Slide 1

Slide 1 text

Testing the Intelligence of your AI Iosif Itkin, CEO and co-founder Elena Treshcheva, Researcher Iosif Itkin, CEO and co-founder Elena Treshcheva, Business Development Manager and Researcher

Slide 2

Slide 2 text

Exactpro Overview ● A specialist firm focused on functional and non-functional testing of exchanges, clearing houses, depositories, trade repositories and other financial market infrastructures. ● Incorporated in 2009 with 10 people, our company has experienced significant growth and is now employing over 550 specialists. ● We were part of the London Stock Exchange Group (LSEG) from May 2015 till January 2018. Exactpro management buyout from LSEG was successfully completed in January 2018. We are headquartered in the UK and have operations in the US, Georgia and Russia.

Slide 3

Slide 3 text

Exactpro Client Network

Slide 4

Slide 4 text

AI-based Systems in Finance Machine Learning in financial organizations: - already passed an initial development phase - the usage of live ML applications is about to dramatically increase over the next three years https://www.bankofengland.co.uk/-/media/boe/files/report/ 2019/machine-learning-in-uk-financial-services.pdf

Slide 5

Slide 5 text

AI-based Systems in Finance Machine Learning in financial organizations: - already passed an initial development phase - the usage of live ML applications is about to dramatically increase over the next three years ● Market Surveillance Systems ● Conversational Assistants ● Algo Trading Systems ● Pricing Calculators ● Machine Readable News ● Insurance Claims https://www.bankofengland.co.uk/-/media/boe/files/report/ 2019/machine-learning-in-uk-financial-services.pdf

Slide 6

Slide 6 text

AI-based Systems’ Quality Characteristics: - Ability to learn: The capacity of the system to learn from use for the system itself, or data and events it is exposed to. - Ability to generalize: The ability of the system to apply to different and previously unseen scenarios. - Trustworthiness: The degree to which the system is trusted by stakeholders, for example a health diagnostic A4Q AI and Software Testing Foundation Syllabus https://www.gasq.org/en/exam-modules/a4q-ai-and-software-testing.html Testing the Intelligence of your AI

Slide 7

Slide 7 text

Ability to Learn: https://www.deeplearning.ai/ • Training set — Which you run your learning algorithm on. • Development set — Which you use to tune parameters, select features, and make other decisions regarding the learning algorithm. Sometimes also called the hold-out cross validation set. • Test set — which you use to evaluate the performance of the algorithm, but not to make any decisions regarding what learning algorithm or parameters to use.

Slide 8

Slide 8 text

Trustworthiness: https://innovation.defense.gov/ai/ During the DIB’s quarterly public meeting on October 31, 2019, the DIB members voted to approve the proposed AI Principles.

Slide 9

Slide 9 text

Trustworthiness: https://www.mas.gov.sg/news/media-releases/2019/mas-partners-financial-industry -to-create-framework-for-responsible-use-of-ai

Slide 10

Slide 10 text

Ability to Generalize: Scope of End-to-End and Negative Testing

Slide 11

Slide 11 text

Congruence bias Confirmation bias Law of triviality Zero-risk bias Anthropocentric thinking Illusion of control Cognitive Biases Affecting Software Testing of AI-based Systems Automation bias

Slide 12

Slide 12 text

AI-based Systems: Machine-Readable News

Slide 13

Slide 13 text

Confirmation Bias

Slide 14

Slide 14 text

Salman, I. (2016). Cognitive biases in software quality and testing. Proceedings of the 38th International Conference on Software Engineering Companion - ICSE ’16. Pp. 823-826.

Slide 15

Slide 15 text

Mohanani, R., Salman, I., Turhan, B., Rodríguez, P., & Ralph, P. (2018). Cognitive Biases in Software Engineering: A Systematic Mapping Study. IEEE Transactions on Software Engineering

Slide 16

Slide 16 text

AI-based Systems: Conversational Assistants (Chatbots) Chatbot

Slide 17

Slide 17 text

Anthropocentric Bias We should not humanize computers.

Slide 18

Slide 18 text

Anthropocentric bias They dislike it a lot!

Slide 19

Slide 19 text

Anthropocentric Bias: Testing a Mine-Defusing Robot

Slide 20

Slide 20 text

Anthropocentric Bias: Why We Treat Robots Like Humans Darling, Kate and Nandy, Palash and Breazeal, Cynthia “Empathic Concern and the Effect of Stories in Human-Robot Interaction” (2015). Proceedings of the IEEE International Workshop on Robot and Human Communication (ROMAN), 2015. 6 p. https://www.ted.com/talks/kate_darling_why_we_ha ve_an_emotional_connection_to_robots

Slide 21

Slide 21 text

Anthropocentric Bias: Testing Chatbots Anaphora / Context Human: I bought 500 Company X shares two years ago. The stocks’ cost was 60,000 USD. What’s their today’s cost? Chatbot: What currency would you like to have for the rate? X Spelling / overall correctness Human: What is the setlement date of the tradeId XXX?? Chatbot: ???

Slide 22

Slide 22 text

AI-based Systems: Algo Trading

Slide 23

Slide 23 text

Congruence Bias Direct Testing Indirect Testing Indirect Testing

Slide 24

Slide 24 text

Indirect Testing Methods Information extraction and Machine learning End-to-End Automated Test Library Whatever it takes! Test execution data and log analysis Passive Testing Whatever it takes!

Slide 25

Slide 25 text

Applications of the Proposed Approach https://unsplash.com/search/photos/san-francisco The First IEEE International Conference on Artificial Intelligence Testing (IEEE AITest 2019), April 4-9 2019, San Francisco East Bay, CA, USA User-Assisted Log Analysis for Quality Control of Distributed Fintech Systems Iosif Itkin, Anna Gromova, Anton Sitnikov, Rostislav Yavorskiy, Evgenii Tsymbalov, Andrey Novikov and Kirill Rudakov.

Slide 26

Slide 26 text

AI-based Systems: Pricing Calculator

Slide 27

Slide 27 text

Law of Triviality (the Bike-Shed Effect)

Slide 28

Slide 28 text

Automation Bias

Slide 29

Slide 29 text

AI-based Systems: Fraud Detection and Market Surveillance

Slide 30

Slide 30 text

Build Software to Test Software Click to know more about Exactpro Test Tools

Slide 31

Slide 31 text

AI-based Systems: Insurance Claims

Slide 32

Slide 32 text

Zero-Risk Bias

Slide 33

Slide 33 text

Non-deterministic Systems: Financial Market Infrastructures

Slide 34

Slide 34 text

The Illusion of Control and Happiness Sherman, G. D., Lee, J. J., Cuddy, A. J. C., Renshon, J., Oveis, C., Gross, J. J., & Lerner, J. S. (2012). Leadership is associated with lower levels of stress. Proceedings of the National Academy of Sciences, 109(44), 17903–17907.

Slide 35

Slide 35 text

Fenton-O’Creevy, M., Nicholson, N., Soane, E., & Willman, P. (2003). “Trading on illusions: Unrealistic perceptions of control and trading performance”. Journal of Occupational and Organizational Psychology, 76(1), 53–68. The Illusion of Control and Performance

Slide 36

Slide 36 text

Thank you