Test Design for AI Systems

1 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks BUILD
SOFTWARE TO TEST SOFTWARE exactpro.com Test Design for AI Systems Murad Mamedov AI Researcher, Exactpro 13 MAY | 3 PM SLST

2 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Table
of Contents - neural network architecture - ML development process overview - current QA activities in industry - why test-design is important - black-box testing - white-box testing - data-box testing - conclusion

3 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Neural
Net Architecture

4 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks ML
Development Process Overview

5 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Current
Activities in Industry The emerging risk makes governments and businesses respond with quality assurance activities. The regulatory activities are also leveraging monitoring and control of AI development. - USA Data and Trust Alliance - EU AI Regulation Draft - ISO/IEC TR 29119-11:2020 Guidelines on the testing of AI-based systems

6 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Current
Activities in Industry

7 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Why
Test Design is Important

8 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Black
Box Strategy: Mutational Approach It can be applied to: - to an algorithm itself - train data - test data Original Program Mutant Program Output Compare the results of both programs

Box Strategy: Combinatorial Approach The picture represents a decision map application to the Boldness and Discontinuity features, in order to generate use cases from high-level scenarios

Box Strategy: Business Logic Approach The approaches based on the business logic are closer to validation-level ones, since they are going directly to the question of whether a system meets the stakeholders’ expectations or not. An example of merging approaches: Model-based exploration of the frontier of behaviours for deep learning system testing Input Database

Box Strategy: Business Logic Approach The approaches based on the business logic are closer to validation-level ones, since they are going directly to the question of whether a system meets the stakeholders’ expectations or not. An example of merging approaches: Model-based exploration of the frontier of behaviours for deep learning system testing Input Database Literature Features Libelled Inputs Open Coding Metric Identiﬁcation Candidate Metrics Design Metrics Validation and Correlation Metrics Initial Labelling Consensus Meeting Final Labelling Feature name Score [5pt] Candidate Metrics

12 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks White
Box Strategy: Activation Testing What can be tested: - if a neuron is activated - which value it’s activated with - how the neurons interact - how the layers interact

Box Strategy: Tools

Box Strategy: What’s Missing? Activation testing focuses mostly on neurons/layers behaviour, and pays less attention to the predictions

15 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Data
Box Strategy Combinatorial EDA helps to represent the data from a use cases perspective and enhances the further ML testing activities such as oracle education, test generation, etc.

16 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Conclusion

17 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks AI
Testing Talks Thank You!

Test Design for AI Systems

Test Design for AI Systems

Exactpro PRO

More Decks by Exactpro

Other Decks in Technology

Featured

Transcript

1 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks BUILD

2 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Table

3 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Neural

4 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks ML

5 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Current

6 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Current

7 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Why

8 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Black

9 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Black

10 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Black

11 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Black

12 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks White

13 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks White

14 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks White

15 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Data

16 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks Conclusion

17 BUILD SOFTWARE TO TEST SOFTWARE AI Testing Talks AI