Mutation Testing - Better code by making bugs

MUTATION TESTING Better code by making bugs 1

ABOUT ME 2 @tﬁdry @theoﬁdry Théo Fidry Web developer London
(UK)

PROJECTS 3 Alice Humbug & Infection  PHP-Scoper & Box API-Platform

LET’S DO A SHOW OF HANDS 4

WHO DOES • UNIT TESTING • TEST-DRIVEN DEVELOPMENT • CONTINUOUS
INTEGRATION • MEASURE CODE COVERAGE • MUTATION TESTING 5

HOW TO DEFINE SOFTWARE QUALITY? 6

7 Marcello Duarte Creator of PhpSpec ex-Head of Training @Inviqa
The extent to what the software takes into account what matters most for the customer & the maintainability of the source code DEFINING SOFTWARE QUALITY Internal Quality External Quality https://speakerdeck.com/jakzal/building-in-quality

8 EXTERNAL QUALITY

EXTERNAL QUALITY • Conformity to the user expectation • Reliability
• Accuracy • Ergonomics • Design • … 9

THE QUALITY PERCEIVED BY THE USER 10

11 INTERNAL QUALITY https://insight.sensiolabs.com/projects/208b1a4c-4e7b-44b6-90a2-d7a2d91d431e/analyses/18744

INTERNAL QUALITY • Maintainability • Concision • Cohesion • Simplicity
• Clarity • … 12

THE QUALITY PERCEIVED BY THE DEVELOPER 13

SHOULD WE CARE? 14

Cumulative functioncality Time 15 PROJECT STAMINA Payoﬀ line https://martinfowler.com/bliki/DesignStaminaHypothesis.html

THE LESS YOU CARE THE HARDER IT WILL BE TO
ADD NEW FEATURES 16

COST OF BUG / PHASE 17 http://www.ifpug.org/Documents/Jones-CostPerDefectMetricVersion4.pdf Cost US$1,666.67 US$3,333.33
US$5,000.00 US$6,666.67 US$8,333.33 US$10,000.00 Development phase Dev CI/Review QA Prod US$100 US$500 US$1,500 US$10,000 https://plus.google.com/u/1/+LaurentBossavit/posts/8QLBPXA9miZ

ORIGIN OF THE COSTS • Implementation • Debug • Repair
• Technical debt • Delay: the extra you are paying for not ﬁxing bugs earlier on 18

ORIGIN OF THE COSTS 19 Technical debt Build cost Build
cost Cost of delay Technical debt Cost of debug Cost of repair Right feature built wrong Right feature IDEA

THE MORE YOU DELAY THE MORE EXPANSIVE IT GETS 20

YOU SHOULD CARE 21

HOW TO IMPROVE (INTERNAL) QUALITY? 22 BY ADDING TESTS

TESTS ARE CODE 23

TESTS ARE CODE • You need to write them •
You need to make sure they work • You need to refactor them • You need to maintain them 24

TESTS ARE EXPENSIVE 25

PROBLEMS • How do I safely refactor my tests? •
How do I know I can trust a test suite? • How do I ensure my team is writing eﬀective tests • How do I know if I’ve retroﬁtted enough tests to safely refactor a piece of legacy code? 26

HOW DO I ASSESS THE QUALITY? 27

28 NO TEST MAX TEST SHORT-TERM HIGH VELOCITY SHORT-TERM LOW
VELOCITY TESTS QUALITY Unless you are Jakub or Marco Level of quality

HOW DO I ASSESS THE QUALITY OF THE TEST SUITE?
29

COMMON ANSWERS • Don’t worry, it’ll be ﬁne • I’m
a ninja rockstar, I know my tests are good • I do TDD, I know my tests are good • What about the tests you didn’t write? • How do you test drive changes to tests? • Code review • Inconsistent + Labour intensive • Code coverage 30

CODE COVERAGE MEASURE DOES NOT TELL YOU WHICH PART HAS
BEEN TESTED 31

32 EXAMPLE https://github.com/theoﬁdry/mutation-testing-demo

34 RUN PHPUNIT WITH COVERAGE REPORT 100% code coverage

IS IT GOOD ENOUGH? 35

36 LET’S INTRODUCE A BUG https://github.com/theoﬁdry/mutation-testing-demo

37 RUN PHPUNIT WITH COVERAGE REPORT

OUR TESTS STILL PASS. OUR TEST SUITE IS DEFICIENT 38

CODE COVERAGE MEASURE DOES NOT TELL YOU WHICH PART HAS
BEEN TESTED 39

WHAT CODE COVERAGE DOES TELL YOU 40

EXECUTED ≠ TESTED Executed Tested 41

CODE COVERAGE TELLS YOU ONLY WHAT HAS NOT BEEN TESTED
42

43 A TEST CASE IS MISSING https://github.com/theoﬁdry/mutation-testing-demo

HOW TO DETECT IF A TEST SUITE IS DEFICIENT? 45

INTRODUCE A BUG 46

MUTATION TESTING 47

CREATE A MUTANT SOURCE CODE MUTATOR MUTATION PROCESS MUTANT 48

EXAMPLE OF A MUTANT 49

MUTATOR EXAMPLES 50 Name Original Mutated Plus + - GreaterThanOrEqualTo
>= > Spaceship $a <=> $b $b <=> $a TrueValue return true; return false; https://infection.github.io/guide/mutators.html

COLLECT THE SOURCE FILES 51 Counter.php Foo.php FILE COLLECTOR

GENERATE MUTANTS 52 Counter.php Foo.php MUTATOR MUTATOR MUTATOR MUTATOR MUTATOR
MUTATOR MUTANTS

GENERATED MUTANTS 53 … and more

APPLY MUTANTS 54 PROCESS BUILDER MUTANT PROCESS WITH MUTATED CODE
RESULT Runs tests TESTS RUNNER

IF A MUTANT DOES NOT CAUSE THE TESTS TO FAIL,
IT SURVIVED 55

IF A MUTANT DOES CAUSE THE TESTS TO FAIL, IT
WAS KILLED 56

MUTATION SCORE Nbr of mutant killed Nbr of mutant generated
Mutation score = 57

CODE COVERAGE HIGHLIGHTS CODE THAT IS DEFINITELY NOT TESTED MUTATION
SCORE HIGHLIGHTS CODE THAT IS DEFINITELY TESTED 58 HOW TO DETECT IF A TEST SUITE IS DEFICIENT?

DOES IT WORK? “Complex faults are coupled to simple faults
in such a way that a test data set that detects all simple faults in a program will detect most complex faults” Demonstrated in 1995 by K. Wah, “Fault coupling in finite bijective functions” 59

DEMO 60

MUTATION TESTING IN PHP infection/infection 61 symfony/dependency-injection

INSTALLATION 62 Infection installation guide: https://infection.github.io/guide/installation.html PHPUnit installation guide: https://phpunit.de/getting-started/phpunit-7.html

63 CONFIGURATION infection.json.dist https://infection.github.io/guide/usage.html#Conﬁguration

64 RUNNING INFECTION $ php infection.phar

65 RESULT

66 REPORT

67 RUNNING ON DIFF https://blog.alejandrocelaya.com/2018/02/17/mutation-testing-with-infection-in-big-php-projects/

DEMO DONE 68

IT IS NOT NEW… - HISTORY • Begins in 1971,
R. Lipton, “Fault Diagnosis of Computer Programs” • Generally accepted in 1978, R. Lipton and al, “Hints on test data selection: Help for the practicing programmer” 69

WHY IS IT NOT WIDELY USED? 70 http://knowyourmeme.com/memes/family-guy-why-are-we-not-funding-this

WHY IS IT NOT WIDELY USED? Maturity Problem: Because testing
is not widely used yet (Although it is increasing) 71

WHY IS IT NOT WIDELY USED? Integration Problem: Inability to
successfully integrated it into software development process (TDD plays a key role now) 72

WHY IS IT NOT WIDELY USED? Technical Problem: It is
a brute force technique! 73

BRUTE FORCE TECHNIQUE N: Number of tests M: Number of
mutants NxM 74

THEORETICAL RUN • 672 tests in 25.29 seconds   (0.03763s/test)
• 3573 mutants • 2,401,056 tests • ~53h With basic Mutation Testing 75

OPTIMISATION STRATEGIES • Mutate only covered code • Incremental analysis
• Parallelism • Equivalent mutant (~30%) • Diﬀerent levels of requirements 77

OPTIMISATION STRATEGIES • 672 tests in 25.29 seconds   (0.03763s/test)
• 3573 mutants • 2,401,056 tests • ~10min With Infection 78

INFECTION IS STILL YOUNG 79

FUTURE WORK • Test framework support • Equivalent mutant •
Performance optimisations • Test framework support • Granular conﬁguration (Proﬁles) 80

WRAP UP 81 https://www.pinterest.com/pin/140526450845150336/

THE GOOD PARTS • Gives you feedback on your tests
• Test your tests with little eﬀort • They are automatic • They discover dead code • Helps to refactor your tests 82

THE NOT SO GOOD PARTS • Can be slow •
Handful of young libraries • Writing complex mutant tests is diﬃcult • Side eﬀects with integration tests 83

MUTATION TESTING LIBRARIES HTTPS://GITHUB.COM/THEOFIDRY/MUTATION-TESTING 84

QUESTIONS? 85 https://imgﬂip.com/memegenerator/Shrek-Cat

THANK YOU! 86

Mutation Testing - Better code by making bugs

Mutation Testing - Better code by making bugs

More Decks by Théo FIDRY

Other Decks in Programming

Featured

Transcript