The Best Test Data is Random Test Data

The Best Test Data is Random Test Data An introduction
to property-based testing Fraser Tweedale Red Hat, Inc. @hackuador February 7, 2015

Introduction

About me Developer at Red Hat FreeIPA identity management and
Dogtag PKI Mostly Python and Java at work Mostly Haskell for other projects

This talk Introduce property-based testing; motivate with examples Concepts will
be demonstrated in Haskell using QuickCheck A brief look at property-based testing in other languages Discussion of limitations Alternative approaches

Property-based testing A property-based testing framework: 1. Gives you a
way to state properties of functions 2. Gives you a way to declare how to generate arbitrary values of your types 3. Provides generators for standard types (usually) 4. Attempts to falsify your properties and reports counterexamples.

Applications Check laws and invariants of algorithms, data, abstractions Check
code against a model implementation Properties are meaningful documentation

Examples

Reversing a list rev :: [a] -> [a] rev []
= [] rev (x:xs) = rev xs ++ [x] prop_RevUnit :: Int -> Bool prop_RevUnit x = rev [x] == [x] prop_RevApp :: [Int] -> [Int] -> Bool prop_RevApp xs ys = rev (xs ++ ys) == rev ys ++ rev xs

Expression transformation

Gotchas

Exhaustion Na¨ ıve use of preconditions resulting in not enough
test cases Solution: custom generator to ensure precondition satisﬁed Better solution: redesign data types such that precondition is invariant

Trivial test data Trivial test data can result in tests
passing vacuously. Use collect or cover to inspect distribution Use frequency to govern distribution

Inﬁnite data structures Useful, but be careful what you evaluate
Use sized when deﬁning generators for recursive data

Other languages

Property-based testing implementations Most languages have at least one implementation
Incomplete list: https://en.wikipedia.org/wiki/QuickCheck Some decent or popular implementations are missing Python: pyqcy Java: Functional Java (fj.test)

Python example from pyqcy import * def rev(l): return list(reversed(l))
@qc def prop_rev_unit(x=int_()): assert rev([x]) == [x] @qc def prop_rev_app(xs=list_(of=int), ys=list_(of=int)): assert rev(xs + ys) == rev(ys) + rev(xs) if __name__ == ’__main__’: main()

Java example junit-quickcheck https://github.com/pholser/junit-quickcheck/ static <A> List<A> rev(List<A> xs); static
<A> List<A> app(List<A> xs, List<A> ys);

Java example import static org.junit.Assert.*; import org.junit.contrib.theories.*; import org.junit.runner.RunWith; import
com.pholser.junit.quickcheck.ForAll; @RunWith(Theories.class) public class RevTestCase { // next slide }

Java example @Theory public void revUnit(@ForAll Integer x) { ArrayList
xs = new ArrayList(); xs.add(x); assertEquals(rev(xs), xs); } @Theory public void revApp( @ForAll ArrayList<Integer> xs, @ForAll ArrayList<Integer> ys ) { assertEquals( rev(app(xs, ys)), app(rev(ys), rev(xs)) ); }

Limitations

Bugs Incorrect Arbitrary instances Incorrect properties Incomplete properties

Randomness prop_verify_eq :: Password -> Bool prop_verify_eq s = verify
(hash s) s prop_verify_neq :: Password -> Password -> Property prop_verify_neq s s’ = not (s == s’) ==> not (verify (hash s) s’)

Randomness Previous slide: what if hash truncates input before hashing?
Some bugs are unlikely to be found with random data Workaround: mutate or fuzz data in domain-relevant way

Randomness fuzz :: Password -> Gen Password fuzz = {-
truncation / extension / permutation / etc -} prop_verify_fuzzed :: Password -> Property prop_verify_fuzzed s = forAll (fuzz s) (prop_verify_neq s)

Failure cases Arbitrary is great for generating random valid data
How to specify behaviour given invalid data?

Failure cases dump :: JSON -> String load :: String
-> Maybe JSON prop_dumpLoad :: JSON -> Bool prop_dumpLoad a = load (dump a) == Just a loadSpec :: Spec loadSpec = describe "load" $ it "fails on bogus input" $ load "bogus" ‘shouldBe‘ Nothing

Conclusion Property-based testing is true automated testing More thorough testing
in less time ($$$) Relieves developer of burden of ﬁnding and manually writing tests for corner cases Properties are meaningful documentation The best test data is random test data, but. . . a bit of domain-speciﬁc non-randomness is sometimes useful examples still have their place.

Alternative approaches

Exhaustive testing The best test data is all of the
data Check that property holds for all values Supports existential properties Available in several languages SmallCheck (Haskell), smallcheck4scala, autocheck (C++), ocamlcheck, python-doublecheck

Proof The best test data is no test data Some
languages have theorem-proving capabilities Properties become theorems; no proof, no program Program extraction to other languages Completeness proofs rev example: http://is.gd/EhanO1

Resources QuickCheck: A Lightweight Tool for Random Testing of Haskell
Programs (2000) Koen Claessen, John Hughes: http://is.gd/mpsY7G Automated Unit Testing your Java using ScalaCheck by Tony Morris: http://is.gd/j0R7qq UCSD CSE 230 lecture: http://is.gd/0YfxOr QuickCheck: Beyond the Basics by Dave Laing: http://is.gd/pGKnhg Recommended Haskell learning path: https://github.com/bitemyapp/learnhaskell

Thanks for listening Copyright 2015 Fraser Tweedale This work is
licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. Feedback http://devconf.cz/f/72 Slides https://github.com/frasertweedale/talks/ Email [email protected] Twitter @hackuador

Questions?

The Best Test Data is Random Test Data

The Best Test Data is Random Test Data

Fraser Tweedale

More Decks by Fraser Tweedale

Other Decks in Programming

Featured

Transcript