reality is expensive: a better way of thinking about mock objects

imagine: testing landing gear

if our tests prove that our code works

then "fake" means "cheat," right?

we're all cheaters

selenium

selenium A user who moves the mouse & keyboard exactly
how you tell it to.

how you tell it to. Over and over.

how you tell it to. Over and over. Every time.

test data

test data Runner cleans the database on every test run.

Got years of data migrations?

Got years of data migrations? No problem!

stamina

stamina Runtime is always started fresh for tests.

stamina Runtime is always started fresh for tests. Subtle memory
leak?

stamina Runtime is always started fresh for tests. Subtle memory
leak? Tests won't mind!

web scale

web scale Tests run against a single-process runner or server.

Race conditions?

Race conditions? No deadlocks here!

reality is expensive

why not budget it? reality is expensive

so, instead of asking

is this test too fake?

perhaps, we should ask

how much reality does this test need?

unknown fakeness we talked about

known fakeness? but what about

1. identify fake things we already consciously do in our
tests know thy fakeness

2. understand those fake things well enough to appraise their
costs & beneﬁts know thy fakeness

3. write more valuable tests by strategically deciding how much
reality to give them know thy fakeness

oft-overheard at conferences

"Mocks are Good!" oft-overheard at conferences

"Mocks are Good!" Bad! oft-overheard at conferences

"Only Mock What You Own!" oft-overheard at conferences

"Only Mock What You Own!" external systems! oft-overheard at conferences

"Mock everything!" oft-overheard at conferences

"Mock everything!" avoid over-mocking! oft-overheard at conferences

I'd some

nuance I'd some

rigor I'd some

thoughtfulness I'd some

types of tests

Dog App Tester Dog App end-to-end test

DogFeederTest DogRepo Database DogFeeder integration test BoneRepo

DogFeederTest DogRepo Fake Database DogFeeder "unit" test BoneRepo

DogFeederTest Fake DogRepo DogFeeder isolation test Fake BoneRepo

mock The popular term for a fake object that takes
the place of a real object when a test is executed. n. \ˈmäk, ˈmȯk\

test double The precise term for a fake object that
takes the place of a real object when a test is executed. n. \ˈtest də-bəl\

think "stunt double"

our silly catch-all term test double

an alternative implementation to stand-in for something you depend on
fake test double

replies to certain messages with responses that help you write
tests fake test double stub

ensures certain messages are received (and explodes upon receiving any
unexpected messages) fake test double stub mock

stealthily records all interactions, allowing you to make assertions about
them later fake test double stub mock spy

how should we use them?

hold your fake horses!

before the

why should we use them?

why do we test at all?

ACCEPTANCE prove the app works as promised

SPECIFICATION examples of how the code behaves

REGRESSION prevent bugs from coming back

DESIGN shape code by listening to tests

CHARACTERIZE safely improve legacy code

this test provide?" ask, "what value will

value-oriented test strategies

prove it works

prove it works minimize fakeness

prove it works only write end-to-end & integration tests minimize
fakeness

value cost prove it works

value cost prove it works • conﬁdence that passing equals
working

working • fewer tests → less eﬀort spent writing tests

working • fewer tests → less eﬀort spent writing tests • tests can't provide many design cues

working • fewer tests → less eﬀort spent writing tests • tests can't provide many design cues • slow feedback

working • fewer tests → less eﬀort spent writing tests • tests can't provide many design cues • slow feedback • high coverage is infeasible

passage of time build duration

passage of time build duration what we expect

passage of time build duration

passage of time build duration what actually happens once you
account for growth of the system

mocking boundaries

mocking boundaries fake remote systems, never your app's code

mocking boundaries fake remote systems, never your app's code primarily
write unit & end-to-end tests

value cost mocking boundaries

value cost mocking boundaries • each object is exercised by
many tests

many tests • practical tests without deep knowledge of test doubles

many tests • practical tests without deep knowledge of test doubles • tests give less feedback about interactions

many tests • practical tests without deep knowledge of test doubles • tests give less feedback about interactions • one change → many test ﬁxes

many tests • practical tests without deep knowledge of test doubles • tests give less feedback about interactions • one change → many test ﬁxes • end-to-end is extra redundant

case-by-case

case-by-case cater your testing approach to the needs of each
situation

value cost case-by-case

value cost case-by-case • developers free to choose best approach

• your team is likely already doing it!

• your team is likely already doing it! • test doubles easily abused

• your team is likely already doing it! • test doubles easily abused • no strategy → murky test value

• your team is likely already doing it! • test doubles easily abused • no strategy → murky test value • "wanna mock that?" timesink

the GOOS way

the GOOS way isolation tests to drive design

the GOOS way isolation tests to drive design end-to-end tests
to prove it works

value cost the GOOS way

value cost the GOOS way • TDD gives rich feedback
about interactions

about interactions • consistent, fast, complete

about interactions • consistent, fast, complete • easy to extract from app to libs

about interactions • consistent, fast, complete • easy to extract from app to libs • requires discipline & practice

about interactions • consistent, fast, complete • easy to extract from app to libs • requires discipline & practice • awkward "in- between" tests

about interactions • consistent, fast, complete • easy to extract from app to libs • requires discipline & practice • awkward "in- between" tests • frameworks dislike isolation

my favorite strategy

1. agree on literally any clear strategy my favorite strategy

1. agree on literally any clear strategy 2. follow that
strategy consistently my favorite strategy

test double smells

smell # 1 test doubles are used in integration tests

perhaps test integrity was compromised to address painful setup or
assertions

suggestion respond to pain by changing your code, not your
tests

smell #2 test replaces methods on the subject-under-test with test
doubles

perhaps subject has too many responsibilities

suggestion extract a new object as a collaborator of the
subject

smell #3 your objects extend framework classes that you don't
own

perhaps inherited behavior inhibits good test isolation

suggestion avoid such extension or forgo isolation testing of those
classes

smell #4 test contains lots of code just to conﬁgure
test doubles

perhaps the unit's number of collaborators or their collective surface
area is too large

suggestion reduce the count of methods on which the subject
depends

smell #5 test doubles are used to stand-in for 3rd
party APIs

perhaps test pain isn't actionable, because design of 3rd party
code can't be changed

suggestion wrap third-party libraries in small abstractions you own

blame the code before the test

blame the code before the test double ^

www.testdouble.com @andrewvida @jasonkarns @dmosher @joelhelbling @searls @kbaribeau @timwingﬁeld @toddkaufman

reality is expensive: a better way of thinking ...

reality is expensive: a better way of thinking about mock objects

More Decks by Justin Searls

Other Decks in Programming

Featured

Transcript