A Practical Taxonomy of Bugs and How to Squash Them-We Rise 2017

A COMMON TAXONOMY OF BUGS AND HOW TO SQUASH THEM

Instinctual Indications…6 Research Methods…9 Practical Taxonomy…13 Bohrbug…17 Schrödinbug…25 Fractalbug…24 Heisenbug…35
Mandelbug…44 Resources…55 Table of Contents

Debugging Skills

“As you familiarize yourself with the application, you’ll build up
some debugging instincts"

“Whenever I see something like this happening, the first thing
I do is scan the logs to see if this process is completing or is sending a weird message.”

Debugging Instincts “ ”

“Whenever I see something like this happening, the first thing
I do is scan the logs to see if this process is completing or is sending a weird message.”

“Whenever I see #{x}, I always check #{y}”

Instincts are just internalized rulesets

Research Methods • containment sometimes takes priority over squashing

Research Methods • containment sometimes takes priority over squashing •
we can only work with facts

Research Methods • containment sometimes takes priority over squashing •
we can only work with facts • we can’t squash every bug in this talk

Observable Attributes

Phenetics { ]

Warning: Contrived Scenarios Ahead

A COMMON TAXONOMY OF BUGS AND HOW TO SQUASH THEM

Taxonomy of Bugs Upsettingly Observable Wildly Chaotic {

Upsettingly Observable Bugs

Wildly Chaotic Bugs

How to Squash Them

upsettingly observable bug #1 UPSETTINGLY OBSERVABLE

Observable Attributes is the bug observable in production?

Observable Attributes is the bug observable in production? can it
be reproduced locally?

Observable Attributes is the bug observable in production? can it
be reproduced locally? does it seem to be restricted to one area?

Bohrbug deterministic, highly reproducible UPSETTINGLY OBSERVABLE

Bohrbug Commonly found in code,sometimes on server UPSETTINGLY OBSERVABLE

Bohrbug likes to hide in complex branching in functions, classes
or config UPSETTINGLY OBSERVABLE

Bohrbug In the wild: validation UPSETTINGLY OBSERVABLE

Bohrbug UPSETTINGLY OBSERVABLE Easy to squash

Reproduction & Resolution replicate locally and in test UPSETTINGLY OBSERVABLE

Reproduction & Resolution replicate locally and in test write the
simple solution UPSETTINGLY OBSERVABLE

Reproduction & Resolution replicate locally and in test write the
simple solution rewrite to be highly readable and extendable UPSETTINGLY OBSERVABLE

Bohrbug UPSETTINGLY OBSERVABLE

upsettingly observable bug #2 UPSETTINGLY OBSERVABLE

Observable Attributes how does this work?

Observable Attributes how does this work? does this work?

Observable Attributes how does this work? does this work? wait,
what is this even testing?

Observable Attributes how does this work? does this work? wait,
what is this even testing? did this ever work?

Schrödinbug stick-like body appendages look like twigs UPSETTINGLY OBSERVABLE

Schrödinbug Likes to pretend to be working code. On close
inspection, reveals itself to be a bug. UPSETTINGLY OBSERVABLE

Schrödinbug UPSETTINGLY OBSERVABLE Type I. Type II.

Schrödinbug Type I. Code that never worked UPSETTINGLY OBSERVABLE

Schrödinbug Type I. Reveal themselves via side-effects UPSETTINGLY OBSERVABLE

Schrödinbug Type I. In the wild: UI shows update but
database entry not updated. UPSETTINGLY OBSERVABLE

Schrödinbug Type II. Code that doesn’t work how you thought
UPSETTINGLY OBSERVABLE

Schrödinbug Type II. In the wild: Same function being called
multiple times UPSETTINGLY OBSERVABLE

Basic Reproduction & Resolution replicate locally and in test write
the simple solution rewrite to be highly readable and extendable UPSETTINGLY OBSERVABLE

-You “How can I reproduce this without knowing exactly what
is happening?”

Logging as Verification Tool

Git Bisect Tool

Reproduction & Resolution reproduce the “broken” state locally and in
test

test add log statements until you can verify what causes the broken state.

test add log statements until you can verify what causes the broken state. if the bug did work at some point, find the point at which it did work.

test add log statements until you can verify what causes the broken state. if the bug did work at some point, find the point at which it did work. write tests to represent the configuration and flow of the fixed state

Schrödinbug UPSETTINGLY OBSERVABLE

wildly chaotic bug #1 WILDLY CHAOTIC

Observable Attributes Does it appear non-deterministic?

Observable Attributes Does it appear non-deterministic? Does it seem to
disappear once you observe or debug it?

Heisenbug “now you see it, now you don’t” WILDLY CHAOTIC

Heisenbug WILDLY CHAOTIC Type I. Type II.

Heisenbug Type I. WILDLY CHAOTIC Lives in code

Heisenbug Type II. WILDLY CHAOTIC Lives in data

-You “How can I reproduce this without testing on production?”

Profiling for Verification https://kcachegrind.github.io/html/CallgrindFormat.html Tool

FLAME GRAPHS http://www.brendangregg.com/FlameGraphs/cpu-mysql-updated.svg Tool

Heisenbug Type I. WILDLY CHAOTIC Profiling can reveal what is
being called and when

Heisenbug Type II. WILDLY CHAOTIC Profiling can reveal how much
time is being spent.

Reproduction & Resolution use profiling to find the trigger state

use the app (not fixtures or DB manipulation) to get the data in this state

use the app (not fixtures or DB manipulation) to get the data in this state recreate that state in test

use the app (not fixtures or DB manipulation) to get the data in this state recreate that state in test follow borhbug instruction

Heisenbug WILDLY CHAOTIC

wildly chaotic bug #2 WILDLY CHAOTIC

Observable Attributes is everything broken?

Observable Attributes is everything broken? all of it?

Observable Attributes is everything broken? all of it? send help??

Mandelbug WILDLY CHAOTIC

Mandelbug seems like everything is broken at once WILDLY CHAOTIC

Mandelbug people are very upset with you WILDLY CHAOTIC

Mandelbug likely an issue with your system, not code WILDLY
CHAOTIC

“The bug is huge and everywhere at once. SQL: could
not connect to server: Connection refused was bubbling up all over the place. Jobs won’t run, emails won’t send, every submit button on the site fatal errored.” on-call log WILDLY CHAOTIC

Disk Usage Tool df -h

Logging as Verification Tool

Reproduction & Resolution use df -h to find if all
the storage is being use

the storage is being use attempt to connect to server & view logs

the storage is being used attempt to connect to server & view logs can that be restarted, rotated or killed at this time?

Mandelbug WILDLY CHAOTIC

A Practical Taxonomy of Bugs Upsettingly Observable Wildly Chaotic {
bohrbug schrödinbug mandelbug heisenbug

“Debugging Instincts”

Debugging Skills

Observe & Classify

Verify with logging and time travel

Verify without changing state by profiling

Use server tools to observe entire process

Observe & Classify Verify with logging and time travel Verify
without changing state by profiling Use server tools to observe entire process

Build Up Your Own Toolkit and Share it

Resources & Further Study • “Linux Debugging Tools I Love”,
Julia Evans • Systems Performance, Brendan Gregg • “Why Do Computers Stop and What Can Be Done About It?”, Jim Gray • “Debug Patterns for Efficient High- levelSystemC Debugging”, Frank Rogin, Erhard Fehlauer, Christian Haufe, Sebastian Ohnewald

A Practical Taxonomy of Bugs and How to Squash ...

A Practical Taxonomy of Bugs and How to Squash Them-We Rise 2017

More Decks by Kylie

Other Decks in Programming

Featured

Transcript