A Practical Taxonomy of Bugs and How to Squash Them-RubyConf Italy 2016

A PRACTICAL TAXONOMY OF BUGS AND HOW TO SQUASH THEM

Instinctual Indications…6 Research Methods…9 Practical Taxonomy…13 Bohrbug…17 Schrödinbug…25 Fractalbug…24 Heisenbug…35
Mandelbug…44 Resources…55 Table of Contents

Debugging Skills

“As you familiarize yourself with the application, you’ll build up
some debugging instincts"

“Whenever I see something like this happening, the first thing
I do is scan the logs to see if this process is completing or is sending a weird message.”

Debugging Instincts “ ”

“Whenever I see #{x}, I always check #{y}”

Research Methods • containment sometimes takes priority over squashing •
we can only work with facts • we can’t squash every bug in this talk

Observable Attributes

Phenetics { ]

Warning: Contrived Scenarios Ahead

A Practical Taxonomy of Bugs Upsettingly Observable Wildly Chaotic {

Upsettingly Observable

Wildly Chaotic

How to Squash Them

upsettingly observable bug #1 UPSETTINGLY OBSERVABLE

Observable Attributes is the bug observable in production? can it
be reproduced locally? does it seem to be restricted to one area?

Bohrbug deterministic, highly reproducible UPSETTINGLY OBSERVABLE

Bohrbug Commonly found in code,sometimes on server UPSETTINGLY OBSERVABLE

Bohrbug likes to hide in complex branching in functions, classes
or config UPSETTINGLY OBSERVABLE

Bohrbug In the wild: validation UPSETTINGLY OBSERVABLE

Reproduction & Resolution replicate locally and in test write the
simple solution rewrite to be highly readable and extendable UPSETTINGLY OBSERVABLE

Bohrbug UPSETTINGLY OBSERVABLE

upsettingly observable bug #2 UPSETTINGLY OBSERVABLE

Observable Attributes how does this work? does this work? wait,
what is this even testing? did this ever work?

Schrödinbug stick-like body appendages look like twigs UPSETTINGLY OBSERVABLE

Schrödinbug Likes to pretend to be working code. On close
inspection, reveals itself to be a bug. UPSETTINGLY OBSERVABLE

Schrödinbug In the wild: code that never worked UPSETTINGLY OBSERVABLE

Schrödinbug In the wild: it didn’t work how you thought
it did UPSETTINGLY OBSERVABLE

Logging as Verification Tool

Git Bisect Tool

Reproduction & Resolution reproduce the “broken” state locally and in
test add log statements until you can verify what causes the broken state. if the bug did work at some point, find the point at which it did work. write tests to represent the configuration and flow of the fixed state

Schrödinbug UPSETTINGLY OBSERVABLE

wildly chaotic bug #1 WILDLY CHAOTIC

Observable Attributes Does it appear non-deterministic? Does it seem to
disappear once you observe or debug it?

Heisenbug “now you see it, now you don’t” WILDLY CHAOTIC

Heisenbug WILDLY CHAOTIC In the wild: a heisenbug that lives
in code

Heisenbug WILDLY CHAOTIC In the wild: a heisenbug that lives
in data

Profiling for Verification https://kcachegrind.github.io/html/CallgrindFormat.html Tool

FLAME GRAPHS http://www.brendangregg.com/FlameGraphs/cpu-mysql-updated.svg Tool

Reproduction & Resolution use profiling to find the trigger state
use the app (not fixtures or DB manipulation) to get the data in this state recreate that state in test follow borhbug instruction

Heisenbug WILDLY CHAOTIC

wildly chaotic bug #2 WILDLY CHAOTIC

Observable Attributes is everything broken? all of it? send help??

Mandelbug WILDLY CHAOTIC

Mandelbug seems like everything is broken at once WILDLY CHAOTIC

Mandelbug people are very upset with you WILDLY CHAOTIC

Mandelbug likely an issue with your system, not code WILDLY
CHAOTIC

“The bug is huge and everywhere at once. SQL: could
not connect to server: Connection refused was bubbling up all over the place. Jobs won’t run, emails won’t send, every submit button on the site fatal errored.” on-call log 24 June 2014 WILDLY CHAOTIC

Disk Usage Tool df -h

Reproduction & Resolution attempt to connect to server & view
logs use df -h to find if all the storage is being used can that be restarted, rotated or killed at this time?

Mandelbug WILDLY CHAOTIC

A Practical Taxonomy of Bugs Upsettingly Observable Wildly Chaotic {
bohrbug schrödinbug mandelbug heisenbug

“Debugging Instincts”

Debugging Skills

Observe & Classify

Verify with logging and time travel

Verify without changing state by profiling

Use linux server tools to observe entire process

Observe & Classify Verify with logging and time travel Verify
without changing state by profiling Use linux server tools to observe entire process

Build Up Your Own Toolkit and Share it

Resources & Further Study • “Linux Debugging Tools I Love”,
Julia Evans • Systems Performance, Brendan Gregg • Site Reliability Engineering, Betsy Beyer, Chris Jones, Jennifer Petoff, Niall Richard Murphy • “Why Do Computers Stop and What Can Be Done About It?”, Jim Gray • “Debug Patterns for Efficient High- levelSystemC Debugging”, Frank Rogin, Erhard Fehlauer, Christian Haufe, Sebastian Ohnewald

A Practical Taxonomy of Bugs and How to Squash ...

A Practical Taxonomy of Bugs and How to Squash Them-RubyConf Italy 2016

More Decks by Kylie

Other Decks in Programming

Featured

Transcript