Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Fuck Spreadsheets - first steps to become a data-driven company

Steven
August 07, 2015

Fuck Spreadsheets - first steps to become a data-driven company

A small introduction about the biggest problems of spreadsheets, google docs and methods to fight those to become a data-driven company.

Steven

August 07, 2015
Tweet

More Decks by Steven

Other Decks in Business

Transcript

  1. “ ... Simply put, spreadsheets are good for quick and

    dirty work, but they are not designed for serious and reliable work. ... Spreadsheets make code review difficult. The code is hidden away in dozens if not hundreds of little cells If you are not reviewing your code carefully and if you make it difficult for others to review it, how do expect it to be reliable?” – Daniel Lemire
  2. –Financial Times, May 23rd 2014 “… once the FT cleaned

    up and simplified the data, the European numbers do not show any tendency towards rising wealth inequality after 1970. An independent specialist in measuring inequality shared the FT’s concerns.”
  3. “I work in a large company and I can’t help

    but notice the way the business team uses excel for everything. There are times were emergency meetings are pulled because the numbers don’t add up. Sometimes the issue is a single cell among 60,000 containing a typo in the formula (a dollar sign missing).” –Anonymous
  4. “Wurden die Auswertungen des Adwords Reporting Dokumentes weitergehend genutzt? Mir

    ist ein Fehler in den Formeln aufgefallen, den ich jetzt berichtigen werde. Er besteht seit einschl. KW19, als weitere Kampagnen aktiviert wurden Sorry dafür. Konkret wurden über Zeile 55 hinausgehende Werte nicht mit in die Gesamtsumme unten aufgenommen” –Anonymous
  5. • Import of genetical data into databases • Genes: mrtA

    • Proteins: Mrta • Excel’s autocapitalisation sucks -> several thousands database entries were wrong
  6. “Es wurden und werden Dokumente in der täglichen Arbeit genutzt,

    die aber unter Umständen im ursprünglichen "Besitz" von Mitarbeitern sind, die nicht mehr im Unternehmen sind. Somit mussten wir dann den ein oder anderen Account wiederherstellen. Dies geht allerdings nur noch bis Mittwoch. Danach sind Daten von gelöschten Accounts nicht wiederherstellbar!” –Anonymous
  7. CONCLUSION • 88% of spreadsheets contain errors * • a

    simple mistake like misplacing a decimal point can result in huge errors * Studies(Market Watch, 2013)
  8. CONCLUSION • You will lose your data. It might be

    a hard disk crash, or a computer virus, or maybe even a rogue employee deliberately entering incorrect information on a spreadsheet.
  9. CONCLUSION • Spreadsheets do not support testing. For anything that

    matters, you should validate and test your code automatically and systematically.
  10. CONCLUSION • Spreadsheets make code reviews impractical. To inspect the

    code, you need to look at every cell. In practice, this means that you cannot reasonably ask someone to read over your formulas to make sure that there is no mistake.
  11. CONCLUSION • Spreadsheet encourage redundancies. Spreadsheets encourage copy-and-paste. Though copying

    and pasting is sometimes the right tool, it also creates redundancies. These redundancies make it very difficult to update a spreadsheet: are you absolutely sure that you have changed the formula throughout?
  12. ADVANTAGES • Version control • you can revert versions •

    you can jump to older versions • changelog
  13. ADVANTAGES • Linked data • connect data cells with more

    context • object-based thinking • put comments in comment fields
  14. ADVANTAGES • Accessibility • everyone can work on the same

    data • prevent data islands • no problems with sets of different excel sheets
  15. SHORT FACTS • previous company was sold to salesforce •

    funding 2015: ca. 11 million dollars • pitched @ silicon valley and supported by Ashton Kutcher :-P
  16. OTHER POSSIBILITIES • Google docs API (but rights should be

    restricted) • An own developed app as excel replacement ( please no )
  17. BENEFITS OF R • what is done in spreadsheets can

    be done in R • faster than spreadsheets • open-source • code review • version control
  18. but … we should be realistic a department would need

    a person willing to learn and use R
  19. OTHER OPTIONS • every app visualises it’s own data •

    i.e. Jobmensa with New Relic Insights • an own App or service which visualises all gathered data at our company
  20. DATA DRIVEN COMPANY decisions bound on data if something bad

    happens its possible to inspect all possible error sources real time marketing