Upgrade to Pro — share decks privately, control downloads, hide ads and more …

The Hardest Problem in Data

The Hardest Problem in Data

Ronnie Chen

August 24, 2017
Tweet

More Decks by Ronnie Chen

Other Decks in Technology

Transcript

  1. The Hardest Problem in Data Ronnie Chen @rondoftw Data Engineering

    Slack 1 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  2. → Machine learning → Predictive modeling → Neural networks →

    Artificial intelligence 3 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  3. The Rules: 1. Only one number 2. Convince me it's

    correct 7 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  4. Will I get the same number if... !"#$ I ask

    every person you know if they consider you their friend? 9 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  5. Will I get the same number if... ! " I

    ask every person that knows you if they think you would consider them a friend? 10 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  6. Is this the number of people that you'd tell a

    secret to? 11 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  7. user_id name email deleted 1 Alice alice@*** 2 Bob bob@***

    true 3 Carol 15 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  8. SELECT COUNT(*) FROM prod.users WHERE deleted != true AND email

    != null 16 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  9. SELECT COUNT(*) FROM prod.users WHERE last_active > 2017-07-24 17 —

    WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  10. Okay, I get it. But what's the big deal? 21

    — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  11. 26% of professional computing jobs were held by women in

    2016 22 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  12. Numbers give you authority and the appearance of objectivity 24

    — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  13. Counts can determine funding, set agendas, and shift priorities 27

    — WriteSpeakCode 2017 | Ronnie Chen @rondoftw
  14. Machine learning is like money laundering for bias — Maciej

    Cegłowski, founder of @Pinboard 28 — WriteSpeakCode 2017 | Ronnie Chen @rondoftw