Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Best Practices for Using Machine Learning in Businesses in 2018 - Keynote at Budapest BI Forum Conference - Budapest, November 2018

szilard
November 04, 2018
100

Best Practices for Using Machine Learning in Businesses in 2018 - Keynote at Budapest BI Forum Conference - Budapest, November 2018

szilard

November 04, 2018
Tweet

More Decks by szilard

Transcript

  1. Best Practices for Using Machine Learning in Businesses in 2018

    Szilárd Pafka, PhD Chief Scientist, Epoch (USA) Budapest BI Forum Conference November 2018
  2. Disclaimer: I am not representing my employer (Epoch) in this

    talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
  3. *

  4. 10x

  5. ML training: lots of CPU cores lots of RAM limited

    time ML scoring: separated servers
  6. “people that know what they’re doing just use open source

    [...] the same open source tools that the MLaaS services offer” - Bradford Cross
  7. already pre-processed data less domain knowledge (or deliberately hidden) AUC

    0.0001 increases "relevant" no business metric no actual deployment models too complex no online evaluation no monitoring data leakage
  8. Aggregation 100M rows 1M groups Join 100M rows x 1M

    rows time [s] time [s] “Motherfucka!”
  9. AI?