Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Machine Learning in Production with R or Python - Budapest Data Forum - June 2017

szilard
June 08, 2017
160

Machine Learning in Production with R or Python - Budapest Data Forum - June 2017

szilard

June 08, 2017
Tweet

More Decks by szilard

Transcript

  1. Machine Learning in Production with R Szilárd Pafka, PhD Chief

    Scientist, Epoch Budapest Data Forum June 2017
  2. Machine Learning in Production with R or Python Szilárd Pafka,

    PhD Chief Scientist, Epoch Budapest Data Forum June 2017
  3. Machine Learning in Production with R or maybe Python Szilárd

    Pafka, PhD Chief Scientist, Epoch Budapest Data Forum June 2017
  4. Disclaimer: I am not representing my employer (Epoch) in this

    talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
  5. EC2

  6. n = 10K, 100K, 1M, 10M, 100M Training time RAM

    usage AUC CPU % by core read data, pre-process, score test data
  7. n = 10K, 100K, 1M, 10M, 100M Training time RAM

    usage AUC CPU % by core read data, pre-process, score test data
  8. 10x

  9. learn_rate = 0.1, max_depth = 6, n_trees = 300 learn_rate

    = 0.01, max_depth = 16, n_trees = 1000
  10. ...