Upgrade to Pro — share decks privately, control downloads, hide ads and more …

A Benchmark of Open Source Tools for Machine Learning from R - UseR! 2017 Conference - Brussels, July, 2007

szilard
July 02, 2017
310

A Benchmark of Open Source Tools for Machine Learning from R - UseR! 2017 Conference - Brussels, July, 2007

szilard

July 02, 2017
Tweet

More Decks by szilard

Transcript

  1. A Benchmark of Open Source Tools for Machine Learning from

    R Szilárd Pafka, PhD Chief Scientist, Epoch useR! 2017 Conference Brussels, July 2017
  2. Disclaimer: I am not representing my employer (Epoch) in this

    talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
  3. EC2

  4. n = 10K, 100K, 1M, 10M, 100M Training time RAM

    usage AUC CPU % by core read data, pre-process, score test data
  5. n = 10K, 100K, 1M, 10M, 100M Training time RAM

    usage AUC CPU % by core read data, pre-process, score test data
  6. 10x

  7. learn_rate = 0.1, max_depth = 6, n_trees = 300 learn_rate

    = 0.01, max_depth = 16, n_trees = 1000
  8. ...

  9. R++