Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Machine Learning in Production - LA Data Science/Machine Learning Meetup - May 2017

szilard
May 10, 2017
1.5k

Machine Learning in Production - LA Data Science/Machine Learning Meetup - May 2017

szilard

May 10, 2017
Tweet

More Decks by szilard

Transcript

  1. Machine Learning in Production Szilárd Pafka, PhD Chief Scientist, Epoch

    LA Data Science/Machine Learning Meetup May 2017
  2. Disclaimer: I am not representing my employer (Epoch) in this

    talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
  3. ...

  4. Machine Learning as a Service [...] The bottom line on

    why it doesn’t work: the people that know what they’re doing just use open source, and the people that don’t will not get anything to work, ever, even with APIs. [...] Amazon, Google, and Microsoft are all trying to sell a MLaaS layer as a component of their cloud strategy. [...] The problem here is a very practical matter; the MLaaS solutions have no customer segment -- they serve neither the competent nor the incompetent customer segment. The competent segment: you need machine learning people to build real production machine learning models, because it is hard to train and debug these things properly, and it requires a mix of understanding both theory and practice. These machine learning people tend to just use the same open source tools that the MLaaS services offer. So this knocks out the competent customer segment. [...] The incompetent segment: [...] http://www.bradfordcross.com/blog/2017/3/3/five-ai-startup-predictions-for-2017
  5. - kaggle already pre-processed data (focus on ML) - sometimes

    less domain knowledge (or deliberately hidden) - focus on some measure of acc only, 0.0001 increases "relevant" - ensembles create too complex models - never deployed in prod - data leakage: for kaggle hard to fix even if discovered