Upgrade to Pro — share decks privately, control downloads, hide ads and more …

10 Pitfalls in Data Science - LA Data Science Meetup Kick-Off - Feb 2014

szilard
March 18, 2014
160

10 Pitfalls in Data Science - LA Data Science Meetup Kick-Off - Feb 2014

szilard

March 18, 2014
Tweet

More Decks by szilard

Transcript

  1. 10 Pitfalls in Data Science Szilárd Pafka, PhD Chief Scientist,

    Epoch LA Machine Learning Meetup Data Science Track Feb 2014
  2. (Some) Pitfalls • DS = IT project • DS isolated

    from business • Restricted access to data • Not enough EDA/cleaning • Data leakage • Overfitting • Optimizing wrong metric • Skip model validation • Too complex to deploy • Poor communication