Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Applying deployment oriented mindset for buildi...

Applying deployment oriented mindset for building Machine Learning models at Data Science fwdays '19

Developing a complicated ensemble model with hundreds of features fetched from a bunch of different sources? Give me two! Showing great metrics to the stakeholders and already discussing how it will hit a home run in production? Why not! And then getting stuck for months trying to deploy the model and fighting with data inconsistency and bugs? Sounds familiar?

This talk will focus on providing guidelines on how to build your model development process keeping in mind the deployment phase to come later on.

Marianna Diachuk

September 07, 2019
Tweet

More Decks by Marianna Diachuk

Other Decks in Programming

Transcript

  1. AGENDA ➔ Why am I telling you this? ➔ What

    I mean by deployment? ➔ Deployment problems ➔ Model development process ➔ What deployment oriented mindset gives you 2
  2. WHY AM TELLING YOU THIS? - I developed and deployed

    to production 7 scoring and antifraud ensemble models - I leaded a small but proud team of 2 data scientists and 1 data engineer - I’m a mother of 3 dragons ducks 3
  3. DEPLOYMENT PROBLEMS (SOME OF THEM) ➢ Model response inconsistency 7

    research environment development environment
  4. DEPLOYMENT PROBLEMS (SOME OF THEM) ➢ Model response inconsistency 8

    research environment development environment dataset
  5. DEPLOYMENT PROBLEMS (SOME OF THEM) ➢ Model response inconsistency ➢

    Impossibility to implement features calculations ➢ Features inconsistency ➢ Model is not scalable and so on and so on… 10
  6. How fast should our model respond? Are there any lim

    itations for deploym ent? Can we fetch the data from DB quickly? BUSINESS UNDERSTANDING STAGE Can developers help us with deploym ent? Should we worry about features calculation time? 14
  7. BUSINESS UNDERSTANDING STAGE Pay attention to: ! Model response time

    ! Feature calculation time ! Database response ! Human resources availability 16
  8. Ouch… It was one of the top features :( We

    removed one field from the application form. WORK CLOSELY WITH YOUR COLLEAGUES. 20
  9. REFACTOR YOUR CODE 1. Write your feature calculation 4. Improve

    your code efficiency 2. Test the feature 3. Improve your code readability 22
  10. WHAT DEPLOYMENT ORIENTED MINDSET GIVES YOU. - Easier to estimate

    tasks - Less risks in the late phases -> less postponed deadlines - Better to understand responsibilities distribution 30