Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Making machine learning model deployment boring

Ruurtjan
September 19, 2019

Making machine learning model deployment boring

The free lunch for machine learning is over. Organizations are quickly ramping up their abilities to automate and professionalize their machine learning processes and infrastructure. As a consequence organizational goals, processes and requirements put an increasing burden on teams to put machine learning models in production. We believe much of this burden relates to engineering issues, which with proper abstractions can be greatly reduced for product teams. In this presentation we will talk about the organizational context of ING and the design our Machine Learning Platform. In the first part we will sketch some organizational context and the requirements it brings. Next, we will picture the kind of use cases and user journey we have in mind. Finally, we will present how these considerations led the platform design we are currently deploying.

Ruurtjan

September 19, 2019
Tweet

More Decks by Ruurtjan

Other Decks in Technology

Transcript

  1. Machine learning helps many companies • Match candidates and vacancies

    • Recommend customer treatments to call center agents • Pick the right time for airplane maintenance
  2. Machine learning helps many companies • Match candidates and vacancies

    • Recommend customer treatments to call center agents • Pick the right time for airplane maintenance
  3. Machine learning helps many companies • Match candidates and vacancies

    • Recommend customer treatments to call center agents • Pick the right time for airplane maintenance
  4. Machine learning requires many capabilities Statistics Algorithms Math Scripting Programming

    CI/CD Feature engineering Data plumbing Monitoring Stability Reliability Infrastructure Business value Compliance and Risk Integration Process Domain knowledge
  5. Machine learning requires many capabilities Statistics Algorithms Math Scripting Programming

    CI/CD Feature engineering Data plumbing Monitoring Stability Reliability Infrastructure Business value Compliance and Risk Integration Process Domain knowledge
  6. Machine learning requires many capabilities Statistics Algorithms Math Scripting Programming

    CI/CD Data plumbing Monitoring Stability Reliability Infrastructure Programming CI/CD Feature engineering Data plumbing Monitoring Stability Reliability Infrastructure Business value Compliance and Risk Integration Process Domain knowledge
  7. Machine learning requires many capabilities Statistics Algorithms Math Scripting Programming

    CI/CD Feature engineering Data plumbing Monitoring Stability Reliability Infrastructure Statistics Algorithms Math Scripting Monitoring Stability Reliability Infrastructure Business value Compliance and Risk Integration Process Domain knowledge
  8. Machine learning requires many capabilities Statistics Algorithms Math Scripting Programming

    CI/CD Feature engineering Data plumbing Monitoring Stability Reliability Infrastructure Statistics Algorithms Math Scripting Programming CI/CD Data plumbing Business value Compliance and Risk Integration Process Domain knowledge
  9. The machine learning platform • The platform is a service

    built by ING engineers, for ING engineers and data scientists • The platform is responsible for hosting machine learning models, and orchestrating them
  10. The machine learning platform • Orchestrator as an interface between

    models and outside world • Kafka for streaming data • REST API for real-time and batch
  11. • Allow teams to fully embrace the power and capabilities

    of machine learning • Only software engineers, data scientists and product owners are responsible The machine learning platform
  12. A journey: the feedback use case • ING gathers user

    feedback • Feedback covers many categories
  13. • Automatically categorize user feedback using machine learning • Establish

    business goals Inception of the idea Product owner
  14. • Data scientists iteratively build a model • Incorporate feedback

    from product owner • Configure the model according to platform standards Building the model Data scientist
  15. • Data scientist and software engineer agree on configuration of

    the model • Software engineer builds interface for consuming the platform • Data scientist deploys model on pipeline Packaging the model Software engineer & data scientist
  16. • Automated pipelines deploy the model on the platform •

    Technologies such as Ansible, Docker and Gitlab CI • No actions are required by the platform team The packaged model Platform
  17. The productionized model • Once in production, data scientists, software

    engineers and product owners can monitor their model • The model can be changed iteratively and quickly deployed again
  18. The productionized model, versioning • Models are tracked by their

    version number • Using historic data we can track the performance of the models
  19. Centralized teams should handle cross-cutting concerns ING’s platform team provides

    machine learning deployment as a service Allowing data scientists to easily and quickly deploy models makes machine learning boring Key takeaways