Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Practical DevOps for the busy data scientist

Practical DevOps for the busy data scientist

Tania Allard

October 09, 2019
Tweet

More Decks by Tania Allard

Other Decks in Programming

Transcript

  1. What you’ll learn 01 02 Why MLOps/ DevOps ? Who

    is responsible? 03 04 Getting started Getting from A to B
  2. DevOps is the union of people, process, and products to

    enable continuous delivery of value into production - Donovan Brown What is devops @ixek bit.ly/PyConDE-mlops
  3. MlOps Aims to reduce the end-to-end cycle time and friction

    of data analytics/science from the origin of ideas to the creation of data artifacts. What is devops @ixek bit.ly/PyConDE-mlops
  4. But I do not work in a big company with

    many ML engineers @ixek bit.ly/PyConDE-mlops
  5. Your saviour Source control • Code and comments only (not

    Jupyter output) • Plus every part of the pipeline • And Infrastructure and dependencies • And maybe a subset of data @ixek bit.ly/PyConDE-mlops
  6. Everything should be in source control!! Except your training data

    which should be a known, shared data source Do not touch the raw data! Not even with a stick Your saviour @ixek bit.ly/PyConDE-mlops
  7. Adopt automation • Orchestration for Continuous Integration and Continuous Delivery

    • Gates, tasks, and processes for quality • Integration with other services • Triggers on code and non-code events @ixek bit.ly/PyConDE-mlops
  8. In brief Deterministic environments Use pipelines Continuous integration and delivery

    Source control (done right) Code, infrastructure, everything! Ensure production readiness For repeatable workflows Detect errors early and seamless deployments @ixek bit.ly/PyConDE-mlops