Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Practical DevOps for the busy data scientist
Search
Tania Allard
October 09, 2019
Programming
1
790
Practical DevOps for the busy data scientist
Tania Allard
October 09, 2019
Tweet
Share
More Decks by Tania Allard
See All by Tania Allard
The RSE hiring and career progression pipelines: Top tips to navigate them efficiently
trallard
0
140
Mentored Sprints - 2023
trallard
0
140
Mentored Sprints 2022 - kickoff
trallard
3
200
Como participar en el mercado emergente del codigo abierto
trallard
4
200
El presente y futuro del computo cientifico con Python
trallard
0
180
Foss for fun and profit
trallard
3
240
Open source for fun and profit: rethinking the long road of sustainability.
trallard
0
120
Docker and Python: making them play nicely and securely for Ml and DS
trallard
1
530
Python and Docker for ML and Data Science
trallard
1
360
Other Decks in Programming
See All in Programming
デフォルトにして至高、RubyMineの大好きな所
ruzia
0
230
MetricKitで予期せぬ終了を検知する話 / Detect unexpected termination with MetricKit
nekowen
0
180
見た目から始める生産性向上
ikumatadokoro
7
780
Folding Cheat Sheet #3
philipschwarz
PRO
0
120
PHP8.3の機能を振り返る / Review of PHP 8.3 features
seike460
PRO
1
110
Komplexe Oberflächen mit SVG und der Web Animation API
joergneumann
0
670
スキーマ駆動開発による品質とスピードの両立 - 私達は何故、スキーマを書くのか
kentaroutakeda
0
160
チーム力を高めるスクラム実践法:カンバン公開と課題攻略について - ニフティのスクラムトーク Vol. 2 - NIFTY Tech Talk #18
niftycorp
PRO
1
110
⼤規模⾔語モデルの拡張(RAG)が 終わったかも知れない件について
nearme_tech
23
15k
R言語の環境構築と基礎 Tokyo.R 112
bob3bob3
0
260
使ってみよう Azure AI Document Intelligence
kosmosebi
2
270
Micro Frontends for Java Microservices - Devnexus 2024
mraible
PRO
0
480
Featured
See All Featured
Designing for Performance
lara
601
67k
Intergalactic Javascript Robots from Outer Space
tanoku
266
26k
Designing for humans not robots
tammielis
248
25k
Automating Front-end Workflow
addyosmani
1355
200k
How GitHub Uses GitHub to Build GitHub
holman
468
290k
For a Future-Friendly Web
brad_frost
171
8.9k
Learning to Love Humans: Emotional Interface Design
aarron
266
39k
Six Lessons from altMBA
skipperchong
20
3k
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
273
13k
Atom: Resistance is Futile
akmur
258
25k
Building Flexible Design Systems
yeseniaperezcruz
318
37k
Designing on Purpose - Digital PM Summit 2013
jponch
110
6.5k
Transcript
Practical DevOps for the busy data Scientist
bit.ly/PyConDE-mlops Slides
What you’ll learn 01 02 Why MLOps/ DevOps ? Who
is responsible? 03 04 Getting started Getting from A to B
About Me
Software engineering Algorithm Data Answers @ixek bit.ly/PyConDE-mlops
Machine learning Answers Data Algorithm @ixek bit.ly/PyConDE-mlops
Machine learning Answers Data Model @ixek bit.ly/PyConDE-mlops @ixek bit.ly/PyConDE-mlops
Machine learning Answers Data Model Answers Predictions @ixek bit.ly/PyConDE-mlops
The data cycle Magic? R&D Generation @ixek bit.ly/PyConDE-mlops
Anyone? @ixek bit.ly/PyConDE-mlops
A common scenario @ixek bit.ly/PyConDE-mlops
@ixek bit.ly/PyConDE-mlops
If you had one wish? @ixek bit.ly/PyConDE-mlops
Replacing the magic ML Ops and robust pipelines R&D Generation
@ixek bit.ly/PyConDE-mlops
How skills are perceived @ixek bit.ly/PyConDE-mlops
Better @ixek bit.ly/PyConDE-mlops
How they really are @ixek bit.ly/PyConDE-mlops
DevOps is the union of people, process, and products to
enable continuous delivery of value into production - Donovan Brown What is devops @ixek bit.ly/PyConDE-mlops
MlOps Aims to reduce the end-to-end cycle time and friction
of data analytics/science from the origin of ideas to the creation of data artifacts. What is devops @ixek bit.ly/PyConDE-mlops
But I do not work in a big company with
many ML engineers @ixek bit.ly/PyConDE-mlops
Build your own MLOps Platform @ixek bit.ly/PyConDE-mlops
None
None
Practical steps @ixek bit.ly/PyConDE-mlops
We have the notebooks in source control @ixek bit.ly/PyConDE-mlops
Your saviour Source control • Code and comments only (not
Jupyter output) • Plus every part of the pipeline • And Infrastructure and dependencies • And maybe a subset of data @ixek bit.ly/PyConDE-mlops
Everything should be in source control!! Except your training data
which should be a known, shared data source Do not touch the raw data! Not even with a stick Your saviour @ixek bit.ly/PyConDE-mlops
Deterministic environments @ixek bit.ly/PyConDE-mlops
Whatever that environment is @ixek bit.ly/PyConDE-mlops
Your laptop is not a production environment… so ensure reproducibility
@ixek bit.ly/PyConDE-mlops
@ixek bit.ly/PyConDE-mlops
Use pipelines for repeatability and reproducibility @ixek bit.ly/PyConDE-mlops
ml.azure.com
@ixek bit.ly/PyConDE-mlops
@ixek bit.ly/PyConDE-mlops
Automate wisely @ixek bit.ly/PyConDE-mlops
Adopt automation • Orchestration for Continuous Integration and Continuous Delivery
• Gates, tasks, and processes for quality • Integration with other services • Triggers on code and non-code events @ixek bit.ly/PyConDE-mlops
Complete pipeline @ixek bit.ly/PyConDE-mlops
Kubeflow example https://www.kubeflow.org/docs/azure/azureendtoend/ @ixek bit.ly/PyConDE-mlops
Build pipeline- https://azure.microsoft.com/en-us/services/devops/https://azure.microsoft.com/e n-us/services/devops/
Code event trigger @ixek bit.ly/PyConDE-mlops
Release / deploy @ixek bit.ly/PyConDE-mlops
In brief Deterministic environments Use pipelines Continuous integration and delivery
Source control (done right) Code, infrastructure, everything! Ensure production readiness For repeatable workflows Detect errors early and seamless deployments @ixek bit.ly/PyConDE-mlops
Want to learn more? • ml.azure.com • https://azure.microsoft.com/en-us/services/devops/ • https://docs.microsoft.com/en-us/azure/machine-learning/ser
vice/concept-ml-pipelines @ixek bit.ly/PyConDE-mlops
Come talk to us! @ ixek
[email protected]