Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Practical DevOps for the busy data scientist
Search
Tania Allard
June 27, 2019
Technology
0
370
Practical DevOps for the busy data scientist
Tania Allard
June 27, 2019
Tweet
Share
More Decks by Tania Allard
See All by Tania Allard
2024_pydata_lndn.pdf
trallard
1
220
The RSE hiring and career progression pipelines: Top tips to navigate them efficiently
trallard
0
260
Mentored Sprints - 2023
trallard
0
230
Mentored Sprints 2022 - kickoff
trallard
3
280
Como participar en el mercado emergente del codigo abierto
trallard
4
300
El presente y futuro del computo cientifico con Python
trallard
0
260
Foss for fun and profit
trallard
3
340
Open source for fun and profit: rethinking the long road of sustainability.
trallard
0
190
Docker and Python: making them play nicely and securely for Ml and DS
trallard
1
640
Other Decks in Technology
See All in Technology
The Future of SEO: The Impact of AI on Search
badams
0
200
Data-centric AI入門第6章:Data-centric AIの実践例
x_ttyszk
1
410
2.5Dモデルのすべて
yu4u
2
870
Oracle Cloud Infrastructure:2025年2月度サービス・アップデート
oracle4engineer
PRO
1
220
エンジニアの育成を支える爆速フィードバック文化
sansantech
PRO
3
1.1k
バックエンドエンジニアのためのフロントエンド入門 #devsumiC
panda_program
18
7.5k
抽象化をするということ - 具体と抽象の往復を身につける / Abstraction and concretization
soudai
19
7.5k
Helm , Kustomize に代わる !? 次世代 k8s パッケージマネージャー Glasskube 入門 / glasskube-entry
parupappa2929
0
250
CZII - CryoET Object Identification 参加振り返り・解法共有
tattaka
0
380
SA Night #2 FinatextのSA思想/SA Night #2 Finatext session
satoshiimai
1
140
プロセス改善による品質向上事例
tomasagi
2
2.6k
エンジニアのためのドキュメント力基礎講座〜構造化思考から始めよう〜(2025/02/15jbug広島#15発表資料)
yasuoyasuo
17
6.8k
Featured
See All Featured
Facilitating Awesome Meetings
lara
52
6.2k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
330
21k
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
174
51k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
129
19k
A Philosophy of Restraint
colly
203
16k
Unsuck your backbone
ammeep
669
57k
Large-scale JavaScript Application Architecture
addyosmani
511
110k
Measuring & Analyzing Core Web Vitals
bluesmoon
6
240
Java REST API Framework Comparison - PWX 2021
mraible
28
8.4k
Scaling GitHub
holman
459
140k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.3k
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
251
21k
Transcript
Tania Allard, PhD @ixek Developer Advocate @Microsoft Practical DevOps for
the busy Data Scientist http://bit.ly/MancML-trallard
2 A bit of background never hurt anyone About us
3 @ixek
4 @ixek
5 Top top view… @ixek Stable model/application ready to be
productised R&D - develop, iterate fast, usually local or cloud Magic Is it live??
6 How I would like everything to work…. @ixek It
works…. now send it over to production R&D - develop, iterate fast, usually local or cloud Push code, tag, tag data* Worry free deployment! Wait and relax
7 @ixek
8
9 @ixek DevOps / DataOps / MLOps
10 DevOps is the union of people, process, and products
to enable continuous delivery of value into production What is DevOps anyway? @ixek
11 Sort of DevOps applied to data-intensive applications. Requires close
collaboration between engineers, data scientists, architects, data engineers and Ops. How does it fit for DS? @ixek
12 @ixek Aims to reduce the end-to-end cycle time of
data analytics/science from the origin of ideas to the creation of data artifacts.
13
14
15 7 steps to DS
16 Keep everything in source control - but allow for
experimentation
17
18 Standardize and define your environments in code (conda, pipfiles,
Docker)
19 Use canonical data sources - always know what data
you are using (where it comes and goes)
20
21 Automate wisely
22 https://xkcd.com/1205/
23
24 Use pipelines for repeatability and explainability
25 Deploy portable models
26
27 Test continuously and monitor production: shift left
28
29 Thank you @ixek http://bit.ly/MancML-trallard