Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Scaling Machine Learning at Holiday Extras (Big...
Search
Rebecca Vickery
November 13, 2019
Technology
0
130
Scaling Machine Learning at Holiday Extras (Big Data LDN 2019))
Rebecca Vickery
November 13, 2019
Tweet
Share
More Decks by Rebecca Vickery
See All by Rebecca Vickery
Pair Programming with AI
rebeccavickery
1
88
Machine Learning for Everyone
rebeccavickery
0
23
Data Preparation and the Importance of How Machines Learn
rebeccavickery
0
150
Scaling_Machine_Learning_at_Holiday_Extras_-_MUC.pdf
rebeccavickery
0
1.2k
Gender Bias, Why we Need More Women in Tech
rebeccavickery
0
1.2k
The Fastest Way to Learn Data Science
rebeccavickery
0
54
Employing Google Cloud Machine Learning Engine to Develop Models in Production
rebeccavickery
0
1.3k
Other Decks in Technology
See All in Technology
スマートファクトリーの第一歩 〜AWSマネージドサービスで 実現する予知保全と生成AI活用まで
ganota
2
220
2つのフロントエンドと状態管理
mixi_engineers
PRO
3
110
人工衛星のファームウェアをRustで書く理由
koba789
15
8k
Generative AI Japan 第一回生成AI実践研究会「AI駆動開発の現在地──ブレイクスルーの鍵を握るのはデータ領域」
shisyu_gaku
0
280
下手な強制、ダメ!絶対! 「ガードレール」を「檻」にさせない"ガバナンス"の取り方とは?
tsukaman
2
450
Webアプリケーションにオブザーバビリティを実装するRust入門ガイド
nwiizo
7
850
RSCの時代にReactとフレームワークの境界を探る
uhyo
10
3.4k
いま注目のAIエージェントを作ってみよう
supermarimobros
0
310
Firestore → Spanner 移行 を成功させた段階的移行プロセス
athug
1
490
DroidKaigi 2025 Androidエンジニアとしてのキャリア
mhidaka
2
330
COVESA VSSによる車両データモデルの標準化とAWS IoT FleetWiseの活用
osawa
1
290
Rustから学ぶ 非同期処理の仕組み
skanehira
1
140
Featured
See All Featured
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
656
61k
GraphQLとの向き合い方2022年版
quramy
49
14k
Unsuck your backbone
ammeep
671
58k
Site-Speed That Sticks
csswizardry
10
820
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
Scaling GitHub
holman
463
140k
Balancing Empowerment & Direction
lara
3
620
Raft: Consensus for Rubyists
vanstee
140
7.1k
For a Future-Friendly Web
brad_frost
180
9.9k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
50k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
15k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
112
20k
Transcript
Scaling Machine Learning at Holiday Extras REBECCA VICKERY | DATA
SCIENTIST @vickdata
Travel planning is time consuming Airport parking Airport hotels Airport
lounges Travel insurance Holiday money Port products Car hire Airport transfers 582 minutes Over 46 days* Travel Planning *Facebook commissioned consumer research company GfK
Optimising consumer decision making Airport parking Airport hotels Airport lounges
Travel insurance Holiday money Port products Car hire Airport transfers Less Hassle. More Holiday Trip recommendations
Automated bidding Ad targeting Channel optimisation 1 Ad spend 2
Commercial 3 Customer Experience 4 Marketing Lots of other processes to optimise Automated pricing Allocation Revenue optimisation Automated call handling Personalised experiences Intelligent messaging Optimise send frequency
How to scale Use Cases and Buy in (Input Team
Deployment
How to scale Use Cases and Buy in (Input Team
Deployment
“Ideas are worth nothing unless executed”, Derek Sivers
Deploying machine learning is hard Scaling is even harder
Tools - Data Scientists Open source Lack Software Development expertise
Mainly Python c Flaticon
Tools - Software Engineers Different tools Lack ML/Data expertise Mainly
Javascript c Flaticon
Data science process The wrong kind of independence c Flaticon
People Small data science team Science + software experts are
rare c Flaticon
Two types of deployment
Bespoke Solutions “Ideas are worth nothing unless executed”, Derek Sivers
c Daniel Moyo
Unused Models Many models never make it to production “Ideas
are worth nothing unless executed”, Derek Sivers
Time to model deployment Model development = days to weeks
Model deployment = weeks to never! “Ideas are worth nothing unless executed”, Derek Sivers
The technology
c Flaticon init.py task.py setup.py model.py Model Package
Repeatable, Reusable Process init.py task.py setup.py model.py Model Package
Data transformations Scikit-learn pipelines + custom transformers Transformation occurs in
the model
Solution for other libraries too Add preprocess file to the
package Image taken from Google Cloud documentation
Further customisation Custom scoring Custom prediction routines
None
Faster time to production c flaticon Fully Managed service
Not Quite!
Collaborative Project
ML Proxy (bespoke ML microservice)
Model Versioning
Monitoring - Model Performance
Monitoring - AI Platform Performance
Time to model deployment Model development = days to weeks
Model deployment = hours to days “Ideas are worth nothing unless executed”, Derek Sivers
How to scale Use Cases and Buy in (Input Team
Deployment
The right kind of independence c flaticon Data Scientists have
full ownership over models
The right kind of independence c flaticon Data scientists work
closely together
The right kind of independence c flaticon But they also
work closely with other teams
Use cases and buy in c flaticon Focus on problems
to solve
Use cases and buy in c flaticon Don’t start in
the highest value area
Use cases and buy in Deploy a first version (not
the best) as fast as possible
Test and learn Photo by Alex Kondratiev on Unsplash Use
cases and buy in
Thank you @vickdata