Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Scaling Machine Learning at Holiday Extras (Big...
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Rebecca Vickery
November 13, 2019
Technology
130
0
Share
Scaling Machine Learning at Holiday Extras (Big Data LDN 2019))
Rebecca Vickery
November 13, 2019
More Decks by Rebecca Vickery
See All by Rebecca Vickery
Pair Programming with AI
rebeccavickery
1
100
Machine Learning for Everyone
rebeccavickery
0
27
Data Preparation and the Importance of How Machines Learn
rebeccavickery
0
160
Scaling_Machine_Learning_at_Holiday_Extras_-_MUC.pdf
rebeccavickery
0
1.2k
Gender Bias, Why we Need More Women in Tech
rebeccavickery
0
1.2k
The Fastest Way to Learn Data Science
rebeccavickery
0
57
Employing Google Cloud Machine Learning Engine to Develop Models in Production
rebeccavickery
0
1.3k
Other Decks in Technology
See All in Technology
やさしいとこから始めるGitHubリポジトリのセキュリティ
tsubakimoto_s
3
2.1k
15年メンテしてきたdotfilesから開発トレンドを振り返る 2011 - 2026
giginet
PRO
2
270
Sansanの認証基盤を支えるアーキテクチャとその振り返り
sansantech
PRO
1
150
【関西電力KOI×VOLTMIND 生成AIハッカソン】空間AIブレイン ~⼤阪おばちゃんフィジカルAIに続く道~
tanakaseiya
0
110
OCI技術資料 : ロード・バランサ 概要 - FLB・NLB共通
ocise
4
27k
OPENLOGI Company Profile for engineer
hr01
1
62k
Cursor Subagentsはいいぞ
yug1224
2
140
AWS DevOps Agent or Kiro の使いどころを考える_20260402
masakiokuda
0
150
Zephyr(RTOS)でOpenPLCを実装してみた
iotengineer22
0
180
FlutterでPiP再生を実装した話
s9a17
0
240
Kubernetesの「隠れメモリ消費」によるNode共倒れと、Request適正化という処方箋
g0xu
0
170
スケーリングを封じられたEC2を救いたい
senseofunity129
0
140
Featured
See All Featured
Paper Plane
katiecoart
PRO
1
48k
Faster Mobile Websites
deanohume
310
31k
Done Done
chrislema
186
16k
世界の人気アプリ100個を分析して見えたペイウォール設計の心得
akihiro_kokubo
PRO
68
38k
Making Projects Easy
brettharned
120
6.6k
The Art of Programming - Codeland 2020
erikaheidi
57
14k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.8k
A Tale of Four Properties
chriscoyier
163
24k
The Curse of the Amulet
leimatthew05
1
11k
The Anti-SEO Checklist Checklist. Pubcon Cyber Week
ryanjones
0
110
30 Presentation Tips
portentint
PRO
1
270
It's Worth the Effort
3n
188
29k
Transcript
Scaling Machine Learning at Holiday Extras REBECCA VICKERY | DATA
SCIENTIST @vickdata
Travel planning is time consuming Airport parking Airport hotels Airport
lounges Travel insurance Holiday money Port products Car hire Airport transfers 582 minutes Over 46 days* Travel Planning *Facebook commissioned consumer research company GfK
Optimising consumer decision making Airport parking Airport hotels Airport lounges
Travel insurance Holiday money Port products Car hire Airport transfers Less Hassle. More Holiday Trip recommendations
Automated bidding Ad targeting Channel optimisation 1 Ad spend 2
Commercial 3 Customer Experience 4 Marketing Lots of other processes to optimise Automated pricing Allocation Revenue optimisation Automated call handling Personalised experiences Intelligent messaging Optimise send frequency
How to scale Use Cases and Buy in (Input Team
Deployment
How to scale Use Cases and Buy in (Input Team
Deployment
“Ideas are worth nothing unless executed”, Derek Sivers
Deploying machine learning is hard Scaling is even harder
Tools - Data Scientists Open source Lack Software Development expertise
Mainly Python c Flaticon
Tools - Software Engineers Different tools Lack ML/Data expertise Mainly
Javascript c Flaticon
Data science process The wrong kind of independence c Flaticon
People Small data science team Science + software experts are
rare c Flaticon
Two types of deployment
Bespoke Solutions “Ideas are worth nothing unless executed”, Derek Sivers
c Daniel Moyo
Unused Models Many models never make it to production “Ideas
are worth nothing unless executed”, Derek Sivers
Time to model deployment Model development = days to weeks
Model deployment = weeks to never! “Ideas are worth nothing unless executed”, Derek Sivers
The technology
c Flaticon init.py task.py setup.py model.py Model Package
Repeatable, Reusable Process init.py task.py setup.py model.py Model Package
Data transformations Scikit-learn pipelines + custom transformers Transformation occurs in
the model
Solution for other libraries too Add preprocess file to the
package Image taken from Google Cloud documentation
Further customisation Custom scoring Custom prediction routines
None
Faster time to production c flaticon Fully Managed service
Not Quite!
Collaborative Project
ML Proxy (bespoke ML microservice)
Model Versioning
Monitoring - Model Performance
Monitoring - AI Platform Performance
Time to model deployment Model development = days to weeks
Model deployment = hours to days “Ideas are worth nothing unless executed”, Derek Sivers
How to scale Use Cases and Buy in (Input Team
Deployment
The right kind of independence c flaticon Data Scientists have
full ownership over models
The right kind of independence c flaticon Data scientists work
closely together
The right kind of independence c flaticon But they also
work closely with other teams
Use cases and buy in c flaticon Focus on problems
to solve
Use cases and buy in c flaticon Don’t start in
the highest value area
Use cases and buy in Deploy a first version (not
the best) as fast as possible
Test and learn Photo by Alex Kondratiev on Unsplash Use
cases and buy in
Thank you @vickdata