Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Scaling Machine Learning at Holiday Extras (Big...
Search
Rebecca Vickery
November 13, 2019
Technology
0
130
Scaling Machine Learning at Holiday Extras (Big Data LDN 2019))
Rebecca Vickery
November 13, 2019
Tweet
Share
More Decks by Rebecca Vickery
See All by Rebecca Vickery
Pair Programming with AI
rebeccavickery
1
88
Machine Learning for Everyone
rebeccavickery
0
23
Data Preparation and the Importance of How Machines Learn
rebeccavickery
0
150
Scaling_Machine_Learning_at_Holiday_Extras_-_MUC.pdf
rebeccavickery
0
1.2k
Gender Bias, Why we Need More Women in Tech
rebeccavickery
0
1.2k
The Fastest Way to Learn Data Science
rebeccavickery
0
53
Employing Google Cloud Machine Learning Engine to Develop Models in Production
rebeccavickery
0
1.3k
Other Decks in Technology
See All in Technology
Amazon S3標準/ S3 Tables/S3 Express One Zoneを使ったログ分析
shigeruoda
5
580
フィンテック養成勉強会#54
finengine
0
200
rubygem開発で鍛える設計力
joker1007
2
270
生まれ変わった AWS Security Hub (Preview) を紹介 #reInforce_osaka / reInforce New Security Hub
masahirokawahara
0
360
asken AI勉強会(Android)
tadashi_sato
0
140
開発生産性を組織全体の「生産性」へ! 部門間連携の壁を越える実践的ステップ
sudo5in5k
0
300
Microsoft Build 2025 技術/製品動向 for Microsoft Startup Tech Community
torumakabe
2
330
「Chatwork」の認証基盤の移行とログ活用によるプロダクト改善
kubell_hr
1
230
Claude Code Actionを使ったコード品質改善の取り組み
potix2
PRO
6
2.6k
Tokyo_reInforce_2025_recap_iam_access_analyzer
hiashisan
0
130
強化されたAmazon Location Serviceによる新機能と開発者体験
dayjournal
3
240
事業成長の裏側:エンジニア組織と開発生産性の進化 / 20250703 Rinto Ikenoue
shift_evolve
PRO
1
110
Featured
See All Featured
How to Ace a Technical Interview
jacobian
277
23k
The Language of Interfaces
destraynor
158
25k
GitHub's CSS Performance
jonrohan
1031
460k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
15
1.5k
4 Signs Your Business is Dying
shpigford
184
22k
Writing Fast Ruby
sferik
628
62k
The Cult of Friendly URLs
andyhume
79
6.5k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
29
2.7k
A Modern Web Designer's Workflow
chriscoyier
694
190k
GraphQLの誤解/rethinking-graphql
sonatard
71
11k
The Art of Programming - Codeland 2020
erikaheidi
54
13k
It's Worth the Effort
3n
185
28k
Transcript
Scaling Machine Learning at Holiday Extras REBECCA VICKERY | DATA
SCIENTIST @vickdata
Travel planning is time consuming Airport parking Airport hotels Airport
lounges Travel insurance Holiday money Port products Car hire Airport transfers 582 minutes Over 46 days* Travel Planning *Facebook commissioned consumer research company GfK
Optimising consumer decision making Airport parking Airport hotels Airport lounges
Travel insurance Holiday money Port products Car hire Airport transfers Less Hassle. More Holiday Trip recommendations
Automated bidding Ad targeting Channel optimisation 1 Ad spend 2
Commercial 3 Customer Experience 4 Marketing Lots of other processes to optimise Automated pricing Allocation Revenue optimisation Automated call handling Personalised experiences Intelligent messaging Optimise send frequency
How to scale Use Cases and Buy in (Input Team
Deployment
How to scale Use Cases and Buy in (Input Team
Deployment
“Ideas are worth nothing unless executed”, Derek Sivers
Deploying machine learning is hard Scaling is even harder
Tools - Data Scientists Open source Lack Software Development expertise
Mainly Python c Flaticon
Tools - Software Engineers Different tools Lack ML/Data expertise Mainly
Javascript c Flaticon
Data science process The wrong kind of independence c Flaticon
People Small data science team Science + software experts are
rare c Flaticon
Two types of deployment
Bespoke Solutions “Ideas are worth nothing unless executed”, Derek Sivers
c Daniel Moyo
Unused Models Many models never make it to production “Ideas
are worth nothing unless executed”, Derek Sivers
Time to model deployment Model development = days to weeks
Model deployment = weeks to never! “Ideas are worth nothing unless executed”, Derek Sivers
The technology
c Flaticon init.py task.py setup.py model.py Model Package
Repeatable, Reusable Process init.py task.py setup.py model.py Model Package
Data transformations Scikit-learn pipelines + custom transformers Transformation occurs in
the model
Solution for other libraries too Add preprocess file to the
package Image taken from Google Cloud documentation
Further customisation Custom scoring Custom prediction routines
None
Faster time to production c flaticon Fully Managed service
Not Quite!
Collaborative Project
ML Proxy (bespoke ML microservice)
Model Versioning
Monitoring - Model Performance
Monitoring - AI Platform Performance
Time to model deployment Model development = days to weeks
Model deployment = hours to days “Ideas are worth nothing unless executed”, Derek Sivers
How to scale Use Cases and Buy in (Input Team
Deployment
The right kind of independence c flaticon Data Scientists have
full ownership over models
The right kind of independence c flaticon Data scientists work
closely together
The right kind of independence c flaticon But they also
work closely with other teams
Use cases and buy in c flaticon Focus on problems
to solve
Use cases and buy in c flaticon Don’t start in
the highest value area
Use cases and buy in Deploy a first version (not
the best) as fast as possible
Test and learn Photo by Alex Kondratiev on Unsplash Use
cases and buy in
Thank you @vickdata