Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Scaling Machine Learning at Holiday Extras (Big...
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Rebecca Vickery
November 13, 2019
Technology
130
0
Share
Scaling Machine Learning at Holiday Extras (Big Data LDN 2019))
Rebecca Vickery
November 13, 2019
More Decks by Rebecca Vickery
See All by Rebecca Vickery
Pair Programming with AI
rebeccavickery
1
100
Machine Learning for Everyone
rebeccavickery
0
27
Data Preparation and the Importance of How Machines Learn
rebeccavickery
0
160
Scaling_Machine_Learning_at_Holiday_Extras_-_MUC.pdf
rebeccavickery
0
1.2k
Gender Bias, Why we Need More Women in Tech
rebeccavickery
0
1.2k
The Fastest Way to Learn Data Science
rebeccavickery
0
57
Employing Google Cloud Machine Learning Engine to Develop Models in Production
rebeccavickery
0
1.3k
Other Decks in Technology
See All in Technology
OpenClaw初心者向けセミナー / OpenClaw Beginner Seminar
cmhiranofumio
0
250
機能・非機能の学びを一つに!Agent Skillsで月間レポート作成始めてみた / Unifying Bug & Infra Insights — Building Monthly Quality Reports with Agent Skills
bun913
2
620
第26回FA設備技術勉強会 - Claude/Claude_codeでデータ分析 -
happysamurai294
0
360
Why we keep our community?
kawaguti
PRO
0
380
互換性のある(らしい)DBへの移行など考えるにあたってたいへんざっくり
sejima
PRO
0
530
OCI技術資料 : 証明書サービス概要
ocise
1
7.2k
FlutterでPiP再生を実装した話
s9a17
0
240
Bref でサービスを運用している話
sgash708
0
220
Oracle AI Database@AWS:サービス概要のご紹介
oracle4engineer
PRO
3
2.1k
JAWS DAYS 2026でAIの「もやっと」感が解消された話
smt7174
1
120
Databricks Lakehouse Federationで 運用負荷ゼロのデータ連携
nek0128
0
110
Amazon Qはアマコネで頑張っています〜 Amazon Q in Connectについて〜
yama3133
1
170
Featured
See All Featured
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
360
30k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
Unlocking the hidden potential of vector embeddings in international SEO
frankvandijk
0
230
Navigating Algorithm Shifts & AI Overviews - #SMXNext
aleyda
1
1.2k
A Modern Web Designer's Workflow
chriscoyier
698
190k
Test your architecture with Archunit
thirion
1
2.2k
HU Berlin: Industrial-Strength Natural Language Processing with spaCy and Prodigy
inesmontani
PRO
0
300
Rails Girls Zürich Keynote
gr2m
96
14k
BBQ
matthewcrist
89
10k
Producing Creativity
orderedlist
PRO
348
40k
My Coaching Mixtape
mlcsv
0
92
Transcript
Scaling Machine Learning at Holiday Extras REBECCA VICKERY | DATA
SCIENTIST @vickdata
Travel planning is time consuming Airport parking Airport hotels Airport
lounges Travel insurance Holiday money Port products Car hire Airport transfers 582 minutes Over 46 days* Travel Planning *Facebook commissioned consumer research company GfK
Optimising consumer decision making Airport parking Airport hotels Airport lounges
Travel insurance Holiday money Port products Car hire Airport transfers Less Hassle. More Holiday Trip recommendations
Automated bidding Ad targeting Channel optimisation 1 Ad spend 2
Commercial 3 Customer Experience 4 Marketing Lots of other processes to optimise Automated pricing Allocation Revenue optimisation Automated call handling Personalised experiences Intelligent messaging Optimise send frequency
How to scale Use Cases and Buy in (Input Team
Deployment
How to scale Use Cases and Buy in (Input Team
Deployment
“Ideas are worth nothing unless executed”, Derek Sivers
Deploying machine learning is hard Scaling is even harder
Tools - Data Scientists Open source Lack Software Development expertise
Mainly Python c Flaticon
Tools - Software Engineers Different tools Lack ML/Data expertise Mainly
Javascript c Flaticon
Data science process The wrong kind of independence c Flaticon
People Small data science team Science + software experts are
rare c Flaticon
Two types of deployment
Bespoke Solutions “Ideas are worth nothing unless executed”, Derek Sivers
c Daniel Moyo
Unused Models Many models never make it to production “Ideas
are worth nothing unless executed”, Derek Sivers
Time to model deployment Model development = days to weeks
Model deployment = weeks to never! “Ideas are worth nothing unless executed”, Derek Sivers
The technology
c Flaticon init.py task.py setup.py model.py Model Package
Repeatable, Reusable Process init.py task.py setup.py model.py Model Package
Data transformations Scikit-learn pipelines + custom transformers Transformation occurs in
the model
Solution for other libraries too Add preprocess file to the
package Image taken from Google Cloud documentation
Further customisation Custom scoring Custom prediction routines
None
Faster time to production c flaticon Fully Managed service
Not Quite!
Collaborative Project
ML Proxy (bespoke ML microservice)
Model Versioning
Monitoring - Model Performance
Monitoring - AI Platform Performance
Time to model deployment Model development = days to weeks
Model deployment = hours to days “Ideas are worth nothing unless executed”, Derek Sivers
How to scale Use Cases and Buy in (Input Team
Deployment
The right kind of independence c flaticon Data Scientists have
full ownership over models
The right kind of independence c flaticon Data scientists work
closely together
The right kind of independence c flaticon But they also
work closely with other teams
Use cases and buy in c flaticon Focus on problems
to solve
Use cases and buy in c flaticon Don’t start in
the highest value area
Use cases and buy in Deploy a first version (not
the best) as fast as possible
Test and learn Photo by Alex Kondratiev on Unsplash Use
cases and buy in
Thank you @vickdata