EuroPython 2017: How Booking.com serves Deep Learning Predictions at Large Scale by Sahil Dua

@sahildua2305 How Booking.com serves Deep Learning Predictions at Large Scale.
Sahil Dua

@sahildua2305 I am ... ➔ Backend Developer developing Deep Learning
Infrastructure ➔ Machine Learning Enthusiast ➔ Open Source Contributor (Git, Pandas, Kinto, go-github, etc.) ➔ Tech Speaker I am not ... ➔ A Data Scientist ➔ A Machine Learning Expert

@sahildua2305 Agenda ➔ Applications of Deep Learning ➔ Life-cycle of
a model ➔ Our Deep Learning Production Pipeline

@sahildua2305 Applications of Deep Learning at Booking.com

@sahildua2305 1.3 million+ active properties in 220+ countries 1,200,000+ room
nights booked every 24 hours Scale highlights.

@sahildua2305 Image Tagging

@sahildua2305 Image Tagging Sea view: 6.38 Balcony/Terrace: 4.82 Photo of
the whole room: 4.21 Bed: 3.47 Decorative details: 3.15 Seating area: 2.70

@sahildua2305

@sahildua2305 Image Tagging Using the image tag information in the
right context Swimming pool, Breakfast Buffet, etc.

@sahildua2305 Recommendation Engine User X booked hotel Y User Z
... ? Objective: Find probability of booking a hotel User Features: country, language, etc. Contextual Features: day of week, season, etc. Item Features: price, location of the hotel, etc.

@sahildua2305 Applications of Deep Learning at Booking.com

@sahildua2305 Lifecycle of a model

@sahildua2305 Deploy Lifecycle of a model Train Code

@sahildua2305 Training a Model - on laptop

@sahildua2305 Training a Model Server Training

@sahildua2305 Training a Model Server Training GPU support

@sahildua2305 Training a Model

@sahildua2305 Training a Model Training Data

@sahildua2305 Training a Model Model Checkpoints Training

@sahildua2305 Training a Model

@sahildua2305 Deploying a Model ➔ Python app running in container
➔ Model weights from Hadoop storage ➔ Loads model in memory ➔ Get a nice URL to get predictions

@sahildua2305 Deploying a Model App Client Input Features Prediction

@sahildua2305 Deploying a Model App App App App App App
Load Balancer Input Features Prediction Client

@sahildua2305 Deploying a Model Load Balancer Input Features Prediction Client

@sahildua2305 Deploying a Model Load Balancer Client Input Features Prediction

@sahildua2305 Performance PredictionTime = RequestOverhead + N*ComputationTime N is the
number of instances to predict on

@sahildua2305 Optimizing for Latency ➔ Do not predict if you
can precompute ➔ Reduce Request Overhead ➔ Predict for one instance ➔ Quantization (float 32 => fixed 8) ➔ TensorFlow specific: freeze network & optimize for inference

@sahildua2305 Optimizing for Throughput ➔ Do not predict if you
can precompute ➔ Batch requests ➔ Parallelize requests

@sahildua2305 Summary

@sahildua2305 Summary ➔ Training models in containers ➔ Serving models
from containers using Kubernetes ➔ Optimizing serving for latency/throughput

http://workingatbooking.com We are hiring! Roles • Software Developer • Data
Scientist • ... Work with • MapReduce • Spark • Recommender Systems • NLP • ...

@sahildua2305 Want to get in touch? LinkedIn @sahildua2305 GitHub Twitter
@sahildua2305 @sahildua2305 Website www.sahildua.com

@sahildua2305 THANK YOU @sahildua2305

EuroPython 2017: How Booking.com serves Deep Le...

EuroPython 2017: How Booking.com serves Deep Learning Predictions at Large Scale by Sahil Dua

More Decks by Sahil Dua

Other Decks in Technology

Featured

Transcript