Slide 1

Slide 1 text

Scalable, real-time Machine Learning using Apache Kafka

Slide 2

Slide 2 text

Agenda ● Traditional model deployment process ● 90 seconds to WoW ● Let’s process the incoming stream ● Demo ● What’s more? 2

Slide 3

Slide 3 text

$ whoami ● Personalisation lead at Hotstar ● Led Data Infrastructure team at Grofers and TinyOwl ● Kafka fanboy ● Usually rant on twitter @jayeshsidhwani 3

Slide 4

Slide 4 text

Machine Learning @ Hotstar ● ~150 mn users ● 4.8 mn peak concurrency ● 120K peak recommendation requests per second ● Diverse content in diverse languages 4

Slide 5

Slide 5 text

Traditional model deployment process 5 Model Training Data Lake Serialized Model Batch Predictions Recommendation APIs Offline Online ● One-day / few-hours batch pre-compute ● Slow time to react

Slide 6

Slide 6 text

Sense of urgency? 6 ● 90 seconds to convert a new user ● To power his experience, we need to know user’s gender, interests and more ● Need an always-thinking machine

Slide 7

Slide 7 text

Thinking streams 7 Data at Rest Data in motion ● Slow ● Batch-y ● Fast ● Sub-second

Slide 8

Slide 8 text

Enter Apache Kafka 8 ● Kafka is a scalable, fault-tolerant, distributed message queue ● Producers and Consumers ● Uses ○ Real-time applications such as: intelligent notifications, anomaly etc. ○ Asynchronous communication in event-driven architectures Diagram credits: http://kafka.apache.org

Slide 9

Slide 9 text

Real-time infrastructure at Hotstar 9 ● All clickstream data pushed into Apache Kafka ● Apache Kafka Streams to process events as they happen ● Incoming data available for everyone Intelligence Apple TV iOS ANDROID Roku STREAM PROCESSING FRAMEWORK Filter Window Join Anomaly Machine Learning

Slide 10

Slide 10 text

Demo Predict whether a flight is delayed in real-time 10

Slide 11

Slide 11 text

How to process a stream? 11 ML

Slide 12

Slide 12 text

Advanced use-cases 12 page-clicks Processor nodes Source / Sink nodes video-plays predict-gender predict-interest 5-min trending videos Recommended for You Hotstar Streaming Platform

Slide 13

Slide 13 text

Questions? 13 tech.hotstar.com