Deploying Models to production with Azure ML | Scottish Summit

#ScottishSummit2021 R i s h i t D a g
l i D e p l o y i n g M o d e l s t o P r o d u c t i o n w i t h A z u r e M L M e t h v e n , S a t 1 7 : 3 0 @rishit_dagli Rishit-dagli www.rishit.tech

Our Sponsors

“Most models don’t get deployed.”

90% Of models don’t get deployed

S o u r c e : L a u
r e n c e M o r o n e y

TEDx, TED-Ed Speaker High School Rishit Dagli @rishit_dagli Rishit-dagli www.rishit.tech

• High School Student • TEDx and Ted-Ed Speaker •
♡Hackathons and competitions • ♡Research • My coordinates -www.rishit.tech $whoami

Acknowledgements • Henk Boelman(Microsoft) • Dawood Iddris(AI MVP)

• Devs who have worked on creating Machine Learning Models
• Devs looking for ways to put their model into production ready manner Ideal Audience

Why care about ML deployments? Source: memegenerator .net

• Package the model What things to take care of?

• Package the model • Post the model on Server
What things to take care of?

• Maintain the server What things to take care of?

• Maintain the server o Auto-scale What things to take care of?

• Maintain the server o Auto-scale o Global Availability What things to take care of?

• Maintain the server o Auto-scale o Global Availability o Latency What things to take care of?

• Maintain the server • API What things to take care of?

• Maintain the server • API • Model Versioning What things to take care of?

• Maintain the server • API • Model Versioning • Batch Predictions What things to take care of?

Simple Deployments Why are they Inefficient?

Simple Deployments Why are they Inefficient? • No consistent API
• No model versioning • No mini-batching • Inefficient for large models Source: Hannes Hapke

Deploying a Model A Walkthrough

What do we need? • Register Your Model • Load
the Model • Perform Inference • Deploy the model

What do we need? • Register Your Model • Load
the Model • Perform Inference Do it at Scale

Register a Model

Register a Model TensorFlow 2 Saved Model

Register a Model .onnx .pkl .pt

Register a Model With the run object

Register a Model With the run object .onnx .pkl .pt

Creating an Inference Service • Load the Model

Creating an Inference Service • Load the Model • Inference
from the Model

Creating an Inference Service • Load the Model • Inference
from the Model Environment

Creating an Inference Service

Creating an Inference Service Load the registered model Really do
the inference

Load a model

Let’s inference from the model

And that’s it

Set up an environment Customizable • Can use a Docker
Image directly • Can manage the dependencies yourself too • Can specify a custom interpreter • Customizable Spark Settings

Let’s deploy it!

Deployment Configuration

And Deploy to AKS

Inference with REST • JSON response • Can specify a
particular version

Inference with REST

Inference with gRPC • Better connections • Data converted to
protocol buffer • Request types have designated type • Payload converted to base64 • Use gRPCstubs

Batch Inferences

Batch Inferences • Use hardware efficiently • Save costs and
compute resources • Take multiple requests process them together • Super cool😎for large models

Batch Inferences • Update the run() function • Runs on
each batch of data

Batch Inferences

Configure the ParallelRun

Create the pipeline

Publish the pipeline

Demos!

#ScottishSummit2021 Thank You @rishit_dagli Rishit-dagli www.rishit.tech

Deploying Models to production with Azure ML | ...

Deploying Models to production with Azure ML | Scottish Summit

More Decks by Rishit Dagli

Other Decks in Programming

Featured

Transcript