DevOps for Machine Learning

DevOps for Machine Learning Henk Boelman Cloud Advocate @ Microsoft
@hboelman github.com/hnky henkboelman.com

Agenda Machine Learning on Azure Data science process Build a
Machine Learning pipeline

Machine Learning Ability to learn without being explicitly programmed.

Programming Algorithm Data Answers

Machine Learning Algorithm Data Answers

Machine Learning Model Data Answers

Machine Learning Predictions Data Model Data Answers

Sophisticated pretrained models To simplify solution development Azure Databricks Machine
Learning VMs Popular frameworks To build advanced deep learning solutions TensorFlow Keras Pytorch Onnx Azure Machine Learning Language Speech … Azure Search Vision On-premises Cloud Edge Productive services To empower data science and development teams Powerful infrastructure To accelerate deep learning Flexible deployment To deploy and manage models on intelligent cloud and edge Machine Learning on Azure Cognitive Services

Vision Speech Language Knowledge Cognitive Services: Pre-Trained models in the
cloud and on the edge

DevOps is the union of people, process, and products to
enable continuous delivery of value to your end users. “ ”

A data scientist joins the DevOps team …

Ask a sharp question Collect the data Prepare the data
Select the algorithm Train the model Use the answer The data science process

Azure Machine Learning Services A fully-managed cloud service that enables
you to easily build, deploy, and share predictive analytics solutions.

Sophisticated pretrained models To simplify solution development Azure Databricks Machine
Learning VMs Popular frameworks To build advanced deep learning solutions TensorFlow Keras Pytorch Onnx Azure Machine Learning Language Speech … Azure Search Vision On-premises Cloud Edge Productive services To empower data science and development teams Powerful infrastructure To accelerate deep learning Flexible deployment To deploy and manage models on intelligent cloud and edge Machine Learning on Azure Cognitive Services

Is it Marge or Homer?

Prepare your environment Experiment with your model & data Deploy
Your model into production

Step 1: Prepare your environment

Setup your environment VS Code Azure Notebooks Azure Portal

Create a workspace ws = Workspace.create( name='<NAME>', subscription_id='<SUBSCRIPTION ID>', resource_group='<RESOURCE
GROUP>', location='westeurope') ws.write_config() ws = Workspace.from_config() Create a workspace

Datasets – registered, known data sets Experiments – Training runs
Models – Registered, versioned models Endpoints: Real-time Endpoints – Deployed model endpoints Pipeline Endpoints – Training workflows Compute – Managed compute Datastores – Connections to data Azure Machine Learning Service

Create Compute cfg = AmlCompute.provisioning_configuration( vm_size='STANDARD_NC6', min_nodes=1, max_nodes=6) cc =
ComputeTarget.create(ws, '<NAME>', cfg) Create a workspace Create compute

Step 1 Prepare your environment Create a workspace Create compute
Setup storage

Step 2: Create a ML pipeline to deliver a model

Pipelines Azure ML Service Pipelines Azure Pipelines

Azure Machine Learning Service Pipelines Workflows of steps that can
use Data Sources, Datasets and Compute targets Unattended runs Reusability Tracking and versioning

Azure Pipelines Orchestration for Continuous Integration and Continuous Delivery Gates,
tasks and processes for quality Integration with other services Trigger on code and non-code events

Create a pipeline step Input Output Runs a script on
a Compute Target in a Docker container. Parameters

Create a pipeline Dataset of Simpsons Images Prepare data Train
the Model with a Tensorflow estimator Processed dataset model Register the model Blob Storage Account Model Management

Create a training file Create an Experiment Create a training
file

Create an estimator params = {'--data-folder': ws.get_default_datastore().as_mount()} trainEstimator = TensorFlow(
source_directory = script_folder, script_params = params, compute_target = computeCluster, entry_script = 'train.py’, use_gpu = True, conda_packages = ['scikit-learn','keras','opencv’], framework_version='1.10') Create an Experiment Create a training file Create an estimator

Create an Estimator Step Create an Experiment Create a training
file Create an estimator trainOnGpuStep = EstimatorStep( name= 'Train Estimator Step', estimator= trainEstimator, inputs= [training_data_location], outputs= [model], compute_target= computeCluster, estimator_entry_script_arguments = estimator_script_params )

Create, publish and run a pipeline Create an Experiment Create
a training file Create an estimator prep_train_register = [preProcessStep,trainOnGpuStep,registerStep] pipeline = Pipeline(workspace=ws, steps=[prep_train_register]) mlpipeline = pipeline.publish(name="Marge Or Homer") pipeline_run = mlpipeline.submit(ws,experiment_name)

Submit the pipeline to the cluster

Create an Experiment Create a training file Submit to the
AI cluster Create an estimator Demo: Create an Azure ML Pipeline

Azure Notebook Compute Target Experiment Docker Image Data store 1.
Snapshot folder and send to experiment 2. create docker image 3. Deploy docker and snapshot to compute 4. Mount datastore to compute 6. Stream stdout, logs, metrics 5. Launch the script 7. Copy over outputs

Create an Experiment Create a training file Submit to the
AI cluster Create an estimator Register the model Demo: Test the model

Azure Machine Learning Service Pipelines A repeatable process to deliver
a ML model

Code and comments only (not Jupyter output) Plus every part
of the pipeline And Infrastructure and dependencies And maybe a subset of data Source Control

Everything should be in source control! Except your training data
which should be a known, shared data source

Triggered on code change Refresh and execute AML Pipeline Code
quality, linting, and unit testing Pull request process aka.ms/mitt/azuredevops Continuous Integration

Demo: Setup Azure Pipeline for AMLS Pipeline

Step 3: Deploy your model into production

AMLS to deploy The Model Score.py Environment file Docker Image

Deployment using AMLS

Developer Favorite Tools Version Control Build Pipeline Azure Container Registry

Project  App.py => run the model as an API
 Dockerfile => Docker Configuration  Requirements.txt => Python packages to be installed  aml_config/config.json => Azure Machine Learning config  downloadmodel.py => Build step to retrieve the latest model

Demo: API code and Azure DevOps pipeline

@hboelman github.com/hnky henkboelman.com Thank you! Read more on: henkboelman.com

One final lesson ….

@hboelman github.com/hnky henkboelman.com Thank you! Read more on: henkboelman.com

DevOps for Machine Learning

DevOps for Machine Learning

More Decks by Henk Boelman

Other Decks in Programming

Featured

Transcript