GDD X Mollie

From notebook to production in Vertex AI Wessel Huising &
Daniel van der Ende

• Data & AI Consultancy and Training • We help
organizations be successful with Data and AI • Mix of Data Engineers, Data Scientists, Machine Learning Engineers, Analytics Engineers & Analytics Translators • Based in Amsterdam & Eindhoven • Part of Xebia Data & AI • The best Payment Service Provider out there • Founded in 2004 by Adriaan Mol • Mission to simplify financial services by creating world-class products • Currently active for merchants in European Economic Area (EEA), Switzerland, and the United Kingdom

The challenge

• Low effort • Highly depending on one specific Data
Scientist • Prone to human errors • Labor intensive Manual predictions are convenient Data Scientists ≠ Software Engineers • Data Scientist tend not to have traditional Software Engineering background • Tend to lack understanding of DevOps ML models are something else • Different than regular software artifacts • There is significant overlap Machine Learning models to production is hard

The classic ML routine

INPUT DATA The classic ML routine

INPUT DATA The classic ML routine Lack of a centralized
data source

INPUT DATA The classic ML routine Code not living in
a Python package

INPUT DATA The classic ML routine Missing lineage tracking of
artifacts

INPUT DATA The classic ML routine No version control of
the code

INPUT DATA The classic ML routine No automatic or scheduled
predictions

INPUT DATA The classic ML routine Predictions all over the
place

What is MLOps? and should you want it?

• Bringing a model into production state • Reliable &
automated service but comes with extra costs • MLOps is a End-to-End process • Results in great ML software MLOps = Data + ML + DevOps

The big decision

The big decision Open Source solutions Managed solutions

What is Vertex AI

“Build, deploy, and scale ML models faster, with pre-trained and
custom tooling within a unified artificial intelligence platform.” What does Google tell us?

Vertex AI has a few (important) components The Buzzword Bingo
And more… Metadata Endpoints Models Pipelines Workbench Features Datasets

From notebook to production

Goal: Use machine learning to create a model that predicts
which passengers survived the Titanic shipwreck

In a Workbench we can do “traditional” Data Science without
any limitations or strings attached. Each DS has their own Virtual Machine Each VM: • Has access to data • Is persistent • Can be configured to work with VSCode or PyCharm on your local machine • Can have the specs you need/want Step 1: Let’s explore the problem! Workbench

Time to move our code out of notebooks! In this
step, we’ll: • Use the Pipeline components that Google provides out of the box to setup a training pipeline. • Train our model and output Datasets and Models. • Deploy our model to an Endpoint so our model is available for consumption by downstream users. Step 2: Deploy it as if you’re Google Metadata Models Pipelines Datasets Endpoints

Let’s take a step back… Good Not so good ❌
Automated Train/Test splits are nice, but also opaque and not very configurable ❌ Where is the model evaluation step? ❌ Everything disappears into one big “train the model” step, including preprocessing. ✅ We have an ML Pipeline ✅ It’s all code ✅ It can be scheduled and kicked off automatically ✅ Everything now is traceable

To mitigate the downsides, while keeping the upsides, we use
the Kubeflow API It integrates well with Vertex AI and can be easily customized Mollievert is our package with customized components to simplify and clarify our ML Pipelines Step 3: Now Mollie-fy it! Metadata Models Pipelines Datasets Endpoints

Wrapping up Mollie’s ML Platform enables robust MLOps practices by:
• Making it easier to go to production without lowering the bar • Empowering Data Scientists and ML Engineers with tooling • Defining a ‘Golden Path’ to production, but allowing customization if desired.

GDD X Mollie

GDD X Mollie

Marketing OGZ
PRO

More Decks by Marketing OGZ

Featured

Transcript

From notebook to production in Vertex AI Wessel Huising &

• Data & AI Consultancy and Training • We help

The challenge

• Low effort • Highly depending on one specific Data

The classic ML routine

INPUT DATA The classic ML routine

INPUT DATA The classic ML routine Lack of a centralized

INPUT DATA The classic ML routine Code not living in

INPUT DATA The classic ML routine Missing lineage tracking of

INPUT DATA The classic ML routine No version control of

INPUT DATA The classic ML routine No automatic or scheduled

INPUT DATA The classic ML routine Predictions all over the

What is MLOps? and should you want it?

• Bringing a model into production state • Reliable &

The big decision

The big decision Open Source solutions Managed solutions

What is Vertex AI

“Build, deploy, and scale ML models faster, with pre-trained and

Vertex AI has a few (important) components The Buzzword Bingo

Vertex AI has a few (important) components The Buzzword Bingo

From notebook to production

Goal: Use machine learning to create a model that predicts

In a Workbench we can do “traditional” Data Science without

Time to move our code out of notebooks! In this

Let’s take a step back… Good Not so good ❌

To mitigate the downsides, while keeping the upsides, we use

Wrapping up Mollie’s ML Platform enables robust MLOps practices by: