Slide 1

Slide 1 text

Machine Learning Infrastructure with Feature Store and Vertex AI Li Software Engineer (Machine Learning)

Slide 2

Slide 2 text

Li / @Li Software Engineer (Machine Learning) ● 2017/4 joined Yahoo! JAPAN ○ FrontEnd Engineer ○ 2019/1~: Machine Learning Engineer ● 2021/9 joined Merpay ○ Machine Learning Engineer ○ currently in charge of fraud prevention-related development.

Slide 3

Slide 3 text

Agenda 01 Background 02 Machine Learning Infrastructure 03 What we think were good after 1 year of operation 04 Future work

Slide 4

Slide 4 text

Agenda 01 Background 02 Machine Learning Infrastructure 03 What we think were good after 1 year of operation 04 Future work

Slide 5

Slide 5 text

Fraud Prevention Models ● Alert Filtering (multiple ML models) ○ ref: https://engineering.mercari.com/blog/entry/alertfiltering-ml/ ● ChargeBack Detection (ML model) ○ ref: https://engineering.mercari.com/en/blog/entry/chargeback-ml/ ● Sub Account Detection ● Suspicious Account Detection (rule-based logics) ● Suspicious Action Detection (complex network) ○ ref: https://engineering.mercari.com/blog/entry/complex-network-ml/ ● (new) Account-based Chargeback Detection ● (new) Account Takeover Detection ● (new) Continuous transaction Detection ● (new) Overdue Payment Detection

Slide 6

Slide 6 text

t 2020 2021 Alert Filtering Model ChargeBack Detection Sub Account Detection Dimension ~ 40 Alert Filtering Model × 4 sub model ChargeBack Detection latest version Sub Account Detection Suspicious Account Detection × ~ 4 detection Suspicious Action Detection × 2 mass fraud detection Dimension ~ 170 Features ● feature dimension increased by 4 times within 1 year ● feature used by multiple models

Slide 7

Slide 7 text

Agenda 01 Background 02 Machine Learning Infrastructure 03 What we think were good after 1 year of operation 04 Future work

Slide 8

Slide 8 text

machine learning infrastructure

Slide 9

Slide 9 text

Vertex AI ref: https://cloud.google.com/vertex-ai/docs/start/introduction-unified-platform?hl=ja

Slide 10

Slide 10 text

Feature Store(Feast) ref: https://docs.feast.dev/

Slide 11

Slide 11 text

Before Feature Store BigQuery GCS Model A Training Model B Training Model C Training …

Slide 12

Slide 12 text

After Feature Store FeatureView1 FeatureView2 FeatureView3 Feature Service1 Feature Service2 Feature Service3 FeatureView4 … … Model A Training Model B Training Model C Training … BigQuery GCS FEAST

Slide 13

Slide 13 text

Agenda 01 Background 02 Machine Learning Infrastructure 03 What we think were good after 1 year of operation 04 Future work

Slide 14

Slide 14 text

● components ○ performs one step in ML workflow ● common components ○ commonly used components such as execute_bigquery_query ● pipelines ○ ML workflow, including all of the components in the workflow Elements Definition component pipeline

Slide 15

Slide 15 text

● supervised learning process was similar between multiple models ● use the same training pipeline with different config files to train multiple models which reduce the cost of modeling Common Pipelines

Slide 16

Slide 16 text

Common Pipelines

Slide 17

Slide 17 text

Version Control def train(): … pipeline.py component.py com1.yaml … com1.yaml com2.yaml compile compile

Slide 18

Slide 18 text

Without Version Control

Slide 19

Slide 19 text

With Version Control

Slide 20

Slide 20 text

pipeline uses different versions of common components Version Control

Slide 21

Slide 21 text

● we use one-for-all versioning for components and pipeline files ● components and pipeline files are compiled and saved to gcs. ● we use different gcs path for different versions Version Control

Slide 22

Slide 22 text

Scheduled/On-demand Execution need to satisfy ● train pipeline ○ on-demand execution ● predict pipeline ○ scheduled execution ● common pipeline ○ execute with different config files

Slide 23

Slide 23 text

Scheduled/On-demand Execution

Slide 24

Slide 24 text

Continuous A/B Test

Slide 25

Slide 25 text

ML Monitoring with Slack Notifier

Slide 26

Slide 26 text

Agenda 01 Background 02 Machine Learning Infrastructure 03 What we think were good after 1 year of operation 04 Future work

Slide 27

Slide 27 text

Feature Online Store with Stream Ingest WIP

Slide 28

Slide 28 text

WIP Data Drift Detection and Re-Train

Slide 29

Slide 29 text

No content