popmon: population shift monitoring made easy

` ` population shi monitoring made easy Tomas Sostak 16
June 2020 @tomassostak ING Wholesale Banking Advanced Analytics

PRESENTATION OUTLINE • MOTIVATION • PROBLEMS • POPMON SOLUTION •
DEMO • USE CASE About me Tomas Sostak Data Scientist @ ING Wholesale Banking Advanced Analytics @tomassostak

Our why - Our ML models and data were not
being monitored carefully enough - No good open-source solution available - Past experience in doing this right

Motive ‣ Running reliable and consistent models in production ‣
Are newly incoming data/predictions consistent with the historical data on which model has been trained and tested on initially? ‣ If input features change, then tested performance is not guaranteed ‣ Full control of continuous retraining of deployed models ‣ Think twice before retraining your model if new data has different distribution than the old one ‣ Reporting - audit, paper trail

Common Issues Population Shift Data Rot Time Dependencies

Monitoring data predictions model performance

data proﬁle data points histogram Population shi

https://deepai.org/publication/conformal-prediction-under-covariate-shift https://www.researchgate.net/ﬁgure/Covariate-shift-Training-and-test-data-sets-are-drawn-from-different-distributions_ﬁg24_330485084 Population shi

New data Steps

New data New (test) Histograms Steps

New data Reference (train) New (test) Histograms Comparison Steps

New data Reference (train) New (test) Histograms Comparison Metrics Steps

New data Reference (train) New (test) Histograms Comparison Metrics Thresholds
Steps

New data Reference (train) New (test) Histograms Comparison Metrics Alerting
Thresholds Steps

1 year of data 52,000 26,000 0 A B C
D

1 year of data 52,000 26,000 0 A B C
D Week 1 1,000 500 0 A B C D Week 2 1,000 500 0 A B C D Week 3 Week 52 …

Week 1 1,000 500 0 A B C D

1,000 500 0 A B C D Week 2

Overlay: week 1 vs 2 1,000 500 0 A B
C D

Statistical tests 1,000 500 0 A B C D •
Chi-squared • Kolmogorov-Smirnov • Pearson’s correlation • Your own tests

WHY HISTOGRAMS? • Aggregated information (data privacy) • Size =
easy to store, light for sending over APIs • Monitoring works identically with both big and small data • More visual - adds information (distribution) • Useful for applying all sorts of statistical tests

Prediction monitoring

DS & DE - Great for data exploration (seeing data
patterns, trend, seasonality, outliers). - Very valuable for early inspection of covariate shifts - Data ingestion pipelines (monitor your incoming data to prevent drop in performance); stitching is available (e.g. data coming in batches: over certain period, or a number records)

Profiling Reference points Statistical comparisons

Profiling Statistical comparisons Reference points count, mean, std, ﬁlled, nan,
min, max, p01, p05, p25, p50, p75, p95, p99,… • Self • Reference (train) • Rolling (sliding) • Expanding • Chi-squared • Kolmogorov-Smirnov • Pearson’s correlation • Trend detection (LR) • Custom tests

Tra ic lights

Tra ic lights Standard score: We set conﬁgurable bounds: (-2,
-1, 1, 2)

Tra ic lights Week 1 1,000 500 0 A B
C D mean

Tra ic lights Get distribution of reference data over time

Tra ic lights

Tra ic lights Set traffic light bounds

- Use popmon to monitor the stability of a pandas
or spark dataset - Automatically detect changes over time: trends, shifts, peaks, outliers, anomalies, changing correlations, etc. - Alerting based on static or dynamic business rules. - Easy to extend: make your own data pipelines (with preferred configurations) + your own implemented statistical tests = it will all automatically show up in the report - Supports 1D & 2D histograms released April 2020

Internal use case time Chi2

time Chi2 Internal use case

time Chi2 - Switch to a new data source -
Training on all data: - AUC model performance: 0.972 - Training on the new data only: - AUC model performance: 0.995 Internal use case

Thank you! https://github.com/ing-bank/popmon pip install popmon Are you ready to
give your data the attention it deserves?

popmon: population shift monitoring made easy

popmon: population shift monitoring made easy

Other Decks in Science

Featured

Transcript