Slide 105
Slide 105 text
©2024 Databricks Inc. — All rights reserved
Profiling
Table
Drift
Table
Dashboard
Monitoring a table in the Lakehouse
Table
🔎monitor
Alerts Webhooks
DBSQL
How does it work?
Distributional statistics for
inputs, outputs
Minimum, maximum, standard deviation,
quantiles, top occurring value, …
Model quality metrics (if labels
are provided)
Classification: Accuracy, F1, precision,
recall
Regression: MSE, RMSE, MAE, R2, …
Anomaly detection and drift for
training-vs-scoring and
scoring-vs-scoring
Delta/changes in nulls and counts, PSI, KS
divergence, Mean shift, Total Variation
distance, L-inf distance, χ2 test, Wasserstein
distance, …
Custom metrics
Expressed as SQL expressions