Machine Learning Drift: spatial and temporal analysis

ML Model Drifting Spatial & Temporal Analysis Foto de Mike
Hindle en Unsplash Andrés L. Martínez Ortiz @davilagrau

About me… Andrés-Leonardo Martínez-Ortiz a.k.a almo, holds a PhD on
Software, Systems and Computing and a Master on Computer Science. Based on Zurich, almo is a member of the Google Machine Learning Site Reliability Engineering team, leading several programs aiming for reliability, efficiency & convergence. He is also a member of IEEE, ACM, Linux Foundation and Computer Society. @davilagrau almo

Agenda Machine Learning Operations: • Efficiency, Reliability and convergence. •
Model Drift Spatial Drift Temporal Drift Risk Modelling References Photo by Bradyn Shock on Unsplash

Machine Learning Operations Photo by Philipp Katzenberger on Unsplash

Machine Learning Operations

MLOps: Model Drifting Machine Learning Abstract Model Data Drifting Concept
Drifting Temporal Drifting Spatial Drifting Temporal Drifting

Spatial Drifting Photo by Pawel Czerwinski on Unsplash

Spatial Drift: Challenges and research areas • Detection under unstructured
and noise datasets • Understanding of the model drift is required for a proper treatment. • Reacting to model drift, adapting the life cycle. Photo by Harole Ethan on Unsplash

Detection General Framework Source: arXiv:2004.05785 Inferring data distribution Extracting sensitive
features Severity of the drift Drift detection Accuracy

General drift partners and algorithms’ performance Source: arXiv:2004.05785 Source: Expert
Systems with Applications 41 (2014) 8144–8156

Drift Understanding Time series analysis Synthetic data Degradation patterns datasets
Explainable analysis (symbolic regression) For critical applications, detection is not enough. ML drift presents high dependency on the application, making diﬃcult general solutions. Open dataset, synthetic data are opportunities for new developments. Massive Online Analysis (MOA)

Temporal Drift Photo by Jon Tyson on Unsplash

What is the temporal ml drifting? Temporal degradation of ml
models affecting • Penalized Regression • Random Forest • Gradient Boosting • Neural network over • long life datasets (3-5 years) • with no data or concept drifting • Multi-domain: weather, financial, hospital planning and flight delays. Foto de Dustin Humes en Unsplash Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022).

Evaluation framework Vela, D., Sharp, A., Zhang, R. et al.
Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022). You need to deﬁne your own evaluation framework.

How does the temporal ML drifting look like? No degradation
or gradual Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022).

How does the temporal ML drifting look like? Explosive degradation
Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022).

How does the temporal ML drifting look like? Increasing variability
Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022).

How does the temporal ML drifting look like? Exotic patterns:
chaos and periodic Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022).

Implications for ML Operations Long lasting models demands temporal degradation
analysis. Numerical analysis and dynamic systems analysis are required. Automatic re-training is not (always) an option: • Lack of clear thresholds • Lack of training data • Lack of convergence • Catastrophic forgetting • Random seeds dependencies Recommendations Extend drift analysis, including temporal drift. Evaluation should include models, hyperparameters and size of training data. Feature drifting analysis Real time or high frequency monitoring Your models life is now longer than ever. Temporal drift analysis is a must.

Risk Modelling and Mitigation Photo by Edge2Edge Media on Unsplash

Risks Vulnerabilities in complex systems (hardwares, software, data) Deceived Human
Interaction: misleading reporting Unpredictable sequential planning (collusion) Difficulties being shut down Photo by Tom Morel on Unsplash Deployment of ML systems required risk analysis, including technological, business and legal perspectives

Model Evaluation along the life cycle Internal: multi-layer APIs: red
teaming Auditing Evaluation requires proper development, documentation and deployment, adding extra complexity External

Model Evaluation Source: arXiv:2305.15324

Limitations and hazards Limitations Complex system integrations: unpredictable interactions Unknown
unknown Hidden Features Over-promising Evaluation technology Hazards Impact of model evaluation Superficial improvements to model safety

Organizational Implications Communications • Incident analysis and reporting, including external
parties. • Auditability • Scientific peer-review • Internal communication for business units and non technical staff. Security • Intensive evaluation strategies • AI-based monitoring • Fast responses protocols • Integrity verification, authorization and auditing Photo by Alexander Grey on Unsplash

Thank you! Questions?

References • Lu J., Liu A., Dong F., Gu F.,
Gama J. and Zhang G. Learning under Concept Drift: A Review, arXiv:2004.05785 (2020). (link) • Zeniseka, J., Holzingera, F. and Affenzellera, M. Machine learning based concept drift detection for predictive maintenance, Computers & Industrial Engineering 137 (2019) 106031. • Gonçalves P. M., Carvalho Santos S.G.T., Barros, R.S.M. and Vieira D.C.L. A comparative study on concept drift detectors, Expert Systems with Applications 41 (2014) 8144–8156 • Vela, D., Sharp, A., Zhang, R. et al. Temporal quality degradation in AI models. Sci Rep 12, 11654 (2022). (link) • Shevlane T., Farquhar S., Garfinkel B., Phuong M., Whittlestone J., Leung J, Kokotajlo D., Marchal N., Anderljung M., Kolt N., Ho L., Siddarth D., Avin S., Hawkins W., Kim B., Gabriel I., Bolina V., Clark J., Bengio Y., Christiano P. and Dafoe A. Model evaluation for extreme risks, arXiv:2305.15324 (link)

Machine Learning Drift: spatial and temporal an...

Machine Learning Drift: spatial and temporal analysis

almo

More Decks by almo

Other Decks in Research

Featured

Transcript

ML Model Drifting Spatial & Temporal Analysis Foto de Mike

About me… Andrés-Leonardo Martínez-Ortiz a.k.a almo, holds a PhD on

Agenda Machine Learning Operations: • Efficiency, Reliability and convergence. •

Machine Learning Operations Photo by Philipp Katzenberger on Unsplash

Machine Learning Operations

MLOps: Model Drifting Machine Learning Abstract Model Data Drifting Concept

Spatial Drifting Photo by Pawel Czerwinski on Unsplash

Spatial Drift: Challenges and research areas • Detection under unstructured

Detection General Framework Source: arXiv:2004.05785 Inferring data distribution Extracting sensitive

General drift partners and algorithms’ performance Source: arXiv:2004.05785 Source: Expert

Drift Understanding Time series analysis Synthetic data Degradation patterns datasets

Temporal Drift Photo by Jon Tyson on Unsplash

What is the temporal ml drifting? Temporal degradation of ml

Evaluation framework Vela, D., Sharp, A., Zhang, R. et al.

How does the temporal ML drifting look like? No degradation

How does the temporal ML drifting look like? Explosive degradation

How does the temporal ML drifting look like? Increasing variability

How does the temporal ML drifting look like? Exotic patterns:

Implications for ML Operations Long lasting models demands temporal degradation

Risk Modelling and Mitigation Photo by Edge2Edge Media on Unsplash

Risks Vulnerabilities in complex systems (hardwares, software, data) Deceived Human

Model Evaluation along the life cycle Internal: multi-layer APIs: red

Model Evaluation Source: arXiv:2305.15324

Limitations and hazards Limitations Complex system integrations: unpredictable interactions Unknown

Organizational Implications Communications • Incident analysis and reporting, including external

Thank you! Questions?

References • Lu J., Liu A., Dong F., Gu F.,