• Have impact at scale ◦ Automation ◦ Standards for best practices ◦ Reduce time to implementation ◦ Reduce time to correction • Applies to problems we have ◦ Find fake and exceptional data ◦ Estimate missing data ◦ Triage data by risk ◦ Predict demand and trends Why Use Machine Learning?
experiment Challenges • Current digital data not collected with ML in mind • Data sovereignty often requires on-premises deployments Working with governments and NGOs
• State-of-the-art open source solutions Challenges • Use ML as a means not an end • Data consistency across different implementations Scaling Data Management Platforms