Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Predicted Cloud Spend / Consumption - an #Ensem...

Laura Edell
February 26, 2018

Predicted Cloud Spend / Consumption - an #EnsembleLearning Approach

an @Azure #machinelearning / #deeplearning use case walk through - How to avoid #spuriousCorrelations, understand drivers of cloud spend (consumption) and accurately predicted future spend based on those drivers.

Laura Edell

February 26, 2018
Tweet

More Decks by Laura Edell

Other Decks in Technology

Transcript

  1. Data Preparation Data Pre processing Transformations Data collection Data source

    Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure
  2. Identify the right sample based on business requirements Population Selection

    Noise/Outlier treatment Missing values Data Cleansing Normalization / Log transformation Aggregation and Encoding Data Transformation Constant values/ Zero variance Highly correlated Data Reduction
  3. Data Preparation Data Pre processing Transformations Data collection Data source

    Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure
  4. Customer Spend Prior spend Offering Geo Industry Customers with higher

    historic run rate (spending/rate) are more likely to spend more on Azure Prior Spend Customers subscribed to higher tier offers are likely to spend more on Azure What offer they subscribed to.. Customers from developed countries are likely to have higher spend Customers from tech dominant states/regions are more likely to spend more What country they belong to.. Customer associated with professional services, tech industry are more likely to spend more on Azure Which industry they belong to..
  5. Data Preparation Data Pre processing Transformations Data collection Data source

    Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure
  6. Data Preparation Data Pre processing Transformations Data collection Data source

    Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure
  7. Metric Formulae Pro Cons R^2 % the variance in response

    explained by predictors Can be misleading MAE Outlier resistant No sense of direction RMSE General purpose Metric Outlier sensitive
  8. Data Preparation Data Pre processing Transformations Data collection Data source

    Identification Initial Dataset creation Feature Engineering Feature extraction Feature transformation Feature selection Model Building Model Architecture Model Stacking Model Validation Picking the right accuracy measures Model Deployment Using Microsoft Azure
  9. Lorem Ipsum Lorem Ipsum 01 04 Data preparation Feature Engineering

    Deep Learning Benchmark Crowdsourcing Stacking Model Evaluation Model Tuning