Upgrade to Pro — share decks privately, control downloads, hide ads and more …

AzureML - Zero to Hero

AzureML - Zero to Hero

The presentation for SQLUG meetup.

Govind Kanshi

August 02, 2014
Tweet

More Decks by Govind Kanshi

Other Decks in Technology

Transcript

  1. 1

  2. 2

  3. 3

  4. 4

  5. 5

  6. 6

  7. 7

  8. 8

  9. AzureML - where experiments are done and deployed as web

    services AzureML studio has “toolbar” which has modules for data ingestion/transformation, statistics, machine learning. Some of them have properties which can be set. AzureML has Datasets which can be bought in at runtime or persisted inside. It has public datasets too. AzureML 9
  10. 10

  11. Classification algorithms can be measured by these metrics Regression have

    just RMSE which many people are questioning in present circumstances (Sum through all instances (actual class value - predicted one)) Clustering has different mechanism and requires tests/re-runs to ensure grouped/clustered points have cohesion of somekind Types of classification errors often incur different costs. Total error = (FP+FN)/(TP+FP+TN+FN) Lift charts Sort instances by their predicted probability of being a true positive (TP). X axis is sample size and Y axis is number of true positives (TP). ROC curves (ROC means receiver operating characteristic, a term from signal processing) X axis shows %of false positives (FP) Y axis shows %of true positives (TP). Recall - precision (IR world- search world has these terms too ): Precision (retrieved relevant / total retrieved) = TP / (TP+FP) Recall (retrieved relevant / total relevant) = TP / (TP + FN) 11
  12. 12

  13. 13

  14. Desirables Ipython like “executable” documented – DS – how to

    achieve in simple way Model interpretation More visualization HMM Native Time series Text analysis – IR integration 14