Machine learning: How it can help your business - Microsoft Future Decoded - Budapest, March 2018

Machine Learning: How It Can Help Your Business Szilárd Pafka,
PhD Chief Scientist, Epoch (USA) Microsoft Future Decoded, Budapest March 2018

Disclaimer: I am not representing my employer (Epoch) in this
talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk

Source: Andrew Ng

y = f(x) “Learn” f from data Source: Hastie etal,
ESL 2ed

Machine Learning linear/logistic regression decision trees neural networks support vector
machines random forests gradient boosting deep learning neural networks

Machine Learning linear/logistic regression (early 1900s/60s) decision trees (60s/80s) neural
networks (60s/80s) support vector machines (90s) random forests (90s) gradient boosting (90s) deep learning neural networks (2000s)

data mining Source: Szilard Pafka

data science Source: Szilard Pafka

CRISP-DM, 1999

data $$$

Source: Andrew Ng

Source: @iamdevloper (twitter)

structured/tabular data: GBM (or RF) very small data: LR very
large sparse data: LR with SGD images/videos, speech: DL

large sparse data: LR with SGD images/videos, speech: DL better answer: it depends

large sparse data: LR with SGD images/videos, speech: DL better answer: it depends alternative answer: try them all

large sparse data: LR with SGD images/videos, speech: DL better answer: it depends alternative answer: try them all extra accuracy: combine them (ensembles)

ML training: lots of CPU cores lots of RAM limited
time

Source: Szilard Pafka

Random forest GBM GBM + cross validation GBM + hyperparameter
tuning Logistic regression Neural Nets / Deep Learning Ensembles

Backup Slides

Source: Szilard Pafka: 10 Pitfalls in Data Science, LA Data
Science Meetup, February, 2014

Machine learning: How it can help your business...

Machine learning: How it can help your business - Microsoft Future Decoded - Budapest, March 2018

szilard

More Decks by szilard

Featured

Transcript

Machine Learning: How It Can Help Your Business Szilárd Pafka,

Disclaimer: I am not representing my employer (Epoch) in this

Source: Andrew Ng

y = f(x) “Learn” f from data Source: Hastie etal,

Machine Learning linear/logistic regression decision trees neural networks support vector

Machine Learning linear/logistic regression (early 1900s/60s) decision trees (60s/80s) neural

data mining Source: Szilard Pafka

data science Source: Szilard Pafka

data science Source: Szilard Pafka

CRISP-DM, 1999

data $$$

How?

Source: Andrew Ng

Source: @iamdevloper (twitter)

structured/tabular data: GBM (or RF) very small data: LR very

structured/tabular data: GBM (or RF) very small data: LR very

structured/tabular data: GBM (or RF) very small data: LR very

structured/tabular data: GBM (or RF) very small data: LR very

10x

ML training: lots of CPU cores lots of RAM limited

Source: Szilard Pafka

Random forest GBM GBM + cross validation GBM + hyperparameter

Backup Slides

10x

Source: Szilard Pafka: 10 Pitfalls in Data Science, LA Data