Lock in $30 Savings on PRO—Offer Ends Soon! ⏳
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Machine learning: How it can help your business...
Search
szilard
March 21, 2018
0
180
Machine learning: How it can help your business - Microsoft Future Decoded - Budapest, March 2018
szilard
March 21, 2018
Tweet
Share
More Decks by szilard
See All by szilard
Gradient Boosting Machines (GBM): From Zero to Hero (with R and Python Code) - Data Con LA - Oct 2020
szilard
0
200
Make Machine Learning Boring Again: Best Practices for Using Machine Learning in Businesses - Albuquerque Machine Learning Meetup (Online) - Aug 2020
szilard
0
150
Better than Deep Learning: Gradient Boosting Machines (GBM) - eRum conference - invited talk - June 2020
szilard
0
130
Gradient Boosting Machines (GBM): From Zero to Hero (with R and Python Code) - LA Data Science Meetup - February 2020
szilard
0
120
A Random Walk in Data Science and Machine Learning in Practice - CEU, Business Analytics Masters - Budapest, Febr 2020
szilard
0
310
Better than My Meetup/Conference Talks: Going Deeper in Various GBM Topics - GBM Advanced Workshop - Budapest, Nov 2019
szilard
0
87
Gradient Boosting Machines (GBM): From Zero to Hero (with R and Python Code) - Budapest BI Forum, Budapest, Nov 2019
szilard
0
150
Make Machine Learning Boring Again: Best Practices for Using Machine Learning in Businesses - LA Data Science Meetup - Playa Vista, August 2019
szilard
0
130
Better than Deep Learning: Gradient Boosting Machines (GBM) / 2019 edition - Budapest R and Data Science Meetups - Budapest, June 2019
szilard
0
99
Featured
See All Featured
jQuery: Nuts, Bolts and Bling
dougneiner
65
8.2k
We Have a Design System, Now What?
morganepeng
54
7.9k
A better future with KSS
kneath
240
18k
Being A Developer After 40
akosma
91
590k
KATA
mclloyd
PRO
32
15k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
Reflections from 52 weeks, 52 projects
jeffersonlam
355
21k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.7k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Distributed Sagas: A Protocol for Coordinating Microservices
caitiem20
333
22k
4 Signs Your Business is Dying
shpigford
186
22k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
46
2.6k
Transcript
Machine Learning: How It Can Help Your Business Szilárd Pafka,
PhD Chief Scientist, Epoch (USA) Microsoft Future Decoded, Budapest March 2018
None
Disclaimer: I am not representing my employer (Epoch) in this
talk I cannot confirm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk
None
Source: Andrew Ng
None
y = f(x) “Learn” f from data Source: Hastie etal,
ESL 2ed
Machine Learning linear/logistic regression decision trees neural networks support vector
machines random forests gradient boosting deep learning neural networks
Machine Learning linear/logistic regression (early 1900s/60s) decision trees (60s/80s) neural
networks (60s/80s) support vector machines (90s) random forests (90s) gradient boosting (90s) deep learning neural networks (2000s)
None
data mining Source: Szilard Pafka
data science Source: Szilard Pafka
data science Source: Szilard Pafka
CRISP-DM, 1999
data $$$
How?
None
None
Source: Andrew Ng
None
None
None
Source: @iamdevloper (twitter)
None
None
None
structured/tabular data: GBM (or RF) very small data: LR very
large sparse data: LR with SGD images/videos, speech: DL
structured/tabular data: GBM (or RF) very small data: LR very
large sparse data: LR with SGD images/videos, speech: DL better answer: it depends
structured/tabular data: GBM (or RF) very small data: LR very
large sparse data: LR with SGD images/videos, speech: DL better answer: it depends alternative answer: try them all
structured/tabular data: GBM (or RF) very small data: LR very
large sparse data: LR with SGD images/videos, speech: DL better answer: it depends alternative answer: try them all extra accuracy: combine them (ensembles)
None
None
10x
None
None
None
ML training: lots of CPU cores lots of RAM limited
time
None
None
None
None
None
Source: Szilard Pafka
None
Random forest GBM GBM + cross validation GBM + hyperparameter
tuning Logistic regression Neural Nets / Deep Learning Ensembles
None
None
None
None
None
None
Backup Slides
None
10x
None
None
None
Source: Szilard Pafka: 10 Pitfalls in Data Science, LA Data
Science Meetup, February, 2014
None
None