Better than My Meetup/Conference Talks: Going Deeper in Various GBM Topics - GBM Advanced Workshop - Budapest, Nov 2019

Transcript

Better than My Meetup/Conference Talks: Going Deeper in Various GBM

Topics Szilard Pafka, PhD Chief Scientist, Epoch (USA) GBM Advanced Workshop Budapest Nov 2019

Why GBMs

None

meetup/conference talks going deeper section dividers

None

Disclaimer: I am not representing my employer (Epoch) in this

talk I cannot conﬁrm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk

Source: Andrew Ng

None

...

None

http://lowrank.net/nikos/pubs/empirical.pdf http://www.cs.cornell.edu/~alexn/papers/empirical.icml06.pdf

None

top algos (RF, boosting), all features 2007

top algos (RF, boosting), all features most algos (lin, tree,

nnet) worst algos (knn, NB) 2007

top algos (RF, boosting), all features most algos (lin, tree,

nnet) worst algos (knn, NB) top algos, removed top feature(s) 2007

None

Source: Hastie etal, ESL 2ed

GBM libs

None

10x

None

Scoring

None

* very first request not shown >500ms (JVM “warmup”)

None

GBM-perf github repo

None

multi-core/socket

None

CPU 1

CPU 1 CPU 2

None

5x 3.5x

None

zero

None

Spark

None

GPU

None

catboost

None

API / tuning

None

http://www.jmlr.org/papers/volume13/bergstra12a/bergstra12a.pdf

http://www.argmin.net/2016/06/20/hypertuning/

None

time ordered data time ordered data

time ordered data time ordered data train sample

time ordered data time ordered data train test sample sample

(slightly different distribution)

time ordered data time ordered data train test sample sample

proper train early stopping Model selection resampled 80-10-10 (~CV) (slightly different distribution)

time ordered data time ordered data train test sample sample

proper train early stopping Model selection random search over lightgbm resampled 80-10-10 (~CV) (slightly different distribution)

None

Closing

None

Source: https://www.linkedin.com/pulse/winning-solution-kaggledays-2019-competition-san-francisco-mark-peng/

None

More:

None

Better than My Meetup/Conference Talks: Going ...

Better than My Meetup/Conference Talks: Going Deeper in Various GBM Topics - GBM Advanced Workshop - Budapest, Nov 2019

More Decks by szilard

Featured

Transcript