Make hyperparameters great again

Make Hyperparameters great again! Daniel Kühn @ DataGeeks Data Day
2017

InTro 2

3 http://xgboost.readthedocs.io/en/latest/model.html XGBoost

4 XGBoost Hyperparameters eta min_child_weight max_depth gamma nrounds subsample colsample_bytree
colsample_bylevel

https://www.openml.org/d/151 5

Better 7

8 How to tune the hyperparameters of XGBoost to get
a good result?

Grid search

10 XGBoost Hyperparameters eta min_child_weight max_depth gamma nrounds subsample colsample_bytree
colsample_bylevel

Better 12

Random search

Better 15

16 BERGSTRA, BENGIO (2012) Grid search and manual search are
the most widely used strategies for hyper-parameter optimization. This paper shows empirically and theoretically that randomly chosen trials are more efficient for hyper-parameter optimization than trials on a grid. […] Granting random search the same computational budget, random search finds better models by effectively searching a larger, less promising configuration space. […] this work shows that random search is a natural baseline against which to judge progress in the development of adaptive (sequential) hyper-parameter optimization algorithms.

Bayesian optimization

18 SNOEK, LAROCHELLE, ADAMS (2012) In this work, we consider
the automatic tuning problem within the framework of Bayesian optimization, in which a learning algorithm's generalization performance is modeled as a sample from a Gaussian process (GP). […] We show that these proposed algorithms improve on previous automatic procedures and can reach or surpass human expert-level optimization on a diverse set of contemporary algorithms including latent Dirichlet allocation, structured SVMs and convolutional neural networks.

19 https://cran.r-project.org/web/packages/mlrMBO/vignettes/mlrMBO.html

Better 24

Open machine Learning bot

26 https://www.openml.org/u/2702

28 Random Forest

29 Dataset ETA NROUNDS … AUC 1 0.27 903 …
0.84 1 0.12 2841 … 0.92 … … … … … PREDICT

Dataset ETA NROUNDS … AUC 1 0.27 903 … 0.84
1 0.12 2841 … 0.92 2 … … … … 30 PREDICT How to find good values?

31 PREDICT Dataset ETA NROUNDS … AUC 1 0.27 903
… 0.84 1 0.12 2841 … 0.92 2 0.05 1750 … ? 2 0.072 2411 … ? 2 … … … ?

32 PREDICT Dataset ETA NROUNDS … AUC 1 0.27 903
… 0.84 1 0.12 2841 … 0.92 2 0.05 1750 … 0.97 2 0.072 2411 … 0.91 2 … … … …

Dataset ETA NROUNDS … AUC 1 0.27 903 … 0.84
1 0.12 2841 … 0.92 2 0.05 1750 … 0.97 2 0.072 2411 … 0.91 2 … … … … 33 PREDICT Take the best!

34 Random Forest

Better 35

Wrap up

Make hyperparameters great again

Make hyperparameters great again

More Decks by MunichDataGeeks

Other Decks in Research

Featured

Transcript