Gradient Boosting Machines (GBM): From Zero to Hero (with R and Python Code) - Data Con LA - Oct 2020

Transcript

Gradient Boosting Machines (GBM): From Zero to Hero (with R

and Python Code) Szilard Pafka, PhD Chief Scientist, Epoch Data Con LA (Online) Oct 2020

None

Disclaimer: I am not representing my employer (Epoch) in this

talk I cannot conﬁrm nor deny if Epoch is using any of the methods, tools, results etc. mentioned in this talk

Source: https://twitter.com/iamdevloper/

None

y = f(x 1 ,x 2 ,...,x n ) “Learn”

f from data

Supervised Learning Data: X (n obs, p features), y (labels)

Regression, classification Train/learn/fit f from data (model) Score: for new x, get f(x) Algos: LR, k-NN, DT, RF, GBM, NN/DL, SVM, NB… Goal: max acc/min err new data Metrics: MSE, AUC (ROC) Bad: measure on train set. Need: test set/cross-validation (CV) Hyperparameters, model capacity, overfitting Regularization Model selection Hyperparameter search (grid, random) Ensembles

Supervised Learning Data: X (n obs, p features), y (labels)

Regression, classification Train/learn/fit f from data (model) Score: for new x, get f(x) Algos: LR, k-NN, DT, RF, GBM, NN/DL, SVM, NB… Goal: max acc/min err new data Metrics: MSE, AUC (ROC) Bad: measure on train set. Need: test set/cross-validation (CV) Hyperparameters, model capacity, overfitting Regularization Model selection Hyperparameter search (grid, random) Ensembles

None

Gradient Boosting Machines (GBM): From Zero to...

Gradient Boosting Machines (GBM): From Zero to Hero (with R and Python Code) - Data Con LA - Oct 2020

szilard

More Decks by szilard

Featured

Transcript