ML as a tool Output ● Prediction ● Score Input ● Data (features) ● Task ○ Target ○ Regression or Classification ○ Metrics Auto ML 1. Define a good task 2. Validate results 3. Accelerate your work!
4. Modeling ● Choose model (& hyper parameters) ■ model = Model(params) ● Training ○ Data (Features), Target ■ model.fit(train_x, train_y) ● Prediction ○ Test data ■ pred = model.predict(test_x)
4. Modeling: Model ● Linear model ● k-nearest neighbor algorithm: kNN ● Random forest ○ Decision tree + bagging ● Neural network: NN ● Gradient boosting decision tree: GBDT ○ Decision tree + Gradient boosting