Slide 12
Slide 12 text
FEATURE ENGINEERING
• Numerical - Log, Log(1 + x), Normalization, Binarization
• Categorical - One-hot-encode, TF-IDF (text), Weight-of-Evidence
• Timeseries - Stats, FFT, MFCC (audio), ERP (EEG)
• Numerical/Timeseries to Categorical - RF/GBM*
* http://www.csie.ntu.edu.tw/~r01922136/kaggle-2014-criteo.pdf