Slide 7
Slide 7 text
不均衡なデータへの主な対策
● Data-Centric
○ Sampling Method
■ Over-sampling
■ Under-sampling
■ Combination
○ Active Learning Method
[1]H.He, Y.Ma, “IMBALANCED LEARNING:Foundations, Algorithms, and Applications”, Wiley IEEE Press, 2013.
[2]P.Branco, L.Torgo, R.P.Ribeiro, “A Survey of Predictive Modeling on Imbalanced Domains”, ACM Computing Surveys(CSUR), No. 42(2),
2016.
※これらを組み合わせたHybridな手法もある
● Model-Centric
○ Cost-Sensitive Method
■ Loss weighting
■ Ranking etc...
○ Ensemble(+resampling) Method
■ Boosting
■ Bagging etc...
○ One-Class Learning Method
■ Anomaly Detection etc...
imbalanced-learnで対応