Slide 25
Slide 25 text
一些發現
● average時,分數好的model不代表優於分數差的。
○ 我的 RF 分數優於 luckyGB, 可是
○ 加入RF 分數變差。
○ 加入luckyGB 分數變好。
● 重點還是feature,第一名說他最好的單一model就可以拿前三。
● 前幾名的model其實可以很精簡, 第四名 use weighted average of 6 GBT and a special
post processing, the best single model has only hand-picked 22 features and can take 5th
rank ,
c("WeekOfMonth","month","week","day","Store","Promo","DayOfWeek","year","
SchoolHoliday","CompDist0","CompOpenSince0","Promo2Since0","MeanLogSalesByStore","
MeanLogSalesByState","MeanLogSalesByStateHoliday","MeanLogSalesByAssortment","
MeanLogSalesByPromoInterval","MeanLogSalesByStorePromoDOW","
MeanLogCustByStorePromoDOW","MeanLogSalesBySchoolHoliday2Type","
Max_TemperatureC","SONNENSCHEINDAUER")
Ensemble