blindness before it’s too late Build a model to help identify diabetic retinopathy automatically ü Aravind Eye Hospital technicians travel to rural areas to capture images ü Shortage of high trained doctors to review the images and provide diagnosis in rural areas of India Aravind Eye Hospital Madurai, Tamil Nadu Rural areas
: ~ 11,000 ü png format images ü target label 0 : No DR 1 : Mild 2 : Moderate 3 : Severe 4 : Proliferative DR 0 : No DR 1 : Mild 2 : Moderate 3 : Severe 4 : Proliferative DR
B4(MSE) 2. Dataset ü Use 2015 + 2019 while training(for generalization) ü Remove confusing label(duplicate, confusing) 3. Augmentation ü Rotate, Horizonal/Vertical Flip, Zoom, Lightning 4. Pseudo Labeling ü Model Average ü Add training data, and retrain models 5. Ensemble ü Average
EfficientNet B4(400) ü EfficientNet B5(456) 2. Preprocessing ü Crop From Gray(Both Training/Predicting) ü Apply Image Type 3. Augmentation ü Dihedral, Random Crop, Rotation, Contrast Brightness, Cutout, PerspectiveTransform, CLAHE
ü Remove Crop ü Apply Type using image boundary 3. Augmentations(Albumentations) ü Dihedral, Random Crop, Rotation, Contrast Brightness, Cutout, Perspective Transform, CLAHE 4. Simple Average Ensemble
ResNetV2, 2 * inception V4, 2 * SE-ResNeXt50, 2 * SE-ResNeXt101 ü Loss is Smooth L1 ü Replace Average Pooling to Generalized Mean 2. Dataset 3. Preprocessing 4. Pseudo Labeling(Soft) 5. Validation is Public LB!
use many different models and image size 1. e.g. SE-ResNeXt 50 + 101 + Inception ResNetV2 Image sizes are different 2. Recently, use Efficient Net 2. Get model diversity, and get robustness score. 3. We get high score using ensemble(averaging/stacking/blending)
labeling. 2. Pseudo labeling 1. Hard 2. Soft( I haven’t never choose) 3. Many team use training data + pseudo labeling testing data with CV 1. Our team(28th) only used pretrain(2015 + train) phase. but Public LB is low.(we get high score on Private LB .) So we couldn’t choose that submission. 2. We have to get high score pseudo labeling. (If pseudo labeling is low, we cannot get high score)