• Grapheme root(169 class) • Vowel diacritic(11 class) • Consonant diacritic(7 class) • Grapheme is 1295 class • But test data have unseen graphemes • Code Competition • 9hours(CPU)/2hours(GPU)
Face • Binary Cross Entropy • Model • Efficient Net/Se-ResNeXt • 3 target or individual 3 target(1 target per model) • Data Augmentation • FMix • Generate Character • Cycle GAN • From Font
• FMix is better than cutmix • https://arxiv.org/abs/2002.12047 • Individual Prediction (3 target) • Unseen • train model from font data. • Blending https://www.kaggle.com/c/bengaliai-cv19/discussion/135966
1295 class(grapheme) • If all class predictions are low, itʼs judged unseen class. • Seen Model • Efficientnet B7 based 14784 class (Grapheme * Consonant * Vowel) • Unseen Model • Cycle GAN(Font and Image) + Efficientnet B0(2 font models) https://www.kaggle.com/c/bengaliai-cv19/discussion/135984
is better than 1ttf.(ttf file is font) • Convert original image to Font image and predict Efficient net b0 • Get a narrow portion of the handwriting. • Abstract character structure. • Detail in the interview article. • https://medium.com/kaggle-blog/top-marks-for-student-kaggler-in-bengali- ai-a-winners-interview-with-linsho-kaku-dd321b324c74
this competition. • Arc Face/Binary Cross Entropy • Character Generation can get generalized feature • Cycle GAN can get a absolute feature and impressive. • Character generation from font also can get generalization. • Mixed augmentation can get better result(like FMix)