the Knowledge in a Neural Network (Hinton et al., 2015) 2. Recent Research 1. Transformer to CNN:Label-scarce distillation for efficient text classification (Chia et al., 2018 NIPS Workshop) 2. BAM!:Born-Again Multi-Task Networks for Natural Language Understanding (Clark et al., 2019 arXiv) 3. Well-Read Students Learn Better: The Impact of Student Initialization on Knowledge Distillation(Turc et al., 2019 arXiv) 4. Patient Knowledge Distillation for BERT Model Compression (Sun et al., 2019 EMNLP)
Transfer Data 5FBDIFSݽ؛۽ࠗఠࢤࢿ ഛೞחঋ /-*١ٜ݅ӝয۰ Student Training ۄݫఠ ೠNFNPSZMBUFODZ 5FBDIFS0VUQVUਸਊೞৈण Labeled Data Unlabeled Data Transfer Data Process