Research Question : Classify conversational dialogues into one of the emotions category using GoEmotions dataset • DL models : BERT and DistilBERT • ML models : Support Vector Machine, Logistics Regression, Multinomial Naive Bayes, and Random Forest • New possibilities : Natural Language Understanding and Natural Language Generation, improve Human-AI collaboration & human attributes in dialogues • Future work: classify multilabel emotions from multimodal data for LLMs
carefully curated comments extracted from Reddit, with human annotations to 27 emotion categories + Neutral: admiration amusement approval caring desire excitement gratitude joy love optimism pride relief anger annoyance disappointment disapproval disgust embarrassment fear grief nervousness remorse sadness confusion curiosity realization surprise neutral We used the filtered version included in this dataset based on reter-agreement: training dataset: 43,410 + test dataset: 5,427 + validation dataset: 5,426
is a small, fast, cheap and light Transformer model trained by distilling BERT base. It has 40% less parameters than google-bert/bert-base-uncased, runs 60% faster while preserving over 95% of BERT’s performances as measured on the GLUE language understanding benchmark.