Annotation Guideline: Approaches in Literature
Background
● Varying definitions of what is hateful
● Varying labels hate, offense, toxic,
profane, abusive.
○ Some go into finer details of
offense being sexist, racist,
islamophobic etc.
● NO STANDARD DEFINITION of
hate speech in NLP
○ AKA no benchmark dataset or
leaderboard for hate speech.
Current approaches
● Expert annotations
● Crowdsourced annotations
● Mixtures of both