corpus. (Misra et al., 2016)
◦ 6,000 sentential argument pairs.
◦ From social media dialogs on three controversial topics.
▪ gun control, gay marriage, and death penalty. ◦ Annotated on a scale from 0 to 5
▪ 0: different topic, 5: completely equivalent. ◦ The similarity notion is fairly different to STS datasets.
▪ To be considered similar, arguments must not only make similar claims, but also provide a similar reasoning. ▪ Simple unsupervised methods perform badly on this dataset (Reimers et al., 2019)