corpus. (Misra et al., 2016)
 ◦ 6,000 sentential argument pairs.
 ◦ From social media dialogs on three controversial topics.
 ▪ gun control, gay marriage, and death penalty. ◦ Annotated on a scale from 0 to 5
 ▪ 0: different topic, 5: completely equivalent. ◦ The similarity notion is fairly different to STS datasets.
 ▪ To be considered similar, arguments must not only make similar claims, but also provide a similar reasoning. ▪ Simple unsupervised methods perform badly on this dataset (Reimers et al., 2019)