Slide 2
Slide 2 text
Overview
2
● This paper presents our exploration of BERT-based Bi-Encoder
approach for predicting the similarity of two multilingual news.
● There are several findings such as pretrained models, pooling
methods, translation, data separation, and the number of tokens.
● The weighted average ensemble of the four models achieved the
competitive result and ranked in the top 12.