common task, and useful for evaluating the quality of headlines and thumbnail images. • However, the CTR prediction model trained with users log data is heavily affected by the display position. • Therefore, this research proposes a method for generating a pairwise dataset for training the CTR prediction model through a framework of pairwise learning-to-rank. • We verified its usefulness by experiments and discussed the potential for editing support.
Total print and digital subscribers of the Nikkei reach around 3 million. • The Nikkei is known as the must-read paper for Japanese professionals with extensive coverage of Japan's economy, industry and markets. • With more than 40 affiliated companies, the group business spreads to publishing, broadcasting, events, database services and index business. • Financial Times is also part of the Nikkei.
displays a list of articles, and individual article pages often provide guidance on related articles. • Readers also decide whether to move on to the article page based on the information displayed in the external inflow, such as social networking services and browser searches.
Inc. utilizes the pattern tests for providing multiple options. Randomized controlled trial Published Pattern test with multi-armed bandit Published Distribution rate
where it is desirable to present uniform information to all readers for news of high importance. • The possibility that low-quality options may negatively affect the user experience during experiments must be taken into account. • The workload of editors would be increased in terms of the need to produce several candidates of sufficiently high quality to present to the readers.
the position, the higher its CTR. • If the raw CTR data is simply used as a training dataset, there is a concern that a prediction model would be created that focuses with more importance on the display position than on the information of the article itself.
the similarity of display positions and contents. • We build a model with learning-to-rank framework: focusing more on contents information by learning to compare the two pairs of articles. model CTR: 0.05, 0.01
learning [25, 26], Multi-modal [13] • Position bias: Pairwise learning-to-rank [9, 23] • Editing support: CTR prediction [17], Headline generation [16, 24] Case study on Yahoo! News, which is similar in problem setting.
[25, 26], Multi-modal [13] • Position bias: Pairwise learning-to-rank [9, 23] • Editing support: CTR prediction [17], Headline generation [16, 24] 1. Consideration of position bias derived from service UI. 2. Not only headlines but also thumbnail images. 3. Discussion on use case of headline generation.
Generating a pairwise dataset display position = 1 cluster number = 1 display position = 1 cluster number = 2 … display position = 10 cluster number = 1000 Extracting two pairs of articles from a set that satisfy the set size condition Building a model for predicting CTR using pairwise learning-to-rank model CTR: 0.05, 0.01
• Hyperparameters: The number of clusters Clustering and creating candidate sets 16 CTR of individual articles Generating a pairwise dataset display position = 1 cluster number = 1 display position = 1 cluster number = 2 … display position = 10 cluster number = 1000
Generating a pairwise dataset display position = 1 cluster number = 1 display position = 1 cluster number = 2 … display position = 10 cluster number = 1000 📝 Notes: • Hyperparameters: Maximum set size Extracting two pairs of articles from a set that satisfy the set size condition
articles Generating a pairwise dataset display position = 1 cluster number = 1 display position = 1 cluster number = 2 … display position = 10 cluster number = 1000 Extracting two pairs of articles from a set that satisfy the set size condition Building a model for predicting CTR using pairwise learning-to-rank model CTR: 0.05, 0.01
CTR data • PatternCTR: ◦ Pattern test results. ◦ We use its accuracy for evaluation metric. • PairwiseCTR: ◦ Generated from SingleCTR. ◦ We use it for training and validation.
and thumbnail image. • Baseline + display position + published date time: including information as input. • Baseline + fixed CTR: correcting the CTR of the training dataset. • Proposed method: trained with PairwiseCTR. Models 22 headline BERT thumbnail image EfficientNet display position published date time fully connected layer
bias. • Baseline + display position + published date time: showed improvement for headlines, while no clear performance improvement could be confirmed for thumbnail images. • Baseline + fixed CTR: did not contribute to the performance. • Proposed method: showed particularly high performance for thumbnail images. There was also a certain improvement for headlines compared to the baseline, in some cases obtaining results as good as 0.720.
be assisted with the predicted CTR. • It should also be available as for one perspective for summarization. • We can also present a visualization of the weights.
necessary to be aware of the clickbait issues. • Even if the CTR is high, headlines and thumbnail images that do not match the body text would damage the user experience. • We also tackle this issue, for example creating a recognizing textual entailment model.
method to generate a pairwise dataset for creating the CTR prediction model in the framework of pairwise learning-to-rank considering position bias. • The experiment reported the better performance potential, and the practical use as editing support was explained. • The future work is to expand the evaluation dataset for larger scale performance evaluation. 📧 [email protected] 📘 https://speakerdeck.com/upura/