Mutual Collaboration between Data Analysis and Machine Learning Development at LINE: Continuous Improvement in Sticker Recommendation Engine

Self Introduction - Joined LINE Corporation as a Data Scientist
in February 2019 - Working on data analysis for cross-service projects such as LINE POINT CLUB - Yu Otsuka

Agenda - Data Science & Machine Learning department - A
case of DS & ML collaboration - Continuous improvement in sticker recommendation engine

- DS - Estimate business impact - Perform statistical analysis
- ML - Provide tailor-made ML engine - Develop general ML platform DS & ML Department Data Science department Machine Learning department ML privacy team LPML department Data Science center

Continuous improvement of ML System Plan ML design Check A/B
test Action Find improvement Do ML integration .- .- %4 .- %4

Continuous improvement in sticker recommendation engine A case of DS
& ML collaboration

Stickers For You - Personalized recommendation

Stickers For You % purchase (within the top page) 40%
# clicks per day 1.2M # packages 11M Purchase: As of September 2021 Click: As of August 2021 Package: As of April 2021

Role of “Stickers For You” is changed Smart Channel -
Many sticker touchpoints out of sticker shop - Smart Channel - Home Tab - LINE Official Account - and more … Home Tab

Role of “Stickers For You” is changed 89% Paid users
via both Stickers For You & others Paid users via Stickers For You only - Paid users via Stickers For You also use other touchpoints Period: FY2020

Role of “Stickers For You” is changed As-Is - Offering
stickers that all you need - Require simple recommendation To-Be - Offering stickers that enhances serendipity - Require highly diverse recommendation - Re-definition of Stickers For You

As-Is Recommendation Model Naive Bayes-based collaborative filtering Business Requirement ›Simple
recommendation Model Summary Pros ›Small number of parameters ›Very irrelevant items won’t be recommended Cons ›Poor exploration leads to no fresh items ›Poor generalization leads to cold start problem !(item' |f* , f, , . . . , f. ) ∝ !(item' )∏ 2 !(f2 |item' )

New Recommendation Model Two-tower model based on Graph Convolution Network
Business Requirement ›Enhance serendipity Model Summary Pros ›Highly diverse recommendation ›Better performance reported [1,2] Cons ›Large number of parameters ›ANN search sometimes picks unexpected items

Metrics for Model Assessment Sales via Stickers For You Total
sales Sales divided into major and minor package Contents Via Stickers For You Other than Stickers For You Major package Minor package

Total Sales Lifted, but … Major package Minor package Total
Sales Lift (%) from As-Is model

Total Sales Lifted, but … Major package Minor package Total
Sales Impression Gap between sales and impression lifts Lift (%) from As-Is model

What happened? - e.g. Users bought Osaka-dialect sticker → Model
offers [another prefecture]-dialect sticker - Feedback to Machine Learning department - Too-much feature abstraction on sticker metadata

Improved Recommendation Model Simplified two-tower model based on GCN Business
Requirements ›Enhance serendipity ›Moderate feature abstraction Model Summary ›Simplification by using Light Graph Convolution layer [3]. ›Automated hyperparameter optimization. ›Better performance than the previous model in an offline test.

Total Sales Lifted even more with enough serendipity Lift (%)
from As-Is model Major package Minor package Total Sales Impression 1st model

Total Sales Lifted even more with enough serendipity Lift (%)
from As-Is model Major package Minor package Total Major package Minor package Total Sales Impression 1st model 2nd model

Wrap-up

Continuous improvement of ML System Plan ML design Check A/B
test Action Find improvement Do ML integration .- .- %4 .- %4

Thank you

[1] Inductive Representation Learning on Large Graphs - https://arxiv.org/abs/1706.02216 [2]
Controllable Multi-Interest Framework for Recommendation - https://arxiv.org/abs/2005.09347 [3] LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation - https://arxiv.org/abs/2002.02126 Reference

Mutual Collaboration between Data Analysis and ...

Mutual Collaboration between Data Analysis and Machine Learning Development at LINE: Continuous Improvement in Sticker Recommendation Engine

LINE DEVDAY 2021

More Decks by LINE DEVDAY 2021

Other Decks in Technology

Featured

Transcript

Self Introduction - Joined LINE Corporation as a Data Scientist

Agenda - Data Science & Machine Learning department - A

- DS - Estimate business impact - Perform statistical analysis

Continuous improvement of ML System Plan ML design Check A/B

Continuous improvement in sticker recommendation engine A case of DS

Stickers For You - Personalized recommendation

Stickers For You % purchase (within the top page) 40%

Role of “Stickers For You” is changed Smart Channel -

Role of “Stickers For You” is changed 89% Paid users

Role of “Stickers For You” is changed As-Is - Offering

As-Is Recommendation Model Naive Bayes-based collaborative filtering Business Requirement ›Simple

New Recommendation Model Two-tower model based on Graph Convolution Network

Metrics for Model Assessment Sales via Stickers For You Total

Metrics for Model Assessment Sales via Stickers For You Total

Total Sales Lifted, but … Major package Minor package Total

Total Sales Lifted, but … Major package Minor package Total

What happened? - e.g. Users bought Osaka-dialect sticker → Model

Improved Recommendation Model Simplified two-tower model based on GCN Business

Total Sales Lifted even more with enough serendipity Lift (%)

Total Sales Lifted even more with enough serendipity Lift (%)

Wrap-up

Continuous improvement of ML System Plan ML design Check A/B

Thank you

[1] Inductive Representation Learning on Large Graphs - https://arxiv.org/abs/1706.02216 [2]