Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Brickmaster: Scenario-Wise Recommender Engine @...

Brickmaster: Scenario-Wise Recommender Engine @ TECHPULSE 2023

- Speaker: Vila Lin
- Event: http://techpulse.line.me/

Brickmaster 是 LINE 購物於 2022 年推出的情境感知深度學習推薦系統。本系統將解決傳統推薦系統常遇到的三個問題:推薦商品過時、無法跟上使用者喜好以及與組織 KPI 脫鉤。系統架構也提供可再用性與知識共享,將大幅減少所需開發人力與時間。

LINE Developers Taiwan

February 21, 2023
Tweet

More Decks by LINE Developers Taiwan

Other Decks in Technology

Transcript

  1. 1

  2. Agenda › Introduction › What is Scenario › Issue ›

    Goal › Brickmaster › Architecture › Flow
  3. Issue 1: Items become out of date Popularity Time Item

    still be recommended ! Introduction Growth Mature Decline
  4. Issue Issue 1 Outdated Easily Item Issue 2 Lack of

    Recency User Intention Issue 3 Ambiguous Reasoning Business
  5. Issue 3: Hard to measure potential impacts OA Web APP

    Search View Click Clickout Order CTR GMV Engagement Action Performance Channel ?
  6. Tech Stack Computing - PySpark - PyTorch 
 (PyTorch Lightning)

    - Modin + Ray Storage - HDFS - Hive - Redis Serving - FastAPI - BentoML - Dash Scheduler - Airflow ML lifecycle - MLflow
  7. Architecture Candidate Generation Ranking Feature Generation User 
 Engagement Recommendations

    User 
 Web/App
 Log User
 Profile Context Item
 Profile Serving Brickmaster
  8. Architecture Candidate Generation Ranking Feature Generation User 
 Engagement Recommendations

    User 
 Web/App
 Log User
 Profile Context Item
 Profile Serving Brickmaster Candidate Generation Ranking Feature Generation Serving
  9. Flow of Brickmaster Feature
 Generation Billions
 of Features Thousands
 of

    Items Hundreds 
 of Items Candidate
 Generation Ranking Serving
  10. Flow of Brickmaster 19 Feature
 Generation Candidate
 Generation Ranking Serving

    Recall Precision Billions
 of Features Thousands
 of Items Hundreds 
 of Items
  11. Flow of Brickmaster Feature
 Generation Candidate
 Generation Ranking Serving Recall

    Precision Billions
 of Features Thousands
 of Items Hundreds 
 of Items
  12. Recall Precision Scalability and Flexibility Feature
 Generation Candidate
 Generation Ranking

    Serving Enlarge User Population Rankers for BIZ Idea Billions
 of Features Thousands
 of Items Hundreds 
 of Items Scenario
 Feature
  13. Recall Precision Scalability and Flexibility Feature
 Generation Candidate
 Generation Ranking

    Serving Enlarge User Population Rankers for BIZ Idea Billions
 of Features Thousands
 of Items Hundreds 
 of Items Scenario
 Feature
  14. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time
  15. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time Product Price › Mean › Median › IQR › Price change
  16. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time Product Price › Mean › Median › IQR › Price change
  17. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time Product Price › Mean › Median › IQR › Price change
  18. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time Product Price › Mean › Median › IQR › Price change
  19. Feature Generation User, Product, Context and Scenario User Pro fi

    le › Age › Area › Gender User Browsing › Click › Like › Search › UTM Product Pro fi le › Age › Gender › Text › Shop › Category Product Price › Mean › Median › IQR › Price change Date Time › Festival event › Next EC event › Holiday Scenario › Clickout › Impression › Last clickout time › Last view time Product Price › Mean › Median › IQR › Price change
  20. Feature Generation Product Features Product Token, Embedding NLP-based
 Classifier Model-based

    Classifier Product
 Name Product
 Text
 Feature Product
 Gender
 Feature Product
 Text
 Feature
  21. Feature Generation Product Features Product Token, Embedding NLP-based
 Classifier Model-based

    Classifier Product
 Name Product
 Text
 Feature Product
 Gender
 Feature Product
 Text
 Feature
  22. Candidate Generation Feature Engineering User
 Product
 Feature Training Data Generation

    Processed
 Dataset Modeling
 Dataset DNN Model Training Candidate
 Generator 
 Matcher
 (LSH) Model Inference User & Product Id Mapping User
 Product
 Candidate User & Product Embedding Imputation 
 Transformation Scaling Negative Sampling
  23. Candidate Generation Feature Engineering User
 Product
 Feature Training Data Generation

    Processed
 Dataset Modeling
 Dataset DNN Model Training Candidate
 Generator 
 Matcher
 (LSH) Model Inference User & Product Id Mapping User
 Product
 Candidate User & Product Embedding Imputation 
 Transformation Scaling Negative Sampling
  24. Candidate Generation Feature Engineering User
 Product
 Feature Training Data Generation

    Processed
 Dataset Modeling
 Dataset DNN Model Training Candidate
 Generator 
 Matcher
 (LSH) Model Inference User & Product Id Mapping User
 Product
 Candidate User & Product Embedding Imputation 
 Transformation Scaling Negative Sampling
  25. Candidate Generation Feature Engineering User
 Product
 Feature Training Data Generation

    Processed
 Dataset Modeling
 Dataset DNN Model Training Candidate
 Generator 
 Matcher
 (LSH) Model Inference User & Product Id Mapping User
 Product
 Candidate User & Product Embedding Imputation 
 Transformation Scaling Negative Sampling
  26. Candidate Generation Feature Engineering User
 Product
 Feature Training Data Generation

    Processed
 Dataset Modeling
 Dataset DNN Model Training Candidate
 Generator 
 Matcher
 (LSH) Model Inference User & Product Id Mapping User
 Product
 Candidate User & Product Embedding Imputation 
 Transformation Scaling Negative Sampling
  27. Ranking Feature Engineering User
 Product
 Candidate Training Data Generation Processed


    Dataset Modeling
 Dataset Scenario A
 Feature DNN Model Training Ranker A Model Inference Personalized
 Ranking A Imputation 
 Transformation Scaling Negative Sampling
  28. Ranking Feature Engineering User
 Product
 Candidate Training Data Generation Processed


    Dataset Modeling
 Dataset Scenario A
 Feature DNN Model Training Ranker A Model Inference Personalized
 Ranking A Imputation 
 Transformation Scaling Negative Sampling
  29. Ranking Feature Engineering User
 Product
 Candidate Training Data Generation Processed


    Dataset Modeling
 Dataset Scenario A
 Feature DNN Model Training Ranker A Model Inference Personalized
 Ranking A Imputation 
 Transformation Scaling Negative Sampling
  30. Health Checking Original DataSet Transformed
 Dataset Check DAG Success Check

    Data Splitting Training Data Generation Feature Engineering Imputation Outlier Detection Noti fi cation
  31. Health Checking Original DataSet Transformed
 Dataset Check DAG Success Check

    Data Splitting Training Data Generation Feature Engineering Imputation Outlier Detection Noti fi cation Pipeline
 Level
  32. Health Checking Original DataSet Transformed
 Dataset Check DAG Success Check

    Data Splitting Training Data Generation Feature Engineering Imputation Outlier Detection Noti fi cation Data
 Level
  33. Personalized Recommendation › Match business goal (e.g. clickout, GMV) ›

    We use proximal concept to solve business problems Business Goal Science Clickout User
 Engagement Min(Next Clickout Time) Max(Session Duration)
  34. Personalized Recommendation › Match business goal (e.g. clickout, GMV) ›

    We use proximal concept to solve business problems Business Goal Science Clickout User
 Engagement Min(Next Clickout Time) Max(Session Duration)
  35. Trending Inspire user to 
 fi nd out potential needs

    Popularity Introduction Growth Mature Decline
  36. Recap Design of Brickmaster Business Target Stage User Preference KPI

    -Oriented Objective 1st stage - Candidate Generation 2nd stage - Scenario Rankers
  37. Recap Design of Brickmaster Business Target Stage User Preference KPI

    -Oriented Objective 1st stage - Candidate Generation 2nd stage - Scenario Rankers