Upgrade to Pro — share decks privately, control downloads, hide ads and more …

HOW ML POWERS LINE SERVICES

HOW ML POWERS LINE SERVICES

by Shawn Tsai @ LINE Developer Meetup 13 https://linegroup.kktix.cc/events/20200918

LINE Developers Taiwan

September 18, 2020
Tweet

More Decks by LINE Developers Taiwan

Other Decks in Technology

Transcript

  1. Shawn Tsai LINE Taiwan Data Dev About Me What I

    really enjoy about being part of the Data team is the flexibility you are allowed in your daily work and the cross- functional nature of the projects you can be involved in, which stems from the Data team ultimately being in charge of all things data at LINE.
  2. Obtain Scrub Explore Model Interpret LINE Taiwan Data Dev Team

    Data Engineer Data Analyst Data Scientist
  3. USERS > 21M TODAY > 1M articles/y SHOPPING > 5M

    queries/m OA > 1B interactions/m Data in LINE We Are Facing
  4. LINE客服小幫手 ȡ虻碘㬵რ    ȡ虻碘㬵რ    #POS

    #CNN #Tokenization #Word2Vec #SiameseNet #LSTM #BERT #FastText
  5. Feature Extraction – LSTM-CNN , VHQWHQFHPDWUL[ /670HQFRGHU HPEHGGLQJGLP ORYH P\

    QHZ LSKRQH  3$'! 3$'! 3$'! FRQYROXWLRQDOIHD WXUHPDS HQFRGLQJGLP PD[SRROLQJ Q  Q  Q  DFWLYDWLRQ
  6. Answer Selection – Siamese Network Input A Input B Neural

    Network Architecture Neural Network Architecture Output A Output B + Shared Structure Shared Weights
  7. LINE訊息查證 ȡ虻碘㬵რ    ȡ虻碘㬵რ    #BERT

    #Word2Vec #TFIDF #POS #Tokenization #Classifier #NER
  8. How ML Helps LINE Fact Check Verified Messages: 300+ Similar

    Messages : 35k+ Total Messages : 40k/d Near-Duplication Classification
  9. How Users Are Influenced 33 25% 33% 46% • XVHUVUHFRJQL]HWKHVXVSLFLRXVQHZV

    • XVHUVDFWLYHO\GRWKHIDFWFKHFN • XVHUVZRXOGVKDUHWKHFKHFNHGUHVXOWV ü[WLPHVRIIDFWVFKHFNHG ü0LPSUHVVLRQRIWKHFKHFNHGIDFWV
  10. Customer Segmentation by Conversion and Treatment Churn if not treated

    Churn if treated High Low Low High Sleeping Dog Lost Cause Sure Thing Persuadables
  11. Target Customers Who Are Persuadable Churn if not treated Churn

    if treated High Low Low High Sleeping Dog Lost Cause Sure Thing Persuadables Uplift M odel
  12. Uplift By Declines Decile Persuadables Lost causes/ sure things Sleeping

    dog Reference: Causal Inference and Uplift Modeling A review of the literature
  13. Manual, Inconsistent and Separated Prepare Data • Integrate raw data

    from multiple sources • Difficult to track data used for a model Build Model • Track experiments manually • Hard to reproduce experiments Deploy Model • Tightly coupled deployment options • Different monitoring approach for each framework
  14. Model Training Is The Only One Part Of It Reference:

    Hidden Technical Debt in Machine Learning Systems, NIPS 2015