How Machine Learning helps LINE Fact Checker

How Machine Learning helps LINE Fact Checker

BY Jim Horng @LINE TECHPULSE 2019 https://techpulse.line.me/

2102a6b8760bd6f57f672805723dd83a?s=128

line_developers_tw

December 04, 2019
Tweet

Transcript

  1. None
  2. > Jim Horng / LINE TODAY | Data TF How

    Machine Learning helps LINE Fact Checker
  3. Agenda > The Need for Fact Checker > Overview of

    ML Components > Overview of ML System > Challenges and Future Work
  4. ※Source from Digital Society Project Severity of Fake Messages

  5. Fact Checker OA, Dashboard OA - Query OA - Report

    Dashboard
  6. Agenda > The Need for Fact Checker > Overview of

    ML Components > Overview of ML System > Challenges and Future Work
  7. AI Knows Everything?

  8. How ML Helps?

  9. How ML Helps? Similar Messages Total Messages Verified Messages ML

    Near-Duplication ML Classification
  10. Near-Duplication - Use Cases Verified Fake Message: "footage on Captain's

    Instagram Stories showed them wearing wedding rings on their both hands, which proves Captain America and Captain Marvel get married in Las Vegas" Query Result Type Captain America and Captain Marvel get married in Las Vegas True Partial The wedding in Las Vegas is hosted by Captain America and Captain Marvel couple True Semantically Ironman and Black Widow get married in Los Angeles False Syntactically
  11. Near-Duplication - Flow > Long Text ➔ Full Match •

    performs faster and trustworthy > Short Text ➔ Partial Match + Fuzzy Tolerance • 20% user query are partial texts of original messages
 ( E.g. sentence, topic of an article )
  12. Full Match - BERT Based ※Source from https://towardsdatascience.com/bert-explained-state-of-the-art-language-model-for-nlp- f8b21a9b6270 >

    Has Chinese pre-trained model > Can extract sentence vector from
 Upstream > Can capture semantics based on Context
  13. Topic probability: 
 traffic: 3.7%, life: 38%, art: 49%, health:

    2.54%, others: 1.6%, sport: 4.3%, education: 0.7%, law: 0.16% "They were in Vegas for the BillBoard Music Awards but, a few hours later, footage on Captain's Instagram Stories showed them wearing wedding rings on their both hands, which proves Captain America and Captain Marvel get married in Las Vegas " Message Classification
  14. Message Classification > BERT + NN Layer > Fine-Tuned (Accuracy)

    vs. 
 Pre-Trained (Speed)
  15. Agenda > The Need for Fact Checker > Overview of

    ML Components > Overview of ML System > Challenges and Future Work
  16. ML System Under the Hood Serving Training Data Ingestion Async

    Execution Model Management Model Deployment Index-based Vector Search (ANN) Scheduling, Orchestration
  17. Agenda > The Need for Fact Checker > Overview of

    ML Components > Overview of ML System > Challenges and Future Work
  18. Similarity 65% 46% 82% Identify Message With Image > VGG16

    (ConvNet Configuration D) > Convolution network extracts image features, Able to capture objects and shape.
  19. Duplicated Reported Messages Training Store as New If New, by

    Near- Duplication Report Message Check By Cache / Search Engine Store As Cache / Search Engine House Keeping: Find Duplicates By Near- Duplication and Merge Training Store as New If New, by Near- Duplication Report Message
  20. Knowledge Graph Model > Knowledge Graph Triples > Inference >

    Fusion with other models
  21. Thank You