Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Unleashing the Power of NLP : Innovations from LINE Data Dev

Unleashing the Power of NLP : Innovations from LINE Data Dev

Event: 臺北醫學大學企業參訪
Speaker: Danny Lo

LINE Developers Taiwan

March 03, 2023
Tweet

More Decks by LINE Developers Taiwan

Other Decks in Technology

Transcript

  1. Danny Lo Data Dev, TECH FRESH • NTU CSIE •

    Research Assistant @MSLAB • LINE TECH FRESH @LINE Data Dev
  2. Data Dev 的⼯作內容~ Data Dev LINE Family Services LINE SHOPPING

    LINE SPOT LINE MUSIC LINE Sticker LINE VOOM LINE Reward Fact Checker LINE HELP TW LINE Travel NLP Knowledge Graph Uplift Modeling NER Classifier Duplication Detector Auto completion Keyword Extraction Related Search Text Generation User Tagging Data Analytics Recom- mendation CLV LINE TODAY
  3. Data Dev 成員組成 • Build and optimize da ta pipeline

    architectu re • Assemble large, comp lex data sets that mee t requirements Data Engineer Data Analyst Big data infra, SQL, ETL, message queuing • Interpret data, analyz e results using statisti cal techniques • Identify, analyze, and interpret trends or pa tterns in complex dat a sets Statistics, Data Visualiza tion, Business Knowled ge SKILL RESPONSIBILITY • Select appropriate da tasets and data repre sentation methods • Research and implem ent appropriate ML al gorithms Data Scientist Machine learning, deep learning, CV, NLP, Speec h ML Svc Engineer • Build and scale machi ne learning infrastruc ture • Monitor model perfor mance System infrastructure d esign, DevOps
  4. SmartText – 如何建立 ML pipeline? DS DE MLE DA PM

    Biz DS DE DS DS DE DA MLE Data preparation Scaling Performance Model decay Data drift EDA Model build Hyper-parameter tu ning Evaluation Feature Engineering Error analysis MLE MLE MLE DE
  5. Q&A