Slide 1

Slide 1 text

從 1001 號 到 333 號

Slide 2

Slide 2 text

Shandy •陽明交通大學 百川學士學位學程 111級 •Data Scientist @LINE Taiwan •TECH FRESH @LINE Taiwan •Consulting Service Intern @Microsoft Taiwan •Research Assistant @NYCU CIRDA •Software Engineer Intern @ITRI Yu

Slide 3

Slide 3 text

Source from: LINE Engineering Blog 跟大部分人一樣.... 我們都曾經坐在台下!

Slide 4

Slide 4 text

LINE Family Services LINE TODAY LINE SHOPPING LINE SPOT LINE INVOICE LINE STICKER LINE VOOM LINE TRAVEL Data Dev NLP Generative AI Data Analytics NER Classifier Duplication Detector Auto completion Keyword Extraction Related Search Text Generation User Tagging Uplift Modeling Recom- mendation CLV Source from: Penny LINE OA Data Dev 在做什麼 ?

Slide 5

Slide 5 text

NLPaaS —— SmartText SmartText 1.0 Automatic NLP Classifier Multi-label Classifier Topic Detection SmartText 2.0 Generative NLP Summarization Paraphrasing Question-Answering Beyond NLP Image Search Image Generation Audio Interaction

Slide 6

Slide 6 text

LINE Family Services LINE TODAY LINE SHOPPING LINE SPOT LINE INVOICE LINE STICKER LINE VOOM LINE TRAVEL Data Dev NLP Generative AI Data Analytics NER Classifier Duplication Detector Auto completion Keyword Extraction Related Search Text Generation User Tagging Uplift Modeling Recom- mendation CLV Source from: Penny LINE OA Data Dev 在做什麼 ?

Slide 7

Slide 7 text

7 • Build and optimize data pipeline architecture • Assemble large, complex data sets that meet requirements Data Engineer Data Analyst Big data infra, SQL, ETL, message queuing • Interpret data, analyze results using statistical techniques • Identify, analyze, and interpret trends or patterns in complex data sets Statistics, Data Visualization, Business Knowledge SKILL RESPONSIBILITY Pipeline Biz • Select appropriate datasets and data representation methods • Research and implement appropriate ML algorithms Data Scientist Machine learning, deep learning, CV, NLP, Speech Model ML Engineer • Build and scale machine learning infrastructure • Monitor model performance System infrastructure design, DevOps Service 常見的組織分佈

Slide 8

Slide 8 text

As a Data Scientist @Data Dev ???

Slide 9

Slide 9 text

As a Data Scientist @Data Dev Data Scientist • Build prediction model • Study cutting-edged technologies 30% Source from: Tomaz Bratanic.(2021 Jul 20) Turn a Harry Potter Book into a Knowledge Graph https://neo4j.com/developer-blog/turn-a-harry-potter-book-into-a-knowledge-graph/

Slide 10

Slide 10 text

As a Data Scientist @Data Dev Data Analyst • Discover the data • Design BI platform 30% 30%

Slide 11

Slide 11 text

As a Data Scientist @Data Dev Data Engineer • Clean the data • Design the ETL • Design the automatic pipeline 30% 30% 20% Extract Load Transform Source DB Data Warehouse

Slide 12

Slide 12 text

As a Data Scientist @Data Dev Others • Requirements Discussion • Internal/External Speech • Training 30% 30% 20% 10%

Slide 13

Slide 13 text

VOOM Hashtag Analysis 1 6 1 6

Slide 14

Slide 14 text

VOOM Hashtag Analysis

Slide 15

Slide 15 text

Dependency Between Pipeline Post Hashtag Creator User Behavior delay AS-IS

Slide 16

Slide 16 text

Dependency Between Pipeline TO-BE IU User Behavior Performance Creator Performance Post Performance Hashtag Performance

Slide 17

Slide 17 text

Hackathon Cross-functional Team Work Time Management User Rule

Slide 18

Slide 18 text

Learning Materials Study Group Online Learning Platform Workshop

Slide 19

Slide 19 text

Life in LINE is not only work

Slide 20

Slide 20 text

No content

Slide 21

Slide 21 text

No content