Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
網路爬蟲與文字探勘工作坊
Search
tlyu0419
November 15, 2021
Technology
550
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
網路爬蟲與文字探勘工作坊
tlyu0419
November 15, 2021
More Decks by tlyu0419
See All by tlyu0419
網路爬蟲與文字探勘 證券公司 App 評論分析的資料科學旅程
tlyu0419
0
130
網頁爬蟲技術於人力資源管理的應用
tlyu0419
0
360
Topic Modeling with Python: What do Customers Care about Digital Banking Apps?
tlyu0419
0
230
資料血緣: 營運機器/深度學習模型的秘密武器
tlyu0419
0
380
Mastering Feature Engineering: Mining the Hidden Salary Formula with CakeResume
tlyu0419
0
400
Spark_Task_Optimization_Journey_How_I_Increased_10x_Speed_by_Performance_Tuning
tlyu0419
0
400
Why we want to become PyCon TW volunteers
tlyu0419
0
230
Regular expression in Python - From zero to hero
tlyu0419
0
310
資料視覺化工作坊
tlyu0419
0
270
Other Decks in Technology
See All in Technology
アンオフィシャルな、オフィシャルからのお願い
wyamazak_devrel
0
110
2026 TECHFRESH 畢業分享會 - 開發日常大解密!從領域驅動到企業級上線
line_developers_tw
PRO
0
1k
Chainlitで作るお手軽チャットUI
ynt0485
0
240
Bucharest Tech Week 2026 - Reinventing testing practices in the AI era
edeandrea
PRO
1
160
FDE という解 ― 暗黙知と明示知をつなぐ、伴走型エンジニアリング ―
otanet
0
160
白金鉱業Meetup_Vol.24_「AIエージェントは分けるほど良い」は本当か? / Is it true that “the more you divide AI agents, the better”?
brainpadpr
1
370
連合学習と機密コンピューティング
lycorptech_jp
PRO
0
120
2026 TECHFRESH 畢業分享會 - AI-Native 重塑軟體工程與虛擬講師
line_developers_tw
PRO
0
1k
SONiCのLinuxベースを活かしたZabbix監視
sonic
0
160
中期計画、2回作ってみた ~業務委託と正社員、両方の視点から~
demaecan
1
750
RSA暗号を手計算したくなること、ありますよね?? (20260615_orestudy6_rsa)
thousanda
0
410
やさしいA2A入門
minorun365
PRO
12
1.9k
Featured
See All Featured
AI in Enterprises - Java and Open Source to the Rescue
ivargrimstad
0
1.3k
svc-hook: hooking system calls on ARM64 by binary rewriting
retrage
2
300
It's Worth the Effort
3n
188
29k
Exploring anti-patterns in Rails
aemeredith
3
410
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
The browser strikes back
jonoalderson
0
1.2k
The Invisible Side of Design
smashingmag
302
52k
Docker and Python
trallard
47
3.9k
How GitHub (no longer) Works
holman
316
150k
Accessibility Awareness
sabderemane
1
140
Bash Introduction
62gerente
615
220k
Art, The Web, and Tiny UX
lynnandtonic
304
22k
Transcript
None
None
None
None
None
None
None
None
None
• • • • •
• • • • • •
• • • •
None
• • • • • • • • •
None
None
Ans:
◼
◼
None
None
None
None
◼ ◼ ◼ ◼
None
None
None
• ➢ ➢ ➢ ➢ ➢
None
CONTENTS
None
None
None
None
Ref: LDA - How to grid search best topic models?
None
None
None
None
None
None
None
None
SOURCE_NAME SOURCE TARGET_NAME TARGET TIME TEXT Linda**** 1795**** **** 1000****
2020-01-01 19:53 **** 1000**** Tsai Ing-wen 4625**** 2019-11-19 15:24 ... **** 1000**** Tsai Ing-wen 4625**** 2019-11-13 20:37 Hsu**** 1000**** Tsai Ing-wen 4625**** 2019-11-19 18:59 Ingwen**** 1000**** Tsai Ing-wen 4625**** 2019-11-30 05:31 ... Faithé**** 1000**** Faithé**** 1000**** 2020-01-01 22:00 ... ... ... ... ... ...
None
None
None
• • •
None
None
• •
None
None
Study: Twitter Sentiment Mirrored Facebook’s Stock Price Today
CONTENTS
None
• •
None
None
None
None
None
None
• •
•
•
• •
• • •
• • • • • • •
• • • • • • •
None
None
None
• • • • • •
None
None
None
None
None
None
None
None
None
None
None
None
None
• •
• • • • • •
None
None
None
None
• • • • • • • • • •
None
None
None
None
None
None
None
None
(?
! ? ( 0.11 -> 0.28)
None
• • • • • • • • • •
What’s the next?
CONTENTS
None
•
None
None
None
None
• • • • • • • • • •
• • >“<
None
None
•
None
None
None
• >”<
None
None
None
None
沒錢、沒人的話可以「借」別人的模型 借完再拿來當目標變數 Train 自己的模型XD 解釋力最強,但需要花時間溝通>”< 小心別用壞別人的網站 有點吃翻譯的效度XD
None
0
What’s the next?
None
None
None
• • • • • • • • • •
• • >“<
None
None
None
/ ( ) ( )
None
None
None
CONTENTS
None
None
/
• • • • • • • • • •
• • • • •
None