Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
pycon_delhi_lightening
Search
Devashish Deshpande
September 24, 2016
Technology
0
1.5k
pycon_delhi_lightening
Lightening talk delivered at PyCon India 2016
Devashish Deshpande
September 24, 2016
Tweet
Share
Other Decks in Technology
See All in Technology
なぜ あなたはそんなに re:Invent に行くのか?
miu_crescent
PRO
0
220
2025年のデザインシステムとAI 活用を振り返る
leveragestech
0
360
Cloud WAN MCP Serverから考える新しいネットワーク運用 / 20251228 Masaki Okuda
shift_evolve
PRO
0
110
ESXi のAIOps だ!2025冬
unnowataru
0
400
アプリにAIを正しく組み込むための アーキテクチャ── 国産LLMの現実と実践
kohju
0
240
AgentCore BrowserとClaude Codeスキルを活用した 『初手AI』を実現する業務自動化AIエージェント基盤
ruzia
7
1.7k
Claude Codeを使った情報整理術
knishioka
13
9.8k
[Neurogica] 採用ポジション/ Recruitment Position
neurogica
1
130
ハッカソンから社内プロダクトへ AIエージェント ko☆shi 開発で学んだ4つの重要要素
leveragestech
0
270
業務の煩悩を祓うAI活用術108選 / AI 108 Usages
smartbank
9
15k
Snowflake Industry Days 2025 Nowcast
takumimukaiyama
0
130
AWS re:Invent 2025~初参加の成果と学び~
kubomasataka
1
200
Featured
See All Featured
Building Adaptive Systems
keathley
44
2.9k
The untapped power of vector embeddings
frankvandijk
1
1.5k
Game over? The fight for quality and originality in the time of robots
wayneb77
1
67
Git: the NoSQL Database
bkeepers
PRO
432
66k
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
0
260
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
340
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
10
760
Designing Powerful Visuals for Engaging Learning
tmiket
0
190
Practical Orchestrator
shlominoach
190
11k
A Modern Web Designer's Workflow
chriscoyier
698
190k
Visual Storytelling: How to be a Superhuman Communicator
reverentgeek
2
400
Darren the Foodie - Storyboard
khoart
PRO
0
2k
Transcript
News classification with Gensim Devashish Deshpande Undergraduate student RaRe Technologies
Incubator Program Github: dsquareindia Blogs: https://rare-technologies.com/blog/
Gensim: Topic modeling in python
Problem of News (mis)classification
Screenshots from play newsstand
Topic-word coloring with LDA Image taken from LDA paper by
David Blei
What is a good LDA model? • Come up with
good topics • Infer topic distribution (United topic): mourinho, red_devils, old_trafford, bad_team... (Arsenal topic): wenger, henry, invincibles,.... (City topic): aguero, etihad, england, premier_league (Chelsea topic): blues, football, roman, bridge,... Football LDA model
Evaluating topic models • Manually – Look at the topics.
See if they are interpretable. – Comparing different topic models Qualititative
None
Topic Coherence • Quantitave
Topic Coherence • Assign a number to the human interpretability!
Comparing topic models becomes much easier
Topic Coherence • Better LDA -> Better topics -> Better
classification Topics from topic modeling tutorial on Lee corpus
Join the community! • Pick up issues from: https://github.com/RaRe-Technologies/gensim •
Come for the sprint!