Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
pycon_delhi_lightening
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Devashish Deshpande
September 24, 2016
Technology
1.6k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
pycon_delhi_lightening
Lightening talk delivered at PyCon India 2016
Devashish Deshpande
September 24, 2016
Other Decks in Technology
See All in Technology
Kiro Ambassador を目指す話
k_adachi_01
0
110
【Snowflake Summit 2026 Recap!!】Snowflake Summit Deep Dive: Security & Governance
civitaspo
1
270
AIのReact習熟度を測る
uhyo
2
660
Comment regagner la souveraineté de vos données tout en étant payé grâce à Nostr !
rlifchitz
0
110
攻撃者視点で考えるDetection Engineering
cryptopeg
3
2k
徹底討論!ECS vs EKS!
daitak
3
1.1k
FPGAの開発コンペでZephyrを使ってみた
iotengineer22
0
160
Lightning近況報告
kozy4324
0
210
2026TECHFRESH畢業分享會 - 原生還是跨平台? App 開發踩坑實錄
line_developers_tw
PRO
0
1.4k
脆弱性対応、どこで線を引くか
rymiyamoto
1
420
アンオフィシャルな、オフィシャルからのお願い
wyamazak_devrel
0
140
螺旋型キャリアの生存戦略 / kinoko-conf2026
rakus_dev
0
150
Featured
See All Featured
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.5k
How to build a perfect <img>
jonoalderson
1
5.7k
Understanding Cognitive Biases in Performance Measurement
bluesmoon
32
2.9k
Typedesign – Prime Four
hannesfritz
42
3.1k
[SF Ruby Conf 2025] Rails X
palkan
2
1.1k
Music & Morning Musume
bryan
47
7.2k
New Earth Scene 8
popppiees
3
2.3k
Dominate Local Search Results - an insider guide to GBP, reviews, and Local SEO
greggifford
PRO
0
200
How STYLIGHT went responsive
nonsquared
100
6.2k
Mozcon NYC 2025: Stop Losing SEO Traffic
samtorres
1
260
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
360
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.8k
Transcript
News classification with Gensim Devashish Deshpande Undergraduate student RaRe Technologies
Incubator Program Github: dsquareindia Blogs: https://rare-technologies.com/blog/
Gensim: Topic modeling in python
Problem of News (mis)classification
Screenshots from play newsstand
Topic-word coloring with LDA Image taken from LDA paper by
David Blei
What is a good LDA model? • Come up with
good topics • Infer topic distribution (United topic): mourinho, red_devils, old_trafford, bad_team... (Arsenal topic): wenger, henry, invincibles,.... (City topic): aguero, etihad, england, premier_league (Chelsea topic): blues, football, roman, bridge,... Football LDA model
Evaluating topic models • Manually – Look at the topics.
See if they are interpretable. – Comparing different topic models Qualititative
None
Topic Coherence • Quantitave
Topic Coherence • Assign a number to the human interpretability!
Comparing topic models becomes much easier
Topic Coherence • Better LDA -> Better topics -> Better
classification Topics from topic modeling tutorial on Lee corpus
Join the community! • Pick up issues from: https://github.com/RaRe-Technologies/gensim •
Come for the sprint!