Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
pycon_delhi_lightening
Search
Devashish Deshpande
September 24, 2016
Technology
1.6k
0
Share
pycon_delhi_lightening
Lightening talk delivered at PyCon India 2016
Devashish Deshpande
September 24, 2016
Other Decks in Technology
See All in Technology
MLOps導入のための組織作りの第一歩
akasan
0
330
QGISプラグイン CMChangeDetector
naokimuroki
1
400
AWS DevOps Agentはチームメイトになれるのか?/ Can AWS DevOps Agent become a teammate
kinunori
6
740
ハーネスエンジニアリングの概要と設計思想
sergicalsix
9
5k
昔はシンプルだった_AmazonS3
kawaji_scratch
0
330
AI時代のガードレールとしてのAPIガバナンス
nagix
0
280
Master Dataグループ紹介資料
sansan33
PRO
1
4.6k
クラウドネイティブな開発 ~ 認知負荷に立ち向かうためのコンテナ活用
literalice
0
130
AI와 협업하는 조직으로의 여정
arawn
0
420
自立を加速させる神器 - EMOasis #11
stanby_inc
0
140
コードや知識を組み込む / Incorporate Code and Knowledge
ks91
PRO
0
150
Chasing Real-Time Observability for CRuby
whitegreen
0
120
Featured
See All Featured
WCS-LA-2024
lcolladotor
0
540
Code Reviewing Like a Champion
maltzj
528
40k
We Are The Robots
honzajavorek
0
220
Gemini Prompt Engineering: Practical Techniques for Tangible AI Outcomes
mfonobong
2
370
Scaling GitHub
holman
464
140k
Public Speaking Without Barfing On Your Shoes - THAT 2023
reverentgeek
1
370
How to build an LLM SEO readiness audit: a practical framework
nmsamuel
1
710
Future Trends and Review - Lecture 12 - Web Technologies (1019888BNR)
signer
PRO
0
3.5k
Mind Mapping
helmedeiros
PRO
1
150
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Faster Mobile Websites
deanohume
310
31k
Utilizing Notion as your number one productivity tool
mfonobong
4
290
Transcript
News classification with Gensim Devashish Deshpande Undergraduate student RaRe Technologies
Incubator Program Github: dsquareindia Blogs: https://rare-technologies.com/blog/
Gensim: Topic modeling in python
Problem of News (mis)classification
Screenshots from play newsstand
Topic-word coloring with LDA Image taken from LDA paper by
David Blei
What is a good LDA model? • Come up with
good topics • Infer topic distribution (United topic): mourinho, red_devils, old_trafford, bad_team... (Arsenal topic): wenger, henry, invincibles,.... (City topic): aguero, etihad, england, premier_league (Chelsea topic): blues, football, roman, bridge,... Football LDA model
Evaluating topic models • Manually – Look at the topics.
See if they are interpretable. – Comparing different topic models Qualititative
None
Topic Coherence • Quantitave
Topic Coherence • Assign a number to the human interpretability!
Comparing topic models becomes much easier
Topic Coherence • Better LDA -> Better topics -> Better
classification Topics from topic modeling tutorial on Lee corpus
Join the community! • Pick up issues from: https://github.com/RaRe-Technologies/gensim •
Come for the sprint!