Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
pycon_delhi_lightening
Search
Devashish Deshpande
September 24, 2016
Technology
0
1.4k
pycon_delhi_lightening
Lightening talk delivered at PyCon India 2016
Devashish Deshpande
September 24, 2016
Tweet
Share
Other Decks in Technology
See All in Technology
LLVM/ASMを使った有限体の高速実装
herumi
0
120
突撃! 隣のAmazon Bedrockユーザー 〜YouはどうしてAWSで?〜
minorun365
PRO
3
380
Analytics-Backed App Widget Development - Served with Jetpack Glance
miyabigouji
0
550
エンジニア視点で見る、 組織で運用されるデザインシステムにするには
shunya078
1
300
チームビルディングは"感性"で向き合おう / Team Building with Awareness
kohzas
0
210
Oracle Autonomous Database:サービス概要のご紹介
oracle4engineer
PRO
1
7k
スタッフエンジニアの道: The Staff Engineer’s Path
snoozer05
PRO
44
14k
ロリポップ! for Gamersを支えるインフラ/lolipop for gamers infrastructure
takumakume
0
130
ネットワークだけ隔離されたコンテナ作成デモ / Kichijoji.pm36
tenforward
1
190
サーバー管理しないサーバーサービスManaged DevOps Pool
kkamegawa
0
130
SORACOMで実現するIoTのマルチクラウド対応 - IoTでのクリーンアーキテクチャの実現 -
kenichirokimura
0
380
AI活用したくてもできなかった不動産SaaSの今とこれから
nealle
0
330
Featured
See All Featured
Teambox: Starting and Learning
jrom
131
8.7k
Typedesign – Prime Four
hannesfritz
39
2.3k
Done Done
chrislema
180
16k
Product Roadmaps are Hard
iamctodd
PRO
48
10k
Build your cross-platform service in a week with App Engine
jlugia
228
18k
Unsuck your backbone
ammeep
667
57k
How GitHub (no longer) Works
holman
310
140k
A better future with KSS
kneath
235
17k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
PRO
5
480
Fantastic passwords and where to find them - at NoRuKo
philnash
48
2.8k
The MySQL Ecosystem @ GitHub 2015
samlambert
250
12k
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
25
3.9k
Transcript
News classification with Gensim Devashish Deshpande Undergraduate student RaRe Technologies
Incubator Program Github: dsquareindia Blogs: https://rare-technologies.com/blog/
Gensim: Topic modeling in python
Problem of News (mis)classification
Screenshots from play newsstand
Topic-word coloring with LDA Image taken from LDA paper by
David Blei
What is a good LDA model? • Come up with
good topics • Infer topic distribution (United topic): mourinho, red_devils, old_trafford, bad_team... (Arsenal topic): wenger, henry, invincibles,.... (City topic): aguero, etihad, england, premier_league (Chelsea topic): blues, football, roman, bridge,... Football LDA model
Evaluating topic models • Manually – Look at the topics.
See if they are interpretable. – Comparing different topic models Qualititative
None
Topic Coherence • Quantitave
Topic Coherence • Assign a number to the human interpretability!
Comparing topic models becomes much easier
Topic Coherence • Better LDA -> Better topics -> Better
classification Topics from topic modeling tutorial on Lee corpus
Join the community! • Pick up issues from: https://github.com/RaRe-Technologies/gensim •
Come for the sprint!