$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OpenTalks.AI - Алексей Бурнаков, Тематическое м...
Search
OpenTalks.AI
February 21, 2020
Science
0
2.1k
OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований
OpenTalks.AI
February 21, 2020
Tweet
Share
More Decks by OpenTalks.AI
See All by OpenTalks.AI
OpenTalks.AI - Виктор Лемпицкий, Моделирование 3Д сцен: новые подходы в 2020 году
opentalks
0
490
OpenTalks.AI - Алексей Чернявский, Нейросетевые алгоритмы для повышения качества медицинских изображений
opentalks
0
430
OpenTalks.AI - Александр Громов, Устойчивость нейросетевых моделей при анализе КТ/НДКТ-исследований
opentalks
0
370
OpenTalks.AI - Денис Тимонин, Megatron-LM: Обучение мультимиллиардных LMs при помощи техники Model Parallelism
opentalks
0
510
OpenTalks.AI - Егор Филимонов, Возможности платформы Huawei Atlas и эффективный гетерогенный инференс.
opentalks
0
150
OpenTalks.AI - Александр Прозоров, Референсная архитектура робота сервисного центра в отраслях с изменчивыми бизнес-процессами
opentalks
0
380
OpenTalks.AI - Наталья Лукашевич, Анализ тональности по отношению к компании — с чем не справился BERT
opentalks
0
340
OpenTalks.AI - Константин Воронцов, Фейковые новости и другие типы потенциально опасного дискурса: типология, подходы, датасеты, соревнования
opentalks
0
440
OpenTalks.AI - Дмитрий Ветров, Фрактальность функции потерь, эффект двойного спуска и степенные законы в глубинном обучении - фрагменты одной мозаики
opentalks
0
470
Other Decks in Science
See All in Science
Accelerating operator Sinkhorn iteration with overrelaxation
tasusu
0
140
DMMにおけるABテスト検証設計の工夫
xc6da
1
1.4k
Cross-Media Technologies, Information Science and Human-Information Interaction
signer
PRO
3
31k
データマイニング - ノードの中心性
trycycle
PRO
0
320
academist Prize 4期生 研究トーク延長戦!「美は世界を救う」っていうけど、どうやって?
jimpe_hitsuwari
0
460
Algorithmic Aspects of Quiver Representations
tasusu
0
130
PPIのみを用いたAIによる薬剤–遺伝子–疾患 相互作用の同定
tagtag
0
120
データマイニング - ウェブとグラフ
trycycle
PRO
0
220
機械学習 - ニューラルネットワーク入門
trycycle
PRO
0
910
MCMCのR-hatは分散分析である
moricup
0
540
Kaggle: NeurIPS - Open Polymer Prediction 2025 コンペ 反省会
calpis10000
0
290
Hakonwa-Quaternion
hiranabe
1
160
Featured
See All Featured
The Curse of the Amulet
leimatthew05
0
4.8k
Are puppies a ranking factor?
jonoalderson
0
2.4k
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
2
66
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
120
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
0
120
Paper Plane
katiecoart
PRO
0
44k
Utilizing Notion as your number one productivity tool
mfonobong
2
190
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
9
1k
Navigating the moral maze — ethical principles for Al-driven product design
skipperchong
1
210
Easily Structure & Communicate Ideas using Wireframe
afnizarnur
194
17k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.9k
Transcript
NEWS TOPIC MODELLING BASED ON CITATION DETECTION Alexey Burnakov, TASS
ITAR-TASS www.tass.ru 115th birthday It's been a while…
News Media Market: A Complex Graph
Citation Detection
News Specific Citation Detection Personal data Headline Editorial Cite number
Cite index Citing media Date News rating Editor rating Board rating Organization rating
Methods I Cosine similarity Bag of words / tf-idf Generalized
linear models
Methods II PageRank Random-Walk Graph Partitioning
Citation Detection: results precision = 0.89 recall = 0.87 Logistic
regression output F1 score = 0.88 MCC score = 0.88 AUC: 0.998 We did good at a train dataset
PageRank: results. TOP-25 of the Russian Mass Media
NLP Pipeline Raw text Tokenization Who cites TASS Which news
was cited Topic modelling Customer facing
Topic Modelling I Motivation: Are there big topics today? Notre-Dame
de Paris’s on fire : (
Topic Modelling II Airbus emergency landing Motivation: Are there big
topics today?
Topic Modelling III Flood in the Irkutsk Region :( `Losharik`
Submarine deadly accident :( Motivation: Are there big topics today?
Topic Report Ex-Kyrgyz president Atambaev seizure by special forces
Competition Snapshot Which agency did a good job?
Daily Competition Snapshot Who is the hero of the day?
Personal data Personal data Personal data Personal data Personal data
Daily Competition Snapshot https://www.gazeta.ru/politics/2019/08/08_a_12564199.shtml
THANK YOU!