Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OpenTalks.AI - Алексей Бурнаков, Тематическое м...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
OpenTalks.AI
February 21, 2020
Science
2.1k
0
Share
OpenTalks.AI - Алексей Бурнаков, Тематическое моделирование новостей на основе детекции цитирований
OpenTalks.AI
February 21, 2020
More Decks by OpenTalks.AI
See All by OpenTalks.AI
OpenTalks.AI - Виктор Лемпицкий, Моделирование 3Д сцен: новые подходы в 2020 году
opentalks
0
500
OpenTalks.AI - Алексей Чернявский, Нейросетевые алгоритмы для повышения качества медицинских изображений
opentalks
0
440
OpenTalks.AI - Александр Громов, Устойчивость нейросетевых моделей при анализе КТ/НДКТ-исследований
opentalks
0
380
OpenTalks.AI - Денис Тимонин, Megatron-LM: Обучение мультимиллиардных LMs при помощи техники Model Parallelism
opentalks
0
530
OpenTalks.AI - Егор Филимонов, Возможности платформы Huawei Atlas и эффективный гетерогенный инференс.
opentalks
0
170
OpenTalks.AI - Александр Прозоров, Референсная архитектура робота сервисного центра в отраслях с изменчивыми бизнес-процессами
opentalks
0
390
OpenTalks.AI - Наталья Лукашевич, Анализ тональности по отношению к компании — с чем не справился BERT
opentalks
0
340
OpenTalks.AI - Константин Воронцов, Фейковые новости и другие типы потенциально опасного дискурса: типология, подходы, датасеты, соревнования
opentalks
0
450
OpenTalks.AI - Дмитрий Ветров, Фрактальность функции потерь, эффект двойного спуска и степенные законы в глубинном обучении - фрагменты одной мозаики
opentalks
0
490
Other Decks in Science
See All in Science
白金鉱業Vol.21【初学者向け発表枠】身近な例から学ぶ数理最適化の基礎 / Learning the Basics of Mathematical Optimization Through Everyday Examples
brainpadpr
1
720
アクシズを探せ! 各勢力の位置関係についての考察
miu_crescent
PRO
1
270
Bear-safety-running
akirun_run
0
140
(メタ)科学コミュニケーターからみたAI for Scienceの同床異夢
rmaruy
0
220
主成分分析に基づく教師なし特徴抽出法を用いたコラーゲン-グリコサミノグリカンメッシュの遺伝子発現への影響
tagtag
PRO
0
250
PPIのみを用いたAIによる薬剤–遺伝子–疾患 相互作用の同定
tagtag
PRO
0
210
Vibecoding for Product Managers
ibknadedeji
0
160
Fairfax County’s Tree Canopy: Examining the Effects of Land Development Regulations on Tree Canopy Conservation
pwiseman
1
110
Accelerating operator Sinkhorn iteration with overrelaxation
tasusu
0
310
Algorithmic Aspects of Quiver Representations
tasusu
0
320
KISHIMOTO Atsuo
genomethica
0
130
【論文紹介】Is CLIP ideal? No. Can we fix it?Yes! 第65回 コンピュータビジョン勉強会@関東
shun6211
5
2.4k
Featured
See All Featured
Digital Ethics as a Driver of Design Innovation
axbom
PRO
1
280
Reflections from 52 weeks, 52 projects
jeffersonlam
356
21k
The AI Search Optimization Roadmap by Aleyda Solis
aleyda
1
5.8k
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
180
Optimising Largest Contentful Paint
csswizardry
37
3.7k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.2k
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
510
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
1.1k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
141
35k
The Illustrated Children's Guide to Kubernetes
chrisshort
51
52k
The Success of Rails: Ensuring Growth for the Next 100 Years
eileencodes
47
8.1k
Testing 201, or: Great Expectations
jmmastey
46
8.1k
Transcript
NEWS TOPIC MODELLING BASED ON CITATION DETECTION Alexey Burnakov, TASS
ITAR-TASS www.tass.ru 115th birthday It's been a while…
News Media Market: A Complex Graph
Citation Detection
News Specific Citation Detection Personal data Headline Editorial Cite number
Cite index Citing media Date News rating Editor rating Board rating Organization rating
Methods I Cosine similarity Bag of words / tf-idf Generalized
linear models
Methods II PageRank Random-Walk Graph Partitioning
Citation Detection: results precision = 0.89 recall = 0.87 Logistic
regression output F1 score = 0.88 MCC score = 0.88 AUC: 0.998 We did good at a train dataset
PageRank: results. TOP-25 of the Russian Mass Media
NLP Pipeline Raw text Tokenization Who cites TASS Which news
was cited Topic modelling Customer facing
Topic Modelling I Motivation: Are there big topics today? Notre-Dame
de Paris’s on fire : (
Topic Modelling II Airbus emergency landing Motivation: Are there big
topics today?
Topic Modelling III Flood in the Irkutsk Region :( `Losharik`
Submarine deadly accident :( Motivation: Are there big topics today?
Topic Report Ex-Kyrgyz president Atambaev seizure by special forces
Competition Snapshot Which agency did a good job?
Daily Competition Snapshot Who is the hero of the day?
Personal data Personal data Personal data Personal data Personal data
Daily Competition Snapshot https://www.gazeta.ru/politics/2019/08/08_a_12564199.shtml
THANK YOU!