Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
結合トピックモデル
Search
Kento Nozawa
March 29, 2016
Research
0
1.1k
結合トピックモデル
2016年3月29日に『トピックモデルによる統計的潜在意味解析』
読書会ファイナル ~佐藤一誠先生スペシャル~のLTで発表しました
Kento Nozawa
March 29, 2016
Tweet
Share
More Decks by Kento Nozawa
See All by Kento Nozawa
Analysis on Negative Sample Size in Contrastive Unsupervised Representation Learning
nzw0301
0
170
[IJCAI-ECAI 2022] Evaluation Methods for Representation Learning: A Survey
nzw0301
0
620
[NeurIPS Japan meetup 2021 talk] Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
200
[IBIS2021] 対照的自己教師付き表現学習おける負例数の解析
nzw0301
0
190
Understanding Negative Samples in Instance Discriminative Self-supervised Representation Learning
nzw0301
0
500
Introduction of PAC-Bayes and its Application for Contrastive Unsupervised Representation Learning
nzw0301
2
830
NLP Tutorial; word representation learning
nzw0301
0
220
Analyzing Centralities of Embedded Nodes
nzw0301
0
180
Paper Reading: Noise-Contrastive Estimation of Unnormalized Statistical Models, with Applications to Natural Image Statistics
nzw0301
2
1.2k
Other Decks in Research
See All in Research
IMC の細かすぎる話 2025
smly
2
710
電通総研の生成AI・エージェントの取り組みエンジニアリング業務向けAI活用事例紹介
isidaitc
1
1.1k
論文紹介:Not All Tokens Are What You Need for Pretraining
kosuken
0
200
Nullspace MPC
mizuhoaoki
1
240
言語モデルの地図:確率分布と情報幾何による類似性の可視化
shimosan
8
2k
ロボット学習における大規模検索技術の展開と応用
denkiwakame
1
140
ウェブ・ソーシャルメディア論文読み会 第31回: The rising entropy of English in the attention economy. (Commun Psychology, 2024)
hkefka385
1
110
[論文紹介] Intuitive Fine-Tuning
ryou0634
0
130
MetaEarth: A Generative Foundation Model for Global-Scale Remote Sensing Image Generation
satai
4
360
心理言語学の視点から再考する言語モデルの学習過程
chemical_tree
2
660
Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification
satai
3
150
20250725-bet-ai-day
cipepser
2
500
Featured
See All Featured
Into the Great Unknown - MozCon
thekraken
40
2.1k
GitHub's CSS Performance
jonrohan
1032
470k
Building an army of robots
kneath
306
46k
Git: the NoSQL Database
bkeepers
PRO
431
66k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
Building Applications with DynamoDB
mza
96
6.7k
Code Review Best Practice
trishagee
72
19k
Large-scale JavaScript Application Architecture
addyosmani
514
110k
What's in a price? How to price your products and services
michaelherold
246
12k
The Pragmatic Product Professional
lauravandoore
36
7k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Done Done
chrislema
185
16k
Transcript
݁߹τϐοΫϞσϧ ʰτϐοΫϞσϧʹΑΔ౷ܭతજࡏҙຯղੳʱ ಡॻձϑΝΠφϧ ~ࠤ౻Ұઌੜεϖγϟϧ~ ݈ਓ (@nzw0301) 2016-03-29
ࣗݾհ ͡Ί·ͯ͠ ݈ਓ (@nzw0301) य़͔ΒஜେͰM1 ڵຯ • ػցֶशɼNLPɼάϥϑɼDL
݁߹τϐοΫϞσϧ จॻσʔλͱରԠ͢ΔใΛ߹Θֶͤͯश • ຊޠͱӳޠ • Ϩγϐͱࡐྉ • ୯ޠͱͦͷࢺ ࢀߟɿ௨ৗͷLDAͷάϥϑΟΧϧϞσϧ 3
D N2 N1 K 2 1 w2 i w1 i ✓ ↵ 1 z1 i z2 i 2 D N K wi ✓ ↵ zi
z ͷαϯϓϦϯάࣜ • 3ষp55ʹैͬͯಋग़Մೳ • ৄ͘͠ http://nzw0301.github.io/2016/02/jointTopicModelsEquation • ҎԼͷ͔ࣜΒGibbs SamplingͷࣜΛٻΊΔ
ࢀߟɿ௨ৗͷLDA 4 p(z1 d,i = k|w1 d,i = v, W1 \d,i , W2, Z1 \d,i , Z2, ↵, 1, 2) p(z1 d,i = k|w1 d,i = v, W1 \d,i , Z1 \d,i , ↵, 1)
ࣜมܗͷ݁Ռ • ݁߹τϐοΫϞσϧͷαϯϓϦϯάࣜ • φ௨ৗͷLDAͱಉ͡ • θʹ͍ͭͯɼؚ·ΕΔ߲͕ิॿใͷ͚ͩ૿͑Δ ࢀߟɿ௨ৗͷLDAͷαϯϓϦϯάࣜ 5 n1
k,v,\d,i + v P v0 (n1 k,v0,\d,i + v0 ) n1 d,k,\d,i + n2 d,k + ↵k P k0 (n1 d,k0,\d,i + n2 d,k0 + ↵k0 ) n1 k,v,\d,i + v P v0 (n1 k,v0,\d,i + v0 ) n1 d,k,\d,i + ↵k P k0 (n1 d,k0,\d,i + ↵k0 )
࣮ߦྫ • ର༁ίʔύεΛఆͨ͠؆୯ͳྫ • ݴޠ͕ҧͬͯτϐοΫڞ௨ http://nzw0301.github.io/2016/02/jointTopicModelsEquation ࣮ɿLDAͷαϯϓϧࣜͰ͏౷ܭྔΛྻʹ 6
ࢀߟจݙ • ؠా ۩࣏. τϐοΫϞσϧ. ߨஊࣾ. 2015. (MLPػցֶशϓϩϑΣογϣφϧγϦʔζ). • ࠤ౻
Ұ. τϐοΫϞσϧʹΑΔ౷ܭతજࡏҙຯղੳ. ίϩφࣾ. 2015. (ࣗવݴޠॲཧγ Ϧʔζ, 8). • David Mimno, Hanna M. Wallach, Jason Naradowsky, David A. Smith and Andrew McCallum. 2009. Polylingual Topic Models. in EMNLP. 7