×
Copy
Open
Link
Embed
Share
Beginning
This slide
Copy link URL
Copy link URL
Copy iframe embed code
Copy iframe embed code
Copy javascript embed code
Copy javascript embed code
Share
Tweet
Share
Tweet
Slide 1
Slide 1 text
Title: Topic Modeling of Short Texts: A Pseudo-Document View Authors: Yuan Zuo, Junjie Wu, Hui Zhang, Hao Lin, Fei Wang, Ke Xu, Hui Xiong @KDD 2016ษڧձ NOZAWA Kento (ஜେM1/AIST RA)
Slide 2
Slide 2 text
LDAͷ՝: จॻ͕͍ͱτϐοΫͷֶशʹࣦഊ • ݪҼ: ڞىใ͕ेʹಘΒΕͳ͍ͨΊ • ղܾࡦ: ڞىใΛ૿͢Α͏ͳ • จॻΛΫϥελϦϯάٖͨ͠ࣅจॻͰֶश ػցֶश -%" /-1 ػցֶश ϕΠζਪ ࠷దԽ ࠓى͖ͨ Εͳ͍ շປ short text ٖࣅจॻ ΫϥελϦϯά ࠓى͖ͨ Εͳ͍ շປ ػցֶश -%"/-1 ػցֶश ϕΠζਪ ࠷దԽ
Slide 3
Slide 3 text
D N K z ✓ w ↵ D N K P l ✓ z w ↵ Graphical models • K: τϐοΫ • D: จॻ • N: จॻͷ୯ޠ • P: ٖࣅจॻ େখؔ: K
Slide 4
Slide 4 text
D N K P l ✓ z w ↵ Generative process of PTM PTM • ٖࣅจॻ: จॻͷϋʔυΫϥελ • ٖࣅจॻ1ͭʹτϐοΫθ͕ఆٛ ٖࣅจॻID
Slide 5
Slide 5 text
• 1จॻ͝ͱʹٖࣅจॻ Λαϯϓϧ • ୯ޠ͝ͱʹτϐοΫ z Λαϯϓϧ short textͩͱ΄΅0খ͍͔͞͠ͱΒͳ͍ ରͯ͠ ରԠ͢ΔٖࣅจॻશମͰͷස͕͑Δ Inference by collapsed Gibbs sampling Nz lds Nz ds l p(zs,i = z|rest) / (Nz ds + ↵)( N wds,i z + Nz + V ) (LDA) p(zs,i = z|rest) / (Nz lds + ↵)( N wds,i z + Nz + V ) (PTM)
Slide 6
Slide 6 text
• SPTM • Spike and Slab prior ΛٖࣅจॻͷτϐοΫʹ͍ΕΔ • EPTM • ෳͷٖࣅจॻʹଐͤΔ PTMΛ֦ுͨ͠ϞσϧఏҊ
Slide 7
Slide 7 text
σʔληοτͱֶशύϥϝʔλ * Questions : தࠃޠ ύϥϝʔλ • τϐοΫ K: 100 • ٖࣅจॻP: 1000
Slide 8
Slide 8 text
࣮ݧ༰ 1. จॻྨ • ༩ͨ͠τϐοΫΛͱʹSVMͰྨ 2. UCI topic Coherence • wikipediaͷσʔλΛͬͯܭࢉ • NewsͱDBLPͷΈ 3. ύϥϝʔλൺֱ • ٖࣅจॻɼֶशσʔλɼϞσϧൺֱ 4. ༩ͨ͠τϐοΫͷྫ • লུ
Slide 9
Slide 9 text
จॻྨ • จॻ͕খ͍͞(News)ͱSPTM > PTM • ֶशσʔλ͕গͳͯ͘Fେ͖͘Լ͕Βͳ͍
Slide 10
Slide 10 text
UCI topic Coherence จॻྨͱಉ༷ɼจॻ͕খ͍͞ͱSPTM > PTM
Slide 11
Slide 11 text
• ٖࣅจॻ͋Δఔେ͖͘͢Δඞཁ͕͋Δ • ٖࣅจॻ͕গͳ͍ͱSPTM͕Α͍ • NewsͰٖࣅจॻΛ૿͢ͱ SPTM < PTMʹͳΔ ԣ࣠: ٖࣅจॻ ύϥϝʔλൺֱ
Slide 12
Slide 12 text
ఏҊख๏ؒͰͷൺֱ • EPTM͕࠷ѱ͍
Slide 13
Slide 13 text
·ͱΊ short textͰ͏·ֶ͘श͢ΔτϐοΫϞσϧͷఏҊ • จॻू߹ΛΫϥελϦϯάٖͨ͠ࣅจॻΛજࡏมʹಋೖ • 3ͭͷϞσϧΛఏҊ • σʔλٖࣅจॻ͕গͳ͍߹SPTM • ͦΕҎ֎PTM