Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
【ICML読み会】Unsupervised Deep Embedding for Cluste...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Hayato Maki
July 16, 2016
Technology
1.4k
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
【ICML読み会】Unsupervised Deep Embedding for Clustering Analysis
Hayato Maki
July 16, 2016
More Decks by Hayato Maki
See All by Hayato Maki
Billion-scale Embedding for E-commerce Recommendation in Alibaba
hamaki
0
120
Today was a Good Day: The Daily Life of Software Developers
hamaki
0
120
論文紹介:Relaxed Softmax for PU Learning
hamaki
3
1.1k
MIRU 2019 Lunch on Seminar
hamaki
1
290
コーディネート整合性を考慮したカテゴリ間推薦
hamaki
0
1.2k
Regularization_The Element of Statical Learning
hamaki
0
220
Neural Activity During Sentence Processing as Reflected in Theta, Alpha, Beta, and Gamma Oscillations
hamaki
0
260
Other Decks in Technology
See All in Technology
iOS アプリの「これって不具合ですか?」を AI に調べてもらう
miichan
0
110
40代で“やっとエンジニアになれた”――閉じた学びを開き、空の青さを知る / 20260628 Naoki Takahashi
shift_evolve
PRO
4
120
フィジカル版Github Onshapeの紹介
shiba_8ro
0
300
FPC(フレキシブル)基板にZephyr実装してみた。
iotengineer22
0
130
20260619 私の日常業務での生成 AI 活用
masaruogura
1
230
Oracle AI Database@Google Cloud:サービス概要のご紹介
oracle4engineer
PRO
6
1.6k
攻撃者視点で考えるDetection Engineering
cryptopeg
3
2k
アンオフィシャルな、オフィシャルからのお願い
wyamazak_devrel
0
140
AWS Security Agent といっしょに脅威モデリングをやってみよう
amarelo_n24
1
190
気軽に使える"情報のハブ"としてのNotion活用 〜フロー情報の集積点 と、 Claude Code × Notion AI〜
syucream
1
160
就職⽀援サービスにおけるキャリアアドバイザーのシフトスケジューリング
recruitengineers
PRO
1
150
Bucharest Tech Week 2026 - Reinventing testing practices in the AI era
edeandrea
PRO
1
170
Featured
See All Featured
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
3
160
Raft: Consensus for Rubyists
vanstee
141
7.5k
Pawsitive SEO: Lessons from My Dog (and Many Mistakes) on Thriving as a Consultant in the Age of AI
davidcarrasco
0
160
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
750
Build The Right Thing And Hit Your Dates
maggiecrowley
39
3.2k
Build your cross-platform service in a week with App Engine
jlugia
234
18k
Between Models and Reality
mayunak
4
340
Are puppies a ranking factor?
jonoalderson
1
3.6k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
55k
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
200
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
287
14k
Transcript
ICML2016จհ Unsupervised Deep Embedding for Clustering Analysis Ross Girshick Jungian
Xie Ali Farhadi University of Washington Facebook AI Research University of Washington ൃදऀ ਅ༐ਓ ಸྑઌՊֶٕज़େֶӃେֶ ใՊֶݚڀՊ ത࢜ޙظ՝ఔ ೳίϛϡχέʔγϣϯݚڀࣨ 2016/07/16 @NAIST
3ߦͰཁ • ରɿݹయతͳΫϥελϦϯά • ख๏ɿਂֶशΛར༻ͨ࣍͠ݩݮύ ϥϝλͱΫϥελϦϯάͷಉ࣌ ࠷దԽ • ݁Ռɿैདྷख๏ΑΓߴ͍ਫ਼ɼ͍ ܭࢉ࣌ؒΛ࣮ݱ
ΫϥελϦϯάͷؔ࿈ݚڀ • k-means ٴͼ ࠞ߹ਖ਼نϞσϧ(GMM) • ೖྗͷ࣍ݩ͕ߴ͍ͱࣦഊ͍͢͠ • ࣍ݩݮͱΫϥελϦϯάΛಉ࣌ʹߦ͏ख๏ •
࣍ݩۭؒʹࣸ૾ɼࣸ૾ͨ͠ઌͰΫϥελϦϯά • ैདྷख๏ઢܗࣸ૾ͷΈ • εϖΫτϥϧɾΫϥελϦϯά • σʔλͷάϥϑߏΛར༻͢Δख๏ • k-meansΑΓྑ͍݁ՌʹͳΔ͜ͱ͕ଟ͍ • ܭࢉྔ͕αϯϓϧͷ̎·ͨ̐ʹൺྫ
ه߸ • σʔλɿ • σʔλɿ • Ϋϥελͷʢࣄલʹܾఆʣɿ • ࣸ૾ɿ •
ɹ ͷ࣍ݩ <<< ͷ࣍ݩ • ࣸ૾ͷύϥϝλ ΛDNNͰֶश • ࣸ૾ઌͷσʔλɿ • ηϯτϩΠυʢΫϥελΛද͢Δʣɿ n { xi 2 X }n i=1 k zi = f✓( xi) ✓ {zi 2 Z}n i=1 {µj 2 Z}k i=1 zi = f✓( xi) zi = f✓( xi)
ఏҊ๏ͷྲྀΕ ॳظԽ ࣍ݩݮ ΫϥελׂΓͯ KL divergenceܭࢉ ύϥϝλߋ৽
࣍ݩݮ • ਂֶशΛར༻ͨ͠ඇઢܗͳ࣍ݩԽࣸ૾ f✓ : X ! Z { xi
2 X }n i=1 zi = • ڭࢣͳֶ͠शͷͨΊɼަ ࠩݕূ๏ʹΑΔϋΠύʔύ ϥϝλͷௐͰ͖ͳ͍ • ͦͷͨΊɼΑ͘ΘΕΔ ωοτϫʔΫߏΛ༻ • ֤ͷ࣍ݩ (input)-500-500-2000-10 • શ݁߹ [van der Maaten, 09]
ΫϥελׂΓͯ Soft Asignment • ࣸ૾͞Εͨσʔλ ͱηϯτϩΠυ ͷྨࣅ ई (soft assignment)
ɼ ͕̹൪ͷΫϥελʹೖΔ֬ͱͯ͠ ղऍͰ͖Δɽ qij = 1 + kzi µj0 k2/↵ (↵+1)/2 P j0 (1 + kzi µj0 k2/↵) (↵+1)/2 ↵ = 1 {zi 2 Z}n i=1 µj [van der Maaten & Hinton, 08] qij = 1 + kzi µj0 k2/↵ (↵+1)/2 P j0 (1 + kzi µj0 k2/↵) (↵+1)/2 {zi 2 Z}n i=1 • ڭࢣͳֶ͠शʹ͓͍ͯɼަࠩݕূ๏͑ͳ͍ ͨΊɼ ʹݻఆɽ
KLμΠόʔδΣϯεʹΑΔDNNֶश • ఆతͳׂΓͯ • ඪʢཧతͳΫϥελϦϯάΛߦ͏ͱߟ͑Β ΕΔʣ • PͱQͷKLμΠόʔδΣϯεΛ࠷খԽ͢ΔΑ͏ʹDNN Λֶश •
Pͷઃఆ͕ຊख๏ͷΩϞ
ύϥϝλߋ৽ • DNNͷύϥϝλθ ͱ ηϯτϩΠυ μj Λߋ৽ • SGDͰߋ৽ (θόοΫϓϩύήʔγϣϯ)
ॳظԽ • DNNͷॳظԽɿ Stacked Auto Encoder Λར༻ • ηϯτϩΠυͷॳظԽɿॳظԽDNNΛར༻ͯ࣍͠ݩ ݮ͠ɼࣸ૾ઌͰk-means
࣮ݧ • σʔληοτ • ൺֱख๏ • k-means • LDGMI (εϖΫτϥϧɾΫϥελϦϯά)
• SEC (εϖΫτϥϧɾΫϥελϦϯά) • Without back propagation
࣮ • Stacked Auto EncoderͷॳظԽ • ฏۉ0ɼඪ४ภࠩ0.01ͷਖ਼نΛͬͨ ཚͰॏΈΛॳظԽ • ֤͝ͱʹ50000ճ෮ʢ20%Dropoutʣ
• Auto EncoderશମͰ100000ճ෮ͯ͠ fine tuning (Dropoutແ͠) • ϛχόοναΠζ=256 • ֶश=0.1 ←20000෮ຖʹ1/10
࣮ • ηϯτϩΠυͷॳظԽ • ҟͳΔॳظͰ20ճ࣮ߦͯ͠ϕετͳ ͷΛબ • KLμΠόʔδΣϯεͷ࠷খԽ • ֶश=0.01
(ݻఆ) • ऩଋఆ • ΫϥελͷׂΓ͕ͯมԽ͢Δσʔλ͕ 0.1%ҎԼʹͳΔ·Ͱ
ධՁج४ • Unsupervised Clustering Accuracy (ACC) • pi ɿਅͷϥϕϧ •
qi ɿΞϧΰϦζϜ͕ग़ྗͨ͠ϥϕϧ • map()ɿϥϕϧ͔ΒΫϥελͷ࠷దͳϚοϐϯά
݁Ռ • ఏҊ๏͕ϕετͷੑೳ • REUTERSʹ͍ͭͯ • ఏҊ๏ͷֶश࣌ؒ30ఔ • LDMGIͱSECϲ݄Ҏ্ͷܭࢉ࣌ؒͱςϥ ୯ҐͷϝϞϦ͕ඞཁ
ϋΠύʔύϥϝλʹର͢Δؤ݈ੑ • ҟͳΔ9ͭͷϋΠύʔύϥϝλʢΞχʔϦϯάʣ ͰੑೳΛൺֱ • ఏҊ๏ϋΠύʔύϥϝλͷมಈʹରͯ͠ؤ݈ɼ ͔ͭσʔληοτʹඇґଘ • ڭࢣͳֶ͠शʹ͓͍ͯॏཁͳੑ࣭
ඪPͷੑ࣭ • qij ͕େ͖΄Ͳ(֬৴͕େ͖͍΄Ͳ)ɼޯ͕ େ͖͘ͳΔ ˠP·͍͠ੑ࣭Λ͍࣋ͬͯΔ
࣍ݩݮͷ෮࠷దԽʹΑΔޮՌ • t-SNEΛར༻ͨ͠ՄࢹԽ [van der Maaten & Hinton, 08] •
ߋ৽͕ਐΉ΄ͲΫϥελʔͷ͕໌֬ʹ
Auto EncoderʹΑΔಛநग़ͷޮՌ • Auto EncoderͰಛநग़ˠ֤ΞϧΰϦζϜͰॲཧ • Auto EncoderʹΑΔߩݙ͕େ͖͍
ෆۉҰͳσʔληοτʹର͢Δؤ݈ੑ • αϯϓϧ͕࠷খͷΫϥεͷαϯϓϧΛɼ ࠷େͷαϯϓϧͷΫϥεͷrmin ഒʹઃఆɼ ͦͷଞͷΫϥε0.1ͣͭ૿͍ͯ͘͠ • ఏҊ๏αϯϓϧͷෆۉҰੑʹରͯ͠ؤ݈
݁ • ఏҊ๏ɼਂֶशΛར༻ͨ࣍͠ݩݮʹΑΓΫϥ ελϦϯάͷੑೳΛ্ • ࣍ݩݮͷύϥϝλͱΫϥελϦϯάͷ݁ՌΛಉ ࣌ʹ࠷దԽ • ΫϥελϦϯάఆతͳग़ྗͱඪͱͷKL μΠόʔδΣϯεΛଛࣦؔͱͯ͠όοΫϓϩύ
ήʔγϣϯ • ैདྷ๏ΑΓߴਫ਼͔ͭߴʢܭࢉ࣌ؒαϯϓϧ ʹରͯ͠ઢܗʹൺྫʣɼσʔληοτඇґଘɼϋΠύʔ ύϥϝλඇґଘɼαϯϓϧͷෆۉҰੑʹରͯ͠ؤ݈
Ҏ্