Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
統計的学習理論の基礎 II
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Masanari Kimura
March 05, 2021
Research
410
3
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
統計的学習理論の基礎 II
Masanari Kimura
March 05, 2021
More Decks by Masanari Kimura
See All by Masanari Kimura
Equivalence of Geodesics and Importance Weighting from the Perspective of Information Geometry
mkimura
0
370
機械学習における重要度重み付けとその応用
mkimura
3
3.4k
Paper Intro: Human Rademacher Complexity
mkimura
0
240
On the principle of Invariant Risk Minimization
mkimura
0
400
論文紹介:Clustering with Bregman Divergences: an Asymptotic Analysis
mkimura
0
620
Generalization Bounds for Set-to-Set Matching with Negative Sampling
mkimura
0
190
論文紹介:On the Importance of Gradients for Detecting Distributional Shifts in the Wild
mkimura
2
900
論文紹介:Dangers of Bayesian Model Averaging under Covariate Shift
mkimura
0
380
Information Geometry of Dropout Training
mkimura
0
350
Other Decks in Research
See All in Research
LLM の Attention 機構まとめ — 数式・計算量・メモリ
puwaer
8
2.2k
2026 東京科学大 情報通信系 研究室紹介 (大岡山)
icttitech
0
3.9k
「車1割削減、渋滞半減、公共交通2倍」を 熊本から岡山へ@RACDA設立30周年記念都市交通フォーラム2026
trafficbrain
1
1.2k
ScoreMatchingRiesz for Automatic Debiased Machine Learning and Policy Path Estimation with an Application to Japanese Monetary Policy Evaluation
masakat0
0
290
Claude Code × autoresearch 実践
mathbullet
0
170
LLMアプリケーションの透明性について
fufufukakaka
0
240
Apache Gravitinoで実現する Icebergカタログ統合とアクセスの一元化
matsumooon
0
300
衛星×エッジAI勉強会 衛星上におけるAI処理制約とそ取組について
satai
4
570
[BlackHatAsia2026] Hidden Telemetry: Uncovering TraceLogging ETW Providers You're Not Using (Yet)
asuna_jp
1
550
LLM Compute Infrastructure Overview
karakurist
2
1.5k
世界モデルにおける分布外データ対応の方法論
koukyo1994
7
2.2k
Data Visualization Tools in the Age of AI
flekschas
0
160
Featured
See All Featured
How to Build an AI Search Optimization Roadmap - Criteria and Steps to Take #SEOIRL
aleyda
1
2.1k
Have SEOs Ruined the Internet? - User Awareness of SEO in 2025
akashhashmi
0
370
The State of eCommerce SEO: How to Win in Today's Products SERPs - #SEOweek
aleyda
2
11k
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
1
150
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
56k
Documentation Writing (for coders)
carmenintech
77
5.4k
Claude Code のすすめ
schroneko
67
230k
Exploring anti-patterns in Rails
aemeredith
3
430
Max Prin - Stacking Signals: How International SEO Comes Together (And Falls Apart)
techseoconnect
PRO
0
190
Thoughts on Productivity
jonyablonski
76
5.2k
Connecting the Dots Between Site Speed, User Experience & Your Business [WebExpo 2025]
tammyeverts
11
950
The Spectacular Lies of Maps
axbom
PRO
1
820
Transcript
CompML ౷ܭతֶशཧͷجૅ II Masanari Kimura (@machinery81)
CompML TL;DR • ౷ܭతֶशཧͷجૅతͳࣄ߲ͷ·ͱΊ • ୈೋճҎԼͷτϐοΫʹ͍ͭͯ • ू߹ͷ֓೦ • VC-Dimension
• Pseudo-Dimension • Fat-Shattering Dimension • VCόϯυ 2
CompML VC-Dimension
CompML VC-Dimension ఆٛ 1.ʢVC-࣍ݩʣՄଌۭؒ ͷ͋Δू߹Λ ͱ͢Δɽશͯͷ෦ू߹ ʹ͍ͭͯɼ ͱͳΔΑ͏ͳ ͕ଘࡏ͢Δͱ͖ɼू߹
Ͱ͞ ΕΔͱ͍͏ɽ ͷVapnik-Chervonenkis࣍ݩ ɼ ʹΑͬͯ͞ΕΔू ߹ͷجͷ࠷େʹ͍͠ɽ (𝑋, 𝑆) 𝒜 ⊂ 𝑆 𝐵 ⊂ 𝑆 𝑆 ∩ 𝐴 = 𝐵 𝐴 ∈ 𝒜 𝑆 𝒜 𝒜 𝑉𝐶𝑑𝑖𝑚(𝒜) 𝒜 Photo by Wikipedia.
CompML The Pseudo-Dimension ఆٛ2.ʢ -࣍ݩʣՄଌۭؒ ͷ্ͷՄଌؔͷू߹Λ ͱ͢Δɽ ू߹ ҎԼ͕Γཱͭͱ͖ -shatteredͰ͋Δͱ͍͏ɿ
ҙͷ2ϕΫτϧ ͱͦΕʹରԠ͢Δؔ ʹ͍ͭͯɼ ্هͷ݅ΛHeavisideؔ Ͱॻ͖͑Δͱ ؔΫϥε ͷ -࣍ݩ ʹΑͬͯ -shatteredͱͳΔΑ͏ͳू߹ͷجͷ࠷େͰఆٛ͞Εɼ ͱॻ͔ΕΔɽ 𝑃 (𝑋, 𝑆 ) ℱ ⊂ [0,𝑅] 𝑋 𝑆 = {𝑥1 , …, 𝑥𝑛} ⊂ 𝑋 𝑃 𝑒 ∈ {0,1}𝑛 𝑓𝑒 ∈ ℱ { 𝑓𝑒(𝑥𝑖) ≥ 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 1, 𝑓𝑒(𝑥𝑖) < 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 0. 𝜂(𝑧) 𝜂[𝑓𝑒(𝑥𝑖) − 𝑐𝑖] = 𝑒𝑖 , ∀𝑖, ∀𝑒 . ℱ 𝑃 ℱ 𝑃 𝑃𝑑𝑖𝑚(ℱ)
CompML Illustration of P-Shattering 𝑥1 𝑥2 𝑥3 𝑓 [01…1] 𝑓
[00…1] 𝑓 [11…0] 𝑐1 𝑐2 𝑐3 { 𝑓𝑒(𝑥𝑖) ≥ 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 1, 𝑓𝑒(𝑥𝑖) < 𝑐𝑖 𝑖𝑓 𝑒𝑖 = 0.
CompML VC࣍ݩͱ -࣍ݩͷಉ݅ 𝑃 ิ1ɽ ʹ͍ͭͯɼҎԼͷΑ͏ʹ Λఆٛ͢Δɿ ͜ͷͱ͖ɼ ℱ =
{𝑓:𝑋 → [0,𝑅]} ¯ ℱ ¯ ℱ = { ¯ 𝑓(𝑥, 𝑐) = 𝜂[𝑓(𝑥) − 𝑐] :𝑓 ∈ ℱ} . 𝑃𝑑𝑖𝑚( ¯ ℱ) = 𝑉𝐶𝑑𝑖𝑚( ¯ ℱ) .
CompML The Fat-Shattering Dimension ఆٛɽʢFat-Shattering࣍ݩʣ Մଌۭؒ ͷ্ͷՄଌؔͷू߹Λ ͱ͢Δɽू߹ Ҏ Լ͕Γཱͭͱ͖෯
͓Αͼਫ਼ Ͱfat-shatteredͰ͋Δͱ͍͏ɿ ҙͷ2ϕΫτϧ ͱͦΕʹରԠ͢Δؔ ʹ͍ͭͯɼ ؔΫϥε ͷFat-Shattering࣍ݩ ʹΑͬͯfat-shatteredͱͳΔΑ͏ͳू߹ͷج ͷ࠷େͰఆٛ͞Εɼ ͱॻ͔ΕΔɽ (𝑋, 𝑆) ℱ ⊂ [0,𝑅] 𝑋 S = {x1 , …, xn } γ c 𝑒 ∈ {0,1}𝑛 𝑓𝑒 ∈ ℱ { fe (xi ) ≥ ci + γ if ei = 1, fe (xi ) < ci − γ if ei = 0. ℱ ℱ Fdim(ℱ, γ)
CompML VC Generalization Bound ఆཧɽظޡࠩ ͓Αͼܦݧޡࠩ ʹ͍ͭͯɼVC࣍ݩΛ ͱॻ͘ͱɼ ͕ຬ͞ΕΔɽ ൚Խޡ͕ࠩVC࣍ݩΛ༻͍ͯ͑ΒΕΔɽ
R(h) ̂ R(h) dVC R(h) − ̂ R(h) ≤ 8dVC(ln 2m dVC + 1) + 8 ln 4 δ m
CompML LemmaʢSymmetrizationʣ ิɽ ͱͳΔΑ͏ͳ ʹ͍ͭͯɼ ͕Γཱͭɽ͜͜Ͱ ؔͷظͱܦݧͷࠩɼಠཱʹಘΒΕͨೋछྨͷܦݧͷࠩͰ͑ΒΕΔɽ t ≥ 2/m
t > 0 P( sup f∈ℱ | f − ̂ f | ) ≤ 2P( sup f∈ℱ | ̂ f′ − ̂ f | ≥ t/2) f = 𝔼[ f ] ̂ f = 1 m m ∑ i=1 f(xi , yi ) ̂ f′ = 1 m m ∑ i=1 f(x′ i , y′ i )
CompML ࢀߟจݙ • Shalev-Shwartz, S., Ben-David, S. (2014). Understanding Machine
Learning - From Theory to Algorithms.. Cambridge University Press. ISBN: 978-1-10-705713-5 • Mohri, Mehryar, Afshin Rostamizadeh, and Ameet Talwalkar. Foundations of machine learning. MIT press, 2018.