Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
An introduction of statistical learning
Search
Nakamura, Ryotaro
June 28, 2017
Science
0
42
An introduction of statistical learning
Nakamura, Ryotaro
June 28, 2017
Tweet
Share
More Decks by Nakamura, Ryotaro
See All by Nakamura, Ryotaro
Duct for beginners.
nryotaro
0
4.1k
Learn Go in 15 minutes
nryotaro
0
36
Seven architectural patterns
nryotaro
1
100
Improving Performance with Parallel Programming
nryotaro
0
53
Other Decks in Science
See All in Science
Masseyのレーティングを用いたフォーミュラレースドライバーの実績評価手法の開発 / Development of a Performance Evaluation Method for Formula Race Drivers Using Massey Ratings
konakalab
0
190
点群ライブラリPDALをGoogleColabにて実行する方法の紹介
kentaitakura
1
400
データベース04: SQL (1/3) 単純質問 & 集約演算
trycycle
PRO
0
990
機械学習 - 授業概要
trycycle
PRO
0
240
Transport information Geometry: Current and Future II
lwc2017
0
200
baseballrによるMLBデータの抽出と階層ベイズモデルによる打率の推定 / TokyoR118
dropout009
1
560
ド文系だった私が、 KaggleのNCAAコンペでソロ金取れるまで
wakamatsu_takumu
2
1.3k
テンソル分解による糖尿病の組織特異的遺伝子発現の統合解析を用いた関連疾患の予測
tagtag
2
240
ttl2html (RDF/Turtle to HTML)
masao
0
110
動的トリートメント・レジームを推定するDynTxRegimeパッケージ
saltcooky12
0
190
Lean4による汎化誤差評価の形式化
milano0017
1
300
研究って何だっけ / What is Research?
ks91
PRO
1
120
Featured
See All Featured
Into the Great Unknown - MozCon
thekraken
40
2k
jQuery: Nuts, Bolts and Bling
dougneiner
64
7.9k
Java REST API Framework Comparison - PWX 2021
mraible
33
8.8k
Bootstrapping a Software Product
garrettdimon
PRO
307
110k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
29
2.9k
CSS Pre-Processors: Stylus, Less & Sass
bermonpainter
358
30k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
234
17k
The Illustrated Children's Guide to Kubernetes
chrisshort
48
50k
RailsConf 2023
tenderlove
30
1.2k
KATA
mclloyd
32
14k
Designing Experiences People Love
moore
142
24k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.5k
Transcript
ػցֶशษڧձ ୈ 1 ճ தଜ ྒྷଠ June 13, 2017
Table of contents Supervised Learning 1. Classification 2. Perceptron 3.
Regression Unsupervised Learning 4. Clustering 1
ࠓͷඪ ࣍ճҎ߱ʹֶͿΞϧΰϦζϜͷ֓ཁΛΔ ΞϧΰϦζϜͱద༻ྫ ΞϧΰϦζϜ ద༻ྫ ྨ εύϜϝʔϧఆ ճؼੳ ച্༧ଌ ΫϥελϦϯά
ը૾ͷݮ৭ॲཧ 2
ύϥϝτϦοΫ๏ ϞσϧʢࣜʣΛԾఆ͠ɼϞσϧͷ࠷దͳύϥϝλΛֶश͢Δ ύϥϝτϦοΫ๏ͷखॱ 1. σʔλͷ༧ଌϞσϧΛԾఆ 2. Ϟσϧͷύϥϝλͷ ධՁج४ΛܾΊΔ 3. ύϥϝλΛܾΊΔ
0.0 0.2 0.4 0.6 0.8 1.0 −0.2 0.0 0.2 0.4 0.6 0.8 1.0 Ұ࣍ؔͷϞσϧͷύϥϝλௐ 3
Classification
ྨ Ϋϥεʹྨ͞ΕͨطଘσʔλΛݩʹ৽نσʔλΛྨ͢Δ ΞϧΰϦζϜ • ύʔηϓτϩϯ • ϩδεςΟοΫճؼ 4
Perceptron
ύʔηϓτϩϯ, Ϟσϧ ઢܗͳϞσϧ f Λઃఆ͢Δ f (x, y) = w0
+ w1x + w2y f (x, y) > 0 ⇒ t = +1 f (x, y) < 0 ⇒ t = −1 −20 −10 0 10 20 30 x −30 −20 −10 0 10 20 y t = +1 t = -1 ଐੑ t = ±1 Λͭσʔλ܈ ઢ্ͷ (x′, y′) f (x′, y′) = 0 ΛΈͨ͢ 5
ύʔηϓτϩϯ, ධՁج४ʢޡࠩؔʣ ޡࠩؔ E ͕࠷খʹͳΔ wi ΛٻΊΔ E = N
∑ i=1 {− (w0 + w1x + w2y) ti } = N ∑ i=1 (−f (xi , yi )ti ) • N σʔλ • ޡྨͩͱ −f (xi , yi )ti > 0 −20 −10 0 10 20 30 x −30 −20 −10 0 10 20 y t = +1 t = -1 ଐੑ t = ±1 Λͭσʔλ܈ 6
ϩδεςΟοΫճؼ, Ϟσϧ ύʔηϓτϩϯͱಉ͘͡ઢܗϞσϧ f Λઃఆ͢Δ f (x, y) = w0
+ w1x + w2y f (x, y) > 0 ⇒ t = +1 f (x, y) < 0 ⇒ t = −1 −30 −20 −10 0 10 20 30 x −20 −15 −10 −5 0 5 10 15 20 y t = +1 t = -1 f (x, y) ͕૿Ճ͢Δ͖ 7
ϩδεςΟοΫճؼ, Ϟσϧ ͨͩ͠ɼ|f | ͕େ͖͍΄Ͳ t Ͱ͋Δ͕֬ߴ͍ͱ͢Δ ϩδεςΟοΫؔ σ (α)
= 1 1 + e−α Λಋೖ͠ɼ (x′, y′) ͕ t = 1 Ͱ͋Δ֬Λ 0 < σ ( f ( x′, y′ )) < 1 ͱ͢Δ −4 −3 −2 −1 0 1 2 3 4 α 0.0 0.2 0.4 0.6 0.8 1.0 σ (α) ϩδεςΟοΫؔͷάϥϑ 8
ϩδεςΟοΫճؼ, ධՁج४ʢ࠷ਪఆʣ ܇࿅σʔλ͕ಘΒΕΔ֬ P Λ࠷େʹ͢Δ wi ΛٻΊΔ p(x, y) =
σ(x0 + w1x + w2y) P = N ∏ i p (xi , yi )tn {1 − p (xi , yi )}1−tn ܇࿅σʔλ࠷ൃੜ͕֬ߴ͍σʔλͰ͋ΔͱԾఆ͍ͯ͠Δ 9
Regression
ճؼੳ, ϞσϧͱධՁج४ʢ࠷খೋʣ σʔλ͕ M ࣍ଟ߲ࣜ f ʹै͏ͱͯ͠ɼೋޡࠩ ED Λ࠷খʹ͢Δ ύϥϝλ
wi ΛબͿ f (x) = M ∑ m=0 wmxm ED = 1 2 N ∑ n=1 {f (xn) − tn}2 0 2 4 6 8 10 −15 −10 −5 0 5 ground truth degree 3 degree 4 degree 5 training points M ∈ {3, 4, 5} ͷଟ߲ࣜۙࣅྫ 10
Clustering
k ฏۉ๏ σʔλؒͷڑΛٻΊɼσʔλΛ k ݸͷΫϥελʹ͚Δ −2 −1 0 1 2
3 0 1 2 3 4 5 σʔλू߹ −2 −1 0 1 2 3 0 1 2 3 4 5 cluster 1 cluster 2 cluster 3 centroids k = 3 ͷΫϥελ Ϋϥελ͝ͱʹදσʔλΛܾΊɼදͷۙ͘ͷσʔλू߹Ͱ ΫϥελΛ࡞Δ 11
k ฏۉ๏ͷΞϧΰϦζϜ ೖྗ: σʔλू߹ D = { x(1), x(2), ·
· · , x(|D|) } : Ϋϥελ k ແ࡞ҝʹ m1, m2 · · · , mk ΛܾΊΔ until ऩଋ foreach x(i) ∈ D cmax = arg max c sim ( x(i), mc ) σʔλू߹ͷׂ insert x(i)into cmax end foreach ∀c, mc = 1 |c| ∑ x(i)∈c x(i) දϕΫτϧΛ࠶ܭࢉ end until 12