â¾
4.1 An Overview of Classification
4.2 Why Not Linear Regression?
4.3 Logistic Regression
4.3.1 The Logistic Model
4.3.2 Estimating the Regression Coefficients
4.3.3 Making Predictions
4.3.4 Multiple Logistic Regression
4.3.5 Logistic Regression for >2 Response Classes
4.4 Linear Discriminant Analysis
4.4.1 Using Bayes’ Theorem for Classification
4.4.2 Linear Discriminant Analysis for p=1
4.4.3 Linear Discriminant Analysis for p>1
4.4.4 Quadratic Discriminant Analysis
4.5 A Comparison of Classification Methods
4.6 Lab: Logistic Regression, LDA, QDA, and KNN
4.7 Exercises 2
FŸTŹ20%ŸĘÍƪDŽƦŸƲƣƫƞƽƹŬƌ
ŢŸƌřŶƱƽƺDŽƦŝ,ű0ŜŮűŘƏYHŹƵƖƤ
0ŐQƓö<ŶěóŦƏŢųŝŲŞƏƌ
ƍƐūƪDŽƦŜƍĂ340²ŶƌƏÆfZÛƓœ
ŸgĂŲéŦƌ
4.4.2 Linear Discriminant Analysis
for D = 1
55
Slide 56
Slide 56 text
gŃŶŹ!ŝÀĖ0rŶřų0ŜŮűŘūųŤűƇ
J", … , JG, :", … , :G, E#ƓfŦƏ ĔŝŗƏƌ
Ă340²(LDA)ŹƱƽƺDŽƦŸf&ƓţŮŞŸy
ŶĴÚŤűƵƖƤ0ŐQƓƏƌ
Ÿf&ŝƌşÚŘƍƐƏƌ
4.4.2 Linear Discriminant Analysis
for D = 1
56
Slide 57
Slide 57 text
NŹ,ĘÍ&Ÿ¢Ŕ N;
ŹƝƽƣ<ŸĘÍ&Ÿ¢
ŸƓfŦƏ ĔŝŗƏƌ
J;
ŹƝƽƣ<Ÿ<øŵuVŬƌ
E#ŹGƝƽƣŸ¼°0¡Ÿ8ĹuVųƅƐƏƌ
:;
ŹƪDŽƦŸ7HŲŘŘƌ
ůƄƎ:
O; = N;/NŬƌ
4.4.2 Linear Discriminant Analysis
for D = 1
57
Slide 58
Slide 58 text
ąƎĮŦŠŴLDAŹĘÍ&ŝƝƽƣU¯ŸuV&Ɠ
ŮűŘűŔ-ıŸ0¡ƓůÀĖ0rŶřųf
ŦƏƌ
ũŤűŢƐƍŸƱƽƺDŽƦŸf&ƓƵƖƤ0ŐQŶĴ
ÚŦƏƌ
4.4.4ôŲŹƝƽƣũƐŪƐŝU¯Ÿ0¡E;
#ƓůųŘ
řţƍŶ{ŘfƓśşƌ
4.4.2 Linear Discriminant Analysis
for D = 1
58
Slide 59
Slide 59 text
4.4.3 Linear Discriminant Analysis
for D > 1
Ă340²(D > 1)
Slide 60
Slide 60 text
4.4.3 Linear Discriminant Analysis
for D > 1
LDAƓē¢Ÿġ§\¢ŸYHŶ|ŦƏƌ
ũŸūƆŶŔĘÍ&ŹŔGƝƽƣŝU¯ŸuVƵƝƫƿ
ųŔ,Ɲƽƣ-ıŸ0¡-0¡đ2Ɠů^\ĺÀĖ
0rŶřųfŦƏƌ
^\ĺÀĖ0rŹũƐŪƐŸ\¢ŝ1¾*ŸÀĖ0
rŶřųfŤűŔţƍŶũƐŪƐŸ\¢ŹäĽĽ$
Ɠůƌ
60
Slide 61
Slide 61 text
4.4.3 Linear Discriminant Analysis
for D > 1
D = 2Ÿ^\ĺÀĖ0rŸixĽ¢ŸƞƽƳƓéŦƌ
ŴŮŭŸTƇx"
ĨƄūŹx#
ĨŶũŮűTƓ1¤ŦƏų¤
ʼnŹ1¾*ŸÀĖ0rŶŵƏƌ
pŹVar !" = Var !#
ŲCor !", !# = 0Ŭƌ
FŹ!"
ų!#
Ŷ0.7ŸäĽŝŗƏƌ
61
Slide 62
Slide 62 text
4.4.3 Linear Discriminant Analysis
for D > 1
^\ĺÀĖ0rŸixĽ¢Ź
ŢƔŵŬƌ
!~V(J, Σ)Ůű¬şƌ
E ! = JŹD¾*ŸuVƵƝƫƿŬƌ
Cov ! = ΣŹD×DŸ0¡-0¡đ2Ŭƌ
62
Slide 63
Slide 63 text
=;(! = A)Ɠ Ŷ+ŤűŔţŮŞų
Iť»Ŷ¢ěóƓđřų
ųŵƏƌ
D = 1ŸųŞŹŢƔŵŬŮūŷ
4.4.3 Linear Discriminant Analysis
for D > 1
63
Slide 64
Slide 64 text
!ƓéŦƌ
ůŸ¸.ŹũƐŪƐŸ¯èÖ95%ŸŌWŬƌ
çĂŹƵƖƤŸÆfZÛŬƌ
GƝƽƣŜƍŸ20%ŸĘÍ&ųŔũƐŜƍLDAŲěóţ
ƐūÆfZÛ(œŸgĂ)ƓFŸTŶéŦƌ
4.4.3 Linear Discriminant Analysis
for D > 1
64
Slide 65
Slide 65 text
ƵƖƤŸÆfZÛŹH; A = HZ(A)ųŵƏAŸńHƓĒ
ŤűŘƏƌ
ůƄƎ
ƓÎūŦƌ(< ≠ \)
4.4.3 Linear Discriminant Analysis
for D > 1
65
Slide 66
Slide 66 text
ţŮŞŸƝǀƢƨƫƛDŽƬŸƪDŽƦŶLDAƓĴÚŦƏƌ
ġ§\¢ŹbalanceųstudentŬƌ
10000%ŸĜăƪDŽƦƓÚŘūƌ
ĜăƪDŽƦŸƘƽDŽÖŹ2.75%ŬŮūƌDž
ŒŘ÷xŶĕŚƏŠŴůÉŝ ĔŬƌ
4.4.3 Linear Discriminant Analysis
for D > 1
66
Slide 67
Slide 67 text
żųůƆ
ĜăƪDŽƦŸƘƽDŽÖŹŔgŃŸƪDŽƦŸƘƽDŽÖƌƎ
ƇŘ&ŶŵƏŸŹ~ÑŬƌ
ůƄƎŔ¥ŤŘƪDŽƦƓŮűŞűŢŸ0ŐQŶıŦų
ƘƽDŽÖŝŝƏŢųŝŲŞƏƌ
ĜăƪDŽƦŲřƄşŘş»ŵƱƽƺDŽƦƓfŤūŜƍŷ
ŢƐŹD ųNŸÃŝ_ŞŘųŔƌƎŏďŬƌ
RŸYHŹD = 2ŲN = 10000ŬŜƍƂųƔŴŊ
ŹŵŘųřŠŴ
4.4.3 Linear Discriminant Analysis
for D > 1
67
Slide 68
Slide 68 text
žūůƆ
ĜăƪDŽƦŸřŭdefaultŸ7HŹ3.33%ŬŮūƌ
ůƄƎŔþjdefaultŶŵƍŵŘųÍŦƏ0ŐQŲƇ
3.33%ŸƘƽDŽÖŶŵƏƌ
LDAŸ0ŐQƌƎlŤŬŠŒŘƘƽDŽÖŶŵƏƌ
4.4.3 Linear Discriminant Analysis
for D > 1
68
Slide 69
Slide 69 text
ƘƽDŽŶŹíŐŗƏƌ
defaultŦƏŶdefaultŤŵŘų34ŦƏƘƽDŽųŔ
defaultŤŵŘŶdefaultŦƏų34ŦƏƘƽDŽ
ƘƽDŽŸíŐŶůŘűćŚƏŢųŹ_
Ŭƌ
ŸƌřŵĒ(ËIđ2)ŝ#5Ŭƌ
4.4.3 Linear Discriminant Analysis
for D > 1
69
Slide 70
Slide 70 text
ĒŶƌƏųLDAŹ104ŝdefaultŦƏųŤūƌ
gŃŶŹ81ŝdefaultŤű23ŝdefaultŤŵŜŮūƌ
ůƄƎdefaultŤŵŜŮū9667Ÿřŭ23ŝļIJŚūƌ
÷xŒŘ»ŶƅŚƏƌŷ
4.4.3 Linear Discriminant Analysis
for D > 1
70
Slide 71
Slide 71 text
ŬŠŴdefaultŤū333Ÿřŭ252ƇĕįŤūƌDž
,áŵƘƽDŽÖŹŘ»ŶĕŚƏŠŴdefaultŤū
Ÿ ŸƘƽDŽÖŹŒŘƌ
ƾƣƝŝŒŘƓÔfŤƌřųŤűŘƏƛDŽƬêŸĘ
ÏŜƍŦƏųŔ252/333=75.7%ųŘř¢bŹDŠ+Ɛƍ
ƐŵŘEĉŝŗƏƌ
4.4.3 Linear Discriminant Analysis
for D > 1
71
Slide 72
Slide 72 text
0ŐŸĉŹĐØdųŜÙÓdŲƇĹĔŬƌ
xųÔÞxųŘřÚğŝŗƏƌ
xŹßŝŗƏųŞŶ·µŲÀŤşłŶŵƏÖŲŗ
ƎŔŢŸYHŹ24.3%ųŘ
ÔÞxŹßŝŵŘųŞŶ·µŲÀŤşŀŶŵƏÖ
ŲŗƎŔŢŸYHŹ99.8%ųŒŘ
4.4.3 Linear Discriminant Analysis
for D > 1
72
Slide 73
Slide 73 text
ŵƔŲŢƔŵŶxŝŘŸŜLJ
ƵƖƤ0ŐQŹ,űŸ0ŐQŸ ŲƇĠqÖŝ
Řƌ(ƜƗƣƻƪƿŝÀŤŘYH)
ŢƐŹŔƘƽDŽŝŴŸƝƽƣŜƍ±ūŜŶŜŜƒƍŧŔ
ũŸƘƽDŽŸÿNƓkĿŶŚƏųŘřKŬƌ
¦ƛDŽƬêŹgŃŶŹdefaultųŵƏƓļIJŮű
0ŐŦƏŢųŹĶŠūŘŜƇŤƐŵŘƌ
ũřŘŮūYHƛDŽƬêŸƮDŽƤŶjŤűLDAƓ\
«ŤūƎŦƏƌ
4.4.3 Linear Discriminant Analysis
for D > 1
73
Slide 74
Slide 74 text
ƵƖƤ0ŐQŹ
èÖD;(!)ŝƇ_ŞşŵƏƝƽƣ
ŶĘÍ&Ɠ7Ǝ~űƏƌ
!Śź2ƝƽƣŶ0ŠƏdefaultŸYH
Ÿ¨ŹŔĘÍ&Źdefault=YesŶ0ŐţƐƏƌ
ůƄƎŔ
èÖŶjŤű50%Ÿľ&Ų0ŐŤűŘƏų
ĚŚƏŷ
defaultŲŗƏŸĠ·æÖƓšūŘYHŹŔŢŸľ
&ƓšƏųčŘŷ
4.4.3 Linear Discriminant Analysis
for D > 1
74
Slide 75
Slide 75 text
ľ&Ɠ20%ŶšūYHŸû³ƓĒŶéŦƌ
defaultŤū333ŸřŭĕįŤūŸŹ138(41.4%)
ŬŮūƌ
ţŮŞŸ50%Ÿľ&Ÿ¨Ź75.7%ŬŮūƌŷ
ŧŘſƔPŤūŷ
4.4.3 Linear Discriminant Analysis
for D > 1
75
Slide 76
Slide 76 text
ŬŠŴƇŭƑƔčŘŢųźŜƎťƈŵŘƌ
,ŸƘƽDŽÖųŤűŹ2.75%Ŝƍ3.73%ŶŝŮūƌ
ƝǀƢƨƫêŜƍŦƏųľ&20%Ÿ¦ŝčŘŜƇŤƐ
ŵŘƌŷ
4.4.3 Linear Discriminant Analysis
for D > 1
76
Slide 77
Slide 77 text
ľ&Ɠ\«ŤūYHŸƘƽDŽÖŸƞƽƳƓīŨƏƌ
½ĨŝţŮŞŸľ&Ŭƌ
ŘƑŘƑŵƘƽDŽÖŝľ&ŸĽ¢ųŤűƴǁƨƫţƐű
ŘƏƌ
4.4.3 Linear Discriminant Analysis
for D > 1
77
Slide 78
Slide 78 text
œŘgĂŝ,ŸƘƽDŽÖŬƌ
ŇŘçĂŝdefaultŦƏŶdefaultŤŵŘų34ŤūƘ
ƽDŽÖŬƌ
ƚǀǂƢŸÏĂŹdefaultŤŵŘŶdefaultŦƏų34
ŤūƘƽDŽÖŬƌ
4.4.3 Linear Discriminant Analysis
for D > 1
78
Slide 79
Slide 79 text
ľ&ŝ0.5Ÿ¨ŝ,ŸƘƽDŽÖŹÝŘƌ
ŬŠŴŔŴŸƘƽDŽÖƓšūŘŜŶƌŮűľ&Ź\
:ŦƏƌ
ƇŭƑƔŔŶƇ»ŖŵXŜƍľ&ƓÆfŦƏƌ
4.4.3 Linear Discriminant Analysis
for D > 1
79
Slide 80
Slide 80 text
ROCªĂƓéŦƌ
ĄĨŹdefaultŦƏŶdefaultŦƏų34ŤūÖŬƌ
ůƄƎŔxŸŢųŬƌ
4.4.3 Linear Discriminant Analysis
for D > 1
½ĨŹdefaultŤŵŘŶdefault
ŦƏų34ŤūÖŬƌ
ůƄƎŔ1-ÔÞxŬƌ
ŗƍƊƏľ&ŲŔůŸÖƓě
óŤűƴǁƨƫŤūŸŝROCªĂ
Ŭƌ
80
Slide 81
Slide 81 text
4.4.3 Linear Discriminant Analysis
for D > 1
81
Slide 82
Slide 82 text
4.4.3 Linear Discriminant Analysis
for D > 1
ŇĂŸŸʼnîŝ_ŞŠƐźŔčŘ0ŐŝŲŞƏƌ
ũŸʼnîŹAUCųLźƐƏƌ
RŸYHAUCŹ0.95ųňtŶŒŘƌ
£ƆŸçĂŹŔŗƍƊƏľ&Ŷ
śŘűůŸÖŝòŤŘĂŬƌ
6ƶDŽƢŸTŲŘřųŔ0rŝ
ĹŵŮűŘƏųŞųŜŶŵƏƌ
ŢŸYH34Ÿû³ŹŸ
XƇŚŵŘƌ
82
Slide 83
Slide 83 text
4.4.3 Linear Discriminant Analysis
for D > 1
ČáŶ0ŐŤūû³ŹĒŸƌřŶŵƏƌ
“dž”Ɠ·/ŤūŘƌŔjïġŬƌ
“DŽ”ũŸAjŬƌŔsÐġŬƌ
defaultƪDŽƦŸYHŹ”dž”ŝdefault=YesŬƌ
83
Slide 84
Slide 84 text
4.4.3 Linear Discriminant Analysis
for D > 1
ĹĔŵýěĺƓŸĒŶéŦƌ
False Pos. rateųTrue Pos. rateŸ0ÂŹGƝƽƣŸgŃ
Ÿ%¢Ŭƌ
Pos. pred. valueųNeg. Pred. valueŸ0ÂŹGƝƽƣŸ
ÍŤū%¢Ŭƌ
84