$8$POpEFODF8FJHIUFEBMHPSJUIN<%SFE[F> ‣ *EFBXFJHIUTGPSGSFRVFOUGFBUVSFTNPSFlDPOpEFOUzUIBOSBSFPOFT $POTJEFS(BVTTJBOEJTUSJCVUJPOGPSXFJHIUTVQEBUFNFBOWBSJBODF 7 No memory. Figure from http://kazoo04.hatenablog.com/entry/2012/12/20/000000 Previous: CW: More “confident”.
8 Minimally update It has closed form solution (c.f. [Dredze+ 2008]). Often use diag only instead of Σ" to make it simpler (not much performance change).
8 Minimally update Correctly classify with prob >= η It has closed form solution (c.f. [Dredze+ 2008]). Often use diag only instead of Σ" to make it simpler (not much performance change).
*OTJEF"308 12 Minimally update Minimize loss Values of hyper-parameter λ’s not so important (e.g. 0.1). It has closed form solution (c.f. [Crammer+ 2009]).
*OTJEF"308 12 Minimally update Minimize loss More data, more confident Values of hyper-parameter λ’s not so important (e.g. 0.1). It has closed form solution (c.f. [Crammer+ 2009]).
4$84PGU$POpEFODF8FJHIUFEMFBSOJOH<8BOH> ‣ -BSHFNBSHJOUSBJOJOH ‣ $POpEFODFXFJHIUJOH ‣ )BOEMJOHOPOTFQBSBCMFEBUB ‣ "EBQUJWFNBSHJO 14 Can see it as PA-I/PA-II equivalent of CW.
4VNNBSZ ‣ $8DPOTJEFSDPOpEFODFPGXFJHIUT5PPBHHSFTTJWF
XFBLUPOPJTF ‣ "308
4$8TPGUFOUIFDPOTUSBJOUPG$8 ‣ "3084$8DPNQBSBCMFQFSGPSNBODF ‣ 4$8JGZPVEPO`UNJOEpOEJOHPQUJNBMIZQFSQBSBNFUFSTʜ ‣ "308PUIFSXJTFʜ 19 … but it all depends on the data sets!