the K-fold cross validation References K-fold cross validation ▶ データセットを K 分割して学習と評価 K 回くりかえす ▶ 計算時間はかかるけれど学習データをフル活用できていそう Dataset Train set Validation set Train set Validation set Validation set Train set K-fold 7/14
the K-fold cross validation References Moments of the error discrepancy Theorem For all m ≥ 1, we have E [ |ˆ eK − ¯ eK|m ] ≤ E [ |ˆ e1 − ¯ e1 |m ] . (1) Proof. Jensen の不等式から,すべての凸関数 f と実数 xi ∈ R について f x1 + x2 + · · · + xn n ≤ f(x1) + f(x2) + · · · + f(xn) n が成り立つ.|x|m は ∀m ≥ 1 で凸関数なので, |ˆ eK − ¯ eK|m = ˆ e1 − ¯ e1 + · · · ˆ eK − ¯ eK K m ≤ |ˆ e1 − ¯ e1|m + · · · + |ˆ eK − ¯ eK|m K = E |ˆ e1 − ¯ e1|m 10/14
the K-fold cross validation References Strict Inequality for the Moments of the Error Discrepancy Theorem Suppose the X is finite, the learning algorithm is insensitive to example ordering, and Pr[ˆ e1 ̸= ¯ e1 ] > 0. Then, for 2 < k < n and m ≥ 2, we have E [ |ˆ eK − ¯ eK|m ] < E [ |ˆ e1 − ¯ e1 |m ] . (2) Proof. (証明のスケッチ) .Jensen の不等式において等式が成り立つのは,∀i = {1, . . . , K} で ˆ ei − ¯ ei が等しいときに限 る.つまり,ˆ ei − ¯ ei ̸= ˆ ej − ¯ ej となるような i, j を 1 組でも見つければ証明は完了する.ここで前述した順序不変 性や Pr[ˆ e1 ̸= ¯ e1] > 0 が効いて,定理が証明される. 12/14
the K-fold cross validation References References Avrim Blum, Adam Kalai, and John Langford. Beating the hold-out: Bounds for k-fold and progressive cross-validation. In Proceedings of the twelfth annual conference on Computational learning theory, pages 203–208, 1999. Michael Kearns and Dana Ron. Algorithmic stability and sanity-check bounds for leave-one-out cross-validation. Neural computation, 11(6):1427–1453, 1999. 14/14