Dai, A. M., Olah, C., & Le, Q. V. (2015). Document embedding with paragraph vectors. arXiv. https://arxiv.org/abs/1507.07998 Wikipedia nearest neighbours to “Lady Gaga” - “American” + “Japanese” Ayumi Hamazaki 2024 5-6 t — 2024-12-16 – p.14/39
N(µA , σ2 A ) N(µB , σ2 B ) xA xB xA xB (xA − xB ) (xA − xB ) ( ) (µA − µB ) (xA − xB ) xA xB σ2 A nA + σ2 B nB σ2 A nA + σ2 B nB 2024 5-6 t — 2024-12-16 – p.26/39
) 20 19 µ σ . . . ( ) x 95% −z0.05 +z0.05 µ ( 5 ) σ s σ s z N(0, 12) ( n ) → z t → t (t distribution) t n df ( : t(df)) t0.05 (df) ( ) 95% [x − t0.05 (df) × s √ n , x + t0.05 (df) × s √ n ] 2024 5-6 t — 2024-12-16 – p.29/39
. . ) n ( df) t (Student ) x Student t 95% ( 20 19 ) −t0.05 (df) ≤ t (x − µ) √ n s ≤ +t0.05 (df) µ . . . − t0.05 (df) ≤ (x − µ) √ n s (6) ⇒ − t0.05 (df) × s √ n ≤ x − µ (7) ⇒ µ ≤ x + t0.05 (df) × s √ n ( ) (8) (x − µ) √ n s ≤ +t0.05 (df) (9) ⇒ x − µ ≤ +t0.05 df × s √ n (10) ⇒ x − t0.05 (df) × s √ n ≤ µ ( ) (11) 2024 5-6 t — 2024-12-16 – p.34/39