loss function c(x, y) • Two approach standard RFF • naive: W1, . . . , WD iid ∼ ˆ q, ϕi D = [ϕ(xi, W1) · · · ϕ(xi, WD)]⊤ arg min θ { n ∑ i=1 c ( 1 √ D θ⊤ϕi D , yi ) + r(θ) } (13) Proposed • efficient for D ≥ nnz(ˆ q): For w1, . . . wNw ∈ P0 (used before), arg min θ { n ∑ i=1 c(θ⊤diag(ˆ q)1/2ϕi, yi) + r(θ) } (14) 15 / 24