Slide 15
Slide 15 text
0.
Discretization and entropy regularization
Primal problem
minimize
M
⟨M,C⟩+
1
2
T
∑
t=1
∥GPt(M)−yt∥2
2
+εD(M)
• With n discretization points in each margin, M and C are tensors in
RnT
:= Rn×n×...×n.
• D(M) = ∑i1
,...,iT
[M]i1
,...,iT
log[M]i1
,...,iT
−1 +1.
• Impossible to solve directly, but the entropy term leads to a nice dual
problem.
Dual problem
maximize
λ
λ
λt ∈RN ,t=1,...,T
−ε⟨U,K⟩−
1
2
T
∑
t=1
∥λ
λ
λt∥2
2
+
T
∑
t=1
⟨λ
λ
λt,yt⟩
• Here U = u1 ⊗u2 ⊗...⊗uT , with ut = exp 1
ε
GT λ
λ
λt . K = exp −1
ε
C .
• The primal solution is M = U⊙K.
• Dimension of dual is considerably smaller than that of the primal.
• The dual can be solved efficiently, with computational bottleneck being
computation of projections Pt (U⊙K). Often, structure in K can be
exploited.