Physical regularization of Optimal Transport

Physical Regularization of Optimal Transport Nicolas Papadakis Transport optimal en
apprentissage statistique et traitement du signal 1 / 50

Optimal Transport (OT) Basic ingredients • OT deﬁnes a family
of distances between densities of probability • Transport a mass ρ0 onto ρ1: • Deﬁne a cost C(x, y) of mass transport between locations x and y • OT: application with mimimal global cost that transfers ρ0 onto ρ1 • If C(x, y) = ||x − y||p, Lp Wasserstein distance • Concave cost (Economy), Truncated cost (Computer Vision) 1 / 50

Applications in IP, CV and ML Robust dissimilarity measure (Optimal
transport cost) • Image retrieval (EMD) [Rubner et al. ’00] • 3D shape recognitions [Ruzon and Tomasi, ’01] • SIFT matching [Pele and Werman ’08] • Object segmentation [Ni et al. ’09, Rabin et al. ’11, ’15], • Denoising [Burger et al. ’12, Tartavel et al. ’16] • Loss function [Frogner et al. ’15, Genevay et al. ’17] • Generative models [Arjovsky et al. ’17] 2 / 50

transport cost) • Image retrieval (EMD) [Rubner et al. ’00] • 3D shape recognitions [Ruzon and Tomasi, ’01] • SIFT matching [Pele and Werman ’08] • Object segmentation [Ni et al. ’09, Rabin et al. ’11, ’15], • Denoising [Burger et al. ’12, Tartavel et al. ’16] • Loss function [Frogner et al. ’15, Genevay et al. ’17] • Generative models [Arjovsky et al. ’17] Why is it robust? Discrete bin-to-bin metrics are not informative for disjoint supports 2 / 50

transport cost) • Image retrieval (EMD) [Rubner et al. ’00] • 3D shape recognitions [Ruzon and Tomasi, ’01] • SIFT matching [Pele and Werman ’08] • Object segmentation [Ni et al. ’09, Rabin et al. ’11, ’15], • Denoising [Burger et al. ’12, Tartavel et al. ’16] • Loss function [Frogner et al. ’15, Genevay et al. ’17] • Generative models [Arjovsky et al. ’17] Why is it robust? Discrete bin-to-bin metrics are not informative for disjoint supports Transport map T explains how far are the distributions 2 / 50

Optimal Transport Map • The transport map: • Interpolate between
densities, compute barycenters or geodesics in the Wasserstein space ρ0 ρ1 3 / 50

Applications in IP, CV and ML Tool for matching/interpolation (Optimal
transport map) • Image interpolation, registration [Angenent et al. ’04] Medical image registration [Rehman et al. ’09] • Color transfer [Delon, ’04, Pitié et al. ’07, Bonneel et al. ‘11] • Shape matching [Rabin et al. ’10, Schmitzer and Schnörr ’14] • Texture synthesis [Xia et al. ’13, Galerne et al. ’18, Leclaire et al. ’19] • Geodesic PCA [Bigot et al. ‘13, Seguy et al. ‘15, Cazelles et al. ‘18] • Domain adaptation [Courty et al. ’15, Redko et al. ’17] • Generative models [Seguy et al. ’18] 4 / 50

Applications in IP, CV and ML Tool for matching/interpolation (Optimal
transport map) • Image interpolation, registration [Angenent et al. ’04] Medical image registration [Rehman et al. ’09] • Color transfer [Delon, ’04, Pitié et al. ’07, Bonneel et al. ‘11] • Shape matching [Rabin et al. ’10, Schmitzer and Schnörr ’14] • Texture synthesis [Xia et al. ’13, Galerne et al. ’18, Leclaire et al. ’19] • Geodesic PCA [Bigot et al. ‘13, Seguy et al. ‘15, Cazelles et al. ‘18] • Domain adaptation [Courty et al. ’15, Redko et al. ’17] • Generative models [Seguy et al. ’18] Today: Use of the transport map for Image Processing applications 4 / 50

Formulations, Numerical methods, Limitations Continuous [Benamou - Brenier ’00] Semi-discrete
[Mérigot ’11] Discrete [Cuturi ’13] 5 / 50

Formulations, Numerical methods, Limitations Continuous Semi-discrete Discrete Irregularity of the
transport map Interpolation µt between images: ρ0 ρ1 5 / 50

transport map Interpolation µt between images: ρ0 ρ1 ⇒ Objects contained in the scene are not preserved 5 / 50

transport map Exact Wasserstein for GAN 5 / 50

transport map Exact Wasserstein for GAN Generator 5 / 50

transport map Exact Wasserstein for GAN Wasserstein discriminator Generator 5 / 50

transport map Exact Wasserstein for GAN ⇒ Over-ﬁtting 5 / 50

transport map Transfer of colors between images 5 / 50

transport map Transfer of colors between images ⇒ Artifacts appear with exact prescription of color histograms 5 / 50

Formulations, Numerical methods, Limitations Continuous Semi-discrete Discrete Objective: Generalized OT
models with regularized transport maps 5 / 50

Formulations, Numerical methods, Limitations Continuous Semi-discrete Discrete • Images: densities
on support Ω • Mass transport in a ﬂuid mechanics framework on Ω Velocity ﬁeld T : Ω → Ω 5 / 50

Formulations, Numerical methods, Limitations Continuous Semi-discrete Discrete • Images: densities
on support Ω • Mass transport in a ﬂuid mechanics framework on Ω Velocity ﬁeld T : Ω → Ω • Distributions of image features • Transport between normalized histograms of size N and M Coupling matrix P of size M × N 5 / 50

Overview Part I - Continuous formulation • Dynamic optimal transport
• Generalization of the transport cost • Non-convex model with physical priors ⇒ Application to data interpolation Part II - Discrete formulation • Relaxation and regularization of static transport matrix • Non-convex model to cancel mass spreading ⇒ Application to color transfer 6 / 50

Part I Continuous Optimal Transport 7 / 50

Context • National project on numerical algorithms for optimal transport
• Collaborations with oceanographers: Sea Surface Height: creation of vortexes in Cap Point (output of model NEMO) • Objective: Image interpolation • Problem: How to deal with the coast?p 8 / 50

• Collaborations with oceanographers: Sea Surface Height: creation of vortexes in Cap Point (output of model NEMO) • Objective: Image interpolation • Problem: How to deal with the coast?pOptical ﬂow 8 / 50

• Collaborations with oceanographers: Sea Surface Height: creation of vortexes in Cap Point (output of model NEMO) • Objective: Image interpolation • Problem: How to deal with the coast?p (((((( hhhhhh Optical ﬂow 8 / 50

• Collaborations with oceanographers: Sea Surface Height: creation of vortexes in Cap Point (output of model NEMO) • Objective: Image interpolation • Problem: How to deal with the coast?p (((((( hhhhhh Optical ﬂow Discrete OT 8 / 50

• Collaborations with oceanographers: Sea Surface Height: creation of vortexes in Cap Point (output of model NEMO) • Objective: Image interpolation • Problem: How to deal with the coast?p (((((( hhhhhh Optical ﬂow (((((( hhhhhh Discrete OT 8 / 50

Overview • Dynamic optimal transport • Generalization of the transport
cost • Optimal transport with physical priors 9 / 50

Continuous Optimal Transport • Densities ρ0 and ρ1 deﬁned from
x ∈ [0, 1]d to [0, 1] • Mass preserving transport map T: T (ρ0, ρ1) := {T : [0, 1]d → [0, 1]d such that ρ1 = T ρ0} • An optimal transport T solves min T∈T (ρ0,ρ1) C(x, T(x))ρ0(x) dx where C(x, y) 0 is the cost of assigning x ∈ [0, 1]d to y ∈ [0, 1]d 10 / 50

Lp optimal transport Properties C(x, y) = ||x − y||p
⇒ p−Wasserstein distance between ρ0 and ρ1 • For p > 1, T is unique • For p = 2,T = ∇ψ, with ψ convex [Brenier ’91] and optimal mass transfer follows straight lines 11 / 50

⇒ p−Wasserstein distance between ρ0 and ρ1 • For p > 1, T is unique • For p = 2,T = ∇ψ, with ψ convex [Brenier ’91] and optimal mass transfer follows straight lines Explicit computation in 1D with cumulative functions 11 / 50

⇒ p−Wasserstein distance between ρ0 and ρ1 • For p > 1, T is unique • For p = 2,T = ∇ψ, with ψ convex [Brenier ’91] and optimal mass transfer follows straight lines Explicit computation in 1D with cumulative functions Can not be extended to higher dimensions x ∈ Rd , d > 1 11 / 50

Estimation of optimal transport map • A transport map T
∈ T (ρ0, ρ1) satisﬁes the gradient equation ρ0(x) = ρ1(T(x))| det(∂T(x))| • For p = 2, T = ∇ψ ⇒ Monge-Ampere equation: det(D2ψ) = ρ0(x) ρ1(∇ψ(x)) [Oliker and Prussner ’88, Oberman ’08, Froese ’12, Benamou et al. ’12, ’16] 12 / 50

∈ T (ρ0, ρ1) satisﬁes the gradient equation ρ0(x) = ρ1(T(x))| det(∂T(x))| • For p = 2, T = ∇ψ ⇒ Monge-Ampere equation: det(D2ψ) = ρ0(x) ρ1(∇ψ(x)) [Oliker and Prussner ’88, Oberman ’08, Froese ’12, Benamou et al. ’12, ’16] Fast algorithms (second order methods) 12 / 50

∈ T (ρ0, ρ1) satisﬁes the gradient equation ρ0(x) = ρ1(T(x))| det(∂T(x))| • For p = 2, T = ∇ψ ⇒ Monge-Ampere equation: det(D2ψ) = ρ0(x) ρ1(∇ψ(x)) [Oliker and Prussner ’88, Oberman ’08, Froese ’12, Benamou et al. ’12, ’16] Fast algorithms (second order methods) ρ1 should be lipschitz continuous with convex support ρ0 ρ1 12 / 50

∈ T (ρ0, ρ1) satisﬁes the gradient equation ρ0(x) = ρ1(T(x))| det(∂T(x))| • For p = 2, T = ∇ψ ⇒ Monge-Ampere equation: det(D2ψ) = ρ0(x) ρ1(∇ψ(x)) [Oliker and Prussner ’88, Oberman ’08, Froese ’12, Benamou et al. ’12, ’16] Fast algorithms (second order methods) ρ1 should be lipschitz continuous with convex support ρ0 ρ1 ρ0 ρ1 12 / 50

∈ T (ρ0, ρ1) satisﬁes the gradient equation ρ0(x) = ρ1(T(x))| det(∂T(x))| • For p = 2, T = ∇ψ ⇒ Monge-Ampere equation: det(D2ψ) = ρ0(x) ρ1(∇ψ(x)) [Oliker and Prussner ’88, Oberman ’08, Froese ’12, Benamou et al. ’12, ’16] Fast algorithms (second order methods) ρ1 should be lipschitz continuous with convex support ρ0 ρ1 ρ0 ρ1 • Regularized potential ψ [Paty et al. ’19] 12 / 50

Estimation of optimal transport map Use of Knothe rearrangement •
The Knothe transport solves: min T∈T (ρ0,ρ1) d i=1 x C(xi , T(x)i ) • Can be computed explicitly 13 / 50

The Knothe transport solves: min T∈T (ρ0,ρ1) d i=1 x C(xi , T(x)i ) • Can be computed explicitly PDE initialized with Knothe rearrangement • T = ∇ψ, then curl(T) = ∇ × T = 0 . Penalization of curl(T) [Angenent et al. ’03, Haber et al. ’10] . PDE on T in the transport map space • Lagrangian formulation using straight lines [Iollo and Lombardi, ’11] • PDE on ψ on the torus [Carlier et al. ’10, Bonnotte ’13] 13 / 50

The Knothe transport solves: min T∈T (ρ0,ρ1) d i=1 x C(xi , T(x)i ) • Can be computed explicitly PDE initialized with Knothe rearrangement • T = ∇ψ, then curl(T) = ∇ × T = 0 . Penalization of curl(T) [Angenent et al. ’03, Haber et al. ’10] . PDE on T in the transport map space • Lagrangian formulation using straight lines [Iollo and Lombardi, ’11] • PDE on ψ on the torus [Carlier et al. ’10, Bonnotte ’13] All these methods are limited to non vanishing densities 13 / 50

Fluid mechanics formulation [Benamou-Brenier ’00] • Parameterization with t ∈
[0, 1] of the geodesic path ρ(x, t): ρ(x, t) = ((1 − t)Id + tT(x)) ρ0 14 / 50

[0, 1] of the geodesic path ρ(x, t): ρ(x, t) = ((1 − t)Id + tT(x)) ρ0 • Non-convex problem over ρ(x, t) ∈ R and velocity ﬁeld v(x, t) ∈ R2: W2(ρ0, ρ1)2 = min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||v(x, t)||2dtdx, under the set of non-linear constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, v(0, ·) = v(1, ·) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 14 / 50

[0, 1] of the geodesic path ρ(x, t): ρ(x, t) = ((1 − t)Id + tT(x)) ρ0 • Non-convex problem over ρ(x, t) ∈ R and velocity ﬁeld v(x, t) ∈ R2: W2(ρ0, ρ1)2 = min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||v(x, t)||2dtdx, under the set of non-linear constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, v(0, ·) = v(1, ·) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 Change of variable (v, µ) → (m, µ), with m = µv: Convex cost J and linear constraints C 14 / 50

[0, 1] of the geodesic path ρ(x, t): ρ(x, t) = ((1 − t)Id + tT(x)) ρ0 • Non-convex problem over ρ(x, t) ∈ R and velocity ﬁeld v(x, t) ∈ R2: W2(ρ0, ρ1)2 = min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||v(x, t)||2dtdx, under the set of non-linear constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, v(0, ·) = v(1, ·) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 Change of variable (v, µ) → (m, µ), with m = µv: Convex cost J and linear constraints C No estimation of the transport map T, only the geodesic ρ(x, t) 14 / 50

Minimization The problem is min (m,ρ) J (m, ρ) +
ιC(m, ρ) where J et ιCm are non-smooth convex functions 15 / 50

Minimization The problem is min (m,ρ) J (m, ρ) +
ιC(m, ρ) where J et ιCm are non-smooth convex functions [P., Peyré et Oudet, 2014] • Staggered grid discretization • Optimisation with proximal splitting algorithms: - ADMM/Douglas-Rachford [Lions et Mercier ‘79, Combettes and Pesquet ‘07] - Generalized Forward-Backward [Raguet et al. ‘13] - Primal-Dual [Chambolle and Pock ‘11] • Generalized costs 15 / 50

Illustration: linear transport 16 / 50

Comparison of optimization algorithms 100 101 102 103 104 102
103 104 105 Iterations k J(mk,fk) ADMM DR PD J (m( ), ρ( )) Mininimum value of ρ( ) Convergence of transport cost is fast.... 17 / 50

Convergence speed ||ρ − ρ( )|| ||m − m( )||
... but convergence of iterates is slow ((ρ∗, m∗) is the reference solution) 18 / 50

Example: vanishing densities 19 / 50

Example: vanishing densities Limitation: Dealing with non convex domains 19
/ 50

Generalization of the transport cost Functional deﬁnition • Transport cost
function: min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||v(x, t)||2dtdx, • Set of constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 20 / 50

Generalization of the transport cost Functional deﬁnition • Generalization of
the transport cost function: min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρβ(x, t)||v(x, t)||2dtdx, • Set of constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 • β ∈ [0; 1]: from H−1 to L2 -Wasserstein distances [Dolbeault et al. ’09, Cardaliaguet et al. ’12] 20 / 50

the transport cost function: min (v,ρ)∈Cv 1 2 [0,1]2 1 0 w(x, t)ρ(x, t)||v(x, t)||2dtdx, • Set of constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 • β ∈ [0; 1]: from H−1 to L2 -Wasserstein distances [Dolbeault et al. ’09, Cardaliaguet et al. ’12] • Riemannian manifold with 0 < w(x, t) = w(x) +∞ (existence and uniqueness [Mc Cann ’01] ): deal with obstacles 20 / 50

the transport cost function: min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||A(x, t)v(x, t)||2dtdx, • Set of constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = 0, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 • β ∈ [0; 1]: from H−1 to L2 -Wasserstein distances [Dolbeault et al. ’09, Cardaliaguet et al. ’12] • Riemannian manifold with 0 < w(x, t) = w(x) +∞ (existence and uniqueness [Mc Cann ’01] ): deal with obstacles • Anisotropic transport [Hug et al. ’15] 20 / 50

the transport cost function: min (v,ρ)∈Cv 1 2 [0,1]2 1 0 ρ(x, t)||v(x, t)||2dtdx+ 1 2 [0,1]2 1 0 ρ(x, t)||f(x, t)||2, • Set of constraints Cv = (v, ρ) ; ∂t ρ + divx (ρv) = f, ρ(·, 0) = ρ0, ρ(·, 1) = ρ1 • β ∈ [0; 1]: from H−1 to L2 -Wasserstein distances [Dolbeault et al. ’09, Cardaliaguet et al. ’12] • Riemannian manifold with 0 < w(x, t) = w(x) +∞ (existence and uniqueness [Mc Cann ’01] ): deal with obstacles • Anisotropic transport [Hug et al. ’15] • Unbalanced Transport [Chizat et al. ’18] 20 / 50

Generalization of the transport cost Example: linear transport with ||m||2/ρβ
β = 0 β = 0.5 β = 0.25 β = 0.75 21 / 50

Anisotropic Optimal Transport Illustration: • Rotations: OT (A=Id) A1 A2
A3 22 / 50

Generalization of the transport cost Synthetic oceanography application 2D OT
23 / 50

Generalization of the transport cost Synthetic oceanography application 2D OT
in a complex domain, with w(x, t) = w(x) ∈ {1; +∞} 24 / 50

Generalization of the transport cost Example: labyrinth 1 w(x, t)
= w(x) ∈ {1; +∞} 25 / 50

Generalization of the transport cost Example: labyrinth 1 Piecewise straight
lines 25 / 50

Generalization of the transport cost Example: labyrinth 2 Moving walls
w(x, t) ∈ {1; +∞} 26 / 50

Generalization of the transport cost Example: labyrinth 2 Piecewise constant
speed 26 / 50

= w(x) ∈ {1; +∞} 27 / 50

= w(x) ∈ {1; +∞} Deal with obstacles Preservation of structures 27 / 50

Optimal transport with physical priors Problems • Optimal mass transfer
follows straight lines • Can we include other physical priors on the transport ? 28 / 50

follows straight lines • Can we include other physical priors on the transport ? Reintroduction of the velocity v [Hug et al. ’15, Maas et al. ’15] • Coupling with a smooth and non-convex penalization: K(m, ρ, v) = 1 2 [0;1]2 1 0 ||m − ρv||2dtdx • Regularity priors R(v): incompressibility (div(v) = 0), rigidity... 28 / 50

follows straight lines • Can we include other physical priors on the transport ? Reintroduction of the velocity v [Hug et al. ’15, Maas et al. ’15] • Coupling with a smooth and non-convex penalization: K(m, ρ, v) = 1 2 [0;1]2 1 0 ||m − ρv||2dtdx • Regularity priors R(v): incompressibility (div(v) = 0), rigidity... • Non-convex model F(m, ρ, v) = J (m, ρ) + ιC (m, ρ) + λK(m, ρ, v) + αR(v), ⇒ Block coordinates descent [Tseng ’01, Ochs et al. ’14] 28 / 50

Optimal transport with physical priors Illustration OT Incompressible Rigid 29
/ 50

Optimal transport with physical priors Oceanography application 2D OT in
a complex domain with divergence-free penalization 30 / 50

Conclusion Proximal splitting methods • Solving dynamical OT problem •
Adding constraints and generalizing cost function 31 / 50

Adding constraints and generalizing cost function Extension of Dynamic OT • Discrete surfaces [Lavenant et al. ’18] • Sphere [Lang and P. ’19?] 31 / 50

Adding constraints and generalizing cost function Extension of Dynamic OT • Discrete surfaces [Lavenant et al. ’18] • Sphere [Lang and P. ’19?] Open problems • Study the existence of solution for non static domains • Modeling of data occlusions (clouds) for data assimilation in oceanography 31 / 50

Part II Discrete Optimal Transport 32 / 50

Context General color transfer problem • Manipulate some statistical features
of an image (color, luminance, texture, etc) • Preserve the other characteristics (geometry, edges, contrast, etc) and avoid artifacts 33 / 50

Examples of color transfer applications Color Harmonization (before) u1 u2
34 / 50

Examples of color transfer applications Color Harmonization(after) T(u1) u2 34
/ 50

Examples of color transfer applications Color Harmonization(after) T(u1) u2 3D
Reconstruction [P., Provenzi and Caselles ’11] 34 / 50

Examples of color transfer applications • Transfer style from movies
Amélie Poulain Transformers Result Sources: [Bonneel et al. ’13] • Prior on foreground/background segmentation [Frigo et al. ’15] 35 / 50

Examples of color transfer applications Attracting students 36 / 50

Examples of color transfer applications Attracting students Place near a
random university 36 / 50

Examples of color transfer applications Attracting students Place near Color
Palette a random university 36 / 50

Palette Warm place a random university near the sea 36 / 50

Palette Warm place a random university near the sea Advertisement: Erasmus Mundus Master Program IPCV (Bordeaux, Budapest, Madrid) 36 / 50

Unsupervised color transfer u v Tu→v (u) Same color mean
37 / 50

Unsupervised color transfer u v Tu→v (u) Same color mean
Parametric methoods: transfer of color statistics [Reinhard 2001,Tai et al. ’05] 37 / 50

Unsupervised color transfer u v Tu→v (u) Same color histogram
Optimal transport: transfer of color palette [Pitié et al. ’07, Rabin et al. ’11] 37 / 50

Monge-Kantorovitch problem for d 1 • Histograms: µ = M
i=1 µi δXi et ν = N j=1 νj δYj , Xi , Yj ∈ Rd • Wasserstein distance W2(µ, ν)2 = min P∈Pµ,ν { P , C = i,j Pi,j Ci,j } Pµ,ν =        P ∈ RM×N , Pi,j 0, i,j Pi,j = 1, j Pi,j = µi , i Pi,j = νj        • Pi,j is the mass transported from µi to νj • Cost matrix between locations Xi and Yj : Ci,j = d k=1 ||Xk i − Yk j ||2 Do not depend on feature dimension d Limited to low dimensions M and N 38 / 50

Solving the discrete problem • Linear programing: simplex, interior points
• Assignment problems: Hungarian, Auction • Acceleration for L1 costs [Ling and Okada ’07 Pele and Werman ’08] • Sliced Wasserstein [Rabin et al. ’11] • Multiscale [Oberman and Ruan ’15]. • Fast approximation: Sinkhorn [Cuturi ’13] • Shortcuts L2 [Schmitzer ’15] 39 / 50

Limitations of Optimal Transport • Exact Transfert of color proportions
• Irregularity of transport map (spatial / color) • Dimension: limited to low dimensional images Solutions • Relaxation of mass preservation constraint • Estimation of regularised transport maps (color consistency) • Pixel clustering (spatial consistency) 40 / 50

Relaxation of mass conservation constraint • Transport of color histogram
µ = M i=1 µi δXi to ν µi : proportion of color for bin Xi • Introduction of capacity variables κj • Joint estimation of P and κ {P , κ } ∈ argmin P∈Pκ(µ,ν) κ∈RN ,κ≥0, κ, ν =1 P, C + µ||κ − 1||1 • Relaxed constraints [Ferradans et al. ’14]: Pκ (µ, ν) =        Pi,j 0, i,j Pi,j = 1, j Pi,j = µi , i Pi,j = κj νj        Still a linear program 41 / 50

µ = M i=1 µi δXi to ν µi : proportion of color for bin Xi • Introduction of capacity variables κj • Joint estimation of P and κ {P , κ } ∈ argmin P∈Pκ(µ,ν) κ∈RN ,κ≥0, κ, ν =1 P, C + µ||κ − 1||1 • Relaxed constraints [Ferradans et al. ’14]: Pκ (µ, ν) =        Pi,j 0, i,j Pi,j = 1, j Pi,j = µi , i Pi,j = κj νj        Optimal transport Still a linear program 41 / 50

µ = M i=1 µi δXi to ν µi : proportion of color for bin Xi • Introduction of capacity variables κj • Joint estimation of P and κ {P , κ } ∈ argmin P∈Pκ(µ,ν) κ∈RN ,κ≥0, κ, ν =1 P, C + µ||κ − 1||1 • Relaxed constraints [Ferradans et al. ’14]: Pκ (µ, ν) =        Pi,j 0, i,j Pi,j = 1, j Pi,j = µi , i Pi,j = κj νj        Relaxed optimal transport Still a linear program 41 / 50

Illustration of relaxed model Target OT Relaxed OT Source
Joint estimation of the proportion of color to transfer No spatial nor colorimetric regularisation 42 / 50

Regularity of transport map • Global regularization: NP-hard problem •
Mean transport map [Ferradans et al. ’13, Seguy et al. ’17] TP(Xi ) = Yi = 1 j Pij j Pij Yj = (Dµ PY)i 43 / 50

Regularity of transport map for color transfer • Mean transport:
Vi = TP(Xi) − Xi • Spatial consistency: graph of similarity ωij between Xi and Xj ⇒ Close pixels with similar colors should be matched together • Graph-laplacian of the mean transport ﬁeld V (∆V)i := j∈EX (i) ωij d =1 (Vi − Vj ), • Color consistency penalize color shift to avoid artifacts R(P) = i |∆V|i • Still a linear program • Symetric formulation • Barycenter computation Graph 44 / 50

Vi = TP(Xi) − Xi • Spatial consistency: graph of similarity ωij between Xi and Xj ⇒ Close pixels with similar colors should be matched together • Graph-laplacian of the mean transport ﬁeld V (∆V)i := j∈EX (i) ωij d =1 (Vi − Vj ), • Color consistency penalize color shift to avoid artifacts R(P) = i |∆V|i • Still a linear program • Symetric formulation • Barycenter computation Relaxed OT 44 / 50

Vi = TP(Xi) − Xi • Spatial consistency: graph of similarity ωij between Xi and Xj ⇒ Close pixels with similar colors should be matched together • Graph-laplacian of the mean transport ﬁeld V (∆V)i := j∈EX (i) ωij d =1 (Vi − Vj ), • Color consistency penalize color shift to avoid artifacts R(P) = i |∆V|i • Still a linear program • Symetric formulation • Barycenter computation Relaxed and regularized OT 44 / 50

Color transfer algorithm 1. Superpixel clustering [Achanta et al. ’12,
Giraud et al. ’18] Image Superpixels 2. Graph built from superpixel similarities (spatial+color) 3. Estimation of relaxed and regularised transport map 4. Final synthesis at pixel scale 45 / 50

Color transfer Images: Nicolas Le Dilhuit 46 / 50

Limitation Creation of new drab colors: • Interpolation Xi
→ ¯ Yi • Ampliﬁed with explicit regularisation Implicit regularisation of the transport matrix elements does not help • Entropic regularisation [Cuturi ’13] • Convex/sparse regularisation [Blondel et al. ’17, Dessein et al. ’19] Solution: deal with the color dispersion 47 / 50

Color dispersion • Measure variance of transfered color [Rabin and
P. ’15] Var(Y)i := Y − Yi 2 i • Minimisation of a concave term of mass dispersion: α i µi Var(Y)i ⇒ Associate a single color to Xi • Non-smooth and non-convex problem Forward-Backard [Attouch et al. ’13, Ochs et al. ’14], DC programming [Tao ’05] 48 / 50

Inﬂuence of color dispersion with parameter α Input α =
0 Example with high regularisation parameter 49 / 50

Conclusion For image processing applications: • Relaxation of mass conservation
constraint is necessary • Spatial regularization into transport map deals with artifacts • Non convex models prevent from creating new dull colors To be fair: • Doing color transfer with optimal transport is currently time consuming (1 minute for HD image) • Semi-automatic methods (high level segmentation, semantic analysis, simple optimal transport) [Bonneel et al. ’13, Frigo et al. ’14] give fast and accurate color transfer results even for videos But: • Enhancing OT framework will improve semi-automatic methods • Dealing with artifacts allows deﬁning robust dissimilarity measures 50 / 50

Physical regularization of Optimal Transport

Physical regularization of Optimal Transport

More Decks by npapadakis

Other Decks in Science

Featured

Transcript