Network-to-Network Translation with Conditional Invertible Neural Networks

2020/12/15 @udoooom Network-to-Network Translation with Conditional Invertible Neural Networks Robin
Rombach∗, Patrick Esser∗, Björn Ommer IWR, HCI, Heidelberg University https://papers.nips.cc/paper/2020/ﬁle/1cfa81af29c6f2d8cacb44921722e753-Paper.pdf https://papers.nips.cc/paper/2020/ﬁle/1cfa81af29c6f2d8cacb44921722e753-Supplemental.pdf

Problems • Supervised models have enough great success such tasks,
which are • Image Classiﬁcation, Segmentation (ResNet, DeepLab Series) • Question Answering (BERT, GPT-3) • Image Generation, Translation (BigGAN, StyleGAN) • Need to ﬁnd new ways to reuse such expert models!!

Problems • Pre-trained models have arbitrary ﬁxed representations • StyleGAN:
Image Generation • BERT: Sentence Embedding • Need domain (modal) translation with keeping the full capabilities!

Contribution • Propose conditionally invertible network (cINN), which is a
model that can relate between different existing representations without altering them. • cINN needs no gradients of expert models.

Related Works Invertible Neural Networks(INN): Generative Models Figure: https://openai.com/blog/generative-models/ Base
Distribution Target Distribution INN (e.g. Image2StyleGAN) Generation Conditions

Related Works Invertible Neural Networks(INN): Generative Models Figure: https://openai.com/blog/generative-models/ Base
Distribution Target Distribution INN (e.g. Image2StyleGAN) Generation Conditions Extends Network-to-Network

Proposed Method Motivation • Learn relationships and transfer between representations
of different domains

Proposed Method Motivation • : Two target domains • :
Desired output, • : Latent representation • • To realize domain translation, it needs to be described probabilistically as sampling from • Denote , translation func, residuals D x , D y f(x) x ∈ D x z Φ = Φ(x) f(x) = Ψ(Φ(x)), g(y) = Λ(Θ(y)) p(z Θ |z Φ ) z Θ = τ(v|z Φ ) τ : v : x ∈ D x y 1 ∈ D y 5IFEPHJTDVUF 5IFEPHJTMPWFMZ y 2 ∈ D y z Φ = Φ(x) Λ(z Φ ) Λ(z Φ ) v

Proposed Method Learning a Domain Translation τ • must capture
all information of not represented in , but no information that is already represented in • v z Θ z Φ z Φ v = τ−1(z Θ |z Φ ) cINN

Proposed Method Learning a Domain Translation τ • discards all
information of , if and are independent • Minimize v z Φ v z Φ KL(p(v|z Φ )|q(v)) standard normal distribution Achieve the goal of sampling from ɹɹ , sampled from p(z Θ |z Φ ) z Θ = τ(v|z Φ ) v q(v)

Proposed Method Domain Transfer Between Fixed Models • • Algorithm
• 1. Sample from • 2. Encode into • 3. Sample from • 4. Transform • 5. Decode into x p(x) x z Φ = Φ(x) v q(v) z Θ = τ(v|z Φ ) z Θ y = Λ(z Θ ) 2 3 4 5 1

Experiments 1. BERT-to-BigGAN Translation • Compare IS and FID with
baselines using COCO-stuff dataset CVPR19 CVPR18 ICCV17 CVPR19

Experiments 2. Reusing a single target generator • Encoder: (a,
b) DeepLab, (c, d) ResNet50 Super-Resolution with Auto-encoder

Experiments 2. Reusing a single target generator • How the
invariances increase with increasing layer depth for visualization

Experiments 3. Image Editing: Conditional I2I [8] StarGAN[Choi+ CVPR18]

Experiments 3. Image Editing: Exemplar-Guided Translation and Uns. Disentangling

Experiments 3. Image Editing: Unpaired I2I

Conclusion • Propose cINN technique for reusing pre-trained models •
NLP-to-Image • Image-to-Image • Label-to-Image • Achieve eco-friendly method

Network-to-Network Translation with Conditional...

Network-to-Network Translation with Conditional Invertible Neural Networks

Udon

More Decks by Udon

Other Decks in Technology

Featured

Transcript

2020/12/15 @udoooom Network-to-Network Translation with Conditional Invertible Neural Networks Robin

Problems • Supervised models have enough great success such tasks,

Problems • Pre-trained models have arbitrary ﬁxed representations • StyleGAN:

Contribution • Propose conditionally invertible network (cINN), which is a

Related Works Invertible Neural Networks(INN): Generative Models Figure: https://openai.com/blog/generative-models/ Base

Related Works Invertible Neural Networks(INN): Generative Models Figure: https://openai.com/blog/generative-models/ Base

Proposed Method Motivation • Learn relationships and transfer between representations

Proposed Method Motivation • : Two target domains • :

Proposed Method Learning a Domain Translation τ • must capture

Proposed Method Learning a Domain Translation τ • discards all

Proposed Method Domain Transfer Between Fixed Models • • Algorithm

Experiments 1. BERT-to-BigGAN Translation • Compare IS and FID with

Experiments 2. Reusing a single target generator • Encoder: (a,

Experiments 2. Reusing a single target generator • How the

Experiments 3. Image Editing: Conditional I2I [8] StarGAN[Choi+ CVPR18]

Experiments 3. Image Editing: Exemplar-Guided Translation and Uns. Disentangling

Experiments 3. Image Editing: Unpaired I2I

Conclusion • Propose cINN technique for reusing pre-trained models •