Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deep Learning Image Manipulation
Search
Leszek Rybicki
May 18, 2017
Research
2
190
Deep Learning Image Manipulation
Illustrated guide to some image manipulation methods, with demonstration.
Leszek Rybicki
May 18, 2017
Tweet
Share
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
110
How to Patch Image Classifiers
lunardog
0
1.7k
Towards Realistic Predictors - EN
lunardog
0
1.6k
Towards Realistic Predictors
lunardog
1
2k
Deep Learning Hot Dog Detector
lunardog
0
230
Finding beans in burgers: paper reading notes
lunardog
0
1.3k
Kelner: Serve Your Models
lunardog
0
100
Image Analysis at Cookpad
lunardog
1
1.6k
Kelner: serve your models
lunardog
1
340
Other Decks in Research
See All in Research
メタヒューリスティクスに基づく汎用線形整数計画ソルバーの開発
snowberryfield
3
620
非ガウス性と非線形性に基づく統計的因果探索
sshimizu2006
0
430
Language is primarily a tool for communication rather than thought
ryou0634
4
790
熊本から日本の都市交通政策を立て直す~「車1割削減、渋滞半減、公共交通2倍」の実現へ~@公共交通マーケティング研究会リスタートセミナー
trafficbrain
0
180
ニューラルネットワークの損失地形
joisino
PRO
36
18k
言語と数理の交差点:テキストの埋め込みと構造のモデル化 (IBIS 2024 チュートリアル)
yukiar
4
920
Weekly AI Agents News! 10月号 論文のアーカイブ
masatoto
1
400
国際会議ACL2024参加報告
chemical_tree
1
350
2024/10/30 産総研AIセミナー発表資料
keisuke198619
1
380
打率7割を実現する、プロダクトディスカバリーの7つの極意(pmconf2024)
geshi0820
0
130
第 2 部 11 章「大規模言語モデルの研究開発から実運用に向けて」に向けて / MLOps Book Chapter 11
upura
0
430
Tiaccoon: コンテナネットワークにおいて複数トランスポート方式で統一的なアクセス制御
hiroyaonoe
0
130
Featured
See All Featured
The Psychology of Web Performance [Beyond Tellerrand 2023]
tammyeverts
45
2.2k
Designing on Purpose - Digital PM Summit 2013
jponch
116
7k
RailsConf 2023
tenderlove
29
940
Being A Developer After 40
akosma
87
590k
Why You Should Never Use an ORM
jnunemaker
PRO
54
9.1k
Bash Introduction
62gerente
608
210k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
38
1.9k
Speed Design
sergeychernyshev
25
670
KATA
mclloyd
29
14k
It's Worth the Effort
3n
183
28k
Optimizing for Happiness
mojombo
376
70k
Thoughts on Productivity
jonyablonski
67
4.4k
Transcript
%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO
"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP
DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT
$POWPMVUJPOBM "SJUINFUJD OCIKE
*NBHFTUPGFBUVSFT
$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel
4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px
1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px
4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html
5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”
%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output
6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output
&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space
'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer
$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…
'PPE/FU ™ food not food
@teenybiscuit
None
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
(FOFSBUJWF "EWFSTBSJBM /FUXPSLT
Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D
G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher
G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples
None
https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
$POEJUJPOBM ("/T
G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D
SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition
G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator
https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original
original https://www.faceapp.com/
https://www.faceapp.com/ original
"SUJTUJD4UZMF5SBOTGFS Improved!
https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image
“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix
https://www.youtube.com/watch?v=xVJwwWQlQ1o
$ZDMF("/
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155
GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
of the input image https://arxiv.org/abs/1603.08155
"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse
$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image
https://www.youtube.com/watch?v=9reHvktowLY
5IF&OE