Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deep Learning Image Manipulation
Search
Leszek Rybicki
May 18, 2017
Research
2
190
Deep Learning Image Manipulation
Illustrated guide to some image manipulation methods, with demonstration.
Leszek Rybicki
May 18, 2017
Tweet
Share
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
88
How to Patch Image Classifiers
lunardog
0
1.6k
Towards Realistic Predictors - EN
lunardog
0
1.5k
Towards Realistic Predictors
lunardog
1
1.9k
Deep Learning Hot Dog Detector
lunardog
0
220
Finding beans in burgers: paper reading notes
lunardog
0
1.2k
Kelner: Serve Your Models
lunardog
0
98
Image Analysis at Cookpad
lunardog
1
1.5k
Kelner: serve your models
lunardog
1
300
Other Decks in Research
See All in Research
Minimum Bayes-Risk Decoding における性能変動の理解に向けて(2024年6月5日 第59回 NLPコロキウム)
atsumoto
0
210
JMED-LLM: 日本語医療LLM評価データセットの公開
fta98
1
350
仮説検定とP値
shuntaros
6
7.3k
RCEへの近道
kawakatz
1
620
[Human-AI Decision Making勉強会] 説明の更新はユーザにどのような影響をもたらすか
okoso
1
310
自然言語とVision&Language
kuehara
19
4.4k
SSII2024 [SS1] 拡散モデルの今 〜 2024年の研究動向 〜
ssii
PRO
2
1.9k
Conducting AI Research on High-Performance Computing (HPC) Systems
yoshipon
2
460
SSII2024 [OS2] GPT-4Vで画像認識は終わるのか(オープニング)
ssii
PRO
0
640
論文紹介 AST: Audio Spectrogram Transformer
kazu07
0
190
SSII2024 [PD] 画像センシングの未来
ssii
PRO
0
290
機械学習と数理最適化の融合-文脈付き確率的最短路を例として-
mickey_kubo
2
670
Featured
See All Featured
Code Review Best Practice
trishagee
58
16k
Visualization
eitanlees
139
14k
KATA
mclloyd
20
13k
Product Roadmaps are Hard
iamctodd
PRO
48
10k
The Invisible Side of Design
smashingmag
294
50k
Faster Mobile Websites
deanohume
303
30k
Building Effective Engineering Teams - LeadDev
addyosmani
47
2.2k
YesSQL, Process and Tooling at Scale
rocio
166
14k
Typedesign – Prime Four
hannesfritz
37
2.2k
It's Worth the Effort
3n
181
27k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
226
52k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
129
32k
Transcript
%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO
"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP
DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT
$POWPMVUJPOBM "SJUINFUJD OCIKE
*NBHFTUPGFBUVSFT
$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel
4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px
1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px
4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html
5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”
%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output
6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output
&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space
'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer
$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…
'PPE/FU ™ food not food
@teenybiscuit
None
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
(FOFSBUJWF "EWFSTBSJBM /FUXPSLT
Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D
G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher
G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples
None
https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
$POEJUJPOBM ("/T
G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D
SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition
G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator
https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original
original https://www.faceapp.com/
https://www.faceapp.com/ original
"SUJTUJD4UZMF5SBOTGFS Improved!
https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image
“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix
https://www.youtube.com/watch?v=xVJwwWQlQ1o
$ZDMF("/
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155
GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
of the input image https://arxiv.org/abs/1603.08155
"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse
$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image
https://www.youtube.com/watch?v=9reHvktowLY
5IF&OE