Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Deep Learning Image Manipulation
Search
Leszek Rybicki
May 18, 2017
Research
2
220
Deep Learning Image Manipulation
Illustrated guide to some image manipulation methods, with demonstration.
Leszek Rybicki
May 18, 2017
Tweet
Share
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
150
How to Patch Image Classifiers
lunardog
0
2.5k
Towards Realistic Predictors - EN
lunardog
0
2.4k
Towards Realistic Predictors
lunardog
1
2.3k
Deep Learning Hot Dog Detector
lunardog
0
290
Finding beans in burgers: paper reading notes
lunardog
0
1.8k
Kelner: Serve Your Models
lunardog
0
130
Image Analysis at Cookpad
lunardog
1
1.8k
Kelner: serve your models
lunardog
1
400
Other Decks in Research
See All in Research
ローテーション別のサイドアウト戦略 ~なぜあのローテは回らないのか?~
vball_panda
0
280
An Open and Reproducible Deep Research Agent for Long-Form Question Answering
ikuyamada
0
260
社内データ分析AIエージェントを できるだけ使いやすくする工夫
fufufukakaka
1
890
LLM-jp-3 and beyond: Training Large Language Models
odashi
1
760
学習型データ構造:機械学習を内包する新しいデータ構造の設計と解析
matsui_528
6
3.1k
R&Dチームを起ち上げる
shibuiwilliam
1
150
POI: Proof of Identity
katsyoshi
0
140
【NICOGRAPH2025】Photographic Conviviality: ボディペイント・ワークショップによる 同時的かつ共生的な写真体験
toremolo72
0
160
OWASP KansaiDAY 2025.09_文系OSINTハンズオン
owaspkansai
0
110
財務諸表監査のための逐次検定
masakat0
1
250
"主観で終わらせない"定性データ活用 ― プロダクトディスカバリーを加速させるインサイトマネジメント / Utilizing qualitative data that "doesn't end with subjectivity" - Insight management that accelerates product discovery
kaminashi
15
20k
ロボット学習における大規模検索技術の展開と応用
denkiwakame
1
210
Featured
See All Featured
Fireside Chat
paigeccino
41
3.8k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
55
3.2k
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
0
140
Raft: Consensus for Rubyists
vanstee
141
7.3k
Measuring Dark Social's Impact On Conversion and Attribution
stephenakadiri
1
120
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
150
A Soul's Torment
seathinner
5
2.3k
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
26
3.3k
The AI Search Optimization Roadmap by Aleyda Solis
aleyda
1
5.2k
End of SEO as We Know It (SMX Advanced Version)
ipullrank
3
3.9k
Everyday Curiosity
cassininazir
0
130
Bash Introduction
62gerente
615
210k
Transcript
%FFQ-FBSOJOH *NBHF.BOJQVMBUJPO BOJMMVTUSBUFEHVJEF .-,JUDIFO
"CPVUNF w -FT[FL3ZCJDLJ w HJUIVC!MVOBSEPH w CPSOJO1PMBOE w .-3FTFBSDIFSBU$PPLQBE w
*MJLFOBUUP
DBSFFST!DPPLQBEDPN 8BOUUPXPSLXJUIVT
$POWPMVUJPOBM "SJUINFUJD OCIKE
*NBHFTUPGFBUVSFT
$POWPMVUJPO http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html input output input output kernel
4USJEF http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px 2px 2px
1BEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html 2px 2px
4USJEF QBEEJOH http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html
5SBOTQPTFE http://deeplearning.net/software/theano/tutorial/conv_arithmetic.html simulated here with padding also called “deconvolution” “fractional
stride”
%PXOTBNQMJOH features or small resolution image convolutional layer or layers
RGB image input output
6QTBNQMJOH upsampling CNN layer or layers RGB image features or
small resolution image input output
&ODPEFS%FDPEFS D E image in Decoder Encoder image out feature
space
'VMMZ$POOFDUFE $MBTTJpFS approve loan reject class data or features also
called “Dense” layer
$//$MBTTJpFS food person plant other AlexNet, LeNet, VGG…
'PPE/FU ™ food not food
@teenybiscuit
None
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
@teenybiscuit
(FOFSBUJWF "EWFSTBSJBM /FUXPSLT
Generator Discriminator https://speakerdeck.com/lunardog/deep-convolutional-voight-kampf-test “Couple of bots studying for the Turing
Test”
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
Generator Discriminator G MPPLTMFHJU UPUBMMZTIPQQFE D
G SFBM GBLF D D(G(noise)) ˠ real (FOFSBUPSUSBJOJOH Discriminator acts
as the teacher
G SFBM GBLF D SFBM GBLF D D(G(noise)) ˠ fake
D(photo) ˠ real %JTDSJNJOBUPSUSBJOJOH Generator provides negative examples
None
https://www.youtube.com/watch?v=rs3aI7bACGc ©Yota Ishida
Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks Alec
Radford, Luke Metz, Soumith Chintala (Submitted on 19 Nov 2015 (v1), last revised 7 Jan 2016 (this version, v2)) https://arxiv.org/abs/1511.06434
$POEJUJPOBM ("/T
G NBMF GFNBMF DIJME FMEFSMZ G(noise | conditions) $POEJUJPOBM(FOFSBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D $POEJUJPOBM%JTDSJNJOBUPS
SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D SJHIU XSPOH NBMF
GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME FMEFSMZ D D
SJHIU XSPOH D $POEJUJPOBM("/ https://arxiv.org/abs/1411.1784 Conditional Generative Adversarial Nets Mehdi
Mirza, Simon Osindero (Submitted on 6 Nov 2014) Generator Discriminator NBMF GFNBMF DIJME FMEFSMZ G NBMF GFNBMF DIJME FMEFSMZ same condition
G NBMF GFNBMF DIJME FMEFSMZ SJHIU XSPOH NBMF GFNBMF DIJME
FMEFSMZ D $POEJUJPOBM("/ Discriminator Generator
https://www.faceapp.com/ Disclaimer: FaceApp authors don’t disclose their method. This is
only my guess. It may have nothing to do with GANs. original
original https://www.faceapp.com/
https://www.faceapp.com/ original
"SUJTUJD4UZMF5SBOTGFS Improved!
https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://prisma-ai.com/ https://prisma-ai.com/
https://arxiv.org/abs/1603.08155 transformation network loss network Gram matrices in feature space
pre-trained content image style image
“Gram matrices in feature space” https://en.wikipedia.org/wiki/Gramian_matrix
https://www.youtube.com/watch?v=xVJwwWQlQ1o
$ZDMF("/
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
https://github.com/junyanz/CycleGAN
(FOFSBUPS transformation network https://arxiv.org/abs/1603.08155
GBLF IPSTF GBLF IPSTF … %JTDSJNJOBUPS fully convolutional judges patches
of the input image https://arxiv.org/abs/1603.08155
"EWFSTBSJBM-PTT X F G Y GBLF [FCSB GBLF [FCSB …
GBLF IPSTF GBLF IPSTF … X(F(horse)) ˠ classify as zebra Y(F(zebra)) ˠ classify as horse
$ZDMF-PTT G F G(F(image))ˠ the same image F G F(G(image))ˠ
the same image
https://www.youtube.com/watch?v=9reHvktowLY
5IF&OE