Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
CVPR2019参加速報 本会議 3日目 / CVPR2019 Personal Memo: Day 3
Search
Atsushi
June 21, 2019
Technology
0
260
CVPR2019参加速報 本会議 3日目 / CVPR2019 Personal Memo: Day 3
チラシの裏チラシの裏チラシの裏チラシの裏チラシの裏チラシの裏
Atsushi
June 21, 2019
Tweet
Share
More Decks by Atsushi
See All by Atsushi
CVPR2019参加速報 本会議 2日目 / CVPR2019 Personal Memo: Day 2
atsushihashimoto
0
220
CVPR2019 参加速報 本会議1日目 / CVPR2019 Personal Memo: Day 1
atsushihashimoto
0
350
Other Decks in Technology
See All in Technology
モーダル間の変換後の一致性とジャンル表を用いた解釈可能性の考察 ~Text-to-MusicとText-To-ImageかつImage-to-Musicを例に~
otanet
0
310
2023年度にEMとして頑張ったこと
ikefukurou777
0
100
AWSに詳しくない人でも始められるコスト最適化ガイド
yuhta28
2
400
Building a RAG-poweredAI chat appwith Python and VS Code
pamelafox
0
160
エンジニア候補者向け資料2024.04.24.pdf
macloud
0
3.4k
M5と自作基板をくっつけてみた〜M5 Japan Tour 2024 Spring 福冈 (Fukuoka|福岡)〜
keropiyo
0
150
2024春 注目のWeb系 OSS & SaaS 3選
makies
0
180
GrafanaMeetup_AmazonManagedGrafanaのアクセス制御機能とマルチテナント環境下でのアクセス制御について
daitak
0
400
よく聞くけど使ったことないソフトウェアNo.1 KafkaとSnowflake
foursue
4
510
VSCodeの拡張機能を作っている話
ebarakazuhiro
1
810
Gradle Build Scanを使ってビルドのことを知ろう potatotips #87
tomorrowkey
2
160
TechFeed Experts Night#27 〜 フロントエンドフレームワーク最前線 (Svelte)
baseballyama
2
590
Featured
See All Featured
The Pragmatic Product Professional
lauravandoore
26
5.8k
Intergalactic Javascript Robots from Outer Space
tanoku
266
26k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
660
120k
How to Ace a Technical Interview
jacobian
273
22k
Fireside Chat
paigeccino
22
2.6k
Faster Mobile Websites
deanohume
300
30k
Visualization
eitanlees
137
14k
What's new in Ruby 2.0
geeforr
337
31k
How to Create Impact in a Changing Tech Landscape [PerfNow 2023]
tammyeverts
20
1.6k
Raft: Consensus for Rubyists
vanstee
133
6.3k
Statistics for Hackers
jakevdp
790
220k
A designer walks into a library…
pauljervisheath
201
23k
Transcript
CVPR2019 Main Conference the 3rd day
Deep LearningΛ͍ͬͯΔ ͷ͕ͩɺLosslessͰը૾Λѹ ॖ͢Δख๏ɻ ֤ղ૾ͰEntropyූ߸ԽΛ ͬͨEncoding ˠղ૾ͷ͍ը૾ͱͯ͠ ͕ࠩͰΔ(͜͜ͷৄࡉ͕Θ͔ Βͳ͔ͬͨ) ˠ(܁Γฦ͠)
ˠখ͞ͳը૾͕ࠩ0ʹͳ ΔΑ͏ʹEntropyූ߸Խ σίʔυٯॱͰɻ Poster൪߸1൪ʹ ϙελʔηογϣϯ ։࢝ޙʹ͍͘ͱ શ͚ۙͮ͘ͳ͍ͷͰ ҙɻ •
1࣍ݩʹslice͢ΔWasserstein Distance ˠn࣍ݩʹslice͢ΔWD(͜͜·Ͱैདྷख๏) n͕Exponentialʹେ͖͘ͳΔ͕͋ͬ ͨɻ ͦͷͨΊɺn࣍ݩͷmaxΛڑͱ͢ΔWD ΛఏҊɻڑͷެཧΛຬͨ͢ɻ ·ͨɺnఆͰྑ͍ɻ
Ϩγϐʹ͓͚Δ ը૾⁶खॱจॻͷ ຒΊࠐΈɻ ςΩετ͔Βը૾Λੜ ͠ɺGANͰຊͬΆ͘͢Δɻ ڞ༗જࡏ্ۭؒͷಛྔ͚ͩ Ͱͳ͘ɺը૾ͷྨࣅߟྀ ʹ͍Εͯݕࡧ͢Δ͜ͱͰੑೳ ্͕ͱͷ͜ͱɻ (ٯʹݴ͑ɺಛྔͷൺֱͩ
͚Ͱ͋·ΓޮՌ͕ͳ͍ʁ)
ࡢͷStudent Best PaperͰ͋Δ Total Captureͷଓฤɻ ࡢإମɺखͷ࢟ͷਪఆ ผʑʹͬͯɺଓ෦Ͱແཧ ΓҐஔ߹Θ͍͚ͤͯͨ͠Ͳɺ͜Ε Λ͏গ͠౷߹తͳਪఆʹͨ͠ ͷɻओʹɺखإ౷߹ͨ͠Part
Orientation FieldsͱɺMesh TrackingʹΑΔମද໘ͷಈ͖ͷਪ ఆ͔ΒͳΔɻ ֶश༻σʔλ૿༷͑ͨɻ Tired
ը૾͔Β࢟Λਪఆ͢Δω οτϫʔΫΛ͍ͭͭɺը ૾Λճసͤͯ͞ɺ͝ʹΐ͝ ʹΐͱ͢Δ͜ͱͰ࢟Ҏ֎ ͷಛΛநग़͢Δωοτϫ ʔΫΛఏҊɻ
Flickerͷhash tagΛͬͯɺͱʹ͔ ͨ͘͘͞ΜσʔλΛͬͯpre- training͢Δ͜ͱΛతͱͨ͠ख ๏ɻhash tagͷ͏ͪɺಈ࡞ͱؔ ͳ͍ͷଟ͍ͷͰɺطଘͷσʔ ληοτͷΞϊςʔγϣϯͱྨࣅ ͨ͠λάͷΈΛऑڭࢣσʔλͱ͠ ͯ࠾༻ɻҎ্ͷॲཧΛશͯࣗಈ
Ͱɺ1ϲֶ݄͔͚ۙͯ͘शͨ͠ͱͷ ͜ͱɻ 0 -_- ͍ͭͯ ͱͯ͠
લఏ: ಉ͡खॱͷ࡞ۀΛͨ͠ಈը͕ෳ ͋Δɻ త: ࡞ۀͷதͷಈ࡞ΧςΰϦΛɺͦͷ ಛྔͱͱʹࣗಈͰಛఆɻ ํ๏: ࡞ۀͷ։͔࢝Βऴྃ·Ͱͷ࣌ࠁtΛ 0ʙ1Ͱද͢ɻ ֤࣌ࠁͷը૾ྻΛೖྗͱͯ͠ɺtΛਪఆ
͢Δself-supervised learningΛߦ͏ɻ (ಈ࡞ͷॱ൪ʹ͕͋ΔͳΒಛྔͱt ͕݁ͼͭͣ͘ɺͱ͍͏Ծఆ) ͜͏ͯ͠ಘΒΕͨಛྔΛΫϥελϦϯ ά͢Δͱɺಈ࡞ΫϥεʹͳΓಘΔɻ
Grassman Manifolds্ͷʹରͯ͠ɺࣜ(3)ͷ Α͏ʹೋͭͷΧʔωϧΛ༻͍ͨڑ(?)Λ spectral clusteringʹΑΓ࠷খԽɻ ΧʔωϧΛೋͭ༻͍Δ͜ͱͷϝϦοτྑ͘ Θ͔Βͳ͔ͬͨɻGrassman Manifolds্Ͱ Spectral Clusteringɺͱ͍͏ͷੲMulti-layer
Spectral Clusteringͱ͍͏จͰಡΜͩΑ͏ʹ ࢥ͏͠ɺͦ͜ʹৄ͘͠ॻ͍ͯ͋ͬͨͣͳͷ ͰɺͪΌΜͱཧղ͠Α͏ͱࢥͬͨΒɺ͔ͦ͜ ΒಡΈͯ͠ྑ͍͔ɻ
͜ΕɺD2AE(CVPR2018)Λ VAEͬͯ៉ྷʹ࣮ͨ͠ ͚ͩͰ???
Monocular DepthͰɺ࣌ܥྻ؍ଌΛੵ ͢Δ͜ͱͰਫ਼Λ͋͛Δख๏ɻ
Event Camera (ޫྔͷ૿ݮͷΈΛଊ ͑ΔΧϝϥ)ʹ͓͚Δػցֶश༻ͷଛ ࣦؔͷྑ͠ѱ͠Λௐͨαʔϕ Πɻ ͜͜ʹͨ͘͞Μྻڍ͞Ε͍ͯΔɻ Β Ұ
Monocular DepthΛ͢Δ࣌ ʹྨࣅ͢ΔλεΫ(optical flowͱ͔৭ʑ)Λͬͨmulti- taskʹ͠Α͏ɺͱ͍͏ͱ͜ Ζ·ͰڈʹͲ͔͜Ͱݟ ͨؾ͕͢Δɻ ͪΐͬͱͲ͜ʹ৽نੑ͕͋ Δͷ͔·Ͱ͑ΒΕ͍ͯ ͳ͍ɻ
Unsupervised Domain AdaptationΛ͢Δͱ͖ʹɺGANΛͬͯυϝΠϯΛ૿ͯ͠ؤ݈ੑ Λ ͋͛Δख๏ɻී௨ͷreconstruction loss + adv. loss, color
consistency loss + adv. loss, fullͷ 3छྨͰυϝΠϯΛ3ͭ૿͢(Domain Diversification... ͍ͩͿώϡʔϦεςΟοΫ͕ͩ). ͦͷ্ͰɺMulti-source domain adaptationΛߦ͏ɻମݕग़ͰධՁɻ
ࣸਅʹରͯ͠ɺࣗવݴޠͰ༩͑ͨΫΤ ϦʹΑͬͯɺͲͷ෦͕ݴٴ͞Ε͔ͨ Λattentionͱͯ͠औΓग़͢ख๏ɻ
ࣝผ݁Ռʹରͯ͠ɺͳͥͦ͏அͨ͠ͷ͔ ͑ΒΕΔΑ͏ʹͨ͠Ϟσϧɻ આ໌ ςΩετ+BBͰ༩͑ΒΕΔɻ આ໌ͷੜ෦ࣄલʹڭࢣ͋ΓͰֶ शɻ ʔ
ಉ͘͡આ໌Λੜ͢Δݚ ڀɻ # ྆ํͱ౦େݪాݚ ࣸਅͰͪΐ͏ͲӅΕͯ͠ ·͍ͬͯΔ͕ɺઆ໌Λ Because ࣝผ݁Ռͷௗͷಛ ͷઆ໌ as
ը૾1, not as ը ૾2ɺΈ͍ͨͳܗͰ 1. ಉ͡ಛΛͭଞͷௗ 2. ͦͷಛΛ࣋ͨͳ͍͕ࣅ ͨௗ Λදࣔ͢Δɻ
Hand-crafted ͳWarpingؔΛೖ ΕΔ͜ͱͰdisentanglingΛ͢Δख ๏ɻwarpingؔͰRealͳը૾ ΛੜͰ͖ͳ͍ͷͰɺ݁ՌͷΫΦ ϦςΟͪΐͬͱѱ͍Α͏ͳؾ ͢Δɻ
Video frameͷॱংΛsort͢Δself-supervised learningख๏CVPR2017Ͱൃද͞Ε͍ͯΔ ͕ɺframeͰͳ͘ɺvideo clipͷsortΛ͢Δख ๏ɻ 2017ͷࠒΑΓGPUͷmemoryαΠζ͕૿͑ ͯྑ͔ͬͨɻ
UDAΛ͢Δͱ͖ͷDiscriminator ͱͯ͠1࣍ݩͷSliced Wasserstein DistanceΛ༻͍ Δɻ ΧςΰϦؒʹҙͷॱংؔΛ ࡞ΓɺΧςΰϦؒͷWDΛ(ॱং ʹԠͯ͡)େ͖͘͠ͳ͕Βɺυ ϝΠϯؒͷڑΛখ͘͢͞Δ͜ ͱͰΑΓΑ͍UDAΛ࣮ݱɻ
ޙड़͢Δ͕ɺ͜Ε͔ͳΓ͍͍ ײ͕ͩ͡SOTAͰͳ͍ɻ
ಈըͷதʹɺ΄͔ͷಈը͔ΒಘΒΕ ͨಈମͷը૾ྻΛࣗવʹຒΊࠐΉ ͜ͱͰɺମݕग़ͷ܇࿅σʔλΛਫ ૿͠ɻ
࠷ऴϑϨʔϜ͔Βͷܦա࣌ࠁΛ conditionͱͯ͠ར༻͢Δ͜ͱͰɺ ҙͷ࣌ࠁͷanticipationΛ͢Δख๏ɻ Epic KitchenͰ60ඵઌ·Ͱ༧ଌɻ ಉ͡࡞ۀ͕ଓ͔͘Ͳ͏͔ͯΒΕ ͍ͯΔ͕ɺ֬৴ͷΑ͏ͳͷಉ ࣌ʹग़ྗ͍ͨ͠ͱײͨ͡ɻ ͳ͓ɺ͋·Γԕ͘ͷະདྷ·Ͱཉுͬ ֶͯशͤ͞Δͱਫ਼͕Լ͕Δɺͱ͍
͏ͷͬͯΈͨΒ͍͠ɻ ࠷ॳɺEpic KitchenͷAction Anticipation͜͏͍͏λεΫͩͱࢥ ͬͯͨɻ1ඵઌΛ༧ଌ͢ΔΜ͡Όͳ͘ ͬͯ...ɻ
إͷdepthը૾Domain InvariantͰ͋Δɺͱ ͍͏ԾઆͷԼɺdepthΛ༧ଌ͢Δmain branch ͱɺdomainຖͷdomain specific branchΛֶ शɻ֤domain specific branchͷಛ͕main
branchͱ۠ผ͔ͭͳ͘ͳΔΑ͏ʹ͢Δ͜ͱͰ generalizationΛߦ͏ɻ depthdomainඇґଘɺͱ͍͏ࣄલ͕ࣝॏ ཁʹݟ͑Δɻٯʹݴ͑ɺdepthηϯαʔͷछ ྨإͷ֯ɺڑͳͲ͕ҧͬͨΒɺ͜ͷख ๏͏·͍͔͘ͳ͍ͣɻ
खͷಛΛ͍ͯ͠Δͣͩ ͕ɺஶऀ͍ͳ͍͠ Inference modelsͷઆ໌ෆ໌ͳ ͷͰɺৄࡉΘ͔Βͣɻ
ೕ༮ࣇͷֶशաఔΛ฿͢ Δ͜ͱͰɺIncrementalͳ ମࣝผΛ࣮ݱ͠Α͏ɺͱ͍ ͏νϟϨϯδɻ σʔληοτɺΧϝϥ͕ ମಈ༳ͷΑ͏ͳಈ͖Λ͠ ͓ͯΓɺͦͷதͰମ ͕(࣋͞Εͯ৭ʑͳ֯ ʹ͞Ε͍ͯΔ͔ͷΑ͏ʹ)͘ Δ͘Δճ͍ͬͯΔɻ
ϥϯμϜʹ৭ʑͳମ͕ॱ ൪ʹ࣋͞ΕΔɻ ;ͭ͏ʹΔͱ Catastrophic Forgetting͕ ى͖ΔͷͰͦΕΛͲ͏ճආ ͠·͠ΐ͏ʁͱ͍͏ఏ ى(ͱ؆୯ͳbase lineΆ͍
खͰ࣋ͨ͠෦ͷԹ͕͔͋ͨͨ ͘ͳΔ͜ͱΛར༻ͯ͠ ͞·͟·ͳମ(ͷ3DϓϦϯλͰҹ ͨ͠ϨϓϦΧ)ΛඃݧऀʹѲͬͯ ΒͬͯɺͲ͜Λ৮͍͔ͬͯͨΛճస ςʔϒϧʹͤͯ360ࡱӨͨ͠σ ʔληοτɻ ϩϘοτͷ࣋Λҙ͍ࣝͯ͠ΔΒ͠ ͍͕ɺखͷܗ͕ਓͱϩϘοτͰҧ͏ ͔ΒɺͲ͏͑ΔΜͩΖ͏͔ɺͱͪ
ΐͬͱࢥ͏͕ɺগͳ͘ͱਓ͕৮ͬ ͯྑ͍ͱஅͨ͠ॴͳͲΘ͔Δ ͔ͳɻ
1ϑϨʔϜ͚ͩਖ਼ղͷsegmentation͕͍͍ͯͯɺͦ ΕΛ͢ΔΑ͏ͳઃఆʹ͓͍ͯɺͦͷ1ϑϨʔ ϜΛͲ͜ʹ͢ΔͱҰ൪ਫ਼͕ߴ͘ͳΔ͔Λఆ͢Δ ख๏ͷఏҊɻجຊతʹը૾2ຕΛ͍ΕͯɺͲͪΒ͕ ྑ͍͔ఆ͢ΔωοτϫʔΫΛ͍ɺόϒϧιʔτ Λͤ͞Δɻ CVPRҰٳ͞Μ͔ͳʁͱࢥͬͨݩڟͷݚڀɻ
UDAͷSOTAɻଞͷख๏͕Adv. Training DiscrepancyΛ࠷খԽ͢Δͷʹର͠ ͯɺਅ໘ʹυϝΠϯຖͷฏۉͱࢄ Λܭࢉͯ͠ Batch Normalizationͷ࣌ʹฏۉ0,ࢄ ͕EͱͳΔΑ͏ʹwhitening͢Δɺͭ· Γਅ໘ͳύϥϝλਪఆʹΑΔख๏ɻ ͜ΕʹՃ͑ͯMin-Entropy
Consensus Lossͱ͍͏͍ͯ͠Δ͕ɺͦΕ͕ ͳͯ͘SOTAɻͳ͓ɺTarget Domain ୯ମͷࣝผ݁ՌΑΓਫ਼͕ߴ͍ɻ
͜ΕɺҰൃ͚ܳͩͲ໘ന͔ͬͨɻ semantic segmentationʹՃ͑ͯɺମத৺ ͕ͦͷըૉ͔ΒΈͯͲͪΒ(&ڑ)ʹ͋Δ͔ɺ ͱ͍͏ํͷใֶशɻ͜ΕʹΑΓɺಉҰ ମͷsegmentͰ͋ͬͯinstanceͷڥքΛ ܭࢉՄೳʹ͍ͯ͠Δɻ ํͷใΛOptical FlowΈ͍ͨʹՄࢹԽ͠ ͨͷ
ʔ
Motion SegmentationͰRegion ProposalΛunsupervisedʹ܇࿅͢ Δख๏ɻ Medical Imagingͱ͔Ͱʹཱͪ ͦ͏ɻ
ӴࣸਅͳͲͷൺֱ࣌ʹɺΑ͋͘ ΔมԽ(রͷࡱӨ݅มԽ)ͱ ϨΞͳมԽ(ݐ͕૿͑ͨɺͳͲ) Λ۠ผ͢Δख๏ɻ 2ຕͷը૾ʹڞ௨͢ΔbranchΛ ͬͨreconstruction݁Ռͱɺೖྗ ը૾ͷޡࠩʹج͍ͮͯϨΞ͔Ͳ͏ ͔Λఆ(ݫີʹBackprop.ͨ͠ ͱ͖ͷgradientͷେ͖͞ɺͱ͍ͬ ͍͕ͯͨɺࠓʹͳͬͯɺޡࠩͰ
ఆ͢ΔͷͱԿ͕ҧ͏ͷ͔Θ͔Βͳ ͘ͳ͍ͬͯΔͳ͏)
ըૉຖʹԻݯΛ͢ Δख๏ͳͲͰɺैདྷ ख๏ͰɺԻݯ͕ө૾ ͷϑϨʔϜʹ͋Δ͜ ͱ͕Ծఆ͞Ε͍ͯͨɻ ͜ͷԾఆΛͣͨ͢Ί ʹɺڞ༗જࡏಛΛֶ शͨ͋͠ͱͰɺಈըຖ ʹΫϥελϦϯάʹΑ ͬͯରԠؔΛٻΊ
ΔɻԻ͔Βಘͨಛͩ ͚͔ΒͳΔΫϥελ ɺը૾ʹ͍ࣸͬͯ ͳ͍ԻݯͱఆͰ͖ Δɻ
Deep NNʹAdaBoostΛద༻ ͢Δͱɺաֶशͯ݁͠Ռ͕ѱ ͘ͳΔɻ͜ͷݪҼΛɺαϯϓ ϧຖͷWeightίϯτϩʔϧ ʹ͋Δʢͭ·Γɺޡࣝผ͢ Δαϯϓϧ͕গͳ͗͢Δͱɺ ޙଓͷClassifier͕ͦΕΒʹ overfit͢Δʣͱߟ͑ɺ category-wiseͳॏΈͷߋ৽
ʹΑΔAdaBoost-likeͳΞϯ αϯϒϧख๏ΛఏҊɻ AdaBoostʹৄ͔ͬͨ͠Βൃ දऀʹײಈ͞Εͨͷ͕ͩɺ ͜Ε(δΣωϨʔγϣϯ)Ϊ ϟοϓ๖͑ͷҰछͩΖ͏͔ʁ
ֶशͷޮԽͷͨΊʹConvolutionͷνϟϯ ωϧΛάϧʔϓʹΘ͚Δʢάϧʔϓؒͷࢬ͕ ফ͑ΔͷͰύϥϝλ͕ݮΔʣɺͱ͍͏ख๏ ʹ͓͍ͯɺGroupֶ͚शͰಘΑ͏ɺͱ͍ ͏ख๏ɻ
ֶशʹ༻͍Δ͖Ͱͳ͍όΠΞεΛֶशͨ͠ ߹ʹϖφϧςΟΛ༩͑Δख๏ɻ ೦͗͢ΔͷόΠΞε͕طͰ͋Δͱ͍͏ ԾఆɻόΠΞεΛ͍ͬͯ͠ΕDAͰ͖Δ͠ɺ DAͰ͖ͨΒόΠΞεΘ͔ΔͷͰɺܲͱཛͷ ؔʹ͋ΔͣͳͷʹɺҰํΛطͱͨ͠Β ͩΊͰ͠ΐ...ࠪಡऀࣄ͠ɺ͛;Μɻ