Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
医療ディープラーニング勉強会 DL勉強会 第3回 2020.4
Search
M.Inomata
April 01, 2020
Technology
560
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
医療ディープラーニング勉強会 DL勉強会 第3回 2020.4
https://deeplearning-b.connpass.com/event/169952/
M.Inomata
April 01, 2020
More Decks by M.Inomata
See All by M.Inomata
20221005_AI勉強会
ecoopnet
0
580
20220112_AI勉強会
ecoopnet
0
580
AI最新論文読み会2021年11月
ecoopnet
0
370
AI最新論文読み会2021年5月
ecoopnet
0
580
AI最新論文読み会2020年12月
ecoopnet
0
450
AI最新技術Update会 8月
ecoopnet
0
510
Kotlin Coroutines ことはじめ
ecoopnet
2
380
ハマった時に役立つ通信系デバッグの話
ecoopnet
3
1.9k
モバイルアプリ開発に使える設計の話
ecoopnet
1
1.7k
Other Decks in Technology
See All in Technology
サイバーセキュリティ概論 / Introduction to Cybersecurity
ks91
PRO
0
160
正解のないAIプロダクトをどう導くか?dodaが挑む、ユーザーの『本音』を構造化する評価設計と検証のリアル
techtekt
PRO
0
180
【Gen-AX】20260530開催_JJUG CCC 2026 Spring
genax
0
420
[モダンアプリ勉強会]今更聞けないGit/GitHub入門
tsukuboshi
0
280
ブロックチェーン / Blockchain
ks91
PRO
0
110
Diagnosing performance problems without the guesswork
elenatanasoiu
0
170
GoとSIMDとWasmの今。
askua
3
510
Platform engineering for developers, architects & the rest of us (AI agents)
danielbryantuk
0
180
SIer20年! 培ったスキルがスタートアップで輝く時
shucho0103
0
440
PHP と TypeScript の型システム比較:AI 時代の「型」は誰のためにあるのか? #frontend_phpcon_do / frontend_phpcon_do_2026
shogogg
1
250
AIプラットフォームを運用し続けるための可観測性
tanimuyk
4
1.1k
BigQuery の Cross-cloud Lakehouse への歩み
phaya72
2
570
Featured
See All Featured
The Limits of Empathy - UXLibs8
cassininazir
1
350
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
254
22k
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
520
Balancing Empowerment & Direction
lara
6
1.1k
Unlocking the hidden potential of vector embeddings in international SEO
frankvandijk
0
830
エンジニアに許された特別な時間の終わり
watany
107
250k
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
3
150
Money Talks: Using Revenue to Get Sh*t Done
nikkihalliwell
0
240
How to train your dragon (web standard)
notwaldorf
97
6.7k
Organizational Design Perspectives: An Ontology of Organizational Design Elements
kimpetersen
PRO
1
720
Being A Developer After 40
akosma
91
590k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
1
340
Transcript
ҩྍσΟʔϓϥʔχϯάษڧձ ୈ3ճ DLษڧձ ᷂tech vein ழມ ॆԝ
ࣗݾհ ழມ ॆԝ (͍ͷ·ͨ ΈͭͻΖ) גࣜձࣾ tech vein දऔక ݉
σϕϩούʔ twitter: @ino2222 IUUQTXXXUFDIWFJODPN
ΞδΣϯμ Archive Sanity (arxiv-sanity.com) ͔ΒϐοΫΞο ϓͨ͠աڈ1ϲ݄ؒͷจհɻ ɾtop recentͷจτοϓ10 ɾtop hype
ͷจτοϓ10 ɾҰ൪ؾʹͳͬͨจ1ͭ
Archive Sanity? https://www.arxiv-sanity.com/top
Arxiv Sanity Top recent: Best10
ᶃA Primer in BERTology: What we know about how BERT
works BERTֶͷೖॻɻBERT͕ͲͷΑ͏ʹػೳ͢Δ͔ʹ͍͍ͭͯͬͯΔ͜ͱ τϥϯεϑΥʔϚʔܕϞσϧ(Transformer-based models)ݱ ࡏɺNLPͰ͘ΘΕ͍ͯ·͕͢ɺͦͷ෦ͷΈʹ͍ͭ ͯ·ͩ͋·Γཧղ͞Ε͍ͯ·ͤΜɻຊߘͰɺ༗໊ͳBERT Ϟσϧ(Devlin et al. 2019)ʹ͍ͭͯɺ40Ҏ্ͷղੳݚڀΛ߹ ͯ͠ɺ͜Ε·ͰʹΒΕ͍ͯΔ͜ͱΛઆ໌͠·͢ɻ·ͨɺ ఏҊ͞Ε͍ͯΔϞσϧͷमਖ਼ͱͦͷ܇࿅ϨδʔϜͷ֓ཁΛઆ ໌͠·͢ɻͦͯ͠ɺ͞ΒͳΔݚڀͷํੑΛ֓આ͠·͢ɻ https://arxiv.org/abs/2002.12327v1
ᶄOn Feature Normalization and Data Augmentation ಛͷਖ਼نԽͱσʔλ֦ுʹ͍ͭͯ ݱͷχϡʔϥϧωοτϫʔΫ܇࿅ɺҰൠԽΛվળ͢ΔͨΊʹσʔλͷ૿ େʹେ͖͘ґଘ͍ͯ͠·͢ɻϥϕϧอଘܕͷ૿େ๏͕࠷ॳʹޭͨ͠ޙɺ࠷ۙ Ͱɺֶश͞Εܾͨఆ໘ΛΒ͔ʹ͢ΔͨΊʹɺֶशαϯϓϧશମͷಛͱ
ϥϕϧΛΈ߹ΘͤΔϥϕϧઁಈ๏ͷؔ৺͕ߴ·͍ͬͯ·͢ɻຊจͰɺ ಛͷਖ਼نԽʹΑͬͯநग़͞Εͨୈ1ͱୈ2ͷϞʔϝϯτΛར༻ͨ͠৽͍͠૿ ڧ๏ΛఏҊ͠·͢ɻֶशͨ͠ಛྔͷϞʔϝϯτΛผͷֶशը૾ͷϞʔϝϯτ ʹஔ͖͑ΔͱͱʹɺඪϥϕϧΛิؒ͢ΔɻզʑͷΞϓϩʔνߴͰ ͋ΓɺಛۭؒશମͰಈ࡞͠ɺैདྷͷख๏ͱҟͳΔ৴߸Λࠞ߹͢ΔͨΊɺ طଘͷ૿ڧख๏ͱޮՌతʹΈ߹ΘͤΔ͜ͱ͕Ͱ͖·͢ɻզʑɺίϯ ϐϡʔλϏδϣϯɺԻɺࣗવݴޠॲཧͷϕϯνϚʔΫσʔληοτʹ͓͍ ͯɺͦͷ༗ޮੑΛ࣮ূ͠·ͨ͠ɻ https://arxiv.org/abs/2002.11102v2
None
ᶅKnowledge Graphs φϨοδάϥϑ ຊจͰɺφϨοδάϥϑʹ͍ͭͯแׅతʹհ͠·͢ɻφϨοδάϥϑɺଟ ༷ͰಈతͳେنσʔλͷίϨΫγϣϯΛར༻͢Δ͜ͱΛඞཁͱ͢ΔγφϦΦʹ͓ ͍ͯɺۙɺ࢈ۀքͱֶज़քͷํ͔Βେ͖ͳΛूΊ͍ͯ·͢ɻҰൠతͳհ ͷޙɺφϨοδάϥϑʹ༻͞ΕΔ༷ʑͳάϥϑϕʔεͷσʔλϞσϧͱΫΤϦݴ ޠͷಈػ͚ͱରൺΛߦ͍·͢ɻφϨοδάϥϑʹ͓͚ΔεΩʔϚɺಉҰੑɺίϯ ςΩετͷׂʹ͍ͭͯٞ͠·͢ɻԋ៷తٕज़ͱؼೲతٕज़ͷΈ߹ΘͤΛ༻͍ ͯɺ͕ࣝͲͷΑ͏ʹදݱ͞Εɺநग़͞ΕΔ͔Λઆ໌͠·͢ɻφϨοδάϥϑͷ࡞
ɺॆ࣮ɺ࣭ධՁɺચ࿅ɺެ։ͷͨΊͷํ๏Λ·ͱΊ͍ͯ·͢ɻஶ໊ͳΦʔϓϯ φϨοδάϥϑͱΤϯλʔϓϥΠζφϨοδάϥϑͷ֓ཁɺͦΕΒͷΞϓϦέʔ γϣϯɺ͓ΑͼͦΕΒ্͕ड़ͷٕज़ΛͲͷΑ͏ʹ༻͍ͯ͠Δ͔Λઆ໌͠·͢ɻ࠷ ޙʹɺφϨοδάϥϑͷকདྷͷݚڀͷํੑʹ͍ͭͯड़·͢ɻ https://arxiv.org/abs/2003.02320v1
None
ᶆBatch Normalization Biases Deep Residual Networks Towards Shallow Paths όονਖ਼نԽਂ͍ࠩωοτϫʔΫΛઙ͍ܦ࿏ʹภΒͤΔ
όονਖ਼نԽʹෳͷϝϦοτ͕͋Γ·͢ɻόονਖ਼نԽଛࣦϥϯυεέʔϓ ͷ͚݅Λվળ͠ɺڻ͘΄ͲޮՌతͳਖ਼ଇԽΛߦ͍·͢ɻ͔͠͠ɺόονਖ਼نԽ ͷ࠷ॏཁͳརࠩωοτϫʔΫ(Residual Network)ʹ͓͍ͯੜ͡·͢ɻॳظ Խͷࡍɺόονਖ਼نԽɺωοτϫʔΫͷਂ͞ͷฏํࠜʹൺྫͨ͠ਖ਼نԽʹ ΑͬͯɺεΩοϓଓʹର͢ΔࠩذΛμϯεέʔϧ͠·͢ɻ͜ΕʹΑΓɺτ Ϩʔχϯάͷॳظஈ֊Ͱɺਂ͍ਖ਼نԽ͞ΕͨࠩωοτϫʔΫʹΑͬͯܭࢉ͞Ε ͨؔɺྑͳޯΛ࣋ͭઙ͍ύεʹΑͬͯࢧ͞ΕΔ͜ͱ͕อূ͞Ε·͢ɻ͜ ͷಎΛ༻͍ͯɺਖ਼نԽͳ͠Ͱඇৗʹਂ͍ࠩωοτϫʔΫΛ܇࿅Ͱ͖Δ؆୯ͳॳ ظԽεΩʔϜΛ։ൃͨ͠ɻ·ͨɺόονਖ਼نԽΑΓେ͖ͳֶशͰ҆ఆֶͨ͠श ΛՄೳʹ͠·͕͢ɺ͜ͷརେ͖ͳόοναΠζͷֶशΛฒྻԽ͍ͨ͠߹ʹͷ Έ༗༻Ͱ͋Δ͜ͱΛ໌Β͔ʹ͠·ͨ͠ɻզʑͷ݁ՌɺҟͳΔΞʔΩςΫνϟʹ͓ ͚Δόονਖ਼نԽͷརΛ͢Δͷʹཱͪ·͢ɻ https://arxiv.org/abs/2002.10444v1
ᶇAutoML-Zero: Evolving Machine Learning Algorithms From Scratch. AutoML-Zero: εΫϥον͔ΒͷػցֶशΞϧΰϦζϜͷਐԽ ػցֶशͷݚڀɺϞσϧߏֶशํ๏ͳͲଟ໘తʹਐΜͰ͍·͢ɻAutoMLͱͯ͠ΒΕΔ͜
ͷΑ͏ͳݚڀΛࣗಈԽ͠Α͏ͱ͢Δྗ·ͨɺେ͖ͳਐาΛ͖͛ͯ·ͨ͠ɻ͔͠͠ɺ͜ͷਐา ओʹχϡʔϥϧωοτϫʔΫͷΞʔΩςΫνϟʹযΛͯͨͷͰ͋Γɺ͜͜ͰɺϏϧσΟ ϯάϒϩοΫͱͯ͠ߴͳઐՈ͕ઃܭͨ͠ʹґଘ͍ͯ͠·ͨ͠--͋Δ͍ಉ༷ʹ੍ݶͷ͋Δ୳ ࡧۭؒʹґଘ͍ͯ͠·ͨ͠ɻࢲͨͪͷඪɺAutoML͕͞ΒʹਐԽͰ͖Δ͜ͱΛࣔ͢͜ͱͰ͋Γ ·͢ɻզʑɺҰൠతͳݕࡧۭؒΛ௨ͯ͠ਓؒͷόΠΞεΛେ෯ʹݮ͢Δ৽͍͠ϑϨʔϜϫʔΫ Λಋೖ͢Δ͜ͱʹΑͬͯɺ͜ΕΛ࣮ূ͠·͢ɻ͜ͷۭؒͷେ͞ʹ͔͔ΘΒͣɺਐԽత୳ࡧ όοΫϓϩύήʔγϣϯʹΑͬͯ܇࿅͞Εͨ2ͷχϡʔϥϧωοτϫʔΫΛൃݟ͢Δ͜ͱ͕Ͱ͖ ·͢ɻ͜ΕΒͷ୯७ͳχϡʔϥϧωοτϫʔΫɺͦͷޙɺؔ৺ͷ͋ΔλεΫɺྫ͑CIFAR-10ͷ มछͰਐԽͤ͞Δ͜ͱͰɺόΠϦχΞΠϯλϥΫγϣϯɺਖ਼نԽޯɺॏΈฏۉԽͳͲͷτο ϓΞϧΰϦζϜʹݱతͳٕज़͕ݱΕΔ͜ͱͰ͙྇͜ͱ͕Ͱ͖·͢ɻ͞ΒʹɺਐԽΞϧΰϦζϜ ΛҟͳΔλεΫλΠϓʹదԠͤ͞·͢ɻθϩ͔ΒػցֶशΞϧΰϦζϜΛൃݟͨ͜͠ΕΒͷ༧උత ͳޭɺ͜ͷͷ༗ͳ৽͍͠ํੑΛ͍ࣔͯ͠Δͱ৴͍ͯ͡·͢ɻ https://arxiv.org/abs/2003.03384v1
None
None
ᶈHyper-Parameter Optimization: A Review of Algorithms and Applications. ϋΠύʔύϥϝʔλ࠷దԽ. ΞϧΰϦζϜͱΞϓϦέʔγϣϯͷϨϏϡʔ
σΟʔϓχϡʔϥϧωοτϫʔΫ͕։ൃ͞ΕͯҎདྷɺৗੜ׆ʹଟେͳߩݙΛ͖ͯ͠·ͨ͠ɻػ ցֶशɺৗੜ׆ͷ΄΅ͯ͢ͷଆ໘ʹ͓͍ͯɺਓ͕ؒͰ͖ΔҎ্ͷ߹ཧతͳΞυόΠεΛఏ ڙͯ͘͠Ε·͢ɻ͔͠͠ɺ͜ͷΑ͏ͳՌʹ͔͔ΘΒͣɺχϡʔϥϧωοτϫʔΫͷઃܭͱ܇ ࿅ɺґવͱͯ͠ࠔͰ༧ଌෆՄೳͳखॱͰ͢ɻҰൠతͳϢʔβʔͷٕज़తͳᮢΛԼ͛ΔͨΊ ʹɺࣗಈԽ͞ΕͨϋΠύʔύϥϝʔλ࠷దԽ(HPO)ɺֶज़తʹ࢈ۀతʹਓؾͷ͋Δτϐο Ϋͱͳ͍ͬͯ·͢ɻຊจͰɺϋΠύʔύϥϝʔλ࠷దԽʹؔ͢Δ࠷ॏཁͳτϐοΫͷϨ ϏϡʔΛߦ͍·͢ɻ࠷ॳʹɺϞσϧͷֶशߏʹؔ࿈͢ΔओཁͳϋΠύʔύϥϝʔλΛհ ͠ɺͦͷॏཁੑͱҬΛఆٛ͢Δํ๏Λ͡·͢ɻ࣍ʹɺओཁͳ࠷దԽΞϧΰϦζϜͱͦͷద༻ ੑʹযΛͯɺಛʹਂֶशωοτϫʔΫʹର͢Δޮͱਫ਼Λཏ͍ͯ͠·͢ɻ࣍ʹɺHPO ͷͨΊͷओཁͳαʔϏεπʔϧΩοτΛϨϏϡʔ͠ɺ࠷ઌͷݕࡧΞϧΰϦζϜͷରԠɺओ ཁͳਂֶशϑϨʔϜϫʔΫͰͷ࣮ݱੑɺϢʔβ͕ઃܭͨ͠৽͍͠Ϟδϡʔϧͷ֦ுੑΛൺֱ ͠·͢ɻ࠷ޙʹɺHPOΛਂֶशʹద༻ͨ͠߹ͷɺ࠷దԽΞϧΰϦζϜؒͷൺֱɺݶΒ ΕͨܭࢉࢿݯͰͷϞσϧධՁͷͨΊͷஶ໊ͳΞϓϩʔνΛհ͠ɺจΛకΊ͘͘Γ·͢ɻ https://arxiv.org/abs/2003.05689v1
ᶉA Survey on Contextual Embeddings จ຺తΤϯϕοσΟϯάʹؔ͢Δௐࠪ ELMoBERTͳͲͷจ຺ʹج͍ͮͨΤϯϕοσΟϯάɺ Word2VecͷΑ͏ͳάϩʔόϧͳ୯ޠදݱΛ͑ͯɺ෯͍ࣗવݴ ޠॲཧλεΫʹ͓͍ͯըظతͳύϑΥʔϚϯεΛ࣮ݱ͠·͢ɻจ຺ ʹج͍ͮͨΤϯϕοσΟϯάɺ֤୯ޠʹͦͷจ຺ʹج͍ͮͨදݱ
ΛׂΓͯΔ͜ͱͰɺ༷ʑͳจ຺Ͱͷ୯ޠͷ༻Λัଊ͠ɺݴޠؒ Ͱୡ͞ΕΔࣝΛූ߸Խ͠·͢ɻຊௐࠪͰɺطଘͷจ຺ʹجͮ ͘ຒΊࠐΈϞσϧɺݴޠԣஅతͳϙϦάϩοτͷࣄલ܇࿅ɺԼྲྀλ εΫʹ͓͚Δจ຺ʹجͮ͘ຒΊࠐΈͷԠ༻ɺϞσϧѹॖɺϞσϧղ ੳΛϨϏϡʔ͠·͢ɻ https://arxiv.org/abs/2003.07278v1
ᶊReZero is All You Need: Fast Convergence at Large Depth.
ReZero͕͋ΕେৎɻେਂͰͷߴऩଋ σΟʔϓωοτϫʔΫɺྖҬΛ͑ͯେ෯ͳੑೳ্ΛՄೳʹ͠·͕ͨ͠ɺଟ͘ͷ߹ɺফ ࣦ/രൃతͳޯʹ·͞Ε͍ͯ·͢ɻ͜ΕಛʹτϥϯεϑΥʔϚʔΞʔΩςΫνϟʹͯ ·Γɺେنͳσʔληοτܭࢉ༧ࢉ͕ͳ͍ͱ12Λ͑Δਂ͞ͷֶश͕ࠔͰ͢ɻҰൠత ʹɺඇޮͳ৴߸͕σΟʔϓωοτϫʔΫͷֶशΛ્͢Δ͜ͱ͕Θ͔͍ͬͯ·͢ɻτϥ ϯεͰɺϚϧνϔουͷࣗݾҙ͕͜ͷѱ͍৴߸ͷओͳݪҼͱͳ͍ͬͯ·͢ɻਂ৴߸ Λଅਐ͢ΔͨΊʹɺզʑReZeroΛఏҊ͠·͢ɻ͜ΕΞʔΩςΫνϟΛ؆୯ʹมߋͨ͠ ͷͰɺϨΠϠʔ͝ͱʹ1ͭͷՃֶशύϥϝʔλΛ༻ͯ͠ɺҙͷϨΠϠʔΛಉҰੑϚο ϓͱͯ͠ॳظԽ͢ΔͷͰ͢ɻզʑ͜ͷٕज़ΛݴޠϞσϦϯάʹద༻͠ɺ100Ҏ্ͷ ReZero-τϥϯεϑΥʔϚʔωοτϫʔΫΛ؆୯ʹ܇࿅Ͱ͖Δ͜ͱΛൃݟ͠·ͨ͠ɻ12ͷτ ϥϯεϑΥʔϚʔʹద༻͢Δͱɺenwiki8ͰReZero56%͘ऩଋ͠·͢ɻReZero TransformerΛ͑ͯଞͷࠩωοτϫʔΫʹద༻͞Εɺਂ͍શʹଓ͞ΕͨωοτϫʔΫ Ͱ1,500%͘ऩଋ͠ɺCIFAR 10Ͱ܇࿅͞ΕͨResNet-56Ͱ32%͘ऩଋ͠·͢ɻ https://arxiv.org/abs/2003.04887v1
ᶋLagrangian Neural Networks ϥάϥϯδϡχϡʔϥϧωοτϫʔΫ ੈքͷਖ਼֬ͳϞσϧɺͦͷجૅͱͳΔରশੑͷ֓೦ʹج͍ͮͯߏங͞Ε͍ͯ·͢ɻཧֶ Ͱɺ͜ΕΒͷରশੑΤωϧΪʔӡಈྔͳͲͷอଘଇʹରԠ͍ͯ͠·͢ɻ͔͠͠ɺ χϡʔϥϧωοτϫʔΫϞσϧཧֶͰͷར༻͕૿͍͑ͯΔʹ͔͔ΘΒͣɺ͜ΕΒ ͷରশੑΛֶश͢Δͷʹۤ࿑͍ͯ͠·͢ɻຊจͰɺχϡʔϥϧωοτϫʔΫΛ༻͍ͯ ҙͷϥάϥϯδΞϯΛύϥϝʔλԽͰ͖ΔϥάϥϯδΞϯχϡʔϥϧωοτϫʔΫ(LNN)Λ ఏҊ͠·͢ɻϋϛϧτχΞϯΛֶश͢ΔϞσϧͱରরతʹɺLNNਖ਼४࠲ඪΛඞཁͱ͠ͳ
͍ͨΊɺਖ਼४ӡಈྔ͕ෆ໌Ͱ͋ͬͨΓɺܭࢉ͕ࠔͳ߹ʹ༗ޮͰ͢ɻ͜Ε·ͰͷΞϓϩʔ νͱҟͳΓɺզʑͷख๏ֶश͞ΕͨΤωϧΪʔͷؔܗࣜΛ੍ݶͤͣɺ༷ʑͳλεΫͷ ͨΊͷΤωϧΪʔอଘϞσϧΛੜ͠·͢ɻզʑɺೋॏৼΓࢠͱ૬ରతཻࢠͰզʑͷΞ ϓϩʔνΛςετ͠ɺϕʔεϥΠϯΞϓϩʔνͰࢄҳ͕ൃੜ͢ΔΤωϧΪʔอଘΛ࣮ূ ͠ɺϋϛϧτχΞϯΞϓϩʔνͰࣦഊ͢Δਖ਼४࠲ඪͷͳ͍૬ରੑཧΛϞσϧԽ͠·͢ɻ ࠷ޙʹɺϥάϥϯδϡάϥϑωοτϫʔΫΛ༻͍ͯɺ͜ͷϞσϧ͕ͲͷΑ͏ʹάϥϑ࿈ଓ ܥʹద༻Ͱ͖Δ͔Λࣔ͠ɺ1࣍ݩಈํఔ্ࣜͰ࣮ূ͠·͢ɻ https://arxiv.org/abs/2003.04630v1
None
ᶌSet-Structured Latent Representations ू߹ߏԽજࡏදݱ ߏԽ͞Ε͍ͯͳ͍σʔλɺγʔϯͷΠϝʔδͷதͷΦϒδΣΫτͷΑ͏ ʹɺજࡏతͳߏཁૉͷߏΛ͍࣋ͬͯΔ͜ͱ͕ଟ͍Ͱ͢ɻ͜ͷΑ͏ͳঢ়گͰ ɺແடংͳίϨΫγϣϯ set ͕જࡏతͳߏͱͳΓ·͢ɻ͔͠͠ɼ͜ͷΑ ͏ͳදݱΛσʔλ͔Βֶश͢Δ͜ͱɼࢄతͰແடংͳߏͷͨΊࠔ
Ͱ͢ɻ ͜͜Ͱɼू߹ߏΛ࣋ͭજࡏදݱΛඍՄೳʹֶश͢ΔͨΊͷϑϨʔ ϜϫʔΫΛ։ൃ͠·͢ɻ͜ͷϑϨʔϜϫʔΫΛ༻͍ͯɺը૾ͳͲͷσʔλΛࣗ વʹղऍՄೳͰҙຯͷ͋Δͷू߹ʹղ͢Δํ๏Λࣔ͠ɺطଘͷख๏Ͱ ؔ࿈͢ΔߏΛదʹΓ͢͜ͱ͕Ͱ͖ͳ͍͜ͱΛࣔ͠·͢ɻ·ͨɺզʑͷ ํ๏Λɺηοτݻ༗ͷૢ࡞Λ༻͢ΔηοτϚονϯάͷΑ͏ͳԼྲྀͷλε Ϋʹ·Ͱ֦ு͢Δํ๏ࣔ͠·͢ɻզʑͷίʔυͪ͜Βͷhttps URL͔Βೖख ՄೳͰ͢ɻ https://arxiv.org/abs/2003.04448v1
None
Arxiv Sanity Top hype: Best10
ᶃLearning to Simulate Complex Physics with Graph Networks άϥϑωοτϫʔΫΛ༻͍ͨෳࡶͳཧֶͷγϛϡϨʔγϣϯͷֶश ͜͜ͰɺγϛϡϨʔγϣϯֶशͷͨΊͷҰൠతͳϑϨʔϜϫʔΫΛఏࣔ͠ɺྲྀମɺ߶ମɺ
มܗՄೳͳ࣭͕૬ޓʹ࡞༻͍ͯ͠Δ༷ʑͳཧྖҬͰ࠷ઌͷੑೳΛൃش͢Δ୯ҰϞσϧ ͷ࣮Λఏڙ͠·͢ɻզʑͷϑϨʔϜϫʔΫʢզʑ͕ʮάϥϑωοτϫʔΫϕʔεγϛϡϨʔ λʯʢGNSʣͱݺͿʣɺཧγεςϜͷঢ়ଶΛཻࢠͰදݱ͠ɺάϥϑͷϊʔυͱͯ͠දݱ ͠ɺֶश͞ΕͨϝοηʔδύογϯάΛհͯ͠μΠφϛΫεΛܭࢉ͠·͢ɻͦͷ݁Ռɺզʑͷ ϞσϧɺֶशதͷઍݸͷύʔςΟΫϧΛ༻͍ͨγϯάϧλΠϜεςοϓͷ༧ଌ͔Βɺҟͳ Δॳظ݅ɺઍݸͷλΠϜεςοϓɺࢼݧ࣌ʹগͳ͘ͱҰܻҎ্ͷύʔςΟΫϧΛ༻͍ ͨ༧ଌͱҰൠԽͰ͖Δ͜ͱ͕ࣔ͞Ε·ͨ͠ɻզʑͷϞσϧɺ༷ʑͳධՁࢦඪͷϋΠύʔ ύϥϝʔλͷબʹରͯ͠ϩόετͰͨ͠ɻظతͳੑೳͷओͳܾఆཁҼɺϝοηʔδ௨ աεςοϓͷͱɺ܇࿅σʔλΛϊΠζͰഁյ͢Δ͜ͱʹΑΔΤϥʔͷੵΛܰݮ͢Δ͜ͱ Ͱͨ͠ɻզʑͷGNSϑϨʔϜϫʔΫɺ͜Ε·ͰͰ࠷ਖ਼֬ͳ൚༻ֶशཧγϛϡϨʔλͰ ͋Γɺෳࡶͳॱํ͓ΑͼٯํͷΛ෯͘ղ͘͜ͱ͕ظ͞Ε͍ͯ·͢ɻ
None
None
ᶄA Primer in BERTology: What we know about how BERT
works BERTֶ ͷೖॻɻBERT͕ͲͷΑ͏ʹػೳ͢Δ͔ʹ͍͍ͭͯͬͯΔ͜ͱ τϥϯεϑΥʔϚʔܕϞσϧ(Transformer-based models)ݱ ࡏɺNLPͰ͘ΘΕ͍ͯ·͕͢ɺͦͷ෦ͷΈʹ͍ͭ ͯ·ͩ͋·Γཧղ͞Ε͍ͯ·ͤΜɻຊߘͰɺ༗໊ͳBERT Ϟσϧ(Devlin et al. 2019)ʹ͍ͭͯɺ40Ҏ্ͷղੳݚڀΛ߹ ͯ͠ɺ͜Ε·ͰʹΒΕ͍ͯΔ͜ͱΛઆ໌͠·͢ɻ·ͨɺఏ Ҋ͞Ε͍ͯΔϞσϧͷमਖ਼ͱͦͷ܇࿅ϨδʔϜͷ֓ཁΛઆ໌ ͠·͢ɻͦͯ͠ɺ͞ΒͳΔݚڀͷํੑΛ֓આ͠·͢ɻ https://arxiv.org/abs/2002.12327v1 ॏෳ
ᶅLearning to Shade Hand-drawn Sketches खඳ͖εέονͷӄӨΛֶͿ ઢըεέονͱর໌ํͷϖΞ͔Βɺৄࡉ͔ͭਖ਼֬ͳܳज़తͳӨΛੜ͢ΔͨΊͷશࣗಈ ख๏Λఏࣔ͠·͢ɻ·ͨɺઢըͱӨͷϖΞ͔Βɺর໌ํͱλά͚͞Εͨ1,000ྫͷ৽ ͍͠σʔληοτΛఏڙ͠·͢ɻڻ͖͘͜ͱʹɼੜ͞ΕͨӨɼεέον͞Εͨγʔ ϯͷجૅͱͳΔ3DߏΛૉૣ͑͘·͢ɽͦͷ݁ՌɺզʑͷΞϓϩʔνʹΑͬͯੜ͞
ΕͨӨɺ༻͢Δ͜ͱɺΞʔςΟετͷͨΊͷ༏Εͨग़ൃͱͯ͠༻͢Δ͜ͱ Ͱ͖·͢ɻզʑ͕ఏҊ͢ΔσΟʔϓϥʔχϯάωοτϫʔΫ͕ɺखඳ͖ͷεέονΛड ͚औΓɺજࡏۭؒʹ3DϞσϧΛߏங͠ɺͦͷ݁Ռͱͯ͠ੜ͞ΕͨӨΛϨϯμϦϯά͢ Δ͜ͱΛ࣮ূ͍ͯ͠·͢ɻੜ͞ΕͨӨɺखඳ͖ͷઢͱͦͷԼͷ3࣍ݩۭؒΛଚॏ͠ɺ ࣗӨޮՌͷΑ͏ͳચ࿅͞Εͨਖ਼֬ͳσΟςʔϧΛؚΜͰ͍·͢ɻ͞Βʹɺੜ͞Εͨγϟ υʹɺैདྷͷ3DϨϯμϦϯάख๏Ͱ࣮ݱͰ͖ͳ͔ͬͨɺϦϜϥΠςΟϯάόο ΫϥΠςΟϯά͔ΒݱΕΔϋϩʔͳͲͷܳज़తͳޮՌؚ͕·Ε͍ͯ·͢ɻ https://arxiv.org/abs/2002.11812v1
None
None
ᶆStyleGAN2 Distillation for Feed-forward Image Manipulation. StyleGAN2 ϑΟʔυϑΥϫʔυը૾ૢ࡞ͷͨΊͷৠཹ StyleGAN2ɺϦΞϧͳը૾Λੜ͢ΔͨΊͷ࠷ઌͷωοτϫʔΫͰ͢ɻStyleGAN2 ɺજࡏۭؒͰͷํੑ͕ҟͳΔΑ͏ʹ໌ࣔతʹ܇࿅͞Ε͓ͯΓɺજࡏҼࢠΛมԽͤ͞
ͯޮతͳը૾ૢ࡞ΛՄೳʹ͠·͢ɻطଘͷը૾Λฤू͢Δʹɺ༩͑ΒΕͨը૾Λ StyleGAN2ͷજࡏۭؒʹຒΊࠐΉඞཁ͕͋Γ·͢ɻόοΫϓϩύήʔγϣϯΛ༻͍ͨજࡏ ίʔυ࠷దԽɺ࣮ੈքͷը૾ͷ࣭తຒΊࠐΈʹҰൠతʹ༻͍ΒΕ͍ͯ·͕͢ɺଟ͘ͷΞ ϓϦέʔγϣϯͰ๏֎ʹ͕͔͔࣌ؒΓ·͢ɻզʑɺStyleGAN2ͷಛఆͷը૾ૢ࡞Λɺ ରʹͳֶͬͯश͞Εͨը૾ରը૾ωοτϫʔΫʹৠཹ͢Δํ๏ΛఏҊ͢Δɻ݁Ռͱͯ͠ಘ ΒΕΔύΠϓϥΠϯɺطଘͷGANͷସͱͯ͠ɺରʹͳ͍ͬͯͳ͍σʔλΛ༻ֶ͍ͯश ͞Ε·͢ɻຊݚڀͰɺਓؒͷإͷม݁ՌΛఏڙ͠·͢ɿੑผަɺՃྸɾएฦΓɺε λΠϧมɺը૾ϞʔϑΟϯάɻզʑͷख๏Λ༻͍ͨੜͷ࣭ɺ͜ΕΒͷಛఆͷλεΫ ʹ͓͍ͯɺStyleGAN2όοΫϓϩύήʔγϣϯݱࡏͷ࠷ઌͷख๏ͱಉͰ͋Δ͜ͱΛ ࣔ͠·͢ɻ https://arxiv.org/abs/2003.03581v1
None
None
None
ᶇAutoML-Zero: Evolving Machine Learning Algorithms From Scratch. AutoML-Zero: εΫϥον͔ΒͷػցֶशΞϧΰϦζϜͷਐԽ ػցֶशͷݚڀɺϞσϧߏֶशํ๏ͳͲଟ໘తʹਐΜͰ͍ΔɻAutoMLͱͯ͠ΒΕΔ͜ͷ
Α͏ͳݚڀΛࣗಈԽ͠Α͏ͱ͢Δྗ·ͨɺେ͖ͳਐาΛ͖͛ͯ·ͨ͠ɻ͔͠͠ɺ͜ͷਐา ओʹχϡʔϥϧωοτϫʔΫͷΞʔΩςΫνϟʹযΛͯͨͷͰ͋Γɺ͜͜ͰɺϏϧσΟϯ άϒϩοΫͱͯ͠ߴͳઐՈ͕ઃܭͨ͠ʹґଘ͍ͯ͠·ͨ͠--͋Δ͍ಉ༷ʹ੍ݶͷ͋Δ୳ࡧ ۭؒʹґଘ͍ͯ͠·ͨ͠ɻࢲͨͪͷඪɺAutoML͕͞ΒʹਐԽͰ͖Δ͜ͱΛࣔ͢͜ͱͰ͋Γ· ͢ɻզʑɺҰൠతͳݕࡧۭؒΛ௨ͯ͠ਓؒͷόΠΞεΛେ෯ʹݮ͢Δ৽͍͠ϑϨʔϜϫʔΫΛ ಋೖ͢Δ͜ͱʹΑͬͯɺ͜ΕΛ࣮ূ͠·͢ɻ͜ͷۭؒͷେ͞ʹ͔͔ΘΒͣɺਐԽత୳ࡧόο ΫϓϩύήʔγϣϯʹΑͬͯ܇࿅͞Εͨ2ͷχϡʔϥϧωοτϫʔΫΛൃݟ͢Δ͜ͱ͕Ͱ͖· ͢ɻ͜ΕΒͷ୯७ͳχϡʔϥϧωοτϫʔΫɺͦͷޙɺؔ৺ͷ͋ΔλεΫɺྫ͑CIFAR-10ͷม छͰਐԽͤ͞Δ͜ͱͰɺόΠϦχΞΠϯλϥΫγϣϯɺਖ਼نԽޯɺॏΈฏۉԽͳͲͷτοϓ ΞϧΰϦζϜʹݱతͳٕज़͕ݱΕΔ͜ͱͰ͙྇͜ͱ͕Ͱ͖·͢ɻ͞ΒʹɺਐԽΞϧΰϦζϜΛ ҟͳΔλεΫλΠϓʹదԠͤ͞·͢ɻθϩ͔ΒػցֶशΞϧΰϦζϜΛൃݟͨ͜͠ΕΒͷ༧උతͳ ޭɺ͜ͷͷ༗ͳ৽͍͠ํੑΛ͍ࣔͯ͠Δͱ৴͍ͯ͡·͢ɻ https://arxiv.org/abs/2003.03384v1 ॏෳ
ᶈLagrangian Neural Networks ϥάϥϯδϡχϡʔϥϧωοτϫʔΫ ੈքͷਖ਼֬ͳϞσϧɺͦͷجૅͱͳΔରশੑͷ֓೦ʹج͍ͮͯߏங͞Ε͍ͯ·͢ɻཧֶ Ͱɺ͜ΕΒͷରশੑΤωϧΪʔӡಈྔͳͲͷอଘଇʹରԠ͍ͯ͠·͢ɻ͔͠͠ɺ χϡʔϥϧωοτϫʔΫϞσϧཧֶͰͷར༻͕૿͍͑ͯΔʹ͔͔ΘΒͣɺ͜ΕΒ ͷରশੑΛֶश͢Δͷʹۤ࿑͍ͯ͠·͢ɻຊจͰɺχϡʔϥϧωοτϫʔΫΛ༻͍ͯ ҙͷϥάϥϯδΞϯΛύϥϝʔλԽͰ͖ΔϥάϥϯδΞϯχϡʔϥϧωοτϫʔΫ(LNN)Λఏ Ҋ͠·͢ɻϋϛϧτχΞϯΛֶश͢ΔϞσϧͱରরతʹɺLNNਖ਼४࠲ඪΛඞཁͱ͠ͳ͍
ͨΊɺਖ਼४ӡಈྔ͕ෆ໌Ͱ͋ͬͨΓɺܭࢉ͕ࠔͳ߹ʹ༗ޮͰ͢ɻ͜Ε·ͰͷΞϓϩʔν ͱҟͳΓɺզʑͷख๏ֶश͞ΕͨΤωϧΪʔͷؔܗࣜΛ੍ݶͤͣɺ༷ʑͳλεΫͷͨ ΊͷΤωϧΪʔอଘϞσϧΛੜ͠·͢ɻզʑɺೋॏৼΓࢠͱ૬ରతཻࢠͰզʑͷΞϓ ϩʔνΛςετ͠ɺϕʔεϥΠϯΞϓϩʔνͰࢄҳ͕ൃੜ͢ΔΤωϧΪʔอଘΛ࣮ূ͠ɺ ϋϛϧτχΞϯΞϓϩʔνͰࣦഊ͢Δਖ਼४࠲ඪͷͳ͍૬ରੑཧΛϞσϧԽ͠·͢ɻ࠷ޙ ʹɺϥάϥϯδϡάϥϑωοτϫʔΫΛ༻͍ͯɺ͜ͷϞσϧ͕ͲͷΑ͏ʹάϥϑ࿈ଓܥʹ ద༻Ͱ͖Δ͔Λࣔ͠ɺ1࣍ݩಈํఔ্ࣜͰ࣮ূ͠·͢ɻ https://arxiv.org/abs/2003.04630v1 ॏෳ
ᶉMLIR: A Compiler Infrastructure for the End of Moore's Law.
MLIR: ϜʔΞͷ๏ଇͷऴᖼͷͨΊͷίϯύΠϥج൫ ຊݚڀͰɺ࠶ར༻ՄೳͰ֦ுՄೳͳίϯύΠϥج൫Λߏங͢ΔͨΊͷ৽͍͠ΞϓϩʔνͰ͋Δ MLIRΛհ͠·͢ɻMLIRͷతɺιϑτΣΞͷஅยԽʹରॲ͠ɺҟछϋʔυΣΞͷίϯύ ΠϧΛվળ͠ɺυϝΠϯݻ༗ͷίϯύΠϥΛߏங͢ΔͨΊͷίετΛେ෯ʹݮ͠ɺطଘͷίϯύ ΠϥΛଓ͢ΔͷΛॿ͚Δ͜ͱͰ͢ɻMLIRɺҟͳΔநϨϕϧͰͷίʔυੜثɺτϥϯε ϨʔλɺΦϓςΟϚΠβͷઃܭͱ࣮Λ༰қʹ͠ɺ·ͨɺΞϓϦέʔγϣϯυϝΠϯɺϋʔυΣ Ξλʔήοτɺ࣮ߦڥʹ·͕ͨΔίʔυੜثɺτϥϯεϨʔλɺΦϓςΟϚΠβͷઃܭͱ࣮ Λ༰қʹ͠·͢ɻຊݚڀͰɼ(1)֦ுͱਐԽͷͨΊʹߏங͞ΕͨݚڀՌͱͯ͠ͷMLIRʹ͍ͭ ͯٞ͠ɼઃܭɼηϚϯςΟΫεɼ࠷దԽ༷ɼγεςϜɼΤϯδχΞϦϯάʹ͓͍ͯɼ͜ͷ৽͠ ͍ઃܭϙΠϯτ͕ͨΒ͢՝ͱػձΛ໌Β͔ʹ͠·͢ɽ(2) ίϯύΠϥߏஙίετΛݮ͢ΔҰ ൠԽ͞ΕͨΠϯϑϥͱͯ͠ͷMLIRͷධՁ-ଟ༷ͳϢʔεέʔεΛհ͠ɼকདྷͷϓϩάϥϛϯάݴ ޠɼίϯύΠϥɼ࣮ߦڥɼίϯϐϡʔλΞʔΩςΫνϟͷݚڀͱڭҭͷػձΛࣔ͠·͢ɽ·ͨɼ MLIRͷཧతࠜڌɼಠࣗͷઃܭݪཧɼߏɼҙຯʹ͍ͭͯհ͍ͯ͠·͢ɽ https://arxiv.org/abs/2002.11054v2 લճͱॏෳ
ᶊSLIDE : In Defense of Smart Algorithms over Hardware Acceleration
for Large-Scale Deep Learning Systems. SLIDE : େنσΟʔϓϥʔχϯάγεςϜͷͨΊͷϋʔυΣΞΞΫηϥϨʔ γϣϯΑΓεϚʔτΞϧΰϦζϜΛकΔͨΊʹɻ σΟʔϓϥʔχϯάʢDLʣΞϧΰϦζϜɺݱͷػցֶशγεςϜͷத৺తͳযͱͳ͍ͬͯ·͢ɻσʔ λྔͷ૿Ճʹ͍ɺԯݸͷύϥϝʔλΛ࣋ͭେنͳχϡʔϥϧωοτϫʔΫΛ܇࿅ͯ͠ɺ͜ΕΒͷσʔ λྔΛهԱ͠ɺ࠷ઌͷਫ਼ΛಘΔͷʹेͳ༰ྔΛҡ࣋͢Δ͜ͱ͕Ұൠతʹͳ͖͍ͬͯͯ·͢ɻେنͳ Ϟσϧͱσʔλʹؔ࿈͢ΔߴֹͳܭࢉΛճආ͢ΔͨΊʹɺίϛϡχςΟͰϞσϧֶशͷͨΊͷઐ༻ϋʔυ ΣΞͷࢿ͕૿Ճ͍ͯ͠·͢ɻ͔͠͠ɺಛघͳϋʔυΣΞߴՁͰ͋Γɺଟ͘ͷλεΫʹҰൠԽ͢Δ ͜ͱࠔͰ͢ɻΞϧΰϦζϜͷਐาɺNVIDIA-V100 GPUͷΑ͏ͳڧྗͳϋʔυΣΞʹରͯ͠త ͳ༏ҐੑΛࣔ͢͜ͱ͕Ͱ͖·ͤΜͰͨ͠ɻ͜ͷจྫ֎Λఏڙ͠·͢ɻզʑɺεϚʔτͳϥϯμϜԽΞ ϧΰϦζϜͱϚϧνίΞฒྻԽͱϫʔΫϩʔυ࠷దԽΛಠࣗʹ༥߹ͤͨ͞SLIDE (Sub-LInear Deep learning Engine)ΛఏҊ͠·͢ɻSLIDECPUͷΈΛ༻͢Δ͜ͱͰɺ࠷దԽ͞ΕͨTensorflow(TF)ͷ࣮ ΛGPU্Ͱ࣮ߦͨ͠߹ʹൺͯɺֶशͱਪͷ྆ํͷܭࢉྔΛେ෯ʹݮ͢Δ͜ͱ͕Ͱ͖·͢ɻۀքن ͷਪσʔληοτΛ༻͍ͨධՁͰɺ44ίΞͷCPUͰSLIDEΛ༻ͨ͠߹ɺTesla V100ͰTFΛ༻͠ ֶͯशͨ͠߹ͱൺֱͯ͠ɺҙͷਫ਼ϨϕϧͰ3.5ഒҎ্(1࣌ؒର3.5࣌ؒ)ͷͰֶश͕ՄೳͰ͋Δ͜ ͱ͕ࣔ͞Ε͍ͯ·͢ɻಉ͡CPUͷϋʔυΣΞ্ͰɺSLIDETFΑΓ10ഒҎ্ߴͰ͢ɻ࠶ݱੑͷͨΊͷ ίʔυͱεΫϦϓτΛఏڙ͠·͢ɻ https://arxiv.org/abs/1903.03129v2
ᶋAn Empirical Evaluation of Generic Convolutional and Recurrent Networks for
Sequence Modeling. γʔέϯεϞσϦϯάͷͨΊͷ൚༻ίϯϘϦϡʔγϣϯɾϦΧϨϯτωοτϫʔ Ϋͷ࣮ূతධՁ ΄ͱΜͲͷਂֶशͷઐՈʹͱͬͯɺγʔέϯεϞσϦϯάϦΧϨϯτωοτϫʔΫͱಉٛͰ ͢ɻ ͔͠͠ɺ࠷ۙͷ݁ՌɺԻ߹ػց༁ͳͲͷλεΫʹ͓͍ͯɺίϯϘϦϡʔγϣϯɾ ΞʔΩςΫνϟ͕ϦΧϨϯτɾωοτϫʔΫΑΓ༏Ε͍ͯΔ͜ͱΛ͍ࣔͯ͠·͢ɻ ৽͍͠γʔέ ϯεϞσϦϯάͷλεΫσʔληοτ͕ൃੜͨ͠߹ɺͲͷΞʔΩςΫνϟΛ༻͖͔͢ʁ զʑɺγʔέϯεϞσϦϯάͷͨΊͷҰൠతͳΈࠐΈ͓ΑͼϦΧϨϯτΞʔΩςΫνϟͷମܥ తͳධՁΛߦ͍ͬͯ·͢ɻ ͜ΕΒͷϞσϧɺϦΧϨϯτɾωοτϫʔΫͷϕϯνϚʔΫʹҰൠత ʹ༻͞Ε͍ͯΔ෯͍ඪ४λεΫͰධՁ͞Ε͍ͯ·͢ɻ ͦͷ݁Ռɺ୯७ͳΈࠐΈΞʔΩςΫ νϟɺଟ༷ͳλεΫσʔληοτʹ͓͍ͯɺLSTMͷΑ͏ͳਖ਼نͷϦΧϨϯτωοτϫʔΫΑ Γ༏Ε͍ͯΔ͜ͱ͕ࣔ͞ΕɺҰํͰɺΑΓ͍༗ޮϝϞϦΛࣔ͠·ͨ͠ɻ զʑɺγʔέϯε ϞσϦϯάͱϦΧϨϯτωοτϫʔΫͷؒͷڞ௨ͷؔ࿈ੑΛ࠶ߟ͖͢Ͱ͋ΓɺΈࠐΈωοτ ϫʔΫγʔέϯεϞσϦϯάλεΫͷࣗવͳग़ൃͱΈͳ͖͢Ͱ͋Δͱ͚݁·ͨ͠ɻ ؔ ࿈͢Δ࡞ۀΛࢧԉ͢ΔͨΊʹɺզʑ͜ͷhttp URLͰίʔυΛར༻Ͱ͖ΔΑ͏ʹ͠·ͨ͠ɻ https://arxiv.org/abs/1803.01271v2
TCN Լ:ೖྗ=࣌ܥྻσʔλ(ӈ͔Βॱ൪ʹ0...nඵޙ) ্:ग़ྗ=nඵؒͷ݁Ռ(ӈ͔Βॱʹɺnඵલɺ2nඵલ…) https://github.com/philipperemy/keras-tcn
TCNΛॏͶͨྫ https://github.com/philipperemy/keras-tcn
CNN(TCN) vs RNN܈
ᶌSparse Orthogonal Variational Inference for Gaussian Processes. ΨεաఔͷͨΊͷૄͳަมྔਪ. ༠ಋΛ༻͍ͨΨεաఔͷεύʔεมۙࣅͷ৽͍͠ղऍΛհ͠ ·͢ɻ͜ΕɺΨεաఔΛ2ͭͷಠཱͨ͠աఔͷͱͯ͠ղ͢Δ
͜ͱʹج͍͍ͮͯ·͢ɻ1ͭ༠ಋͷ༗ݶجఈʹ·͕͓ͨͬͯΓɺ ͏1ͭΓͷมಈΛัଊ͠·͢ɻ͜ͷఆࣜԽ͕طଘͷۙࣅΛճ෮ ͢Δͱಉ࣌ʹɺݶքͷΑΓݫ͍͠Լݶͱ৽͍֬͠తมਪ ΞϧΰϦζϜΛಘΔ͜ͱ͕Ͱ͖Δ͜ͱΛࣔ͠·͢ɻඪ४ճؼ͔Βʢਂ ͍ʣΈࠐΈΨεաఔΛ༻͍ͨଟΫϥεྨ·Ͱɺ͍͔ͭ͘ͷΨ εաఔϞσϧʹ͓͍ͯ͜ΕΒͷΞϧΰϦζϜͷޮੑΛ࣮ূ͠ɺ७ਮ ʹGPϕʔεͷϞσϧͷதͰCIFAR-10Ͱͷ࠷৽ͷ݁ՌΛใࠂ͠·͢ɻ https://arxiv.org/abs/1910.10596v3
ΨεաఔϞσϧͷ༧ଌྫ • ☓: ࣮ଌσʔλ • ೱ੨ઢ: ࠷֬ͷߴ͍༧ • ബ੨ଳ:ى͜Γ͏ΔՄೳੑͷߴ͍ൣғ (=σ2)
http://machine-learning.hatenablog.com/entry/2018/01/13/142612
My favorite
ᶆStyleGAN2 Distillation for Feed-forward Image Manipulation. StyleGAN2 ϑΟʔυϑΥϫʔυը૾ૢ࡞ͷͨΊͷৠཹ StyleGAN2ɺϦΞϧͳը૾Λੜ͢ΔͨΊͷ࠷ઌͷωοτϫʔΫͰ͢ɻStyleGAN2 ɺજࡏۭؒͰͷํੑ͕ҟͳΔΑ͏ʹ໌ࣔతʹ܇࿅͞Ε͓ͯΓɺજࡏҼࢠΛมԽͤ͞
ͯޮతͳը૾ૢ࡞ΛՄೳʹ͠·͢ɻطଘͷը૾Λฤू͢Δʹɺ༩͑ΒΕͨը૾Λ StyleGAN2ͷજࡏۭؒʹຒΊࠐΉඞཁ͕͋Γ·͢ɻόοΫϓϩύήʔγϣϯΛ༻͍ͨજࡏ ίʔυ࠷దԽɺ࣮ੈքͷը૾ͷ࣭తຒΊࠐΈʹҰൠతʹ༻͍ΒΕ͍ͯ·͕͢ɺଟ͘ͷΞ ϓϦέʔγϣϯͰ๏֎ʹ͕͔͔࣌ؒΓ·͢ɻզʑɺStyleGAN2ͷಛఆͷը૾ૢ࡞Λɺ ରʹͳֶͬͯश͞Εͨը૾ରը૾ωοτϫʔΫʹৠཹ͢Δํ๏ΛఏҊ͢Δɻ݁Ռͱͯ͠ಘ ΒΕΔύΠϓϥΠϯɺطଘͷGANͷସͱͯ͠ɺରʹͳ͍ͬͯͳ͍σʔλΛ༻ֶ͍ͯश ͞Ε·͢ɻຊݚڀͰɺਓؒͷإͷม݁ՌΛఏڙ͠·͢ɿੑผަɺՃྸɾएฦΓɺε λΠϧมɺը૾ϞʔϑΟϯάɻզʑͷख๏Λ༻͍ͨੜͷ࣭ɺ͜ΕΒͷಛఆͷλεΫ ʹ͓͍ͯɺStyleGAN2όοΫϓϩύήʔγϣϯݱࡏͷ࠷ઌͷख๏ͱಉͰ͋Δ͜ͱΛ ࣔ͠·͢ɻ https://arxiv.org/abs/2003.03581v1
աڈจ: StyleGAN2 ͱ • ελΠϧΛϊΠζͱͯ͠ϥϯμϜͳը૾ੜ͢ Δ࠷৽ͷGAN • ελΠϧʹը෩ɾྠֲɾ͖ɾͷ৭ܗɾ ͷ৭ܗͳͲԿͰ
StyleGAN2ͷྫ1 https://arxiv.org/abs/1912.04958
StyleGAN2ͷྫ2 https://ai-scholar.tech/articles/others/stylegan-fashion-339
ࠓճͷจ:StyleGAN2ͷৠཹ • StyleGAN2 ͚ͩͰ͍͕͢͝ɺৠཹʢબผɾ࠶ ֶशʣ͢Δ͜ͱͰྸΛएฦΒͤͨΓՃྸͨ͠ ΓɺੑผΛΑΓࣗવʹม͑ͨإࣸਅΛੜ͢Δ͜ ͱ͕Ͱ͖ΔΑ͏ʹͳͬͨͱ͍͏จɻ • ԽɾएฦΓੑผͷೖΕସ͑ͷֶशσʔληο τΛ࡞Δͷݱ࣮తʹෆՄೳ͕ͩɺͲ͏ͬͯͦ
ΕΛΓӽ͑ͯσʔλΛ࡞͔ͬͨʁ͕ϙΠϯτɻ
StyleGAN2ৠཹͷՌྫ • ੑస • एฦΓɾԽ
StyleGAN2ͷৠཹɾֶश • 1.StyleGAN2͔Βֶशݩσʔλੜ • 2.ݩσʔλͷྸɾੑผม • 3.ผϞσϧͰ࠶ֶशͯ͠ར༻
step1 : ֶशݩσʔλੜ (StyleGAN2+إྨϞσϧ)
step1 : ֶशݩσʔλੜ (StyleGAN2+إྨϞσϧ) 1.StyleGAN2Ͱ ϥϯμϜʹը૾Λ࡞Δ ੜը૾
step1 : ֶशݩσʔλੜ (StyleGAN2+إྨϞσϧ) إը૾ྨϞσϧͰ ը૾ఆ͢Δ ৴༻ ྸ ੑผ
step1 : ֶशݩσʔλੜ (StyleGAN2+إྨϞσϧ) X4UZMF("/ͷ ը૾ੜͷॏΈσʔλ ੜը૾ͷಛ
step1 : ֶशݩσʔλੜ (StyleGAN2+إྨϞσϧ) • e ॏΈ ৴༻ ྸ ੑผΛ̍ͱ͢Δ
ˠ͜ΕΛͨ͘͞Μ࡞ͬͯɺ৴༻ͷߴ͍ͷΛσʔληοτʹɻ
ৠཹstep2:ݩσʔλͷྸɾੑผม (ྫྸ͕ͩɺੑผಉ͡) 1. ྸผʹॏΈσʔλ܈ͷதԝΛܭࢉͯ͠ɺ ྸ͕มΘͬͨͱ͖ͷॏΈͷࠩΛٻΊΔɻ 2. ݩͷॏΈσʔλʹࠩΛͨ͠ΓҾ͍ͨΓ͠ ͯԽɾएฦΓը૾Λύλʔϯ࡞Δ 3. ΑΓࣗવͳͷΛϐοΫΞοϓͨ͠ΒಉҰਓ
ͷԽɾएฦΓͷڭࢣσʔληοτ͕
࡞ͬͨॏΈσʔλΛ Ճݮͯ͠ը૾ੜ ͦΕͬΆ͍ͷΛબͿ
step3: ࠶ֶशɾར༻ pix2pixHD ͰɺಉҰਓͰྸɾੑผΛม͑ͨ ͷͷΈ߹ΘͤΛͦΕͧΕݸผʹֶश͢Δ ͋ͱՃྸ͚ͨ͠ΕɺՃྸֶशͨ͠pix2pixϞ σϧɺੑస͚ͨ͠Εੑసͨ͠pix2pixϞσ ϧɺͱ͍ͬͨܗͰը૾ੜ͢Δ͚ͩɻ
None
None
None
͜ͷจΛબΜͩཧ༝ γϯϓϧͳख๏ΛΈ߹Θͤͯطଘͷٕज़Λ ͍͑ͯΔͷ͕໘ന͍ɻ ৠཹͰݱ࣮తʹ༻ҙࠔͳσʔληοτΛ࡞ͬ ͍ͯΔɻଞͷͳʹ͔ʹԠ༻Ͱ͖Δ͔ɻ
Special Thanks
DeepL Translator (deepl.com) https://www.deepl.com/en/translator