Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Vision Proで広告フリーな世界を実現したい
Search
Shuhei Shitamori
December 12, 2024
Programming
0
81
Vision Proで広告フリーな世界を実現したい
MIERUNE BBQ #14発表資料
Shuhei Shitamori
December 12, 2024
Tweet
Share
More Decks by Shuhei Shitamori
See All by Shuhei Shitamori
Apple SharePlayで 非対称クロスプレイ チャレンジ (SharePlay使ってみた編)
shitamori1272
0
20
Foundation Models触ってみた - iPhone Dev Sapporo — WWDC25 Recap
shitamori1272
0
80
Wallet API, Verifier APIで実現するIDカード on iPhoneの世界
shitamori1272
1
2.7k
Other Decks in Programming
See All in Programming
RDoc meets YARD
okuramasafumi
4
160
FindyにおけるTakumi活用と脆弱性管理のこれから
rvirus0817
0
430
🔨 小さなビルドシステムを作る
momeemt
3
660
Protocol Buffersの型を超えて拡張性を得る / Beyond Protocol Buffers Types Achieving Extensibility
linyows
0
110
More Approvers for Greater OSS and Japan Community
tkikuc
1
110
個人軟體時代
ethanhuang13
0
310
もうちょっといいRubyプロファイラを作りたい (2025)
osyoyu
0
320
ソフトウェアテスト徹底指南書の紹介
goyoki
1
140
Laravel Boost 超入門
fire_arlo
2
210
TanStack DB ~状態管理の新しい考え方~
bmthd
2
480
Flutter with Dart MCP: All You Need - 박제창 2025 I/O Extended Busan
itsmedreamwalker
0
140
AI時代のUIはどこへ行く?
yusukebe
16
8.2k
Featured
See All Featured
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
51
5.6k
Rails Girls Zürich Keynote
gr2m
95
14k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
139
34k
Practical Orchestrator
shlominoach
190
11k
StorybookのUI Testing Handbookを読んだ
zakiyama
31
6.1k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
48
9.7k
Making the Leap to Tech Lead
cromwellryan
135
9.5k
Large-scale JavaScript Application Architecture
addyosmani
512
110k
What's in a price? How to price your products and services
michaelherold
246
12k
A Tale of Four Properties
chriscoyier
160
23k
Fashionably flexible responsive web design (full day workshop)
malarkey
407
66k
The Straight Up "How To Draw Better" Workshop
denniskardys
236
140k
Transcript
Լɹपฏ 2024/11/20 Vision ProͰࠂϑϦʔͳੈքΛ ࣮ݱ͍ͨ͠ MIERUNE BBQ
Լ पฏ w ࠓͷ݄͔Βࡳຈࡏॅ ग़Γ w Χφμͷσδλϧ*%ελʔτΞοϓͰJ04ΤϯδχΞ w 7JTJPO1SPങͬͨ
2 #MVFTLZ -JOLFEJO
XRͱ • Ծੈքͱݱ࣮ੈքͷΈ߹ΘͤʹΑͬͯ ৽ͨͳମݧΛੜΈग़ٕ͢ज़ͷ૯শ https://www.canon-its.co.jp/solution/mr/vr-ar-mr/
AR(֦ுݱ࣮)ͰͰ͖Δମݧ https://k-tai.watch.impress.co.jp/docs/news/1203694.html
ใྔ͕ଟ͍ͱετϨε • ใྔ͕ଟ͍ͱετϨεΛײ͍͢͡ • λεΫύϑΥʔϚϯεʹӨڹ • ใΛݮΒͨ͢ΊͷऔΓΈॏཁ
Diminished Reality(ݮଛݱ࣮) • ARͱରʹ࣮ࡍʹଘࡏ͢ΔͷΛϦΞϧλΠϜͰݟ͑ͳ͘͢Δٕज़ • ΠϠϗϯͷϊΠζΩϟϯηϦϯάͷࢹ֮όʔδϣϯ https://solution.itage.jp/2021/12/16/16738/
Vision ProͰDRΛͬͯΈΔ • ֗த͔ΒࠂுΓࢴΛফͯ͠ೝෛՙͷ͍ੈքΛ࣮ݱ͍ͨ͠
Vision Proͱ • Apple͕2023ʹൢച։࢝ͨ͠MRϔουηοτ • MacBookͱಉͷM2 νοϓࡌ • ߴղ૾ͷө૾ •
ϓϥΠόγʔΛྀͨ͠ମݧઃܭ
ࠂDRʹඞཁͳٕज़ Vision Proͷಛٕज़ཁ݅ʹϚον͍ͯ͠Δʂ • 1. ࢹ֮ใ(Χϝϥ)͔Βࠂͷݕग़ • Vision ProʹߴੑೳͳΧϝϥ͕ࡌ͞ΕͯΔ •
2. ࠂΛফͨ͢Ίͷഎܠը૾Λੜ • Vision ProʹAIΛಈ͔ͨ͢Ίͷߴੑೳͳνοϓ͕ࡌ͞ΕͯΔ • 3. ੜͨ͠ը૾ΛࠂʹॏͶͯදࣔ • Vision ProʹԾମΛۭؒʹஔͰ͖Δ
ࠂDRʹඞཁͳٕज़ Vision Proͷಛٕज़ཁ݅ʹϚον͍ͯ͠Δʂ͠ͳ͍… • 1. ࢹ֮ใ(Χϝϥ)͔Βࠂͷݕग़ • Vision ProʹߴੑೳͳΧϝϥ͕ࡌ͞ΕͯΔ͕ɺΧϝϥө૾ʹΞΫηεͰ͖ͳ͍ •
2. ࠂΛফͨ͢Ίͷഎܠը૾Λੜ • Vision ProʹAIΛಈ͔ͨ͢Ίͷߴੑೳͳνοϓ͕ࡌ͞ΕͯΔ͕ɺAI༻ΤϯδϯʹΞΫηεͰ͖ͳ͍ • 3. ੜͨ͠ը૾ΛࠂʹॏͶͯදࣔ • Vision ProʹԾମΛۭؒʹஔͰ͖Δ
ࠓճͷॲཧϑϩʔ • PCͰࣄલʹʮݕ͍ͨ͠ࠂʯͱʮॏͶ͍ͨഎܠը૾ͷੜʯΛΔ • Vision Proʮੜͨ͠ը૾ΛࠂʹॏͶͯදࣔʯ͚ͩ • ࢹ֮ใ(Χϝϥ)͔Β ࠂͷݕग़ 1.
ࠂΛࡱӨͯ͠σʔληοτ࡞ 2. ը૾͔ΒࠂྖҬΛݕग़ 3. ը૾ੜ༻ʹྖҬΛܗ ࠂΛফͨ͢Ίͷ എܠը૾ͷੜ 1. ࠂʹସΘΔഎܠը૾Λੜ 2. Vision Pro༻ʹը૾Λܗ ੜͨ͠ը૾Λ ࠂʹॏͶͯදࣔ 1. Vision Pro༻ͷΞϓϦΛ࡞ PCͰࣄલʹ AVPͰϦΞϧλΠϜʹ
ࢹ֮ใ(Χϝϥ)͔Βࠂͷݕग़ ࠂΛࡱӨͯ͠σʔληοτ࡞ • ԼమӺʹܝࡌ͞ΕͨࠂுΓࢴΛΧϝϥͰࡱӨ
ࢹ֮ใ(Χϝϥ)͔Βࠂͷݕग़ ը૾͔ΒࠂྖҬΛݕग़ • ը૾͔ΒҙͷΦϒδΣΫτΛݕग़Ͱ͖Δ Segmented Anything Model(SAM)ϕʔεͷϞσϧΛར༻ • prompt=“Advertisement”Ͱ֘͢ΔྖҬΛࣗಈͰݕग़ ”Advertisement”
https://github.com/hustvl/EVF-SAM
ࢹ֮ใ(Χϝϥ)͔Βࠂͷݕग़ ྖҬݕग़݁Ռͷܗ • ΪβΪβ݀ݕग़ྖҬͱͯ͠ෆਖ਼֬ͳͷͰ࢛֯ܗͱͯ͠ܗ • ܗޙͷը૾Λॏը૾ੜͷϚεΫͱͯ͠ར༻
ࠂΛফͨ͢Ίͷഎܠը૾ͷੜ ݕग़ྖҬΛੜAIͰ࠶ඳը • ը૾෮ݩʹಛԽͨ͠stable-di ff usionϞσϧͰϚεΫྖҬΛ࠶ੜ ޭ ύλʔϯ ࣦഊ ύλʔϯ
https://huggingface.co/stabilityai/stable-di ff usion-2-inpainting
ࠂΛফͨ͢Ίͷഎܠը૾ͷੜ Vision ProͰ͏σʔληοτΛ࡞ • Vision ProͰར༻͢Δը૾σʔληοτΛ࡞ • ݕग़༻ͷReference, ॏ༻ͷGeneratedΛ࡞ Reference
Generated
ੜͨ͠ը૾ΛࠂʹॏͶͯදࣔ Vision ProΞϓϦͷ࡞ • Reference ImageΛݕग़ͨ͠Β Generated ImageΛಉ͡Ґஔʹදࣔͤ͞ΔγϯϓϧͳΞϓϦ Reference ൃݟʂ
Generated ࠂ্ʹ ॏͶͯදࣔʂ
͍͟ɺӺߏͰ࣮ݧ ͦͷ1
͍͟ɺӺߏͰ࣮ݧ ͦͷ2
݁Ռ • Reference ImageΛ͖ͪΜͱݕग़ͯ͠Generated ImageΛॏදࣔͰ͖ͨ • ҰํͰ • ࠂʹेʹ͔ۙͮͳ͍ͱVision Pro͕ReferenceΛݕ͠ͳ͍
• Generated Imageʹมͳͷ͕ೖΓɺʮফ͢ʯ͜ͱ͕Ͱ͖ͳ͍࣌͋ͬͨ
ࠓޙվળ͢ΔͳΒ • ेʹࠂʹ͔ۙͮͳ͍ͱVision Pro͕ReferenceΛݕ͠ͳ͍ -> ͋Β͔͡Ί্ۭؒʹGenerated ImageΛஔ͓͚ͯ͠ ReferenceͷݕΛඞཁͱ͠ͳ͍͔ʁ • Generated
Imageʹมͳͷ͕ೖΓɺʮফ͢ʯ͜ͱ͕Ͱ͖ͳ͍࣌͋ͬͨ -> SAMʹΑΔࠂྖҬݕग़ͷਫ਼͕ෆेͩͬͨͷͰɺଞͷख๏ݕ౼ -> Generated ImageΛੜ͢Δࡍʹ”น”ͱ໌ࣔͯ͠ྑ͔͔ͬͨ