Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
Search
ni_san2000
June 26, 2025
Technology
0
150
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
in Swift愛好会スピンオフ WWDC25セッション要約会@DeNA (2025/06/26)
ni_san2000
June 26, 2025
Tweet
Share
More Decks by ni_san2000
See All by ni_san2000
Appleの“ホーム”アプリを使いたい! 2024年の国内スマートホーム事情
ryosism
0
130
初めて参加したiOSDCは○○のようだった!
ryosism
0
1.9k
Other Decks in Technology
See All in Technology
カンファレンスに託児サポートがあるということ / Having Childcare Support at Conferences
nobu09
1
580
20201008_ファインディ_品質意識を育てる役目は人かAIか___2_.pdf
findy_eventslides
2
640
「れきちず」のこれまでとこれから - 誰にでもわかりやすい歴史地図を目指して / FOSS4G 2025 Japan
hjmkth
1
310
【Kaigi on Rails 事後勉強会LT】MeはどうしてGirlsに? 私とRubyを繋いだRail(s)
joyfrommasara
0
270
Git in Team
kawaguti
PRO
3
380
E2Eテスト設計_自動化のリアル___Playwrightでの実践とMCPの試み__AIによるテスト観点作成_.pdf
findy_eventslides
2
630
AWSでAgentic AIを開発するための前提知識の整理
nasuvitz
2
170
Wasmのエコシステムを使った ツール作成方法
askua
0
190
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
5
43k
AI時代こそ求められる設計力- AWSクラウドデザインパターン3選で信頼性と拡張性を高める-
kenichirokimura
3
330
そのWAFのブロック、どう活かす? サービスを守るための実践的多層防御と思考法 / WAF blocks defense decision
kaminashi
0
200
今この時代に技術とどう向き合うべきか
gree_tech
PRO
2
1.9k
Featured
See All Featured
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
508
140k
Evolution of real-time – Irina Nazarova, EuRuKo, 2024
irinanazarova
9
980
RailsConf 2023
tenderlove
30
1.2k
KATA
mclloyd
32
15k
Thoughts on Productivity
jonyablonski
70
4.9k
How to train your dragon (web standard)
notwaldorf
97
6.3k
Mobile First: as difficult as doing things right
swwweet
224
10k
No one is an island. Learnings from fostering a developers community.
thoeni
21
3.5k
The Power of CSS Pseudo Elements
geoffreycrofte
79
6k
Producing Creativity
orderedlist
PRO
347
40k
Sharpening the Axe: The Primacy of Toolmaking
bcantrill
45
2.5k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
230
22k
Transcript
WWDC2025ηογϣϯڞ༗: VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷ ಡΈࠐΈ in SwiftѪձεϐϯΦϑ WWDC25ηογϣϯཁձ@DeNA (2025/06/26) ʹʔ͞Μ(@ni_san2000)
ࣗݾհ • ʹʔ͞Μ(@ni_san2000) • ۀͰiOSΞϓϦΛ࡞͍ͬͯ·͢(3) • Apple৴ऀͳͷͰجௐߨԋੲ͔ΒݟͯΔ • macOSͷωʔϛϯάετʔϦʔ •
ΫϨΠάɾϑΣσϦΪͷύϧΫʔϧ • झຯ • Χϝϥ / ؍༿২ / ίʔώʔ / eεϙʔπ • ࠷ۙࣸਅίϯςετͱ͔ڵຯ͋Γ·͢ 2
͢ηογϣϯʹ͍ͭͯ • VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷಡΈࠐΈ • “Foundation ModelsͰͳ͍”ɺAIؔ࿈ͷ͓Ͱ͢ 3 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ • ಈը૾ʹಛԽͨ͠AIϞσϧͷػೳΛఏڙ͢ΔϑϨʔϜϫʔΫ • ը૾ೝࣝ, ମݕग़, إೝࣝͳͲ • .mlmodelΛཁ͢ΔػೳΛखܰʹAPIͱͯ͠ར༻Ͱ͖Δ •
iOS18Ͱ31ݸͷAPIΛఏڙ͍ͯͨ͠ 4 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQNBDIJOFMFBSOJOHNPEFMT
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ CalculateImageAestheticsScoresRequest ClassifyImageRequest CoreMLRequest DetectAnimalBodyPoseRequest DetectBarcodesRequest DetectContoursRequest DetectDocumentSegmentationRequest DetectFaceCaptureQualityRequest DetectFaceLandmarksRequest
DetectFaceRectanglesRequest DetectHorizonRequest DetectHumanBodyPose3DRequest DetectHumanBodyPoseRequest DetectHumanHandPoseRequest DetectHumanRectanglesRequest DetectRectanglesRequest DetectTextRectanglesRequest DetectTrajectoriesRequest GenerateAttentionBasedSaliencyIma GenerateForegroundInstanceMaskRequest GenerateImageFeaturePrintRequest GenerateOpticalFlowRequest GeneratePersonInstanceMaskRequest GeneratePersonSegmentationRequest RecognizeAnimalsRequest RecognizeTextRequest TrackHomographicImageRegistration TrackObjectRequest TrackOpticalFlowRequest TrackRectangleRequest TrackTranslationalImageRegistrationRequest 5 RecognizeDocumentRequest /FX DetectLensSmudgeRequest /FX
͜Ε·ͰͷυΩϡϝϯτೝࣝ • ݟ͍͑ͯΔจࣈೝࣝͰ͖Δ • ͚ͲɺจষߏදͷίϯςΩετࣦΘΕΔ 6 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
xOS26͔ΒͰ͖ΔΑ͏ʹͳΔ͜ͱ • υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ • ஈམɾՕॻ͖දͷίϯςΩετೝࣝ • ಛఆͷϑΥʔϚοτʹଈͨ͠ςΩετͷࣝผ • QRίʔυͷऔΓग़͠ 7
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
ᶃ RequestΫϥεͷ༻ҙ (RecognizeDocumentRequest) ᶄ ը૾Λ͢ ᶅ DocumentObservationΛड͚औΔ ᶆ document͔Β֤ཁૉΛऔಘ ᶇ
That’s it !
DocumentObservationͷݕग़ߏ • ContainerList, Table, Text, BarcodesͳͲͷཁૉΛ࣋ͭ • $FMM*UFNT$POUBJOFSͷཁૉ͍࣋ͬͯΔͷͰɺ͞ΒʹแͰ͖Δ 10 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
☝
• ࣝผϑΥʔϚοτ10छྨ • ຊޠ·ͩNot supported • දࣔ͢ΔࡍʹText AttributeΛࢦఆ͢Δඞཁ͕ͳ͘ͳΓͦ͏ ςΩετΛࣝผ͢ΔDataDetector 11
આ໌ 5ZQF ิ ΧϨϯμʔ CalendarEvent ϝʔϧΞυϨε EmailAddress ϑϥΠτ൪߸ FlightNumber 63- Link ଌఆ ୯Ґ͖ Measurement %JNFOTJPOͰදݱͰ͖Δ୯Ґ ֹۚ ௨՟͖ MoneyAmount -PDBMF$VSSFODZͰදݱͰ͖Δ୯Ґ ࢧ͍ঢ়گ PaymentIdentifier 6OJ fi FE1BZNFOUT*OUFSGBDF 61* ి൪߸ PhoneNumber ॅॴ PostalAddress FHTUSFFU DJUZ TUBUF ૹ൪߸ ShipmentTrackingNumber ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
֤ϑΥʔϚοτͱͯ͠σʔλΛऔಘ • switch-case͢Δ͚ͩ • ͲͷϑΥʔϚοτ͔ςΩετ୯ҐͰࣝผࡁΈ 12
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷ৽نAPI • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ContainerTable, List, Cell, TextͳͲͷཁૉΛ͍࣋ͬͯΔ •
CellListͷItemೖΕࢠͰContainerΛ֨ೲͰ͖Δ • Text֨ೲ͞Εͨจࣈ͔ΒϑΥʔϚοτΛೝࣝͰ͖Δ • DataDetector.Match.SemanticDetails.XXXXXʹଟͷ৽نσʔλߏ RecognizeDocumentRequest·ͱΊ 14
ΓͷVisionؔ࿈ͷ৽ཁૉ̎ͭ ͓·͚ 15
• DetectLensSmudgeRequest / SmudgeObservation͕Ճ • ೖྗͨ͠ը૾͕ԚΕ͍ͯΔ͔Λผ͢Δ • ᮢΛઃ͚Δ͜ͱͰੳΤϥʔΛະવʹ͛Δ Ϩϯζද໘ͷԚΕಶΓΛݕग़͢ΔObservation 16
ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• खͷؔઅҐஔΛݕग़͢ΔϞσϧ͕ߋ৽͞Εͨ • ਪͷਫ਼ͱ্͕ • WWDC2021Ͱൃද͞ΕͨϞσϧ͔ΒΞοϓσʔτ HandPose DetectionͷੳϞσϧߋ৽ 17 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷػೳՃ • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ςΩετͷ༰ΛྨͰ͖Δ • ͦͷଞʹ2ͭͷVisionϑϨʔϜϫʔΫͷΞοϓσʔτ • DetectLensSmudgeRequest
/ SmudgeObservation • HandPose DetectionͷੳϞσϧߋ৽ ͓͠·͍ 18