Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
Search
ni_san2000
June 26, 2025
Technology
0
130
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
in Swift愛好会スピンオフ WWDC25セッション要約会@DeNA (2025/06/26)
ni_san2000
June 26, 2025
Tweet
Share
More Decks by ni_san2000
See All by ni_san2000
Appleの“ホーム”アプリを使いたい! 2024年の国内スマートホーム事情
ryosism
0
130
初めて参加したiOSDCは○○のようだった!
ryosism
0
1.9k
Other Decks in Technology
See All in Technology
帳票Vibe Coding
terurou
0
150
小さなチーム 大きな仕事 - 個人開発でAIをフル活用する
himaratsu
0
140
Kubernetes における cgroup v2 でのOut-Of-Memory 問題の解決
pfn
PRO
0
250
モダンフロントエンド 開発研修
recruitengineers
PRO
6
4.3k
microCMS 最新リリース情報(microCMS Meetup 2025)
microcms
0
250
ドキュメントはAIの味方!スタートアップのアジャイルを加速するADR
kawauso
3
450
「AI2027」を紐解く ― AGI・ASI・シンギュラリティ
masayamoriofficial
0
130
知られざるprops命名の慣習 アクション編
uhyo
11
2.8k
『FailNet~やらかし共有SNS~』エレベーターピッチ
yokomachi
1
170
実践アプリケーション設計 ②トランザクションスクリプトへの対応
recruitengineers
PRO
4
1k
KiroでGameDay開催してみよう(準備編)
yuuuuuuu168
1
150
コスト削減の基本の「キ」~ コスト消費3大リソースへの対策 ~
smt7174
2
280
Featured
See All Featured
Creating an realtime collaboration tool: Agile Flush - .NET Oxford
marcduiker
31
2.2k
Scaling GitHub
holman
462
140k
10 Git Anti Patterns You Should be Aware of
lemiorhan
PRO
656
61k
The Art of Delivering Value - GDevCon NA Keynote
reverentgeek
15
1.6k
Documentation Writing (for coders)
carmenintech
73
5k
Embracing the Ebb and Flow
colly
87
4.8k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
18
1.1k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
161
15k
The Straight Up "How To Draw Better" Workshop
denniskardys
236
140k
Build your cross-platform service in a week with App Engine
jlugia
231
18k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
34
3.1k
YesSQL, Process and Tooling at Scale
rocio
173
14k
Transcript
WWDC2025ηογϣϯڞ༗: VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷ ಡΈࠐΈ in SwiftѪձεϐϯΦϑ WWDC25ηογϣϯཁձ@DeNA (2025/06/26) ʹʔ͞Μ(@ni_san2000)
ࣗݾհ • ʹʔ͞Μ(@ni_san2000) • ۀͰiOSΞϓϦΛ࡞͍ͬͯ·͢(3) • Apple৴ऀͳͷͰجௐߨԋੲ͔ΒݟͯΔ • macOSͷωʔϛϯάετʔϦʔ •
ΫϨΠάɾϑΣσϦΪͷύϧΫʔϧ • झຯ • Χϝϥ / ؍༿২ / ίʔώʔ / eεϙʔπ • ࠷ۙࣸਅίϯςετͱ͔ڵຯ͋Γ·͢ 2
͢ηογϣϯʹ͍ͭͯ • VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷಡΈࠐΈ • “Foundation ModelsͰͳ͍”ɺAIؔ࿈ͷ͓Ͱ͢ 3 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ • ಈը૾ʹಛԽͨ͠AIϞσϧͷػೳΛఏڙ͢ΔϑϨʔϜϫʔΫ • ը૾ೝࣝ, ମݕग़, إೝࣝͳͲ • .mlmodelΛཁ͢ΔػೳΛखܰʹAPIͱͯ͠ར༻Ͱ͖Δ •
iOS18Ͱ31ݸͷAPIΛఏڙ͍ͯͨ͠ 4 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQNBDIJOFMFBSOJOHNPEFMT
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ CalculateImageAestheticsScoresRequest ClassifyImageRequest CoreMLRequest DetectAnimalBodyPoseRequest DetectBarcodesRequest DetectContoursRequest DetectDocumentSegmentationRequest DetectFaceCaptureQualityRequest DetectFaceLandmarksRequest
DetectFaceRectanglesRequest DetectHorizonRequest DetectHumanBodyPose3DRequest DetectHumanBodyPoseRequest DetectHumanHandPoseRequest DetectHumanRectanglesRequest DetectRectanglesRequest DetectTextRectanglesRequest DetectTrajectoriesRequest GenerateAttentionBasedSaliencyIma GenerateForegroundInstanceMaskRequest GenerateImageFeaturePrintRequest GenerateOpticalFlowRequest GeneratePersonInstanceMaskRequest GeneratePersonSegmentationRequest RecognizeAnimalsRequest RecognizeTextRequest TrackHomographicImageRegistration TrackObjectRequest TrackOpticalFlowRequest TrackRectangleRequest TrackTranslationalImageRegistrationRequest 5 RecognizeDocumentRequest /FX DetectLensSmudgeRequest /FX
͜Ε·ͰͷυΩϡϝϯτೝࣝ • ݟ͍͑ͯΔจࣈೝࣝͰ͖Δ • ͚ͲɺจষߏදͷίϯςΩετࣦΘΕΔ 6 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
xOS26͔ΒͰ͖ΔΑ͏ʹͳΔ͜ͱ • υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ • ஈམɾՕॻ͖දͷίϯςΩετೝࣝ • ಛఆͷϑΥʔϚοτʹଈͨ͠ςΩετͷࣝผ • QRίʔυͷऔΓग़͠ 7
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
ᶃ RequestΫϥεͷ༻ҙ (RecognizeDocumentRequest) ᶄ ը૾Λ͢ ᶅ DocumentObservationΛड͚औΔ ᶆ document͔Β֤ཁૉΛऔಘ ᶇ
That’s it !
DocumentObservationͷݕग़ߏ • ContainerList, Table, Text, BarcodesͳͲͷཁૉΛ࣋ͭ • $FMM*UFNT$POUBJOFSͷཁૉ͍࣋ͬͯΔͷͰɺ͞ΒʹแͰ͖Δ 10 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
☝
• ࣝผϑΥʔϚοτ10छྨ • ຊޠ·ͩNot supported • දࣔ͢ΔࡍʹText AttributeΛࢦఆ͢Δඞཁ͕ͳ͘ͳΓͦ͏ ςΩετΛࣝผ͢ΔDataDetector 11
આ໌ 5ZQF ิ ΧϨϯμʔ CalendarEvent ϝʔϧΞυϨε EmailAddress ϑϥΠτ൪߸ FlightNumber 63- Link ଌఆ ୯Ґ͖ Measurement %JNFOTJPOͰදݱͰ͖Δ୯Ґ ֹۚ ௨՟͖ MoneyAmount -PDBMF$VSSFODZͰදݱͰ͖Δ୯Ґ ࢧ͍ঢ়گ PaymentIdentifier 6OJ fi FE1BZNFOUT*OUFSGBDF 61* ి൪߸ PhoneNumber ॅॴ PostalAddress FHTUSFFU DJUZ TUBUF ૹ൪߸ ShipmentTrackingNumber ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
֤ϑΥʔϚοτͱͯ͠σʔλΛऔಘ • switch-case͢Δ͚ͩ • ͲͷϑΥʔϚοτ͔ςΩετ୯ҐͰࣝผࡁΈ 12
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷ৽نAPI • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ContainerTable, List, Cell, TextͳͲͷཁૉΛ͍࣋ͬͯΔ •
CellListͷItemೖΕࢠͰContainerΛ֨ೲͰ͖Δ • Text֨ೲ͞Εͨจࣈ͔ΒϑΥʔϚοτΛೝࣝͰ͖Δ • DataDetector.Match.SemanticDetails.XXXXXʹଟͷ৽نσʔλߏ RecognizeDocumentRequest·ͱΊ 14
ΓͷVisionؔ࿈ͷ৽ཁૉ̎ͭ ͓·͚ 15
• DetectLensSmudgeRequest / SmudgeObservation͕Ճ • ೖྗͨ͠ը૾͕ԚΕ͍ͯΔ͔Λผ͢Δ • ᮢΛઃ͚Δ͜ͱͰੳΤϥʔΛະવʹ͛Δ Ϩϯζද໘ͷԚΕಶΓΛݕग़͢ΔObservation 16
ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• खͷؔઅҐஔΛݕग़͢ΔϞσϧ͕ߋ৽͞Εͨ • ਪͷਫ਼ͱ্͕ • WWDC2021Ͱൃද͞ΕͨϞσϧ͔ΒΞοϓσʔτ HandPose DetectionͷੳϞσϧߋ৽ 17 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷػೳՃ • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ςΩετͷ༰ΛྨͰ͖Δ • ͦͷଞʹ2ͭͷVisionϑϨʔϜϫʔΫͷΞοϓσʔτ • DetectLensSmudgeRequest
/ SmudgeObservation • HandPose DetectionͷੳϞσϧߋ৽ ͓͠·͍ 18