Upgrade to PRO for Only $50/Year—Limited-Time Offer! 🔥
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
Search
ni_san2000
June 26, 2025
Technology
0
180
WWDC2025セッション共有: Visionフレームワークによるドキュメントの読み込み
in Swift愛好会スピンオフ WWDC25セッション要約会@DeNA (2025/06/26)
ni_san2000
June 26, 2025
Tweet
Share
More Decks by ni_san2000
See All by ni_san2000
Appleの“ホーム”アプリを使いたい! 2024年の国内スマートホーム事情
ryosism
0
140
初めて参加したiOSDCは○○のようだった!
ryosism
0
1.9k
Other Decks in Technology
See All in Technology
5分で知るMicrosoft Ignite
taiponrock
PRO
0
380
エンジニアとPMのドメイン知識の溝をなくす、 AIネイティブな開発プロセス
applism118
4
1.3k
MapKitとオープンデータで実現する地図情報の拡張と可視化
zozotech
PRO
1
140
AIと二人三脚で育てた、個人開発アプリグロース術
zozotech
PRO
1
730
regrowth_tokyo_2025_securityagent
hiashisan
0
250
寫了幾年 Code,然後呢?軟體工程師必須重新認識的 DevOps
cheng_wei_chen
1
1.4k
ログ管理の新たな可能性?CloudWatchの新機能をご紹介
ikumi_ono
1
770
MLflowで始めるプロンプト管理、評価、最適化
databricksjapan
1
250
Lookerで実現するセキュアな外部データ提供
zozotech
PRO
0
140
乗りこなせAI駆動開発の波
eltociear
1
1.1k
学習データって増やせばいいんですか?
ftakahashi
2
350
【AWS re:Invent 2025速報】AIビルダー向けアップデートをまとめて解説!
minorun365
4
530
Featured
See All Featured
RailsConf 2023
tenderlove
30
1.3k
Music & Morning Musume
bryan
46
7k
Building an army of robots
kneath
306
46k
Automating Front-end Workflow
addyosmani
1371
200k
Refactoring Trust on Your Teams (GOTO; Chicago 2020)
rmw
35
3.3k
For a Future-Friendly Web
brad_frost
180
10k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
1.8k
Imperfection Machines: The Place of Print at Facebook
scottboms
269
13k
Building Flexible Design Systems
yeseniaperezcruz
330
39k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
48
9.8k
Mobile First: as difficult as doing things right
swwweet
225
10k
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
25
1.6k
Transcript
WWDC2025ηογϣϯڞ༗: VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷ ಡΈࠐΈ in SwiftѪձεϐϯΦϑ WWDC25ηογϣϯཁձ@DeNA (2025/06/26) ʹʔ͞Μ(@ni_san2000)
ࣗݾհ • ʹʔ͞Μ(@ni_san2000) • ۀͰiOSΞϓϦΛ࡞͍ͬͯ·͢(3) • Apple৴ऀͳͷͰجௐߨԋੲ͔ΒݟͯΔ • macOSͷωʔϛϯάετʔϦʔ •
ΫϨΠάɾϑΣσϦΪͷύϧΫʔϧ • झຯ • Χϝϥ / ؍༿২ / ίʔώʔ / eεϙʔπ • ࠷ۙࣸਅίϯςετͱ͔ڵຯ͋Γ·͢ 2
͢ηογϣϯʹ͍ͭͯ • VisionϑϨʔϜϫʔΫʹΑΔυΩϡϝϯτͷಡΈࠐΈ • “Foundation ModelsͰͳ͍”ɺAIؔ࿈ͷ͓Ͱ͢ 3 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ • ಈը૾ʹಛԽͨ͠AIϞσϧͷػೳΛఏڙ͢ΔϑϨʔϜϫʔΫ • ը૾ೝࣝ, ମݕग़, إೝࣝͳͲ • .mlmodelΛཁ͢ΔػೳΛखܰʹAPIͱͯ͠ར༻Ͱ͖Δ •
iOS18Ͱ31ݸͷAPIΛఏڙ͍ͯͨ͠ 4 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQNBDIJOFMFBSOJOHNPEFMT
VisionϑϨʔϜϫʔΫʹ͍͓ͭͯ͞Β͍ CalculateImageAestheticsScoresRequest ClassifyImageRequest CoreMLRequest DetectAnimalBodyPoseRequest DetectBarcodesRequest DetectContoursRequest DetectDocumentSegmentationRequest DetectFaceCaptureQualityRequest DetectFaceLandmarksRequest
DetectFaceRectanglesRequest DetectHorizonRequest DetectHumanBodyPose3DRequest DetectHumanBodyPoseRequest DetectHumanHandPoseRequest DetectHumanRectanglesRequest DetectRectanglesRequest DetectTextRectanglesRequest DetectTrajectoriesRequest GenerateAttentionBasedSaliencyIma GenerateForegroundInstanceMaskRequest GenerateImageFeaturePrintRequest GenerateOpticalFlowRequest GeneratePersonInstanceMaskRequest GeneratePersonSegmentationRequest RecognizeAnimalsRequest RecognizeTextRequest TrackHomographicImageRegistration TrackObjectRequest TrackOpticalFlowRequest TrackRectangleRequest TrackTranslationalImageRegistrationRequest 5 RecognizeDocumentRequest /FX DetectLensSmudgeRequest /FX
͜Ε·ͰͷυΩϡϝϯτೝࣝ • ݟ͍͑ͯΔจࣈೝࣝͰ͖Δ • ͚ͲɺจষߏදͷίϯςΩετࣦΘΕΔ 6 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
xOS26͔ΒͰ͖ΔΑ͏ʹͳΔ͜ͱ • υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ • ஈམɾՕॻ͖දͷίϯςΩετೝࣝ • ಛఆͷϑΥʔϚοτʹଈͨ͠ςΩετͷࣝผ • QRίʔυͷऔΓग़͠ 7
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
ᶃ RequestΫϥεͷ༻ҙ (RecognizeDocumentRequest) ᶄ ը૾Λ͢ ᶅ DocumentObservationΛड͚औΔ ᶆ document͔Β֤ཁૉΛऔಘ ᶇ
That’s it !
DocumentObservationͷݕग़ߏ • ContainerList, Table, Text, BarcodesͳͲͷཁૉΛ࣋ͭ • $FMM*UFNT$POUBJOFSͷཁૉ͍࣋ͬͯΔͷͰɺ͞ΒʹแͰ͖Δ 10 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
☝
• ࣝผϑΥʔϚοτ10छྨ • ຊޠ·ͩNot supported • දࣔ͢ΔࡍʹText AttributeΛࢦఆ͢Δඞཁ͕ͳ͘ͳΓͦ͏ ςΩετΛࣝผ͢ΔDataDetector 11
આ໌ 5ZQF ิ ΧϨϯμʔ CalendarEvent ϝʔϧΞυϨε EmailAddress ϑϥΠτ൪߸ FlightNumber 63- Link ଌఆ ୯Ґ͖ Measurement %JNFOTJPOͰදݱͰ͖Δ୯Ґ ֹۚ ௨՟͖ MoneyAmount -PDBMF$VSSFODZͰදݱͰ͖Δ୯Ґ ࢧ͍ঢ়گ PaymentIdentifier 6OJ fi FE1BZNFOUT*OUFSGBDF 61* ి൪߸ PhoneNumber ॅॴ PostalAddress FHTUSFFU DJUZ TUBUF ૹ൪߸ ShipmentTrackingNumber ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
֤ϑΥʔϚοτͱͯ͠σʔλΛऔಘ • switch-case͢Δ͚ͩ • ͲͷϑΥʔϚοτ͔ςΩετ୯ҐͰࣝผࡁΈ 12
υΩϡϝϯτ͔ΒจॻͷߏΛཧղͯ͠ཁૉͷೝ͕ࣝͰ͖Δ ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷ৽نAPI • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ContainerTable, List, Cell, TextͳͲͷཁૉΛ͍࣋ͬͯΔ •
CellListͷItemೖΕࢠͰContainerΛ֨ೲͰ͖Δ • Text֨ೲ͞Εͨจࣈ͔ΒϑΥʔϚοτΛೝࣝͰ͖Δ • DataDetector.Match.SemanticDetails.XXXXXʹଟͷ৽نσʔλߏ RecognizeDocumentRequest·ͱΊ 14
ΓͷVisionؔ࿈ͷ৽ཁૉ̎ͭ ͓·͚ 15
• DetectLensSmudgeRequest / SmudgeObservation͕Ճ • ೖྗͨ͠ը૾͕ԚΕ͍ͯΔ͔Λผ͢Δ • ᮢΛઃ͚Δ͜ͱͰੳΤϥʔΛະવʹ͛Δ Ϩϯζද໘ͷԚΕಶΓΛݕग़͢ΔObservation 16
ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• खͷؔઅҐஔΛݕग़͢ΔϞσϧ͕ߋ৽͞Εͨ • ਪͷਫ਼ͱ্͕ • WWDC2021Ͱൃද͞ΕͨϞσϧ͔ΒΞοϓσʔτ HandPose DetectionͷੳϞσϧߋ৽ 17 ႄႨჭğIUUQTEFWFMPQFSBQQMFDPNKQWJEFPTQMBZXXED
• VisionϑϨʔϜϫʔΫͷػೳՃ • υΩϡϝϯτͷ༰ΛߏతʹੳͰ͖Δ • ςΩετͷ༰ΛྨͰ͖Δ • ͦͷଞʹ2ͭͷVisionϑϨʔϜϫʔΫͷΞοϓσʔτ • DetectLensSmudgeRequest
/ SmudgeObservation • HandPose DetectionͷੳϞσϧߋ৽ ͓͠·͍ 18