Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SpeechAnalyzerによるSpeech to Textの進化を探る
Search
Masashi
October 02, 2025
Programming
11
0
Share
SpeechAnalyzerによるSpeech to Textの進化を探る
extension DC 2025 Day2 @Sansan
https://sansan.connpass.com/event/362403/
Masashi
October 02, 2025
More Decks by Masashi
See All by Masashi
Speech Frameworkを使った音声認識の基本
kawabe
0
79
Eight iOSを支えるアーキテクチャ
kawabe
1
630
これだけは伝えたい設計の技術
kawabe
0
1.3k
EightのUI Component化の取り組み
kawabe
0
140
Other Decks in Programming
See All in Programming
Swift Concurrency Type System
inamiy
1
560
検索設計から 推論設計への重心移動と Recall-First Retrieval
po3rin
4
1.3k
Vibe NLP for Applied NLP
inesmontani
PRO
0
540
AIと共に生きる技術選定 2026
sgash708
0
110
PicoRuby for IoT: Connecting to the Cloud with MQTT
yuuu
2
710
JOAI2026 1st solution - heron0519 -
heron0519
0
160
GNU Makeの使い方 / How to use GNU Make
kaityo256
PRO
16
5.6k
GitHubCopilotCLIをはじめよう.pdf
htkym
0
300
ソフトウェア設計の結合バランス #phperkaigi
kajitack
0
160
AI時代のPhpStorm最新事情 #phpcon_odawara
yusuke
0
240
AI-DLC Deep Dive
yuukiyo
9
5k
個人的に嬉しかったpnpmの新機能・3選
matsuo_atsushi
0
110
Featured
See All Featured
The Art of Programming - Codeland 2020
erikaheidi
57
14k
30 Presentation Tips
portentint
PRO
1
280
Data-driven link building: lessons from a $708K investment (BrightonSEO talk)
szymonslowik
1
1k
A brief & incomplete history of UX Design for the World Wide Web: 1989–2019
jct
1
360
Deep Space Network (abreviated)
tonyrice
0
130
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.4k
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
330
What does AI have to do with Human Rights?
axbom
PRO
1
2.1k
GraphQLの誤解/rethinking-graphql
sonatard
75
12k
Visualization
eitanlees
150
17k
Facilitating Awesome Meetings
lara
57
6.8k
A Guide to Academic Writing Using Generative AI - A Workshop
ks91
PRO
1
280
Transcript
SpeechAnalyzer ʹΑΔ Speech to Text ͷਐԽΛ୳Δ
Տล խ࢙ɹMasashi Kawabe NOT A HOTEL גࣜձࣾ Smart Home νʔϜ
ιϑτΣΞΤϯδχΞ
ࣨʹεΠονϦϞίϯͳ͘ɺͯ͢ Home Controller͔Βૢ࡞ɻ ੈքதɺͲ͜ͷNOT A HOTELʹߦͬͯɺ ·ΔͰࣗͷΑ͏ʹ໎͏͜ͱͳͯ͘͢ͷػ ثͷૢ࡞͕Ͱ͖·͢ɻ Home Controller
͓෦ʹ͋ΔػثʢΤΞίϯɺর໌ɺ ΧʔςϯͳͲʣΛɺ؆୯ͳԻίϚϯυͰૢ ࡞Ͱ͖·͢ɻ ϦϞίϯεΠονʹ৮ΕΔ͜ͱͳ͘ɺ ͓෦ΛշదʹௐͰ͖·͢ɻ Իίϯτϩʔϧ
SpeechAnalyzer • iOS 26 Ͱಋೖ͞ΕͨɺԻͷੳ, ੳηογϣϯΛཧ͢ΔAPI • SpeechModule Protocol ʹ४ڌ͍ͯ͠ΔϞδϡʔϧΛՃ͢Δ͜ͱͰɺ
ಛఆͷछྨͷੳΛ࣮ߦͰ͖Δ • SpeechTranscriber • Speech to Text • DictationTranscriber • SpeechTranscriber ͕ରԠ͍ͯ͠ͳ͍ݴޠ σόΠεͰͷ Speech to Text • SpeechDetector • Ի۠ؒݕग़ ( VAD )
SFSpeechRecognizer • SpeechAnalyzer Λར༻Ͱ͖ͳ͍ iOS 26 ະຬͰɺͪ͜ΒΛར༻ • iOS 10.0
Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text ͕Մೳ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
ϚΠΫ͔ΒΕͨҐஔͰҰఆͷਫ਼Λ୲อ SFSpeechRecognizer SpeechTranscriber
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
Իೖྗͷ༗ޮԽपΓͷۤ࿑͕ͳ͘ͳΔ • SFSpeechRecognizer ͰɺiPadOS ͷόʔδϣϯΛ্͛Δʹ Իೖྗͷ༗ޮԽ͕ඞཁͩͬͨ • OSͷόʔδϣϯΞοϓσʔτͱ߹ΘͤͯԻೖྗͷ༗ޮԽΛ ͠ͳ͔ͬͨ߹ɺҎԼͷΤϥʔʹͳΔ •
Error Domain=kLSRErrorDomain Code=201 "Siri and Dictation are disabled" UserInfo={NSLocalizedDescription=Siri and Dictation are disabled}
( ༨ஊ ) SpeechDetector • SpeechModule Protocol ʹ·ͩ४ڌ͍ͯ͠ͳ͍ • Apple
Developer Forum ͰɺࣗͰ SpeechModule Protocol ʹ४ڌͤ͞Δํ๏͕հ͞Ε͍ͯΔ • There's wrong with speech detector ios26 • https://developer.apple.com/forums/thread/794439