Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SpeechAnalyzerによるSpeech to Textの進化を探る
Search
Masashi
October 02, 2025
Programming
11
0
Share
SpeechAnalyzerによるSpeech to Textの進化を探る
extension DC 2025 Day2 @Sansan
https://sansan.connpass.com/event/362403/
Masashi
October 02, 2025
More Decks by Masashi
See All by Masashi
Speech Frameworkを使った音声認識の基本
kawabe
0
78
Eight iOSを支えるアーキテクチャ
kawabe
1
630
これだけは伝えたい設計の技術
kawabe
0
1.3k
EightのUI Component化の取り組み
kawabe
0
140
Other Decks in Programming
See All in Programming
How Swift's Type System Guides AI Agents
koher
0
210
条件判定に名前、つけてますか? #phperkaigi #c
77web
2
1k
瑠璃の宝石に学ぶ技術の声の聴き方 / 【劇場版】アニメから得た学びを発表会2026 #エンジニアニメ
mazrean
0
220
AI時代の脳疲弊と向き合う ~言語学としてのPHP~
sakuraikotone
1
1.8k
レガシーPHP転生 〜父がドメインエキスパートだったのでDDD+Claude Codeでチート開発します〜
panda_program
0
650
PDI: Como Alavancar Sua Carreira e Seu Negócio
marcelgsantos
0
110
3分でわかるatama plusのQA/about atama plus QA
atamaplus
0
130
実践ハーネスエンジニアリング #MOSHTech
kajitack
7
6.2k
ハンズオンで学ぶクラウドネイティブ
tatsukiminami
0
110
Codex CLIのSubagentsによる並列API実装 / Parallel API Implementation with Codex CLI Subagents
takatty
2
880
Make GenAI Production-Ready with Kubernetes Patterns
bibryam
0
100
KagglerがMixSeekを触ってみた
morim
0
370
Featured
See All Featured
Documentation Writing (for coders)
carmenintech
77
5.3k
The untapped power of vector embeddings
frankvandijk
2
1.7k
職位にかかわらず全員がリーダーシップを発揮するチーム作り / Building a team where everyone can demonstrate leadership regardless of position
madoxten
62
53k
Claude Code どこまでも/ Claude Code Everywhere
nwiizo
64
54k
Typedesign – Prime Four
hannesfritz
42
3k
Collaborative Software Design: How to facilitate domain modelling decisions
baasie
0
190
Are puppies a ranking factor?
jonoalderson
1
3.3k
Keith and Marios Guide to Fast Websites
keithpitt
413
23k
How GitHub (no longer) Works
holman
316
150k
Navigating Team Friction
lara
192
16k
Designing for Performance
lara
611
70k
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
160
Transcript
SpeechAnalyzer ʹΑΔ Speech to Text ͷਐԽΛ୳Δ
Տล խ࢙ɹMasashi Kawabe NOT A HOTEL גࣜձࣾ Smart Home νʔϜ
ιϑτΣΞΤϯδχΞ
ࣨʹεΠονϦϞίϯͳ͘ɺͯ͢ Home Controller͔Βૢ࡞ɻ ੈքதɺͲ͜ͷNOT A HOTELʹߦͬͯɺ ·ΔͰࣗͷΑ͏ʹ໎͏͜ͱͳͯ͘͢ͷػ ثͷૢ࡞͕Ͱ͖·͢ɻ Home Controller
͓෦ʹ͋ΔػثʢΤΞίϯɺর໌ɺ ΧʔςϯͳͲʣΛɺ؆୯ͳԻίϚϯυͰૢ ࡞Ͱ͖·͢ɻ ϦϞίϯεΠονʹ৮ΕΔ͜ͱͳ͘ɺ ͓෦ΛշదʹௐͰ͖·͢ɻ Իίϯτϩʔϧ
SpeechAnalyzer • iOS 26 Ͱಋೖ͞ΕͨɺԻͷੳ, ੳηογϣϯΛཧ͢ΔAPI • SpeechModule Protocol ʹ४ڌ͍ͯ͠ΔϞδϡʔϧΛՃ͢Δ͜ͱͰɺ
ಛఆͷछྨͷੳΛ࣮ߦͰ͖Δ • SpeechTranscriber • Speech to Text • DictationTranscriber • SpeechTranscriber ͕ରԠ͍ͯ͠ͳ͍ݴޠ σόΠεͰͷ Speech to Text • SpeechDetector • Ի۠ؒݕग़ ( VAD )
SFSpeechRecognizer • SpeechAnalyzer Λར༻Ͱ͖ͳ͍ iOS 26 ະຬͰɺͪ͜ΒΛར༻ • iOS 10.0
Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text ͕Մೳ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
ϚΠΫ͔ΒΕͨҐஔͰҰఆͷਫ਼Λ୲อ SFSpeechRecognizer SpeechTranscriber
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
Իೖྗͷ༗ޮԽपΓͷۤ࿑͕ͳ͘ͳΔ • SFSpeechRecognizer ͰɺiPadOS ͷόʔδϣϯΛ্͛Δʹ Իೖྗͷ༗ޮԽ͕ඞཁͩͬͨ • OSͷόʔδϣϯΞοϓσʔτͱ߹ΘͤͯԻೖྗͷ༗ޮԽΛ ͠ͳ͔ͬͨ߹ɺҎԼͷΤϥʔʹͳΔ •
Error Domain=kLSRErrorDomain Code=201 "Siri and Dictation are disabled" UserInfo={NSLocalizedDescription=Siri and Dictation are disabled}
( ༨ஊ ) SpeechDetector • SpeechModule Protocol ʹ·ͩ४ڌ͍ͯ͠ͳ͍ • Apple
Developer Forum ͰɺࣗͰ SpeechModule Protocol ʹ४ڌͤ͞Δํ๏͕հ͞Ε͍ͯΔ • There's wrong with speech detector ios26 • https://developer.apple.com/forums/thread/794439