Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SpeechAnalyzerによるSpeech to Textの進化を探る
Search
Masashi
October 02, 2025
Programming
0
10
SpeechAnalyzerによるSpeech to Textの進化を探る
extension DC 2025 Day2 @Sansan
https://sansan.connpass.com/event/362403/
Masashi
October 02, 2025
Tweet
Share
More Decks by Masashi
See All by Masashi
Speech Frameworkを使った音声認識の基本
kawabe
0
78
Eight iOSを支えるアーキテクチャ
kawabe
1
630
これだけは伝えたい設計の技術
kawabe
0
1.3k
EightのUI Component化の取り組み
kawabe
0
140
Other Decks in Programming
See All in Programming
モダンOBSプラグイン開発
umireon
0
180
ネイティブアプリとWebフロントエンドのAPI通信ラッパーにおける共通化の勘所
suguruooki
0
190
最初からAWS CDKで技術検証してもいいんじゃない?
akihisaikeda
4
170
守る「だけ」の優しいEMを抜けて、 事業とチームを両方見る視点を身につけた話
maroon8021
3
1.4k
Cyrius ーLinux非依存にコンテナをネイティブ実行する専用OSー
n4mlz
0
250
実践ハーネスエンジニアリング #MOSHTech
kajitack
7
4.3k
How to stabilize UI tests using XCTest
akkeylab
0
140
今年もTECHSCOREブログを書き続けます!
hiraoku101
0
160
ベクトル検索のフィルタを用いた機械学習モデルとの統合 / python-meetup-fukuoka-06-vector-attr
monochromegane
2
550
Smarter Angular mit Transformers.js & Prompt API
christianliebel
PRO
1
100
PHP でエミュレータを自作して Ubuntu を動かそう
m3m0r7
PRO
2
150
Goの型安全性で実現する複数プロダクトの権限管理
ishikawa_pro
2
1.4k
Featured
See All Featured
BBQ
matthewcrist
89
10k
StorybookのUI Testing Handbookを読んだ
zakiyama
31
6.6k
brightonSEO & MeasureFest 2025 - Christian Goodrich - Winning strategies for Black Friday CRO & PPC
cargoodrich
3
130
From π to Pie charts
rasagy
0
160
Efficient Content Optimization with Google Search Console & Apps Script
katarinadahlin
PRO
1
440
Ruling the World: When Life Gets Gamed
codingconduct
0
180
What Being in a Rock Band Can Teach Us About Real World SEO
427marketing
0
200
Exploring the relationship between traditional SERPs and Gen AI search
raygrieselhuber
PRO
2
3.7k
RailsConf 2023
tenderlove
30
1.4k
How To Speak Unicorn (iThemes Webinar)
marktimemedia
1
420
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
32
2.6k
The agentic SEO stack - context over prompts
schlessera
0
720
Transcript
SpeechAnalyzer ʹΑΔ Speech to Text ͷਐԽΛ୳Δ
Տล խ࢙ɹMasashi Kawabe NOT A HOTEL גࣜձࣾ Smart Home νʔϜ
ιϑτΣΞΤϯδχΞ
ࣨʹεΠονϦϞίϯͳ͘ɺͯ͢ Home Controller͔Βૢ࡞ɻ ੈքதɺͲ͜ͷNOT A HOTELʹߦͬͯɺ ·ΔͰࣗͷΑ͏ʹ໎͏͜ͱͳͯ͘͢ͷػ ثͷૢ࡞͕Ͱ͖·͢ɻ Home Controller
͓෦ʹ͋ΔػثʢΤΞίϯɺর໌ɺ ΧʔςϯͳͲʣΛɺ؆୯ͳԻίϚϯυͰૢ ࡞Ͱ͖·͢ɻ ϦϞίϯεΠονʹ৮ΕΔ͜ͱͳ͘ɺ ͓෦ΛշదʹௐͰ͖·͢ɻ Իίϯτϩʔϧ
SpeechAnalyzer • iOS 26 Ͱಋೖ͞ΕͨɺԻͷੳ, ੳηογϣϯΛཧ͢ΔAPI • SpeechModule Protocol ʹ४ڌ͍ͯ͠ΔϞδϡʔϧΛՃ͢Δ͜ͱͰɺ
ಛఆͷछྨͷੳΛ࣮ߦͰ͖Δ • SpeechTranscriber • Speech to Text • DictationTranscriber • SpeechTranscriber ͕ରԠ͍ͯ͠ͳ͍ݴޠ σόΠεͰͷ Speech to Text • SpeechDetector • Ի۠ؒݕग़ ( VAD )
SFSpeechRecognizer • SpeechAnalyzer Λར༻Ͱ͖ͳ͍ iOS 26 ະຬͰɺͪ͜ΒΛར༻ • iOS 10.0
Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text ͕Մೳ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
ϚΠΫ͔ΒΕͨҐஔͰҰఆͷਫ਼Λ୲อ SFSpeechRecognizer SpeechTranscriber
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
Իೖྗͷ༗ޮԽपΓͷۤ࿑͕ͳ͘ͳΔ • SFSpeechRecognizer ͰɺiPadOS ͷόʔδϣϯΛ্͛Δʹ Իೖྗͷ༗ޮԽ͕ඞཁͩͬͨ • OSͷόʔδϣϯΞοϓσʔτͱ߹ΘͤͯԻೖྗͷ༗ޮԽΛ ͠ͳ͔ͬͨ߹ɺҎԼͷΤϥʔʹͳΔ •
Error Domain=kLSRErrorDomain Code=201 "Siri and Dictation are disabled" UserInfo={NSLocalizedDescription=Siri and Dictation are disabled}
( ༨ஊ ) SpeechDetector • SpeechModule Protocol ʹ·ͩ४ڌ͍ͯ͠ͳ͍ • Apple
Developer Forum ͰɺࣗͰ SpeechModule Protocol ʹ४ڌͤ͞Δํ๏͕հ͞Ε͍ͯΔ • There's wrong with speech detector ios26 • https://developer.apple.com/forums/thread/794439