Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SpeechAnalyzerによるSpeech to Textの進化を探る
Search
Sponsored
·
SiteGround - Reliable hosting with speed, security, and support you can count on.
→
Masashi
October 02, 2025
Programming
13
0
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
SpeechAnalyzerによるSpeech to Textの進化を探る
extension DC 2025 Day2 @Sansan
https://sansan.connpass.com/event/362403/
Masashi
October 02, 2025
More Decks by Masashi
See All by Masashi
Speech Frameworkを使った音声認識の基本
kawabe
0
82
Eight iOSを支えるアーキテクチャ
kawabe
1
630
これだけは伝えたい設計の技術
kawabe
0
1.3k
EightのUI Component化の取り組み
kawabe
0
140
Other Decks in Programming
See All in Programming
Observability in Practice:Grafana 與 Edge Device SRE 的那些事
blueswen
0
160
A2UI という光を覗いてみる
satohjohn
1
130
過去最大のMCPアップデート! 2026-07-28 RC版の謎に迫る
licux
6
240
運用エージェントは "作る" から "育てる" へ - 記憶と自己進化の3層設計パターン / self-evolving-agents-three-layer-agent-design
gawa
12
3.6k
代数的データ型って何が嬉しいの? #frontend_phpcon_do
kajitack
8
3.3k
ローカルLLMを使ってB2Bサービスを作っていての学び
yaotti
0
160
コンテキストの使い捨てをやめる — ビジネスルール駆動開発と miko —
ioki
0
190
セキュリティの専門家じゃなくてもできる。「セキュリティ意識」をアップデートして サプライチェーン攻撃への耐性を高めよう。
tk3fftk
5
710
技術記事、 専門家としてのプログラマ、 言語化
mizchi
4
2.7k
Language Server 使ってる? 〜VSCode と Zed の場合〜 / Are you using a Language Server? ~For VS Code and Zed~
handlename
0
780
脅威をエンジニアリングの糧にして――現場編 / Turning Threats into Engineering Fuel — Field Edition
nrslib
0
270
技術記事、AIに書かせるか、自分で書くか? 〜それでも私が自分の手で書く理由〜 / #QiitaConference
jnchito
2
1.4k
Featured
See All Featured
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
450
Un-Boring Meetings
codingconduct
0
310
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
200
How to Grow Your eCommerce with AI & Automation
katarinadahlin
PRO
1
200
Thoughts on Productivity
jonyablonski
76
5.2k
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
160
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
Automating Front-end Workflow
addyosmani
1370
210k
Game over? The fight for quality and originality in the time of robots
wayneb77
1
200
Winning Ecommerce Organic Search in an AI Era - #searchnstuff2025
aleyda
1
2k
Music & Morning Musume
bryan
47
7.2k
The Cost Of JavaScript in 2023
addyosmani
55
10k
Transcript
SpeechAnalyzer ʹΑΔ Speech to Text ͷਐԽΛ୳Δ
Տล խ࢙ɹMasashi Kawabe NOT A HOTEL גࣜձࣾ Smart Home νʔϜ
ιϑτΣΞΤϯδχΞ
ࣨʹεΠονϦϞίϯͳ͘ɺͯ͢ Home Controller͔Βૢ࡞ɻ ੈքதɺͲ͜ͷNOT A HOTELʹߦͬͯɺ ·ΔͰࣗͷΑ͏ʹ໎͏͜ͱͳͯ͘͢ͷػ ثͷૢ࡞͕Ͱ͖·͢ɻ Home Controller
͓෦ʹ͋ΔػثʢΤΞίϯɺর໌ɺ ΧʔςϯͳͲʣΛɺ؆୯ͳԻίϚϯυͰૢ ࡞Ͱ͖·͢ɻ ϦϞίϯεΠονʹ৮ΕΔ͜ͱͳ͘ɺ ͓෦ΛշదʹௐͰ͖·͢ɻ Իίϯτϩʔϧ
SpeechAnalyzer • iOS 26 Ͱಋೖ͞ΕͨɺԻͷੳ, ੳηογϣϯΛཧ͢ΔAPI • SpeechModule Protocol ʹ४ڌ͍ͯ͠ΔϞδϡʔϧΛՃ͢Δ͜ͱͰɺ
ಛఆͷछྨͷੳΛ࣮ߦͰ͖Δ • SpeechTranscriber • Speech to Text • DictationTranscriber • SpeechTranscriber ͕ରԠ͍ͯ͠ͳ͍ݴޠ σόΠεͰͷ Speech to Text • SpeechDetector • Ի۠ؒݕग़ ( VAD )
SFSpeechRecognizer • SpeechAnalyzer Λར༻Ͱ͖ͳ͍ iOS 26 ະຬͰɺͪ͜ΒΛར༻ • iOS 10.0
Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text ͕Մೳ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
ϚΠΫ͔ΒΕͨҐஔͰҰఆͷਫ਼Λ୲อ SFSpeechRecognizer SpeechTranscriber
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
Իೖྗͷ༗ޮԽपΓͷۤ࿑͕ͳ͘ͳΔ • SFSpeechRecognizer ͰɺiPadOS ͷόʔδϣϯΛ্͛Δʹ Իೖྗͷ༗ޮԽ͕ඞཁͩͬͨ • OSͷόʔδϣϯΞοϓσʔτͱ߹ΘͤͯԻೖྗͷ༗ޮԽΛ ͠ͳ͔ͬͨ߹ɺҎԼͷΤϥʔʹͳΔ •
Error Domain=kLSRErrorDomain Code=201 "Siri and Dictation are disabled" UserInfo={NSLocalizedDescription=Siri and Dictation are disabled}
( ༨ஊ ) SpeechDetector • SpeechModule Protocol ʹ·ͩ४ڌ͍ͯ͠ͳ͍ • Apple
Developer Forum ͰɺࣗͰ SpeechModule Protocol ʹ४ڌͤ͞Δํ๏͕հ͞Ε͍ͯΔ • There's wrong with speech detector ios26 • https://developer.apple.com/forums/thread/794439