Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
SpeechAnalyzerによるSpeech to Textの進化を探る
Search
Masashi
October 02, 2025
Programming
0
10
SpeechAnalyzerによるSpeech to Textの進化を探る
extension DC 2025 Day2 @Sansan
https://sansan.connpass.com/event/362403/
Masashi
October 02, 2025
Tweet
Share
More Decks by Masashi
See All by Masashi
Speech Frameworkを使った音声認識の基本
kawabe
0
77
Eight iOSを支えるアーキテクチャ
kawabe
1
620
これだけは伝えたい設計の技術
kawabe
0
1.3k
EightのUI Component化の取り組み
kawabe
0
140
Other Decks in Programming
See All in Programming
公共交通オープンデータ × モバイルUX 複雑な運行情報を 『直感』に変換する技術
tinykitten
PRO
0
180
Grafana:建立系統全知視角的捷徑
blueswen
0
260
GISエンジニアから見たLINKSデータ
nokonoko1203
0
190
re:Invent 2025 トレンドからみる製品開発への AI Agent 活用
yoskoh
0
560
Navigating Dependency Injection with Metro
l2hyunwoo
1
200
クラウドに依存しないS3を使った開発術
simesaba80
0
200
生成AIを利用するだけでなく、投資できる組織へ
pospome
2
430
ELYZA_Findy AI Engineering Summit登壇資料_AIコーディング時代に「ちゃんと」やること_toB LLMプロダクト開発舞台裏_20251216
elyza
2
860
開発に寄りそう自動テストの実現
goyoki
2
1.6k
AI Agent Tool のためのバックエンドアーキテクチャを考える #encraft
izumin5210
5
1.5k
C-Shared Buildで突破するAI Agent バックテストの壁
po3rin
0
420
Python札幌 LT資料
t3tra
7
1.1k
Featured
See All Featured
Building a Modern Day E-commerce SEO Strategy
aleyda
45
8.4k
コードの90%をAIが書く世界で何が待っているのか / What awaits us in a world where 90% of the code is written by AI
rkaga
58
41k
The Curse of the Amulet
leimatthew05
0
6.5k
Primal Persuasion: How to Engage the Brain for Learning That Lasts
tmiket
0
200
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
27k
The Invisible Side of Design
smashingmag
302
51k
Product Roadmaps are Hard
iamctodd
PRO
55
12k
Imperfection Machines: The Place of Print at Facebook
scottboms
269
13k
Building a A Zero-Code AI SEO Workflow
portentint
PRO
0
210
Noah Learner - AI + Me: how we built a GSC Bulk Export data pipeline
techseoconnect
PRO
0
76
Exploring the Power of Turbo Streams & Action Cable | RailsConf2023
kevinliebholz
37
6.2k
The Organizational Zoo: Understanding Human Behavior Agility Through Metaphoric Constructive Conversations (based on the works of Arthur Shelley, Ph.D)
kimpetersen
PRO
0
210
Transcript
SpeechAnalyzer ʹΑΔ Speech to Text ͷਐԽΛ୳Δ
Տล խ࢙ɹMasashi Kawabe NOT A HOTEL גࣜձࣾ Smart Home νʔϜ
ιϑτΣΞΤϯδχΞ
ࣨʹεΠονϦϞίϯͳ͘ɺͯ͢ Home Controller͔Βૢ࡞ɻ ੈքதɺͲ͜ͷNOT A HOTELʹߦͬͯɺ ·ΔͰࣗͷΑ͏ʹ໎͏͜ͱͳͯ͘͢ͷػ ثͷૢ࡞͕Ͱ͖·͢ɻ Home Controller
͓෦ʹ͋ΔػثʢΤΞίϯɺর໌ɺ ΧʔςϯͳͲʣΛɺ؆୯ͳԻίϚϯυͰૢ ࡞Ͱ͖·͢ɻ ϦϞίϯεΠονʹ৮ΕΔ͜ͱͳ͘ɺ ͓෦ΛշదʹௐͰ͖·͢ɻ Իίϯτϩʔϧ
SpeechAnalyzer • iOS 26 Ͱಋೖ͞ΕͨɺԻͷੳ, ੳηογϣϯΛཧ͢ΔAPI • SpeechModule Protocol ʹ४ڌ͍ͯ͠ΔϞδϡʔϧΛՃ͢Δ͜ͱͰɺ
ಛఆͷछྨͷੳΛ࣮ߦͰ͖Δ • SpeechTranscriber • Speech to Text • DictationTranscriber • SpeechTranscriber ͕ରԠ͍ͯ͠ͳ͍ݴޠ σόΠεͰͷ Speech to Text • SpeechDetector • Ի۠ؒݕग़ ( VAD )
SFSpeechRecognizer • SpeechAnalyzer Λར༻Ͱ͖ͳ͍ iOS 26 ະຬͰɺͪ͜ΒΛར༻ • iOS 10.0
Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text ͕Մೳ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
ϚΠΫ͔ΒΕͨҐஔͰҰఆͷਫ਼Λ୲อ SFSpeechRecognizer SpeechTranscriber
SpeechTranscriber • iOS 26.0 Ҏ߱Ͱར༻Մೳ • ΦϯσόΠεͰͷ Speech to Text
͕Մೳ • ࣌ؒͷԻʹରԠ • ϚΠΫ͔ΒΕͨҐஔͰͷԻʹରԠ • Siri ΩʔϘʔυͷԻೖྗͷ༗ޮԽ͕ෆཁ • ຊޠαϙʔτ
Իೖྗͷ༗ޮԽपΓͷۤ࿑͕ͳ͘ͳΔ • SFSpeechRecognizer ͰɺiPadOS ͷόʔδϣϯΛ্͛Δʹ Իೖྗͷ༗ޮԽ͕ඞཁͩͬͨ • OSͷόʔδϣϯΞοϓσʔτͱ߹ΘͤͯԻೖྗͷ༗ޮԽΛ ͠ͳ͔ͬͨ߹ɺҎԼͷΤϥʔʹͳΔ •
Error Domain=kLSRErrorDomain Code=201 "Siri and Dictation are disabled" UserInfo={NSLocalizedDescription=Siri and Dictation are disabled}
( ༨ஊ ) SpeechDetector • SpeechModule Protocol ʹ·ͩ४ڌ͍ͯ͠ͳ͍ • Apple
Developer Forum ͰɺࣗͰ SpeechModule Protocol ʹ४ڌͤ͞Δํ๏͕հ͞Ε͍ͯΔ • There's wrong with speech detector ios26 • https://developer.apple.com/forums/thread/794439