Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Co...
Search
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Technology
0
54
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Corporation's Speech AI Vision: Towards Realtime Spoken Dialogue through Advanced ASR and TTS
LINEヤフーの音声認識と音声合成技術を活用した応用事例と、近年注目されているLLM基盤のリアルタイム音声対話技術の自社の取り組みについて紹介します。
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Tweet
Share
More Decks by LINEヤフーTech (LY Corporation Tech)
See All by LINEヤフーTech (LY Corporation Tech)
Yahoo!しごとカタログ 新しい境地を創るエンジニア募集!
lycorptech_jp
PRO
2
290
データグループにおけるフロントエンド開発
lycorptech_jp
PRO
2
240
Yahoo!知恵袋におけるフロントエンド開発
lycorptech_jp
PRO
0
240
"LINE Planet" and AI: Conversations with AI
lycorptech_jp
PRO
0
50
Seamless inventory management with AI
lycorptech_jp
PRO
0
24
AI Frontiers Revealed: Transforming LINE Shopping TW with LLM-Driven Product Attribute Extraction
lycorptech_jp
PRO
0
42
「Yahoo!検索」におけるWebパフォーマンス改善の取り組み / Efforts to Improve Web Performance in "Yahoo! JAPAN Search"
lycorptech_jp
PRO
1
62
アクセシビリティ改善の実践:プロダクトにおける具体的な取り組みと課題 / Practices for Accessibility Improvement: Specific Efforts and Challenges in Products
lycorptech_jp
PRO
0
57
「PayPayゲートウェイ」におけるStorybook活用事例 / Introducing Storybook: Enhancing Development in "PayPay Gateway"
lycorptech_jp
PRO
0
130
Other Decks in Technology
See All in Technology
エンジニアリングマネージャー“お悩み相談”パネルセッション
ar_tama
1
310
CDK Toolkit Libraryにおけるテストの考え方
smt7174
1
570
「Chatwork」のEKS環境を支えるhelmfileを使用したマニフェスト管理術
hanayo04
1
420
cdk initで生成されるあのファイル達は何なのか/cdk-init-generated-files
tomoki10
1
700
全部AI、全員Cursor、ドキュメント駆動開発 〜DevinやGeminiも添えて〜
rinchsan
10
5.4k
ObsidianをLLM時代のナレッジベースに! クリッピング→Markdown→CLI連携の実践
srvhat09
6
4.6k
本当にわかりやすいAIエージェント入門
segavvy
7
4.2k
SRE with AI:実践から学ぶ、運用課題解決と未来への展望
yoshiiryo1
1
460
An introduction to Claude Code SDK
choplin
3
2.6k
研究開発部メンバーの働き⽅ / Sansan R&D Profile
sansan33
PRO
3
18k
AWS 怖い話 WAF編 @fillz_noh #AWSStartup #AWSStartup_Kansai
fillznoh
0
140
“日本一のM&A企業”を支える、少人数SREの効率化戦略 / SRE NEXT 2025
genda
1
290
Featured
See All Featured
Building Applications with DynamoDB
mza
95
6.5k
RailsConf & Balkan Ruby 2019: The Past, Present, and Future of Rails at GitHub
eileencodes
138
34k
Building Adaptive Systems
keathley
43
2.7k
Typedesign – Prime Four
hannesfritz
42
2.7k
Dealing with People You Can't Stand - Big Design 2015
cassininazir
367
26k
The Pragmatic Product Professional
lauravandoore
35
6.7k
XXLCSS - How to scale CSS and keep your sanity
sugarenia
248
1.3M
Raft: Consensus for Rubyists
vanstee
140
7k
Build your cross-platform service in a week with App Engine
jlugia
231
18k
How GitHub (no longer) Works
holman
314
140k
Building an army of robots
kneath
306
45k
The Straight Up "How To Draw Better" Workshop
denniskardys
235
140k
Transcript
-:$PSQPSBUJPOT4QFFDI"*7JTJPO 5PXBSET3FBMUJNF4QFFDIUP4QFFDI UISPVHI"EWBODFE"43BOE554 4QFFDIBOE"DPVTUJD"*%FQU %BUB4DJFODF(SPVQ +VNQFJ .JZBLF 5BJLJ,JOPTIJUB -*/&ϠϑʔͷԻ"*͕ͨΒ͢ະདྷɿ"43554ͱରٕज़ͷ৽ͨͳՄೳੑ
"HFOEB -:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ -:$PSQPSBUJPO`T"43554 -*/&ϠϑʔͷԻೝࣝɾԻ߹ͷհ 3FBMUJNF4QFFDIUP4QFFDI ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈʹ͍ͭͯ
'VUVSF8PSLT ࠓޙͷల
-:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ 7JEFPBOE"VEJP $POUFOU"OBMZTJT 4QFFDI 3FDPHOJUJPO 4QFFDI (FOFSBUJPO 7JEFP"VEJP$POUFOUT $BMM$FOUFS
.FFUJOH 7PJDF6TFS*OUFSGBDF 7JEFP"VEJP$POUFOUBOE$BMM"OBMZTJT ࣸਅૉࡐఏڙΞϑϩ
-*/&ϠϑʔͷԻೝࣝͱԻ߹ͷհ *OUSPEVDUJPOPG -:$PSQPSBUJPO`T"43554 "43"VUPNBUJD4QFFDI3FDPHOJUJPO 5545FYU5P4QFFDI
:+70*$&4USFBNJOH"43 :+70*$&ετϦʔϛϯάԻೝࣝ &GGJDJFOUMZBEBQUTUPUBSHFUEPNBJOT • "TUSBUFHZCBTFEPODPNQBDUNPEFMTXJUIPVU FYUFSOBMMBOHVBHFNPEFMT • %PNBJOBEBQUBUJPOXJUIPVUUBSHFUBVEJPEBUB 1BJSFETQFFDIUFYUEBUB 6OQBJSFEUFYUEBUB
#BTF.PEFM "EBQUBUJPO .PEFM 4QFFDI 5FYU 5FYU #PPTUTQISBTFXJUIVTFSEJDUJPOBSJFT 4QFFDI 3FDPHOJUJPO 8PVMEZPV MJLFUPTUBSU UIFOBWJHBUJPO WJBUIJTSPVUF 4FSWJDF4QFDJGJD %JDUJPOBSZ :FT /P 1SJPSJUJ[FFYQSFTTXBZT 1SJPSJUJ[FHFOFSBMSPBET ʷ :FBTU ˠ ˓ :FT ʷ ,OPX ˠ ˓ /P ʷ 1SJPSJUJ[FHFOFSBMMPBET ˣ ˓ 1SJPSJUJ[FHFOFSBM SPBET 3FTPMWFTIPNPOZNT ຊ ڮ χ ϗ ϯ ό γ · Ͱ Ϛ σ ͷ ϊʜ 4VSGBDF 3FBE 4VSGBDF 3FBE &OEUP&OE "43 4QFFDI • JF ɾຊڮ χϗϯόγ JTBMPDBUJPOJO5PLZP ɾຊڮ χοϙϯόγ JTBMPDBUJPOJO0TBLB • +PJOUQSFEJDUJPOPGCPUITVSGBDFBOESFBEJOH ಉදهҟԻޠ ޮతͳυϝΠϯదԠ ಈతϢʔβࣙॻʹΑΔϑϨʔζೝࣝڧԽ ˞"CPVU'FBUVSF 'FBUVSF)JHIBDDVSBDZGPSXFCTFBSDIBOE-:$PSQPSBUJPOEPNBJO 'FBUVSF 3FTPMWFTIPNPOZNTBOEDVTUPNJ[FTFBTJMZ 'FBUVSF1SPWJEFT8FC"1*BOEPOEFWJDFNPEVMFT
"DIPSJT&YQSFTTJWF554 "DIPSJT දݱྗ͕๛͔ͳԻ߹ 'FBUVSF$POUSPMFNPUJPOJOUFOTJUZXJUIFYQSFTTJPOTUZMFT 'FBUVSF QSFTFUTQFBLFSPQUJPOTXJUIIVNBOMJLFRVBMJUZ 'FBUVSF1SPWJEFT8FC"1* POEFWJDFNPEVMFTBOEFEJUJOHXFCUPPMT "DIPSJT &EJUPS5FYUUPTQFFDIFEJUJOHUPPM
"DIPSJT &YQSFTTJWFUFYUUPTQFFDI $POUSPM0WFS4QFBLFS &NPUJPO BOE*OUFOTJUZ
ԻೝࣝͷαʔϏε׆༻ࣄྫ • :BIPP +"1"/"QQ`T7PJDF4FBSDI • 7PJDF4FBSDIJTJNQMFNFOUFEJONPTU:BIPP+"1"/4FSWJDFT JFTFSWJDFTJODMVEJOH.BQT 5SBOTJU BOETIPQQJOH :BIPP+"1"/"QQ
J04"OESPJE &YBNQMFTPG"QQMJDBUJPO
Ի߹ͷαʔϏε׆༻ࣄྫ • /BWJHBUJPOWPJDFJO:BIPP+"1"/$BS/BWJHBUJPO"QQ • 0O%FWJDF/FVSBM5FYU5P4QFFDI&OHJOF DBMMFEl"DIPSJT -JUFz • (FOFSBUFBTFDPOEBVEJPXBWFGPSNJO
TFDPOET ˎ 3FBM5JNF'BDUPS 35' JTJOJ1IPOF • 5PNJOJNJ[FBQQTJ[F XFWFJNQMFNFOUFE WBSJPVTPQUJNJ[BUJPOTJOCPUIJOGFSFODFMJCSBSJFTBOENPEFMTJ[F :BIPP+"1"/$BS/BWJHBUJPO"QQ &YBNQMFTPG"QQMJDBUJPO
"MBCBQQGPSSFBMUJNFTQPLFOEJBMPHVF CBTFEPOB--. VOEFSEFWFMPQJOH IUUQTXXXMZDPSQDPKQKBUFDIOPMPHZEFTJHOMBCT &YBNQMFTPG"QQMJDBUJPO ϦΞϧλΠϜԻରͷ࣮ݧΞϓϦ ։ൃத
ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈ 3FBMUJNF4QFFDIUP4QFFDI
3FBMUJNF4QFFDIUP4QFFDI5SFOE ϦΞϧλΠϜ4QFFDIUP4QFFDIͷٕज़ಈ IUUQTPQFOBJDPNKB+1DIBUHQUPWFSWJFX IUUQTHFNJOJHPPHMFPWFSWJFXHFNJOJMJWF IUUQTNPTIJDIBU IUUQTOVEJBMPHVFHJUIVCJPKNPTIJ
0QFO"* $IBU(15 "EWBODFE7PJDF.PEF ,ZVUBJ .PTIJ /BHPZBVOJW +.PTIJ (PPHMF (FNJOJ-JWF
4QFFDIUP4QFFDI"SDIJUFDUVSF 4QFFDIUP4QFFDIͷϞσϧߏ -BSHF-BOHVBHF.PEFM 5FYU(VJEFE4QFFDI (FOFSBUJPO Low-latency speech generation using audio
tokens or a streaming TTS module "VEJP"EBQUFS 4QFFDI &ODPEFS .PEBMJUZBMJHONFOU CFUXFFOUFYUBOEBVEJP 1SPNQU 4QFFDI 4JOHMF4USFBN 6TFSTTQFFDIPOMZ .VMUJ4USFBN 6TFSTTQFFDI --.HFOFSBUFETQFFDI
1SPTPG*OUFHSBUJOH4QFFDI&ODPEFS XJUI--.T Իͱ--.Tͷ౷߹ʹΑΔར -FWFSBHF--. $BQBCJMJUJFT 1SPNQU%SJWFO 'MFYJCJMJUZ #ZQBTT "43&SSPST --.ͷߴͳج൫ೳྗͷ׆༻
ϓϩϯϓτʹΑΔߴ͍ΧελϚΠζੑ ԻೝࣝޡΓͷӨڹΛճආ
&WBMVBUJPOPG5BTL1FSGPSNBODF 4QFFDI--.ͷλεΫੑೳͷධՁ JOQVU +42V"% 2VFTUJPO"OTXFS DIBS@G "-5 5SBOTMBUJPOGSPNKQ UPFO
#FSU4DPSF (SPVOEUSVUIUFYU UFYUUPUFYU 5SBOTDSJCFEUFYU UFYUUPUFYU 4QFFDI TQFFDIUPUFYU --.HFNNBCJU "43NPEFMXIJTQFSTNBMM 4QFFDI&ODPEFSXIJTQFSTNBMM 5SBJOJOH5PPMLJU4-".--. &WBMVBUJPO5PPMLJUMMNKQFWBM 4-".--.IUUQTHJUIVCDPN9-"/$&4-".--. MMNKQFWBMIUUQTHJUIVCDPNMMNKQMMNKQFWBM $PNQBSBCMF QFSGPSNBODFPO USBOTMBUJPO UBTL #FUUFS QFSGPSNBODFPO 2"UBTL
&WBMVBUJPOPG*OGFSFODF4QFFE ਪͷධՁ 0 10 20 30 40 50
60 vllm slam-llm (transformers) Generated Characters per Second W--.,XPO 8PPTVL FUBM&GGJDJFOUNFNPSZNBOBHFNFOUGPSMBSHFMBOHVBHFNPEFMTFSWJOHXJUIQBHFEBUUFOUJPO1SPDFFEJOHTPGUIFUI4ZNQPTJVNPO0QFSBUJOH4ZTUFNT1SJODJQMFT 'BTUFS YGBTUFS (FOFSBUFT)PXDBO*IFMQZPVUPEBZ JOTFDPOET /VNCFSPG5PLFOT
'VUVSF8PSLT ࠓޙͷల 8FBSFEFWFMPQJOH • 3FBMUJNF4QFFDIUP4QFFDI JOUFHSBUJPOXJUI--. • .VMUJMJOHVBM4QFFDI5P5FYU5FYU5P4QFFDI 7PJDF$POUSPMJOB$BS )VNBOMJLFBOE/BUVSBM
$POWFSTBUJPOBM4FBSDI 4QPLFO%JBMPHVFWJB$BMM "*"HFOU 03 4FBSDI 8FBUIFS 1PEDBTU "*"HFOU "*"HFOU "*"HFOU
EOP