Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Co...
Search
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Technology
0
220
LINEヤフーの音声AIがもたらす未来:ASR/TTSと対話技術の新たな可能性 / LY Corporation's Speech AI Vision: Towards Realtime Spoken Dialogue through Advanced ASR and TTS
LINEヤフーの音声認識と音声合成技術を活用した応用事例と、近年注目されているLLM基盤のリアルタイム音声対話技術の自社の取り組みについて紹介します。
LINEヤフーTech (LY Corporation Tech)
PRO
July 01, 2025
Tweet
Share
More Decks by LINEヤフーTech (LY Corporation Tech)
See All by LINEヤフーTech (LY Corporation Tech)
Java Virtual Threads, Kotlin Coroutines, Go Goroutinesの比較
lycorptech_jp
PRO
0
26
マイクロサービスアーキテクチャのトレードオフとコンポーネント増加について〜Yahoo!ニュース〜
lycorptech_jp
PRO
0
24
AIプラットフォームにおけるMLflowの利用について
lycorptech_jp
PRO
2
250
MLflowダイエット大作戦
lycorptech_jp
PRO
1
210
4%ルールとN1思考──不確実性に対抗するディスカバリー検証
lycorptech_jp
PRO
1
160
初めてのOSS貢献の雑ガイド
lycorptech_jp
PRO
1
48
LINEスタンプ開発の日常
lycorptech_jp
PRO
1
670
LINEスタンプサーバーサイド
lycorptech_jp
PRO
0
670
Yahoo!ファイナンスにおける生成AIを活用した新機能紹介
lycorptech_jp
PRO
0
720
Other Decks in Technology
See All in Technology
Web Intelligence and Visual Media Analytics
weblyzard
PRO
1
6.8k
Data Intelligence on Lakehouse Paradigm
scotthsieh825
0
170
Introduction to Sansan for Engineers / エンジニア向け会社紹介
sansan33
PRO
6
63k
[PR] はじめてのデジタルアイデンティティという本を書きました
ritou
1
820
「リリースファースト」の実感を届けるには 〜停滞するチームに変化を起こすアプローチ〜 #RSGT2026
kintotechdev
0
1.1k
GitHub Copilot CLI 現状確認会議
torumakabe
8
2.4k
SwiftDataを覗き見る
akidon0000
0
280
Bill One 開発エンジニア 紹介資料
sansan33
PRO
4
17k
Kaggleコンペティション「MABe Challenge - Social Action Recognition in Mice」振り返り
yu4u
1
570
AI Agent Agentic Workflow の可観測性 / Observability of AI Agent Agentic Workflow
yuzujoe
4
2.1k
Databricks Free Editionで始めるLakeflow SDP
taka_aki
0
140
Introduction to Sansan, inc / Sansan Global Development Center, Inc.
sansan33
PRO
0
2.9k
Featured
See All Featured
Practical Tips for Bootstrapping Information Extraction Pipelines
honnibal
25
1.7k
[SF Ruby Conf 2025] Rails X
palkan
0
710
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
31
3.1k
Heart Work Chapter 1 - Part 1
lfama
PRO
5
35k
WCS-LA-2024
lcolladotor
0
420
Why You Should Never Use an ORM
jnunemaker
PRO
61
9.7k
Tell your own story through comics
letsgokoyo
1
790
The Spectacular Lies of Maps
axbom
PRO
1
440
Self-Hosted WebAssembly Runtime for Runtime-Neutral Checkpoint/Restore in Edge–Cloud Continuum
chikuwait
0
290
How to Align SEO within the Product Triangle To Get Buy-In & Support - #RIMC
aleyda
1
1.4k
Measuring & Analyzing Core Web Vitals
bluesmoon
9
730
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
700
Transcript
-:$PSQPSBUJPOT4QFFDI"*7JTJPO 5PXBSET3FBMUJNF4QFFDIUP4QFFDI UISPVHI"EWBODFE"43BOE554 4QFFDIBOE"DPVTUJD"*%FQU %BUB4DJFODF(SPVQ +VNQFJ .JZBLF 5BJLJ,JOPTIJUB -*/&ϠϑʔͷԻ"*͕ͨΒ͢ະདྷɿ"43554ͱରٕज़ͷ৽ͨͳՄೳੑ
"HFOEB -:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ -:$PSQPSBUJPO`T"43554 -*/&ϠϑʔͷԻೝࣝɾԻ߹ͷհ 3FBMUJNF4QFFDIUP4QFFDI ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈʹ͍ͭͯ
'VUVSF8PSLT ࠓޙͷల
-:$PSQPSBUJPO`T4QFFDI"* -*/&ϠϑʔͷԻ"*ʹ͍ͭͯ 7JEFPBOE"VEJP $POUFOU"OBMZTJT 4QFFDI 3FDPHOJUJPO 4QFFDI (FOFSBUJPO 7JEFP"VEJP$POUFOUT $BMM$FOUFS
.FFUJOH 7PJDF6TFS*OUFSGBDF 7JEFP"VEJP$POUFOUBOE$BMM"OBMZTJT ࣸਅૉࡐఏڙΞϑϩ
-*/&ϠϑʔͷԻೝࣝͱԻ߹ͷհ *OUSPEVDUJPOPG -:$PSQPSBUJPO`T"43554 "43"VUPNBUJD4QFFDI3FDPHOJUJPO 5545FYU5P4QFFDI
:+70*$&4USFBNJOH"43 :+70*$&ετϦʔϛϯάԻೝࣝ &GGJDJFOUMZBEBQUTUPUBSHFUEPNBJOT • "TUSBUFHZCBTFEPODPNQBDUNPEFMTXJUIPVU FYUFSOBMMBOHVBHFNPEFMT • %PNBJOBEBQUBUJPOXJUIPVUUBSHFUBVEJPEBUB 1BJSFETQFFDIUFYUEBUB 6OQBJSFEUFYUEBUB
#BTF.PEFM "EBQUBUJPO .PEFM 4QFFDI 5FYU 5FYU #PPTUTQISBTFXJUIVTFSEJDUJPOBSJFT 4QFFDI 3FDPHOJUJPO 8PVMEZPV MJLFUPTUBSU UIFOBWJHBUJPO WJBUIJTSPVUF 4FSWJDF4QFDJGJD %JDUJPOBSZ :FT /P 1SJPSJUJ[FFYQSFTTXBZT 1SJPSJUJ[FHFOFSBMSPBET ʷ :FBTU ˠ ˓ :FT ʷ ,OPX ˠ ˓ /P ʷ 1SJPSJUJ[FHFOFSBMMPBET ˣ ˓ 1SJPSJUJ[FHFOFSBM SPBET 3FTPMWFTIPNPOZNT ຊ ڮ χ ϗ ϯ ό γ · Ͱ Ϛ σ ͷ ϊʜ 4VSGBDF 3FBE 4VSGBDF 3FBE &OEUP&OE "43 4QFFDI • JF ɾຊڮ χϗϯόγ JTBMPDBUJPOJO5PLZP ɾຊڮ χοϙϯόγ JTBMPDBUJPOJO0TBLB • +PJOUQSFEJDUJPOPGCPUITVSGBDFBOESFBEJOH ಉදهҟԻޠ ޮతͳυϝΠϯదԠ ಈతϢʔβࣙॻʹΑΔϑϨʔζೝࣝڧԽ ˞"CPVU'FBUVSF 'FBUVSF)JHIBDDVSBDZGPSXFCTFBSDIBOE-:$PSQPSBUJPOEPNBJO 'FBUVSF 3FTPMWFTIPNPOZNTBOEDVTUPNJ[FTFBTJMZ 'FBUVSF1SPWJEFT8FC"1*BOEPOEFWJDFNPEVMFT
"DIPSJT&YQSFTTJWF554 "DIPSJT දݱྗ͕๛͔ͳԻ߹ 'FBUVSF$POUSPMFNPUJPOJOUFOTJUZXJUIFYQSFTTJPOTUZMFT 'FBUVSF QSFTFUTQFBLFSPQUJPOTXJUIIVNBOMJLFRVBMJUZ 'FBUVSF1SPWJEFT8FC"1* POEFWJDFNPEVMFTBOEFEJUJOHXFCUPPMT "DIPSJT &EJUPS5FYUUPTQFFDIFEJUJOHUPPM
"DIPSJT &YQSFTTJWFUFYUUPTQFFDI $POUSPM0WFS4QFBLFS &NPUJPO BOE*OUFOTJUZ
ԻೝࣝͷαʔϏε׆༻ࣄྫ • :BIPP +"1"/"QQ`T7PJDF4FBSDI • 7PJDF4FBSDIJTJNQMFNFOUFEJONPTU:BIPP+"1"/4FSWJDFT JFTFSWJDFTJODMVEJOH.BQT 5SBOTJU BOETIPQQJOH :BIPP+"1"/"QQ
J04"OESPJE &YBNQMFTPG"QQMJDBUJPO
Ի߹ͷαʔϏε׆༻ࣄྫ • /BWJHBUJPOWPJDFJO:BIPP+"1"/$BS/BWJHBUJPO"QQ • 0O%FWJDF/FVSBM5FYU5P4QFFDI&OHJOF DBMMFEl"DIPSJT -JUFz • (FOFSBUFBTFDPOEBVEJPXBWFGPSNJO
TFDPOET ˎ 3FBM5JNF'BDUPS 35' JTJOJ1IPOF • 5PNJOJNJ[FBQQTJ[F XFWFJNQMFNFOUFE WBSJPVTPQUJNJ[BUJPOTJOCPUIJOGFSFODFMJCSBSJFTBOENPEFMTJ[F :BIPP+"1"/$BS/BWJHBUJPO"QQ &YBNQMFTPG"QQMJDBUJPO
"MBCBQQGPSSFBMUJNFTQPLFOEJBMPHVF CBTFEPOB--. VOEFSEFWFMPQJOH IUUQTXXXMZDPSQDPKQKBUFDIOPMPHZEFTJHOMBCT &YBNQMFTPG"QQMJDBUJPO ϦΞϧλΠϜԻରͷ࣮ݧΞϓϦ ։ൃத
ϦΞϧλΠϜ4QFFDIUP4QFFDIٕज़։ൃͷऔΓΈ 3FBMUJNF4QFFDIUP4QFFDI
3FBMUJNF4QFFDIUP4QFFDI5SFOE ϦΞϧλΠϜ4QFFDIUP4QFFDIͷٕज़ಈ IUUQTPQFOBJDPNKB+1DIBUHQUPWFSWJFX IUUQTHFNJOJHPPHMFPWFSWJFXHFNJOJMJWF IUUQTNPTIJDIBU IUUQTOVEJBMPHVFHJUIVCJPKNPTIJ
0QFO"* $IBU(15 "EWBODFE7PJDF.PEF ,ZVUBJ .PTIJ /BHPZBVOJW +.PTIJ (PPHMF (FNJOJ-JWF
4QFFDIUP4QFFDI"SDIJUFDUVSF 4QFFDIUP4QFFDIͷϞσϧߏ -BSHF-BOHVBHF.PEFM 5FYU(VJEFE4QFFDI (FOFSBUJPO Low-latency speech generation using audio
tokens or a streaming TTS module "VEJP"EBQUFS 4QFFDI &ODPEFS .PEBMJUZBMJHONFOU CFUXFFOUFYUBOEBVEJP 1SPNQU 4QFFDI 4JOHMF4USFBN 6TFSTTQFFDIPOMZ .VMUJ4USFBN 6TFSTTQFFDI --.HFOFSBUFETQFFDI
1SPTPG*OUFHSBUJOH4QFFDI&ODPEFS XJUI--.T Իͱ--.Tͷ౷߹ʹΑΔར -FWFSBHF--. $BQBCJMJUJFT 1SPNQU%SJWFO 'MFYJCJMJUZ #ZQBTT "43&SSPST --.ͷߴͳج൫ೳྗͷ׆༻
ϓϩϯϓτʹΑΔߴ͍ΧελϚΠζੑ ԻೝࣝޡΓͷӨڹΛճආ
&WBMVBUJPOPG5BTL1FSGPSNBODF 4QFFDI--.ͷλεΫੑೳͷධՁ JOQVU +42V"% 2VFTUJPO"OTXFS DIBS@G "-5 5SBOTMBUJPOGSPNKQ UPFO
#FSU4DPSF (SPVOEUSVUIUFYU UFYUUPUFYU 5SBOTDSJCFEUFYU UFYUUPUFYU 4QFFDI TQFFDIUPUFYU --.HFNNBCJU "43NPEFMXIJTQFSTNBMM 4QFFDI&ODPEFSXIJTQFSTNBMM 5SBJOJOH5PPMLJU4-".--. &WBMVBUJPO5PPMLJUMMNKQFWBM 4-".--.IUUQTHJUIVCDPN9-"/$&4-".--. MMNKQFWBMIUUQTHJUIVCDPNMMNKQMMNKQFWBM $PNQBSBCMF QFSGPSNBODFPO USBOTMBUJPO UBTL #FUUFS QFSGPSNBODFPO 2"UBTL
&WBMVBUJPOPG*OGFSFODF4QFFE ਪͷධՁ 0 10 20 30 40 50
60 vllm slam-llm (transformers) Generated Characters per Second W--.,XPO 8PPTVL FUBM&GGJDJFOUNFNPSZNBOBHFNFOUGPSMBSHFMBOHVBHFNPEFMTFSWJOHXJUIQBHFEBUUFOUJPO1SPDFFEJOHTPGUIFUI4ZNQPTJVNPO0QFSBUJOH4ZTUFNT1SJODJQMFT 'BTUFS YGBTUFS (FOFSBUFT)PXDBO*IFMQZPVUPEBZ JOTFDPOET /VNCFSPG5PLFOT
'VUVSF8PSLT ࠓޙͷల 8FBSFEFWFMPQJOH • 3FBMUJNF4QFFDIUP4QFFDI JOUFHSBUJPOXJUI--. • .VMUJMJOHVBM4QFFDI5P5FYU5FYU5P4QFFDI 7PJDF$POUSPMJOB$BS )VNBOMJLFBOE/BUVSBM
$POWFSTBUJPOBM4FBSDI 4QPLFO%JBMPHVFWJB$BMM "*"HFOU 03 4FBSDI 8FBUIFS 1PEDBTU "*"HFOU "*"HFOU "*"HFOU
EOP