Upgrade to Pro — share decks privately, control downloads, hide ads and more …

An attempt to reproduce WaveNet-based text-to-speech synthesis

An attempt to reproduce WaveNet-based text-to-speech synthesis

Ryuichi Yamamoto

June 15, 2018
Tweet

More Decks by Ryuichi Yamamoto

Other Decks in Technology

Transcript

  1. ➭䩛岀הך嫰鯰 (1/2) 7 Text: Scientists at the CERN laboratory say

    they have discovered a new particle Deep Voice3 [Ping ‘18]. (trained on LJSpeech, w/o WN) Tacotron 2 [Shen; ‘18]. (trained on LJSpeech , w/ WN) Tacotron 2 [Shen; ’18]. (trained on proprietary corpus , w/ WN)
  2. ➭䩛岀הך嫰鯰 (2/2) 8 Text: Generative adversarial network or variational auto-encoder.

    Deep Voice3 [Ping ‘18]. (trained on LJSpeech, w/o WN) Tacotron 2 [Shen; ‘18]. (trained on LJSpeech , w/ WN) Tacotron 2 [Shen; ’18]. (trained on proprietary corpus , w/ WN)
  3. References • <%FFQ.JOEˎ>8BWF/FU"(FOFSBUJWF.PEFMGPS3BX"VEJP IUUQTEFFQNJOEDPNCMPHXBWFOFUHFOFSBUJWF NPEFMSBXBVEJP • <WBOEFO0PSEˏD>"BSPOWBOEFO0PSE 4BOEFS%JFMFNBO )FJHB ;FO

    FUBM 8BWF/FU"(FOFSBUJWF.PEFMGPS 3BX"VEJP BS9JW 4FQ • <1JOHˎ>8FJ1JOH ,BJOBO 1FOH "OESFX(JCJBOTLZ FUBM ˑ%FFQ7PJDF4DBMJOH5FYUUP4QFFDIXJUI $POWPMVUJPOBM4FRVFODF-FBSOJOH˒ 1SPDPG*$-3  • <4IFOˏ>+POBUIBO4IFO 3VPNJOH 1BOH 3PO+8FJTT FUBM /BUVSBM5544ZOUIFTJTCZ$POEJUJPOJOH8BWF/FU PO.FM4QFDUSPHSBN1SFEJDUJPOT 1SPDPG*$"441  16