Upgrade to Pro — share decks privately, control downloads, hide ads and more …

wavenet

Sponsored · Your Podcast. Everywhere. Effortlessly. Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
Avatar for soymsk soymsk
April 27, 2017

 wavenet

Avatar for soymsk

soymsk

April 27, 2017
Tweet

More Decks by soymsk

Other Decks in Technology

Transcript

  1. Wavenet • 2016೥ʹDeepMind͕ൃදͨ͠Ի੠߹੒ΞϧΰϦζϜ • Text to Speech(TTS)ͷ෼໺Ͱߴ͍Ի੠߹੒ͷਫ਼౓Λୡ੒͠ ͨɻ • ࣮૷͕ެ։͞Ε͓ͯΒͣɺ·ͨ਺ࣜ΋গͳ͘ɺ࣮ࡍʹͲͷΑ

    ͏ʹͳ͍ͬͯΔ͔ෆ໌ͳॴ΋ଟ͍ • Concatenate Text to Speech • parametric TTS parametric TTS • PixelRNN • PixelCNN 8BWFOFU +
  2. ैདྷͷख๏ • Concatenate Text to Speech • ୹͍Ի੠σʔλΛେྔʹσʔλϕʔεʹ֨ೲ͠ɺͦΕΛͭͳ͗߹ΘͤΔख๏ • طଘͷσʔλΛͭͳ͗߹ΘͤΔ͚ͩͳͷͰɺڧௐɾ੠৭มߋͳͲ͕ۤखɻ·

    ͨɺ߹੒ޙͷԻ੠ͷͭͳ͕Γ΋ෆࣗવʹͳΓ͕ͪ • parametric TTS • ੜ੒ϞσϧʹΑͬͯԻ੠߹੒͢Δख๏ • ൃ࿩಺༰΍ൃ࿩ऀͷಛ௃ΛϞσϧͷೖྗͱͯ͠ίϯτϩʔϧͤ͞Δ͜ͱ͕Ͱ ͖ΔΑ͏ʹͳͬͨɻ • ͨͩ͠ɺࣗવͳൃ࿩ɺͱ͸ݴ͍೉͍
  3. Dilated causal convolution • 44100ͷೖྗ΋16૚ͷDilated causal convolution ͰΈΔ͜ͱ͕Մೳ • WavenetͰ͸ɺ࠷େDilation=512·Ͱ૚ΛॏͶ(

    1- block )ɺblockΛෳ਺ੵΈॏͶΔߏ଄Λऔ͍ͬͯ Δɻ • ૚Λਂͯ͘͠΋ֶशͰ͖ΔΑ͏ʹResidualNetΛར ༻
  4. Conditional Wavenet • Conditional Pixel CNN ͱಉ༷ɺWavenetʹ೚ҙͷύϥϝʔλhಋೖ͢Δ ͜ͱͰɺWavenetΛύϥϝʔλͰૢ࡞ • Global

    conditions: Wavenetʹൃ࿩ऀͷಛ௃Λֶशͤ͞Δ ύϥϝʔλhʹΑͬͯൃ࿩શମͷதͰͷൃ࿩ऀͷಛ௃Λ࠶ݱͰ͖Δ ex: ฼ࠃޠ͕ҟͳΔൃ࿩ऀͷಛ௃ શͯͷ࣌ؒεςοϓͰ࡞༻͢Δ߲
  5. ࢀߟ • https://arxiv.org/abs/1609.03499 • ݪஶPDF • https://deepmind.com/blog/wavenet-generative-model-raw-audio/ • σϞ݁ՌͳͲ •

    http://musyoku.github.io/2016/09/18/wavenet-a-generative-model-for-raw- audio/ • Chainer࣮૷΍Dilationͷ෦෼͕Θ͔Γ΍͍͢ • https://www.slideshare.net/DeepLearningJP2016/dlwavenet-a-generative- model-for-raw-audio