Upgrade to Pro — share decks privately, control downloads, hide ads and more …

wavenet

soymsk
April 27, 2017

 wavenet

soymsk

April 27, 2017
Tweet

More Decks by soymsk

Other Decks in Technology

Transcript

  1. Wavenet • 2016೥ʹDeepMind͕ൃදͨ͠Ի੠߹੒ΞϧΰϦζϜ • Text to Speech(TTS)ͷ෼໺Ͱߴ͍Ի੠߹੒ͷਫ਼౓Λୡ੒͠ ͨɻ • ࣮૷͕ެ։͞Ε͓ͯΒͣɺ·ͨ਺ࣜ΋গͳ͘ɺ࣮ࡍʹͲͷΑ

    ͏ʹͳ͍ͬͯΔ͔ෆ໌ͳॴ΋ଟ͍ • Concatenate Text to Speech • parametric TTS parametric TTS • PixelRNN • PixelCNN 8BWFOFU +
  2. ैདྷͷख๏ • Concatenate Text to Speech • ୹͍Ի੠σʔλΛେྔʹσʔλϕʔεʹ֨ೲ͠ɺͦΕΛͭͳ͗߹ΘͤΔख๏ • طଘͷσʔλΛͭͳ͗߹ΘͤΔ͚ͩͳͷͰɺڧௐɾ੠৭มߋͳͲ͕ۤखɻ·

    ͨɺ߹੒ޙͷԻ੠ͷͭͳ͕Γ΋ෆࣗવʹͳΓ͕ͪ • parametric TTS • ੜ੒ϞσϧʹΑͬͯԻ੠߹੒͢Δख๏ • ൃ࿩಺༰΍ൃ࿩ऀͷಛ௃ΛϞσϧͷೖྗͱͯ͠ίϯτϩʔϧͤ͞Δ͜ͱ͕Ͱ ͖ΔΑ͏ʹͳͬͨɻ • ͨͩ͠ɺࣗવͳൃ࿩ɺͱ͸ݴ͍೉͍
  3. Dilated causal convolution • 44100ͷೖྗ΋16૚ͷDilated causal convolution ͰΈΔ͜ͱ͕Մೳ • WavenetͰ͸ɺ࠷େDilation=512·Ͱ૚ΛॏͶ(

    1- block )ɺblockΛෳ਺ੵΈॏͶΔߏ଄Λऔ͍ͬͯ Δɻ • ૚Λਂͯ͘͠΋ֶशͰ͖ΔΑ͏ʹResidualNetΛར ༻
  4. Conditional Wavenet • Conditional Pixel CNN ͱಉ༷ɺWavenetʹ೚ҙͷύϥϝʔλhಋೖ͢Δ ͜ͱͰɺWavenetΛύϥϝʔλͰૢ࡞ • Global

    conditions: Wavenetʹൃ࿩ऀͷಛ௃Λֶशͤ͞Δ ύϥϝʔλhʹΑͬͯൃ࿩શମͷதͰͷൃ࿩ऀͷಛ௃Λ࠶ݱͰ͖Δ ex: ฼ࠃޠ͕ҟͳΔൃ࿩ऀͷಛ௃ શͯͷ࣌ؒεςοϓͰ࡞༻͢Δ߲
  5. ࢀߟ • https://arxiv.org/abs/1609.03499 • ݪஶPDF • https://deepmind.com/blog/wavenet-generative-model-raw-audio/ • σϞ݁ՌͳͲ •

    http://musyoku.github.io/2016/09/18/wavenet-a-generative-model-for-raw- audio/ • Chainer࣮૷΍Dilationͷ෦෼͕Θ͔Γ΍͍͢ • https://www.slideshare.net/DeepLearningJP2016/dlwavenet-a-generative- model-for-raw-audio