Upgrade to Pro — share decks privately, control downloads, hide ads and more …

PyData Meetup Amsterdam - Keras & Wavenet

Bas Veeling
December 08, 2016

PyData Meetup Amsterdam - Keras & Wavenet

Bas Veeling

December 08, 2016
Tweet

More Decks by Bas Veeling

Other Decks in Science

Transcript

  1. About me • AI at UvA, MSc thesis with Philips

    Research • Internship at Google Research, Scyfer. • ~2 years experience with Deep learning 2
  2. WaveNet[1] Teaser • Can generate music/speech from examples. • Generative

    Auto-Regressive Deep Neural Network • Not fully text to speech: needs TTS features [1] Oord, Aaron van den, et al. "Wavenet: A generative model for raw audio." 3
  3. WaveNet[1] Teaser • Can generate music/speech from examples. • Generative

    Auto-Regressive Deep Neural Network • Not fully text to speech: needs TTS features Speaker 1 Speaker 2 Music [1] Oord, Aaron van den, et al. "Wavenet: A generative model for raw audio." 3
  4. Keras • Author: François Chollet ( @fchollet) & others •

    High-level neural network library • Python • Uses tensorflow or theano to run on GPU/CPU 4
  5. 5 in = Input(8) out = Dense(9)(in) out = Dense(9)(out)

    out = Dense(9)(out) out = Dense(4)(out) model = Model(in, out) model.compile() model.fit(X,y)
  6. About you • Know the basic principles of deep learning?

    • Worked with a machine learning framework? • Worked with Keras? • Worked with computation graph frameworks (tensorflow, theano, etc)? • Rough understanding of WaveNet? 6
  7. Goals of this talk • Show the magic of Keras

    • Accessible without machine learning experience. • Tips and tricks on working efficiently. • Leave out on details (but feel free to ask!) • Non-goal: an objective comparison between frameworks. 7
  8. Quick Deep Learning Intro • Image -> blackbox -> target

    ‘cat’ or ‘dog’ • Neural network consisting of layers of neurons that compute intermediate steps. • These layers have weights, which determine the outcome. • We ‘train’ these weights: show an input and adjust the weights towards target. 8
  9. Computational graph libraries • Deep Learning Models are expensive •

    Run on GPU and other hardware • Solution: computational graph libary. • Keras uses Theano or Tensorflow, and provides common interface: import keras.backend as K • Example time: adding two numbers together. 9
  10. WaveNet • x = raw audio (300ms or 4.800 samples)

    • ??????? • y = next unseen sample 10
  11. Lots of other tricks! • Gated activation Unit[1] • Non-linear

    quantization • Categorical Distribution as output • Conditional Wavenet for TTS [1] Oord, Aaron van den, Nal Kalchbrenner, and Koray Kavukcuoglu. "Pixel recurrent neural networks." 13
  12. 1. Conditional WaveNet 2. Printing intermediate output w/ K.print_tensor() 3.

    PyCharm and working with remote GPU servers/Cluster 4. Having Keras in pycharm project with editable install. 5. Smart Experiment Management with IDSIA-Sacred 6. TDD with Keras 18 If we had more time: