Introductory talk on deep learning


  models AE / SAE RBM / DBN CNN RNN / LSTM Memnet / NTM

    What ? Why ? How ? Next ?
  3. what why how next What ? AI technique for learning

    multiple levels of abstractions directly from raw information
  Primitive rule based AI Tailored systems Hand Crafted Program Output Input

    Hand Crafted Program Output Input
  5. what why how next Classical machine learning Learning from custom

    features Hand Crafted Features Learning System Output Input
  6. what why how next Deep Learning based AI Learn everything

    Learned Features (Lower Level) Learned Features (Higher Level) Learning System Output Input
  Why The buzz ?

  Google Trends Deep Learning

  12. what why how next

  Crude timeline of Neural Networks 1950 1980 1990 2000 Perceptron Backprop & Application NN Winter

    Backprop & Application NN Winter
  2010 Stacking RBMs Deep Learning fuss

  HUGE DATA Large Synoptic Survey Telescope (2022) 30 TB/night

  HUGE CAPABILITIES GPGPU  ~20x speedup Powerful Clusters

  17. HUGE SUCCESS Speech, text understanding Robotics / Computer Vision Business

    / Big Data Artificial General Intelligence (AGI)
  How its done ?

  Shallow Network ℎ ℎ = (, 0) = ′(ℎ, 1) = (, ) minimize

    0) = ′(ℎ, 1) = (, ) minimize
  Deep Network

  Deep Network More abstract features Stellar performance Vanishing Gradient Overfitting

    performance Vanishing Gradient Overfitting
  Autoencoder ℎ Unsupervised Feature Learning

  Stacked Autoencoder Y. Bengio et. all; Greedy Layer-Wise Training of Deep Networks

    Greedy Layer-Wise Training of Deep Networks
  Stacked Autoencoder 1. Unsupervised, layer by layer pretraining 2. Supervised fine tuning

    layer pretraining 2. Supervised fine tuning
  25. what why how next Deep Belief Network 2006 breakthrough Stacking

    Restricted Boltzmann Machines (RBMs) Hinton, G. E., Osindero, S. and Teh, Y.; A fast learning algorithm for deep belief nets
  Rethinking Computer Vision

  Traditional Image Classification pipeline Feature Extraction (SIFT, SURF etc.) Classifier (SVM, NN etc.)

    (SIFT, SURF etc.) Classifier (SVM, NN etc.)
  Convolutional Neural Network Images taken from
  Convolutional Neural Network

  Convolutional Neural Network Images taken from
  Convolutional Neural Network

  32. what why how next The Starry Night Vincent van Gogh

    Leon A. Gatys, Alexander S. Ecker and Matthias Bethge; A Neural Algorithm of Artistic Style
  33. what why how next

  34. what why how next Scene Description CNN + RNN Oriol

    Vinyals et. all; Show and Tell: A Neural Image Caption Generator
  Learning Sequences

  Recurrent Neural Network Simple Elman Version ℎ ℎ = ( , ℎ−1 , 0, 1) = ′(ℎ , 2)

    ℎ ℎ = ( , ℎ−1 , 0, 1) = ′(ℎ , 2)
  37. what why how next Long Short Term Memory (LSTM) add

    memory cells learn access mechanism Sepp Hochreiter and Jürgen Schmidhuber; Long short-term memory
  39. what why how next

  40. what why how next Fooling Deep Networks Anh Nguyen, Jason

    Yosinski, Jeff Clune; Deep Neural Networks are Easily Fooled
  Next Cool things to try

  Hyperparameter optimization bayesian Optimization methods adadelta, rmsprop . . . Regularization dropout, dither . . .

    rmsprop . . . Regularization dropout, dither . . .
  Attention & Memory NTMs, Memory Networks, Stack RNNs . . . NLP Translation, description

    Stack RNNs . . . NLP Translation, description
  Cognitive Hardware FPGA, GPU, Neuromorphic Chips Scalable DL map-reduce, compute clusters

    Scalable DL map-reduce, compute clusters
  Deep Reinforcement Learning deepmindish things, deep Q learning Energy models RBMs, DBNs . . .

    Q learning Energy models RBMs, DBNs . . .

  Theano (Python) | Torch (lua) | Caffe (C++) Github is a friend

    a friend
  @AbhinavTushar