Predictive Modeling and Deep Learning

Predictive Modeling & Deep Learning Olivier Grisel - ENMI -
Paris 2015

Outline • Predictive Modeling & Artiﬁcial Intelligence • Deep Learning
• Computer Vision • Natural Language Understanding and Machine Translation • Learning to Reason and Answer Questions

Predictive Modeling

type (category) # rooms (int) surface (ﬂoat m2) public trans
(boolean) Apartment 3 50 TRUE House 5 254 FALSE Duplex 4 68 TRUE Apartment 2 32 TRUE sold (ﬂoat k€) 450 430 712 234 features target samples (train)

type (category) # rooms (int) surface (ﬂoat m2) public trans
(boolean) Apartment 3 50 TRUE House 5 254 FALSE Duplex 4 68 TRUE Apartment 2 32 TRUE sold (ﬂoat k€) 450 430 712 234 features target samples (train) Apartment 2 33 TRUE House 4 210 TRUE samples (test) ? ?

Training text docs images sounds transactions Labels Machine Learning Algorithm
Model Predictive Modeling Data Flow Feature vectors

New text doc image sound transaction Model Expected Label Predictive
Modeling Data Flow Feature vector Training text docs images sounds transactions Labels Machine Learning Algorithm Feature vectors

Inventory forecasting & trends detection Predictive modeling examples Personalized radios
Fraud detection Virality and readers engagement Predictive maintenance Personality matching

Artiﬁcial Intelligence Predictive Modeling (Data Analytics)

Artiﬁcial Intelligence Predictive Modeling (Data Analytics) Self-driving cars IBM Watson
Movie recommendations Predictive Maintenance

Artiﬁcial Intelligence Hand-crafted symbolic reasoning systems Predictive Modeling (Data Analytics)

Artiﬁcial Intelligence Hand-crafted symbolic reasoning systems Machine Learning Predictive Modeling
(Data Analytics)

Artiﬁcial Intelligence Hand-crafted symbolic reasoning systems Machine Learning Deep Learning
Predictive Modeling (Data Analytics)

Deep Learning • Neural Networks from the 90’s rebranded in
2006+ • « Neuron » is a loose inspiration (not important) • Stacked architecture of modules that compute internal abstract representations from the data • Parameters are tuned from labeled examples

Deep Learning for Computer Vision

Deep Learning in the 90’s • Yann LeCun invented Convolutional
Networks • First NN successfully trained with many layers

Early success at OCR

Natural image classiﬁcation until 2012 Feature Extractions Classiﬁcation Data independent
Supervised Learning dog

Supervised Learning dog cat

Supervised Learning cat

NN Layer Supervised Learning dog Supervised Learning Supervised Learning NN
Layer NN Layer Image classiﬁcation today

Image classiﬁcation today NN Layer Supervised Learning Supervised Learning Supervised
Learning NN Layer NN Layer dog cat

ImageNet Challenge 2012 • 1.2M images labeled with 1000 object
categories • AlexNet from the deep learning team of U. of Toronto wins with 15% error rate vs 26% for the second (traditional CV pipeline)

ImageNet Challenge 2013 • Clarifai ConvNet model wins at 11%
error rate • Many other participants used ConvNets

ImageNet Challenge 2014 • Monster model: GoogLeNet at 6.7% error
rate

GoogLeNet vs Andrej • Andrej Karpathy evaluated human performance (himself):
~5% error rate • "It is clear that humans will soon only be able to outperform state of the art image classiﬁcation models by use of signiﬁcant effort, expertise, and time.” source: What I learned from competing against a ConvNet on ImageNet

ImageNet Challenge 2015 • Microsoft Research Asia wins with networks
with depths ranging from 34 to 152 layers • New record: 3.6% error rate

Recurrent Neural Networks

source: The Unreasonable Effectiveness of RNNs

Applications of RNNs • Natural Language Processing  (e.g. Language Modeling,
Sentiment Analysis) • Machine Translation  (e.g. English to French) • Speech recognition: audio to text • Speech synthesis: text to audio • Biological sequence modeling (DNA, Proteins)

Language modeling source: The Unreasonable Effectiveness of RNNs

Shakespeare source: The Unreasonable Effectiveness of RNNs

Linux source code

Attentional architectures for Machine Translation

Neural MT source: From language modeling to machine translation

Attentional Neural MT source: From language modeling to machine translation

Attention == Alignment source: Neural MT by Jointly Learning to
Align and Translate

source: Show, Attend and Tell

Learning to answer questions

Paraphrases from web news

source: Teaching Machines to Read and Comprehend

Conclusion • ML and DL progress is fast paced •
Many applications already in production (e.g. speech, image indexing, translation, face recognition) • Very promising results for QA and robot control • Machine Learning is now moving from pattern recognition to higher level reasoning

Thank you! http://twitter.com/ogrisel http://speakerdeck.com/ogrisel TIP: download the PDF version of
the slides to click on the source links

Predictive Modeling and Deep Learning

Predictive Modeling and Deep Learning

More Decks by Olivier Grisel

Other Decks in Technology

Featured

Transcript