Machines are Learning: Bringing Powerful Artificial Intelligence to All Developers

Machines are Learning Danilo Poccia, AWS Technical Evangelist CODEMOTION MILAN
- SPECIAL EDITION 10 – 11 NOVEMBER 2017

Machines are Learning Bringing Powerful Artificial Intelligence to All Developers
Danilo Poccia AWS Technical Evangelist @danilop [email protected] danilop

Credit: Gerry Cranham/Fox Photos/Getty Images http://www.telegraph.co.uk/travel/destinations/europe/united-kingdom/england/london/galleries/The-history-of-the-Tube-in-pictures-150-years-of-London-Underground/1939-ticket-examin/

1939 London Underground Credit: Gerry Cranham/Fox Photos/Getty Images http://www.telegraph.co.uk/travel/destinations/europe/united-kingdom/england/london/galleries/The-history-of-the-Tube-in-pictures-150-years-of-London-Underground/1939-ticket-examin/

Data Predictions

Data Model Predictions

http://www.thehudsonvalley.com/articles/60-years-ago-today-local-technology-demonstrated-artificial-intelligence-for-the-first-time 1959 Arthur Samuel

Machine Learning

Machine Learning Supervised Learning Inferring a model from labeled training
data

Machine Learning Supervised Learning Unsupervised Learning Inferring a model from
labeled training data Inferring a model to describe hidden structure from unlabeled data

Reinforcement Learning Perform a certain goal in a dynamic environment
Machine Learning Supervised Learning Unsupervised Learning

Driving a vehicle Playing a game against an opponent

Clustering

Tip: Try topic modeling with your own emails ;-) Topic
Modeling Discovering abstract “topics” that occur in a collection of documents For example, looking for “infrequent” words that are used more often in a document

Regression “How many bikes will be rented tomorrow?” Happy, Sad,
Angry, Confused, Disgusted, Surprised, Calm, Unknown Binary Classification Multi-Class Classification “Is this email spam?” “What is the sentiment of this tweet, or of this social media comment?” 1, 0, 100K Yes / No True / False %

Training the Model Minimizing the Error of using the Model
on the Labeled Data

Validation How well is this Model working on New Data?

Be Careful of Overfitting

Better Fitting

Different Models ⇒ Different Predictions

Labeled Data

Labeled Data 70% 30% Training Validation

Neural Networks

1943 Warren McCulloch, Walter Pitts Threshold Logic Units

1962 Frank Rosenblatt Perceptron

∑ w1 w2 w3 wn w0 = output weights (parameters)
activation function input

f(∑) w1 w2 w3 wn w0 = weights (parameters) activation
function output input

f(∑) input output

1969 Marvin Minsky, Seymour Papert Perceptrons: An Introduction to Computational
Geometry A perceptron can only solve linearly separable functions (e.g. no XOR)

f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) input
layer hidden layer output layer input output Multiple Layers Lots of Parameters Backpropagation

Microprocessor Transistor Counts 1971-2011 Intel Xeon CPU 28 cores NVIDIA
V100 GPU 5,120 CUDA Cores 640 Tensor Cores https://en.wikipedia.org/wiki/Moore's_law

LeCun, Gradient-Based Learning Applied to Document Recognition,1998 Hinton, A Fast
Learning Algorithm for Deep Belief Nets, 2006 Bengio, Learning Deep Architectures for AI, 2009 Advances in Research 1998-2009

“Stacks of differentiable non-linear functions with lots of parameters solve
nearly any predictive modeling problem” —Jeremy Howard, fast.ai

Image Processing

f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) f(∑) output
How to give images in input to a Neural Network? Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix 0 0 0 0 1 0 0 0
0 Identity Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix 1 0 -1 2 0 -2 1 0
-1 Left Edges Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix -1 0 1 -2 0 2 -1 0
1 Right Edges Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix 1 2 1 0 0 0 -1 -2
-1 Top Edges Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix -1 -2 -1 0 0 0 1 2
1 Bottom Edges Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolution Matrix 0.6 -0.6 1.2 -1.4 1.2 -1.6 0.8 -1.4
1.6 Random Values Photo by David Iliff. License: CC-BY-SA 3.0 https://commons.wikimedia.org/wiki/File:Colosseum_in_Rome,_Italy_-_April_2007.jpg

Convolutional Neural Networks (CNNs) https://en.wikipedia.org/wiki/Convolutional_neural_network

ImageNet Classification Error Over Time 0 5 10 15 20
25 30 2010 2011 2012 2013 2014 2015 2016 2017 Classification Error CNNs

2012 ImageNet Classification with Deep Convolutional Neural Networks

SuperVision: 8 layers, 60M parameters 0

2013 Visualizing and Understanding Convolutional Networks

How Do Neural Networks Learn? ? More generic and can
be reused as feature extractor for other visual tasks Specific to task Cat Dog 0

The Challenge For Machine Learning: Scale Aggressive migration New data
created on AWS PBs of existing data Data

The Challenge For Machine Learning: Scale Tons of GPUs Elastic
capacity Pre-built images Aggressive migration New data created on AWS PBs of existing data Data Training

The Challenge For Machine Learning: Scale Tons of GPUs and
CPUs Serverless At the edge, on IoT Devices Tons of GPUs Elastic capacity Pre-built images Aggressive migration New data created on AWS PBs of existing data Data Training Prediction

Natural Language Processing Experiment: Topic Modeling EC2 Spot Instances 1.1
Million vCPUs

Fulfilment & logistics Search & discovery Existing products New products
Thousands Of Amazon Engineers Focused On Machine Learning

Machine Learning On AWS Today

Artificial Intelligence In The Hands Of Every Developer S E
R V I C E S P L A T F O R M S E N G I N E S I N F R A S T R U C T U R E GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow

Early Detection Of Diabetic Complications

FDA-approved Medical Imaging

Sports Analytics

Autonomous Driving Systems

Real Time, Per Pixel Object Segmentation

Centimeter-accurate Positioning

Computation Knowledge Engine

S E R V I C E S P L
A T F O R M S E N G I N E S I N F R A S T R U C T U R E Amazon ML Spark & EMR Kinesis Batch ECS GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow Artificial Intelligence In The Hands Of Every Developer

S E R V I C E S P L
A T F O R M S Vision Amazon Rekognition E N G I N E S I N F R A S T R U C T U R E Amazon ML Spark & EMR Kinesis Batch ECS GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow Artificial Intelligence In The Hands Of Every Developer

Mona Lisa (Leonardo da Vinci)

Mona Lisa (Prado's version)

Portrait of Maddalena Doni (Raphael)

Bynder allows you to easily create, find and use content
for branding automation and marketing solutions. With our new AI capabilities, Bynder’s software… now allows users to save hours of admin labor when uploading and organizing their files, adding exponentially more value. Chris Hall CEO, Bynder ” “ With Rekognition, Bynder revolutionizes marketing admin tasks with AI capabilities

S E R V I C E S P L
A T F O R M S Speech Amazon Polly Vision Amazon Rekognition E N G I N E S I N F R A S T R U C T U R E Amazon ML Spark & EMR Kinesis Batch ECS GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow Artificial Intelligence In The Hands Of Every Developer

Generate Lifelike Speech With Amazon Polly 24 languages “The temperature
in Milanis 16 degrees Celsius” “The temperature in Milan is 16˚C” Amazon Polly 50 voices

aws polly synthesize-speech --text "It was nice to live such
a wonderful live show." --output-format mp3 --voice-id Joanna --text-type text output.mp3

“Nel mezzo del cammin di nostra vita mi ritrovai per
una selva oscura ché la diritta via era smarrita.” https://commons.wikimedia.org/wiki/File:Portrait_de_Dante.jpg

Duolingo voices its language learning service Using Polly Duolingo is
a free language learning service where users help translate the web and rate translations. With Amazon Polly our users benefit from the most lifelike Text-to-Speech voices available on the market. Severin Hacker CTO, Duolingo ” “ • Spoken language crucial for language learning • Accurate pronunciation matters • Faster iteration thanks to TTS • As good as natural human speech

” “ Royal National Institute of Blind People creates and
distributes accessible information in the form of synthesized content Amazon Polly delivers incredibly lifelike voices which captivate and engage our readers. John Worsfold Solutions Implementation Manager, RNIB • RNIB delivers largest library of audiobooks in the UK for nearly 2 million people with sight loss • Naturalness of generated speech is critical to captivate and engage readers • No restrictions on speech redistributions enables RNIB to create and distribute accessible information in a form of synthesized content RNIB provides the largest library in the UK for people with sight loss

S E R V I C E S P L
A T F O R M S Chat Amazon Lex Speech Amazon Polly Vision Amazon Rekognition E N G I N E S I N F R A S T R U C T U R E Amazon ML Spark & EMR Kinesis Batch ECS GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow Artificial Intelligence In The Hands Of Every Developer

Amazon Lex Speech recognition and natural language understanding Automatic speech
recognition Natural language understanding “What’s the weather forecast?” Weather forecast Amazon Lex

Amazon Lex Speech recognition and natural language understanding “It will
be sunny and 16C” Automatic speech recognition Natural language understanding “What’s the weather forecast?” Weather forecast Amazon Lex

“It will be sunny and 16 degrees Celsius” Amazon Polly
Amazon Lex “It will be sunny and 16C” Automatic speech recognition Natural language understanding “What’s the weather forecast?” Weather forecast Speech recognition and natural language understanding Amazon Lex

” “ Finding missing persons: ~100,000 active missing persons cases
in the U.S. at any given time ~60% are adults, ~40% are children • Motorola Solutions applies Amazon Rekognition, Amazon Polly and Amazon Lex • Image analytics and facial recognition can continually monitor for missing persons • Tools that understand natural language can enable officers to keep eyes up and hands free Motorola Solutions is using AI to help finding missing persons Motorola Solutions keeps utility workers connected and visible to each other with real-time voice and data communication across the smart grid.

S E R V I C E S P L
A T F O R M S Chat Amazon Lex Speech Amazon Polly Vision Amazon Rekognition E N G I N E S I N F R A S T R U C T U R E Amazon ML Spark & EMR Kinesis Batch ECS GPU CPU IoT Mobile Apache MXNet Caffe 2 Theano PyTorch CNTK TensorFlow

There’s Never Been A Better Time To Build Smart Apps

https://github.com/danilop/security-camera

Machines are Learning Bringing Powerful Artificial Intelligence to All Developers
Danilo Poccia AWS Technical Evangelist @danilop [email protected] danilop

Machines are Learning: Bringing Powerful Artifi...

Machines are Learning: Bringing Powerful Artificial Intelligence to All Developers

More Decks by Danilo Poccia

Other Decks in Programming

Featured

Transcript