Making Sense of Neural Network Training

Making Sense of Neural Network Training Pivotal (2018/02/20)

@JohnEstropia Developer since 2008 (mostly Mobile apps) Principal Engineer @
Other hobby projects http://github.com/JohnEstropia/CoreStore

Today's talk My motivation with Machine Learning Rundown of Neural
Networks in image recognition Some interesting insights

Why I started using ML

Who's that Pokemon? PokeRater's image processing

Optical Character Recognition (Tesseract) PokeRater's image processing

Current solution

Current (incomplete) solution

Other issues with traditional OCRs Inﬁnite possibilities of misreads PLKACHUʢPIKACHUʣ
ϏΨνϡϫʢϐΧνϡ΢ʣ Non-English OCRs are not reliable Pokemon names are in 9 languages

Neural Networks

"Charmander" Neurons = Cells

"Pikachu" Neurons = Cells

"Pikachu" "Charmander" Neurons = Weights (of features)

Neurons = Weights (of features) *Clip: The Game Theorists (Youtube
channel)

Neurons = Weights (of features) Features extracted using Convolution ﬁlters

Training a Neural Network

Common Neural Network Creation Flow Front-end Back-end Model

Common Neural Network Creation Flow Front-end: - Training code (usually
Python) - Loads and processes all training images - Template codes are abundant! (most NNs are set up very similarly)

Common Neural Network Creation Flow Back-end: Computes and builds the
"weights" network

Common Neural Network Creation Flow Model ﬁle: What apps will
use Example: Core ML

Insights on Neural Network Concepts

Training a Neural Network Teaching a kid From here on
we’ll call Neural Network “N-chan”

Tons of images (100~ each) "Pikachu" "Charmander"

Training data =~ Flash cards

Teaching = Repetition Takes about a day on decent-sized data
size GPU hardware is recommended

Repetitions → Misunderstandings Depending on our training data (or lack
thereof), N-chan may misunderstand some things “Overﬁtting” Three

Countering Overﬁtting: “Dropout” Randomly force N-chan to “forget” a learned
item Good example: Math Exams memorizing is not necessarily a good thing

Countering Overﬁtting: Optimizers Tweak the "learning rate" Example: N-chan is
studying for an exam Read all book chapters then take a mock exam (slow but extensive) Take a mock exam then check the answers (trial and error)

Countering Overﬁtting: Optimizers 0% accuracy 100% accuracy loss (noise) loss
(noise) loss (noise) speed = learning rate

Today's Key Points Neural Networks are better at analyzing unknown
data than traditional image recognition systems (ex: OCR) Many template projects for training Neural Networks exist (esp. Keras) Training Neural Networks is like teaching a kid

References https://shibberu.com/2016/04/26/ma-490-deep-learning/ https://www.youtube.com/watch?v=ZCPauvMxV7Q&t=568s https://blog.keras.io/building-powerful-image-classiﬁcation- models-using-very-little-data.html https://adeshpande3.github.io/A-Beginner%27s-Guide-To- Understanding-Convolutional-Neural-Networks/ http://cs231n.github.io/convolutional-networks/#overview

Thanks!

Making Sense of Neural Network Training

Making Sense of Neural Network Training

John Estropia

More Decks by John Estropia

Other Decks in Technology

Featured

Transcript