Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Making Sense of Neural Network Training
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
John Estropia
February 20, 2018
Technology
0
84
Making Sense of Neural Network Training
Presented at Pivotal Labs, Tokyo (2018/2/20)
John Estropia
February 20, 2018
Tweet
Share
More Decks by John Estropia
See All by John Estropia
WWDC Party Intro to CoreStore
johnestropia
0
64
Making an Efficient Deploy Bot
johnestropia
0
450
Creating Keyboard Extensions
johnestropia
0
81
My journey taming Core Data: An intro to the CoreStore library
johnestropia
0
170
Fun with Swift 4 KeyPaths
johnestropia
1
650
Pairs JP Team's iOS Deployment
johnestropia
0
950
Making Slackbots deploy iOS apps for you
johnestropia
2
230
OSSの作法(本題)
johnestropia
0
80
Other Decks in Technology
See All in Technology
日本の85%が使う公共SaaSは、どう育ったのか
taketakekaho
1
240
Oracle Base Database Service 技術詳細
oracle4engineer
PRO
15
93k
30万人の同時アクセスに耐えたい!新サービスの盤石なリリースを支える負荷試験 / SRE Kaigi 2026
genda
4
1.4k
マーケットプレイス版Oracle WebCenter Content For OCI
oracle4engineer
PRO
5
1.6k
配列に見る bash と zsh の違い
kazzpapa3
3
170
(技術的には)社内システムもOKなブラウザエージェントを作ってみた!
har1101
0
150
AWS Network Firewall Proxyを触ってみた
nagisa53
1
240
SREのプラクティスを用いた3領域同時 マネジメントへの挑戦 〜SRE・情シス・セキュリティを統合した チーム運営術〜
coconala_engineer
2
770
StrandsとNeptuneを使ってナレッジグラフを構築する
yakumo
1
120
AI駆動開発を事業のコアに置く
tasukuonizawa
1
360
~Everything as Codeを諦めない~ 後からCDK
mu7889yoon
3
480
Agile Leadership Summit Keynote 2026
m_seki
1
670
Featured
See All Featured
Ethics towards AI in product and experience design
skipperchong
2
200
Being A Developer After 40
akosma
91
590k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Neural Spatial Audio Processing for Sound Field Analysis and Control
skoyamalab
0
170
Code Reviewing Like a Champion
maltzj
527
40k
The Impact of AI in SEO - AI Overviews June 2024 Edition
aleyda
5
740
Darren the Foodie - Storyboard
khoart
PRO
2
2.4k
How to Get Subject Matter Experts Bought In and Actively Contributing to SEO & PR Initiatives.
livdayseo
0
67
Designing for Timeless Needs
cassininazir
0
130
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
287
14k
Visual Storytelling: How to be a Superhuman Communicator
reverentgeek
2
430
Bootstrapping a Software Product
garrettdimon
PRO
307
120k
Transcript
Making Sense of Neural Network Training Pivotal (2018/02/20)
@JohnEstropia Developer since 2008 (mostly Mobile apps) Principal Engineer @
Other hobby projects http://github.com/JohnEstropia/CoreStore
Today's talk My motivation with Machine Learning Rundown of Neural
Networks in image recognition Some interesting insights
Why I started using ML
Who's that Pokemon? PokeRater's image processing
Optical Character Recognition (Tesseract) PokeRater's image processing
Current solution
Current (incomplete) solution
Other issues with traditional OCRs Infinite possibilities of misreads PLKACHUʢPIKACHUʣ
ϏΨνϡϫʢϐΧνϡʣ Non-English OCRs are not reliable Pokemon names are in 9 languages
Neural Networks
"Charmander" Neurons = Cells
"Pikachu" Neurons = Cells
"Pikachu" "Charmander" Neurons = Weights (of features)
Neurons = Weights (of features) *Clip: The Game Theorists (Youtube
channel)
Neurons = Weights (of features) Features extracted using Convolution filters
Training a Neural Network
Common Neural Network Creation Flow Front-end Back-end Model
Common Neural Network Creation Flow Front-end: - Training code (usually
Python) - Loads and processes all training images - Template codes are abundant! (most NNs are set up very similarly)
Common Neural Network Creation Flow Back-end: Computes and builds the
"weights" network
Common Neural Network Creation Flow Model file: What apps will
use Example: Core ML
Insights on Neural Network Concepts
Training a Neural Network Teaching a kid From here on
we’ll call Neural Network “N-chan”
Tons of images (100~ each) "Pikachu" "Charmander"
Training data =~ Flash cards
Teaching = Repetition Takes about a day on decent-sized data
size GPU hardware is recommended
Repetitions → Misunderstandings Depending on our training data (or lack
thereof), N-chan may misunderstand some things “Overfitting” Three
Countering Overfitting: “Dropout” Randomly force N-chan to “forget” a learned
item Good example: Math Exams memorizing is not necessarily a good thing
Countering Overfitting: Optimizers Tweak the "learning rate" Example: N-chan is
studying for an exam Read all book chapters then take a mock exam (slow but extensive) Take a mock exam then check the answers (trial and error)
Countering Overfitting: Optimizers 0% accuracy 100% accuracy loss (noise) loss
(noise) loss (noise) speed = learning rate
Today's Key Points Neural Networks are better at analyzing unknown
data than traditional image recognition systems (ex: OCR) Many template projects for training Neural Networks exist (esp. Keras) Training Neural Networks is like teaching a kid
References https://shibberu.com/2016/04/26/ma-490-deep-learning/ https://www.youtube.com/watch?v=ZCPauvMxV7Q&t=568s https://blog.keras.io/building-powerful-image-classification- models-using-very-little-data.html https://adeshpande3.github.io/A-Beginner%27s-Guide-To- Understanding-Convolutional-Neural-Networks/ http://cs231n.github.io/convolutional-networks/#overview
Thanks!