$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Making Sense of Neural Network Training
Search
John Estropia
February 20, 2018
Technology
0
83
Making Sense of Neural Network Training
Presented at Pivotal Labs, Tokyo (2018/2/20)
John Estropia
February 20, 2018
Tweet
Share
More Decks by John Estropia
See All by John Estropia
WWDC Party Intro to CoreStore
johnestropia
0
64
Making an Efficient Deploy Bot
johnestropia
0
450
Creating Keyboard Extensions
johnestropia
0
80
My journey taming Core Data: An intro to the CoreStore library
johnestropia
0
170
Fun with Swift 4 KeyPaths
johnestropia
1
650
Pairs JP Team's iOS Deployment
johnestropia
0
950
Making Slackbots deploy iOS apps for you
johnestropia
2
230
OSSの作法(本題)
johnestropia
0
80
Other Decks in Technology
See All in Technology
Oracle Cloud Infrastructure IaaS 新機能アップデート 2025/09 - 2025/11
oracle4engineer
PRO
0
160
JEDAI認定プログラム JEDAI Order 2026 エントリーのご案内 / JEDAI Order 2026 Entry
databricksjapan
0
140
コンテキスト情報を活用し個社最適化されたAI Agentを実現する4つのポイント
kworkdev
PRO
1
1.5k
まだ間に合う! Agentic AI on AWSの現在地をやさしく一挙おさらい
minorun365
5
310
[デモです] NotebookLM で作ったスライドの例
kongmingstrap
0
160
AlmaLinux + KVM + Cockpit で始めるお手軽仮想化基盤 ~ 開発環境などでの利用を想定して ~
koedoyoshida
0
110
プロンプトやエージェントを自動的に作る方法
shibuiwilliam
13
11k
re:Invent 2025 ~何をする者であり、どこへいくのか~
tetutetu214
0
220
業務のトイルをバスターせよ 〜AI時代の生存戦略〜
staka121
PRO
2
220
今からでも間に合う!速習Devin入門とその活用方法
ismk
1
750
文字列の並び順 / Unicode Collation
tmtms
3
610
RAG/Agent開発のアップデートまとめ
taka0709
0
190
Featured
See All Featured
How to train your dragon (web standard)
notwaldorf
97
6.4k
The Art of Programming - Codeland 2020
erikaheidi
56
14k
The Illustrated Children's Guide to Kubernetes
chrisshort
51
51k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
9
1k
JavaScript: Past, Present, and Future - NDC Porto 2020
reverentgeek
52
5.8k
[RailsConf 2023] Rails as a piece of cake
palkan
58
6.2k
Statistics for Hackers
jakevdp
799
230k
Docker and Python
trallard
47
3.7k
How to Think Like a Performance Engineer
csswizardry
28
2.4k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.3k
Templates, Plugins, & Blocks: Oh My! Creating the theme that thinks of everything
marktimemedia
31
2.6k
Transcript
Making Sense of Neural Network Training Pivotal (2018/02/20)
@JohnEstropia Developer since 2008 (mostly Mobile apps) Principal Engineer @
Other hobby projects http://github.com/JohnEstropia/CoreStore
Today's talk My motivation with Machine Learning Rundown of Neural
Networks in image recognition Some interesting insights
Why I started using ML
Who's that Pokemon? PokeRater's image processing
Optical Character Recognition (Tesseract) PokeRater's image processing
Current solution
Current (incomplete) solution
Other issues with traditional OCRs Infinite possibilities of misreads PLKACHUʢPIKACHUʣ
ϏΨνϡϫʢϐΧνϡʣ Non-English OCRs are not reliable Pokemon names are in 9 languages
Neural Networks
"Charmander" Neurons = Cells
"Pikachu" Neurons = Cells
"Pikachu" "Charmander" Neurons = Weights (of features)
Neurons = Weights (of features) *Clip: The Game Theorists (Youtube
channel)
Neurons = Weights (of features) Features extracted using Convolution filters
Training a Neural Network
Common Neural Network Creation Flow Front-end Back-end Model
Common Neural Network Creation Flow Front-end: - Training code (usually
Python) - Loads and processes all training images - Template codes are abundant! (most NNs are set up very similarly)
Common Neural Network Creation Flow Back-end: Computes and builds the
"weights" network
Common Neural Network Creation Flow Model file: What apps will
use Example: Core ML
Insights on Neural Network Concepts
Training a Neural Network Teaching a kid From here on
we’ll call Neural Network “N-chan”
Tons of images (100~ each) "Pikachu" "Charmander"
Training data =~ Flash cards
Teaching = Repetition Takes about a day on decent-sized data
size GPU hardware is recommended
Repetitions → Misunderstandings Depending on our training data (or lack
thereof), N-chan may misunderstand some things “Overfitting” Three
Countering Overfitting: “Dropout” Randomly force N-chan to “forget” a learned
item Good example: Math Exams memorizing is not necessarily a good thing
Countering Overfitting: Optimizers Tweak the "learning rate" Example: N-chan is
studying for an exam Read all book chapters then take a mock exam (slow but extensive) Take a mock exam then check the answers (trial and error)
Countering Overfitting: Optimizers 0% accuracy 100% accuracy loss (noise) loss
(noise) loss (noise) speed = learning rate
Today's Key Points Neural Networks are better at analyzing unknown
data than traditional image recognition systems (ex: OCR) Many template projects for training Neural Networks exist (esp. Keras) Training Neural Networks is like teaching a kid
References https://shibberu.com/2016/04/26/ma-490-deep-learning/ https://www.youtube.com/watch?v=ZCPauvMxV7Q&t=568s https://blog.keras.io/building-powerful-image-classification- models-using-very-little-data.html https://adeshpande3.github.io/A-Beginner%27s-Guide-To- Understanding-Convolutional-Neural-Networks/ http://cs231n.github.io/convolutional-networks/#overview
Thanks!