Bootstrapping the Machine Learning Training Process

Bootstrapping the Machine Learning Training Process Meghan Kane, @meghafon AppBuilders
April 2018

! Hey, I'm Meghan! @meghafon iOS Engineer @ Novoda Berlin

is it all the hype?

machine learning is a powerful tool but, it is still
just another tool

how can we think about ML as ! developers?

Can this be solved without ML? if so, choose that

ML vs not ML basic unit of solving problem =
function ("model") ML: enabling a machine to learn function on its own classify sign language alphabet images not ML: explicitly deﬁning function determining if a number is even/odd

If you decide to use ML still go with the
simplest solution

Why do ML (predictions) on mobile? 1. low latency user
experience 2. user privacy

What's available from Apple? image classiﬁcation of 1000 common categories
→ trees, animals, food, vehicles, people → SqueezeNet (5 MB), MobileNet (17 MB), Inception V3 (95 MB), ResNet50 (103 MB), VGG16 (554 MB) scene classiﬁcation of 205 categories → airport terminal, bedroom, forest, coast → Places205-GoogLeNet (25 MB)

If not, train custom ML model 1. use framework for
training TensorFlow, keras, Turi Create , Caffe, etc ⚠ warning, there are a lot of them 1. convert to .mlmodel format (OSS) →  coremltools github.com/apple/coremltools → tf-coreml github.com/tf-coreml

It has been quite a year

beyond the cat/dog classiﬁer (TM)

Some questions you may have is it practical for my
app? ! barriers to entry? data, compute power, phd, time? which framework should i use for training? ! how do i navigate all of the material out there?

Is it practical for my app? image classification audio classification
speech recognition text classification object localization gesture recognition optical character recognition (OCR) translation

embrace idea generation & experimentation

Mobile speciﬁc concerns 1. size of model 2. time it
takes to run predictions 3. supported layers

End-to-end Process as a developer? 1.Deﬁne problem 2. Gather data
3. Train ML model 4.Convert to coreml 5. Import into Xcode project 6. Run predictions using Core ML framework

examples!

1.Deﬁne problem American Sign Language (ASL) alphabet classiﬁer

2.Gather data !

barriers to entry? 1. A large dataset 2. Access to
high end compute power 3. PhD in machine learning 4. All the time in the world ...nope!

enter.. transfer learning ! use knowledge learned from source task
(MobileNet) --> to train target task (ASL classiﬁer)

Quick Review: Deep Learning neural network model with many layers
deep = many layers -> deep neural network https://heartbeat.fritz.ai/mobile-machine- learning-101-glossary-7a4ee36e0a1a

sometime way back in B.C. people used to train deep
neural network from scratch

still some (more recent) time in B.C. people stand on
the shoulders of giants' work utilizing transfer learning

don't reinvent the wheel

Transfer Learning

Why Transfer Learning works neural networks are universal approximators in
theory, they can approximate any function

2. Gather data

how much data do i need? depends on problem just
100s images per category

where can i get it? google kaggle record video +
use ffmpeg

what if i don't have enough? data augmentation! deeplearning.ai video
lesson https://www.youtube.com/watch?v=JI8saFjK84o

3. Train ML model

confession: this is in python

Performance so how did our training go? ~20 min to
run on my MacBook 95.3% accuracy on the test data

audio classiﬁcation 1.Deﬁne problem 2. Gather data 3. Train ML
model

1.Deﬁne problem Audio classiﬁer of urban sounds air conditioner, car
horn, children playing, drilling, siren, etc

2.Gather data UrbanSound 8K open dataset https://serv.cusp.nyu.edu/projects/ urbansounddataset/

should we use raw audio (.wav)? no, it's too computationally
expensive

convert wav to spectrogram represent audio as image (3 dimensions)
1st dimension: time (x-axis) 2nd dimension: frequency (y-axis) 3rd dimension: sound intensity (color)

Performance so how did our training go? ~1 hour to
run on my MacBook 77.1% accuracy on the test data

what to focus on

Where to ﬁnd inspiration look at open datasets read research
papers! follow heartbeat blog, openAI

Reproduce results research papers often include this make sure to
do the same if you publish check licensing + attribute proper credit

Looking forward to the future ML interpretability swift for TensorFlow

Review is it practical for my app? ! barriers to
entry? data, compute power, phd, time? which framework should i use for training? ! how do i navigate all of the material out there?

Resources (abridged) developer.apple.com/machine-learning codelabs.developers.google.com/codelabs/ tensorﬂow-for-poets/ heartbeat.fritz.ai/ serv.cusp.nyu.edu/projects/urbansounddataset/ github.com/tf-coreml/

Further Learning oreilly.com/data/free/files/building-mobile- applications-with-tensorflow.pdf neuralnetworksanddeeplearning.com/ cs231n.stanford.edu/ playground.tensorflow.org youtube.com/watch?v=JI8saFjK84o

Attributions & Mentions researchgate.net/publication/ 316748306TransferLearningofTemporalInformationfor DriverAction_Classiﬁcation devonzuegel.com/post/strategically-ignorant research.googleblog.com/2018/03/the-building- blocks-of-interpretability.html

Grazie

Bootstrapping the Machine Learning Training Pro...

Bootstrapping the Machine Learning Training Process

More Decks by Meghan Kane

Other Decks in Technology

Featured

Transcript