The State of ML for iOS: On the Advent of WWDC 2018 🕯

The State of ML for iOS On the Advent of
WWDC 2018 Meghan Kane, @meghafon NSLondon May 2018

! Hey, I'm Meghan! @meghafon iOS Engineer @ Novoda Berlin

wwdc 2018

! big picture " when is it practical to use
ML for iOS? # what's available to us? $ end-to-end examples !

barriers to entry? 1. A large dataset 2. Access to
high end compute power 3. PhD in machine learning 4. All the time in the world ...nope!

Is it practical for my app? image classification audio classification
speech recognition text classification gesture recognition optical character recognition (OCR) translation voice synthesis

embrace idea generation & experimentation

is it just hype?

machine learning is a powerful tool but, it is still
just another tool

how can we think about ML as ! developers?

Can this be solved without ML? if so, choose that

ML vs not ML basic unit of solving problem =
function ("model") ML: enabling a machine to learn function on its own classify sign language alphabet images not ML: explicitly deﬁning function determining if a number is even/odd

If you decide to use ML still go with the
simplest solution

Why do ML (predictions) on mobile? → low latency user
experience → user privacy

What's available from Apple? image classiﬁcation of 1000 common categories
→ trees, animals, food, vehicles, people → SqueezeNet (5 MB), MobileNet (17 MB), Inception V3 (95 MB), ResNet50 (103 MB), VGG16 (554 MB) scene classiﬁcation of 205 categories → airport terminal, bedroom, forest, coast → Places205-GoogLeNet (25 MB)

If not, train custom ML model step 1: use framework
for training TensorFlow, keras, Turi Create , Caffe, etc ⚠ warning, there are a lot of them step 2: convert to .mlmodel format (OSS) →  coremltools github.com/apple/coremltools → tf-coreml github.com/tf-coreml

It has been quite a year

beyond the cat/dog classiﬁer (TM)

End-to-end Process as a developer? 0. Deﬁne problem 1. Collect
data 2. Train ML model 3. Convert to coreml .mlmodel 4. Import into Xcode project 5. Predict using Core ML (+Vision) framework

Mobile speciﬁc concerns size of model time it takes to
run predictions supported layers

examples!

0.Deﬁne problem American Sign Language (ASL) alphabet classiﬁer

1.Collect data !

2. Train model

Quick Review: Deep Learning neural network model with many layers
deep = many layers -> deep neural network Mobile Machine Learning 101: Glossary Jameson Toole on Heartbeat blog

sometime way back in B.C. people used to train deep
neural network from scratch

still some (more recent) time in B.C. people stand on
the shoulders of giants' work utilizing transfer learning

enter.. transfer learning ! use knowledge learned from source task
(MobileNet) --> to train target task (ASL classiﬁer)

don't reinvent the wheel

Why Transfer Learning works neural networks are universal approximators in
theory, they can approximate any function

how much data do i need? depends on problem just
100s images per category

where can i get it? kaggle google for them... record
video + extract frames (using e.g. FFmpeg)

what if i don't have enough? data augmentation! Deeplearning.ai: C4W2L10
Data Augmentation (~10 min video)

Let's start training... → can we use Swift for Tensor
Flow? → for now, stick with regular Tensor Flow

confession: this is in python

Performance so how did our training go? ~20 min to
run on my MacBook 95.3% accuracy on the test data

3. Convert to .mlmodel tf-coreml

It just works !

4. Import into Xcode project drag + drop

It actually just works

5. Predict using Core ML (+Vision) framework vision + core
ml

audio classiﬁcation 0.Deﬁne problem 1. Gather data 2. Train ML
model

0.Deﬁne problem Audio classiﬁer of urban sounds air conditioner, car
horn, children playing, drilling, siren, etc

1.Gather data UrbanSound 8K open dataset Urban Sound Datasets, NYU
CUSP

should we use raw audio (.wav)? no, it's too computationally
expensive

convert wav to spectrogram represent audio as image (3 dimensions)
1st dimension: time (x-axis) 2nd dimension: frequency (y-axis) 3rd dimension: sound intensity (color)

2. Train Model

Performance so how did our training go? ~1 hour to
run on my MacBook 77.1% accuracy on the test data

what to focus on

Where to ﬁnd inspiration look at open datasets read research
papers! follow heartbeat blog, openAI

Reproduce results research papers often include this make sure to
do the same if you publish check licensing + attribute proper credit

Looking forward to the future ML interpretability swift for TensorFlow

Review ! big picture " when is it practical to
use ML for iOS? # what's available to us? $ end-to-end examples

Attributions & Mentions (1/4) Apple Machine Learning WWDC 2017 Videos
TensorFlow for Poets Google codelabs tutorial Apple coremltools GitHub repo tf-coreml GitHub repo: TensorFlow->core ml converter

Attributions & Mentions (2/4) Heartbeat by fritz.ai blog: Machine Learning
at the edge ASL Datasets Kaggle Sign Language MNIST Urban Sound Datasets, NYU CUSP deeplearning.ai course: Data Augmentation

Attributions & Mentions (3/4) Swift for TensorFlow GitHub repo Dockerized
Swift for TF GitHub repo, Alexis Gallager themorningpaper by Adrian Colyer OpenAI Research "The Building Blocks of Interpretability" Google: C. Olah et al

Attributions & Mentions (4/4) "Strategically Ignorant" Devon Zuegel "Transfer Learning
of Temporal Information for Driver Action Classiﬁcation" J. Lemley et al "Transfer Learning for Sound Classiﬁcation" TataLab

Further Learning (1/3) fast.ai Deep Learning course My Udacity Core
ML course machinethink, ! ML for iOS blog by Matthijs Hollemans TensorFlow Dev Summit 2018 Videos TensorFlow playground

Further Learning (2/3) Building Mobile Apps w/ Tensor Flow Pete
Warden Neural Networks & Deep Learning Michael Nielsen Stanford's Computer Vision course (CS231n)

Further Learning (3/3) "Distilling the Knowledge in a Neural Network"
Geoffrey Hinton et al. "Transfer Learning - Machine Learning's Next Frontier" ! Sebastian Ruder "Transfer learning for music classiﬁcation and regression tasks" ! Keunwoo Choi et al.

Thank you Keep in touch! twitter: @meghafon

The State of ML for iOS: On the Advent of WWDC ...

The State of ML for iOS: On the Advent of WWDC 2018 🕯

More Decks by Meghan Kane

Other Decks in Technology

Featured

Transcript