Heat the Neurons of Your Smartphone with Deep Learning

Heat the Neurons of Your Smartphone with Deep Learning Qian
Jin @bonbonking Yoann Benoit @yoannbenoit AndroidMakers Paris | 10th April 2017

On-Device Intelligence

3 Image Credit: Google Research Blog

“In my 34 years in the semiconductor industry, I have
witnessed the advertised death of Moore’s Law no less than four times. As we progress from 14 nanometer technology to 10 nanometer and plan for 7 nanometer and 5 nanometer and even beyond, our plans are proof that Moore’s Law is alive and well.” 5

6 Ref: https://www.qualcomm.com/news/snapdragon/2017/01/09/tensorflow-machine-learning-now-optimized-snapdragon-835-and-hexagon-682

7 Ref: https://9to5google.com/2017/01/10/qualcomm-snapdragon-835-machine-learning-tensorflow/

The ultimate goal of the on-device intelligence is to improve
mobile devices’ ability to understand the world. 8

Magritte Ceci n’est pas une pomme.

#datamobile Chat History of the Slack channel 11

Build TensorFlow Android Example With Bazel 13

Android Developer Deep Learning Noob

NEURONS NEURONS EVERYWHERE 16

WE CAN RECOGNIZE ALL THE THINGS! 17

I THOUGHT THERE WERE MODELS FOR EVERYTHING... 18

Neural Networks in a Nutshell

Here’s a Neural Network 20

Prediction on an image - Inference 21

Prediction on an image - Inference 22

Prediction on an image - Inference Apple: 0.98 Banana: 0.02
23

Training a model

Training a model - Back Propagation 26

Training a model - Back Propagation Apple: 0.34 Banana: 0.66
27

Prediction error 28

Prediction error 29

Prediction error 30

31

Transfer Learning

Deep Convolutional Neural Network 33

Transfer Learning • Use a pre-trained Deep Neural Network •
Keep all operations but the last one • Re-train only the last operation to specialize your network to your classes Keep all weights identical except these ones 34

Save the model • 2 things to save • Execution
graph • Weights for each operation • 2 outputs • Model as protobuf file • Labels in text file 35 model.pb label.txt

java.lang.UnsupportedOperationException: Op BatchNormWithGlobalNormalization is not available in GraphDef version 21.
36

Unsupported Operation • Only keep the operations dedicated to the
inference step • Remove decoding, training, loss and evaluation operations 37

Data Scientist Android Development Noob

CLICK 7 TIMES ON BUILD NUMBER 39

Build Standalone App

Standalone App • Use nightly build • Library .so •
Java API jar android { //… sourceSets { main { jniLibs.srcDirs = ['libs'] } } } 41

App size ~80MB 42

Reducing model size

WHO CARES? MODEL SIZE 44

Model Size All weights are stored as they are (64-bit
floats) => 80MB 45

80 MB => 20 MB 46 Weights quantization 6.372638493746383 =>
6.4

Architecture Underneath

Android SDK (Java) Android NDK (C++) Classifier Implementation TensorFlow JNI
wrapper Image (Bitmap) Trained Model top_results Classifications + Confidence input_tensor 1 2 3 4 Camera Preview Ref: https://jalammar.github.io/Supercharging-android-apps-using-tensorflow/ Overlay Display 48

Image Sampling Get Image from Camera Preview Crop the center
square Resize Sample Image 49

Converts YUV420 to ARGB8888 public static native void convertYUV420ToARGB8888( byte[]
y, byte[] u, byte[] v, int[] output, int width, int height, int yRowStride, int uvRowStride, int uvPixelStride, boolean halfSize ); 50

Create Input Tensor From RGB values // Preprocess the image
data from 0-255 int to normalized float based // on the provided parameters. bitmap.getPixels(intValues, 0, bitmap.getWidth(), 0, 0, bitmap.getWidth(), bitmap.getHeight()); for (int i = 0; i < intValues.length; ++i) { final int val = intValues[i]; floatValues[i * 3 + 0] = (((val >> 16) & 0xFF) - imageMean) / imageStd; floatValues[i * 3 + 1] = (((val >> 8) & 0xFF) - imageMean) / imageStd; floatValues[i * 3 + 2] = ((val & 0xFF) - imageMean) / imageStd; } inferenceInterface.feed(inputName, floatValues, 1, inputSize, inputSize, 3); 51

Adding new models

Adding a new model 53 2 * 20 MB =
40 MB

Model Stacking 54 • Start from previous model to keep
all specific operations in the graph • Specify all operations to keep when optimizing for inference graph_util.convert_variables_to_constants(sess, graph.as_graph_def(), [“final_result_fruits”, “final_result_vegetables”]

Demo Time

What’s next? 57

Next: Pulling Model From the Cloud 58 Ref: https://www.youtube.com/watch?v=EnFyneRScQ8

Next: Federate Learning Collaborative Machine Learning without Centralized Training Data
59 Ref: https://research.googleblog.com/2017/04/federated-learning-collaborative.html

Thank you! Questions? Github: https://github.com/xebia-france/magritte

Heat the Neurons of Your Smartphone with Deep L...

Heat the Neurons of Your Smartphone with Deep Learning

More Decks by jinqian

Other Decks in Technology

Featured

Transcript