Face Recognition with Vision & Core ML

Face Recognition on mobile applications Deep Learning & Augmented Reality

Integration inside a Mobile App

Tell Me Why Diving Deeper Performances Privacy

Machine Learning Machine Learning on Mobile

2 Main Solutions 2 Main Solutions

Google: TensorFlow 2 Main Solutions

Apple: Vision & Core ML 2 Main Solutions

Deep Learning for Face Recognition

Two Components

Machine Learning

Vision ▼ Face and face landmark detection ▼ Text detection
▼ Barcode recognition ▼ Feature tracking ▼ Custom Core ML models

Face Rectangle Request let imageRequestHandler = VNImageRequestHandler(cvPixelBuffer: pixelBuffer, orientation: .right,
options: [:]) let request = VNDetectFaceRectanglesRequest() try imageRequestHandler.perform([request]) guard let results = request.results as? [VNFaceObservation] else { return } for result in results { result.confidence // 0...1 result.landmarks // Face Landmarks result.boundingBox // Rectangle Around the Face }

Machine Learning

Apple: Core ML Diving Deeper

Core ML: Execution on CPU or GPU? Diving Deeper ▼
CPU or GPU chosen according to the device ▼ CPU execution runs on top of Accelerate framework

Core ML: Execution on GPU or GPU? Diving Deeper ▼
GPU or GPU chosen according to the device ▼ CPU execution runs on top of Accelerate framework ▽ Abstraction for leveraging the vector-processing capabilities of the CPU

Core ML: Execution on GPU or GPU? Diving Deeper ▼
GPU or GPU chosen according to the device ▼ CPU execution runs on top of Accelerate framework ▽ Abstraction for leveraging the vector-processing capabilities of the CPU ▼ iOS chooses for You /!\

Diving Deeper Main Hurdles ▼ Conversion ▼ Performances ▼ Model
Size

Core ML Tools Diving Deeper https://github.com/apple/coremltools ▼ Convert ▼ Predict
(Test)

Core ML Tools Diving Deeper from inception_resnet_v1 import * model
= InceptionResNetV1(weights_path='facenet_keras_weights.h5') coreml_model = coremltools.converters.keras.convert( model, input_names="image", image_input_names="image", output_names="output" ) coreml_model.save('facenet_keras_weights_coreml.mlmodel')

Diving Deeper Easier Said Than Done ▼ Input Normalization ▼
Custom Layers

Custom Layers // ... def scaling(x, scale): return x *
scale // ...

Core ML Tools: Input Normalization and Conversion Diving Deeper from
inception_resnet_v1 import * model = InceptionResNetV1(weights_path='facenet_keras_weights.h5') coreml_model = coremltools.converters.keras.convert( model, input_names="image", image_input_names="image", output_names="output", add_custom_layers=True, image_scale=2/255.0, red_bias=-1, green_bias=-1, blue_bias=-1, custom_conversion_functions={ "Lambda": convert_lambda } ) coreml_model.save('facenet_keras_weights_coreml.mlmodel')

Core ML+ Xcode Diving Deeper

Core ML in Swift Diving Deeper let model = try
VNCoreMLModel(for: facenet_keras_weights_coreml().model) let coreMLRequest = VNCoreMLRequest(model: model) // Options coreMLRequest.imageCropAndScaleOption = .scaleFit // ...

Diving Deeper Easier Said Than Done ▼ Input Normalization ▼
Custom Layers ▼ Input Normalization ▼ Custom Layers ▽ Performance

Custom Layers // ... def scaling(x, scale): return x *
scale // ...

Custom Layers: No Optimization @objc(scaling) class Scaling: NSObject, MLCustomLayer {
// ... func evaluate(inputs: [MLMultiArray], outputs: [MLMultiArray]) throws { for i in 0..<inputs.count { let input = inputs[i] let output = outputs[i] assert(input.shape == output.shape) for j in 0..<input.count { let x = input[j].doubleValue let y = x * scale output[j] = NSNumber(value: y) } } } }

Custom Layers: No Optimization

Custom Layers: Using Accelerate.framework @objc(scaling) class Scaling: NSObject, MLCustomLayer {
// ... func evaluate(inputs: [MLMultiArray], outputs: [MLMultiArray]) throws { var scale = Float(self.scale) for i in 0..<inputs.count { let input = inputs[i] let output = outputs[i] assert(input.shape == output.shape) let count = input.count let inputPointer = UnsafeMutablePointer<Float>(OpaquePointer(input.dataPointer)) let outputPointer = UnsafeMutablePointer<Float>(OpaquePointer(output.dataPointer)) vDSP_vsmul(inputPointer, 1, &scale, outputPointer, 1, vDSP_Length(count)) } } }

Custom Layers: Using Accelerate.framework

Custom Layers: Using Metal Shader (GPU) #include <metal_stdlib> using namespace
metal; kernel void scaling( texture2d_array<half, access::read> inTexture [[texture(0)]], texture2d_array<half, access::write> outTexture [[texture(1)]], constant float& scale [[buffer(0)]], ushort3 gid [[thread_position_in_grid]]) { if (gid.x >= outTexture.get_width() || gid.y >= outTexture.get_height()) { return; } const float4 x = float4(inTexture.read(gid.xy, gid.z)); const float4 y = x * scale; outTexture.write(half4(y), gid.xy, gid.z); }

Custom Layers: Using Metal Shader (GPU) @objc(scaling) class Scaling: NSObject,
MLCustomLayer { if let encoder = commandBuffer.makeComputeCommandEncoder() { for i in 0..<inputs.count { encoder.setTexture(inputs[i], index: 0) encoder.setTexture(outputs[i], index: 1) var scale = self.scale encoder.setBytes(&scale, length: MemoryLayout<Float>.size, index: 0) encoder.dispatch(pipeline: scalingPipeline, texture: inputs[i]) encoder.endEncoding() } } }

Custom Layers: Using Metal Shader (GPU)

Two Components

Augmented Reality

Augmented Reality = +

Augmented Reality Vision Framework

Augmented Reality

Adding 3D Objects: What’s a Node? SCNNode()

Adding 3D Objects func paintFaceGeometry(at rect: CGRect, personIdentifier: String) {
let targetRect = rect.transformed(to: arSceneView.frame.size) let targetRectCenter = CGPoint(x: targetRect.midX, y: targetRect.midY) guard let point = findAverageHitTest(for: targetRectCenter) else { return }

let targetRect = rect.transformed(to: arSceneView.frame.size) let targetRectCenter = CGPoint(x: targetRect.midX, y: targetRect.midY) guard let point = findAverageHitTest(for: targetRectCenter) else { return } let pointerNode = SCNNode.createPointerNode(text: personIdentifier) pointerNode.position = point

let targetRect = rect.transformed(to: arSceneView.frame.size) let targetRectCenter = CGPoint(x: targetRect.midX, y: targetRect.midY) guard let point = findAverageHitTest(for: targetRectCenter) else { return } let pointerNode = SCNNode.createPointerNode(text: personIdentifier) pointerNode.position = point let constraint = SCNBillboardConstraint() baseNode.constraints = [constraint] arSceneView.scene.rootNode.addChildNode(pointerNode) }

What’s Next

Summing Up What’s Next ▼ Optimizing Model Size ▽ Quantization
▼ Better Performances ▼ Core ML 2 (Beta) ▼ Create ML 2.0

Summing Up

Summing Up Recap ▼ What is DL ▼ How to
use DL for Face Recognition ▼ How to import a face recognition model inside a mobile app ▼ How to make use of a ML model to create an AR experience on a modern phone

References

References References ▼ http://machinethink.net/blog/ ▼ https://developer.apple.com/wwdc/ REF

Face Recognition with Vision & Core ML

Face Recognition with Vision & Core ML

More Decks by Simone Civetta

Other Decks in Programming

Featured

Transcript