TensorFlow - La IA detrás de Google

+Israel Blancas @iblancasa TensorFlow La IA detrás de Google #DevFestGRX
GDG Granada

How Can You Get Started with Machine Learning? Three ways,
with varying complexity: (1) Use a Cloud-based or Mobile API (Vision, Natural Language, etc.) (2) Use an existing model architecture, and retrain it or fine tune on your dataset (3) Develop your own machine learning models for new problems More flexible, but more effort required #DevFestGRX GDG Granada

Cloud Machine Learning APIs See, Hear and Understand the world
#DevFestGRX GDG Granada

Cloud Natural Language Cloud Speech Cloud Translate Cloud Vision #DevFestGRX
GDG Granada

5 5 Vision API https://cloud.google.com/vision/ #DevFestGRX GDG Granada

Faces Faces, facial landmarks, emotions OCR Read and extract text,
with support for > 10 languages Label Detect entities from furniture to transportation Logos Identify product logos Landmarks & Image Properties Detect landmarks & dominant color of image Safe Search Detect explicit content - adult, violent, medical and spoof Cloud Vision API #DevFestGRX GDG Granada

API Usage: Detect Objects in an Image Image Detected Items
Vision API Create JSON request with the image or pointer to an image Process the JSON response Call the REST API 1 2 3 #DevFestGRX GDG Granada

8 8 Cloud Natural Language API https://cloud.google.com/natural-language/ #DevFestGRX GDG Granada

Confidential & Proprietary Google Cloud Platform 9 Cloud Natural Language
API Extract sentence, identify parts of speech and create dependency parse trees for each sentence. Identify entities and label by types such as person, organization, location, events, products and media. Understand the overall sentiment of a block of text. Syntax Analysis Entity Recognition Sentiment Analysis #DevFestGRX GDG Granada

10 10 Cloud Speech API Demo https://cloud.google.com/speech/ #DevFestGRX GDG Granada

Confidential & Proprietary Google Cloud Platform 11 Cloud Speech API
Automatic Speech Recognition (ASR) powered by deep learning neural networking to power your applications like voice search or speech transcription. Recognizes over 80 languages and variants with an extensive vocabulary. Returns partial recognition results immediately, as they become available. Filter inappropriate content in text results. Audio input can be captured by an application’s microphone or sent from a pre-recorded audio file. Multiple audio file formats are supported, including FLAC, AMR, PCMU and linear-16. Handles noisy audio from many environments without requiring additional noise cancellation. Audio files can be uploaded in the request and, in future releases, integrated with Google Cloud Storage. Automatic Speech Recognition Global Vocabulary Inappropriate Content Filtering Streaming Recognition Real-time or Buffered Audio Support Noisy Audio Handling Integrated API #DevFestGRX GDG Granada

Mobile Vision API Providing on-device vision for applications #DevFestGRX GDG
Granada

Face API faces, facial landmarks, eyes open, smiling Barcode API
1D and 2D barcodes Text API Latin-based text / structure Common Mobile Vision API Support for fast image and video on-device detection and tracking. #DevFestGRX GDG Granada

Googly Eyes Android App Video credit Google 1. Create a
face detector for facial landmarks (e.g., eyes) 3. For each face, draw the eyes FaceDetector detector = new FaceDetector.Builder() .setLandmarkType(FaceDetector.ALL_LANDMARKS) .build(); SparseArray<Face> faces = detector.detect(image); for (int i = 0; i < faces.size(); ++i) { Face face = faces.valueAt(i); for (Landmark landmark : face.getLandmarks()) { // Draw eyes 2. Detect faces in the image #DevFestGRX GDG Granada

Face API Photo credit developers.google.com/vision #DevFestGRX GDG Granada

Easy to use Java API image detected items Detector 1.
Create a detector object 2. detectedItems = detector.detect(image) Photo credit developers.google.com/vision #DevFestGRX GDG Granada

Text Detection Latin based language Understand text structure Photo credit
Getty Images #DevFestGRX GDG Granada

Text Structure Blocks Lines Words Lines Words Words Words #DevFestGRX
GDG Granada

Barcode Detection 1D barcodes EAN-13/8 UPC-A/E Code-39/93/128 ITF Codabar 2D
barcodes QR Code Data Matrix PDF-417 AZTEC UPC DataMatrix QR Code PDF 417 Video and image credit Google #DevFestGRX GDG Granada

Combined Vision & Translation #DevFestGRX GDG Granada

• Open source Machine Learning library • Especially useful for
Deep Learning • For research and production • Apache 2.0 license #DevFestGRX GDG Granada

Architecture • Core in C++ • Different front ends ◦
Python and C++ today, community may add more Core TensorFlow Execution System CPU GPU Android iOS ... C++ front end Python front end ... #DevFestGRX GDG Granada

Raspberry Pi Datacenters Your laptop Android iOS Portable & Scalable
#DevFestGRX GDG Granada

/Serving Open-source solution for serving trained models #DevFestGRX GDG Granada

From the whitepaper: “TensorFlow is an interface for expressing machine
learning algorithms, and an implementation for executing such algorithms.” In short: TensorFlow is Theano++. • Symbolic ML dataflow framework that compiles to native / GPU code What is TensorFlow? #DevFestGRX GDG Granada

Data Flow Graphs Computation is defined as a directed acyclic
graph (DAG) to optimize an objective function • Graph is defined in high-level language (Python) • Graph is compiled and optimized • Graph is executed (in parts or fully) on available low level devices (CPU, GPU) • Data (tensors) flow through the graph • TensorFlow can compute gradients automatically #DevFestGRX GDG Granada

Graph? #DevFestGRX GDG Granada

Variables are 0-ary stateful nodes which output their current value.
(State is retained across multiple executions of a graph.) (parameters, gradient stores, eligibility traces, …) Graph? #DevFestGRX GDG Granada

Placeholders are 0-ary nodes whose value is fed in at
execution time. (inputs, variable learning rates, …) Graph? #DevFestGRX GDG Granada

Mathematical operations: MatMul: Multiply two matrix values. Add: Add elementwise
(with broadcasting). ReLU: Activate with elementwise rectified linear function. Graph? #DevFestGRX GDG Granada

In code, please! 1. Create model weights, including initialization a.
W ~ Uniform(-1, 1); b = 0 2. Create input placeholder x a. m * 784 input matrix 3. Create computation graph import tensorflow as tf b = tf.Variable(tf.zeros((100,))) W = tf.Variable(tf.random_uniform((784, 100), -1, 1)) x = tf.placeholder(tf.float32, (None, 784)) h_i = tf.nn.relu(tf.matmul(x, W) + b) 1 2 3 #DevFestGRX GDG Granada

So far we have defined a graph. We can deploy
this graph with a session: a binding to a particular execution context (e.g. CPU, GPU) How do we run it? #DevFestGRX GDG Granada

Getting output sess.run(fetches, feeds) Fetches: List of graph nodes. Return
the outputs of these nodes. Feeds: Dictionary mapping from graph nodes to concrete values. Specifies the value of each graph node given in the dictionary. import numpy as np import tensorflow as tf b = tf.Variable(tf.zeros((100,))) W = tf.Variable(tf.random_uniform((784, 100),-1, 1)) x = tf.placeholder(tf.float32, (None, 784)) h_i = tf.nn.relu(tf.matmul(x, W) + b) 1 2 3 sess = tf.Session() sess.run(tf.initialize_all_variables()) sess.run(h_i, {x: np.random.random(64, 784)}) #DevFestGRX GDG Granada

1. Build a graph a. Graph contains parameter specifications, model
architecture, optimization process, … 2. Initialize a session 3. Fetch and feed data with Session.run a. Compilation, optimization, etc. happens at this step — you probably won’t notice Basic flow #DevFestGRX GDG Granada

Questions? #DevFestGRX GDG Granada

tensorflow.org github.com/tensorflow Want to learn more? Udacity class on Deep
Learning, goo.gl/iHssII Guides, codelabs, videos MNIST for Beginners, goo.gl/tx8R2b TF Learn Quickstart, goo.gl/uiefRn TensorFlow for Poets, goo.gl/bVjFIL ML Recipes, goo.gl/KewA03 TensorFlow and Deep Learning without a PhD, goo.gl/pHeXe7 What's Next #DevFestGRX GDG Granada

+Israel Blancas @iblancasa TensorFlow La IA detrás de Google #DevFestGRX
GDG Granada

TensorFlow - La IA detrás de Google

TensorFlow - La IA detrás de Google

More Decks by Israel Blancas

Other Decks in Technology

Featured

Transcript