Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Exploring Machine Learning With Google Cloud Vision API

Exploring Machine Learning With Google Cloud Vision API

My talk during the awesome #DevFestNairobi 2018 about exploring Machine Learning using Google Cloud Vision API for Web and Mobile applications.

Google Cloud Vision API allows you to explore powerful image analysis with
-Pretrained models
-Ability to build custom models using AutoML Vision

And provides access via an easy-to-use REST API

Olayinka Peter Oluwafemi

September 22, 2018
Tweet

More Decks by Olayinka Peter Oluwafemi

Other Decks in Technology

Transcript

  1. Exploring Machine Learning With Google Cloud Vision API +OlayinkaPeterOluwafemi @olayinkapeter_

    Organizer, Google Cloud Developer Community, Ado-Ekiti, Nigeria
  2. Sometime in late 2017, I published Toodoo, a Firebase-powered app

    that basically lets you add tasks you want to be reminded of, but with some Machine Learning capabilities.
  3. Toodoo is not special. Except that it solves the problem

    of: Users spending large amount of time and/or having trouble in trying to read small printed material in a voluminous page
  4. Hence, allowing users to: • Hunt for text material (in

    languages such as Chinese, English, Finnish, French, German, Japanese, Korean, Portuguese & Spanish) with a device camera to capture an image. • Process the image and extract the text from it using powerful Google Cloud vision Optical Character Recognition (OCR) support. • Copy the extracted image to clipboard, and add them as to-dos
  5. Powerful image analysis Google Cloud Vision API allows you to

    explore powerful image analysis with • Pretrained models • Ability to build custom models using AutoML Vision Easy-to-use REST API
  6. Easily detect broad sets of objects in your images With

    AutoML Vision, you can create custom models that highlight specific concepts from your images. This enables use cases ranging from categorizing product images to diagnosing diseases.
  7. Extracts and identifies text Optical Character Recognition (OCR) enables you

    to detect text within your images or via the camera, along with automatic language identification. Remember Toodoo?
  8. Power of the web Vision API uses the power of

    Google Image Search to find topical entities like celebrities, logos, or news events. Millions of entities are supported, so you can be confident that the latest relevant images are available.
  9. Moderates content With the power of Google SafeSearch, one can

    easily moderate content and detect inappropriate content from crowd-sourced images. That is, detect different types of inappropriate content, from adult to violent content.
  10. Google Cloud Vision API allows you to do all that

    Machine Learning, and a bit more...
  11. Getting started with the Mobile Vision API usage The Mobile

    Vision package consists of common base functionality, and subpackages for specific Machine Learning implementations. These subpackages are easily accessed by adding their package URL to the project dependencies. Note that the common functionality package must first be added. • Common functionality: com.google.android.gms.vision • Face detector: com.google.android.gms.vision.face • Barcode detector: com.google.android.gms.vision.barcode • Text detector: com.google.android.gms.vision.text