Upgrade to Pro — share decks privately, control downloads, hide ads and more …

ML Video Intelligence

Ahmed Sobhy
August 13, 2020
18

ML Video Intelligence

Ahmed Sobhy

August 13, 2020
Tweet

Transcript

  1. What Is an Algorithm ? - An algorithm is a

    set of rules to be followed when solving problems. In machine learning, algorithms take in data and perform calculations to find an answer. - Algorithms need to be trained to learn how to classify and process information. The efficiency and accuracy of the algorithm are dependent on how well the algorithm was trained. - Using an algorithm to calculate something does not automatically mean machine learning or AI was being used. All squares are rectangles, but not all rectangles are squares.
  2. The dimension of data are: • CPU: 1 X 1

    data unit • GPU: 1 X N data unit • TPU: N X N data unit As a comparison, consider this: • CPU can handle tens of operation per cycle • GPU can handle tens of thousands of operation per cycle • TPU can handle upto 128000 operations per cycle CPU GPU TPU
  3. At the most basic level, machines equipped with AI imitate

    the human thought process such as the ability to identify an apple and orange. Source ( qualcomm)
  4. With machine learning, a system improves its pattern recognition capabilities

    by learning from its own mistakes. Source ( qualcomm)
  5. Through deep learning, a machine can process large amounts of

    data, recognize complex patterns, and provide more detailed insights. Source ( qualcomm)
  6. Three ways, with varying complexity: (1) Use a Cloud-based or

    Mobile API (Vision, Natural Language, etc.) (2) Use an existing model architecture, and retrain it or fine tune on your dataset (3) Develop your own machine learning models for new problems How Can You Get Started with Machine Learning? More flexible, but more effort required
  7. Faces Faces, facial landmarks, emotions OCR Read and extract text,

    with support for > 10 languages Label Detect entities from furniture to transportation Logos Identify product logos Landmarks & Image Properties Detect landmarks & dominant color of image Safe Search Detect explicit content - adult, violent, medical and spoof Cloud Vision API
  8. API Usage: Detect Objects in an Image Image Detected Items

    Vision API Create JSON request with the image or pointer to an image Process the JSON response Call the REST API 1 2 3
  9. Cloud Speech API Automatic Speech Recognition (ASR) powered by deep

    learning neural networking to power your applications like voice search or speech transcription. Recognizes over 80 languages and variants with an extensive vocabulary. Returns partial recognition results immediately, as they become available. Filter inappropriate content in text results. Audio input can be captured by an application’s microphone or sent from a pre-recorded audio file. Multiple audio file formats are supported, including FLAC, AMR, PCMU and linear-16. Handles noisy audio from many environments without requiring additional noise cancellation. Audio files can be uploaded in the request and, in future releases, integrated with Google Cloud Storage. Automatic Speech Recognition Global Vocabulary Inappropriate Content Filtering Streaming Recognition Real-time or Buffered Audio Support Noisy Audio Handling Integrated API
  10. Extract sentence, identify parts of speech and create dependency parse

    trees for each sentence. Identify entities and label by types such as person, organization, location, events, products and media. Understand the overall sentiment of a block of text. Syntax Analysis Entity Recognition Sentiment Analysis Cloud Natural Language API
  11. • https://cloud.google.com/free/ • Free Trial: $300 and 12 months •

    Free Tier extends beyond 12 months too (Always Free)