Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Mobile Vision API + Android
Search
Enzo Lizama Paredes
July 06, 2018
Programming
0
55
Mobile Vision API + Android
Enzo Lizama Paredes
July 06, 2018
Tweet
Share
More Decks by Enzo Lizama Paredes
See All by Enzo Lizama Paredes
BDD in Flutter
enzoftware
0
68
Error handling in Flutter
enzoftware
0
76
Adding Flutter to an existing Android/iOS app
enzoftware
0
130
Flutter flavors
enzoftware
0
61
Flutter CI/CD with Fastlane
enzoftware
0
60
Flutter Animations
enzoftware
0
30
Productivity tools 4 developers
enzoftware
0
31
OpenCV + Android
enzoftware
1
42
Anko Superpowers
enzoftware
0
57
Other Decks in Programming
See All in Programming
富山発の個人開発サービスで日本中の学校の業務を改善した話
krpk1900
4
370
第3回関東Kaggler会_AtCoderはKaggleの役に立つ
chettub
3
890
AWS Organizations で実現する、 マルチ AWS アカウントのルートユーザー管理からの脱却
atpons
0
130
Kubernetes History Inspector(KHI)を触ってみた
bells17
0
200
JavaScriptツール群「UnJS」を5分で一気に駆け巡る!
k1tikurisu
10
1.8k
Pythonでもちょっとリッチな見た目のアプリを設計してみる
ueponx
1
480
SRE、開発、QAが協業して挑んだリリースプロセス改革@SRE Kaigi 2025
nealle
3
4.1k
Grafana Loki によるサーバログのコスト削減
mot_techtalk
1
110
『GO』アプリ データ基盤のログ収集システムコスト削減
mot_techtalk
0
110
sappoRo.R #12 初心者セッション
kosugitti
0
230
Amazon Bedrock Multi Agentsを試してきた
tm2
1
280
SwiftUI Viewの責務分離
elmetal
PRO
0
150
Featured
See All Featured
Documentation Writing (for coders)
carmenintech
67
4.6k
Statistics for Hackers
jakevdp
797
220k
VelocityConf: Rendering Performance Case Studies
addyosmani
328
24k
Save Time (by Creating Custom Rails Generators)
garrettdimon
PRO
29
1k
Speed Design
sergeychernyshev
25
780
The MySQL Ecosystem @ GitHub 2015
samlambert
250
12k
Fontdeck: Realign not Redesign
paulrobertlloyd
82
5.4k
Done Done
chrislema
182
16k
We Have a Design System, Now What?
morganepeng
51
7.4k
BBQ
matthewcrist
86
9.5k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
120k
Making Projects Easy
brettharned
116
6k
Transcript
MAKE YOUR APP SEE WITH MOBILE VISION API
ENZO LIZAMA PAREDES @enzoftware @enzoftware @enzoftware
None
HOW DO WE SEE ?
None
HOW DO MACHINES SEE ?
None
None
WHAT IS COMPUTER VISION ?
None
None
None
COMPUTER VISION IS EVERYWHERE!
On robotics
On security
On social networks
On medicine
On gaming
On cars
And also on mobile
None
None
MOBILE VISION API OVERVIEW
FUNCTIONALITY • Face API • Barcode API • Text API
(OCR)
FACE API What can we do with faces?
FACE TRACKING
LANDMARKS DETECTION
None
FACE CLASSIFICATION : EYES OPENED 0.0 1.0
FACE CLASSIFICATION : SMILING 0.0 1.0
FACE CLASSIFICATION : POSE ANGLE
BARCODE API What can we do with barcodes?
BARCODE TRACKING
None
TEXT API Reading text from images
READING TEXT FROM IMAGES
TEXT STRUCTURE
MOBILE VISION API USAGE 1. Add dependencies 2. Add meta-data
on manifest 3. Create detector 4. Check if detector is operational 5. Create frame 6. Detect faces
1. ADD DEPENDENCIES
2. ADD META-DATA ON MANIFEST
3. CREATE DETECTOR
4. CHECK IF DETECTOR IS OPERATIONAL
5. CREATE FRAME
6. DETECT FACES
DEMO - FaceSpotter - dOCReader - Barcode Recognition
AND NOW WHAT? GITHUB SAMPLES https://github.com/googlesamples/android-vision
AND NOW WHAT? Mobile Vision Docs https://developers.google.com/vision/
AND NOW WHAT? Google Codelabs https://codelabs.developers.google.com/
@enzoftware