Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Food Image Object Detection and Classification
Search
Leszek Rybicki
February 16, 2017
Research
2
14k
Food Image Object Detection and Classification
Part 1: Detection
Leszek Rybicki
February 16, 2017
Tweet
Share
More Decks by Leszek Rybicki
See All by Leszek Rybicki
Let's talk about Fakes
lunardog
0
110
How to Patch Image Classifiers
lunardog
0
1.7k
Towards Realistic Predictors - EN
lunardog
0
1.6k
Towards Realistic Predictors
lunardog
1
2k
Deep Learning Hot Dog Detector
lunardog
0
230
Finding beans in burgers: paper reading notes
lunardog
0
1.3k
Kelner: Serve Your Models
lunardog
0
100
Image Analysis at Cookpad
lunardog
1
1.6k
Kelner: serve your models
lunardog
1
340
Other Decks in Research
See All in Research
さんかくのテスト.pdf
sankaku0724
0
520
The Fellowship of Trust in AI
tomzimmermann
0
150
FOSS4G 山陰 Meetup 2024@砂丘 はじめの挨拶
wata909
1
120
熊本から日本の都市交通政策を立て直す~「車1割削減、渋滞半減、公共交通2倍」の実現へ~@公共交通マーケティング研究会リスタートセミナー
trafficbrain
0
180
LLM時代にLabは何をすべきか聞いて回った1年間
hargon24
1
530
データサイエンティストをめぐる環境の違い 2024年版〈一般ビジネスパーソン調査の国際比較〉
datascientistsociety
PRO
0
780
marukotenant01/tenant-20240826
marketing2024
0
520
日本語医療LLM評価ベンチマークの構築と性能分析
fta98
3
780
ダイナミックプライシング とその実例
skmr2348
3
480
秘伝:脆弱性診断をうまく活用してセキュリティを確保するには
okdt
PRO
4
770
Weekly AI Agents News! 9月号 プロダクト/ニュースのアーカイブ
masatoto
2
170
RSJ2024「基盤モデルの実ロボット応用」チュートリアルA(河原塚)
haraduka
3
700
Featured
See All Featured
A Philosophy of Restraint
colly
203
16k
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
44
9.3k
Fight the Zombie Pattern Library - RWD Summit 2016
marcelosomers
232
17k
Music & Morning Musume
bryan
46
6.2k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.2k
GraphQLの誤解/rethinking-graphql
sonatard
67
10k
Imperfection Machines: The Place of Print at Facebook
scottboms
266
13k
Why Our Code Smells
bkeepers
PRO
335
57k
The Invisible Side of Design
smashingmag
298
50k
Optimizing for Happiness
mojombo
376
70k
Raft: Consensus for Rubyists
vanstee
137
6.7k
Rails Girls Zürich Keynote
gr2m
94
13k
Transcript
Food Image Object Detection and Classification Challenges and Solutions
Part 1: Detection
自己紹介 • リビツキ レシェック • ポーランド出身 • 2016~ クックパッド • github:
lunardog
Warning! This presentation contains images that may cause severe drooling
and stomach grumbling. @cookpad
History 歴史
ImageNet KWWSLPDJHQHWRUJ
ImageNet Large Scale Visual Recognition Competition KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2010 task Classification )RUHDFKLPDJHDOJRULWKPV ZLOOSURGXFHDOLVWRIDWPRVW REMHFWFDWHJRULHVLQWKH GHVFHQGLQJRUGHURI FRQILGHQFH KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
ILSVRC 2011 tasks 1. Classification 2. *Classification with localization *tester
task
KWWSFVQVWDQIRUGHGXV\OODEXVKWPO Classification + Localization
ILSVRC 2012 tasks 1. Classification 2. Classification with localization 3.
Fine-grained classification
Fine-grained classification KWWSZZZLPDJHQHWRUJFKDOOHQJHV/695&
AlexNet ,PDJHQHWFODVVLILFDWLRQZLWKGHHSFRQYROXWLRQDOQHXUDOQHWZRUNV $.UL]KHYVN\,6XWVNHYHU*(+LQWRQ$GYDQFHVLQQHXUDOLQIRUPDWLRQ SURFHVVLQJV\VWHPV
ILSVRC 2013 tasks 1. Detection 2. Classification 3. Classification with
localization
ILSVRC 2014 tasks 1. Detection 2. Classification 3. Classification with
localization
Object Detection KWWSFVQVWDQIRUGHGXV\OODEXVKWPO
Deep Learning KWWSVGHYEORJVQYLGLDFRP
ILSVRC 2015 tasks 1. Object detection 2. Object localization 3.
*Object detection from video 4. *Scene classification
ILSVRC 2016 tasks 1. Object localization 2. Object detection 3.
Object detection from video 4. Scene classification 5. Scene parsing
Cookpad 2016
画像データセット 1997年~ レシピ数:国内約260万 + 国外 + つくれぽ + 手順写真 17言語、60カ国
※数字は2017年02月時点のものです
画像解析の研究関心 • これは料理ですか? • どの料理ですか? • 料理はどこですか? • 。。。 Part
2
Where is the food? 料理はどこですか?
ゴール )LQGIRRGLQWKHLPDJHGUDZ DERXQGLQJER[DURXQGWKH IRRGLWHPLQFOXGLQJWKH GLVKLIYLVLEOH
,IWKHUHDUHPXOWLSOHLWHPV GUDZDERXQGLQJER[ DURXQGHDFKRQH ゴール
ground truth bounding box > 0.9 We count it as
a positive detection if Intersection over Union ratio is greater than 0.9. ƴ
QXPEHURIWUXHSRVLWLYHV QXPEHURIJURXQGWUXWKER[HV ƴ ƴ ƴ QXPEHURIWUXHSRVLWLYHV QXPEHURIJHQHUDWHGER[HV 再現率 (precision) (recall)
ƴ ƴ
Methods
1. Build a classifier 2. Pick Regions of Interest 3.
Run classifier on each region 4. Remove duplicate detections IDEA
Fast, Faster R-CNN 5LFKIHDWXUHKLHUDUFKLHVIRUDFFXUDWHREMHFWGHWHFWLRQDQGVHPDQWLFVHJPHQWDWLRQ 5RVV*LUVKLFN-HII'RQDKXH7UHYRU'DUUHOO-LWHQGUD0DOLN )DVWHU5&117RZDUGV5HDO7LPH2EMHFW'HWHFWLRQZLWK5HJLRQ3URSRVDO1HWZRUNV 6KDRTLQJ5HQ.DLPLQJ+H5RVV*LUVKLFN-LDQ6XQ
)DVW5&11 5RVV*LUVKLFN
問題 1. Computational cost 2. Context is important 3. ...but
context can be confusing. KDQG IRRG JUDVV IRRG KWWSSL[DED\FRP
Single Shot Detector 66'6LQJOH6KRW0XOWL%R['HWHFWRU :HL/LX'UDJRPLU$QJXHORY'XPLWUX(UKDQ&KULVWLDQ6]HJHG\ 6FRWW5HHG&KHQJ<DQJ)X$OH[DQGHU&%HUJ
Either The Least Or Most Employable Person Ever 7KH+XIILQJWRQ3RVW JLWKXEFRPSMUHGGLH
SMUHGGLHFRPGDUNQHW ZZZNDJJOHFRPSMUHGGLH Joseph Redmon
You Only Look Once <RX2QO\/RRN2QFH8QLILHG 5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV *LUVKLFN$OL)DUKDGL 'HF
<2/2%HWWHU)DVWHU 6WURQJHU -RVHSK5HGPRQ$OL)DUKDGL
<RX2QO\/RRN2QFH8QLILHG5HDO7LPH2EMHFW'HWHFWLRQ -RVHSK5HGPRQ6DQWRVK'LYYDOD5RVV*LUVKLFN$OL)DUKDGL YOLO in Context
None