Slide 31
Slide 31 text
ิɿσʔληοτ
Ex. Language & Script Data Difficulty Annotation
ICDAR 2017 MLT
dataset (MLT17)
9 languages representing
6 different scripts equally
multi-oriented scene text annotated using
quadrangle bounding
boxes.
ICDAR 2019 MLT
dataset (MLT19)
10 languages representing
7 different scripts.
multi-oriented scene text annotated using
quadrangle bounding
boxes.
Total-Text dataset English language.
wide variety of horizontal,
multi-oriented and curved
text
annotated at word-level
using polygon bounding
boxes.
ICDAR 2019 ArT
dataset (ArT19)
English and Chinese
languages
highly challenging arbitrarily
shaped text
annotated using arbitrary
number of polygon
vertices
ICDAR 2017 RCTW
dataset (RCTW17)
Chinese scene text in Chinese drawing polygons to
surround every text line
ICDAR 2019 LSVT
dataset (LSVT19)
Chinese,
but also has about 20% of its
labels in English words.
street view text in Chinese drawing polygons to
surround every text line
ICDAR 2013 dataset
(IC13)
English language horizontal text annotated at word-level
using rectangular
bounding boxes
ICDAR 2015 dataset
(IC15)
English language multi-oriented scene text annotated at word-level
using quadrangle
bounding boxes.
$IBSBDUFSMFWFMͷ"OOPUBUJPO͕ແ͍ɺݴޠ͕ภ͍ͬͯΔʹɻ