Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Speaker Deck
PRO
Sign in
Sign up for free
OCRFeeder: OCR Made Easy on GNOME
Joaquim Rocha
July 27, 2012
Programming
1
210
OCRFeeder: OCR Made Easy on GNOME
A presentation of what OCRFeeder is and what is does.
Joaquim Rocha
July 27, 2012
Tweet
Share
More Decks by Joaquim Rocha
See All by Joaquim Rocha
Git: Best Practices
jrocha
3
3.7k
Skeltrack: Open Source Skeleton Tracking
jrocha
1
170
Introduction to Django
jrocha
5
3.5k
Skeltrack - Open Source Skeleton Tracking
jrocha
0
150
Skeltrack: Open Source Skeleton Tracking
jrocha
1
2.8k
Other Decks in Programming
See All in Programming
Meet Swift Regex
usamik26
0
370
Scrum Fest Osaka 2022/5年で200人になったスタートアップの アジャイル開発の歴史とリアル
atamaplus
1
970
Beyond Micro Frontends: Frontend Moduliths for the Enterprise @wad2022
manfredsteyer
PRO
0
140
iOS 16からのロック画面Widget争奪戦に備える
tsuzuki817
0
270
クックパッドマートの失敗したデータ設計 Before / After 大放出
mokuzon
0
180
「混ぜるな危険」を推進する設計
minodriven
8
2.2k
Modern Android Developer ~ 안내서
pluu
1
660
Independently together: better developer experience & App performance
bcinarli
0
190
[DevTrends - Jun/2022] Arquitetura baseada em eventos
camilacampos
0
160
GDG Seoul IO Extended 2022 - Android Compose
taehwandev
0
340
Android Compose Component - mapping.
taehwandev
0
140
シェーダー氷山発掘記
logilabo
0
150
Featured
See All Featured
Why Our Code Smells
bkeepers
PRO
324
55k
Rails Girls Zürich Keynote
gr2m
86
12k
Building Flexible Design Systems
yeseniaperezcruz
310
34k
Learning to Love Humans: Emotional Interface Design
aarron
261
37k
I Don’t Have Time: Getting Over the Fear to Launch Your Podcast
jcasabona
12
930
Responsive Adventures: Dirty Tricks From The Dark Corners of Front-End
smashingmag
237
19k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
212
20k
Product Roadmaps are Hard
iamctodd
34
6.6k
Debugging Ruby Performance
tmm1
65
10k
Agile that works and the tools we love
rasmusluckow
319
19k
WebSockets: Embracing the real-time Web
robhawkes
57
5.4k
Code Reviewing Like a Champion
maltzj
506
37k
Transcript
static void _f_do_barnacle_install_properties(GObjectClass *gobject_class) { GParamSpec *pspec; /* Party code
attribute */ pspec = g_param_spec_uint64 (F_DO_BARNACLE_CODE, "Barnacle code.", "Barnacle code", 0, G_MAXUINT64, G_MAXUINT64 /* default value */, G_PARAM_READABLE | G_PARAM_WRITABLE | G_PARAM_PRIVATE); g_object_class_install_property (gobject_class, F_DO_BARNACLE_PROP_CODE, Joaquim Rocha jrocha@igalia.com OCRFeeder OCR Made Easy on GNOME July 27 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 What is
it? Document Analysis and Optical Character Recognition for GNOME
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Why? Paper
has a number of problems No applications for GNU/Linux to do a fair job
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Security CC Photo by: http://www.flickr.com/photos/badwsky/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Preservation CC Photo by: http://www.flickr.com/photos/98469445@N00/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Data processing CC Photo by: http://www.flickr.com/photos/hugovk/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Ecology CC Photo by: http://www.flickr.com/photos/pranavsingh/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Accessibility CC Photo by: http://www.flickr.com/photos/illustrator/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 No fair
conversion apps for GNU/Linux apart from OCR engines, but...
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 OCR !=
Document Conversion (it only deals with chars) (does not consider the layout) (does not distinguish contents)
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 What's needed
is Document Analysis and Recognition (conversion of documents to an electronic format) (first projects in the 80s)
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 How it
works
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 So many
layouts... CC Photo by: http://www.flickr.com/photos/uber-tuber/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Layouts vary
with the type of document What works on detecting one, won't work on others
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 OCRFeeder focuses
on contents, not on layouts!
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Key concept:
If a document image can be divided in windows of 1 (content) or 0 (not content), then it is possible to group all the 1s and outline the contents
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Recognition: System-wide
OCR engines are used Engines are configured from the GUI or XML files
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Most known
free OCR engines are detected and configured automatically: * Tesseract * GOCR * OCRAD * Cuneiform
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Exportation formats:
ODT HTML Plain text PDF
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 User interaction:
Users can edit everything and review the algorithm's results So, UI can work in attended and unattended ways CLI only works in an unattended mode
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Demo time!
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Other features:
* PDF importation * Unpaper preprocessor * Font style edition * Image deskewing * OCR results cleaning * Project saving/loading
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Future: *
More exportation formats: HOCR, etc. * Make OCR engines' management easier
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Webpage: http://live.gnome.org/OCRFeeder
git: http://git.gnome.org/ocrfeeder Bugzilla: http://bugzilla.gnome.org product: OCRFeeder
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Thank you!