Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
OCRFeeder: OCR Made Easy on GNOME
Search
Joaquim Rocha
July 27, 2012
Programming
1
250
OCRFeeder: OCR Made Easy on GNOME
A presentation of what OCRFeeder is and what is does.
Joaquim Rocha
July 27, 2012
Tweet
Share
More Decks by Joaquim Rocha
See All by Joaquim Rocha
Git: Best Practices
jrocha
3
3.8k
Skeltrack: Open Source Skeleton Tracking
jrocha
1
190
Introduction to Django
jrocha
5
3.7k
Skeltrack - Open Source Skeleton Tracking
jrocha
0
160
Skeltrack: Open Source Skeleton Tracking
jrocha
1
3k
Other Decks in Programming
See All in Programming
大規模Reactアプリのリアーキテクチャ~8万行のTanStack Query移行の軌跡~
kj455
4
1k
Git Rebase
bkuhlmann
11
1.6k
"config" ってなんだ? / What is "config"?
okashoi
0
300
MetricKitで予期せぬ終了を検知する話 / Detect unexpected termination with MetricKit
nekowen
1
200
DMMプラットフォームがTiDB Cloudを採用した背景
pospome
9
4.2k
Elm Form Validation
bkuhlmann
0
510
Implementing Design Systems in Swift
seyfoyun
1
460
PHPはいつから死んでいるかの調査
chiroruxx
2
420
From Spring Boot 2 to Spring Boot 3 with Java 21 and Jakarta EE
ivargrimstad
0
520
FigmaとPHPで作る1ミリたりとも表示崩れしない最強の帳票印刷ソリューション
ttskch
43
19k
GitHub Copilotのススメ
marcy731
1
220
Fast JSX: Don't clone props object #28768
yossydev
1
170
Featured
See All Featured
In The Pink: A Labor of Love
frogandcode
138
21k
Fireside Chat
paigeccino
22
2.6k
GraphQLとの向き合い方2022年版
quramy
33
12k
Infographics Made Easy
chrislema
238
18k
How To Stay Up To Date on Web Technology
chriscoyier
782
250k
A Philosophy of Restraint
colly
197
16k
Learning to Love Humans: Emotional Interface Design
aarron
267
39k
Building Effective Engineering Teams - LeadDev
addyosmani
31
1.9k
Happy Clients
brianwarren
92
6.4k
Reflections from 52 weeks, 52 projects
jeffersonlam
345
19k
Helping Users Find Their Own Way: Creating Modern Search Experiences
danielanewman
21
1.9k
個人開発の失敗を避けるイケてる考え方 / tips for indie hackers
panda_program
65
14k
Transcript
static void _f_do_barnacle_install_properties(GObjectClass *gobject_class) { GParamSpec *pspec; /* Party code
attribute */ pspec = g_param_spec_uint64 (F_DO_BARNACLE_CODE, "Barnacle code.", "Barnacle code", 0, G_MAXUINT64, G_MAXUINT64 /* default value */, G_PARAM_READABLE | G_PARAM_WRITABLE | G_PARAM_PRIVATE); g_object_class_install_property (gobject_class, F_DO_BARNACLE_PROP_CODE, Joaquim Rocha
[email protected]
OCRFeeder OCR Made Easy on GNOME July 27 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 What is
it? Document Analysis and Optical Character Recognition for GNOME
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Why? Paper
has a number of problems No applications for GNU/Linux to do a fair job
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Security CC Photo by: http://www.flickr.com/photos/badwsky/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Preservation CC Photo by: http://www.flickr.com/photos/98469445@N00/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Data processing CC Photo by: http://www.flickr.com/photos/hugovk/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Ecology CC Photo by: http://www.flickr.com/photos/pranavsingh/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Paper problems:
Accessibility CC Photo by: http://www.flickr.com/photos/illustrator/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 No fair
conversion apps for GNU/Linux apart from OCR engines, but...
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 OCR !=
Document Conversion (it only deals with chars) (does not consider the layout) (does not distinguish contents)
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 What's needed
is Document Analysis and Recognition (conversion of documents to an electronic format) (first projects in the 80s)
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 How it
works
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 So many
layouts... CC Photo by: http://www.flickr.com/photos/uber-tuber/
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Layouts vary
with the type of document What works on detecting one, won't work on others
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 OCRFeeder focuses
on contents, not on layouts!
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Key concept:
If a document image can be divided in windows of 1 (content) or 0 (not content), then it is possible to group all the 1s and outline the contents
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Recognition: System-wide
OCR engines are used Engines are configured from the GUI or XML files
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Most known
free OCR engines are detected and configured automatically: * Tesseract * GOCR * OCRAD * Cuneiform
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Exportation formats:
ODT HTML Plain text PDF
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 User interaction:
Users can edit everything and review the algorithm's results So, UI can work in attended and unattended ways CLI only works in an unattended mode
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Demo time!
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Other features:
* PDF importation * Unpaper preprocessor * Font style edition * Image deskewing * OCR results cleaning * Project saving/loading
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Future: *
More exportation formats: HOCR, etc. * Make OCR engines' management easier
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Webpage: http://live.gnome.org/OCRFeeder
git: http://git.gnome.org/ocrfeeder Bugzilla: http://bugzilla.gnome.org product: OCRFeeder
Joaquim Rocha (Igalia) · OCRFeeder · GUADEC 2012 Thank you!