- OCRs didn’t work well on images
in Nikkei print replica editions
- By assuming that characters will
be arranged in blocks, detecting
rectangular blocks become easier
Why we started developing this technology in-house
Idea
- Detects rectangular blocks in the
following order: (1) paragraphs,
(2) lines, and (3) characters
8