Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Stroke Width Transform

Stroke Width Transform

A novel image operator that seeks to find the value of stroke width for each image pixel, and its use on the task of text detection in natural images.

dennis1088

June 28, 2012
Tweet

More Decks by dennis1088

Other Decks in Research

Transcript

  1. Text Detection • Image retrieval • Help for visually impaired

    • Translation • Business geocoding • Robotic navigation
  2. Natural Images • Large variability of fonts • Varied illumination

    • Image noise • Blur • No page layout
  3. Region Based Text Detection • Pixels aggregated based on local

    features • Connected components extracted, deemed letter candidates • Letter candidates grouped into line text, geometric checks applied
  4. Computing Stroke Width 4 4 4 4 4 4 4

    4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 3 4 4 3 3 3 4 4 4 4 4 4 4 4 4 4 4 3 3 3 4 4 4 4 4 4 4 4 4 4 4 3 3 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4
  5. Geometric Checks • Reject components high stroke width variance •

    Limit aspect ratio of components • Components too large or too small ignored
  6. Issues • Gradient direction will not always point towards opposite

    stroke edge point • Will travel in opposite direction than intended
  7. Solution • SWT must be done twice • Dark text

    on light background • Light text on dark background • Inverse of gradient direction
  8. Future • Detect reading in eye tracking data • Use

    gaze data to bound text detection to a window • Preform text detection • Preform OCR • Use for different purposes