Upgrade to Pro — share decks privately, control downloads, hide ads and more …

NYPL Labs Building Inspector: Extracting Data ...

NYPL Labs Building Inspector: Extracting Data from Historic Maps

Slides for the talk given by Mauricio Giraldo Arteaga at OpenVisConf 2014 http://openvisconf.com/

Video here: https://www.youtube.com/watch?v=Oph1o3IZEFU

Errata (Apr 30, 2014): There's some (kind of cool) dispute on the bronze map. More info: http://exhibitions.nypl.org/treasures/items/show/163 and http://news.nationalgeographic.com/news/2013/08/130821-ostrich-globe-map-discovery-science-nation/

Mauricio Giraldo

April 25, 2014
Tweet

More Decks by Mauricio Giraldo

Other Decks in Technology

Transcript

  1. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2)
  2. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored
  3. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored
  4. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored
  5. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored ✔ ✔
  6. alpha shape convex hull with a sample point set cran.r-project.org/web/packages/alphahull/

    *code basically stolen wholesale from rpubs.com/geospacedman/alphasimple
  7. completely enclosed by black lines dashed lines are not walls

    > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored ✔ ✔ ✔ ✔
  8. [218, 211, 209] paper [199, 179, 173], [179, 155, 157],

    [206, 193, 189], [199, 195, 163], [207, 204, 179], [195, 189, 154], [209, 203, 181], [255, 225, 40], [194, 198, 192], [161, 175, 190], [137, 174, 163], [166, 176, 172], [149, 156, 141] [205, 200, 186] not paper
  9. ✔ ✔ ✔ ✔ ✔ completely enclosed by black lines

    dashed lines are not walls > 20m2 (~180ft2) < 3,000m2 (~27,000ft2) not paper-colored
  10. are people willing to spend time checking building footprints? insurance

    atlases are not exactly the coolest type of maps
  11. 420k+ flags* 70k+ unique polygons ! consensus: ~84% YES, 7%

    FIX, 9% NO *a “flag” is a YES/NO/FIX by one person for a given polygon
  12. gracias bocoup! mauricio giraldo arteaga @mgiraldo NYPL Labs images from

    giphy, wikimedia commons & nypl digital collections