Upgrade to Pro — share decks privately, control downloads, hide ads and more …

The 4 V's of Big Data

Carsten Keßler
September 20, 2012

The 4 V's of Big Data

My slides from the big data panel at GIScience 2012 in Columbus, OH.

Carsten Keßler

September 20, 2012
Tweet

More Decks by Carsten Keßler

Other Decks in Science

Transcript

  1. The 3 Vs Carsten Keßler | Institute for Geoinformatics |

    University of Münster, Germany http://carsten.io | @carstenkessler of Big Data 4
  2. ‣ Integrating data of 70 different types (and counting): 1

    | Volume LODUM Project LODUM Linked Open Data University of Münster
  3. ‣ Integrating data of 70 different types (and counting): ‣

    publications, persons, buildings, courses, cafeteria menus, urban history data, … 1 | Volume LODUM Project LODUM Linked Open Data University of Münster
  4. ‣ Integrating data of 70 different types (and counting): ‣

    publications, persons, buildings, courses, cafeteria menus, urban history data, … ‣ Challenge: reconciliation and de-duplication 1 | Volume LODUM Project LODUM Linked Open Data University of Münster
  5. ‣ Integrating data of 70 different types (and counting): ‣

    publications, persons, buildings, courses, cafeteria menus, urban history data, … ‣ Challenge: reconciliation and de-duplication > 800k resources, > 3 mio triples 1 | Volume LODUM Project LODUM Linked Open Data University of Münster
  6. 2 | Velocity Semantic Sensor Networks ‣ Does Linked Sensor

    Data make sense at all? ‣ Do we want to provide all sensor data as Linked Sensor Data? / /
  7. 2 | Velocity Semantic Sensor Networks ‣ Does Linked Sensor

    Data make sense at all? ‣ Do we want to provide all sensor data as Linked Sensor Data? ‣ Data streams ‣ Stream processing ‣ Automatically calibrate update frequency / /
  8. 3 | Variety Humanitarian eXchange Language ‣ United Nations Office

    for the Coordination of Humanitarian Affairs (OCHA) hxl.humanitarianresponse.info
  9. 3 | Variety Humanitarian eXchange Language ‣ United Nations Office

    for the Coordination of Humanitarian Affairs (OCHA) ‣ Integrating messy data from a wide range of humanitarian organizations in a number of formats hxl.humanitarianresponse.info
  10. 3 | Variety Humanitarian eXchange Language ‣ United Nations Office

    for the Coordination of Humanitarian Affairs (OCHA) ‣ Integrating messy data from a wide range of humanitarian organizations in a number of formats ‣ Approach: ‣ semantic annotations ‣ space + time as organization principles hxl.humanitarianresponse.info
  11. 4 | Veracity Trust and Reputation in VGI ‣ Parsing

    the history of single features ‣ Trying to compute credibility on a per-user basis
  12. 4 | Veracity Trust and Reputation in VGI ‣ Parsing

    the history of single features ‣ Trying to compute credibility on a per-user basis ‣ Volume: ‣ Full OSM history dump: 500 GB ‣ Decompressing it takes ~36hours on a decent laptop