Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Getting to understand Linked Open Data in 60 minutes

Getting to understand Linked Open Data in 60 minutes

Guest lecture at Ghent University

Pieter Colpaert

May 08, 2017
Tweet

More Decks by Pieter Colpaert

Other Decks in Technology

Transcript

  1. Will engineering a better information system help us build a

    type I civilization? https://en.wikipedia.org/wiki/Kardashev_scale
  2. Open Data in the world For example Data Portal from

    Worldbank http://data.worldbank.org
  3. SNCB STIB De Lijn TEC Schedules shared shared shared open

    Real-time shared shared shared planned Tickets no no yes no Historic no no no open Status of e.g., Public Transit in BE?
  4. Sharing data between 2 systems Your system Third party system

    Agree on a protocol Will determine which questions can be answered in a timely fashion Can ask questions to your system as previously agreed
  5. Sharing data on the Web Your system ? ? ?

    ? ? ? Maximizing reuse → need to raise the interoperability
  6. ↓ Querying syntactic semantic technical legal When I have got

    2 datasets, how easy is it to use them as if they were 1?
  7. As a reuser, you need certainty that you won’t get

    sued https://github.com/iRail/stations
  8. A story of raising interoperability ↓ Querying syntactic semantic technical

    legal When I have 2 datasets, how easy is it to turn them into 1 dataset? → Open Definition & open licenses → The Internet: exchanging data world-wide → JSON, XML, CSV, HTML… Open Standards
  9. name type city population StP-Plein Parking Gent 257k { “StP-Plein”

    : { “type” : “Parking”, “city” : “Gent, “population” : “257k” } } <StP-Plein> <type>Parking</type> <city>Gent</city> <population> 257k </population> </StP-Plein> Table / CSV / Spreadsheet JSON XML Serialisations ↓ Querying syntactic semantic technical legal
  10. { “StP-Plein” : { “type” : “Parking”, “city” : “Gent,

    “population” : “257k” } } name type city population StP-Plein Parking Gent 257k <StP-Plein> <type>Parking</type> <city>Gent</city> <population> 257k </population> </StP-Plein> <StP-Plein> <type> <Parking> . <StP-Plein> <city> <Gent> . <Gent> <population> “257k” . Table / CSV / Spreadsheet 3 time a datum Triples JSON XML
  11. World Wide Web St-P Plein city Gent St Pietersplein type

    Parking Gent population 257k HTTP Machine 1 HTTP Machine 2 HTTP Machine 3 Thought experiment: decentralizing publishing A user agent visiting each machine knows more than any of the machines independently
  12. Solution Sint Pietersplein → https://stad.gent/id/parking/P10 is a → http://www.w3.org/1999/02/22-rdf-syntax-ns#type Parking

    → http://vocab.datex.org/terms#UrbanParkingSite Uniform Resource Identifiers (URIs)
  13. A story of raising interoperability ↓ Querying syntactic semantic technical

    legal When I have 2 datasets, how easy is it to turn them into 1 dataset? → Open Definition & open licenses → The Internet: exchanging data world-wide → JSON, XML, CSV, HTML… Open Standards → Linked Data
  14. Open Data is only a legal definition… but: The 5

    stars of Linked Open Data http://5stardata.info
  15. Sharing data between 2 systems Your system Third party system

    Agree on a protocol Will determine which questions can be answered in a timely fashion Can ask questions to your system as previously agreed
  16. Sharing data on the Web Your system ? ? ?

    ? ? ? Maximizing reuse → need to raise the interoperability
  17. data dump Ask any question How to allow for asking

    any kind of query? Your system 3d party Your system ? ? ? ? ? ?
  18. data dump Ask any question Asking questions Your system 3d

    party Your system ? ? ? ? ? ? Data publishing: Cacheable Dataset split in fragments
  19. A long tail for for e.g., transport data services ...

    Hard to guess which kind of queries will be needed … More specific features Size of audience Google maps Proximus CityMapper Go-OV Ally Transit App NextTrain smartwatch
  20. data dump Route planning algorithms as a service Asking questions

    Your system 3d party Your system ? ? ? ? ? ? Does not scale: Extra users comes with extra load Does not give necessary flexibility to companies
  21. Discover all the necessary data on the Web Just like

    websites, we want your data to be high available
  22. API fanboys Real data reusers Need Open Data Want services

    on top of data What we ask data owners
  23. What we ask data owners Data dumps Smart servers Data

    publishing (cheap/reliable) Data services (rather expensive/unreliable) Entire query languages over HTTP Dataset split in fragments Smart agents algorithms as a service Read more at http://linkeddatafragments.org API fanboys Open Data
  24. Business model? API fanboys Real data reusers Need Open Data

    Need services on top of data Business opportunity?
  25. Servers publishing Open Data e.g., • all the planned and

    actual arrivals and departures • the network of roads in a certain region worldwide web-services e.g., • a route planner: from → to • the closest station to your current location? Scalable businesses $$$ $ $ $$$ end-users
  26. We want a world where knowledge creates power for the

    many, not the few. Looking for a student job in Open Data? Check out http://summerofcode.be