Ric Roberts: Data Platforms and the Data Value Chain

Ric Roberts: Data Platforms and the Data Value Chain

This deck supports Ric's talk about the wider perspective of data publishing, where the act of publishing itself sits within that, what it entails and the advantages it brings. For all talks from the day, head to: http://power-of-data-2017.swirrl.com/

D3be0eabd21ee937951fc9c512f44891?s=128

Swirrl

June 15, 2017
Tweet

Transcript

  1. Data Platforms and the Data Value Chain CTO, @RicRoberts

  2. data

  3. data You are here.

  4. profit!

  5. profit! You want to be here.

  6. profit! Examples • Deciding the best place to put a

    new school • Benchmarking hospitals • Working out the impact of poor air quality on health • Calculating the cost of increased flood risk
  7. None
  8. data profit! ?

  9. data collect clean curate

  10. Examples • Surveys of various sorts • Administrative systems •

    Sensors e.g. detecting river flow • Social media data collect clean curate
  11. data ? profit!

  12. data use profit!

  13. use • exploring • filtering • aggregating • downloading •

    exporting • analysing (data science!) • generating reports (xls, pdf, doc, ppt) • using it in interactive apps or visualisations • sharing results
  14. data ? use profit!

  15. data connect use profit!

  16. connect • A common set of names for the things

    in the data. • A shared, documented and understood model of the data. • An agreed set of technologies for communicating and manipulating the data (standards!). • The data needs to be in a place people can get to it, in an relevant format (with a licence).
  17. https://www.flickr.com/photos/kewl/7006904747

  18. connect In computing, linked data is a method of publishing

    structured data so that it can be interlinked and become more useful through semantic queries. It builds upon standard Web technologies such as HTTP, RDF and URIs, but rather than using them to serve web pages for human readers, it extends them to share information in a way that can be read automatically by computers. This enables data from different sources to be connected and queried. — Wikipedia
  19. connect • A common set of names for the things

    in the data. • A common set of names for the things in the data. • An agreed set of technologies for communicating and manipulating the data (standards!). • The data needs to be in a place people can get to it, in an relevant format (with a licence).
  20. connect • A common set of names for the things

    in the data. • A shared, documented and understood model of the data. • An agreed set of technologies for communicating and manipulating the data (standards!). • The data needs to be in a place people can get to it, in an relevant format (with a licence).
  21. connect • A common set of names for the things

    in the data. • A shared, documented and understood model of the data. • An agreed set of technologies for communicating and manipulating the data (standards!). • The data needs to be in a place people can get to it, in an relevant format (with a licence).
  22. connect • A common set of names for the things

    in the data. • A shared, documented and understood model of the data. • An agreed set of technologies for communicating and manipulating the data (standards!). • The data needs to be in a place people can get to it, in an relevant format (with a licence).
  23. https://www.flickr.com/photos/iwannt/8596885627

  24. data ? connect use profit!

  25. data publish connect use profit!

  26. swirrl.com

  27. publish An (RDF) Graph Store Apache Jena

  28. publish Extract, Transform, Load (ETL) grafter.org github.com/swirrl/grafter

  29. publish Drafting and publication workflow

  30. publish A User Interface

  31. publish APIs

  32. None
  33. data publish connect use profit! collect clean curate What’s limiting

    the effectiveness of this value chain? • Cottage industry of skilled individuals • Data preparation is not always considering bigger picture • Those expending the costs != those reaping the benefits • Availability of skilled data analysts • Lack of guidance and standardisation
  34. data publish connect use profit! collect clean curate

  35. Data Platforms and the Data Value Chain CTO, @RicRoberts