of the verb, ‘to know.’ It used to mean ‘having information stored in one’s memory.’ It now means the process of having access to information and knowing how to use it.” ---Herbert Simon
heard about it! Is it real? Where are the applications for it? Is there a “killer app?” Who is using it? What role (if any) will it play in the Future Web? The Semantic Web is alive and well in Linked Open Data (LOD)!
of being composed of pages, documents We have been able to insert some data Images <img src=“….”> Multimedia Web 2.0 mashups provided a new way of thinking about a “Web of Data” but it was awkward to obtain APIs “Screen-scraping”
Designed for Human consumption Primary objects Documents (or sub-parts of) Links between Documents (or sub-parts of) Degree of structure in objects Fairly low Semantics of content and links Implicit
data, untyped links, disconnected data Integration Show me all the publications from HKU PhD students in Computer Science Querying Which papers have I written with colleagues outside the US?
database Designed for Machines first, humans later Primary objects Things (or descriptions of things) Links between Things Degree of structure in (descriptions of ) things High Semantics of content and links Explicit
the Web that Encourages reuse Reduces redundancy Maximizes its (real and potential) inter-connectedness Enables network effects to add value to data
Resource Identifier (URI) provides a simple and extensible means for identifying a resource” – RFC 3986 Many different schemes – http://, ftp://, tel:, urn:, mailto: Some URIs for “real world” things http://www.bebowhite.com/ http://dbpedia.org/page/University_of_Hong_Kong http://sws.geonames.org/1819729/
proper infrastructure for a real Web of Data Data is available on the Web Accessible via standard Web technologies Data are interlinked over the Web ie, data can be integrated over the Web This is where Semantic Web technologies come in
id_xyz Social Media Tools and Platforms in Learning Environments id_qpr 2011 ID Name Homepage id_xyz White, Bebo http://www.bebowhite.com ID Publisher’s name City id_qpr Springer New York
not necessarily mean physical conversion of the data Relations can be generated on-the-fly at query time Via SQL “bridges” Scraping HTML pages Extracting data from Excel sheets etc. One can export part of the data
We “connected” the data… But a simple connection is not enough… data should be named somehow Hence the RDF Triples: a labelled connection between two resources
that: • “s”, “p” are URI-s, ie, resources on the Web; “o” is a URI or a literal • “s”, “p”, and “o” stand for “subject”, “property”, and “object” • here is the complete triple: (<http://…isbn…6682>, <http://…/original>, <http://…isbn…409X>) • RDF is a general model for such triples • With machine readable formats like RDF/XML, Turtle, N3, RDFa, …
http://www.example.org/file.html#home http://www.example.org/file2.xml#xpath(//q[@a=b]) http://www.example.org/form?a=b&c=d RDF triples form a directed, labeled graph (the best way to think about them!)
de Medias Sociaux</f:titre> <f:original rdf:resource="http://…/isbn/3642203914"/> </rdf:Description> (Note: namespaces are used to simplify the URI-s) http://…isbn/2020386682 Outils de Medias Sociaux… http://…isbn/ 3642203914
Bristol Lab): A “Model” object is created The RDF file is parsed and results stored in the Model The Model offers methods to retrieve: triples (property,object) pairs for a specific subject (subject,property) pairs for specific object etc. The rest is conventional programming… Similar tools exist in Python, PHP, etc.
data onto an abstract data representation Make the data independent of its internal representation… Merge the resulting representations Start making queries on the whole! Queries not possible on the individual data sets
Anything, not just documents You are not your homepage Information resources and non-information resources Use HTTP URIs Globally unique names, distributed ownership Allows people to look up those names Provide useful information in RDF When someone looks up a URI Include RDF links to other URIs To enable discovery of related information
cases it’s our data! Efficiency, reducing redundancy Promotes a digital society Opens the door to data innovation and discovery Holds the promise of creating from data Knowledge Wisdom Benefit for all