Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Markus Kroetzsch: Wikidata: A Free Collaborative Knowledge Base

Markus Kroetzsch: Wikidata: A Free Collaborative Knowledge Base

More Decks by Cultures of Knowledge: Networking the Republic of Letters, 1550-1750

Other Decks in Education

Transcript

  1. July 29, 2013 University of Oxford Department of Computer Science

    Wikidata A Free Collaborative Knowledge Base Markus Krötzsch University of Oxford
  2. July 29, 2013 Markus Krötzsch: Big Deduction Page 4 Wikidata

     Official “Wikipedia Database”  For all 285 language editions  Very recent:  Live since November 2012  Enabled on all Wikipedia editions since March 2013  Ongoing development led by Wikimedia Germany
  3. July 29, 2013 Markus Krötzsch: Big Deduction Page 5 Contents

    and Data Model  Language links  Labels, descriptions, aliases  Statements  Ongoing work: more datatypes, ranking statements, …
  4. July 29, 2013 Markus Krötzsch: Big Deduction Page 6 Growth

     Status as of 17 July 2013:  800k-900k human edits per month  13M items, 14M statements  15M edits in May 2013 (500k per day)
  5. July 29, 2013 Markus Krötzsch: Big Deduction Page 8 Application

    Areas  Labels and descriptions  Identifiers  Data access  Advanced analytics
  6. July 29, 2013 Markus Krötzsch: Big Deduction Page 11 Getting

    the Data  See www.wikidata.org/wiki/Wikidata:Data_access  Direct access per item (Web API, RDF/JSON/...)  Database dumps (full dumps + daily changes)  Full dumps in more convenient formats planned
  7. July 29, 2013 Markus Krötzsch: Big Deduction Page 12 Conclusions

     Wikidata is developing rapidly  Data size  Vocabulary size  Technical features and community processes  A platform for data integration  Including links to many other databases  Data access is easy, both legally and technically  Further improvements planned for exports