Slide 1

Slide 1 text

July 29, 2013 University of Oxford Department of Computer Science Wikidata A Free Collaborative Knowledge Base Markus Krötzsch University of Oxford

Slide 2

Slide 2 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 2

Slide 3

Slide 3 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 3

Slide 4

Slide 4 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 4 Wikidata  Official “Wikipedia Database”  For all 285 language editions  Very recent:  Live since November 2012  Enabled on all Wikipedia editions since March 2013  Ongoing development led by Wikimedia Germany

Slide 5

Slide 5 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 5 Contents and Data Model  Language links  Labels, descriptions, aliases  Statements  Ongoing work: more datatypes, ranking statements, …

Slide 6

Slide 6 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 6 Growth  Status as of 17 July 2013:  800k-900k human edits per month  13M items, 14M statements  15M edits in May 2013 (500k per day)

Slide 7

Slide 7 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 7 Growth

Slide 8

Slide 8 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 8 Application Areas  Labels and descriptions  Identifiers  Data access  Advanced analytics

Slide 9

Slide 9 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 9 Third-party applications

Slide 10

Slide 10 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 10 Person Data  See www.wikidata.org

Slide 11

Slide 11 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 11 Getting the Data  See www.wikidata.org/wiki/Wikidata:Data_access  Direct access per item (Web API, RDF/JSON/...)  Database dumps (full dumps + daily changes)  Full dumps in more convenient formats planned

Slide 12

Slide 12 text

July 29, 2013 Markus Krötzsch: Big Deduction Page 12 Conclusions  Wikidata is developing rapidly  Data size  Vocabulary size  Technical features and community processes  A platform for data integration  Including links to many other databases  Data access is easy, both legally and technically  Further improvements planned for exports