Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Wiki[mp]edia data sources & the MediaWiki API

Wiki[mp]edia data sources & the MediaWiki API

Presented at Melbourne Hack Weekend, 2009.

Avatar for Brianna Laugher

Brianna Laugher

November 09, 2009
Tweet

More Decks by Brianna Laugher

Other Decks in Technology

Transcript

  1. ...

  2. {{Infobox Company |name = Lonely Planet |logo = |type =

    [[United Kingdom|British]] [[Government-owned company|government-owned]] (subsidiary of [[BBC Worldwide]]) |genre = [[Guide book|Travel guides]] |foundation = 1972 |founder = Tony Wheeler<br /> Maureen Wheeler |location_city = [[Footscray, Victoria]] |location_country = [[Australia]] |location = |origins = |key_people = Matt Goldberg <small>(Global [[CEO]])</small> |area_served = Worldwide |industry = [[Multi media]] |products = Travel [[guidebook, digital applications, online travel community]] |services =
  3. Wiktionary 5M+ entries 170+ languages 13 languages > 100K entries

    French biggest at 1.5M (English second at 1.4M)
  4.  Users  Logs  Pages, subpages, talk pages 

    Links, backlinks  Templates  Categories MediaWiki structure
  5. DBpedia Community project extracting structured data from Wikipedia and making

    it available Can download data sets or query them online Ontology++ e.g. dbpedia.org/page/Lonely_Planet
  6. toolserver.org Server for community-developed plugins, addons, extensions, stats and hacks

    – tools Tools often explicitly implements implicit editing community standards (“community API”) Toolserver