Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Getting data & why it matters

Getting data & why it matters

Presented at Nepal Data Bootcamp with various live demos, including:

* Example project: Dollars for docs- http://projects.propublica.org/docdollars/

Catalogues:

* Nepal Bureau of Statistics - http://cbs.gov.np/
* World Bank Data Catalogue - http://data.worldbank.org/
* WB Country Factsheet - http://data.worldbank.org/country/nepal
* WB Country Map - http://maps.worldbank.org/sa/nepal
* UN Data search - http://data.un.org/Search.aspx?q=nepal
* Collaborative data hub - http://datahub.io/
* Open Nepal data portal - http://opendatanepal.org/

Search:

* Searching google with Operators: https://www.google.de/advanced_search?q=inurl:gov.np+filetype:pdf&hl=en&biw=1024&bih=670&noj=1

Everything is data:

* Nepal Stock index: http://www.myrepublica.com/portal/index.php?action=stocks
* Energy downtimes: http://www.myrepublica.com/portal/index.php?action=pages&page_id=8
* CA members: http://can.gov.np/en/ca_members/index
* Movies on Wikipedia: http://en.wikipedia.org/wiki/List_of_Nepalese_films

Extraction tools

* Google Chrome Scraper Extension - https://chrome.google.com/webstore/detail/scraper/mbigbapnjcgaffohmbkdlecaccepngjd?hl=en
* Google Spreadsheets ImportHTML: http://support.google.com/drive/bin/answer.py?hl=en&answer=155182
* ScraperWiki for little programs: https://scraperwiki.com/

Dealing with PDFs:

* Budget 2012/13 as a PDF: http://mof.gov.np/article-Content-bWlbjg==
* Tabula: http://tabula.nerdpower.org/
* Cometdocs: http://www.cometdocs.com/
* FineReader (Desktop application) - http://finereader.abbyy.com/

Friedrich Lindenberg

June 03, 2013
Tweet

More Decks by Friedrich Lindenberg

Other Decks in Technology

Transcript

  1. Checklist •Who made it? Why? When? •How did they make

    it? •What is missing? •Which other data relates to it?