Slide 1

Slide 1 text

Tools for data journalism

Slide 2

Slide 2 text

I’m @pudo I make software for journalists at @occrp

Slide 3

Slide 3 text

No content

Slide 4

Slide 4 text

Find data where no (wo)man has gone before

Slide 5

Slide 5 text

Dig into bureaucracy

Slide 6

Slide 6 text

No content

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

Everything is data

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

Voluntary Release Involuntary Release Active acquisition FoI Scraping Passive acquisition Open Data Leaks

Slide 11

Slide 11 text

All web pages are data! import.io / Chrome Scraper / ScraperWiki

Slide 12

Slide 12 text

Documents, too! Tabula PDF / CometDocs / ABBYY

Slide 13

Slide 13 text

Collect a treasure (and share it!)

Slide 14

Slide 14 text

No content

Slide 15

Slide 15 text

Interview the data

Slide 16

Slide 16 text

Data cleaning with Google Refine

Slide 17

Slide 17 text

Google Sheets & Pivot Tables

Slide 18

Slide 18 text

WARNING: Use your brain

Slide 19

Slide 19 text

Make a point

Slide 20

Slide 20 text

No content

Slide 21

Slide 21 text

No content

Slide 22

Slide 22 text

DataWrapper

Slide 23

Slide 23 text

CartoDB

Slide 24

Slide 24 text

When is a map a map?

Slide 25

Slide 25 text

No content

Slide 26

Slide 26 text

No content

Slide 27

Slide 27 text

No content

Slide 28

Slide 28 text

If you like, learn to code

Slide 29

Slide 29 text

JavaScript/D3 SQL/Databases Python for scraping

Slide 30

Slide 30 text

friedrich@pudo.org