Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Tools for Data Journalism | MediaLab Prado DDJ ...
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Friedrich Lindenberg
October 23, 2015
0
250
Tools for Data Journalism | MediaLab Prado DDJ Workshop
Presented Fri, 23.10.2015
Friedrich Lindenberg
October 23, 2015
Tweet
Share
More Decks by Friedrich Lindenberg
See All by Friedrich Lindenberg
Introducción a OCCRP Data
pudo
0
420
Getting started with OCCRP Data
pudo
0
1.6k
#nr16: Recherche-Tools
pudo
1
110
data.occrp.org
pudo
0
170
Digitial Research Tools for Investigative Reporters
pudo
0
11k
Grano: A Python tool for investigating influence
pudo
1
290
Data doesn't grow in tables
pudo
2
280
Dr. Freezefile
pudo
2
440
Intro presentation for Naivasha
pudo
1
170
Featured
See All Featured
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.3k
Information Architects: The Missing Link in Design Systems
soysaucechin
0
780
A better future with KSS
kneath
240
18k
A Soul's Torment
seathinner
5
2.3k
Design of three-dimensional binary manipulators for pick-and-place task avoiding obstacles (IECON2024)
konakalab
0
350
Lightning talk: Run Django tests with GitHub Actions
sabderemane
0
120
The browser strikes back
jonoalderson
0
390
Six Lessons from altMBA
skipperchong
29
4.2k
Skip the Path - Find Your Career Trail
mkilby
0
57
Beyond borders and beyond the search box: How to win the global "messy middle" with AI-driven SEO
davidcarrasco
1
56
Marketing to machines
jonoalderson
1
4.6k
How People are Using Generative and Agentic AI to Supercharge Their Products, Projects, Services and Value Streams Today
helenjbeal
1
130
Transcript
Tools for data journalism
I’m @pudo I make software for journalists at @occrp
None
Find data where no (wo)man has gone before
Dig into bureaucracy
None
None
Everything is data
None
Voluntary Release Involuntary Release Active acquisition FoI Scraping Passive acquisition
Open Data Leaks
All web pages are data! import.io / Chrome Scraper /
ScraperWiki
Documents, too! Tabula PDF / CometDocs / ABBYY
Collect a treasure (and share it!)
None
Interview the data
Data cleaning with Google Refine
Google Sheets & Pivot Tables
WARNING: Use your brain
Make a point
None
None
DataWrapper
CartoDB
When is a map a map?
None
None
None
If you like, learn to code
JavaScript/D3 SQL/Databases Python for scraping
[email protected]