Citizen Science with Python
@IanOzsvald – ianozsvald.com
Ian Ozsvald
PyConLT 2019 Keynote
Slide 2
Slide 2 text
Interim Chief Data Scientist
19+ years experience
How do we “do good” with data?
Introductions
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 3
Slide 3 text
Short stories on Citizen Science
Tips for how you could do the same
Short demo
Goals today
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 4
Slide 4 text
Macedonian air quality
By [ian]@ianozsvald[.com] Ian Ozsvald
The “smelly fog” in Skopje
Gorjan Jovanovski (when 21)
Government open data showed “4* more pollution than
Beijing and 20* EU limits”
www.theair.app
Photo: https://www.facebook.com/ilijoski
Slide 5
Slide 5 text
Initially a JSON dump
1 mil. People in 1 mo.
Visualisation in Parliament
Challenged by a Minister
Drove Government policy
Political debate and change
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 6
Slide 6 text
Get a public dataset – most people can’t but you can
Load it, investigate
Make some graphs with matplotlib
Can you tell a story?
How to start?
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 7
Slide 7 text
Updating outdated medical results
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 8
Slide 8 text
Updating outdated medical results
By [ian]@ianozsvald[.com] Ian Ozsvald
Friedman 1955
Stages of labour
by cervix dilation
Different drugs,
ages, technologies
Significant medical
decisions based on the result
Slide 9
Slide 9 text
Updating outdated medical results
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 10
Slide 10 text
Actionable results
By [ian]@ianozsvald[.com] Ian Ozsvald
First or ‘many’ births?
Weight (before delivery)
Age
Slide 11
Slide 11 text
Check for out-dated assumptions
Gather data & visualise
Produce simple models (if/then models are great!)
Build up iteratively as you gain trust
How to start?
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 12
Slide 12 text
Dirk Gorissen – track 6 Orangutangs in 2000 km^2 of
Bornean jungle
Radio pings, drones and signal processing
Where’s Bunga the Orangutang
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 13
Slide 13 text
Sw Defined Radio, Python processing
Automated flight paths
SEE VIDEO
Success on second round!
Where’s Bunga the Orangutang
By [ian]@ianozsvald[.com] Ian Ozsvald
https://dirkgorissen.com/2019/01/06/wheres-pinoh-tracking-orangutans-with-drones-and-gnu-radio/
Slide 14
Slide 14 text
Automate a manual process
Note – drones are complex, don’t start here!
Collect data, analyse, visualise signal, make decisions
How to start?
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 15
Slide 15 text
Hackathon
Can we get more people registered to vote?
Using only open data and adwords
Thanks Alex, Emiliano, Harry, John, Laurens, Nick
Improving political engagement
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 16
Slide 16 text
Improving political engagement
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 17
Slide 17 text
Improving political engagement
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 18
Slide 18 text
Jupyter Notebooks and open data
Can we challenge a Guardian news article and fix it?
Thanks to Tom Forth @thomasforth
Short demo
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 19
Slide 19 text
Tell useful data stories
Where’s PyData Vilnius? Start it?
Thank your organisers & speakers!
Send me a postcard if I taught you something?
Summary
By [ian]@ianozsvald[.com] Ian Ozsvald
Slide 20
Slide 20 text
“Monitoring Personal Air Quality”
By [ian]@ianozsvald[.com] Ian Ozsvald
Github.com/OxygenLithium
PyLondinium 2018 talk by Robin & Oliver