open data open source open access open educational resources infra & data technology open science transparency research integrity scholarly publishing increased collaboration speed-up production results increased'openness' throughout the researchcycle
what is open data? (from OKF) availability& access re-use and redistribution universal participation open data High-quality sensor networks, simulations small number of collectors usual suspects: administrativedata: PDOK, CBS climate & RS: copernicus, KNMI data tech: UCI ML repo, Kaggle
Alive and kicking in env sci and ecology for 2 decades, recently taking off in the climate sciences citizen science Variable data quality, but volume and pervasiveness is difficult to ignore principleof enablingindividuals with an electronic device to take measurements of the most immediate surroundings
Privacy Citizen science data might contain personal information: • Individual privacy must be protected under European laws • Organizations can't share personal information with the general public • Consequence: conflicting with the "open data" principle
Privacy personal experience • Legal: data transfer agreement between WUR, RIVM and ITC (UTwente) ❖ Tick data can be used for research within ITC ❖ Redistribution is prohibited ❖ Data was anonymized, but there was a location • Transfer: yearly update of observations • Be in sync: ❖ Progress meetings: how are you using data? ❖ Collaboration in research articles • Overall: positive experience ❖ Organizations were open to do research ❖ Terms and conditions were established ❖ Data was prepared fast (thanks!) ❖ Interested in getting feedback
Privacy walk in my shoes • Sometimes citizen science data can't be open and that might be a good thing • Open questions: • Do people actually need the (raw) observations? • What about making... ...aggregated products? ...services and real-time indicators?
Conclusion • Citizen science has the potential of contributing to open science: • Ubiquity of humans means we can monitor anything • Privacy issues need to be considered case by case • Researchers: better understand the importance of open science • Promote the use of open-source software and programming languages (reproducibility) • Depositing your research data in a public repository (findable, accessible, interoperable) • Organizations: assess whether data can be (partially) open • Open the door to new research lines • Explore ways of anonymizingdata • Release aggregated products instead of raw