Democratizing Data Science By: Rayna Harris @raynamharris Rebecca Calisi @BeccaCalisi With thanks to Titus Brown @ctitusbrown Greg Wilson @gvwilson Suggested reading: Reproducible Research in Computational Science by Roger D. Peng @rdpeng https://science.sciencemag.org/content/334/6060/1226 https://sci-hub.tw/https://science.sciencemag.org/content/334/6060/1226 May 2, 2019
tested as part of a National Geographic funded research project to sample the genomes of 100 human populations. Later, the data generated by her DNA is used to support a scientific claim <insert headline> and implicate you in a crime <insert headline>. How would you evaluate the data and the results?
within a society The minimum standard for judging scientific claims when full independent replication of a study is not possible • 1. Introduce a democratic system to data science. 2. Make data science accessible to everyone.
within a society The minimum standard for judging scientific claims when full independent replication of a study is not possible • 1. Introduce a democratic system to data science. 2. Make data science accessible to everyone. Methodological advances Compute power Public data sets
within a society The minimum standard for judging scientific claims when full independent replication of a study is not possible • 1. Introduce a democratic system to data science. 2. Make data science accessible to everyone. Community governance Open access education Methodological advances Compute power Public data sets
within a society The minimum standard for judging scientific claims when full independent replication of a study is not possible • 1. Introduce a democratic system to data science. 2. Make data science accessible to everyone. Community governance Open access education Cultural barriers Technology barriers Methodological advances Compute power Public data sets