Slide 1

Slide 1 text

CC-BY 4.0 Are (data) scientists bad at science? Munich Datageeks May Edition 2023 Heidi Seibold Heidiseibold.com Slides: https://bit.ly/3IvAzT0

Slide 2

Slide 2 text

Please ask questions!

Slide 3

Slide 3 text

Image: https://the-turing-way.netlify.app

Slide 4

Slide 4 text

Is this what we want? DOI: 10.1371/journal.pmed.0020124

Slide 5

Slide 5 text

No content

Slide 6

Slide 6 text

https://the-turing-way.netlify.app/reproducible-research/overview/overview-definitions.html

Slide 7

Slide 7 text

No content

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

How can we do better?

Slide 11

Slide 11 text

What are you doing well already?

Slide 12

Slide 12 text

No content

Slide 13

Slide 13 text

Project management = good organisation Let's not pretend: we're not geniuses ;P http://www.quickmeme.com/meme/3r98zx

Slide 14

Slide 14 text

Good organisation … starts simple ● Good naming ● Nice file organisation

Slide 15

Slide 15 text

Naming ● Myabstract.docx ● Joe’s Filenames Use Spaces and Punctuation.xlsx ● figure 1.png ● fig 2.png ● JW7d^(2sl@deletethisandyourcareerisoverWx2*.txt ● 2014-06-08_abstract-for-sla.docx ● Joes-filenames-are-getting-better.xlsx ● Fig01_scatterplot-talk-length-vs-interest.png ● Fig02_histogram-talk-attendance.png ● 1986-01-28_raw-data-from-challenger-o-rings.txt NO YES See slides by Jenny Brian

Slide 16

Slide 16 text

Naming ● 2014-06-08_abstract-for-sla.docx ● Joes-filenames-are-getting-better.xlsx ● Fig01_scatterplot-talk-length-vs-interest.png ● Fig02_histogram-talk-attendance.png ● 1986-01-28_raw-data-from-challenger-o-rings.txt YES File names should be: ➔ Machine readable ➔ Human readable ➔ Optional: Consistent ➔ Optional: Play well with default ordering

Slide 17

Slide 17 text

Organise your files and folders well . ├── analysis <- all things data analysis │ └── src <- functions and other source files ├── comm │ ├── internal-comm <- internal communication such as meeting notes │ └── journal-comm <- communication with the journal, e.g. peer review ├── data │ ├── data_clean <- clean version of the data │ └── data_raw <- raw data (don't touch) ├── dissemination │ ├── manuscripts │ ├── posters │ └── presentations ├── documentation <- documentation, e.g. data management plan └── misc <- miscellaneous files that don't fit elsewhere https://github.com/HeidiSeibold/research-project-template

Slide 18

Slide 18 text

What can data scientists do?

Slide 19

Slide 19 text

What can data scientists do? ● Work according to good practices ● Be a role model ● Collaborate ● Teach

Slide 20

Slide 20 text

heidiseibold.ck.page/posts/f eedback-wanted-building- a-digital-research-academy

Slide 21

Slide 21 text

More… Slides: bit.ly/3IvAzT0