Slide 1

Slide 1 text

RESEARCH! DATA! MANAGEMENT! SHARING! +   Brianna Marshall | @notsosternlib ! UW-Madison Libraries | #uwECR!

Slide 2

Slide 2 text

about me! Brianna Marshall! Digital Curation Coordinator, UW Libraries! Lead, Research Data Services – Education + training – Consultations – Data management plans (DMPs) researchdata.wisc.edu @UWMadRschSvcs ! !

Slide 3

Slide 3 text

caveats! •  Given limited time, I’ve chosen to mention things - even just briefly! - rather than forego them entirely •  Hopefully you’ll get ideas for concepts and tools to explore later •  Expectations and best practices are often field- specific, so it’s tough to generalize

Slide 4

Slide 4 text

hopes + dreams! you’ll find something I talk about today useful + put it into practice you’ll share your top data management tips with us you’ll tell me what you want to know more about for future workshops! you’ll be reenergized enough by the topic to find something else that works for you and / or

Slide 5

Slide 5 text

what is research data?! “the recorded factual information commonly accepted in the scientific community as necessary to validate research findings.” INCLUDES: code, figures, statistics, interviews, transcripts EXCLUDES: preliminary analyses, drafts of papers, plans for further research, communication + peer reviews, physical samples -  OMB Circular, White House

Slide 6

Slide 6 text

time to ponder! Can you still access your data from… – 10 years ago? – 5 years ago? – 1 year ago? Let’s talk about the data you’ve kept and lost.

Slide 7

Slide 7 text

time to ponder! Can you still access your data digital stuff from… – 10 years ago? – 5 years ago? – 1 year ago? Let’s talk about the data digital stuff you’ve kept and lost.

Slide 8

Slide 8 text

data horror stories! Image courtesy of Flickr user wolfgangfoto (CC BY ND)!

Slide 9

Slide 9 text

https://i.imgflip.com/ntnjb.jpg you can’t! find it.!

Slide 10

Slide 10 text

or you can’t! understand it.! http://cdn.meme.am/instances/58392702.jpg

Slide 11

Slide 11 text

or it’s! long gone.! https://community.spiceworks.com/topic/813225- best-backup-recovery-memes

Slide 12

Slide 12 text

federal funding requirements! Data management plans (DMPs) are required by all federal funding agencies. Office of Science and Technology Policy (OSTP) memo –  Released spring 2013; took effect fall 2015 –  Requires open sharing of published articles and data –  Publication repository is provided; data repository is not –  Applies to agencies with $100M + in R&D

Slide 13

Slide 13 text

dmptool.org!

Slide 14

Slide 14 text

what’s up with researchers?! Ample technology to generate data but few skills to manage it effectively Movement toward openness, impacted by OSTP and spurred by early career researcher expectations Disciplinary culture shifts toward data reuse + reproducibility Need for multi-purpose online spaces to collaborate, share, store, and archive research outputs (including data)

Slide 15

Slide 15 text

DATA  MANAGEMENT  

Slide 16

Slide 16 text

data management basics! File organization •  Is your data organized meaningfully or jumbled together? Do you know where your data is? Documentation •  How much contextual information accompanies your data? Can you understand it? Can a stranger understand it? Storage & backup •  Where is your data stored and backed up? Could you recover from hardware failure or accidental deletion? Media obsolescence •  Do you know how the software, hardware, and file formats you use will impact your data’s readability in the future?

Slide 17

Slide 17 text

file naming conventions! •  Use them any time you have related files •  Consistent •  Short yet descriptive •  Avoid spaces and special characters example File001.xls vs. Project_instrument_location_YYYYMMDD.xls

Slide 18

Slide 18 text

directory/folder organization! Lots of possibilities, so consider what makes sense for your project – File type – Date – Type of analysis example: MyDocuments\Research\Sample12.tiff vs. C:\\NSFGrant01234\WaterQuality\Images\LakeMendota_20141030.tiff

Slide 19

Slide 19 text

retroactive organization! •  Do a data inventory. List all the places where your data lives (both physical and digital) •  Make a plan for consolidating – follow the rule of 3, not the rule of 17

Slide 20

Slide 20 text

documentation! Coded SPSS survey responses (Useless without the original questionnaires)

Slide 21

Slide 21 text

document on many levels! Project- & folder-level –  Create a readme file. (Good example located here: http://hdl.handle.net/2022/17155) –  Document any data processing and analyses. –  Don’t forget written notes. Item-level –  Remember the importance of file names for conveying descriptive information. –  Find and adhere to disciplinary metadata standards •  XML •  Dublin Core

Slide 22

Slide 22 text

what’s in a good readme file?! •  Names + contact information for people associated with the project! •  List of files, including a description of their relationship to one another! •  Copyright + licensing information! •  Limitations of the data! •  Funding sources / institutional support! ! tl;dr !! Any information necessary for someone with no knowledge of your research to understand and / or replicate your work.!

Slide 23

Slide 23 text

example readme file!

Slide 24

Slide 24 text

storage & backup! storage = working files. The files you access regularly and change frequently. In general, losing your storage means losing current versions of the data. backup = regular process of copying data separate from storage. You don’t really need it until you lose data, but when you need to restore a file it will be the most important process you have in place.

Slide 25

Slide 25 text

rule of 3 Keep THREE copies of your data –  TWO onsite –  ONE offsite Example –  One: Network drive –  Two: External hard drive –  Three: Cloud storage This ensures that your storage and backup is not all in the same place – that’s too risky!

Slide 26

Slide 26 text

Original clipart from http://cliparts.co/clipart/2532461. Modified version made available as CC0.

Slide 27

Slide 27 text

evaluating cloud services! •  Lots of options out there – and not all are created equal •  Read the Terms of Service! •  While at UW, use your free UW Box or Google Drive accounts

Slide 28

Slide 28 text

h#p://www.doit.wisc.edu/news/collabora6on-­‐tools-­‐google-­‐docs-­‐vs-­‐box-­‐2  

Slide 29

Slide 29 text

media obsolescence! CC  image  by  Flickr  user  wlef70       •  software •  hardware •  file formats        

Slide 30

Slide 30 text

thwarting obsolescence! •  You can’t. •  Today’s popular software can become obsolete through business deals, new versions, or a gradual decline in user base. (Consider WordPerfect.) •  Anticipate average lifespan of media to be 3-5 years. Migrate your files every few years, if not more frequently!

Slide 31

Slide 31 text

thwarting obsolescence! •  Some file formats are less susceptible to obsolescence than others –  Open, non-proprietary formats (pick TXT over DOCX, CSV over XSLX, TIF over JPG) –  Wide adoption –  History of backward compatibility –  Metadata support in open format (XML)

Slide 32

Slide 32 text

back to (data management) basics! File organization •  Is your data organized meaningfully or jumbled together? Do you know where your data is? Documentation •  How much contextual information accompanies your data? Can you understand it? Can a stranger understand it? Storage & backup •  Where is your data stored and backed up? Could you recover from hardware failure or accidental deletion? Media obsolescence •  Do you know how the software, hardware, and file formats you use will impact your data’s readability in the future?

Slide 33

Slide 33 text

DATA  SHARING  +  PUBLICATION  

Slide 34

Slide 34 text

get credit for your data! •  Many ways to share/publish your data! –  Institutional + disciplinary repositories –  Data papers/journals •  If your research is federally funded, remember that you’ll now have to share your data •  Data is not copyrightable; best practice is to apply a Creative Commons 0 license •  There’s even a proven citation advantage to sharing your data* *Piwowar HA, Vision TJ. (2013) Data reuse and the open data citation advantage. PeerJ 1:e175 https://dx.doi.org/10.7717/peerj.175

Slide 35

Slide 35 text

minds.wisconsin.edu!

Slide 36

Slide 36 text

dryad.org!

Slide 37

Slide 37 text

figshare.com! http://figshare.com/articles/Prevalence_and_use_of_Twitter_among_scholars/104629

Slide 38

Slide 38 text

zenodo.org!

Slide 39

Slide 39 text

nature.com/sdata/!

Slide 40

Slide 40 text

github.com!

Slide 41

Slide 41 text

No content

Slide 42

Slide 42 text

re3data browse functionality!

Slide 43

Slide 43 text

Image courtesy of Flickr user davegray (CC BY ND)

Slide 44

Slide 44 text

Image courtesy of Flickr user mabi (CC BY SA)

Slide 45

Slide 45 text

FINAL  THOUGHTS  

Slide 46

Slide 46 text

final thoughts! •  Think about how your existing data management practices will impact your ability to access your data days/weeks/years from now. •  If organizing retroactively, prioritize your most important research. •  Managing digital stuff requires a LOT of decision making, so embrace it! •  Any plan is better than no plan at all. Start today. Ask for help.

Slide 47

Slide 47 text

my suggestion?! Grant or not, start new projects with a data management plan compiled by project leaders. The plan should cover: •  Organization + naming •  Documentation + metadata •  Storage + sharing •  Any and all other pertinent details. (The more the better; it’ll save you headaches later.) The plan should be actively revisited and adapted as needed throughout the project.  

Slide 48

Slide 48 text

GOOD DATA MANAGEMENT + sharing ! BETTER, ! FASTER! research! =  

Slide 49

Slide 49 text

No content

Slide 50

Slide 50 text

researchdata.wisc.edu!

Slide 51

Slide 51 text

upcoming digital scholarship workshops! An Introduction to Open Research DECEMBER 10 AVAILABLE ONLINE Project Management + Productivity Tools Crafting Your Digital Identity Steenbock Library BioCommons | 4-5pm

Slide 52

Slide 52 text

BRIANNA MARSHALL! @notsosternlib! [email protected]!