Goals What metrics for research data do researchers and data managers want? Do data repositories make these metrics available? Build services to collect these metrics for all datasets in DataONE repository network
MDC Team Peter Slaughter Dave Vieglais Matt Jones Stephen Abrams John Kratz Patricia Cruse Carly Strasser Jennifer Lin Kristen Ratan John Chodacki Martin Fenner Project Partners California Digital Library (CDL) DataONE Public Library of Science (PLOS)
THE VALUE OF RESEARCH DATA Metrics for datasets from a cultural and technical point of view http://repository.jisc.ac.uk/6205/1/Value_of_Research_Data.pdf
How interested would you be to know each of the following about the impact of your data? http://doi.org/10.1038/sdata.2015.39 http://www.dx.doi.org/10.5060/D8H59D
Wrote import pipeline to regularly import new DataONE datasets. Handles persistent identifiers beyond DOIs, including URLs. http://dlm.datacite.org/status
Wrote new sources for data metrics, including DataONE usage stats and data citations found in open access content https://dlm.datacite.org/sources/europe_pmc_fulltext
Metadata of articles References are part of the metadata deposited to CrossRef Cited-by service aggregates these citations for CrossRef DOIs Work is underway to include Crossref DOI <-> DataCite DOI links
http://doi.org/10.1038/ncomms9212 … For instance, although there are estimated 18,000 butterfly species, there are currently only 6 butterfly genome sequences7, 8, 9, 10, 11… Citations 7-11 are all for journal articles, not datasets. Second Order Events
Next Steps Analyze usage statistics in more detail Analyze second order citations Analyze influence of persistent identifier Do similar project with scientific software Turn research project into service