Slide 1

Slide 1 text

Data curation as “publishing” for digital humanists Trevor Muñoz University of Maryland CLI Annual Conference 2013

Slide 2

Slide 2 text

@trevormunoz #ciccli13

Slide 3

Slide 3 text

Assistant Dean for Digital Humanities Research, University Libraries Associate Director, Maryland Institute for Technology in the Humanities (MITH)

Slide 4

Slide 4 text

Humanities Data Curation Summit June 2011 http://hdl.handle.net/2142/30852

Slide 5

Slide 5 text

DH Curation Guide July 2012 http://guide.dhcuration.org

Slide 6

Slide 6 text

Digital Humanities Winter Institute January 2013 http://ter.ps/dhwidc

Slide 7

Slide 7 text

Digital Humanities Data Curation Institute Summer 2013 - Spring 2014 http://www.dhcuration.org/institute

Slide 8

Slide 8 text

and more anon ...

Slide 9

Slide 9 text

Data curation as a “publishing” activity is increasingly important to digital humanists

Slide 10

Slide 10 text

Data curation as publishing is not the same as data publication

Slide 11

Slide 11 text

Data curation as publishing draws directly on the unique skills of librarians (unique value proposition!)

Slide 12

Slide 12 text

Data curation as publishing aligns directly with library missions and values

Slide 13

Slide 13 text

Data curation involves maintaining digital information that is produced in the course of research in a manner that preserves its meaning and usefulness as a potential input for further research.” “

Slide 14

Slide 14 text

Data curation is the active and on-going management of data through its lifecycle of interest and usefulness to scholarship, science, and education.” “ http://hdl.handle.net/2142/3493

Slide 15

Slide 15 text

curation activities enable data discovery and retrieval, maintain quality, add value, and provide for re-use over time.” “ http://hdl.handle.net/2142/3493

Slide 16

Slide 16 text

connections between data curation and publishing are not new

Slide 17

Slide 17 text

Digital Curation and E-Publishing: Libraries Make the Connection Choudhury, Furlough, and Ray 2009 http://bit.ly/maketheconnection

Slide 18

Slide 18 text

Now and Future of Data Publishing 2013 #nfdp13 Symposium at Oxford University

Slide 19

Slide 19 text

Humanists have data. Digital humanists need data curation.

Slide 20

Slide 20 text

A PhD student investigating historical demographics of American religion posts tables & plots of population estimates to GitHub http://bit.ly/mullendemo

Slide 21

Slide 21 text

Literature scholars trying to understand the scope and structure of literary history collect, clean, and normalize data from HathiTrust http://usesofscale.com/

Slide 22

Slide 22 text

Data sets as enticement for digital humanists

Slide 23

Slide 23 text

Help - anyone have (or know of) openly licensed datasets (preferably heritage related) that can be used in @chi_initiative fieldschool?” “ http://bit.ly/canhazdata

Slide 24

Slide 24 text

IndexCat National Library of Medicine http://bit.ly/indexcat

Slide 25

Slide 25 text

What’s on the Menu? New York Public Library http://menus.nypl.org/data

Slide 26

Slide 26 text

Baldwin Library of Historical Children’s Literature University of Florida Libraries http://bit.ly/baldwinlib

Slide 27

Slide 27 text

and many more

Slide 28

Slide 28 text

and many more

Slide 29

Slide 29 text

Data curation as a “publishing” activity is increasingly important to digital humanists

Slide 30

Slide 30 text

Digital Curation and E-Publishing: Libraries Make the Connection Choudhury, Furlough, and Ray 2009 http://bit.ly/maketheconnection

Slide 31

Slide 31 text

Data curation and publishing are mutually reinforcing activities where publishers and libraries can “make the connection”

Slide 32

Slide 32 text

Yes, and ...

Slide 33

Slide 33 text

Data curation itself is legible as a publishing activity

Slide 34

Slide 34 text

curation activities enable data discovery and retrieval, maintain quality, add value, and provide for re-use over time.” “ http://hdl.handle.net/2142/3493

Slide 35

Slide 35 text

Data curation as publishing is not the same as data publication

Slide 36

Slide 36 text

Is Data Publication the Right Metaphor? Parsons and Fox 2013 http://bit.ly/parsonsfox

Slide 37

Slide 37 text

There is ... little emphasis on data discovery and interoperability across systems. Data are often presented as they were created without explicit considerations of data integration or significant reuse” “

Slide 38

Slide 38 text

Adopting only the data publication model expands recognizable publisher and library activities to a new class of scholarly objects but in many ways perpetuates the (problematic) status quo.

Slide 39

Slide 39 text

Little emphasis on data discovery and interoperability across systems” “

Slide 40

Slide 40 text

Data are often presented as they were created without explicit considerations of data integration or significant reuse” “

Slide 41

Slide 41 text

issues such as latency, rapid versioning and reprocessing, and computational demands.” “

Slide 42

Slide 42 text

Data curation as publishing is not the same as data publication

Slide 43

Slide 43 text

Data curation itself is legible as a publishing activity

Slide 44

Slide 44 text

curation activities enable data discovery and retrieval, maintain quality, add value, and provide for re-use over time.” “ http://hdl.handle.net/2142/3493

Slide 45

Slide 45 text

The library and information science meta-science perspective articulated by [Marcia] Bates (1999) has always been fundamental to the role of providing broad, useable information collections and services, especially to support interdisciplinary research.” “ http://bit.ly/reusevalue

Slide 46

Slide 46 text

Data curation as publishing draws directly on the unique skills of librarians (unique value proposition!)

Slide 47

Slide 47 text

This is a type publishing “business” that libraries should be in

Slide 48

Slide 48 text

Data curation as publishing aligns directly with library missions and values

Slide 49

Slide 49 text

Little emphasis on data discovery and interoperability across systems” “

Slide 50

Slide 50 text

curation activities enable data discovery and retrieval, maintain quality, add value, and provide for re-use over time.” “ http://hdl.handle.net/2142/3493

Slide 51

Slide 51 text

Ok, what might this really look like?

Slide 52

Slide 52 text

Digital Curation and E-Publishing: Libraries Make the Connection Choudhury, Furlough, and Ray 2009 http://bit.ly/maketheconnection

Slide 53

Slide 53 text

Open Context eds. Kansa, Kansa, and Deblauwe http://opencontext.org/about/

Slide 54

Slide 54 text

Thank You. Trevor Muñoz [email protected] @trevormunoz

Slide 55

Slide 55 text

Data curation as a “publishing” activity is increasingly important to digital humanists Data curation as publishing is not the same as data publication Data curation as publishing draws directly on the unique skills of librarians Data curation as publishing aligns directly with library missions and values