Session Title: 
E-science: Exploring the Librarian's Role 

Location: 
Georgetown East, Concourse Level 

Session Time: 
Sunday, May 23, 2010, 2:45 pm - 4:15 pm 

Control Number: 
08-IS-522-MLA 

Activity: 
Invited Speaker 

Publishing Title: 
Data Curation and Research Librarianship in the Age of E-science 

Author Block: 
Carole L. Palmer, Professor and Director, Center for Informatics Research in Science and Scholarship, Graduate School 
of Library and Information Science, University of Illinois-Urbana-Champaign, Champaign, IL 

Abstract Body: 
Description: Research libraries are evolving in the age of e-science. They are part of the growing, globally distributed 
network of digital information and services that supports the conduct of research. In this information landscape, digital 
data are now recognized as valuable asset--research resources that can be aggregated and integrated across multiple scales 
of size, time, and orders of complexity, and across discipline--to address the grand research challenges facing society. 
Data curation is a new responsibility for our field, but it is not ours alone. Our contributions need to be developed in 
concert with related activities in the scientific disciplines and in collaboration with scientists and technologists whose 
knowledge and expertise are essential in creating large-scale, high-performance information systems, where data are 
shared and linked to the literature. A new initiative in our program of e-science research and education is the Data 
Conservancy (DC), one of the first two National Science Foundation DataNet awards. Led by Sayeed Choudhury at Johns 
Hopkins University Library, DC is an international group of uniquely qualified domain scientists, information and 
computer scientists, librarians, and engineers. We are designing and implementing an integrated data curation strategy and 
infrastructure to address the urgent need to collect, organize, validate, and preserve data to support scientific inquiry. As a 
model for the future research library, the DC has a broad purview, covering astronomy, biology, earth science, and social 
science, and is working toward cross-disciplinary solutions. Research aims include a data model for observational data 
and a general framework for data collection identity and description. Technical development will be informed by a 
systematic comparative analysis of data practices and curation requirements across the research communities served by 
DC. Metadata and ontologies will be central to the organization and functionality of the repository. For example, the Open 
Archives Initiative, Object Reuse and Exchange protocol will be applied for linking data to associated literature and 
tracking provenance, and existing taxonomies and ontologies in the life sciences will be integrated to organize species 
information to support queries related to climate change. To strengthen the data curation workforce, DC educational 
initiatives will build on the biological information specialist master's degree in bioinformatics and the data curation 
specialization in the master's of science in library and information science, begun at Illinois in 2006. The "Summer 
Institute in Data Curation" will be extended to more in-service professionals, internship and mentorship options for 
students will be expanded, and educational materials will be coordinated and shared among the full set of DataNet 
partners. 

See the interactive content here