Giving them what they want: Using Data Curation Profiles to guide Datastar development

Author(s)
First Name: 
Sarah
Last Name: 
Wright
Affiliation: 
Cornell University
First Name: 
Wendy
Last Name: 
Kozlowski
Affiliation: 
Cornell University
First Name: 
Dianne
Last Name: 
Dietrich
Affiliation: 
Cornell University
First Name: 
Huda
Last Name: 
Khan
Affiliation: 
Cornell University
First Name: 
Leslie
Last Name: 
McIntosh
Affiliation: 
Washington University at St. Louis
First Name: 
Gail
Last Name: 
Steinhart
Affiliation: 
Cornell University
Keywords: 
data registry; research data; publication; access; discovery
Track: 
General conference
Poster
Abstract: 

The Datastar project, which first received NSF funding in 2007, is one of Cornell University Library’s (CUL) efforts in the area of digital data discovery (Steinhart 2010; Dietrich 2010; Khan 2011). Originally envisioned as a “data staging repository,” where researchers could upload data, create minimal metadata, and share data with selected colleagues, Datastar has been reconceived as a data registry to support discovery of datasets. Datastar is now being developed in partnership with Washington University at St. Louis (WUSTL) as a data registry that can be used either as a standalone tool or in conjunction with VIVO, Cornell’s open source semantic web application for research and scholarship networks. These more recent efforts were funded by the U.S. Institute of Museum and Library Services.
With the new focus, we wanted to ensure development decisions were driven by real user needs. To that end, we used the Data Curation Toolkit (http://datacurationprofiles.org/) to conduct interviews and create a set of Data Curation Profiles (DCPs) with participants selected at CU and WUSTL (Witt and others 2009). Designed to elicit information needs associated with a data set or collection, the structured interviews provided a general framework for discussing data with researchers and to create a profile covering the stages of the data lifecycle. Participants from a broad range of disciplines were invited to participate, and eight completed interviews.
After evaluation and prioritization of the findings from the interviews, a set of particularly relevant responses emerged. These highly relevant responses had a direct influence on Datastar development. In contrast, some findings from the DCPs were out of scope for the current iteration of Datastar, but they were still interesting because they helped to provide additional details about the ways researchers prefer to interact with their data. Although not all of the findings will result in functionalities in current or later iterations of Datastar, they will almost certainly inform future development. Using the DCPs allowed us to focus Datastar development efforts on addressing real user needs, saving wasted effort. This presentation will discuss both the results of our interviews and the development efforts that were prioritized as a result of our findings.

AttachmentSize
DatastarPoster_OR2013_Final.docx18.68 KB