National Academies Press: OpenBook

For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop (2012)

Chapter: 16- Data Center-Library Cooperation in Data Publication in Ocean Science

« Previous: 15- Microsoft Academic Search: An Overview and Future Directions
Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×

16- Data Center-Library Cooperation in Data Publication in Ocean Science

Roy Lowry1
British Oceanographic Data Center

Let me start by providing some background information about the key players in the partnership that has come together to foster data publication in the ocean sciences. The Scientific Committee on Oceanic Research (SCOR) is an international non-governmental organization formed by the International Council of Scientific Unions (ICSU, now the International Council for Science) in 1957. The Committee has scientists from 36 countries participating in different working groups and steering committees. It promotes international cooperation through planning and conducting oceanographic research, and solving methodological and conceptual problems that hinder research.

The second partner organization is the International Oceanographic Data and Information Exchange (IODE). This is a data and information exchange program of UNESCO’s Intergovernmental Oceanographic Commission (IOC), commenced in 1961. The main goal of this program is to establish national oceanographic data centres or coordinators in IOC member states in order to acquire, enhance, and exchange oceanographic data and information. It also aims at extending the national oceanographic data center network through training and capacity building.

The last player in this partnership is the Marine Biological Laboratory Woods Hole Oceanographic Institution (MBLWHOI) Library. The Woods Hole scientific community library has a strong interest in data publication in digital libraries. The Digital Library Archive (DLA) contains:

•  WHOI archives;

•  Historical photographs and oceanographic instruments;

•  Scientific data, e.g., echo sounding records from WHOI research vessel expeditions;

•  Technical report collections; and

•  Maps, nautical charts, geologic and bathymetric maps, and cruise tracks.

The group had a series of meetings between June 2008 and April 2010 and there is another meeting scheduled for November 2011. The group’s objectives are to:

______________________

1 Presentation given by Sarah Callaghan and slides are available at http://sites.nationalacademies.org/PGA/brdi/PGA_064019.

Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×

•  Engage the IODE data center and marine library communities in data publication issues.

•  Provide a network of hosts for cited data.

•  Motivate scientists through reward for depositing data in data centers.

•  Promote scientific clarity and re-use of data.

However, engaging IODE data centers effectively in data publication and distribution encounters a problem of different approaches. One model is as follows.

Data can change significantly as additional value is added by the data center through metadata generation, quality control (e.g., flagging outliers), and the like.

The “best available” data are served by the data center to other users during data evolution, which means that the dataset is continually changing with no snapshots preserved or formal versioning during work-up. This makes it difficult to go back and get the same data that you got a year or six months ago.

The second model is the Digital Library Paradigm.

A dataset is a “bucket of bytes,” which is:

•  Fixed (checksum should be a metadata item)

•  Changes generate a new version of the dataset

•  Previous versions must persist

•  Accessible online via a permanent identifier

•  Usable on a decadal timescale (using standards such as the Open Archive Information Standard)

•  Citable in the scientific literature to provide links to marine libraries

•  Discoverable

To summarize these data distribution paradigm issues, the problem is to find ways for IODE data centers to engage in digital library practices while leaving current infrastructure largely intact. Change should happen gradually through evolution and not revolution. Probably the best way to do that is through pilot projects at the British Oceanographic Data Center (BODC) and WHOI.

To that end, the BODC has started a pilot project activity with a decision to establish a repository at IODE called Published Ocean Data (POD), where data will be accessible to many data centers, with technical quality control and good long-term stewardship credentials in place. The process to achieve this goal has taken longer than anticipated due to extended discussions and resource availability. However, specifications are being produced and accepted now, and the actual building of the systems will start in the fall of 2011.

Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×

As for the WHOI pilot project, the MBLWHOI library has loaded a number of datasets from the National Science Foundation’s (NSF) Biological and Chemical Oceanography Data Management Office (BCO-DMO). The datasets have been associated with published journal articles. For example dx.doi.org/10.1575/1912/4199, resolves to: https://darchive.mblwhoilibrary.org/handle/1912/4199).

The group is also working with a scientist who is submitting a paper to the American Geophysical Union in September, with a complete publishing process use case including DOI assignments to datasets supporting specific figures. These dataset citations will be incorporated in the final version of the paper, subject to publisher approval. Furthermore, talks are underway concerning incorporation of the Woods Hole Open Access Server (WHOAS) repository in an NSF proposal data management plan. Finally, this partnership also has plans for collaboration with BCO-DMO to develop an automated publication system for all data center accessions.

Let me conclude with a summary of our future plans. We will:

•  Complete the pilot projects identified earlier.

•  Engage other data centers in data publication through reporting our experiences and disseminating knowledge through appropriate routes, such as workshops, conferences and other publications.

•  Engage SeaDataNet II when it starts later in 2011.

•  Continue outreach activities to scientific, data management, and marine library communities.

•  A further meeting is planned to be held in Liverpool, UK, on November 3-4, 2011.

•  Expand BODC activities into an operational service.

•  Develop the MBLWHOI Library BCO-DMO ingest system.

Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×

This page intentionally left blank.

Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×
Page 109
Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×
Page 110
Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×
Page 111
Suggested Citation:"16- Data Center-Library Cooperation in Data Publication in Ocean Science." National Research Council. 2012. For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop. Washington, DC: The National Academies Press. doi: 10.17226/13564.
×
Page 112
Next: 17- Data Citation Mechanism and Service for Scientific Data: Defining a Framework for Biodiversity Data Publishers »
For Attribution: Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop Get This Book
×
Buy Paperback | $48.00 Buy Ebook | $38.99
MyNAP members save 10% online.
Login or Register to save!
Download Free PDF

The growth of electronic publishing of literature has created new challenges, such as the need for mechanisms for citing online references in ways that can assure discoverability and retrieval for many years into the future. The growth in online datasets presents related, yet more complex challenges. It depends upon the ability to reliably identify, locate, access, interpret, and verify the version, integrity, and provenance of digital datasets. Data citation standards and good practices can form the basis for increased incentives, recognition, and rewards for scientific data activities that in many cases are currently lacking in many fields of research. The rapidly-expanding universe of online digital data holds the promise of allowing peer-examination and review of conclusions or analysis based on experimental or observational data, the integration of data into new forms of scholarly publishing, and the ability for subsequent users to make new and unforeseen uses and analyses of the same data-either in isolation, or in combination with, other datasets.

The problem of citing online data is complicated by the lack of established practices for referring to portions or subsets of data. There are a number of initiatives in different organizations, countries, and disciplines already underway. An important set of technical and policy approaches have already been launched by the U.S. National Information Standards Organization (NISO) and other standards bodies regarding persistent identifiers and online linking.

The workshop summarized in For Attribution -- Developing Data Attribution and Citation Practices and Standards: Summary of an International Workshop was organized by a steering committee under the National Research Council's (NRC's) Board on Research Data and Information, in collaboration with an international CODATA-ICSTI Task Group on Data Citation Standards and Practices. The purpose of the symposium was to examine a number of key issues related to data identification, attribution, citation, and linking to help coordinate activities in this area internationally, and to promote common practices and standards in the scientific community.

  1. ×

    Welcome to OpenBook!

    You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

    Do you want to take a quick tour of the OpenBook's features?

    No Thanks Take a Tour »
  2. ×

    Show this book's table of contents, where you can jump to any chapter by name.

    « Back Next »
  3. ×

    ...or use these buttons to go back to the previous chapter or skip to the next one.

    « Back Next »
  4. ×

    Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

    « Back Next »
  5. ×

    Switch between the Original Pages, where you can read the report as it appeared in print, and Text Pages for the web version, where you can highlight and search the text.

    « Back Next »
  6. ×

    To search the entire text of this book, type in your search term here and press Enter.

    « Back Next »
  7. ×

    Share a link to this book page on your preferred social network or via email.

    « Back Next »
  8. ×

    View our suggested citation for this chapter.

    « Back Next »
  9. ×

    Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

    « Back Next »
Stay Connected!