National Academies Press: OpenBook

Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs (2020)

Chapter: Appendix B: Active Data Management Plans as a Planning Tool

« Previous: Appendix A: Meetings and Presentations
Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×

B

Active Data Management Plans as a Planning Tool

A data management plan (DMP) is a formal document that outlines data types and formats, dissemination and sharing plans, roles and responsibilities, and preservation plans for data generated by a project. Various federal research funders, including the National Science Foundation (NSF) and various institutes of the National Institutes of Health (NIH) require grantees to propose a DMP, but specific requirements are not uniform among the agencies. Quality and utility of DMPs are an issue, with the perception that DMPs are an “annoying administrative exercise” (Simms et al., 2017). The typical DMP is a text document written as a verbose narrative.

Various guidance documents and templates for DMP production evolved (e.g., the National Network of Libraries of Medicine [NNLM]1), and a first generation of tools to facilitate the production of DMPs was created, and are widely available. Some examples of tools include DMPTool (California Digital Libraries),2 the DMPOnline (Digital Curation Centre—UK),3 and the Interdisciplinary Earth Data Alliance (IEDA) DMP Tool.4 Generally, those tools guided the creation of Microsoft Word or PDF documents and incorporated templates for use with various funders. More recently, a second generation of those tools was created, with the goal of making DMPs “machine actionable” (i.e., written so that computer programs can parse the content cleanly and take action based on the information in such machine-actionable DMPs [maDMPs]). Newer DMP tools incorporate richer information, such as a list of acceptable repositories, and better guidance, with richer prefillable information.

While the creation of a DMP guides the researcher in formulating a data management process, the role of the DMP in the evaluation of grant proposals and in post-award evaluations is less clear. Most grant agencies require a DMP but do not explicitly score DMPs or integrate them formally into scoring a proposal under consideration for funding. For instance, NIH scoring guidelines5 make no reference to DMPs, although data-sharing plans are a required element of proposals (in fact, the word “data” does not appear in the scoring guide). The NSF-wide

___________________

1 The NNLM website providing a collection of data management guides is https://nnlm.gov/data/data-management-plan, accessed January 14, 2020.

2 The website for DMPTool is https://dmptool.org/, accessed January 14, 2020.

3 The website for DMPOnline is https://dmponline.dcc.ac.uk/, accessed January 14, 2020.

4 The website for the IEDA DMP is https://www.iedadata.org/dmp/, accessed January 14, 2020.

5 The website describing the NIH scoring guidance is https://grants.nih.gov/grants/policy/review/rev_prep/scoring.htm, accessed January 14, 2020.

Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×

Proposal and Award Policies and Procedures Guide6 notes a requirement for (two-page) DMPs but only as a supplementary material, leaving the evaluation thereof in the realm of individual divisions and program managers. The IEDA, acting as a repository for observational geoscience data, provides researchers with the ability to generate a Data Compliance Report7 based on their NSF award number. The resulting output can be used “to demonstrate that your data are registered with IEDA systems and you are compliant with NSF Data Policies,” but it is not clear how much weight that carries in post-award evaluation or in subsequent proposals. The NSF Directorate for Biological Sciences notes a requirement for inclusion in annual and final reports, as well as part of “Results of prior NSF Support” in subsequent proposals, but, again, it is not clear what weight these are given during the evaluation process.8 The Canadian “Tri-Agency Statement of Principles on Digital Data Management”9 highlights researcher and institution responsibilities only with respect to the development of and compliance with DMPs, and the Canadian Institutes of Health Research’s Peer Review Manual10 does not include DMPs in its scoring criteria.

Halbert (2013) and Keralis and colleagues (2013) identify lack of consistency across funding agencies as a barrier for a consistent response by researchers and data librarians to data management challenges (see also Williams et al., 2017). The development of the second generation of DMP tools may be seen as a direct result of these findings and the attempt to construct a “meta-DMP” that provides consistent guidance regardless of the underlying agency reporting requirement.

maDMPs, also referred to as dynamic DMPs (Simms and Jones, 2017; Simms et al., 2017) or data management records (Morgan and Janke, 2017), may be useful for forecasting of costs of data preservation. They are specifically proposed as a more formal (machine-readable) document, allowing for data exchange across various entities, in particular across the entire data life cycle. Integration with funders, as well as institutional and community capacity planning, are specifically identified (Simms et al., 2017). maDMPs are evolving, and a standard has not yet emerged, although several use cases and implementations (Morgan and Janke, 2017) exist. At the time of writing, working groups at the Research Data Alliance11 and FORCE1112 (Chodacki et al., 2016) are working on use cases from a variety of disciplines and coordinating on standards. In particular, maDMPs target metadata such as quantity and type of data, regardless of storage location, allowing for an assessment of time-varying cost of storing such data. They strongly encourage use of persistant identifiers for people, institutions, and assets, so that maDMPs are globally intelligible (Bakos et al., 2018).

REFERENCES

Bakos, A., T. Miksa, and A. Rauber. 2018. Research data preservation using process engines and machine-actionable data management plans. Digital Libraries for Open Knowledge. https://doi.org/10.1007/978-3-030-00066-0_6.

Chodacki, J., M. Crosas, M. Martone, and S.-A. Sansone. 2016. FAIR DMP. FORCE11, May 3. https://www.force11.org/group/fairdmp.

Halbert, M. 2013. The problematic future of research data management: Challenges, opportunities and emerging patterns identified by the DataRes Project. International Journal of Digital Curation. 8(2):111-122. https://doi.org/10.2218/ijdc.v8i2.276.

Keralis, S.D.C., S. Stark, M. Halbert, and W.E. Moen. 2013. Research data management in policy and practice: The DataRes Project. In Research Data Management: Principles, Practices, and Prospects, 16-38. Council on Library and Information Resources. https://libres.uncg.edu/ir/uncg/f/M_Halbert_Research_2013.pdf.

Morgan, H., and A. Janke. 2017. DMRs, making DMPs relevant again. May 22. http://andscentral.blogspot.com/2017/05/dmrs-making-dmps-relevant-again.html.

___________________

6 The NSF Proposal and Award Policies and Procedures Guide can be found at https://www.nsf.gov/publications/pub_summ.jsp?ods_key=pappg&WT.z_pims_id=0, accessed January 14, 2020.

7 The IEDA data compliance reporting tool can be found at http://app.iedadata.org/dcr/report.php, accessed January 14, 2020.

8 The website for the NSF Directorate for Biological Sciences updated information about DMP is https://www.nsf.gov/bio/pubs/BIODMP061511.pdf, accessed January 14, 2020.

9 The Government of Canada website with the 2015 Tri-Agency Statement of Principles on Digital Data Management is http://science.gc.ca/eic/site/063.nsf/eng/h_83F7624E.html, accessed January 14, 2020.

10 The Canadian Institutes for Health Research Peer Review Manual can be found at https://cihr-irsc.gc.ca/e/49564.html, accessed January 14, 2020.

11 The website for the Data Research Alliance is https://www.rd-alliance.org/, accessed January 14, 2020.

12 The website for FORCE11 is https://www.force11.org/, accessed January 14, 2020.

Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×

Simms, S., and S. Jones. 2017. Next-generation data management plans: Global, machine-actionable, FAIR. International Journal of Digital Curation 12(1):36. https://doi.org/10.2218/ijdc.v12i1.513.

Simms, S., S. Jones, D. Mietchen, and T. Miksa. 2017. Machine-actionable data management plans (maDMPs). Research Ideas and Outcomes 3:e13086.

Williams, M., J. Bagwell, and M.N. Zozus. 2017. Data management plans: The missing perspective. Journal of Biomedical Informatics 71(July):130-142.

Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×
Page 137
Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×
Page 138
Suggested Citation:"Appendix B: Active Data Management Plans as a Planning Tool." National Academies of Sciences, Engineering, and Medicine. 2020. Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs. Washington, DC: The National Academies Press. doi: 10.17226/25639.
×
Page 139
Next: Appendix C: Identifying Salary Ranges for Jobs Relevant to the Data Life Cycle »
Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs Get This Book
×
 Life-Cycle Decisions for Biomedical Data: The Challenge of Forecasting Costs
Buy Paperback | $75.00 Buy Ebook | $59.99
MyNAP members save 10% online.
Login or Register to save!
Download Free PDF

Biomedical research results in the collection and storage of increasingly large and complex data sets. Preserving those data so that they are discoverable, accessible, and interpretable accelerates scientific discovery and improves health outcomes, but requires that researchers, data curators, and data archivists consider the long-term disposition of data and the costs of preserving, archiving, and promoting access to them.

Life Cycle Decisions for Biomedical Data examines and assesses approaches and considerations for forecasting costs for preserving, archiving, and promoting access to biomedical research data. This report provides a comprehensive conceptual framework for cost-effective decision making that encourages data accessibility and reuse for researchers, data managers, data archivists, data scientists, and institutions that support platforms that enable biomedical research data preservation, discoverability, and use.

READ FREE ONLINE

  1. ×

    Welcome to OpenBook!

    You're looking at OpenBook, NAP.edu's online reading room since 1999. Based on feedback from you, our users, we've made some improvements that make it easier than ever to read thousands of publications on our website.

    Do you want to take a quick tour of the OpenBook's features?

    No Thanks Take a Tour »
  2. ×

    Show this book's table of contents, where you can jump to any chapter by name.

    « Back Next »
  3. ×

    ...or use these buttons to go back to the previous chapter or skip to the next one.

    « Back Next »
  4. ×

    Jump up to the previous page or down to the next one. Also, you can type in a page number and press Enter to go directly to that page in the book.

    « Back Next »
  5. ×

    Switch between the Original Pages, where you can read the report as it appeared in print, and Text Pages for the web version, where you can highlight and search the text.

    « Back Next »
  6. ×

    To search the entire text of this book, type in your search term here and press Enter.

    « Back Next »
  7. ×

    Share a link to this book page on your preferred social network or via email.

    « Back Next »
  8. ×

    View our suggested citation for this chapter.

    « Back Next »
  9. ×

    Ready to take your reading offline? Click here to buy this book in print or download it as a free PDF, if available.

    « Back Next »
Stay Connected!