Reproducibility Needs Reshape Scientific Data Governance
Authors:
Paul Meijer,
Yousef Aggoune,
Madeline Ambrose,
Aldan Beaubien,
James Harvey,
Nicole Howard,
Neelima Inala,
Ed Johnson,
Autumn Kelsey,
Melissa Kinsey,
Jessica Liang,
Paul Mariz,
Stark Pister,
Sathya Subramanian,
Vitalii Tereshchenko,
Anne Vetto
Abstract:
Scientific data governance should prioritize maximizing the utility of data throughout the research lifecycle. Research software systems that enable analysis reproducibility inform data governance policies and assist administrators in setting clear guidelines for data reuse, data retention, and the management of scientific computing needs. Proactive analysis reproducibility and data governance are…
▽ More
Scientific data governance should prioritize maximizing the utility of data throughout the research lifecycle. Research software systems that enable analysis reproducibility inform data governance policies and assist administrators in setting clear guidelines for data reuse, data retention, and the management of scientific computing needs. Proactive analysis reproducibility and data governance are integral and interconnected components of research lifecycle management.
△ Less
Submitted 29 September, 2024;
originally announced October 2024.
Provide Proactive Reproducible Analysis Transparency with Every Publication
Authors:
Paul Meijer,
Nicole Howard,
Jessica Liang,
Autumn Kelsey,
Sathya Subramanian,
Ed Johnson,
Paul Mariz,
James Harvey,
Madeline Ambrose,
Vitalii Tereshchenko,
Aldan Beaubien,
Neelima Inala,
Yousef Aggoune,
Stark Pister,
Anne Vetto,
Melissa Kinsey,
Tom Bumol,
Ananda Goldrath,
Xiaojun Li,
Troy Torgerson,
Peter Skene,
Lauren Okada,
Christian La France,
Zach Thomson,
Lucas Graybuck
Abstract:
The high incidence of irreproducible research has led to urgent appeals for transparency and equitable practices in open science. For the scientific disciplines that rely on computationally intensive analyses of large data sets, a granular understanding of the analysis methodology is an essential component of reproducibility. This paper discusses the guiding principles of a computational reproduci…
▽ More
The high incidence of irreproducible research has led to urgent appeals for transparency and equitable practices in open science. For the scientific disciplines that rely on computationally intensive analyses of large data sets, a granular understanding of the analysis methodology is an essential component of reproducibility. This paper discusses the guiding principles of a computational reproducibility framework that enables a scientist to proactively generate a complete reproducible trace as analysis unfolds, and share data, methods and executable tools as part of a scientific publication, allowing other researchers to verify results and easily re-execute the steps of the scientific investigation.
△ Less
Submitted 17 August, 2024;
originally announced August 2024.