Introduction to Computational Reproducibility (...

January 03, 2017

440

Introduction to Computational Reproducibility (and why we care)

Please cite as:
Barba, Lorena A. (2017): Introduction to Computational Reproducibility (and why we care). figshare.
https://dx.doi.org/10.6084/m9.figshare.4509419.v1

Introduction & motivation on the first day of the workshop "Essential skills for reproducible research computing," at Universidad Técnica Federico Santa María (January 2017):
https://barbagroup.github.io/essential_skills_RRC/

[slide 7]
The American Physical Society (APS) in its Ethics & Values document (1999) explains their position on "What is Science?" It says: "The success and credibility of science are anchored in the willingness of scientists to […] Expose their ideas and results to independent testing and replication by others. This requires the open exchange of data, procedures and materials."

[slide 8]
The journal Nature published a News Feature article on October 2010 discussing the failings of computational science: http://www.nature.com/news/2010/101013/full/467775a.html
The article mentions that coding problems can sometimes cause substantial harm, and have forced some scientists to retract papers. It tells the story of a structural-biology group at Scripps Institute, led by Geoffrey Chang ... in 2006, the team realized that a code they were using had a sign error, which reversed two columns of data, causing their protein structures to be completely wrong.

[slide 9]
Screenshot from Science, 22 December 2006.
Chang and co-authors were forced to retract five papers published in Science, the Journal of Molecular Biology and the Proceedings of the National Academy of Sciences, between the years 2001 and 2005.

[slide 10]
Quote from Nature (2010):
As a general rule, researchers do not test or document their programs rigorously, and they rarely release their codes, making it almost impossible to reproduce and verify published results generated by scientific software …

[slide 11]
Quote from Nature (2010):
"There are terrifying statistics showing that almost all of what scientists know about coding is self-taught," says Wilson. "They just don't know how bad they are."
The Nature piece quotes Greg Wilson, leader of the “Software Carpentry” workshop series ... he ran an online survey in 2008 of 2,000 researchers working with computers in one way or another.
– only 47% of scientists have good understanding of software testing
– only 34% of scientists think that formal training in developing software is important
– 38% of scientists spend at least 1/5 of their time developing software

[slide 12]
And it continues to happen. A paper in the Journal of Clinical Oncology, published online in January 2016 (March 2016 in print) contained analysis that mislabeled a data in a column, affecting how a substantial set of clinical results from 1990 to 2008 entered into the conclusions. Some of the conclusions were incorrect and the paper had to be retracted.

The principal investigator said that the coding error was made by a doctoral student, but gave no specifics.

You could say that this is just bad luck, that the PI can’t really have avoided this, mistakes happen, etc. But the fact is that there are engineering practices to ensure quality of research software that could have prevented this: these practices are part of what we call “Reproducible Research” and include version control, code reviews, code testing, study replication, and others.
See Retraction Watch: http://retractionwatch.com/2016/09/26/error-in-one-line-of-code-sinks-2016-seer-cancer-study/

[slide 13]
Screenshot from The New York Times: "The Excel Depression"
Two economists at Harvard University, Carmen Reinhart and Kenneth Rogoff, published a study in 2010 titled “Growth in a time of Debt,” suggesting a negative effect on growth from the national debt. It appeared in a non peer-reviewed issue of the American Economic Review.
The main conclusion was that average annual growth was –0.1 % in countries with episodes of gross government debt equal to 90 % or more of GDP between 1945 and 2009.
The Reinhart-Rogoff study came out just after Greece fell into crisis, and it was widely cited by fiscal-conservative politicians to call for austerity measures.
Nobel-prize winner Paul Krugman called it “the most influential economic analysis of recent years.”
Critics of the article rightly pointed out that it could be a case of “reverse causation,” that is, it is not the debt that impacts negatively on growth, but that low growth leads to high debt.
Soon, a more serious problem—other researchers tried to replicate the Reinhart-Rogoff study with similar data, but could not reach a similar finding.

[slide 14]
Screenshots from Business Insider and The Wall Street Journal.
University of Massachusetts graduate student Thomas Herndon started a replication exercise for an econometrics term paper. After repeated failures to replicate, he approached Reinhart and Rogoff to ask for their data and their spreadsheet, and they provided it.
Herndon found that 5 out of 20 countries had been left out of the calculation, due to a botched formula in the spreadsheet.

[slide 15]
Screenshot from Genome Biology.
A spreadsheet error was not the only problem in the Reinhart and Rogoff study—there was omission of some countries from the analysis and questionable statistical analysis to boot.
But in other fields, Microsoft Excel has wreaked havoc.
In genomics, researchers estimate that 1 out of 5 publications using Excel for gene lists contain errors. The problem here is that Excel automatically converts some gene names to other formats, like dates or floating point numbers.
The gene SEPT2 (Septin 2) gets converted to the date 2-Sept, and the identifier ‘2310009E13’ gets converted to a floating-point number of order 10 to the 13th power.
See Genome Biology: http://genomebiology.biomedcentral.com/articles/10.1186/s13059-016-1044-7

[slide 16]
Screenshot of a tweet by Philip B. Stark, Professor of Statistics, University of California Berkeley:
https://twitter.com/philipbstark/status/498683914592862208

[slide 17]
One of the first milestones of the Reproducibility movement was the “Yale Roundtable,” which resulted in a jointly-authored “Data and Code Sharing Declaration.”
About 30 experts got together ... their fields: computer science, applied mathematics, law, biostatistics, information sciences, astronomy, biochemistry.

[slide 19]
Screenshot from Science magazine.
On December 2011, Science had a special issue on Replication & Reproducibility.

[slide 20]
Screenshot from R. Peng's article in that issue.
The standard of reproducibility calls for the data and the computer code used to analyze the data be made available to others.
... aim of the reproducibility standard is to fill the gap in the scientific evidence-generating process between full replication of a study and no replication
... a study may be more or less reproducible than another depending on what data and code are made available
... A critical barrier to reproducibility in many cases is that the computer code is no longer available.

[slide 25]
We aim to carry out all research with attention to reproducibility, making all research code open-source and publishing data, plotting scripts, figures and cite our figshare repository when including the figure in the paper ... all in the aim of facilitating reproducibility of our results. We include a reproducibility statement in the papers.

[slide 27]
We created this course to share what we’ve learned from years of thinking about reproducibility in computational science.
I provides an introduction to the tools and techniques that we consider fundamental for responsible use of computers in scientific research.

[slide 28]
Syllabus of the workshop

Lorena A. Barba

January 03, 2017

More Decks by Lorena A. Barba

See All by Lorena A. Barba

Design for Reproducibility

labarba

610

Science Reproducibility Taxonomy

labarba

400

Just Do It: Reproducible Research in CFD

labarba

240

How to run a lab for reproducible research

labarba

1.1k

A short lecture on Open Licensing

labarba

830

Pedagogical Purpose of Open Sharing

labarba

1.6k

A pathway in Open Teaching

labarba

290

Engineering Gender Balance

labarba

2.2k

Plagiarism explainer for students

labarba

860

Other Decks in Education

See All in Education

Visionary Initiative: Materials-Positive Society — Evolving “Things,” empowering a positive society | Science Tokyo

sciencetokyo

PRO

130

Soluciones al examen de Geografía 2026. JUNIO (Convocatoria Ordinaria)

juanmartin2026

6.5k

Data Management and Analytics Specialisation

signer

PRO

1.9k

[2026前期火５] 論理学（京都大学文学部前期第3回）「形式言語と四つのキーワード：メタ・構成・意味論・ハーモニー」

yatabe

600

DECADE_ゴルフ_コースマネジメント完全ガイド.pdf

ozekinote

120

[2026前期火５] 論理学（京都大学文学部前期第2回）「論理的な正しさはどこにあるのか」

yatabe

AI-Based Speaking Assessment of a Short-Term Study Abroad Program

uranoken

370

Visionary Initiative: Materials-Positive Society 「モノの進化をポジティブな社会の原動力に」｜Science Tokyo（東京科学大学）

sciencetokyo

PRO

710

[2026前期火５] 論理学（京都大学文学部前期第4回）「ならば（→）の導入と証明ネット」

yatabe

510

[2026前期火５] 論理学（京都大学文学部前期第14回）「計算は、証明ではない——ハルシネーションを三層ハーモニーで診る」

yatabe

100

2026年度春学期　統計学　第9回　確からしさを記述するー確率 (2026. 5. 28)

akiraasano

PRO

150

2026年度春学期　統計学　第5回　分布をまとめるー記述統計量（平均・分散など） (2026. 5. 7)

akiraasano

PRO

210

Featured

See All Featured

ラッコキーワードサービス紹介資料

rakko

Improving Core Web Vitals using Speculation Rules API

sergeychernyshev

1.5k

HDC tutorial

michielstock

750

The World Runs on Bad Software

bkeepers

PRO

12k

How to train your dragon (web standard)

notwaldorf

6.7k

Bootstrapping a Software Product

garrettdimon

PRO

307

120k

The Impact of AI in SEO - AI Overviews June 2024 Edition

aleyda

1.1k

Distributed Sagas: A Protocol for Coordinating Microservices

caitiem20

333

23k

Evolving SEO for Evolving Search Engines

ryanjones

240

Designing Dashboards & Data Visualisations in Web Apps

destraynor

231

55k

Code Review Best Practice

trishagee

20k

Agile Leadership in an Agile Organization

kimpetersen

PRO

190

Transcript

Universidad Técnica Federico Santa María Valparaíso, 3 January 2017 Introduction
to Computational Reproducibility (and why we care) Prof. Lorena A. Barba Mechanical and Aerospace Engineering Department  The George Washington University @LorenaABarba
Acknowledgements NSF CAREER award NVIDIA CUDA Fellows Program
About us
http://lorenabarba.com
“Essential skills for reproducible research computing” Universidad Técnica Federico Santa
María First week of January 2017 A Barba-group workshop for graduate students https://barbagroup.github.io/essential_skills_RRC/
with Barba-group members: Gilbert Forsyth @gforsyth @gilforsyth Natalia Clementi @ncclementi
@ncclementi
What is Science? ‣ American Physical Society: - Ethics and
Values, 1999 "The success and credibility of science are anchored in the willingness of scientists to […] Expose their ideas and results to independent testing and replication by others. This requires the open exchange of data, procedures and materials." https://www.aps.org/policy/statements/99_6.cfm
None
None
As a general rule, researchers do not test or document
their programs rigorously, and they rarely release their codes, making it almost impossible to reproduce and verify published results generated by scientific software … 14 OCTOBER 2010 | VOL 467 | NATURE | 775
QUOTE: "There are terrifying statistics showing that almost all of
what scientists know about coding is self-taught," says Wilson. "They just don't know how bad they are." 14 OCTOBER 2010 | VOL 467 | NATURE | 775
None
None
None
None
None
2009 Yale Data and Code Sharing Roundtable ‣ 14 contributed
thought pieces ‣ “Data and Code Sharing Declaration”  ... demanding a resolution to the credibility crisis from the lack of reproducible research in computational science. SEPT/OCT 2010 | COMPUTING IN SCIENCE AND ENGINEERING
Practicing safe software ... ‣ Use a version-control system ‣
Track your materials ‣ Write testable software ‣ Test the software ‣ Encourage sharing of software 14 OCTOBER 2010 | VOL 467 | NATURE | 775
None
None
http://icerm.brown.edu/tw12-5-rcem/
http://lorenabarba.com/gallery/reproducibility-pi-manifesto/
‣ I teach my graduate students about reproducibility ‣ All
our research code (and writing) is under version control ‣ We always carry out veriﬁcation & validation (and make them public) ‣ For main results, we share data, plotting script & ﬁgure under CC-BY ‣ We upload preprint to arXiv at the time of submission to a journal ‣ We release code at the time of submission of a paper to a journal ‣ We add a “Reproducibility” declaration at the end of each paper ‣ I develop a consistent open-science policy & keep an up-to-date web presence Reproducibility PI Manifesto
None
None
Why does it matter? We use computers to create scientiﬁc
knowledge.
“Essential skills for reproducible research computing”
A syllabus for research computing 1. command line utilities in
Unix/Linux 2. an open-source scientiﬁc software ecosystem (our favorite is Python's) 3. software version control (we advocate the distributed kind: our favorite is git) 4. good practices for scientiﬁc software development: code hygiene and testing 5. knowledge of licensing options for sharing software https://barbagroup.github.io/essential_skills_RRC/