Reproducibility and Statistical Methodology

Anthony Almudevar; Jacob Almudevar

Reproducibility and Statistical Methodology

Anthony Almudevar, Jacob Almudevar

Abstract

In 2015 the Open Science Collaboration (OSC) (Nosek et al 2015) published a highly influential paper which claimed that a large fraction of published results in the psychological sciences were not reproducible. In this article we review this claim from several points of view. We first offer an extended analysis of the methods used in that study. We show that the OSC methodology induces a bias that is able by itself to explain the discrepancy between the OSC estimates of reproducibility and other more optimistic estimates made by similar studies. The article also offers a more general literature review and discussion of reproducibility in experimental science. We argue, for both scientific and ethical reasons, that a considered balance of false positive and false negative rates is preferable to a single-minded concentration on false positive rates alone.

Reproducibility and Statistical Methodology

Abstract

Paper Structure (22 sections, 21 equations, 7 figures, 4 tables)

This paper contains 22 sections, 21 equations, 7 figures, 4 tables.

Introduction
A simple mathematical model for reproducibility
Model definition
What value can we expect effect prevalence $\pi$ to be?
Prevalence $\pi$ for clinical trials
Equipoise
Other interpretation of the reproducibility model
On the use of preliminary data for power analyses
A simple power analysis model
One- versus two-sided tests
Two-stage power calculation
Estimation of effect size
The OSC-RP revisited
Other reproducibility projects
Prevailing views on effect size estimation and power analyses
...and 7 more sections

Figures (7)

Figure 1: Decision tree representation of reproducibility model.
Figure 2: Contour plots of $\beta_u(\eta,t,\alpha, \beta)$ and $\beta_c(\eta,t,\alpha, \beta)$ for fixed nominal type I, II errors $\alpha = 0.05$, $\beta = 0.1$. Contour for $n^*/n = 1$ is superimposed using a dashed line [- - -]. Values $t = z_{\alpha/2}$ and $\eta = z_{\alpha/2} + z_\beta$ are indicated for convenience.
Figure 3: Contour plots of $R^*_u(\eta,t,\alpha, \beta)$ and $R^*_c(\eta,t,\alpha, \beta)$ for fixed nominal type I, II errors $\alpha = 0.05$, $\beta = 0.1$. Contour for $n^*/n = 1$ is indicated by a dashed line [- - -]. Values $t = z_{\alpha/2}$ and $\eta = z_{\alpha/2} + z_\beta$ are indicated for convenience.
Figure 4: Contour plots of $\beta_c(\eta,t,0.05, 0.05)$ and $R^*_c(\eta,t,0.05, 0.05)$. Contour for $n^*/n = 1$ is indicated by a dashed line [- - -]. Values $t = z_{\alpha/2}$ and $\eta = z_{\alpha/2} + z_\beta$ are indicated for convenience.
Figure 5: Maximum likelihood estimate of $\eta$ based on a single observation of $z_p \in [2.0,5.0]$, truncated at $z_p \geq t = z_{0.025}$. The identity is indicated by a dashed line [- - -].
...and 2 more figures

Reproducibility and Statistical Methodology

Abstract

Reproducibility and Statistical Methodology

Authors

Abstract

Table of Contents

Figures (7)