Cancer Biology

Reproducibility in Cancer Biology: Challenges for assessing replicability in preclinical cancer biology

Center for Open Science, United States
Science Exchange, United States
University of Virginia, United States

Dec 7, 2021

Open access
Copyright information

Download
Cite
CommentOpen annotations (there are currently 0 annotations on this page).
Share

Article
Figures and data
Abstract
Introduction
The challenges encountered when designing experiments
Challenges during experiments and peer review
Discussion
Conclusion
Materials and methods
Note
Appendix 1
Data availability
References
Decision letter
Author response
Article and author information
Metrics

Abstract

We conducted the Reproducibility Project: Cancer Biology to investigate the replicability of preclinical research in cancer biology. The initial aim of the project was to repeat 193 experiments from 53 high-impact papers, using an approach in which the experimental protocols and plans for data analysis had to be peer reviewed and accepted for publication before experimental work could begin. However, the various barriers and challenges we encountered while designing and conducting the experiments meant that we were only able to repeat 50 experiments from 23 papers. Here we report these barriers and challenges. First, many original papers failed to report key descriptive and inferential statistics: the data needed to compute effect sizes and conduct power analyses was publicly accessible for just 4 of 193 experiments. Moreover, despite contacting the authors of the original papers, we were unable to obtain these data for 68% of the experiments. Second, none of the 193 experiments were described in sufficient detail in the original paper to enable us to design protocols to repeat the experiments, so we had to seek clarifications from the original authors. While authors were extremely or very helpful for 41% of experiments, they were minimally helpful for 9% of experiments, and not at all helpful (or did not respond to us) for 32% of experiments. Third, once experimental work started, 67% of the peer-reviewed protocols required modifications to complete the research and just 41% of those modifications could be implemented. Cumulatively, these three factors limited the number of experiments that could be repeated. This experience draws attention to a basic and fundamental concern about replication – it is hard to assess whether reported findings are credible.

Introduction

Science is a system for accumulating knowledge. The credibility of knowledge claims relies, in part, on the transparency and repeatability of the evidence used to support them. As a social system, science operates with norms and processes to facilitate the critical appraisal of claims, and transparency and skepticism are virtues endorsed by most scientists (Anderson et al., 2007). Science is also relatively non-hierarchical in that there are no official arbiters of the truth or falsity of claims. However, the interrogation of new claims and evidence by peers occurs continuously, and most formally in the peer review of manuscripts prior to publication. Once new claims are made public, other scientists may question, challenge, or extend them by trying to replicate the evidence or to conduct novel research. The evaluative processes of peer review and replication are the basis for believing that science is self-correcting. Self-correction is necessary because mistakes and false starts are expected when pushing the boundaries of knowledge. Science works because it efficiently identifies those false starts and redirects resources to new possibilities.

We believe everything we wrote in the previous paragraph except for one word in the last sentence – efficiently. Science advances knowledge and is self-correcting, but we do not believe it is doing so very efficiently. Many parts of research could improve to accelerate discovery. In this paper, we report the challenges confronted during a large-scale effort to replicate findings in cancer biology, and describe how improving transparency and sharing can make it easier to assess rigor and replicability and, therefore, to increase research efficiency.

Transparency is essential in any system that seeks to evaluate the credibility of scientific claims. To evaluate a scientific claim one needs access to the evidence supporting the claim – the methodology and materials used, the data generated, and the process of drawing conclusions from those data. The standard process for providing this information is to write a research paper that details the methodology and outcomes. However, this process is imperfect. For example, selectively reporting experiments or analyses, particularly reporting only those that 'worked', biases the literature by ignoring negative or null results (Fanelli, 2010; Fanelli, 2011; Ioannidis, 2005; Rosenthal, 1979; Sterling, 1959; Sterling et al., 1995). And the combined effect of constraints related to the research paper format (including word limits, and only reporting what can be described in words), the tendency of authors to report what they perceive to be important, and rewards for exciting, innovative outcomes is an emphasis on reporting outcomes and their implications, rather than a comprehensive description of the methodology (Kilkenny et al., 2009; Landis et al., 2012; Moher et al., 2008).

The sharing of data, materials, and code can also increase the efficiency of research in a number of ways (Molloy, 2011; Murray-Rust et al., 2010; Nosek et al., 2015). For example, sharing provides opportunities for independent observers to evaluate both the evidence reported in papers and the credibility of the claims based on this evidence; it allows other researchers to analyze the data in different ways (by, for example, using different rules for data exclusion); and it helps other researchers to perform replications to determine if similar evidence can be observed independently of the original context. Moreover, giving other researchers access to data, materials, and code may allow them to identify important features of the research that were not appreciated by the original researchers, or to identify errors in analysis or reporting.

Transparency and sharing therefore contribute to assessment of research reproducibility, robustness, and replicability. Reproducibility refers to whether the reported findings are repeatable using the same analysis on the same data as the original study. Robustness refers to whether the reported findings are repeatable using reasonable alternative analysis strategies on the same data as the original study. Replicability refers to whether the reported findings are repeatable using new data (NAS, 2019). By these definitions, all reported findings should be reproducible in principle; variability in robustness may imply fragility of the phenomenon or greater uncertainty in its evidence base; and variability in replicability may imply fragility, more limited scope of applicability than originally presumed, or uncertainty in the conditions necessary for observing supporting evidence (Nosek and Errington, 2020a). All three are important for assessing the credibility of claims and to make self-corrective processes as efficient as possible.

From 2013 to 2020, as part of the Reproducibility Project: Cancer Biology, we tried to replicate selected results in high-impact preclinical papers in the field of cancer biology (Errington et al., 2014; Table 1). The aim of the project was not to repeat every experiment in each paper: rather it was to repeat a selection of key experiments from each paper. The project also adopted an approach in which a Registered Report describing the experimental protocols and plans for data analysis had to be peer reviewed and accepted for publication before experimental work could begin. The Replication Study reporting the results of the experiments was then peer reviewed to ensure that the experiments had been conducted and analyzed according to the procedures outlined in the Registered Report: crucially, reviewers were asked not to take the ’success' or 'failure' of the experiments into account when reviewing Replication Studies.

Table 1

The 53 papers selected for replication in the RP:CB.

Original paper	Experiments selected	Registered report	Experiments registered	Replication study*	Experiments completed	Data, digital materials, and code
Poliseno et al., 2010	11	Khan et al., 2015	6	Kerwin et al., 2020	5	https://osf.io/yyqas/
Sharma et al., 2010	8	Haven et al., 2016	8	N/A	0	https://osf.io/xbign/
Gupta et al., 2010	2	N/A	0	N/A	0	https://osf.io/4bokd/
Figueroa et al., 2010	6	N/A	0	N/A	0	https://osf.io/xdojz/
Ricci-Vitiani et al., 2010	3	Chroscinski et al., 2015b	2	Errington et al., 2021a	1	https://osf.io/mpyvx/
Kan et al., 2010	3	Sharma et al., 2016a	3	Errington et al., 2021a	1	https://osf.io/jpeqg/
Heidorn et al., 2010	8	Bhargava et al., 2016a	5	Errington et al., 2021a	1	https://osf.io/b1aw6/
Hatzivassiliou et al., 2010	4	Bhargava et al., 2016b	3	Pelech et al., 2021	2	https://osf.io/0hezb/
Vermeulen et al., 2010	4	Evans et al., 2015a	3	Essex et al., 2019	3	https://osf.io/pgjhx/
Carro et al., 2010	8	N/A	0	N/A	0	https://osf.io/mfxpj/
Nazarian et al., 2010	5	N/A	0	N/A	0	https://osf.io/679uw/
Johannessen et al., 2010	5	Sharma et al., 2016b	5	Errington et al., 2021a	2	https://osf.io/lmhjg/
Poulikakos et al., 2010	5	N/A	0	N/A	0	https://osf.io/acpq7/
Sugahara et al., 2010	4	Kandela et al., 2015a	3	Mantis et al., 2017	3	https://osf.io/xu1g2/
Ward et al., 2010	3	Fiehn et al., 2016	3	Showalter et al., 2017	3	https://osf.io/8l4ea/
Ko et al., 2010	3	N/A	0	N/A	0	https://osf.io/udw78/
Zuber et al., 2011	3	N/A	0	N/A	0	https://osf.io/devog/
Delmore et al., 2011	2	Kandela et al., 2015b	2	Aird et al., 2017	2	https://osf.io/7zqxp/
Goetz et al., 2011	2	Fiering et al., 2015	2	Sheen et al., 2019	2	https://osf.io/7yqmp/
Sirota et al., 2011	1	Kandela et al., 2015c	1	Kandela et al., 2017	1	https://osf.io/hxrmm/
Raj et al., 2011	4	N/A	0	N/A	0	https://osf.io/uvapt/
Possemato et al., 2011	3	N/A	0	N/A	0	https://osf.io/u1mfn/
Tay et al., 2011	5	Phelps et al., 2016	5	Wang et al., 2020	4	https://osf.io/oblj1/
Xu et al., 2011	5	Evans et al., 2015b	5	N/A	0	https://osf.io/kvshc/
DeNicola et al., 2011	4	N/A	0	N/A	0	https://osf.io/i0yka/
Zhu et al., 2011	3	N/A	0	N/A	0	https://osf.io/oi7jj/
Liu et al., 2011	4	Li et al., 2015	3	Yan et al., 2019	3	https://osf.io/gb7sr/
Dawson et al., 2011	3	Fung et al., 2015	3	Shan et al., 2017	3	https://osf.io/hcqqy/
Qian et al., 2011	3	N/A	0	N/A	0	https://osf.io/ckpsn/
Sumazin et al., 2011	3	N/A	0	N/A	0	https://osf.io/wcasz/
Chaffer et al., 2011	2	N/A	0	N/A	0	https://osf.io/u6m4z/
Opitz et al., 2011	5	N/A	0	N/A	0	https://osf.io/o2xpf/
Kang et al., 2011	2	Raouf et al., 2015	2	N/A	0	https://osf.io/82nfe/
Chen et al., 2012	2	N/A	0	N/A	0	https://osf.io/egoni/
Driessens et al., 2012	2	N/A	0	N/A	0	https://osf.io/znixv/
Garnett et al., 2012	3	Vanden Heuvel et al., 2016	3	Vanden Heuvel et al., 2018	3	https://osf.io/nbryi/
Schepers et al., 2012	3	N/A	0	N/A	0	https://osf.io/1ovqn/
Willingham et al., 2012	2	Chroscinski et al., 2015a	1	Horrigan and Reproducibility Project: Cancer Biology, 2017a	1	https://osf.io/9pbos/
Straussman et al., 2012	4	Blum et al., 2014	4	N/A	0	https://osf.io/p4lzc/
Arthur et al., 2012	2	Eaton et al., 2015	2	Eaton et al., 2018	2	https://osf.io/y4tvd/
Peinado et al., 2012	3	Lesnik et al., 2016	2	Kim et al., 2018	2	https://osf.io/ewqzf/
Malanchi et al., 2011	3	Incardona et al., 2015	2	N/A	0	https://osf.io/vseix/
Berger et al., 2012	1	Chroscinski et al., 2014	1	Horrigan et al., 2017b	1	https://osf.io/jvpnw/
Prahallad et al., 2012	4	N/A	0	N/A	0	https://osf.io/ecy85/
Wilson et al., 2012	3	Greenfield et al., 2014	2	N/A	0	https://osf.io/h0pnz/
Lu et al., 2012	5	Richarson et al., 2016	3	Errington et al., 2021a	2	https://osf.io/vfsbo/
Lin et al., 2012	2	Blum et al., 2015	2	Lewis et al., 2018	2	https://osf.io/mokeb/
Lee et al., 2012	3	N/A	0	N/A	0	https://osf.io/i25y8/
Castellarin et al., 2012	1	Repass et al., 2016	1	Repass and Reproducibility Project: Cancer Biology, 2018	1	https://osf.io/v4se2/
Crasta et al., 2012	3	N/A	0	N/A	0	https://osf.io/47xy6/
Png et al., 2011	5	N/A	0	N/A	0	https://osf.io/tkzme/
Metallo et al., 2011	5	N/A	0	N/A	0	https://osf.io/isdbh/
Morin et al., 2010	1	N/A	0	N/A	0	https://osf.io/6kuy8/

193 experiments in 53 papers were selected for replication. The papers are listed in column 1, and the number of experiments selected from each paper is listed in column 2. Registered Reports for 87 experiments from 29 papers were published in eLife. The Registered Reports are listed in column 3, and the number of experiments included in each Registered Report is listed in column 4. 50 experiments from 23 Registered Reports were completed. 17 Replication Studies reporting the results of 41 experiments were published in eLife; the results of another nine experiments from the six remaining Registered Reports were published in an aggregate paper (Errington et al., 2021a). The Replication Studies are listed in column 5, and the number of experiments included in each study is listed in column 6. Column seven contains a link to data, digital materials, and code.

The initial goal was to repeat 193 experiments from 53 high-impact papers published between 2010 and 2012, but the obstacles we encountered at every phase of the research lifecycle meant that we were only able to repeat 50 experiments from 23 papers. In a separate paper we report a meta-analysis of the results of those 50 experiments (Errington et al., 2021b). In this paper, we describe the challenges we confronted during the different phases of the research lifecycle. A completed replication attempt passed through six phases: designing the experiment (and writing the Registered Report); peer reviewing the Registered Report; preparing the experiments; conducting the experiments; analysing the data (and writing the Replication Study); and peer reviewing the Replication Study.

The next section discusses in detail the challenges faced during the first of these phases. A subsequent section covers the challenges encountered when conducting the experiments and during the peer review of the Replication Studies.

The challenges encountered when designing experiments

Sampling papers

At the start of the project in 2013 we searched various databases to identify basic research papers in cancer biology published between 2010 and 2012 that were having a substantial impact as indexed by citation rates and readership in multiple databases. We selected the highest impact papers from each year that met inclusion criteria (Errington et al., 2014). We excluded papers that reported exclusively genomics, proteomics, and high-throughput assays. This resulted in 50 included papers for which we initiated the process of preparing a replication. During inquiries with original authors, two papers were identified that we determined would be unfeasible to attempt and we decided to halt the effort; for another paper we requested, but did not receive, a key material (i.e., mouse model) so replication was not feasible. We decided to go back to the sampling pool and pull the next available papers, bringing the effective sample to 53 papers. Observing that challenges like this were relatively common, we did not return to the pool for resampling again for the rest of the project. Among the 53 selected papers, 35 were published in the Nature family of journals, 11 in the Cell family of journals, 4 in the Science family of journals, and three in other journals.

From each paper, we identified a subset of experiments for potential replication with an emphasis on those supporting the main conclusions of the paper and attending to resource constraints (Table 1). In total, 193 experiments were identified for replication across the 53 papers for an average of 3.6 per paper (SD = 1.9; range 1–11). Figure 1 illustrates the fate of all the experiments that we attempted to replicate. Below, we summarize the findings by experiment; similar findings are observed when aggregating the data by paper (Figure 1—figure supplement 1).

Figure 1 with 1 supplement see all

Download asset Open asset

Barriers to conducting replications – by experiment.

During the design phase of the project the 193 experiments selected for replication were coded according to six criteria: availability and sharing of data; reporting of statistical analysis (i.e., did the paper describe the tests used in statistical analysis?; if such tests were not used, did the paper report on biological variation (e.g., graph reporting error bars) or representative images?); availability and sharing of analytic code; did the original authors offer to share key reagents?; what level of protocol clarifications were needed from the original authors?; how helpful were the responses to those requests? The 29 Registered Reports published by the project included protocols for 87 experiments, and these experiments were coded according to three criteria: were reagents shared by the original authors?; did the replication authors have to make modifications to the protocol?; were these modifications implemented? A total of 50 experiments were completed.

Searching for data from the original experiments

We planned to conduct replications with at least 0.80 power to detect the effect size reported in the original paper at p < .05 using two-tailed tests. However, in a number of cases only representative images or graphs were reported in the original paper. This occurred for 53 of the 193 experiments (27%). Additionally, it was uncommon for papers to include the summary statistics (such as sample size, means, standard deviations, and inferential statistics) that were needed to calculate the original effect size. We searched the original paper and supplemental files for the original data. When data were not publicly accessible, we requested them from the original authors. At least some data were open or included in the paper for four experiments (2%), raw data were shared for 31 experiments (16%), summary data were shared for 27 experiments (14%), and nothing was shared for 131 experiments (68%).

Failure to report sample size, variability information from sampling, or inferential tests in the original paper makes it difficult or impossible to calculate effect sizes. Further, failure to share data upon request – even summary statistics – leaves the nature of the original research and inference testing opaque. When we could not obtain the data we needed we estimated means and variability from the available information reported in the original papers (e.g., estimating bar heights and error bars from graphs). In cases where there was no information to estimate, such as only a representative image, we treated the extracted representative data point as the mean and estimated a range of variances to determine the replication sample size (Errington et al., 2014).

Analytic code availability was not common, although, unlike data, we did not explicitly request it for all experiments. Statistical analyses were reported for 78 of the 193 experiments (40%). When the outcome of analyses were reported (e.g., p-value) it was unclear what statistical test was used in 16 of the 78 experiments (21%). Of the experiments that reported an outcome from statistical analyses, at least some analysis code was open for one experiment (1%), code was shared by the original authors for 10 experiments (13%), additional analysis details were shared for four experiments (5%), and nothing was shared for 63 experiments (81%).

Independent development of replication protocols

To carry out rigorous replication studies, we needed to understand the original methodology. We read each paper and supplementary information closely to design a protocol. We coded if we requested a key reagent (i.e., cell lines, plasmids, model organisms, antibodies) that was not available commercially or in a repository. We requested key reagents for 136 of the 193 experiments (70%) and for 45 of the 53 papers (85%).

We coded the frequency with which we were able to design a complete protocol for repeating an experiment based on the original paper without having to contact the original authors to clarify some aspect of the original experiment (see Case study in Box 1). Zero experiments needed no clarifications (0%), 17 experiments needed few clarifications (9%), 77 experiments needed some clarifications (40%), 60 experiments needed moderate clarifications (31%), 29 experiments needed strong clarifications (15%), 10 experiments needed extreme clarifications (5%). To illustrate, one experiment needing few clarifications was missing reagent identifying information (e.g., catalog numbers), cell density at time of transfection (or harvest), and some specific details about the gas chromatography-mass spectrometry methodology (e.g., ramping, derivatization volume, injection volume). An experiment needing moderate clarifications was missing reagent identifying information, specific details about the transfection and infection methodologies (e.g., cell density, amount of plasmid/viral titer), and specific details about the flow cytometry methodology (e.g., cell dissociation technique, specific gating strategy). And, an experiment needing extreme clarifications was missing reagent identifying information, specific details about the transfection and infection methodologies, specific details for injecting mice with cells (e.g., number of cells and volume injected, injection methodology), specific details about the bioluminescence imaging (e.g., amount and location of luciferin injected, time post-injection until measurement), and clarification of measurement details (e.g., the exact days post-injection when measurements were taken, how the reported ratio values were calculated).

Box 1

Case study: Designing a replication protocol by reading the original paper.

Designing the replication protocol (Kandela et al., 2015a) for measuring the effect of doxorubicin alone or in combination with a tumor penetrating peptide in mice bearing orthotopic prostate tumors was challenged by a lack of details in the original paper (Sugahara et al., 2010). There was no detailed protocol for the peptide generation in the paper or the cited references. Instead, the sequence and a general description of the ‘standard’ technique was briefly described. Data variability, sample size, and statistical analyses were reported; however, no raw data was available. The strain and sex of the mice and the cell type and number of cells implanted were provided; however, there were no detailed protocols available for generating or harvesting the orthotopic prostate tumors which meant these details were filled in based on the standard approach used in the replicating laboratory. Most end-point measurements were described or discernable; however, there was no description of how ‘positive area’ was determined for TUNEL staining which meant this needed to be surmised and articulated for the replication attempt. This paper was coded as no data available beyond what was reported in graphs and images in the original paper, statistical analysis reported with tests described with no code available beyond the reported analysis, and “strong clarification” needed about the published experimental methodology.

Requesting assistance from original authors

We sought assistance from original authors to clarify the experimental protocols and to obtain original materials and reagents when necessary. We sent authors the drafted experimental protocols, clarification questions, and requests for materials. Some original authors were helpful and generous with their time providing feedback (see Case study in Box 2), others were not. We coded if original authors were willing to share key reagents. Of the 45 papers for which we requested a key reagent, the authors of 33 papers (73%) offered to share at least one key material. By experiment, of the 136 experiments for which we requested a key reagent, the authors were willing to share for 94 of them (69%).

Box 2

Case study: Feedback from original authors.

The replication protocol (Fiering et al., 2015) for evaluating the impact stromal caveolin-1 has on remodeling the intratumoral microenvironment was challenged by a lack of details in the original paper (Goetz et al., 2011). However, the original authors supplied us with most of the missing details. Based on the description in the paper, multiple strains of knockout mice could have been used for the replications. The authors provided strain stock numbers ensuring the same genetic background was selected. The authors also shared the raw data and statistical analysis: this was particularly helpful for understanding the original effects and sample size planning because the data did not have a normal distribution. The tumor cells used in the original study, engineered to express luciferase, were not available in a repository but the original authors provided them upon request. The authors also provided detailed protocol information and clarified uncertainties with reporting in the original paper. This included the age of the mice, injection details of the cells and luciferin (e.g., location, timing, procedural details), a detailed immunostaining and microscopy protocol (e.g., number of fields taken per section, magnification and specs of the objective), and the euthanasia criteria that was approved by the original study’s ethics committee. The latter determined the number of days the mice were maintained in the original study. The authors also shared an additional assay, which was included in the published Replication Study (Sheen et al., 2019), that demonstrated the extracellular matrix remodeling capabilities of the cells that was not shown in the original paper because journal policy restricted the number of supplemental figures. This paper was coded as raw data shared by original authors, statistical analysis reported with tests described and code shared by original authors, original authors offered to share key reagents, and “extremely helpful” response from the original authors to the “moderate clarification” needed about the published experimental methodology.

We also coded the degree to which authors were helpful in providing feedback and materials for designing the replication experiments. Authors were extremely helpful for 51 experiments (26%), very helpful for 28 experiments (15%), moderately helpful for 18 experiments (9%), somewhat helpful for 18 experiments (9%), minimally helpful for 17 experiments (9%), and not at all helpful/no response for 61 experiments (32%). An example of an extremely helpful response was the corresponding author reaching out to the other authors (who since moved to other institutions) to help with the requests, sharing detailed protocol and reagent information, providing additional information beyond what we requested to help ensure the experimental details were complete, and providing additional feedback on any known deviations that were needed (e.g., different instrumentation) to help ensure a good-faith replication would be designed. An example of a moderately helpful response was replying to all of our requests with the necessary information and providing additional clarifications when follow-up requests were made, but where some parts of the response were not very helpful. For example, a request for specific protocol details was responded with “a standard procedure was used.” Examples of not at all helpful responses include non-response to multiple requests (6/53 papers [11%]) or responses questioning the value of conducting replications and declining to assist.

An obvious hypothesis is that the helpfulness of the original authors was determined by the extent of clarifications requested because of the workload. If only minimal clarification were needed, then authors would be helpful. If lots of clarifications were needed, then authors would not be helpful. The correlation between extent of clarifications and helpfulness was –0.24 (95% CI [–0.48, 0.03]) across papers and –0.20 (95% CI [–0.33, –0.06]) across experiments. Larger requests were only modestly associated with less helpfulness. The variability in this relationship is visualized in Figure 2. We also explored whether the extent of clarifications or helpfulness varied by experimental techniques and found the relationship was similar across different categories of experimental techniques (Figure 2—figure supplement 1; Figure 2—figure supplement 2).

Figure 2 with 2 supplements see all

Download asset Open asset

Relationship between extent of clarification needed and helpfulness of authors.

Fluctuation plots showing the coded ratings for extent of clarifications needed from original authors and the degree to which authors were helpful in providing feedback and materials for designing the replication experiments. The size of the square shows the number (Freq) of papers/experiments for each combination of extent of clarification needed and helpfulness. (A) To characterize papers (N = 53), coded ratings were averaged across experiments for each paper. The average number of experiments per paper was 3.6 (SD = 1.9; range = 1–11). The Spearman rank-order correlation between extent of clarification needed and helpfulness was –0.24 (95% CI [–0.48, 0.03]) across papers. (B) For experiments (N = 193), the Spearman rank-order correlation between extent of clarification needed and helpfulness was –0.20 (95% CI [–0.33, –0.06]).

Preparing the Registered Report for peer review

Depending on feedback and materials received from original authors, some protocols were easier to design than others. To design experiments with at least 0.80 power to detect the effect size reported in the original paper at p < .05 using two-tailed tests, we often needed a larger sample size for the replication than what was reported in the original experiment. As an illustration, the average sample size of animal experiments in the replication protocols (average = 30; SD = 16; median = 26; IQR = 18–41) were 25% higher than the sample size of the original experiments (average = 24; SD = 14; median = 22; IQR = 16–30). Also, some experiments proved challenging to complete, or were discontinued, due to delays and cost increases that emerged when the replications were being designed and/or conducted (e.g., when the original authors declined to share reagents, or it became clear that the material transfer agreement process was going to take a very long time (see Case study in Box 3)). This included discontinuing some viable experiments that were still near the start of the design phase to ensure that experiments that were further along in the process could be completed.

Box 3

Case study: Gathering original materials.

The replication protocols (Khan et al., 2015) for evaluating the impact of PTENP1 on cellular PTEN expression and function required a plasmid that overexpressed the 3’UTR of PTEN. The original paper (Poliseno et al., 2010) described the generation of this plasmid and the original authors agreed to share this plasmid, as indicated in the Registered Report (Khan et al., 2015). A material transfer agreement (MTA) was initiated to obtain the plasmid. More than one year passed without the MTA being finalized preventing us from acquiring the plasmid. To complete the replication study, we regenerated the plasmid adding time and cost to the study. The regenerated plasmid for the replication study was deposited in Addgene (plasmid# 97204; RRID:Addgene_97204) for the research community to easily access for future research. These experiments were coded as key reagents offered to be shared, but not actually shared.

Ultimately, 32 Registered Reports covering 97 experiments were submitted for peer review. One or more authors from the original paper was always invited by eLife to participate in the peer-review process. None of the papers were accepted without revision, one was not resubmitted after resource consideration of requested revisions, and two were rejected. As such, 29 papers with 87 experiments were published as Registered Reports (Table 1). We will now discuss some of the problems and challenges encountered in subsequent phases of the project.

Challenges during experiments and peer review

The challenges encountered when conducting experiments

Once accepted as Registered Reports, experiments could begin in the replication labs. Despite often obtaining original materials and reagents and having fully specified and peer reviewed protocols, it was common that the preregistered procedure had to be modified to complete the experiments (see Case study in Box 4). Sometimes just minor modifications were needed (e.g., changing antibody concentrations or blocking reagents to detect the protein of interest during a Western blot assay). Sometimes moderate modifications were needed. In some cases, for example, despite attempts to adjust the conditions, we were still unable to produce the expected intermediate results (e.g., obtaining the desired transfection efficiency as indicated by a reporter system) and an additional protocol step, different reagent source, or change in instrumentation was needed (e.g., including an enrichment step, such as fluorescence-activated cell sorting [FACS], to increase the number of transfected cells). And in some cases extreme modifications to the preregistered procedure were needed. For example, in one case (Yan et al., 2019) the preregistered protocol did not result in the generation of tumors in mice that were needed for a downstream infection and tumorigenicity assay, so substantial changes had to be made to this protocol to proceed with the experiment (e.g., changing the source of the tumor cells, modifying the timing and technique of infection to achieve the desired transduction efficiency, and using a different technique to detect the molecule of interest).

Box 4

Case study: Solving challenges during data collection.

The replication protocol (Kandela et al., 2015b) for evaluating BET bromodomain inhibition as a therapeutic strategy to target c-Myc described the timeframe for tumor cell inoculation, injection with luciferin to image the tumor progression, and injection with a BET bromodomain inhibitor. This followed the same timing as the original study (Delmore et al., 2011). An initial attempt was unsuccessful in detecting bioluminescence even though disease progression was observed, indicating tumor cell inoculation occurred, and the luciferase expressing tumor cells had a strong luminescent signal prior to injection. Lack of bioluminescence meant that we could not test the BET bromodomain inhibitor because the predetermined baseline bioluminescence indicating disease progression for inhibitor administration was never achieved. We modified the preregistered protocol and selected for highly expressing cells to enrich the tumor cells. We also designed a pilot study to identify a modified time frame in which mice could establish the same detectable baseline bioluminescence as the original study before administration of the inhibitor. We included the initial preregistered study, the pilot, and the modified replication in the published Replication Study (Aird et al., 2017) and discussion of the variability of the timing from tumor cell inoculation until baseline disease detection, comparing the original study, the replication, and other published studies using the same model. This experiment was coded as “completely implemented” for the “extreme modifications” needed for the experiment.

We coded each experiment on the extent to which modifications were needed to conduct a fair replication of the original findings. No modifications were required for 25 of the 87 experiments (29%), few modifications for 18 (21%), some modifications for 12 (14%), moderate modifications for 8 (9%), strong modifications for 6 (7%), and extreme modifications for 7 (8%). We did not start 11 experiments and thus did not assess the level of modification required for these. This means that a total of 76 experiments were started.

The implementation of the modifications varied. When modifications could be carried out, in some cases they were completely implemented (see Case study in Box 4) and in others they were only partially implemented. For example, modifications were successfully implemented to reach some preregistered end-point measurements, but not all (e.g., modifications were implemented to enable quantification of one protein of interest, while continued challenges detecting another protein of interest was eventually halted). Not all modifications could be carried out. In some cases this was due to feasibility or resource constraints; and in other cases it was due to pronounced differences in the behavior of model systems or experimental protocols from what was reported in the original paper that had no obvious strategy for modification (see Case study in Box 5). We coded the extent to which we were able to implement the needed modifications to complete the replication experiments. Modifications were not needed for 25 of the 87 experiments (29%). We completely implemented modifications for 21 experiments (24%), mostly implemented them for four experiments (5%), moderately implemented them for four experiments (5%), implemented some of them for six experiments (7%), implemented few of them for four experiments (5%), and did not implement any for 12 experiments (14%). As before, the 11 experiments that were not started were not assessed. Excluding papers that needed no modifications or were not assessed, the correlation between extent of modification needed and implementation of modifications was –0.01 (95% CI [–0.42, 0.40]) across papers and 0.01 (95% CI [–0.27, 0.28]) across all experiments (Figure 3).

Box 5

Case study: Failing to solve challenges during data collection.

The replication protocol (Chroscinski et al., 2015b) for evaluating whether glioblastoma stem-like cell-derived endothelial cells contribute to tumor growth in vivo required generating cancer cells that stably expressed the thymidine kinase gene under the control of the transcriptional regulatory elements of the endothelial marker Tie2. The preregistered protocol required achieving at least 80% positive expression of the gene, based on a GFP reporter, among the cell populations before proceeding with the xenograft experiment. However, after multiple attempts the required expression level could not be achieved despite obtaining new cells, plasmids, and incorporating changes to the protocol suggested by the original authors to improve the infection efficiency and enrich GFP expressing cells. Eventually, the replication attempt was stopped because of the increasing costs associated with multiple optimization attempts with no feasible path to a solution. The original finding (Ricci-Vitiani et al., 2010), that selective killing of tumor-derived endothelial cells results in tumor reduction and degeneration, was not tested. This experiment was coded as “some implemented” for the “moderate modifications” needed for the experiment.

Figure 3

Download asset Open asset

Relationship between extent of modifications needed and implementation of modifications.

Fluctuation plots showing the coded ratings for extent of modifications needed in order to conduct the replication experiments, and the extent to which the replication authors were able to implement these modifications for experiments that were conducted. The size of the square shows the number (Freq) of papers/experiments for each combination. (A) To characterize papers (N = 29), coded ratings were averaged across the experiments conducted for each paper. The average number of experiments conducted per paper was 2.6 (SD = 1.3; range = 1–6), and the Spearman rank-order correlation between extent of modifications needed and implementation was –0.01 (95% CI [–0.42, 0.40]). (B) For the experiments that were started (N = 76), the Spearman rank-order correlation was 0.01 (95% CI [–0.27, 0.28]).

Having original materials, a fully specified protocol, and peer review from experts was not always sufficient to ensure that the replication protocol behaved as expected to test the original claim. The observed implementation challenges could mean that the original finding is not valid because the model system or other parts of the protocol do not operate that way in reality – for example, the original procedure was confounded or influenced by some unrecognized factors. It might also be that, in some cases, a failure to replicate was caused by the replication team deviating from the protocol in some way that was not recognized, or that a key part of the procedure was left out of the protocol inadvertently. It is also possible that the effect reported in the original paper depended on methodological factors that were not identified by original authors, the replication team, or any other experts involved in the peer review of the original paper or the Registered Report.

Whatever the reason, all of these factors are barriers to replicability and causes of friction in efficiency of replication and discovery. Failures during implementation leave untested the original claim because the original experiment could not be carried out as described. That does not falsify the original claim because the replication does not test it. But, depending on the reasons for failure in implementation, it could raise doubt about the reliability of the claim if it seems that the original methodology could not have produced the reported outcomes either. For example, if the replication study suggests that the model system does not behave as reported in the original study, it could indicate a flaw in the original study or an unknown feature that is necessary to obtain the original behavior.

The challenges encountered during peer review of the Replication Studies

In total, we completed 50 experiments from 23 of the original papers (Table 1). This means that no experiments were completed for six of the original papers for which Registered Reports were published. For 18 of the original papers we were able to complete all experiments described in the Registered Report, so for each of these we prepared and submitted a Replication Study that reported the results of the completed experiments. For five of the original papers we were only able to complete some of the experiments described in the Registered Report: in these cases the results of the completed experiments were reported in an aggregate paper (Errington et al., 2021a).

In the Registered Report/Replication Study model (https://cos.io/rr/), peer review of the Replication Study is supposed to be independent of outcome to mitigate publication bias, suppression of negative results, and results-contingent motivated reasoning interfering with publication (Nosek and Lakens, 2014; Chambers, 2019). Reviewers examine whether the authors completed the experiments as proposed, appropriately interpreted the outcomes, and met any outcome-independent quality control criteria that were defined during the review of the Registered Report. Usually the review process played out according to these ideals, occasionally it did not. This is understandable, partly because the Registered Report model is new for many reviewers, and partly because when observed outcomes differ from expectations it provokes immediate reasoning and rationalizing about why it occurred (Nosek and Errington, 2020a). Indeed, such interrogation of unexpected outcomes is productive for hypothesis generation and exploration of what could be studied next.

A presumed virtue of Registered Reports is that it incorporates preregistration (Camerer et al., 2018) to very clearly separate hypothesis testing (confirmatory) and hypothesis generating (exploratory) modes of analysis. Another virtue is that expert feedback is incorporated during design to improve the quality of the experiments (Soderberg et al., 2021). During peer review of the Registered Reports the reviewers ensure that the proposed experiments are appropriately designed and fair tests of the original findings. That precommitment, by both replication authors and reviewers, is a mechanism to ensure that all results are taken seriously whether they confirm or disconfirm the original finding (Nosek and Errington, 2020b).

During peer review of the Replication Study, the authors and reviewers observe the outcomes and wrestle with what they mean. But, because they made precommitments to the experiments being diagnostic tests of the original finding, new ideas that occur following observation of the outcomes are clearly designated as hypothesis generating ideas for what should be studied next. For example, when an outcome is inconsistent with the original finding, it is common for reviewers to return and re-evaluate methodology (see Case study in Box 6). Features or differences from the original experiments that seemed immaterial a priori become potentially important post facto. The risk, of course, is that the post facto reasoning is just rationalization to maintain belief in an original finding that should now be questioned (Kerr, 1998; Kunda, 1990). Registered Reports mitigates that risk with the precommitment to publishing regardless of outcomes, and then speculations for the causes of different results from the original experiments can be actively and independently tested to assess their veracity.

Box 6

Case study: Peer review of protocols prior to conducting the experiments and after the results are known.

The replication protocol (Vanden Heuvel et al., 2016) for testing the sensitivity of Ewing’s sarcoma cell lines to PARP inhibitors was based on the original paper (Garnett et al., 2012), like all replication protocols. The original authors provided additional feedback to ensure a good-faith replication protocol. Peer review of the protocols further increased the rigor and accuracy of the experimental designs, such as including additional measurements of proliferation to ensure all cell lines were replicating at the time of drug treatment and specifying the minimal number of colonies in the control condition before stopping the experiment. Peer review of the protocols also acknowledged the challenge we faced of not having access to all of the exact same cell lines as the original study and did not raise any concerns when cell lines of the same disease/status were proposed (e.g., different Ewing’ sarcoma cell lines). After the experiments were conducted, the results were submitted for peer review, and the reviewer comments were largely focused on trying to reconcile the differences between the results of the original study and the results of the replication (Vanden Heuvel et al., 2018). A lack of concern about inexact comparability of cell lines in the reviews before the results were known was replaced with highlighted concern that this difference accounted for the lack of statistically significant results in the replication after the results were known. Similarly, after the fact, reviewers raised concerns about the timing of an experiment as potentially not allowing for the effect to be measurable due to the need for cells to be in a proliferative state despite the fact that the design was identical between the replication and original experiments. Some speculations, such as the use of different sarcoma cell lines and the level of knockdown efficiency, are possible explanations for the different results in the replication experiments, but they require follow-up tests to assess whether they actually account for the observed differences. We included these possibilities when discussing the results in the Replication Study, but disagreed with a request from the reviewers that the speculations justified labeling the replication result as “inconclusive”.

This mostly occurred as intended in this project. Of the 18 Replication Studies submitted to eLife, 17 were accepted and one was rejected. The rejected Replication Study was posted as a preprint (Pelech et al., 2021). eLife makes reviewer comments and author responses to reviews public with the published papers. Links to all published papers and reviewer comments are in Table 1. With rejection of one completed Replication Study, the Registered Reports model was mostly effective at eliminating publication bias against negative results (Allen and Mehler, 2019; Scheel et al., 2020). With peer review in advance, the Registered Reports model was effective at fostering precommitments among authors and reviewers to the replication experimental designs (Nosek and Errington, 2020b). And, as evidenced by the diversity of reactions in the open reviews and commentaries on the final Replication Studies, the Registered Reports model did not eliminate divergence and disagreement among researchers about the meaning and implications of the replication findings. As long as all outcomes are reported, such divergence after the fact may be productive for stimulating critical inquiry and generating hypotheses even when it is indicative of intransigence or motivated reasoning to preserve prior claims.

The duration of the different phases in the project

On average the gap between paper selection and the submission of a Registered Report was 30 weeks (mean), and the gap between submission and acceptance for publication was 19 weeks (Figure 4). It then took an average of 12 weeks to prepare experiments for data collection. The gap between the start of experimental work and final data delivery was 90 weeks, and another 24 weeks were needed to analyse the data and write the Replication Study. The gap between submission of the Replication Study and acceptance for publication was 22 weeks. On average the process took 197 weeks.

Figure 4

Download asset Open asset

The different phases of the replication process.

Graph showing the number of papers entering each of the six phases of the replication process, and the mean duration of each phase in weeks. 53 papers entered the design phase, which started with the selection of papers for replication and ended with submission of a Registered Report (mean = 30 weeks; median = 31; IQR = 21–37). 32 papers entered the protocol peer reviewed phase, which ended with the acceptance of a Registered Report (mean = 19 weeks; median = 18; IQR = 15–24). 29 papers entered the preparation phase (Prep), which ended when experimental work began (mean = 12 weeks; median = 3; IQR = 0–11). The mean for the prep phase was much higher than the median (and outside the IQR) because this phase took less than a week for many studies, but much longer for a small number of studies. The same 29 papers entered the conducted phase, which ended when the final experimental data were delivered (mean = 90 weeks; median = 88; IQR = 44–127), and the analysis and writing phase started, which ended with the submission of a Replication Study (mean = 24 weeks; median = 23; IQR = 7–32). 18 papers entered the results peer review phase, which ended with the acceptance of a Replication Study (mean = 22 weeks; median = 18; IQR = 15–26). In the end, 17 Replication Studies were accepted for publication. The entire process had a mean length of 197 weeks and a median length of 181 weeks (IQR = 102–257).

All the experimental details (e.g., additional protocol details, data, analysis files) are openly available at https://osf.io/collections/rpcb/ (see Table 1 for links to individual studies), or domain specific repositories (e.g., https://www.metabolomicsworkbench.org); physical materials (e.g., plasmids) were made openly available where possible (e.g., https://www.addgene.org).

Discussion

Much of the concern about replicability in science is whether reported findings are credible (Begley and Ellis, 2012; Camerer et al., 2016; Camerer et al., 2018; Errington et al., 2021b; Open Science Collaboration, 2015; Prinz et al., 2011). Our experience conducting this project identifies a much more basic and fundamental concern about replication – it is hard to assess whether reported findings are credible. We attempted to replicate 193 experiments from 53 papers, but we experienced reproducibility challenges at every phase of the research lifecycle. Many original papers failed to report key descriptive and inferential statistics including 27 % of experiments just presenting representative images and 21% of experiments reporting inferential test outcomes not reporting which test was conducted. Raw data was publicly accessible for just 2% of experiments to reproduce the findings, compute effect sizes, and conduct power analyses. After requesting original data from authors, we acquired raw data for 16%, summary data for 14%, and nothing for 68% of experiments. None of the 193 experiments was described completely enough to design a replication protocol without requesting clarifying details from the original authors.

Authors were bimodal in their helpfulness in sharing data and materials and providing feedback, 32% were not at all helpful/no response and 26 % were extremely helpful. Implementation of peer-reviewed and preregistered protocols often led to unexpected challenges such as model systems behaving differently than originally reported, requiring modifications to protocols. Just 33% of experiments required no modifications. Of those needing modifications, 41% were implemented completely. Cumulatively, these process challenges for assessing replicability slowed the project and increased costs (see Appendix 1). After an extended data collection period, we completed replications of 50 experiments from 23 papers.

Original papers do not include enough information about the methodology and results. Original data and materials are not archived and accessible in repositories. Original authors are variably willing or able to clarify information gaps and share data, materials, and reagents to facilitate assessment of original findings. These challenges slowed progress, inflated costs, and made it harder to design and conduct replication studies. None of this is a direct indication of whether any particular original finding is credible, but all of it is a challenge for credibility of research in general (Begley and Ioannidis, 2015; Ioannidis et al., 2014). Credibility of scientific claims is rooted in their independent verifiability (Nosek and Errington, 2020a; Putnam, 1975; Schmidt, 2009). Pervasive impediments to verification mean that research is not living up to the “show me” ethos of science and is functionally operating as a “trust me” enterprise.

Practical barriers to the assessment of replicability compounds the credibility risk that is already present with a research culture that prizes innovation at the expense of verification (Martin, 1992; Sovacool, 2008). Publication is achieved, grants are given, and careers are made on the production of positive results not negative results, tidy evidence and explanation not uncertainty and exceptions, and novel findings not replications or incremental extensions of prior work (Giner-Sorolla, 2012; Mahoney, 1977; Nosek et al., 2012). These incentives encourage publication bias against negative results and selective reporting to indicate stronger, cleaner findings than the reality of the evidence – and the behaviors that produce these outcomes could occur without intention or control via motivated reasoning (Hart et al., 2009; Kunda, 1990), confirmation bias (Nickerson, 1998), and hindsight bias (Christensen-Szalanski and Willham, 1991; Fischhoff and Beyth, 1975). Lack of documentation and transparency of the research process makes it difficult to identify these behaviors. And, even if researchers are motivated to conduct independent verification, not only are there disincentives to spend resources on reproduction and replication and cultural resistance to the practice, there are also mundane practical barriers to doing so because of lack of documentation, transparency, and sharing. In short, we have created a research culture in which assessing replicability and reproducibility is unrewarded, unnecessarily difficult, and potentially career damaging.

If the published literature were highly credible, and if false starts were efficiently weeded out of the literature, then the lack of reward and feasibility for verification and replication efforts might not be a cause for concern. However, the present evidence suggests that we should be concerned. As reported in Errington et al., 2021b, replication efforts frequently produced evidence that was weaker or inconsistent with original studies. These results corroborate similar efforts by pharmaceutical companies to replicate findings in cancer biology (Begley and Ellis, 2012; Prinz et al., 2011), efforts by a non-profit biotech to replicate findings of potential drugs in a mouse model of amyotrophic lateral sclerosis (Perrin, 2014), and systematic replication efforts in other disciplines (Camerer et al., 2016; Camerer et al., 2018; Cova et al., 2018; Ebersole et al., 2016; Ebersole et al., 2019; Klein et al., 2014; Klein et al., 2018; Open Science Collaboration, 2015; Steward et al., 2012). Moreover, the evidence for self-corrective processes in the scientific literature is underwhelming: extremely few replication studies are published (Makel et al., 2012; Makel and Plucker, 2014); preclinical findings are often advanced to clinical trials before they have been verified and replicated by other laboratories (Chalmers et al., 2014; Drucker, 2016; Ramirez et al., 2017); and many papers continue to be cited even after they have been retracted (Budd et al., 1999; Lu et al., 2013; Madlock-Brown and Eichmann, 2015; Pfeifer and Snodgrass, 1990). If replicability is low and the self-correction processes in science are not efficiently separating the credible from the not credible, then the culture of modern research is creating unnecessary friction in the pace of discovery.

Fundamentally, the problem with practical barriers to assessing replicability and reproducibility is that it increases uncertainty in the credibility of scientific claims. Are we building on solid foundations? Do we know what we think we know? Assessing replicability and reproducibility are important mechanisms for identifying whether findings are credible, for clarifying boundary conditions on circumscribed findings, and for generalizing findings to untested circumstances (Nosek and Errington, 2020a). There are open questions about the appropriate distribution of resource investment between innovation and verification efforts. Here, for example, though costs increased because of the unexpected impediments, the final cost per experiment of approximately $53,000 might be seen as comparatively modest compared to the losses incurred by follow-on research for findings that are unreplicable or much more limited than initially believed. DARPA’s Friend or Foe program might be a case in point in which a portion of the program budget is invested in independent verification and validation (Raphael et al., 2020). In any case, an efficient science would not impose unnecessary practical barriers to verification just as it should not impose unnecessary practical barriers to innovation. We can do better. Fortunately, there are mechanisms that could greatly enhance the ability to assess whether reported findings are credible, and reduce the barriers to verification efforts more generally. Moreover, some mechanisms are in practice already demonstrating their feasibility for broad implementation.

Improving documentation and reporting

Reading the paper and supplementary materials was sufficient to design the replication study for none of the 193 experiments. Lack of interest or attention to methods, space constraints, and an absence of standards may all contribute to weaknesses in documentation of how the research was done. Better reporting will improve research efficiency by helping authors and peer reviewers identify errors or other potential limitations. Better reporting will also improve research efficiency by helping readers who wish to replicate or extend the research to develop accurate experimental designs. Our sample of papers came from articles published between 2010 and 2012. Since then, some publishers have taken steps to improve reporting and standards have emerged to promote consistency, clarity, and accuracy (Marcus, 2016; Nature, 2013), and a coalition of publishers and other stakeholders are promoting minimum reporting standards for life science (Macleod et al., 2021). Also, increasing frequency of citation of data, materials, reagents, and antibodies highlights improving reporting standards (Han et al., 2017; Macleod, 2017).

There is still a long way to go before strong reporting is normative (Baker et al., 2014; Gulin et al., 2015), but the efforts to establish reporting standards and requirements has positioned the community for significant improvement in making it possible to understand how the research was conducted (Glasziou et al., 2014; Macleod et al., 2014). A potential negative consequence of improving documentation and reporting is additional burden on researchers without compensatory benefits for actually improving research. Regardless of their benefits, implementations of reporting standards should make them easy and efficient to adopt and attentive to diminishing returns. The sweet spot of reporting standards is to provide sufficient structure, specificity, and support to make the research process transparent and simultaneously to avoid turning a good practice into just another bureaucratic burden.

Improving data, code, and materials transparency and sharing

For many of the experiments we examined, we could not determine key details of the original results from the paper such as sample size, effect size, or variability. Data and code were almost never available in public repositories, and requests for sharing the original data mostly failed. It is not possible to assess reproducibility or robustness if data are not available. And, policies that data are to be made available “upon request” are recognized as ineffective (McNutt et al., 2016). One obvious reason is that such requests come long after the original researchers have moved on from the project, making the data difficult, impossible, or time-consuming to recover. Hundreds of journals have strengthened their policies to promote data and code sharing, and the rates of sharing are improving, if slowly (Camerer et al., 2018; Serghiou et al., 2021; Stodden et al., 2013; see journal transparency policies at https://topfactor.org). The infrastructure for sharing and archiving data and code has blossomed with domain-specific repositories for a wide variety of data types such as GenBank, Protein DataBank, and Cancer Imaging Archive, and emergence of metadata standards more generally (Wilkinson et al., 2016). Generalist repositories such as OSF, Zenodo, and Figshare offer archiving solutions for digital data of almost any kind.

Repositories are likewise available for sharing digital materials such as protocols, additional images, IACUC or IRB documentation, or any other content. For example, the OSF projects for these replication efforts include cell line authentication (e.g., STR and mycoplasma testing), plasmid verification (e.g., sequencing files), maintenance records (e.g., cell culture, animal husbandry), and all raw images (e.g., Western blot, immunohistochemistry, bioluminescence images) for relevant experiments alongside the data and code. Another challenge to address to improve research efficiency is burdens and delays for sharing physical materials such as cells, plasmids, animals, and antibodies. Repositories are available for sharing physical materials (e.g., https://www.addgene.org, https://www.mmrrc.org, https://www.atcc.org) and relieves scientists of having to maintain and distribute to other researchers minimizing costs associated when researchers have to make them again (see Case study in Box 3; Lloyd et al., 2015). When not available in a repository, we experienced a variety of unnecessary barriers and delays navigating material transfer agreements with institutions because of lack of interest, infrastructure, or policy for facilitating material sharing. There is an opportunity for substantial improvement not only for replications but also for novel research that builds upon published research by having better funding and legal structures for sharing materials. For example, the initiation of replication experiments at replication labs was significantly accelerated by the existence of standard master services agreements already in place with all replicating labs via the Science Exchange marketplace.

Potential negative consequences of improved sharing can occur if the scholarly reward systems fail to catch up. At present, some researchers see risk and little reward for sharing because of lack of credit for doing so. Evidence suggests that there is more benefit than cost (McKiernan et al., 2016), but altering reward systems toward treating data, materials, and code as citable scholarly contributions will ease the perceived risks.

Improving preregistration of experiments and analysis plans

Two key factors undermining the credibility and replicability of research are publication bias and questionable research practices like p-hacking. With publication bias, negative findings are much less likely to be reported than positive findings (Greenwald, 1975; Rosenthal, 1979). With questionable research practices, discretion in data analysis and selective reporting of outcomes can lead to intentional or unintentional manufacturing and reporting of positive outcomes that are more favorable for publication (Casadevall and Fang, 2012; Gelman and Loken, 2013; Ioannidis, 2005; John et al., 2012; Kaplan and Irvin, 2015; van der Naald et al., 2020; Simmons et al., 2011). These lead to a biased literature with exaggerated claims and incredible evidence (Begley and Ellis, 2012; Open Science Collaboration, 2015; Prinz et al., 2011; Smaldino and McElreath, 2016).

One solution to these challenges is preregistration (Nosek et al., 2019; Nosek et al., 2018; Wagenmakers et al., 2012). Preregistration of experiments mitigates publication bias by making all research discoverable whether or not it is ultimately published. Preregistration of analysis plans solves selective reporting by making clear what analyses were planned a priori and what was determined and conducted after the fact. Planned and unplanned analyses both contribute to advancement of knowledge, the latter often being the source of unexpected discoveries. But, unplanned analyses that occur after observing the data are usually more tentative and uncertain. Preregistration helps increase visibility of that uncertainty and reduce the likelihood of inadvertently mistaking an uncertain exploratory result as a confirmatory test of an existing hypothesis. For areas of research that have been investigating replicability, such as psychology and economics, preregistration has gained rapid adoption (Christensen et al., 2019; Nosek and Lindsay, 2018).

Preregistration is still relatively rare in basic and preclinical research in the life sciences, but the potential for improving replicability and research efficiency is pronounced. In life science experiments involving animals, there are significant ethical implications for not publishing negative results derived from these experiments. Recent studies suggest the data from only 26 % of animals used in life science experiments are ever published (van der Naald et al., 2020). One could argue that ensuring outcome reporting of all animal experiments is an ethical issue, and IACUC’s could incentivize or require preregistration as a compliance mechanism. A recent NIH committee report focusing on improving research rigor and reproducibility recommended piloting preregistration in animal research to test its effectiveness (Wold et al., 2021).

Like improving reporting standards, a potential risk of preregistration is creating bureaucratic burden that does not exceed the benefits of instituting the process. Technology supporting preregistration can minimize that burden with efficient workflows that researchers perceive as supporting effective research planning rather than imposing reporting burdens. Also, misperceptions that preregistration discourages exploratory or discovery oriented research could interfere with effective adoption and application. As such, education and training are essential components of effective adoption.

Improving rigor, reporting, and incentives with Registered Reports

All replication studies were peer reviewed at eLife prior to conducting the research, a publishing model called Registered Reports. With Registered Reports, expert critique improves experimental designs before they are conducted rather than just pointing out the errors and problems after the work is completed. Preregistration is built into the process eliminating publication bias and providing a clear distinction between planned analyses and exploratory discoveries. Publication decisions are made based on the importance of the research question and the quality of the methodology proposed to test the question, not whether the observed outcomes are exciting or as expected. Incentives for exciting findings, regardless of credibility, are removed. Researchers are instead incentivized to ask important questions and design creative and compelling tests of those questions (Chambers, 2019).

As of late 2021, more than 300 journals have adopted Registered Reports as a submission option, mostly in the social-behavioral sciences and neuroscience. Evidence to date suggests that Registered Reports are effective at eliminating publication bias. In a sample of 71 Registered Reports and 152 comparison articles from the same outlets published around the same time, 56 % of primary outcomes were negative results for Registered Reports and 4 % for comparison articles (Scheel et al., 2020). Moreover, despite the increase of supposedly “boring” negative results, a sample of Registered Reports received similar or greater altmetric attention and citation impact as comparison articles (Hummer et al., 2017). An observational study also found evidence that Registered Reports outperform comparison articles on all 19 outcome measures from slightly on measures of novelty and creativity to strongly on measures of quality and rigor (Soderberg et al., 2021).

Some funders and journals are conducting partnerships via Registered Reports in which a single peer review process results in in-principle acceptance of the paper and funding to conduct the experiments such as programs sponsored by the Children’s Tumor Foundation (https://www.ctf.org/research/drug-discovery-initiative-registered-reports-ddirr) and The Flu Lab (https://cos.io/flulab/). This offers a compelling incentive alignment for researchers and opportunity for journals to receive and publish high-quality, funded projects and funders to maximize their return on investment by ensuring that funded studies don’t wind up in the file drawer. Like preregistration, a potential unintended negative consequence of Registered Reports is if the model shifts the culture away from valuing exploratory and discovery-oriented research. Ideally, both practices facilitate clarity of when research is testing versus generating hypotheses without fostering the perception that research progress can occur with one and without the other.

Improving incentives for replication

With a research culture that prizes innovation and novelty, verification and replication gets pushed aside. Innovation without verification creates a fragile and fragmented evidence base that may slow the pace of knowledge accumulation (Chalmers et al., 2014). Replication is essential for advancing theory because it provides an opportunity to confront and refine current understanding (Nosek and Errington, 2020a). Investigations of the prevalence of replication studies in the published literature yield extremely low estimates in different disciplines (Makel and Plucker, 2014; Makel et al., 2012; Pridemore et al., 2018; Valentine et al., 2011). There is no known systematic investigation of the prevalence of replication studies in cancer biology, but like other fields – with a strong emphasis on innovation and novelty in cancer biology – there is little encouragement by journals or funders for proposing, conducting, or reporting replications. Without reward systems for replication research, it is unlikely that the near exclusive emphasis on innovation will be reduced.

Simultaneously, it is not clear that a dramatic shift in the proportion of studies for replication is needed. Only a small portion of the research literature has a substantial impact on the direction and investment in research. By focusing replication resources on the research that is having significant impact and spurring new investment, even a small infusion of funding, journal space, and institutional reward for replications could have a dramatic effect on improving clarity about credibility and replicability – emboldening investments on productive paths and saving resources from dead ends. That is not to suggest that replications provide definitive evidence to confirm or disconfirm original findings. Rather, successful replications promote confidence that new findings are reliable and justify further investigation into their validity and applicability, and unsuccessful replication prompt questions to look closer at the phenomenon to determine whether the failure is due to a false positive in the original research, a flaw in the replication, or previously unidentified conditions that influence whether the phenomenon is observed (Errington et al., 2021b; Nosek and Errington, 2020b; Nosek and Errington, 2020a).

There is some movement toward valuing replication research more explicitly with some journals providing explicit statements in their policies about publishing replications and funders like DARPA in the US explicitly investing in independent verification and validation as part of ongoing programs pursuing research innovations (Raphael et al., 2020). Also, funders are occasionally launching programs to support replication studies such as the NWO in the Netherlands (Baker, 2016) and the NSF in the US (Cook, 2016). A potential risk of increased rewards for replication is if expectations for conducting or achieving replicability become so high that they discourage risk-taking and pursuit of highly resource intensive investigations. For example, in early phases of research, low replicability is not surprising because researchers are often pursuing ideas that have low prior odds of success. The optimal mixture of investment in innovation versus verification research is unknown.

Collectively, these improvements to transparency and sharing are captured by the Transparency and Openness Promotion Guidelines (TOP; https://cos.io/top), a policy framework for publishers, funders, and institutions to set standards for transparency of the research process and outputs by their authors, grantees, or staff (Camerer et al., 2018). As of 2020, more than 1,000 journals have implemented TOP compliant policies for one or more of the categories of improvements. TOP Factor (https://topfactor.org) rates journal policies on promoting transparency, openness, and reproducibility, and the web interface makes it easy to compare across journals. Pervasive adoption of open behaviors and policies by all stakeholders would help shift norms and set higher standards for transparency, sharing, and reproducibility of research.

Simultaneously, an active metascience research community that evaluates the impact of these new behaviors and policies will help identify unintended negative consequences, improve their implementation, and optimize their adoption for facilitating research progress. Stakeholders in the cancer biology community including researchers, funders, societies, and institutional representatives could facilitate and support research investigations of the scientific process so that decisions about adopting these behaviors at scale can be evidence-based and clearly represent both the costs and benefits.

Conclusion

We experienced substantial challenges when designing protocols to replicate experiments from published papers because the papers often did not contain the information required for such replications (such as raw data and identifiers for reagents and research materials). There is substantial opportunity to improve the seemingly mundane but critical behaviors of documentation, transparency, and open sharing of protocols, data, and research materials, if the scientific community is to improve the reproducibility, replicability, and reuse of research. Initiatives to improve the reporting of methods and results – including preregistration of experiments, and the reporting of both negative and positive results – have started to make headway in the life sciences, and some are becoming mainstream in neighboring disciplines. Collectively, these initiatives offer substantial opportunities to improve the replicability and credibility of research and, ultimately, to advance scientific knowledge in a way that is both efficient and reliable.

Materials and methods

Paper and experiment selection strategy

Request a detailed protocol

50 papers published in 2010, 2011 or 2012 were selected as described in Errington et al., 2014 . After the project started one paper was replaced because it contained sequencing and proteomic experiments and should not have been selected in the first place. During the course of the project, after contacting the original authors, we determined that it would not be feasible to conduct replications for three papers, so these papers were replaced.

Experiments for replication were identified as described in Errington et al., 2014. Corresponding authors were contacted and shared the drafted replication protocols based on information from the original papers. Specific questions were highlighted including requests for original data, key materials that were identified, and protocol clarifications. We also asked for any additional information that could improve the quality of the replication attempt. Following initial author feedback, we shared replication protocols with research providers from the Science Exchange marketplace, which consists of a database of searchable scientific service providers that have been qualified and contracted under a standard already negotiated master services agreement. On average it took 6 days from placing requests to receiving a quote from replicating labs (median = 2 days; IQR = 1–8). In total 48 providers participated in the project (22 academic shared resource facilities and 26 contract research organizations [CROs]) by reviewing and contributing to replication protocols, including describing deviations from the original study (e.g., different instrumentation), and conducting the replication experiments themselves. Experimental designs and protocols were iterated based on comments and suggestions from original authors, when possible, and the replicating researchers. Experiments were then submitted as a Registered Report to eLife where it underwent peer review and if approved began experimentation. In all, 193 experiments were included in the project: 188 experiments were identified at the start of the project; three were added during peer review of the Registered Reports, and two were added following the exchange of comments and suggestions with original authors. At the same time, 83 experiments were dropped following exchanges with original authors. Of the 110 experiments that continued, 97 were included in Registered Reports that we submitted to eLife. The 29 Registered Reports that were accepted for publication included 87 experiments.

Coding

Request a detailed protocol

Papers were coded for metadata and whether corresponding authors responded to any email requests. Experiments were coded on a number of variables from the papers, requests and input from the original authors, and information about the replication attempt. Experiments were linked to specific figures and tables in the original papers. Variables were coded as described in the data dictionary and figures, figure legends, main text, methods, and supplementary figures/tables were searched for the information. Variables about requests and input from the original authors were coded based on the protocol documents shared with original authors for input and the responses received and were either objective or subjective. Information about the replication attempts were coded based on objective features or our subjective experience of the process. For subjective variables coded responses were given according to a Likert scale with examples given in the main text to provide illustrations of the subjective coding. Data dictionaries describing all of the variables are available at https://osf.io/e5nvr/.

Statistical analysis and reporting

Request a detailed protocol

Descriptive and exploratory statistics were used to analyze coded variables in R software (RRID:SCR_001905), version 4.0.3 (R Development Core Team, 2021). Figures 2—4 were generated using the ggplot2 (version 3.3.3) package. Exploratory analysis (Spearman rank-order correlation) was conducted after data were checked to ensure assumptions were met.

Note

All eLife content related to the Reproducibility Project: Cancer Biology is available at: https://elifesciences.org/collections/9b1e83d1/reproducibility-project-cancer-biology.

All underlying data, code, and digital materials for the project is available at: https://osf.io/collections/rpcb/.

Appendix 1

Costs associated with evaluating reproducibility

In conducting research, there is a constant interplay of evaluating available resources for making decisions about research investments. This is a combination of time, cost, and accessibility of materials and reagents to do the research. In typical work, that decision matrix also includes questions about potential viability of the research direction, potential impact of success, and confidence in the current evidence (shall we replicate or proceed with the evidence we have?). These latter items were not complicating factors for our project, but the former ones were substantial as we had many papers, many experiments, lab sourcing, design feedback, cost projections, time projections, and coordination decisions to resolve. One of the most concrete ways to express that challenge is the evolving cost estimates for conducting the research over the course of the project.

At the start of the project we budgeted $25,000 per paper, and updated this figure as the project progressed. By the time peer review of the Registered Reports began, the estimated cost per paper had increased to $35,750 (median = $33,822; IQR = $26,448–$44,260). At the onset of data collection, the average estimate was $42,017 (median = $39,843; IQR = $28,069–$55,750). And, the actual average cost on completion was $52,574 per replication study (median = $53,089; IQR = $33,994–$61,496). In total, $1,524,640 was spent on replication experiments. Not included in these costs are project administration costs, particularly personnel costs, accrued as the project took longer to complete than originally estimated because of the unexpected challenges of getting feedback, obtaining materials, carrying out the experiments, internal delays in project management, and the common delays in peer review. Additionally, not counted in these costs are donated reagents from scientific suppliers and replicating labs that provided discounted costs to support the project.

Delays and increasing costs were practical challenges for investigating reproducibility. If data and materials were readily available, the time and cost of designing experiments would have been lower. If original papers were more comprehensive in reporting methodology, the time and cost of designing protocols would have been lower. If providing feedback on replication designs were normative, the time and cost of confirming protocols would have been lower. Improving sharing, documentation, and feedback are fixable with changes to norms and policies.

Some delays and costs were due to experimental systems (e.g., tumor growth in animals) not behaving as they had in the original study. Whether or not it is possible to make experimental systems more consistent in their behavior depends on whether these inconsistencies are an inherent feature of working with complex biological systems that cannot be avoided, or if they are due to weaknesses in methodology that could be addressed.

Delays and increasing costs also had other consequences: for example, experiments were staggered over time, so as some experiments were completed, we were able to update time and cost estimates for experiments not yet started. Decisions to end individual experiments were influenced partly by challenges unique to that experiment, and partly by factors related to time and cost estimates across the project as a whole. A decision to end an experiment does not necessarily mean that the original finding is unreplicable. It is possible that devoting more project resources to any one finding would have ultimately resolved the challenges and replicated the result successfully. Likewise, a decision to end an experiment does not validate the original finding. It is possible that the practical challenges are indicators of deeper issues with the original findings.

Data availability

All experimental details (e.g., additional protocol details, data, analysis files) of the individual replications and data, code, and materials for the overall project are openly available at https://osf.io/collections/rpcb/; see Table 1 of the present article for links to individual studies. Master data files, containing the aggregate coded variables, are available for exploratory analysis at https://osf.io/e5nvr/.

The following data sets were generated

1. Errington TM
2. Denis A
(2021) Open Science Framework
ID e5nvr. Replication Data from the Reproducibility Project: Cancer Biology.

https://osf.io/e5nvr/

References

(2017) Replication Study: BET bromodomain inhibition as a therapeutic strategy to target c-Myc
eLife 6:e21253.

https://doi.org/10.7554/eLife.21253
- PubMed
- Google Scholar
1. Allen C
2. Mehler DMA
(2019) Open science challenges, benefits and tips in early career and beyond
PLOS Biology 17:e3000246.

https://doi.org/10.1371/journal.pbio.3000246
- PubMed
- Google Scholar
(2007) Normative dissonance in science: results from a national survey of U.S. scientists
Journal of Empirical Research on Human Research Ethics 2:3–14.

https://doi.org/10.1525/jer.2007.2.4.3
- PubMed
- Google Scholar
1. Arthur JC
2. Perez-Chanona E
3. Mühlbauer M
4. Tomkovich S
5. Uronis JM
6. Fan TJ
7. Campbell BJ
8. Abujamel T
9. Dogan B
10. Rogers AB
11. Rhodes JM
12. Stintzi A
13. Simpson KW
14. Hansen JJ
15. Keku TO
16. Fodor AA
17. Jobin C
(2012) Intestinal inflammation targets cancer-inducing activity of the microbiota
Science 338:120–123.

https://doi.org/10.1126/science.1224820
- PubMed
- Google Scholar
1. Baker D
2. Lidster K
3. Sottomayor A
4. Amor S
(2014) Two years later: journals are not yet enforcing the ARRIVE guidelines on reporting standards for pre-clinical animal studies
PLOS Biology 12:e1001756.

https://doi.org/10.1371/journal.pbio.1001756
- PubMed
- Google Scholar
1. Baker M
(2016) Dutch agency launches first grants programme dedicated to replication
Nature.

https://doi.org/10.1038/nature.2016.20287
- Google Scholar
1. Begley CG
2. Ellis LM
(2012) Drug development: Raise standards for preclinical cancer research
Nature 483:531–533.

https://doi.org/10.1038/483531a
- PubMed
- Google Scholar
1. Begley CG
2. Ioannidis JPA
(2015) Reproducibility in science: Improving the standard for basic and preclinical research
Circulation Research 116:116–126.

https://doi.org/10.1161/CIRCRESAHA.114.303819
- PubMed
- Google Scholar
1. Berger MF
2. Hodis E
3. Heffernan TP
4. Deribe YL
5. Lawrence MS
6. Protopopov A
7. Ivanova E
8. Watson IR
9. Nickerson E
10. Ghosh P
11. Zhang H
12. Zeid R
13. Ren X
14. Cibulskis K
15. Sivachenko AY
16. Wagle N
17. Sucker A
18. Sougnez C
19. Onofrio R
20. Ambrogio L
21. Auclair D
22. Fennell T
23. Carter SL
24. Drier Y
25. Stojanov P
26. Singer MA
27. Voet D
28. Jing R
29. Saksena G
30. Barretina J
31. Ramos AH
32. Pugh TJ
33. Stransky N
34. Parkin M
35. Winckler W
36. Mahan S
37. Ardlie K
38. Baldwin J
39. Wargo J
40. Schadendorf D
41. Meyerson M
42. Gabriel SB
43. Golub TR
44. Wagner SN
45. Lander ES
46. Getz G
47. Chin L
48. Garraway LA
(2012) Melanoma genome sequencing reveals frequent PREX2 mutations
Nature 485:502–506.

https://doi.org/10.1038/nature11071
- PubMed
- Google Scholar
(2016a) Registered Report: Kinase-dead BRAF and oncogenic RAS cooperate to drive tumor progression through CRAF
eLife 5:e11999.

https://doi.org/10.7554/eLife.11999
- Google Scholar
(2016b) Registered Report: RAF inhibitors prime wild-type RAF to activate the MAPK pathway and enhance growth
eLife 5:e09976.

https://doi.org/10.7554/eLife.09976
- PubMed
- Google Scholar
(2014) Registered Report: Tumour micro-environment elicits innate resistance to RAF inhibitors through HGF secretion
eLife 3:e04034.

https://doi.org/10.7554/eLife.04034
- PubMed
- Google Scholar
(2015) Registered Report: Transcriptional amplification in tumor cells with elevated c-Myc
eLife 4:e04024.

https://doi.org/10.7554/eLife.04024
- PubMed
- Google Scholar
(1999)
Effects of article retraction on citation and practice in medicine

Bulletin of the Medical Library Association 87:437–443.
- PubMed
- Google Scholar
1. Camerer CF
2. Dreber A
3. Forsell E
4. Ho TH
5. Huber J
6. Johannesson M
7. Kirchler M
8. Almenberg J
9. Altmejd A
10. Chan T
11. Heikensten E
12. Holzmeister F
13. Imai T
14. Isaksson S
15. Nave G
16. Pfeiffer T
17. Razen M
18. Wu H
(2016) Evaluating replicability of laboratory experiments in economics
Science 351:1433–1436.

https://doi.org/10.1126/science.aaf0918
- PubMed
- Google Scholar
1. Camerer CF
2. Dreber A
3. Holzmeister F
4. Ho TH
5. Huber J
6. Johannesson M
7. Kirchler M
8. Nave G
9. Nosek BA
10. Pfeiffer T
11. Altmejd A
12. Buttrick N
13. Chan T
14. Chen Y
15. Forsell E
16. Gampa A
17. Heikensten E
18. Hummer L
19. Imai T
20. Isaksson S
21. Manfredi D
22. Rose J
23. Wagenmakers EJ
24. Wu H
(2018) Evaluating the replicability of social science experiments in nature and science between 2010 and 2015
Nature Human Behaviour 2:637–644.

https://doi.org/10.1038/s41562-018-0399-z
- PubMed
- Google Scholar
1. Carro MS
2. Lim WK
3. Alvarez MJ
4. Bollo RJ
5. Zhao X
6. Snyder EY
7. Sulman EP
8. Anne SL
9. Doetsch F
10. Colman H
11. Lasorella A
12. Aldape K
13. Califano A
14. Iavarone A
(2010) The transcriptional network for mesenchymal transformation of brain tumours
Nature 463:318–325.

https://doi.org/10.1038/nature08712
- PubMed
- Google Scholar
1. Casadevall A
2. Fang FC
(2012) Reforming science: methodological and cultural reforms
Infection and Immunity 80:891–896.

https://doi.org/10.1128/IAI.06183-11
- PubMed
- Google Scholar
1. Castellarin M
2. Warren RL
3. Freeman JD
4. Dreolini L
5. Krzywinski M
6. Strauss J
7. Barnes R
8. Watson P
9. Allen-Vercoe E
10. Moore RA
11. Holt RA
(2012) Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma
Genome Research 22:299–306.

https://doi.org/10.1101/gr.126516.111
- PubMed
- Google Scholar
1. Chaffer CL
2. Brueckmann I
3. Scheel C
4. Kaestli AJ
5. Wiggins PA
6. Rodrigues LO
7. Brooks M
8. Reinhardt F
9. Su Y
10. Polyak K
11. Arendt LM
12. Kuperwasser C
13. Bierie B
14. Weinberg RA
(2011) Normal and neoplastic nonstem cells can spontaneously convert to a stem-like state
PNAS 108:7950–7955.

https://doi.org/10.1073/pnas.1102454108
- PubMed
- Google Scholar
(2014) How to increase value and reduce waste when research priorities are set
Lancet 383:156–165.

https://doi.org/10.1016/S0140-6736(13)62229-1
- PubMed
- Google Scholar
1. Chambers C
(2019) What’s next for registered reports
Nature 573:187–189.

https://doi.org/10.1038/d41586-019-02674-6
- PubMed
- Google Scholar
1. Chen J
2. Li Y
3. Yu TS
4. McKay RM
5. Burns DK
6. Kernie SG
7. Parada LF
(2012) A restricted cell population propagates glioblastoma growth after chemotherapy
Nature 488:522–526.

https://doi.org/10.1038/nature11287
- PubMed
- Google Scholar
Preprint
1. Christensen G
2. Wang Z
3. Paluck EL
4. Swanson N
5. Birke DJ
6. Miguel E
7. Littman R
(2019) Open science practices are on the rise: The State of Social Science (3S) Survey
OSF Preprints.

https://doi.org/10.31222/osf.io/5rksu
- Google Scholar
1. Christensen-Szalanski JJJ
2. Willham CF
(1991) The hindsight bias: A meta-analysis
Organizational Behavior and Human Decision Processes 48:147–168.

https://doi.org/10.1016/0749-5978(91)90010-Q
- Google Scholar
(2014) Registered Report: Melanoma genome sequencing reveals frequent PREX2 mutations
eLife 3:e04180.

https://doi.org/10.7554/eLife.04180
- PubMed
- Google Scholar
(2015a) Registered Report: The CD47-signal regulated protein alpha (SIRPa) interaction is a therapeutic target for human solid tumors
eLife 4:e04586.

https://doi.org/10.7554/eLife.04586
- PubMed
- Google Scholar
(2015b) Registered Report: Tumour vascularization via endothelial differentiation of glioblastoma stem-like cells
eLife 4:e04363.

https://doi.org/10.7554/eLife.04363
- PubMed
- Google Scholar
Website
1. Cook FL
(2016) Dear colleague letter: Robust and reliable research in the social, behavioral, and economic sciences
National Science Foundation. Accessed August 5, 2021.

https://www.nsf.gov/pubs/2016/nsf16137/nsf16137.jsp
1. Cova F
2. Strickland B
3. Abatista A
4. Allard A
5. Andow J
6. Attie M
7. Beebe J
8. Berniūnas R
9. Boudesseul J
10. Colombo M
11. Cushman F
12. Diaz R
13. N’Djaye Nikolai van Dongen N
14. Dranseika V
15. Earp BD
16. Torres AG
17. Hannikainen I
18. Hernández-Conde JV
19. Hu W
20. Jaquet F
21. Khalifa K
22. Kim H
23. Kneer M
24. Knobe J
25. Kurthy M
26. Lantian A
27. Liao S
28. Machery E
29. Moerenhout T
30. Mott C
31. Phelan M
32. Phillips J
33. Rambharose N
34. Reuter K
35. Romero F
36. Sousa P
37. Sprenger J
38. Thalabard E
39. Tobia K
40. Viciana H
41. Wilkenfeld D
42. Zhou X
(2018) Estimating the reproducibility of experimental philosophy
Review of Philosophy and Psychology 12:9–44.

https://doi.org/10.1007/s13164-018-0400-9
- Google Scholar
1. Crasta K
2. Ganem NJ
3. Dagher R
4. Lantermann AB
5. Ivanova EV
6. Pan Y
7. Nezi L
8. Protopopov A
9. Chowdhury D
10. Pellman D
(2012) DNA breaks and chromosome pulverization from errors in mitosis
Nature 482:53–58.

https://doi.org/10.1038/nature10802
- PubMed
- Google Scholar
1. Dawson MA
2. Prinjha RK
3. Dittmann A
4. Giotopoulos G
5. Bantscheff M
6. Chan WI
7. Robson SC
8. Chung C
9. Hopf C
10. Savitski MM
11. Huthmacher C
12. Gudgin E
13. Lugo D
14. Beinke S
15. Chapman TD
16. Roberts EJ
17. Soden PE
18. Auger KR
19. Mirguet O
20. Doehner K
21. Delwel R
22. Burnett AK
23. Jeffrey P
24. Drewes G
25. Lee K
26. Huntly BJP
27. Kouzarides T
(2011) Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia
Nature 478:529–533.

https://doi.org/10.1038/nature10509
- PubMed
- Google Scholar
1. Delmore JE
2. Issa GC
3. Lemieux ME
4. Rahl PB
5. Shi J
6. Jacobs HM
7. Kastritis E
8. Gilpatrick T
9. Paranal RM
10. Qi J
11. Chesi M
12. Schinzel AC
13. McKeown MR
14. Heffernan TP
15. Vakoc CR
16. Bergsagel PL
17. Ghobrial IM
18. Richardson PG
19. Young RA
20. Hahn WC
21. Anderson KC
22. Kung AL
23. Bradner JE
24. Mitsiades CS
(2011) BET bromodomain inhibition as a therapeutic strategy to target c-Myc
Cell 146:904–917.

https://doi.org/10.1016/j.cell.2011.08.017
- PubMed
- Google Scholar
1. DeNicola GM
2. Karreth FA
3. Humpton TJ
4. Gopinathan A
5. Wei C
6. Frese K
7. Mangal D
8. Yu KH
9. Yeo CJ
10. Calhoun ES
11. Scrimieri F
12. Winter JM
13. Hruban RH
14. Iacobuzio-Donahue C
15. Kern SE
16. Blair IA
17. Tuveson DA
(2011) Oncogene-induced Nrf2 transcription promotes ROS detoxification and tumorigenesis
Nature 475:106–109.

https://doi.org/10.1038/nature10189
- PubMed
- Google Scholar
1. Driessens G
2. Beck B
3. Caauwe A
4. Simons BD
5. Blanpain C
(2012) Defining the mode of tumour growth by clonal analysis
Nature 488:527–530.

https://doi.org/10.1038/nature11344
- PubMed
- Google Scholar
1. Drucker DJ
(2016) Never waste a good crisis: Confronting reproducibility in translational research
Cell Metabolism 24:348–360.

https://doi.org/10.1016/j.cmet.2016.08.006
- PubMed
- Google Scholar
(2015) Registered Report: Intestinal inflammation targets cancer-inducing activity of the microbiota
eLife 4:e04186.

https://doi.org/10.7554/eLife.04186
- PubMed
- Google Scholar
(2018) Replication Study: Intestinal inflammation targets cancer-inducing activity of the microbiota
eLife 7:e34364.

https://doi.org/10.7554/eLife.34364
- PubMed
- Google Scholar
1. Ebersole CR
2. Atherton OE
3. Belanger AL
4. Skulborstad HM
5. Allen JM
6. Banks JB
7. Baranski E
8. Bernstein MJ
9. Bonfiglio DBV
10. Boucher L
11. Brown ER
12. Budiman NI
13. Cairo AH
14. Capaldi CA
15. Chartier CR
16. Chung JM
17. Cicero DC
18. Coleman JA
19. Conway JG
20. Davis WE
21. Devos T
22. Fletcher MM
23. German K
24. Grahe JE
25. Hermann AD
26. Hicks JA
27. Honeycutt N
28. Humphrey B
29. Janus M
30. Johnson DJ
31. Joy-Gaba JA
32. Juzeler H
33. Keres A
34. Kinney D
35. Kirshenbaum J
36. Klein RA
37. Lucas RE
38. Lustgraaf CJN
39. Martin D
40. Menon M
41. Metzger M
42. Moloney JM
43. Morse PJ
44. Prislin R
45. Razza T
46. Re DE
47. Rule NO
48. Sacco DF
49. Sauerberger K
50. Shrider E
51. Shultz M
52. Siemsen C
53. Sobocko K
54. Weylin Sternglanz R
55. Summerville A
56. Tskhay KO
57. van Allen Z
58. Vaughn LA
59. Walker RJ
60. Weinberg A
61. Wilson JP
62. Wirth JH
63. Wortman J
64. Nosek BA
(2016) Many Labs 3: Evaluating participant pool quality across the academic semester via replication
Journal of Experimental Social Psychology 67:68–82.

https://doi.org/10.1016/j.jesp.2015.10.012
- Google Scholar
Preprint
1. Ebersole CR
2. Mathur MB
3. Baranski E
4. Bart-Plange DJ
5. Buttrick N
6. Chartier CR
7. Corker KS
8. Corley M
9. Hartshorne JK
10. IJzerman H
11. Lazarevic LB
12. Rabagliati H
13. Ropovik I
14. Aczel B
15. Aeschbach L
16. Andrighetto L
17. Arnal JD
18. Arrow H
19. Babincak P
20. Nosek BA
(2019) Many Labs 5: Testing pre-data collection peer review as an intervention to increase replicability
PsyArXiv.

https://doi.org/10.31234/osf.io/sxfm2
- Google Scholar
1. Errington TM
2. Iorns E
3. Gunn W
4. Tan FE
5. Lomax J
6. Nosek BA
(2014) An open investigation of the reproducibility of cancer biology research
eLife 3:e04333.

https://doi.org/10.7554/eLife.04333
- PubMed
- Google Scholar
1. Errington TM
2. Denis A
3. Allison AB
4. Araiza R
5. Aza-Blanc P
6. Bower LR
7. Campos J
8. Chu H
9. Denson S
10. Donham C
11. Harr K
12. Haven B
13. Iorns E
14. Kwok J
15. McDonald E
16. Pelech S
17. Perfito N
18. Pike A
19. Sampey D
20. Settles M
21. Scott DA
22. Sharma V
23. Tolentino T
24. Trinh A
25. Tsui R
26. Willis B
27. Wood J
28. Young L
(2021a) Experiments from unfinished Registered Reports in the Reproducibility Project: Cancer Biology
eLife 10:e73430.

https://doi.org/10.7554/eLife.73430
- Google Scholar
1. Errington TM
2. Mathur MB
3. Soderberg CK
4. Denis A
5. Perfito N
6. Iorns E
7. Nosek BA
(2021b) Investigating the replicability of preclinical cancer biology
eLife 10:e71601.

https://doi.org/10.7554/eLife.71601
- Google Scholar
(2019) Replication Study: Wnt activity defines colon cancer stem cells and is regulated by the microenvironment
eLife 8:e45426.

https://doi.org/10.7554/eLife.45426
- PubMed
- Google Scholar
(2015a) Registered report: Wnt activity defines colon cancer stem cells and is regulated by the microenvironment
eLife 4:e07301.

https://doi.org/10.7554/eLife.07301
- PubMed
- Google Scholar
(2015b) Registered Report: Oncometabolite 2-hydroxyglutarate is a competitive inhibitor of α-ketoglutarate-dependent dioxygenases
eLife 4:e07420.

https://doi.org/10.7554/eLife.07420
- PubMed
- Google Scholar
1. Fanelli D
(2010) “Positive” results increase down the hierarchy of the sciences
PLOS ONE 5:e10068.

https://doi.org/10.1371/journal.pone.0010068
- PubMed
- Google Scholar
1. Fanelli D
(2011) Negative results are disappearing from most disciplines and countries
Scientometrics 90:891–904.

https://doi.org/10.1007/s11192-011-0494-7
- Google Scholar
(2016) Registered Report: The common feature of leukemia-associated IDH1 and IDH2 mutations is a neomorphic enzyme activity converting alpha-ketoglutarate to 2-hydroxyglutarate
eLife 5:e12626.

https://doi.org/10.7554/eLife.12626
- PubMed
- Google Scholar
(2015) Registered Report: Biomechanical remodeling of the microenvironment by stromal caveolin-1 favors tumor invasion and metastasis
eLife 4:e04796.

https://doi.org/10.7554/eLife.04796
- PubMed
- Google Scholar
1. Figueroa ME
2. Abdel-Wahab O
3. Lu C
4. Ward PS
5. Patel J
6. Shih A
7. Li Y
8. Bhagwat N
9. Vasanthakumar A
10. Fernandez HF
11. Tallman MS
12. Sun Z
13. Wolniak K
14. Peeters JK
15. Liu W
16. Choe SE
17. Fantin VR
18. Paietta E
19. Löwenberg B
20. Licht JD
21. Godley LA
22. Delwel R
23. Valk PJM
24. Thompson CB
25. Levine RL
26. Melnick A
(2010) Leukemic IDH1 and IDH2 mutations result in a hypermethylation phenotype, disrupt TET2 function, and impair hematopoietic differentiation
Cancer Cell 18:553–567.

https://doi.org/10.1016/j.ccr.2010.11.015
- PubMed
- Google Scholar
1. Fischhoff B
2. Beyth R
(1975) I knew it would happen: Remembered probabilities of once—future things
Organizational Behavior and Human Performance 13:1–16.

https://doi.org/10.1016/0030-5073(75)90002-1
- Google Scholar
(2015) Registered Report: Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukemia
eLife 4:e08997.

https://doi.org/10.7554/eLife.08997
- PubMed
- Google Scholar
1. Garnett MJ
2. Edelman EJ
3. Heidorn SJ
4. Greenman CD
5. Dastur A
6. Lau KW
7. Greninger P
8. Thompson IR
9. Luo X
10. Soares J
11. Liu Q
12. Iorio F
13. Surdez D
14. Chen L
15. Milano RJ
16. Bignell GR
17. Tam AT
18. Davies H
19. Stevenson JA
20. Barthorpe S
21. Lutz SR
22. Kogera F
23. Lawrence K
24. McLaren-Douglas A
25. Mitropoulos X
26. Mironenko T
27. Thi H
28. Richardson L
29. Zhou W
30. Jewitt F
31. Zhang T
32. O’Brien P
33. Boisvert JL
34. Price S
35. Hur W
36. Yang W
37. Deng X
38. Butler A
39. Choi HG
40. Chang JW
41. Baselga J
42. Stamenkovic I
43. Engelman JA
44. Sharma SV
45. Delattre O
46. Saez-Rodriguez J
47. Gray NS
48. Settleman J
49. Futreal PA
50. Haber DA
51. Stratton MR
52. Ramaswamy S
53. McDermott U
54. Benes CH
(2012) Systematic identification of genomic markers of drug sensitivity in cancer cells
Nature 483:570–575.

https://doi.org/10.1038/nature11005
- PubMed
- Google Scholar
Website
1. Gelman A
2. Loken E
(2013) The Garden of Forking Paths: Why Multiple Comparisons Can Be a Problem, Even When There Is No “Fishing Expedition” or “p-Hacking” and the Research Hypothesis Was Posited Ahead of Time
Department of Statistics, Columbia University. Accessed August 5, 2021.

http://www.stat.columbia.edu/~gelman/research/unpublished/forking.pdf
1. Giner-Sorolla R
(2012) Science or art? How aesthetic standards grease the way through the publication bottleneck but undermine science
Perspectives on Psychological Science 7:562–571.

https://doi.org/10.1177/1745691612457576
- PubMed
- Google Scholar
1. Glasziou P
2. Altman DG
3. Bossuyt P
4. Boutron I
5. Clarke M
6. Julious S
7. Michie S
8. Moher D
9. Wager E
(2014) Reducing waste from incomplete or unusable reports of biomedical research
Lancet 383:267–276.

https://doi.org/10.1016/S0140-6736(13)62228-X
- PubMed
- Google Scholar
(2011) Biomechanical remodeling of the microenvironment by stromal caveolin-1 favors tumor invasion and metastasis
Cell 146:148–163.

https://doi.org/10.1016/j.cell.2011.05.040
- PubMed
- Google Scholar
(2014) Registered Report: Widespread potential for growth factor-driven resistance to anticancer kinase inhibitors
eLife 3:e04037.

https://doi.org/10.7554/eLife.04037
- PubMed
- Google Scholar
1. Greenwald AG
(1975) Consequences of prejudice against the null hypothesis
Psychological Bulletin 82:1–20.

https://doi.org/10.1037/h0076157
- Google Scholar
(2015) Quality of reporting and adherence to arrive guidelines in animal studies for chagas disease preclinical drug research: A systematic review
PLOS Neglected Tropical Diseases 9:e0004194.

https://doi.org/10.1371/journal.pntd.0004194
- PubMed
- Google Scholar
1. Gupta RA
2. Shah N
3. Wang KC
4. Kim J
5. Horlings HM
6. Wong DJ
7. Tsai MC
8. Hung T
9. Argani P
10. Rinn JL
11. Wang Y
12. Brzoska P
13. Kong B
14. Li R
15. West RB
16. van de Vijver MJ
17. Sukumar S
18. Chang HY
(2010) Long non-coding RNA HOTAIR reprograms chromatin state to promote cancer metastasis
Nature 464:1071–1076.

https://doi.org/10.1038/nature08975
- PubMed
- Google Scholar
1. Han S
2. Olonisakin TF
3. Pribis JP
4. Zupetic J
5. Yoon JH
6. Holleran KM
7. Jeong K
8. Shaikh N
9. Rubio DM
10. Lee JS
(2017) A checklist is associated with increased quality of reporting preclinical biomedical research: A systematic review
PLOS ONE 12:e0183591.

https://doi.org/10.1371/journal.pone.0183591
- PubMed
- Google Scholar
1. Hart W
2. Albarracín D
3. Eagly AH
4. Brechan I
5. Lindberg MJ
6. Merrill L
(2009) Feeling validated versus being correct: A meta-analysis of selective exposure to information
Psychological Bulletin 135:555–588.

https://doi.org/10.1037/a0015701
- PubMed
- Google Scholar
1. Hatzivassiliou G
2. Song K
3. Yen I
4. Brandhuber BJ
5. Anderson DJ
6. Alvarado R
7. Ludlam MJC
8. Stokoe D
9. Gloor SL
10. Vigers G
11. Morales T
12. Aliagas I
13. Liu B
14. Sideris S
15. Hoeflich KP
16. Jaiswal BS
17. Seshagiri S
18. Koeppen H
19. Belvin M
20. Friedman LS
21. Malek S
(2010) RAF inhibitors prime wild-type RAF to activate the MAPK pathway and enhance growth
Nature 464:431–435.

https://doi.org/10.1038/nature08833
- PubMed
- Google Scholar
(2016) Registered Report: A chromatin-mediated reversible drug-tolerant state in cancer cell subpopulations
eLife 5:e09462.

https://doi.org/10.7554/eLife.09462
- PubMed
- Google Scholar
(2010) Kinase-dead BRAF and oncogenic RAS cooperate to drive tumor progression through CRAF
Cell 140:209–221.

https://doi.org/10.1016/j.cell.2009.12.040
- PubMed
- Google Scholar
1. Horrigan SK
2. Reproducibility Project: Cancer Biology
(2017a) Replication Study: The CD47-signal regulatory protein alpha (SIRPa) interaction is a therapeutic target for human solid tumors
eLife 6:e18173.

https://doi.org/10.7554/eLife.18173
- PubMed
- Google Scholar
(2017b) Replication Study: Melanoma genome sequencing reveals frequent PREX2 mutations
eLife 6:e21634.

https://doi.org/10.7554/eLife.21634
- PubMed
- Google Scholar
Data
(authors) (2017) Evaluating Registered Reports: A Naturalistic Comparative Study of Article Impact
OSF.

https://doi.org/10.31219/osf.io/5y8w7
(2015) Registered Report: Interactions between cancer stem cells and their niche govern metastatic colonization
eLife 4:e06938.

https://doi.org/10.7554/eLife.06938
- PubMed
- Google Scholar
1. Ioannidis JPA
(2005) Why most published research findings are false
PLOS Medicine 2:e124.

https://doi.org/10.1371/journal.pmed.0020124
- PubMed
- Google Scholar
(2014) Increasing value and reducing waste in research design, conduct, and analysis
Lancet 383:166–175.

https://doi.org/10.1016/S0140-6736(13)62227-8
- PubMed
- Google Scholar
1. Johannessen CM
2. Boehm JS
3. Kim SY
4. Thomas SR
5. Wardwell L
6. Johnson LA
7. Emery CM
8. Stransky N
9. Cogdill AP
10. Barretina J
11. Caponigro G
12. Hieronymus H
13. Murray RR
14. Salehi-Ashtiani K
15. Hill DE
16. Vidal M
17. Zhao JJ
18. Yang X
19. Alkan O
20. Kim S
21. Harris JL
22. Wilson CJ
23. Myer VE
24. Finan PM
25. Root DE
26. Roberts TM
27. Golub T
28. Flaherty KT
29. Dummer R
30. Weber BL
31. Sellers WR
32. Schlegel R
33. Wargo JA
34. Hahn WC
35. Garraway LA
(2010) COT drives resistance to RAF inhibition through MAP kinase pathway reactivation
Nature 468:968–972.

https://doi.org/10.1038/nature09627
- PubMed
- Google Scholar
(2012) Measuring the prevalence of questionable research practices with incentives for truth telling
Psychological Science 23:524–532.

https://doi.org/10.1177/0956797611430953
- PubMed
- Google Scholar
1. Kan Z
2. Jaiswal BS
3. Stinson J
4. Janakiraman V
5. Bhatt D
6. Stern HM
7. Yue P
8. Haverty PM
9. Bourgon R
10. Zheng J
11. Moorhead M
12. Chaudhuri S
13. Tomsho LP
14. Peters BA
15. Pujara K
16. Cordes S
17. Davis DP
18. Carlton VEH
19. Yuan W
20. Li L
21. Wang W
22. Eigenbrot C
23. Kaminker JS
24. Eberhard DA
25. Waring P
26. Schuster SC
27. Modrusan Z
28. Zhang Z
29. Stokoe D
30. de Sauvage FJ
31. Faham M
32. Seshagiri S
(2010) Diverse somatic mutation patterns and pathway alterations in human cancers
Nature 466:869–873.

https://doi.org/10.1038/nature09208
- PubMed
- Google Scholar
(2015a) Registered Report: Coadministration of a tumor-penetrating peptide enhances the efficacy of cancer drugs
eLife 4:e06959.

https://doi.org/10.7554/eLife.06959
- PubMed
- Google Scholar
(2015b) Registered Report: BET bromodomain inhibition as a therapeutic strategy to target c-Myc
eLife 4:e07072.

https://doi.org/10.7554/eLife.07072
- PubMed
- Google Scholar
(2015c) Registered Report: Discovery and preclinical validation of drug indications using compendia of public gene expression data
eLife 4:e06847.

https://doi.org/10.7554/eLife.06847
- PubMed
- Google Scholar
(2017) Replication Study: Discovery and preclinical validation of drug indications using compendia of public gene expression data
eLife 6:e17044.

https://doi.org/10.7554/eLife.17044
- PubMed
- Google Scholar
1. Kang T-W
2. Yevsa T
3. Woller N
4. Hoenicke L
5. Wuestefeld T
6. Dauch D
7. Hohmeyer A
8. Gereke M
9. Rudalska R
10. Potapova A
11. Iken M
12. Vucur M
13. Weiss S
14. Heikenwalder M
15. Khan S
16. Gil J
17. Bruder D
18. Manns M
19. Schirmacher P
20. Tacke F
21. Ott M
22. Luedde T
23. Longerich T
24. Kubicka S
25. Zender L
(2011) Senescence surveillance of pre-malignant hepatocytes limits liver cancer development
Nature 479:547–551.

https://doi.org/10.1038/nature10599
- PubMed
- Google Scholar
1. Kaplan RM
2. Irvin VL
(2015) Likelihood of null effects of large NHLBI clinical trials has Increased over time
PLOS ONE 10:e0132382.

https://doi.org/10.1371/journal.pone.0132382
- PubMed
- Google Scholar
1. Kerr NL
(1998) HARKing: hypothesizing after the results are known
Personality and Social Psychology Review 2:196–217.

https://doi.org/10.1207/s15327957pspr0203_4
- PubMed
- Google Scholar
(2020) Replication Study: A coding-independent function of gene and pseudogene mRNAs regulates tumour biology
eLife 9:e51019.

https://doi.org/10.7554/eLife.51019
- PubMed
- Google Scholar
(2015) Registered Report: A coding-independent function of gene and pseudogene mRNAs regulates tumour biology
eLife 4:e08245.

https://doi.org/10.7554/eLife.08245
- PubMed
- Google Scholar
1. Kilkenny C
2. Parsons N
3. Kadyszewski E
4. Festing MFW
5. Cuthill IC
6. Fry D
7. Hutton J
8. Altman DG
(2009) Survey of the quality of experimental design, statistical analysis and reporting of research using animals
PLOS ONE 4:e7824.

https://doi.org/10.1371/journal.pone.0007824
- PubMed
- Google Scholar
(2018) Replication study: Melanoma exosomes educate bone marrow progenitor cells toward a pro-metastatic phenotype through MET
eLife 7:e39944.

https://doi.org/10.7554/eLife.39944
- PubMed
- Google Scholar
1. Klein R. A
2. Ratliff KA
3. Vianello M
4. Adams RB
5. Bahník Š
6. Bernstein MJ
7. Bocian K
8. Brandt MJ
9. Brooks B
10. Brumbaugh CC
11. Cemalcilar Z
12. Chandler J
13. Cheong W
14. Davis WE
15. Devos T
16. Eisner M
17. Frankowska N
18. Furrow D
19. Galliani EM
20. Nosek BA
(2014) Investigating variation in replicability: A “Many Labs” replication project
Social Psychology 45:142–152.

https://doi.org/10.1027/1864-9335/a000178
- Google Scholar
1. Klein RA
2. Vianello M
3. Hasselman F
4. Adams BG
5. Adams RB
6. Alper S
7. Aveyard M
8. Axt JR
9. Babalola MT
10. Bahník Š
11. Batra R
12. Berkics M
13. Bernstein MJ
14. Berry DR
15. Bialobrzeska O
16. Binan ED
17. Bocian K
18. Brandt MJ
19. Busching R
20. Rédei AC
21. Cai H
22. Cambier F
23. Cantarero K
24. Carmichael CL
25. Ceric F
26. Chandler J
27. Chang JH
28. Chatard A
29. Chen EE
30. Cheong W
31. Cicero DC
32. Coen S
33. Coleman JA
34. Collisson B
35. Conway MA
36. Corker KS
37. Curran PG
38. Cushman F
39. Dagona ZK
40. Dalgar I
41. Dalla Rosa A
42. Davis WE
43. de Bruijn M
44. De Schutter L
45. Devos T
46. de Vries M
47. Doğulu C
48. Dozo N
49. Dukes KN
50. Dunham Y
51. Durrheim K
52. Ebersole CR
53. Edlund JE
54. Eller A
55. English AS
56. Finck C
57. Frankowska N
58. Freyre MÁ
59. Friedman M
60. Galliani EM
61. Gandi JC
62. Ghoshal T
63. Giessner SR
64. Gill T
65. Gnambs T
66. Gómez Á
67. González R
68. Graham J
69. Grahe JE
70. Grahek I
71. Green EGT
72. Hai K
73. Haigh M
74. Haines EL
75. Hall MP
76. Heffernan ME
77. Hicks JA
78. Houdek P
79. Huntsinger JR
80. Huynh HP
81. IJzerman H
82. Inbar Y
83. Innes-Ker ÅH
84. Jiménez-Leal W
85. John MS
86. Joy-Gaba JA
87. Kamiloğlu RG
88. Kappes HB
89. Karabati S
90. Karick H
91. Keller VN
92. Kende A
93. Kervyn N
94. Knežević G
95. Kovacs C
96. Krueger LE
97. Kurapov G
98. Kurtz J
99. Lakens D
100. Lazarević LB
101. Levitan CA
102. Lewis NA
103. Lins S
104. Lipsey NP
105. Losee JE
106. Maassen E
107. Maitner AT
108. Malingumu W
109. Mallett RK
110. Marotta SA
111. Međedović J
112. Mena-Pacheco F
113. Milfont TL
114. Morris WL
115. Murphy SC
116. Myachykov A
117. Neave N
118. Neijenhuijs K
119. Nelson AJ
120. Neto F
121. Lee Nichols A
122. Ocampo A
123. O’Donnell SL
124. Oikawa H
125. Oikawa M
126. Ong E
127. Orosz G
128. Osowiecka M
129. Packard G
130. Pérez-Sánchez R
131. Petrović B
132. Pilati R
133. Pinter B
134. Podesta L
135. Pogge G
136. Pollmann MMH
137. Rutchick AM
138. Saavedra P
139. Saeri AK
140. Salomon E
141. Schmidt K
142. Schönbrodt FD
143. Sekerdej MB
144. Sirlopú D
145. Skorinko JLM
146. Smith MA
147. Smith-Castro V
148. Smolders K
149. Sobkow A
150. Sowden W
151. Spachtholz P
152. Srivastava M
153. Steiner TG
154. Stouten J
155. Street CNH
156. Sundfelt OK
157. Szeto S
158. Szumowska E
159. Tang ACW
160. Tanzer N
161. Tear MJ
162. Theriault J
163. Thomae M
164. Torres D
165. Traczyk J
166. Tybur JM
167. Ujhelyi A
168. van Aert RCM
169. van Assen M
170. van der Hulst M
171. van Lange PAM
172. van ’t Veer AE
173. Vásquez- Echeverría A
174. Ann Vaughn L
175. Vázquez A
176. Vega LD
177. Verniers C
178. Verschoor M
179. Voermans IPJ
180. Vranka MA
181. Welch C
182. Wichman AL
183. Williams LA
184. Wood M
185. Woodzicka JA
186. Wronska MK
187. Young L
188. Zelenski JM
189. Zhijia Z
190. Nosek BA
(2018) Many Labs 2: Investigating variation in replicability across samples and settings
Advances in Methods and Practices in Psychological Science 1:443–490.

https://doi.org/10.1177/2515245918810225
- Google Scholar
1. Ko M
2. Huang Y
3. Jankowska AM
4. Pape UJ
5. Tahiliani M
6. Bandukwala HS
7. An J
8. Lamperti ED
9. Koh KP
10. Ganetzky R
11. Liu XS
12. Aravind L
13. Agarwal S
14. Maciejewski JP
15. Rao A
(2010) Impaired hydroxylation of 5-methylcytosine in myeloid cancers with mutant TET2
Nature 468:839–843.

https://doi.org/10.1038/nature09586
- PubMed
- Google Scholar
1. Kunda Z
(1990) The case for motivated reasoning
Psychological Bulletin 108:480–498.

https://doi.org/10.1037/0033-2909.108.3.480
- PubMed
- Google Scholar
1. Landis SC
2. Amara SG
3. Asadullah K
4. Austin CP
5. Blumenstein R
6. Bradley EW
7. Crystal RG
8. Darnell RB
9. Ferrante RJ
10. Fillit H
11. Finkelstein R
12. Fisher M
13. Gendelman HE
14. Golub RM
15. Goudreau JL
16. Gross RA
17. Gubitz AK
18. Hesterlee SE
19. Howells DW
20. Huguenard J
21. Kelner K
22. Koroshetz W
23. Krainc D
24. Lazic SE
25. Levine MS
26. Macleod MR
27. McCall JM
28. Narasimhan K
29. Noble LJ
30. Perrin S
31. Porter JD
32. Steward O
33. Unger E
34. Utz U
35. Silberberg SD
(2012) A call for transparent reporting to optimize the predictive value of preclinical research
Nature 490:187–191.

https://doi.org/10.1038/nature11556
- PubMed
- Google Scholar
1. Lee MJ
2. Ye AS
3. Gardino AK
4. Heijink AM
5. Sorger PK
6. MacBeath G
7. Yaffe MB
(2012) Sequential application of anticancer drugs enhances cell death by rewiring apoptotic signaling networks
Cell 149:780–794.

https://doi.org/10.1016/j.cell.2012.03.031
- PubMed
- Google Scholar
(2016) Registered Report: Melanoma exosomes educate bone marrow progenitor cells toward a pro-metastatic phenotype through MET
eLife 5:e07383.

https://doi.org/10.7554/eLife.07383
- PubMed
- Google Scholar
(2018) Replication Study: Transcriptional amplification in tumor cells with elevated c-Myc
eLife 7:e30274.

https://doi.org/10.7554/eLife.30274
- PubMed
- Google Scholar
(2015) Registered Report: The microRNA miR-34a inhibits prostate cancer stem cells and metastasis by directly repressing CD44
eLife 4:e06434.

https://doi.org/10.7554/eLife.06434
- PubMed
- Google Scholar
1. Lin CY
2. Lovén J
3. Rahl PB
4. Paranal RM
5. Burge CB
6. Bradner JE
7. Lee TI
8. Young RA
(2012) Transcriptional amplification in tumor cells with elevated c-Myc
Cell 151:56–67.

https://doi.org/10.1016/j.cell.2012.08.026
- PubMed
- Google Scholar
1. Liu C
2. Kelnar K
3. Liu B
4. Chen X
5. Calhoun-Davis T
6. Li H
7. Patrawala L
8. Yan H
9. Jeter C
10. Honorio S
11. Wiggins JF
12. Bader AG
13. Fagin R
14. Brown D
15. Tang DG
(2011) The microRNA miR-34a inhibits prostate cancer stem cells and metastasis by directly repressing CD44
Nature Medicine 17:211–215.

https://doi.org/10.1038/nm.2284
- PubMed
- Google Scholar
1. Lloyd K
2. Franklin C
3. Lutz C
4. Magnuson T
(2015) Reproducibility: Use mouse biobanks or lose them
Nature 522:151–153.

https://doi.org/10.1038/522151a
- PubMed
- Google Scholar
1. Lu C
2. Ward PS
3. Kapoor GS
4. Rohle D
5. Turcan S
6. Abdel-Wahab O
7. Edwards CR
8. Khanin R
9. Figueroa ME
10. Melnick A
11. Wellen KE
12. O’Rourke DM
13. Berger SL
14. Chan TA
15. Levine RL
16. Mellinghoff IK
17. Thompson CB
(2012) IDH mutation impairs histone demethylation and results in a block to cell differentiation
Nature 483:474–478.

https://doi.org/10.1038/nature10860
- PubMed
- Google Scholar
1. Lu SF
2. Jin GZ
3. Uzzi B
4. Jones B
(2013) The retraction penalty: Evidence from the Web of Science
Scientific Reports 3:3146.

https://doi.org/10.1038/srep03146
- PubMed
- Google Scholar
(2014) Biomedical research: Increasing value, reducing waste
Lancet 383:101–104.

https://doi.org/10.1016/S0140-6736(13)62329-6
- PubMed
- Google Scholar
Preprint
1. Macleod MR
(2017) Findings of a Retrospective, Controlled Cohort Study of the Impact of a Change in Nature Journals’ Editorial Policy for Life Sciences Research on the Completeness of Reporting Study Design and Execution
bioRxiv.

https://doi.org/10.1101/187245
- Google Scholar
1. Macleod M
2. Collings AM
3. Graf C
4. Kiermer V
5. Mellor D
6. Swaminathan S
7. Sweet D
8. Vinson V
(2021) The MDAR (Materials Design Analysis Reporting) Framework for transparent reporting in the life sciences
PNAS 118:e2103238118.

https://doi.org/10.1073/pnas.2103238118
- PubMed
- Google Scholar
1. Madlock-Brown CR
2. Eichmann D
(2015) The (lack of) impact of retraction on citation networks
Science and Engineering Ethics 21:127–137.

https://doi.org/10.1007/s11948-014-9532-1
- PubMed
- Google Scholar
1. Mahoney MJ
(1977) Publication prejudices: An experimental study of confirmatory bias in the peer review system
Cognitive Therapy and Research 1:161–175.

https://doi.org/10.1007/BF01173636
- Google Scholar
(2012) Replications in psychology research: How often do they really occur?
Perspectives on Psychological Science 7:537–542.

https://doi.org/10.1177/1745691612460688
- PubMed
- Google Scholar
1. Makel MC
2. Plucker JA
(2014) Facts are more important than novelty: Replication in the education sciences
Educational Researcher 43:304–316.

https://doi.org/10.3102/0013189X14545513
- Google Scholar
(2011) Interactions between cancer stem cells and their niche govern metastatic colonization
Nature 481:85–89.

https://doi.org/10.1038/nature10694
- PubMed
- Google Scholar
(2017) Replication Study: Coadministration of a tumor-penetrating peptide enhances the efficacy of cancer drugs
eLife 6:e17584.

https://doi.org/10.7554/eLife.17584
- PubMed
- Google Scholar
1. Marcus E
(2016) A STAR Is born
Cell 166:1059–1060.

https://doi.org/10.1016/j.cell.2016.08.021
- PubMed
- Google Scholar
1. Martin B
(1992) Scientific fraud and the power structure of science
Prometheus 10:83–98.

https://doi.org/10.1080/08109029208629515
- Google Scholar
1. McKiernan EC
2. Bourne PE
3. Brown CT
4. Buck S
5. Kenall A
6. Lin J
7. McDougall D
8. Nosek BA
9. Ram K
10. Soderberg CK
11. Spies JR
12. Thaney K
13. Updegrove A
14. Woo KH
15. Yarkoni T
(2016) How open science helps researchers succeed
eLife 5:e16800.

https://doi.org/10.7554/eLife.16800
- PubMed
- Google Scholar
1. McNutt M
2. Lehnert K
3. Hanson B
4. Nosek BA
5. Ellison AM
6. King JL
(2016) Liberating field science samples and data
Science 351:1024–1026.

https://doi.org/10.1126/science.aad7048
- PubMed
- Google Scholar
1. Metallo CM
2. Gameiro PA
3. Bell EL
4. Mattaini KR
5. Yang J
6. Hiller K
7. Jewell CM
8. Johnson ZR
9. Irvine DJ
10. Guarente L
11. Kelleher JK
12. Vander Heiden MG
13. Iliopoulos O
14. Stephanopoulos G
(2011) Reductive glutamine metabolism by IDH1 mediates lipogenesis under hypoxia
Nature 481:380–384.

https://doi.org/10.1038/nature10602
- PubMed
- Google Scholar
1. Moher D
2. Simera I
3. Schulz KF
4. Hoey J
5. Altman DG
(2008) Helping editors, peer reviewers and authors improve the clarity, completeness and transparency of reporting health research
BMC Medicine 6:13.

https://doi.org/10.1186/1741-7015-6-13
- PubMed
- Google Scholar
1. Molloy JC
(2011) The Open Knowledge Foundation: open data means better science
PLOS Biology 9:e1001195.

https://doi.org/10.1371/journal.pbio.1001195
- PubMed
- Google Scholar
1. Morin RD
2. Johnson NA
3. Severson TM
4. Mungall AJ
5. An J
6. Goya R
7. Paul JE
8. Boyle M
9. Woolcock BW
10. Kuchenbauer F
11. Yap D
12. Humphries RK
13. Griffith OL
14. Shah S
15. Zhu H
16. Kimbara M
17. Shashkin P
18. Charlot JF
19. Tcherpakov M
20. Corbett R
21. Tam A
22. Varhol R
23. Smailus D
24. Moksa M
25. Zhao Y
26. Delaney A
27. Qian H
28. Birol I
29. Schein J
30. Moore R
31. Holt R
32. Horsman DE
33. Connors JM
34. Jones S
35. Aparicio S
36. Hirst M
37. Gascoyne RD
38. Marra MA
(2010) Somatic mutations altering EZH2 (Tyr641) in follicular and diffuse large B-cell lymphomas of germinal-center origin
Nature Genetics 42:181–185.

https://doi.org/10.1038/ng.518
- PubMed
- Google Scholar
Website
(2010) Panton Principles, Principles for Open Data in Science
Accessed August 5, 2021.

http://pantonprinciples.org
Book
1. NAS
(2019) Reproducibility and Replicability in Science
Washington, D.C: The National Academies Press.

https://doi.org/10.17226/25303
- Google Scholar
1. Nature
(2013) Announcement: Reducing our irreproducibility
Nature 496:398.

https://doi.org/10.1038/496398a
- Google Scholar
1. Nazarian R
2. Shi H
3. Wang Q
4. Kong X
5. Koya RC
6. Lee H
7. Chen Z
8. Lee M-K
9. Attar N
10. Sazegar H
11. Chodon T
12. Nelson SF
13. McArthur G
14. Sosman JA
15. Ribas A
16. Lo RS
(2010) Melanomas acquire resistance to B-RAF(V600E) inhibition by RTK or N-RAS upregulation
Nature 468:973–977.

https://doi.org/10.1038/nature09626
- PubMed
- Google Scholar
1. Nickerson RS
(1998) Confirmation bias: A ubiquitous phenomenon in many guises
Review of General Psychology 2:175–220.

https://doi.org/10.1037/1089-2680.2.2.175
- Google Scholar
(2012) Scientific Utopia: II. Restructuring incentives and practices to promote truth over publishability
Perspectives on Psychological Science 7:615–631.

https://doi.org/10.1177/1745691612459058
- PubMed
- Google Scholar
1. Nosek B. A
2. Lakens D
(2014) Registered Reports: A method to increase the credibility of published results
Social Psychology 45:137–141.

https://doi.org/10.1027/1864-9335/a000192
- Google Scholar
1. Nosek BA
2. Alter G
3. Banks GC
4. Borsboom D
5. Bowman SD
6. Breckler SJ
7. Buck S
8. Chambers CD
9. Chin G
10. Christensen G
11. Contestabile M
12. Dafoe A
13. Eich E
14. Freese J
15. Glennerster R
16. Goroff D
17. Green DP
18. Hesse B
19. Humphreys M
20. Ishiyama J
21. Karlan D
22. Kraut A
23. Lupia A
24. Mabry P
25. Madon T
26. Malhotra N
27. Mayo-Wilson E
28. McNutt M
29. Miguel E
30. Paluck EL
31. Simonsohn U
32. Soderberg C
33. Spellman BA
34. Turitto J
35. VandenBos G
36. Vazire S
37. Wagenmakers EJ
38. Wilson R
39. Yarkoni T
(2015) Promoting an open research culture
Science 348:1422–1425.

https://doi.org/10.1126/science.aab2374
- Google Scholar
(2018) The preregistration revolution
PNAS 115:2600–2606.

https://doi.org/10.1073/pnas.1708274114
- PubMed
- Google Scholar
Website
1. Nosek BA
2. Lindsay DS
(2018) Preregistration Becoming the Norm in Psychological Science
Accessed August 5, 2021.

https://www.psychologicalscience.org/observer/preregistration-becoming-the-norm-in-psychological-science
1. Nosek BA
2. Beck ED
3. Campbell L
4. Flake JK
5. Hardwicke TE
6. Mellor DT
7. van ’t Veer AE
8. Vazire S
(2019) Preregistration Is hard, and worthwhile
Trends in Cognitive Sciences 23:815–818.

https://doi.org/10.1016/j.tics.2019.07.009
- PubMed
- Google Scholar
1. Nosek BA
2. Errington TM
(2020a) The best time to argue about what a replication means? Before you do it
Nature 583:518–520.

https://doi.org/10.1038/d41586-020-02142-6
- PubMed
- Google Scholar
1. Nosek BA
2. Errington TM
(2020b) What is replication?
PLOS Biology 18:e3000691.

https://doi.org/10.1371/journal.pbio.3000691
- PubMed
- Google Scholar
1. Open Science Collaboration
(2015) Psychology Estimating the reproducibility of psychological science
Science 349:aac4716.

https://doi.org/10.1126/science.aac4716
- PubMed
- Google Scholar
1. Opitz CA
2. Litzenburger UM
3. Sahm F
4. Ott M
5. Tritschler I
6. Trump S
7. Schumacher T
8. Jestaedt L
9. Schrenk D
10. Weller M
11. Jugold M
12. Guillemin GJ
13. Miller CL
14. Lutz C
15. Radlwimmer B
16. Lehmann I
17. von Deimling A
18. Wick W
19. Platten M
(2011) An endogenous tumour-promoting ligand of the human aryl hydrocarbon receptor
Nature 478:197–203.

https://doi.org/10.1038/nature10491
- PubMed
- Google Scholar
1. Peinado H
2. Alečković M
3. Lavotshkin S
4. Matei I
5. Costa-Silva B
6. Moreno-Bueno G
7. Hergueta-Redondo M
8. Williams C
9. García-Santos G
10. Ghajar C
11. Nitadori-Hoshino A
12. Hoffman C
13. Badal K
14. Garcia BA
15. Callahan MK
16. Yuan J
17. Martins VR
18. Skog J
19. Kaplan RN
20. Brady MS
21. Wolchok JD
22. Chapman PB
23. Kang Y
24. Bromberg J
25. Lyden D
(2012) Melanoma exosomes educate bone marrow progenitor cells toward a pro-metastatic phenotype through MET
Nature Medicine 18:883–891.

https://doi.org/10.1038/nm.2753
- PubMed
- Google Scholar
Preprint
1. Pelech S
2. Gallagher C
3. Sutter C
4. Yue L
5. Kerwin J
6. Bhargava A
7. Iorns E
8. Tsui R
9. Denis A
10. Perfito N
11. Errington TM
(2021) Replication Study: RAF Inhibitors Prime Wild-Type RAF to Activate the MAPK Pathway and Enhance Growth
bioRxiv.

https://doi.org/10.1101/2021.11.30.470372
- Google Scholar
1. Perrin S
(2014) Preclinical research: Make mouse studies work
Nature 507:423–425.

https://doi.org/10.1038/507423a
- PubMed
- Google Scholar
1. Pfeifer MP
2. Snodgrass GL
(1990) The continued use of retracted, invalid scientific literature
JAMA 263:1420–1423.

https://doi.org/10.1001/jama.1990.03440100140020
- PubMed
- Google Scholar
(2016) Registered Report: Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs
eLife 5:e12470.

https://doi.org/10.7554/eLife.12470
- PubMed
- Google Scholar
(2011) A microRNA regulon that mediates endothelial recruitment and metastasis by cancer cells
Nature 481:190–194.

https://doi.org/10.1038/nature10661
- PubMed
- Google Scholar
1. Poliseno L
2. Salmena L
3. Zhang J
4. Carver B
5. Haveman WJ
6. Pandolfi PP
(2010) A coding-independent function of gene and pseudogene mRNAs regulates tumour biology
Nature 465:1033–1038.

https://doi.org/10.1038/nature09144
- PubMed
- Google Scholar
1. Possemato R
2. Marks KM
3. Shaul YD
4. Pacold ME
5. Kim D
6. Birsoy K
7. Sethumadhavan S
8. Woo H-K
9. Jang HG
10. Jha AK
11. Chen WW
12. Barrett FG
13. Stransky N
14. Tsun Z-Y
15. Cowley GS
16. Barretina J
17. Kalaany NY
18. Hsu PP
19. Ottina K
20. Chan AM
21. Yuan B
22. Garraway LA
23. Root DE
24. Mino-Kenudson M
25. Brachtel EF
26. Driggers EM
27. Sabatini DM
(2011) Functional genomics reveal that the serine synthesis pathway is essential in breast cancer
Nature 476:346–350.

https://doi.org/10.1038/nature10350
- PubMed
- Google Scholar
1. Poulikakos PI
2. Zhang C
3. Bollag G
4. Shokat KM
5. Rosen N
(2010) RAF inhibitors transactivate RAF dimers and ERK signalling in cells with wild-type BRAF
Nature 464:427–430.

https://doi.org/10.1038/nature08902
- PubMed
- Google Scholar
(2012) Unresponsiveness of colon cancer to BRAF(V600E) inhibition through feedback activation of EGFR
Nature 483:100–103.

https://doi.org/10.1038/nature10868
- PubMed
- Google Scholar
(2018) Replication in criminology and the social sciences
Annual Review of Criminology 1:19–38.

https://doi.org/10.1146/annurev-criminol-032317-091849
- Google Scholar
(2011) Believe it or not: How much can we rely on published data on potential drug targets
Nature Reviews Drug Discovery 10:712.

https://doi.org/10.1038/nrd3439-c1
- PubMed
- Google Scholar
Book
1. Putnam H
(1975)
Philosophical Papers: Mathematics, Matter, and Method

Cambridge: Cambridge University Press.
- Google Scholar
1. Qian B-Z
2. Li J
3. Zhang H
4. Kitamura T
5. Zhang J
6. Campion LR
7. Kaiser EA
8. Snyder LA
9. Pollard JW
(2011) CCL2 recruits inflammatory monocytes to facilitate breast-tumour metastasis
Nature 475:222–225.

https://doi.org/10.1038/nature10138
- PubMed
- Google Scholar
Software
1. R Development Core Team
(2021) R: A Language and Environment for Statistical Computing
R Foundation for Statistical Computing, Vienna, Austria.

https://www.R-project.org/
1. Raj L
2. Ide T
3. Gurkar AU
4. Foley M
5. Schenone M
6. Li X
7. Tolliday NJ
8. Golub TR
9. Carr SA
10. Shamji AF
11. Stern AM
12. Mandinova A
13. Schreiber SL
14. Lee SW
(2011) Selective killing of cancer cells by a small molecule targeting the stress response to ROS
Nature 475:231–234.

https://doi.org/10.1038/nature10167
- PubMed
- Google Scholar
1. Ramirez FD
2. Motazedian P
3. Jung RG
4. Di Santo P
5. MacDonald ZD
6. Moreland R
7. Simard T
8. Clancy AA
9. Russo JJ
10. Welch VA
11. Wells GA
12. Hibbert B
(2017) Methodological rigor in preclinical cardiovascular studies
Circulation Research 120:1916–1926.

https://doi.org/10.1161/CIRCRESAHA.117.310628
- Google Scholar
(2015) Registered Report: Senescence surveillance of pre-malignant hepatocytes limits liver cancer development
eLife 4:e04105.

https://doi.org/10.7554/eLife.04105
- PubMed
- Google Scholar
(2020) A controlled trial for reproducibility
Nature 579:190–192.

https://doi.org/10.1038/d41586-020-00672-7
- PubMed
- Google Scholar
(2016) Registered Report: Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma
eLife 5:e10012.

https://doi.org/10.7554/eLife.10012
- PubMed
- Google Scholar
1. Repass J
2. Reproducibility Project: Cancer Biology
(2018) Replication Study: Fusobacterium nucleatum infection is prevalent in human colorectal carcinoma
eLife 7:e25801.

https://doi.org/10.7554/eLife.25801
- PubMed
- Google Scholar
1. Ricci-Vitiani L
2. Pallini R
3. Biffoni M
4. Todaro M
5. Invernici G
6. Cenci T
7. Maira G
8. Parati EA
9. Stassi G
10. Larocca LM
11. De Maria R
(2010) Tumour vascularization via endothelial differentiation of glioblastoma stem-like cells
Nature 468:824–828.

https://doi.org/10.1038/nature09557
- PubMed
- Google Scholar
(2016) Registered Report: IDH mutation impairs histone demethylation and results in a block to cell differentiation
eLife 5:e10860.

https://doi.org/10.7554/eLife.10860
- PubMed
- Google Scholar
1. Rosenthal R
(1979) The file drawer problem and tolerance for null results
Psychological Bulletin 86:638–641.

https://doi.org/10.1037/0033-2909.86.3.638
- Google Scholar
(2020) An excess of positive results: Comparing the standard psychology literature with Registered Reports
Advances in Methods and Practices in Psychological Science 4:251524592110074.

https://doi.org/10.1177/25152459211007467
- Google Scholar
(2012) Lineage tracing reveals Lgr5+ stem cell activity in mouse intestinal adenomas
Science 337:730–735.

https://doi.org/10.1126/science.1224676
- PubMed
- Google Scholar
1. Schmidt S
(2009) Shall we really do it again? The powerful concept of replication is neglected in the social sciences
Review of General Psychology 13:90–100.

https://doi.org/10.1037/a0015108
- Google Scholar
(2021) Assessment of transparency indicators across the biomedical literature: How open is open
PLOS Biology 19:e3001107.

https://doi.org/10.1371/journal.pbio.3001107
- PubMed
- Google Scholar
(2017) Replication Study: Inhibition of BET recruitment to chromatin as an effective treatment for MLL-fusion leukaemia
eLife 6:e25306.

https://doi.org/10.7554/eLife.25306
- PubMed
- Google Scholar
1. Sharma SV
2. Lee DY
3. Li B
4. Quinlan MP
5. Takahashi F
6. Maheswaran S
7. McDermott U
8. Azizian N
9. Zou L
10. Fischbach MA
11. Wong K-K
12. Brandstetter K
13. Wittner B
14. Ramaswamy S
15. Classon M
16. Settleman J
(2010) A chromatin-mediated reversible drug-tolerant state in cancer cell subpopulations
Cell 141:69–80.

https://doi.org/10.1016/j.cell.2010.02.027
- PubMed
- Google Scholar
(2016a) Registered Report: Diverse somatic mutation patterns and pathway alterations in human cancers
eLife 5:e11566.

https://doi.org/10.7554/eLife.11566
- PubMed
- Google Scholar
(2016b) Registered Report: COT drives resistance to RAF inhibition through MAP kinase pathway reactivation
eLife 5:e11414.

https://doi.org/10.7554/eLife.11414
- PubMed
- Google Scholar
(2019) Replication Study: Biomechanical remodeling of the microenvironment by stromal caveolin-1 favors tumor invasion and metastasis
eLife 8:e45120.

https://doi.org/10.7554/eLife.45120
- PubMed
- Google Scholar
(2017) Replication Study: The common feature of leukemia-associated IDH1 and IDH2 mutations is a neomorphic enzyme activity converting alpha-ketoglutarate to 2-hydroxyglutarate
eLife 6:e26030.

https://doi.org/10.7554/eLife.26030
- PubMed
- Google Scholar
(2011) False-positive psychology: undisclosed flexibility in data collection and analysis allows presenting anything as significant
Psychological Science 22:1359–1366.

https://doi.org/10.1177/0956797611417632
- PubMed
- Google Scholar
1. Sirota M
2. Dudley JT
3. Kim J
4. Chiang AP
5. Morgan AA
6. Sweet-Cordero A
7. Sage J
8. Butte AJ
(2011) Discovery and preclinical validation of drug indications using compendia of public gene expression data
Science Translational Medicine 3:96ra77.

https://doi.org/10.1126/scitranslmed.3001318
- PubMed
- Google Scholar
1. Smaldino PE
2. McElreath R
(2016) The natural selection of bad science
Royal Society Open Science 3:160384.

https://doi.org/10.1098/rsos.160384
- PubMed
- Google Scholar
(2021) Initial evidence of research quality of registered reports compared with the standard publishing model
Nature Human Behaviour 5:990–997.

https://doi.org/10.1038/s41562-021-01142-4
- PubMed
- Google Scholar
1. Sovacool BK
(2008) Exploring scientific misconduct: Isolated individuals, impure institutions, or an inevitable idiom of modern science
Journal of Bioethical Inquiry 5:271–282.

https://doi.org/10.1007/s11673-008-9113-6
- Google Scholar
1. Sterling TD
(1959) Publication decisions and their possible effects on inferences drawn from tests of significance—or vice versa
Journal of the American Statistical Association 54:30–34.

https://doi.org/10.1080/01621459.1959.10501497
- Google Scholar
(1995) Publication Decisions Revisited: The Effect of the Outcome of Statistical Tests on the Decision to Publish and Vice Versa
The American Statistician 49:108–112.

https://doi.org/10.1080/00031305.1995.10476125
- Google Scholar
(2012) Replication and reproducibility in spinal cord injury research
Experimental Neurology 233:597–605.

https://doi.org/10.1016/j.expneurol.2011.06.017
- PubMed
- Google Scholar
1. Stodden V
2. Guo P
3. Ma Z
(2013) Toward reproducible computational research: An empirical analysis of data and code policy adoption by journals
PLOS ONE 8:e67111.

https://doi.org/10.1371/journal.pone.0067111
- PubMed
- Google Scholar
1. Straussman R
2. Morikawa T
3. Shee K
4. Barzily-Rokni M
5. Qian ZR
6. Du J
7. Davis A
8. Mongare MM
9. Gould J
10. Frederick DT
11. Cooper ZA
12. Chapman PB
13. Solit DB
14. Ribas A
15. Lo RS
16. Flaherty KT
17. Ogino S
18. Wargo JA
19. Golub TR
(2012) Tumour micro-environment elicits innate resistance to RAF inhibitors through HGF secretion
Nature 487:500–504.

https://doi.org/10.1038/nature11183
- PubMed
- Google Scholar
(2010) Coadministration of a tumor-penetrating peptide enhances the efficacy of cancer drugs
Science 328:1031–1035.

https://doi.org/10.1126/science.1183057
- PubMed
- Google Scholar
1. Sumazin P
2. Yang X
3. Chiu H-S
4. Chung W-J
5. Iyer A
6. Llobet-Navas D
7. Rajbhandari P
8. Bansal M
9. Guarnieri P
10. Silva J
11. Califano A
(2011) An extensive microRNA-mediated network of RNA-RNA interactions regulates established oncogenic pathways in glioblastoma
Cell 147:370–381.

https://doi.org/10.1016/j.cell.2011.09.041
- PubMed
- Google Scholar
1. Tay Y
2. Kats L
3. Salmena L
4. Weiss D
5. Tan SM
6. Ala U
7. Karreth F
8. Poliseno L
9. Provero P
10. Di Cunto F
11. Lieberman J
12. Rigoutsos I
13. Pandolfi PP
(2011) Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs
Cell 147:344–357.

https://doi.org/10.1016/j.cell.2011.09.029
- PubMed
- Google Scholar
1. Valentine JC
2. Biglan A
3. Boruch RF
4. Castro FG
5. Collins LM
6. Flay BR
7. Kellam S
8. Mościcki EK
9. Schinke SP
(2011) Replication in prevention science
Prevention Science 12:103–117.

https://doi.org/10.1007/s11121-011-0217-6
- PubMed
- Google Scholar
(2020) Publication rate in preclinical research: a plea for preregistration
BMJ Open Science 4:e100051.

https://doi.org/10.1136/bmjos-2019-100051
- Google Scholar
(2016) Registered Report: Systematic identification of genomic markers of drug sensitivity in cancer cells
eLife 5:e13620.

https://doi.org/10.7554/eLife.13620
- PubMed
- Google Scholar
(2018) Replication Study: Systematic identification of genomic markers of drug sensitivity in cancer cells
eLife 7:e29747.

https://doi.org/10.7554/eLife.29747
- PubMed
- Google Scholar
1. Vermeulen L
2. De Sousa E Melo F
3. van der Heijden M
4. Cameron K
5. de Jong JH
6. Borovski T
7. Tuynman JB
8. Todaro M
9. Merz C
10. Rodermond H
11. Sprick MR
12. Kemper K
13. Richel DJ
14. Stassi G
15. Medema JP
(2010) Wnt activity defines colon cancer stem cells and is regulated by the microenvironment
Nature Cell Biology 12:468–476.

https://doi.org/10.1038/ncb2048
- PubMed
- Google Scholar
(2012) An agenda for purely confirmatory research
Perspectives on Psychological Science 7:632–638.

https://doi.org/10.1177/1745691612463078
- PubMed
- Google Scholar
(2020) Replication Study: Coding-independent regulation of the tumor suppressor PTEN by competing endogenous mRNAs
eLife 9:e56651.

https://doi.org/10.7554/eLife.56651
- PubMed
- Google Scholar
1. Ward PS
2. Patel J
3. Wise DR
4. Abdel-Wahab O
5. Bennett BD
6. Coller HA
7. Cross JR
8. Fantin VR
9. Hedvat CV
10. Perl AE
11. Rabinowitz JD
12. Carroll M
13. Su SM
14. Sharp KA
15. Levine RL
16. Thompson CB
(2010) The common feature of leukemia-associated IDH1 and IDH2 mutations is a neomorphic enzyme activity converting alpha-ketoglutarate to 2-hydroxyglutarate
Cancer Cell 17:225–234.

https://doi.org/10.1016/j.ccr.2010.01.020
- PubMed
- Google Scholar
1. Wilkinson MD
2. Dumontier M
3. Aalbersberg IJJ
4. Appleton G
5. Axton M
6. Baak A
7. Blomberg N
8. Boiten J-W
9. da Silva Santos LB
10. Bourne PE
11. Bouwman J
12. Brookes AJ
13. Clark T
14. Crosas M
15. Dillo I
16. Dumon O
17. Edmunds S
18. Evelo CT
19. Finkers R
20. Gonzalez-Beltran A
21. Gray AJG
22. Groth P
23. Goble C
24. Grethe JS
25. Heringa J
26. ’t Hoen PAC
27. Hooft R
28. Kuhn T
29. Kok R
30. Kok J
31. Lusher SJ
32. Martone ME
33. Mons A
34. Packer AL
35. Persson B
36. Rocca-Serra P
37. Roos M
38. van Schaik R
39. Sansone S-A
40. Schultes E
41. Sengstag T
42. Slater T
43. Strawn G
44. Swertz MA
45. Thompson M
46. van der Lei J
47. van Mulligen E
48. Velterop J
49. Waagmeester A
50. Wittenburg P
51. Wolstencroft K
52. Zhao J
53. Mons B
(2016) The FAIR Guiding Principles for scientific data management and stewardship
Scientific Data 3:160018.

https://doi.org/10.1038/sdata.2016.18
- PubMed
- Google Scholar
1. Willingham SB
2. Volkmer JP
3. Gentles AJ
4. Sahoo D
5. Dalerba P
6. Mitra SS
7. Wang J
8. Contreras-Trujillo H
9. Martin R
10. Cohen JD
11. Lovelace P
12. Scheeren FA
13. Chao MP
14. Weiskopf K
15. Tang C
16. Volkmer AK
17. Naik TJ
18. Storm TA
19. Mosley AR
20. Edris B
21. Schmid SM
22. Sun CK
23. Chua MS
24. Murillo O
25. Rajendran P
26. Cha AC
27. Chin RK
28. Kim D
29. Adorno M
30. Raveh T
31. Tseng D
32. Jaiswal S
33. Enger PØ
34. Steinberg GK
35. Li G
36. So SK
37. Majeti R
38. Harsh GR
39. van de Rijn M
40. Teng NNH
41. Sunwoo JB
42. Alizadeh AA
43. Clarke MF
44. Weissman IL
(2012) The CD47-signal regulatory protein alpha (SIRPa) interaction is a therapeutic target for human solid tumors
PNAS 109:6662–6667.

https://doi.org/10.1073/pnas.1121623109
- PubMed
- Google Scholar
1. Wilson TR
2. Fridlyand J
3. Yan Y
4. Penuel E
5. Burton L
6. Chan E
7. Peng J
8. Lin E
9. Wang Y
10. Sosman J
11. Ribas A
12. Li J
13. Moffat J
14. Sutherlin DP
15. Koeppen H
16. Merchant M
17. Neve R
18. Settleman J
(2012) Widespread potential for growth-factor-driven resistance to anticancer kinase inhibitors
Nature 487:505–509.

https://doi.org/10.1038/nature11249
- PubMed
- Google Scholar
Website
1. Wold B
2. Tabak LA
3. Ator N
4. Berro L
5. Bliss-Moreau E
6. Gonzalez-Villalobos RA
7. Hankenson C
8. Kiermer V
9. Mathis KW
10. Nusser S
11. Nuzzo R
12. Prager E
13. Ramirez FD
14. Svenson K
15. Berridge B
16. Brown P
17. Clayton J
18. Gordon JA
19. Lauer M
20. Wolinetz C
(2021) ACD Working Group on Enhancing Rigor, Transparency, and Translatability in Animal Research
Accessed August 5, 2021.

https://www.acd.od.nih.gov/documents/presentations/06112021_ACD_WorkingGroup_FinalReport.pdf
1. Xu W
2. Yang H
3. Liu Y
4. Yang Y
5. Wang P
6. Kim S-H
7. Ito S
8. Yang C
9. Wang P
10. Xiao M-T
11. Liu L
12. Jiang W
13. Liu J
14. Zhang J
15. Wang B
16. Frye S
17. Zhang Y
18. Xu Y
19. Lei Q
20. Guan K-L
21. Zhao S
22. Xiong Y
(2011) Oncometabolite 2-hydroxyglutarate is a competitive inhibitor of α-ketoglutarate-dependent dioxygenases
Cancer Cell 19:17–30.

https://doi.org/10.1016/j.ccr.2010.12.014
- PubMed
- Google Scholar
(2019) Replication Study: The microRNA miR-34a inhibits prostate cancer stem cells and metastasis by directly repressing CD44
eLife 8:e43511.

https://doi.org/10.7554/eLife.43511
- PubMed
- Google Scholar
1. Zhu Q
2. Pao GM
3. Huynh AM
4. Suh H
5. Tonnu N
6. Nederlof PM
7. Gage FH
8. Verma IM
(2011) BRCA1 tumour suppression occurs via heterochromatin-mediated silencing
Nature 477:179–184.

https://doi.org/10.1038/nature10371
- PubMed
- Google Scholar
1. Zuber J
2. Shi J
3. Wang E
4. Rappaport AR
5. Herrmann H
6. Sison EA
7. Magoon D
8. Qi J
9. Blatt K
10. Wunderlich M
11. Taylor MJ
12. Johns C
13. Chicas A
14. Mulloy JC
15. Kogan SC
16. Brown P
17. Valent P
18. Bradner JE
19. Lowe SW
20. Vakoc CR
(2011) RNAi screen identifies Brd4 as a therapeutic target in acute myeloid leukaemia
Nature 478:524–528.

https://doi.org/10.1038/nature10334
- PubMed
- Google Scholar

Decision letter

Peter Rodgers

Reviewing Editor; eLife, United Kingdom
Eduardo Franco

Senior Editor; McGill University, Canada

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Decision letter after peer review:

Thank you for submitting your article "Challenges for Assessing Reproducibility and Replicability Across the Research Lifecycle in Preclinical Cancer Biology" to eLife for consideration as a Feature Article. Your article has been reviewed by three peer reviewers, including the eLife Features Editor Peter Rodgers and the evaluation has been overseen by Eduardo Franco as the Senior Editor.

The reviewers and editors have discussed the reviews and we have drafted this decision letter to help you prepare a revised submission.

Summary:

This is a well written paper that clearly expresses the various issues concerning interactions with scientists to acquire reagents etc. to test data reproducibility for "high impact" papers published during 2010-2012.

For the purpose of this review, I refer to the authors of the series as the "replicators" to distinguish them from the "authors" of the 2010-2012 publications. The current manuscript summarizes the heterogeneity of the outcomes of their communications with the authors of the papers whose experiments they sought to replicate. Many authors do not appear to have been willing to provide additional data when requested by the replicators. In this regard, Figure 3 presents an interesting analysis that suggests a limited correlation between the amount of information requested by the replicators and the degree of cooperation by the authors. However, a majority of the authors were willing to provide key reagents.

Essential revisions:

1. The Discussion makes several recommendations for increasing transparency of preclinical experiments under several headings: improving documentation and reporting; improving data, code, and materials transparency and sharing; improving preregistration of studies and analysis plans; improving rigor, reporting, and incentives with Registered Reports; and improving incentives for replication. The recommendations are presented as though none of them might have any unintended negative consequences, an unlikely scenario.

Please include a brief discussion of the possible unintended negative consequences of the recommendations.

2. In several instances, the replicators note that changes in the direction they are recommending have already been instituted by journals, which implies that this landscape is changing. It would seem prudent to suggest that an appropriately constituted committee consider the current situation before deciding what further changes might be appropriate.

Please include a brief discussion of the next steps for the preclinical cancer research community in response to your findings.

3. It is not clear how devoting more resources to the replication of experiments will help the overall process, as such experiments can only evaluate a small proportion of the published literature. In this regard, the replicators seem to have two beliefs that may be flawed.

First, they seem to believe that the scientific community will accept that the inability of replicators to replicate the findings of a particular experiment means that the initial experiment was flawed and that other investigators will be skeptical of the conclusion of the publication in question. However, that may not be the case.

Conversely, the replicators seem to believe that if other laboratories in the course of their regular studies fail to replicate the findings, either they will not publish the negative results or, if they do publish them, those results will not be accepted by the scientific community as implying that they should be skeptical of the conclusion of the initial publication. Again, this belief may be wrong.

Please comment on these two points in the manuscript.

4. Were there any particular types of experiments/techniques for which published papers lacked adequate details and/or for which authors seemed less likely to provide you with information and/or materials (such a cell lines and reagents)?

5. Please comment on how the numbers of mice you calculated were needed for the various replication experiments compares with the numbers of mice that were used in the original experiments?

Editorial points:

a) There is a tendency to represent the authors own published results in the third person (e.g. Errington and colleagues…..), giving the impression that the quoted material is from others. Please address this.

b) The manuscript cites two unpublished manuscripts:

Errington, 2021;

Errington, Mathur et al., 2021.

Please clarify how these two manuscripts are related to the present manuscript.

c) Given that a key finding of this manuscript is that many published papers do not adequately describe the methods used to obtain the results being reported, it is fitting that these authors describe what they have done in exhaustive detail! However, this can make the manuscript difficult to read in places, and most of the comments below are of an editorial nature and are intended to make the manuscript more readable.

i) The title and abstract are both too long.

ii) The passage “Replication experiments across the research life cycle” would be more readable if some of the medians and IQRs were moved to the caption for figure.

iii) The section "Design phase" would be more readable if the six sentences that start "By experiment... " were less obtrusive.

iv) There are a few passages in the discussion that unnecessarily repeat material from earlier in the article.

v) Table 1 requires a short caption to explain what are shown in columns 2, 4 and 6, and to explain why these numbers are usually different.

https://doi.org/10.7554/eLife.67995.sa1

Author response

Essential revisions:

1. The Discussion makes several recommendations for increasing transparency of preclinical experiments under several headings: improving documentation and reporting; improving data, code, and materials transparency and sharing; improving preregistration of studies and analysis plans; improving rigor, reporting, and incentives with Registered Reports; and improving incentives for replication. The recommendations are presented as though none of them might have any unintended negative consequences, an unlikely scenario.

Please include a brief discussion of the possible unintended negative consequences of the recommendations.

We have added a brief discussion of potential unintended consequences for each of the solutions offered and pointed readers to empirical evidence when available.

2. In several instances, the replicators note that changes in the direction they are recommending have already been instituted by journals, which implies that this landscape is changing. It would seem prudent to suggest that an appropriately constituted committee consider the current situation before deciding what further changes might be appropriate.

Please include a brief discussion of the next steps for the preclinical cancer research community in response to your findings.

We closed the discussion by pointing out the importance of evaluating the impact of these interventions to optimize their benefit and minimize their costs and highlighted the importance of bringing stakeholders together to evaluate the evidence in scaling up such interventions. Specifically, on Page 28 we added “an active metascience research community that evaluates the impact of these new behaviors and policies will help identify unintended negative consequences, improve their implementation, and optimize their adoption for facilitating research progress. Stakeholders in the cancer biology community including researchers, funders, societies, and institutional representatives could facilitate and support research investigations of the scientific process so that decisions about adopting these behaviors at scale can be evidence-based and clearly represent both the costs and benefits.”

3. It is not clear how devoting more resources to the replication of experiments will help the overall process, as such experiments can only evaluate a small proportion of the published literature. In this regard, the replicators seem to have two beliefs that may be flawed.

First, they seem to believe that the scientific community will accept that the inability of replicators to replicate the findings of a particular experiment means that the initial experiment was flawed and that other investigators will be skeptical of the conclusion of the publication in question. However, that may not be the case.

Conversely, the replicators seem to believe that if other laboratories in the course of their regular studies fail to replicate the findings, either they will not publish the negative results or, if they do publish them, those results will not be accepted by the scientific community as implying that they should be skeptical of the conclusion of the initial publication. Again, this belief may be wrong.

Please comment on these two points in the manuscript.

We added discussion of these issues on pages 26-27 of the manuscript. We highlight both issues and refer readers to in-depth treatments of the role and interpretation of replication studies in advancing research.

4. Were there any particular types of experiments/techniques for which published papers lacked adequate details and/or for which authors seemed less likely to provide you with information and/or materials (such a cell lines and reagents)?

We added results, including the addition of two supplementary figures (Figure 3—figure supplement 1 and Figure 3—figure supplement 2), to page 12 of the manuscript. There was little variation in the extent of clarifications or helpfulness by category of experimental technique.

5. Please comment on how the numbers of mice you calculated were needed for the various replication experiments compares with the numbers of mice that were used in the original experiments?

We added these comparisons (replications were 25% higher in average sample size compared to original experiments) to page 12 of the manuscript.

Editorial points:

a) There is a tendency to represent the authors own published results in the third person (e.g. Errington and colleagues…..), giving the impression that the quoted material is from others. Please address this.

The authorship lists between the papers are not perfectly overlapping. Using “we” is reasonable with recognition that it is a rough approximation of who contributed to the underlying research.

b) The manuscript cites two unpublished manuscripts:

Errington, 2021;

Errington, Mathur et al., 2021.

Please clarify how these two manuscripts are related to the present manuscript.

Errington (2021) reports individual experiments that were completed as part of the Reproducibility Project: Cancer Biology but did not make it into a published Replication Study because other experiments from that Registered Report were not performed or completed.

Errington, Mathur, et al. (2021) is a meta-analysis of the statistical outcomes from the replication studies that were completed as part of the Reproducibility Project: Cancer Biology. It is now under review at eLife as a companion paper to this piece.

c) Given that a key finding of this manuscript is that many published papers do not adequately describe the methods used to obtain the results being reported, it is fitting that these authors describe what they have done in exhaustive detail! However, this can make the manuscript difficult to read in places, and most of the comments below are of an editorial nature and are intended to make the manuscript more readable.

i) The title and abstract are both too long.

Revised and shortened. The title is now “Challenges for Assessing Reproducibility and Replicability in Preclinical Cancer Biology” and the abstract is now 200 words.

ii) The passage “Replication experiments across the research life cycle” would be more readable if some of the medians and IQRs were moved to the caption for figure.

Done.

iii) The section "Design phase" would be more readable if the six sentences that start "By experiment... " were less obtrusive.

We removed the sentences summarizing the findings by paper and retained the sentences summarizing the findings by experiment. And, we referred the reader to Figure 1—figure supplement 1 to see the data represented by paper (very similar results). We retained summary data at the papers-level in a couple of places in the manuscript when it was particularly relevant to characterize the findings for the paper as a whole.

iv) There are a few passages in the discussion that unnecessarily repeat material from earlier in the article.

We reviewed and edited the manuscript to cleanly present the findings and avoid unnecessary redundancy. We did retain the opening paragraph of the Discussion section that briefly summarizes the key findings from the results.

v) Table 1 requires a short caption to explain what are shown in columns 2, 4 and 6, and to explain why these numbers are usually different.

Done.

https://doi.org/10.7554/eLife.67995.sa2

Article and author information

Author details

Timothy M Errington

Timothy M Errington is at the Center for Open Science, Charlottesville, United States

Contribution
Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Supervision, Validation, Visualization, Writing – original draft, Writing – review and editing

For correspondence
tim@cos.io

Competing interests
Employed by the nonprofit Center for Open Science that has a mission to increase openness, integrity, and reproducibility of research

"This ORCID iD identifies the author of this article:" 0000-0002-4959-5143
Alexandria Denis

Alexandria Denis is at the Center for Open Science, Charlottesville, United States

Present address
Fordham University School of Law, New York, United States

Contribution
Data curation, Investigation, Writing – review and editing

Competing interests
Employed by the nonprofit Center for Open Science that has a mission to increase openness, integrity, and reproducibility of research
Nicole Perfito

Nicole Perfito is at Science Exchange, Palo Alto, United States

Present address
Rarebase, Palo Alto, United States

Contribution
Conceptualization, Investigation, Methodology, Project administration, Writing – review and editing

Competing interests
Employed by and hold shares in Science Exchange Inc

"This ORCID iD identifies the author of this article:" 0000-0001-9546-215X
Elizabeth Iorns

Elizabeth Iorns is at Science Exchange, Palo Alto, United States

Contribution
Conceptualization, Data curation, Funding acquisition, Investigation, Methodology, Project administration, Supervision, Validation, Writing – review and editing

Competing interests
Employed by and hold shares in Science Exchange Inc

"This ORCID iD identifies the author of this article:" 0000-0002-5515-1258
Brian A Nosek

Brian A Nosek is at the Center for Open Science and the University of Virginia, Charlottesville, United States

Contribution
Conceptualization, Funding acquisition, Methodology, Supervision, Writing – original draft, Writing – review and editing

Competing interests
Employed by the nonprofit Center for Open Science that has a mission to increase openness, integrity, and reproducibility of research

"This ORCID iD identifies the author of this article:" 0000-0001-6797-5476

Funding

Arnold Ventures

Timothy M Errington
Alexandria Denis
Nicole Perfito
Elizabeth Iorns
Brian A Nosek

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Acknowledgements

This work was supported by a grant from Arnold Ventures (formerly known as the Laura and John Arnold Foundation), provided to the Center for Open Science in collaboration with Science Exchange. We thank Anne Chestnut for assistance creating Figure 1. We thank Fraser Tan, Joelle Lomax, Rachel Tsui, and Stephen Williams for helping in coordination efforts during the course of the project. We thank all Science Exchange providers who provided their services, and all employees at Science Exchange and the Center for Open Science who contributed to administrative and platform development efforts that enabled this project to occur.

Publication history

Received: March 2, 2021
Accepted: July 20, 2021
Version of Record published: December 7, 2021
Version of Record updated: June 14, 2023

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.