Methods for randomized, blinded, controlled evaluation of putative disease interventions in multilaboratory, preclinical assessment networks

Lab Animal, Feb 2026

Science faces a reproducibility crisis, and public trust in science declines when large clinical trials, which had been qualified by promising preclinical studies, fail. While some clinical trial designs may have been inadequate, preclinical assessments of disease interventions might have lacked key elements of rigor such as treatment concealment, randomization, blinded outcomes, prespecified and adequate sample sizes, and models including comorbidities. Here, to demonstrate feasibility and practicality of enhanced rigor in preclinical assessment, we designed a six-laboratory network that implemented rigorous study elements, using acute ischemic stroke for demonstration. This network enrolled 2,615 rodents in 5 different models and implemented a multistage, multiarm statistical design that sequentially eliminated candidate interventions during interim analyses. The methods included centralized intervention packaging, randomization, data quality assessment and data archiving. Blinded analysis of 9,274 video-recorded behavioral tasks and 3,652 magnetic resonance images were evaluated. All tools and protocols are presented and could be adapted to preclinical assessment in other disease areas.

Article PDF cannot be displayed. You can download it here:

https://www.nature.com/articles/s41684-026-01683-z.pdf

Methods for randomized, blinded, controlled evaluation of putative disease interventions in multilaboratory, preclinical assessment networks

lab animal Article https://doi.org/10.1038/s41684-026-01683-z Methods for randomized, blinded, controlled evaluation of putative disease interventions in multilaboratory, preclinical assessment networks Check for updates Jessica Lamb    1 , Karisma Nagarkatti1, Marcio A. Diniz    2,26, Ryan Cabeen3, Monica Estrada3, Karen L. Crawford3, Andre Rogatko    2, Sungjin Kim2, Cenk Ayata    4,5, David C. Hess6, Mohammad Badruzzaman Khan6, Rakesh B. Patel    7, Mariia Kumskova7, Enrique C. Leira8,9,10, Anil K. Chauhan7, SPAN Consortium* & Patrick Lyden1,11 Science faces a reproducibility crisis, and public trust in science declines when large clinical trials, which had been qualified by promising preclinical studies, fail. While some clinical trial designs may have been inadequate, preclinical assessments of disease interventions might have lacked key elements of rigor such as treatment concealment, randomization, blinded outcomes, prespecified and adequate sample sizes, and models including comorbidities. Here, to demonstrate feasibility and practicality of enhanced rigor in preclinical assessment, we designed a six-laboratory network that implemented rigorous study elements, using acute ischemic stroke for demonstration. This network enrolled 2,615 rodents in 5 different models and implemented a multistage, multiarm statistical design that sequentially eliminated candidate interventions during interim analyses. The methods included centralized intervention packaging, randomization, data quality assessment and data archiving. Blinded analysis of 9,274 video-recorded behavioral tasks and 3,652 magnetic resonance images were evaluated. All tools and protocols are presented and could be adapted to preclinical assessment in other disease areas. Science faces skepticism from the lay public, and scientists have described problems with rigor, transparency and reproducibility. Many published findings, selected from high-impact journals, have failed replication outside of the original laboratories1–3. Many factors contribute to reproducibility issues in science: inadequate sample size and proper power analysis before initiating experiments, lack of control for repeated significance testing (‘P-hacking’), inadequate blinding of the investigators or insufficient or inappropriate controls, among other deficiencies1,4–7. 1 Many groups, including the National Academy of Science, have called on grant agencies and journals to enforce higher standards of rigor and experimental design to address these deficiencies. However, appropriate methods to implement greater scientific rigor may be lacking or insufficiently developed8. Here, we address one important type of scientific study: the use of preclinical animal disease models to assess the efficacy of proposed candidate interventions. Before launching pivotal clinical trials in patients, Department of Physiology and Neuroscience of the Zilkha Neurogenetic Institute of the Keck School of Medicine, Los Angeles, CA, USA. 2Biostatistics and Bioinformatics Research Center, Samuel Oschin Comprehensive Cancer Center, Cedars-Sinai Medical Center, Los Angeles, CA, USA. 3USC Stevens Neuroimaging and Informatics Institute, Los Angeles, CA, USA. 4Neurovascular Research Unit, Department of Neurology, Harvard Medical School, Massachusetts General Hospital, Boston, MA, USA. 5Department of Radiology, Harvard Medical School, Massachusetts General Hospital, Charlestown, MA, USA. 6Department of Neurology, Medical College of Georgia, Augusta University, Augusta, GA, USA. 7Department of Internal Medicine, Division of Hematology, Oncology and Blood and Marrow Transplantation, Carver College of Medicine, University of Iowa, Iowa City, IA, USA. 8Department of Neurology, Carver College of Medicine, University of Iowa, Iowa City, IA, USA. 9Department of Neurosurgery, Carver College of Medicine, University of Iowa, Iowa City, IA, USA. 10Department of Epidemiology, College of Public Health, University of Iowa, Iowa City, IA, USA. 11Department of Neurology of the Keck School of Medicine, Los Angeles, CA, USA. 26Present address: Department of Population Health Science and Policy, Icahn School of Medicine at Mount Sinai, New York, NY, USA. *A list of authors and their affiliations appear at the end of the paper. e-mail: Lab Animal Article https://doi.org/10.1038/s41684-026-01683-z a b d Publication requests MRI images Behavior videos SOP approval LONI Communication /website MGH Yale LONI IDA USC Stats Image analysis EAB SC decisions Protocol MRI acquisition UT-Houston c Primary outcome MCAo, NDS Corner Day –7 Corner, grid, hanging wire* Corner, grid, hanging wire*, NDS 7 28 NDS 0 1 2 5 Animal records EOS, tissue banking 29 30 Statistics raw Results REDCap forms by stage 21 MRI piled Com data Database e Timeline NDS, MRI Behavior scores Model choice Histology Augusta Stage-specific treatment Number of forms Univ. of Iowa Johns Hopkins NINDS 1 21 19 18 2 3 4 Stage Treatments as prescribed Fig. 1 | Description of the network. a, Geographical representation of the six laboratories marked with a yellow dot (Augusta, University of Iowa, Johns Hopkins, Mass General Hospital (MGH), Yale and UT-Houston) sending data to the CC at University of Southern California (USC), marked with a yellow star, where the data repositories are located, including IDA of LONI and Statistics. The External Advisory Board (EAB) provides feedback to the National Institute of Neurological Disorders and Stroke (NINDS) in Washington DC, which also advises the network. b, Graphical representation of the decisions the SC makes, including approving SOPs, communication and website development, stagespecific treatment protocols, the model choice, behavioral outcome measures and the experimental protocol, which includes when MRI images are collected as well as histology decisions and publication requests from within the network and outside the network. c, General experimental timeline through end of study (EOS): each animal starts with baseline corner testing performed seven days before MCAo surgery. NDS are collected on the day of surgery, day 1, day 2 and day 28. MRI is performed at day 2 and day 29. In addition to the baseline, corner testing is performed at day 7 and day 28, along with grid testing and hanging wire testing. *The hanging wire test was discontinued after stage 1. d, Pathways of data flow from the research laboratories to central storage and analysis. Data collected at the laboratories included animal records, MRI images and behavior videos, which are sent to either other laboratories, the database or LONI for analysis. Once compiled, all raw data are sent to statistics for results. e, Total number of data entry forms in the REDCap database for each stage (1–4) of the trial. Panels a–d created in BioRender; Lamb, J. https://biorender.com/sae4s8q (2026). many funders, sponsors and regulators require that therapeutic efficacy b (...truncated)


This is a preview of a remote PDF: https://www.nature.com/articles/s41684-026-01683-z.pdf
Article home page: https://www.nature.com/articles/s41684-026-01683-z

Lamb, Jessica, Nagarkatti, Karisma, Diniz, Marcio A., Cabeen, Ryan, Estrada, Monica, Crawford, Karen L., Rogatko, Andre, Kim, Sungjin, Ayata, Cenk, Hess, David C., Khan, Mohammad Badruzzaman, Patel, Rakesh B., Kumskova, Mariia, Leira, Enrique C., Chauhan, Anil K., Lyden, Patrick. Methods for randomized, blinded, controlled evaluation of putative disease interventions in multilaboratory, preclinical assessment networks, Lab Animal, 2026, DOI: 10.1038/s41684-026-01683-z