Hybrid algorithms for generating optimal designs for discriminating multiple nonlinear models under various error distributional assumptions

PLOS ONE, Oct 2020

Finding a model-based optimal design that can optimally discriminate among a class of plausible models is a difficult task because the design criterion is non-differentiable and requires 2 or more layers of nested optimization. We propose hybrid algorithms based on particle swarm optimization (PSO) to solve such optimization problems, including cases when the optimal design is singular, the mean response of some models are not fully specified and problems that involve 4 layers of nested optimization. Using several classical examples, we show that the proposed PSO-based algorithms are not models or criteria specific, and with a few repeated runs, can produce either an optimal design or a highly efficient design. They are also generally faster than the current algorithms, which are generally slow and work for only specific models or discriminating criteria. As an application, we apply our techniques to find optimal discriminating designs for a dose-response study in toxicology with 5 possible models and compare their performances with traditional and a recently proposed algorithm. In the supplementary material, we provide a R package to generate different types of discriminating designs and evaluate efficiencies of competing designs so that the user can implement an informed design.

Hybrid algorithms for generating optimal designs for discriminating multiple nonlinear models under various error distributional assumptions

PLOS ONE RESEARCH ARTICLE Hybrid algorithms for generating optimal designs for discriminating multiple nonlinear models under various error distributional assumptions Ray-Bing Chen1,2, Ping-Yang Chen1, Cheng-Lin Hsu1, Weng Kee Wong ID3* a1111111111 a1111111111 a1111111111 a1111111111 a1111111111 1 Department of Statistics, National Cheng Kung University, Tainan, Taiwan, 2 Institute of Data Science, National Cheng Kung University, Tainan, Taiwan, 3 Department of Biostatistics, University of California, Los Angeles, California, United States of America * Abstract OPEN ACCESS Citation: Chen R-B, Chen P-Y, Hsu C-L, Wong WK (2020) Hybrid algorithms for generating optimal designs for discriminating multiple nonlinear models under various error distributional assumptions. PLoS ONE 15(10): e0239864. https://doi.org/10.1371/journal.pone.0239864 Editor: Ping He, Jinan University, China, HONG KONG Received: April 15, 2020 Accepted: September 14, 2020 Published: October 5, 2020 Peer Review History: PLOS recognizes the benefits of transparency in the peer review process; therefore, we enable the publication of all of the content of peer review and author responses alongside final, published articles. The editorial history of this article is available here: https://doi.org/10.1371/journal.pone.0239864 Copyright: © 2020 Chen et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Data Availability Statement: They are no real data involved because this work describes how to collect data to identify the right statistical models in the most efficient manner under various criteria Finding a model-based optimal design that can optimally discriminate among a class of plausible models is a difficult task because the design criterion is non-differentiable and requires 2 or more layers of nested optimization. We propose hybrid algorithms based on particle swarm optimization (PSO) to solve such optimization problems, including cases when the optimal design is singular, the mean response of some models are not fully specified and problems that involve 4 layers of nested optimization. Using several classical examples, we show that the proposed PSO-based algorithms are not models or criteria specific, and with a few repeated runs, can produce either an optimal design or a highly efficient design. They are also generally faster than the current algorithms, which are generally slow and work for only specific models or discriminating criteria. As an application, we apply our techniques to find optimal discriminating designs for a dose-response study in toxicology with 5 possible models and compare their performances with traditional and a recently proposed algorithm. In the supplementary material, we provide a R package to generate different types of discriminating designs and evaluate efficiencies of competing designs so that the user can implement an informed design. Introduction Much of the work in optimal design of experiments assumes a known parametric model, apart from the unknown model parameters and the objective is to develop a plan to collect data judiciously for accurate statistical inference. For example, one may wish to design a study to estimate parameters in a nonlinear regression model. In practice, the model is rarely known with certainty and it is likely that there are a few plausible models. Optimal design problems concern identifying the best design, i.e. how to collect data to judiciously select the right model among the plausible models. When there are 2 models and errors are normally distributed and one of the 2 models is fully known, [1] introduced T-optimality as a design discrimination PLOS ONE | https://doi.org/10.1371/journal.pone.0239864 October 5, 2020 1 / 30 PLOS ONE and assumptions. However codes are available to reproduce the results in the paper. Funding: Authors P-YC, R-BC, and WKW received partial support for this study in the form of a grant from the National Institute of General Medical Sciences of the National Institutes of Health under Award Number R01GM107639. R-BC is also partially supported by the Mathematics Division of the National Center for Theoretical Sciences in Taiwan. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Competing interests: The authors have declared that no competing interests exist. Hybrid algorithms for discriminating multiple nonlinear models using various criteria criterion based on the squared difference between the 2 mean predictions. [2] reviewed optimal discriminating design problems and since then, locally T-optimal designs have been applied and studied in various setups, see for example, [3–8] and [9]. When the outcomes are binary [10] or model errors are not normally distributed, [11] proposed KL-optimality criterion based on the Kullback-Leibler (KL) divergence as the distance measure between the 2 competing models. Analytical descriptions of optimal discriminating designs rarely exist unless there are simple settings, such as when we want to find an optimal design to discriminate between a constant model and a quadratic model, and both models have homoscedastic errors [1]. When there are multiple models to discriminate, [12] proposed a Fedorov-Wynn type algorithm to find a T-optimal design and the convergence of such an algorithm to the optimal discriminating design was established recently under some restrictive conditions [13]. Over time, there were several modifications of the algorithm to find various optimal designs, including [11], who amended it to find KL-optimal designs. Algorithms are a practical way to find optimal discriminating designs. Recently, natureinspired metaheuristic algorithms have been repeatedly shown to be fast, flexible and efficient for solving hard and high dimensional optimization problems in engineering and computer science. 2 such algorithms are differential evolutionary (DE) algorithm proposed by [14] and particle swarm optimization (PSO) proposed by [15]. [16] was the first to show that PSO outperformed traditional algorithms in statistics for finding a variety of optimal designs. Maximin design problems are much harder problems to solve because the design criterion is non-differentiable and require multiple nested optimization. [17] developed hybridized PSO-based algorithms to solve more complicated optimal design problems such as the standardized maximin optimal criteria, which includes the simpler minimax design problems. Most recently, [18] applied DE to find optimal approximate designs for logistic models with up to 5 factors with all pairwise interaction terms. The number of variables to optimize for such a model is at least 95 if the optimal design is minimally supported; otherwise, there will be many more vari (...truncated)


This is a preview of a remote PDF: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0239864&type=printable
Article home page: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0239864

Ray-Bing Chen, Ping-Yang Chen, Cheng-Lin Hsu, Weng Kee Wong. Hybrid algorithms for generating optimal designs for discriminating multiple nonlinear models under various error distributional assumptions, PLOS ONE, 2020, Volume 15, Issue 10, DOI: 10.1371/journal.pone.0239864