Best Linear Unbiased Prediction for Multifidelity Computer Experiments

Jun 2018

Recently it becomes a growing trend to study complex systems which contain multiple computer codes with different levels of accuracy, and a number of hierarchical Gaussian process models are proposed to handle such multiple-fidelity codes. This paper derives the best linear unbiased prediction for three popular classes of multiple-level Gaussian process models. The predictors all have explicit expressions at each untried point. Empirical best linear unbiased predictors are also provided by plug-in methods with generalized maximum likelihood estimators of unknown parameters.

Article PDF cannot be displayed. You can download it here:

http://downloads.hindawi.com/journals/mpe/2018/8525736.pdf

Best Linear Unbiased Prediction for Multifidelity Computer Experiments

Hindawi Mathematical Problems in Engineering Volume 2018, Article ID 8525736, 7 pages https://doi.org/10.1155/2018/8525736 Research Article Best Linear Unbiased Prediction for Multifidelity Computer Experiments Weiyan Mu 1 ,1 Qiuyue Wei,1 Dongli Cui,1 and Shifeng Xiong2 School of Science, Beijing University of Civil Engineering and Architecture, Beijing 100044, China NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China 2 Correspondence should be addressed to Weiyan Mu; Received 26 September 2017; Revised 26 April 2018; Accepted 8 May 2018; Published 7 June 2018 Academic Editor: Elisa Francomano Copyright Β© 2018 Weiyan Mu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Recently it becomes a growing trend to study complex systems which contain multiple computer codes with different levels of accuracy, and a number of hierarchical Gaussian process models are proposed to handle such multiple-fidelity codes. This paper derives the best linear unbiased prediction for three popular classes of multiple-level Gaussian process models. The predictors all have explicit expressions at each untried point. Empirical best linear unbiased predictors are also provided by plug-in methods with generalized maximum likelihood estimators of unknown parameters. 1. Introduction With the rapid development of computer technology, computer experiments have been widely used in engineering and science [1]. Consider a computer code with inputs x ∈ [0, 1]𝑑 . The common statistical modeling method is to model the response of the code as 𝑝 π‘Œ (x) = βˆ‘π‘“π‘— (x) 𝛽𝑗 + 𝑍 (x) = 𝑓󸀠 (x) 𝛽 + 𝑍 (x) , (1) 𝑗=1 where 𝑓1 (β‹…), . . . , 𝑓𝑝 (β‹…) are known regression functions, 𝛽 = (𝛽1 , . . . , 𝛽𝑝 )σΈ€  is a vector of unknown regression coefficients, and 𝑍(β‹…) is a stationary Gaussian process on [0, 1]𝑑 having zero mean, variance πœŽπ‘§2 , and correlation function 𝑅(β‹…). Let Y = (𝑦1 , . . . , 𝑦𝑛 )σΈ€  be the responses corresponding to the design points x1 , . . . , x𝑛 ∈ [0, 1]𝑑 . With a known correlation function 𝑅(β‹…), the best linear unbiased predictor (BLUP) is given by Μ‚ Μ‚ + rσΈ€  π‘…βˆ’1 (Y𝑛 βˆ’ F𝛽) Μ‚ (x0 ) = π‘Œ Μ‚0 ≑ f0 σΈ€  𝛽 π‘Œ 0 (2) for x0 ∈ [0, 1]𝑑 , where r0 = (𝑅(x0 βˆ’ x1 ), . . . , 𝑅(x0 βˆ’ x𝑛 ))σΈ€  , Μ‚ = R = (𝑅(x𝑖 βˆ’ x𝑗 ))𝑖,𝑗=1,...,𝑛 , F = (f(x1 ), . . . , f(x𝑛 ))σΈ€  , and 𝛽 (FσΈ€  Rβˆ’1 F)βˆ’1 FσΈ€  Rβˆ’1 Y is the generalized least squares estimator of 𝛽 [1]. It is a growing trend to study the complex system which contains multifidelity computer codes with different levels of accuracy. For example, a Bayesian approach is described to predict and analyze complex computer codes which can be run at different levels of sophistication [2]. A novel approach is taken to integrate data from approximate and detailed simulations to build a surrogate model that describes the relationship between output and input parameters [3]. The Bayesian hierarchical Gaussian process (BHGP) models are introduced to integrate low-accuracy and high-accuracy [4]. A class of nonstationary Gaussian process models are proposed to link the computer outputs of different mesh densities [5]. However, there are few papers that provide BLUPs for multifidelity computer experiments. The purpose of this article is to find BLUPs for multifidelity computer experiments. The structure of the article is as follows. In Section 2, BLUPs for two levels of accuracy [4] are discussed. In Section 3, BLUPs for general π‘˜-level cases in autoregressive model described by Kennedy and O’Hagan [2] are illustrated. In Section 4, BLUPs for continuous level in nonstationary Gaussian process model described by Tuo, Wu, and Yu [5] are demonstrated. We present a real application in Section 5. Concluding remarks are given in Section 6. 2 Mathematical Problems in Engineering 2. BLUPs for Two-Level Cases + πœŽπ‘2 aσΈ€ 1 Rπœƒ11 a1 + πœŽπ‘2 𝜎𝜌2 aσΈ€ 2 V1 a2 + 𝜌02 πœŽπ‘2 aσΈ€ 2 Rπœƒ21 a2 The two experiments considered in this section are named as the low-accuracy experiment (LE) and high-accuracy experiment (HE). Let GP(πœ‡, 𝜎2 , πœƒ) denote the Gaussian process with mean πœ‡, variance 𝜎2 , and correlation parameters πœƒ. Let 𝐷𝑙 = (x1 , . . . , x𝑛 ) and π·β„Ž = (s1 , . . . , sπ‘š ) denote the design set for the LE and HE, respectively. Following Qian and Wu [4], for any x𝑖 ∈ 𝐷𝑙 , the LE is described by + (𝜎𝜌2 + 𝜌02 ) πœŽπ‘2 + 2𝜌0 πœŽπ‘2 aσΈ€ 1 Rπœƒ31 a2 βˆ’ 2𝜌0 πœŽπ‘2 aσΈ€ 1 r1πœƒ1 π‘Œπ‘™ (x𝑖 ) = f σΈ€  (x𝑖 ) 𝛽 + 𝑍 (x𝑖 ) , (3) where 𝑍(β‹…) ∼ GP(0, πœŽπ‘2 , πœƒ1 ) and f(x) = [𝑓1 (x), . . . , π‘“π‘ž (x)]σΈ€  is a set of prespecified regression functions. For any s𝑖 ∈ π·β„Ž , the HE can be described by π‘Œβ„Ž (s𝑖 ) = 𝜌 (s𝑖 ) 𝑦𝑙 (s𝑖 ) + 𝛿 (s𝑖 ) + πœ€ (s𝑖 ) , (4) where the scale changes from LE to HE 𝜌(β‹…) ∼ GP(𝜌0 , 𝜎𝜌2 , πœƒ3 ), the location adjustment 𝛿(β‹…) ∼ GP(𝛿0 , πœŽπ›Ώ2 , πœƒ2 ), the measurement error πœ€ ∼ 𝑁(0, πœŽπœ€2 ) and 𝑍(β‹…), 𝛿(β‹…), and πœ€ are jointly independent. Let Y𝑙 = (π‘Œπ‘™ (x1 ), . . . , π‘Œπ‘™ (x𝑛 ))σΈ€  and Yβ„Ž = (π‘Œπ‘™ (s1 ), . . . , π‘Œπ‘™ (sπ‘š ))σΈ€  . 𝑑 βˆ’ 2πœŽπ‘2 𝜎𝜌2 aσΈ€ 2 k1 βˆ’ 2𝜌02 πœŽπ‘2 aσΈ€ 2 r2πœƒ1 + πœŽπ›Ώ2 (aσΈ€ 2 Rπœƒ2 a2 βˆ’ 2aσΈ€ 2 rπœƒ2 + 1) + πœŽπœ–2 (aσΈ€ 2 a2 + 1) , (8) where V = (π‘…πœƒ3 (s𝑖 βˆ’ s𝑗 )f(s𝑖 )f σΈ€  (s𝑗 )), V1 = (π‘…πœƒ1 (s𝑖 βˆ’ s𝑗 )π‘…πœƒ3 (s𝑖 βˆ’ s𝑗 )), Rπœƒ21 = (π‘…πœƒ1 (s𝑖 βˆ’ s𝑗 )), Rπœƒ2 = (π‘…πœƒ2 (s𝑖 βˆ’ s𝑗 )), 1 ≀ 𝑖, 𝑗 ≀ π‘š; k = (π‘…πœƒ3 (x0 βˆ’ s𝑖 )f σΈ€  (s𝑖 )), k1 = (π‘…πœƒ1 (x0 βˆ’ s𝑖 )π‘…πœƒ3 (x0 βˆ’ s𝑖 )), r2πœƒ1 = (π‘…πœƒ1 (x0 βˆ’ s𝑖 ))σΈ€  , rπœƒ2 = (π‘…πœƒ2 (x0 βˆ’ s𝑖 ))σΈ€  , 1 ≀ 𝑖 ≀ π‘š; Rπœƒ11 = (π‘…πœƒ1 (x𝑖 βˆ’ x𝑗 )),1 ≀ 𝑖, 𝑗 ≀ 𝑛; Rπœƒ31 = (π‘…πœƒ1 (x𝑖 βˆ’ s𝑗 )),1 ≀ 𝑖 ≀ π‘š, 1 ≀ 𝑗 ≀ 𝑛; r1πœƒ1 = (π‘…πœƒ1 (x0 βˆ’ x𝑖 ))σΈ€  , 1 ≀ 𝑖 ≀ 𝑛. Thus the Lagrange multipliers can be used to solve BLUP corresponding to a1 and a2 that minimize (5) subject to aσΈ€ 1 F1 + 𝜌0 aσΈ€ 2 F2 = 𝜌0 f σΈ€  (x0 ) , aσΈ€ 2 1π‘š = 1. The Lagrange function is σΈ€  Theorem 1. For x0 ∈ [0, 1] , the BLUP of π‘Œβ„Ž (x0 ) is a Y, where 𝐿 = 𝜎𝜌2 (aσΈ€ 2 𝛽󸀠 V𝛽a2 + f σΈ€  (x0 ) 𝛽𝛽󸀠 f (x0 ) a βˆ’1 βˆ’1 = Bβˆ’1 A (AσΈ€  Bβˆ’1 A) k1 + [Bβˆ’1 βˆ’ Bβˆ’1 A (AσΈ€  Bβˆ’1 A) AσΈ€  Bβˆ’1 ] k2 , βˆ’ 2aσΈ€ 2 k𝛽𝛽󸀠 f (x0 )) + πœŽπ‘2 aσΈ€ 1 Rπœƒ11 a1 πœŽπ‘2 𝜎𝜌2 aσΈ€ 2 V1 a2 (5) + 𝜌02 πœŽπ‘2 aσΈ€ 2 Rπœƒ21 a2 + (𝜎𝜌2 + 𝜌02 ) πœŽπ‘2 + 2𝜌0 πœŽπ‘2 aσΈ€ 1 Rπœƒ31 a2 F1 0 A=[ ], 𝜌0 F2 1π‘š k1 = [ 𝜌0 f (x0 ) 1 (9) βˆ’ 2𝜌0 πœŽπ‘2 aσΈ€ 1 r1πœƒ1 βˆ’ 2πœŽπ‘2 𝜎𝜌2 aσΈ€ 2 k1 βˆ’ 2𝜌02 πœŽπ‘2 aσΈ€ 2 r2πœƒ1 (10) + πœŽπ›Ώ2 (aσΈ€ 2 Rπœƒ2 a2 βˆ’ 2aσΈ€ 2 rπœƒ2 + 1) + πœŽπœ–2 (aσΈ€ 2 a2 + 1) ], 𝜌0 πœŽπ‘2 r1πœƒ1 k2 = [ 2 ], 𝜎𝜌 k𝛽𝛽󸀠 f (x0 ) + 𝜎𝜌2 πœŽπ‘2 k1 + 𝜌02 πœŽπ‘2 r2πœƒ1 + πœŽπ›Ώ2 rπœƒ2 + 2πœ†σΈ€ 1 (FσΈ€ 1 a1 + 𝜌0 FσΈ€ 2 a2 βˆ’ 𝜌0 f (x0 )) + 2πœ† 2 (1σΈ€ π‘š a2 (6) Let the gradient with respect to a1 , a2 , πœ†1 , πœ† 2 be zero, and we have πœ† k1 0 AσΈ€  [ (11) ] [ ] = [ ], k2 a A B and B πœŽπ‘2 Rπœƒ11 𝜌0 πœŽπ‘2 Rπœƒ31 ]. =[ 2 3 σΈ€  2 σΈ€  𝜌𝜎 R 𝜎𝜌 𝛽 V𝛽 + 𝜎𝜌2 πœŽπ‘2 V1 + 𝜌02 πœŽπ‘2 Rπœƒ21 + πœŽπ›Ώ2 Rπœƒ2 + πœŽπœ–2 Iπ‘š [ 0 𝑍 πœƒ1 ] Μ‚β„Ž (x0 ) = aσΈ€  Y𝑙 + aσΈ€  Yβ„Ž + 𝐢 based Proof. The linear predictor π‘Œ 1 2 on training data Y𝐿 and Y𝐻 at an untried point x0 is unbiased for π‘Œβ„Ž (x0 ) provided aσΈ€ 1 F1 + 𝜌0 aσΈ€ 2 F2 = 𝜌0 f σΈ€  (x0 ) , aσΈ€ 2 1π‘š = 1, (7) 𝐢 = 0. For any linear unbiased predictor (LUP) of π‘Œ0 = π‘Œβ„Ž (x0 ), Μ‚β„Ž (x0 ) = aσΈ€  Y𝑙 + aσΈ€  Yβ„Ž , the mean squared prediction error π‘Œ 1 2 (MSPE) of aσΈ€ 1 Y𝑙 + aσΈ€ 2 Yβ„Ž is 𝐸{𝑀2 } = 𝐸{(aσΈ€ 1 Y𝑙 + aσΈ€ 2 Yβ„Ž βˆ’ π‘Œ0 )2 }. We have 𝐸 {𝑀2 } = 𝜎𝜌2 (aσΈ€ 2 𝛽󸀠 (...truncated)


This is a preview of a remote PDF: http://downloads.hindawi.com/journals/mpe/2018/8525736.pdf
Article home page: https://www.hindawi.com/journals/mpe/2018/8525736/

Weiyan Mu, Qiuyue Wei, Dongli Cui, Shifeng Xiong. Best Linear Unbiased Prediction for Multifidelity Computer Experiments, 2018, 2018, DOI: 10.1155/2018/8525736