Best Linear Unbiased Prediction for Multifidelity Computer Experiments
Hindawi
Mathematical Problems in Engineering
Volume 2018, Article ID 8525736, 7 pages
https://doi.org/10.1155/2018/8525736
Research Article
Best Linear Unbiased Prediction for Multifidelity
Computer Experiments
Weiyan Mu
1
,1 Qiuyue Wei,1 Dongli Cui,1 and Shifeng Xiong2
School of Science, Beijing University of Civil Engineering and Architecture, Beijing 100044, China
NCMIS, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing 100190, China
2
Correspondence should be addressed to Weiyan Mu;
Received 26 September 2017; Revised 26 April 2018; Accepted 8 May 2018; Published 7 June 2018
Academic Editor: Elisa Francomano
Copyright Β© 2018 Weiyan Mu et al. This is an open access article distributed under the Creative Commons Attribution License,
which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Recently it becomes a growing trend to study complex systems which contain multiple computer codes with different levels of
accuracy, and a number of hierarchical Gaussian process models are proposed to handle such multiple-fidelity codes. This paper
derives the best linear unbiased prediction for three popular classes of multiple-level Gaussian process models. The predictors all
have explicit expressions at each untried point. Empirical best linear unbiased predictors are also provided by plug-in methods with
generalized maximum likelihood estimators of unknown parameters.
1. Introduction
With the rapid development of computer technology, computer experiments have been widely used in engineering and
science [1]. Consider a computer code with inputs x β [0, 1]π .
The common statistical modeling method is to model the
response of the code as
π
π (x) = βππ (x) π½π + π (x) = πσΈ (x) π½ + π (x) ,
(1)
π=1
where π1 (β
), . . . , ππ (β
) are known regression functions, π½ =
(π½1 , . . . , π½π )σΈ is a vector of unknown regression coefficients,
and π(β
) is a stationary Gaussian process on [0, 1]π having
zero mean, variance ππ§2 , and correlation function π
(β
). Let Y =
(π¦1 , . . . , π¦π )σΈ be the responses corresponding to the design
points x1 , . . . , xπ β [0, 1]π . With a known correlation function
π
(β
), the best linear unbiased predictor (BLUP) is given
by
Μ
Μ + rσΈ π
β1 (Yπ β Fπ½)
Μ (x0 ) = π
Μ0 β‘ f0 σΈ π½
π
0
(2)
for x0 β [0, 1]π , where r0 = (π
(x0 β x1 ), . . . , π
(x0 β xπ ))σΈ ,
Μ =
R = (π
(xπ β xπ ))π,π=1,...,π , F = (f(x1 ), . . . , f(xπ ))σΈ , and π½
(FσΈ Rβ1 F)β1 FσΈ Rβ1 Y is the generalized least squares estimator
of π½ [1].
It is a growing trend to study the complex system which
contains multifidelity computer codes with different levels
of accuracy. For example, a Bayesian approach is described
to predict and analyze complex computer codes which can
be run at different levels of sophistication [2]. A novel
approach is taken to integrate data from approximate and
detailed simulations to build a surrogate model that describes
the relationship between output and input parameters [3].
The Bayesian hierarchical Gaussian process (BHGP) models
are introduced to integrate low-accuracy and high-accuracy
[4]. A class of nonstationary Gaussian process models are
proposed to link the computer outputs of different mesh
densities [5]. However, there are few papers that provide
BLUPs for multifidelity computer experiments.
The purpose of this article is to find BLUPs for multifidelity computer experiments. The structure of the article is
as follows. In Section 2, BLUPs for two levels of accuracy [4]
are discussed. In Section 3, BLUPs for general π-level cases
in autoregressive model described by Kennedy and OβHagan
[2] are illustrated. In Section 4, BLUPs for continuous level in
nonstationary Gaussian process model described by Tuo, Wu,
and Yu [5] are demonstrated. We present a real application in
Section 5. Concluding remarks are given in Section 6.
2
Mathematical Problems in Engineering
2. BLUPs for Two-Level Cases
+ ππ2 aσΈ 1 Rπ11 a1 + ππ2 ππ2 aσΈ 2 V1 a2 + π02 ππ2 aσΈ 2 Rπ21 a2
The two experiments considered in this section are named as
the low-accuracy experiment (LE) and high-accuracy experiment (HE). Let GP(π, π2 , π) denote the Gaussian process
with mean π, variance π2 , and correlation parameters π. Let
π·π = (x1 , . . . , xπ ) and π·β = (s1 , . . . , sπ ) denote the design set
for the LE and HE, respectively. Following Qian and Wu [4],
for any xπ β π·π , the LE is described by
+ (ππ2 + π02 ) ππ2 + 2π0 ππ2 aσΈ 1 Rπ31 a2 β 2π0 ππ2 aσΈ 1 r1π1
ππ (xπ ) = f σΈ (xπ ) π½ + π (xπ ) ,
(3)
where π(β
) βΌ GP(0, ππ2 , π1 ) and f(x) = [π1 (x), . . . , ππ (x)]σΈ is a
set of prespecified regression functions. For any sπ β π·β , the
HE can be described by
πβ (sπ ) = π (sπ ) π¦π (sπ ) + πΏ (sπ ) + π (sπ ) ,
(4)
where the scale changes from LE to HE π(β
) βΌ GP(π0 , ππ2 , π3 ),
the location adjustment πΏ(β
) βΌ GP(πΏ0 , ππΏ2 , π2 ), the measurement error π βΌ π(0, ππ2 ) and π(β
), πΏ(β
), and π are jointly independent. Let Yπ = (ππ (x1 ), . . . , ππ (xπ ))σΈ and Yβ = (ππ (s1 ), . . . ,
ππ (sπ ))σΈ .
π
β 2ππ2 ππ2 aσΈ 2 k1 β 2π02 ππ2 aσΈ 2 r2π1
+ ππΏ2 (aσΈ 2 Rπ2 a2 β 2aσΈ 2 rπ2 + 1) + ππ2 (aσΈ 2 a2 + 1) ,
(8)
where V = (π
π3 (sπ β sπ )f(sπ )f σΈ (sπ )), V1 = (π
π1 (sπ β sπ )π
π3 (sπ β
sπ )), Rπ21 = (π
π1 (sπ β sπ )), Rπ2 = (π
π2 (sπ β sπ )), 1 β€ π, π β€ π;
k = (π
π3 (x0 β sπ )f σΈ (sπ )), k1 = (π
π1 (x0 β sπ )π
π3 (x0 β sπ )), r2π1 =
(π
π1 (x0 β sπ ))σΈ , rπ2 = (π
π2 (x0 β sπ ))σΈ , 1 β€ π β€ π; Rπ11 = (π
π1 (xπ β
xπ )),1 β€ π, π β€ π; Rπ31 = (π
π1 (xπ β sπ )),1 β€ π β€ π, 1 β€ π β€ π;
r1π1 = (π
π1 (x0 β xπ ))σΈ , 1 β€ π β€ π.
Thus the Lagrange multipliers can be used to solve BLUP
corresponding to a1 and a2 that minimize (5) subject to
aσΈ 1 F1 + π0 aσΈ 2 F2 = π0 f σΈ (x0 ) ,
aσΈ 2 1π = 1.
The Lagrange function is
σΈ
Theorem 1. For x0 β [0, 1] , the BLUP of πβ (x0 ) is a Y, where
πΏ = ππ2 (aσΈ 2 π½σΈ Vπ½a2 + f σΈ (x0 ) π½π½σΈ f (x0 )
a
β1
β1
= Bβ1 A (AσΈ Bβ1 A) k1 + [Bβ1 β Bβ1 A (AσΈ Bβ1 A) AσΈ Bβ1 ] k2 ,
β 2aσΈ 2 kπ½π½σΈ f (x0 )) + ππ2 aσΈ 1 Rπ11 a1 ππ2 ππ2 aσΈ 2 V1 a2
(5)
+ π02 ππ2 aσΈ 2 Rπ21 a2 + (ππ2 + π02 ) ππ2 + 2π0 ππ2 aσΈ 1 Rπ31 a2
F1 0
A=[
],
π0 F2 1π
k1 = [
π0 f (x0 )
1
(9)
β 2π0 ππ2 aσΈ 1 r1π1 β 2ππ2 ππ2 aσΈ 2 k1 β 2π02 ππ2 aσΈ 2 r2π1
(10)
+ ππΏ2 (aσΈ 2 Rπ2 a2 β 2aσΈ 2 rπ2 + 1) + ππ2 (aσΈ 2 a2 + 1)
],
π0 ππ2 r1π1
k2 = [ 2
],
ππ kπ½π½σΈ f (x0 ) + ππ2 ππ2 k1 + π02 ππ2 r2π1 + ππΏ2 rπ2
+ 2πσΈ 1 (FσΈ 1 a1 + π0 FσΈ 2 a2 β π0 f (x0 )) + 2π 2 (1σΈ π a2
(6)
Let the gradient with respect to a1 , a2 , π1 , π 2 be zero, and we
have
π
k1
0 AσΈ
[
(11)
] [ ] = [ ],
k2
a
A B
and B
ππ2 Rπ11
π0 ππ2 Rπ31
].
=[ 2 3 σΈ 2 σΈ
ππ R
ππ π½ Vπ½ + ππ2 ππ2 V1 + π02 ππ2 Rπ21 + ππΏ2 Rπ2 + ππ2 Iπ
[ 0 π π1
]
Μβ (x0 ) = aσΈ Yπ + aσΈ Yβ + πΆ based
Proof. The linear predictor π
1
2
on training data YπΏ and Yπ» at an untried point x0 is unbiased
for πβ (x0 ) provided
aσΈ 1 F1 + π0 aσΈ 2 F2 = π0 f σΈ (x0 ) ,
aσΈ 2 1π = 1,
(7)
πΆ = 0.
For any linear unbiased predictor (LUP) of π0 = πβ (x0 ),
Μβ (x0 ) = aσΈ Yπ + aσΈ Yβ , the mean squared prediction error
π
1
2
(MSPE) of aσΈ 1 Yπ + aσΈ 2 Yβ is πΈ{π2 } = πΈ{(aσΈ 1 Yπ + aσΈ 2 Yβ β π0 )2 }.
We have
πΈ {π2 }
= ππ2 (aσΈ 2 π½σΈ (...truncated)