High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria

Shi, Junjun; Shen, Jingfang; Li, Yaohui

doi:10.3390/math9050536

Open AccessArticle

High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria

by

Junjun Shi

¹,

Jingfang Shen

^1,* and

Yaohui Li

^1,2

¹

College of Science, Huazhong Agricultural University, Wuhan 430070, China

²

School of Mechanical and Electrical Engineering, Xuchang University, Xuchang 461000, China

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(5), 536; https://doi.org/10.3390/math9050536

Submission received: 7 February 2021 / Revised: 26 February 2021 / Accepted: 26 February 2021 / Published: 4 March 2021

(This article belongs to the Special Issue Surrogate Modeling and Related Methods in Science and Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

Finding new valuable sampling points and making these points better distributed in the design space is the key to determining the approximate effect of Kriging. To this end, a high-precision Kriging modeling method based on hybrid sampling criteria (HKM-HS) is proposed to solve this problem. In the HKM-HS method, two infilling sampling strategies based on MSE (Mean Square Error) are optimized to obtain new candidate points. By maximizing MSE (MMSE) of Kriging model, it can generate the first candidate point that is likely to appear in a sparse area. To avoid the ill-conditioned correlation matrix caused by the too close distance between any two sampling points, the MC (MSE and Correlation function) criterion formed by combining the MSE and the correlation function through multiplication and division is minimized to generate the second candidate point. Furthermore, a new screening method is used to select the final expensive evaluation point from the two candidate points. Finally, the test results of sixteen benchmark functions and a house heating case show that the HKM-HS method can effectively enhance the modeling accuracy and stability of Kriging in contrast with other approximate modeling methods.

Keywords:

surrogate model; Kriging; infill sampling criteria; multiplication and division

1. Introduction

In the field of engineering design and scientific practice, simulation models are widely used to imitate and analyze expensive black box problems. The rapid development and advancement of technology makes some black box problems have greater nonlinearity and computational complexity. Therefore, the simulation models will undoubtedly become more complex and the computing cost will be more expensive. All of these bring additional challenges to the black box simulation problems. Under these circumstances, approximate models, especially for the surrogate models [1,2] (also known as meta-models or response surfaces), are widely applied to better simulate complex black box problems. Using surrogate models to approximately replace expensive simulations can not only significantly reduce the time spent, but also effectively improve the modeling accuracy.

Due to the actual demand of engineering and science, different types of surrogate models have sprung up. The classics mainly include Support Vector Regression (SVR) [3,4], Polynomial Response Surface (PRS) [5,6], Radial Basis Function (RBF) [7,8], Multiple Adaptive Regression Spline (MARS) [9] and Kriging [10,11,12]. The PRS model can solve low-order, low-dimensional and easy-to-access engineering design problems. However, it is not suitable for solving high-dimensional nonlinear multi-modal design problems. RBF is an interpolation method that uses the sum of the weights of the basic functions to approximate complex black box problems. The RBF method can more accurately approximate nonlinear and high-dimensional black box problems, but the approximate requires it to use more function evaluations, which in turn increases the time cost. The computational complexity of the SVR model depends on the number of support vectors rather than the dimension of the sample space. In this way, the dimension problem can be avoided. However, the time cost of the SVR model is relatively high, and it is difficult to handle the problem of large-scale training samples. MARS is a nonlinear nonparametric regression method which can use piecewise linear splines to simulate the relationship between the dependent and independent variables and has good adaptability. However, with the expansion of the sample space, the MARS method is difficult to divide the space effectively, and the recursive partitioning makes the model lack continuity. In the high-dimensional case, it does not have the ability to capture relatively simple functional relationships. However, the Kriging model always has good approximation properties. The reason is that Kriging model not only has smaller errors and higher accuracy, but also can handle linear or nonlinear, simple or complex, low-dimensional or high-dimensional problems. Secondly, the Kriging can estimate the uncertainty of the model, and its basis function usually has adjustable parameters. Moreover, the Kriging model can ensure the smoothness of the function and has high execution efficiency and good accuracy. These excellent properties make the Kriging model popular. Craven et al. applied the Kriging surrogate model to medicine, in which Kriging has a good effect on predicting the power law coefficient of hemolysis [13]. Song et al. applied the Kriging surrogate model to the layout design of Vortex-Induced Piezoelectric Energy Converters (VIPECs) [14]. Yu et al. applied the Kriging model to the tip leakage flow control of the multi-stage DBD (dielectric barrier discharge) plasma actuator blade cascade [15]. The Kriging method can increase the efficiency of permanent magnet synchronous generators (PMSG) for vehicles [16].

Although the Kriging model shows superior performance and has been successfully applied in various fields, there are still some problems in the Kriging model that need to be further studied [17]. The specific problems are as follows. (1) The infilling sampling criterion will affect the approximation accuracy of the Kriging model. How to find a good infilling sampling strategy to select the next sampling point is the core of the iterative process of the adaptive sequence experiment design method based on Kriging. (2) What are the internal relationships between the accuracy of the Kriging model and its eight kernel functions? (3) How to determine the best applicable field of each kernel function is an issue that needs attention. We mainly discuss the construction problem of infilling sampling criterion for Kriging in this work.

A good sampling strategy can directly enhance the accuracy and efficiency of the surrogate model, which is very helpful for solving computationally expensive simulation systems and complex engineering design problems. Therefore, many scholars put forward the sampling strategy of the Kriging model and applied it to the modeling process of the Kriging model. Among them, the adaptive sampling strategy based on active learning [18] has received more and more attention in recent years. The next update point of the simplest adaptive sampling strategy is selected according to the maximum mean square error (MMSE) [19]. The MMSE method is widely used because of its high accuracy and low calculation time. An adaptive sampling method in the framework of bias-variance decomposition was proposed in [20]. The sampling criterion of this method is to maximize the expected prediction error of the bias and variance information to select the sampling point. An adaptive Bayesian sequential sampling method was proposed in [21], which determines the new sampling point by maximizing the determinant of the adjusted covariance matrix. Sacks et al. select new points by maximizing the expected reduction in integrated mean square error (IMSE) in the entire design space [22]. Ping proposed a Kriging meta-model adaptive sampling strategy (KMDT) based on Delaunay triangulation and TOPSIS [23]. In addition, there is also the maximum entropy sampling method (ME) [24], parallel adaptive sampling strategy (PAS) [25], maximum curvature minimum distance sequence sampling [26], etc. These are all effective methods to increase the update points, but these methods are limited to adding a new sample point in one iteration cycle and cannot use more computer resources to improve the calculation efficiency.

To generate multiple sampling points in each cycle, a multi-point plus point strategy was presented by considering the predicted response value and uncertainty of the Kriging model. A multi-point sampling strategy based on Kriging (MPSK) was presented in [27]. This strategy assumes that every unobserved point in the region may be the next update point. During the sampling process, the expected improvement function is transformed into a cumulative probability distribution function, and an appropriate new sample is intelligently drawn within a certain probability region. In addition, Li proposed a Kriging-based multi-point sequential sampling optimization (KMSSO) method [28]. So far, there are few relevant literatures and research results to solve this problem. The reasons for this phenomenon are as follows. When adding multiple points at the same time, the correlation between each point needs to be considered, otherwise it may lead to an ill-conditioned matrix, or find the same point and cause modeling failure. Adding multiple points at the same time may take too much time, resulting in low modeling efficiency. Adding multiple points at the same time does not guarantee that each point is promising and may fail to improve the accuracy.

To overcome this bottleneck problem, a high-precision Kriging modeling method based on hybrid sampling criteria (HKM-HS) is proposed. Two sampling strategies, MMSE and MC, are included in HKM-HS. First, the MMSE sampling strategy is used to generate the first candidate point. If the second candidate point is directly generated by MMSE, the distance between the two candidate points may be very close, resulting in the ill-conditioned correlation matrix. Therefore, the correlation function is added as a selection condition when selecting the second candidate point. MMSE is to ensure that the candidate points have the maximum amount of information, while the minimum correlation function is to ensure that the distance between the candidate points and the existing sample points is not too close, thereby ensuring that the new correlation matrix will not be ill-conditioned. The MC criterion is obtained by combining the maximum mean square error and the minimum correlation function through the multiplication and division method, and the second candidate point is obtained by MC sampling criterion. Finally, the evaluation points are selected from two candidate points.

The rest of this work is organized as follows. The second part discusses the basic background of the Kriging model. The third part proposes two sampling criteria and gives specific steps for determining new sampling points. The fourth part tests 16 benchmark functions and two actual cases and gives the test results. The fifth part is the application of HKM-HS method in house heating. The sixth part is the conclusion.

2. Related Work

2.1. Kriging

In statistics, the interpolation process of Kriging is finished by a Gaussian process controlled by the priori covariance. Under appropriate a priori assumptions, the Kriging model can provide the best linear unbiased prediction. The specific details are stated as follows.

Suppose there are m sample points

X = {[x_{1}, \dots, x_{m}]}^{T}

,

X \in ℜ^{m \times n}

and object response

Y = {[y_{1}, \dots, y_{m}]}^{T}

. The Kriging model is expressed as follows.

y (x) = μ (x) + z (x)

(1)

The Equation consists of two parts. The first part

μ (x)

is the trend function. When

μ (x)

is equal to 0,

μ

and

\sum_{i = 1}^{p} β_{i} f_{i} (x)

are called simple Kriging, ordinary Kriging, and standard Kriging models, respectively. Where

f_{i} (x)

and

β_{i}

are the regression function and the corresponding regression function coefficients, respectively. Standard Kriging uses the regression function to determine the change of process mean [29].

The second part of Equation (1) is a stochastic process model established by observing the data and quantifying the correlation of data.

z (x)

is considered as the realization of the stochastic process

Z (x)

with mean of 0 and variance of

σ^{2}

. The covariance between sampling points is shown in Equation (2).

E [Z (x_{i}) Z (x_{j})] = σ^{2} R (θ, x_{i}, x_{j})

(2)

where process variance

σ^{2}

is a scalar and

θ

is a key parameter of the kernel function. By optimizing

θ

, the correlation between the design points can be adjusted adaptively. The expression of the kernel function is as follows.

R (θ, x_{i}, x_{j}) = \prod_{k = 1}^{n} R_{k} (θ_{k}, x_{i}^{k} - x_{j}^{k})

(3)

There are many options for spatial correlation functions. However, the most widely used correlation function is the Gaussian function. The expression of the Gaussian model is as follows.

R_{k} (θ_{k}, x_{i}^{k} - x_{j}^{k}) = \exp (- θ_{k} {|x_{i}^{k} - x_{j}^{k}|}^{2})

(4)

Under the above assumptions, the best linear unbiased estimate of

y (x)

is

\hat{y} (x) = f^{T} (x) \hat{β} + r^{T} (x) R^{- 1} (Y - F \hat{β})

(5)

r

and

R

in Equation (5) are

r (x) = \{R (θ, x, x_{1}), R (θ, x, x_{2}), \dots, R (θ, x, x_{m})\}

(6)

R_{m \times m} = [\begin{matrix} R (θ, x_{1}, x_{1}) & \dots & R (θ, x_{1}, x_{m}) \\ ⋮ & ⋱ & ⋮ \\ R (θ, x_{m}, x_{1}) & \dots & R (θ, x_{m}, x_{m}) \end{matrix}]

(7)

Among them, the least square estimation

\hat{β}

of

β

can be obtained by calculating Equation (8).

\hat{β} = {(F^{T} R^{- 1} F)}^{- 1} F^{T} R^{- 1} Y

(8)

In addition, the predicted mean square error

\hat{s} (x)

of

\hat{y} (x)

can be calculated by Equation (9).

\hat{s} (x) = M S E [\hat{y} (x)] = {\hat{σ}}^{2} \{1 - [f {(x)}^{T} r {(x)}^{T}] [\begin{matrix} 0 & F^{T} \\ F & R \end{matrix}] [\begin{matrix} f (x) \\ r (x) \end{matrix}]\}

(9)

where

{\hat{σ}}^{2} = \frac{1}{m} {(Y - F \hat{β})}^{T} R^{- 1} (Y - F \hat{β})

(10)

It can be seen from Equations (3), (8) and (9) that the matrix

R

, vector

\hat{β}

, and the estimated variance

\hat{s} (x)

of the unknown point all depend on the value of the parameter

θ

. Based on the maximum likelihood estimation theory, the unconstrained optimization algorithm is used to maximize Equation (11) to obtain the optimal

θ

value [30].

θ = \arg \max (- \frac{m}{2} \ln {\hat{σ}}^{2} - \frac{1}{2} \ln |R|)

(11)

In fact, the best approximation curve is not necessarily obtained from the optimal value, as long as the value is close to the optimal value, there will be a good approximation result.

2.2. AME

The AME (adaptive maximum entropy) [21] sampling method determines the new sampling point by maximizing the determinant of the adjusted covariance matrix. The adjusted covariance matrix expression is as follows.

cov (x_{i}, x_{j}) = σ^{2} \prod_{k = 1}^{n} \exp (- η_{i} η_{j} θ_{k} {|d_{k}|}^{2}) = σ^{2} \prod_{k = 1}^{n} \exp (- {(\frac{e_{i}}{e_{\max}})}^{γ} {(\frac{e_{j}}{e_{\max}})}^{γ} θ_{k} {|d_{k}|}^{2})

(12)

where

η_{i}

and

η_{j}

are the adjust factors for points

x_{i}

and

x_{j}

, respectively. They are determined by the errors at

x_{i}

and

x_{j}

. The parameter

γ

can be adjusted to

η_{i}

to balance local exploitation and global exploration. The adjusted covariance matrix not only considers the distance factor between two points, but also considers the error of the two points. Sampling more points in the area with a larger error can improve the accuracy of the meta-model.

When

x_{i}

is a known sample point,

e_{i} = |y (x_{i}) - \hat{y} (x_{i})|

. When

x_{i}

is the unsampled point,

e_{i} = |y (x_{j}) - \hat{y} (x_{j})|

, where

x_{j}

is the sampled point closest to

x_{i}

.

3. HKM-HS Method

The purpose of this paper is to add one or two new sampling points in an optimization iteration of the Kriging model. The HKM-HS method with two infilling sampling criteria can achieve this purpose. The specific details of the two infilling sampling criteria are as follows.

3.1. Maximizing Mean Square Error (MMSE)

A good sampling criterion can not only improve the accuracy and efficiency of modeling, but also reduce the cost of modeling. The criterion for evaluating the merits and demerits of the sampling methods are as follows. Whether the newly discovered sampling points have more information in the design field and whether the sampling points can be more evenly distributed in the entire design space under the premise of maintaining certain independence has yet to be determined. The common sampling criteria are maximum mean square error (MMSE), integrated mean square error (IMSE) [22], an adaptive Bayesian sequential sampling method [21] and maximum entropy sampling method (ME) [24]. The ME method tends to add candidate points far away from the current sample points, without considering the objective information of the design point. The adaptive Bayesian sequential sampling method considers not only the distance factor but also the error factor when adding design points, but the running time is slightly longer than the MMSE and IMSE methods. Both MMSE and IMSE methods can obtain the next sampling point by combining the known sample data with the Kriging model. The ideas of the two methods are similar, but the MMSE method is simpler and easier. The MMSE method can obtain sampling points with potentially useful information from the current Kriging model. In the sampling process, the distance between sampling points is considered to ensure that the new sampling data is evenly distributed in the entire design space. Therefore, MMSE sampling criterion is selected as the final sampling criterion. The established model is as follows.

\max f_{1} (x) = \hat{s} (x)

(13)

where

\hat{s} (x) = {\hat{σ}}^{2} \{1 - [f {(x)}^{T} r {(x)}^{T}] [\begin{matrix} 0 & F^{T} \\ F & R \end{matrix}] [\begin{matrix} f (x) \\ r (x) \end{matrix}]\}

is a function about the unknown point

x

. The purpose of Equation (13) is to find the candidate point

x

when the mean square error is maximized.

3.2. MC Criterion

The spatial correlation function (SCF) can affect the smoothness of the Kriging model. This correlation applies to not only the known sample points, but also to the quantified observations. There are eight commonly used correlation function models in SCF [31,32,33]. See Table 1 for details.

The following is an example to prove the advantages of the Gaussian correlation function. A simple one-dimensional problem is expressed as [25]:

f (x) = {(6 x - 2)}^{2} \sin (12 x - 4) x \in [0, 1]

Different correlation functions are used in Kriging modeling process. Figure 1 shows the trend and error of the approximate function.

AE (absolute error) in Figure 1b is the absolute error between the actual value and the predicted value. The Figure 1a shows the relationship between the original function and the seven Kriging approximate functions based on different correlation functions. The approximate results show that the selection of correlation function does have a certain influence on the accuracy of the Kriging modeling and the selection of new samples. When the kernel function of the Kriging is the Gaussian function, the approximate function is closest to the original function. Figure 1b indicates that the Kriging approximation function with Gaussian kernel function has the smallest error, which shows that the Kriging approximation function has the highest prediction accuracy.

To further verify the effectiveness of the Gaussian kernel function, a three-dimensional function and a five-dimensional function are used as examples to verify it. Figure 2a,b are graphs of the average absolute error of each experiment run 20 times. Obviously, the Kriging model based on the Gaussian kernel function has the smallest average absolute error, which proves the effectiveness of the Gaussian kernel function.

f (x) = (x_{3} + 2) x_{2} x_{1}^{2} x_{i} \in [0, 2]

f (x) = \sqrt{2} (x_{1} + x_{2} + x_{3} + x_{4} + x_{5}) x_{i} \in [0, 1]

When comparing the effectiveness of multiple kernel functions, the Exponential Gaussian kernel function does not participate in the comparison. There are two main reasons. One is that the specific expression varies according to the parameter

θ_{n + 1}

, so there is no fixed expression. The other is that the Exponential kernel function and the Gaussian kernel function are both special forms of the Exponential Gaussian kernel function; that is, the exponential kernel and the Gaussian kernel belong to the Exponential Gaussian kernel.

According to these analyses, the Gaussian kernel function is finally selected as the correlation function. Its specific expression is shown in Equation (4).

R_{k} (θ_{k}, x_{i}^{k} - x_{j}^{k}) = \exp (- θ_{k} {|x_{i}^{k} - x_{j}^{k}|}^{2})

The smaller the correlation between the new sampling point and the known sampling points, the better. If the correlation value is large, the matrix

R

may be ill-conditioned. In view of this, the requirement of minimizing the correlation function in Equation (13) need be met to minimize the correlation between new sampling points and all design points.

\min f_{2} (x) = \sqrt{r_{1}^{2} + \dots + r_{m}^{2}}

(14)

where parameter

r_{i} = R (θ, x, x_{i}) i = 1, \dots, m

is the correlation between new sampling point and the

i

th sample point.

Multiplication and division can be used to deal with multi-objective optimization problems [34,35]. The multiplication or division of multiple objective optimization functions can make multi-objective optimization problems have a clearer meaning and can transform the multi-objective optimization problem into a single-objective optimization problem, so that the single-objective contains the optimization intent of each single objective in the multi-objective. If multiple objective functions need to be maximized, we can multiply them together to form a new objective function

f (x) = f_{1} (x) \cdot f_{2} (x) \dots f_{k} (x)

; or we can use a weighted geometric mean to construct a new objective function, such as

f (x) = {[f_{1} (x)]}^{α_{1}} \cdot {[f_{2} (x)]}^{α_{2}} \dots {[f_{k} (x)]}^{α_{k}}

, where

α_{i} \geq 0, 1 \leq i \leq k, \sum_{i = 1}^{k} α_{i} = 1

. If some of the multiple objective functions need to be maximized and others need to be minimized, the product of all objective functions to be maximized can be divided by the product of all objective functions that are minimized and then maximized, or the product of all the objective functions that need to be minimized is divided by the product of all the objective functions to be maximized and then minimized, and so on. In this way, the problem can be transformed into a problem of maximizing or minimizing the value in the form of “multiplication and division.” For the problems raised in this paper, we can get the following single-objective optimization model:

\min G (x) = \frac{f_{2} (x)}{f_{1} (x)} = \frac{{\hat{σ}}^{2} \{1 - [f {(x)}^{T} r {(x)}^{T}] [\begin{matrix} 0 & F^{T} \\ F & R \end{matrix}] [\begin{matrix} f (x) \\ r (x) \end{matrix}]\}}{\sqrt{r_{1}^{2} + \dots + r_{m}^{2}}}

(15)

where

f_{1} (x)

represents the mean square error function, as shown in Equation (13);

f_{2} (x)

represents the correlation function, as shown in Equation (14). Equation (15) is obtained by combining the two objective functions of Equations (13) and (14) through multiplication and division. The significance of building model (15) is to transform two objective functions into a single objective function.

The range of each

r_{i} (i = 1, \dots, m)

in function

f_{2}

is

[0, 1]

, the corresponding range of

f_{2}

is

[0, \sqrt{m}]

. The range of

f_{1}

cannot be determined because the MSE of different functions is different, but the value of

f_{1}

is always greater than 0.

In addition, the following situations may exist. When the range of

f_{1}

is much smaller than that of

f_{2}

, a small change of the function

f_{1}

will greatly affect the optimization results. In this situation, the function

f_{1}

plays a leading role, and the selection of point pays more attention to the correlation between candidate point and all sample points. Similarly, when the range of

f_{1}

is much bigger than that of

f_{2}

, the function

f_{2}

dominates the optimization results, and the selection of point pays more attention to the amount of information of the candidate points. In either case, the point found can be used as a candidate point. Therefore, Equation (15) can be optimized directly. The optimal value obtained at this time is the second candidate sampling point.

3.3. Screening Method

Two candidate sampling points were generated through two infilling sampling criteria. In this paper, the PSO (particle swarm optimization) algorithm of MATLAB was used to optimize the sampling criteria. The first candidate point

x_{i}

was obtained by optimizing the MMSE function (Equation (13)) through the PSO algorithm. The second candidate point

x_{j}

was obtained by optimizing the

G (x)

function (Equation (15)) with the PSO algorithm. Then, one or two candidate sampling points were selected as new sampling points. The method of selecting sampling points is described as follows.

First, set the correlation threshold to 0.001. The range of correlation is

[0, 1]

. The closer the correlation value is to 0, the higher the mutual independence between the two points. Similarly, the closer the correlation value is to 1, the lower the mutual independence between the two sample points. In addition, the distance between two points with low independence may be too close, resulting in an ill-conditioned correlation matrix. Set the correlation threshold to 0.001, which is a value close enough to 0. This setting can ensure that the two candidate points have independent information and avoids the appearance of an ill-conditioned correlation matrix, which may lead to the consequences of modeling failure.

Second, determine the correlation value between two candidate points. The correlation value between two candidates

x_{i}

and

x_{j}

is determined by Gaussian correlation function. If the correlation value is less than 0.001, it means that the correlation between the two candidates

x_{i}

and

x_{j}

is very small. At this time, both

x_{i}

and

x_{j}

are selected as new sampling points. If the correlation value is equal to or greater than 0.001, it means that the correlation between the two candidates

x_{i}

and

x_{j}

is large. At this time, if both

x_{i}

and

x_{j}

are selected as new sampling points, the correlation matrix may be ill-conditioned. Therefore, in this case, select the point with large mean square error as the new sampling point.

3.4. Implementation of the HKM-HS Method

This section introduces the specific implementation process of the HKM-HS model. Table 2 shows the specific implementation steps of the HKM-HS model. Table 3 is the pseudocode of the HM-HS algorithm. Figure 3 is the flowchart of the HKM-HS model.

4. Numerical Experiment

To test the modeling accuracy and stability of the HKM-HS method, the 16 benchmark functions were tested with HKM-HS, MMSE, LHD and AME [21] methods, respectively, and the test results of the four methods were compared. Then the HKM-HS algorithm was applied to the problems of spring design, simple pendulum system and a house heating case.

4.1. Benchmark Function Test

In this paper, a total of 16 benchmark functions were tested, and the dimensional range of these functions was 2 to 16 dimensions. Among them, there were 6 two-dimensional benchmark functions. Descriptions of these functions can be found in Table 4.

These benchmark functions were tested by the HKM-HS, MMSE, LHD and AME algorithms, respectively. The number of design points to build these four models is the same (step 6), and the size of initial design point was set as 2n (step 1). There are two main motivations. (a) There is little difference between the initial point setting in the following papers and that in this paper. In the papers [36,37], 2n sample points were selected as the initial points in some experiments. This shows that it is appropriate to select 2n as the number of initial sample points. (b) The accuracy of the initial model is relatively low when the size of the initial design point is fixed to 2n. Experiments performed under this condition can better show the improvement of model accuracy, which can verify the effectiveness of the proposed method. In the modeling process, new sample points are obtained (step 4–5) and the Kriging model is updated constantly (step 2–3).

The six two-dimensional functions were classified according to their similarity in significant physical properties and shapes, so as to more accurately determine the influence of the algorithm on different types of functions. The characteristic of the first type of function is that it has many local minimums. Bukin and Schwefel functions belong to the first type. The results of the MSE obtained by the test are shown in Figure 4 and Figure 5.

The characteristic of the second type of benchmark function is that it is the plate-shaped function. The Mccormick function belongs to the second type. Figure 6 indicates the test results of MSE. The third type of benchmark function is a valley-shaped function. The sixHump function belongs to the third type. The results of its MSE test are shown in Figure 7. The remaining two functions are divided into the fourth type. Figure 8 and Figure 9 are their MSE results.

The mean square error results of different types of two-dimensional benchmark functions tested by the HKM-HS, MMSE, LHD and AME algorithms are shown in Figure 4, Figure 5, Figure 6, Figure 7, Figure 8 and Figure 9. These figures show, based on the test results of these six benchmark functions, that the HKM-HS algorithm has the smallest mean square error, which indicates that the HKM-HS algorithm has good test results for different types of two-dimensional benchmark functions. Compared with other three modeling methods, the HKM-HS modeling method has higher accuracy.

There are ten high-dimensional benchmark functions. The benchmark functions are divided into four types. The characteristic of the first type of function is that it has many local minima. The Levy function and Rastrigin function belong to the first type of function. The property of the second type of function is that it is the plate-shaped function. The DixonPrice and Rosenbrock functions belong to the second type. The third type of function has a steep ridge. The Michalewicz function and Michalewicz 10 function belong to the third type of function. The remaining four functions are divided into the fourth type. Each benchmark function is tested 30 times. The average value and standard deviation of the 30 test results is taken as the data in Table 5.

The RMSE results in Table 5 are obtained by leaving a cross-validation method. The calculation formula of RSME is as follows.

R M S E = \frac{1}{k} \sqrt{\sum_{i = 1}^{k} {\hat{s}}^{2}_{i}}

(16)

where

k

is the maximum expensive evaluation number. Assuming that

x_{i}

is one of the sample points, the variance at

x_{i}

is evaluated as

{\hat{s}}_{i}^{2}

.

{\hat{s}}_{i}^{2}

is the predicted variance obtained from the Kriging model constructed by the remaining

k - 1

sample data.

Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 are box plots of the test results of each type benchmark function.

The RMSE results of 10 benchmark functions with different dimensions tested using HKM-HS, AME, MMSE and LHD algorithms are listed in Table 5. Table 5 concluded that in the test of 10 benchmark functions, the test result of 8 functions showed that the HKM-HS algorithm has the smallest average value of RMSE. For the Levy function, the LHD method has the smallest average value and standard deviation of RMSE. The AME algorithm has a better result in the DixonPrice function. The test results of these four methods were compared, and the comparison results show that the accuracy and stability of the HKM-HS method is slightly higher than the other three methods.

The Levy and DixonPrice benchmark functions belong to the first and second types of functions, instead of focusing on one type of function. This shows that the method proposed in this paper is applicable to various types of functions. Figure 10, Figure 11, Figure 12, Figure 13 and Figure 14 also show that the results obtained by the HKM-HS algorithm have better stability and accuracy. In short, the test results of 10 benchmark functions show that compared with AME, MMSE and LHD, the HKM-HS method has higher modeling accuracy and stability. Therefore, it is feasible to apply the HKM-HS method to practical cases.

4.2. Examples

4.2.1. Spring Design Problem

Figure 15 is a schematic diagram of the spring design. Arora was the first to propose the problem of spring design [38]. The purpose of designing the spring is to make the weight of the spring as small as possible. The minimum deflection, shear stress, and oscillation frequency are used as the constraints of the model.

The model expression formula is:

\min f (x) = (x_{3} + 2) x_{2} x_{1}^{2}

The constraints are:

g_{1} (x) = 1 - \frac{x_{2}^{3} x_{3}}{71785 x_{1}^{4}} \leq 0

g_{2} (x) = \frac{4 x_{2}^{2} - x_{1} x_{2}}{12566 (x_{2} x_{1}^{3} - x_{1}^{4})} + \frac{1}{5108 x_{1}^{2}} - 1 \leq 0

g_{3} (x) = 1 - \frac{140.45 x_{1}}{x_{2}^{2} x_{3}} \leq 0

g_{4} (x) = \frac{x_{2} + x_{1}}{1.5} - 1 \leq 0

The HKM-HS method studies unconstrained problems, so it is necessary to transform the constraint problem into an unconstrained problem. If any variable in the variable interval satisfies all constraints, then the problem in the interval is an unconstrained problem. The nature of the constraints determines that the three design variables are closely related and influence each other, so it is impossible to obtain an exact variable interval. Therefore, we only need to find the subinterval which satisfies all the constraints. By simplifying and calculating the constraints, the subinterval that satisfies all the constraints obtained in this paper is

x_{1} \in [- 3, - 2.6]

,

x_{2} \in [- 5 \times 10^{4}, - 4 \times 10^{4}]

and

x_{3} \in [- 1.1 \times 10^{- 7}, - 9.1 \times 10^{- 8}]

.

HKM-HS, MMSE, LHD and AME methods are used to model the spring design problem. The modeling results are shown in Table 6 and Figure 16.

The average value and standard deviation of RMSE obtained by running each method 30 times are listed in Table 6. In Table 6, the average RMSE value obtained by the HKM-HS method is the smallest. This shows that in the spring design problem, the accuracy of modeling with the HKM-HS method is higher. Figure 16 and the standard deviation of RMSE reflect the higher stability of the HKM-HS method.

4.2.2. Ten-Bar Planar Truss Problem

Figure 17 is a schematic diagram of a ten-bar planar truss. Barthelemyy was the first to study the ten-bar planar truss [39,40]. The purpose of the study is to minimize the weight of the truss system under the condition that each steel bar stress

σ_{i}

meets the constraint. The ten design variables

x_{i} (i = 1, \dots, 10)

are the cross-sectional areas of each steel bar.

The problem is expressed as

f (x) = 36 (x_{1} + x_{2} + x_{3} + x_{4} + x_{5} + x_{6} + \sqrt{2} (x_{7} + x_{8} + x_{9} + x_{10}))

subject to

0.645 {cm}^{2} \leq x_{i} \leq 64.5 {cm}^{2} i = 1, \dots, 10

- 172375 kpa \leq σ_{i} \leq 172375 kpa i = 1, \dots, 10

Table 7 shows the RMSE results of 30 runs of each method. Figure 18 is the box plot of RMSE. From Table 7 and Figure 18, it can be seen that compared with AME, MMSE and LHD, the HKM-HS has the smallest average value and standard deviation of RMSE. This shows that the HKM-HS method has the highest modeling accuracy and stability. Therefore, compared with the other three methods, the HKM-HS method is more suitable for solving this problem.

5. House Heating

With the continuous progress of society, the problem of an energy shortage cannot be ignored. Energy conservation has received widespread attention worldwide, and China is no exception. Every winter, house heating is a way to consume energy that cannot be underestimated. In order to reduce energy consumption and reduce pollution to the external environment as much as possible, it is necessary to discuss the issue of house heating [41,42]. There are many factors that affect the heating cost of a house, such as the size of the house, the thermal properties of the house materials, the thermal resistance of the house, the characteristics of the heater, the cost of electricity, and the indoor and outdoor temperatures. Different influencing factors have different degrees of influence on the heating cost of houses. Therefore, predicting the heating cost under different conditions has great practical significance for reducing energy consumption, reducing economic costs and reducing environmental pollution.

Figure 19 is a schematic diagram of house heating. In order to study the relationship between house heating cost and influencing factors, we need to set some influencing factors as variables. Define the length, width and height of the house as variables

x_{1}

,

x_{2}

and

x_{3}

, respectively. Suppose the number of windows in the house is six and set the length and width of the windows as variables

x_{4}

and

x_{5}

. Set the initial room temperature (outdoor temperature) as variable

x_{6}

. Set the hot air temperature of the heater as variable

x_{7}

. Set the thickness of the glass wool on the wall as variable

x_{8}

. In addition, it is assumed that the flow rate of the heater is

3600 kg / hr

, the electricity cost is

0.09 $ / kWhr

and the constant pressure specific heat capacity of air is

1005.4 J / kg

.

The house heating simulation model was established by MATLAB and SIMULINK. The variables

x_{1}, \dots, x_{8}

are used as the input parameters of the simulation, and the heating cost of the house is used as the output variable. The heating cost of the house is related to time. Here we take the day as the unit and the heating cost of the house per day as the output variable. The HKM-HS method was used to determine the new valuation point, that is, to determine the different values of 8 variables (influencing factors), and then estimate the heating cost of a day through simulation. From this, the degree of influence of different influencing factors on heating costs can be explored.

In the process of using the HKM-HS method to achieve house heating, first randomly select 8 initial sample points

x_{i} (i = 1, \dots, 8)

through the LHD method and simulate the initial sample points to obtain

y_{i} (i = 1, \dots, 8)

. Secondly, select new estimation points by the method proposed in this paper and perform simulation until the total number of sample points is 24. In order to ensure the accuracy of the experiment, the average RMSE value of the ten results is taken as the error. RMSE is obtained by the cross-validation method. Table 8 shows the average value and standard deviation of ten RMSE values obtained by applying HKM-HS, MMSE, LHD and AME methods to house heating. The error results show that the modeling accuracy of the HKM-HS method is very good. Figure 20, Figure 21 and Figure 22 is an analysis of the housing heating problem.

Figure 20, Figure 21 and Figure 22 show the relationship between different variables and the heating cost of a house. Through the analysis of the experimental results, we can draw the following conclusions. (1) From Figure 22, the changing trend of

x_{6}

and

x_{7}

is almost the same as that of the objective value y. This shows that the hot air temperature of the heater and the initial room temperature of the house have a great influence on the heating cost of the house. This may be because the temperature setting of the heater and initial room temperature determine the use time and start-up frequency of the heater. The higher the temperature setting of the heater, the greater the electricity consumption, which will lead to increased electricity cost. On the contrary, the lower the initial room temperature of the house, the higher the electricity cost. (2) In Figure 20, the changing trends of

x_{1}

,

x_{2}

, and

x_{3}

have some similarities with the changing trends of the objective value y, and the similarity of

x_{1}

is more obvious. This indicates that the geometric structure of the house also has a certain influence on the heating cost. The length of the house has the most obvious influence on the heating cost, because the length occupies the largest weight of the house size. The larger the house, the more electricity the heater consumes for heating, and the higher the heating cost. (3) It can be seen in Figure 21 that the changing trends of

x_{4}

,

x_{5}

, and

x_{8}

are less consistent with that of the objective value y. This implies that these factors have little influence on the heating cost of houses. In short, the HKM-HS method can predict the heating cost of influencing factors under different conditions. Setting a suitable heater temperature according to the size of the house and the initial room temperature can effectively reduce heating cost.

6. Conclusions

The paper proposes a high-precision Kriging modeling method based on hybrid sampling criteria (HKM-HS). First, LHD was used to select the initial sampling point in this method. Secondly, MMSE and MC criteria were optimized to find two candidate points, which were further screened to obtain the final sampling points. Then the HKM-HS modeling method was used to test 16 benchmark functions and three engineering cases. The test results show that the HKM-HS is almost better than other three Kriging-based modeling methods in terms of accuracy and stability. Furthermore, as an actual simulation problem, a housing heating case was used by the proposed method to show that the HKM-HS has a certain degree of engineering application.

However, it is not appropriate for this method to deal with higher dimensional problems. It is the main reason why the increase in dimensions not only causes the construction cost of the Kriging model to rise rapidly, but also cannot guarantee the final modeling accuracy. In view of this, the exploration of high-dimensional problems based on the Kriging model has attracted the attention of some researchers. The key point is how to achieve dimensionality reduction of the high-dimensional Kriging model while ensuring certain accuracy requirements. At present, there are two main ideas to deal with this key issue. One is to transform high-dimensional modeling problems into low-dimensional modeling problems through effective dimensionality reduction methods, thereby increasing the modeling efficiency of Kriging; the other core idea is to reduce the amount of calculation of hyper-parameters in the Kriging modeling process because optimizing hyper-parameters is a very time-consuming task. These research ideas and directions are yet to be further explored and developed by researchers.

Author Contributions

Methodology, J.S. (Junjun Shi) and Y.L.; software, J.S. (Junjun Shi); writing—original draft, J.S. (Junjun Shi) and J.S.; writing—review & editing, J.S. (Jingfang Shen) and Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the National Natural Science Foundation of China (No. 51775472), Science & Technology Innovation Talents in Universities of Henan Province (No. 21HASTIT027), Henan Excellent Youth Fund Project (202300410346), Training Plan of Young Backbone Teachers in Colleges and Universities of Henan Province (2020GGJS209) and Cooperative Research and Model Research Based on Computational Medicinal Chemistry (G20200017074).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Jensen, W.A. Response surface methodology: Process and product optimization using designed experiments. J. Qual. Technol. 2017, 49, 186. [Google Scholar] [CrossRef]
Fan, Y.; Lu, W.; Miao, T.; Li, J.; Lin, J. Multiobjective optimization of the groundwater exploitation layout in coastal areas based on multiple surrogate models. Environ. Sci. Pollut. Res. Int. 2020, 27, 19561–19576. [Google Scholar] [CrossRef] [PubMed]
Yan, C.; Shen, X.; Guo, F. An improved support vector regression using least squares method. Struct. Multidiscip. Optim. 2017, 57, 2431–2445. [Google Scholar] [CrossRef]
Hamed, Y.; Alzahrani, A.I.; Mustaffa, Z.; Ismail, M.C.; Eng, K.K. Two steps hybrid calibration algorithm of support vector regression and K-nearest neighbors. Alex. Eng. J. 2020, 59, 1181–1190. [Google Scholar] [CrossRef]
Fan, C.; Huang, Y.; Wang, Q. Sparsity-promoting polynomial response surface: A new surrogate model for response prediction. Adv. Eng. Softw. 2014, 77, 48–65. [Google Scholar] [CrossRef]
Rashki, M.; Azarkish, H.; Rostamian, M.; Bahrpeyma, A. Classification correction of polynomial response surface methods for accurate reliability estimation. Struct. Saf. 2019, 81, 101869. [Google Scholar] [CrossRef]
Dou, S.-q.; Li, J.-j.; Kang, F. Health diagnosis of concrete dams using hybrid FWA with RBF-based surrogate model. Water Sci. Eng. 2019, 12, 188–195. [Google Scholar] [CrossRef]
Durantin, C.; Rouxel, J.; Désidéri, J.A.; Glière, A. Multifidelity surrogate modeling based on radial basis functions. Struct. Multidiscip. Optim. 2017, 56, 1061–1075. [Google Scholar] [CrossRef] [Green Version]
Keshtegar, B.; Mert, C.; Kisi, O. Comparison of four heuristic regression techniques in solar radiation modeling: Kriging method vs RSM, MARS and M5 model tree. Renew. Sustain. Energy Rev. 2018, 81, 330–341. [Google Scholar] [CrossRef]
Li, T.; Yang, X. An efficient uniform design for Kriging-based response surface method and its application. Comput. Geotech. 2019, 109, 12–22. [Google Scholar] [CrossRef]
van Stein, B.; Wang, H.; Kowalczyk, W.; Emmerich, M.; Bäck, T. Cluster-based Kriging approximation algorithms for complexity reduction. Appl. Intell. 2019, 50, 778–791. [Google Scholar] [CrossRef] [Green Version]
Namura, N.; Shimoyama, K.; Obayashi, S. Kriging surrogate model with coordinate transformation based on likelihood and gradient. J. Glob. Optim. 2017, 68, 827–849. [Google Scholar] [CrossRef]
Craven, B.A.; Aycock, K.I.; Herbertson, L.H.; Malinauskas, R.A. A CFD-based Kriging surrogate modeling approach for predicting device-specific hemolysis power law coefficients in blood-contacting medical devices. Biomech. Model. Mechanobiol. 2019, 18, 1005–1030. [Google Scholar] [CrossRef] [PubMed]
An, X.; Song, B.; Mao, Z.; Ma, C. Layout Optimization Design of Two Vortex Induced Piezoelectric Energy Converters (VIPECs) Using the Combined Kriging Surrogate Model and Particle Swarm Optimization Method. Energies 2018, 11, 2069. [Google Scholar] [CrossRef] [Green Version]
Yu, J.; Wang, Z.; Chen, F.; Yu, J.; Wang, C. Kriging surrogate model applied in the mechanism study of tip leakage flow control in turbine cascade by multiple DBD plasma actuators. Aerosp. Sci. Technol. 2019, 85, 216–228. [Google Scholar] [CrossRef]
Zeng, F.; Bu, J.; Yu, Y.; Bian, H.; Yang, L.; Zi, X. Optimum design of permanent magnet synchronous generator based on MaxPro sampling and kriging surrogate model. IEEJ Trans. Electr. Electron. Eng. 2020, 15, 278–290. [Google Scholar] [CrossRef]
Palar, P.S.; Liem, R.P.; Zuhal, L.R.; Shimoyama, K. On the use of surrogate models in engineering design optimization and exploration: The key issues. In Proceedings of the Genetic and Evolutionary Computation Conference Companion, Prague, Czech Republic, 13–17 July 2019. [Google Scholar]
Settles, B. Active Learning Literature Survey; University of Wisconsin-Madison Department of Computer Sciences: Madison, WI, USA, 2009. [Google Scholar]
Jin, R.; Chen, W.; Sudjianto, A. On sequential sampling for global metamodeling in engineering design. In Proceedings of the ASME 2002 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference, Montreal, QC, Canada, 29 September–2 October 2002. American Society of Mechanical Engineers Digital Collection. [Google Scholar]
Liu, H.; Cai, J.; Ong, Y.-S. An adaptive sampling approach for kriging metamodeling by maximizing expected prediction error. Comput. Chem. Eng. 2017, 106, 171–182. [Google Scholar] [CrossRef]
Liu, H.; Xu, S.; Ma, Y.; Chen, X.; Wang, X. An Adaptive Bayesian Sequential Sampling Approach for Global Metamodeling. J. Mech. Des. 2016, 138, 011404. [Google Scholar] [CrossRef]
Sacks, J.; Welch, W.J.; Mitchell, T.J.; Wynn, H.P. Design and analysis of computer experiments. Stat. Sci. 1989, 4, 409–423. [Google Scholar] [CrossRef]
Jiang, P.; Zhang, Y.; Zhou, Q.; Shao, X.; Hu, J.; Shu, L. An adaptive sampling strategy for Kriging metamodel based on Delaunay triangulation and TOPSIS. Appl. Intell. 2018, 48, 1644–1656. [Google Scholar] [CrossRef]
Xu, J.; Dang, C. A novel fractional moments-based maximum entropy method for high-dimensional reliability analysis. Appl. Math. Model. 2019, 75, 749–768. [Google Scholar] [CrossRef]
Zeng, W.; Sun, W.; Song, H.; Ren, T.; Sun, Y. A parallel adaptive sampling strategy to accelerate the sampling process during the modeling of a Kriging metamodel. J. Chin. Inst. Eng. 2019, 42, 676–689. [Google Scholar] [CrossRef]
Wei, X.; Wu, Y.-Z.; Chen, L.-P. A new sequential optimal sampling method for radial basis functions. Appl. Math. Comput. 2012, 218, 9635–9646. [Google Scholar] [CrossRef]
Cai, X.; Qiu, H.; Gao, L.; Yang, P.; Shao, X. A multi-point sampling method based on kriging for global optimization. Struct. Multidiscip. Optim. 2017, 56, 71–88. [Google Scholar] [CrossRef]
Li, Y. A Kriging-based multi-point sequential sampling optimization method for complex black-box problem. Evol. Intell. 2020, 1–10. [Google Scholar] [CrossRef]
Liu, B.; Xie, L. An Improved Structural Reliability Analysis Method Based on Local Approximation and Parallelization. Mathematics 2020, 8, 209. [Google Scholar] [CrossRef] [Green Version]
Nielsen, H.B. Aspects of the Matlab Toolbox DACE; Technical University of Denmark: Kongens Lyngby, Denmark, 2002. [Google Scholar]
Lophaven, S.; Nielsen, H.; Sondergaard, J.; Dace, A. A Matlab Kriging Toolbox; Technical Report, No. IMMTR-2002-12; Technical University of Denmark: Kongens Lyngby, Denmark, 2002. [Google Scholar]
Rasmussen, C.E.; Williams, C.K.I. Gaussian Processes for Machine Learning; The MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
Palar, P.S.; Shimoyama, K. Efficient global optimization with ensemble and selection of kernel functions for engineering design. Struct. Multidiscip. Optim. 2018, 59, 93–116. [Google Scholar] [CrossRef]
Yuan, G.L.; Yu, T.; Du, J. Multi-Objective Optimal Load Distribution Based on Sub Goals Multiplication and Division in Power Plants. Appl. Mech. Mater. 2014, 494–495, 1715–1718. [Google Scholar] [CrossRef]
Yun, Y.; Chuluunsukh, A.; Gen, M. Sustainable Closed-Loop Supply Chain Design Problem: A Hybrid Genetic Algorithm Approach. Mathematics 2020, 8, 84. [Google Scholar] [CrossRef] [Green Version]
Schobi, R.; Sudret, B.; Wiart, J. Polynomial-chaos-based kriging. Int. J. Uncertain. Quantif. 2015, 5, 171–193. [Google Scholar] [CrossRef]
Xiong, Y.; Chen, W.; Apley, D.; Ding, X. A non-stationary covariance-based Kriging method for metamodelling in engineering design. Int. J. Numer. Methods Eng. 2010, 71, 733–756. [Google Scholar] [CrossRef]
Arora, J.S. Optimum Design with Excel Solver. In Introduction to Optimum Design; Academic Press: Cambridge, MA, USA, 2012; pp. 213–273. [Google Scholar]
Sobieszczanski-Sobieski, J.; Barthelemy, J.-F.; Riley, K.M. Sensitivity of Optimum Solutions of Problem Parameters. AIAA J. 1982, 20, 1291–1299. [Google Scholar] [CrossRef] [Green Version]
Schmit, L.A.; Farshi, B. Some Approximation Concepts for Structural Synthesis. AIAA J. 1974, 12, 692–699. [Google Scholar] [CrossRef]
Fan, Y.; Zhao, X.; Li, J.; Li, G.; Myers, S.; Cheng, Y.; Badiei, A.; Yu, M.; Akhlaghi, Y.G.; Shittu, S.; et al. Economic and environmental analysis of a novel rural house heating and cooling system using a solar-assisted vapour injection heat pump. Appl. Energy 2020, 275, 115323. [Google Scholar] [CrossRef]
Bezyan, B.; Zmeureanu, R. Machine Learning for Benchmarking Models of Heating Energy Demand of Houses in Northern Canada. Energies 2020, 13, 1158. [Google Scholar] [CrossRef] [Green Version]

Figure 1. (a) Kriging approximate function based on different correlation functions; (b) absolute error (AE) of Kriging approximation functions.

Figure 2. (a) AE of Kriging approximation functions in the three-dimensional question; (b) AE of Kriging approximation functions in the five-dimensional question.

Figure 3. The flowchart of the HKM-HS method.

Figure 4. (a)Test result of Bukin function by HKM-HS; (b) Test result of Bukin function by maximum mean square error (MMSE); (c) Test result of Bukin function by Latin Hypercube Design (LHD); (d) Test result of Bukin function by adaptive maximum entropy (AME).

Figure 5. (a)Test result of Schwefel function by HKM-HS; (b) Test result of Schwefel function by maximum mean square error (MMSE); (c) Test result of Schwefel function by Latin Hypercube Design (LHD); (d) Test result of Schwefel function by adaptive maximum entropy (AME).

Figure 6. (a)Test result of Mccormick function by HKM-HS; (b) Test result of Mccormick function by maximum mean square error (MMSE); (c) Test result of Mccormick function by Latin Hypercube Design (LHD); (d) Test result of Mccormick function by adaptive maximum entropy (AME).

Figure 7. (a)Test result of sixHump function by HKM-HS; (b) Test result of sixHump function by maximum mean square error (MMSE); (c) Test result of sixHump function by Latin Hypercube Design (LHD); (d) Test result of sixHump function by adaptive maximum entropy (AME).

Figure 8. (a)Test result of Alpine function by HKM-HS; (b) Test result of Alpine function by maximum mean square error (MMSE); (c) Test result of Alpine function by Latin Hypercube Design (LHD); (d) Test result of Alpine function by adaptive maximum entropy (AME).

Figure 9. (a)Test result of Gp function by HKM-HS; (b) Test result of Gp function by maximum mean square error (MMSE); (c) Test result of Gp function by Latin Hypercube Design (LHD); (d) Test result of Gp function by adaptive maximum entropy (AME).

Figure 10. (a) Box plot of test results for Levy function; (b) Box plot of test results for Rastrigin function.

Figure 11. (a) Box plot of test results for DixonPrice function; (b) Box plot of test results for Rosenbrock function.

Figure 12. (a) Box plot of test results for Michalewicz function; (b) Box plot of test results for Michalewice10 function.

Figure 13. (a) Box plot of test results for Hartman3 function; (b) Box plot of test results for Shekel function.

Figure 14. (a) Box plot of test results for Hartman6 function; (b) Box plot of test results for F16 function.

Figure 15. Spring design.

Figure 16. Box plot of the RMSE values of the four methods.

Figure 17. Ten-bar planar truss.

Figure 18. Box plot of the RMSE values of the four methods.

Figure 19. House heating.

Figure 20. Trends of design variables

x_{1}

,

x_{2}

,

x_{3}

and objective function obtained by HKM-HS.

Figure 20. Trends of design variables

x_{1}

,

x_{2}

,

x_{3}

and objective function obtained by HKM-HS.

Figure 21. Trends of design variables

x_{4}

,

x_{5}

,

x_{8}

and objective function obtained by HKM-HS.

Figure 21. Trends of design variables

x_{4}

,

x_{5}

,

x_{8}

and objective function obtained by HKM-HS.

Figure 22. Trends of design variables

x_{6}

,

x_{7}

and objective function obtained by HKM-HS.

Figure 22. Trends of design variables

x_{6}

,

x_{7}

and objective function obtained by HKM-HS.

Table 1. Expressions of different correlation functions.

$Spatial Correlation Function Model R_{k} (θ_{k}, d_{k})$ $, where d_{k} = x_{i}^{k} - x_{j}^{k}$
Exponential model	$R_{k} (θ_{k}, d_{k}) = \exp (- θ_{k} \|d_{k}\|)$
Exponential Gaussian model	$R_{k} (θ_{k}, d_{k}) = \exp (- θ_{k} {\|d_{k}\|}^{θ_{n + 1}}), θ_{n + 1} \in (0, 2]$
Gaussian model	$R_{k} (θ_{k}, d_{k}) = \exp (- θ_{k} {\|d_{k}\|}^{2})$
Linear model	$R_{k} (θ_{k}, d_{k}) = \max (0, 1 - θ_{k} \|d_{k}\|)$
Spherical model	$R_{k} (θ_{k}, d_{k}) = 1 - 1.5 ε_{k} + 0.5 ε_{k}, ε_{k} = \min \{1, θ_{k} \|d_{k}\|\}$
Cubic model	$R_{k} (θ_{k}, d_{k}) = 1 - 3 ε_{k}^{2} + 2 ε_{k}^{3}, ε_{k} = \min \{1, θ_{k} \|d_{k}\|\}$
Spline model	$R_{k} (θ_{k}, d_{k}) = \{\begin{matrix} 1 - 15 ε_{k}^{2} + 30 ε_{k}^{3}, & ε_{k} \in (0, 0.2] \\ 1.25 {(1 - ε_{k})}^{3}, & ε_{k} \in (0.2, 1) \\ 0, & ε_{k} \in [1, \infty) \end{matrix}$
Matérn model	$R_{k} (θ_{k}, d_{k}) = (1 + \sqrt{3} θ_{k} \|d_{k}\|) \exp (- \sqrt{3} θ_{k} \|d_{k}\|)$

Table 2. The specific steps of high-precision Kriging modeling method based on hybrid sampling criteria (HKM-HS).

HKM-HS Method

Step 1. Generate initial design points. The LHD (Latin Hypercube Design) method is used to generate

m = 2 n

initial design point

x_{i} (i = 1, \dots, m)

. Then, set the initial sample set to

X

.
Step 2. Determine the sample sets

X

and

Y

. Estimate the expensive objective function of each initial sample point

x_{i} (i = 1, \dots, m)

to obtain

y (x_{i})

. Then, set

Y

to be the set of all

y (x_{i})

. For one or two update points obtained by the infilling sampling criterion, their expensive function evaluation values can be determined. Then the update point and its function evaluation value are added to the sample sets

X

and

Y

, respectively. If there is only one new sampling point, then

m = m + 1

; if there are two new sampling points, then

m = m + 2

.
Step 3. Build Kriging model. The Kriging model will be established by the new data sets

X

and

Y

formed by Step 2, and the construction of Kriging is realized by DACE toolbox in MATLAB.
Step 4. Infill sampling criteria. The new infilling sampling method has two criteria, one is MMSE and the other is the MC sampling strategy based on mean square error and correlation function. See Section 3.1 and Section 3.2 for details. The PSO algorithm of MATLAB is used to optimize the sampling criteria. The first candidate point

x_{i}

is obtained by optimizing the MMSE function (Equation (12)) through PSO algorithm. The second candidate point

x_{j}

is obtained by optimizing the

G (x)

function (Equation (14)) with PSO algorithm.
Step 5. Determine new sampling point. First, two candidate sampling points

x_{i}

and

x_{j}

are generated with the new infilling sampling criteria. Then, one or two candidate sampling points are selected as new sampling points. See Section 3.3 for details.
Step 6. Stopping criterion. The maximum number of function evaluations is set as

N_{\max} = 20 n

. Determine the relationship between m and

N_{\max}

. If the number of sampling points

m

is greater than

N_{\max}

, stop adding points and go to step 7. If the condition is not met, go back to step 2.
Step 7. Stop. This is the output global approximate Kriging model.

Table 3. The pseudocode of HKM-HS.

Algorithm for HKM-HS

1. Begin
2. X and

Y

: Use LHD to generate m initial sample points and perform expensive functional evaluation on them.
3. Update

X

and

Y

: Add the new sample point

x_{i}

to

X

, and add

y (x_{i})

to

Y

.
4. Update m:

m = m + 1

or

m = m + 2

.

5. y (x)

:

The Kriging is established with new sets X and

Y

.

6. x_{i}

and

x_{j}

: Optimize the MMSE function and

G (x)

function by PSO.
7. New sampling points: Screening method (Section 3.3).

8. If m < 20 n

Return to line 3or elsestop if this is the end.
9. End

Table 4. Benchmark functions.

Function	Dimension	Expression
Alpine	2	$y = \|x_{1} \sin (x_{1}) + 0.1 x_{1}\| + \|x_{2} \sin (x_{2}) + 0.1 x_{2}\| x_{i} \in [- 10, 10], i = 1, 2$
Bukin	2	$y = 100 \sqrt{\|x_{2} - 0.01 x_{1}^{2}\|} + 0.01 \|x_{1} + 10\| x_{1} \in [- 15, - 5], x_{2} \in [- 3, 3]$
Gp	2	$\begin{array}{c} y = (30 + {(2 x_{1} - 3 x_{2})}^{2} (18 - 32 x_{1} + 12 x_{1}^{2} + 48 x_{2} - 36 x_{1} x_{2} + 27 x_{2}^{2})) \cdot \\ (1 + {(x_{1} + x_{2} + 1)}^{2} (19 - 14 x_{1} + 3 x_{1}^{2} - 14 x_{2} + 6 x_{1} x_{2} + 3 x_{2}^{2})) x_{i} \in [- 2, 2], i = 1, 2 \end{array}$
Mccormick	2	$y = \sin (x_{1} + x_{2}) + {(x_{1} - x_{2})}^{2} - 1.5 x_{1} + 2.5 x_{2} + 1 x_{1} \in [- 1.5, 4], x_{2} \in [- 3, 4]$
Schwefel	2	$y = 837.9658 - \sum_{i = 1}^{2} x_{i} \sin (\sqrt{\|x_{i}\|}) x_{i} \in [- 500, 500], i = 1, 2$
sixHump	2	$y = 4 x_{1}^{2} - 2.1 x_{1}^{4} + x_{1}^{6} / 3 + x_{1} x_{2} - 4 x_{2}^{2} + 4 x_{2}^{4} x_{i} \in [- 2, 2], i = 1, 2$

Table 5. RMSE results of HKM-HS, MMSE, LHD and AME algorithms tested on 10 benchmark functions.

D	Type	Function	AKSM-MC		MSE		LHD		AME
			RMSE		RMSE		RMSE		RMSE
			Mean	Std	Mean	Std	Mean	Std	Mean	Std
3	Fourth	Hartman3	0.0350	0.0025	0.0443	0.0044	0.0460	0.0033	0.0379	0.0079
4	Fourth	Shekel	0.0038	0.0006	0.0048	0.0010	0.0075	0.0019	0.0054	0.0008
5	Third	Michalewicz	0.0329	0.0020	0.0402	0.0036	0.0478	0.0028	0.0401	0.0028
6	Fourth	Hartman6	0.0121	0.0024	0.0166	0.0030	0.0167	0.0018	0.0134	0.0041
8	First	Levy	6.5634	0.3191	6.7909	0.5061	3.1332	0.2700	3.1550	0.3943
9	Second	DixonPrice	3.15 × 10⁴	2.09 × 10³	4.65 × 10⁴	3.98 × 10³	1.64 × 10⁴	3.12 × 10³	1.17 × 10⁴	1.42 × 10³
10	Third	Michalewicz10	0.0461	0.0025	0.0476	0.0047	0.0499	0.0028	0.0513	0.0018
10	Second	Rosenbrock	228.5093	9.0254	377.0992	21.4293	230.2005	9.0255	231.2139	22.2353
12	First	Rastrigin	0.5446	0.0215	0.7339	0.1015	0.6045	0.0215	0.6608	0.0537
16	Fourth	F16	1.3398	0.1280	1.4068	0.4496	1.4281	0.4921	1.3414	0.0535

Table 6. Comparison of spring modeling results.

		HKM-HS	MMSE	LHD	AME
RMSE	Meanstd	258.1106 15.7369	305.5048 17.8805	436.2186 43.3815	284.0081 34.8651

Table 7. Comparison of modeling results of ten-bar planar truss.

		HKM-HS	MMSE	LHD	AME
RMSE	mean	219.0163	238.4223	241.7911	219.7608
RMSE	std	5.8617	12.7105	8.0089	6.5274

Table 8. Comparison of modeling results of house heating.

		HKM-HS	MMSE	LHD	AME
RMSE	mean	0.5738	0.6028	0.6577	0.5866
RMSE	std	0.0358	0.0418	0.0338	0.0505

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shi, J.; Shen, J.; Li, Y. High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria. Mathematics 2021, 9, 536. https://doi.org/10.3390/math9050536

AMA Style

Shi J, Shen J, Li Y. High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria. Mathematics. 2021; 9(5):536. https://doi.org/10.3390/math9050536

Chicago/Turabian Style

Shi, Junjun, Jingfang Shen, and Yaohui Li. 2021. "High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria" Mathematics 9, no. 5: 536. https://doi.org/10.3390/math9050536

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

High-Precision Kriging Modeling Method Based on Hybrid Sampling Criteria

Abstract

1. Introduction

2. Related Work

2.1. Kriging

2.2. AME

3. HKM-HS Method

3.1. Maximizing Mean Square Error (MMSE)

3.2. MC Criterion

3.3. Screening Method

3.4. Implementation of the HKM-HS Method

4. Numerical Experiment

4.1. Benchmark Function Test

4.2. Examples

4.2.1. Spring Design Problem

4.2.2. Ten-Bar Planar Truss Problem

5. House Heating

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI