Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM2.5 Estimation in Northern Thailand

Utudee, Somlak; Chanthorn, Pharunyou; Moonchai, Sompop

doi:10.3390/math13172811

Open AccessArticle

Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand

by

Somlak Utudee

^1,2,3

,

Pharunyou Chanthorn

^1,2,3

and

Sompop Moonchai

^1,2,3,*

¹

Advanced Research Center for Computational Simulation, Chiang Mai University, Chiang Mai 50200, Thailand

²

Department of Mathematics, Faculty of Science, Chiang Mai University, Chiang Mai 50200, Thailand

³

Centre of Excellence in Mathematics, MHESI, Bangkok 10400, Thailand

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(17), 2811; https://doi.org/10.3390/math13172811

Submission received: 26 June 2025 / Revised: 18 August 2025 / Accepted: 27 August 2025 / Published: 1 September 2025

(This article belongs to the Special Issue Statistical Data Modeling and Machine Learning with Applications, 3rd Edition)

Download

Browse Figures

Versions Notes

Abstract

Accurate spatial interpolation of environmental data requires utilizing flexible models that can capture complex spatial patterns. In this paper, we present two improved dual kriging (DK) models comprising a nonlinear trend function that combines Gaussian radial basis functions with a first-order polynomial. The proposed model, DK–RBFP, and its extension, DK–RBFPGA, which includes k-means clustering and a genetic algorithm for parameter optimization, respectively, exhibit enhanced performance in capturing spatial variation. The complete monotonicity of the covariance function and the strict positive definiteness of the coefficient matrix provide theoretical support for the uniqueness of the DK solution. When applied to datasets of PM_2.5 concentrations for northern Thailand, both models perform better than the conventional DK model using a second-order polynomial trend (DK–POLY), as evidenced by accuracy metrics including the mean absolute percentage error (MAPE), the mean squared error (MSE), and the root mean square error (RMSE). The outcomes indicate that integrating nonlinear trend components with data-driven optimization significantly enhances accuracy and flexibility in environmental spatial predictions.

Keywords:

spatial interpolation; dual kriging; kriging with external drift; Gaussian radial basis function; k-mean; distance correlation coefficient; genetic algorithm

MSC:

62P12; 68T09

1. Introduction

Spatial interpolation refers to methods used to estimate the value of a target variable at a specific location based on observed values from nearby sites. Common techniques include inverse distance weighting (IDW), geographically weighted regression (GWR), and kriging. Kriging is a geostatistical approach that provides the best linear unbiased prediction with minimum variance by modeling a spatial process as the sum of a deterministic trend and a stochastic residual. Ordinary kriging (OK), the most widely used variant, assumes a constant mean across the study area. In contrast, kriging with an external drift (KED) incorporates auxiliary variables that are highly correlated with the target variable, defining the trend as a function of these variables to improve estimation accuracy [1].

One of the main computational challenges in KED is that a distinct kriging system must be solved for each interpolation point to calculate the linear estimator weights. To overcome this, DK, which is derived from KED, uses a single system to compute weights for all points while producing identical estimation results [2]. Compared with OK, DK offers greater flexibility by simultaneously modeling global trends and local variations. Its trend component enables the capturing of complex, nonlinear spatial structures, while solving a single system makes DK more computationally efficient for large datasets. Unlike machine learning (ML) methods such as support vector machines (SVMs) or neural networks, DK explicitly incorporates spatial autocorrelation, providing more interpretable spatial predictions. ML approaches, by contrast, require large datasets and are not inherently designed to account for spatial dependencies [3]. Recent studies have confirmed the effectiveness of DK in relevant research areas [4,5,6,7].

The trend component in DK is critical for representing large-scale spatial variation. By explicitly modeling the trend, DK can better capture nonlinear spatial patterns, integrate auxiliary information, and improve prediction accuracy and interpretability. Polynomial functions are commonly used to describe trends in KED and DK; however, they often fail to represent nonlinear interactions between predictors and the response variable. While several studies have explored KED and DK, few focus on nonlinear trend functions. Notable exceptions include Snepvangers et al. [8], who applied a logarithmic trend, and Freier and Lieres [9], who extended universal kriging to include nonlinear trends using Taylor-based linearization. More recently, Baisad et al. [10] employed least squares support vector regression (LSSVR) as the trend in KED. Despite these advances, research on nonlinear trends in DK remains limited. The choice of trend function directly influences the structure of the coefficient matrix in the DK system. This structure determines the existence and uniqueness of the optimal weight solutions. Therefore, selecting an appropriate trend function is a vital step in ensuring that DK provides reliable and consistent estimates.

In this paper, we propose a nonlinear trend function for the DK framework that integrates a radial basis function (RBF) with a first-order polynomial to capture both linear and nonlinear dependencies of the target variable on auxiliary variables. RBFs are widely used in machine learning and geostatistics for interpolation and function approximation, as they model relationships based on distances from a set of centers, enabling flexible representation of nonlinear spatial patterns [11,12,13]. Commonly used RBFs include the thin-plate spline (TPS), the multiquadric function (MQF), the inverse multiquadric function (IMQF), and the Gaussian function (GF) [14]. Among these, the GF, also known as the Gaussian radial basis function (GRBF), is particularly popular for spatial interpolation, regression [15,16,17], and kernel-based ML methods [18,19].

The performance of the GRBF depends on two key parameters: the center, which defines where the function peaks, and the bandwidth, which controls its width. Selecting centers typically involves data reduction methods such as k-means clustering or subset selection methods to identify representative points from the dataset [20,21], while bandwidth is often estimated using cross-validation, maximum likelihood, or rule-of-thumb approaches based on inter-center distances [22,23]. One practical approach is to set the bandwidth proportional to the average distance between centers [24]. In this study, k-means clustering is used to choose centers, and a genetic algorithm (GA) is employed to estimate the bandwidth. GAs are population-based optimization techniques inspired by natural selection [25,26], and they have been increasingly applied to estimate parameters in nonlinear regression models [27,28,29,30].

Finally, we apply the proposed DK approach with nonlinear trend functions to interpolate PM_2.5 concentrations using air pressure and relative humidity as auxiliary variables and to compare its performance against DK with a second-order polynomial trend.

The following summarizes the main contributions of this work: (1) the development of a novel nonlinear trend function within the DK framework by combining GRBFs with a first-order polynomial; (2) a theoretical guarantee of the uniqueness of the DK system solution, established by demonstrating the nonsingularity of the coefficient matrix based on the theorems of complete monotonicity and strict positive definiteness; and (3) the implementation of a hybrid approach in which the centers of the GRBFs are determined using k-means clustering, and the bandwidth parameter is estimated using GA.

The remainder of this paper is organized as follows: Section 2 outlines the mathematical backgrounds of KED and DK, as well as the theoretical concepts related to strictly positive definite matrices and completely monotone functions. In Section 3, we present a hybrid nonlinear trend model that combines the Gaussian radial basis function with the first-order polynomial. Section 4 addresses the requirements necessary to ensure the DK system’s nonsingularity. Section 5 gives a detailed explanation of the dataset and determines the application of distance correlation coefficients in finding appropriate auxiliary variables. Then, the performance of the proposed DK method is tested by estimating PM_2.5 concentrations in Northern Thailand. Conclusions and discussions are given in Section 6.

2. Theoretical Background

2.1. Kriging with an External Drift

Let Z(s) be a random function at spatial locations s ∈ D, where d is the spatial dimension and

D \subset R^{d}

is the spatial domain of interest. The random function Z(s) for KED is modeled as

\begin{matrix} Z (s) = μ (s) + ϵ (s), \end{matrix}

(1)

where μ(s) is the drift or trend component and ϵ(s) is a stochastic residual with zero mean and a stationary covariance.

The drift component μ(s) is a deterministic function, often referred to as the trend function. It incorporates external covariates or auxiliary variables, which are predetermined variables having a systematic effect on the underlying spatial process. The trend function μ(s) is expressed as

μ (s) = \sum_{l = 0}^{L} α_{l} f_{l} (X (s)),

(2)

where α_ℓ denotes an unknown coefficient to be estimated and

X (s) = {[X_{1} (s), \dots, X_{m} (s)]}^{T}

represents the vector of m auxiliary variables at location s. The function f_ℓ denotes a basis function defined in terms of the auxiliary variables. In general, f₀ is defined as the constant function (e.g., f₀(x) = 1), and L + 1 is the total number of basis functions.

We can rewrite Equation (1) in the following form:

Z (s) = F^{T} α + ϵ (s),

(3)

where

F = {[f_{0} (X (s)), f_{1} (X (s)), \dots, f_{L} (X (s))]}^{T}

and

α = {[α_{0}, α_{1}, \dots, α_{L}]}^{T}

.

The residual term ϵ(s) is assumed to be a second-order stationary random function with a mean of zero and a covariance function C(h) [31], where h is the spatial lag vector between two locations. Specifically, the residual satisfies the following properties:

E (ϵ (s)) = 0, Cov (ϵ (s), ϵ (s + h)) = C (h),

(4)

where s,

s + h \in D \subseteq R^{d}

.

Typically, the covariance function is modeled using the variogram γ(h) of the residual process and is defined as

\begin{matrix} γ (h) & = \frac{1}{2} Var (ϵ (s + h) - ϵ (s)), \\ = \frac{1}{2} E {(ϵ (s + h) - ϵ (s))}^{2}, \end{matrix}

(5)

which leads to

C (h) = σ^{2} - γ (h),

(6)

where σ² is the variance in the residual process [32].

Given n observed values,

Z (s_{1}), \dots, Z (s_{n})

, at sample locations

s_{1}, s_{2}, \dots, s_{n}

, the KED estimator at an unobserved location s₀ is expressed as a linear combination of the observed values Z(s_i) for

i = 1, \dots, n

:

{\hat{Z}}_{K E D} (s_{0}) = \sum_{i = 1}^{n} w_{i} Z (s_{i}) = Z^{T} W,

(7)

where w_i denotes the KED weight associated with Z(s_i),

Z = {[Z (s_{1}), \dots, Z (s_{n})]}^{T}

is the vector of the observed values of the target variable, and

W = {[w_{1}, w_{2}, \dots, w_{n}]}^{T}

represents the vector of KED weights. The optimal weights are derived by minimizing the estimation error variance under the constraint of unbiasedness, resulting in the following formulation of the KED system:

\begin{matrix} [\begin{matrix} C & | & F \\ - - & + & - - \\ F^{T} & | & 0 \end{matrix}] [\begin{matrix} W \\ - - \\ L \end{matrix}] = [\begin{matrix} C_{0} \\ - - \\ F_{0} \end{matrix}], \end{matrix}

(8)

where L = η + m;

0 denotes the (L + 1) × (L + 1) zero matrix;

L = {[λ_{0}, λ_{1}, \dots, λ_{L}]}^{T}

represents the vector of Lagrange multipliers;

C_{0} = {[C (s_{1} - s_{0}), C (s_{2} - s_{0}), \dots, C (s_{n} - s_{0})]}^{T}

;

F_{0} = {[f_{0} (X (s_{0})), f_{1} (X (s_{0})), \dots, f_{L} (X (s_{0}))]}^{T}

;

C = [\begin{matrix} C (s_{1} - s_{1}) & \dots & C (s_{1} - s_{n}) \\ C (s_{2} - s_{1}) & \dots & C (s_{2} - s_{n}) \\ ⋮ & ⋱ & ⋮ \\ C (s_{n} - s_{1}) & \dots & C (s_{n} - s_{n}) \end{matrix}]

;

and

F = [\begin{matrix} f_{0} (X (s_{1})) & f_{1} (X (s_{1})) & \dots & f_{L} (X (s_{1})) \\ f_{0} (X (s_{2})) & f_{1} (X (s_{2})) & \dots & f_{L} (X (s_{2})) \\ ⋮ & ⋮ & ⋱ & ⋮ \\ f_{0} (X (s_{n})) & f_{1} (X (s_{n})) & \dots & f_{L} (X (s_{n})) \end{matrix}]

.

2.2. Dual Kriging

Using the KED system as presented in Equation (8) and assuming an invertible coefficient matrix, the solution can be formalized as [1,2]

\begin{matrix} [\begin{matrix} W \\ - - \\ L \end{matrix}] = [\begin{matrix} U & | & V \\ - - & + & - - \\ V^{T} & | & Ω \end{matrix}] [\begin{matrix} C_{0} \\ - - \\ F_{0} \end{matrix}], \end{matrix}

(9)

where

U

,

V

and Ω denote matrices of dimensions n × n, n × (L + 1), and (L + 1) × (L + 1), respectively. Since the coefficient matrix in Equation (8) is symmetric, its inverse is also symmetric. Consequently, the matrix

U

is symmetric.

Substituting the weight vector

W

, as given in Equation (9), into the estimator formulation in Equation (7) yields the following estimated value:

\begin{matrix} {\hat{Z}}_{K E D} (s_{0}) = Z^{T} V F_{0} + Z^{T} U C_{0} . \end{matrix}

(10)

Let α and β be matrices of dimensions L × 1 and n × 1, respectively; they are defined as follows:

\begin{matrix} α^{T} = [α_{0}, \dots, α_{L}] = Z^{T} V and β^{T} = [β_{1}, \dots, β_{n}] = Z^{T} U = Z^{T} U^{T} . \end{matrix}

(11)

As a result, Equation (10) can be reformulated to yield the dual kriging (DK) estimator, which is expressed as follows:

\begin{matrix} {\hat{Z}}_{DK} (s_{0}) = α^{T} F_{0} + β^{T} C_{0} . \end{matrix}

(12)

Therefore, the vectors α and β represent the weighting coefficients used in the DK estimator. According to Equation (11), these vectors can be expressed in matrix form as follows:

\begin{matrix} [\begin{matrix} β \\ - - \\ α \end{matrix}] = [\begin{matrix} U & | & A \\ - - & + & - - \\ V^{T} & | & B \end{matrix}] [\begin{matrix} Z \\ - - \\ 0_{L + 1} \end{matrix}], \end{matrix}

(13)

where

A

and

B

denote arbitrary matrices of dimensions n × (L + 1) and (L + 1) × (L + 1), respectively. The term

0_{L + 1}

refers to a zero column vector of size (L + 1) × 1.

By setting

A

=

V

and

B

= Ω, the coefficient vectors α and β are obtained as the solution to the DK system, as shown below.

\begin{matrix} [\begin{matrix} C & | & F \\ - - & + & - - \\ F^{T} & | & 0 \end{matrix}] [\begin{matrix} β \\ - - \\ α \end{matrix}] = [\begin{matrix} Z \\ - - \\ 0_{L + 1} \end{matrix}] . \end{matrix}

(14)

The estimated value of the target variable at the unobserved location s₀ can be computed by substituting the coefficient vectors α and β into Equation (12).

The solution of the individual systems used to determine the optimal weights in dual kriging (DK) depends on the invertibility of the coefficient matrices, which ensures a unique solution. Achieving this property requires appropriate selection of variogram or covariance functions and trend models. Myers [33] noted that these coefficient matrices are nonsingular if the covariance function is expressed in terms of linearly independent, real-valued functions of the trend model and is strictly positive definite with respect to the basis functions. In the following section, we introduce the definitions and theorems of positive definite matrices and completely monotone functions to establish the conditions necessary for the invertibility of the dual kriging system.

2.3. The Relationship Between the Positive Definite Matrix and the Completely Monotone Function

The definitions and theorems presented in this section are adapted from the foundational work [34,35], which provides a comprehensive treatment of positive definite matrices, completely monotone functions, and their applications in approximation theory.

Definition 1.

Let

C

be an

n \times n

symmetric matrix and

u \in R^{n}

.

1.: $C$ is positive definite (denoted by $C \geq 0$ ) if

$u^{T} C u \geq 0 for all u \neq 0 .$
2.: $C$ is strictly positive definite (denoted by $C > 0$ ) if

$u^{T} C u > 0 for all u \neq 0 .$

Definition 2.

Let

f : [0, \infty) \to R

. The function f is called a completely monotone function if

1.: $f \in C [0, \infty) \cap C^{\infty} (0, \infty)$ ;
2.: ${(- 1)}^{k} f^{(k)} (h) \geq 0$ for all $k \in N$ and $h \in (0, \infty)$ .

To illustrate the concept of complete monotonicity, we now present a specific function that satisfies the conditions outlined above.

Example 1.

The function

f (h) = e^{- \sqrt{h} / a}

is completely monotone for every a > 0.

We see that the function

f (h)

is continuous on

[0, \infty)

. We will verify that

{(- 1)}^{k} f^{(k)} \geq 0

for all

k \in N

by strong induction. For

h > 0

, it is obvious that

(- 1) f^{'} (h) = \frac{1}{2 a \sqrt{h}} e^{- \sqrt{h} / a} \geq 0 .

Next, we assume that

{(- 1)}^{i} f^{(i)} (h) \geq 0 for all i = 1, 2, \dots, k .

Using the Leibniz rule, where

{(f g)}^{(k)} = \sum_{i = 0}^{k} (\binom{k}{i}) f^{(k - i)} g^{(i)},

we compute the following:

\begin{matrix} {(- 1)}^{k + 1} f^{(k + 1)} (h) & = \frac{1}{2 a} \sum_{i = 0}^{k} (\binom{k}{i}) \frac{(2 i + 1)!!}{2^{i} h^{(2 i + 1) / 2}} {(- 1)}^{k - i} f^{(k - i)} (h) \geq 0, \end{matrix}

where

(2 i + 1)!! = (2 i + 1) (2 i - 1) (2 i - 3) \dots 3 \cdot 1

. We then conclude that

{(- 1)}^{k} f^{(k)} (h) \geq 0 for all k \in N .

The following theorem provides a powerful criterion for ensuring strict positive definiteness of matrices formed from completely monotone functions.

Theorem 1

(Schoenberg’s Theorem). If f is a completely monotone function and not constant on

[0, \infty)

, then for any distinct points

s_{1}, \dots, s_{n}

, the matrix

C = [f (∥ s_{i} - s_{j} ∥^{2})]

is strictly positive definite.

We now turn to a related result concerning the nonsingularity of a specific matrix structure known as the bordered Gramian matrix.

Theorem 2.

Let

C

be a positive definite matrix of size

n \times n

, and let

F

be an

n \times (L + 1)

matrix. The bordered Gramian matrix

[\begin{matrix} C & | & F \\ - - & + & - - \\ F^{T} & | & 0 \end{matrix}]

is nonsingular if and only if

rank (F) = L + 1 and C + F F^{T} > 0 .

3. A New Approach to Nonlinear Trend Modeling in Dual Kriging

This section presents the trend functions employed in the DK approach adopted in this study. These include a second-order polynomial function and the proposed nonlinear trend function, which were constructed by integrating GRBFs with a first-order polynomial. Additionally, the parameter estimates obtained from these two distinct trend models are presented.

Let the dataset at each sample location

s_{i} \in R^{d}

be denoted by

{(X (s_{i}), Z (s_{i}))}_{i = 1}^{n}

, where

X (s_{i}) = {[X_{1} (s_{i}), \dots, X_{m} (s_{i})]}^{T} \in R^{m}

is a vector comprising m auxiliary variables and Z(s_i) denotes the observed response at location s_i.

The conventional trend functions employed in KED and DK methods are typically first-order (linear) or second-order (quadratic) polynomial models. In this study, we focus on the second-order polynomial model, which is defined as follows [36,37]:

μ (s) = b_{0} + \sum_{j = 1}^{m} b_{j} X_{j} (s) + \sum_{i = 1}^{m} b_{i i} X_{i} {(s)}^{2} + \sum_{i = 1}^{m - 1} \sum_{j = i + 1}^{m} b_{i j} X_{i} (s) X_{j} (s) .

(15)

Given that

\frac{(m + 1) (m + 2)}{2} \leq n

, the unknown coefficients in Equation (15) can be estimated using the ordinary least squares (OLS) method [38,39], which is given by

\hat{B} = {(X^{T} X)}^{- 1} X^{T} Z,

(16)

where

\hat{B}

is an

\frac{(m + 1) (m + 2)}{2} \times 1

matrix of estimated coefficients,

X

is an

n \times \frac{(m + 1) (m + 2)}{2}

matrix whose rows are the vectors

X (s_{i}) = [1, X_{1} (s_{i}), \dots, X_{m} (s_{i}), X_{1}^{2} (s_{i}), \dots, X_{m}^{2} (s_{i}), X_{1} (s_{i}) X_{2} (s_{i}), \dots, X_{m - 1} (s_{i}) X_{m} (s_{i})]

,

and

Z = {[Z (s_{1}), Z (s_{2}), \dots, Z (s_{n})]}^{T}

.

In addition, the matrix $X$ has full column rank.

The proposed nonlinear trend function is created by integrating GRBFs with a first-order polynomial. GRBFs are widely used for modeling and estimating functions in both spatial and temporal contexts. Their formal definition is provided as follows [15]:

φ_{i} (X (s), P_{i}) = φ_{i} (X (s), Ψ_{i}, σ_{i}) = exp [- {(\frac{‖ X (s) - Ψ_{i} ‖}{σ_{i}})}^{2}] .

(17)

In Equation (17), the symbol

∥ \cdot ∥

represents some norm on

R^{m}

, typically the Euclidean norm. Each parameter vector P_i is defined as

P_{i} = [Ψ_{i}, σ_{i}]

, where Ψ_i and σ_i denote the center (or mean) and the bandwidth (or scale) parameters, respectively.

Given the effectiveness of GRBFs in capturing highly nonlinear relationships, the proposed trend model is augmented with a first-order polynomial to account for the linear component of the trend. According to Equation (1), the resulting trend function is defined as follows:

μ (s) = a_{0} + \sum_{i = 1}^{m} a_{i} X_{i} (s) + \sum_{j = 1}^{η} b_{j} φ_{j} (X (s), P_{j}),

(18)

where a_i and b_j denote unknown coefficients for

i = 0, 1, \dots, m

and

j = 1, \dots, η

, respectively. The vector

X (s) = {[X_{1} (s), \dots, X_{m} (s)]}^{T}

represents the m auxiliary variables at location s, while φ_j(X(s), P_j) is the radial basis function with parameter P_j. Additionally, η denotes the total number of radial basis functions.

In this study, we consider the parameters of the trend model(Model (18)) in two cases. In the first case, Ψ_j and σ_j are estimated using the method proposed in [15]. Subsequently, the parameters a_i and b_j for

i = 0, 1, \dots, m

and

j = 1, \dots, η

are determined using the least squares method.

In the second case, the trend model is treated as a nonlinear model. The parameter Ψ_j is obtained using the same method as in the first case, while the parameters σ_j, a_i, and b_j for

i = 0, 1, \dots, m

and

j = 1, \dots, η

are estimated using the genetic algorithm. The model parameters are estimated using a genetic algorithm implemented via the ga function in MATLAB (Version 2018a). The algorithm is configured with a population size of 30, a crossover probability of 0.8, a mutation probability of 0.2, and a maximum of 20,000 generations. Stochastic uniform selection is employed as the selection method, combined with scattered crossover and Gaussian mutation strategies.

In the first case, for the purpose of estimating the parameters Ψ_j and σ_j, the dataset is partitioned into η clusters, denoted by

{G_{1}, \dots, G_{η}}

. In this study, k-means clustering is employed to partition the dataset into n distinct clusters. As an unsupervised machine learning algorithm, k-means groups data points based on their similarity by assigning them to the nearest cluster centroid [40]. Following the method proposed in [15], the parameters Ψ_j and σ_j are estimated as follows:

Ψ_{j} = \frac{1}{n_{j}} \sum_{X_{α} \in G_{j}} X_{α},

(19)

and

σ_{j}^{2} = \frac{1}{n_{j}} \sum_{X_{α} \in G_{j}} ‖ X_{α} - Ψ_{j} ‖^{2},

(20)

where n_j denotes the number of observations belonging to the j-th cluster G_j. Subsequently, Equation (18) can be expressed in matrix form as

μ (s) = Y^{T} (s) θ,

(21)

where

Y (s) = {[1, X_{1} (s), \dots, X_{m} (s), φ_{1} (X (s)), φ_{2} (X (s)), \dots, φ_{η} (X (s))]}^{T}

and

θ = {[a_{0}, a_{1}, \dots, a_{m}, b_{1}, \dots, b_{η}]}^{T}

.

Assuming that 1 + m + η ≤ n, the parameter matrix θ can be estimated using the least squares method, and it is given by

\hat{θ} = {(Y^{T} Y)}^{- 1} Y^{T} Z,

(22)

where Y represents an n × (1 + m + η) matrix whose rows are the vectors

Y^{T} (s_{i})

. Furthermore, Y is of full column rank.

4. Nonsingularity of the Dual Kriging System

The following result combines the preceding theoretical results to establish conditions under which the bordered Gramian matrix, which was constructed from a completely monotone function and a full-rank matrix, is guaranteed to be nonsingular.

Theorem 3.

Let

X (s_{1}), X (s_{2}), \dots, X (s_{n})

be distinct points in

R^{m}

, and assume

n \geq 1 + m + η

. Define the matrix

F \in R^{n \times (1 + m + η)}

by

F = [\begin{matrix} 1 & X_{1} (s_{1}) & \dots & X_{m} (s_{1}) & φ_{1} (X (s_{1})) & \dots & φ_{η} (X (s_{1})) \\ 1 & X_{1} (s_{2}) & \dots & X_{m} (s_{2}) & φ_{1} (X (s_{2})) & \dots & φ_{η} (X (s_{2})) \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ 1 & X_{1} (s_{n}) & \dots & X_{m} (s_{n}) & φ_{1} (X (s_{n})) & \dots & φ_{η} (X (s_{n})) \end{matrix}],

where for

j = 1, \dots, η

,

φ_{j} (X (s)) = exp [- {(\frac{∥ X (s) - Ψ_{j} ∥}{σ_{j}})}^{2}], σ_{j} > 0,

are Gaussian radial basis functions with distinct centers

Ψ_{j} \in R^{m}

. Then

rank (F) = 1 + m + η

.

Proof.

We will show that the columns of F are linearly independent. To establish linear independence, it suffices to show that the homogeneous system

F c = 0

has only the trivial solution c = 0, where

c = {[c_{0}, c_{1}, \dots, c_{m}, c_{m + 1}, c_{m + 2}, \dots, c_{m + η}]}^{T} \in R^{1 + m + η} .

For each

i = 1, \dots, n

, the relation Fc = 0 can be written as

c_{0} + \sum_{k = 1}^{m} c_{k} X_{k} (s_{i}) + \sum_{j = 1}^{η} c_{m + j} φ_{j} (X (s_{i})) = 0 .

(23)

We decompose the matrix F as

F = [1 | P | Φ],

where

1 \in R^{n}

is the all-ones column,

P \in R^{n \times m}

with P_ik = X_k(s_i), and

Φ \in R^{n \times η}

with Φ_ij = φ_j(

X

(s_i)).

For distinct sites X(s_i) and GRBFs with distinct centers and σ_j > 0, Φ has full column rank when n ≥ η. Furthermore, for distinct sites with n ≥ η + m + 1 > m + 1, the polynomial block

[1 | P]

has rank m + 1, since it spans the affine-linear functions in

R^{m}

.

Case 1: Suppose some

c_{j} \neq 0

for a

j \in {m + 1, \dots, m + η}

. Then by (23), the nontrivial RBF combination

\sum_{j = 1}^{η} c_{m + j} φ_{j}

coincides with an affine-linear function on the finite set

{X (s_{i})}_{i = 1}^{n}

. Since Φ and

[1 | P]

are of full rank and span disjoint subspaces under the stated conditions, this leads to a contradiction. Hence this case is impossible.

Case 2: Assume

c_{m + 1} = \dots = c_{m + η} = 0

. Then (23) reduces to

c_{0} + \sum_{k = 1}^{m} c_{k} X_{k} (s_{i}) = 0, i = 1, \dots, n .

Because n > m + 1 and

[1 | P]

has full column rank 1 + m, it follows that

c_{0} = c_{1} = \dots = c_{m} = 0

.

Since both cases imply c = 0, we conclude that the columns of F are linearly independent.

□

Remark 1.

This theorem guarantees uniqueness in interpolation schemes that combine Gaussian radial basis functions with an affine polynomial tail. The assumptions of distinct centers and sufficiently many sample sites are essential for ensuring full rank in both the GRBF and polynomial blocks.

On the other hand, this study assumes isotropy, implying that the variogram is a function of the distance (i.e., the magnitude of the lag vector h) rather than its direction. The exponential variogram is used in its isotropic form without a nugget effect. This model is commonly applied in a wide range of spatial analysis contexts and is defined as follows [31]:

γ (h) = k_{2} (1 - e^{- h / k_{1}}),

(24)

where h = ∥h∥ is the separation distance, k₁ is the range parameter, and k₂ represents the sill, corresponding to the variance in the spatial process.

The variogram model is developed using spatial data in conjunction with an empirical variogram estimator, which captures the degree of spatial dependence between observations across different distances. In this study, we adopt the empirical variogram estimator proposed by Matheron [41], a widely recognized and commonly applied approach in geostatistics.

Theorem 4.

The DK system in Equation (14) admits a unique solution.

Proof.

From Equation (6), the covariance function can be derived as follows:

C (h) = k_{2} - [k_{2} (1 - e^{- h / k_{1}})] = k_{2} e^{- h / k_{1}} .

(25)

According to Example 1, the function

f (h) = k_{2} e^{- \sqrt{h} / k_{1}}

is a completely monotone function and not constant on

[0, \infty)

. According to Schoenberg’s Theorem, for any distinct points

s_{1}, \dots, s_{n}

, the matrix

C = [f (∥ s_{i} - s_{j} ∥^{2})] = [C (s_{i} - s_{j})]

in Equation (14) is strictly positive definite. Since the matrix FF^T is always positive definite, it follows that the sum C + FF^T is strictly positive definite. Furthermore, Theorem 3 assures that rank(F) = 1 + m + η = L + 1. Combining these two facts, Theorem 2 applies and ensures that the coefficient matrix in the dual kriging system (Equation (14)) is nonsingular.

Consequently, the DK system admits a unique solution. □

5. Application to PM_2.5 Spatial Estimation in Northern Thailand

5.1. Study Area and Data Description

In this study, fine particulate matter (PM_2.5) pollution over northern Thailand is investigated. This region is characterized by its mountainous landscape and persistent air quality issues, particularly during the dry season [42,43]. Northern Thailand consists of 15 provinces and is influenced by a mix of complex topography, intensive agricultural practices, and seasonal burning activities, which together contribute to severe haze events and transboundary air pollution [44,45]. As illustrated in Figure 1, the study area encompasses several provinces in the northern part of the country. The inset map provides a national reference, while the enlarged panel highlights the specific provinces included in the analysis, shown in orange. Fifteen monitoring stations, each corresponding to a province and marked with green circles, were employed to collect data on PM_2.5 concentrations along with associated meteorological variables.

Model accuracy was assessed using data collected during March and April of both 2023 and 2024. PM_2.5 concentration data (μg/m³) were obtained from the Pollution Control Department (PCD), while meteorological variables, including air pressure (hPa) and relative humidity (%), were retrieved from the Thai Meteorological Department (TMD). All datasets were aggregated weekly to reduce short-term variability and highlight broader spatial and temporal patterns.

The relationship between PM_2.5 concentrations and meteorological conditions was investigated using the distance correlation coefficient (DCC). This method enabled the identification of spatial dependencies between air pollution levels and meteorological factors across the study region.

For model validation, the 15 monitoring stations were divided into two subsets. Twelve stations were used for model training, while the remaining three served as an independent test set. This partitioning allowed for an objective evaluation of the model’s predictive performance and ensured a reliable basis for spatial interpolation of PM_2.5 in areas lacking direct observations.

5.2. Performance Metrics and Quantitative Evaluation

This study compares the accuracy of the DK constructed using three different trend functions. The first model (DK–POLY) utilizes a second-order polynomial trend, with its parameters estimated via ordinary least squares (OLS). The second approach (DK–RBFP) incorporates a non-linear trend function based on the GRBF combined with a first-order polynomial function. In this method, the mean and scale parameters of the GRBF are estimated using the methods described in Equations (19) and (20), respectively, while the remaining parameters are estimated using OLS. The third model (DK–RBFPGA) also employs a GRBF-based trend combined with a first-order polynomial function, where the mean parameters are estimated using the approach in Equation (19), and the remaining parameters are optimized using the GA.

For the comparison of the performance of the three models, a k-fold cross-validation technique was used [46]. The dataset was divided into five equal sets randomly (folds). For each of the five iterations, one fold was used as the validation set, while the remaining four were used for model training. This procedure guaranteed that each fold was used as the validation set once. Model accuracy was then computed by averaging over all iterations. Three metrics of performance were used to assess model performance, namely the mean absolute percentage error (MAPE) [10], the mean squared error (MSE) [47], and the root mean square error (RMSE) [10], as described below:

MAPE = \frac{100 %}{n} \sum_{i = 1}^{n} |\frac{Z (s_{i}) - \hat{Z} (s_{i})}{Z (s_{i})}|,

(26)

MSE = \frac{1}{n} \sum_{i = 1}^{n} {(Z (s_{i}) - \hat{Z} (s_{i}))}^{2},

(27)

RMSE = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(Z (s_{i}) - \hat{Z} (s_{i}))}^{2}},

(28)

where Z(s_i) denotes the observed value at location s_i,

\hat{Z} (s_{i})

is the estimated value at location s_i, and n represents the number of observations.

5.3. The Distance Correlation Coefficient

The distance correlation coefficient (DCC), which was introduced by Székely et al. [48], measures the statistical dependence between two random vectors,

U \in R^{p}

and

V \in R^{q}

, and is designed to detect both linear and nonlinear relationships. The distance correlation coefficient has proven to be a flexible and effective statistical measure, with recent research highlighting its broad applicability and extension across various fields [49,50,51,52,53]. Its strength lies in its ability to detect both linear and nonlinear relationships, making it particularly well-suited for analyzing complex and high-dimensional data [54].

Given a sample of n paired observations

{(U_{i}, V_{i})}_{i = 1}^{n}

, the calculation is carried out as follows [48,55]:

First, the empirical distance covariance,

d C (U, V)

, is computed as:

d C (U, V) = \frac{1}{n} \sqrt{\sum_{i = 1}^{n} \sum_{j = 1}^{n} A_{i j} B_{i j}}

(29)

where A_ij and B_ij are the doubly centered Euclidean distances within the samples of U and V, respectively. The centered distance matrices A_ij and B_ij are computed by double-centering the pairwise Euclidean distance matrices of the variables U and V, respectively. Specifically, the entries of the centered distance matrix for U are given by

A_{i j} = ∥ U_{i} - U_{j} ∥_{R^{p}} - \frac{1}{n} \sum_{l = 1}^{n} ∥ U_{i} - U_{l} ∥_{R^{p}} - \frac{1}{n} \sum_{k = 1}^{n} ∥ U_{k} - U_{j} ∥_{R^{p}} + \frac{1}{n^{2}} \sum_{k = 1}^{n} \sum_{l = 1}^{n} {∥ U_{k} - U_{l} ∥}_{R^{p}}

(30)

Similarly, the centered distance matrix for V is defined as

B_{i j} = ∥ V_{i} - V_{j} ∥_{R^{q}} - \frac{1}{n} \sum_{l = 1}^{n} ∥ V_{i} - V_{l} ∥_{R^{q}} - \frac{1}{n} \sum_{k = 1}^{n} ∥ V_{k} - V_{j} ∥_{R^{q}} + \frac{1}{n^{2}} \sum_{k = 1}^{n} \sum_{l = 1}^{n} {∥ V_{k} - V_{l} ∥}_{R^{q}}

(31)

In these equations, the notations

{∥ \cdot ∥}_{R^{p}}

and

{∥ \cdot ∥}_{R^{q}}

denote the Euclidean norm in p-dimensional and q-dimensional spaces, respectively.

Next, the empirical distance variances for U and V are calculated as

d V (U) = \frac{1}{n} \sqrt{\sum_{i = 1}^{n} \sum_{j = 1}^{n} A_{i j}^{2}}

(32)

and

d V (V) = \frac{1}{n} \sqrt{\sum_{i = 1}^{n} \sum_{j = 1}^{n} B_{i j}^{2}} .

(33)

Finally, the empirical distance correlation coefficient is given by

d R (U, V) = \{\begin{matrix} \frac{d C (U, V)}{\sqrt{d V (U) d V (V)}}, & if d V (U) > 0 and d V (V) > 0, \\ 0, & otherwise . \end{matrix}

(34)

The distance correlation coefficient

d R (U, V)

takes values between 0 and 1, where a value of 0 indicates complete independence between U and V, while a value of 1 indicates perfect dependence, regardless of whether the relationship is linear or nonlinear.

5.4. Results

The trend function of DK relies on auxiliary variables, which must exhibit a relationship with the target variable, in this case, the PM_2.5 concentration. As such, it is essential to quantify the relationship between the candidate auxiliary variables and the PM_2.5 concentration. In this study, the distance correlation coefficient was employed to evaluate this relationship, with air pressure and relative humidity identified as the candidate auxiliary variables.

Table 1 and Table 2 present the weekly distance correlations between PM_2.5 concentrations and two auxiliary meteorological variables, namely air pressure and relative humidity, during March and April for the years 2023 and 2024, respectively.

Table 1 shows the correlation results for 2023. It shows that air pressure had a moderate effect on PM_2.5 concentrations, with average correlation values of 0.464 in March and 0.481 in April. In Week 4 of April, there was a peak, with the correlation reaching 0.737. This suggests that air pressure may have a short-term effect on PM_2.5 levels. In contrast, relative humidity demonstrated weaker correlations, with average values of 0.402 in March and 0.453 in April, reflecting a comparatively lower association with PM_2.5 during this period.

In comparison, Table 2 shows that the 2024 results exhibited generally stronger correlations, particularly for relative humidity. In March 2024, the correlation between relative humidity and PM_2.5 climbed progressively from Week 1 to Week 4, reaching a high of 0.905 and an overall monthly average of 0.700. Although the average correlation dropped slightly to 0.528 in April, it remained higher than the values observed in 2023. Air pressure also showed stronger correlations in March 2024, averaging 0.534, though it decreased to 0.452 in April. These findings highlight that the influence of meteorological factors on PM_2.5 concentrations can vary from year to year and suggest that relative humidity may play a more significant role under certain atmospheric conditions.

The results indicate that the correlations between PM_2.5 concentrations and the auxiliary variables, air pressure and relative humidity, vary across different periods and years. Relative humidity generally shows a stronger and more consistent association with PM_2.5, while air pressure demonstrates a moderate relationship that fluctuates more noticeably over time.

Figure 2, Figure 3 and Figure 4 present a comparative evaluation of the predictive capabilities of DK with three different trend models: DK–POLY, DK–RBFP, and DK–RBFPGA. The assessment focuses on their performance during March and April of both 2023 and 2024. Each figure highlights a specific error metric, including the MAPE, MSE, and RMSE, to assess the accuracy of each model.

Figure 2 presents a comparison of the MAPE values for the three models across the specified months and years. In 2023, DK–POLY recorded the highest MAPE, indicating lower predictive accuracy compared to the other models. Conversely, both DK–RBFP and DK–RBFPGA demonstrated superior performance, with DK–RBFPGA showing a marginal advantage. A consistent trend of improvement was observed in 2024, as the MAPE values decreased for all three models. Throughout this period, DK–RBFPGA consistently yielded the lowest MAPE, indicating its enhanced reliability in prediction.

A comparison of the MSE for the same techniques and time periods is shown in Figure 3. DK–POLY in 2023 showed a significantly larger MSE, which was consistent with the MAPE results and confirmed its lower accuracy. DK–RBFPGA, on the other hand, had the lowest MSE, suggesting the most accurate predictions. The DK–POLY approach continued to exhibit the largest error, whereas DK–RBFPGA retained the lowest error, further demonstrating its superior prediction accuracy in 2024, even though the MSE dropped for all methods.

The RMSE for the three DK approaches over the same time periods is also compared in Figure 4. DK–POLY had the greatest RMSE in 2023, while the DK–RBFP and DK–RBFPGA approaches demonstrated reduced errors, with DK–RBFPGA performing slightly better. These results are consistent with the MAPE and MSE studies. The trend of reduced error continued in 2024, with all methods exhibiting lower RMSE values. However, DK–POLY consistently presented the highest RMSE, while DK–RBFPGA again achieved the lowest RMSE, highlighting its improved accuracy.

In summary, Figure 2, Figure 3 and Figure 4 consistently demonstrate that DK–POLY produced the least accurate predictions, with significantly higher MAPE, MSE, and RMSE values, particularly in 2023. In contrast, DK–RBFPGA achieved the highest prediction accuracy, consistently yielding the lowest error metrics for both 2023 and 2024. Furthermore, over the two-year period, the average MAPE, MSE, and RMSE values for DK–RBFPGA improved by 9.0%, 19.8%, and 8.7%, respectively, compared to DK–RBFP. Nevertheless, the MAPE values for both DK–RBFP and DK–RBFPGA, ranging from 18.5% to 30.7%, indicate relatively high prediction errors. This may be attributed to the limited number of data points, which restricts the development of reliable trend and variogram models.

Based on the results indicating that the DK–RBFPGA method provides the highest prediction accuracy, its estimates were subsequently used to generate maps illustrating the spatial distribution of PM_2.5 for Weeks 1 to 4 of March and April in both 2023 and 2024. The data used for this mapping were taken from fold 2. The maps were produced using QGIS (Quantum Geographic Information System) software, version 3.40.4, with the study area divided into a grid of square cells, each measuring 0.01 degrees per side. The resulting spatial distribution patterns of weekly mean PM_2.5 concentrations are presented in Figure 5, Figure 6, Figure 7 and Figure 8.

Figure 5 illustrates the spatial distribution of weekly mean PM_2.5 concentrations across northern Thailand for March 2023. During the first two weeks (Figure 5a,b), moderate concentration levels were observed in most areas, with slightly elevated values in the central region during Week 1 and in both the central and northwestern regions during Week 2. In Week 3, there was a significant reduction in PM_2.5 concentrations (Figure 5c). However, concentrations increased again in Week 4 (Figure 5d), particularly in the northwestern part of the region.

The spatial distribution of weekly mean PM_2.5 concentrations for April 2023 is depicted in Figure 6. High concentrations were observed in the northern and northeastern areas during Week 1 (Figure 6a). PM_2.5 levels declined in Week 2 (Figure 6b), and this downward trend continued through Weeks 3 and 4 (Figure 6c,d). By Week 4, concentrations had significantly decreased across the region.

Figure 7 presents the spatial distribution of weekly mean PM_2.5 concentrations for March 2024. During Week 1 (Figure 7a), concentrations were generally low across the region, although some moderate hotspots appeared in the central part. Week 2 (Figure 7b) shows an increase in PM_2.5 levels, particularly in the central and northwestern regions. In Week 3 (Figure 7c), concentrations rose slightly compared to Week 2. However, by Week 4 (Figure 7d), levels had returned to lower values.

Figure 8 shows the spatial distribution of weekly mean PM_2.5 concentrations over northern Thailand for April 2024. In Week 1 (Figure 8a), concentrations were mostly moderate, with elevated values particularly in the northwestern region. A decrease in PM_2.5 concentrations was observed across most areas in Week 2 (Figure 7b), and this trend persisted through Week 3 (Figure 7c). However, in Week 4 (Figure 7d), concentrations increased slightly compared to the preceding week.

Based on Figure 5 and Figure 6, PM_2.5 concentrations were significantly elevated during Week 4 of March 2023, as well as Weeks 1 and 2 of April 2023. These increases were likely influenced by forest fires and cross-border agricultural burning. This interpretation aligns with previous studies that identified March and April as peak periods for biomass burning and transboundary haze transport in northern Thailand [56,57]. In contrast, as shown in Figure 7 and Figure 8, air quality in 2024 showed notable improvement compared to 2023. This improvement is primarily attributed to the systematic implementation of the national action plan on combating airborne dust. Key measures included reducing open burning, monitoring black smoke emissions, strengthening the enforcement of environmental laws, and enhancing regional cooperation to address transboundary haze. These coordinated efforts contributed to more tangible progress in controlling air pollution [58].

6. Conclusions and Discussion

This study improves the dual kriging (DK) method by introducing a novel nonlinear trend function that combines the Gaussian radial basis functions (GRBFs) with the first-order polynomial, resulting in two enhanced models: DK–RBFP and DK–RBFPGA. The proposed trend structure enhances the model’s ability to capture complex spatial patterns. Theoretical support for the uniqueness of the DK solution is provided by the complete monotonicity of the covariance function and the positive definiteness of the coefficient matrix. To further increase flexibility and accuracy, the DK–RBFPGA model employs k-means clustering to identify GRBF centers and utilizes a genetic algorithm to optimize the bandwidth and other trend parameters. When tested on the spatial interpolation of PM_2.5 concentrations in northern Thailand, both enhanced models consistently outperformed the standard polynomial-based DK model, achieving significantly lower MAPE, MSE, and RMSE values. These results highlight the advantages of incorporating nonlinear trend structures and data-driven parameter optimization, which together provide a more accurate and flexible framework for environmental data analysis and spatial prediction.

Although the proposed trend model in the DK method has demonstrated strong performance in this application, its effectiveness relies on the appropriate selection of the number of clusters, which determines the number of GRBFs used in the trend function. This selection should be suited to the characteristics and quantity of the data. Moreover, accurate estimation of the GRBF parameters, including the centers and bandwidths, is crucial for achieving effective spatial interpolation.

As we have seen in this case study, DK–RBFPGA has greater predictive accuracy in comparison to DK–RBFP. However, employing the GA to estimate function parameters results in much higher computational expenses in comparison to the least squares method used in DK–RBFP, especially in the presence of large volumes of data. The least squares method benefits from performing direct matrix calculations, which are efficient and require less memory when compared to the GA’s iterative population-based searches with recurring fitness evaluations, where the algorithm assesses how well each solution solves the problem, making it much slower and more costly in terms of resources. As a result, the conflict between the two methods stems from the decision to be made in balancing higher predictive accuracy with computational efficiency.

Furthermore, the proposed models may be adapted for real-time prediction by using a local neighborhood approach that decreases the computational complexity for big datasets. In this method, the prediction for a given case is based on the closest observations, which drastically reduces the amount of data that needs to be processed. This approach helps the models remain responsive even with the large volumes and velocity requirements of data, which are typical in real-time systems.

In future work, we aim to explore alternative variogram models and nonlinear trend functions that guarantee the uniqueness of the DK solution and to extend our approach for broader applicability across diverse data types. In addition, further research will focus on the computational complexity and asymptotic properties of parameter estimation for the trend function.

Author Contributions

Conceptualization, S.U. and S.M.; methodology, S.U. and S.M.; software, S.M.; validation, S.M.; formal analysis, S.U., P.C. and S.M.; investigation, S.U. and S.M.; resources, S.M.; data curation, S.M.; writing—original draft preparation, S.U., P.C. and S.M.; writing—review and editing, S.U. and S.M.; visualization, S.M.; supervision, S.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Fundamental Fund 2025, Chiang Mai University.

Data Availability Statement

The data utilized in this study were sourced from the Pollution Control Department (PCD) and the Thai Meteorological Department (TMD).

Acknowledgments

This research was partially supported by Chiang Mai University and the Fundamental Fund 2025, Chiang Mai University.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

OK	Ordinary kriging
KED	Kriging with an external drift
DK	Dual kriging
RBF	Radial basis function
GRBFs	Gaussian radial basis functions
GA	Genetic algorithm
DCC	Distance correlation coefficient
MAPE	Mean absolute percentage error
MSE	Mean squared error
RMSE	Root mean square error
OLS	Ordinary least squares
PCD	Pollution Control Department
TMD	Thai Meteorological Department

DK–POLY	Dual kriging with a second-order polynomial trend
DK–RBFP	Dual kriging with a trend function based on the GRBF and a
DK–RBFP	first-order polynomial
DK–RBFPGA	DK–RBFP with parameters estimated using a genetic algorithm

References

Wackernagel, H. Multivariate Geostatistics: An Introduction with Applications; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
Chaveesuk, R.; Smith, A.E. Dual Kriging: An exploratory use in economic metamodeling. Eng. Econ. 2005, 50, 247–271. [Google Scholar] [CrossRef]
Erdogan Erten, G.; Yavuz, M.; Deutsch, C.V. Combination of machine learning and kriging for spatial estimation of geological attributes. Nat. Resour. Res. 2022, 31, 191–213. [Google Scholar] [CrossRef]
Gravier, V.; Trochu, F.; Aubin, C.-É.; Plouznikoff, A.; Ozell, B. Interpolation by dual kriging of a moving flow front and conservation of the fluid mass. In Fourth Canada-Japan Workshop on Composites; CRC Press: Boca Raton, FL, USA, 2020; pp. 255–262. [Google Scholar]
He, Y.; Sun, J.; Song, P.; Wang, X. Dual Kriging assisted efficient global optimization of expensive problems with evaluation failures. Aerosp. Sci. Technol. 2020, 105, 106006. [Google Scholar] [CrossRef]
Trochu, F.; Vernet, N.; Sun, Y.; Echaabi, J.; Makradi, A.; Belouettar, S. Hybrid twin models of fiber compaction for composite manufacturing based on dual kriging. Int. J. Mater. Form. 2022, 15, 36. [Google Scholar] [CrossRef]
Kongsanun, C.; Chutsagulprom, N.; Moonchai, S. Spatio-temporal dual kriging with adaptive coefficient drift function. Mathematics 2024, 12, 400. [Google Scholar] [CrossRef]
Snepvangers, J.J.J.C.; Heuvelink, G.B.M.; Huisman, J.A. Soil water content interpolation using spatio-temporal kriging with external drift. Geoderma 2003, 112, 253–271. [Google Scholar] [CrossRef]
Freier, L.; Wiechert, W.; von Lieres, E. Kriging with trend functions nonlinear in their parameters: Theory and application in enzyme kinetics. Eng. Life Sci. 2017, 17, 916–922. [Google Scholar] [CrossRef]
Baisad, K.; Chutsagulprom, N.; Moonchai, S. A non-linear trend function for kriging with external drift using least squares support vector regression. Mathematics 2023, 11, 4799. [Google Scholar] [CrossRef]
Rusu, C.; Rusu, V. Radial basis functions versus geostatistics in spatial interpolations. In IFIP International Conference on Artificial Intelligence in Theory and Practice; Springer: Berlin/Heidelberg, Germany, 2006; pp. 119–128. [Google Scholar]
Majdisova, Z.; Skala, V. Radial basis function approximations: Comparison and applications. Appl. Math. Model. 2017, 51, 728–743. [Google Scholar] [CrossRef]
Mai-Duy, N.; Tran-Cong, T. Approximation of function and its derivatives using radial basis function networks. Appl. Math. Model. 2003, 27, 197–220. [Google Scholar] [CrossRef]
Arora, G.; Bala, K.; Emadifar, H.; Khademi, M. A review of radial basis function with applications explored. J. Egypt. Math. Soc. 2023, 31, 1–14. [Google Scholar] [CrossRef]
Kawano, S.; Konishi, S. Nonlinear regression modeling via regularized Gaussian basis functions. Bull. Inform. Cybernet. 2007, 3, 83–96. [Google Scholar] [CrossRef]
Karimi, N.; Kazem, S.; Ahmadian, D.; Adibi, H.; Ballestra, L.V. On a generalized Gaussian radial basis function: Analysis and applications. Eng. Anal. Bound. Elem. 2020, 112, 46–57. [Google Scholar] [CrossRef]
Jasek, K.; Pasternak, M.; Miluski, W.; Bugaj, J.; Grabka, M. Application of Gaussian radial basis functions for fast spatial imaging of ground penetration radar data obtained on an irregular grid. Electronics 2021, 10, 2965. [Google Scholar] [CrossRef]
Shawe-Taylor, J.; Sun, S. Kernel methods and support vector machines. In Academic Press Library in Signal Processing; Elsevier: Amsterdam, The Netherlands, 2014; Volume 1, pp. 857–881. [Google Scholar]
Awad, M.; Khanna, R. Support vector machines for classification. In Efficient Learning Machines: Theories, Concepts, and Applications for Engineers and System Designers; Springer: Berlin/Heidelberg, Germany, 2015; pp. 39–66. [Google Scholar]
Orr, M.J.L. Introduction to Radial Basis Function Networks; Technical Report; Center for Cognitive Science, University of Edinburgh: Edinburgh, UK, 1996. [Google Scholar]
Rippa, S. An algorithm for selecting a good value for the parameter c in radial basis function interpolation. Adv. Comput. Math. 1999, 11, 193–210. [Google Scholar] [CrossRef]
Fasshauer, G.E.; Zhang, J.G. On choosing “optimal” shape parameters for RBF approximation. Numer. Algorithms 2007, 45, 345–368. [Google Scholar] [CrossRef]
Wang, J.G.; Liu, G.R. On the optimal shape parameters of radial basis functions used for 2-D meshless methods. Comput. Methods Appl. Mech. Eng. 2002, 191, 2611–2630. [Google Scholar] [CrossRef]
Schaback, R. Error estimates and condition numbers for radial basis function interpolation. Adv. Comput. Math. 1995, 3, 251–264. [Google Scholar] [CrossRef]
Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Boston, MA, USA, 1989. [Google Scholar]
Holland, J.H. Genetic algorithms. Sci. Am. 1992, 267, 66–73. [Google Scholar] [CrossRef]
Shi, T.; Meng, S. Optimization of nonlinear functions based on improved genetic algorithm. In Proceedings of the Third International Conference on Computer Science and Communication Technology (ICCSCT 2022); SPIE: Bellingham, WA, USA, 2022; Volume 12506, pp. 272–277. [Google Scholar]
Emmanuel, S.; Okoye, I.; Ezenweke, C.; Shobanke, D.; Adeniyi, I. Estimating nonlinear regression parameters using particle swarm optimization and genetic algorithm. FUDMA J. Sci. 2022, 6, 202–213. [Google Scholar] [CrossRef]
Onoghojobi, B.; Olewuezi, N.P.; Omojarabi, O. Nonlinear regression parameter estimates using genetic algorithms. J. Niger. Assoc. Math. Phys. 2023, 65, 173–178. [Google Scholar]
Sang, N.D. The genetic algorithm and its application in calculating the kinetic parameters of the thermoluminescence curve. In Genetic Algorithms-Theory, Design and Programming; IntechOpen: London, UK, 2024. [Google Scholar]
Montero, J.-M.; Fernández-Avilés, G.; Mateu, J. Spatial and Spatio-Temporal Geostatistical Modeling and Kriging; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Zimmerman, D.L.; Stein, M.; Gelfand, A.E.; Diggle, P.J. Classical geostatistical methods. In Handbook of Spatial Statistics; CRC Press: Boca Raton, FL, USA, 2010; pp. 29–44. [Google Scholar]
Myers, D.E. Smoothing and interpolation with radial basis functions. WIT Trans. Model. Simul. 2024, 23. [Google Scholar]
Cheney, W.; Light, W. A Course in Approximation Theory; Brooks/Cole Publishing Company: Pacific Grove, CA, USA, 1999. [Google Scholar]
Karim, M.A.; Jan, R.M. Matrix Algebra; Cambridge University Press: New York, NY, USA, 2005. [Google Scholar]
Gaspar, B.; Teixeira, A.P.; Soares, C.G. Assessment of the efficiency of kriging surrogate models for structural reliability analysis. Probabilistic Eng. Mech. 2014, 37, 24–34. [Google Scholar] [CrossRef]
Kleijnen, J.P.C. Regression and Kriging metamodels with their experimental designs in simulation: A review. Eur. J. Oper. Res. 2017, 256, 1–16. [Google Scholar] [CrossRef]
Lark, R.M.; Webster, R. Geostatistical mapping of geomorphic variables in the presence of trend. Earth Surf. Process. Landf. 2006, 31, 862–874. [Google Scholar] [CrossRef]
Yoshida, R. Linear Algebra and Its Applications with R; Chapman and Hall/CRC: Boca Raton, FL, USA, 2021. [Google Scholar]
Barua, T.; Hiran, K.K.; Jain, R.K.; Doshi, R. Machine Learning with Python; Walter de Gruyter GmbH & Co., KG: Berlin, Germany, 2024. [Google Scholar]
Matheron, G. Principles of geostatistics. Econ. Geol. 1963, 58, 1246–1266. [Google Scholar] [CrossRef]
Junpen, A.; Pansuk, J.; Kamnoet, O.; Cheewaphongphan, P.; Garivait, S. Emission of air pollutants from rice residue open burning in Thailand, 2018. Atmosphere 2018, 9, 449. [Google Scholar] [CrossRef]
Jainontee, K.; Pongkiatkul, P.; Wang, Y.-L.; Weng, R.J.F.; Lu, Y.-T.; Wang, T.-S.; Chen, W.-K. Strategy design of PM_2.5 controlling for northern Thailand. Aerosol Air Qual. Res. 2023, 23, 220432. [Google Scholar] [CrossRef]
Mueller, W.; Loh, M.; Vardoulakis, S.; Johnston, H.J.; Steinle, S.; Precha, N.; Kliengchuay, W.; Tantrakarnapa, K.; Cherrie, J.W. Ambient particulate matter and biomass burning: An ecological time series study of respiratory and cardiovascular hospital visits in northern Thailand. Environ. Health 2020, 19, 77. [Google Scholar] [CrossRef]
Thongtip, S.; Srivichai, P.; Chaitiang, N.; Tantrakarnapa, K. The influence of air pollution on disease and related health problems in northern Thailand. Sains Malays. 2022, 51, 1993–2002. [Google Scholar] [CrossRef]
Goud, M.P.; Magadum, A. Artificial Intelligence and Machine Learning; RK Publication: Jalandhar, India, 2024. [Google Scholar]
Carter, N. Data Science for Mathematicians; CRC Press: Boca Raton, FL, USA, 2020. [Google Scholar]
Szekely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and testing independence by correlation of distances. Ann. Stat. 2007, 35, 2769–2794. [Google Scholar] [CrossRef]
Hou, J.; Ye, X.; Feng, W.; Zhang, Q.; Han, Y.; Liu, Y.; Li, Y.; Wei, Y. Distance correlation application to gene co-expression network analysis. BMC Bioinform. 2022, 23, 81. [Google Scholar] [CrossRef] [PubMed]
Ratnasingam, S.; Muñoz-Lopez, J. Distance correlation-based feature selection in random forest. Entropy 2023, 25, 1250. [Google Scholar] [CrossRef] [PubMed]
Rui, D.; Hui, J. High-accuracy transient fuel consumption model based on distance correlation analysis. Fuel 2022, 321, 123927. [Google Scholar] [CrossRef]
Park, S.; Moon, J.; Jung, S.; Rho, S.; Hwang, E. Explainable influenza forecasting scheme using DCC-based feature selection. Data Knowl. Eng. 2024, 149, 102256. [Google Scholar] [CrossRef]
Ren, Z.; Zhang, J.; Zhou, Y.; Ji, X. Prediction of PM_2.5 with a piecewise affine model considering spatial-temporal correlation. J. Intell. Fuzzy Syst. 2024, 46, 9525–9542. [Google Scholar] [CrossRef]
Yao, S.; Zhang, X.; Shao, X. Testing mutual independence in high dimension via distance covariance. J. R. Stat. Soc. Ser. B Stat. Methodol. 2018, 80, 455–480. [Google Scholar] [CrossRef]
Martínez-Gómez, E.; Richards, M.T.; Richards, D.S.P. Distance correlation methods for discovering associations in large astrophysical databases. Astrophys. J. 2014, 781, 39. [Google Scholar] [CrossRef]
Suriyawong, P.; Chuetor, S.; Samae, H.; Piriyakarnsakul, S.; Amin, M.; Furuuchi, M.; Hata, M.; Inerb, M.; Phairuang, W. Airborne particulate matter from biomass burning in Thailand: Recent issues, challenges, and options. Heliyon 2023, 9, e14261. [Google Scholar] [CrossRef]
Inlaung, K.; Chotamonsak, C.; Macatangay, R.; Surapipith, V. Assessment of transboundary PM_2.5 from biomass burning in northern Thailand using the WRF-chem model. Toxics 2024, 12, 462. [Google Scholar] [CrossRef]
Pollution Control Department. Air and Noise Pollution Situation and Management in Thailand 2024; Ministry of Natural Resources and Environment: Bangkok, Thailand, 2024. Available online: https://www.pcd.go.th/publication/36289/ (accessed on 5 August 2025). (In Thai)

Figure 1. Distribution of PM_2.5 monitoring stations in northern Thailand.

Figure 2. Comparison of the mean absolute percentage error in dual kriging predictions utilizing three different trend functions for the months of March and April in (a) the year 2023 and (b) the year 2024.

Figure 3. Comparison of the mean squared error in dual kriging predictions using three different trend functions for the months of March and April in (a) the year 2023 and (b) the year 2024.

Figure 4. Average root mean squared error of dual kriging predictions for three different trend functions in March and April: (a) 2023 and (b) 2024.

Figure 5. Spatial distribution maps of weekly mean PM_2.5 concentrations for March 2023: (a) Week 1, (b) Week 2, (c) Week 3, and (d) Week 4.

Figure 6. Spatial distribution maps of weekly mean PM_2.5 concentrations for April 2023: (a) Week 1, (b) Week 2, (c) Week 3, and (d) Week 4.

Figure 7. Spatial distribution maps of weekly mean PM_2.5 concentrations for March 2024: (a) Week 1, (b) Week 2, (c) Week 3, and (d) Week 4.

Figure 8. Spatial distribution maps of weekly mean PM_2.5 concentrations for April 2024: (a) Week 1, (b) Week 2, (c) Week 3, and (d) Week 4.

Table 1. Weekly distance correlation coefficients for PM_2.5 concentrations with air pressure and relative humidity during March and April 2023, based on the complete dataset.

Auxiliary Variable	Month	Week 1	Week 2	Week 3	Week 4	Average
Air Pressure	March 2023	0.522	0.470	0.449	0.414	0.464
Air Pressure	April 2023	0.458	0.393	0.338	0.737	0.481
Relative Humidity	March 2023	0.424	0.440	0.401	0.345	0.402
Relative Humidity	April 2023	0.428	0.490	0.417	0.477	0.453

Table 2. Weekly distance correlation coefficients for PM_2.5 concentrations with air pressure and relative humidity during March and April 2024, based on the complete dataset.

Auxiliary Variable	Month	Week 1	Week 2	Week 3	Week 4	Average
Air Pressure	March 2024	0.446	0.511	0.604	0.574	0.534
Air Pressure	April 2024	0.368	0.542	0.459	0.440	0.452
Relative Humidity	March 2024	0.546	0.587	0.762	0.905	0.700
Relative Humidity	April 2024	0.744	0.499	0.424	0.445	0.528

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Utudee, S.; Chanthorn, P.; Moonchai, S. Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand. Mathematics 2025, 13, 2811. https://doi.org/10.3390/math13172811

AMA Style

Utudee S, Chanthorn P, Moonchai S. Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand. Mathematics. 2025; 13(17):2811. https://doi.org/10.3390/math13172811

Chicago/Turabian Style

Utudee, Somlak, Pharunyou Chanthorn, and Sompop Moonchai. 2025. "Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand" Mathematics 13, no. 17: 2811. https://doi.org/10.3390/math13172811

APA Style

Utudee, S., Chanthorn, P., & Moonchai, S. (2025). Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand. Mathematics, 13(17), 2811. https://doi.org/10.3390/math13172811

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand

Abstract

1. Introduction

2. Theoretical Background

2.1. Kriging with an External Drift

2.2. Dual Kriging

2.3. The Relationship Between the Positive Definite Matrix and the Completely Monotone Function

3. A New Approach to Nonlinear Trend Modeling in Dual Kriging

4. Nonsingularity of the Dual Kriging System

5. Application to PM_2.5 Spatial Estimation in Northern Thailand

5.1. Study Area and Data Description

5.2. Performance Metrics and Quantitative Evaluation

5.3. The Distance Correlation Coefficient

5.4. Results

6. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM2.5 Estimation in Northern Thailand

Abstract

1. Introduction

2. Theoretical Background

2.1. Kriging with an External Drift

2.2. Dual Kriging

2.3. The Relationship Between the Positive Definite Matrix and the Completely Monotone Function

3. A New Approach to Nonlinear Trend Modeling in Dual Kriging

4. Nonsingularity of the Dual Kriging System

5. Application to PM2.5 Spatial Estimation in Northern Thailand

5.1. Study Area and Data Description

5.2. Performance Metrics and Quantitative Evaluation

5.3. The Distance Correlation Coefficient

5.4. Results

6. Conclusions and Discussion

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Dual Kriging with a Nonlinear Hybrid Gaussian RBF–Polynomial Trend: The Theory and Application to PM_2.5 Estimation in Northern Thailand

5. Application to PM_2.5 Spatial Estimation in Northern Thailand