In this part, we will cover the linear/nonlinear ROM formulation of the AC equation. First, we briefly outline the conventional PCA method, followed by an explanation of KPCA, the nonlinear counterpart of the PCA. The process involves two steps, starting with vectors from the column space of a data matrix, known as the input space, moving to the reduced space via projection, and then mapping back from the reduced space to the input space to get the reduced approximation. Because the ROM approach we are examining relies on the given data, we will focus in the following sections on the data matrix
where the
jth column vector
represents the (full-order) solution vector of the AC equation at time
, derived from the full discrete system (
5). We make a point to simplify notation by beginning the initial superscript of the snapshot vectors in Equation (
6) at 1 instead of 0. This means that the column vector
in Equation (
6) corresponds to the solution vector
,
, of the AC equation derived from the system (
5).
3.1. Linear Dimension Reduction (PCA)
In the case of a data matrix (
6) containing the solution vectors for the AC equation, PCA aims to find a linear model of a reduced dimension
, which can accurately represent the variance in the vectors
. We assume that the matrix
U has zero column sum; otherwise, we can achieve this by subtracting the average column value
from each column. The covariance matrix
C of the matrix
U is defined by
whose diagonalization is given by
where the column vectors of the orthogonal matrix
represent the eigenvectors related to the (sorted) eigenvalues
lined up on the diagonal entries of the diagonal matrix
. Next, the eigenvectors
of the covariance matrix
C with the
k largest eigenvalues
are considered as the basis of the
k-dimensional linear subspace in a straightforward manner [
18]. Ultimately, any random vector
in the input space can be roughly expressed by the pre-image
as a linear combination of the eigenvectors
with the matrix
being composed of the initial
k columns of the orthogonal matrix
P. The real coefficients
represent the components of the projection vector
, obtained by projecting
onto the smaller linear subspace. The
-dimensional vector
lies in the reduced space, which is the projection of the
m-dimensional vector
from the input space. For more details on PCA, we refer the reader to the works [
9,
10,
12,
13,
14,
20,
21,
26] and references therein.
3.2. Nonlinear Dimension Reduction (KPCA)
PCA is restricted to reducing dimensionality in a linear manner. However, standard PCA may be inefficient when confronted with data featuring intricate structures that cannot be accurately represented within a linear subspace. Yet, KPCA provides a resolution by allowing us to expand the linear PCA for nonlinear dimensionality reduction [
12,
14,
16].
By utilizing KPCA, one maps the vectors from the
m-dimensional input space to a larger
-dimensional (potentially infinite-dimensional) space known as the feature space, through a nonlinear map
. Then, the traditional PCA is subsequently implemented on the vectors within this feature space. In order to achieve this, we label as
the converted data matrix created as
where the columns consist of the converted vectors
corresponding to the input space vectors
. In most cases, the matrix
associated with the arbitrary map
does not necessarily possess zero column sum for PCA. Subtracting the mean
from each column results in a data matrix
with zero column sum, where the columns are
, and H is the centering matrix defined by
where
is the
n-dimensional identity matrix and
is the vector of ones. Next, we apply the standard PCA steps described earlier to the data matrix
, which has columns that represent the feature space. This involves identifying the eigenvectors of the covariance matrix
Currently, the KPCA has two significant disadvantages. In the first place, the size
M could be so large that it becomes nearly impossible to calculate the eigenvectors of the
M-dimensional covariance matrix
. Second, the random nonlinear function
is often inaccessible. To tackle these problems, a kernel trick is utilized [
16,
18]. To comprehend this technique, let us examine the eigenvalue problem
where
represents the eigenpair of the covariance matrix
. It is important to understand that, by definition, the eigenvectors cover the feature space, which is equivalent to the column space of the altered data matrix
. Consequently, there are real coefficients
for each eigenvector
that satisfy the linear combination
By replacing the connection (
11) and the identity (
9) into the Equation (
10), we obtain
All eigenvectors are within the range of the altered vectors
, so for
, we can examine the equivalent equations by projecting onto the vectors
, resulting in
At this stage, a kernel function
is defined so that
with the goal of depicting the Euclidean inner products
of non-centered transformed vectors in the feature space using input space vectors. Different types of kernel functions are utilized in various studies, including linear, polynomial, and Gaussian kernels [
16,
26]. On the other hand, in order to be able to construct a non-iterative solution the formula, it is crucial to take into account a kernel function that can be purely written in terms of the distance metric
. For this reason, we employ the Gaussian kernel defined for any vector
by
where
represents the Euclidean norm (distance metric) and
is a parameter. The Gaussian kernel, also known as the radial basis function kernel, is the most used kernel function among the others, because of its nice properties such as smoothness and easy computation. During the simulations, we take the value of the parameter
so that we have
. Indeed, any positive value of the parameter
can be taken, since the results are invariant under transformation [
27,
28,
29]. This invariance is also demonstrated numerically for the one-dimensional example in the numerical results section.
In addition, to depict the Euclidean inner products
of transformed vectors that are centered in the feature space, we utilize the framework [
14]
where the kernel matrix
and the vector
are defined as
We note that it is not necessary to compute all the vectors
, as they are simply the
ith columns of the symmetric kernel matrix
K. The use of the kernel function results in an equivalent equation to (
12) being
The Equation (
13) can be represented in matrix-vector form as
for the coefficient vector
, where
and
H is the centering matrix given in (
8). We note here that the Gaussian kernel leads to a kernel matrix
K being symmetric and positive definite. As a result, it is guaranteed that the eigenvalue problem (
14) has stable solutions with positive eigenvalues. We also note that since we use only the first
eigenvectors related to the first
k dominant eigenvalue, it no eigenvalue clustering or near-degeneracy issues occur. In simulations, we compute the eigenvalues using the MatLab’s built-in function
eig.
Ultimately, for any vector
in the input space, its transformed vector
in the feature space is found. The projection vector
represents the projection of
onto the reduced
k-dimensional space (
) spanned by the eigenvectors
associated with the largest
k eigenvalues
. After calculating the coefficients
from the eigenvalue problem (
14) and utilizing the identity (
11) of the eigenvectors
, the projection vectors’ entries
can be determined using the kernel function as the following
Reconstructing the original image of any random vector
in the input space can be achieved by approximating its pre-image
using standard PCA as shown in Equation (
7). Nonetheless, KPCA does not follow this pattern. We denote
as the projection of
onto the reduced space spanned by the eigenvectors
, namely,
Next, we can find an estimated pre-image
such that the vector
after transformation is the vector closest to the projected vector
. This involves determining the minimum of the objective functional
using a least-squares approach. By imposing
, we obtain the following equation to be solved
where we set
The implicit Equation (
15) can be solved by fixed-point iteration or other Newton-type nonlinear iterative solvers. But the nonlinear iteration technique can be unstable and depends highly on the initial estimation [
14,
26]. In the KPCA method presented in this study, the pre-image is calculated using a non-iterative approach [
14], which relies on the correlation between the distance of vectors in the input space and the distance of the vectors in the transformed space, employing the Gaussian kernel function as a distance metric function
. By utilizing the MDS concept [
14,
20], we can roughly discover a pre-image
where the differences in distances between
and each input vector
and between projected vector
and each transformed feature vector
remain constant; see
Figure 1. To achieve this goal, let
represent the distance metrics between input space vectors and their transformed feature space vectors. After manipulating the Gaussian kernel function as
=
, we can establish a relation between input space distance and feature space distance, and also its inverse map, as [
14]
In addition, it can be demonstrated using the kernel matrix that the distance in feature space between the projected vector
and a transformed vector
is
for the matrix
Ultimately, by taking
and utilizing the metric relation (
16) within Equation (
15), we can derive the non-iterative solution formula [
14]
where the distance
in the formula is determined through relation (
17). Furthermore, the
k vectors
mentioned in (
18) are chosen from the
n input space vectors that are closest to the given vector
[
14,
26].