A Data-Driven Parameter Prediction Method for HSS-Type Methods

Jiang, Kai; Su, Jianghao; Zhang, Juan

doi:10.3390/math10203789

Open AccessArticle

A Data-Driven Parameter Prediction Method for HSS-Type Methods

by

Kai Jiang

¹,

Jianghao Su

²

and

Juan Zhang

^3,*

¹

Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan 411105, China

²

School of Mathematics and Computational Science, Xiangtan University, Xiangtan 411105, China

³

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education, Xiangtan University, Xiangtan 411105, China

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(20), 3789; https://doi.org/10.3390/math10203789

Submission received: 2 September 2022 / Revised: 5 October 2022 / Accepted: 10 October 2022 / Published: 14 October 2022

(This article belongs to the Special Issue Matrix Equations and Their Algorithms Analysis)

Download

Browse Figures

Versions Notes

Abstract

:

Some matrix-splitting iterative methods for solving systems of linear equations contain parameters that need to be specified in advance, and the choice of these parameters directly affects the efficiency of the corresponding iterative methods. This paper uses a Bayesian inference-based Gaussian process regression (GPR) method to predict the relatively optimal parameters of some HSS-type iteration methods and provide extensive numerical experiments to compare the prediction performance of the GPR method with other existing methods. Numerical results show that using GPR to predict the parameters of the matrix-splitting iterative methods has the advantage of smaller computational effort, predicting more optimal parameters and universality compared to the currently available methods for finding the parameters of the HSS-type iteration methods.

Keywords:

Gaussian process regression; matrix-splitting iterative method; data-driven method; HSS iteration method; LHSS iteration method; NHSS iteration method; SHSS iteration method; SHSS-SS iteration method; MHSS iteration method; MSNS iteration method; linear systems

MSC:

15A24; 65F45

1. Introduction

Solving linear equations is one of the most fundamental topics in matrix computation, and with the development of science and technology, many important problems in the natural sciences and engineering can often be reduced to the following linear equation:

A x = b,

(1)

where

x, b \in C^{n}

,

A \in C^{n \times n}

is a large sparse non-Hermitian and positive definite matrix.

There are many powerful matrix-splitting iterative methods for solving systems of linear equations, such as the successive over-relaxation (SOR) method [1], the symmetric SOR (SSOR) method [2], the accelerated over-relaxation (AOR) method [3] and the symmetric AOR (SAOR) method [4]. Many researchers have applied them to different problems and made some improvements [5,6,7,8]. Considering the specificity of the problem and solving Equation (1) more efficiently, many new matrix-splitting iterative methods have been proposed. Bai et al. offered the Hermitian and skew-Hermitian splitting (HSS) method and the inexact HSS method [9]. To improve the efficiency of the HSS method, Bai et al. proposed the preconditioned HSS (PHSS) [10]. Due to the promising performance of the HSS method, some HSS-type iteration methods were presented. These methods can be mainly divided into the following two forms. The first one is accelerated HSS-type methods. Such as the generalized HSS method [11], the lopsided HSS (LHSS) method [12], the generalized PHSS method [13], the asymmetric HSS method [14] and the new HSS (NHSS) method [15]. In addition, Yang et al. offered the minimum residual HSS method [16] by applying the minimum residual technique to the HSS method and Li et al. [17] proposed the single-step HSS (SHSS) method. Based on the shift-splitting method and the SHSS method, Li et al. established the SHSS-SS method [18].

Apart from the accelerated HSS-type methods, some other HSS-type methods focused on the applications to different kinds of problems. Such as the saddle-point problems [19,20,21,22,23,24], solving the matrix equation [25,26,27,28,29,30,31,32], and solving the complex symmetric linear systems [33,34,35] and the nonlinear systems [36,37].

These iteration methods contain splitting parameters that need to be specified in advance. At present, there are three main methods of selecting the splitting parameters. The first is obtaining relatively optimal parameters by traversing or experimenting within some intervals [26,38,39]. The advantage of this traversal method is that it can obtain relatively accurate optimal parameters, but it requires large amount of calculation and consumes a lot of extra time, especially when the dimension of the coefficient matrix is large. The second is estimating optimal parameters through theoretical analysis [40,41]. Some researchers find optimal parameters by minimizing the spectral radius of the iterative matrix. However, solving this optimization problem is very difficult in theoretical analysis and practical computation. Bai et al. [42] proposed an accurate formula for computing the optimal parameters of the HSS method by directly minimizing the spectral radius of the iterative matrix, but the coefficient matrix is a two-by-two matrix or a two-by-two block matrix with specific forms. Some researchers find quasi-optimal parameters by minimizing the upper bound of the spectral radius of the iterative matrix of some iteration methods [9,12,17,18]. By a reasonable and simple optimization principle, Chen [43] proposed an accurate estimate to the optimal parameter of the HSS iteration method. Huang [44] and Yang [45] estimated the optimal parameters of the HSS method by solving a cubic polynomial equation and a quartic polynomial equation, respectively. Huang [46] proposed variable-parameter HSS methods and the parameter in it updated at each step of the iteration. The above theoretical methods contain the following limitations. First, the method is only available case by case, which means it is less universal. Second, the method needs to compute the maximum or the minimum eigenvalues of the matrix, but this is time-consuming work. Jiang et al. [47] proposed the third estimation method, the Bayesian inference-based Gaussian process regression (GPR) method, to predict the optimal parameters in some alternating direction iterative methods. This method uses a training set to learn a mapping between the dimension of linear systems and relatively optimal splitting parameters.

The choice of splitting parameters can greatly affect the efficiency of the HSS-type iteration methods [47,48], which makes the parameter selection of great importance. For computing the splitting parameters of the HSS-type iteration methods, to overcome the limitations of the traversal method and the theoretical methods, we use the GPR method to predict the splitting parameters of some HSS-type iteration methods. The main contributions of this work are:

We apply the GPR method for the prediction of optimal splitting parameters of some HSS-type methods, which is a new application.
We provide extensive numerical experiments to compare the prediction performance of the GPR method with the traversal method and the theoretical methods.

The results of numerical experiments show that: comparing to the traversal method, the GPR method can predict almost the same parameters as the traversal method does but with less computational effort; comparing to the theoretical method, the GPR method can predict better optimal parameters than the theoretical method and is more universal (unlike the theoretical method available case by case, the GPR method is suitable for all the HSS-type iteration methods tested). Moreover, the theoretical methods need to compute the maximum or the minimum eigenvalues (or singular values) of the matrix but this is a time-consuming work when the dimension of the matrix is large and the GPR method overcomes this limitation.

The rest of the paper is organized as follows. In Section 2, we present Gaussian process regression method based on Bayesian inference. In Section 3, we present the iteration scheme of some HSS-type iteration methods, the corresponding convergence conditions and the theoretical methods for estimating the relatively optimal splitting parameters. In Section 4, we illustrate the efficiency of the GPR method by numerical experiments. Finally, in Section 5, we include some concluding remarks and prospects.

Throughout the paper, the sets of

n \times n

complex and real matrices are denoted by

C^{n \times n}

and

R^{n \times n}

, respectively. If

X \in C^{n \times n}

, let

X^{T}

,

X^{- 1}

,

X^{*}

,

{∥X∥}_{2}

,

{∥X∥}_{F}

denote the transpose, inverse, conjugate transpose, the Euclidean norm and Frobenius norm of X, respectively. The notations

λ (X) = (λ_{1} (X), λ_{2} (X), \dots, λ_{n} (X))

,

σ (X) = (σ_{1} (X), σ_{2} (X), \dots, σ_{n} (X))

,

ρ (X)

denote the eigenvalue set, singular value set and spectral radius of X, respectively. The symbol ⊗ denotes the Kronecker product. I represents the identity matrix.

2. Gaussian Process Regression Method

In this section, we present a Gaussian process regression method based on Bayesian inference. Gaussian process regression is an application of non-parametric Bayesian estimation to regression problems and has a wide range of applications in the field of machine learning.

2.1. Bayesian Inference

Bayesian inference is a method that infers the population distribution or the characteristic number of the population according to the sample information and the prior information. Prior information is some information about statistical problems before sampling. Taking inferrance of the distribution of an unknown quantity

θ

as an example, Bayesian inference believes that

θ

can be regarded as a random variable before obtaining the sample information, so it can be described by a probability distribution and this distribution is a prior distribution. After obtaining the samples, the population distribution, samples and the prior distribution are combined by Bayesian formula to obtain a new distribution about the unknown quantity

θ

, which is a posterior distribution. We can find that the process of Bayesian inference is essentially the process of updating prior information through sample information.

The Bayesian inference-based Gaussian process regression method is Bayesian inference with Gaussian process as a prior distribution. The definition of Gaussian process is given below.

Definition 1.

Gaussian process (GP) is a collection of random variables

{X_{t}, t \in T}

and for any finite subset

{t_{1}, t_{2}, \dots, t_{k}}

of T,

(X_{t_{1}}, X_{t_{2}}, \dots, X_{t_{k}})

follows the joint Gaussian distribution.

In our question, we expect to obtain a mapping

f (\cdot)

from the dimension of linear systems to relatively optimal splitting parameters, so that for each given dimension

n_{*}

, the mapping can output the corresponding relatively optimal splitting parameter

α_{*} = f (n_{*})

. To this end, this paper uses the GPR to fit this mapping

f (n)

.

2.2. Model Building

Following the steps of Bayesian inference, firstly, we give prior information. For each given

n \in N^{+}

, the corresponding optimal splitting parameter

α

is a random variable, and we denote

α

as

f (n)

to reflect the correspondence between

α

and n. Considering that, in general, the observed

α

would be polluted by addition noise, the observed

α

may not be exactly equal to

f (n)

. It is

α = f (n) + η,

(2)

where

η

is the addition noise and we assume that

η

follows a Gaussian distribution with zero mean and variance

σ^{2}

, i.e.,

η \sim N (0, σ^{2})

. The desirable range of

σ

is

10^{- 2} \leq σ \leq 10^{- 6}

. In this work, we take

σ = 10^{- 4}

. We also assume that

f (n)

are independent of each other. Obviously, our task is to obtain

f (\cdot)

.

Considering random process

{f (n), n \in N^{+}}

and assuming this process is a GP, we have

f (n) \sim G P (μ (n), k (n, n)),

where

μ (\cdot)

and

(\cdot, \cdot)

denote mean function and covariance function of the GP

f (n)

, respectively. Once

μ (\cdot)

and

k (\cdot, \cdot)

have been determined, the GP is also determined. The selection of

μ (\cdot)

and

k (\cdot, \cdot)

is shown in Section 2.3.

Next, we obtain sample information. Assume that we have a training set

D = \{(n_{i}, α_{i}) ∣ i = 1, 2, \dots, d\} : = {n, α}

where

(n_{i}, α_{i})

is an input–output pair.

n_{i}

is the dimension of the coefficient matrix and

α_{i}

is the optimal splitting parameter in a matrix-splitting iterative method. Obviously, the training set is the sample information. According to the prior information,

{f (n_{i}), i = 1, 2, \dots, d}

follows the joint Gaussian distribution, i.e.,

f (n) : = (\begin{matrix} f (n_{1}) \\ ⋮ \\ f (n_{d}) \end{matrix}) \sim N ((\begin{matrix} μ (n_{1}) \\ ⋮ \\ μ (n_{d}) \end{matrix}), (\begin{matrix} k (n_{1}, n_{1}) & \dots & k (n_{1}, n_{d}) \\ ⋮ & ⋱ & ⋮ \\ k (n_{d}, n_{1}) & \dots & k (n_{1}, n_{d}) \end{matrix})) .

Or equally,

f (n) \sim N (μ (n), k (n, n)),

where

n = {(n_{1}, n_{2}, \dots, n_{d})}^{T}

. From Equation (2), the distribution of the corresponding observed

α

is

α \sim N (μ (n), k (n, n) + σ^{2} I_{d})

(3)

where

I_{d}

is a d-order identity matrix.

Finally, we can update the prior information by using the sample information. That is to say, to predict new dimensional vector

n_{*} = {(n_{1}, n_{2}, \dots, n_{m})}^{T}

, we can obtain the distribution of the optimal splitting parameters corresponding to

n_{*}

, i.e., the conditional distribution

f_{*} | n, α, n_{*}

, where

f_{*} = {(f (n_{1}), f (n_{2}), \dots, f (n_{m}))}^{T}

. According to the prior information, the sample

α

and predicted value of

n_{*}

follow the joint Gaussian distribution. From Equation (3), we have

(\begin{matrix} α \\ f_{*} \end{matrix}) \sim N ((\begin{matrix} μ (n) \\ μ (n_{*}) \end{matrix}), (\begin{matrix} k (n, n) + σ^{2} I_{d} & k (n, n_{*}) \\ k (n_{*}, n) & k (n_{*}, n_{*}) \end{matrix})) .

(4)

To obtain the conditional distribution

f_{*} | n, α, n_{*}

from Equation (4), we have the following theorem [49].

Theorem 1.

Let

x

and

y

be jointly Gaussian random vectors, i.e.,

(\begin{matrix} x \\ y \end{matrix}) \sim N ((\begin{matrix} μ_{x} \\ μ_{y} \end{matrix}), (\begin{matrix} A & C \\ C^{T} & B \end{matrix})),

then the marginal distribution of

x

and the conditional distribution of

x

given by

y

are

\begin{matrix} x \sim N (μ_{x}, A), \\ x ∣ y \sim N (μ_{x} + C B^{- 1} (y - μ_{y}), A - C B^{- 1} C^{T}) . \end{matrix}

From Theorem 1, we have

f_{*} ∣ n, α, n_{*} \sim N (μ_{*}, σ_{*}^{2}),

where

\begin{matrix} μ_{*} = k (n_{*}, n) {[k (n, n) + σ^{2} I_{d}]}^{- 1} (α - μ (n)) + μ (n_{*}), \\ σ_{*}^{2} = k (n_{*}, n_{*}) - k (n_{*}, n) {[k (n, n) + σ^{2} I_{d}]}^{- 1} k (n, n_{*}) . \end{matrix}

For the predicted value

n_{*}

, one can use the mean value of the above Gaussian distribution as its estimated value, i.e.,

f_{*} = μ_{*}

. Now, we have successfully obtained the function

f (\cdot)

, and denoted the independent variable of

f (\cdot)

by

n_{*}

; then,

f (n_{*}) : = μ_{*} (n_{*})

.

2.3. Model Selection

In this section, we determine

μ (\cdot)

and

k (\cdot, \cdot)

. In this work, we let

μ (\cdot) = 0

, other ways can refer to [49]. For covariance function, the exponential kernel function is

k (x, y) = σ_{f}^{2} exp (\frac{- ∥ x - y ∥}{2 ι^{2}}),

where

θ = {ι, σ_{f}}

is the hyperparameter. In this work, maximum likelihood estimation is used to select the values of the hyperparameter

θ

. Specifically, when we have a training set

D = \{(n_{i}, α_{i}) ∣ i = 1, 2, \dots, d\} : = {n, α}

, the likelihood function L of

θ

can be derived as

\begin{matrix} L : = log p (α ∣ n, θ) = & - \frac{1}{2} α^{T} {[k_{θ} (n, n) + σ^{2} I_{d}]}^{- 1} α \\ - \frac{1}{2} log det (k_{θ} (n, n) + σ^{2} I_{d}) . \end{matrix}

The optimal hyperparameter

θ

is

θ^{*} = arg max_{θ} L

.

In practice, to avoid a large amount of calculation, we generally produced the training set from a set of small-scale systems.

3. Matrix-Splitting Iterative Methods

In this section, we recall some matrix-splitting iteration methods, including the HSS method, the NHSS method, the LHSS method, the SHSS method, the SHSS-SS method, the MHSS method and the MSNS method. We mainly focus on their iterative schemes, convergence and theoretical methods involving estimating optimal splitting parameters.

3.1. Matrix-Splitting Methods for Non-Hermitian Positive Definite Linear Systems

Consider the linear Equation (1).

A x = b,

(5)

where

x, b \in C^{n}

,

A \in C^{n \times n}

is non-singular. Let

M, N \in C^{n \times n}

be splitting matrices such that

A = M + N .

For the HSS method, the NHSS method, the LHSS method, the SHSS method and the SHSS-SS method, they all split A into Hermitian and skew-Hermitian parts. i.e.,

M = \frac{A + A^{*}}{2}, N = \frac{A - A^{*}}{2} .

(6)

3.1.1. HSS Iteration Method

The scheme of HSS iteration [9] is as follows.

Definition 2.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + M) x^{(k + \frac{1}{2})} = (α I - N) x^{(k)} + b, \\ (α I + N) x^{(k + 1)} = (α I - M) x^{(k + \frac{1}{2})} + b, \end{matrix}

where α is a given positive constant.

For the convergence property of the HSS iteration, we have the following theorem [9]:

Theorem 2.

Let A in Equation (5) be a positive definite matrix, the matrixes

M, N

be defined in the same way as Equation (6) and let α be a positive constant. Then, the iteration matrix

M (α)

of the HSS iteration is given by

M (α) = {(α I + N)}^{- 1} (α I - M) {(α I + M)}^{- 1} (α I - N),

and its spectral radius

ρ (M (α))

is bounded by

σ (α) \equiv max_{λ \in λ (M)} |\frac{α - λ_{i}}{α + λ_{i}}|,

(7)

where

λ (M)

is the spectral set of the matrix M. Therefore, it holds that

ρ (M (α)) \leq σ (α) < 1,

i.e., the HSS iteration converges to the unique solution of the Equation (5).

Equation (7) provides a theoretical method to estimate the optimal splitting parameter, that is, by minimizing the upper bound

σ (α)

of the spectral radius of the iteration matrix

M (α)

of the HSS iteration to obtain the quasi-optimal splitting parameter

α^{*}

, and it is shown by the following theorem [9].

Theorem 3.

The conditions are the same as Theorem 2: let

λ_{min}

and

λ_{max}

be the minimum and the maximum eigenvalues of the matrix M, respectively. Then

α^{*} \equiv arg min_{α} \{max_{γ_{min} \leq λ \leq γ_{max}} |\frac{α - λ}{α + λ}|\} = \sqrt{λ_{min} λ_{max}} .

The HSS iteration needs to solve two linear systems with coefficient matrices

α I + H

and

α I + S

, which is costly and impractical. An approach to overcome this disadvantage is to solve two subproblems iteratively and this result is the inexact HSS (IHSS) iteration method.

Definition 3.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges,

1.: Approximately solve $(α I + M) {\bar{z}}^{(k)} = {\bar{r}}^{(k)} ({\bar{r}}^{(k)} = b - A {\bar{x}}^{(k)})$ by employing an inner iteration (e.g., the CG method), such that the residual ${\bar{p}}^{(k)} = {\bar{r}}^{(k)} - (α I + M) {\bar{z}}^{(k)}$ satisfies

$∥{\bar{p}}^{(k)}∥ \leq ε_{k} ∥{\bar{r}}^{(k)}∥,$

and then compute ${\bar{x}}^{(k + \frac{1}{2})} = {\bar{x}}^{(k)} + {\bar{z}}^{(k)}$ ;
2.: Approximately solve $(α I + N) {\bar{z}}^{(k + \frac{1}{2})} = {\bar{r}}^{(k + \frac{1}{2})} ({\bar{r}}^{(k + \frac{1}{2})} = b - A {\bar{x}}^{(k + \frac{1}{2})}$ ) by employing an inner iteration (e.g., some Krylov subspace method), such that the residual ${\bar{q}}^{(k + \frac{1}{2})} = {\bar{r}}^{(k + \frac{1}{2})} - (α I + N) {\bar{z}}^{(k + \frac{1}{2})}$ satisfies

$∥{\bar{q}}^{(k + \frac{1}{2})}∥ \leq η_{k} ∥{\bar{r}}^{(k + \frac{1}{2})}∥,$

and then compute ${\bar{x}}^{(k + 1)} = {\bar{x}}^{(k + \frac{1}{2})} + {\bar{z}}^{(k + \frac{1}{2})}$ , here α is a given positive constant.

The convergence property and the choice of the tolerance

ε_{k}

and

η_{k}

can refer to [9].

3.1.2. NHSS Iteration Method

The scheme of NHSS [15] iteration is as follows.

Definition 4.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} M x^{(k + \frac{1}{2})} = - N x^{(k)} + b, \\ (α I + M) x^{(k + 1)} = (α I - N) x^{(k + \frac{1}{2})} + b, \end{matrix}

where α is a given positive constant.

For the convergence property of the NHSS iteration, we have the following theorem [15]:

Theorem 4.

Let A in Equation (5) be a positive definite and normal matrix; the matrixes

M, N

be defined in the same way as Equation (6) and let α be a positive constant. Then, the iteration matrix

M (α)

of the NHSS iteration is

M (α) = {(α I + M)}^{- 1} (α I - N) M^{- 1} (- N) .

The spectral radius

ρ (M (α))

is bounded by

σ (α) \equiv \frac{σ_{max} \sqrt{α^{2} + σ_{max}^{2}}}{λ_{min} (α + λ_{min})},

(8)

where

σ_{max}

is the maximum singular value of the matrix N and

λ_{min}

is the minimum eigenvalue of the matrix M. Moreover, if

σ_{max} \leq λ_{min}

, then

M (α) \leq σ (α) < 1,

i.e., the NHSS iteration converges to the unique solution of Equation (5).

Equation (8) provides a theoretical method to estimate the optimal splitting parameter, that is, by minimizing the upper bound

σ (α)

of the spectral radius of the iteration matrix

M (α)

of the NHSS iteration to obtain the quasi-optimal splitting parameter

α^{*}

, and it is given by Theorem 5 [15].

Theorem 5.

The conditions are the same as Theorem 4; then, the quasi-optimal splitting parameter of the NHSS iteration is

α^{*} = \frac{σ_{max}^{2}}{λ_{min}} .

3.1.3. LHSS Iteration Method

The scheme of LHSS iteration [12] is as follows.

Definition 5.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} M x^{(k + \frac{1}{2})} = - N x^{(k)} + b, \\ (α I + N) x^{(k + 1)} = (α I - M) x^{(k + \frac{1}{2})} + b, \end{matrix}

where α is a given positive constant.

For the convergence property of the LHSS iteration, we have the following theorem [12]:

Theorem 6.

Let A in Equation (5) be non-singular and α be a positive constant. Then, the LHSS iteration converges to the unique solution of Equation (5). Moreover, the spectral radius of the iterative matrix of the LHSS iteration satisfies

\begin{matrix} ρ (M (α)) < \frac{σ_{max}}{\sqrt{α^{2} + σ_{max}^{2}}} max_{λ_{i} \in λ (M)} |\frac{α - λ_{i}}{λ_{i}}|, \end{matrix}

(9)

where

M (α) = {(α I + N)}^{- 1} {(α I - M)}^{- 1} (- N) .

By minimizing the upper bound of the spectral radius of the iteration matrix of the LHSS iteration in Equation (9), we can obtain the quasi-optimal splitting parameter

α^{*}

using Theorem 7 [12].

Theorem 7.

The conditions are the same as Theorem 6: let

λ_{min}

and

λ_{max}

be the minimum and the maximum eigenvalues of the matrix M, respectively. Let

σ_{max}

be the maximum singular value of the matrix N. Then

α^{*} = \frac{2 λ_{max} λ_{min}}{λ_{max} + λ_{min}} .

To improve the efficiency of the LHSS iteration method, we have the following ILHSS iteration method.

Definition 6.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges,

1.: Approximately solve $M {\bar{z}}^{(k)} = {\bar{r}}^{(k)} ({\bar{r}}^{(k)} = b - A {\bar{x}}^{(k)})$ by employing an inner iteration (e.g., the CG method), such that the residual ${\bar{p}}^{(k)} = {\bar{r}}^{(k)} - M {\bar{z}}^{(k)}$ satisfies

$∥{\bar{p}}^{(k)}∥ \leq ε_{k} ∥{\bar{r}}^{(k)}∥,$

and then compute ${\bar{x}}^{(k + \frac{1}{2})} = {\bar{x}}^{(k)} + {\bar{z}}^{(k)}$ ;
2.: Approximately solve $(α I + N) {\bar{z}}^{(k + \frac{1}{2})} = {\bar{r}}^{(k + \frac{1}{2})} ({\bar{r}}^{(k + \frac{1}{2})} = b - A {\bar{x}}^{(k + \frac{1}{2})}$ ) by employing an inner iteration (e.g., a Krylov subspace method), such that the residual ${\bar{q}}^{(k + \frac{1}{2})} = {\bar{r}}^{(k + \frac{1}{2})} - (α I + N) {\bar{z}}^{(k + \frac{1}{2})}$ satisfies

$∥{\bar{q}}^{(k + \frac{1}{2})}∥ \leq η_{k} ∥{\bar{r}}^{(k + \frac{1}{2})}∥,$

and then compute ${\bar{x}}^{(k + 1)} = {\bar{x}}^{(k + \frac{1}{2})} + {\bar{z}}^{(k + \frac{1}{2})}$ , where α is a given positive constant.

The convergence property and the choice of the tolerance

ε_{k}

and

η_{k}

can refer to [12].

3.1.4. SHSS Iteration Method

The scheme of SHSS iteration [17] is as follows.

Definition 7.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + M) x^{(k + 1)} = (α I - N) x^{(k)} + b . \end{matrix}

where α is a given positive constant.

For the convergence property of the SHSS iteration, we have the following theorem [17]:

Theorem 8.

Let A in Equation (5) be positive definite. Let

λ_{min}

and

λ_{max}

be the minimum and the maximum eigenvalues of the matrix M, respectively. Let

σ_{max}

be the maximum singular value of the matrix N. The spectral radius of the iteration matrix of the SHSS iteration method is bounded by

σ (α) = \frac{\sqrt{α^{2} + σ_{max}^{2}}}{α + λ_{min}} .

Moreover,

(i): If $λ_{min} \geq σ_{max}$ , then $σ (α) < 1$ for any $α > 0$ , which means that the SHSS iteration method is unconditional convergent;
(ii): If $λ_{min} < σ_{max}$ , then $σ (α) < 1$ (which means the SHSS iteration method is convergent) if and only if

$α > max \{0, \frac{σ_{max}^{2} - λ_{min}^{2}}{2 λ_{min}}\} .$

The quasi-optimal splitting parameter

α^{*}

of the SHSS-SS iteration method is shown by Theorem 9 [17].

Theorem 9.

The conditions are the same as Theorem 8, then

α^{*} = \frac{σ_{max}^{2}}{λ_{min}} .

3.1.5. SHSS-SS Iteration Method

The scheme of SHSS-SS iteration [18] is as follows.

Definition 8.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + M) x^{(k + \frac{1}{2})} = (α I - N) x^{(k)} + b, \\ (α I + A) x^{(k + 1)} = (α I - A) x^{(k + \frac{1}{2})} + 2 b, \end{matrix}

where α is a given positive constant.

For the convergence property of the SHSS-SS iteration, we have the following theorem [18]:

Theorem 10.

Let A in Equation (5) be non-Hermitian and positive definite. Let

λ_{min}

and

λ_{max}

be the minimum and the maximum eigenvalues of the matrix M, respectively. Let

σ_{max}

be the maximum singular value of the matrix N. If α satisfies

α > max \{0, \frac{σ_{max}^{2} - λ_{min}^{2}}{2 λ_{min}}\},

then the SHSS-SS iteration converges to the unique solution of Equation (5).

The quasi-optimal splitting parameter

α^{*}

of the SHSS-SS iteration method is as follows [18].

Theorem 11.

The conditions are the same as Theorem 10, then

α^{*} = \frac{σ_{max}^{2}}{λ_{min}} .

3.2. Matrix-Splitting Methods for Complex Symmetric Linear Systems

Considering the linear equation of the form

A x \equiv (W + i T) x = b,

(10)

where

W, T \in R^{n \times n}

are symmetric positive definite matrix and symmetric positive semi-definite matrix, respectively,

b \in R^{n}

and

i = \sqrt{- 1}

. Here, we let

T \neq 0

, so A in Equation (10) is non-Hermitian.

3.2.1. HSS Iteration Method and MHSS Iteration Method

As the matrix W is positive definite, so the matrix A in Equation (10) is non-Hermitian positive definite. We can straightforwardly use the HSS method to solve Equation (10). The scheme of HSS iteration is as follows.

Definition 9.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + W) x^{(k + \frac{1}{2})} = (α I - i T) x^{(k)} + b, \\ (α I + i T) x^{(k + 1)} = (α I - W) x^{(k + \frac{1}{2})} + b, \end{matrix}

(11)

where α is a given positive constant.

However, solving the linear sub-system with its coefficient matrix being the shifted skew-Hermitian

α I + i T

is very difficult in some cases. To avoid this, Bai et al. [33] proposed the modified HSS (MHSS) method. The scheme of MHSS iteration is as follows.

Definition 10.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + W) x^{(k + \frac{1}{2})} = (α I - i T) x^{(k)} + b, \\ (α I + T) x^{(k + 1)} = (α I + i W) x^{(k + \frac{1}{2})} - i b, \end{matrix}

where α is a given positive constant.

For the convergence property of the MHSS iteration, we have the following theorem [33]:

Theorem 12.

The conditions are the same as for Equation (11) and let α be a positive constant. Then, the iteration matrix

M (α)

of the MHSS iteration is

M (α) = {(α I + T)}^{- 1} (α I + i W) {(α I + W)}^{- 1} (α I - i T),

and its spectral radius

ρ (M (α))

is bounded by

σ (α) \equiv max_{λ_{j} \in λ (M)} \frac{\sqrt{α^{2} + λ_{j}^{2}}}{α + λ_{j}},

where

λ (M)

is the spectral set of the matrix M. Therefore, we have

M (α) \leq σ (α) < 1,

i.e., the MHSS iteration converges to the unique solution of Equation (11).

The quasi-optimal splitting parameter

α^{*}

of the MHSS method is as follows [33].

Theorem 13.

The conditions are the same as Theorem 12, let

λ_{min}

and

λ_{max}

be the minimum and the maximum eigenvalues of the matrix W, respectively. Then

α^{*} = \sqrt{λ_{min} λ_{max}} .

Similar to the HSS iteration method and LHSS iteration method, the MHSS iteration method has its inexact version as well [33].

3.2.2. MSNS Iteration Method

The scheme of MSNS iteration [50] is as follows.

Definition 11.

Given an initial guess

x^{0}

, for

k = 0, 1, 2, \dots,

until

x^{(k)}

converges, compute

\begin{matrix} (α I + T) x^{(k + \frac{1}{2})} = (i α W + T^{2}) x^{(k)} + i T b, \\ (i α W I - T^{2}) x^{(k + 1)} = (α I - T) x^{(k + \frac{1}{2})} + i T b, \end{matrix}

where α is a given positive constant.

For the convergence property of the MSNS iteration, we have the following theorem [50]:

Theorem 14.

Let W be a real symmetric indefinite matrix, T be a real symmetric definite positive matrix and α be a positive constant. Then, the spectral radius

ρ (M (α))

of the iteration matrix

M (α)

of the MSNS iteration is bounded by

σ (α) \equiv max_{μ_{j} \in μ (T)} |\frac{α - μ_{j}}{α + μ_{j}}|,

where

μ (T)

is the spectral set of the matrix T. Moreover, it holds that

ρ (M (α)) \leq σ (α) < 1,

i.e., the MSNS iteration converges to the unique solution of Equation (11).

The quasi-optimal splitting parameter

α^{*}

of the MSNS method is as follows [50].

Theorem 15.

The conditions are the same as Theorem 14, let

μ_{min}

and

μ_{max}

be the minimum and the maximum eigenvalues of the matrix T, respectively. Then

α^{*} = \sqrt{μ_{min} μ_{max}} .

4. Numerical Results

In this section, we present extensive numerical examples to show the power of the GPR method compared with the traversal method and theoretical method. We take a three-dimensional convection-diffusion equation and

Pad \overset{´}{e}

approximation in the time integration of a parabolic partial differential equations as examples.

In the following numerical experiments, all tests are started with a zero vector. All iterative methods are terminated if the relative residual error satisfies

{∥r^{(k)}∥}_{2} / {∥r^{(0)}∥}_{2} \leq 10^{- 6}

, where

r^{(k)} = b - A x^{(k)}

is the k-step residual. “IT” and “CPU” denote the required iterations and the CPU time (in seconds), respectively. “Traversal time” denotes the required CPU time (in seconds) to obtain the optimal splitting parameters by the traversal method. “Training time” denotes the required CPU time (in seconds) to produce the training set and train the GPR model. We use

\frac{| Traversal time - Training time |}{Traversal time}

to make a comparison of the calculation amount of the traversal method and the GPR method. Obviously, the larger this quantity is, the longer the traversal time of the traversal method will take compared to the training time of the GPR method.

For the traversal method, the optimal splitting parameter minimizes the iterations of the corresponding iteration method when solving linear systems and it is obtained by traversing interval

(0, 3]

with a step size of

0.01

.

For the GPR method, the training set is produced by using the traversal method for a set of small-scale systems and their dimensions are shown later.

For the IHSS method and the ILHSS method, we use the CG method to solve the linear systems with the coefficient matrix

α I + M

and the GMRES method to solve linear systems with the coefficient matrix

α I + N

. The inner CG and GMRES iterates are terminated if the current residuals of the inner iterations satisfy

\frac{{∥p^{(j)}∥}_{2}}{{∥r^{(k)}∥}_{2}} \leq max \{0.1 \times {0.8}^{k}, 1 \times 10^{- 7}\} and \frac{{∥q^{(j)}∥}_{2}}{{∥r^{(k)}∥}_{2}} \leq max \{0.1 \times {0.8}^{k}, 1 \times 10^{- 6}\},

where

p^{(j)}

and

q^{(j)}

are, respectively, the residuals of the jth inner CG and GMRES,

r^{(k)}

is the residual of the kth outer iteration.

All computations are carried out using MATLAB 2018b on a personal computer with a 1.8 GHz CPU Intel Core i5 and 8G memory.

Example 1.

Consider the following three-dimensional convection-diffusion equation

- (u_{x x} + u_{y y} + u_{z z}) + q (u_{x} + u_{y} + u_{z}) = f (x, y, z),

(12)

on the unit cube

Ω : = [0, 1] \times [0, 1] \times [0, 1]

, with constant coefficient q and subject to Dirichlet-type boundary conditions. When the seven-point finite difference discretization and the equidistant step-size

h = \frac{1}{n + 1}

(n is the degree of freedom along each dimension) is used on all the three directions applied to Equation (12), we obtain the linear system with the coefficient matrix

A = T_{x} \otimes I \otimes I + I \otimes T_{y} \otimes I + I \otimes I \otimes T_{z},

(13)

where

T_{x}

,

T_{y}

and

T_{z}

are tridiagonal matrices. If the first order derivatives are approximated by the centered difference scheme, we have

\begin{matrix} T_{x} = tridiag (t_{2}, t_{1}, t_{3}), T_{y} = tridiag (t_{2}, 0, t_{3}), \\ T_{z} = tridiag (t_{2}, 0, t_{3}), \end{matrix}

with

t_{1} = 6

,

t_{2} = - 1 - r

,

t_{3} = - 1 + r

and

r = \frac{q h}{2}

(mesh Reynolds number).

According to [9,51,52], for the centred difference scheme, the extreme eigenvalues and singular values of matrices M and N in Equation (6) are

\begin{matrix} min_{1 ⩽ i ⩽ n^{3}} λ_{i} (M) = 6 (1 - cos π h), max_{1 ⩽ i ⩽ n^{3}} λ_{i} (M) = 6 (1 + cos π h), \\ max_{1 ⩽ i ⩽ n^{3}} σ_{i} (N) = 6 r cos π h . \end{matrix}

Therefore, the theoretical method of the HSS method, the LHSS method, the NHSS method, the SHSS method and the SHSS-SS method to obtain the optimal splitting parameters can be easily calculated.

Let

q = 1

in Equation (12). The discretization of Equation (12) leads to a system of linear equations

A x = b

, where

A \in R^{n^{3} \times n^{3}}

is defined by Equation (13), and set the exact solution

x_{e} = {(1, 1, \dots, 1)}^{T} \in R^{n^{3}}

, then,

b = A x_{e}

.

In this experiment, we apply the GPR method to compare with the traversal method and the theoretical method, respectively. Concretely, first, we use the HSS method, IHSS method, LHSS method and ILHSS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the traversal method and the GPR method, respectively. Numerical experiments results show that the GPR can predict almost the same parameters as the traversal method does, but with less calculation. Finally, we use the HSS method, the NHSS method, the LHSS method, the SHSS method and the SHSS-SS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the theoretical method and the GPR method, respectively. Numerical results show that the GPR method can compute better optimal parameters than the theoretical method, which means that the GPR method can be applied to a wide range of matrix-splitting iterative methods and is highly universal.

The GPR method vs. the traversal method. We first use the HSS method, the IHSS method, the LHSS method and the ILHSS method to solve the 3D convection-diffusion equation and the splitting parameters are selected using the traversal method and the GPR method, respectively. Table 1, Table 2, Table 3 and Table 4 and Figure 1 show the numerical results.

From Table 1, Table 2, Table 3 and Table 4 and Figure 1, we know that the GPR method can predict almost the same parameters as the traversal method does. It uses a training set from a set of small-scale systems and its training time is much less than the traversal time of the traversal method. Thus, the GPR method requires less calculation than the traversal method.

From Figure 2, we can have a visual representation of what the mapping

f (n)

we want to fit looks like.

The GPR method vs. the theoretical method. We use the HSS method, the NHSS, the LHSS method, the SHSS method and the SHSS-SS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the theoretical method (given in Theorems 3, 5, 7, 9 and 11) and the GPR method, respectively. Table 5, Table 6, Table 7, Table 8 and Table 9 and Figure 3 and Figure 4 show the numerical results.

From Table 5, Table 6, Table 7, Table 8 and Table 9 and Figure 3 and Figure 4, we know that the GPR method can predict better optimal parameters than the theoretical method. Unlike the theoretical method available case-by-case, the GPR method is suitable for the five iterative methods, which means that the GPR method is highly universal.

Example 2.

Consider the following complex symmetric linear systems.

[(K + \frac{3 - \sqrt{3}}{τ} I) + i (K + \frac{3 + \sqrt{3}}{τ} I)] x = b,

(14)

where K is the five-point centered difference matrix approximating the negative Laplacian operator with homogeneous Dirichlet boundary conditions, on a uniform mesh in the unit square

[0, 1] \times [0, 1]

with mesh-size

h = \frac{1}{m + 1}

.

K \in R^{m^{2} \times m^{2}}

and

K = I \otimes V_{m} + V_{m} \otimes I

, with

V_{m} = h^{- 2} t r i d i a g (- 1, 2, - 1) \in R^{m \times m}

. In addition, the right-hand side vector b with its jth entry

b_{j}

is given by

b_{j} = \frac{(1 - i) j}{τ {(j + 1)}^{2}}, j = 1, 2, \dots, n .

Let

τ = h

and normalize coefficient matrix and right-hand side by multiplying both by

h^{2}

. Refer to [53] for more details.

In this experiment, we apply the GPR method to compare with the traversal method and the theoretical method, respectively. Concretely, first, we use the HSS method and the MHSS method to solve Equation (14), and the splitting parameters are selected using the traversal method and the GPR method, respectively. Then, we use the HSS method, the MHSS method and the MSNS method to solve Equation (14), and the splitting parameters are selected using the theoretical method and the GPR method, respectively. Since the extreme eigenvalues of matrix M and extreme singular value of matrix N cannot be explicitly obtained, we use MATLAB built-in function “eigs(MaxIterations’,500,’Tolerance’,1e-5)” and “svds(MaxIterations’,500,’Tolerance’,1e-5)” to calculate them.

The GPR method vs. the traversal method. We first use the HSS method and the MHSS method to solve Equation (14) and the splitting parameters are selected using the traversal method and the GPR method, respectively. Table 10 and Table 11 and Figure 5 show the numerical results.

From Table 10 and Table 11 and Figure 5, we know that the GPR method can predict almost the same parameters as the traversal method does. It uses a training set obtained by using the traversal method for small-scale systems and its training time is much less than the traversal time of the traversal method. Thus, the GPR method requires less calculation than the traversal method.

The GPR method vs. the theoretical method. We use the HSS method, the MHSS method and the MSNS method to solve Equation (14), and the splitting parameters are selected using the theoretical method (given in Theorem 3, 13 and 14) and the GPR method, respectively. Table 12, Table 13 and Table 14 and Figure 6 show the numerical results.

From Table 12, Table 13 and Table 14 and Figure 6, we know that the GPR method can predict better optimal parameters than the theoretical method. Unlike the theoretical method only available case-by-case, the GPR method is suitable for the three iterative methods, which means that the GPR method is highly universal.

5. Conclusions

In this paper, we use the Bayesian inference-based Gaussian process regression (GPR) method to predict the relatively optimal parameters of some matrix-splitting iteration methods and provide extensive numerical experiments to compare the prediction performance of the GPR method with other methods. The GPR method learns a mapping between the dimension of linear systems and relatively optimal splitting parameters using a small training data set. Numerical results show that the GPR method requires less calculation than the traversal method. It is more universal and can predict more optimal parameters than the theoretical methods.

There is still lots of work to study the proposed methods. For example, the first one is to apply the GPR method to some iteration methods with multi-parameters or some non HSS-type iteration methods. The second one is to measure the predictive performance of the GPR method when the true optimal parameters to predict are unknown. The third one is to choose other mean functions and covariance functions to improve the predictive performance of the GPR method.

Author Contributions

K.J.—methodology, review and editing; J.S.—software, visualization, data curation; J.Z.—methodology, review and editing. All authors have read and agreed to the published version of the manuscript.

Funding

The work is supported in part by the National Natural Science Foundation of China (12171412, 11771370), Natural Science Foundation for Distinguished Young Scholars of Hunan Province (2021JJ10037), Hunan Youth Science and Technology Innovation Talents Project (2021RC3110), and the Key Project of Education Department of Hunan Province (19A500, 21A0116).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Young, D. Iterative methods for solving partial difference equations of elliptic type. Trans. Am. Math. Soc. 1954, 76, 92–111. [Google Scholar] [CrossRef]
Young, D.M. Convergence properties of the symmetric and unsymmetric successive overrelaxation methods and related methods. Math. Comput. 1970, 24, 793–807. [Google Scholar] [CrossRef]
Hadjidimos, A. Accelerated overrelaxation method. Math. Comput. 1978, 32, 149–157. [Google Scholar] [CrossRef]
Hadjidimos, A.; Yeyios, A. Symmetric accelerated overrelaxation (SAOR) method. Math. Comput. Simul. 1982, 24, 72–76. [Google Scholar] [CrossRef]
Darvishi, M.T.; Hessari, P. Symmetric SOR method for augmented systems. Appl. Math. Comput. 2006, 183, 409–415. [Google Scholar] [CrossRef]
Darvishi, M.T.; Hessari, P. A modified symmetric successive overrelaxation method for augmented systems. Comput. Math. Appl. 2011, 61, 3128–3135. [Google Scholar] [CrossRef] [Green Version]
Allahviranloo, T. Successive over relaxation iterative method for fuzzy system of linear equations. Appl. Math. Comput. 2005, 162, 189–196. [Google Scholar] [CrossRef]
Darvishi, M.T.; Hessari, P. On convergence of the generalized AOR method for linear systems with diagonally dominant coefficient matrices. Appl. Math. Comput. 2006, 176, 128–133. [Google Scholar] [CrossRef]
Bai, Z.Z.; Golub, G.H.; Ng, M.K. Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems. SIAM J. Matrix Anal. Appl. 2003, 24, 603–626. [Google Scholar] [CrossRef] [Green Version]
Bai, Z.Z.; Golub, G.H.; Pan, J.Y. Preconditioned Hermitian and skew-Hermitian splitting methods for non-Hermitian positive semidefinite linear systems. Numer. Math. 2004, 98, 1–32. [Google Scholar] [CrossRef]
Benzi, M. A generalization of the Hermitian and skew-Hermitian splitting iteration. SIAM J. Matrix Anal. Appl. 2009, 31, 360–374. [Google Scholar] [CrossRef]
Li, L.; Huang, T.Z.; Liu, X.P. Modified Hermitian and skew-Hermitian splitting methods for non-Hermitian positive-definite linear systems. Numer. Linear Algebra Appl. 2007, 14, 217–235. [Google Scholar] [CrossRef]
Yang, A.L.; An, J.; Wu, Y.J. A generalized preconditioned HSS method for non-Hermitian positive definite linear systems. Appl. Math. Comput. 2010, 216, 1715–1722. [Google Scholar] [CrossRef]
Li, L.; Huang, T.Z.; Liu, X.P. Asymmetric Hermitian and skew-Hermitian splitting methods for positive definite linear systems. Comput. Math. Appl. 2007, 54, 147–159. [Google Scholar] [CrossRef] [Green Version]
Noormohammadi Pour, H.; Sadeghi Goughery, H. New Hermitian and skew-Hermitian splitting methods for non-Hermitian positive-definite linear systems. Numer. Algorithms 2015, 69, 207–225. [Google Scholar] [CrossRef]
Yang, A.L.; Cao, Y.; Wu, Y.J. Minimum residual Hermitian and skew-Hermitian splitting iteration method for non-Hermitian positive definite linear systems. BIT Numer. Math. 2019, 59, 299–319. [Google Scholar] [CrossRef]
Li, C.X.; Wu, S.L. A single-step HSS method for non-Hermitian positive definite linear systems. Appl. Math. Lett. 2015, 44, 26–29. [Google Scholar] [CrossRef]
Li, C.X.; Wu, S.L. A SHSS–SS iteration method for non-Hermitian positive definite linear systems. Results Appl. Math. 2022, 13, 100225. [Google Scholar] [CrossRef]
Jiang, M.Q.; Cao, Y. On local Hermitian and skew-Hermitian splitting iteration methods for generalized saddle point problems. J. Comput. Appl. Math. 2009, 231, 973–982. [Google Scholar] [CrossRef] [Green Version]
Benzi, M.; Gander, M.J.; Golub, G.H. Optimization of the Hermitian and skew-Hermitian splitting iteration for saddle-point problems. BIT Numer. Math. 2003, 43, 881–900. [Google Scholar] [CrossRef]
Krukier, L.A.; Krukier, B.L.; Ren, Z.R. Generalized skew-Hermitian triangular splitting iteration methods for saddle-point linear systems. Numer. Linear Algebra Appl. 2014, 21, 152–170. [Google Scholar] [CrossRef]
Bai, Z.Z.; Benzi, M. Regularized HSS iteration methods for saddle-point linear systems. BIT Numer. Math. 2017, 57, 287–311. [Google Scholar] [CrossRef]
Yang, A.L.; Wu, Y.J. The Uzawa–HSS method for saddle-point problems. Appl. Math. Lett. 2014, 38, 38–42. [Google Scholar] [CrossRef]
Li, X.; Yang, A.L.; Wu, Y.J. Parameterized preconditioned Hermitian and skew-Hermitian splitting iteration method for saddle-point problems. Int. J. Comput. Math. 2014, 91, 1224–1238. [Google Scholar] [CrossRef]
Wang, X.; Li, Y.; Dai, L. On Hermitian and skew-Hermitian splitting iteration methods for the linear matrix equation AXB = C. Comput. Math. Appl. 2013, 65, 657–664. [Google Scholar] [CrossRef]
Bai, Z.Z. On Hermitian and skew-Hermitian splitting iteration methods for continuous Sylvester equations. J. Comput. Math. 2011, 29, 185–198. [Google Scholar] [CrossRef] [Green Version]
Wang, X.; Li, W.W.; Mao, L.Z. On positive-definite and skew-Hermitian splitting iteration methods for continuous Sylvester equation AX + XB = C. Comput. Math. Appl. 2013, 66, 2352–2361. [Google Scholar] [CrossRef]
Zhou, R.; Wang, X.; Tang, X.B. A generalization of the Hermitian and skew-Hermitian splitting iteration method for solving Sylvester equations. Appl. Math. Comput. 2015, 271, 609–617. [Google Scholar] [CrossRef]
Zhou, R.; Wang, X.; Tang, X.B. Preconditioned positive-definite and skew-Hermitian splitting iteration methods for continuous Sylvester equations AX + XB = C. East Asian J. Appl. Math. 2017, 7, 55–69. [Google Scholar] [CrossRef]
Dehghan, M.; Shirilord, A. A generalized modified Hermitian and skew-Hermitian splitting (GMHSS) method for solving complex Sylvester matrix equation. Appl. Math. Comput. 2019, 348, 632–651. [Google Scholar] [CrossRef]
Bahramizadeh, Z.; Nazari, M.; Zak, M.K.; Yarahmadi, Z. Minimal residual Hermitian and skew-Hermitian splitting iteration method for the continuous Sylvester equation. arXiv 2020, arXiv:2012.00310. [Google Scholar]
Xu, L.; Mingxiang, L. Generalized positive-definite and skew-hermitian splitting iteration method and its sor acceleration for continuous sylvester equations. Math. Numer. Sin. 2021, 43, 354. [Google Scholar]
Bai, Z.Z.; Benzi, M.; Chen, F. Modified HSS iteration methods for a class of complex symmetric linear systems. Computing 2010, 87, 93–111. [Google Scholar] [CrossRef]
Li, X.; Yang, A.L.; Wu, Y.J. Lopsided PMHSS iteration method for a class of complex symmetric linear systems. Numer. Algorithms 2014, 66, 555–568. [Google Scholar] [CrossRef]
Wu, S.L. Several variants of the Hermitian and skew-Hermitian splitting method for a class of complex symmetric linear systems. Numer. Linear Algebra Appl. 2015, 22, 338–356. [Google Scholar] [CrossRef]
Bai, Z.Z.; Guo, X.P. On Newton-HSS methods for systems of nonlinear equations with positive-definite Jacobian matrices. J. Comput. Math. 2010, 28, 235–260. [Google Scholar]
Zhu, M.Z. Modified iteration methods based on the Asymmetric HSS for weakly nonlinear systems. J. Comput. Anal. Appl. 2013, 15, 185–195. [Google Scholar]
Ke, Y.F.; Ma, C.F. A preconditioned nested splitting conjugate gradient iterative method for the large sparse generalized Sylvester equation. Comput. Math. Appl. 2014, 68, 1409–1420. [Google Scholar] [CrossRef]
Zheng, Q.Q.; Ma, C.F. On normal and skew-Hermitian splitting iteration methods for large sparse continuous Sylvester equations. J. Comput. Appl. Math. 2014, 268, 145–154. [Google Scholar] [CrossRef]
Carre, B. The determination of the optimum accelerating factor for successive over-relaxation. Comput. J. 1961, 4, 73–78. [Google Scholar] [CrossRef]
Kulsrud, H.E. A practical technique for the determination of the optimum relaxation factor of the successive over-relaxation method. Commun. ACM 1961, 4, 184–187. [Google Scholar] [CrossRef]
Bai, Z.Z.; Golub, G.H.; Li, C.K. Optimal parameter in Hermitian and skew-Hermitian splitting method for certain two-by-two block matrices. SIAM J. Sci. Comput. 2006, 28, 583–603. [Google Scholar] [CrossRef] [Green Version]
Chen, F. On choices of iteration parameter in HSS method. Appl. Math. Comput. 2015, 271, 832–837. [Google Scholar] [CrossRef]
Huang, Y.M. A practical formula for computing optimal parameters in the HSS iteration methods. J. Comput. Appl. Math. 2014, 255, 142–149. [Google Scholar] [CrossRef]
Yang, A.L. Scaled norm minimization method for computing the parameters of the HSS and the two-parameter HSS preconditioners. Numer. Linear Algebra Appl. 2018, 25, e2169. [Google Scholar] [CrossRef]
Huang, N. Variable-parameter HSS methods for non-Hermitian positive definite linear systems. Linear Multilinear Algebra 2021, 1–18. [Google Scholar] [CrossRef]
Jiang, K.; Su, X.; Zhang, J. A general alternating-direction implicit framework with Gaussian process regression parameter prediction for large sparse linear systems. SIAM J. Sci. Comput. 2022, 44, A1960–A1988. [Google Scholar] [CrossRef]
Axelsson, O.; Bai, Z.Z.; Qiu, S.X. A class of nested iteration schemes for linear systems with a coefficient matrix with a dominant positive definite symmetric part. Numer. Algorithms 2004, 35, 351–372. [Google Scholar] [CrossRef]
Von Mises, R. Mathematical Theory of Probability and Statistics; Academic Press: Cambridge, MA, USA, 2014. [Google Scholar]
Pourbagher, M.; Salkuyeh, D.K. On the solution of a class of complex symmetric linear systems. Appl. Math. Lett. 2018, 76, 14–20. [Google Scholar] [CrossRef]
Greif, C.; Varah, J. Iterative solution of cyclically reduced systems arising from discretization of the three-dimensional convection-diffusion equation. SIAM J. Sci. Comput. 1998, 19, 1918–1940. [Google Scholar] [CrossRef] [Green Version]
Greif, C.; Varah, J. Block stationary methods for nonsymmetric cyclically reduced systems arising from three-dimensional elliptic equations. SIAM J. Matrix Anal. Appl. 1999, 20, 1038–1059. [Google Scholar] [CrossRef]
Axelsson, O.; Kucherov, A. Real valued iterative methods for solving complex symmetric linear systems. Numer. Linear Algebra Appl. 2000, 7, 197–218. [Google Scholar] [CrossRef]

Figure 1. The IT and CPU of the HSS method and the LHSS method to solve the 3D convection-diffusion equation with splitting parameters selected by the traversal method and the GPR method, respectively.

Figure 2. The splitting parameters of the HSS method, the LHSS method, the IHSS method and the ILHSS method to solve the 3D convection-diffusion equation with splitting parameters selected by the traversal method and the GPR method, respectively.

Figure 3. The IT and CPU of the HSS method and the LHSS method to solve the 3D convection-diffusion equation with splitting parameters selected by the theoretical method and the GPR method, respectively.

Figure 4. The IT and CPU of the NHSS method, the SHSS method and the SHSS-SS method to solve the 3D convection-diffusion equation with splitting parameters selected by the theoretical method and the GPR method, respectively.

Figure 5. The IT and CPU of the HSS method and the MHSS method to solve Equation (14) with splitting parameters selected by the traversal method and the GPR method, respectively.

Figure 6. The IT and CPU of the HSS method, the MHSS method and the MSNS method to solve Equation (14) with splitting parameters selected by the theoretical method and the GPR method, respectively.

Table 1. Results of the HSS method for solving 3D convection-diffusion equation with traversal method and GPR method.

$n^{3}$	HSS (Traversal Method)			HSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$n^{3}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$8^{3}$ (512)	1.7500	32	0.2500	1.7533	32	0.2813	$[2^{3}, 4^{3}]$	93.73%	0.12 h
$12^{3}$ (1728)	1.2600	46	0.6719	1.2920	46	0.6706	$[1^{3}, 6^{3}]$	97.02%	0.59 h
$16^{3}$ (4096)	1.0000	59	2.5625	1.0673	60	3.3125	$[1^{3}, 4^{3}]$	99.45%	1.53 h
$20^{3}$ (8000)	0.8400	72	7.7969	0.8424	72	6.0152	$[2^{3}, 7^{3}, 10^{3}]$	88.53%	2.42 h
$24^{3}$ (13,824)	0.7200	85	25.2031	0.7090	86	25.6250	$[1^{3}, 6^{3}, 10^{3}]$	91.83%	5.69 h
$28^{3}$ (21,952)	0.6200	99	54.4063	0.6527	100	55.1875	$[1^{3}, 7^{3}, 10^{3}]$	93.43%	>6 h
$32^{3}$ (32,768)	0.5600	111	115.7344	0.5457	112	117.2031	$[1^{3}, 7^{3}, 10^{3}]$	95.99%	>6 h
$36^{3}$ (46,656)	0.5000	124	193.6406	0.5007	124	193.3594	$[1^{3}, 5^{3}, 7^{3}, 13^{3}]$	97.79%	>6 h

Table 2. Results of the IHSS method for solving 3D convection-diffusion equation with traversal method and GPR method.

$n^{3}$	IHSS (Traversal Method)			IHSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$n^{3}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$32^{3}$ (32,768)	0.6200	125	47.7969	0.6389	125	47.7656	$[2^{3}, 6^{3}, 10^{3}]$	98.03%	3.25 h
$48^{3}$ (110,592)	0.4500	187	257.7188	0.4647	187	256.1250	$[2^{3}, 7^{3}, 10^{3}]$	99.37%	>6 h
$64^{3}$ (262,144)	0.3600	246	680.7031	0.3502	249	693.2031	$[1^{3}, 4^{3}, 5^{3}, 10^{3}, 12^{3}]$	99.89%	>6 h

Table 3. Results of the LHSS method for solving 3D convection-diffusion equation with traversal method and GPR method.

$n^{3}$	LHSS (Traversal Method)			LHSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$n^{3}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$8^{3}$ (512)	1.5300	5	0.0938	1.7323	5	0.0469	$[2^{3}, 3^{3}, 4^{3}]$	92.09%	0.07 h
$12^{3}$ (1728)	1.1200	5	0.1781	1.3161	5	0.1250	$[2^{3}, 3^{3}, 4^{3}]$	98.90%	0.47 h
$16^{3}$ (4096)	0.8700	5	0.3406	0.9999	5	0.3750	$[2^{3}, 3^{3}, 4^{3}]$	99.50%	1.04 h
$20^{3}$ (8000)	0.7100	5	0.8125	0.7597	5	0.8106	$[2^{3}, 3^{3}, 4^{3}]$	99.65%	1.46 h
$24^{3}$ (13,824)	0.6000	5	1.2969	0.6497	5	1.6719	$[1^{3}, 6^{3}, 10^{3}]$	91.44%	2.50 h
$28^{3}$ (21,952)	0.5100	5	3.9531	0.5340	5	2.9219	$[1^{3}, 6^{3}, 10^{3}]$	92.20%	2.74 h
$32^{3}$ (32,768)	0.4500	5	10.6094	0.4949	5	5.2813	$[1^{3}, 7^{3}, 10^{3}]$	93.27%	3.20 h
$36^{3}$ (46,656)	0.4000	5	10.2188	0.4158	5	8.3281	$[1^{3}, 7^{3}, 10^{3}]$	94.82%	4.15 h

Table 4. Results of the ILHSS method for solving 3D convection-diffusion equation with traversal method and GPR method.

$n^{3}$	ILHSS (Traversal Method)			ILHSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$n^{3}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$32^{3}$ (32,768)	1.4100	7	2.3281	1.4789	7	2.3031	$[4^{3}, 8^{3}]$	98.02%	0.37 h
$48^{3}$ (110,592)	1.4300	7	10.2969	1.4394	7	7.4219	$[4^{3}, 8^{3}]$	99.35%	1.11 h
$64^{3}$ (262,144)	1.4100	7	37.7031	1.4155	7	36.8281	$[4^{3}, 7^{3}, 12^{3}]$	99.49%	3.04 h

Table 5. Results of the HSS method for solving 3D convection-diffusion equation with theoretical method and GPR method.

$n^{3}$	HSS (Theoretical Method)			HSS (GPR)
$n^{3}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$8^{3}$ (512)	2.0521	35	0.6406	1.7533	32	0.2813	$[2^{3}, 4^{3}]$
$12^{3}$ (1728)	1.4359	49	1.3594	1.2920	46	0.6706	$[1^{3}, 6^{3}]$
$16^{3}$ (4096)	1.1025	62	4.1406	1.0673	60	3.3125	$[1^{3}, 4^{3}]$
$20^{3}$ (8000)	0.8943	75	8.2031	0.8424	72	6.0152	$[2^{3}, 7^{3}, 10^{3}]$
$24^{3}$ (13,824)	0.7520	87	26.4844	0.7090	86	25.6250	$[1^{3}, 6^{3}, 10^{3}]$
$28^{3}$ (21,952)	0.6487	100	55.8438	0.6312	99	54.1875	$[3^{3}, 4^{3}, 16^{3}]$
$32^{3}$ (32,768)	0.5703	112	118.2188	0.5457	112	117.2031	$[1^{3}, 7^{3}, 10^{3}]$
$36^{3}$ (46,656)	0.5088	124	205.2500	0.5007	124	193.3594	$[1^{3}, 5^{3}, 7^{3}, 13^{3}]$

Table 6. Results of the LHSS method for solving 3D convection-diffusion equation with theoretical method and GPR method.

$n^{3}$	LHSS (Theoretical Method)			LHSS (GPR)
$n^{3}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$8^{3}$ (512)	0.7091	9	0.0469	1.7323	5	0.0469	$[2^{3}, 3^{3}, 4^{3}]$
$12^{3}$ (1728)	0.3436	12	0.4688	1.3161	5	0.1250	$[2^{3}, 3^{3}, 4^{3}]$
$16^{3}$ (4096)	0.2026	16	1.8750	0.9999	5	0.3750	$[2^{3}, 3^{3}, 4^{3}]$
$20^{3}$ (8000)	0.1333	20	4.5625	0.7597	5	0.8106	$[2^{3}, 3^{3}, 4^{3}]$
$24^{3}$ (13,824)	0.0943	25	19.3438	0.6497	5	1.6719	$[1^{3}, 6^{3}, 10^{3}]$
$28^{3}$ (21,952)	0.0701	31	41.5938	0.5340	5	2.9219	$[1^{3}, 6^{3}, 10^{3}]$
$32^{3}$ (32,768)	0.0542	37	81.1719	0.4949	5	5.2813	$[1^{3}, 7^{3}, 10^{3}]$
$36^{3}$ (46,656)	0.0432	44	150.7188	0.4158	5	8.3281	$[1^{3}, 7^{3}, 10^{3}]$

Table 7. Results of the NHSS method for solving 3D convection-diffusion equation with theoretical method and GPR method.

$n^{3}$	NHSS (Theoretical Method)			NHSS (GPR)
$n^{3}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$8^{3}$ (512)	0.2711	4	0.0469	0.0100	4	0.0494	$[2^{3}, 3^{3}, 4^{3}]$
$12^{3}$ (1728)	0.2880	5	0.1869	0.0100	3	0.1406	$[2^{3}, 3^{3}, 4^{3}]$
$16^{3}$ (4096)	0.2945	5	0.4063	0.0100	3	0.1563	$[2^{3}, 3^{3}, 4^{3}]$
$20^{3}$ (8000)	0.2978	5	0.8750	0.0100	3	0.6281	$[2^{3}, 3^{3}, 4^{3}]$
$24^{3}$ (13,824)	0.2996	5	2.0156	0.0100	3	1.4063	$[1^{3}, 6^{3}, 10^{3}]$
$28^{3}$ (21,952)	0.3007	5	5.2656	0.0100	3	4.1156	$[1^{3}, 6^{3}, 10^{3}]$
$32^{3}$ (32,768)	0.3014	5	7.6250	0.0100	3	5.4375	$[1^{3}, 7^{3}, 10^{3}]$
$36^{3}$ (46,656)	0.3020	5	14.1406	0.0100	3	9.6406	$[1^{3}, 7^{3}, 10^{3}]$

Table 8. Results of the SHSS method for solving 3D convection-diffusion equation with theoretical method and GPR method.

$n^{3}$	SHSS (Theoretical Method)			SHSS (GPR)
$n^{3}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$8^{3}$ (512)	0.2711	15	0.0781	0.0100	7	0.0281	$[2^{3}, 3^{3}, 4^{3}]$
$12^{3}$ (1728)	0.2880	25	0.3438	0.0100	6	0.0781	$[2^{3}, 3^{3}, 4^{3}]$
$16^{3}$ (4096)	0.2945	39	1.4063	0.0100	6	0.1563	$[2^{3}, 3^{3}, 4^{3}]$
$20^{3}$ (8000)	0.2978	55	4.7500	0.0100	6	0.5625	$[2^{3}, 3^{3}, 4^{3}]$
$24^{3}$ (13,824)	0.2996	75	13.6250	0.0100	7	3.4219	$[1^{3}, 6^{3}, 10^{3}]$
$28^{3}$ (21,952)	0.3007	97	38.9375	0.0100	7	11.7813	$[1^{3}, 6^{3}, 10^{3}]$
$32^{3}$ (32,768)	0.3014	122	100.5469	0.0100	8	15.1094	$[1^{3}, 7^{3}, 10^{3}]$
$36^{3}$ (46,656)	0.3020	150	220.4844	0.0100	9	17.1563	$[1^{3}, 7^{3}, 10^{3}]$

Table 9. Results of the SHSS-SS method for solving 3D convection-diffusion equation with theoretical method and GPR method.

$n^{3}$	SHSS-SS (Theoretical Method)			SHSS-SS (GPR)
$n^{3}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$8^{3}$ (512)	0.2711	7	0.0469	0.0100	7	0.0438	$[2^{3}, 3^{3}, 4^{3}]$
$12^{3}$ (1728)	0.2880	7	0.4063	0.0100	6	0.3281	$[2^{3}, 3^{3}, 4^{3}]$
$16^{3}$ (4096)	0.2945	12	1.5625	0.0100	6	0.6563	$[2^{3}, 3^{3}, 4^{3}]$
$20^{3}$ (8000)	0.2978	17	6.0625	0.0100	6	3.3906	$[2^{3}, 3^{3}, 4^{3}]$
$24^{3}$ (13,824)	0.2996	24	16.7969	0.0100	6	9.2656	$[1^{3}, 6^{3}, 10^{3}]$
$28^{3}$ (21,952)	0.3007	31	52.7188	0.0100	6	25.9063	$[1^{3}, 6^{3}, 10^{3}]$
$32^{3}$ (32,768)	0.3014	40	139.7031	0.0100	6	46.2344	$[1^{3}, 7^{3}, 10^{3}]$
$36^{3}$ (46,656)	0.3020	49	309.8125	0.0100	6	60.8906	$[1^{3}, 7^{3}, 10^{3}]$

Table 10. Results of the HSS method for solving Equation (14) with traversal method and GPR method.

$m^{2}$	HSS (Traversal Method)			HSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$m^{2}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$32^{2}$ (1024)	0.5800	63	0.6563	0.5935	64	0.6250	$[2^{2}, 4^{2}, 10^{2}]$	96.53%	0.07 h
$64^{2}$ (4096)	0.3900	94	4.9844	0.3831	95	4.8594	$[2^{2}, 4^{2}, 6^{2}, 10^{2}]$	99.36%	0.46 h
$96^{2}$ (9216)	0.3100	117	18.0156	0.3093	118	19.0469	$[5^{2}, 7^{2}, 12^{2}]$	99.76%	1.67 h
$128^{2}$ (16,384)	0.2700	136	40.7969	0.2783	135	35.6094	$[5^{2}, 8^{2}, 12^{2}]$	99.90%	3.78 h
$160^{2}$ (25,600)	0.2500	151	71.4688	0.2514	152	72.1094	$[6^{2}, 10^{2}, 14^{2}]$	99.91%	>6 h
$192^{2}$ (36,864)	0.2200	166	121.9531	0.2226	165	114.6875	$[6^{2}, 8^{2}, 14^{2}]$	99.95%	>6 h
$224^{2}$ (50,176)	0.2100	178	181.4063	0.2089	178	181.0406	$[6^{2}, 10^{2}, 11^{2}]$	99.98%	>6 h
$256^{2}$ (65,536)	0.2000	191	271.4375	0.1912	191	271.2344	$[8^{2}, 11^{2}, 13^{2}]$	99.99%	>6 h

Table 11. Results of the MHSS method for solving Equation (14) with traversal method and GPR method.

$m^{2}$	MHSS (Traversal Method)			MHSS (GPR)				$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$m^{2}$	$α_{trav}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set	$\frac{\| Traversal time - Training time \|}{Traversal time}$	Traversal Time
$32^{2}$ (1024)	0.7800	53	0.1094	0.7979	53	0.1006	$[3^{2}, 6^{2}, 14^{2}]$	83.20%	0.01 h
$64^{2}$ (4096)	0.5500	72	0.7344	0.5440	73	0.7813	$[3^{2}, 10^{2}, 14^{2}]$	96.64%	0.08 h
$96^{2}$ (9216)	0.4600	86	6.4063	0.4825	87	6.7813	$[3^{2}, 9^{2}, 12^{2}, 14^{2}]$	99.45%	0.63 h
$128^{2}$ (16,384)	0.4000	98	11.7500	0.4068	98	11.6719	$[4^{2}, 6^{2}, 10^{2}, 13^{2}]$	99.71%	1.14 h
$160^{2}$ (25,600)	0.3600	108	20.7500	0.3593	108	20.6063	$[4^{2}, 7^{2}, 10^{2}, 13^{2}]$	99.81%	1.91 h
$192^{2}$ (36,864)	0.3300	117	35.6094	0.3227	118	36.4844	$[10^{2}, 20^{2}, 40^{2}]$	99.39%	3.46 h
$224^{2}$ (50,176)	0.3100	125	57.0938	0.3114	125	57.0781	$[10^{2}, 20^{2}, 30^{2}, 60^{2}]$	98.88%	5.25 h
$256^{2}$ (65,536)	0.2900	133	81.6563	0.2990	133	81.2500	$[10^{2}, 30^{2}, 40^{2}, 60^{2}]$	99.57%	>6 h

Table 12. Results of the HSS method for solving Equation (14) with theoretical method and GPR method.

$m^{2}$	HSS (Theoretical Method)			HSS (GPR)
$m^{2}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$32^{2}$ (1024)	0.6734	71	0.7031	0.5935	64	0.6250	$[2^{2}, 4^{2}, 10^{2}]$
$64^{2}$ (4096)	0.4402	102	5.2180	0.3831	95	4.8594	$[2^{2}, 4^{2}, 6^{2}, 10^{2}]$
$96^{2}$ (9216)	0.3486	124	30.0469	0.3093	118	19.0469	$[5^{2}, 7^{2}, 12^{2}]$
$128^{2}$ (16,384)	0.2970	141	40.9531	0.2783	135	35.6094	$[5^{2}, 8^{2}, 12^{2}]$
$160^{2}$ (25,600)	0.2630	156	74.3750	0.2514	152	72.1094	$[6^{2}, 10^{2}, 14^{2}]$
$192^{2}$ (36,864)	0.2384	170	122.8906	0.2226	165	114.6875	$[6^{2}, 8^{2}, 14^{2}]$
$224^{2}$ (50,176)	0.2196	182	190.9844	0.2089	178	181.6406	$[6^{2}, 10^{2}, 11^{2}]$
$256^{2}$ (65,536)	0.2047	193	273.9531	0.1912	191	271.2344	$[8^{2}, 11^{2}, 13^{2}]$

Table 13. Results of the MHSS method for solving Equation (14) with theoretical method and GPR method.

$m^{2}$	MHSS (Theoretical Method)			MHSS (GPR)
$m^{2}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$32^{2}$ (1024)	0.6734	59	0.1719	0.7979	53	0.1006	$[3^{2}, 6^{2}, 14^{2}]$
$64^{2}$ (4096)	0.4402	88	0.9219	0.5440	73	0.7813	$[3^{2}, 10^{2}, 14^{2}]$
$96^{2}$ (9216)	0.3486	109	8.0469	0.4825	87	6.7813	$[3^{2}, 9^{2}, 12^{2}, 14^{2}]$
$128^{2}$ (16,384)	0.2970	127	15.4844	0.4068	98	11.6719	$[4^{2}, 6^{2}, 10^{2}, 13^{2}]$
$160^{2}$ (25,600)	0.2630	142	26.5938	0.3593	108	20.6063	$[4^{2}, 7^{2}, 10^{2}, 13^{2}]$
$192^{2}$ (36,864)	0.2384	156	50.4531	0.3227	118	36.4844	$[10^{2}, 20^{2}, 40^{2}]$
$224^{2}$ (50,176)	0.2196	169	75.5000	0.3114	125	57.0781	$[10^{2}, 20^{2}, 30^{2}, 60^{2}]$
$256^{2}$ (65,536)	0.2047	181	109.7188	0.2990	133	81.2500	$[10^{2}, 30^{2}, 40^{2}, 60^{2}]$

Table 14. Results of the MSNS method for solving Equation (14) with theoretical method and GPR method.

$m^{2}$	MSNS (Theoretical Method)			MSNS (GPR)
$m^{2}$	$α_{theo}$	IT	CPU	$α_{gpr}$	IT	CPU	Training Set
$32^{2}$ (1024)	1.1456	42	1.2344	1.0317	38	0.7969	$[4^{2}, 11^{2}]$
$64^{2}$ (4096)	0.7906	57	7.4375	0.7376	54	6.6563	$[3^{2}, 4^{2}, 9^{2}]$
$96^{2}$ (9216)	0.6399	68	26.9531	0.5745	64	25.4531	$[6^{2}, 13^{2}, 20^{2}]$
$128^{2}$ (16,384)	0.5516	77	53.6250	0.4934	74	50.1094	$[7^{2}, 10^{2}, 24^{2}]$
$160^{2}$ (25,600)	0.4920	85	100.4688	0.4365	83	85.4219	$[6^{2}, 10^{2}, 15^{2}, 20^{2}]$
$192^{2}$ (36,864)	0.4483	92	189.4063	0.4217	89	182.1719	$[7^{2}, 10^{2}, 15^{2}, 27^{2}]$
$224^{2}$ (50,176)	0.4145	99	254.5938	0.3855	96	238.6875	$[7^{2}, 10^{2}, 15^{2}, 24^{2}]$
$256^{2}$ (65,536)	0.3873	104	335.6563	0.3628	102	328.0313	$[7^{2}, 10^{2}, 15^{2}, 21^{2}]$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Jiang, K.; Su, J.; Zhang, J. A Data-Driven Parameter Prediction Method for HSS-Type Methods. Mathematics 2022, 10, 3789. https://doi.org/10.3390/math10203789

AMA Style

Jiang K, Su J, Zhang J. A Data-Driven Parameter Prediction Method for HSS-Type Methods. Mathematics. 2022; 10(20):3789. https://doi.org/10.3390/math10203789

Chicago/Turabian Style

Jiang, Kai, Jianghao Su, and Juan Zhang. 2022. "A Data-Driven Parameter Prediction Method for HSS-Type Methods" Mathematics 10, no. 20: 3789. https://doi.org/10.3390/math10203789

APA Style

Jiang, K., Su, J., & Zhang, J. (2022). A Data-Driven Parameter Prediction Method for HSS-Type Methods. Mathematics, 10(20), 3789. https://doi.org/10.3390/math10203789

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Data-Driven Parameter Prediction Method for HSS-Type Methods

Abstract

1. Introduction

2. Gaussian Process Regression Method

2.1. Bayesian Inference

2.2. Model Building

2.3. Model Selection

3. Matrix-Splitting Iterative Methods

3.1. Matrix-Splitting Methods for Non-Hermitian Positive Definite Linear Systems

3.1.1. HSS Iteration Method

3.1.2. NHSS Iteration Method

3.1.3. LHSS Iteration Method

3.1.4. SHSS Iteration Method

3.1.5. SHSS-SS Iteration Method

3.2. Matrix-Splitting Methods for Complex Symmetric Linear Systems

3.2.1. HSS Iteration Method and MHSS Iteration Method

3.2.2. MSNS Iteration Method

4. Numerical Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI