1. Introduction
Solving linear equations is one of the most fundamental topics in matrix computation, and with the development of science and technology, many important problems in the natural sciences and engineering can often be reduced to the following linear equation:
where
,
is a large sparse non-Hermitian and positive definite matrix.
There are many powerful matrix-splitting iterative methods for solving systems of linear equations, such as the successive over-relaxation (SOR) method [
1], the symmetric SOR (SSOR) method [
2], the accelerated over-relaxation (AOR) method [
3] and the symmetric AOR (SAOR) method [
4]. Many researchers have applied them to different problems and made some improvements [
5,
6,
7,
8]. Considering the specificity of the problem and solving Equation (
1) more efficiently, many new matrix-splitting iterative methods have been proposed. Bai et al. offered the Hermitian and skew-Hermitian splitting (HSS) method and the inexact HSS method [
9]. To improve the efficiency of the HSS method, Bai et al. proposed the preconditioned HSS (PHSS) [
10]. Due to the promising performance of the HSS method, some HSS-type iteration methods were presented. These methods can be mainly divided into the following two forms. The first one is accelerated HSS-type methods. Such as the generalized HSS method [
11], the lopsided HSS (LHSS) method [
12], the generalized PHSS method [
13], the asymmetric HSS method [
14] and the new HSS (NHSS) method [
15]. In addition, Yang et al. offered the minimum residual HSS method [
16] by applying the minimum residual technique to the HSS method and Li et al. [
17] proposed the single-step HSS (SHSS) method. Based on the shift-splitting method and the SHSS method, Li et al. established the SHSS-SS method [
18].
Apart from the accelerated HSS-type methods, some other HSS-type methods focused on the applications to different kinds of problems. Such as the saddle-point problems [
19,
20,
21,
22,
23,
24], solving the matrix equation [
25,
26,
27,
28,
29,
30,
31,
32], and solving the complex symmetric linear systems [
33,
34,
35] and the nonlinear systems [
36,
37].
These iteration methods contain splitting parameters that need to be specified in advance. At present, there are three main methods of selecting the splitting parameters. The first is obtaining relatively optimal parameters by traversing or experimenting within some intervals [
26,
38,
39]. The advantage of this traversal method is that it can obtain relatively accurate optimal parameters, but it requires large amount of calculation and consumes a lot of extra time, especially when the dimension of the coefficient matrix is large. The second is estimating optimal parameters through theoretical analysis [
40,
41]. Some researchers find optimal parameters by minimizing the spectral radius of the iterative matrix. However, solving this optimization problem is very difficult in theoretical analysis and practical computation. Bai et al. [
42] proposed an accurate formula for computing the optimal parameters of the HSS method by directly minimizing the spectral radius of the iterative matrix, but the coefficient matrix is a two-by-two matrix or a two-by-two block matrix with specific forms. Some researchers find quasi-optimal parameters by minimizing the upper bound of the spectral radius of the iterative matrix of some iteration methods [
9,
12,
17,
18]. By a reasonable and simple optimization principle, Chen [
43] proposed an accurate estimate to the optimal parameter of the HSS iteration method. Huang [
44] and Yang [
45] estimated the optimal parameters of the HSS method by solving a cubic polynomial equation and a quartic polynomial equation, respectively. Huang [
46] proposed variable-parameter HSS methods and the parameter in it updated at each step of the iteration. The above theoretical methods contain the following limitations. First, the method is only available case by case, which means it is less universal. Second, the method needs to compute the maximum or the minimum eigenvalues of the matrix, but this is time-consuming work. Jiang et al. [
47] proposed the third estimation method, the Bayesian inference-based Gaussian process regression (GPR) method, to predict the optimal parameters in some alternating direction iterative methods. This method uses a training set to learn a mapping between the dimension of linear systems and relatively optimal splitting parameters.
The choice of splitting parameters can greatly affect the efficiency of the HSS-type iteration methods [
47,
48], which makes the parameter selection of great importance. For computing the splitting parameters of the HSS-type iteration methods, to overcome the limitations of the traversal method and the theoretical methods, we use the GPR method to predict the splitting parameters of some HSS-type iteration methods. The main contributions of this work are:
We apply the GPR method for the prediction of optimal splitting parameters of some HSS-type methods, which is a new application.
We provide extensive numerical experiments to compare the prediction performance of the GPR method with the traversal method and the theoretical methods.
The results of numerical experiments show that: comparing to the traversal method, the GPR method can predict almost the same parameters as the traversal method does but with less computational effort; comparing to the theoretical method, the GPR method can predict better optimal parameters than the theoretical method and is more universal (unlike the theoretical method available case by case, the GPR method is suitable for all the HSS-type iteration methods tested). Moreover, the theoretical methods need to compute the maximum or the minimum eigenvalues (or singular values) of the matrix but this is a time-consuming work when the dimension of the matrix is large and the GPR method overcomes this limitation.
The rest of the paper is organized as follows. In
Section 2, we present Gaussian process regression method based on Bayesian inference. In
Section 3, we present the iteration scheme of some HSS-type iteration methods, the corresponding convergence conditions and the theoretical methods for estimating the relatively optimal splitting parameters. In
Section 4, we illustrate the efficiency of the GPR method by numerical experiments. Finally, in
Section 5, we include some concluding remarks and prospects.
Throughout the paper, the sets of complex and real matrices are denoted by and , respectively. If , let ,,,, denote the transpose, inverse, conjugate transpose, the Euclidean norm and Frobenius norm of X, respectively. The notations , , denote the eigenvalue set, singular value set and spectral radius of X, respectively. The symbol ⊗ denotes the Kronecker product. I represents the identity matrix.
4. Numerical Results
In this section, we present extensive numerical examples to show the power of the GPR method compared with the traversal method and theoretical method. We take a three-dimensional convection-diffusion equation and approximation in the time integration of a parabolic partial differential equations as examples.
In the following numerical experiments, all tests are started with a zero vector. All iterative methods are terminated if the relative residual error satisfies
, where
is the
k-step residual. “IT” and “CPU” denote the required iterations and the CPU time (in seconds), respectively. “Traversal time” denotes the required CPU time (in seconds) to obtain the optimal splitting parameters by the traversal method. “Training time” denotes the required CPU time (in seconds) to produce the training set and train the GPR model. We use
to make a comparison of the calculation amount of the traversal method and the GPR method. Obviously, the larger this quantity is, the longer the traversal time of the traversal method will take compared to the training time of the GPR method.
For the traversal method, the optimal splitting parameter minimizes the iterations of the corresponding iteration method when solving linear systems and it is obtained by traversing interval with a step size of .
For the GPR method, the training set is produced by using the traversal method for a set of small-scale systems and their dimensions are shown later.
For the IHSS method and the ILHSS method, we use the CG method to solve the linear systems with the coefficient matrix
and the GMRES method to solve linear systems with the coefficient matrix
. The inner CG and GMRES iterates are terminated if the current residuals of the inner iterations satisfy
where
and
are, respectively, the residuals of the
jth inner CG and GMRES,
is the residual of the
kth outer iteration.
All computations are carried out using MATLAB 2018b on a personal computer with a 1.8 GHz CPU Intel Core i5 and 8G memory.
Example 1. Consider the following three-dimensional convection-diffusion equationon the unit cube , with constant coefficient q and subject to Dirichlet-type boundary conditions. When the seven-point finite difference discretization and the equidistant step-size (n is the degree of freedom along each dimension) is used on all the three directions applied to Equation (12), we obtain the linear system with the coefficient matrixwhere , and are tridiagonal matrices. If the first order derivatives are approximated by the centered difference scheme, we havewith , , and (mesh Reynolds number). According to [
9,
51,
52], for the centred difference scheme, the extreme eigenvalues and singular values of matrices
M and
N in Equation (
6) are
Therefore, the theoretical method of the HSS method, the LHSS method, the NHSS method, the SHSS method and the SHSS-SS method to obtain the optimal splitting parameters can be easily calculated.
Let
in Equation (
12). The discretization of Equation (
12) leads to a system of linear equations
, where
is defined by Equation (
13), and set the exact solution
, then,
.
In this experiment, we apply the GPR method to compare with the traversal method and the theoretical method, respectively. Concretely, first, we use the HSS method, IHSS method, LHSS method and ILHSS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the traversal method and the GPR method, respectively. Numerical experiments results show that the GPR can predict almost the same parameters as the traversal method does, but with less calculation. Finally, we use the HSS method, the NHSS method, the LHSS method, the SHSS method and the SHSS-SS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the theoretical method and the GPR method, respectively. Numerical results show that the GPR method can compute better optimal parameters than the theoretical method, which means that the GPR method can be applied to a wide range of matrix-splitting iterative methods and is highly universal.
The GPR method vs. the traversal method. We first use the HSS method, the IHSS method, the LHSS method and the ILHSS method to solve the 3D convection-diffusion equation and the splitting parameters are selected using the traversal method and the GPR method, respectively.
Table 1,
Table 2,
Table 3 and
Table 4 and
Figure 1 show the numerical results.
From
Table 1,
Table 2,
Table 3 and
Table 4 and
Figure 1, we know that the GPR method can predict almost the same parameters as the traversal method does. It uses a training set from a set of small-scale systems and its training time is much less than the traversal time of the traversal method. Thus, the GPR method requires less calculation than the traversal method.
From
Figure 2, we can have a visual representation of what the mapping
we want to fit looks like.
The GPR method vs. the theoretical method. We use the HSS method, the NHSS, the LHSS method, the SHSS method and the SHSS-SS method to solve the 3D convection-diffusion equation, and the splitting parameters are selected using the theoretical method (given in Theorems 3, 5, 7, 9 and 11) and the GPR method, respectively.
Table 5,
Table 6,
Table 7,
Table 8 and
Table 9 and
Figure 3 and
Figure 4 show the numerical results.
From
Table 5,
Table 6,
Table 7,
Table 8 and
Table 9 and
Figure 3 and
Figure 4, we know that the GPR method can predict better optimal parameters than the theoretical method. Unlike the theoretical method available case-by-case, the GPR method is suitable for the five iterative methods, which means that the GPR method is highly universal.
Example 2. Consider the following complex symmetric linear systems.where K is the five-point centered difference matrix approximating the negative Laplacian operator with homogeneous Dirichlet boundary conditions, on a uniform mesh in the unit square with mesh-size . and , with . In addition, the right-hand side vector b with its jth entry is given by Let and normalize coefficient matrix and right-hand side by multiplying both by . Refer to [53] for more details. In this experiment, we apply the GPR method to compare with the traversal method and the theoretical method, respectively. Concretely, first, we use the HSS method and the MHSS method to solve Equation (
14), and the splitting parameters are selected using the traversal method and the GPR method, respectively. Then, we use the HSS method, the MHSS method and the MSNS method to solve Equation (
14), and the splitting parameters are selected using the theoretical method and the GPR method, respectively. Since the extreme eigenvalues of matrix
M and extreme singular value of matrix
N cannot be explicitly obtained, we use MATLAB built-in function “eigs(MaxIterations’,500,’Tolerance’,1e-5)” and “svds(MaxIterations’,500,’Tolerance’,1e-5)” to calculate them.
The GPR method vs. the traversal method. We first use the HSS method and the MHSS method to solve Equation (
14) and the splitting parameters are selected using the traversal method and the GPR method, respectively.
Table 10 and
Table 11 and
Figure 5 show the numerical results.
From
Table 10 and
Table 11 and
Figure 5, we know that the GPR method can predict almost the same parameters as the traversal method does. It uses a training set obtained by using the traversal method for small-scale systems and its training time is much less than the traversal time of the traversal method. Thus, the GPR method requires less calculation than the traversal method.
The GPR method vs. the theoretical method. We use the HSS method, the MHSS method and the MSNS method to solve Equation (
14), and the splitting parameters are selected using the theoretical method (given in Theorem 3, 13 and 14) and the GPR method, respectively.
Table 12,
Table 13 and
Table 14 and
Figure 6 show the numerical results.
From
Table 12,
Table 13 and
Table 14 and
Figure 6, we know that the GPR method can predict better optimal parameters than the theoretical method. Unlike the theoretical method only available case-by-case, the GPR method is suitable for the three iterative methods, which means that the GPR method is highly universal.