1. Introduction
This paper is aimed at constructing an analysis of methods for efficiently solving the systems of linear algebraic equations resulting from FEM discretization of second-order elliptic boundary value problems on general bounded domains
. Although the approach is applicable to a wider class of multidimensional problems, we restricted our presentation to the case
with a polygonal boundary
. As a model equation, we considered the Dirichlet problem
with
uniformly positive definite, bounded, and piecewise smooth on
. The weak formulation of (
1) is: Find
such that
for all
where
Then the FEM approximation of (
2) is: Find
such that
for all
. Here,
is a finite element subspace with dimension
n. Thus, the numerical solution of the elliptic problem (
1) is reduced to the discrete problem (
3), which in turn is equivalent to the linear system.
The stiffeners matrix is sparse, symmetric, and positive definite. Without losing the generality, we assumed that is the subspace of piecewise linear functions (linear triangle finite elements) defined on the triangulation .
It is assumed that the system (
4) is large-scale, that is
. For such sparse linear systems, the advantages of preconditioned conjugate gradient (PCG) solution methods increase with increasing the size
n. In particular, we focused on the case of non-overlapping DD preconditioning.
A key question in DD methods is how to couple the solutions in subdomains. The interface equation involves a pseudodifferential operator, commonly called the Steklov-Poincaré operator. Many of the currently used DD methods are implicit. As a general approach, they approximate the non-local interface problem by solving a suitably constructed sequence of local (at the level of subdomains) problems. To this group belongs the original Schwarz methods, as well as: Dirichlet-Neumann, Neumann-Neumann, additive and multiplicative Schwarz, Bramble-Pasciak-Schatz (BPS), finite element tearing, and interconnect (FETI) methods. For this class of problems, independence of the convergence rate (the number of PCG iterations) of discretization and decomposition parameters does not generally hold and further consideration of the problem is needed [
1]. In the spirit of implicit methods, a remarkable set of earlier results has been published in a series of papers on BPS methods, starting with [
2].
An alternative second approach is to address the interface problem directly. In this case, several specific issues need to be noted: (i) the Steklov-Poincaré operator is known for a very limited set of model problems with constant coefficients in simple domains; (ii) the interface problem is non-local, and the matrix obtained after its discretization is dense, which should significantly increase the computational complexity of the DD method. Some earlier results on this approach have been published in [
3,
4,
5,
6]; see also [
7] and references therein. They treat simplified cases where the interface is either one line or several non-intersecting lines. Fast Fourier transform (FFT) is used there to efficiently solve the arising systems with the square root of the matrix
, which approximates the corresponding Schur complement.
In this paper, we propose and analyze a new non-overlapping DD method which uses the second (direct) approach mentioned above. Our results are applicable to a very general domain partitioning, including the case of interface with cross-points. The construction is based on the spectral equivalence of the energy norm associated with the related Steklov-Poincaré operator and the norm of the Sobolev space of index 1/2. More details on the discrete representation of these norms, when FEM is applied to a numerical solution of the boundary value problem, can be found in [
1]. Our approach is purely algebraic and can be successfully applied to preconditioning systems with graph Laplacians. In this context, we refer to the new discrete trace theorems recently published in [
8]. The first step in the considered construction is to replace (to approximate) the Schur complement
corresponding to the interface
with
, while
is the discrete Laplace-Beltrami operator assembled on
or, in short, the discrete Laplacian over
. According to the trace theory, the obtained preconditioner provides optimal convergence rate, regardless of the mesh parameter
h of the FEM discretization. This is confirmed, for example, by the numerical tests in [
1], where the spectral decomposition of
has been used to solve the fractional Laplacian problems at the interface. Obviously, such a method is computationally expensive.
This brings us to the more general problem of efficient preconditioning methods for fractional Sobolev spaces. In this context, the multigrid method developed in [
9] can be considered, where the abstract additive multilevel framework has been adapted. Although the fractional diffusion on the manifold
is nonlocal, the preconditioner has a controlled sparsity, providing optimal convergence rate if the number of coarsening steps is fixed. Another natural approach, briefly discussed in [
1], is based on a truncated spectral decomposition, where a small number of vectors are used in a modified Lanczos algorithm. The presented initial results of numerical experiments feed certain expectations for further theoretical analysis of this problem-specific model basis reduction algorithm.
We propose a new non-overlapping DD preconditioner where the BURA is applied to the block
. The BURA method was originally introduced in [
10]; see also the survey paper [
11]. Here, we built on the results from [
12], integrating the interface preconditioning technique into the DD framework. In the present paper, we use the improved version of BURA from [
13].
In general, it is clear that there are different methods for spectral fractional diffusion problems which can be applied to the block
. Recently, a unified view to these methods was presented in [
14], showing that all of them can be interpreted as generated by some rational approximation. Thus, BURA methods are asymptotically the best. It should also be noted that the simplified structure of the proposed algorithm is favorable for parallel implementation. In this context, it is an advantage that the rational approximation is computed offline. Among the main contributions of this paper, we noted the following: BURA preconditioners are purely algebraic, so that no regularity assumptions are made in the theoretical analysis; under standard assumptions, the proposed DD method has optimal computational complexity; the presented numerical results prove the concept of BURA based DD preconditioning.
The structure of the paper is as follows. The spectral equivalence of the non-overlapping DD preconditioners under consideration is discussed in the next section.
Section 3 is devoted to the construction and basic properties of the BURA method. Then, condition number estimates of the BURA based DD preconditioners are derived in
Section 4. The optimal value of the introduced weight parameter is obtained. Algorithmic issues are discussed in
Section 5, proving at the end optimal computational complexity estimates. The numerical experiments presented in
Section 6 have a double motivation. On the one hand, they confirm the accuracy of the derived theoretical estimates. On the other hand, they provide an additional practical insight for the efficient usage of rational approximations of a smaller degree. Finally, we give some conclusions in
Section 7.
2. Non-Overlapping DD
Let the computational domain
be partitioned into
N non-overlapping subdomains
,
and let the interface
be alighted with the FEM triangulation
,
We write the linear system (
4) in the
block form
where:
;
,
, and
correspond to the interior mesh nodes of the subdomains;
,
, and
correspond to the mesh nodes over the interface
. The blocks
represent the interaction between the nodal basis functions corresponding to the interior mesh nodes of the subdomains and the interface nodes. As typical for non-overlapping DD, the block
has a block-diagonal form as follows:
Here, a subdomain-by-subdomain numbering of the mesh nodes (unknowns) is used. The factorization
holds true where
is the Schur complement. Now we are ready to introduce the DD preconditioner
in the form
is a scaling parameter that will be further specified. Here, denotes the discrete Laplace-Beltrami operator assembled on , which is the weighted discrete Laplacian with weight equal to the trace of the coefficient matrix on .
Lemma 1. There exists positive constants that are independent of the mesh parameter h such that Proof. The spectral equivalence estimates (
8) follow, for example, from the analysis in [
1], where a more general diffusion reaction problem is considered. □
Lemma 2. The following condition number estimate holds for the non-overlapping DD preconditioner (7) Proof. We rewrite (
7) and (
8) in the form
where
Then the minimal eigenvalue of the preconditioned matrix
is estimated as follows:
In the same way, we derive the estimate
with which the proof is completed. □
The results from Lemma 2 are summarized in
Table 1.
Corollary 1. The recommended interval for the scaling parameter of the non-overlapping DD preconditioner is The implementation of the DD preconditioner involves solving linear systems with the block . Until recently, this was a challenge when the interface contained intersecting lines. The rest of the paper is devoted to developing an effective solution to this problem using the BURA method.
3. Best Uniform Rational Approximation Preconditioning
The concept of our DD preconditioner is to replace the
block with an appropriately chosen approximation, explicitly defined by a computationally efficient approximation of its inverse. Thus, the proposed approach directly relies on the numerical methods for spectral space-fractional problems. The survey paper [
11] tracks some of the recent achievements in the numerical solution of the equation
,
, and is published in [
11], where
is a symmetric positive definite (SPD) operator corresponding to second order elliptic boundary value problem in a bounded domain
. The methods discussed there can be classified into the following three groups: (i) based on the Dunford-Taylor integral formula or its equivalent Balakrishnan formula; (ii) based on equivalent representation of the solution using extension to: a second order elliptic problem in
with a local operator; a pseudo-parabolic equation in the cylinder
; (iii) based on spectral representation of the solution and the BURA of
on
. Although different in origin, all these methods can be interpreted as some rational approximation of
. Thus, the unified approach presented and analysed in [
14] provides a solid basis for comparison and evaluation. In this sense, from a mathematical point of view, the BURA methods should be the best, which is supported by various numerical tests. An example of such a comparative study is available in [
11].
The BURA method was originally introduced in [
10]. After that, the method was consecutively improved in several papers (for more details, see the survey in [
11]). Here, we will apply the variant proposed and analyzed in [
13].
Let
be the BURA of
in
in the class of rational functions of degree
k belonging to the Walsh table. This means that
,
and
are polynomials of equal degree
k. By definition,
is the minimizer
for which
The following asymptotically sharp exponential error estimate with respect to the degree
k holds true [
15]:
Following the notation entered, the BURA approximation of
is defined as
where
is the smallest eigenvalue of the SPD matrix
.
Remark 1. The presented construction of BURA is purely algebraic, and thus the method is applicable to arbitrary SPD matrices.
Following [
12], we consider the preconditioner
of
defined by its inverse via (
14) in the form
Let
denote the condition number of
. The following lemma is then proved in [
12].
Lemma 3. Assume that the degree k is large enough so that . Then, the following condition number estimate for the BURA preconditioner , , holds true: Let
be the eigenvalues of
and let
, for all
. In addition to (
15), following the proof of Lemma 3, as presented in [
12], we obtain the following estimates of the maximal and minimal eigenvalues of
:
5. Implementation Issues and Computational Complexity Estimate
An additive representation of the rational function
can be used for implementation of the BURA method:
where
and
[
13]. In this way, the inverse of BURA approximation (
14) reads as
Therefore, the implementation of the preconditioner requires solving k auxiliary linear systems with sparse SPD matrices, which are obtained by positive diagonal perturbations of .
The following assumptions will be used to analyze the computational complexity of the introduced BURA based non-overlapping DD preconditioner:
- (A1)
A quasi uniform mesh with a mesh parameter h is used for FEM approximation of the considered elliptic problem. Thus, the number of unknowns is and the number of unknowns related to the interface is .
- (A2)
Under the assumption A1, for the condition number of the stiffness matrix holds, ). We will assume in particular that .
- (A3)
The stiffness matrix has at most non=zero elements per row (column).
- (A4)
For systems with sparse SPD matrices that appear in the implementation of , a solver with optimal computational complexity is applied. In particular, we will assume that the cost of the solver is bounded from above by arithmetic operations per unknown. Such commonly used solvers are based, for example, on multigrid or multilevel preconditioners.
- (A5)
For the iterative solution of the system (
4), the PCG method with preconditioner
is used.
The computational complexity of one PCG iteration reads as
where
is the cost of solving a system with the preconditioner
, and
is the cost of a matrix vector multiplication with the stiffness matrix
.
Thus, the next step is to estimate the computational complexity of solving a system with the preconditioner
. Following (
18), the complexity of one iteration can be written in the form
where
and
are the costs for solving systems with the blocks
and
, respectively, and
and
are the costs for matrix vector multiplications with
and
. Then, (
26) is applied to the last before term in (
28), thus obtaining
where
is the cost of solving a system with the sparse SPD matrix
.
Combining (
28) and (
29) and using the assumptions (A1)–(A5), we get the estimate
Lemma 6. Let k be the minimal integer such that Then the following estimate is valid for the relative condition number of the non-overlapping DD preconditioner Proof. From assumption (A2) and (
31), we get the inequality
which is equivalent to
The last inequality can be written in the form
and consequently
Combining the last estimate with the definitions (
21) and the estimate (
24) from Corollary 2, we get
which completes the proof. □
The presented analysis is summarized in the following theorem.
Theorem 1. Let the assumptions (A1)–(A5) be valid and let Then, for large scale problems (i.e., n is large enough) the PCG iterative solution of the system (4) with the BURA based non-overlapping DD preconditioner (18) has optimal computational complexity. Proof. From Lemma 6 and the setting (
34), we obtain that the number of PCG iterations with the preconditioner
that are needed to achieve the prescribed accuracy is bounded by a constant that is independent of
n. Thus, the proof of the theorem follows directly from (
30), given that, for sufficiently large
n, the first term dominates in the estimate of
. □
Remark 2. A larger degree of the BURA k can be used while maintaining the optimal computational complexity of the BURA based non-overlapping DD preconditioner. In this way, the estimate (33) can be improved to is a given integer, which reduces the constant in (32) from 2 to . 6. Numerical Tests
The presented numerical tests are aimed at an experimental study of the preconditioner
, introduced and analyzed in the previous sections. The case of Laplacian is considered, i.e.,
in (
1).
Example 1. The numerical results are for a test problem in , , and , , , and . The right-hand side is , corresponding to the solution . All matrices are obtained by FEM discretization of the boundary value problem. Linear finite elements are used. A uniform rectangular mesh with a mesh parameter h is applied, in which each square cell is split into two triangles. The number of PCG iterations with the non-overlapping DD preconditioner are examined, where the parameters k and are varied.
We start with a uniform rectangular mesh with mesh parameter
,
triangle finite elements, and
unknowns. Then,
consecutive uniform refinement steps are applied, thus obtaining the finest mesh with
,
8,388,608 elements, and
4,190,209 unknowns. The non-overlapping DD of
, the initial mash, and the first mesh refinement are shown in
Figure 1.
The first step in our analysis concerns the behaviour of the condition number
, as shown in
Table 3. The condition numbers corresponding to the first four meshes are computed. The results confirm the estimate
. More precisely, we observe that, for smaller
h,
is proportional to
. The last four condition numbers are marked with
, indicating that they were obtained using the estimate.
As shown in Corollary 2, the condition number of the BURA based non-overlapping DD preconditioner tends to the optimal value of
when
tends to 1. In addition to the theoretical estimates, we show in
Table 4 the computed condition numbers of BURA preconditioners (see for more details [
12]), with respect to
, for
.
Combining the experimental data from
Table 3 and
Table 4, we conclude that
should be enough to ensure an optimal relative condition number of the
preconditioner.
The results reported in
Table 5 illustrate the optimal convergence rate of the BURA based non-overlapping DD method. The used degree of BURA is
. In this set of experiments, we have chosen the parameter
. According to the theory, the number of iterations is independent of
h and therefore of
n.
In
Table 6, we examine the influence of the scaling parameter by changing
and keeping the degree of the BURA
. The presented results show a stable behaviour of the number of PCG iterations. The minimal values for
are obtained for
. This picture is in very good agreement with Lemma 5 and Consequence 2. It is also worth noting that, for
, the condition number
is very close to 1 (see the numerical experiments in [
12]), resulting in
.
So far, the degree
has been chosen to ensure that
is sufficiently small.
Table 7 presents a comparative analysis of the number of iterations when the degree of BURA,
, varies. We notice that
is applicable for smaller problems, with up to
unknowns, while in general
leads to an equal number of iterations. It is also worth noting that larger
n such values of the degree
k have little effect on the overall computational complexity.
7. Concluding Remarks
We analyzed the non-overlapping DD method in the general framework of problems which involve coupling at interfaces that are manifolds of lower dimensions. This naturally gives rise to interface conditions formulated in fractional Sobolev spaces. Thus, the construction of the method is based on the spectral equivalence of the energy norm associated with the related Steklov-Poincaré operator and the Sobolev norm of index 1/2. The matrix formulation of the preconditioner uses the spectral equivalence of the Schur complement and the discrete fractional Laplacian over the interface
, denoted by
. For a long time, the direct application of this approach was avoided due to the lack of efficient numerical methods for solving fractional diffusion problems in domains (interfaces) of general geometry. The situation has changed significantly in recent years [
11].
The proposed new non-overlapping DD preconditioner is based on BURA approximation of . A complete theoretical analysis of the method is provided, including, in particular, a characterization of the introduced scaling parameters and . Finally, we have proved the overall optimality of the computational complexity. The presented numerical results illustrate in detail the theoretical estimates, ending with some practical tips for choosing the degree of rational approximation k and the parameter .
Although presented in 2D formulation, the theory can easily be extended to the 3D case. Note that the estimates of the condition number of
are robust, with respect to the spatial dimension and the geometry of
. This argues that we can get optimal preconditioning properties of the BURA based non-overlapping DD methods for a very wide class of problems. This statement is supported by the numerical results published in [
1], where exact solution of the systems with fractional Laplacian over the interface
is used. Although this approach is computationally too expensive, the reported results are indicative for what we can expect in terms of BURA-based DD behavior in cases of general geometry of the interface and heterogeneous diffusion coefficients in 2D and 3D. We also note that a coarse grid solver may be needed to stabilize the DD iterations for a larger number of subdomains.
Possibilities for generalizing the presented results include: theoretical and experimental analysis of the new method for three-dimensional problems; development of robust domain decomposition preconditioners for the case of strongly heterogeneous and anisotropic media; computational complexity analysis for the case of fractal geometry interfaces. The application of the BURA-based non-overlapping domain decomposition approach to couple non-linear problems is a separate challenging direction for obtaining future results with great potential and practical significance.
One of the remarkable recent examples of effective application of domain decomposition approaches to elliptic PDEs is that of the Poisson-Boltzmann equations (see [
16,
17]) widely arising in the colloid science, protein electrostatics, implicit solvent models, and plasma physics. Development of BURA based non-overlapping domain decomposition preconditioners for the Poisson-Boltzmann elliptic boundary value problems gives another good perspective for state-of-the-art real-world applications.