Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems

Yu, Bo; Dong, Ning; Hu, Baiquan

doi:10.3390/axioms13040249

Open AccessArticle

Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems

by

Bo Yu

,

Ning Dong

^*

and

Baiquan Hu

School of Science, Hunan University of Technology, Zhuzhou 412007, China

^*

Author to whom correspondence should be addressed.

Axioms 2024, 13(4), 249; https://doi.org/10.3390/axioms13040249

Submission received: 9 March 2024 / Revised: 3 April 2024 / Accepted: 5 April 2024 / Published: 10 April 2024

(This article belongs to the Special Issue The Numerical Analysis and Its Application)

Download

Browse Figures

Versions Notes

Abstract

:

Consider a class of coupled Stein equations arising from jump control systems. An operator Smith algorithm is proposed for calculating the solution of the system. Convergence of the algorithm is established under certain conditions. For large-scale systems, the operator Smith algorithm is extended to a low-rank structured format, and the error of the algorithm is analyzed. Numerical experiments demonstrate that the operator Smith iteration outperforms existing linearly convergent iterative methods in terms of computation time and accuracy. The low-rank structured iterative format is highly effective in approximating the solutions of large-scale structured problems.

Keywords:

coupled Stein equations; operator Smith algorithm; jump control system; low rank; large-scale problems

MSC:

65F45; 65F10

1. Introduction

Consider the discrete-time jump control systems given by

x_{j + 1} = A (t_{i}) x_{j} + B (t_{i}) u_{j}, y_{j} = C (t_{i}) x_{j}, i, j = 1, \dots, m,

where

A (t_{i}) \in R^{N \times N}

,

B (t_{i}) \in R^{N \times l_{b}}

, and

C (t_{i}) \in R^{l_{c} \times N}

with

l_{b}, l_{c} ≪ N

. Here, N represents the scale of the jump control system. Efficient control in the analysis and design of jump systems involves associating the observability Gramian

W_{o i} = \sum_{k = 0}^{\infty} {(A {(t_{i})}^{⊤})}^{k} C {(t_{i})}^{⊤} C (t_{i}) A {(t_{i})}^{k}

and controllability Gramian

W_{c i} = \sum_{k = 0}^{\infty} A {(t_{i})}^{k} B (t_{i}) B {(t_{i})}^{⊤} {(A {(t_{i})}^{⊤})}^{k}

[1], which are solutions of the corresponding coupled discrete-time Stein equations (CDSEs):

S c_{i} (X) = X_{i} - Q_{i} - A_{i}^{⊤} E_{i} (X) A_{i} = 0 .

(1)

Here, for

i = 1, \dots, m

,

A_{i} \in R^{N \times N}

is the input matrix,

Q_{i} \in R^{N \times N}

is symmetric and positive semi-definite, and

E_{i} (X) = \sum_{j = 1}^{m} p_{i j} X_{j} \in R^{N \times N}

with probability values

p_{i j}

satisfying

\sum_{j = 1}^{m} p_{i j} = 1

. Numerous methods, ranging from classical to state-of-the-art, have been developed over the past decades to address the single Stein equation (i.e.,

m = 1

in (1)), particularly for special matrix structures. For example, Betser et al. investigated solutions tailored to cases where coefficient matrices are in companion forms [2]. Hueso et al. devised a systolic algorithm for the triangular Stein equation [3]. Li et al. introduced an iterative method for handling large-scale (the term “large-scale” refers to the scale N of the corresponding equations being large) Stein and Lyapunov equations with low-ranked structures [4]. Fan et al. discussed generalized Lyapunov and Stein equations, deriving connections from rational Riccati equations [5]. Yu et al. scrutinized large-scale Stein equations featuring high-ranked structures [6].

For CDSEs (1), the parallel iteration [7], essentially a stationary iteration for linear systems, is a commonly used method to compute the desired solution. This approach has been extended to an implicit sequential format [8], leveraging the latest information from obtained solutions to accelerate the iteration of the left part. The gradient-based iterative algorithm, introduced for solving CDSEs, explicitly determines the optimal step size to achieve the maximum convergence rate [9]. By utilizing positive operator theory, two iterative algorithms were established for solving CDSEs [10], later extended to Itô stochastic systems. The continuous-time Lyapunov equations can be transformed into CDSEs via the Cayley transformation [11], although determining the optimal Cayley parameter remains a challenge. For other related iterative methods of continuous-time Lyapunov equations, consult [12,13,14,15,16] and the corresponding references.

CDSEs also arise from solving sub-problems of coupled discrete-time Riccati equations from optimized control systems. The difference method [17] and CM method [18] were proposed to tackle the sub-problems of CDSEs. Ivanov [19] developed two Stein iterations that also exploit the latest information of previously derived solutions for acceleration. Notably, the iteration schemes provided in [8] are almost identical to these two Stein iterations [19], essentially corresponding to the Gauss–Jacobi and the Gauss–Seidel iterations applied to coupled matrix equations. Successive over-relaxation (SOR) iterations and their variants for CDSEs were explored in [20,21], though determining the optimal SOR parameter remains challenging. One limitation of the aforementioned methods is that they are linearly convergent, and the potential structures (such as the low rank and sparseness) of the matrices are not fully exploited. To enhance the convergence rate of iterative methods, the Smith method was employed to solve the single Stein equation [22] and extended to structured large-scale problems [4,11]. This method offers the advantage of being parameter-free and converging quadratically to the desired symmetric solution. A similar idea was extended to address some stochastic matrix equations in [5]. In this paper, we adapt the Smith method to an operator version to compute the solution of CDSEs and subsequently construct the corresponding low-ranked format for large-scale problems. The main contributions encompass the following significant aspects:

We introduce the operator defined as

${(F)}_{i} (X) = A_{i}^{⊤} E_{i} (A^{⊤} E (\dots A^{⊤} E (X) A) \dots A) A_{i} .$

(2)

This operator formulation enables us to adapt the Smith iteration [4,11,22] to an operator version, denoted as the operator Smith algorithm (OSA). By doing so, the iteration maintains quadratic convergence for computing the symmetric solution of CDSEs (1). Our numerical experiments demonstrate that the OSA outperforms existing linearly convergent iterations in terms of both the CPU time and accuracy.
To address large-scale problems, we structure the OSA in a low-ranked format with twice truncation and compression (TC). One TC applies to the factor in the constructed operator (2), while the other TC applies to the factor in the approximated solution. This approach effectively reduces the column dimensions of the low-rank factors in symmetric solutions.
We redesign the residual computation to suit large-scale computations. We incorporate practical examples from industries [23] to validate the feasibility and effectiveness of the presented low-ranked OSA. This not only demonstrates its practical utility but also lays the groundwork for exploring various large-scale structured problems.

This paper is structured as follows. Section 2 outlines the iterative scheme of the OSA for CDSEs (1), along with a convergence analysis. Comparative results on small-scale problems highlight the superior performance of the OSA compared to other linearly convergent iterations. Section 3 delves into the development of the low-ranked OSA, providing details on truncation and compression techniques, residual computations, as well as complexity and error analysis. In Section 4, we present numerical experiments from industrial applications to illustrate the effectiveness of the introduced low-ranked OSA in real-world scenarios.

Throughout this paper,

I_{m}

(or simply I) is the

m \times m

identity matrix. For a matrix

A \in R^{N \times N}

,

ρ (A)

is the spectral radius of A. Unless stated otherwise, the norm

∥ \cdot ∥

is the ∞-norm of a matrix. For matrices A and

B \in R^{N \times N}

, the direct sum

A \oplus B

means the block diagonal matrix

[\begin{matrix} A & 0 \\ 0 & B \end{matrix}]

. For symmetric matrices A and

B \in R^{N \times N}

, we say

A > B

(

A \geq B

) if

A - B

is a positive definite (semi-definite) matrix.

2. Operator Smith Iteration for CDSEs

2.1. Iteration Scheme

The operator Smith algorithm (OSA) represents a generalization of the Smith iteration applied to a single matrix equation [4].

For each

i = 1, \dots, m

, the operator

F

in (2) at k-th iteration is defined as

{(F)}_{i, k} (\cdot) = A_{i}^{⊤} E_{i} (\overset{2^{k} - 1}{\overset{⏞}{A^{⊤} E (\dots A^{⊤} E}} (\cdot) \overset{2^{k} - 1}{\overset{⏞}{A) \dots A}}) A_{i} : = A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{k} - 1} (\cdot) A^{2^{k} - 1}) A_{i} .

(3)

With the initial

X_{i, 0} = Q_{i}

, the OSA for CDSEs (1) is given by

X_{i, k + 1} = X_{i, k} + {(F)}_{i, k} (X_{\cdot, k}), k = 0, 1, 2, \dots,

(4)

where

X_{\cdot, k}

represents the k-th iteration satisfying

E_{i} (X_{\cdot, k}) = \sum_{s = 1}^{m} p_{i, s} X_{s, k}

.

Remark 1.

By the definition of the operator

{(F)}_{i, k}

, it is not difficult to see that

{(F)}_{i, k + 1}

doubles the former operator

{(F)}_{i, k}

for

k = 1, 2, \dots

. Specifically, the operator

{(F)}_{i, k + 1}

acting on a matrix is equivalent to applying the former operator

{(F)}_{i, k}

twice on that matrix. To illustrate, let us consider

m = 2

as an example. For

Q = (Q_{1}, Q_{2})

, the operator

{(F)}_{i, 0}

on Q (i.e.,

k = 0

in (3)) is

{(F)}_{i, 0} (Q) = A_{i}^{⊤} E_{i} (Q) A_{i} = A_{i}^{⊤} (p_{i 1} Q_{1} + p_{i 2} Q_{2}) A_{i} .

Similarly, the operator

{(F)}_{i, 1}

on Q (i.e.,

k = 1

in (3)) takes the form

\begin{matrix} {(F)}_{i, 1} (Q) & = A_{i}^{⊤} E_{i} (A^{⊤} E (Q) A) A_{i} \\ = A_{i}^{⊤} (p_{i 1} A_{1}^{⊤} E_{1} (Q) A_{1} + p_{i 2} A_{2}^{⊤} E_{2} (Q) A_{2}) A_{i} \\ = A_{i}^{⊤} (p_{i 1} A_{1}^{⊤} (p_{11} Q_{1} + p_{12} Q_{2}) A_{1} + p_{i 2} A_{2}^{⊤} (p_{21} Q_{1} + p_{22} Q_{2}) A_{2}) A_{i} . \end{matrix} .

Thus, the effect of

{(F)}_{i, 1} (Q)

is equivalent to

{(F)}_{i, 0} ({(F)}_{\cdot, 0} (Q))

, demonstrating that

{(F)}_{i, 1}

effectively doubles

{(F)}_{i, 0}

. This doubling property extends the concept of Smith iteration for a single equation [4,11]. In this sense, (4) is referred to as the OSA.

The following proposition indicates the concrete form of

X_{i, k}, (i = 1, \dots, m)

in the OSA (4):

Proposition 1.

Let

E_{i} (Q) = \sum_{s = 1}^{m} p_{i, s} Q_{s}

. The k-th iteration

X_{i, k}

generated by (4) admits a representation

X_{i, k} = Q_{i} + A_{i}^{⊤} E_{i} (\sum_{j = 0}^{2^{k} - 2} {(A^{⊤} E)}^{j} (Q) A^{j}) A_{i}

(5)

for

i = 1, \dots, m

.

Proof.

We prove (5) by induction. It is obvious from the OSA that

X_{i, 1} = X_{i, 0} + F_{i, 0} (Q) = Q_{i} + A_{i}^{⊤} E_{i} (Q) A_{i}

. Then, (5) holds for

k = 1

.

Assume that (5) holds for

k = l

. Then, one has

\begin{matrix} X_{i, l + 1} & = & X_{i, l} + F_{i, l} (X_{\cdot, l}) \\ = & X_{i, l} + A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l} - 1} (X_{\cdot, l}) A^{2^{l} - 1}) A_{i} \\ = & X_{i, l} + A_{i}^{⊤} E_{i} {{(A^{⊤} E)}^{2^{l} - 1} [\sum_{s = 1}^{m} p_{\cdot, s} (Q_{s} \\ + A_{s}^{⊤} E_{s} {(\sum_{j = 0}^{2^{l} - 2} (A^{⊤} E)}^{j} (Q) A^{j}) A_{s})] A^{2^{l} - 1}} A_{i} \\ = & X_{i, l} + A_{i}^{⊤} E_{i} \{{(A^{⊤} E)}^{2^{l} - 1} {[Q + \sum_{j = 0}^{2^{l} - 2} (A^{⊤} E)}^{j + 1} (Q) A^{j + 1}] A^{2^{l} - 1}} A_{i} \\ = & X_{i, l} + A_{i}^{⊤} E_{i} {(\sum_{j = 2^{l} - 1}^{2^{l + 1} - 2} (A^{⊤} E)}^{j} (Q) A^{j}) A_{i} \\ = & Q_{i} + A_{i}^{⊤} E_{i} {(\sum_{j = 0}^{2^{l + 1} - 2} (A^{⊤} E)}^{j} (Q) A^{j}) A_{i}, \end{matrix}

indicating (5) is true for

k = l + 1

. □

To obtain the convergence of the OSA, we further assume that all

A_{i}

for

i = 1, \dots, m

are d-stable, i.e.,

ρ (A_{i}) < 1 .

The following theorem concludes the convergence of the OSA:

Theorem 1.

Let

ρ : = {max}_{i = 1}^{m} ρ (A_{i})

and

p : = {max}_{i, j = 1}^{m} p_{i j}

such that

2 p ρ^{2} < 1

. Then, the sequence

{X_{i, k}}

generated by (4) is convergent to the solution

X_{i, \infty} = Q_{i} + A_{i}^{⊤} E_{i} (\sum_{j = 0}^{\infty} {(A^{⊤} E)}^{j} (Q) A^{j}) A_{i}

(6)

of the CDSEs when

A_{i}

is d-stable. Moreover, one has

∥ X_{i, k} - X_{i, \infty} ∥ \leq \frac{{(2 p ρ^{2})}^{2^{k}}}{1 - 2 p ρ^{2}} ∥ Q ∥,

where

∥ Q ∥ : = {max}_{i = 1}^{m} ∥ Q_{i} ∥

for

i = 1, \dots, m

.

Proof.

It follows from Proposition 1 that the solution of CDSEs has the form of (6) when the assumption holds. Subtracting (5) from (6) and taking the norm, one then has

\begin{matrix} ∥ X_{i, k} - X_{i, \infty} ∥ & = & ∥ A_{i}^{⊤} E_{i} {(\sum_{j = 2^{k} - 1}^{\infty} (A^{⊤} E)}^{j} (Q) A^{j}) A_{i} ∥ \\ \leq & {(2 p ρ^{2})}^{2^{k}} ∥ Q ∥ (1 + 2 p ρ^{2} + {(2 p ρ^{2})}^{2} \dots) \\ = & \frac{{(2 p ρ^{2})}^{2^{k}}}{1 - 2 p ρ^{2}} ∥ Q ∥ . \end{matrix}

□

Remark 2.

It is evident from Theorem 1 that the OSA admits the quadratic convergence rate when

2 p ρ^{2} < 1

. This highlights its superiority over the prevailing linearly convergent iterations [8,19,21,24] both on accuracy and CPU time, as elaborated in the next subsection.

2.2. Examples

In this subsection, we present several examples that highlight the superior performance of the OSA compared to linearly convergent iterations [8,19,21,24]. Notably, the iteration method outlined in [8] is identical to the one in [19]. Additionally, other linearly convergent iterations exhibit similar numerical behaviors. Therefore, for accuracy and CPU time comparisons, we select the iteration method from [8,19], referred to as “FIX” in this subsection. It is important to note that the discrete-time Lyapunov equation in “FIX” was solved using the built-in function “dlyap” in Matlab 2019a [25].

Example 1.

This example is from a slight modification of the all-pass SISO system [26], where the controllability and observability Gramians are quasi-inverse to each other, i.e.,

W_{c i} W_{o i} = σ_{i} I

for some

σ_{i} > 0

. This property indicates that the system has a single Hankel singular value of multiplicity equal to the system’s order. The derived system matrices are as follows:

A_{1} = 0.4 [{(I + G_{1})}^{- 1} {\bar{A}}_{1}], A_{2} = 0.5 [{(I + G_{2})}^{- 1} {\bar{A}}_{2}], Q_{1} = L_{1}^{Q} {(L_{1}^{Q})}^{⊤}, Q_{2} = L_{2}^{Q} {(L_{2}^{Q})}^{⊤},

where

G_{1}

and

G_{2}

are matrices with zero elements except for the last row of

0.1 g_{1}

and

0.3 g_{2}

, respectively (both

g_{1}

and

g_{2}

are random row vectors with elements from the interval (0, 1));

{\bar{A}}_{1}

and

{\bar{A}}_{2}

are both tri-diagonal matrix

tridiag (- 1, 0, 1)

but with

{\bar{A}}_{1} (1, 1) = - 0.5

and

{\bar{A}}_{2} (1, 1) = - 0.8

, respectively;

{(L_{1}^{Q})}^{⊤} = [1, 0, \dots, 0, 1]

; and

{(L_{2}^{Q})}^{⊤} = [0, 1, 0, \dots, 0, 1, 0]

. We consider

m = 2

and select the probability matrix

Π = (\begin{matrix} 0.26 & 0.74 \\ 0.53 & 0.47 \end{matrix})

.

We evaluate the OSA and FIX for dimensions

N = 400

and

N = 800

and present their numerical behaviors in Table 1. Here,

δ t_{k}

and

t_{k}

record the CPU time of the current iteration and accumulated iterations, respectively. The Rel_Res column exhibits the relative residual of CDSEs in each iteration. From Table 1, it is evident that the OSA achieves equation residuals of

O (10^{- 16})

within five iterations for different dimensions. The CPU time required for the OSA is significantly less than that required for FIX. Conversely, FIX maintains equation residuals at the level of

O (10^{- 13})

even after 11 iterations. The symbol “∗” in the table indicates that, despite resuming the iteration, it can not further reduce the equation residuals to terminate the FIX.

To visualize the residuals of each equation (here,

m = 2

) for different iteration methods, we plot the history of the equation residuals in Figure 1. Here, “S-Resi” and “F-Resi” (

i = 1, 2

) represent the residuals of equations obtained by the OSA and FIX, respectively. The figure illustrates that the OSA has quadratic convergence. Interestingly, although FIX converges rapidly for solving the second equation, it maintains linear convergence for solving the first equation, resulting in an overall linear convergence for FIX.

We further consider modified system matrices

A_{1} = 0.48 [{(I + G_{1})}^{- 1} {\bar{A}}_{1}], A_{2} = 0.98 [{(I + G_{2})}^{- 1} {\bar{A}}_{2}],

where

G_{1}

and

G_{2}

are matrices with zero elements except for the last row of

0.6 g_{1}

and

0.8 g_{2}

, respectively. In this case, the spectral radii of matrices

A_{1}

and

A_{2}

are 0.96 and 0.95, respectively. We rerun the OSA and FIX algorithm, and the obtained results are recorded in Figure 2. From the plot, it can be observed that for

N = 400

, the OSA requires nine iterations to achieve equation residuals at the

O (10^{- 15})

level, with a total time of 10.17 s. Conversely, FIX, after consuming 51.31 s over 90 iterations, only achieves equation residuals at the scale of

O (10^{- 13})

. Further numerical experiments demonstrate that even by increasing the number of iterations, FIX fails to further reduce the residual level. Similar numerical results are obtained for the scale of

N = 800

.

Example 2.

Consider a slight modification of a chemical reaction by a convection reaction partial differential equation on the unit square [27], given by

\frac{\partial x}{\partial t} = \frac{\partial^{2} x}{\partial y^{2}} + \frac{\partial^{2} x}{\partial z^{2}} + 20 \frac{\partial x}{\partial z} - 180 x + f (y, z) x (t),

where x is a function of time (t), vertical position (v), and horizontal position (z). The boundaries of interest in this problem lie on a square with opposite corners at (0, 0) and (1, 1). The function

x (t, v, z)

is zero on these boundaries. This PDE is discretized using centered difference approximations on a grid of

n_{v} \times n_{z}

points. The dimension N of A is the product of the state space dimension

n_{v} n_{z}

, resulting in a sparsity pattern of A as

\begin{matrix} \overset{7}{\overset{⏞}{}} \\ A = (\begin{matrix} - 734 & 171 & 0 & \dots & 0 & 196 \\ - 9 & - 734 & ⋱ & ⋱ & ⋱ & ⋱ \\ 0 & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & 196 \\ 0 & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & 0 \\ 196 & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & ⋮ \\ ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & 0 \\ ⋱ & ⋱ & ⋱ & ⋱ & ⋱ & 171 \\ 196 & 0 & \dots & 0 & - 9 & - 734 \end{matrix}) . \end{matrix}

Here we take

A_{1} = 10^{- 3} \times ξ_{1} A, A_{2} = 10^{- 3} \times ξ_{2} A, Q_{1} = L_{1}^{Q} {(L_{1}^{Q})}^{⊤}, Q_{2} = L_{2}^{Q} {(L_{2}^{Q})}^{⊤},

where

\begin{matrix} {(L_{1}^{Q})}^{⊤} = [\overset{7}{\overset{⏞}{1, \dots, 1}}, 0, \dots, 0, \overset{7}{\overset{⏞}{1, \dots, 1}}], & {(L_{2}^{Q})}^{⊤} = [\overset{7}{\overset{⏞}{0, \dots, 0}}, \overset{7}{\overset{⏞}{1, \dots, 1}}, 0, \dots, 0, \overset{7}{\overset{⏞}{1, \dots, 1}}, \overset{7}{\overset{⏞}{0, \dots, 0}}] . \end{matrix}

The parameter

m = 2

and the probability matrix

Π = (\begin{matrix} 0.244 & 0.756 \\ 0.342 & 0.658 \end{matrix})

.

Similarly, we applied the OSA and FIX iterations to CDSEs with dimensions

N = 350

and

N = 700

, and the results are presented in Table 2. It is evident that the OSA can achieve equation residuals of

O (10^{- 14})

within five iterations, with the required CPU time approximately one-ninth of that needed for the FIX iterations. However, FIX only attains equation residuals of

O (10^{- 13})

after 10 iterations. We also depict the residuals of the two different iteration methods for their respective m equations (here

m = 2

) in Figure 3. The figure illustrates that the OSA exhibits quadratic convergence, while FIX demonstrates only linear convergence.

3. Structured Algorithm for Large-Scale Problems

In numerous practical scenarios [23], matrix

A_{i}

is often sparse, and

Q_{i}

is typically of a low-ranked structure. Therefore, in this section, we adapt the OSA to a low-ranked format, well-suited for large-scale computations.

3.1. Structured Iteration Scheme

Given the initial matrices

X_{i, 0} = Q_{i} = L_{i}^{Q} {(L_{i}^{Q})}^{⊤}

for

i = 1, \dots, m

, we show that the iteration (4) can be organized as the following format:

\begin{matrix} X_{i, k} = L_{i, k}^{Q} K_{i, k}^{Q} {(L_{i, k}^{Q})}^{⊤}, F_{i, k} (X_{\cdot, k}) = L_{i, k}^{F_{2^{k}}} K_{i, k}^{F_{2^{k}}} {(L_{i, k}^{F_{2^{k}}})}^{⊤}, k = 0, 1, 2, \dots, \end{matrix}

(7)

where

L_{i, k}^{F_{2^{k}}}

,

K_{i, k}^{F_{2^{k}}}

,

L_{i, k}^{Q}

, and

K_{i, k}^{Q}

are of forms in the following proposition.

Proposition 2.

Let

L_{i}^{Q} \in R^{N \times l_{i}}

in

X_{i, 0}

be the initial factor with

l_{i} ≪ N

. The sequences

F_{i, k} (X_{\cdot, k})

and

X_{i, k}

generated by (4) are factorized as in (7), where

\begin{matrix} L_{i, k}^{F_{2^{k}}} = A_{i}^{⊤} [L_{1, k}^{F_{2^{k} - 1}}, \dots, L_{m, k}^{F_{2^{k} - 1}}], K_{i, k}^{F_{2^{k}}} = p_{i, 1} K_{1, k}^{F_{2^{k} - 1}} \oplus \dots \oplus p_{i, m} K_{m, k}^{F_{2^{k} - 1}}, \end{matrix}

(8)

\begin{matrix} L_{i, k}^{Q} = [L_{i, k - 1}^{Q}, L_{i, k - 1}^{F_{2^{k} - 1}}], K_{i, k}^{Q} = K_{i, k - 1}^{Q} \oplus K_{i, k - 1}^{F_{2^{k} - 1}} \end{matrix}

(9)

and

L_{i, k}^{F_{1}} = A_{i}^{⊤} [L_{1, k}^{Q}, \dots, L_{m, k}^{Q}], K_{i, k}^{F_{1}} = p_{i, 1} K_{1, k}^{Q} \oplus \dots \oplus p_{i, m} K_{m, k}^{Q}, K_{i, 0}^{Q} = I, L_{i, 0}^{Q} = L_{i}^{Q} .

Proof.

Given the initial matrix

X_{i, 0} = Q_{i} = L_{i}^{Q} {(L_{i}^{Q})}^{⊤} = L_{i, 0}^{Q} K_{i, 0}^{Q} {(L_{i, 0}^{Q})}^{⊤}

, it then follows from (3) that

F_{i, 0} (X_{\cdot, 0}) = A_{i}^{⊤} E_{i} (X_{\cdot, 0}) A_{i} = A_{i}^{⊤} (\sum_{s = 1}^{m} p_{i, s} Q_{s}) A_{i} = L_{i, 0}^{F_{1}} K_{i, 0}^{F_{1}} {(L_{i, 0}^{F_{1}})}^{⊤} .

So (7) holds for

k = 0

. Assume (7) is true for

k = l

. It follows from the iteration scheme (4) that

X_{i, l + 1} = L_{i, l}^{Q} K_{i, l}^{Q} {(L_{i, l}^{Q})}^{⊤} + L_{i, l}^{F_{2^{l}}} K_{i, l}^{F_{2^{l}}} {(L_{i, l}^{F_{2^{l}}})}^{⊤} = L_{i, l + 1}^{Q} K_{i, l + 1}^{Q} {(L_{i, l + 1}^{Q})}^{⊤} .

Recalling (3) again, it follows

\begin{matrix} F_{i, l + 1} (X_{\cdot, l + 1}) & = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 1} (X_{\cdot, l + 1}) A^{2^{l + 1} - 1}) A_{i} \\ = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 2} \cdot A^{⊤} E (X_{\cdot, l + 1}) A \cdot A^{2^{l + 1} - 2}) A_{i} \\ = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 2} (\sum_{s = 1}^{m} p_{\cdot, s} A_{s}^{⊤} [L_{1, l + 1}^{Q}, \dots, L_{m, l + 1}^{Q}] [p_{s, 1} K_{1, l + 1}^{Q} \oplus \dots \\ \oplus p_{s, m} K_{1, l + 1}^{Q}] {[L_{1, l + 1}^{Q}, \dots, L_{m, l + 1}^{Q}]}^{⊤} A_{s}) A^{2^{l + 1} - 2}) A_{i} \\ = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 2} (\sum_{s = 1}^{m} p_{\cdot, s} L_{s, l + 1}^{F_{1}} K_{s, l + 1}^{F_{1}} {(L_{s, l + 1}^{F_{1}})}^{⊤}) A^{2^{l + 1} - 2}) A_{i} \\ = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 3} (\sum_{s = 1}^{m} p_{\cdot, s} A_{s}^{⊤} [L_{1, l + 1}^{F_{1}}, \dots, L_{m, l + 1}^{F_{1}}] [p_{s, 1} K_{1, l + 1}^{F_{1}} \oplus \dots \\ \oplus p_{s, m} K_{1, l + 1}^{F_{1}}] {[L_{1, l + 1}^{F_{1}}, \dots, L_{m, l + 1}^{F_{1}}]}^{⊤} A_{s}) A^{2^{l + 1} - 3}) A_{i} \\ = & A_{i}^{⊤} E_{i} ({(A^{⊤} E)}^{2^{l + 1} - 3} (\sum_{s = 1}^{m} p_{\cdot, s} L_{s, l + 1}^{F_{2}} K_{s, l + 1}^{F_{2}} {(L_{s, l + 1}^{F_{2}})}^{⊤}) A^{2^{l + 1} - 3}) A_{i} \\ = & \dots \\ = & A_{i}^{⊤} E_{i} (\sum_{s = 1}^{m} p_{\cdot, s} L_{s, l + 1}^{F_{2^{l + 1} - 1}} K_{s, l + 1}^{F_{2^{l + 1} - 1}} {(L_{s, l + 1}^{F_{2^{l + 1} - 1}})}^{⊤}) A_{i} \\ = & L_{i, l + 1}^{F_{2^{l + 1}}} K_{i, l + 1}^{F_{2^{l + 1}}} {(L_{i, l + 1}^{F_{2^{l + 1}}})}^{⊤} . \end{matrix}

Then (7) holds true for

k = l + 1

. The proof is complete. □

3.2. Truncation and Compression

It is evident that the columns of

L_{i, k}^{F_{2^{k}}}

at the k-th iteration will scale approximately as

O (m^{2^{k - 1}} l_{i})

, where

l_{i}

is the initial column number of the factor

L_{i}^{Q}

. Consequently, we implement the truncation and compression (TC) to reduce the column number of low-rank factors [11,28]. Notably, our algorithm employs the TC technique twice within one iteration: once for

L_{i, k}^{F_{2^{k}}}

and once for

L_{i, k}^{Q}

.

For simplicity in notation, we omit the subscript i for low-rank factors. Recalling (8) and (9), we perform TC on

L_{k}^{F_{2^{k}}}

and

L_{k}^{Q}

using QR decompositions with column pivoting. Then, one has

\begin{matrix} L_{k}^{F_{2^{k}}} P^{F_{2^{k}}} = [Q^{F_{2^{k}}} {\tilde{Q}}^{F_{2^{k}}}] [\begin{matrix} U_{1}^{F_{2^{k}}} & U_{2}^{F_{2^{k}}} \\ 0 & {\tilde{U}}^{F_{2^{k}}} \end{matrix}], ∥ {\tilde{U}}^{F_{2^{k}}} ∥ < u_{0}^{f} τ, \\ L_{k}^{Q} P_{k}^{Q} = [Q_{k}^{Q} {\tilde{Q}}_{k}^{Q}] [\begin{matrix} U_{k, 1}^{Q} & U_{k, 2}^{Q} \\ 0 & {\tilde{U}}_{k}^{Q} \end{matrix}], ∥ {\tilde{U}}_{k}^{Q} ∥ < u_{0}^{q} τ, \end{matrix}

(10)

where

P^{F_{2^{k}}}

and

P_{k}^{Q}

are permutation matrices ensuring that the diagonal elements of the decomposed block triangular matrices decrease in absolute value. Additionally,

u_{0}^{f}

and

u_{0}^{q}

represent constants, and

τ

is some small tolerance controlling TC. Let

m^{f_{2^{k}}}

and

m_{k}^{q}

denote the respective column numbers of

L_{k}^{F_{2^{k}}}

and

L_{k}^{Q}

, bounded above by a given

m_{max}

. Then, their ranks satisfy

r^{f_{2^{k}}} : = rank (L_{k}^{F_{2^{k}}}) \leq m^{f_{2^{k}}} \leq m_{max} and r_{k}^{q} : = rank (L_{k}^{Q}) \leq m_{k}^{q} \leq m_{max}

with

m_{max} ≪ N

. The truncated factors are still denoted as

\begin{matrix} L_{k}^{F_{2^{k}}} P^{F_{2^{k}}} = Q^{F_{2^{k}}} [U_{1}^{F_{2^{k}}} U_{2}^{F_{2^{k}}}] : = Q^{F_{2^{k}}} U^{F_{2^{k}}}, \end{matrix}

(11)

\begin{matrix} L_{k}^{Q} P_{k}^{Q} = Q_{k}^{Q} [U_{k, 1}^{Q} U_{k, 2}^{Q}] : = Q_{k}^{Q} U_{k}^{Q}, \end{matrix}

(12)

respectively. The compressed kernels are denoted as

\begin{matrix} K_{k}^{F_{2^{k}}} : = U^{F_{2^{k}}} P^{F_{2^{k}}} K_{k}^{F_{2^{k}}} {(U^{F_{2^{k}}} P^{F_{2^{k}}})}^{⊤}, \end{matrix}

(13)

\begin{matrix} K_{k}^{Q} : = U_{k}^{Q} P_{k}^{Q} K_{k}^{Q} {(U_{k}^{Q} P_{k}^{Q})}^{⊤}, \end{matrix}

(14)

where

K_{k}^{F_{2^{k}}}

and

K_{k}^{Q}

represent the abbreviated kernels in (8) and (9) without subscript i, respectively.

3.3. Computation of Residuals

Given the initial matrix

X_{i, 0} = Q_{i} = L_{i}^{Q} {(L_{i}^{Q})}^{⊤}

, for

i = 1, \dots, m

the initial residual of the CDSEs is

S c_{0} (X_{\cdot, 0}) = X_{i, 0} - A_{i}^{⊤} E_{i} (X_{\cdot, 0}) A_{i} - Q_{i} = L_{i, 0}^{R} K_{i, 0}^{R} {(L_{i, 0}^{R})}^{⊤},

(15)

where

L_{i, 0}^{R} = A_{i}^{⊤} [L_{1, 0}^{Q}, \dots, L_{m, 0}^{Q}]

,

K_{i, 0}^{R} = p_{i, 1} I \oplus, \dots, \oplus p_{i, m} I

.

When the k-th iteration

X_{i, k} = L_{i, k}^{Q} K_{i, k}^{Q} {(L_{i, k}^{Q})}^{⊤}

is available, the residual of CDSEs has the decomposition

S c_{k} (X_{\cdot, k}) = X_{i, k} - A_{i}^{⊤} E_{i} (X_{\cdot, k}) A_{i} - Q_{i} = L_{i, k}^{R} K_{i, k}^{R} {(L_{i, k}^{R})}^{⊤}

(16)

with

L_{i, k}^{R} = [L_{i, 0}^{Q}, L_{i, k}^{Q}, L_{i, k}^{F_{2^{k}}}], K_{i, k}^{R} = - I \oplus K_{i, k}^{Q} \oplus - K_{i, k}^{F_{2^{k}}} .

Similarly, we also impose the truncation and compression on

L_{i, k}^{R}, k \geq 0

, i.e, implementing the QR decomposition with pivoting:

\begin{matrix} L_{i, k}^{R} P_{i, k}^{R} = [Q_{i, k}^{R} {\tilde{Q}}_{i, k}^{R}] [\begin{matrix} U_{i, k, 1}^{R} & U_{i, k, 2}^{R} \\ 0 & {\tilde{U}}_{i, k}^{R} \end{matrix}], ∥ {\tilde{U}}_{i, k}^{R} ∥ < u_{0}^{r} τ, \end{matrix}

(17)

where

P_{i, k}^{R}

is a pivoting matrix and

u_{0}^{r}

is some constant. Let

U_{i, k}^{R} = [U_{i, k, 1}^{R}, U_{i, k, 2}^{R}]

. The corresponding kernel of residual is also denoted as

\begin{matrix} K_{i, k}^{R} : = U_{i, k}^{R} P_{i, k}^{R} K_{i, k}^{R} {(U_{i, k}^{R} P_{i, k}^{R})}^{⊤}, \end{matrix}

(18)

and the terminating condition of the whole algorithm is chosen to be

Rel_Res = max_{i} \frac{∥ K_{i, k}^{R} ∥}{∥ K_{i, 0}^{R} ∥} \leq ϵ

(19)

with

ϵ

being the tolerance.

3.4. Large-Scale Algorithm and Complexity

The OSA with a low-rank structure equipped with TC is summarized in the following OSA_lr Algorithm 1.

To show the computational complexity of the algorithm OSA_lr, we assume that all matrices

A_{i}

for

i = 1, \dots, m

are sufficiently sparse. This allows us to consider the cost of both the product

A_{i} B

and solving the equation

A_{i} X = B

, which are both within the range of

c N

floating-point operations (flops), where B is an

N \times m_{b}

matrix with

m_{b} ≪ N

and c is a constant. Additionally, the number of truncated columns of

L_{i, k}^{F_{2^{k - 1}}}

and

L_{i, k}^{Q}

for all

i = 1, \dots, m

are denoted as

m_{k}^{f}

and

m_{k}^{q}

, respectively. The flops and memory of the k-th iteration are summarized in Table 3 below.

Algorithm 1: Algorithm OSA_lr. Solve large-scale CDSEs with low-ranked

Q_{i}

Inputs: Sparse matrices

A_{i}

, low-rank factors

L_{i}^{Q}

for

i = 1, \dots, m

, probability matrix

Π \in R^{m \times m}

, truncation tolerance

τ

, upper bound

m_{max}

and the iteration tolerance

ϵ

.

Outputs: Low-ranked matrix

L_{i}^{X}

and the kernel matrix

K_{i}^{X}

with the solution

X_{i} \approx L_{i}^{X} K_{i}^{X} {(L_{i}^{X})}^{⊤}

.

1. Set

L_{i, 0}^{Q} = L_{i}^{Q}

and

K_{i, 0}^{Q} = I_{i}

for

i = 1, \dots, m

.

2. For

k = 1, \dots,

until convergence, do

3. Compute

K_{i, k}^{F_{2^{k}}}

and

L_{i, k}^{F_{2^{k}}}

as in (8).

4. Truncate and compress

L_{i, k}^{F_{2^{k}}}

as in (10) with accuracy

u_{0}^{f} τ

.

5. Construct compressed low-ranked factor

L_{i, k}^{F_{2^{k}}}

and kernel

K_{i, k}^{F_{2^{k}}}

as in (11) and (13).

6. Compute

K_{i, k}^{Q}

and

L_{i, k}^{Q}

as in (9).

7. Truncate and compress

L_{i, k}^{q}

as in (10) with accuracy

u_{0}^{q} τ

.

8. Construct compressed low-ranked factor

L_{i, k}^{Q}

and kernel

K_{i, k}^{Q}

as in (12) and (14).

9. Evaluate the relative residual Rel_Res in (19).

11. If Rel_Res

< ϵ

, break, End.

12.

k : = k + 1

;

14. End (For)

18. Output

K_{i}^{X} : = K_{i, k}^{Q}

,

L_{i}^{X} : = L_{i, k}^{Q}

.

3.5. Error Analysis

In this subsection, we will conduct the error analysis of OSA_lr. For

i = 1, \dots, m

, let

δ A_{i} = A_{i} - {\hat{A}}_{i}, δ X_{i, k} = X_{i, k} - {\hat{X}}_{i, k}

(20)

be the errors yielded by roundoff or iteration. Here,

A_{i}

and

X_{i, k}

are true matrices, while

{\hat{A}}_{i}

and

{\hat{X}}_{i, k}

are the practical iteration matrices. The following lemma indicates the error propagation of the operator.

Lemma 1.

Given errors

δ X_{i, k}

and

δ A_{i}

as in (20), the error of the operator is

∥ δ F_{i, l} (X_{k}) ∥ \leq {(m p)}^{2^{l}} (α^{2^{l + 1}} δ X_{k} + 2^{l + 1} q_{k} α^{2^{l + 1} - 1} δ A) f o r l \leq k,

where

α = {max}_{i} {∥ A_{i} ∥}

,

q_{k} = {max}_{i} {∥ X_{i, k} ∥}

,

δ A = {max}_{i} {∥ δ A_{i} ∥}

,

δ X_{k} = {max}_{i} {∥ δ X_{i, k} ∥}

, and

p = {max}_{i, j} {p_{i, j}}

.

Proof.

By merely retaining one order error of

δ A_{i}

and

δ X_{i, k}

, it follows from the definition of

F_{i, l} (X_{k})

in (3) that the practical operator is

\begin{matrix} {\hat{F}}_{i, l} (X_{k}) & = & {(A_{i} + δ A_{i})}^{⊤} \overset{2^{l} - 1}{\overset{⏞}{{\sum^{m} p_{\cdot, \cdot} {(A_{j} + δ A_{j})}^{⊤} \dots {\sum^{m} p_{\cdot, \cdot} {(A_{s} + δ A_{s})}^{⊤}}} [\sum^{m} p_{\cdot, \cdot} (X_{\cdot, k} + δ X_{\cdot, k})] \cdot \\ \overset{2^{l} - 1}{\overset{⏞}{(A_{s} + δ A_{s})} \dots (A_{j} + δ A_{j})}}} (A_{i} + δ A_{i}) \\ \leq & {(A_{i} + δ A_{i})}^{⊤} \overset{2^{l} - 2}{\overset{⏞}{{\sum^{m} p_{\cdot, \cdot} {(A_{j} + δ A_{j})}^{⊤} \dots {\sum^{m} p_{\cdot, \cdot} {(A_{t} + δ A_{t})}^{⊤}}} [F_{i, 0} (X_{k}) + {(m p)}^{2} (A_{s}^{⊤} δ X_{\cdot, k} A_{s} \\ + {(δ A_{s})}^{⊤} X_{\cdot, k} A_{s} + A_{s}^{⊤} X_{\cdot, k} δ A_{s})] \overset{2^{l} - 2}{\overset{⏞}{(A_{t} + δ A_{t})} \dots (A_{j} + δ A_{j})}}} (A_{i} + δ A_{i}) \\ \leq & {(A_{i} + δ A_{i})}^{⊤} \overset{2^{l} - 3}{\overset{⏞}{{\sum^{m} p_{\cdot, \cdot} {(A_{j} + δ A_{j})}^{⊤} \dots {\sum^{m} p_{\cdot, \cdot} {(A_{u} + δ A_{u})}^{⊤}}} [F_{i, 1} (X_{k}) + {(m p)}^{3} ({(A_{s} A_{t})}^{⊤} δ X_{\cdot, k} A_{s} A_{t} \\ + {(δ A_{s} A_{t})}^{⊤} X_{\cdot, k} A_{s} A_{t} + {(A_{s} A_{t})}^{⊤} X_{\cdot, k} δ A_{s} A_{t} + {(A_{s} δ A_{t})}^{⊤} X_{\cdot, k} A_{s} A_{t} \end{matrix}

(21)

\begin{matrix} + {(δ A_{s} A_{t})}^{⊤} X_{\cdot, k} A_{s} A_{t})] \overset{2^{l} - 3}{\overset{⏞}{(A_{u} + δ A_{u})} \dots (A_{j} + δ A_{j})}}} (A_{i} + δ A_{i}) \\ \leq & \dots \end{matrix}

(22)

Note that error items in

[\cdot]

in (21) and (22) have corresponding upper bounds

{(m p)}^{2} (α^{2} δ X_{k} + 2 q_{k} α δ A)

and

{(m p)}^{3} (α^{4} δ X_{k} + 4 q_{k} α^{3} δ A)

, respectively. After multiplying the left by

{(A_{i} + δ A_{i})}^{⊤}

and the right by

A_{i} + δ A_{i}

in the outermost layer, the upper bounds of the errors are

{(m p)}^{2} (α^{4} δ X_{k} + 4 q_{k} α^{3} δ A)

and

{(m p)}^{3} (α^{6} δ X_{k} + 6 q_{k} α^{5} δ A)

, respectively. Then, by the induction, it is not difficult to see that the final error is

{(m p)}^{2^{l}} (α^{2^{l + 1}} δ X_{k} + 2^{l + 1} q_{k} α^{2^{l + 1} - 1} δ A) .

□

We have the following error bound at the

k + 1

-th iteration.

Theorem 2.

Given errors

δ X_{k}

and

δ A_{i}

as in (20), the error at the

k + 1

-th iteration has the bound

δ X_{k + 1} \leq {(1 + (m p α^{2}))}^{2^{k}} δ X_{k} + 2^{k + 1} {(m p)}^{2^{k}} α^{2^{k + 1} - 1} q_{k} δ A + O (τ),

(23)

where m, p, α, and

q_{k}

are defined in Lemma 1 and τ is the error of TC described in Section 3.2.

Proof.

It follows from the iteration format (4) that the error at the

k + 1

-th iteration has the upper bound

∥ δ X_{i, k + 1} ∥ \leq ∥ δ X_{i, k} ∥ + ∥ δ F_{i, k} (X_{k}) ∥ + O (τ),

where

O (τ)

represents the truncation and compression error on

F_{i, k} (X_{k})

and

X_{i, k}

. By taking

l = k

in Lemma 1, one has

∥ δ X_{i, k + 1} ∥ \leq δ X_{k} + {(m p)}^{2^{k}} α^{2^{k} + 1} δ X_{k} + 2^{k + 1} {(m p)}^{2^{k}} q_{k} α^{2^{k + 1} - 1} δ A + O (τ)

and (23) holds true. □

4. Numerical Examples

In this section, we illustrate the effectiveness of OSA_lr in computing symmetric solutions to large-scale CDSEs (1) through practical examples [23,26,27,30,31,32,33]. The algorithm OSA_lr was coded by MATLAB 2019a on a 64-bit PC running Windows 10. The computer is equipped with a 3.0 GHz Intel Core i5 processor with six cores and six threads, 32 GB RAM, and a machine unit roundoff value of eps =

2.22 \times 10^{- 16}

. The maximum allowed column number of the low-ranked factors in OSA_lr is bounded by

m_{max}

= 1000, and the tolerance for the TC of columns is set to

τ = 10^{- 16}

. In our experiments, we also attempted using

N \cdot

eps as the TC tolerance for

τ

but found it had no impact on the computation accuracy. The residuals of the equations are calculated in (19) with a termination tolerance of

ϵ = 10^{- 13}

. It is noteworthy that we no longer compare with the linearly convergent iterative methods in Section 2, as the computational complexity of those algorithms per iteration is

O (N^{3})

.

Example 3.

We still employ the modification of the all-pass SISO system [26] in Section 2, but here we take N = 12,000. We list the calculated results of OSA_lr in Table 4, where the columns

δ_{k}

and

t_{k}

record the CPU time for each iteration and for cumulative iterations, respectively. The Resi and Rel_Resi (

i = 1, 2

) columns provide the absolute residual

∥ K_{i, k}^{R} ∥

and relative residual

∥ K_{i, k}^{R} ∥ / ∥ K_{i, 0}^{R} ∥

computed by OSA_lr at each iteration, respectively. The

m_{k}^{Q_{i}}

(

i = 1, 2

) columns indicate the column number of the low-ranked factor

L_{i, k}^{Q}

.

From the table, it is evident that OSA_lr achieves the prescribed equation residual level within five iterations, and the residual history demonstrates the quadratic convergence of the algorithm. The column count

m_{k}^{Q_{i}}

for the low-ranked factor

L_{i, k}^{Q}

expands at a rate greater than twice with each iteration, resulting in an exponential increase in the CPU time. Particularly, significant growth in the CPU time occurs during the third and fourth iterations. In numerical experiments, we observed that this substantial increase in the CPU time primarily lies in the residual computation step, specifically in Step 9 of OSA_lr. Hence, further investigation on the efficient evaluation of the equation residual is a crucial consideration for large-scale computations. We also plot the residual history of OSA_lr in Figure 7 to show its performance, where R-Res_i (

i = 1, 2

) denotes the relative residual of the i-th equation.

Example 4.

We continue to examine CDSEs from Example 2 [27,30], but with larger scales N = 21,000, 28,000, and 35,000. The derived results of OSA_lr are presented in Table 5.

The symbols

δ_{k}

,

t_{k}

, Resi, Rel_Resi, and

m_{k}^{Q_{i}}

(

i = 1, 2

) are defined similarly to those in Example 3. In all experiments, the equation residuals (in

log 10

) reached the predetermined residual level by the sixth iteration. For equations of different dimensions, the Resi

(i = 1, 2)

columns indicate that the algorithm OSA_lr is of nearly quadratic convergence, except for the final two iterations. The

m_{k}^{Q_{i}}

column reveals that, in the fifth and sixth iterations, the column number of the factor

L_{i, k}^{Q}

increased by nearly five times and six times, respectively. This resulted in a substantial increase in the CPU time during the last two iterations. A further detailed analysis indicated that this increased time primarily came from the computation of equation residuals in the final two steps. The performance of OSA_lr on the residual history with N = 35,000 is plotted in Figure 7, where R-Res_i (

i = 1, 2

) denotes the relative residual of the i-th equation.

Example 5.

Consider the thermal convective flow control systems in [23,30,31]. These problems involve a flow region with a prescribed velocity profile, incorporating convective transport. Achieving solution accuracy with upwind finite element schemes typically requires a considerable number of elements for a physically meaningful simulation. In the illustrated scenario (see the left side of Figure 4), a 3D model of a chip is subjected to forced convection, utilizing tetrahedral element type SOLID70 as described by [34]. Both the Dirichlet boundary conditions and initial conditions are set to 0.

We consider the case that the fluid speed is zero and the discretization matrices are symmetric. The system matrices are

A_{1} = {(I + r_{1} B B^{⊤})}^{- 1} A, A_{2} = {(I + r_{2} B B^{⊤})}^{- 1} A, Q_{1} = Q_{2} = C^{⊤} C,

where

r_{1}

and

r_{2}

are random numbers from (0, 1) and matrices

A \in R^{N \times N}

,

B \in R^{N \times 1}

, and

C \in R^{1 \times N}

(N = 20,082) can be found at [23]. Since

A_{1}

and

A_{1}

have almost the same sparse structure, we only plot

A_{1}

in the right of Figure 4, where the non-zero elements attain the scale of 381,276. In the numerical experiments, we set

m = 2

and use the probability matrix

Π = (\begin{matrix} 0.631 & 0.369 \\ 0.143 & 0.857 \end{matrix})

. We ran OSA_lr for 10 different

r_{1}

and

r_{2}

and recorded the averaged CPU time, residual of CDSEs, and column dimension of the low-ranked factor in Table 6.

The computational results in Table 6 reveal that OSA_lr requires only four iterations to achieve the predetermined equation residual accuracy. Moreover, the Resi and Rel_Resi columns indicate that OSA_lr exhibits quadratic convergence. The

m_{k}^{Q_{i}}

column demonstrates that the column count of the low-rank factor

L_{k}^{Q_{i}}

approximately doubles in the first three iterations but experiences a close to three-fold increase in the final iteration. In terms of the CPU time for each iteration, the time required for the final iteration is significantly greater than the sum of the preceding three. A detailed analysis indicates that the primary reason for this phenomenon is similar to the previous examples, wherein the computation of equation residuals in the algorithm accounts for the majority of the time. The performance of OSA_lr on the residual history is plotted in Figure 7, where R-Res_i (

i = 1, 2

) denotes the relative residual of the i-th equation.

Example 6.

Consider the structurally vertical stand model from machinery control systems, depicted on the left side of Figure 5, representing a segment of a machine tool [30]. In this structural component, a set of guide rails is situated on one of its surfaces. Throughout the machining process, a tool slide traverses various positions along these rails [32]. The model was created and meshed using ANSYS. For spatial discretization, the finite element method with linear Lagrange elements was employed and implemented in FEniCS.

The derived system matrices are

\begin{matrix} A_{1} = - 0.01 [{(I + r_{1} B B^{⊤})}^{- 1} A], & A_{2} = - 0.015 [{(I + r_{2} B B^{⊤})}^{- 1} A], \\ Q_{1} = L_{1}^{Q} {(L_{1}^{Q})}^{⊤}, & Q_{2} = L_{2}^{Q} {(L_{2}^{Q})}^{⊤}, \end{matrix}

where

r_{1}

and

r_{2}

are random numbers from (0, 1) and

L_{1}^{Q}

and

L_{2}^{Q} \in R^{N \times 1}

are vectors with all elements being zeros except fives ones located in rows (3341, 6743, 8932, 11,324, and 16,563) and rows (1046, 2436, 6467, 8423, and 12,574), respectively. As the sparse structures of

A_{1}

and

A_{2}

are analogous, the right of Figure 5 only exhibits the structure of

A_{1}

, which contains 602,653 non-zero elements. Matrices

A \in R^{16, 626 \times 16, 626}

and

B \in R^{16, 626 \times 1}

can be found at [23]. In this example, we set

m = 2

and use the probability matrix as

Π = (\begin{matrix} 0.564 & 0.436 \\ 0.785 & 0.215 \end{matrix})

.

We utilized OSA_lr to solve the CDSEs, and the computed results are presented in Table 7. It can be observed from the table that OSA_lr terminates after four iterations, achieving a high-precision solution, where the equation residuals reach a level of

O (10^{- 15})

to

O (10^{- 16})

. The iteration history in the Resi and Rel_Resi columns illustrates the quadratic convergence of OSA_lr. The

m_{k}^{Q_{i}}

column indicates that in the second, third, and fourth iterations, the column count of the low-rank factor

L_{k}^{Q_{i}}

approximately doubles, triples, and quadruples, respectively. This demonstrates that the truncation and compression techniques have a limited impact on reducing the column count of

L_{k}^{Q_{i}}

in this scenario. Similarly,

δ t_{k}

reveals that the CPU time for the final iteration is significantly greater than the sum of the preceding three, primarily due to the algorithm spending substantial time computing equation residuals in the last iteration.

Example 7.

Consider a semi-discretized heat transfer problem aimed at optimizing the cooling of steel profiles in control systems, as discussed in works by [23,30]. The order of the models varies due to the application of different refinements to the computational mesh. For the discretization process, the ALBERTA-1.2 fem-toolbox [33] and the linear Lagrange elements are utilized. The initial mesh (depicted on the left in Figure 6) is generated using MATLAB’s pdetool.

We slightly modify the model matrices as follows:

\begin{matrix} A_{1} = r_{1} [{(I + B B^{⊤})}^{- 1} A], & A_{2} = r_{2} [{(I + B B^{⊤})}^{- 1} A], \\ Q_{1} = L_{1}^{Q} {(L_{1}^{Q})}^{⊤}, & Q_{2} = L_{2}^{Q} {(L_{2}^{Q})}^{⊤}, \end{matrix}

where

r_{1} = 0.009

and

r_{2} = 0.008

for N = 20,209 and

r_{1} = 0.0015

and

r_{2} = 0.0012

for N = 79,841. In this experiment, we take

L_{1}^{Q} = r_{3} C^{'}

and

L_{2}^{Q} = r_{4} C^{'}

, with

r_{3}

,

r_{4}

being a random number in (0, 1). Matrices

A \in R^{N \times N}

,

B \in R^{N \times 1}

, and

C \in R^{1 \times N}

can be found at [23]. The sparse structures of matrices

A_{1}

and

A_{2}

are nearly identical, and for illustration purposes, we only display the structure of

A_{1}

with N = 79,841 on the right side of Figure 6. The probability matrix is defined as

Π = (\begin{matrix} 0.713 & 0.287 \\ 0.584 & 0.416 \end{matrix}) .

We employed OSA_lr to solve CDSEs under two different dimensions, and the computed results are presented in Table 8. It is evident from the table that OSA_lr terminates after achieving the predetermined equation residual levels for various dimensions. The Rel_Resi column indicates a significant decrease in the relative equation residuals to the level of

O (10^{- 8})

by the second iteration, enabling the algorithm to obtain a high-precision solution in only four iterations. The two columns of

m_{k}^{Q_{i}}

demonstrate that, for different dimensions, the column count of the low-rank factor

L_{k}^{Q_{i}}

increases by a factor of two, indicating that truncation and compression techniques effectively constrain the growth of the column count of

L_{k}^{Q_{i}}

in this scenario. Similarly, the

δ t_{k}

column reveals that in the final iteration, due to the computation of equation residuals, OSA_lr consumes considerably more CPU time than the sum of the preceding three iterations. The performances of OSA_lr on the residual history with N = 20,209, 799,841 are plotted in Figure 7, where R-Res_i (

i = 1, 2

) denotes the relative residual of the i-th equation.

Figure 7. Relative residual histories for OSA_lr in Examples 3–7.

5. Conclusions

This paper introduces an OSA method for coupled Stein equations in a class of jump systems. The convergence of the algorithm is established. For large-structured problems, the OSA method is extended to a low-rank structured iterative format, and an error propagation analysis of the algorithm is conducted. Numerical experiments, drawn from practical problems [23], indicate that in small-scale computations, the OSA outperforms existing linearly convergent iterative methods in terms of both the CPU time and accuracy. In large-scale computations, OSA_lr efficiently computes high-precision solutions for CDSEs. Nevertheless, the experiments reveal that the time spent on residual computation in the final iteration is relatively high. Therefore, improving the efficiency of the algorithm’s termination criteria is a direction for further research in future work, and it is currently under consideration.

Author Contributions

Conceptualization, B.Y.; methodology, B.Y.; software, B.H.; validation, N.D.; formal analysis, N.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the NSF of China (11801163), the NSF of Hunan Province (2021JJ50032, 2023JJ50165, 2024JJ7162), and the Degree & Postgraduate Education Reform Project of Hunan University of Technology and Hunan Province (JGYB23009, 2024JGYB210).

Data Availability Statement

All examples and data can be found in [30].

Conflicts of Interest

The authors declare no conflicts of interest.

References

Chen, C.-T. Linear System Theory and Design, 3rd ed.; Oxford University Press: New York, NY, USA, 1999. [Google Scholar]
Betser, A.; Cohen, N.; Zeheb, E. On solving the Lyapunov and Stein equations for a companion matrix. Syst. Control Lett. 1995, 25, 211–218. [Google Scholar] [CrossRef]
Hueso, J.L.; Martínez, G.; Hernández, V. A systolic algorithm for the triangular Stein equation. J. VLSI Signal Process. Syst. Signal Image Video Technol. 1993, 5, 49–55. [Google Scholar] [CrossRef]
Li, T.-X.; Weng, P.C.-Y.; Chu, E.K.-W.; Lin, W.-W. Large-scale Stein and Lyapunov Equations, Smith Method, and Applications. Numer. Algorithms 2013, 63, 727–752. [Google Scholar] [CrossRef]
Fan, H.-Y.; Weng, P.C.-Y.; Chu, K.-W. Numerical solution to generalized Lyapunov/Stein and rational Riccati equations in stochastic control. Numer. Algorithms 2016, 71, 245–272. [Google Scholar] [CrossRef]
Yu, B.; Dong, N.; Tang, Q. Factorized squared Smith method for large-scale Stein equations with high-rank terms. Automatica 2023, 154, 111057. [Google Scholar] [CrossRef]
Borno, I.; Gajic, Z. Parallel algorithm for solving coupled algebraic Lyapunov equations of discrete-time jump linear systems. Comput. Math. Appl. 1995, 30, 1–4. [Google Scholar] [CrossRef]
Wu, A.-G.; Duan, G.-R. New Iterative algorithms for solving coupled Markovian jump Lyapunov equations. IEEE Trans. Auto. Control 2015, 60, 289–294. [Google Scholar]
Zhou, B.; Duan, G.-R.; Li, Z.-Y. Gradient based iterative algorithm for solving coupled matrix equations. Syst. Control. Lett. 2009, 58, 327–333. [Google Scholar] [CrossRef]
Li, Z.-Y.; Zhou, B.; Lam, J.; Wang, Y. Positive operator based iterative algorithms for solving Lyapunov equations for Itô stochastic systems with Markovian jumps. Appl. Math. Comput. 2011, 217, 8179–8195. [Google Scholar] [CrossRef]
Yu, B.; Fan, H.-Y.; Chu, E.K.-W. Smith method for projected Lyapunov and Stein equations. UPB Sci. Bull. Ser. A 2018, 80, 191–204. [Google Scholar]
Sun, H.-J.; Zhang, Y.; Fu, Y.-M. Accelerated smith iterative algorithms for coupled Lyapunov matrix equations. J. Frankl. Inst. 2017, 354, 6877–6893. [Google Scholar] [CrossRef]
Li, T.-Y.; Gajic, Z. Lyapunov iterations for solving coupled algebraic Riccati equations of Nash differential games and algebraic Riccati equations of zero-sum games. In New Trends in Dynamic Games and Applications; Olsder, G.J., Ed.; Annals of the International Society of Dynamic Games; Birkhäuser: Boston, MA, USA, 1995; Volume 3. [Google Scholar]
Wicks, M.; De Carlo, R. Solution of Coupled Lyapunov Equations for the Stabilization of Multimodal Linear Systems. In Proceedings of the 1997 American Control Conference (Cat. No.97CH36041), Albuquerque, NM, USA, 4–6 June 1997; Volume 3, pp. 1709–1713. [Google Scholar]
Ivanov, I.G. An Improved method for solving a system of discrete-time generalized Riccati equations. J. Numer. Math. Stoch. 2011, 3, 57–70. [Google Scholar]
Qian, Y.-Y.; Pang, W.-J. An implicit sequential algorithm for solving coupled Lyapunov equations of continuous-time Markovian jump systems. Automatica 2015, 60, 245–250. [Google Scholar] [CrossRef]
Costa, O.L.V.; Aya, J.C.C. Temporal difference methods for the maximal solution of discrete-time coupled algebraic Riccati equations. J. Optim. Theory Appl. 2001, 109, 289–309. [Google Scholar] [CrossRef]
Costa, O.L.V.; Marques, R.P. Maximal and stabilizing Hermitian solutions for discrete-time coupled algebraic Riccati equations. Math. Control Signals Syst. 1999, 12, 167–195. [Google Scholar] [CrossRef]
Ivanov, I.G. Stein iterations for the coupled discrete-time Riccati equations. Nonlinear Anal. Theory Methods Appl. 2009, 71, 6244–6253. [Google Scholar] [CrossRef]
Bai, L.; Zhang, S.; Wang, S.; Wang, K. Improved SOR iterative method for coupled Lyapunov matrix equations. Afr. Math. 2021, 32, 1457–1463. [Google Scholar] [CrossRef]
Tian, Z.-L.; Xu, T.-Y. An SOR-type algorithm based on IO iteration for solving coupled discrete Markovian jump Lyapunov equations. Filomat 2021, 35, 3781–3799. [Google Scholar] [CrossRef]
Penzl, T. A cyclic low-rank Smith method for large sparse Lyapunov equations. SIAM J. Sci. Comput. 1999, 21, 1401–1408. [Google Scholar] [CrossRef]
Korvink, G.; Rudnyi, B. Oberwolfach Benchmark Collection. In Dimension Reduction of Large-Scale Systems; Benner, P., Sorensen, D.C., Mehrmann, V., Eds.; Lecture Notes in Computational Science and Engineering; Springer: Berlin/Heidelberg, Germany, 2005; Volume 45. [Google Scholar]
Wang, Q.; Lam, J.; Wei, Y.; Chen, T. Iterative solutions of coupled discrete Markovian jump Lyapunov equations. Comput. Math. Appl. 2008, 55, 843–850. [Google Scholar] [CrossRef]
Mathworks. MATLAB User’s Guide; Mathworks: Natick, MA, USA, 2020; Available online: https://www.mathworks.com/help/pdf_doc/matlab/index.html (accessed on 15 March 2024).
Ober, R.J. Asymptotically Stable All-Pass Transfer Functions: Canonical Form, Parametrization and Realization. IFAC Proc. Vol. 1987, 20, 181–185. [Google Scholar] [CrossRef]
Chahlaoui, Y.; Van Dooren, P. Benchmark examples for model reduction of linear time-invariant dynamical systems. In Dimension Reduction of Large-Scale Systems; Springer: Berlin/Heidelberg, Germany, 2005; Volume 45, pp. 379–392. [Google Scholar]
Chu, E.K.-W.; Weng, P.C.-Y. Large-scale discrete-time algebraic Riccati equations—Doubling algorithm and error analysis. J. Comput. Appl. Math. 2015, 277, 115–126. [Google Scholar] [CrossRef]
Higham, N.J. Functions of Matrices: Theory and Computation; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
Chahlaoui, Y.; Van Dooren, P. A collection of benchmark examples for model reduction of linear time invariant dynamical systems. Work. Note 2002. Available online: https://eprints.maths.manchester.ac.uk/1040/1/ChahlaouiV02a.pdf (accessed on 15 March 2024).
Moosmann, C.; Greiner, A. Convective thermal flow problems. In Dimension Reduction of Large-Scale Systems; Lecture Notes in Computational Science and Engineering; Springer: Berlin/Heidelberg, Germany, 2005; Volume 45, pp. 341–343. [Google Scholar]
Lang, N. Numerical Methods for Large-Scale Linear Time-Varying Control Systems and Related Differential Matrix Equations; Logos-Verlag: Berlin, Germany, 2018. [Google Scholar]
Schmidt, A.; Siebert, K. Design of Adaptive Finite Element Software—The Finite Element Toolbox ALBERTA; Lecture Notes in Computational Science and Engineering; Springer: Berlin/Heidelberg, Germany, 2005; Volume 42. [Google Scholar]
Harper, C.A. Electronic Packaging and Interconnection Handbook; McGraw-Hill: New York, NY, USA, 1997. [Google Scholar]

Figure 1. Residual history in each equation for OSA and FIX.

Figure 2. Residual history in each equation for OSA and FIX when

ρ (A_{1}) = 0.96

and

ρ (A_{2}) = 0.95

.

Figure 2. Residual history in each equation for OSA and FIX when

ρ (A_{1}) = 0.96

and

ρ (A_{2}) = 0.95

.

Figure 3. Residual history in each equation for OSA and FIX in Example 2.

Figure 4. The 3D model of the chip and the discretized matrix

A_{1}

.

Figure 4. The 3D model of the chip and the discretized matrix

A_{1}

.

Figure 5. The vertical stand model and the discretized matrix

A_{1}

.

Figure 5. The vertical stand model and the discretized matrix

A_{1}

.

Figure 6. The initial mesh for cooling of steel and the discretized matrix

A_{1}

and

A_{2}

.

Figure 6. The initial mesh for cooling of steel and the discretized matrix

A_{1}

and

A_{2}

.

Table 1. History of CPU time and residual for OSA and FIX in Example 1.

	It.	$δ t_{k}$	$t_{k}$	Rel_Res	$δ t_{k}$	$t_{k}$	Rel_Res
			$N = 400$			$N = 800$
	1	$0.045$	$0.045$	$1.38 \times 10^{- 1}$	$0.156$	$0.156$	$2.50 \times 10^{- 1}$
	2	$0.070$	$0.115$	$1.04 \times 10^{- 2}$	$0.275$	$0.431$	$2.00 \times 10^{- 2}$
OSA	3	$0.139$	$0.254$	$8.59 \times 10^{- 5}$	$0.434$	$0.865$	$1.88 \times 10^{- 4}$
	4	$0.173$	$0.427$	$6.15 \times 10^{- 9}$	$0.705$	$1.571$	$1.59 \times 10^{- 8}$
	5	$0.284$	$0.711$	$2.66 \times 10^{- 16}$	$1.244$	$2.814$	$2.47 \times 10^{- 16}$
	1	$0.495$	$0.495$	$4.10 \times 10^{- 1}$	$2.276$	$2.276$	$7.77 \times 10^{- 1}$
	2	$0.483$	$0.979$	$1.26 \times 10^{- 1}$	$2.021$	$4.297$	$2.51 \times 10^{- 1}$
	3	$0.471$	$1.450$	$4.68 \times 10^{- 3}$	$2.049$	$6.346$	$9.85 \times 10^{- 3}$
	4	$0.452$	$1.902$	$1.65 \times 10^{- 4}$	$2.078$	$8.424$	$3.66 \times 10^{- 4}$
	5	$0.479$	$2.381$	$6.00 \times 10^{- 6}$	$2.051$	$10.503$	$1.40 \times 10^{- 5}$
FIX	6	$0.606$	$2.986$	$2.27 \times 10^{- 7}$	$2.051$	$12.553$	$5.58 \times 10^{- 7}$
	7	$0.630$	$3.617$	$8.87 \times 10^{- 9}$	$2.031$	$14.584$	$2.42 \times 10^{- 8}$
	8	$0.450$	$4.067$	$4.02 \times 10^{- 10}$	$2.055$	$16.639$	$1.21 \times 10^{- 9}$
	9	$0.462$	$4.529$	$1.90 \times 10^{- 11}$	$2.053$	$18.693$	$6.05 \times 10^{- 11}$
	10	$0.470$	$4.998$	$9.11 \times 10^{- 13}$	$2.046$	$20.738$	$3.05 \times 10^{- 12}$
	11	$0.474$	$5.472$	$1.19 \times 10^{- 13} *$	$2.076$	$22.814$	$2.05 \times 10^{- 13} *$

Table 2. History of CPU time and residual for OSA and FIX in Example 2.

	It.	$δ t_{k}$	$t_{k}$	Rel_Res	$δ t_{k}$	$t_{k}$	Rel_Res
			$N = 350$			$N = 700$
	1	$0.001$	$0.001$	$4.37 \times 10^{- 2}$	$0.003$	$0.003$	$4.37 \times 10^{- 2}$
	2	$0.005$	$0.06$	$3.54 \times 10^{- 3}$	$0.003$	$0.006$	$3.54 \times 10^{- 3}$
OSA	3	$0.009$	$0.015$	$3.27 \times 10^{- 5}$	$0.015$	$0.865$	$3.27 \times 10^{- 5}$
	4	$0.032$	$0.047$	$1.85 \times 10^{- 8}$	$0.036$	$0.051$	$1.85 \times 10^{- 8}$
	5	$0.235$	$0.281$	$5.01 \times 10^{- 14}$	$0.331$	$0.382$	$5.01 \times 10^{- 14}$
	1	$0.291$	$0.291$	$1.91 \times 10^{- 1}$	$1.252$	$1.252$	$1.91 \times 10^{- 1}$
	2	$0.231$	$0.523$	$1.27 \times 10^{- 2}$	$1.246$	$2.499$	$1.27 \times 10^{- 2}$
	3	$0.234$	$0.757$	$2.45 \times 10^{- 4}$	$1.294$	$3.793$	$2.45 \times 10^{- 4}$
	4	$0.240$	$0.997$	$7.50 \times 10^{- 6}$	$1.231$	$5.024$	$7.50 \times 10^{- 6}$
	5	$0.227$	$1.224$	$2.99 \times 10^{- 7}$	$1.242$	$6.266$	$2.99 \times 10^{- 7}$
FIX	6	$0.254$	$1.478$	$1.62 \times 10^{- 8}$	$1.215$	$7.482$	$1.62 \times 10^{- 8}$
	7	$0.240$	$1.718$	$1.25 \times 10^{- 9}$	$1.302$	$8.784$	$1.25 \times 10^{- 9}$
	8	$0.228$	$1.946$	$1.02 \times 10^{- 10}$	$1.255$	$10.039$	$1.02 \times 10^{- 10}$
	9	$0.231$	$2.177$	$8.55 \times 10^{- 12}$	$1.243$	$11.281$	$8.55 \times 10^{- 12}$
	10	$0.244$	$2.421$	$6.89 \times 10^{- 13}$	$1.231$	$12.513$	$6.90 \times 10^{- 13}$

Table 3. Complexity and memory at k-th iteration in algorithm OSA_lr.

Items	Flops	Memory
$L_{i, k}^{F_{2^{k - 1}}}$	$c m_{k - 1}^{q} 2^{k - 1} (m^{2^{k - 1}} + m) N$	$m^{2^{k - 1}} m_{k - 1}^{q} N$
$K_{i, k}^{F_{2^{k - 1}}}$	$m {(m_{k - 1}^{q})}^{2} (1 + m^{2 k}) (k + 1) / 2$	${(m^{2^{k - 1}} m_{k - 1}^{q})}^{2}$
$L_{i, k}^{F_{2^{k - 1}}}$ QR *	$2 {(m^{2^{k - 1}} m_{k - 1}^{q})}^{2} (N - m^{2^{k - 1}} m_{k - 1}^{q} / 3)$	${(m_{k}^{f})}^{2}$
Compressed $K_{i, k}^{F_{2^{k - 1}}}$	$4 m_{k}^{f} {(m^{2^{k - 1}} m_{k - 1}^{q})}^{2}$	${(m_{k}^{f})}^{2}$
$L_{i, k}^{Q}$	—	$(m_{k - 1}^{q} + m_{k}^{f}) N$
$K_{i, k}^{Q}$	—	${(m_{k - 1}^{q} + m_{k}^{f})}^{2}$
$L_{i, k}^{Q}$ QR *	$2 {(m_{k}^{f} m_{k - 1}^{q})}^{2} (N - m_{k}^{f} m_{k - 1}^{q} / 3)$	${(m_{k}^{q})}^{2}$
Compressed $K_{i, k}^{Q}$	$4 m_{k}^{q} {(m_{k}^{f} + m_{k - 1}^{q})}^{2}$	${(m_{k}^{q})}^{2}$

* Householder QR decomposition is used [29].

Table 4. CPU time and residual in Example 3.

It.	$δ t_{k}$	$t_{k}$	Res1	Rel_Res1	Res2	Rel_Res2	$m_{k}^{Q_{1}}$	$m_{k}^{Q_{2}}$
1	$0.012$	$0.012$	$2.06 \times 10^{0}$	$2.06 \times 10^{0}$	$3.06 \times 10^{0}$	$3.06 \times 10^{0}$	3	3
2	$0.135$	$0.147$	$1.35 \times 10^{- 1}$	$1.35 \times 10^{- 1}$	$6.15 \times 10^{- 2}$	$6.15 \times 10^{- 2}$	9	9
3	$0.659$	$0.806$	$1.44 \times 10^{- 3}$	$1.44 \times 10^{- 3}$	$6.44 \times 10^{- 4}$	$6.44 \times 10^{- 4}$	21	21
4	$35.765$	$35.571$	$1.80 \times 10^{- 7}$	$1.80 \times 10^{- 7}$	$7.51 \times 10^{- 8}$	$7.51 \times 10^{- 8}$	46	46
5	$488.519$	$525.090$	$4.42 \times 10^{- 14}$	$4.42 \times 10^{- 14}$	$2.72 \times 10^{- 14}$	$2.72 \times 10^{- 14}$	109	109

Table 5. CPU time and residual in Example 4.

It.	$δ t_{k}$	$t_{k}$	Res1	Rel_Res1	Res2	Rel_Res2	$m_{k}^{Q_{1}}$	$m_{k}^{Q_{2}}$
				N = 21,000
1	$0.005$	$0.005$	$1.51 \times 10^{1}$	$1.51 \times 10^{1}$	$1.55 \times 10^{1}$	$1.55 \times 10^{1}$	3	3
2	$0.010$	$0.015$	$4.73 \times 10^{- 2}$	$4.73 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	7	7
3	$0.019$	$0.034$	$6.10 \times 10^{- 4}$	$6.10 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	15	15
4	$0.053$	$0.088$	$6.30 \times 10^{- 7}$	$6.30 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	31	31
5	$1.354$	$1.441$	$3.67 \times 10^{- 12}$	$3.67 \times 10^{- 12}$	$2.37 \times 10^{- 12}$	$2.37 \times 10^{- 12}$	157	157
6	$235.119$	$235.119$	$1.93 \times 10^{- 14}$	$1.93 \times 10^{- 14}$	$7.13 \times 10^{- 15}$	$7.13 \times 10^{- 15}$	908	908
				N = 28,000
1	$0.006$	$0.006$	$1.51 \times 10^{1}$	$1.51 \times 10^{1}$	$1.55 \times 10^{1}$	$1.55 \times 10^{1}$	3	3
2	$0.017$	$0.023$	$4.73 \times 10^{- 2}$	$4.73 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	7	7
3	$0.029$	$0.052$	$6.10 \times 10^{- 4}$	$6.10 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	15	15
4	$0.078$	$0.130$	$6.30 \times 10^{- 7}$	$6.30 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	31	31
5	$1.373$	$1.503$	$4.09 \times 10^{- 12}$	$4.09 \times 10^{- 12}$	$2.86 \times 10^{- 12}$	$2.86 \times 10^{- 12}$	147	147
6	$239.690$	$241.193$	$1.59 \times 10^{- 14}$	$1.59 \times 10^{- 14}$	$8.68 \times 10^{- 15}$	$8.68 \times 10^{- 15}$	908	908
				N = 35,000
1	$0.008$	$0.008$	$1.51 \times 10^{1}$	$1.51 \times 10^{1}$	$1.55 \times 10^{1}$	$1.55 \times 10^{1}$	3	3
2	$0.025$	$0.034$	$4.73 \times 10^{- 2}$	$4.73 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	$2.70 \times 10^{- 2}$	7	7
3	$0.043$	$0.077$	$6.10 \times 10^{- 4}$	$6.10 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	$3.83 \times 10^{- 4}$	15	15
4	$0.098$	$0.175$	$6.30 \times 10^{- 7}$	$6.30 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	$4.04 \times 10^{- 7}$	31	31
5	$1.617$	$1.793$	$4.76 \times 10^{- 12}$	$4.76 \times 10^{- 12}$	$3.46 . \times 10^{- 12}$	$3.46 \times 10^{- 12}$	161	161
6	$248.356$	$250.148$	$1.97 \times 10^{- 14}$	$1.97 \times 10^{- 14}$	$8.62 \times 10^{- 15}$	$8.62 \times 10^{- 15}$	908	908

Table 6. CPU time and residual in Example 5.

It.	$δ t_{k}$	$t_{k}$	Res1	Rel_Res1	Res2	Rel_Res2	$m_{k}^{Q_{1}}$	$m_{k}^{Q_{2}}$
1	$0.017$	$0.017$	$2.06 \times 10^{- 1}$	$2.22 \times 10^{0}$	$6.13 \times 10^{- 1}$	$1.05 \times 10^{0}$	10	10
2	$0.083$	$0.100$	$9.73 \times 10^{- 6}$	$5.03 \times 10^{- 5}$	$9.83 \times 10^{- 6}$	$1.69 \times 10^{- 5}$	22	22
3	$0.665$	$0.766$	$2.74 \times 10^{- 10}$	$1.47 \times 10^{- 9}$	$2.75 \times 10^{- 10}$	$4.72 \times 10^{- 10}$	46	46
4	$1073.56$	$1074.34$	$2.48 \times 10^{- 17}$	$1.33 \times 10^{- 16}$	$2.49 \times 10^{- 16}$	$4.28 \times 10^{- 16}$	112	112

Table 7. CPU time and residual in Example 6.

It.	$δ t_{k}$	$t_{k}$	Res1	Rel_Res1	Res2	Rel_Res2	$m_{k}^{Q_{1}}$	$m_{k}^{Q_{2}}$
1	$0.023$	$0.023$	$5.01 \times 10^{0}$	$5.01 \times 10^{0}$	$5.01 \times 10^{0}$	$5.01 \times 10^{0}$	3	3
2	$0.091$	$0.114$	$1.78 \times 10^{- 6}$	$1.78 \times 10^{- 6}$	$3.77 \times 10^{- 6}$	$3.77 \times 10^{- 6}$	7	7
3	$0.787$	$0.901$	$2.25 \times 10^{- 10}$	$2.25 \times 10^{- 11}$	$4.52 \times 10^{- 11}$	$4.52 \times 10^{- 11}$	25	25
4	$164.941$	$165.842$	$5.59 \times 10^{- 15}$	$5.59 \times 10^{- 15}$	$5.96 \times 10^{- 16}$	$5.96 \times 10^{- 16}$	119	119

Table 8. CPU time and residual in Example 7.

It.	$δ t_{k}$	$t_{k}$	Res1	Rel_Res1	Res2	Rel_Res2	$m_{k}^{Q_{1}}$	$m_{k}^{Q_{2}}$
				N = 20,209
1	$0.011$	$0.011$	$8.30 \times 10^{0}$	$1.34 \times 10^{0}$	$6.82 \times 10^{0}$	$1.34 \times 10^{0}$	12	12
2	$0.052$	$0.063$	$2.39 \times 10^{- 7}$	$3.84 \times 10^{- 8}$	$2.48 \times 10^{- 7}$	$4.86 \times 10^{- 8}$	24	24
3	$0.129$	$0.192$	$7.72 \times 10^{- 13}$	$1.24 \times 10^{- 13}$	$8.01 \times 10^{- 11}$	$1.57 \times 10^{- 13}$	48	48
4	$489.420$	$489.612$	$5.13 \times 10^{- 15}$	$8.25 \times 10^{- 16}$	$2.07 \times 10^{- 15}$	$4.06 \times 10^{- 16}$	96	96
				N = 79,841
1	$0.614$	$0.614$	$8.29 \times 10^{0}$	$1.33 \times 10^{0}$	$6.81 \times 10^{0}$	$1.33 \times 10^{0}$	12	12
2	$5.140$	$5.754$	$1, 77 \times 10^{- 7}$	$2.84 \times 10^{- 8}$	$1.06 \times 10^{- 7}$	$2.09 \times 10^{- 8}$	24	24
3	$16.946$	$22.700$	$4.74 \times 10^{- 12}$	$7.63 \times 10^{- 13}$	$2.86 \times 10^{- 12}$	$5.60 \times 10^{- 13}$	48	48
4	$874.870$	$895.570$	$2.73 \times 10^{- 15}$	$4.39 \times 10^{- 16}$	$3.67 \times 10^{- 15}$	$7.19 \times 10^{- 16}$	96	96

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yu, B.; Dong, N.; Hu, B. Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems. Axioms 2024, 13, 249. https://doi.org/10.3390/axioms13040249

AMA Style

Yu B, Dong N, Hu B. Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems. Axioms. 2024; 13(4):249. https://doi.org/10.3390/axioms13040249

Chicago/Turabian Style

Yu, Bo, Ning Dong, and Baiquan Hu. 2024. "Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems" Axioms 13, no. 4: 249. https://doi.org/10.3390/axioms13040249

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Operator Smith Algorithm for Coupled Stein Equations from Jump Control Systems

Abstract

1. Introduction

2. Operator Smith Iteration for CDSEs

2.1. Iteration Scheme

2.2. Examples

3. Structured Algorithm for Large-Scale Problems

3.1. Structured Iteration Scheme

3.2. Truncation and Compression

3.3. Computation of Residuals

3.4. Large-Scale Algorithm and Complexity

3.5. Error Analysis

4. Numerical Examples

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI