The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions

Xie, Mengyan; Wang, Qing-Wen; Zhang, Yang

doi:10.3390/sym16091167

Open AccessArticle

The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions

by

Mengyan Xie

^1,2

,

Qing-Wen Wang

^2,3,*

and

Yang Zhang

⁴

¹

College of Information Technology, Shanghai Ocean University, Shanghai 201306, China

²

Department of Mathematics and Newtouch Center for Mathematics, Shanghai University, Shanghai 200444, China

³

Collaborative Innovation Center for the Marine Artificial Intelligence, Shanghai 200444, China

⁴

Department of Mathematics, University of Manitoba, Winnipeg, MB R3T 2N2, Canada

^*

Author to whom correspondence should be addressed.

Symmetry 2024, 16(9), 1167; https://doi.org/10.3390/sym16091167

Submission received: 5 August 2024 / Revised: 31 August 2024 / Accepted: 3 September 2024 / Published: 6 September 2024

(This article belongs to the Special Issue Feature Papers in Mathematics Section)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we develop an effective iterative algorithm to solve a generalized Sylvester tensor equation over quaternions which includes several well-studied matrix/tensor equations as special cases. We discuss the convergence of this algorithm within a finite number of iterations, assuming negligible round-off errors for any initial tensor. Moreover, we demonstrate the unique minimal Frobenius norm solution achievable by selecting specific types of initial tensors. Additionally, numerical examples are presented to illustrate the practicality and validity of our proposed algorithm. These examples include demonstrating the algorithm’s effectiveness in addressing three-dimensional microscopic heat transport and color video restoration problems.

Keywords:

quaternion tensor; sylvester tensor equation; iterative algorithm; color video restoration

1. Introduction

An order N tensor

A = {(a_{i_{1} \dots i_{N}})}_{1 ⩽ i_{j} ⩽ I_{j}} (j = 1, \dots, N)

over a field

F

is a multidimensional array with

I_{1} I_{2} \dots I_{N}

entries in

F

, where N is a positive integer [1,2]. The set of all such N tensors is denoted by

F^{I_{1} \times \dots \times I_{N}}

. Over the past few decades, there has been extensive research on tensors, which has been driven by their diverse applications in fields such as physics, computer vision, data mining, and more (see, e.g., [1,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23]).

In this paper, we examine the conditions under which certain tensor equations over quaternions have solutions. This is motivated by the recent research on tensor equations as well as a long research history of solving matrix equations, which are briefly outlined as follows. It is well known that the following Sylvester matrix equation

A X + Y B = C

(1)

and its generalized forms have been widely investigated and found numerous applications in many areas.

During the past few decades, many methods have been developed for solving Sylvester-type matrix equations over the quaternion algebra. For example, Kyrchei [24] provided explicit determinantal representation formulas for the solutions to Equation (1). Heyouni et al. [25] presented the SGl-CMRH method and preconditioned framework of this method to solve matrix Equation (1) when

X = Y

. Zhang [26] investigated the general system of generalized Sylvester quaternion matrix equations. Ahmadi-Asl and Beik [27,28,29] developed efficient iterative algorithms for solving various quaternion matrix equations. Song [30] investigated the general solution to a system of quaternion matrix equation by Cramer’s rule. Wang et al. [31] investigated the solvability of a system of constrained two-sided coupled generalized Sylvester quaternion matrix equations. Zhang et al. [32] derived specific least-squares solutions of the quaternion matrix equation

A X B + C X D = E

. Meanwhile, Huang et al. [33] applied the modified conjugate gradient method to address the generalized coupled Sylvester conjugate matrix equations.

Quaternions provide more versatility and flexibility than real and complex numbers, especially when dealing with multidimensional problems. This unique property has attracted growing interest among scholars, leading to numerous valuable achievements in quaternion-related research (see, e.g., [34,35,36,37,38,39,40]). The tensor equation is a natural extension of the matrix equation.

In this paper, we examine the following generalized Sylvester tensor equation over

H

:

X \times {}_{1}A^{(1)} + X \times {}_{2}A^{(2)} + \dots + X \times_{N} A^{(N)} + Y \times {}_{1}B^{(1)} + Y \times {}_{2}B^{(2)} + \dots + Y \times_{N} B^{(N)} = C,

(2)

where the tensors

A^{(n)}

,

B^{(n)} \in H^{I_{n} \times I_{n}} (n = 1, 2, \dots, N)

, and

C \in H^{I_{1} \times \dots \times I_{N}}

are given, and the tensors

X

,

Y \in H^{I_{1} \times \dots \times I_{N}}

are unknown. The n-mode product of a tensor

X \in H^{I_{1} \times \dots \times I_{N}}

with a matrix

A \in H^{I_{n} \times I_{n}}

is defined as

{(X \times {}_{n}A)}_{i_{1} \dots i_{n - 1} j i_{n + 1} \dots i_{N}} = \sum_{i_{n} = 1}^{I_{n}} a_{j i_{n}} x_{i_{1} \dots i_{n - 1} i_{n} i_{n + 1} \dots i_{N}} .

Observe that the n-mode product can also be represented using unfolded quaternion tensors:

Y = X \times_{n} A ⟺ Y_{[n]} = A X_{[n]},

where

X_{[n]}

is the mode-n unfolding of

X

[1]. We address the following problems related to (2):

Problem 1.1: Given the tensor

C \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}

, and the matrices

A^{(n)}, B^{(n)} \in H^{I_{n} \times I_{n}} (n =

1, 2, \dots, N)

, find the tensors

\tilde{X}, \tilde{Y} \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}

such that

∥\sum_{k = 1}^{N} \tilde{X} \times {}_{k}A^{(k)} + \tilde{Y} \times {}_{k}B^{(k)} - C∥ = min_{X} ∥\sum_{k = 1}^{N} X \times_{k} A^{(k)} + Y \times_{k} B^{(k)} - C∥ .

Problem 1.2: Let

S_{X Y}

denote the solution set of Problem 1.1. For given tensors

X_{0}, Y_{0} \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}

, find the tensors

\overset{ˇ}{X}

and

\overset{ˇ}{Y} \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}

such that

∥\overset{ˇ}{X} - X_{0}∥ + ∥\overset{ˇ}{Y} - Y_{0}∥ = min_{X \in S_{L}} ∥X - X_{0}∥ + ∥Y - Y_{0}∥ .

It is worth emphasizing that the tensor Equation (2) includes several well-studied matrix/tensor equations as special cases. For example, if

X

and

Y

in (2) are order 2 tensors, i.e., matrices, then Equation (2) can be reduced to the following extended Sylvester matrix equation

A^{(1)} X + X {(A^{(2)})}^{T} + B^{(1)} Y + Y {(B^{(2)})}^{T} = C .

In the case of

B^{(n)} = 0 (n = 1, 2 \dots, N)

, Equation (2) becomes the following equation

X \times_{1} A^{(1)} + X \times {}_{2}A^{(2)} + \dots + X \times_{N} A^{(N)} = C,

(3)

which has been the subject of extensive research in recent years. For instance, Saberi et al. [41,42] investigated the SGMRES-BTF method and SGCRO-BTF method to solve Equation (3) over

R

. Wang et al. [43] proposed the conjugate gradient least-squares method to solve Equation (3) over

H

. Zhang and Wang [44] introduced the tensor formulations of the bi-conjugate gradient (BiCG-BTF) and bi-conjugate residual (BiCR-BTF) methods for solving tensor Equation (3) in the real number field

R

. Chen and Lu [45] explored a projection method using a Kronecker product preconditioner to solve Equation (3) over

R

. Karimi and Dehghan [46] introduced the tensor formulation of the global least-squares method for approximating solutions to (3). Additionally, Najafi-Kalyani et al. [47] developed several iterative algorithms based on the global Hessenberg process in their tensor forms to address Equation (3). Considering Equation (3) over

R

and

N = 3

, that is,

X \times_{1} A^{(1)} + X \times {}_{2}A^{(2)} + X \times_{3} A^{(3)} = C .

(4)

It has been shown that Equation (4) plays an important role in finite difference [48], thermal radiation [11], information retrieval [15], finite elements [49], and microscopic heat transport problem [18]. Therefore, our study of Equation (2) will provide a unified treatment for these matrix/tensor equations.

The remainder of this paper is structured as follows. In Section 2, we review key definitions and notations and prove several lemmas related to transforming Equation (2). In Section 3, we develop the BiCG iterative algorithm for solving the quaternion tensor Equation (2) and prove that our algorithm is correct. We also demonstrate that the minimal Frobenius norm solution can be achieved by selecting specific types of initial tensors. Section 4 provides numerical examples to illustrate the effectiveness and applications of the proposed algorithm. Finally, we summarize our contributions in Section 5.

2. Preliminaries

First, we review some notations and definitions. For two complex matrices

U = (u_{i j}) \in C^{m \times n}

and

V = (v_{i j}) \in C^{p \times q}

, the notation

U \otimes V = (u_{i j} V) \in C^{m p \times n q}

represents the Kronecker product of U and V.

The operator

vec (\cdot)

is defined as: for a matrix A and a tensor

X

,

vec (A) = {(a_{1}^{T}, a_{2}^{T}, \dots, a_{n}^{T})}^{T}, and vec (X) = vec (X_{[1]}),

respectively, where

a_{k}

is the kth column of A and

X_{[1]}

is the mode-1 unfolding of the tensor

X

. The inner product of two tensors

X, Y \in H^{I_{1} \times \dots \times I_{N}}

is defined as follows:

〈 X, Y 〉 = \sum_{i_{1} = 1}^{I_{1}} \sum_{i_{2} = 1}^{I_{2}} \dots \sum_{i_{N} = 1}^{I_{N}} x_{i_{1} i_{2} \dots i_{N}} {\bar{y}}_{i_{1} i_{2} \dots i_{N}},

where

{\bar{y}}_{i_{1} i_{2} \dots i_{N}}

represents the quaternion conjugate of

y_{i_{1} i_{2} \dots i_{N}}

. If

〈 X, Y 〉 = 0

, then we say that tensors

X

and

Y

are orthogonal. The Frobenius norm of tensor

X

is of the form:

∥ X ∥ = \sqrt{〈 X, X 〉} .

For any

X \in H^{I_{1} \times \dots \times I_{N}}

, it is well known that

X

can be uniquely represented as

X = X_{1} + X_{2} i + X_{3} j + X_{4} k

, where

X_{i} \in R^{I_{1} \times \dots \times I_{N}}, i = 1, 2, 3, 4

. Next, we define n-mode operators for

X_{i}

.

Let

A^{(n)} = A_{1}^{(n)} + A_{2}^{(n)} i + A_{3}^{(n)} j + A_{4}^{(n)} k, B^{(n)} = B_{1}^{(n)} + B_{2}^{(n)} i + B_{3}^{(n)} j + B_{4}^{(n)} k \in H^{I_{n} \times I_{n}}

, where

A_{i}^{(n)}, B_{i}^{(n)} \in R^{I_{n} \times I_{n}}, i = 1, 2, 3, 4

. For

W \in R^{I_{1} \times \dots \times I_{N}}

, we define

\begin{matrix} L_{A_{i}^{(n)}} (W) = W \times {}_{1}A_{i}^{(1)} + W \times_{2} A_{i}^{(2)} + \dots + W \times_{N} A_{i}^{(N)}, i = 1, 2, 3, 4, \\ L_{B_{i}^{(n)}} (W) = W \times {}_{1}B_{i}^{(1)} + W \times_{2} B_{i}^{(2)} + \dots + W \times_{N} B_{i}^{(N)}, i = 1, 2, 3, 4 . \end{matrix}

Next, replacing

W

in the above equations by

X_{i}

’s, we define the following notations:

\begin{matrix} Γ_{1} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{A_{1}^{(n)}} (X_{1}) - L_{A_{2}^{(n)}} (X_{2}) - L_{A_{3}^{(n)}} (X_{3}) - L_{A_{4}^{(n)}} (X_{4}), \\ Γ_{2} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{A_{2}^{(n)}} (X_{1}) + L_{A_{1}^{(n)}} (X_{2}) + L_{A_{4}^{(n)}} (X_{3}) - L_{A_{3}^{(n)}} (X_{4}), \\ Γ_{3} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{A_{3}^{(n)}} (X_{1}) - L_{A_{4}^{(n)}} (X_{2}) + L_{A_{1}^{(n)}} (X_{3}) + L_{A_{2}^{(n)}} (X_{4}), \\ Γ_{4} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{A_{4}^{(n)}} (X_{1}) + L_{A_{3}^{(n)}} (X_{2}) - L_{A_{2}^{(n)}} (X_{3}) + L_{A_{1}^{(n)}} (X_{4}), \end{matrix}

(5)

\begin{matrix} Φ_{1} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{B_{1}^{(n)}} (X_{1}) - L_{B_{2}^{(n)}} (X_{2}) - L_{B_{3}^{(n)}} (X_{3}) - L_{B_{4}^{(n)}} (X_{4}), \\ Φ_{2} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{B_{2}^{(n)}} (X_{1}) + L_{B_{1}^{(n)}} (X_{2}) + L_{B_{4}^{(n)}} (X_{3}) - L_{B_{3}^{(n)}} (X_{4}), \\ Φ_{3} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{B_{3}^{(n)}} (X_{1}) - L_{B_{4}^{(n)}} (X_{2}) + L_{B_{1}^{(n)}} (X_{3}) + L_{B_{2}^{(n)}} (X_{4}), \\ Φ_{4} [X_{1}, X_{2}, X_{3}, X_{4}] = L_{B_{4}^{(n)}} (X_{1}) + L_{B_{3}^{(n)}} (X_{2}) - L_{B_{2}^{(n)}} (X_{3}) + L_{B_{1}^{(n)}} (X_{4}), \end{matrix}

(6)

The following lemma establishes that the quaternion tensor Equation (2) can be reduced to a system of four real tensor equations.

Lemma 1.

In Equation (2), we assume that

A^{(n)} = A_{1}^{(n)} + A_{2}^{(n)} i + A_{3}^{(n)} j + A_{4}^{(n)} k, B^{(n)} = B_{1}^{(n)} + B_{2}^{(n)} i + B_{3}^{(n)} j + B_{4}^{(n)} k \in H^{I_{n} \times I_{n}}, n = 1, 2, \dots, N

, and

C =

C_{1} + C_{2} i + C_{3} j + C_{4} k, X = X_{1} + X_{2} i + X_{3} j + X_{4} k, Y = Y_{1} + Y_{2} i + Y_{3} j + Y_{4} k \in H^{I_{1} \times \dots \times I_{N}}

. Thus, the quaternion Sylvester tensor Equation (2) can be expressed as the following system of real tensor equations

\begin{matrix} Γ_{1} [X_{1}, X_{2}, X_{3}, X_{4}] + Φ_{1} [Y_{1}, Y_{2}, Y_{3}, Y_{4}] & = C_{1}, \\ Γ_{2} [X_{1}, X_{2}, X_{3}, X_{4}] + Φ_{2} [Y_{1}, Y_{2}, Y_{3}, Y_{4}] & = C_{2}, \\ Γ_{3} [X_{1}, X_{2}, X_{3}, X_{4}] + Φ_{3} [Y_{1}, Y_{2}, Y_{3}, Y_{4}] & = C_{3}, \\ Γ_{4} [X_{1}, X_{2}, X_{3}, X_{4}] + Φ_{4} [Y_{1}, Y_{2}, Y_{3}, Y_{4}] & = C_{4}, \end{matrix}

(7)

where

Γ_{i} [X_{1}, X_{2}, X_{3}, X_{4}]

,

Φ_{i} [Y_{1}, Y_{2}, Y_{3}, Y_{4}] (i = 1, 2, 3, 4)

are defined by (5) and (6). Furthermore, the system of real tensor Equation (7) is equivalent to the following linear system

[M_{A}, M_{B}] z = c,

(8)

where

\begin{matrix} M_{A} = (\begin{matrix} Kro (L_{A_{1}^{(n)}}) & - Kro (L_{A_{2}^{(n)}}) & - Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{4}^{(n)}}) \\ Kro (L_{A_{2}^{(n)}}) & + Kro (L_{A_{1}^{(n)}}) & + Kro (L_{A_{4}^{(n)}}) & - Kro (L_{A_{3}^{(n)}}) \\ Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{4}^{(n)}}) & + Kro (L_{A_{1}^{(n)}}) & + Kro (L_{A_{2}^{(n)}}) \\ Kro (L_{A_{4}^{(n)}}) & + Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{2}^{(n)}}) & + Kro (L_{A_{1}^{(n)}}) \end{matrix}), \end{matrix}

\begin{matrix} M_{B} = (\begin{matrix} Kro (L_{B_{1}^{(n)}}) & - Kro (L_{B_{2}^{(n)}}) & - Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{4}^{(n)}}) \\ Kro (L_{B_{2}^{(n)}}) & + Kro (L_{B_{1}^{(n)}}) & + Kro (L_{B_{4}^{(n)}}) & - Kro (L_{B_{3}^{(n)}}) \\ Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{4}^{(n)}}) & + Kro (L_{B_{1}^{(n)}}) & + Kro (L_{B_{2}^{(n)}}) \\ Kro (L_{B_{4}^{(n)}}) & + Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{2}^{(n)}}) & + Kro (L_{B_{1}^{(n)}}) \end{matrix}), \end{matrix}

z = (\begin{matrix} vec (X_{1}) \\ vec (X_{2}) \\ vec (X_{3}) \\ vec (X_{4}) \\ vec (Y_{1}) \\ vec (Y_{2}) \\ vec (Y_{3}) \\ vec (Y_{4}) \end{matrix}), c = (\begin{matrix} vec (C_{1}) \\ vec (C_{2}) \\ vec (C_{3}) \\ vec (C_{4}) \end{matrix}),

\begin{matrix} Kro (L_{A_{i}^{(n)}}) = \sum_{n = 1}^{N} I^{(I_{N})} \otimes \dots \otimes I^{(I_{n + 1})} \otimes A_{i}^{(n)} \otimes I^{(I_{n - 1})} \otimes \dots \otimes I^{(I_{1})}, i = 1, 2, 3, 4, \\ Kro (L_{B_{i}^{(n)}}) = \sum_{n = 1}^{N} I^{(I_{N})} \otimes \dots \otimes I^{(I_{n + 1})} \otimes B_{i}^{(n)} \otimes I^{(I_{n - 1})} \otimes \dots \otimes I^{(I_{1})}, i = 1, 2, 3, 4, \end{matrix}

(9)

and

I^{(n)}

denotes the identity matrix of size n.

Proof of Lemma 1.

We apply the definition of n-mode product of the quaternion tensor for (2).

\begin{matrix} \sum_{n = 1}^{N} (X \times_{n} A^{(n)} + Y \times_{n} B^{(n)}) \\ = & \sum_{n = 1}^{N} ((X_{1} + X_{2} i + X_{3} j + X_{4} k) \times_{n} (A_{1}^{(n)} + A_{2}^{(n)} i + A_{3}^{(n)} j + A_{4}^{(n)} k) \\ + (Y_{1} + Y_{2} i + Y_{3} j + Y_{4} k) \times_{n} (B_{1}^{(n)} + B_{2}^{(n)} i + B_{3}^{(n)} j + B_{4}^{(n)} k)) \\ = & \sum_{n = 1}^{N} (X_{1} \times_{n} A_{1}^{(n)} - X_{2} \times_{n} A_{2}^{(n)} - X_{3} \times_{n} A_{3}^{(n)} - X_{4} \times_{n} A_{4}^{(n)} \\ + Y_{1} \times_{n} B_{1}^{(n)} - Y_{2} \times_{n} B_{2}^{(n)} - Y_{3} \times_{n} B_{3}^{(n)} - Y_{4} \times_{n} B_{4}^{(n)}) \\ + \sum_{n = 1}^{N} (X_{1} \times_{n} A_{2}^{(n)} + X_{2} \times_{n} A_{1}^{(n)} + X_{3} \times_{n} A_{4}^{(n)} - X_{4} \times_{n} A_{3}^{(n)} \\ + Y_{1} \times_{n} B_{2}^{(n)} + Y_{2} \times_{n} B_{1}^{(n)} + Y_{3} \times_{n} B_{4}^{(n)} - Y_{4} \times_{n} B_{3}^{(n)}) i \\ + \sum_{n = 1}^{N} (X_{1} \times_{n} A_{3}^{(n)} - X_{2} \times_{n} A_{4}^{(n)} + X_{3} \times_{n} A_{1}^{(n)} + X_{4} \times_{n} A_{2}^{(n)} \\ + Y_{1} \times_{n} B_{3}^{(n)} - Y_{2} \times_{n} B_{4}^{(n)} + Y_{3} \times_{n} B_{1}^{(n)} + Y_{4} \times_{n} B_{2}^{(n)}) j \\ + \sum_{n = 1}^{N} (X_{1} \times_{n} A_{4}^{(n)} + X_{2} \times_{n} A_{3}^{(n)} - X_{3} \times_{n} A_{2}^{(n)} + X_{4} \times_{n} A_{1}^{(n)} \\ + Y_{1} \times_{n} B_{4}^{(n)} + Y_{2} \times_{n} B_{3}^{(n)} - Y_{3} \times_{n} B_{2}^{(n)} + Y_{4} \times_{n} B_{1}^{(n)}) k \\ = & C_{1} + C_{2} i + C_{3} j + C_{4} k . \end{matrix}

By the definitions of

Γ_{i}

and

Φ_{i}

, Equation (7) holds. To show (8), we make use of operator “vec” to

Γ_{1} [X_{1}, X_{2}, X_{3}, X_{4}]

and

Φ_{1} [Y_{1}, Y_{2}, Y_{3}, Y_{4}]

, that is,

\begin{matrix} vec (Γ_{1} [X_{1}, X_{2}, X_{3}, X_{4}]) \\ = & vec (L_{A_{1}^{(n)}} (X_{1}) - L_{A_{2}^{(n)}} (X_{2}) - L_{A_{3}^{(n)}} (X_{3}) - L_{A_{4}^{(n)}} (X_{4})) \\ = & vec (L_{A_{1}^{(n)}} (X_{1})) - vec (L_{A_{2}^{(n)}} (X_{2})) - vec (L_{A_{3}^{(n)}} (X_{3})) - vec (L_{A_{4}^{(n)}} (X_{4})) \\ = & Kro (L_{A_{1}^{(n)}}) vec (X_{1}) - Kro (L_{A_{2}^{(n)}}) vec (X_{2}) - Kro (L_{A_{3}^{(n)}}) vec (X_{3}) - Kro (L_{A_{4}^{(n)}}) vec (X_{4}), \end{matrix}

\begin{matrix} vec (Φ_{1} [Y_{1}, Y_{2}, Y_{3}, Y_{4}]) \\ = & vec (L_{B_{1}^{(n)}} (Y_{1}) - L_{B_{2}^{(n)}} (Y_{2}) - L_{B_{3}^{(n)}} (Y_{3}) - L_{B_{4}^{(n)}} (Y_{4})) \\ = & vec (L_{B_{1}^{(n)}} (Y_{1})) - vec (L_{B_{2}^{(n)}} (Y_{2})) - vec (L_{B_{3}^{(n)}} (Y_{3})) - vec (L_{B_{4}^{(n)}} (Y_{4})) \\ = & Kro (L_{B_{1}^{(n)}}) vec (Y_{1}) - Kro (L_{B_{2}^{(n)}}) vec (Y_{2}) - Kro (L_{B_{3}^{(n)}}) vec (Y_{3}) - Kro (L_{B_{4}^{(n)}}) vec (Y_{4}) . \end{matrix}

Similarly, we have the following results for the rest of

Γ_{i}

’s and

Φ_{i}

’s:

\begin{matrix} vec (Γ_{2} [X_{1}, X_{2}, X_{3}, X_{4}]) \\ = & Kro (L_{A_{2}^{(n)}}) vec (X_{1}) + Kro (L_{A_{1}^{(n)}}) vec (X_{2}) + Kro (L_{A_{4}^{(n)}}) vec (X_{3}) - Kro (L_{A_{3}^{(n)}}) vec (X_{4}), \end{matrix}

\begin{matrix} vec (Φ_{2} [Y_{1}, Y_{2}, Y_{3}, Y_{4}]) \\ = & Kro (L_{B_{2}^{(n)}}) vec (Y_{1}) + Kro (L_{B_{1}^{(n)}}) vec (Y_{2}) + Kro (L_{B_{4}^{(n)}}) vec (Y_{3}) - Kro (L_{B_{3}^{(n)}}) vec (Y_{4}), \end{matrix}

\begin{matrix} vec (Γ_{3} [X_{1}, X_{2}, X_{3}, X_{4}]) \\ = & Kro (L_{A_{3}^{(n)}}) vec (X_{1}) - Kro (L_{A_{4}^{(n)}}) vec (X_{2}) + Kro (L_{A_{1}^{(n)}}) vec (X_{3}) + Kro (L_{A_{2}^{(n)}}) vec (X_{4}), \end{matrix}

\begin{matrix} vec (Φ_{3} [Y_{1}, Y_{2}, Y_{3}, Y_{4}]) \\ = & Kro (L_{B_{3}^{(n)}}) vec (Y_{1}) - Kro (L_{B_{4}^{(n)}}) vec (Y_{2}) + Kro (L_{B_{1}^{(n)}}) vec (Y_{3}) + Kro (L_{B_{2}^{(n)}}) vec (Y_{4}), \end{matrix}

\begin{matrix} vec (Γ_{4} [X_{1}, X_{2}, X_{3}, X_{4}]) \\ = & Kro (L_{A_{4}^{(n)}}) vec (X_{1}) + Kro (L_{A_{3}^{(n)}}) vec (X_{2}) - Kro (L_{A_{2}^{(n)}}) vec (X_{3}) + Kro (L_{A_{1}^{(n)}}) vec (X_{4}), \end{matrix}

\begin{matrix} vec (Φ_{4} [Y_{1}, Y_{2}, Y_{3}, Y_{4}]) \\ = & Kro (L_{B_{4}^{(n)}}) vec (Y_{1}) + Kro (L_{B_{3}^{(n)}}) vec (Y_{2}) - Kro (L_{B_{2}^{(n)}}) vec (Y_{3}) + Kro (L_{B_{1}^{(n)}}) vec (Y_{4}) . \end{matrix}

By writing up above equations, we obtain the system (8). □

Lemma 2

([8,50]). Assume that

A \in R^{m \times n}

,

b \in R^{m}

, and the linear matrix equation

A x = b

has a solution

\tilde{x} \in R (A^{⊤})

. Then,

\tilde{x}

is the unique solution with the minimum norm for the equation

A x = b

.

From Lemmas 1 and 2, it is straightforward to observe that the uniqueness of the solution to Equation (2) can be characterized as follows:

Theorem 1.

The tensor Equation (2) has the unique solution with a minimal Frobenius norm if and only if the matrix Equation (8) has a solution

\tilde{z} \in R ({[M_{A}, M_{B}]}^{⊤})

. In this case,

\tilde{z}

is the unique solution with a minimum norm of matrix Equation (8).

Given fixed matrices

A^{(n)} \in R^{I_{n} \times I_{n}}, n = 1, 2, \dots, N

, we define the following linear operators

L_{A^{(n)}} (X) = X \times {}_{1}A^{(1)} + X \times {}_{2}A^{(2)} + \dots + X \times_{N} A^{(N)}, for any X \in R^{I_{1} \times I_{2} \times \dots \times I_{N}} .

Using the property

〈X, Y \times_{n} A^{(n)}〉 = 〈X \times_{n} {(A^{(n)})}^{T}, Y〉

in [10], the following lemma can be easily proven.

Lemma 3.

Let

A^{(n)} \in R^{I_{n} \times I_{n}}, n = 1, 2, \dots, N, X, Y \in R^{I_{1} \times I_{2} \times \dots \times I_{N}}

. Then

〈 L_{A^{(n)}} (X), Y 〉 = 〈X, L_{A^{(n)}}^{*} (Y)〉,

where

L_{A^{(n)}}^{*} (Y) = Y \times_{1} {(A^{(1)})}^{T} + Y \times {}_{2}{(A^{(2)})}^{T} + \dots + Y \times {}_{N}{(A^{(N)})}^{T} .

Clearly,

L_{A^{(n)}}

defined above is a linear mapping. The following lemma provides the uniqueness of the dual mapping for these kinds of linear mappings.

Lemma 4

([43]). Let

N

be a linear mapping from tensor space

R^{I_{1} \times \dots \times I_{N}}

to tensor space

R^{J_{1} \times \dots \times J_{N}} .

For any tensors

X \in R^{I_{1} \times \dots \times I_{N}}

and

Y \in R^{J_{1} \times \dots \times J_{N}}

, there exists a unique linear mapping

M

from tensor space

R^{J_{1} \times \dots \times J_{N}}

to tensor space

R^{I_{1} \times \dots \times I_{N}}

such that

〈 N (X), Y 〉 = 〈 X, M (Y) 〉 .

Finally, we use linear operators

L

and

L^{*}

to describe the inner products involving

Γ_{i}

and

Φ_{i}

, which we will use in the next sections.

Lemma 5.

Let

Γ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}]

,

Φ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] (i = 1, 2, 3, 4)

be defined by (5) and (6),

W_{i} \in R^{I_{1} \times \dots \times I_{N}} (i = 1, 2, 3, 4)

. Then

\begin{matrix} \sum_{i = 1}^{4} 〈Γ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{i}〉 = \sum_{i = 1}^{4} 〈 Z_{i}, Γ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}] 〉, \\ \sum_{i = 1}^{4} 〈Φ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{i}〉 = \sum_{i = 1}^{4} 〈 Z_{i}, Φ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}] 〉, \end{matrix}

(10)

where

\begin{matrix} Γ_{1}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = L_{A_{1}^{(n)}}^{*} (Z_{1}) + L_{A_{2}^{(n)}}^{*} (Z_{2}) + L_{A_{3}^{(n)}}^{*} (Z_{3}) + L_{A_{4}^{(n)}}^{*} (Z_{4}), \\ Γ_{2}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{A_{2}^{(n)}}^{*} (Z_{1}) + L_{A_{1}^{(n)}}^{*} (Z_{2}) - L_{A_{4}^{(n)}}^{*} (Z_{3}) + L_{A_{3}^{(n)}}^{*} (Z_{4}), \\ Γ_{3}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{A_{3}^{(n)}}^{*} (Z_{1}) + L_{A_{4}^{(n)}}^{*} (Z_{2}) + L_{A_{1}^{(n)}}^{*} (Z_{3}) - L_{A_{2}^{(n)}}^{*} (Z_{4}), \\ Γ_{4}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{A_{4}^{(n)}}^{*} (Z_{1}) - L_{A_{3}^{(n)}}^{*} (Z_{2}) + L_{A_{2}^{(n)}}^{*} (Z_{3}) + L_{A_{1}^{(n)}}^{*} (Z_{4}), \end{matrix}

(11)

\begin{matrix} Φ_{1}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = L_{B_{1}^{(n)}}^{*} (Z_{1}) + L_{B_{2}^{(n)}}^{*} (Z_{2}) + L_{B_{3}^{(n)}}^{*} (Z_{3}) + L_{B_{4}^{(n)}}^{*} (Z_{4}), \\ Φ_{2}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{B_{2}^{(n)}}^{*} (Z_{1}) + L_{B_{1}^{(n)}}^{*} (Z_{2}) - L_{B_{4}^{(n)}}^{*} (Z_{3}) + L_{B_{3}^{(n)}}^{*} (Z_{4}), \\ Φ_{3}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{B_{3}^{(n)}}^{*} (Z_{1}) + L_{B_{4}^{(n)}}^{*} (Z_{2}) + L_{B_{1}^{(n)}}^{*} (Z_{3}) - L_{B_{2}^{(n)}}^{*} (Z_{4}), \\ Φ_{4}^{*} [Z_{1}, Z_{2}, Z_{3}, Z_{4}] = - L_{B_{4}^{(n)}}^{*} (Z_{1}) - L_{B_{3}^{(n)}}^{*} (Z_{2}) + L_{B_{2}^{(n)}}^{*} (Z_{3}) + L_{B_{1}^{(n)}}^{*} (Z_{4}), \end{matrix}

(12)

\begin{matrix} L_{A_{i}^{(n)}}^{*} (X) = X \times {}_{1}{(A_{i}^{(1)})}^{T} + X \times_{2} {(A_{i}^{(2)})}^{T} + \dots + X \times_{N} {(A_{i}^{(N)})}^{T}, i = 1, 2, 3, 4, \\ L_{B_{i}^{(n)}}^{*} (X) = X \times {}_{1}{(B_{i}^{(1)})}^{T} + X \times_{2} {(B_{i}^{(2)})}^{T} + \dots + X \times_{N} {(B_{i}^{(N)})}^{T}, i = 1, 2, 3, 4 . \end{matrix}

Proof of Lemma 5.

For the first part of the equalities, we divide

\sum_{i = 1}^{4} 〈Γ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{i}〉

into 4 parts by i and then apply Lemma 3 to each part, that is,

\begin{matrix} 〈Γ_{1} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{1}〉 \\ = 〈 L_{A_{1}^{(n)}} (Z_{1}), W_{1} 〉 - 〈 L_{A_{2}^{(n)}} (Z_{2}), W_{1} 〉 - 〈 L_{A_{3}^{(n)}} (Z_{3}), W_{1} 〉 - 〈 L_{A_{4}^{(n)}} (Z_{4}), W_{1} 〉 \\ = 〈 Z_{1}, L_{A_{1}^{(n)}}^{*} (W_{1}) 〉 - 〈 Z_{2}, L_{A_{2}^{(n)}}^{*} (W_{1}) 〉 - 〈 Z_{3}, L_{A_{3}^{(n)}}^{*} (W_{1}) 〉 - 〈 Z_{4}, L_{A_{4}^{(n)}}^{*} (W_{1}) 〉, \end{matrix}

\begin{matrix} 〈Γ_{2} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{2}〉 \\ = 〈 L_{A_{2}^{(n)}} (Z_{1}), W_{2} 〉 + 〈 L_{A_{1}^{(n)}} (Z_{2}), W_{2} 〉 + 〈 L_{A_{4}^{(n)}} (Z_{3}), W_{2} 〉 - 〈 L_{A_{3}^{(n)}} (Z_{4}), W_{2} 〉 \\ = 〈 Z_{1}, L_{A_{2}^{(n)}}^{*} (W_{2}) 〉 + 〈 Z_{2}, L_{A_{1}^{(n)}}^{*} (W_{1}) 〉 + 〈 Z_{3}, L_{A_{3}^{(n)}}^{*} (W_{2}) 〉 - 〈 Z_{4}, L_{A_{4}^{(n)}}^{*} (W_{2}) 〉, \end{matrix}

\begin{matrix} 〈Γ_{3} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{3}〉 \\ = 〈 L_{A_{3}^{(n)}} (Z_{1}), W_{3} 〉 - 〈 L_{A_{4}^{(n)}} (Z_{2}), W_{3} 〉 + 〈 L_{A_{1}^{(n)}} (Z_{3}), W_{3} 〉 + 〈 L_{A_{2}^{(n)}} (Z_{4}), W_{3} 〉 \\ = 〈 Z_{1}, L_{A_{3}^{(n)}}^{*} (W_{3}) 〉 - 〈 Z_{2}, L_{A_{4}^{(n)}}^{*} (W_{3}) 〉 + 〈 Z_{3}, L_{A_{1}^{(n)}}^{*} (W_{2}) 〉 + 〈 Z_{4}, L_{A_{2}^{(n)}}^{*} (W_{3}) 〉, \end{matrix}

\begin{matrix} 〈Γ_{4} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{4}〉 \\ = 〈 L_{A_{4}^{(n)}} (Z_{1}), W_{4} 〉 + 〈 L_{A_{3}^{(n)}} (Z_{2}), W_{4} 〉 - 〈 L_{A_{2}^{(n)}} (Z_{3}), W_{4} 〉 + 〈 L_{A_{1}^{(n)}} (Z_{4}), W_{4} 〉 \\ = 〈 Z_{1}, L_{A_{4}^{(n)}}^{*} (W_{4}) 〉 + 〈 Z_{2}, L_{A_{3}^{(n)}}^{*} (W_{4}) 〉 - 〈 Z_{3}, L_{A_{2}^{(n)}}^{*} (W_{4}) 〉 + 〈 Z_{4}, L_{A_{1}^{(n)}}^{*} (W_{4}) 〉 . \end{matrix}

By adding up the above four parts, we have

\sum_{i = 1}^{4} 〈Γ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{i}〉 = \sum_{i = 1}^{4} 〈 Z_{i}, Γ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}] 〉,

where

Γ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}]

’s are defined by (11). Using a similar process, we can obtain the second equality

\sum_{i = 1}^{4} 〈Φ_{i} [Z_{1}, Z_{2}, Z_{3}, Z_{4}], W_{i}〉 = \sum_{i = 1}^{4} 〈 Z_{i}, Φ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}] 〉,

where

Φ_{i}^{*} [W_{1}, W_{2}, W_{3}, W_{4}]

’s are defined by (12). □

3. An Iterative Algorithm for Solving the Problems 1.1 and 1.2

The purpose of this section is to propose an iterative algorithm for obtaining the solution of Sylvester tensor Equation (2). As it is well known that the classical bi-conjugate gradient (BiCG) methods for solving nonsymmetric linear systems of equations are feasible and efficient, one may refer to [44,51,52,53,54]. We extend the BiCG method using the tensor format (BTF) for solving Equation (2) and discuss its convergence. Clearly, the tensor Equations (2) and (8) have the same solution from Lemma 1. However, the size of

[M_{A}, M_{B}]

in Equation (8) is usually too large to save computation time and memory space. Beik et al. [55] demonstrated that algorithms using tensor formats generally outperform their classical counterparts in terms of efficiency. Inspired by these issues, we propose the following least-squares algorithm, formulated in tensor format, for solving tensor Equation (2):

Note that

Γ_{i}, Φ_{i}

and

Γ_{i}^{*}, Φ_{i}^{*}

are defined by (5), (6) and (11), (12), respectively. Next, we discuss some bi-orthogonality properties of Algorithm 1.

Algorithm 1 BiCG-BTF method for solving Equation (2).

Input:

A_{i}^{(n)}, B_{i}^{(n)} \in R^{I_{n} \times I_{n}}

,

C_{i} \in R^{I_{1} \times \dots \times I_{N}}, n = 1, 2, \dots, N; i = 1, 2, 3, 4

.

Output: The norm

\sum_{i = 1}^{4} ∥ R_{i} (\cdot) ∥

and the solutions

X_{i} (\cdot)

,

Y_{i} (\cdot),

i = 1, 2, 3, 4

.

Initialization:

X_{i} (1), Y_{i} (1) \in R^{I_{1} \times \dots \times I_{N}}, i = 1, 2, 3, 4

;

(i): Compute
$R_{i} (1) : = C_{i} - Γ_{i} [X_{1} (1), X_{2} (1), X_{3} (1), X_{4} (1)] - Φ_{i} [Y_{1} (1), Y_{2} (1), Y_{3} (1), Y_{4} (1)], i = 1, 2, 3, 4$ ;
Set $R_{i}^{*} (1) : = R_{i} (1)$ ; $P_{i} (1) : = R_{i}^{*} (1)$ ; $P_{i}^{*} (1) : = P_{i} (1)$ ;
Compute the norm: $R n o r m : = \sum_{i = 1}^{4} ∥ R_{i} (1) ∥$ , $i = 1, 2, 3, 4$ ;
Set $k : = 1$ ;
(ii): If $R n o r m = 0$ , then stop;
(iii): Otherwise, compute
$Q_{i x} (k) : = Γ_{i} [P_{1} (k), P_{2} (k), P_{3} (k), P_{4} (k)], i = 1, 2, 3, 4$ ;
$Q_{i y} (k) : = Φ_{i} [P_{1} (k), P_{2} (k), P_{3} (k), P_{4} (k)], i = 1, 2, 3, 4$ ;
$α (k) : = (\sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) 〉) / (\sum_{i = 1}^{4} 〈 P_{i}^{*} (k), Q_{i x} (k) + Q_{i y} (k) 〉)$ ;
$X_{i} (k + 1) : = X_{i} (k) + α (k) P_{i} (k), i = 1, 2, 3, 4$ ;
$Y_{i} (k + 1) : = Y_{i} (k) + α (k) P_{i} (k), i = 1, 2, 3, 4$ ;
$R_{i} (k + 1) : = R_{i} (k) - α (k) (Q_{i x} (k) + Q_{i y} (k)), i = 1, 2, 3, 4$ ;
$Q_{i x}^{*} (k) : = Γ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)], i = 1, 2, 3, 4$ ;
$Q_{i y}^{*} (k) : = Φ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)], i = 1, 2, 3, 4$ ;
$R_{i}^{*} (k + 1) : = R_{i}^{*} (k) - α (k) (Q_{i x}^{*} (k) + Q_{i y}^{*} (k)), i = 1, 2, 3, 4$ ;
$β (k) = (\sum_{i = 1}^{4} 〈 R_{i} (k + 1), R_{i}^{*} (k + 1) 〉) / (\sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) 〉)$ ;
$P_{i} (k + 1) : = R_{i} (k + 1) + β (k) P_{i} (k), i = 1, 2, 3, 4$ ;
$P_{i}^{*} (k + 1) : = R_{i}^{*} (k + 1) + β (k) P_{i}^{*} (k), i = 1, 2, 3, 4$ ;
$R n o r m = \sum_{i = 1}^{4} ∥ R_{i} (k) ∥$ , $i = 1, 2, 3, 4$ ;
(iv): Set $k : = k + 1$ , go to (ii);

Theorem 2.

Assume that iterative sequences

{R_{i} (k)}

,

{R_{i}^{*} (k)}

,

{P_{i} (k)}

,

{P_{i}^{*} (k)}

{Q_{i x} (k)}

and

{Q_{i y} (k)} (i = 1, 2, 3, 4)

are generated by Algorithm 1. Thus, we obtain

\sum_{i = 1}^{4} 〈 R_{i} (l), R_{i}^{*} (m) 〉 = 0, l \neq m,

(13)

\sum_{i = 1}^{4} 〈 Q_{i x} (l) + Q_{i y} (l), P_{i}^{*} (m) 〉 = 0, l \neq m,

(14)

\sum_{i = 1}^{4} 〈 R_{i} (l), P_{i}^{*} (m) 〉 = 0, l > m .

(15)

Proof of Theorem 2.

We apply mathematics induction on k. Let us consider

1 \leq m < l \leq k

first.

When

k = 2

, the conclusion holds, as the following calculations show:

\begin{matrix} \sum_{i = 1}^{4} 〈R_{i} (2), R_{i}^{*} (1)〉 \\ = & \sum_{i = 1}^{4} 〈R_{i} (1) - α_{1} (Q_{i x} (1) + Q_{i y} (1)), R_{i}^{*} (1)〉 \\ = & \sum_{i = 1}^{4} 〈R_{i} (1), R_{i}^{*} (1)〉 - \frac{\sum_{i = 1}^{4} 〈R_{i} (1), R_{i}^{*} (1)〉}{\sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉} \sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉 \\ = & 0, \end{matrix}

and

\begin{matrix} \sum_{i = 1}^{4} 〈 Q_{i x} (2) + Q_{i y} (2), P_{i}^{*} (1) 〉 \\ = & \sum_{i = 1}^{4} 〈Γ_{i} [P_{1} (2), P_{2} (2), P_{3} (2), P_{4} (2)], P_{i}^{*} (1)〉 + \sum_{i = 1}^{4} 〈Φ_{i} [P_{1} (2), P_{2} (2), P_{3} (2), P_{4} (2)], P_{i}^{*} (1)〉 \\ = & \sum_{i = 1}^{4} 〈P_{i} (2), Γ_{i}^{*} [P_{1}^{*} (1), P_{2}^{*} (1), P_{3}^{*} (1), P_{4}^{*} (1)]〉 + \sum_{i = 1}^{4} 〈P_{i} (2), Φ_{i}^{*} [P_{1}^{*} (1), P_{2}^{*} (1), P_{3}^{*} (1), P_{4}^{*} (1)]〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i x}^{*} (1) 〉 + β (1) 〈 P_{i} (1), Q_{i x}^{*} (1) 〉] + \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i y}^{*} (1) 〉 + β (1) 〈 P_{i} (1), Q_{i y}^{*} (1) 〉] \\ = & \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i x}^{*} (1) 〉 + \frac{\sum_{i = 1}^{4} 〈 R_{i} (2), R_{i}^{*} (1) - α (1) (Q_{i x}^{*} (1) + Q_{i y}^{*} (1)) 〉}{\sum_{i = 1}^{4} 〈 R_{i} (1), R_{i}^{*} (1) 〉} 〈 P_{i} (1), Q_{i x}^{*} (1) 〉] \end{matrix}

\begin{matrix} + \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i y}^{*} (1) 〉 + \frac{\sum_{i = 1}^{4} 〈 R_{i} (2), R_{i}^{*} (1) - α (1) (Q_{i x}^{*} (1) + Q_{i y}^{*} (1)) 〉}{\sum_{i = 1}^{4} 〈 R_{i} (1), R_{i}^{*} (1) 〉} 〈 P_{i} (1), Q_{i y}^{*} (1) 〉] \\ = & \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i x}^{*} (1) 〉 - \frac{\sum_{i = 1}^{4} 〈 R_{i} (2), Q_{i x}^{*} (1) + Q_{i y}^{*} (1) 〉}{\sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉} 〈 P_{i} (1), Q_{i x}^{*} (1) 〉] \\ + \sum_{i = 1}^{4} [〈 R_{i} (2), Q_{i y}^{*} (1) 〉 - \frac{\sum_{i = 1}^{4} 〈 R_{i} (2), Q_{i x}^{*} (1) + Q_{i y}^{*} (1) 〉}{\sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉} 〈 P_{i} (1), Q_{i y}^{*} (1) 〉] \\ = & \sum_{i = 1}^{4} 〈 R_{i} (2), Q_{i x}^{*} (1) + Q_{i y}^{*} (1) 〉 - \frac{\sum_{i = 1}^{4} 〈 R_{i} (2), Q_{i x}^{*} (1) + Q_{i y}^{*} (1) 〉}{\sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉} \sum_{i = 1}^{4} 〈 P_{i}^{*} (1), Q_{i x} (1) + Q_{i y} (1) 〉 \\ = & 0 . \end{matrix}

It has been found

\sum_{i = 1}^{4} 〈 R_{i} (2), P_{i}^{*} (1) 〉 = 0

clearly. Now, assume that (13) and (14) hold for

1 \leq m < l \leq k

(

k > 2

). Then,

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k + 1), P_{i}^{*} (m) 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (k) - α (k) (Q_{i x} (k) + Q_{i y} (k)), P_{i}^{*} (m) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k), P_{i}^{*} (m) 〉 - α (k) 〈 Q_{i x} (k) + Q_{i y} (k), P_{i}^{*} (m) 〉] \\ = & 0, \end{matrix}

and

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k + 1), P_{i}^{*} (k) 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (k) - α (k) (Q_{i x} (k) + Q_{i y} (k)), P_{i}^{*} (k) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k), P_{i}^{*} (k) 〉 - α (k) 〈 Q_{i x} (k) + Q_{i y} (k), P_{i}^{*} (k) 〉] \\ = & \sum_{i = 1}^{4} [R_{i} (k), R_{i}^{*} (k) + β (k - 1) 〈 R_{i} (k), P_{i}^{*} (k - 1) 〉] \\ - \frac{\sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) 〉}{\sum_{i = 1}^{4} 〈 P_{i}^{*} (k), Q_{i x} (k) + Q_{i y} (k) 〉} \sum_{i = 1}^{4} 〈 P_{i}^{*} (k), Q_{i x} (k) + Q_{i y} (k) 〉 \\ = & 0 . \end{matrix}

The equality (15) holds for all

l > m

. Next, we will prove that Equations (13) and (14) hold for all

l > m

.

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k + 1), R_{i}^{*} (m) 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (k + 1), P_{i}^{*} (m) - β (m - 1) P_{i}^{*} (m - 1) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k + 1), P_{i}^{*} (m) 〉 - β (m - 1) 〈 R_{i} (k + 1), P_{i}^{*} (m - 1) 〉] \\ = & 0, \end{matrix}

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k + 1), R_{i}^{*} (k) 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (k + 1), P_{i}^{*} (m) - β (m - 1) P_{i}^{*} (m - 1) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k + 1), P_{i}^{*} (m) 〉 - β (m - 1) 〈 R_{i} (k + 1), P_{i}^{*} (m - 1) 〉] \\ = & 0, \end{matrix}

and

\begin{matrix} \sum_{i = 1}^{4} 〈 Q_{i x} (k + 1) + Q_{i y} (k + 1), P_{i}^{*} (m) 〉 \\ = & \sum_{i = 1}^{4} 〈 Q_{i x} (k + 1), P_{i}^{*} (m) 〉 + \sum_{i = 1}^{4} 〈 Q_{i y} (k + 1), P_{i}^{*} (m) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k + 1), Γ_{i}^{*} [P_{1}^{*} (m), P_{2}^{*} (m), P_{3}^{*} (m), P_{4}^{*} (m)] 〉 \\ + β (k) 〈 P_{i} (k), Γ_{i}^{*} [P_{1}^{*} (m), P_{2}^{*} (m), P_{3}^{*} (m), P_{4}^{*} (m)] 〉] \\ + \sum_{i = 1}^{4} [〈 R_{i} (k + 1), Φ_{i}^{*} [P_{1}^{*} (m), P_{2}^{*} (m), P_{3}^{*} (m), P_{4}^{*} (m)] 〉 \\ + β (k) 〈 P_{i} (k), Φ_{i}^{*} [P_{1}^{*} (m), P_{2}^{*} (m), P_{3}^{*} (m), P_{4}^{*} (m)] 〉] \\ = & \sum_{i = 1}^{4} [〈R_{i} (k + 1), Q_{i x}^{*} (m)〉 + β (k) 〈 P_{i} (k), Q_{i x}^{*} (m) 〉 \\ + 〈 R_{i} (k + 1), Q_{i y}^{*} (m) 〉 + β (k) 〈 P_{i} (k), Q_{i y}^{*} (m) 〉] \\ = & \sum_{i = 1}^{4} [〈R_{i} (k + 1), - \frac{1}{α (m)} (R_{i}^{*} (m + 1) - R_{i} (m))〉 \\ + β (k) 〈P_{i} (k), - \frac{1}{α (m)} (R_{i}^{*} (m + 1) - R_{i} (m))〉] \\ = & 0, \end{matrix}

\begin{matrix} \sum_{i = 1}^{4} 〈 Q_{i x} (k + 1) + Q_{i y} (k + 1), P_{i}^{*} (k) 〉 \\ = & \sum_{i = 1}^{4} 〈 Q_{i x} (k + 1), P_{i}^{*} (k) 〉 + \sum_{i = 1}^{4} 〈 Q_{i y} (k + 1), P_{i}^{*} (k) 〉 \\ = & \sum_{i = 1}^{4} [〈 R_{i} (k + 1), Γ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)] 〉 \\ + β (k) 〈 P_{i} (k), Γ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)] 〉] \\ + \sum_{i = 1}^{4} [〈 R_{i} (k + 1), Φ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)] 〉 \end{matrix}

\begin{matrix} + β (k) 〈 P_{i} (k), Φ_{i}^{*} [P_{1}^{*} (k), P_{2}^{*} (k), P_{3}^{*} (k), P_{4}^{*} (k)] 〉] \\ = & \sum_{i = 1}^{4} [〈R_{i} (k + 1), Q_{i x}^{*} (k)〉 + β (k) 〈 P_{i} (k), Q_{i x}^{*} (k) 〉 + 〈 R_{i} (k + 1), Q_{i y}^{*} (k) 〉 + β (k) 〈 P_{i} (k), Q_{i y}^{*} (k) 〉] \\ = & \sum_{i = 1}^{4} [〈R_{i} (k + 1), Q_{i x}^{*} (k) + Q_{i y}^{*} (k)〉 \\ - \frac{\sum_{i = 1}^{4} 〈R_{i} (k + 1), Q_{i x}^{*} (k) + Q_{i y}^{*} (k)〉}{\sum_{i = 1}^{4} 〈P_{i} (k), Q_{i x}^{*} (k) + Q_{i y}^{*} (k)〉} + 〈P_{i} (k), Q_{i x}^{*} (k) + Q_{i y}^{*} (k)〉] \\ = & 0 . \end{matrix}

Similarly, Equations (13) and (14) also hold for the case

1 \leq m < l \leq k .

Therefore, the facts illustrate that (13) and (14) are satisfied for

l \neq m

. □

Corollary 1.

Assume the conditions in Theorem 2 are satisfied. Then,

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k), P_{i}^{*} (k) 〉 = \sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) 〉, \end{matrix}

(16)

\sum_{i = 1}^{4} 〈 Q_{i x} (k) + Q_{i y} (k), R_{i}^{*} (k) 〉 = \sum_{i = 1}^{4} 〈 Q_{i x} (k) + Q_{i y} (k), P_{i}^{*} (k) 〉 .

(17)

Proof of Corollary 1.

From Algorithm 1 and Theorem 2, we have

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (k), P_{i}^{*} (k) 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) + β (k - 1) P_{i}^{*} (k - 1) 〉 \\ = & \sum_{i = 1}^{4} 〈 R_{i} (k), R_{i}^{*} (k) 〉, \end{matrix}

and

\begin{matrix} \sum_{i = 1}^{4} 〈 Q_{i x} (k) + Q_{i y} (k), R_{i}^{*} (k) 〉 = & \sum_{i = 1}^{4} 〈Q_{i x} (k) + Q_{i y} (k), P_{i}^{*} (k) - β (k - 1) P_{i}^{*} (k - 1)〉 \\ = & \sum_{i = 1}^{4} 〈 Q_{i x} (k) + Q_{i y} (k), P_{i}^{*} (k) 〉 . \end{matrix}

□

Theorem 3.

Let tensor sequences

{X_{i} (k)}

,

{Y_{i} (k)} (i = 1, 2, 3, 4)

be generated by Algorithm 1. If Algorithm 1 does not break down, then the tensor sequences

\begin{matrix} {[X (k), Y (k)] ∣ X (k) & = X_{1} (k) + X_{2} (k) i + X_{3} (k) j + X_{4} (k) k, \\ Y (k) & = Y_{1} (k) + Y_{2} (k) i + Y_{3} (k) j + Y_{4} (k) k, k = 1, 2, \dots} \end{matrix}

converge to the solution of Equation (2) within a finite iteration steps in the absence of round-off errors.

Proof of Theorem 3.

We will prove that there exists a

k \leq 4 S_{N}

such that

R_{i} (k) = 0

. By contradiction, assume that

R_{i} (k) \neq 0, i = 1, 2, 3, 4,

for all

k \leq 4 S_{N}

, and thus we can compute

R_{i} (4 S_{N} + 1)

. Suppose that

R_{i} (1), R_{i} (2), \dots, R_{i} (4 S_{N})

is a dependent sequence, then there exist real numbers

λ_{i, 1}, \dots, λ_{i, 4 S_{N}}

, not all zero, such that

λ_{i, 1} R_{i} (1) + \dots + λ_{i, 4 S_{N}} R_{i} (4 S_{N}) = 0,

for

i = 1, 2, 3, 4 .

Then

\begin{matrix} \sum_{i = 1}^{4} 〈 R_{i} (l), 0 〉 = & \sum_{i = 1}^{4} 〈 R_{i} (l), λ_{i, 1} R_{i} (1) + \dots + λ_{i, 4 S_{N}} R_{i} (4 S_{N}) 〉 \\ = & \sum_{i = 1}^{4} λ_{i, l} 〈 〈 R_{i} (l), 〈 R_{i} (l) 〉 \\ = & 0, \end{matrix}

which implies

\sum_{i = 1}^{4} 〈 〈 R_{i} (l), 〈 R_{i} (l) 〉 = 0

. This is a contradiction, since we cannot calculate

R_{i} (4 S_{N} + 1)

in this case. Therefore, there must exist a

k \leq 4 S_{N}

such that

R_{i} (k) = 0

, that is, the exact solution to the tensor Equation (2) can be determined by Algorithm 1 within a finite number of iterations, assuming no round-off errors. □

In the following theorem, we show that if we choose special kinds of initial tensor, then Algorithm 1 can yield the unique minimal Frobenius norm solution of the tensor Equation (2).

Theorem 4.

By selecting the initial tensors as

\begin{matrix} X_{j} (1) = Γ_{j}^{*} [H_{1} (1), H_{2} (1), H_{3} (1), H_{4} (1)], Y_{j} (1) = Φ_{j}^{*} [H_{1} (1), H_{2} (1), H_{3} (1), H_{4} (1)], \end{matrix}

(18)

where

Γ_{j}^{*}, Φ_{j}^{*} (j = 1, 2, 3, 4)

are defined by (11) and (12),

H_{i} (1) \in R^{I_{1} \times I_{2} \times \dots \times I_{N}}, (i = 1, 2, 3, 4)

are arbitrary tensors (for instance, we set

X_{j} (1) = O, Y_{j} (1) = O, j = 1, 2, 3, 4)

, then the solution group

\tilde{X_{j}}, \tilde{Y_{j}} (j = 1, 2, 3, 4)

obtained from Algorithm 1 represents the unique minimal Frobenius norm solution of the tensor Equations (2).

Proof of Theorem 4.

By selecting the initial tensors as specified in (18), it can be easily verified that the tensors

\tilde{X_{j}}

and

\tilde{Y_{j}}

(j = 1, 2, 3, 4)

obtained from Algorithm 1 will have the following form:

\begin{matrix} \tilde{X_{j}} = Γ_{j}^{*} [H_{1}, H_{2}, H_{3}, H_{4}], \tilde{Y_{j}} = Φ_{j}^{*} [H_{1}, H_{2}, H_{3}, H_{4}], \end{matrix}

where tensors

H_{i} \in R^{I_{1} \times I_{2} \times \dots \times I_{N}} (i = 1, 2, 3, 4)

. Now, we show that

\tilde{X} = \tilde{X_{1}} + \tilde{X_{2}} i + \tilde{X_{3}} j + \tilde{X_{4}} k

,

\tilde{Y} = \tilde{Y_{1}} + \tilde{Y_{2}} i + \tilde{Y_{3}} j + \tilde{Y_{4}} k

is the unique solution of the tensor equation with the minimal Frobenius norm (2). Let

\begin{matrix} \tilde{z} & = (\begin{matrix} vec (\tilde{X_{1}}) \\ vec (\tilde{X_{2}}) \\ vec (\tilde{X_{3}}) \\ vec (\tilde{X_{4}}) \\ vec (\tilde{Y_{1}}) \\ vec (\tilde{Y_{2}}) \\ vec (\tilde{Y_{3}}) \\ vec (\tilde{Y_{4}}) \end{matrix}) = (\begin{matrix} Kro (L_{A_{1}^{(n)}}^{T}) & + Kro (L_{A_{2}^{(n)}}^{T}) & + Kro (L_{A_{3}^{(n)}}^{T}) & + Kro (L_{A_{4}^{(n)}}^{T}) \\ - Kro (L_{A_{2}^{(n)}}^{T}) & + Kro (L_{A_{1}^{(n)}}^{T}) & - Kro (L_{A_{4}^{(n)}}^{T}) & + Kro (L_{A_{3}^{(n)}}^{T}) \\ - Kro (L_{A_{3}^{(n)}}^{T}) & + Kro (L_{A_{4}^{(n)}}^{T}) & + Kro (L_{A_{1}^{(n)}}^{T}) & - Kro (L_{A_{2}^{(n)}}^{T}) \\ - Kro (L_{A_{4}^{(n)}}^{T}) & - Kro (L_{A_{3}^{(n)}}^{T}) & + Kro (L_{A_{2}^{(n)}}^{T}) & + Kro (L_{A_{1}^{(n)}}^{T}) \\ Kro (L_{B_{1}^{(n)}}^{T}) & + Kro (L_{B_{2}^{(n)}}^{T}) & + Kro (L_{B_{3}^{(n)}}^{T}) & + Kro (L_{B_{4}^{(n)}}^{T}) \\ - Kro (L_{B_{2}^{(n)}}^{T}) & + Kro (L_{B_{1}^{(n)}}^{T}) & - Kro (L_{B_{4}^{(n)}}^{T}) & + Kro (L_{B_{3}^{(n)}}^{T}) \\ - Kro (L_{B_{3}^{(n)}}^{T}) & + Kro (L_{B_{4}^{(n)}}^{T}) & + Kro (L_{B_{1}^{(n)}}^{T}) & - Kro (L_{B_{2}^{(n)}}^{T}) \\ - Kro (L_{B_{4}^{(n)}}^{T}) & - Kro (L_{B_{3}^{(n)}}^{T}) & + Kro (L_{B_{2}^{(n)}}^{T}) & + Kro (L_{B_{1}^{(n)}}^{T}) \end{matrix}) \end{matrix}

\begin{matrix} \times (\begin{matrix} vec (\tilde{H_{1}}) \\ vec (\tilde{H_{2}}) \\ vec (\tilde{H_{3}}) \\ vec (\tilde{H_{4}}) \end{matrix}), \\ = (\begin{matrix} Kro (L_{A_{1}^{(n)}}) & - Kro (L_{A_{2}^{(n)}}) & - Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{4}^{(n)}}) \\ Kro (L_{A_{2}^{(n)}}) & Kro (L_{A_{1}^{(n)}}) & Kro (L_{A_{4}^{(n)}}) & - Kro (L_{A_{3}^{(n)}}) \\ Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{4}^{(n)}}) & Kro (L_{A_{1}^{(n)}}) & Kro (L_{A_{2}^{(n)}}) \\ Kro (L_{A_{4}^{(n)}}) & Kro (L_{A_{3}^{(n)}}) & - Kro (L_{A_{2}^{(n)}}) & Kro (L_{A_{1}^{(n)}}) \end{matrix} \\ {\begin{matrix} Kro (L_{B_{1}^{(n)}}) & - Kro (L_{B_{2}^{(n)}}) & - Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{4}^{(n)}}) \\ Kro (L_{B_{2}^{(n)}}) & Kro (L_{B_{1}^{(n)}}) & Kro (L_{B_{4}^{(n)}}) & - Kro (L_{B_{3}^{(n)}}) \\ Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{4}^{(n)}}) & Kro (L_{B_{1}^{(n)}}) & Kro (L_{B_{2}^{(n)}}) \\ Kro (L_{B_{4}^{(n)}}) & Kro (L_{B_{3}^{(n)}}) & - Kro (L_{B_{2}^{(n)}}) & Kro (L_{B_{1}^{(n)}}) \end{matrix})}^{T} \times (\begin{matrix} vec (\tilde{H_{1}}) \\ vec (\tilde{H_{2}}) \\ vec (\tilde{H_{3}}) \\ vec (\tilde{H_{4}}) \end{matrix}), \\ = {[M_{A}, M_{B}]}^{T} \times (\begin{matrix} vec (\tilde{H_{1}}) \\ vec (\tilde{H_{2}}) \\ vec (\tilde{H_{3}}) \\ vec (\tilde{H_{4}}) \end{matrix}) \in R ({[M_{A}, M_{B}]}^{T}), \end{matrix}

where

Kro (L_{A_{i}^{(n)}})

and

Kro (L_{B_{i}^{(n)}})

are defined by (9). According to Theorem 1, we can conclude that

\tilde{X} = \tilde{X_{1}} + \tilde{X_{2}} i + \tilde{X_{3}} j + \tilde{X_{4}} k

,

\tilde{Y} = \tilde{Y_{1}} + \tilde{Y_{2}} i + \tilde{Y_{3}} j + \tilde{Y_{4}} k

produced by Algorithm 1; this solution is the unique minimal Frobenius norm solution for the tensor Equation (2). □

Now, we solve Problem 1.2. If the tensor Equation (2) is consistent, the solution pair set

S_{X Y}

for Problem 1.1 is non-empty, for given tensors

\bar{X}, \bar{Y} \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}

, we have

\begin{matrix} min_{X, Y \in H_{1} \times I_{2} \times \dots \times I_{N}} ∥\sum_{k = 1}^{N} X \times_{k} A^{(k)} + Y \times_{k} B^{(k)} - C∥ \\ = min_{X, Y \in H^{I_{1} \times I_{2} \times \dots \times I_{N}}} ∥\sum_{k = 1}^{N} (X - \bar{X}) \times_{k} A^{(k)} + (Y - \bar{Y}) \times_{k} B^{(k)} \\ - (C - \sum_{k = 1}^{N} \bar{X} \times_{k} A^{(k)} - \bar{Y} \times_{k} B^{(k)})∥ . \end{matrix}

Let

\tilde{X} = X - \bar{X}, \tilde{Y} = Y - \bar{Y}

, and

\tilde{C} = C - \sum_{k = 1}^{N} \bar{X} \times_{k} A^{(k)} - \bar{Y} \times_{k} B^{(k)}

, then solving the tensor nearness Problem 1.2 is equivalent to first finding the solution with the minimal Frobenius norm for the tensor equation

\sum_{k = 1}^{N} \tilde{X} \times_{k} A^{(k)} + \tilde{Y} \times_{k} B^{(k)} = C .

(19)

By applying Algorithm 1 and setting the initial tensors as

\tilde{X} = \tilde{X_{1}} + \tilde{X_{2}} i + \tilde{X_{3}} j + \tilde{X_{4}} k

,

\tilde{Y} = \tilde{Y_{1}} + \tilde{Y_{2}} i + \tilde{Y_{3}} j + \tilde{Y_{4}} k

, where

X_{j} (1) = Γ_{j}^{*} [H_{1} (1), H_{2} (1), H_{3} (1), H_{4} (1)],

Y_{j} (1) = Φ_{j}^{*} [H_{1} (1), H_{2} (1), H_{3} (1),

H_{4} (1)] (j = 1, 2, 3, 4)

, where

H_{i} (1) \in R^{I_{1} \times I_{2} \times \dots \times I_{N}} (i = 1, 2, 3, 4)

are arbitrary tensors (in particular, we take

X_{j} (1) = O, Y_{j} (1) = O, j = 1, 2, 3, 4)

, we can derive the unique solution with the minimal Frobenius norm

{\tilde{X}}^{*}, {\tilde{Y}}^{*}

of Equation (19). Once the above tensors

{\tilde{X}}^{*}, {\tilde{Y}}^{*}

are obtained, the unique solution

\overset{ˇ}{X}, \overset{ˇ}{Y}

of Problem 1.2 can be determined. In this situation,

\overset{ˇ}{X}

and

\overset{ˇ}{Y}

can be represented as

\overset{ˇ}{X} = {\tilde{X}}^{*} + \bar{X}

,

\overset{ˇ}{Y} = {\tilde{Y}}^{*} + \bar{Y}

.

4. Numerical Examples

In this section, we will give some numerical examples to support the efficiency and applications of Algorithm 1. The codes in our computation are written in Matlab R2018a with 2.3 GHz central processing unit (Intel(R) Core(TM) i5), 8 GB memory. Moreover, we implemented all the operations based on tensor toolbox (version 3.2.1) proposed by Bader and Kolda [1]. For all of the examples, the iterations begin with the initial values

X_{i} = Y_{i} = 0, i = 1, 2, 3, 4

in Algorithm 1 with a stopping criterion of Res

\leq 10^{- 5}

and the number of iteration steps exceeding 2000. We describe some notations that appear in the following examples in Table 1.

Example 1.

We consider the tensor Equation (2) under the following conditions:

\begin{matrix} A^{(1)} & = (\begin{matrix} 2 + 4 i + 4 j + 5 k & 1 - i + j + 2 k & - 2 i - 2 j + k \\ - 2 + j & 3 + 2 i + 5 j + k & 1 - 2 j + k \\ 2 + 2 i + j & 1 - 2 i + 2 j - k & 3 + 4 i + 4 j + k \end{matrix}), \\ A^{(2)} & = (\begin{matrix} 3 + 4 j & - 1 + j & - 1 - 2 j \\ - 2 + j & 4 + 3 j & - j \\ - j & 1 & 2 + j \end{matrix}), A^{(3)} = (\begin{matrix} 2 i + 3 j & i & 1 + j \\ - 2 i - j & 5 i + 3 j & - i \\ 2 i - j & - i - j & 5 i + 4 j \end{matrix}), \\ B^{(1)} & = (\begin{matrix} 3 + 3 i & 2 + i & - 2 i \\ - 2 + i & 4 + 4 i & - 2 - 2 i \\ 2 + 2 i & 2 & 6 + 4 i \end{matrix}), B^{(2)} = (\begin{matrix} 3 i + 5 j & 2 i & 2 i + 2 j \\ 0 & 2 i + 3 j & j \\ - 2 i + 2 j & - 2 i - 2 j & 6 i + 6 j \end{matrix}), \\ B^{(3)} & = (\begin{matrix} 1 + 3 i + 3 k & - 1 - 2 i & i + 2 k \\ - i & 2 + 4 i + 2 k & 2 + 2 i \\ 2 + i + 2 k & 2 + i - 2 k & 5 + 4 i + 6 k \end{matrix}), \\ C (:, :, 1) & = (\begin{matrix} 3 - 11 i + 31 j + 19 k & 6 - 8 i + j + 28 k & 2 - 6 i + 20 j + 24 k \\ 4 + 36 j + 8 k & 7 + 3 i + 27 j + 17 k & 3 + 5 i + 25 j + 13 k \\ 24 - 2 i + 52 j + 22 k & 27 + i + 43 j + 31 k & 23 + 3 i + 41 j + 27 k \end{matrix}), \\ C (:, :, 2) & = (\begin{matrix} 11 - 13 i + 31 j + 13 k & 14 - 10 i + 22 j + 22 k & 10 - 8 i + 20 j + 18 k \\ 12 - 2 i + 36 j + 2 k & 15 + i + 27 j + 11 k & 11 + 3 i + 25 j + 7 k \\ 32 - 4 i + 52 j + 16 k & 35 - i + 43 j + 25 k & 31 + i + 41 j + 21 k \end{matrix}), \end{matrix}

\begin{matrix} C (:, :, 3) & = (\begin{matrix} 9 - 9 i + 45 j + 25 k & 12 - 6 i + 36 j + 34 k & 8 - 4 i + 34 j + 30 k \\ 10 + 2 i + 50 j + 14 k & 13 + 5 i + 41 j + 23 k & 9 + 7 i + 39 j + 19 k \\ 30 + 66 j + 28 k & 33 + 3 i + 57 j + 3 k & 29 + 5 i + 55 j + 33 k \end{matrix}) . \end{matrix}

Applying for Algorithm 1, the IT is 46, the CPU time is 3.9496 s and the Res is

1.2885 \times 10^{- 6}

. Figure 1 illustrates that Algorithm 1 is feasible.

Example 2

(Test matrices from [28]). We examine the quaternion tensor Equation (2) under the condition that

A^{(m)} = A_{m 1} + A_{m 2} i + A_{m 3} j + A_{m 4} k, B^{(m)} = B_{m 1} + B_{m 2} i + B_{m 3} j + B_{m 4} k, m = 1, 2, 3,

where

\begin{matrix} A_{11} = t r i u (h i l b (n)), A_{12} = t r i u (o n e s (n, n)), A_{13} = e y e (n), A_{14} = o n e s (n), \\ A_{21} = z e r o s (n), A_{22} = z e r o s (n), A_{23} = t r i d i a g (- 1, 2, - 1, n), A_{24} = z e r o s (n), \\ A_{31} = z e r o s (n), A_{32} = t r i d i a g (0.5, 6, - 0.5, n), A_{33} = e y e (n), A_{34} = z e r o s (n), \end{matrix}

\begin{matrix} B_{11} = e y e (n), B_{12} = o n e s (n), B_{13} = z e r o s (n), B_{14} = z e r o s (n), \\ B_{21} = z e r o s (n), B_{22} = t r i d i a g (0.5, 6, - 0.5, n), B_{23} = e y e (n), B_{24} = z e r o s (n), \\ B_{31} = t r i d i a g (0.5, 6, - 0.5, n), B_{32} = e y e (n), B_{33} = z e r o s (n), B_{34} = o n e s (n) . \end{matrix}

C = t e n r a n d (n, n, n) + t e n r a n d (n, n, n) i + t e n r a n d (n, n, n) j + t e n r a n d (n, n, n) k .

Choosingthe initial tensor

X = Y = 0

, we illustrate the convergence curves of Algorithm 1 for various values of n in Figure 2. In Table 2, for

n = 20,

n = 40

and

n = 60

, we provide the CPU time incurred, residual norms after a finite number of steps, and the relative errors of the approximate solutions obtained using Algorithm 1.

Example 3.

We consider the solution of the following convection–diffusion equation over the quaternion algebra [56]

\begin{matrix} - v Δ u + c^{T} \nabla u & = f in Γ = [0, 1] \times [0, 1] \times [0, 1] \\ u & = 0 on \partial Γ . \end{matrix}

Based on a standard finite difference discretization on a uniform grid for the diffusion term and a second-order convergent scheme (Fromm’s scheme) for the convection term with the mesh-size

h = \frac{1}{l + 1}

, we solve the quaternion tensor Equation (3) with

A^{(n)} = A_{1}^{(n)} + A_{2}^{(n)} i + A_{3}^{(n)} j + A_{4}^{(n)} k, n = 1, 2, 3,

where

\begin{matrix} A_{i}^{(n)} = \frac{v_{i}^{(n)}}{h^{2}} {[\begin{matrix} 2 & - 1 \\ - 1 & 2 & - 1 \\ ⋱ & ⋱ & ⋱ \\ - 1 & 2 & - 1 \\ - 1 & 2 \end{matrix}]}_{l \times l} + \frac{c_{i}^{(n)}}{4 h} {[\begin{matrix} 3 & - 5 & 1 \\ 1 & 3 & - 5 \\ ⋱ & ⋱ & ⋱ & 1 \\ 1 & 3 & - 5 \\ 1 & 3 \end{matrix}]}_{l \times l}, \end{matrix}

and

i = 1, 2, 3, 4 .

The right-hand side tensor

C

is constructed so that the exact solution to Equation (3) is

X^{*} = X_{1}^{*} + X_{2}^{*} i + X_{3}^{*} j + X_{4}^{*} k = t e n o n e s (l, l, l) + t e n o n e s (l, l, l) i + t e n o n e s (l, l, l) j + t e n o n e s (l, l, l) k

.

We consider two cases in order to compare Algorithm 1 with the CGLS algorithm in [43]. In case I, we choose different

v_{i}^{(n)}

and

c_{i}^{(n)}

to obtain the results. In Table 3, we set

\begin{matrix} v_{i}^{(1)} = 1, c_{i}^{(1)} = 1; v_{i}^{(2)} = 1, c_{i}^{(2)} = 2; v_{i}^{(3)} = 1, c_{i}^{(2)} = 3, i = 1, 2, 3, 4 \end{matrix}

(20)

to obtain

A^{(n)} (n = 1, 2, 3)

with the same real part and imaginary part. In Table 4, we set

\begin{matrix} v_{i}^{(1)} = 1, c_{1}^{(1)} = 1, c_{2}^{(1)} = - 1, c_{3}^{(1)} = 0, c_{4}^{(1)} = 1; \\ v_{i}^{(2)} = 0.1, c_{1}^{(2)} = - 1, c_{2}^{(2)} = - 1, c_{3}^{(2)} = 1, c_{4}^{(2)} = 0; \\ v_{i}^{(3)} = 0.01, c_{1}^{(3)} = 1, c_{2}^{(3)} = 1, c_{3}^{(3)} = 0, c_{4}^{(3)} = 0, i = 1, 2, 3, 4 \end{matrix}

(21)

to obtain

A^{(n)} (n = 1, 2, 3)

with different real parts and imaginary parts.

In case II, we set

c_{i}^{(1)} = 1; c_{i}^{(2)} = 2; c_{i}^{(2)} = 3, i = 1, 2, 3, 4

, we apply Algorithm 1 and the CGLS algorithm in [43] with

v_{i}^{(j)} = 10^{- 3}, 1, 100, i = 1, 2, 3, 4, j = 1, 2, 3

for grid

10 \times 10 \times 10

. The relative errors of approximate solution

\sum_{i = 1}^{i = 4} ∥ X_{i} (k) - X_{i}^{*} ∥ / ∥ X_{i}^{*} ∥

computed by these methods are shown in Table 5.

The previous results show that Algorithm 1 has faster convergent rates than the CGLS algorithm in [43] as p increases.

Example 4.

We employ Algorithm 1 to compare its performance with the CGLS algorithm in restoring a color video comprising a sequence of RGB images (slices). The video, titled ‘Rhinos,’ originates from Matlab and is saved in AVI format. Each frontal slice of this color video is represented by a pure quaternion matrix with dimensions of

240 \times 320

pixels. For

C = \hat{C} + N = X \times_{1} A

, we consider

X

to represent the orginal colour video, A as the blurred matrix, and

N

as a noise tensor. When

N = 0

,

C

is referred to as the blurred and noise-free color video. In this scenario, we select a blurred matrix

A = A_{1} \otimes A_{2} \in R^{240 \times 320}

, where

A_{1} = {(a_{i j}^{(1)})}_{1 \leq i, j \leq 16}

and

A_{2} = {(a_{i j}^{(2)})}_{1 \leq i \leq 15, 1 \leq j \leq 20}

are Toeplitz matrices with entries defined as follows:

a_{i j}^{(1)} = \{\begin{matrix} \frac{1}{σ \sqrt{2 π}} exp (\frac{{(i - j)}^{2}}{2 σ^{2}}), & | i - j | \leq r, \\ 1, & otherwise . \end{matrix}; a_{i j}^{(2)} = \{\begin{matrix} \frac{1}{2 s - 1}, & | i - j | \leq s, \\ 0, & otherwise . \end{matrix}

We denote

X_{restored}

as the resulting restored color video. The performance of the algorithm is evaluated using the peak signal-to-noise ratio (PSNR), which is measured in decibels (dB):

PSNR (X) = 10 {log}_{10} (\frac{I_{1} I_{2} d^{2}}{{∥X - X_{restored}∥}^{2}}),

where d denotes the maximum possible pixel value of the image.

RE (X)

represents the relative error, which is defined as

RE (X) = \frac{∥X - X_{restored}∥}{∥ X ∥} .

In this case, we use

d = 255

and set the variance

σ = 1

. Table 6 shows the peak signal-to-noise ratio (PSNR) and the relative error (RE) for Algorithm 1 and the CGLS algorithm with various parameters. As indicated, the PSNR and the relative error of our algorithms are significantly superior to those of CGLS. For the case where

r = 6

and

s = 6

with slice No. 7 of the color video, we illustrate the original image, blurred image, and the restored image by CGLS and Algorithm 1 in Figure 3. This figure demonstrates that our algorithm can effectively restore blurred and noise-free color video with high quality.

5. Conclusions

The main goal of this paper is to solve the generalized quaternion tensor Equation (2). We hereby develop a BiCG iterative algorithm in tensor format to efficiently solve Equation (2), and we also prove the convergence of our proposed method. Moreover, we demonstrate that the solution with a minimal Frobenius norm can be achieved by initializing specific types of tensors. We provide several examples to effectively illustrate the effectiveness of our algorithm. Furthermore, our algorithm is successfully applied to the restoration of color videos. This contribution significantly advances the current understanding of quaternion tensor equations by introducing a practical iterative approach.

Author Contributions

Conceptualization, M.X. and Q.-W.W.; methodology, M.X.; software, M.X.; validation, M.X., Q.-W.W. and Y.Z.; formal analysis, M.X.; investigation, M.X.; resources, Q.-W.W.; data curation, M.X.; writing—original draft preparation, M.X.; writing—review and editing, Y.Z.; visualization, M.X.; supervision, Q.-W.W.; project administration, M.X., Q.-W.W. and Y.Z.; funding acquisition, M.X., Q.-W.W. and Y.Z. All authors have read and agreed to the published version of the manuscript.

Funding

The first author is supported by the Natural Science Foundation of China under Grant No. 12301028 and Startup Foundation for Young Teachers of Shanghai Ocean University. The second author is supported by the Natural Science Foundation of China under Grant No. 12371023. The third author is supported by the Canada NSERC under Grant No. RGPIN-2020-06746.

Data Availability Statement

Data are contained within the article.

Acknowledgments

The authors would like to thank the editor and reviewers for their valuable suggestions and comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Kolda, T.G.; Bader, B.W. Tensor decompositions and applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar] [CrossRef]
Qi, L.; Luo, Z. Tensor Analysis: Spectral Theory and Special Tensors; SIAM: Philadelphia, PA, USA, 2017. [Google Scholar]
Beik, F.P.; Jbilou, K.; Najafi-Kalyani, M.; Reichel, L. Golub–Kahan bidiagonalization for ill-conditioned tensor equations with applications. Numer. Algorithms 2020, 84, 1535–1563. [Google Scholar] [CrossRef]
Duan, X.F.; Zhang, Y.S.; Wang, Q.W. An efficient iterative method for solving a class of constrained tensor least squares problem. Appl. Numer. Math. 2024, 196, 104–117. [Google Scholar] [CrossRef]
Guan, Y.; Chu, D. Numerical computation for orthogonal low-rank approximation of tensors. SIAM J. Matrix Anal. Appl. 2019, 40, 1047–1065. [Google Scholar] [CrossRef]
Guan, Y.; Chu, M.T.; Chu, D. Convergence analysis of an SVD-based algorithm for the best rank-1 tensor approximation. Linear Algebra Appl. 2018, 555, 53–69. [Google Scholar] [CrossRef]
Guan, Y.; Chu, M.T.; Chu, D. SVD-based algorithms for the best rank-1 approximation of a symmetric tensor. SIAM J. Matrix Anal. 2018, 39, 1095–1115. [Google Scholar] [CrossRef]
Hu, J.; Ke, Y.; Ma, C. Efficient iterative method for generalized Sylvester quaternion tensor equation. Comput. Appl. Math. 2023, 42, 237. [Google Scholar] [CrossRef]
Ke, Y. Finite iterative algorithm for the complex generalized Sylvester tensor equations. J. Appl. Anal. Comput. 2020, 10, 972–985. [Google Scholar] [CrossRef]
Kolda, T.G. Multilinear Operators for Higher-Order Decompositions; Sandia National Laboratory (SNL): Albuquerque, NM, USA; Livermore, CA, USA, 2006. [Google Scholar]
Li, B.W.; Tian, S.; Sun, Y.S.; Hu, Z.M. Schur-decomposition for 3D matrix equations and its application in solving radiative discrete ordinates equations discretized by Chebyshev collocation spectral method. J. Comput. Phys. 2010, 229, 1198–1212. [Google Scholar] [CrossRef]
Li, T.; Wang, Q.-W.; Duan, X.-F. Numerical algorithms for solving discrete Lyapunov tensor equation. J. Comput. Appl. Math. 2020, 370, 112676. [Google Scholar] [CrossRef]
Li, T.; Wang, Q.-W.; Zhang, X.-F. Gradient based iterative methods for solving symmetric tensor equations. Numer. Linear Algebra Appl. 2022, 29, e2414. [Google Scholar] [CrossRef]
Li, T.; Wang, Q.-W.; Zhang, X.-F. A Modified conjugate residual method and nearest kronecker product preconditioner for the generalized coupled Sylvester tensor equations. Mathematics 2022, 10, 1730. [Google Scholar] [CrossRef]
Li, X.; Ng, M.K. Solving sparse non-negative tensor equations: Algorithms and applications. Front. Math. China 2015, 10, 649–680. [Google Scholar] [CrossRef]
Liang, Y.; Silva, S.D.; Zhang, Y. The tensor rank problem over the quaternions. Linear Algebra Appl. 2021, 620, 37–60. [Google Scholar] [CrossRef]
Lv, C.; Ma, C. A modified CG algorithm for solving generalized coupled Sylvester tensor equations. Appl. Math. Comput. 2020, 365, 124699. [Google Scholar] [CrossRef]
Malek, A.; Momeni-Masuleh, S.H. A mixed collocation–finite difference method for 3D microscopic heat transport problems. J. Comput. Appl. Math. 2008, 217, 137–147. [Google Scholar] [CrossRef][Green Version]
Qi, L. Eigenvalues of a real supersymmetric tensor. J. Symb. Comput. 2005, 40, 1302–1324. [Google Scholar] [CrossRef]
Qi, L. Symmetric nonnegative tensors and copositive tensors. Linear Algebra Appl. 2013, 439, 228–238. [Google Scholar] [CrossRef]
Qi, L.; Chen, H.; Chen, Y. Tensor Eigenvalues and Their Applications; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
Zhang, X.-F.; Li, T.; Ou, Y.-G. Iterative solutions of generalized Sylvester quaternion tensor equations. Linear Multilinear Algebra 2024, 72, 1259–1278. [Google Scholar] [CrossRef]
Zhang, X.-F.; Wang, Q.-W. On RGI Algorithms for Solving Sylvester Tensor Equations. Taiwan. J. Math. 2022, 26, 501–519. [Google Scholar] [CrossRef]
Kyrchei, I. Cramer’s rules for Sylvester quaternion matrix equation and its special cases. Adv. Appl. Clifford Algebras 2018, 28, 1–26. [Google Scholar] [CrossRef]
Heyouni, M.; Saberi-Movahed, F.; Tajaddini, A. On global Hessenberg based methods for solving Sylvester matrix equations. Comput. Math. Appl. 2019, 77, 77–92. [Google Scholar] [CrossRef]
Zhang, X. A system of generalized Sylvester quaternion matrix equations and its applications. Appl. Math. Comput. 2016, 273, 74–81. [Google Scholar] [CrossRef]
Beik, F.; Ahmadi-Asl, S. An iterative algorithmfor η-(anti)-Hermitian least-squares solutions of quaternion matrix equations. Electron. J. Linear Algebra 2015, 30, 372–401. [Google Scholar] [CrossRef]
Ahmadi-Asl, S.; Beik, F.P.A. An efficient iterative algorithm for quaternionic least-squares problems over the generalized η-(anti-)bi-Hermitian matrices. Linear Multilinear Algebra 2017, 65, 1743–1769. [Google Scholar] [CrossRef]
Ahmadi-Asl, S.; Beik, F.P.A. Iterative algorithms for least-squares solutions of a quaternion matrix equation. J. Appl. Math. Comput. 2017, 53, 95–127. [Google Scholar] [CrossRef]
Song, G.; Wang, Q.-W.; Yu, S. Cramer’s rule for a system of quaternion matrix equations with applications. Appl. Math. Comput. 2018, 336, 490–499. [Google Scholar] [CrossRef]
Wang, Q.-W.; He, Z.-H.; Zhang, Y. Constrained two-sided coupled Sylvester-type quaternion matrix equations. Automatica 2019, 101, 207–213. [Google Scholar] [CrossRef]
Zhang, F.; Mu, W.; Li, Y.; Zhao, J. Special least squares solutions of the quaternion matrix equation AXB + CXD = E. Comput. Math. Appl. 2016, 72, 1426–1435. [Google Scholar] [CrossRef]
Huang, N.; Ma, C.-F. Modified conjugate gradient method for obtaining the minimum-norm solution of the generalized coupled Sylvester-conjugate matrix equations. Appl. Math. Model. 2016, 40, 1260–1275. [Google Scholar] [CrossRef]
Gao, Z.-H.; Wang, Q.-W.; Xie, L. The (anti-)η-Hermitian solution to a novel system of matrix equations over the split quaternion algebra. Math. Meth. Appl. Sci. 2024, 1–18. [Google Scholar] [CrossRef]
He, Z.-H.; Wang, X.-X.; Zhao, Y.-F. Eigenvalues of quaternion tensors with applications to color video processing. J. Sci. Comput. 2023, 94, 1. [Google Scholar] [CrossRef]
Jia, Z.; Wei, M.; Zhao, M.X.; Chen, Y. A new real structure-preserving quaternion QR algorithm. J. Comput. Appl. Math. 2018, 343, 26–48. [Google Scholar] [CrossRef]
Li, Y.; Wei, M.; Zhang, F.; Zhao, J. Real structure-preserving algorithms of Householder based transformations for quaternion matrices. J. Comput. Appl. Math. 2016, 305, 82–91. [Google Scholar] [CrossRef]
Mehany, M.S.; Wang, Q.-W.; Liu, L. A System of Sylvester-like quaternion tensor equations with an application. Front. Math. 2024, 19, 749–768. [Google Scholar] [CrossRef]
Xie, M.; Wang, Q.-W. Reducible solution to a quaternion tensor equation. Front. Math. China 2020, 15, 1047–1070. [Google Scholar] [CrossRef]
Xie, M.Y.; Wang, Q.W.; He, Z.H.; Saad, M.M. A system of Sylvester-type quaternion matrix equations with ten variables. Acta Math. Sin. (Engl. Ser.) 2022, 38, 1399–1420. [Google Scholar] [CrossRef]
Saberi-Movahed, F.; Tajaddini, A.; Heyouni, M.; Elbouyahyaoui, L. Some iterative approaches for Sylvester tensor equations, Part I: A tensor format of truncated Loose Simpler GMRES. Appl. Numer. Math. 2022, 172, 428–445. [Google Scholar] [CrossRef]
Saberi-Movahed, F.; Tajaddini, A.; Heyouni, M.; Elbouyahyaoui, L. Some iterative approaches for Sylvester tensor equations, Part II: A tensor format of Simpler variant of GCRO-based methods. Appl. Numer. Math. 2022, 172, 413–427. [Google Scholar] [CrossRef]
Wang, Q.-W.; Xu, X.; Duan, X. Least squares solution of the quaternion Sylvester tensor equation. Linear Multilinear Algebra 2021, 69, 104–130. [Google Scholar] [CrossRef]
Zhang, X.-F.; Wang, Q.-W. Developing iterative algorithms to solve Sylvester tensor equations. Appl. Math. Comput. 2021, 409, 126403. [Google Scholar] [CrossRef]
Chen, Z.; Lu, L. A projection method and Kronecker product preconditioner for solving Sylvester tensor equations. Sci. China Math. 2012, 55, 1281–1292. [Google Scholar] [CrossRef]
Karimi, S.; Dehghan, M. Global least squares method based on tensor form to solve linear systems in Kronecker format. Trans. Inst. Measure. Control 2018, 40, 2378–2386. [Google Scholar] [CrossRef]
Najafi-Kalyani, M.; Beik, F.P.A.; Jbilou, K. On global iterative schemes based on Hessenberg process for (ill-posed) Sylvester tensor equations. J. Comput. Appl. Math. 2020, 373, 112216. [Google Scholar] [CrossRef]
Bai, Z.-Z.; Golub, G.H.; Ng, M.K. Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems. SIAM J. Matrix Anal. Appl. 2003, 24, 603–626. [Google Scholar] [CrossRef]
Grasedyck, L. Existence and computation of low Kronecker-rank approximations for large linear systems of tensor product structure. Computing 2004, 72, 247–265. [Google Scholar] [CrossRef]
Peng, Y.; Hu, X.; Zhang, L. An iteration method for the symmetric solutions and the optimal approximation solution of the matrix equation AXB = C. Appl. Math. Comput. 2005, 160, 763–777. [Google Scholar] [CrossRef]
Bank, R.E.; Chan, T.-F. An analysis of the composite step biconjugate gradient method. Numer. Math. 1993, 66, 295–319. [Google Scholar] [CrossRef]
Bank, R.E.; Chan, T.-F. A composite step bi-conjugate gradient algorithm for nonsymmetric linear systems. Numer. Algorithms 1994, 7, 1–16. [Google Scholar] [CrossRef]
Freund, R.W.; Golub, G.H.; Nachtigal, N.M. Iterative solution of linear systems. Acta Numer. 1992, 1, 44. [Google Scholar] [CrossRef]
Hajarian, M. Developing Bi-CG and Bi-CR methods to solve generalized Sylvester-transpose matrix equations. Int. J. Auto. Comput. 2014, 11, 25–29. [Google Scholar] [CrossRef]
Beik, F.; Panjeh, F.; Movahed, F.; Ahmadi-Asl, S. On the Krylov subspace methods based on tensor format for positive definite Sylvester tensor equations. Numer. Linear Algebra Appl. 2016, 23, 444–466. [Google Scholar] [CrossRef]
Ballani, J.; Grasedyck, L. A projection method to solve linear systems in tensor format. Numer. Linear Algebra Appl. 2013, 20, 27–43. [Google Scholar] [CrossRef]

Figure 1. Convergence history of Example 1.

Figure 2. Convergence history of Example 2.

Figure 3. The restored color video for the case with

r = 6

and

s = 6

, using slice No. 7.

Figure 3. The restored color video for the case with

r = 6

and

s = 6

, using slice No. 7.

Table 1. Some denotations in numerical examples.

IT	The Number of Iterations
CPU time	The CPU time elapsed, measured in seconds
Res	$\sum_{i = 1}^{4} ∥R_{i} (k)∥,$ $R_{i} (k)$ is the residual at kth iteration
$t e n r a n d (n, n, n)$	The third-order $n \times n \times n$ tensor with pseudo-random values
	sampled from a uniform distribution over the unit interval
$t r i u (h i l b (n))$	The upper triangular part of $n \times n$ Hilbert matrix
$t r i u (o n e s (n, n))$	The upper triangular part of $n \times n$ matrix with all 1
$e y e (n)$	Identity matrix with size $n \times n$
$z e r o s (n)$	Zero matrix with size $n \times n$
$t r i d i a g (a, b, c, n)$	The $n \times n$ tridiagonal matrix with $a, b, c$

Table 2. Numerical results for Example 2.

n	IT	CPU Time	Res
20	82	8.3272	$7.7182 \times 10^{- 6}$
40	156	24.8187	$5.1233 \times 10^{- 6}$
60	288	91.4387	$7.6898 \times 10^{- 6}$

Table 3. CPU time (IT) for Example 3 with parameter setup in (20).

	l = 10	l = 25	l = 30
Algorithm 1	25.6884(320)	140.8233(1270)	202.7709(1580)
CGLS [43]	15.5685(219)	144.3166(1266)	230.7340(1832)

Table 4. CPU time (IT) for Example 3 with parameter setup in (21).

	l = 10	l = 15	l = 20
Algorithm 1	45.4157(576)	104.4370(1095)	163.6110(1700)
CGLS [43]	44.0701(579)	118.2419(1296)	230.7978(2298)

Table 5. The relative errors of the solution (IT) for Example 3.

	$v_{i}^{(1)} = 10^{- 3}$	$v_{i}^{(2)} = 1$	$v_{i}^{(3)} = 100$
Algorithm 1	$5.6210 \times 10^{- 9}$ (248)	$6.0527 \times 10^{- 9}$ (188)	$4.1925 \times 10^{- 9}$ (248)
CGLS [43]	$9.9300 \times 10^{- 9}$ (106)	$9.7035 \times 10^{- 9}$ (178)	$8.9552 \times 10^{- 9}$ (167)

Table 6. The numerical results for Example 4.

Algorithm([r,s])	Algorithm 1(PSNR/RE)	CGLS(PSNR/RE)
[3,3]	38.6181(0.0235)	13.8404(0.3769)
[6,6]	37.3721(0.0300)	14.0529(0.3694)
[8,8]	33.8073(0.0338)	14.7958(0.3551)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xie, M.; Wang, Q.-W.; Zhang, Y. The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions. Symmetry 2024, 16, 1167. https://doi.org/10.3390/sym16091167

AMA Style

Xie M, Wang Q-W, Zhang Y. The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions. Symmetry. 2024; 16(9):1167. https://doi.org/10.3390/sym16091167

Chicago/Turabian Style

Xie, Mengyan, Qing-Wen Wang, and Yang Zhang. 2024. "The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions" Symmetry 16, no. 9: 1167. https://doi.org/10.3390/sym16091167

APA Style

Xie, M., Wang, Q.-W., & Zhang, Y. (2024). The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions. Symmetry, 16(9), 1167. https://doi.org/10.3390/sym16091167

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The BiCG Algorithm for Solving the Minimal Frobenius Norm Solution of Generalized Sylvester Tensor Equation over the Quaternions

Abstract

1. Introduction

2. Preliminaries

3. An Iterative Algorithm for Solving the Problems 1.1 and 1.2

4. Numerical Examples

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI