Componentwise Perturbation Analysis of the QR Decomposition of a Matrix

Petkov, Petko H.

doi:10.3390/math10244687

Open AccessFeature PaperArticle

Componentwise Perturbation Analysis of the QR Decomposition of a Matrix

by

Petko H. Petkov

Department of Engineering Sciences, Bulgarian Academy of Sciences, 1040 Sofia, Bulgaria

Mathematics 2022, 10(24), 4687; https://doi.org/10.3390/math10244687

Submission received: 17 November 2022 / Revised: 4 December 2022 / Accepted: 6 December 2022 / Published: 10 December 2022

(This article belongs to the Special Issue Numerical Analysis and Matrix Computations: Theory and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The paper presents a rigorous perturbation analysis of the QR decomposition

A = Q R

of an

n \times m

matrix A using the method of splitting operators. New asymptotic componentwise perturbation bounds are derived for the elements of Q and R and the subspaces spanned by the first

p \leq m

columns of A. The new bounds are less conservative than the known bounds and are significantly better than the normwise bounds. An iterative scheme is proposed to determine global componentwise bounds in the case of perturbations for which such bounds are valid. Several numerical results are given that illustrate the analysis and the quality of the bounds obtained.

Keywords:

QR decomposition; perturbation analysis; componentwise bounds; asymptotic bounds; global bounds

MSC:

65F25; 47A55; 93C73

1. Introduction

The QR decomposition of a matrix

A \in R^{n \times m}

with

n \geq m

as the factorization

A : = Q [\begin{matrix} R \\ 0 \end{matrix}],

(1)

where

Q \in R^{n \times n}

is an orthogonal matrix and

R \in R^{m \times m}

is the upper triangular matrix. The matrices Q and R are referred to as the Q-factor and the R-factor, respectively. Further on, we shall assume that the matrix A has rank m, i.e., it has full column rank. In such a case, the matrix R is nonsingular, and the matrix Q can be represented as

Q = [Q_{1}, Q_{2}], Q_{1} \in R^{n \times m}, Q_{2} \in R^{n \times (n - m)},

where

R (Q_{1}) = R (A)

and the columns of

Q_{2}

form an orthonormal basis for the complementary subspace

R {(A)}^{⊥}

([1], Ch. 1). Thus,

A = Q_{1} R .

(2)

The representation (2) is frequently called QR factorization of A, and it is unique up to the signs of the diagonal elements of R. The matrix

Q_{2}

is not unique but has to obey the orthogonality condition

Q^{T} Q = [\begin{matrix} Q_{1}^{T} Q_{1} & Q_{1}^{T} Q_{2} \\ Q_{2}^{T} Q_{1} & Q_{2}^{T} Q_{2} \end{matrix}] = [\begin{matrix} I_{m} & 0 \\ 0 & I_{n - m} \end{matrix}] .

(3)

In practice, the matrix A is subject to perturbations of different kinds (model inconsistencies, measurement and rounding errors), which leads to the necessity of investigating the sensitivity of the different elements of the QR decomposition to perturbations in the data, i.e., to perform a perturbation analysis of the decomposition [2]. Further on, we assume that the matrix A is subject to an additive perturbation

δ A \in R^{n \times m}

and that there exist another pair of matrix

\tilde{Q}

and upper triangular matrix

\tilde{R}

such that

\tilde{A} = \tilde{Q} [\begin{matrix} \tilde{R} \\ 0 \end{matrix}], \tilde{A} = A + δ A .

(4)

The purpose of the perturbation analysis of the QR decomposition is to find bounds on the sizes of

δ Q = \tilde{Q} - Q

and

δ R = \tilde{R} - R

as functions of the size of

δ A

for sufficiently small perturbations of A [3,4]. Due to the non-uniqueness of the matrix

Q_{2}

, its perturbation is also non-unique. Thus, in the perturbation analysis, one usually considers only the perturbations of the matrix

Q_{1}

, which are uniquely defined by the perturbations of A. However, in the analysis, we shall need to use an arbitrary matrix

Q_{2}

that satisfies the orthogonality condition (3).

The sizes of the perturbations

δ A

,

δ Q_{1}

and

δ R

in the QR factorization are measured by using some of the matrix norms, and, in this case, we call the respective analysis normwise perturbation analysis. Sometimes, however, we are interested in the size of perturbations in individual elements of

δ Q_{1}

and

δ R

, and, in such a case, the analysis is called componentwise perturbation analysis [5]. In the cases when the estimated vector or matrix has components that differ greatly in size, the normwise estimate does not produce reliable results, and it is preferable to use the componentwise perturbation analysis.

The perturbation analysis of the QR decomposition was performed for the first time by Stewart [6], and improved results were presented by Sun [7] and Stewart [8]. Using a different approach, Chang, Paige and Stewart [9] gave new asymptotic perturbation bounds for the R-factor. Additional improvements of the normwise perturbation bounds of the QR-decomposition were proposed by Chang and Stehlé [10] and Li and Wei [11]. Different componentwise estimates of the perturbations of the Q-factor and the R-factor were derived by Sun [12], Zha [13], Chang and Paige [14] and Chang [15].

A general approach, based on the use of the so-called splitting operators, which can be used in the perturbation analysis of several unitary decompositions, was proposed in [16]; for details, see [17]. The method of the splitting operators can be used to determine normwise as well as componentwise perturbation bounds of different unitary decompositions; see [18,19,20,21,22]. This method was implemented by Sun [23], who obtained improved normwise perturbation bounds of the QR decomposition.

This paper presents a rigorous componentwise perturbation analysis of the QR decomposition based on the method of splitting operators. The analysis presented aims at finding normwise and componentwise perturbation bounds for infinitely small perturbations (asymptotic bounds) as well as for finite perturbations (global bounds). The main result is the obtaining of new asymptotic componentwise perturbation bounds that produce less conservative estimates of the QR decomposition perturbations. A particular case of these bounds is the asymptotic normwise bounds of the QR decomposition derived previously.

This is demonstrated by an example that the new componentwise perturbation bounds of the R factor can be several orders of magnitude smaller than the normwise perturbation bound of this factor. An iterative scheme is proposed to determine global componentwise bounds in the case of perturbations for which such bounds exist. The analysis conducted in this paper is unified with the perturbation analysis of the Schur decomposition presented in [20] and can be easily extended to the case of complex matrices.

In Section 2, we introduce the basic scheme of the perturbation analysis. Section 3 is devoted to determining normwise and componentwise perturbation bounds of the matrix

Q_{1}

. In Section 4, we present estimates for the perturbations of the column subspaces of A, and, in Section 5, we derive bounds of the elements of R. An iterative scheme for finding global componentwise perturbation bounds of the QR decomposition is proposed in Section 6. A comparison with some of the known methods for perturbation analysis of the QR decomposition is performed in Section 7, and our conclusions are made in Section 8.

The numerical results presented in the paper were obtained with MATLAB^® R2020b [24] using IEEE double precision arithmetic with roundoff unit

u \approx 1.11 \times 10^{- 16}

.

2. Bounding the Basic Perturbation Parameters

Let

Q : = [q_{1}, q_{2}, \dots, q_{n}], q_{j} \in R^{n}

and the unperturbed and perturbed matrices of the orthogonal factor of the QR decomposition be

\begin{matrix} Q & : = & [q_{1}, q_{2}, \dots, q_{n}], \\ \tilde{Q} & : = & [{\tilde{q}}_{1}, {\tilde{q}}_{2}, \dots, {\tilde{q}}_{n}], \\ {\tilde{q}}_{j} & : = & q_{j} + δ q_{j}, j = 1, 2, \dots n, \end{matrix}

respectively. Define the perturbation matrix

δ Q_{1} : = [δ q_{1}, δ q_{2}, \dots, δ q_{m}], δ q_{j} \in R^{n} .

It follows from (1) and (4) that

δ q_{i}^{T} a_{j} = - {\tilde{q}}_{i}^{T} δ a_{j} = 0, 1 \leq j \leq m, j < i \leq n .

(5)

The column

a_{j}

can be obtained from the QR factorization (2) as

a_{j} = \sum_{k = 1}^{j} r_{k j} q_{k}, 1 \leq j \leq m .

(6)

Substituting (6) in (5) yields

\sum_{k = 1}^{j} r_{k j} δ q_{i}^{T} q_{k} = - {\tilde{q}}_{i}^{T} δ a_{j} .

(7)

Since

{\tilde{Q}}^{T} \tilde{Q} = I_{n}

, it follows that

Q^{T} δ Q = - δ Q^{T} Q - δ Q^{T} δ Q

and

δ q_{i}^{T} q_{j} = - q_{i}^{T} δ q_{j} - δ q_{i}^{T} δ q_{j}, 1 \leq j \leq m, j < i \leq n .

(8)

Using (8), Equation (7) can be written as

\sum_{k = 1}^{j} r_{k j} q_{i}^{T} δ q_{k} + \sum_{k = 1}^{j} r_{k j} δ q_{i}^{T} δ q_{k} = {\tilde{q}}_{i}^{T} δ a_{j} .

(9)

Equation (9) represents a system of

ν = n (n - 1) / 2 - m (m - 1) / 2 = m (2 n - m - 1) / 2

nonlinear algebraic equations for the

ν

unknown quantities

x_{ℓ} : = q_{i}^{T} δ q_{j}, ℓ = i + (j - 1) n - \frac{j (j + 1)}{2}, 1 \leq j \leq m, j < i \leq n .

These quantities, which we call basic perturbation parameters, are elements of the strict lower part of the matrix

δ W = Q^{T} δ Q_{1}

. More precisely, one has that

x = vec (Low (δ W)),

or, equivalently,

x = Ω vec (δ W),

where

\begin{matrix} Ω & : = & [diag (ω_{1}, ω_{2}, \dots, ω_{m})] \in R^{ν \times n m}, \\ ω_{k} & : = & [0_{(n - k) \times k}, I_{n - k}] \in R^{(n - k) \times n}, k = 1, 2, \dots, m, \\ Ω^{T} Ω = I_{ν}, {∥ Ω ∥}_{2} = 1 . \end{matrix}

Define the lower triangular matrix

M : = Ω (R^{T} \otimes I_{m}) Ω^{T} \in R^{ν \times ν}

whose elements are determined entirely from the elements of R. It can be shown that

\sum_{k = i}^{n} t_{i k} q_{k}^{T} δ q_{j} = M x .

The matrix M has the form

M = [\begin{array}{c} r_{11} & 0 & \dots & 0 & 0 & 0 & \dots & 0 & \dots & 0 \\ 0 & r_{11} & \dots & 0 & 0 & 0 & \dots & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & r_{11} & 0 & 0 & \dots & 0 & \dots & 0 \\ 0 & r_{12} & \dots & 0 & r_{22} & 0 & \dots & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 & 0 & r_{22} & \dots & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & r_{12} & 0 & 0 & \dots & r_{22} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & r_{1, m} & 0 & 0 & \dots & r_{2, m} & \dots & r_{m m} \end{array}],

which shows that this matrix is nonsingular if the diagonal elements of R are nonzero. The matrix M is called the perturbation operator matrix.

From (9), we obtain that

M x = f - Δ^{x}

(10)

where

f = vec (Low (F)) = Ω vec (F) \in R^{ν}, F = {\tilde{Q}}^{T} δ A

and the vector

Δ^{x} \in R^{ν}

has components

\begin{matrix} Δ_{ℓ}^{x} & = \sum_{k = 1}^{j} r_{k j} δ q_{i}^{T} δ q_{k}, & ℓ = i + (j - 1) n - \frac{j (j + 1)}{2}, \\ 1 \leq j \leq m, j < i \leq n . \end{matrix}

(11)

containing second-order terms in the perturbations

δ q_{i}, i = 1, 2, \dots, n

.

An asymptotic (linear) approximation of x is obtained from (10) neglecting the second-order term

Δ^{x}

,

x = M^{- 1} f .

(12)

The norm of this approximation obeys

{∥ x ∥}_{2} \leq ∥ M^{- 1} ∥_{2} {∥ f ∥}_{2},

which shows that the size of the linear bound of

{∥ x ∥}_{2}

depends on

1 / σ_{min} (M) = {∥ M^{- 1} ∥}_{2}

. As shown by Sun [23],

∥ M^{- 1} ∥_{2} \leq {∥ A^{†} ∥}_{2} .

Since

{∥ f ∥}_{2} \leq {∥ δ A ∥}_{F},

one obtains the asymptotic normwise bound

{∥ x ∥}_{2} \leq ∥ M^{- 1} ∥_{2} {∥ δ A ∥}_{F} .

Since the matrix M is lower triangular, it is usually inverted with high precision. Using (12), one can obtain asymptotic componentwise bounds on the perturbation vector x. Since

x_{ℓ} = M_{ℓ, 1 : ν}^{- 1} f, ℓ = 1, 2, \dots ν,

(13)

it follows that

| x_{ℓ} | \leq ∥ M_{ℓ, 1 : ν}^{- 1} ∥_{2} {∥ f ∥}_{2}, ℓ = 1, 2, \dots, ν

and using the inequality

{∥ f ∥}_{2} \leq {∥ δ A ∥}_{F}

, one obtains the asymptotic bound

| x_{ℓ} | \leq x_{ℓ}^{l i n} : = ∥ M_{ℓ, 1 : ν}^{- 1} ∥_{2} {∥ δ A ∥}_{F} .

(14)

The quantity

cond (x_{ℓ}) = {∥ M_{ℓ, 1 : ν}^{- 1} ∥}_{2}

can be considered as a componentwise condition number [25] of the element

x_{ℓ}

.

Example 1.

Consider the

4 \times 3

matrix

A = [\begin{matrix} 18 & - 6 & - 18 \\ 6 & - 2 & - 8 \\ - 9 & 3.001 & 7 \\ 9 & - 3 & - 10 \end{matrix}]

and assume that it is perturbed by

\begin{matrix} δ A & = & c \cdot 10^{- k} \cdot A_{0}, \\ A_{0} & = & [\begin{matrix} 7 & - 4 & 1 \\ - 4 & 2 & - 9 \\ 1 & 6 & - 5 \\ - 8 & - 4 & 3 \end{matrix}], \end{matrix}

where c and k are varying parameters. The QR decompositions of matrices A and

A + δ A

are computed by the function qr of MATLAB^®. In the given case, the perturbation operator matrix M is of order

ν = 6

and

∥ M^{- 1} ∥_{2} = 1.71871 \times 10^{3}

.

The exact absolute values of the elements of the vector x and their linear approximations computed according to (12) for three perturbations

δ A = 10^{- 11} A_{0}, 5 \times 10^{- 9} A_{0}

, and

3 \times 10^{- 6}

of different size, are given to five decimal digits in the third and fourth columns of Table 1, respectively. It is seen that the elements of the linear estimate

x_{l i n}

closely follow the corresponding elements of the exact perturbation vector

| x |

.

3. Bounding the Perturbations of the Matrix $Q_{1}$

Consider the matrix

δ W = Q^{T} δ Q_{1} : = [δ w_{1}, δ w_{2}, \dots, δ w_{m}], δ w_{j} \in R^{n} .

The strictly lower part of this matrix contains elements of the form

q_{i}^{T} δ q_{j}, 1 \leq j \leq m, j < i \leq n,

which can be substituted by the corresponding elements

x_{ℓ}, ℓ = i + (j - 1) n - \frac{j (j + 1)}{2}

of the vector x. The elements of the strictly upper part of

δ W

are of the form

q_{i}^{T} δ q_{j}, 1 \leq i < j \leq m,

which, according to the orthogonality condition (8), can be represented as

q_{i}^{T} δ q_{j} = - q_{j}^{T} δ q_{i} - δ q_{i}^{T} δ q_{j} .

(15)

In this way, the matrix

δ W

can be written as

δ W = δ V + δ D - δ Y,

(16)

where the matrix

\begin{matrix} δ V & = & [\begin{matrix} 0 & - x_{1} & - x_{2} & \dots & - x_{m - 1} \\ x_{1} & 0 & - x_{n} & \dots & - x_{n + m - 3} \\ x_{2} & x_{n} & 0 & \dots & - x_{2 n + m - 6} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ x_{m - 1} & x_{n + m - 3} & x_{2 n + m - 6} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n - 1} & x_{2 n - 3} & x_{3 n - 6} & \dots & x_{ν} \end{matrix}] \\ : = & [δ v_{1}, δ v_{2}, \dots, δ v_{m}], v_{j} \in R^{n} \end{matrix}

has elements depending only on the basic perturbation parameters,

δ D = [\begin{matrix} q_{1}^{T} δ q_{1} & 0 & \dots & 0 \\ 0 & q_{2}^{T} δ q_{2} & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & q_{m}^{T} δ q_{m} \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 0 \end{matrix}] \in R^{n \times m},

and the matrix

δ Y = [\begin{matrix} 0 & δ q_{1}^{T} δ q_{2} & δ q_{1}^{T} δ q_{3} & \dots & δ q_{1}^{T} δ q_{m} \\ 0 & 0 & δ q_{2}^{T} δ q_{3} & \dots & δ q_{2}^{T} δ q_{m} \\ 0 & 0 & 0 & \dots & δ q_{3}^{T} δ q_{m} \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & δ q_{m - 1}^{T} δ q_{m} \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 0 \end{matrix}] \in R^{n \times m},

contains second-order terms in

δ q_{j}, j = 1, 2, \dots, m

.

Consider how to determine the diagonal elements of the matrix W (the nontrivial elements of D) from the elements of x. Denote that

α_{j} = δ q_{j}^{T} q_{j}

. According to (8), one has that

2 δ q_{j}^{T} q_{j} = - δ q_{j}^{T} δ q_{j}, 1 \leq j \leq m,

or

2 α_{j} = - {∥ δ q_{j} ∥}^{2} .

The above expression shows that

α

is always nonnegative.

On the other hand, we have that

δ w_{j} = δ v_{j} + [\begin{matrix} 0 \\ ⋮ \\ α_{j} \\ ⋮ \\ 0 \end{matrix}] \begin{matrix} \leftarrow j, \end{matrix} j = 1, 2, \dots, m

so that

∥ δ w_{j} ∥_{2}^{2} = {∥ δ v_{j} ∥}_{2}^{2} + α_{j}^{2} .

(17)

From

δ w_{j} = Q^{T} δ q_{j},

it follows that

∥ δ w_{j} ∥_{2} = {∥ δ q_{j} ∥}_{2} = - 2 α_{j} .

(18)

From (17) and (18), we obtain the quadratic equation

α_{j}^{2} + 2 α_{j} + {∥ δ v_{j} ∥}_{2}^{2} = 0 .

(19)

The negative solution of this equation is

α_{j}^{n o n l} = - {∥ δ v_{j} ∥}_{2}^{2} / (1 + \sqrt{1 - ∥ δ v_{j} ∥_{2}^{2}}), j = 1, 2, \dots, m .

(20)

For a small perturbation

δ A

(small values of

∥ δ v_{j} ∥_{2}

), one has the estimate

α_{j}^{l i n} = - {∥ δ v_{j} ∥}_{2}^{2} / 2 .

Thus, for small perturbations, the quantities

| α_{j}^{l i n} |, j = 1, 2, \dots, m

depend quadratically on

{∥ δ A ∥}_{F}

.

In Table 2, for the same matrix and perturbations that are given in Example 1, we give the exact values of

α_{j}

and their linear

α_{j}^{l i n}

and nonlinear

α_{j}^{n o n l}

estimates computed using the exact vectors x.

Thus, having the linear approximations of the elements of x, one can compute the linear approximations of the matrices

δ V

and

δ D

. According to (16), the sum

δ V + δ D

is the linear approximation of

δ W

, and

δ Y

contains second-order terms in

{∥ δ A ∥}_{F}

that can be neglected in the asymptotic analysis. As shown below, the determining of an estimate of

δ W

allows one to find a bound on

δ Q_{1}

.

3.1. Normwise Bounds

The estimate of

∥ x^{l i n} ∥_{2}

can be used to find an asymptotic normwise bound of

∥ δ Q_{1} ∥_{F}

. In determining condition numbers, one assumes

{∥ δ A ∥}_{F} \to 0

, so that

{∥ δ W ∥}_{F} \approx {∥ δ V ∥}_{F}

. From Equation (16), it follows that the Frobenius norm of the strictly upper triangular part

Up (δ V)

of the matrix

δ V

is less than (if

m < n

) or equal (if

m = n

) to the norm of the strictly lower part

Low (δ V)

. Since

{∥ Low (δ V) ∥}_{F} = {∥ x^{l i n} ∥}_{2}

, we have that

{∥ δ W ∥}_{F} \leq \sqrt{2} {∥ x^{l i n} ∥}_{2}

, and the change of the matrix

Q_{1}

obeys

∥ δ Q_{1} ∥_{F} = ∥ Q^{T} δ Q_{1} ∥_{F} \leq \sqrt{2} ∥ x^{l i n} ∥_{2} \leq c_{Q} {∥ δ A ∥}_{F},

(21)

where

c_{Q} {∥ δ A ∥}_{F}

is an asymptotic normwise bound on

∥ δ Q_{1} ∥_{F}

and

c_{Q} : = \sqrt{2} {∥ M^{- 1} ∥}_{2}

can be considered as a normwise condition number of the matrix

Q_{1}

with respect to the perturbations of A.

Since, in first-order approximation, it is fulfilled that

δ R = δ Q^{T} A + Q^{T} δ A,

considering (21), one obtains that

{∥ δ R ∥}_{F} \leq c_{R} {∥ δ A ∥}_{F},

(22)

where

c_{R} = 1 + 2 \sqrt{2} ∥ M^{- 1} ∥_{2} {∥ A ∥}_{F}

is the normwise condition number of the matrix R with respect to the perturbation

δ A

.

The asymptotic normwise estimates of

δ Q

and

δ R

thus obtained coincide with the corresponding estimates derived in [17,23].

3.2. Componentwise Bounds

The componentwise bounds of the elements of the matrix

δ Q_{1}

can be found by using the componentwise estimates of the elements of x. An asymptotic bound on the matrix

| δ W = Q^{T} δ Q_{1} |

is given by

| δ W^{l i n} | = | δ V | = [\begin{matrix} | α_{1}^{l i n} | & | x_{1}^{l i n} | & | x_{2}^{l i n} | & \dots & | x_{m - 1}^{l i n} | \\ | x_{1}^{l i n} | & | α_{2}^{l i n} | & | x_{n}^{l i n} | & \dots & | x_{n + m - 3}^{l i n} | \\ | x_{2}^{l i n} | & | x_{n}^{l i n} | & | α_{3}^{l i n} | & \dots & | x_{2 n + m - 6}^{l i n} | \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ | x_{m - 1}^{l i n} | & | x_{n + m - 3}^{l i n} | & | x_{2 n + m - 6}^{l i n} | & \dots & | α_{m}^{l i n} | \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ | x_{n - 1}^{l i n} | & | x_{2 n - 3}^{l i n} | & | x_{3 n - 6}^{l i n} | & \dots & | x_{ν}^{l i n} | \end{matrix}] \in R^{n \times m} .

Considering that

δ Q_{1} = Q δ W

and using (16), a linear approximation of the perturbation

| δ Q_{1} |

is determined as

| δ Q_{1} | ⪯ δ Q_{1}^{l i n} = | Q | | δ W^{l i n} | .

(23)

This equation gives asymptotic bounds of the perturbations in the individual elements

q_{i j}

, i.e., componentwise perturbation bounds of the matrix

Q_{1}

. Since

{∥ | Q | ∥}_{F}

=

{∥ Q ∥}_{F} = \sqrt{n}

, we have that

∥ δ Q_{1}^{l i n} ∥_{F} \leq \sqrt{n} {∥ δ W^{l i n} ∥}_{F},

i.e., the obtaining of the asymptotic componentwise estimate

δ Q_{1}^{l i n}

through (23) may increase the bounds on

| δ q_{i j} |

at most

\sqrt{n}

times.

In Table 3, we give, for the same QR decomposition as the one presented in Example 1, the exact values of

| δ q_{i j} |

and their linear approximations

δ q_{i j}^{l i n}

for

δ A = 3 \times 10^{- 6} A_{0}

. The comparison of the componentwise bounds with the normwise linear bound

B (δ Q^{l i n}) = c_{Q} {∥ δ A ∥}_{F}

shows that the bounds on the individual elements of

δ Q_{1}

are smaller than

B (δ Q^{l i n})

for all

j \leq m, j < i \leq n

. The difference between the componentwise and normwise bounds is particularly significant for the elements in the first column of

δ Q_{1}

whose absolute values are of order

10^{- 7}

, while the normwise bound is of order

10^{- 1}

.

4. Estimating Column Subspace Sensitivity

The determination of bounds on the elements of the matrix

δ Q_{1}

makes it possible to estimate the sensitivity of the column subspaces

X_{p} = R ([a_{1}, a_{2}, \dots, a_{p}]), p = 1, 2, \dots, m

. (Note that, for

p = m

, the corresponding column subspace

X_{m}

coincides with the range

R (A)

of A.) Since we assume that R is of full rank, we have that

R ([a_{1}, a_{2}, \dots, a_{p}]) = R ([q_{1}, q_{2}, \dots, q_{p}]), p = 1, 2, \dots, m

, i.e., the first

p \leq m

columns of Q form an orthonormal basis for the subspace

X_{p}

.

As is known [26], the sensitivity of a subspace of dimension p is measured by the p angles between the perturbed and unperturbed subspace. Let

Q_{X}

and

{\tilde{Q}}_{X}

be the orthonormal bases for

X_{p}

and its perturbed counterpart

{\tilde{X}}_{p}

, respectively. Then, the maximum angle

δ Θ {max}_{p} : = δ Θ max ({\tilde{X}}_{p}, X_{p})

between

{\tilde{X}}_{p}

and

X_{p}

is determined from [26]

sin (δ Θ {max}_{p}) = {∥ {Q_{X}^{⊥}}^{T} {\tilde{Q}}_{X} ∥}_{2},

(24)

where

Q_{X}^{⊥}

is the orthogonal complement of

Q_{X}

,

{Q_{X}^{⊥}}^{T} Q_{X} = 0

. Since

{\tilde{Q}}_{X} = Q_{X} + δ Q_{X},

one has that

sin (δ Θ {max}_{p}) = {∥ {Q_{X}^{⊥}}^{T} δ Q_{X} ∥}_{2} .

(25)

Equation (25) shows that the sensitivity of the column subspace

X_{p}

is related to the values of the basic perturbation parameters

x_{ℓ} = q_{i}^{T} δ q_{j}, ℓ = i + (j - 1) n - \frac{j (j + 1)}{2}, i > p, j = 1, 2, \dots, p

. In particular, for

p = 1

, the sensitivity of the first column of A is determined as

sin (δ Θ max (\tilde{X_{1}}, X_{1})) = {∥ δ W_{2 : n, 1} ∥}_{2},

for

p = 2

, one has

sin (δ Θ max (\tilde{X_{2}}, X_{2})) = {∥ δ W_{3 : n, 1 : 2} ∥}_{2}

and so on (see Figure 1).

In this way, if the basic perturbation parameters are known, it is possible to find the sensitivity estimates of all column subspaces with dimension

p = 1, 2, \dots, m

. More specifically, let

δ W = [\begin{matrix} \times & \times & \times & \dots & \times \\ x_{1} & \times & \times & \dots & \times \\ x_{2} & x_{n} & \times & \dots & \times \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ x_{m - 1} & x_{n + m - 3} & x_{2 n + m - 6} & \dots & \times \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ x_{n - 1} & x_{2 n - 3} & x_{3 n - 6} & \dots & x_{ν} \end{matrix}] \in R^{n \times m} .

Then, we have that the maximum angle between the perturbed and unperturbed column subspace of dimension p is

δ Θ {max}_{p} = arcsin (∥ δ W_{p + 1 : n, 1 : p} ∥_{2}) .

(26)

In particular, for the sensitivity of

R (A)

, we obtain that

sin (δ Θ max ({\tilde{X}}_{m}, X_{m})) = {∥ δ W_{m + 1 : n, 1 : m} ∥}_{2} .

An asymptotic estimate of the maximum angle can be obtained, if, in the expression for the matrix

δ W

, the elements

x_{ℓ}, ℓ = 1, 2, \dots, ν

are replaced by their linear approximations (12). Representing the matrix

M^{- 1}

as

M^{- 1} = [\begin{matrix} M_{1, 1 : ν}^{- 1} \\ M_{2, 1 : ν}^{- 1} \\ M_{3, 1 : ν}^{- 1} \\ ⋮ \\ M_{ν, 1 : ν}^{- 1} \end{matrix}],

the matrix

δ W

can be written as

δ W = [\begin{matrix} \times & \times & \times & \dots & \times \\ M_{1, 1 : ν}^{- 1} f & \times & \times & \dots & \times \\ M_{2, 1 : ν}^{- 1} f & M_{n, 1 : ν}^{- 1} f & \times & \dots & \times \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ M_{n - 1, 1 : ν}^{- 1} f & M_{2 n - 3, 1 : ν}^{- 1} f & M_{3 n - 6, 1 : ν}^{- 1} f & \dots & \times \end{matrix}] = L (I_{n} \otimes f),

where the rows of

M^{- 1}

are highlighted in boxes,

L = [\begin{matrix} \times & \times & \times & \dots & \times \\ M_{1, 1 : ν}^{- 1} & \times & \times & \dots & \times \\ M_{2, 1 : ν}^{- 1} & M_{n, 1 : ν}^{- 1} & \times & \dots & \times \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ M_{n - 1, 1 : ν}^{- 1} & M_{2 n - 3, 1 : ν}^{- 1} & M_{3 n - 6, 1 : ν}^{- 1} & \dots & \times \end{matrix}] \in R^{n \times n ν}

and

I_{n} \otimes f = [\begin{matrix} f \\ f \\ ⋱ \\ f \end{matrix}] \in R^{n ν \times n} .

Using the fact that

∥ I_{n} {\otimes f ∥}_{2} = {∥ f ∥}_{2},

we obtain the following asymptotic estimate,

\begin{matrix} | δ Θ {max}_{p} | & \leq & arcsin (∥ L_{p + 1 : n, 1 : p ν} ∥_{2} ∥ f ∥_{2}) \\ \leq & arcsin (∥ L_{p + 1 : n, 1 : p ν} ∥_{2} ∥ δ A ∥_{F}), \\ p = 1, 2, \dots, m . \end{matrix}

(27)

Thus, an asymptotic bound of

δ Θ max ({\tilde{X}}_{p}, X_{p})

is determined as

| δ Θ {max}_{p} | \leq δ Θ {max}_{p}^{l i n} : = cond (Θ {max}_{p}) {∥ δ A ∥}_{F},

(28)

where the quantity

cond (Θ {max}_{p}) : = {∥ L_{p + 1 : n, 1 : p ν} ∥}_{2}

can be considered as a condition number of the column subspace

X_{p}

. The derivation of

cond (Θ {max}_{p})

is performed such that to find its possible minimum value.

In Table 4, we give the exact values of maximum angle

| δ Θ {max}_{p} |

and its asymptotic bound

δ Θ {max}_{p}^{l i n}

for the perturbation problem considered in Example 1. In all cases, the size of the estimate matches correctly the size of the actual maximum angle between the perturbed and unperturbed subspace.

5. Perturbation Bounds of the Elements of R

It is convenient to first consider the sensitivity of the nontrivial elements of the upper triangular matrix R for the case of the diagonal elements. Due to the nonsingularity of R, these elements are nonzero.

5.1. Sensitivity Estimates of the Diagonal Elements of R

The changes in the elements of the perturbed matrix R satisfy

δ r_{i j} = {\tilde{r}}_{i j} - r_{i j} = {\tilde{q}}_{i}^{T} (a_{j} + δ a_{j}), 1 \leq i \leq j \leq m .

The above equation can be rewritten as

δ r_{i j} = δ q_{i}^{T} a_{j} + {\tilde{q}}_{i}^{T} δ a_{j} .

(29)

Using Equations (7) and (8), one obtains for the perturbations of the diagonal (

i = j

) elements of R, the expressions

δ r_{i i} = - \sum_{k = 1}^{i} r_{k i} q_{i}^{T} δ q_{k} - \sum_{k = 1}^{i} r_{k i} δ q_{i}^{T} δ q_{k} + {\tilde{q}}_{i}^{T} δ a_{i}, i = 1, 2, \dots, m .

(30)

Further on, we shall use the following quantities:

The diagonal elements of the matrix ${\tilde{Q}}^{T} δ A$ ,

$g = {[{\tilde{q}}_{1}^{T} δ a_{1}, {\tilde{q}}_{2}^{T} δ a_{2}, \dots, {\tilde{q}}_{m}^{T} δ a_{m}]}^{T} \in R^{m} .$
The changes of the diagonal elements of R,

$δ r_{d i a g} = {[δ r_{11}, δ r_{22}, \dots, δ r_{m m}]}^{T} \in R^{m} .$
The diagonal elements of W,

$α = {[α_{1}, α_{2}, \dots, α_{m}]}^{T} \in R^{m} .$
The quadratic terms in (30),

$Δ^{d} = {[Δ_{1}^{d}, Δ_{2}^{d}, \dots, Δ_{m}^{d}]}^{T} \in R^{m},$

where

$Δ_{i}^{d} = - \sum_{k = 1}^{i} r_{k i} δ q_{i}^{T} δ q_{k}, i = 1, 2, \dots, m .$

Denote the columns of

I_{n}

as

e_{j}, j = 1, 2, \dots, n

and the columns of

I_{m}

as

η_{j}, j = 1, 2, \dots, m

. Then, the system of Equation (30) can be represented as

δ r_{d i a g} = N_{1} x + N_{2} α + g + Δ^{d},

(31)

where

N_{1} = - Π (R^{T} \otimes I_{n}) Ω^{T} \in R^{m \times ν}, N_{2} = - diag (r_{11}, r_{22}, \dots, r_{m m}) \in R^{m \times m},

Π = [η_{1} e_{1}^{T}, η_{2} e_{2}^{T}, \dots, η_{m} e_{m}^{T}] \in R^{m \times n \cdot m},

and the matrix

Ω

was defined earlier. Neglecting the quadratic terms in (31), one obtains the linear estimate

δ r_{d i a g} = N_{1} M^{- 1} f + g .

(32)

Equation (32) can be represented in the compact form

δ r_{d i a g} = [N_{1} M^{- 1}, I_{m}] [\begin{matrix} f \\ g \end{matrix}] .

(33)

Using (33), one can derive condition numbers of the diagonal elements of R. Let

Z = [N_{1} M^{- 1}, I_{m}] \in R^{m \times (ν + m)} .

Since

{∥[\begin{matrix} f \\ g \end{matrix}]∥}_{2} \leq {∥ δ A ∥}_{F},

it follows from (33) that the asymptotic perturbation

δ r_{i i}

satisfies

| δ r_{i i} | \leq δ r_{i i}^{l i n} : = cond (r_{i i}) {∥ δ A ∥}_{F}, i = 1, 2, \dots, m,

(34)

where

cond (r_{i i}) = {∥ Z_{i, 1 : ν + m} ∥}_{2}

(35)

is considered as a condition number of

r_{i i}

. The derivation of (35) is performed to find the minimum possible value of

cond (r_{i i})

.

In Table 5, for the matrix A and the perturbations given in Example 1, we present the exact perturbations

| δ r_{i i} |

of the diagonal elements of R and their linear and nonlinear estimates. The normwise quantities

B (δ R^{l i n})

and

B (δ R^{n o n l})

are the normwise linear and nonlinear bounds, derived in [17,23]. These bounds are more pessimistic than the bounds

δ r_{i i}^{l i n}

and

δ r_{i i}^{n o n l}

.

5.2. Sensitivity Estimates of the Super Diagonal Elements of R

According to (29), the perturbations of the super diagonal elements of the matrix R can be determined as

\begin{matrix} δ r_{i j} = {\tilde{r}}_{i j} - r_{i j} & = & - \sum_{k = 1}^{j} r_{k j} q_{i}^{T} δ q_{k} - \sum_{k = 1}^{j} r_{k j} δ q_{i}^{T} δ q_{k} + {\tilde{q}}_{i}^{T} δ a_{j}, \\ 1 \leq i < j \leq m . \end{matrix}

(36)

Let us define the vectors (the elements of the corresponding matrices are taken row-wise),

\begin{matrix} δ r_{s u p d} & : = & vec ({(Up (δ R))}^{T}) = Ω_{2} vec (δ R^{T}) \in R^{ν_{2}}, ν_{2} = m (m - 1) / 2, \\ {(δ r_{s u p d})}_{ℓ_{2}} = δ r_{i j}, ℓ_{2} = j + (i - 1) m - \frac{i (i + 1)}{2}, 1 \leq i < j \leq m, \\ y & : = & vec ({(Up (Q_{1}^{T} δ Q_{1}))}^{T}) = Ω_{2} vec ({(Q_{1}^{T} δ Q_{1})}^{T}) \in R^{ν_{2}}, \\ y_{ℓ_{2}} = q_{i}^{T} δ q_{j}, \\ h & : = & vec ({(Up ({\tilde{Q}}_{1}^{T} δ A))}^{T}) = Ω_{2} vec {(({\tilde{Q}}_{1}^{T} δ A))}^{T}) \in R^{ν_{2}}, \\ h_{ℓ_{2}} = {\tilde{q}}_{i}^{T} δ a_{j}, \end{matrix}

and

\begin{matrix} Δ^{r} & = & [\begin{matrix} Δ_{1}^{r} \\ Δ_{2}^{r} \\ ⋮ \\ Δ_{ν_{2}}^{r} \end{matrix}], \begin{matrix} Δ_{ℓ_{2}}^{r} & = & - \sum_{k = 1}^{j} r_{k j} δ q_{i}^{T} δ q_{k}, \\ ℓ_{2} = j + (i - 1) m - \frac{i (i + 1)}{2}, 1 \leq i < j \leq m, \end{matrix} \end{matrix}

(37)

where

\begin{matrix} Ω_{2} & : = & [diag (ω_{1}, ω_{2}, \dots, ω_{m - 1}), 0_{ν_{2} \times m}] \in R^{ν_{2} \times m^{2}}, \\ ω_{k} & : = & [0_{(m - k) \times k}, I_{m - k}] \in R^{(m - k) \times m}, k = 1, 2, \dots, m - 1, \\ Ω_{2}^{T} Ω_{2} = I_{m^{2}}, {∥ Ω_{2} ∥}_{2} = 1 . \end{matrix}

Then, Equation (36) may be represented as the system of

ν_{2}

nonlinear algebraic equations

δ r_{s u p d} = M_{1} y + M_{2} x + M_{3} α + h + Δ^{r}, 1 \leq i < j \leq m,

(38)

where

M_{1}, M_{2}

and

M_{3}

are matrices whose elements are functions of the elements of R. These matrices are determined from

M_{1} = - Ω_{2} P_{v e c} (R^{T} \otimes I_{m}) P_{v e c} Ω_{2}^{T} \in R^{ν_{2} \times ν_{2}},

M_{2} = - Ω_{2} P_{v e c} (R^{T} \otimes I_{m}) Ω_{3}^{T} \in R^{ν_{2} \times ν},

M_{3} = - Ω_{2} (I_{m} \otimes R^{T}) Π^{T} \in R^{ν_{2} \times m},

where

\begin{matrix} Ω_{3} & : = & [\begin{matrix} diag (ω_{1}, ω_{2}, \dots, ω_{m - 1}), 0_{(ν - q) \times m} \\ 0_{q \times m^{2}} \end{matrix}] \in R^{ν \times m^{2}}, q = 2 (n - m), \\ ω_{k} & : = & [0_{(m - k) \times k}, I_{m - k}] \in R^{(m - k) \times m}, k = 1, 2, \dots, m - 1, \\ Ω_{3}^{T} Ω_{3} = I_{m^{2}}, {∥ Ω_{3} ∥}_{2} = 1, \end{matrix}

and

P_{v e c}

is the vec-permutation matrix as determined from ([27], Ch. 4)

vec (A^{T}) = P_{v e c} vec (A) .

According to (15), the components of the vector y satisfy

\begin{matrix} y_{ℓ_{2}} = - x_{ℓ} - δ q_{i}^{T} δ q_{j}, & ℓ = j + (i - 1) n - \frac{i (i + 1)}{2}, \\ ℓ_{2} = j + (i - 1) m - \frac{i (i + 1)}{2}, \\ 1 \leq i < j \leq m . \end{matrix}

(39)

In a linear approximation, one has

y_{ℓ_{2}} = - x_{ℓ},

and it is possible to show that

y = Ω_{4} x,

where

\begin{matrix} Ω_{4} & : = & [diag (ω_{1}, ω_{2}, \dots, ω_{m - 1}), 0_{ν_{2} \times (n - m)}] \in R^{ν_{2} \times ν}, \\ ω_{k} & : = & [I_{m - k}, 0_{(m - k) \times (n - m)}] \in R^{(m - k) \times (n - k)}, k = 1, 2, \dots, m - 1, \\ Ω_{4}^{T} Ω_{4} = I_{ν}, {∥ Ω_{4} ∥}_{2} = 1 . \end{matrix}

Neglecting the second-order terms in Equation (38) and using the linear estimate

x = M^{- 1} f

, one obtains the asymptotic estimate

δ r_{s u p d} = - M_{1} Ω_{4} x + M_{2} x + h = - M_{1} Ω_{4} M^{- 1} f + M_{2} M^{- 1} f + h, 1 \leq i < j \leq m .

Let us denote

Z = [| M_{1} Ω_{4} M^{- 1} | + | M_{2} M^{- 1} |, I_{ν_{2}}] \in R^{ν_{2} \times (ν + ν_{2})} .

Since

{∥[\begin{matrix} f \\ h \end{matrix}]∥}_{2} \leq {∥ δ A ∥}_{F},

one concludes that, in a first-order approximation, the super diagonal elements of

| δ R |

fulfill

| δ r_{i j} | ⪯ δ r_{i j}^{l i n} = cond (r_{i j}) {∥ δ A ∥}_{F}, 1 \leq i < j \leq m,

(40)

where

\begin{matrix} cond (r_{i j}) = {∥ Z_{ℓ_{2}, 1 : ν + ν_{2}} ∥}_{2}, & ℓ_{2} = j + (i - 1) m - \frac{i (i + 1)}{2}, \\ 1 \leq i < j \leq m . \end{matrix}

(41)

Equation (40) gives asymptotic componentwise perturbation bounds for the super diagonal part of R. The quantity

cond (r_{i j})

represents the condition number of

r_{i j}

with respect to the perturbations in A.

In Table 6, for the matrix A and the perturbations given in Example 1, we give the exact perturbations of the super diagonal elements of R and their linear estimates. As in the case of the diagonal elements, the normwise linear and nonlinear bounds

B (δ R^{l i n})

and

B (δ R^{n o n l})

give worse estimates than

δ r_{i j}^{l i n}

.

Hence, the full asymptotic componentwise perturbation analysis of the QR decomposition can be conducted using Equations (12), (23), (28), (34) and (40).

6. Determining Global Perturbation Bounds

Based on the analysis presented above, it is possible to derive an iterative scheme for finding global perturbation bounds of the QR decomposition. The main task of such a scheme is to find a nonlinear estimate of the vector x of the basic perturbation parameters. For this aim, it is necessary to estimate the quadratic term

Δ^{x}

in (10). The analysis of the expression (10) shows that

Δ^{x}

contains terms involving the perturbations

δ q_{i}

for

m < i \leq n

, which are not estimates up to the moment since they are columns of the matrix

δ Q_{2} = {\tilde{Q}}_{2} - Q_{2}

. As mentioned previously, the matrix

Q_{2}

is not unique, and consequently its perturbation

δ Q_{2}

is also non-unique. However, the problem with finding

δ Q_{2}

of the minimum norm for a fixed

Q_{2}

has a unique solution, and our first task in this section is to find an approximation of this perturbation.

6.1. Perturbation Bounds of the Columns of $Q_{2}$

According to (3), the perturbation

δ Q_{2}

should satisfy the conditions:

\begin{matrix} {(Q_{1} + δ Q_{1})}^{T} (Q_{2} + δ Q_{2}) & = & 0, \end{matrix}

(42)

\begin{matrix} {(Q_{2} + δ Q_{2})}^{T} (Q_{2} + δ Q_{2}) & = & I_{n - m} . \end{matrix}

(43)

Equations (42) and (43) can be represented as

\begin{matrix} Q_{1}^{T} δ Q_{2} + δ Q_{1}^{T} Q_{2} & = & - δ Q_{1}^{T} δ Q_{2}, \\ Q_{2}^{T} δ Q_{2} + δ Q_{2}^{T} Q_{2} & = & - δ Q_{2}^{T} δ Q_{2} . \end{matrix}

Setting

X_{1} = Q_{1}^{T} δ Q_{2}, X_{2} = Q_{2}^{T} δ Q_{2}

, we obtain that

\begin{matrix} o r t h_{1} (X_{1}, X_{2}) & : = & (I_{m} + W_{1}^{T}) X_{1} + W_{2}^{T} X_{2} + W_{2}^{T} = 0, \end{matrix}

(44)

\begin{matrix} o r t h_{2} (X_{1}, X_{2}) & : = & X_{2} + X_{2}^{T} + X_{1}^{T} X_{1} + X_{2}^{T} X_{2} = 0, \end{matrix}

(45)

where

W_{1} = Q_{1}^{T} δ Q_{1}, W_{2} = Q_{2}^{T} δ Q_{1}

. (Note that

δ W = {[W_{1}^{T} W_{2}^{T}]}^{T}

is already estimated). For sufficiently small perturbations

δ Q_{1}

, the matrix

I_{m} + W_{1}^{T}

is nonsingular, and we have that

\begin{matrix} X_{1} & = & - {(I_{m} + W_{1}^{T})}^{- 1} W_{2}^{T} (I_{n - m} + X_{2}), \end{matrix}

(46)

\begin{matrix} X_{2} + X_{2}^{T} & = & - X_{1}^{T} X_{1} - X_{2}^{T} X_{2} . \end{matrix}

(47)

In the first-order analysis of (47), the term

X_{2}^{T} X_{2}

can be neglected, and we have the approximation

X_{2} + X_{2}^{T} \approx - X_{1}^{T} X_{1} .

(48)

As shown in Appendix A, the minimum norm solution of the matrix Equation (48) with respect to

X_{2}

is

X_{2}^{a p p r} = - X_{1}^{T} X_{1} / 2 .

(49)

The expression (49) shows that the size of the minimum norm matrix

X_{2}^{a p p r}

is of second order regarding to the size of

X_{1}

, and hence

X_{2}

can be neglected in the asymptotic analysis of (46). Thus, we obtain the first-order approximations

\begin{matrix} X_{1}^{a p p r} = - {(I_{m} + W_{1}^{T})}^{- 1} W_{2}^{T}, \end{matrix}

(50)

\begin{matrix} X_{2}^{a p p r} = - X_{1}^{T} X_{1} / 2 . \end{matrix}

(51)

In this way, the matrix

X = [\begin{matrix} X_{1} \\ X_{2} \end{matrix}] = Q^{T} δ Q_{2}

is approximated as

X^{a p p r} = [\begin{matrix} X_{1}^{a p p r} \\ X_{2}^{a p p r} \end{matrix}],

and an approximation of

δ Q_{2}

is obtained as

δ Q_{2}^{a p p r} = Q X^{a p p r} .

(52)

In Table 7, for the perturbation problem presented in Example 1, we show the quantities related to the approximation of

δ Q_{2}

and the norms of the matrices

\begin{matrix} o r t h_{3} (\tilde{Q}) & = & I_{n} - {\tilde{Q}}^{T} \tilde{Q}, \\ o r t h_{3} ({\tilde{Q}}^{a p p r}) & = & I_{n} - {({\tilde{Q}}^{a p p r})}^{T} {\tilde{Q}}^{a p p r}, \end{matrix}

characterizing the errors in the orthogonal matrices

\tilde{Q}

and

{\tilde{Q}}^{a p p r}

, respectively. The approximation of the perturbed orthogonal factor

{\tilde{Q}}^{a p p r}

is obtained as

{\tilde{Q}}^{a p p r} = [Q_{1} + δ Q_{1}, Q_{2} + δ Q_{2}^{a p p r}],

where

δ Q_{1}

is the exact perturbation of

Q_{1}

. These quantities are computed for the three perturbations

δ A = 10^{- 11} A_{0}, 5 \times 10^{- 9} A_{0}

and

3 \times 10^{- 6} A_{0}

. The results given in the table confirm the assumptions from the perturbation analysis of

Q_{2}

.

For the same example used previously, in Table 8, we give the exact values of the elements of

δ Q_{2}

and their approximations using (52). The exact minimum norm perturbation

δ Q_{2}

is found numerically by solving the minimization problem

δ Q_{2} = min_{U} {∥ U ∥}_{F}

under the constraint

{\tilde{Q}}^{T} \tilde{Q} = I_{n}, \tilde{Q} = [Q_{1} + δ Q_{1}, Q_{2} + U]

. The minimization is performed by the MATLAB^® function fmincon. The results show that, in all cases,

| δ q_{i j}^{a p p r} |

is close to

| δ q_{i j} |

.

6.2. Iterative Procedure for Finding Global Bounds of the Elements of x

Since one has linear estimates of the basic perturbation terms

x_{ℓ} = q_{i}^{T} δ q_{j}

, it is appropriate to substitute the terms containing the perturbations

δ q_{j}

in Equation (16) by the perturbations

δ w_{j} = Q^{T} δ q_{j}, j = 1, 2, \dots, m,

which are of the same size as

δ q_{j}

. Since

δ q_{i}^{T} δ q_{j} = δ q_{i}^{T} Q Q^{T} δ q_{j} = δ w_{i}^{T} δ w_{j},

the absolute value of the matrix

δ W

(16) can be bounded as

\begin{matrix} | δ W | & = & | Q^{T} δ Q_{1} | : = [| δ w_{1} |, | δ w_{2} |, \dots, | δ w_{m} |], \\ ⪯ & δ W^{n o n l} = | δ V | + | δ D | + | δ Y |, \end{matrix}

(53)

where

| δ V | = [\begin{matrix} 0 & | x_{1} | & | x_{2} | & \dots & | x_{m - 1} | \\ | x_{1} | & 0 & | x_{n} | & \dots & | x_{n + m - 3} | \\ | x_{2} | & | x_{n} | & 0 & \dots & | x_{2 n + m - 6} | \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ | x_{m - 1} | & | x_{n + m - 3} | & | x_{2 n + m - 6} | & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ | x_{n - 1} | & | x_{2 n - 3} | & | x_{3 n - 6} | & \dots & | x_{ν} | \end{matrix}] \in R^{n \times m},

| δ D | = [\begin{matrix} | α_{1} | & 0 & 0 & \dots & 0 \\ 0 & | α_{2} | & 0 & \dots & 0 \\ 0 & 0 & | α_{3} | & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & | α_{m} | \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 0 \end{matrix}] \in R^{n \times m},

| δ Y | = [\begin{matrix} 0 & | δ w_{1}^{T} | | δ w_{2} | & | δ w_{1}^{T} | | δ w_{3} | & \dots & | δ w_{1}^{T} | | δ w_{m} | \\ 0 & 0 & | δ w_{2}^{T} | | δ w_{3} | & \dots & | δ w_{2}^{T} | | δ w_{m} | \\ 0 & 0 & 0 & \dots & | δ w_{3}^{T} | | δ w_{m} | \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & 0 & \dots & | δ w_{m - 1}^{T} | | δ w_{m} | \\ 0 & 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & 0 & \dots & 0 \end{matrix}] \in R^{n \times m} .

Since the unknown column estimates

| δ w_{j} |

participate in both sides of (53), it is possible to obtain

| δ w_{j} |

recursively as follows.

Let

| δ w_{1} | = | δ v_{1} | + | δ d_{1} |,

where

| δ v_{1} |

and

| δ d_{1} |

are the first columns of

| δ V |

and

| δ D |

, respectively. Then, the next column estimates

| δ w_{j} |, j = 2, 3, \dots m

can be determined as

| δ w_{j} | ⪯ | S_{j} |^{- 1} | δ w_{j - 1} | = | S_{j} |^{- 1} (| δ v_{j - 1} | + | δ d_{j - 1} |),

(54)

where

| S_{j} | = [\begin{matrix} e_{1}^{T} - | δ w_{1}^{T} | \\ e_{2}^{T} - | δ w_{2}^{T} | \\ ⋮ \\ e_{j - 1}^{T} - | δ w_{j - 1}^{T} | \\ e_{j}^{T} \\ ⋮ \\ e_{n}^{T} \end{matrix}] \in R^{n \times n} .

If

| | δ w_{k} {| |}_{2} < 1, k = 1, 2, \dots, j - 1

, the matrix

| S_{j} |

is strictly diagonally dominant and nonsingular ([28], p. 352) and if

| | δ w_{k} {| |}_{2}

are small, then the condition number of

S_{j}

is close to 1.

The matrix

δ W^{n o n l}

only gives estimates of the first m columns of

| Q^{T} δ Q |

. Using the representation

δ W^{n o n l} = [\begin{matrix} W_{1} \\ W_{2} \end{matrix}], W_{1} \in R^{m \times m}, W_{2} \in R^{(n - m) \times m},

one can find an approximation

X^{a p p r}

of the matrix

Q^{T} Q_{2}

using the Equations (50) and (51). Thus, an approximation of

| Q^{T} δ Q |

is obtained as

Z = [δ W^{n o n l}, | X^{a p p r} |] .

After determining estimates of

| δ w_{j} |, j = 1, 2, \dots, m

, it is possible to bound the absolute values of the quadratic terms

Δ_{ℓ}^{x}

, given in (11), as

\begin{matrix} | Δ_{ℓ}^{x} | & = \sum_{k = 1}^{j} | r_{k j} | z_{i}^{T} z_{k}, & ℓ = i + (j - 1) n - \frac{j (j - 1)}{2}, \\ 1 \leq j \leq m, j < i \leq n . \end{matrix}

(55)

The column

z_{j}, 1 \leq j \leq n

represents an estimate of

| Q^{T} δ q_{j} |

such that

| δ q_{i}^{T} δ q_{k} | \leq | δ q_{i}^{T} Q | | Q^{T} δ q_{k} | = z_{i}^{T} z_{k}

.

In this way, one obtains an iterative scheme involving Equations (11) and (53)–(55). At each step s, the value of the nonlinear estimate of x is determined from

x_{s}^{n o n l} = x^{l i n} + | M^{- 1} | | Δ_{s}^{x} |, s = 0, 1, \dots

with initial condition

x_{0}^{n o n l} = e p s {[1, 1, \dots, 1]}^{T}

, where

e p s

is the MATLAB^® function eps,

e p s = 2^{- 52} = 2 u

. The stopping criterion is taken as

e r r_{s} = ∥ x_{s}^{n o n l} - x_{s - 1}^{n o n l} ∥_{2} / {∥ x_{s - 1}^{n o n l} ∥}_{2} < t o l = 10 e p s .

This scheme converges for perturbations of restricted size. As shown in ([17], Ch. 4), the size of the maximum allowable perturbation for which the nonlinear normwise estimate of x is valid is given by

{∥ δ A ∥}_{F} \leq δ^{0} : = \frac{1}{∥ M^{- 1} ∥_{2} (2 μ_{ν} + \sqrt{2 + 8 μ_{ν}^{2}})},

(56)

where

μ_{ν} = \sqrt{(ν - 1) / (2 ν)}

.

In Table 9, we present the number of iterations necessary to find the nonlinear estimate

x^{n o n l}

for the perturbation problem considered in Example 1, along with

{∥ x ∥}_{2}

and

∥ x^{n o n l} ∥_{2}

. The components of

x^{n o n l}

are shown for three different perturbations in the fifth column of Table 1 along with the vectors

| x |

and

x^{l i n}

.

In Figure 2, we show the convergence of the relative error

e r r_{s}

as a function of s for different perturbations

δ A = 10^{- k} A_{0}

. As is seen from the figure, with the increasing perturbation size, the convergence worsens, and, for

k = - 5

(∥ δ A ∥_{F} = 1.78326 \times 10^{- 4})

, the iterations do not converge since the global bound does not exist. The convergence of the iterations is linear, and this can be improved by using appropriate optimization techniques.

6.3. Global Perturbation Bounds of $Q_{1}$ , Column Subspaces and R

Implementing the obtained nonlinear estimate of x, one may find nonlocal bounds on the perturbations of the column subspaces, diagonal and super diagonal elements of R using Equations (26), (31) and (38).

After determining the nonlinear bounds of x and

| δ W |

, it is possible to find nonlinear bounds on the perturbations of the elements of

Q_{1}

according to the relationship

δ Q_{1}^{n o n l} = | Q | | δ W^{n o n l} | .

(57)

The nonlinear bounds

δ q_{i j}^{n o n l}

of the elements of

Q_{1}

for the QR decomposition given in Example 1 and a perturbation

δ A = 3 \times 10^{- 6} A_{0}

are shown in the last column of Table 3 along with

| δ q_{i j} |

and

δ q_{i j}^{l i n}

.

A global estimate of the maximum angle between the perturbed and unperturbed column subspace of dimension p is obtained from (26). The values of

δ Θ {max}_{p}^{n o n l}

for the matrix A from Example 1 and three different perturbations are given in the last rows of Table 4.

Nonlinear bounds on the diagonal elements of R can be obtained by using the expressions

\begin{matrix} δ r_{d i a g}^{n o n l} & = & δ r_{d i a g}^{l i n} + | Δ^{d} |, \\ | Δ_{i}^{d} | & = & \sum_{k = 1}^{i} | r_{k i} | | δ w_{i}^{T} | | δ w_{k} |, \\ i = 1, 2, \dots, m, \end{matrix}

(58)

and global bounds of the perturbations of the super diagonal elements of R can be found from

\begin{matrix} δ r_{s u p d}^{n o n l} & = & δ r_{s u p d}^{l i n} + | M_{3} | α + | Δ^{r} |, \\ α & = & [| α_{1} |, | α_{2} |, \dots, | α_{m} {|]}^{T}, \\ | α_{j} | & = & ∥ δ w_{j} ∥_{2}^{2} / (1 + \sqrt{1 - ∥ δ w_{j} ∥_{2}^{2}}), j = 1, 2, \dots, m, \\ | Δ_{ℓ_{2}}^{r} | & = & \sum_{k = 1}^{j} | r_{k j} | | δ w_{i}^{T} | | δ w_{k} |, \\ ℓ_{2} = j + (i - 1) m - \frac{i (i + 1)}{2}, 1 \leq i < j \leq m . \end{matrix}

(59)

The nonlinear perturbation bounds

δ r_{i i}^{n o n l}

of the diagonal elements of R for the matrix A from Example 1 and for three perturbations

δ A

are given in Table 5, and the nonlinear bounds

δ r_{i j}^{n o n l}

of the super diagonal elements are presented in Table 6. We note that the global perturbation estimates are slightly larger than the corresponding asymptotic estimates but give guaranteed bounds on the perturbations whenever these estimate exist.

7. Comparison with Other Bounds

In this section, we consider two examples in which we compare the perturbation bounds of the QR decomposition obtained in this paper with the bounds that were previously proposed.

Example 2.

Consider the fifth-order matrix [12],

A = [\begin{matrix} 1 & - 1 & - 1 & - 1 & - 1 \\ 0 & 1 & - 1 & - 1 & - 1 \\ 0 & 0 & 1 & - 1 & - 1 \\ 0 & 0 & 0 & 1 & - 1 \\ 0 & 0 & 0 & 0 & 1 \end{matrix}] .

The matrix A is nonsingular, and its QR factors are

Q = I_{5}

and

R = A

. The perturbation matrix is the

5 \times 5

random matrix

δ_{A} = 10^{- 3} [\begin{matrix} 0.2742 & 0.2944 & - 0.3245 & 0.1483 & 0.9386 \\ 0.1186 & - 0.1669 & 0.9198 & - 0.2358 & 0.9445 \\ 0.6810 & 0.1577 & 0.1804 & 0.1979 & - 0.1045 \\ 0.8284 & - 0.9223 & 0.3286 & 0.7425 & - 0.2188 \\ 0.2091 & - 0.4420 & - 0.2410 & 0.8721 & 0.2947 \end{matrix}] .

Using the function qr of MATLAB^®, we obtain (to four decimal digits) that

| δ Q | = [\begin{matrix} 6.0357 \times 10^{- 7} & 1.1901 \times 10^{- 4} & 6.8154 \times 10^{- 4} & 8.2758 \times 10^{- 4} & 2.0877 \times 10^{- 4} \\ 1.1857 \times 10^{- 4} & 3.9005 \times 10^{- 7} & 8.3831 \times 10^{- 4} & 9.5401 \times 10^{- 5} & 2.3275 \times 10^{- 4} \\ 6.8081 \times 10^{- 4} & 8.3827 \times 10^{- 4} & 1.1808 \times 10^{- 6} & 1.0608 \times 10^{- 3} & 2.6485 \times 10^{- 4} \\ 8.2817 \times 10^{- 4} & 9.4474 \times 10^{- 5} & 1.0605 \times 10^{- 3} & 1.0795 \times 10^{- 6} & 5.8280 \times 10^{- 4} \\ 2.0904 \times 10^{- 4} & 2.3305 \times 10^{- 4} & 2.6418 \times 10^{- 4} & 5.8289 \times 10^{- 4} & 2.5378 \times 10^{- 7} \end{matrix}] .

The nonlinear bound of the perturbation of Q, obtained after 16 iterations, is

δ Q^{n o n l} = [\begin{matrix} 1.4500 \times 10^{- 5} & 2.7027 \times 10^{- 3} & 2.7360 \times 10^{- 3} & 2.7719 \times 10^{- 3} & 2.7723 \times 10^{- 3} \\ 2.6710 \times 10^{- 3} & 2.6773 \times 10^{- 5} & 3.9544 \times 10^{- 3} & 4.0418 \times 10^{- 3} & 4.0428 \times 10^{- 3} \\ 2.6876 \times 10^{- 3} & 3.8918 \times 10^{- 3} & 6.0542 \times 10^{- 5} & 7.1420 \times 10^{- 3} & 7.1444 \times 10^{- 3} \\ 2.7056 \times 10^{- 3} & 3.9535 \times 10^{- 3} & 7.0244 \times 10^{- 3} & 1.2828 \times 10^{- 4} & 1.3645 \times 10^{- 2} \\ 2.7058 \times 10^{- 3} & 3.9541 \times 10^{- 3} & 7.0263 \times 10^{- 3} & 1.3574 \times 10^{- 2} & 1.2829 \times 10^{- 4} \end{matrix}] .

The maximum element of the global estimate

B_{q r, w}

of

δ Q

, obtained in [12], is

3.59687 \times 10^{- 2}

, while the maximum element of

δ Q^{n o n l}

is

1.3645 \times 10^{- 2}

. Furthermore,

∥ B_{q r, w} ∥_{F} = 0.0648

, while

∥ δ Q^{n o n l} ∥_{F} = 0.02693

.

Example 3.

Consider a

20 \times 15

matrix A, taken as

A = P_{0} [\begin{matrix} S_{0} \\ 0 \end{matrix}],

where

S_{0}

is an upper triangular matrix with unit diagonal and super diagonal elements equal to 3, and the matrix

P_{0}

is constructed as proposed in [29],

\begin{matrix} P_{0} & = & H_{2} Σ H_{1}, \\ H_{1} & = & I_{n} - 2 u u^{T} / n, H_{2} = I_{n} - 2 v v^{T} / n, \\ u & = & {[1, 1, 1, \dots, 1]}^{T}, v = {[1, - 1, 1, \dots, {(- 1)}^{n - 1}]}^{T}, \\ Σ & = & diag (1, σ, σ^{2}, \dots, σ^{n - 1}), \end{matrix}

where

H_{1}

and

H_{2}

are elementary reflections that are orthogonal and symmetric matrices [30]. The condition number of

P_{0}

with respect to the inversion is controlled by the variable σ and is equal to

σ^{n - 1}

. In the given case, σ is taken equal to

1.2

, and

cond (P_{0}) = 31.9480

. The minimum singular value of the matrix M satisfies

1 / σ_{min (M)} = 2784.9,

which means that the perturbations of Q and R can be several orders of magnitude larger than the perturbations of A. The perturbation of A is chosen as

δ A = 10^{- c} \cdot A_{0}

, where c is a positive number and

A_{0}

is a matrix with random entries generated by the MATLAB^® function rand.

Several results related to the perturbation problem under consideration for 30 values of c between 13 and 5 are given in Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. In Figure 3, we display the perturbations of the particular entry

Q_{15, 10}

, which is an element of the matrix

Q_{1}

. The quantities

B (δ Q^{l i n})

and

B (δ Q^{n o n l})

are the normwise linear and nonlinear bounds derived in [17,23].

These bounds are more than 12-times larger than the norms of the linear

δ Q^{l i n}

and nonlinear

δ Q^{n o n l}

componentwise bounds obtained in Section 3. The nonlinear bound is close to the linear one for perturbations of different sizes and increases gradually in the vicinity of the quantity

{∥ δ A ∥}_{F} \leq 6.20078 \times 10^{- 7}

. For perturbations of a larger size, the iterations for

x^{n o n l}

do not converge. In Figure 4, we compare the exact perturbation

δ Q_{15, 16}

of the entry

Q_{15, 16}

(which is also the element

{(δ Q_{2})}_{15, 1}

of

δ Q_{2}

) with the linear approximation

δ Q_{15, 16}^{a p p r}

. Both quantities are close for all perturbations. This is confirmed by the values of the errors

∥ o r t h_{1} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, ∥ o r t h_{2} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, {∥ o r t h_{3} ({\tilde{Q}}^{a p p r}) ∥}_{F},

shown in Figure 5, which are much smaller than the value of

{∥ δ Q ∥}_{F}

for all perturbations.

The bounds of the quantity

δ Θ {max}_{15}

(the maximum angle between the perturbed and unperturbed range of A), shown in Figure 6, are close to the exact value of this angle, with the nonlinear bound being slightly greater than the linear one. The normwise linear

B (δ R^{l i n})

and the nonlinear

B (δ R^{n o n l})

bounds obtained in [17,23], are more than 75,000-times greater than the linear

δ R_{55}^{l i n}

and the nonlinear

δ R_{55}^{n o n l}

bounds of the diagonal element

R_{55}

, shown in Figure 7. Similarly, the normwise bounds

B (δ R^{l i n})

and

B (δ R^{n o n l})

are more than 13,000-times greater than the bounds

δ R_{2, 10}^{l i n}

and

δ R_{2, 10}^{n o n l}

as shown in Figure 8. This large difference between the sizes of the actual component perturbations of R and the normwise bounds is explained by the large condition number of the computed R—equal to

1.5353 \times 10^{6}

. (Note that

cond (R) = cond (A)

).

Note that, while the normwise estimates are valid for perturbations with sizes up to

δ^{0} = 9.31420 \times 10^{- 5}

, the iterations to find

x^{n o n l}

converge for perturbations

{∥ δ A ∥}_{F} \leq 6.20078 \times 10^{- 7}

.

The results obtained show that the asymptotic bounds are valid for much larger perturbations then the global bounds.

8. Conclusions

The method presented in the paper allows us to find, in a unified manner, componentwise asymptotic and global perturbation bounds for all elements of the QR decomposition, thus, providing a complete perturbation analysis of this important matrix factorization. The bounds obtained in the paper are smaller than some known bounds and can be significantly better than the normwise bounds.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during the current study are available from the author on reasonable request.

Acknowledgments

The author is grateful to the reviewers for their remarks and suggestions that helped to improve the paper.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

$R$	the set of real numbers;
$R^{n \times m}$	the space of $n \times m$ real matrices ( $R^{n} = R^{n \times 1}$ );
$R (A)$	the range of A;
$X^{⊥}$	the orthogonal complement of the subspace $X$ ;
$\| A \|$	the matrix of absolute values of the elements of A;
$A^{T}$	the transposed of A;
$A^{- 1}$	the inverse of A;
$A^{†}$	the pseudoinverse of A;
$a_{j}$	the jth column of A;
$A_{i, 1 : n}$	the ith row of $m \times n$ matrix A;
$A_{i_{1} : i_{2}, j_{1} : j_{2}}$	the part of matrix A from row $i_{1}$ to $i_{2}$ and from column $j_{1}$ to $j_{2}$ ;
$δ A$	perturbation of A;
$0_{m \times n}$	the zero $m \times n$ matrix;
$I_{n}$	the unit $n \times n$ matrix;
$e_{j}$	the jth column of $I_{n}$ ;
$σ_{min} (A)$	the minimum singular value of A; $: =$ , equal by definition;
⪯	relation of partial order. If $a, b \in R^{n}$ , then $a ⪯ b$ means $a_{i} \leq b_{i}, i = 1, 2, \dots, n$ ;
$Low (A)$	the strictly lower triangular part of A;
$Up (A)$	the strictly upper triangular part of A;
${∥ A ∥}_{2}$	the spectral norm of A;
${∥ A ∥}_{F}$	the Frobenius norm of A;
$A \otimes B$	the Kronecker product of A and B;
$vec (A)$	the vec mapping of $A \in R^{n \times m}$ . If A is partitioned columnwise as
$A = [a_{1}, a_{2}, \dots a_{m}]$	then $vec (A) = {[a_{1}^{T}, a_{2}^{T}, \dots, a_{m}^{T}]}^{T}$ ;
$P_{v e c}$	the vec-permutation matrix. $vec (A^{T}) = P_{v e c} vec (A)$ ;
$Θ \max (X, Y)$	the maximum angle between subspaces $X$ and $Y$ ;
$O (∥ δ A ∥_{F}^{2})$	a quantity of second order with respect to ${∥ δ A ∥}_{F}$ .

Appendix A

Theorem A1.

The minimum Frobenius norm solution of the matrix equation

X + X^{T} = Φ, X \in R^{p \times p}, Φ \in R^{p \times p}, Φ^{T} = Φ

(A1)

is given by

X_{m i n} = Φ / 2 .

(A2)

Proof.

Equation (A1) is represented as

(I_{p^{2}} + P_{v e c}) vec (X) = vec (Φ),

(A3)

where

P_{v e c}

is the vec-permutation matrix satisfying

vec (X^{T}) = P_{v e c} vec (X)

. This matrix is symmetric and orthogonal and has

p (p + 1) / 2

eigenvalues equal to 1 and

p (p - 1) / 2

eigenvalues equal to −1 ([27], p. 265). Hence, for some orthogonal U, it may be represented as

P_{v e c} = U diag (I_{p (p + 1 / 2}, - I_{p (p - 1 / 2}) U^{T},

so that

I_{p^{2}} + P_{v e c} = U diag (2 I_{p (p + 1 / 2}, 0_{p (p - 1 / 2}) U^{T} .

The minimum 2-norm solution of (A3), corresponding to the minimum Frobenius solution of (A1), is given by

vec (X_{m i n}) = {(I_{p^{2}} + P_{v e c})}^{†} vec (Φ),

where

{(I_{p^{2}} + P_{v e c})}^{†} = U diag (I_{p (p + 1 / 2} / 2, 0_{p (p - 1 / 2}) U^{T} .

Thus,

{(I_{p^{2}} + P_{v e c})}^{†} = (I_{p^{2}} + P_{v e c}) / 4

and

vec (X_{m i n}) = (I_{p^{2}} + P_{v e c}) vec (Φ) / 4 .

Since

P_{v e c} vec (Φ) = vec (Φ^{T}) = vec (Φ),

it follows that

X_{m i n} = (Φ + Φ) / 4 = Φ / 2,

q.e.d. □

References

Stewart, G.W. Matrix Algorithms; Vol. I: Basic Decompositions; SIAM: Philadelphia, PA, USA, 1998; ISBN 0-89871-414-1. [Google Scholar]
Stewart, G.W.; Sun, J.-G. Matrix Perturbation Theory; Academic Press: San Diego, CA, USA, 1990; ISBN 978-0126702309. [Google Scholar]
Bhatia, R. Matrix factorizations and their perturbations. Linear Algebra Appl. 1994, 197, 245–276. [Google Scholar] [CrossRef] [Green Version]
Li, R. Matrix perturbation theory. In Handbook of Linear Algebra, 2nd ed.; Hogben, L., Ed.; CRC Press: Boca Raton, FL, USA, 2014; Chapter 21; pp. 1–20. [Google Scholar]
Higham, N. A survey of componentwise perturbation theory in numerical linear algebra. In Mathematics of Computation 1943–1993: A Half Century of Computational Mathematics; Gautchi, W., Ed.; Amer. Mathematical Society: Providence, RI, USA, 1994; pp. 49–77. ISBN 0-8218-0291-7. [Google Scholar]
Stewart, G.W. Perturbation bounds for the QR factorization of a matrix. SIAM J. Numer. Anal. 1977, 14, 509–518. [Google Scholar] [CrossRef]
Sun, J.-G. Perturbation bounds for the Cholesky and QR factorizations. BIT Numer. Math. 1991, 31, 341–352. [Google Scholar] [CrossRef]
Stewart, G.W. On the perturbation of LU, Cholesky, and QR factorizations. SIAM J. Matrix Anal. Appl. 1993, 14, 1141–1145. [Google Scholar] [CrossRef] [Green Version]
Chang, X.-W.; Paige, C.C.; Stewart, G.W. Perturbation analyses for the QR factorization. SIAM J. Matrix Anal. Appl. 1997, 18, 1328–1340. [Google Scholar] [CrossRef] [Green Version]
Chang, X.-W.; Stehlé, D. Rigorous perturbation bounds of some matrix factorizations. SIAM J. Matrix Anal. Appl. 2010, 31, 2841–2859. [Google Scholar] [CrossRef] [Green Version]
Li, H.; Wei, Y. Improved rigorous perturbation bounds for the LU and QR factorizations. Numer. Linear Algebra Appl. 2015, 22, 1115–1130. [Google Scholar] [CrossRef]
Sun, J.-G. Componentwise perturbation bounds for some matrix decompositions. BIT Numer. Math. 1992, 32, 702–714. [Google Scholar] [CrossRef]
Zha, H.Y. A componentwise perturbation analysis of the QR decomposition. SIAM J. Matrix Anal. Appl. 1995, 14, 1124–1131. [Google Scholar] [CrossRef]
Chang, X.-W.; Paige, C.C. Componentwise perturbation analyses for the QR factorization. Numer. Math. 2001, 88, 319–345. [Google Scholar] [CrossRef]
Chang, X.-W. On the perturbation of the Q-factor of the QR factorization. Numer. Linear Algebra Appl. 2012, 19, 607–619. [Google Scholar] [CrossRef]
Konstantinov, M.M.; Petkov, P.H.; Christov, N.D. Nonlocal perturbation analysis of the Schur system of a matrix. SIAM J. Matrix Anal. Appl. 1994, 15, 383–392. [Google Scholar] [CrossRef]
Konstantinov, M.M.; Petkov, P.H. Perturbation Methods in Matrix Analysis and Control; NOVA Science Publishers, Inc.: New York, NY, USA, 2020; ISBN 978-1-53617-470-0. [Google Scholar]
Chen, X.S. Perturbation bounds for the periodic Schur decomposition. BIT Numer. Math. 2010, 50, 41–58. [Google Scholar] [CrossRef]
Chen, X.S.; Li, W.; Ng, M.K. Perturbation analysis for antitriangular Schur decomposition. SIAM J. Matrix Anal. Appl. 2012, 33, 1328–1340. [Google Scholar] [CrossRef]
Petkov, P. Componentwise perturbation analysis of the Schur decomposition of a matrix. SIAM J. Matrix Anal. Appl. 2021, 42, 108–133. [Google Scholar] [CrossRef]
Sun, J.-G. Perturbation bounds for the generalized Schur decomposition. SIAM J. Matrix Anal. Appl. 1995, 16, 1328–1340. [Google Scholar] [CrossRef]
Zhang, G.; Li, H.; Wei, Y. Componentwise perturbation analysis for the generalized Schur decomposition. Calcolo 2022, 59. [Google Scholar] [CrossRef]
Sun, J.-G. On perturbation bounds for the QR factorization. Linear Algebra Appl. 1995, 215, 95–112. [Google Scholar] [CrossRef] [Green Version]
MATLAB Version 9.9.0.1538559 (R2020b) Update 3; The MathWorks, Inc.: Natick, MA, USA, 2020.
Gohberg, I.; Koltracht, I. Mixed, componentwise, and structured condition numbers. SIAM J. Matrix Anal. Appl. 1993, 14, 688–704. [Google Scholar] [CrossRef]
Björck, Å.; Golub, G. Numerical methods for computing angles between linear subspaces. Math. Comp. 1973, 27, 579–594. [Google Scholar] [CrossRef]
Horn, R.A.; Johnson, C.R. Topics in Matrix Analysis; Cambridge University Press: Cambridge, UK, 1991; ISBN 0-521-30587-X. [Google Scholar]
Horn, R.A.; Johnson, C.R. Matrix Analysis, 2nd ed.; Cambridge University Press: Cambridge, UK, 2013; ISBN 978-0-521-83940-2. [Google Scholar]
Bavely, C.A.; Stewart, G.W. An algorithm for computing reducing subspaces by block diagonalization. SIAM J. Numer. Anal. 1979, 16, 359–367. [Google Scholar] [CrossRef]
Stewart, G.W. Matrix Algorithms; Vol. II: Eigensystems; SIAM: Philadelphia, PA, USA, 2001; ISBN 0-89871-503-2. [Google Scholar]

Figure 1. Perturbation estimates of the column subspaces.

Figure 2. Iterations for determining the global bounds for different perturbations.

Figure 3. Exact values of

δ Q_{15, 10}

and its bounds as functions of the perturbation norm.

Figure 3. Exact values of

δ Q_{15, 10}

and its bounds as functions of the perturbation norm.

Figure 4. Exact values of

δ Q_{15, 16}

and its bounds as functions of the perturbation norm.

Figure 4. Exact values of

δ Q_{15, 16}

and its bounds as functions of the perturbation norm.

Figure 5. The errors

∥ o r t h_{1} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, ∥ o r t h_{2} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, {∥ o r t h_{3} ({\tilde{Q}}^{a p p r}) ∥}_{F}

as functions of the perturbation norm.

Figure 5. The errors

∥ o r t h_{1} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, ∥ o r t h_{2} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥_{F}, {∥ o r t h_{3} ({\tilde{Q}}^{a p p r}) ∥}_{F}

as functions of the perturbation norm.

Figure 6. Exact values of

δ Θ {max}_{15}

and its bounds as functions of the perturbation norm.

Figure 6. Exact values of

δ Θ {max}_{15}

and its bounds as functions of the perturbation norm.

Figure 7. Exact values of

δ R_{55}

and its bounds as functions of the perturbation norm.

Figure 7. Exact values of

δ R_{55}

and its bounds as functions of the perturbation norm.

Figure 8. Exact values of

δ R_{2, 10}

and its bounds as functions of the perturbation norm.

Figure 8. Exact values of

δ R_{2, 10}

and its bounds as functions of the perturbation norm.

Table 1. Exact basic perturbation parameters and their linear and nonlinear estimates.

${∥ δ A ∥}_{F}$	$x_{ℓ} = q_{i}^{T} δ q_{j}$	$\| x_{ℓ} \|$	$x_{ℓ}^{lin}$	$x_{ℓ}^{nonl}$
1	2	3	4	5
$1.78326 \times 10^{- 10}$	$x_{1} = q_{2}^{T} δ q_{1}$	$6.48563 \times 10^{- 13}$	$7.80510 \times 10^{- 12}$	$7.80510 \times 10^{- 12}$
	$x_{2} = q_{3}^{T} δ q_{1}$	$3.81408 \times 10^{- 12}$	$7.80510 \times 10^{- 12}$	$7.80510 \times 10^{- 12}$
	$x_{3} = q_{4}^{T} δ q_{1}$	$3.12632 \times 10^{- 12}$	$7.80510 \times 10^{- 12}$	$7.80510 \times 10^{- 12}$
	$x_{4} = q_{3}^{T} δ q_{2}$	$6.73721 \times 10^{- 9}$	$2.04508 \times 10^{- 7}$	$2.04508 \times 10^{- 7}$
	$x_{5} = q_{4}^{T} δ q_{2}$	$6.00990 \times 10^{- 8}$	$2.04508 \times 10^{- 7}$	$2.04508 \times 10^{- 7}$
	$x_{6} = q_{4}^{T} δ q_{3}$	$6.70820 \times 10^{- 8}$	$2.28281 \times 10^{- 7}$	$2.28281 \times 10^{- 7}$
$8.91628 \times 10^{- 8}$	$x_{1} = q_{2}^{T} δ q_{1}$	$3.24302 \times 10^{- 10}$	$3.90255 \times 10^{- 9}$	$3.90335 \times 10^{- 9}$
	$x_{2} = q_{3}^{T} δ q_{1}$	$1.90707 \times 10^{- 9}$	$3.90255 \times 10^{- 9}$	$3.90340 \times 10^{- 9}$
	$x_{3} = q_{4}^{T} δ q_{1}$	$1.56317 \times 10^{- 9}$	$3.90255 \times 10^{- 9}$	$3.90340 \times 10^{- 9}$
	$x_{4} = q_{3}^{T} δ q_{2}$	$3.36826 \times 10^{- 6}$	$1.02254 \times 10^{- 4}$	$1.02280 \times 10^{- 4}$
	$x_{5} = q_{4}^{T} δ q_{2}$	$3.00486 \times 10^{- 5}$	$1.02254 \times 10^{- 4}$	$1.02280 \times 10^{- 4}$
	$x_{6} = q_{4}^{T} δ q_{3}$	$3.35398 \times 10^{- 5}$	$1.14140 \times 10^{- 4}$	$1.14193 \times 10^{- 4}$
$5.34977 \times 10^{- 5}$	$x_{1} = q_{2}^{T} δ q_{1}$	$1.94581 \times 10^{- 7}$	$2.34153 \times 10^{- 6}$	$2.75590 \times 10^{- 6}$
	$x_{2} = q_{3}^{T} δ q_{1}$	$1.14424 \times 10^{- 6}$	$2.34153 \times 10^{- 6}$	$2.82650 \times 10^{- 6}$
	$x_{3} = q_{4}^{T} δ q_{1}$	$9.37903 \times 10^{- 7}$	$2.34153 \times 10^{- 6}$	$2.81974 \times 10^{- 6}$
	$x_{4} = q_{3}^{T} δ q_{2}$	$1.99332 \times 10^{- 3}$	$6.13524 \times 10^{- 2}$	$7.59140 \times 10^{- 2}$
	$x_{5} = q_{4}^{T} δ q_{2}$	$1.77825 \times 10^{- 2}$	$6.13524 \times 10^{- 2}$	$7.65532 \times 10^{- 2}$
	$x_{6} = q_{4}^{T} δ q_{3}$	$1.97618 \times 10^{- 2}$	$6.84843 \times 10^{- 2}$	$9.92798 \times 10^{- 2}$

Table 2. Approximation of the diagonal elements of matrix W.

${∥ δ A ∥}_{F}$	$1.78325 \times 10^{- 10}$	$8.91627 \times 10^{- 8}$	$5.34976 \times 10^{- 5}$
$\| α_{1} \|$	$1.67646 \times 10^{- 16}$	$1.74935 \times 10^{- 16}$	$1.11378 \times 10^{- 12}$
$\| α_{2} \|$	$1.98416 \times 10^{- 15}$	$4.57132 \times 10^{- 10}$	$1.60108 \times 10^{- 4}$
$\| α_{3} \|$	$2.33940 \times 10^{- 15}$	$5.68134 \times 10^{- 10}$	$1.98034 \times 10^{- 4}$
$\| α_{1}^{l i n} \|$	$1.23709 \times 10^{- 23}$	$3.09280 \times 10^{- 18}$	$1.11341 \times 10^{- 12}$
$\| α_{2}^{l i n} \|$	$1.82864 \times 10^{- 15}$	$4.57132 \times 10^{- 10}$	$1.60095 \times 10^{- 4}$
$\| α_{3}^{l i n} \|$	$2.27269 \times 10^{- 15}$	$5.68131 \times 10^{- 10}$	$1.97252 \times 10^{- 4}$
$\| α_{1}^{n o n l} \|$	$1.23709 \times 10^{- 23}$	$3.09280 \times 10^{- 18}$	$1.11341 \times 10^{- 12}$
$\| α_{2}^{n o n l} \|$	$1.82864 \times 10^{- 15}$	$4.57132 \times 10^{- 10}$	$1.60108 \times 10^{- 4}$
$\| α_{3}^{n o n l} \|$	$2.27269 \times 10^{- 15}$	$5.68131 \times 10^{- 10}$	$1.97271 \times 10^{- 4}$

Table 3. Exact perturbations of the elements of the matrix

Q_{1}

and their linear and nonlinear estimates,

δ A = 3 \times 10^{- 6} A_{0}, {∥ δ A ∥}_{F} = 5.34977 \times 10^{- 5}

,

B (δ Q^{l i n}) = c_{Q} {∥ δ A ∥}_{F} = 0.13003

, and

B (δ Q^{n o n l}) = 0.14519

.

Table 3. Exact perturbations of the elements of the matrix

Q_{1}

and their linear and nonlinear estimates,

δ A = 3 \times 10^{- 6} A_{0}, {∥ δ A ∥}_{F} = 5.34977 \times 10^{- 5}

,

B (δ Q^{l i n}) = c_{Q} {∥ δ A ∥}_{F} = 0.13003

, and

B (δ Q^{n o n l}) = 0.14519

.

$q_{ij}$	$\| δ q_{ij} \|$	$δ q_{ij}^{lin}$	$δ q_{ij}^{nonl}$
$q_{11}$	$8.24060 \times 10^{- 7}$	$2.46313 \times 10^{- 6}$	$2.94752 \times 10^{- 6}$
$q_{21}$	$5.56921 \times 10^{- 7}$	$3.27407 \times 10^{- 6}$	$3.94135 \times 10^{- 6}$
$q_{31}$	$1.78849 \times 10^{- 7}$	$2.15221 \times 10^{- 6}$	$2.53307 \times 10^{- 6}$
$q_{41}$	$1.09799 \times 10^{- 6}$	$3.07134 \times 10^{- 6}$	$3.68975 \times 10^{- 6}$
$q_{12}$	$5.88076 \times 10^{- 3}$	$4.50959 \times 10^{- 2}$	$5.63774 \times 10^{- 2}$
$q_{22}$	$5.89442 \times 10^{- 3}$	$7.93060 \times 10^{- 2}$	$9.85345 \times 10^{- 2}$
$q_{32}$	$1.47078 \times 10^{- 4}$	$3.46070 \times 10^{- 3}$	$5.35863 \times 10^{- 3}$
$q_{42}$	$1.58388 \times 10^{- 2}$	$7.07534 \times 10^{- 2}$	$8.82920 \times 10^{- 2}$
$q_{13}$	$4.76877 \times 10^{- 3}$	$4.20957 \times 10^{- 2}$	$5.95634 \times 10^{- 2}$
$q_{23}$	$8.37491 \times 10^{- 3}$	$3.98794 \times 10^{- 2}$	$5.85481 \times 10^{- 2}$
$q_{33}$	$2.15468 \times 10^{- 3}$	$5.63927 \times 10^{- 2}$	$7.57743 \times 10^{- 2}$
$q_{43}$	$1.72784 \times 10^{- 2}$	$7.02671 \times 10^{- 2}$	$1.01256 \times 10^{- 1}$

Table 4. Exact perturbations of the maximum subspace angles and their linear and nonlinear estimates.

${∥ δ A ∥}_{F}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34977 \times 10^{- 5}$
$\| δ Θ {max}_{1} \|$	$4.97410 \times 10^{- 12}$	$2.48709 \times 10^{- 9}$	$1.49225 \times 10^{- 6}$
$\| δ Θ {max}_{2} \|$	$6.04754 \times 10^{- 8}$	$3.02368 \times 10^{- 5}$	$1.78948 \times 10^{- 2}$
$\| δ Θ {max}_{3} \|$	$9.00660 \times 10^{- 8}$	$4.50315 \times 10^{- 5}$	$2.65878 \times 10^{- 2}$
$δ Θ {max}_{1}^{l i n}$	$7.80510 \times 10^{- 12}$	$3.90255 \times 10^{- 9}$	$2.34153 \times 10^{- 6}$
$δ Θ {max}_{2}^{l i n}$	$2.04508 \times 10^{- 7}$	$1.02254 \times 10^{- 4}$	$6.13524 \times 10^{- 2}$
$δ Θ {max}_{3}^{l i n}$	$3.06490 \times 10^{- 7}$	$1.53245 \times 10^{- 4}$	$9.19468 \times 10^{- 2}$
$δ Θ {max}_{1}^{n o n l}$	$1.35188 \times 10^{- 11}$	$6.76085 \times 10^{- 9}$	$4.85129 \times 10^{- 6}$
$δ Θ {max}_{2}^{n o n l}$	$2.89218 \times 10^{- 7}$	$1.44645 \times 10^{- 4}$	$1.08022 \times 10^{- 1}$
$δ Θ {max}_{3}^{n o n l}$	$3.06490 \times 10^{- 7}$	$1.53301 \times 10^{- 4}$	$1.25698 \times 10^{- 1}$

Table 5. Exact perturbations of the diagonal elements of R and their linear and nonlinear bounds.

${∥ δ A ∥}_{F}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34977 \times 10^{- 5}$
$\| δ r_{11} \|$	$9.19442 \times 10^{- 12}$	$4.59573 \times 10^{- 9}$	$2.75746 \times 10^{- 6}$
$\| δ r_{22} \|$	$4.20811 \times 10^{- 11}$	$2.10408 \times 10^{- 8}$	$1.27735 \times 10^{- 5}$
$\| δ r_{33} \|$	$1.51994 \times 10^{- 8}$	$7.60002 \times 10^{- 6}$	$4.88606 \times 10^{- 3}$
$δ r_{11}^{l i n}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34977 \times 10^{- 5}$
$δ r_{22}^{l i n}$	$1.87973 \times 10^{- 10}$	$9.39863 \times 10^{- 8}$	$5.63918 \times 10^{- 5}$
$δ r_{33}^{l i n}$	$4.56562 \times 10^{- 7}$	$2.28281 \times 10^{- 4}$	$1.36969 \times 10^{- 1}$
$δ r_{11}^{n o n l}$	$1.78618 \times 10^{- 10}$	$1.62255 \times 10^{- 7}$	$4.80568 \times 10^{- 2}$
$δ r_{22}^{n o n l}$	$1.88265 \times 10^{- 10}$	$1.67069 \times 10^{- 7}$	$4.80543 \times 10^{- 2}$
$δ r_{33}^{n o n l}$	$4.56562 \times 10^{- 7}$	$2.28330 \times 10^{- 4}$	$1.69291 \times 10^{- 1}$
$B (δ R^{l i n})$	$1.44561 \times 10^{- 5}$	$7.22804 \times 10^{- 3}$	$4.33683 \times 10^{0}$
$B (δ R^{n o n l})$	$1.44561 \times 10^{- 5}$	$7.22915 \times 10^{- 3}$	$4.84251 \times 10^{0}$

Table 6. Exact perturbations of the super diagonal elements of R and their linear and nonlinear bounds.

${∥ δ A ∥}_{F}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34977 \times 10^{- 5}$
$\| δ r_{12} \|$	$6.56506 \times 10^{- 11}$	$3.28263 \times 10^{- 8}$	$1.96958 \times 10^{- 5}$
$\| δ r_{13} \|$	$2.19309 \times 10^{- 11}$	$1.09686 \times 10^{- 8}$	$6.58120 \times 10^{- 6}$
$\| δ r_{23} \|$	$1.34417 \times 10^{- 8}$	$6.72117 \times 10^{- 6}$	$4.33437 \times 10^{- 3}$
$δ r_{12}^{l i n}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34977 \times 10^{- 5}$
$δ r_{13}^{l i n}$	$1.79853 \times 10^{- 10}$	$8.99267 \times 10^{- 8}$	$5.39560 \times 10^{- 5}$
$δ r_{23}^{l i n}$	$4.09016 \times 10^{- 7}$	$2.04508 \times 10^{- 4}$	$1.22705 \times 10^{- 1}$
$δ r_{12}^{n o n l}$	$1.78326 \times 10^{- 10}$	$8.91628 \times 10^{- 8}$	$5.34981 \times 10^{- 5}$
$δ r_{13}^{n o n l}$	$1.79853 \times 10^{- 10}$	$8.99301 \times 10^{- 8}$	$5.58512 \times 10^{- 5}$
$δ r_{23}^{n o n l}$	$4.09016 \times 10^{- 7}$	$2.04555 \times 10^{- 4}$	$1.48774 \times 10^{- 1}$
$B (δ R^{l i n})$	$1.44561 \times 10^{- 5}$	$7.22804 \times 10^{- 3}$	$4.33683 \times 10^{0}$
$B (δ R^{n o n l})$	$1.44561 \times 10^{- 5}$	$7.22915 \times 10^{- 3}$	$4.84251 \times 10^{0}$

Table 7. Quantities related to the approximation of

δ Q_{2}

.

Table 7. Quantities related to the approximation of

δ Q_{2}

.

${∥ δ A ∥}_{F}$	$1.7832554500 \times 10^{- 10}$	$8.9162772500 \times 10^{- 8}$	$5.3497663500 \times 10^{- 5}$
$∥ X_{1} ∥_{F}$	$9.0065954775 \times 10^{- 8}$	$4.5031489846 \times 10^{- 5}$	$2.6584712300 \times 10^{- 2}$
$∥ X_{2} ∥_{F}$	$4.1725109797 \times 10^{- 15}$	$1.0139176039 \times 10^{- 9}$	$3.5343592252 \times 10^{- 4}$
$e r r_{1}$	$1.0034590138 \times 10^{- 16}$	$1.2139643751 \times 10^{- 16}$	$1.0775666870 \times 10^{- 16}$
$e r r_{2}$	$2.3314574995 \times 10^{- 16}$	$1.2904023661 \times 10^{- 16}$	$5.7370600309 \times 10^{- 19}$
$∥ X_{1}^{T} X_{1} ∥_{F}$	$8.1118762095 \times 10^{- 15}$	$2.0278350777 \times 10^{- 9}$	$7.0674692808 \times 10^{- 4}$
$∥ X_{2}^{T} X_{2} ∥_{F}$	$1.7409847876 \times 10^{- 29}$	$1.0280289075 \times 10^{- 18}$	$1.2491695133 \times 10^{- 7}$
$∥ X_{1}^{a p p r} ∥_{F}$	$9.0065954782 \times 10^{- 8}$	$4.5031489891 \times 10^{- 5}$	$2.6594111615 \times 10^{- 2}$
$∥ X_{2}^{a p p r} ∥_{F}$	$4.0559381054 \times 10^{- 15}$	$1.0139175409 \times 10^{- 9}$	$3.5362338628 \times 10^{- 4}$
$e r r_{3}$	$3.7480684521 \times 10^{- 22}$	$4.5658214975 \times 10^{- 14}$	$9.4009759870 \times 10^{- 6}$
$e r r_{4}$	$1.6450633915 \times 10^{- 29}$	$1.0280287798 \times 10^{- 18}$	$1.2504949933 \times 10^{- 7}$
$∥ δ Q_{2} ∥_{F}$	$9.0065954775 \times 10^{- 8}$	$4.5031489857 \times 10^{- 5}$	$2.6587061610 \times 10^{- 2}$
$∥ δ Q_{2}^{a p p r} ∥_{F}$	$9.0065954782 \times 10^{- 8}$	$4.5031489903 \times 10^{- 5}$	$2.6596462586 \times 10^{- 2}$
$e r r_{5}$	$3.1278945183 \times 10^{- 16}$	$7.5350125919 \times 10^{- 16}$	$7.8338852224 \times 10^{- 16}$
$e r r_{6}$	$3.4777636565 \times 10^{- 16}$	$6.4524550180 \times 10^{- 14}$	$1.3295575820 \times 10^{- 5}$
$e r r_{1} = {∥ o r t h_{1} (X_{1}, X_{2}) ∥}_{F}$ ,		$e r r_{2} = {∥ o r t h_{2} (X_{1}, X_{2}) ∥}_{F}$ ,
$e r r_{3} = {∥ o r t h_{1} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥}_{F}$ ,		$e r r_{4} = {∥ o r t h_{2} (X_{1}^{a p p r}, X_{2}^{a p p r}) ∥}_{F}$ ,
$e r r_{5} = {∥ o r t h_{3} (\tilde{Q}) ∥}_{F}$ ,		$e r r_{6} = {∥ o r t h_{3} ({\tilde{Q}}^{a p p r}) ∥}_{F}$

Table 8. Approximated perturbations of the elements of

Q_{2}

and their approximations.

Table 8. Approximated perturbations of the elements of

Q_{2}

and their approximations.

${∥ δ A ∥}_{F}$	$q_{ij}$	$\| δ q_{ij} \|$	$\| δ q_{ij}^{appr} \|$
$1.7832554500 \times 10^{- 10}$	$q_{14}$	$4.9044000886 \times 10^{- 8}$	$4.9044000819 \times 10^{- 8}$
	$q_{24}$	$5.0733955344 \times 10^{- 8}$	$5.0733955468 \times 10^{- 8}$
	$q_{34}$	$5.5238446041 \times 10^{- 8}$	$5.5238446008 \times 10^{- 8}$
	$q_{44}$	$9.0189822446 \times 10^{- 9}$	$9.0189821929 \times 10^{- 8}$
$8.9162772501 \times 10^{- 8}$	$q_{14}$	$2.4521479462 \times 10^{- 5}$	$2.4521479487 \times 10^{- 5}$
	$q_{24}$	$2.5365705574 \times 10^{- 5}$	$2.5365705600 \times 10^{- 5}$
	$q_{34}$	$2.7618311270 \times 10^{- 5}$	$2.7618311298 \times 10^{- 5}$
	$q_{44}$	$4.5102092035 \times 10^{- 6}$	$4.5102092081 \times 10^{- 5}$
$5.3497663500 \times 10^{- 5}$	$q_{14}$	$1.4577251477 \times 10^{- 2}$	$1.4582423281 \times 10^{- 2}$
	$q_{24}$	$1.4823481491 \times 10^{- 2}$	$1.4828695707 \times 10^{- 2}$
	$q_{34}$	$1.6304869299 \times 10^{- 2}$	$1.6310634063 \times 10^{- 2}$
	$q_{44}$	$2.9649988293 \times 10^{- 3}$	$2.9661007106 \times 10^{- 3}$

Table 9. Convergence of the global bounds.

k	${∥ δ A ∥}_{F}$	${∥ x ∥}_{2}$	Number of Iterations	$∥ x^{nonl} ∥_{2}$
$- 11$	$1.78326 \times 10^{- 10}$	$9.03176 \times 10^{- 8}$	4	$3.68455 \times 10^{- 7}$
$- 10$	$1.78326 \times 10^{- 9}$	$9.03170 \times 10^{- 7}$	4	$3.68458 \times 10^{- 6}$
$- 9$	$1.78326 \times 10^{- 8}$	$9.03165 \times 10^{- 6}$	5	$3.68480 \times 10^{- 5}$
$- 8$	$1.78326 \times 10^{- 7}$	$9.03122 \times 10^{- 5}$	6	$3.68699 \times 10^{- 4}$
$- 7$	$1.78326 \times 10^{- 6}$	$9.02688 \times 10^{- 4}$	9	$3.70916 \times 10^{- 3}$
$- 6$	$1.78326 \times 10^{- 5}$	$8.98346 \times 10^{- 3}$	17	$3.96070 \times 10^{- 2}$
$- 5$	$1.78326 \times 10^{- 4}$	$8.54366 \times 10^{- 2}$	No convergence	-

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Petkov, P.H. Componentwise Perturbation Analysis of the QR Decomposition of a Matrix. Mathematics 2022, 10, 4687. https://doi.org/10.3390/math10244687

AMA Style

Petkov PH. Componentwise Perturbation Analysis of the QR Decomposition of a Matrix. Mathematics. 2022; 10(24):4687. https://doi.org/10.3390/math10244687

Chicago/Turabian Style

Petkov, Petko H. 2022. "Componentwise Perturbation Analysis of the QR Decomposition of a Matrix" Mathematics 10, no. 24: 4687. https://doi.org/10.3390/math10244687

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Componentwise Perturbation Analysis of the QR Decomposition of a Matrix

Abstract

1. Introduction

2. Bounding the Basic Perturbation Parameters

3. Bounding the Perturbations of the Matrix $Q_{1}$

3.1. Normwise Bounds

3.2. Componentwise Bounds

4. Estimating Column Subspace Sensitivity

5. Perturbation Bounds of the Elements of R

5.1. Sensitivity Estimates of the Diagonal Elements of R

5.2. Sensitivity Estimates of the Super Diagonal Elements of R

6. Determining Global Perturbation Bounds

6.1. Perturbation Bounds of the Columns of $Q_{2}$

6.2. Iterative Procedure for Finding Global Bounds of the Elements of x

6.3. Global Perturbation Bounds of $Q_{1}$ , Column Subspaces and R

7. Comparison with Other Bounds

8. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Componentwise Perturbation Analysis of the QR Decomposition of a Matrix

Abstract

1. Introduction

2. Bounding the Basic Perturbation Parameters

3. Bounding the Perturbations of the Matrix Q 1

3.1. Normwise Bounds

3.2. Componentwise Bounds

4. Estimating Column Subspace Sensitivity

5. Perturbation Bounds of the Elements of R

5.1. Sensitivity Estimates of the Diagonal Elements of R

5.2. Sensitivity Estimates of the Super Diagonal Elements of R

6. Determining Global Perturbation Bounds

6.1. Perturbation Bounds of the Columns of Q 2

6.2. Iterative Procedure for Finding Global Bounds of the Elements of x

6.3. Global Perturbation Bounds of Q 1 , Column Subspaces and R

7. Comparison with Other Bounds

8. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Bounding the Perturbations of the Matrix $Q_{1}$

6.1. Perturbation Bounds of the Columns of $Q_{2}$

6.3. Global Perturbation Bounds of $Q_{1}$ , Column Subspaces and R