Next Article in Journal
Fixed-Time Distributed Optimization for Multi-Agent Systems with Input Delays and External Disturbances
Next Article in Special Issue
Schur Complement-Based Infinity Norm Bound for the Inverse of Dashnic-Zusmanovich Type Matrices
Previous Article in Journal
Modeling Multiple-Event Catastrophe Bond Prices Involving the Trigger Event Correlation, Interest, and Inflation Rates
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Componentwise Perturbation Analysis of the QR Decomposition of a Matrix

Department of Engineering Sciences, Bulgarian Academy of Sciences, 1040 Sofia, Bulgaria
Mathematics 2022, 10(24), 4687; https://doi.org/10.3390/math10244687
Submission received: 17 November 2022 / Revised: 4 December 2022 / Accepted: 6 December 2022 / Published: 10 December 2022
(This article belongs to the Special Issue Numerical Analysis and Matrix Computations: Theory and Applications)

Abstract

:
The paper presents a rigorous perturbation analysis of the QR decomposition A = Q R of an n × m matrix A using the method of splitting operators. New asymptotic componentwise perturbation bounds are derived for the elements of Q and R and the subspaces spanned by the first p m columns of A. The new bounds are less conservative than the known bounds and are significantly better than the normwise bounds. An iterative scheme is proposed to determine global componentwise bounds in the case of perturbations for which such bounds are valid. Several numerical results are given that illustrate the analysis and the quality of the bounds obtained.

1. Introduction

The QR decomposition of a matrix A R n × m with n m as the factorization
A : = Q R 0 ,
where Q R n × n is an orthogonal matrix and R R m × m is the upper triangular matrix. The matrices Q and R are referred to as the Q-factor and the R-factor, respectively. Further on, we shall assume that the matrix A has rank m, i.e., it has full column rank. In such a case, the matrix R is nonsingular, and the matrix Q can be represented as
Q = [ Q 1 , Q 2 ] , Q 1 R n × m , Q 2 R n × ( n m ) ,
where R ( Q 1 ) = R ( A ) and the columns of Q 2 form an orthonormal basis for the complementary subspace R ( A ) ([1], Ch. 1). Thus,
A = Q 1 R .
The representation (2) is frequently called QR factorization of A, and it is unique up to the signs of the diagonal elements of R. The matrix Q 2 is not unique but has to obey the orthogonality condition
Q T Q = Q 1 T Q 1 Q 1 T Q 2 Q 2 T Q 1 Q 2 T Q 2 = I m 0 0 I n m .
In practice, the matrix A is subject to perturbations of different kinds (model inconsistencies, measurement and rounding errors), which leads to the necessity of investigating the sensitivity of the different elements of the QR decomposition to perturbations in the data, i.e., to perform a perturbation analysis of the decomposition [2]. Further on, we assume that the matrix A is subject to an additive perturbation δ A R n × m and that there exist another pair of matrix Q ˜ and upper triangular matrix R ˜ such that
A ˜ = Q ˜ R ˜ 0 , A ˜ = A + δ A .
The purpose of the perturbation analysis of the QR decomposition is to find bounds on the sizes of δ Q = Q ˜ Q and δ R = R ˜ R as functions of the size of δ A for sufficiently small perturbations of A [3,4]. Due to the non-uniqueness of the matrix Q 2 , its perturbation is also non-unique. Thus, in the perturbation analysis, one usually considers only the perturbations of the matrix Q 1 , which are uniquely defined by the perturbations of A. However, in the analysis, we shall need to use an arbitrary matrix Q 2 that satisfies the orthogonality condition (3).
The sizes of the perturbations δ A , δ Q 1 and δ R in the QR factorization are measured by using some of the matrix norms, and, in this case, we call the respective analysis normwise perturbation analysis. Sometimes, however, we are interested in the size of perturbations in individual elements of δ Q 1 and δ R , and, in such a case, the analysis is called componentwise perturbation analysis [5]. In the cases when the estimated vector or matrix has components that differ greatly in size, the normwise estimate does not produce reliable results, and it is preferable to use the componentwise perturbation analysis.
The perturbation analysis of the QR decomposition was performed for the first time by Stewart [6], and improved results were presented by Sun [7] and Stewart [8]. Using a different approach, Chang, Paige and Stewart [9] gave new asymptotic perturbation bounds for the R-factor. Additional improvements of the normwise perturbation bounds of the QR-decomposition were proposed by Chang and Stehlé [10] and Li and Wei [11]. Different componentwise estimates of the perturbations of the Q-factor and the R-factor were derived by Sun [12], Zha [13], Chang and Paige [14] and Chang [15].
A general approach, based on the use of the so-called splitting operators, which can be used in the perturbation analysis of several unitary decompositions, was proposed in [16]; for details, see [17]. The method of the splitting operators can be used to determine normwise as well as componentwise perturbation bounds of different unitary decompositions; see [18,19,20,21,22]. This method was implemented by Sun [23], who obtained improved normwise perturbation bounds of the QR decomposition.
This paper presents a rigorous componentwise perturbation analysis of the QR decomposition based on the method of splitting operators. The analysis presented aims at finding normwise and componentwise perturbation bounds for infinitely small perturbations (asymptotic bounds) as well as for finite perturbations (global bounds). The main result is the obtaining of new asymptotic componentwise perturbation bounds that produce less conservative estimates of the QR decomposition perturbations. A particular case of these bounds is the asymptotic normwise bounds of the QR decomposition derived previously.
This is demonstrated by an example that the new componentwise perturbation bounds of the R factor can be several orders of magnitude smaller than the normwise perturbation bound of this factor. An iterative scheme is proposed to determine global componentwise bounds in the case of perturbations for which such bounds exist. The analysis conducted in this paper is unified with the perturbation analysis of the Schur decomposition presented in [20] and can be easily extended to the case of complex matrices.
In Section 2, we introduce the basic scheme of the perturbation analysis. Section 3 is devoted to determining normwise and componentwise perturbation bounds of the matrix Q 1 . In Section 4, we present estimates for the perturbations of the column subspaces of A, and, in Section 5, we derive bounds of the elements of R. An iterative scheme for finding global componentwise perturbation bounds of the QR decomposition is proposed in Section 6. A comparison with some of the known methods for perturbation analysis of the QR decomposition is performed in Section 7, and our conclusions are made in Section 8.
The numerical results presented in the paper were obtained with MATLAB® R2020b [24] using IEEE double precision arithmetic with roundoff unit u 1.11 × 10 16 .

2. Bounding the Basic Perturbation Parameters

Let
Q : = [ q 1 , q 2 , , q n ] , q j R n
and the unperturbed and perturbed matrices of the orthogonal factor of the QR decomposition be
Q : = [ q 1 , q 2 , , q n ] , Q ˜ : = [ q ˜ 1 , q ˜ 2 , , q ˜ n ] , q ˜ j : = q j + δ q j , j = 1 , 2 , n ,
respectively. Define the perturbation matrix
δ Q 1 : = [ δ q 1 , δ q 2 , , δ q m ] , δ q j R n .
It follows from (1) and (4) that
δ q i T a j = q ˜ i T δ a j = 0 , 1 j m , j < i n .
The column a j can be obtained from the QR factorization (2) as
a j = k = 1 j r k j q k , 1 j m .
Substituting (6) in (5) yields
k = 1 j r k j δ q i T q k = q ˜ i T δ a j .
Since Q ˜ T Q ˜ = I n , it follows that
Q T δ Q = δ Q T Q δ Q T δ Q
and
δ q i T q j = q i T δ q j δ q i T δ q j , 1 j m , j < i n .
Using (8), Equation (7) can be written as
k = 1 j r k j q i T δ q k + k = 1 j r k j δ q i T δ q k = q ˜ i T δ a j .
Equation (9) represents a system of
ν = n ( n 1 ) / 2 m ( m 1 ) / 2 = m ( 2 n m 1 ) / 2
nonlinear algebraic equations for the ν unknown quantities
x : = q i T δ q j , = i + ( j 1 ) n j ( j + 1 ) 2 , 1 j m , j < i n .
These quantities, which we call basic perturbation parameters, are elements of the strict lower part of the matrix δ W = Q T δ Q 1 . More precisely, one has that
x = vec ( Low ( δ W ) ) ,
or, equivalently,
x = Ω vec ( δ W ) ,
where
Ω : = diag ( ω 1 , ω 2 , , ω m ) R ν × n m , ω k : = 0 ( n k ) × k , I n k R ( n k ) × n , k = 1 , 2 , , m , Ω T Ω = I ν , Ω 2 = 1 .
Define the lower triangular matrix
M : = Ω ( R T I m ) Ω T R ν × ν
whose elements are determined entirely from the elements of R. It can be shown that
k = i n t i k q k T δ q j = M x .
The matrix M has the form
M = r 11 0 0 0 0 0 0 0 r 11 0 0 0 0 0 0 0 r 11 0 0 0 0 0 r 12 0 r 22 0 0 0 0 0 0 0 r 22 0 0 0 0 r 12 0 0 r 22 0 0 0 r 1 , m 0 0 r 2 , m r m m ,
which shows that this matrix is nonsingular if the diagonal elements of R are nonzero. The matrix M is called the perturbation operator matrix.
From (9), we obtain that
M x = f Δ x
where
f = vec ( Low ( F ) ) = Ω vec ( F ) R ν , F = Q ˜ T δ A
and the vector Δ x R ν has components
Δ x = k = 1 j r k j δ q i T δ q k , = i + ( j 1 ) n j ( j + 1 ) 2 , 1 j m , j < i n .
containing second-order terms in the perturbations δ q i , i = 1 , 2 , , n .
An asymptotic (linear) approximation of x is obtained from (10) neglecting the second-order term Δ x ,
x = M 1 f .
The norm of this approximation obeys
x 2 M 1 2 f 2 ,
which shows that the size of the linear bound of x 2 depends on 1 / σ min ( M ) = M 1 2 . As shown by Sun [23],
M 1 2 A 2 .
Since
f 2 δ A F ,
one obtains the asymptotic normwise bound
x 2 M 1 2 δ A F .
Since the matrix M is lower triangular, it is usually inverted with high precision. Using (12), one can obtain asymptotic componentwise bounds on the perturbation vector x. Since
x = M , 1 : ν 1 f , = 1 , 2 , ν ,
it follows that
| x | M , 1 : ν 1 2 f 2 , = 1 , 2 , , ν
and using the inequality f 2 δ A F , one obtains the asymptotic bound
| x | x l i n : = M , 1 : ν 1 2 δ A F .
The quantity cond ( x ) = M , 1 : ν 1 2 can be considered as a componentwise condition number [25] of the element x .
Example 1.
Consider the 4 × 3 matrix
A = 18 6 18 6 2 8 9 3.001 7 9 3 10
and assume that it is perturbed by
δ A = c · 10 k · A 0 , A 0 = 7 4 1 4 2 9 1 6 5 8 4 3 ,
where c and k are varying parameters. The QR decompositions of matrices A and A + δ A are computed by the function qr of MATLAB®. In the given case, the perturbation operator matrix M is of order ν = 6 and M 1 2 = 1.71871 × 10 3 .
The exact absolute values of the elements of the vector x and their linear approximations computed according to (12) for three perturbations δ A = 10 11 A 0 , 5 × 10 9 A 0 , and 3 × 10 6 of different size, are given to five decimal digits in the third and fourth columns of Table 1, respectively. It is seen that the elements of the linear estimate x l i n closely follow the corresponding elements of the exact perturbation vector | x | .

3. Bounding the Perturbations of the Matrix Q 1

Consider the matrix
δ W = Q T δ Q 1 : = [ δ w 1 , δ w 2 , , δ w m ] , δ w j R n .
The strictly lower part of this matrix contains elements of the form
q i T δ q j , 1 j m , j < i n ,
which can be substituted by the corresponding elements x , = i + ( j 1 ) n j ( j + 1 ) 2 of the vector x. The elements of the strictly upper part of δ W are of the form
q i T δ q j , 1 i < j m ,
which, according to the orthogonality condition (8), can be represented as
q i T δ q j = q j T δ q i δ q i T δ q j .
In this way, the matrix δ W can be written as
δ W = δ V + δ D δ Y ,
where the matrix
δ V = 0 x 1 x 2 x m 1 x 1 0 x n x n + m 3 x 2 x n 0 x 2 n + m 6 x m 1 x n + m 3 x 2 n + m 6 0 x n 1 x 2 n 3 x 3 n 6 x ν : = [ δ v 1 , δ v 2 , , δ v m ] , v j R n
has elements depending only on the basic perturbation parameters,
δ D = q 1 T δ q 1 0 0 0 q 2 T δ q 2 0 0 0 q m T δ q m 0 0 0 0 0 0 R n × m ,
and the matrix
δ Y = 0 δ q 1 T δ q 2 δ q 1 T δ q 3 δ q 1 T δ q m 0 0 δ q 2 T δ q 3 δ q 2 T δ q m 0 0 0 δ q 3 T δ q m 0 0 0 δ q m 1 T δ q m 0 0 0 0 0 0 0 0 R n × m ,
contains second-order terms in δ q j , j = 1 , 2 , , m .
Consider how to determine the diagonal elements of the matrix W (the nontrivial elements of D) from the elements of x. Denote that α j = δ q j T q j . According to (8), one has that
2 δ q j T q j = δ q j T δ q j , 1 j m ,
or
2 α j = δ q j 2 .
The above expression shows that α is always nonnegative.
On the other hand, we have that
δ w j = δ v j + 0 α j 0 j , j = 1 , 2 , , m
so that
δ w j 2 2 = δ v j 2 2 + α j 2 .
From
δ w j = Q T δ q j ,
it follows that
δ w j 2 = δ q j 2 = 2 α j .
From (17) and (18), we obtain the quadratic equation
α j 2 + 2 α j + δ v j 2 2 = 0 .
The negative solution of this equation is
α j n o n l = δ v j 2 2 / ( 1 + 1 δ v j 2 2 ) , j = 1 , 2 , , m .
For a small perturbation δ A (small values of δ v j 2 ), one has the estimate
α j l i n = δ v j 2 2 / 2 .
Thus, for small perturbations, the quantities | α j l i n | , j = 1 , 2 , , m depend quadratically on δ A F .
In Table 2, for the same matrix and perturbations that are given in Example 1, we give the exact values of α j and their linear α j l i n and nonlinear α j n o n l estimates computed using the exact vectors x.
    Thus, having the linear approximations of the elements of x, one can compute the linear approximations of the matrices δ V and δ D . According to (16), the sum δ V + δ D is the linear approximation of δ W , and δ Y contains second-order terms in δ A F that can be neglected in the asymptotic analysis. As shown below, the determining of an estimate of δ W allows one to find a bound on δ Q 1 .

3.1. Normwise Bounds

The estimate of x l i n 2 can be used to find an asymptotic normwise bound of δ Q 1 F . In determining condition numbers, one assumes δ A F 0 , so that δ W F δ V F . From Equation (16), it follows that the Frobenius norm of the strictly upper triangular part Up ( δ V ) of the matrix δ V is less than (if m < n ) or equal (if m = n ) to the norm of the strictly lower part Low ( δ V ) . Since Low ( δ V ) F = x l i n 2 , we have that δ W F 2 x l i n 2 , and the change of the matrix Q 1 obeys
δ Q 1 F = Q T δ Q 1 F 2 x l i n 2 c Q δ A F ,
where c Q δ A F is an asymptotic normwise bound on δ Q 1 F and
c Q : = 2 M 1 2
can be considered as a normwise condition number of the matrix Q 1 with respect to the perturbations of A.
Since, in first-order approximation, it is fulfilled that
δ R = δ Q T A + Q T δ A ,
considering (21), one obtains that
δ R F c R δ A F ,
where  
c R = 1 + 2 2 M 1 2 A F
is the normwise condition number of the matrix R with respect to the perturbation δ A .
The asymptotic normwise estimates of δ Q and δ R thus obtained coincide with the corresponding estimates derived in [17,23].

3.2. Componentwise Bounds

The componentwise bounds of the elements of the matrix δ Q 1 can be found by using the componentwise estimates of the elements of x. An asymptotic bound on the matrix | δ W = Q T δ Q 1 | is given by
| δ W l i n | = | δ V | = | α 1 l i n | | x 1 l i n | | x 2 l i n | | x m 1 l i n | | x 1 l i n | | α 2 l i n | | x n l i n | | x n + m 3 l i n | | x 2 l i n | | x n l i n | | α 3 l i n | | x 2 n + m 6 l i n | | x m 1 l i n | | x n + m 3 l i n | | x 2 n + m 6 l i n | | α m l i n | | x n 1 l i n | | x 2 n 3 l i n | | x 3 n 6 l i n | | x ν l i n | R n × m .
Considering that δ Q 1 = Q δ W and using (16), a linear approximation of the perturbation | δ Q 1 | is determined as
| δ Q 1 | δ Q 1 l i n = | Q | | δ W l i n | .
This equation gives asymptotic bounds of the perturbations in the individual elements q i j , i.e., componentwise perturbation bounds of the matrix Q 1 . Since | Q | F = Q F = n , we have that
δ Q 1 l i n F n δ W l i n F ,
i.e., the obtaining of the asymptotic componentwise estimate δ Q 1 l i n through (23) may increase the bounds on | δ q i j | at most n times.
In Table 3, we give, for the same QR decomposition as the one presented in Example  1, the exact values of | δ q i j | and their linear approximations δ q i j l i n for δ A = 3 × 10 6 A 0 . The comparison of the componentwise bounds with the normwise linear bound B ( δ Q l i n ) = c Q δ A F shows that the bounds on the individual elements of δ Q 1 are smaller than B ( δ Q l i n ) for all j m , j < i n . The difference between the componentwise and normwise bounds is particularly significant for the elements in the first column of δ Q 1 whose absolute values are of order 10 7 , while the normwise bound is of order 10 1 .

4. Estimating Column Subspace Sensitivity

The determination of bounds on the elements of the matrix δ Q 1 makes it possible to estimate the sensitivity of the column subspaces X p = R ( [ a 1 , a 2 , , a p ] ) , p = 1 , 2 , , m . (Note that, for p = m , the corresponding column subspace X m coincides with the range R ( A ) of A.) Since we assume that R is of full rank, we have that R ( [ a 1 , a 2 , , a p ] ) = R ( [ q 1 , q 2 , , q p ] ) , p = 1 , 2 , , m , i.e., the first p m columns of Q form an orthonormal basis for the subspace X p .
As is known [26], the sensitivity of a subspace of dimension p is measured by the p angles between the perturbed and unperturbed subspace. Let Q X and Q ˜ X be the orthonormal bases for X p and its perturbed counterpart X ˜ p , respectively. Then, the maximum angle δ Θ max p : = δ Θ max ( X ˜ p , X p ) between X ˜ p and X p is determined from [26]
sin ( δ Θ max p ) = Q X T Q ˜ X 2 ,
where Q X is the orthogonal complement of Q X , Q X T Q X = 0 . Since
Q ˜ X = Q X + δ Q X ,
one has that
sin ( δ Θ max p ) = Q X T δ Q X 2 .
    Equation (25) shows that the sensitivity of the column subspace X p is related to the values of the basic perturbation parameters x = q i T δ q j , = i + ( j 1 ) n j ( j + 1 ) 2 , i > p , j = 1 , 2 , , p . In particular, for p = 1 , the sensitivity of the first column of A is determined as
sin ( δ Θ max ( X 1 ˜ , X 1 ) ) = δ W 2 : n , 1 2 ,
for p = 2 , one has
sin ( δ Θ max ( X 2 ˜ , X 2 ) ) = δ W 3 : n , 1 : 2 2
and so on (see Figure 1).
In this way, if the basic perturbation parameters are known, it is possible to find the sensitivity estimates of all column subspaces with dimension p = 1 , 2 , , m . More specifically, let  
δ W = × × × × x 1 × × × x 2 x n × × x m 1 x n + m 3 x 2 n + m 6 × x n 1 x 2 n 3 x 3 n 6 x ν R n × m .
Then, we have that the maximum angle between the perturbed and unperturbed column subspace of dimension p is
δ Θ max p = arcsin ( δ W p + 1 : n , 1 : p 2 ) .
In particular, for the sensitivity of R ( A ) , we obtain that
sin ( δ Θ max ( X ˜ m , X m ) ) = δ W m + 1 : n , 1 : m 2 .
An asymptotic estimate of the maximum angle can be obtained, if, in the expression for the matrix δ W , the elements x , = 1 , 2 , , ν are replaced by their linear approximations (12). Representing the matrix M 1 as
M 1 = M 1 , 1 : ν 1 M 2 , 1 : ν 1 M 3 , 1 : ν 1 M ν , 1 : ν 1 ,
the matrix δ W can be written as
δ W = × × × × M 1 , 1 : ν 1 f × × × M 2 , 1 : ν 1 f M n , 1 : ν 1 f × × M n 1 , 1 : ν 1 f M 2 n 3 , 1 : ν 1 f M 3 n 6 , 1 : ν 1 f × = L ( I n f ) ,
where the rows of M 1 are highlighted in boxes,
L = × × × × M 1 , 1 : ν 1 × × × M 2 , 1 : ν 1 M n , 1 : ν 1 × × M n 1 , 1 : ν 1 M 2 n 3 , 1 : ν 1 M 3 n 6 , 1 : ν 1 × R n × n ν
and
I n f = f f f R n ν × n .
Using the fact that
I n f 2 = f 2 ,
we obtain the following asymptotic estimate,
| δ Θ max p | arcsin ( L p + 1 : n , 1 : p ν 2 f 2 ) arcsin ( L p + 1 : n , 1 : p ν 2 δ A F ) , p = 1 , 2 , , m .
Thus, an asymptotic bound of δ Θ max ( X ˜ p , X p ) is determined as
| δ Θ max p | δ Θ max p l i n : = cond ( Θ max p ) δ A F ,
where the quantity
cond ( Θ max p ) : = L p + 1 : n , 1 : p ν 2
can be considered as a condition number of the column subspace X p . The derivation of cond ( Θ max p ) is performed such that to find its possible minimum value.
In Table 4, we give the exact values of maximum angle | δ Θ max p | and its asymptotic bound δ Θ max p l i n for the perturbation problem considered in Example 1. In all cases, the size of the estimate matches correctly the size of the actual maximum angle between the perturbed and unperturbed subspace.

5. Perturbation Bounds of the Elements of R

It is convenient to first consider the sensitivity of the nontrivial elements of the upper triangular matrix R for the case of the diagonal elements. Due to the nonsingularity of R, these elements are nonzero.

5.1. Sensitivity Estimates of the Diagonal Elements of R

The changes in the elements of the perturbed matrix R satisfy
δ r i j = r ˜ i j r i j = q ˜ i T ( a j + δ a j ) , 1 i j m .
The above equation can be rewritten as
δ r i j = δ q i T a j + q ˜ i T δ a j .
Using Equations (7) and (8), one obtains for the perturbations of the diagonal ( i = j ) elements of R, the expressions
δ r i i = k = 1 i r k i q i T δ q k k = 1 i r k i δ q i T δ q k + q ˜ i T δ a i , i = 1 , 2 , , m .
Further on, we shall use the following quantities:
  • The diagonal elements of the matrix Q ˜ T δ A ,
    g = q ˜ 1 T δ a 1 , q ˜ 2 T δ a 2 , , q ˜ m T δ a m T R m .
  • The changes of the diagonal elements of R,
    δ r d i a g = δ r 11 , δ r 22 , , δ r m m T R m .
  • The diagonal elements of W,
    α = α 1 , α 2 , , α m T R m .
  • The quadratic terms in (30),
    Δ d = Δ 1 d , Δ 2 d , , Δ m d T R m ,
    where
    Δ i d = k = 1 i r k i δ q i T δ q k , i = 1 , 2 , , m .
Denote the columns of I n as e j , j = 1 , 2 , , n and the columns of I m as η j , j = 1 , 2 , , m . Then, the system of Equation (30) can be represented as
δ r d i a g = N 1 x + N 2 α + g + Δ d ,
where
N 1 = Π ( R T I n ) Ω T R m × ν , N 2 = diag ( r 11 , r 22 , , r m m ) R m × m ,
Π = η 1 e 1 T , η 2 e 2 T , , η m e m T R m × n · m ,
and the matrix Ω was defined earlier. Neglecting the quadratic terms in (31), one obtains the linear estimate
δ r d i a g = N 1 M 1 f + g .
Equation (32) can be represented in the compact form
δ r d i a g = [ N 1 M 1 , I m ] f g .
Using (33), one can derive condition numbers of the diagonal elements of R. Let
Z = [ N 1 M 1 , I m ] R m × ( ν + m ) .
Since
f g 2 δ A F ,
it follows from (33) that the asymptotic perturbation δ r i i satisfies
| δ r i i | δ r i i l i n : = cond ( r i i ) δ A F , i = 1 , 2 , , m ,
where
cond ( r i i ) = Z i , 1 : ν + m 2
is considered as a condition number of r i i . The derivation of (35) is performed to find the minimum possible value of cond ( r i i ) .
In Table 5, for the matrix A and the perturbations given in Example 1, we present the exact perturbations | δ r i i | of the diagonal elements of R and their linear and nonlinear estimates. The normwise quantities B ( δ R l i n ) and B ( δ R n o n l ) are the normwise linear and nonlinear bounds, derived in [17,23]. These bounds are more pessimistic than the bounds δ r i i l i n and δ r i i n o n l .

5.2. Sensitivity Estimates of the Super Diagonal Elements of R

According to (29), the perturbations of the super diagonal elements of the matrix R can be determined as
δ r i j = r ˜ i j r i j = k = 1 j r k j q i T δ q k k = 1 j r k j δ q i T δ q k + q ˜ i T δ a j , 1 i < j m .
Let us define the vectors (the elements of the corresponding matrices are taken row-wise),
δ r s u p d : = vec ( ( Up ( δ R ) ) T ) = Ω 2 vec ( δ R T ) R ν 2 , ν 2 = m ( m 1 ) / 2 , ( δ r s u p d ) 2 = δ r i j , 2 = j + ( i 1 ) m i ( i + 1 ) 2 , 1 i < j m , y : = vec ( ( Up ( Q 1 T δ Q 1 ) ) T ) = Ω 2 vec ( ( Q 1 T δ Q 1 ) T ) R ν 2 , y 2 = q i T δ q j , h : = vec ( ( Up ( Q ˜ 1 T δ A ) ) T ) = Ω 2 vec ( ( Q ˜ 1 T δ A ) ) T ) R ν 2 , h 2 = q ˜ i T δ a j ,
and
Δ r = Δ 1 r Δ 2 r Δ ν 2 r , Δ 2 r = k = 1 j r k j δ q i T δ q k , 2 = j + ( i 1 ) m i ( i + 1 ) 2 , 1 i < j m ,
where
Ω 2 : = diag ( ω 1 , ω 2 , , ω m 1 ) , 0 ν 2 × m R ν 2 × m 2 , ω k : = 0 ( m k ) × k , I m k R ( m k ) × m , k = 1 , 2 , , m 1 , Ω 2 T Ω 2 = I m 2 , Ω 2 2 = 1 .
Then, Equation (36) may be represented as the system of ν 2 nonlinear algebraic equations
δ r s u p d = M 1 y + M 2 x + M 3 α + h + Δ r , 1 i < j m ,
where M 1 , M 2 and M 3 are matrices whose elements are functions of the elements of R. These matrices are determined from
M 1 = Ω 2 P v e c ( R T I m ) P v e c Ω 2 T R ν 2 × ν 2 ,
M 2 = Ω 2 P v e c ( R T I m ) Ω 3 T R ν 2 × ν ,
M 3 = Ω 2 ( I m R T ) Π T R ν 2 × m ,
where
Ω 3 : = diag ( ω 1 , ω 2 , , ω m 1 ) , 0 ( ν q ) × m 0 q × m 2 R ν × m 2 , q = 2 ( n m ) , ω k : = 0 ( m k ) × k , I m k R ( m k ) × m , k = 1 , 2 , , m 1 , Ω 3 T Ω 3 = I m 2 , Ω 3 2 = 1 ,
and P v e c is the vec-permutation matrix as determined from ([27], Ch. 4)
vec ( A T ) = P v e c vec ( A ) .
According to (15), the components of the vector y satisfy
y 2 = x δ q i T δ q j , = j + ( i 1 ) n i ( i + 1 ) 2 , 2 = j + ( i 1 ) m i ( i + 1 ) 2 , 1 i < j m .
In a linear approximation, one has
y 2 = x ,
and it is possible to show that
y = Ω 4 x ,
where
Ω 4 : = diag ( ω 1 , ω 2 , , ω m 1 ) , 0 ν 2 × ( n m ) R ν 2 × ν , ω k : = I m k , 0 ( m k ) × ( n m ) R ( m k ) × ( n k ) , k = 1 , 2 , , m 1 , Ω 4 T Ω 4 = I ν , Ω 4 2 = 1 .
Neglecting the second-order terms in Equation (38) and using the linear estimate x = M 1 f , one obtains the asymptotic estimate
δ r s u p d = M 1 Ω 4 x + M 2 x + h = M 1 Ω 4 M 1 f + M 2 M 1 f + h , 1 i < j m .
Let us denote
Z = | M 1 Ω 4 M 1 | + | M 2 M 1 | , I ν 2 R ν 2 × ( ν + ν 2 ) .
Since
f h 2 δ A F ,
one concludes that, in a first-order approximation, the super diagonal elements of | δ R | fulfill
| δ r i j | δ r i j l i n = cond ( r i j ) δ A F , 1 i < j m ,
where
cond ( r i j ) = Z 2 , 1 : ν + ν 2 2 , 2 = j + ( i 1 ) m i ( i + 1 ) 2 , 1 i < j m .
Equation (40) gives asymptotic componentwise perturbation bounds for the super diagonal part of R. The quantity cond ( r i j ) represents the condition number of r i j with respect to the perturbations in A.
In Table 6, for the matrix A and the perturbations given in Example 1, we give the exact perturbations of the super diagonal elements of R and their linear estimates. As in the case of the diagonal elements, the normwise linear and nonlinear bounds B ( δ R l i n ) and B ( δ R n o n l ) give worse estimates than δ r i j l i n .
Hence, the full asymptotic componentwise perturbation analysis of the QR decomposition can be conducted using Equations (12), (23), (28), (34) and (40).

6. Determining Global Perturbation Bounds

Based on the analysis presented above, it is possible to derive an iterative scheme for finding global perturbation bounds of the QR decomposition. The main task of such a scheme is to find a nonlinear estimate of the vector x of the basic perturbation parameters. For this aim, it is necessary to estimate the quadratic term Δ x in (10). The analysis of the expression (10) shows that Δ x contains terms involving the perturbations δ q i for m < i n , which are not estimates up to the moment since they are columns of the matrix δ Q 2 = Q ˜ 2 Q 2 . As mentioned previously, the matrix Q 2 is not unique, and consequently its perturbation δ Q 2 is also non-unique. However, the problem with finding δ Q 2 of the minimum norm for a fixed Q 2 has a unique solution, and our first task in this section is to find an approximation of this perturbation.

6.1. Perturbation Bounds of the Columns of Q 2

According to (3), the perturbation δ Q 2 should satisfy the conditions:
( Q 1 + δ Q 1 ) T ( Q 2 + δ Q 2 ) = 0 ,
( Q 2 + δ Q 2 ) T ( Q 2 + δ Q 2 ) = I n m .
Equations (42) and (43) can be represented as
Q 1 T δ Q 2 + δ Q 1 T Q 2 = δ Q 1 T δ Q 2 , Q 2 T δ Q 2 + δ Q 2 T Q 2 = δ Q 2 T δ Q 2 .
Setting X 1 = Q 1 T δ Q 2 , X 2 = Q 2 T δ Q 2 , we obtain that
o r t h 1 ( X 1 , X 2 ) : = ( I m + W 1 T ) X 1 + W 2 T X 2 + W 2 T = 0 ,
o r t h 2 ( X 1 , X 2 ) : = X 2 + X 2 T + X 1 T X 1 + X 2 T X 2 = 0 ,
where W 1 = Q 1 T δ Q 1 , W 2 = Q 2 T δ Q 1 . (Note that δ W = [ W 1 T W 2 T ] T is already estimated). For sufficiently small perturbations δ Q 1 , the matrix I m + W 1 T is nonsingular, and we have that
X 1 = ( I m + W 1 T ) 1 W 2 T ( I n m + X 2 ) ,
X 2 + X 2 T = X 1 T X 1 X 2 T X 2 .
In the first-order analysis of (47), the term X 2 T X 2 can be neglected, and we have the approximation
X 2 + X 2 T X 1 T X 1 .
As shown in Appendix A, the minimum norm solution of the matrix Equation (48) with respect to X 2 is
X 2 a p p r = X 1 T X 1 / 2 .
The expression (49) shows that the size of the minimum norm matrix X 2 a p p r is of second order regarding to the size of X 1 , and hence X 2 can be neglected in the asymptotic analysis of (46). Thus, we obtain the first-order approximations
X 1 a p p r = ( I m + W 1 T ) 1 W 2 T ,
X 2 a p p r = X 1 T X 1 / 2 .
In this way, the matrix
X = X 1 X 2 = Q T δ Q 2
is approximated as
X a p p r = X 1 a p p r X 2 a p p r ,
and an approximation of δ Q 2 is obtained as
δ Q 2 a p p r = Q X a p p r .
In Table 7, for the perturbation problem presented in Example 1, we show the quantities related to the approximation of δ Q 2 and the norms of the matrices
o r t h 3 ( Q ˜ ) = I n Q ˜ T Q ˜ , o r t h 3 ( Q ˜ a p p r ) = I n ( Q ˜ a p p r ) T Q ˜ a p p r ,
characterizing the errors in the orthogonal matrices Q ˜ and Q ˜ a p p r , respectively. The approximation of the perturbed orthogonal factor Q ˜ a p p r is obtained as  
Q ˜ a p p r = [ Q 1 + δ Q 1 , Q 2 + δ Q 2 a p p r ] ,
where δ Q 1 is the exact perturbation of Q 1 . These quantities are computed for the three perturbations δ A = 10 11 A 0 , 5 × 10 9 A 0 and 3 × 10 6 A 0 . The results given in the table confirm the assumptions from the perturbation analysis of Q 2 .
For the same example used previously, in Table 8, we give the exact values of the elements of δ Q 2 and their approximations using (52). The exact minimum norm perturbation δ Q 2 is found numerically by solving the minimization problem
δ Q 2 = min U U F
under the constraint Q ˜ T Q ˜ = I n , Q ˜ = [ Q 1 + δ Q 1 , Q 2 + U ] . The minimization is performed by the MATLAB® function fmincon. The results show that, in all cases, | δ q i j a p p r | is close to  | δ q i j | .

6.2. Iterative Procedure for Finding Global Bounds of the Elements of x

Since one has linear estimates of the basic perturbation terms x = q i T δ q j , it is appropriate to substitute the terms containing the perturbations δ q j in Equation (16) by the perturbations
δ w j = Q T δ q j , j = 1 , 2 , , m ,
which are of the same size as δ q j . Since  
δ q i T δ q j = δ q i T Q Q T δ q j = δ w i T δ w j ,
the absolute value of the matrix δ W (16) can be bounded as
| δ W | = | Q T δ Q 1 | : = [ | δ w 1 | , | δ w 2 | , , | δ w m | ] , δ W n o n l = | δ V | + | δ D | + | δ Y | ,
where
| δ V | = 0 | x 1 | | x 2 | | x m 1 | | x 1 | 0 | x n | | x n + m 3 | | x 2 | | x n | 0 | x 2 n + m 6 | | x m 1 | | x n + m 3 | | x 2 n + m 6 | 0 | x n 1 | | x 2 n 3 | | x 3 n 6 | | x ν | R n × m ,
| δ D | = | α 1 | 0 0 0 0 | α 2 | 0 0 0 0 | α 3 | 0 0 0 0 | α m | 0 0 0 0 0 0 0 0 R n × m ,
| δ Y | = 0 | δ w 1 T | | δ w 2 | | δ w 1 T | | δ w 3 | | δ w 1 T | | δ w m | 0 0 | δ w 2 T | | δ w 3 | | δ w 2 T | | δ w m | 0 0 0 | δ w 3 T | | δ w m | 0 0 0 | δ w m 1 T | | δ w m | 0 0 0 0 0 0 0 0 R n × m .
Since the unknown column estimates | δ w j | participate in both sides of (53), it is possible to obtain | δ w j | recursively as follows.
Let
| δ w 1 | = | δ v 1 | + | δ d 1 | ,
where | δ v 1 | and | δ d 1 | are the first columns of | δ V | and | δ D | , respectively. Then, the next column estimates | δ w j | , j = 2 , 3 , m can be determined as
| δ w j | | S j | 1 | δ w j 1 | = | S j | 1 ( | δ v j 1 | + | δ d j 1 | ) ,
where
| S j | = e 1 T | δ w 1 T | e 2 T | δ w 2 T | e j 1 T | δ w j 1 T | e j T e n T R n × n .
If | | δ w k | | 2 < 1 , k = 1 , 2 , , j 1 , the matrix | S j | is strictly diagonally dominant and nonsingular ([28], p. 352) and if | | δ w k | | 2 are small, then the condition number of S j is close to 1.
The matrix δ W n o n l only gives estimates of the first m columns of | Q T δ Q | . Using the representation
δ W n o n l = W 1 W 2 , W 1 R m × m , W 2 R ( n m ) × m ,
one can find an approximation X a p p r of the matrix Q T Q 2 using the Equations (50) and (51). Thus, an approximation of | Q T δ Q | is obtained as
Z = δ W n o n l , | X a p p r | .
After determining estimates of | δ w j | , j = 1 , 2 , , m , it is possible to bound the absolute values of the quadratic terms Δ x , given in (11), as
| Δ x | = k = 1 j | r k j | z i T z k , = i + ( j 1 ) n j ( j 1 ) 2 , 1 j m , j < i n .
The column z j , 1 j n represents an estimate of | Q T δ q j | such that | δ q i T δ q k | | δ q i T Q | | Q T δ q k | = z i T z k .
In this way, one obtains an iterative scheme involving Equations (11) and (53)–(55). At each step s, the value of the nonlinear estimate of x is determined from
x s n o n l = x l i n + | M 1 | | Δ s x | , s = 0 , 1 ,
with initial condition x 0 n o n l = e p s [ 1 , 1 , , 1 ] T , where e p s is the MATLAB® function eps, e p s = 2 52 = 2 u . The stopping criterion is taken as
e r r s = x s n o n l x s 1 n o n l 2 / x s 1 n o n l 2 < t o l = 10 e p s .
This scheme converges for perturbations of restricted size. As shown in ([17], Ch. 4), the size of the maximum allowable perturbation for which the nonlinear normwise estimate of x is valid is given by
δ A F δ 0 : = 1 M 1 2 ( 2 μ ν + 2 + 8 μ ν 2 ) ,
where μ ν = ( ν 1 ) / ( 2 ν ) .
In Table 9, we present the number of iterations necessary to find the nonlinear estimate x n o n l for the perturbation problem considered in Example 1, along with x 2 and x n o n l 2 . The components of x n o n l are shown for three different perturbations in the fifth column of Table 1 along with the vectors | x | and x l i n .
    In Figure 2, we show the convergence of the relative error e r r s as a function of s for different perturbations δ A = 10 k A 0 . As is seen from the figure, with the increasing perturbation size, the convergence worsens, and, for k = 5 ( δ A F = 1.78326 × 10 4 ) , the iterations do not converge since the global bound does not exist. The convergence of the iterations is linear, and this can be improved by using appropriate optimization techniques.

6.3. Global Perturbation Bounds of Q 1 , Column Subspaces and R

Implementing the obtained nonlinear estimate of x, one may find nonlocal bounds on the perturbations of the column subspaces, diagonal and super diagonal elements of R using Equations (26), (31) and (38).
After determining the nonlinear bounds of x and | δ W | , it is possible to find nonlinear bounds on the perturbations of the elements of Q 1 according to the relationship
δ Q 1 n o n l = | Q | | δ W n o n l | .
The nonlinear bounds δ q i j n o n l of the elements of Q 1 for the QR decomposition given in Example 1 and a perturbation δ A = 3 × 10 6 A 0 are shown in the last column of Table 3 along with | δ q i j | and δ q i j l i n .
A global estimate of the maximum angle between the perturbed and unperturbed column subspace of dimension p is obtained from (26). The values of δ Θ max p n o n l for the matrix A from Example 1 and three different perturbations are given in the last rows of Table 4.
Nonlinear bounds on the diagonal elements of R can be obtained by using the expressions
δ r d i a g n o n l = δ r d i a g l i n + | Δ d | , | Δ i d | = k = 1 i | r k i | | δ w i T | | δ w k | , i = 1 , 2 , , m ,
and global bounds of the perturbations of the super diagonal elements of R can be found from  
δ r s u p d n o n l = δ r s u p d l i n + | M 3 | α + | Δ r | , α = [ | α 1 | , | α 2 | , , | α m | ] T , | α j | = δ w j 2 2 / ( 1 + 1 δ w j 2 2 ) , j = 1 , 2 , , m , | Δ 2 r | = k = 1 j | r k j | | δ w i T | | δ w k | , 2 = j + ( i 1 ) m i ( i + 1 ) 2 , 1 i < j m .
The nonlinear perturbation bounds δ r i i n o n l of the diagonal elements of R for the matrix A from Example 1 and for three perturbations δ A are given in Table 5, and the nonlinear bounds δ r i j n o n l of the super diagonal elements are presented in Table 6. We note that the global perturbation estimates are slightly larger than the corresponding asymptotic estimates but give guaranteed bounds on the perturbations whenever these estimate exist.

7. Comparison with Other Bounds

In this section, we consider two examples in which we compare the perturbation bounds of the QR decomposition obtained in this paper with the bounds that were previously proposed.
Example 2.
Consider the fifth-order matrix [12],
A = 1 1 1 1 1 0 1 1 1 1 0 0 1 1 1 0 0 0 1 1 0 0 0 0 1 .
The matrix A is nonsingular, and its QR factors are Q = I 5 and R = A . The perturbation matrix is the 5 × 5 random matrix
δ A = 10 3 0.2742 0.2944 0.3245 0.1483 0.9386 0.1186 0.1669 0.9198 0.2358 0.9445 0.6810 0.1577 0.1804 0.1979 0.1045 0.8284 0.9223 0.3286 0.7425 0.2188 0.2091 0.4420 0.2410 0.8721 0.2947 .
Using the function qr of MATLAB®, we obtain (to four decimal digits) that
| δ Q | = 6.0357 × 10 7 1.1901 × 10 4 6.8154 × 10 4 8.2758 × 10 4 2.0877 × 10 4 1.1857 × 10 4 3.9005 × 10 7 8.3831 × 10 4 9.5401 × 10 5 2.3275 × 10 4 6.8081 × 10 4 8.3827 × 10 4 1.1808 × 10 6 1.0608 × 10 3 2.6485 × 10 4 8.2817 × 10 4 9.4474 × 10 5 1.0605 × 10 3 1.0795 × 10 6 5.8280 × 10 4 2.0904 × 10 4 2.3305 × 10 4 2.6418 × 10 4 5.8289 × 10 4 2.5378 × 10 7 .
The nonlinear bound of the perturbation of Q, obtained after 16 iterations, is
δ Q n o n l = 1.4500 × 10 5 2.7027 × 10 3 2.7360 × 10 3 2.7719 × 10 3 2.7723 × 10 3 2.6710 × 10 3 2.6773 × 10 5 3.9544 × 10 3 4.0418 × 10 3 4.0428 × 10 3 2.6876 × 10 3 3.8918 × 10 3 6.0542 × 10 5 7.1420 × 10 3 7.1444 × 10 3 2.7056 × 10 3 3.9535 × 10 3 7.0244 × 10 3 1.2828 × 10 4 1.3645 × 10 2 2.7058 × 10 3 3.9541 × 10 3 7.0263 × 10 3 1.3574 × 10 2 1.2829 × 10 4 .
The maximum element of the global estimate B q r , w of δ Q , obtained in [12], is 3.59687 × 10 2 , while the maximum element of δ Q n o n l is 1.3645 × 10 2 . Furthermore, B q r , w F = 0.0648 , while δ Q n o n l F = 0.02693 .
Example 3.
Consider a 20 × 15 matrix A, taken as
A = P 0 S 0 0 ,
where S 0 is an upper triangular matrix with unit diagonal and super diagonal elements equal to 3, and the matrix P 0 is constructed as proposed in [29],
P 0 = H 2 Σ H 1 , H 1 = I n 2 u u T / n , H 2 = I n 2 v v T / n , u = [ 1 , 1 , 1 , , 1 ] T , v = [ 1 , 1 , 1 , , ( 1 ) n 1 ] T , Σ = diag ( 1 , σ , σ 2 , , σ n 1 ) ,
where H 1 and H 2 are elementary reflections that are orthogonal and symmetric matrices [30]. The condition number of P 0 with respect to the inversion is controlled by the variable σ and is equal to σ n 1 . In the given case, σ is taken equal to 1.2 , and cond ( P 0 ) = 31.9480 . The minimum singular value of the matrix M satisfies
1 / σ min ( M ) = 2784.9 ,
which means that the perturbations of Q and R can be several orders of magnitude larger than the perturbations of A. The perturbation of A is chosen as δ A = 10 c · A 0 , where c is a positive number and A 0 is a matrix with random entries generated by the MATLAB® function rand.
Several results related to the perturbation problem under consideration for 30 values of c between 13 and 5 are given in Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8. In Figure 3, we display the perturbations of the particular entry Q 15 , 10 , which is an element of the matrix Q 1 . The quantities B ( δ Q l i n ) and B ( δ Q n o n l ) are the normwise linear and nonlinear bounds derived in [17,23].
These bounds are more than 12-times larger than the norms of the linear δ Q l i n and nonlinear δ Q n o n l componentwise bounds obtained in Section 3. The nonlinear bound is close to the linear one for perturbations of different sizes and increases gradually in the vicinity of the quantity δ A F 6.20078 × 10 7 . For perturbations of a larger size, the iterations for x n o n l do not converge. In Figure 4, we compare the exact perturbation δ Q 15 , 16 of the entry Q 15 , 16 (which is also the element ( δ Q 2 ) 15 , 1 of δ Q 2 ) with the linear approximation δ Q 15 , 16 a p p r . Both quantities are close for all perturbations. This is confirmed by the values of the errors
o r t h 1 ( X 1 a p p r , X 2 a p p r ) F , o r t h 2 ( X 1 a p p r , X 2 a p p r ) F , o r t h 3 ( Q ˜ a p p r ) F ,
shown in Figure 5, which are much smaller than the value of δ Q F for all perturbations.
The bounds of the quantity δ Θ max 15 (the maximum angle between the perturbed and unperturbed range of A), shown in Figure 6, are close to the exact value of this angle, with the nonlinear bound being slightly greater than the linear one. The normwise linear B ( δ R l i n ) and the nonlinear B ( δ R n o n l ) bounds obtained in [17,23], are more than 75,000-times greater than the linear δ R 55 l i n and the nonlinear δ R 55 n o n l bounds of the diagonal element R 55 , shown in Figure 7. Similarly, the normwise bounds B ( δ R l i n ) and B ( δ R n o n l ) are more than 13,000-times greater than the bounds δ R 2 , 10 l i n and δ R 2 , 10 n o n l as shown in Figure 8. This large difference between the sizes of the actual component perturbations of R and the normwise bounds is explained by the large condition number of the computed R—equal to 1.5353 × 10 6 . (Note that cond ( R ) = cond ( A ) ).
Note that, while the normwise estimates are valid for perturbations with sizes up to δ 0 = 9.31420 × 10 5 , the iterations to find x n o n l converge for perturbations δ A F 6.20078 × 10 7 .
The results obtained show that the asymptotic bounds are valid for much larger perturbations then the global bounds.

8. Conclusions

The method presented in the paper allows us to find, in a unified manner, componentwise asymptotic and global perturbation bounds for all elements of the QR decomposition, thus, providing a complete perturbation analysis of this important matrix factorization. The bounds obtained in the paper are smaller than some known bounds and can be significantly better than the normwise bounds.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets generated during the current study are available from the author on reasonable request.

Acknowledgments

The author is grateful to the reviewers for their remarks and suggestions that helped to improve the paper.

Conflicts of Interest

The author declares no conflict of interest.

Abbreviations

R the set of real numbers;
R n × m the space of n × m real matrices ( R n = R n × 1 );
R ( A ) the range of A;
X the orthogonal complement of the subspace X ;
| A | the matrix of absolute values of the elements of A;
A T the transposed of A;
A 1 the inverse of A;
A the pseudoinverse of A;
a j the jth column of A;
A i , 1 : n the ith row of m × n matrix A;
A i 1 : i 2 , j 1 : j 2 the part of matrix A from row i 1 to i 2 and from column j 1 to j 2 ;
δ A perturbation of A;
0 m × n the zero m × n matrix;
I n the unit n × n matrix;
e j the jth column of I n ;
σ min ( A ) the minimum singular value of A; : = , equal by definition;
relation of partial order. If a , b R n , then a b means a i b i , i = 1 , 2 , , n ;
Low ( A ) the strictly lower triangular part of A;
Up ( A ) the strictly upper triangular part of A;
A 2 the spectral norm of A;
A F the Frobenius norm of A;
A B the Kronecker product of A and B;
vec ( A ) the vec mapping of A R n × m . If A is partitioned columnwise as
A = [ a 1 , a 2 , a m ] then vec ( A ) = [ a 1 T , a 2 T , , a m T ] T ;
P v e c the vec-permutation matrix. vec ( A T ) = P v e c vec ( A ) ;
Θ max ( X , Y ) the maximum angle between subspaces X and Y ;
O ( δ A F 2 ) a quantity of second order with respect to δ A F .

Appendix A

Theorem A1.
The minimum Frobenius norm solution of the matrix equation
X + X T = Φ , X R p × p , Φ R p × p , Φ T = Φ
is given by
X m i n = Φ / 2 .
Proof. 
Equation (A1) is represented as
( I p 2 + P v e c ) vec ( X ) = vec ( Φ ) ,
where P v e c is the vec-permutation matrix satisfying vec ( X T ) = P v e c vec ( X ) . This matrix is symmetric and orthogonal and has p ( p + 1 ) / 2 eigenvalues equal to 1 and p ( p 1 ) / 2 eigenvalues equal to −1 ([27], p. 265). Hence, for some orthogonal U, it may be represented as
P v e c = U diag ( I p ( p + 1 / 2 , I p ( p 1 / 2 ) U T ,
so that
I p 2 + P v e c = U diag ( 2 I p ( p + 1 / 2 , 0 p ( p 1 / 2 ) U T .
The minimum 2-norm solution of (A3), corresponding to the minimum Frobenius solution of (A1), is given by
vec ( X m i n ) = ( I p 2 + P v e c ) vec ( Φ ) ,
where
( I p 2 + P v e c ) = U diag ( I p ( p + 1 / 2 / 2 , 0 p ( p 1 / 2 ) U T .
Thus,
( I p 2 + P v e c ) = ( I p 2 + P v e c ) / 4
and
vec ( X m i n ) = ( I p 2 + P v e c ) vec ( Φ ) / 4 .
Since
P v e c vec ( Φ ) = vec ( Φ T ) = vec ( Φ ) ,
it follows that
X m i n = ( Φ + Φ ) / 4 = Φ / 2 ,
q.e.d. □

References

  1. Stewart, G.W. Matrix Algorithms; Vol. I: Basic Decompositions; SIAM: Philadelphia, PA, USA, 1998; ISBN 0-89871-414-1. [Google Scholar]
  2. Stewart, G.W.; Sun, J.-G. Matrix Perturbation Theory; Academic Press: San Diego, CA, USA, 1990; ISBN 978-0126702309. [Google Scholar]
  3. Bhatia, R. Matrix factorizations and their perturbations. Linear Algebra Appl. 1994, 197, 245–276. [Google Scholar] [CrossRef] [Green Version]
  4. Li, R. Matrix perturbation theory. In Handbook of Linear Algebra, 2nd ed.; Hogben, L., Ed.; CRC Press: Boca Raton, FL, USA, 2014; Chapter 21; pp. 1–20. [Google Scholar]
  5. Higham, N. A survey of componentwise perturbation theory in numerical linear algebra. In Mathematics of Computation 1943–1993: A Half Century of Computational Mathematics; Gautchi, W., Ed.; Amer. Mathematical Society: Providence, RI, USA, 1994; pp. 49–77. ISBN 0-8218-0291-7. [Google Scholar]
  6. Stewart, G.W. Perturbation bounds for the QR factorization of a matrix. SIAM J. Numer. Anal. 1977, 14, 509–518. [Google Scholar] [CrossRef]
  7. Sun, J.-G. Perturbation bounds for the Cholesky and QR factorizations. BIT Numer. Math. 1991, 31, 341–352. [Google Scholar] [CrossRef]
  8. Stewart, G.W. On the perturbation of LU, Cholesky, and QR factorizations. SIAM J. Matrix Anal. Appl. 1993, 14, 1141–1145. [Google Scholar] [CrossRef] [Green Version]
  9. Chang, X.-W.; Paige, C.C.; Stewart, G.W. Perturbation analyses for the QR factorization. SIAM J. Matrix Anal. Appl. 1997, 18, 1328–1340. [Google Scholar] [CrossRef] [Green Version]
  10. Chang, X.-W.; Stehlé, D. Rigorous perturbation bounds of some matrix factorizations. SIAM J. Matrix Anal. Appl. 2010, 31, 2841–2859. [Google Scholar] [CrossRef] [Green Version]
  11. Li, H.; Wei, Y. Improved rigorous perturbation bounds for the LU and QR factorizations. Numer. Linear Algebra Appl. 2015, 22, 1115–1130. [Google Scholar] [CrossRef]
  12. Sun, J.-G. Componentwise perturbation bounds for some matrix decompositions. BIT Numer. Math. 1992, 32, 702–714. [Google Scholar] [CrossRef]
  13. Zha, H.Y. A componentwise perturbation analysis of the QR decomposition. SIAM J. Matrix Anal. Appl. 1995, 14, 1124–1131. [Google Scholar] [CrossRef]
  14. Chang, X.-W.; Paige, C.C. Componentwise perturbation analyses for the QR factorization. Numer. Math. 2001, 88, 319–345. [Google Scholar] [CrossRef]
  15. Chang, X.-W. On the perturbation of the Q-factor of the QR factorization. Numer. Linear Algebra Appl. 2012, 19, 607–619. [Google Scholar] [CrossRef]
  16. Konstantinov, M.M.; Petkov, P.H.; Christov, N.D. Nonlocal perturbation analysis of the Schur system of a matrix. SIAM J. Matrix Anal. Appl. 1994, 15, 383–392. [Google Scholar] [CrossRef]
  17. Konstantinov, M.M.; Petkov, P.H. Perturbation Methods in Matrix Analysis and Control; NOVA Science Publishers, Inc.: New York, NY, USA, 2020; ISBN 978-1-53617-470-0. [Google Scholar]
  18. Chen, X.S. Perturbation bounds for the periodic Schur decomposition. BIT Numer. Math. 2010, 50, 41–58. [Google Scholar] [CrossRef]
  19. Chen, X.S.; Li, W.; Ng, M.K. Perturbation analysis for antitriangular Schur decomposition. SIAM J. Matrix Anal. Appl. 2012, 33, 1328–1340. [Google Scholar] [CrossRef]
  20. Petkov, P. Componentwise perturbation analysis of the Schur decomposition of a matrix. SIAM J. Matrix Anal. Appl. 2021, 42, 108–133. [Google Scholar] [CrossRef]
  21. Sun, J.-G. Perturbation bounds for the generalized Schur decomposition. SIAM J. Matrix Anal. Appl. 1995, 16, 1328–1340. [Google Scholar] [CrossRef]
  22. Zhang, G.; Li, H.; Wei, Y. Componentwise perturbation analysis for the generalized Schur decomposition. Calcolo 2022, 59. [Google Scholar] [CrossRef]
  23. Sun, J.-G. On perturbation bounds for the QR factorization. Linear Algebra Appl. 1995, 215, 95–112. [Google Scholar] [CrossRef] [Green Version]
  24. MATLAB Version 9.9.0.1538559 (R2020b) Update 3; The MathWorks, Inc.: Natick, MA, USA, 2020.
  25. Gohberg, I.; Koltracht, I. Mixed, componentwise, and structured condition numbers. SIAM J. Matrix Anal. Appl. 1993, 14, 688–704. [Google Scholar] [CrossRef]
  26. Björck, Å.; Golub, G. Numerical methods for computing angles between linear subspaces. Math. Comp. 1973, 27, 579–594. [Google Scholar] [CrossRef]
  27. Horn, R.A.; Johnson, C.R. Topics in Matrix Analysis; Cambridge University Press: Cambridge, UK, 1991; ISBN 0-521-30587-X. [Google Scholar]
  28. Horn, R.A.; Johnson, C.R. Matrix Analysis, 2nd ed.; Cambridge University Press: Cambridge, UK, 2013; ISBN 978-0-521-83940-2. [Google Scholar]
  29. Bavely, C.A.; Stewart, G.W. An algorithm for computing reducing subspaces by block diagonalization. SIAM J. Numer. Anal. 1979, 16, 359–367. [Google Scholar] [CrossRef]
  30. Stewart, G.W. Matrix Algorithms; Vol. II: Eigensystems; SIAM: Philadelphia, PA, USA, 2001; ISBN 0-89871-503-2. [Google Scholar]
Figure 1. Perturbation estimates of the column subspaces.
Figure 1. Perturbation estimates of the column subspaces.
Mathematics 10 04687 g001
Figure 2. Iterations for determining the global bounds for different perturbations.
Figure 2. Iterations for determining the global bounds for different perturbations.
Mathematics 10 04687 g002
Figure 3. Exact values of δ Q 15 , 10 and its bounds as functions of the perturbation norm.
Figure 3. Exact values of δ Q 15 , 10 and its bounds as functions of the perturbation norm.
Mathematics 10 04687 g003
Figure 4. Exact values of δ Q 15 , 16 and its bounds as functions of the perturbation norm.
Figure 4. Exact values of δ Q 15 , 16 and its bounds as functions of the perturbation norm.
Mathematics 10 04687 g004
Figure 5. The errors o r t h 1 ( X 1 a p p r , X 2 a p p r ) F , o r t h 2 ( X 1 a p p r , X 2 a p p r ) F , o r t h 3 ( Q ˜ a p p r ) F as functions of the perturbation norm.
Figure 5. The errors o r t h 1 ( X 1 a p p r , X 2 a p p r ) F , o r t h 2 ( X 1 a p p r , X 2 a p p r ) F , o r t h 3 ( Q ˜ a p p r ) F as functions of the perturbation norm.
Mathematics 10 04687 g005
Figure 6. Exact values of δ Θ max 15 and its bounds as functions of the perturbation norm.
Figure 6. Exact values of δ Θ max 15 and its bounds as functions of the perturbation norm.
Mathematics 10 04687 g006
Figure 7. Exact values of δ R 55 and its bounds as functions of the perturbation norm.
Figure 7. Exact values of δ R 55 and its bounds as functions of the perturbation norm.
Mathematics 10 04687 g007
Figure 8. Exact values of δ R 2 , 10 and its bounds as functions of the perturbation norm.
Figure 8. Exact values of δ R 2 , 10 and its bounds as functions of the perturbation norm.
Mathematics 10 04687 g008
Table 1. Exact basic perturbation parameters and their linear and nonlinear estimates.
Table 1. Exact basic perturbation parameters and their linear and nonlinear estimates.
δ A F x = q i T δ q j | x | x lin x nonl
12345
1.78326 × 10 10 x 1 = q 2 T δ q 1 6.48563 × 10 13 7.80510 × 10 12 7.80510 × 10 12
x 2 = q 3 T δ q 1 3.81408 × 10 12 7.80510 × 10 12 7.80510 × 10 12
x 3 = q 4 T δ q 1 3.12632 × 10 12 7.80510 × 10 12 7.80510 × 10 12
x 4 = q 3 T δ q 2 6.73721 × 10 9 2.04508 × 10 7 2.04508 × 10 7
x 5 = q 4 T δ q 2 6.00990 × 10 8 2.04508 × 10 7 2.04508 × 10 7
x 6 = q 4 T δ q 3 6.70820 × 10 8 2.28281 × 10 7 2.28281 × 10 7
8.91628 × 10 8 x 1 = q 2 T δ q 1 3.24302 × 10 10 3.90255 × 10 9 3.90335 × 10 9
x 2 = q 3 T δ q 1 1.90707 × 10 9 3.90255 × 10 9 3.90340 × 10 9
x 3 = q 4 T δ q 1 1.56317 × 10 9 3.90255 × 10 9 3.90340 × 10 9
x 4 = q 3 T δ q 2 3.36826 × 10 6 1.02254 × 10 4 1.02280 × 10 4
x 5 = q 4 T δ q 2 3.00486 × 10 5 1.02254 × 10 4 1.02280 × 10 4
x 6 = q 4 T δ q 3 3.35398 × 10 5 1.14140 × 10 4 1.14193 × 10 4
5.34977 × 10 5 x 1 = q 2 T δ q 1 1.94581 × 10 7 2.34153 × 10 6 2.75590 × 10 6
x 2 = q 3 T δ q 1 1.14424 × 10 6 2.34153 × 10 6 2.82650 × 10 6
x 3 = q 4 T δ q 1 9.37903 × 10 7 2.34153 × 10 6 2.81974 × 10 6
x 4 = q 3 T δ q 2 1.99332 × 10 3 6.13524 × 10 2 7.59140 × 10 2
x 5 = q 4 T δ q 2 1.77825 × 10 2 6.13524 × 10 2 7.65532 × 10 2
x 6 = q 4 T δ q 3 1.97618 × 10 2 6.84843 × 10 2 9.92798 × 10 2
Table 2. Approximation of the diagonal elements of matrix W.
Table 2. Approximation of the diagonal elements of matrix W.
δ A F 1.78325 × 10 10 8.91627 × 10 8 5.34976 × 10 5
| α 1 | 1.67646 × 10 16 1.74935 × 10 16 1.11378 × 10 12
| α 2 | 1.98416 × 10 15 4.57132 × 10 10 1.60108 × 10 4
| α 3 | 2.33940 × 10 15 5.68134 × 10 10 1.98034 × 10 4
| α 1 l i n | 1.23709 × 10 23 3.09280 × 10 18 1.11341 × 10 12
| α 2 l i n | 1.82864 × 10 15 4.57132 × 10 10 1.60095 × 10 4
| α 3 l i n | 2.27269 × 10 15 5.68131 × 10 10 1.97252 × 10 4
| α 1 n o n l | 1.23709 × 10 23 3.09280 × 10 18 1.11341 × 10 12
| α 2 n o n l | 1.82864 × 10 15 4.57132 × 10 10 1.60108 × 10 4
| α 3 n o n l | 2.27269 × 10 15 5.68131 × 10 10 1.97271 × 10 4
Table 3. Exact perturbations of the elements of the matrix Q 1 and their linear and nonlinear estimates, δ A = 3 × 10 6 A 0 , δ A F = 5.34977 × 10 5 , B ( δ Q l i n ) = c Q δ A F = 0.13003 , and B ( δ Q n o n l ) = 0.14519 .
Table 3. Exact perturbations of the elements of the matrix Q 1 and their linear and nonlinear estimates, δ A = 3 × 10 6 A 0 , δ A F = 5.34977 × 10 5 , B ( δ Q l i n ) = c Q δ A F = 0.13003 , and B ( δ Q n o n l ) = 0.14519 .
q ij | δ q ij | δ q ij lin δ q ij nonl
q 11 8.24060 × 10 7 2.46313 × 10 6 2.94752 × 10 6
q 21 5.56921 × 10 7 3.27407 × 10 6 3.94135 × 10 6
q 31 1.78849 × 10 7 2.15221 × 10 6 2.53307 × 10 6
q 41 1.09799 × 10 6 3.07134 × 10 6 3.68975 × 10 6
q 12 5.88076 × 10 3 4.50959 × 10 2 5.63774 × 10 2
q 22 5.89442 × 10 3 7.93060 × 10 2 9.85345 × 10 2
q 32 1.47078 × 10 4 3.46070 × 10 3 5.35863 × 10 3
q 42 1.58388 × 10 2 7.07534 × 10 2 8.82920 × 10 2
q 13 4.76877 × 10 3 4.20957 × 10 2 5.95634 × 10 2
q 23 8.37491 × 10 3 3.98794 × 10 2 5.85481 × 10 2
q 33 2.15468 × 10 3 5.63927 × 10 2 7.57743 × 10 2
q 43 1.72784 × 10 2 7.02671 × 10 2 1.01256 × 10 1
Table 4. Exact perturbations of the maximum subspace angles and their linear and nonlinear  estimates.
Table 4. Exact perturbations of the maximum subspace angles and their linear and nonlinear  estimates.
δ A F 1.78326 × 10 10 8.91628 × 10 8 5.34977 × 10 5
| δ Θ max 1 | 4.97410 × 10 12 2.48709 × 10 9 1.49225 × 10 6
| δ Θ max 2 | 6.04754 × 10 8 3.02368 × 10 5 1.78948 × 10 2
| δ Θ max 3 | 9.00660 × 10 8 4.50315 × 10 5 2.65878 × 10 2
δ Θ max 1 l i n 7.80510 × 10 12 3.90255 × 10 9 2.34153 × 10 6
δ Θ max 2 l i n 2.04508 × 10 7 1.02254 × 10 4 6.13524 × 10 2
δ Θ max 3 l i n 3.06490 × 10 7 1.53245 × 10 4 9.19468 × 10 2
δ Θ max 1 n o n l 1.35188 × 10 11 6.76085 × 10 9 4.85129 × 10 6
δ Θ max 2 n o n l 2.89218 × 10 7 1.44645 × 10 4 1.08022 × 10 1
δ Θ max 3 n o n l 3.06490 × 10 7 1.53301 × 10 4 1.25698 × 10 1
Table 5. Exact perturbations of the diagonal elements of R and their linear and nonlinear bounds.
Table 5. Exact perturbations of the diagonal elements of R and their linear and nonlinear bounds.
δ A F 1.78326 × 10 10 8.91628 × 10 8 5.34977 × 10 5
| δ r 11 | 9.19442 × 10 12 4.59573 × 10 9 2.75746 × 10 6
| δ r 22 | 4.20811 × 10 11 2.10408 × 10 8 1.27735 × 10 5
| δ r 33 | 1.51994 × 10 8 7.60002 × 10 6 4.88606 × 10 3
δ r 11 l i n 1.78326 × 10 10 8.91628 × 10 8 5.34977 × 10 5
δ r 22 l i n 1.87973 × 10 10 9.39863 × 10 8 5.63918 × 10 5
δ r 33 l i n 4.56562 × 10 7 2.28281 × 10 4 1.36969 × 10 1
δ r 11 n o n l 1.78618 × 10 10 1.62255 × 10 7 4.80568 × 10 2
δ r 22 n o n l 1.88265 × 10 10 1.67069 × 10 7 4.80543 × 10 2
δ r 33 n o n l 4.56562 × 10 7 2.28330 × 10 4 1.69291 × 10 1
B ( δ R l i n ) 1.44561 × 10 5 7.22804 × 10 3 4.33683 × 10 0
B ( δ R n o n l ) 1.44561 × 10 5 7.22915 × 10 3 4.84251 × 10 0
Table 6. Exact perturbations of the super diagonal elements of R and their linear and nonlinear bounds.
Table 6. Exact perturbations of the super diagonal elements of R and their linear and nonlinear bounds.
δ A F 1.78326 × 10 10 8.91628 × 10 8 5.34977 × 10 5
| δ r 12 | 6.56506 × 10 11 3.28263 × 10 8 1.96958 × 10 5
| δ r 13 | 2.19309 × 10 11 1.09686 × 10 8 6.58120 × 10 6
| δ r 23 | 1.34417 × 10 8 6.72117 × 10 6 4.33437 × 10 3
δ r 12 l i n 1.78326 × 10 10 8.91628 × 10 8 5.34977 × 10 5
δ r 13 l i n 1.79853 × 10 10 8.99267 × 10 8 5.39560 × 10 5
δ r 23 l i n 4.09016 × 10 7 2.04508 × 10 4 1.22705 × 10 1
δ r 12 n o n l 1.78326 × 10 10 8.91628 × 10 8 5.34981 × 10 5
δ r 13 n o n l 1.79853 × 10 10 8.99301 × 10 8 5.58512 × 10 5
δ r 23 n o n l 4.09016 × 10 7 2.04555 × 10 4 1.48774 × 10 1
B ( δ R l i n ) 1.44561 × 10 5 7.22804 × 10 3 4.33683 × 10 0
B ( δ R n o n l ) 1.44561 × 10 5 7.22915 × 10 3 4.84251 × 10 0
Table 7. Quantities related to the approximation of δ Q 2 .
Table 7. Quantities related to the approximation of δ Q 2 .
δ A F 1.7832554500 × 10 10 8.9162772500 × 10 8 5.3497663500 × 10 5
X 1 F 9.0065954775 × 10 8 4.5031489846 × 10 5 2.6584712300 × 10 2
X 2 F 4.1725109797 × 10 15 1.0139176039 × 10 9 3.5343592252 × 10 4
e r r 1 1.0034590138 × 10 16 1.2139643751 × 10 16 1.0775666870 × 10 16
e r r 2 2.3314574995 × 10 16 1.2904023661 × 10 16 5.7370600309 × 10 19
X 1 T X 1 F 8.1118762095 × 10 15 2.0278350777 × 10 9 7.0674692808 × 10 4
X 2 T X 2 F 1.7409847876 × 10 29 1.0280289075 × 10 18 1.2491695133 × 10 7
X 1 a p p r F 9.0065954782 × 10 8 4.5031489891 × 10 5 2.6594111615 × 10 2
X 2 a p p r F 4.0559381054 × 10 15 1.0139175409 × 10 9 3.5362338628 × 10 4
e r r 3 3.7480684521 × 10 22 4.5658214975 × 10 14 9.4009759870 × 10 6
e r r 4 1.6450633915 × 10 29 1.0280287798 × 10 18 1.2504949933 × 10 7
δ Q 2 F 9.0065954775 × 10 8 4.5031489857 × 10 5 2.6587061610 × 10 2
δ Q 2 a p p r F 9.0065954782 × 10 8 4.5031489903 × 10 5 2.6596462586 × 10 2
e r r 5 3.1278945183 × 10 16 7.5350125919 × 10 16 7.8338852224 × 10 16
e r r 6 3.4777636565 × 10 16 6.4524550180 × 10 14 1.3295575820 × 10 5
e r r 1 = o r t h 1 ( X 1 , X 2 ) F , e r r 2 = o r t h 2 ( X 1 , X 2 ) F ,
e r r 3 = o r t h 1 ( X 1 a p p r , X 2 a p p r ) F , e r r 4 = o r t h 2 ( X 1 a p p r , X 2 a p p r ) F ,
e r r 5 = o r t h 3 ( Q ˜ ) F , e r r 6 = o r t h 3 ( Q ˜ a p p r ) F
Table 8. Approximated perturbations of the elements of Q 2 and their approximations.
Table 8. Approximated perturbations of the elements of Q 2 and their approximations.
δ A F q ij | δ q ij | | δ q ij appr |
1.7832554500 × 10 10 q 14 4.9044000886 × 10 8 4.9044000819 × 10 8
q 24 5.0733955344 × 10 8 5.0733955468 × 10 8
q 34 5.5238446041 × 10 8 5.5238446008 × 10 8
q 44 9.0189822446 × 10 9 9.0189821929 × 10 8
8.9162772501 × 10 8 q 14 2.4521479462 × 10 5 2.4521479487 × 10 5
q 24 2.5365705574 × 10 5 2.5365705600 × 10 5
q 34 2.7618311270 × 10 5 2.7618311298 × 10 5
q 44 4.5102092035 × 10 6 4.5102092081 × 10 5
5.3497663500 × 10 5 q 14 1.4577251477 × 10 2 1.4582423281 × 10 2
q 24 1.4823481491 × 10 2 1.4828695707 × 10 2
q 34 1.6304869299 × 10 2 1.6310634063 × 10 2
q 44 2.9649988293 × 10 3 2.9661007106 × 10 3
Table 9. Convergence of the global bounds.
Table 9. Convergence of the global bounds.
k δ A F x 2 Number of
Iterations
x nonl 2
11 1.78326 × 10 10 9.03176 × 10 8 4 3.68455 × 10 7
10 1.78326 × 10 9 9.03170 × 10 7 4 3.68458 × 10 6
9 1.78326 × 10 8 9.03165 × 10 6 5 3.68480 × 10 5
8 1.78326 × 10 7 9.03122 × 10 5 6 3.68699 × 10 4
7 1.78326 × 10 6 9.02688 × 10 4 9 3.70916 × 10 3
6 1.78326 × 10 5 8.98346 × 10 3 17 3.96070 × 10 2
5 1.78326 × 10 4 8.54366 × 10 2 No convergence-
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Petkov, P.H. Componentwise Perturbation Analysis of the QR Decomposition of a Matrix. Mathematics 2022, 10, 4687. https://doi.org/10.3390/math10244687

AMA Style

Petkov PH. Componentwise Perturbation Analysis of the QR Decomposition of a Matrix. Mathematics. 2022; 10(24):4687. https://doi.org/10.3390/math10244687

Chicago/Turabian Style

Petkov, Petko H. 2022. "Componentwise Perturbation Analysis of the QR Decomposition of a Matrix" Mathematics 10, no. 24: 4687. https://doi.org/10.3390/math10244687

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop