Conditioning Theory for Generalized Inverse CA‡ and Their Estimations

Samar, Mahvish; Zhu, Xinzhong; Shakoor, Abdul

doi:10.3390/math11092111

Open AccessArticle

Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations

by

Mahvish Samar

^1,*,

Xinzhong Zhu

¹ and

Abdul Shakoor

²

¹

College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua 321004, China

²

Department of Mathematics, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(9), 2111; https://doi.org/10.3390/math11092111

Submission received: 24 March 2023 / Revised: 25 April 2023 / Accepted: 25 April 2023 / Published: 29 April 2023

(This article belongs to the Section A: Algebra and Logic)

Download

Browse Figures

Versions Notes

Abstract

:

The conditioning theory of the generalized inverse

C_{A}^{‡}

is considered in this article. First, we introduce three kinds of condition numbers for the generalized inverse

C_{A}^{‡}

, i.e., normwise, mixed and componentwise ones, and present their explicit expressions. Then, using the intermediate result, which is the derivative of

C_{A}^{‡}

, we can recover the explicit condition number expressions for the solution of the equality constrained indefinite least squares problem. Furthermore, using the augment system, we investigate the componentwise perturbation analysis of the solution and residual of the equality constrained indefinite least squares problem. To estimate these condition numbers with high reliability, we choose the probabilistic spectral norm estimator to devise the first algorithm and the small-sample statistical condition estimation method for the other two algorithms. In the end, the numerical examples illuminate the obtained results.

Keywords:

generalized inverse

C_{A}^{‡}

; normwise condition number; mixed and componentwise condition numbers; EILS problem; probabilistic spectral norm estimator; small-sample statistical condition estimation

MSC:

65F20; 65F35; 65F30; 15A12; 15A60

1. Introduction

Throughout this paper,

R^{m \times n}

denotes the set of real

m \times n

matrices. For a matrix

A \in R^{m \times n}

,

A^{T}

is the transpose of A,

rank (A)

denotes the rank of A,

{∥ A ∥}_{2}

is the spectral norm of A, and

{∥ A ∥}_{F}

is the Frobenius norm of A. For a vector a,

{∥ a ∥}_{\infty}

is its ∞-norm, and

{∥ a ∥}_{2}

the 2-norm. The notation

| A |

is a matrix whose components are the absolute values of the corresponding components of A. For any matrix A, the following four equations uniquely define the Moore–Penrose inverse

A^{†}

of A [1]:

\begin{matrix} A A^{†} A = A, A^{†} A A^{†} = A^{†}, {(A A^{†})}^{T} = A A^{†}, {(A^{†} A)}^{T} = A^{†} A . \end{matrix}

(1)

The generalized inverse

C_{A}^{‡}

is defined by

\begin{matrix} C_{A}^{‡} = (I - {(P Q P)}^{†} Q) C^{†}, \end{matrix}

(2)

where

Q = A^{T} J A

,

A \in R^{(p + q) \times n}

denotes weight matrix and

P = I - C^{†} C

is the orthogonal projection onto the null space of C and

C \in R^{s \times n}

may not have full rank and J is a signature matrix defined by

J = [\begin{matrix} I_{p} & 0 \\ 0 & - I_{q} \end{matrix}], p + q = m .

The generalized inverse

C_{A}^{‡}

originated from the equality constrained indefinite least squares problem (EILS), which is stated as follows [2,3,4,5]:

EILS : min_{{∥C x - h∥}_{2}} {(g - A x)}^{T} J (g - A x),

(3)

where

g \in R^{m}

and

h \in R^{s}

. The EILS problem has a unique solution:

x = C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g

(4)

under the following condition:

rank (C) = s, x^{T} Q x > 0 for all nonzero x \in null (C) .

The above condition implies

\begin{matrix} p \geq n - s, rank (\begin{matrix} A \\ C \end{matrix}) = n, \end{matrix}

(5)

then (5) ensures the existence and uniqueness of generalized inverse

C_{A}^{‡}

(see [2,6]). The generalized inverse

C_{A}^{‡}

has significant applications in the study of EILS algorithms, the analysis of large-scale structure, error analysis, perturbation theory, and the solution of the EILS problem [2,3,4,5,7,8,9,10]. The EILS problem was first demonstrated by Bojanczyk et al. [5]. Additionally, we reveal some detailed work on the perturbation analysis of this problem. The perturbation theory of the EILS problem was discussed by Wang [11] and extended by Shi and Liu [8] based on the hyperbolic MGS elimination method. Diao and Zhou [12] recovered the linearized estimate of the backward error of this problem. Later, Li et al. [13] investigated the componentwise condition numbers for the EILS problem. Recently, Wang and Meng [14] studied the condition numbers and normwise perturbation analysis of the EILS problem.

Componentwise perturbation analysis has received significant attention in recent years; for references, see [15,16,17,18,19]. The motivation for studying componentwise perturbation analysis is reasonable for research because, if the perturbation in the input data is measured componentwise rather than by norm, it may help us to measure the sensitivity of a function more accurately [15], and improve the exactness and effectiveness of the EILS solution computation. It has attracted many authors’ attention to consider the componentwise perturbation analysis in which the least squares problem [16] and the weighted least squares problem [17] are included. In this article, we continue the research on componentwise perturbation analysis of the EILS problem. We can recover the componentwise perturbation bounds of the indefinite least squares problem with the intermediate result.

The generalized inverse

C_{A}^{‡}

reduce to K-weighted pseudoinverse

L_{K}^{†}

when

q = 0

and K has a full row rank. This pseudoinverse was expanded to the MK-weighted pseudoinverse

L_{M K}^{†}

by Wei and Zhang [6], which describes its structure and uniqueness. Its algorithm was developed by Elden [20]. According to Wei [21], the expression of

L_{K}^{†}

based on GSVD was investigated. A perturbation equation for

L_{K}^{†}

was given by Gulliksson et al. [22]. The condition numbers for the K-weighted pseudoinverse

L_{K}^{†}

and their statistical estimate were recently provided by Mahvish et al. [23].

The condition number is a well-known research topic in numerical analysis that estimates the worst-case sensitivity of input data to small perturbations on it (see [24,25,26] and references therein). The normwise condition number [25] has the disadvantage of disregarding the scaling structure of both input and output data. To address this issue, the terms mixed and componentwise condition numbers are introduced [26]. Mixed condition numbers employ componentwise error analysis for input data and normwise error analysis for output data. On the other hand, the componentwise condition numbers employ componentwise error analysis for input and output data. In fact, due to rounding errors and data storage difficulties, it is more practical to estimate input errors componentwise rather than normwise. However, the condition numbers of the generalized inverse

C_{A}^{‡}

have not been discussed until now. Inspired by this, we attempt to present the explicit expressions of normwise, mixed and componentwise condition numbers for the generalized inverse

C_{A}^{‡}

, as well as their statistical estimation due to their importance in EILS research.

The rest of this manuscript is organized as follows: Section 2 provides some preliminaries that will be helpful for the upcoming discussions. With the intermediate result, i.e., the derivative of

C_{A}^{‡}

, we can recover the explicit expression of condition numbers for the solution of the EILS problem in Section 3. Section 4 will present the componentwise perturbation analysis for the EILS problem. In Section 5, we propose the first two algorithms for the normwise condition number by using the probabilistic spectral norm estimator [27] and the small-sample statistical condition estimation [28] method. Additionally, we construct the third algorithm for the mixed and componentwise condition numbers by using the small-sample statistical condition estimation [28] method. To check the efficiency of these algorithms, we demonstrate them through numerical experiments in Section 6.

2. Preliminaries

In this part, we introduce some definitions and important results, which will be used in the upcoming sections.

Firstly, we define the entrywise division between two vectors

v = {[v_{1}, \dots, v_{p}]}^{T} \in R^{p}

and

w = {[w_{1}, \dots, w_{p}]}^{T} \in R^{p}

by

\frac{v}{w} = {[η_{1}, \dots, η_{p}]}^{T}

with

η_{i} = \{\begin{matrix} \frac{v_{i}}{w_{i}}, & if w_{i} \neq 0 \\ v_{i}, & if w_{i} = 0 . \end{matrix}

Following [1,26,29], the componentwise distance between v and w is defined by

\begin{matrix} d (v, w) = {∥\frac{v - w}{w}∥}_{\infty} = max_{i = 1, \dots, p} \{\frac{|v_{i} - w_{i}|}{|w_{i}|}\} = \{\begin{matrix} \frac{|v_{i *} - w_{i *}|}{|w_{i *}|}, & if w_{i *} \neq 0 \\ |v_{i *}|, & if w_{i *} = 0 . \end{matrix} \end{matrix}

Note that when

w_{i *} \neq 0, \forall i = 1, \dots, p

,

d (v, w)

gives the relative distance from v to w with respect to w, while the absolute distance for

w_{i *} = 0

. We describe the distance between the matrices

V, W \in R^{n \times n}

as follows:

d (V, W) = d (vec (V), vec (W)) .

In order to define the mixed and componentwise condition numbers, we also need to define the set

B^{\circ} (v, ε) = {u \in R^{p} | | u_{i} - v_{i} | \leq ϵ | v_{i} |, i = 1, \dots, p}

and

B (v, ε) = {u \in R^{p} ∣ ∥ u - v ∥_{2} ⩽ ε ∥ v ∥_{2}}

for given

ε > 0 .

Definition 1

([29]). Let

ℵ : R^{p} \to R^{q}

be a continuous mapping defined on an open set

Dom (ℵ) \subset R^{p}

, and

v \in Dom (ℵ), v \neq 0

such that

ℵ (v) \neq 0

.

(i): The normwise condition number of ℵ at v is given by

$\begin{matrix} n (ℵ, v) & = & lim_{ε \to 0} sup_{\begin{matrix} u \in B (v, ε) \\ u \neq v \end{matrix}} (\frac{{∥ ℵ (u) - ℵ (v) ∥}_{2}}{{∥ ℵ (v) ∥}_{2}} / \frac{{∥ u - v ∥}_{2}}{{∥ v ∥}_{2}}) . \end{matrix}$
(ii): The mixed condition number of ℵ at v is given by

$\begin{matrix} m (ℵ, v) & = & lim_{ε \to 0} sup_{\begin{matrix} u \in B^{o} (v, ε) \\ u \neq v \end{matrix}} \frac{{∥ ℵ (u) - ℵ (v) ∥}_{\infty}}{{∥ ℵ (v) ∥}_{\infty}} \frac{1}{d (u, v)} . \end{matrix}$
(iii): The componentwise condition number of ℵ at v is given by

$\begin{matrix} c (ℵ, v) & = & lim_{ε \to 0} sup_{\begin{matrix} u \in B^{o} (v, ε) \\ u \neq v \end{matrix}} \frac{d (ℵ (u), ℵ (v))}{d (u, v)} . \end{matrix}$

When the map ℵ in Definition 1 is Fréchet differentiable, the following lemma given in [29] makes the computation of condition numbers easier.

Lemma 1

([29]). Under the assumptions of Definition 1, and supposing ℵ is

F r é c h e t

differentiable at v, we have

n (ℵ, v) = \frac{{∥ d ℵ (v) ∥}_{2} {∥ v ∥}_{2}}{{∥ ℵ (v) ∥}_{2}}, m (ℵ, v) = \frac{{∥ | d ℵ (v) | | v | ∥}_{\infty}}{{∥ ℵ (v) ∥}_{\infty}}, c (ℵ, v) = {∥\frac{| d ℵ (v) | | v |}{| ℵ (v) |}∥}_{\infty},

where

d ℵ (v)

stands for the Fréchet derivative of ℵ at v.

To obtain the explicit expressions of the above condition numbers, we need some properties of the Kronecker product [30] between X and Y:

\begin{matrix} vec (Y Z X) & = & (X^{T} \otimes Y) vec (Z), \end{matrix}

(6)

\begin{matrix} vec (Y^{T}) & = & Π_{m n} vec (Y), \end{matrix}

(7)

\begin{matrix} {∥ Y \otimes X ∥}_{2} & = & {∥ Y ∥}_{2} {∥ X ∥}_{2}, \end{matrix}

(8)

where the matrix Z has a suitable dimension, and

Π_{m n} \in R^{m n \times m n}

is the vec-permutation matrix, which depends only on the dimensions m and n.

Now, we present the following two lemmas, which will be helpful for obtaining condition numbers and their upper bounds.

Lemma 2

([31], p. 174, Theorem 5). Let S be an open subset of

R^{n \times q}

, and let

ℵ : S ⟶ R^{m \times p}

be a matrix function defined and

k \geq 1

times (continuously) differentiable on S. If

rank (ℵ (X))

is constant on

S,

then

ℵ^{†} : S ⟶ R^{p \times m}

is k times (continuously) differentiable on S, and

\begin{matrix} d ℵ^{†} & = & - ℵ^{†} d ℵ ℵ^{†} + ℵ^{†} ℵ^{†^{T}} d ℵ^{T} (I_{m} - ℵ ℵ^{†}) + (I_{p} - ℵ^{†} ℵ) d ℵ^{T} ℵ^{†^{T}} ℵ^{†} . \end{matrix}

(9)

Lemma 3

([1]). For any matrices

E,

F,

G,

H,

U

and

V

with dimensions making the following well defined

[E \otimes F + (G \otimes H) Π] vec (U),

\frac{[E \otimes F + (G \otimes H) Π] vec (U)}{V},

F U E^{T} a n d H U^{T} G^{T},

we have

{∥| [E \otimes F + (G \otimes H) Π] | vec (| U |)∥}_{\infty} \leq {∥vec (| F | | U | | E |^{T} + | H | | U |^{T} | G |^{T})∥}_{\infty}

and

{∥\frac{| [E \otimes F + (G \otimes H) Π] | vec (| U |)}{| V |}∥}_{\infty} \leq {∥\frac{vec (| F | | U | | E |^{T} + | H | | U |^{T} | G |^{T})}{| V |}∥}_{\infty} .

3. Condition Numbers

First, we define a mapping

ϕ (u) : R^{m n + s n} \to R^{n s}

by

\begin{matrix} ϕ (u) = vec (C_{A}^{‡}) . \end{matrix}

(10)

Here,

u = {(vec {(A)}^{T}, vec {(C)}^{T})}^{T}

,

Δ u = {(vec {(Δ A)}^{T}, vec {(Δ C)}^{T})}^{T}

, and for matrix

X = (x_{i j})

,

{∥ X ∥}_{F} = {∥ vec (X) ∥}_{2}

and

{∥ X ∥}_{max} = {∥ vec (X) ∥}_{\infty} = max_{i, j} | x_{i j} | .

Then, using Definition 1, we present the definitions of the normwise, mixed, and componentwise condition numbers for generalized inverse

C_{A}^{‡}

as given in [32]:

\begin{matrix} n^{‡} (A, C) = n (ϕ, u) & : = lim_{ε \to 0} sup_{{∥ [Δ A, Δ C] ∥}_{F} \leq ε {∥ [A, C] ∥}_{F}} \frac{{∥ (C + Δ C)}_{A}^{‡} - C_{A}^{‡} ∥_{F} / {∥ C_{A}^{‡} ∥}_{F}}{{∥ [Δ A, Δ C] ∥}_{F} / {∥ [A, C] ∥}_{F}}, \end{matrix}

(11)

\begin{matrix} m^{‡} (A, C) = m (ϕ, u) & : = lim_{ε \to 0} sup_{\begin{matrix} {∥ Δ A / A ∥}_{max} \leq ε \\ {∥ Δ C / C ∥}_{max} \leq ε \end{matrix}} \frac{{∥ (C + Δ C)}_{A}^{‡} - C_{A}^{‡} ∥_{max}}{∥ C_{A}^{‡} ∥_{max}} \frac{1}{d (u + Δ u, u)}, \end{matrix}

(12)

\begin{matrix} c^{‡} (A, C) = c (ϕ, u) & : = lim_{ε \to 0} sup_{\begin{matrix} {∥ Δ A / A ∥}_{max} \leq ε \\ {∥ Δ C / C ∥}_{max} \leq ε \end{matrix}} \frac{1}{d (u + Δ u, u)} {∥\frac{{(C + Δ C)}_{A}^{‡} - C_{A}^{‡}}{C_{A}^{‡}}∥}_{max} . \end{matrix}

(13)

With the help of the

vec

operator, Frobenius, spectral, and Max norms, we can rewrite the definitions of normwise, mixed and componentwise condition numbers as follows:

\begin{matrix} n^{‡} (A, C) = n (ϕ, u) & : = lim_{ε \to 0} sup_{{∥[\begin{matrix} vec (Δ A) \\ vec (Δ C) \end{matrix}]∥}_{2} \leq ε {∥[\begin{matrix} vec (A) \\ vec (C) \end{matrix}]∥}_{2}} \frac{∥ vec ({(C + Δ C)}_{A}^{‡} - C_{A}^{‡}) ∥_{2}}{∥ vec (C_{A}^{‡}) ∥_{2}} / \frac{{∥[\begin{matrix} vec (Δ A) \\ vec (Δ C) \end{matrix}]∥}_{2}}{{∥[\begin{matrix} vec (A) \\ vec (C) \end{matrix}]∥}_{2}}, \end{matrix}

(14)

\begin{matrix} m^{‡} (A, C) = m (ϕ, u) & : = lim_{ε \to 0} sup_{\begin{matrix} {∥ vec (Δ A) / vec (A) ∥}_{\infty} \leq ε \\ {∥ vec (Δ C) / vec (C) ∥}_{\infty} \leq ε \end{matrix}} \frac{∥ vec ({(C + Δ C)}_{A}^{‡} - C_{A}^{‡}) ∥_{\infty}}{∥ vec (C_{A}^{‡}) ∥_{\infty}} \frac{1}{d (u + Δ u, u)}, \end{matrix}

(15)

\begin{matrix} c^{‡} (A, C) = c (ϕ, u) & : = lim_{ε \to 0} sup_{\begin{matrix} {∥ vec (Δ A) / vec (A) ∥}_{\infty} \leq ε \\ {∥ vec (Δ C) / vec (C) ∥}_{\infty} \leq ε \end{matrix}} \frac{1}{d (u + Δ u, u)} {∥\frac{vec ({(C + Δ C)}_{A}^{‡} - C_{A}^{‡})}{vec (C_{A}^{‡})}∥}_{\infty} . \end{matrix}

(16)

In the following, we find the expression of the Fréchet derivative of

ϕ

at u.

Lemma 4.

Let the mapping ϕ be continuous. Then, the Fréchet differential at u is:

\begin{matrix} ϕ^{'} (u) = [W (A), W (C)], \end{matrix}

(17)

where

\begin{matrix} W (A) & = - [(C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) + ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n}], \\ W (C) & = - [(C_{A}^{‡^{T}} \otimes C_{A}^{‡}) - ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} - {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n}] . \end{matrix}

(18)

Proof.

Differentiating both sides of (2), we obtain

\begin{matrix} d (C_{A}^{‡}) & = & d [(I - {(P Q P)}^{†} Q) C^{†}] . \end{matrix}

(19)

From ([3], Theorem 2.2), we obtain

\begin{matrix} {(P Q P)}^{†} = P {(P Q P)}^{†} = {(P Q P)}^{†} P = P {(P Q P)}^{†} P, \end{matrix}

(20)

\begin{matrix} P (I - {(P Q P)}^{†} Q P) = 0, {(P Q P)}^{†} Q P = P . \end{matrix}

(21)

Thus, substituting (20) into (19) and differentiating both sides of the equation, we can deduce

\begin{matrix} d (C_{A}^{‡}) & = d [(I - P {(P Q P)}^{†} Q) C^{†}] = d C^{†} - d (P {(P Q P)}^{†} Q C^{†}) \\ = (I - P {(P Q P)}^{†} Q) d C^{†} - d P {(P Q P)}^{†} Q C^{†} - P d {(P Q P)}^{†} Q C^{†} - P {(P Q P)}^{†} d Q C^{†} . \end{matrix}

Further, using (9), we have

\begin{matrix} d (C_{A}^{‡}) & = (I - P {(P Q P)}^{†} Q) [- C^{†} d C C^{†} + C^{†} C^{†^{T}} d C^{T} (I - C C^{†}) + (I - C^{†} C) d C^{T} C^{†^{T}} C^{†}] \\ - d (I - C^{†} C) {(P Q P)}^{†} Q C^{†} + P [{(P Q P)}^{†} d (P Q P) {(P Q P)}^{†} \\ - {(P Q P)}^{†} {(P Q P)}^{†^{T}} d {(P Q P)}^{T} (I - (P Q P) {(P Q P)}^{†}) \\ - (I - {(P Q P)}^{†} (P Q P)) d {(P Q P)}^{T} {(P Q P)}^{†^{T}} {(P Q P)}^{†}] Q C^{†} - P {(P Q P)}^{†} d Q C^{†} . \end{matrix}

Noting (20), (2), and

(I - P {(P Q P)}^{†} Q) (I - C^{†} C) = P (I - {(P Q P)}^{†} Q P)

, the previous equation may be expressed as

\begin{matrix} d (C_{A}^{‡}) & = - C_{A}^{‡} d C C^{†} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + P (I - {(P Q P)}^{†} Q P) d C^{T} C^{†^{T}} C^{†} + d C^{†} C {(P Q P)}^{†} Q C^{†} \\ + C^{†} d C {(P Q P)}^{†} Q C^{†} + {(P Q P)}^{†} d (P Q P) {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} {(P Q P)}^{†^{T}} d {(P Q P)}^{T} (I - (P Q P) {(P Q P)}^{†}) Q C^{†} \\ - P (I - {(P Q P)}^{†} Q P) d {(P Q P)}^{T} {(P Q P)}^{†^{T}} {(P Q P)}^{†} Q C^{†} - {(P Q P)}^{†} d Q C^{†} . \end{matrix}

Further, by the fact

P Q = {(Q P)}^{T} = Q P

, (21), and

\begin{matrix} C P {(P Q P)}^{†} & = C {(P Q P)}^{†} = 0, \end{matrix}

(22)

the above equation may be simplified as follows:

\begin{matrix} d (C_{A}^{‡}) & = - C_{K}^{‡} d C C^{†} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + C^{†} d C {(P Q P)}^{†} Q C^{†} - {(P Q P)}^{†} d Q C^{†} \\ + {(P Q P)}^{†} d Q P {(P Q P)}^{†} Q C^{†} + {(P Q P)}^{†} Q d P {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} {(P Q P)}^{†^{T}} d Q^{T} P^{T} (I - (P Q P) {(P Q P)}^{†}) Q C^{†} \\ - {(P Q P)}^{†} {(P Q P)}^{†^{T}} Q^{T} d P^{T} (I - (P Q P) {(P Q P)}^{†}) Q C^{†} . \\ = - C_{K}^{‡} d C C^{†} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + C^{†} d C {(P Q P)}^{†} Q C^{†} - {(P Q P)}^{†} d A^{T} J A C^{†} \\ - {(P Q P)}^{†} A^{T} J d A C^{†} + {(P Q P)}^{†} P d A^{T} J A {(P Q P)}^{†} Q C^{†} + {(P Q P)}^{†} P A^{T} J d A {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} {(P Q P)}^{†^{T}} d Q^{T} P (I - Q P {(P Q P)}^{†}) Q C^{†} + {(P Q P)}^{†} Q d P {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} (P Q P) {(P Q P)}^{†} d P^{T} (I - Q P {(P Q P)}^{†}) Q C^{†} . \end{matrix}

(23)

Considering

P Q = {(Q P)}^{T} = Q P

, we obtain

\begin{matrix} P ((I - Q P {(P Q P)}^{†}) = 0 Q P {(P Q P)}^{†} = P \end{matrix}

(24)

Substituting this fact into (23) implies

\begin{matrix} d (C_{A}^{‡}) & = - C_{A}^{‡} d C C^{†} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + C^{†} d C {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} A^{T} J d A (I - P {(P Q P)}^{†} Q) C^{†} - {(P Q P)}^{†} Q C^{†} d C {(P Q P)}^{†} Q C^{†} \\ - {(P Q P)}^{†} d A^{T} J A (I - {(P Q P)}^{†} Q) C^{†} + {(P Q P)}^{†} d C^{T} C^{†^{T}} Q (I - {(P Q P)}^{†} Q) C^{†} . \end{matrix}

We can rewrite the above equation by using (2) and (20) as

\begin{matrix} d (C_{A}^{‡}) & = - C_{A}^{‡} d C C_{A}^{‡} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + {(P Q P)}^{†} d C^{T} C^{†^{T}} Q C_{A}^{‡} - {(P Q P)}^{†} A^{T} J d A C_{A}^{‡} \\ - {(P Q P)}^{†} d A^{T} J A C_{A}^{‡} . \end{matrix}

(25)

By applying “vec” operator on (25), and using (6) and (7), we obtain

\begin{matrix} vec (d (C_{A}^{‡})) & = - (C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) vec (d A) - ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) vec (d A^{T}) \\ - (C_{A}^{‡^{T}} \otimes C_{A}^{‡}) vec (d C) + ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) vec (d C^{T}) \\ + ({(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) vec (d C^{T}) by () \\ = - [(C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) + ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n}] vec (d A) \\ - [(C_{A}^{‡^{T}} \otimes C_{A}^{‡}) - ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} - {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n}] vec (d C) \\ by () \\ = [- (C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) - ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n}, \\ - (C_{A}^{‡^{T}} \otimes C_{A}^{‡}) + ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} + {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n}] [\begin{matrix} vec (d A) \\ vec (d C) \end{matrix}] . \end{matrix}

That is,

d (vec (C_{A}^{‡})) = [W (A), W (C)] d v .

Thus, we have obtained the required result by using the definition of Fréchet derivative. □

Remark 1.

Setting

C = L

,

K = A

,

q = 0

and C as full row rank, we have

C_{A}^{‡} = L_{K}^{†}

and

\begin{matrix} \tilde{W} (A) & = - [(C_{A}^{†^{T}} \otimes {(A P)}^{†}) + ({(A C_{A}^{†})}^{T} \otimes {(A P)}^{†} {(A P)}^{†^{T}}) Π_{m n}], \\ \tilde{W} (C) & = - [(C_{A}^{†^{T}} \otimes C_{A}^{†}) - {({(A C^{†})}^{T} A C_{A}^{†})}^{T} \otimes {(A P)}^{†} {(A P)}^{†^{T}}) Π_{s n}], \end{matrix}

where the latter is just the result of ([23], Lemma 3.1), with which we can recover the condition numbers for K-weighted pseudoinverse

L_{K}^{†}

[23].

Using the straightforward results of Lemmas 1 and 4, we derive the following condition numbers for

C_{A}^{‡} .

Theorem 1.

The normwise, mixed and componentwise condition numbers for

C_{A}^{‡}

defined in (11)–(13) are

\begin{matrix} n^{‡} (A, C) & = & \frac{{∥ [W (A), W (C)] ∥}_{2} {∥[\begin{matrix} vec (A) \\ vec (C) \end{matrix}]∥}_{2}}{∥ vec (C_{A}^{‡}) ∥_{2}}, \end{matrix}

(26)

\begin{matrix} m^{‡} (A, C) & = & \frac{{∥| W (A) | vec (| A |) + | W (C) | vec (| C |)∥}_{\infty}}{∥ vec (C_{A}^{‡}) ∥_{\infty}}, \end{matrix}

(27)

\begin{matrix} c^{‡} (A, C) & = & {∥\frac{| W (A) | vec (| A |) + | W (C) | vec (| C |)}{vec (C_{A}^{‡})}∥}_{\infty} . \end{matrix}

(28)

Next, we provide easier computable upper bounds by minimizing the cost of computing the above condition numbers. The estimation of the upper bounds will be demonstrated by numerical experiments in Section 6.

Corollary 1.

The upper bounds of normwise, mixed and componentwise condition numbers for

C_{A}^{‡}

are

\begin{matrix} n^{‡} (A, C) & \leq n^{u p p e r} (A, C) \\ = [∥ C_{A}^{‡} ∥_{2} ∥ {(P Q P)}^{†} A^{T} J ∥_{2} + ∥ J A C_{A}^{‡} ∥_{2} ∥ {(P Q P)}^{†} ∥_{2} + ∥ C_{A}^{‡} ∥_{2} ∥ C_{A}^{‡} ∥_{2} + ∥ (I - C C^{†}) ∥_{2} ∥ C_{A}^{‡} C^{†^{T}} ∥_{2} \\ + ∥ C^{†^{T}} Q C_{A}^{‡} ∥_{2} {∥ (P Q P)}^{†} ∥_{2}] \frac{{∥ [A, C] ∥}_{F}}{∥ C_{A}^{‡} ∥_{F}}, \\ m^{‡} (A, C) & \leq m^{u p p e r} (A, C) \\ = ∥ [| {(P Q P)}^{†} A^{T} J | | A | | C_{A}^{‡^{T}} | + | {(P Q P)}^{†} | | A^{T} | | J A C_{A}^{‡} | + | C_{A}^{‡} | | C | | C_{A}^{‡^{T}} | + | C_{A}^{‡} C^{†^{T}} | | C^{T} | | (I - C C^{†}) | \\ {+ | (P Q P)}^{†} | | C^{T} | | C^{†^{T}} Q C_{A}^{‡} {|] ∥}_{max} / {∥ C_{A}^{‡} ∥}_{max}, \\ c^{‡} (A, C) & \leq c^{u p p e r} (A, C) \\ = ∥ [| {(P Q P)}^{†} A^{T} J | | A | | C_{A}^{‡^{T}} | + | {(P Q P)}^{†} | | A^{T} | | J A C_{A}^{‡} | + | C_{A}^{‡} | | C | | C_{A}^{‡^{T}} | + | C_{A}^{‡} C^{†^{T}} | | C^{T} | | (I - C C^{†}) | \\ {+ | (P Q P)}^{†} | | C^{T} | | C^{†^{T}} Q C_{A}^{‡} |] / C_{A}^{‡} ∥_{max} . \end{matrix}

Proof.

For any two matrices X and Y, it is well-known that

{∥ [X, Y] ∥}_{2} \leq {∥ X ∥}_{2} + {∥ Y ∥}_{2}

. With the help of Theorem 1, and (8), we obtain

\begin{matrix} n^{‡} (A, C) & \leq [∥ - (C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) - ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n} ∥_{2} \\ + ∥ - (C_{A}^{‡^{T}} \otimes C_{A}^{‡}) + ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} + {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n} ∥_{2}] \\ \times \frac{{∥ [A, C] ∥}_{F}}{∥ C_{A}^{‡} ∥_{F}} \\ \leq [∥ C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J ∥_{2} + ∥ {(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†} ∥_{2} + ∥ C_{A}^{‡^{T}} \otimes C_{A}^{‡} ∥_{2} \\ + ∥ {(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}} ∥_{2} + ∥ {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†} ∥_{2}] \frac{{∥ [A, C] ∥}_{F}}{∥ C_{A}^{‡} ∥_{F}} \\ = [∥ C_{A}^{‡} ∥_{2} ∥ {(P Q P)}^{†} A^{T} J ∥_{2} + ∥ J A C_{A}^{‡} ∥_{2} ∥ {(P Q P)}^{†} ∥_{2} + ∥ C_{A}^{‡} ∥_{2} ∥ C_{A}^{‡} ∥_{2} + ∥ (I - C C^{†}) ∥_{2} ∥ C_{A}^{‡} C^{†^{T}} ∥_{2} \\ + ∥ C^{†^{T}} Q C_{A}^{‡} ∥_{2} {∥ (P Q P)}^{†} ∥_{2}] \frac{{∥ [A, C] ∥}_{F}}{∥ C_{A}^{‡} ∥_{F}} . \end{matrix}

Secondly, by using Lemma 3 and Theorem 1, we obtain

\begin{matrix} m^{‡} (A, C) & = ∥ | - (C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) - ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n} | vec (| A |) + | - (C_{A}^{‡^{T}} \otimes C_{A}^{‡}) \\ + ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} + {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n} {| vec (| C |) ∥}_{\infty} / {∥ vec (C_{A}^{‡}) ∥}_{\infty} \\ \leq ∥ [| {(P Q P)}^{†} A^{T} J | | A | | C_{A}^{‡^{T}} | + | {(P Q P)}^{†} | | A^{T} | | J A C_{A}^{‡} | + | C_{A}^{‡} | | C | | C_{A}^{‡^{T}} | + | C_{A}^{‡} C^{†^{T}} | | C^{T} | | (I - C C^{†}) | \\ {+ | (P Q P)}^{†} | | C^{T} | | C^{†^{T}} Q C_{A}^{‡} {|] ∥}_{max} / {∥ C_{A}^{‡} ∥}_{max}, \end{matrix}

and finally, we have

\begin{matrix} c^{‡} (A, C) & = ∥ | - (C_{A}^{‡^{T}} \otimes {(P Q P)}^{†} A^{T} J) - ({(J A C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{m n} | vec (| A |) + | - (C_{A}^{‡^{T}} \otimes C_{A}^{‡}) \\ + ({(I - C C^{†})}^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} + {(C^{†^{T}} Q C_{A}^{‡})}^{T} \otimes {(P Q P)}^{†}) Π_{s n} | vec (| C |) / | vec (C_{A}^{‡}) {| ∥}_{\infty} \\ \leq ∥ [| {(P Q P)}^{†} A^{T} J | | A | | C_{A}^{‡^{T}} | + | {(P Q P)}^{†} | | A^{T} | | J A C_{A}^{‡} | + | C_{A}^{‡} | | C | | C_{A}^{‡^{T}} | + | C_{A}^{‡} C^{†^{T}} | | C^{T} | | (I - C C^{†}) | \\ {+ | (P Q P)}^{†} | | C^{T} | | C^{†^{T}} Q C_{A}^{‡} |] / C_{A}^{‡} ∥_{max} . \end{matrix}

□

Remark 2.

Using the GHQR factorization [3] on A and C in (2) and (5):

\begin{matrix} H^{T} A Q = & (\begin{matrix} L_{11} & 0 \\ L_{21} & L_{22} \end{matrix}), U^{T} C Q = (\begin{matrix} K_{11} & 0 \\ 0 & 0 \end{matrix}), \end{matrix}

(29)

where

U \in R^{s \times s}

and

Q \in R^{n \times n}

and a

J

-orthogonal matrix,

H \in R^{(p + q) \times (p + q)}

(i.e.,

H J H^{T} = J

),

L_{22}

and

K_{11}

are lower triangular and non-singular. We have

C_{A}^{‡} = Q (\begin{matrix} I \\ - L_{22}^{- 1} L_{21} \end{matrix}) K_{11}^{- 1} U_{1}^{T}, {(P Q P)}^{†} A^{T} J = Q (\begin{matrix} 0 \\ L_{22}^{- 1} \end{matrix}) H_{2}^{T}, {(P Q P)}^{†} = Q (\begin{matrix} 0 \\ - {(L_{22}^{T} L_{22})}^{- 1} \end{matrix}) Q^{T},

C_{A}^{‡} C^{†^{T}} = (\begin{matrix} 0 \\ - L_{22}^{- 1} L_{22} \end{matrix}) {(K_{11}^{- 1})}^{T} Q^{T}, C^{†^{T}} Q C_{A}^{‡} = U_{1}^{- T} K_{11}^{- T} L_{11}^{- 1} J L_{11} K_{11}^{- 1} U_{1}^{T},

J A C_{A}^{‡} = J L_{11} K_{11}^{- T} U_{1}^{T}, C_{A}^{‡} C_{A}^{‡^{T}} = Q (\begin{matrix} I \\ - L_{22}^{- 1} L_{21} \end{matrix}) K_{11}^{- 1} K_{11}^{- T} (\begin{matrix} I & - L_{22}^{- 1} L_{21} \end{matrix}) Q^{T}, C C^{†} = U_{1} (\begin{matrix} K_{11} K_{11}^{- 1} & 0 \end{matrix}) U_{1}^{T},

where

U = (U_{1}, U_{2})

,

H = [H_{1}, H_{2}]

;

U_{1}

and

H_{1}

are, respectively, the submatrices of U and H obtained by taking the first r columns. Putting all the above terms into (18) leads to

\begin{matrix} W_{1} (A) & = - [(U_{1} K_{11}^{- T} (\begin{matrix} I & - L_{22}^{- 1} L_{21} \end{matrix}) Q^{T} \otimes Q (\begin{matrix} 0 \\ L_{22}^{- 1} \end{matrix}) H_{2}^{T}) \\ + (K_{11}^{- T} U_{1} K_{11} L_{11}^{T} J \otimes Q (\begin{matrix} 0 \\ - {(L_{22}^{T} L_{22})}^{- 1} \end{matrix}) Q^{T}) Π_{m n}], \\ W_{1} (C) & = - [(U_{1} K_{11}^{- T} (\begin{matrix} I & - L_{22}^{- 1} L_{21} \end{matrix}) Q^{T} \otimes Q (\begin{matrix} I \\ - L_{22}^{- 1} L_{21} \end{matrix}) K_{11}^{- 1} U_{1}^{T}) \\ - ({(I - U_{1} (\begin{matrix} K_{11} K_{11}^{- 1} & 0 \end{matrix}) U_{1}^{T})}^{T} \otimes (\begin{matrix} 0 \\ - L_{22}^{- 1} L_{22} \end{matrix}) {(K_{11}^{- 1})}^{T} Q^{T}) Π_{s n} \\ - (U_{1} K_{11}^{- T} L_{11}^{T} J L_{11}^{- T} K_{11}^{- 1} U_{1}^{- 1}) \otimes Q (\begin{matrix} 0 \\ - {(L_{22}^{T} L_{22})}^{- 1} \end{matrix}) Q^{T}) Π_{s n}] . \end{matrix}

Remark 3.

We can obtain

d x

using the

d (C_{A}^{‡})

expression, where (4) is the solution of EILS problem (3). By differentiating (4), we obtain

\begin{matrix} d x & = d (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g) . \end{matrix}

Thus, using (20), we obtain

\begin{matrix} d x & = & d (C_{A}^{‡} h + P {(P Q P)}^{†} A^{T} J g) = d (C_{A}^{‡}) h + C_{A}^{‡} d h + d P {(P Q P)}^{†} A^{T} J g + P d {(P Q P)}^{†} A^{T} J g \\ + P {(P Q P)}^{†} d A^{T} J g + P {(P Q P)}^{†} A^{T} J d g . \end{matrix}

Substituting (25) into above equation and using (9), we have

\begin{matrix} d x & = & [- C_{A}^{‡} d C C_{A}^{‡} + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) + {(P Q P)}^{†} d C^{T} C^{†^{T}} Q C_{A}^{‡} - {(P Q P)}^{†} A^{T} J d A C_{A}^{‡} \\ - {(P Q P)}^{†} d A^{T} J A C_{A}^{‡}] h + d (I - C^{†} C) {(P Q P)}^{†} A^{T} J g + P [- {(P Q P)}^{†} d (P Q P) {(P Q P)}^{†} \\ + {(P Q P)}^{†} {(P Q P)}^{†^{T}} d {(P Q P)}^{T} (I - (P Q P) {(P Q P)}^{†}) \\ + (I - {(P Q P)}^{†} (P Q P)) d {(P Q P)}^{T} {(P Q P)}^{†^{T}} {(P Q P)}^{†}] A^{T} J g \\ + P {(P Q P)}^{†} d A^{T} J g + P {(P Q P)}^{†} A^{T} J d g + C_{A}^{‡} d h, \end{matrix}

which together with (20)–(22) give

\begin{matrix} d x & = & - C_{A}^{‡} d C C_{A}^{‡} h + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) h + {(P Q P)}^{†} d C^{T} C^{†^{T}} Q C_{A}^{‡} h - {(P Q P)}^{†} A^{T} J d A C_{A}^{‡} h \\ - {(P Q P)}^{†} d A^{T} J A C_{A}^{‡} h - C^{†} d C {(P Q P)}^{†} A^{T} J g - {(P Q P)}^{†} d Q P {(P Q P)}^{†} A^{T} J g \\ - {(P Q P)}^{†} Q d P {(P Q P)}^{†} A^{T} J g + {(P Q P)}^{†} P Q P {(P Q P)}^{†} d P^{T} (I - (Q P) {(P Q P)}^{†}) A^{T} J g \\ + {(P Q P)}^{†} {(P Q P)}^{†^{T}} d Q^{T} P (I - (Q P) {(P Q P)}^{†}) A^{T} J g + P {(P Q P)}^{†} d A^{T} J g + P {(P Q P)}^{†} A^{T} J d g + C_{A}^{‡} d h . \end{matrix}

Noting (24), the above equation can be rewritten as

\begin{matrix} d x & = & - C_{A}^{‡} d C C_{A}^{‡} h + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) h + {(P Q P)}^{†} d C^{T} C^{†^{T}} A^{T} J A C_{A}^{‡} h - {(P Q P)}^{†} A^{T} J d A C_{A}^{‡} h \\ - {(P Q P)}^{†} d A^{T} J A C_{A}^{‡} h - C^{†} d C {(P Q P)}^{†} A^{T} J g + {(P Q P)}^{†} d A^{T} J (g - A {(P Q P)}^{†} A^{T} J g) \\ - {(P Q P)}^{†} A^{T} J d A {(P Q P)}^{†} A^{T} J g + {(P Q P)}^{†} Q C^{†} d C {(P Q P)}^{†} A^{T} J g \\ - {(P Q P)}^{†} d C^{T} C^{†^{T}} A^{T} J (g - A {(P Q P)}^{†} A^{T} J g) + {(P Q P)}^{†} A^{T} J d g + C_{A}^{‡} d h . \end{matrix}

Further, by (20) and (4), we have

\begin{matrix} d x & = & - C_{A}^{‡} d C (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g) + C_{A}^{‡} C^{†^{T}} d C^{T} (I - C C^{†}) h - {(P Q P)}^{†} A^{T} J d A (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g) \\ - {(P Q P)}^{†} d C^{T} C^{†^{T}} A^{T} J (g - A (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g)) \\ + {(P Q P)}^{†} d A^{T} J (g - A (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g)) + {(P Q P)}^{†} A^{T} J d g + C_{A}^{‡} d h by (20) \\ = & - C_{A}^{‡} d C x + C_{A}^{‡} C^{†^{T}} d C^{T} ρ - {(P Q P)}^{†} A^{T} J d A x - {(P Q P)}^{†} d C^{T} C^{†^{T}} A^{T} J r \\ + {(P Q P)}^{†} d A^{T} J r + {(P Q P)}^{†} A^{T} J d g + C_{A}^{‡} d h, by (4) \end{matrix}

(30)

where

s = J r = J (g - A x)

,

β = (I - C C^{†}) h

. By utilizing operator “vec” on (30), and using (6) and (7), we obtain

\begin{matrix} d x & = & - (x^{T} \otimes {(P Q P)}^{†} A^{T} J) vec (d A) + (s^{T} \otimes {(P Q P)}^{†}) vec (d A^{T}) - (x^{T} \otimes C_{A}^{‡}) vec (d C) \\ + (β^{T} \otimes C_{A}^{‡} C^{†^{T}}) vec (d C^{T}) + {(C^{†^{T}} A^{T} s)}^{T} \otimes {(P Q P)}^{†}) vec (d C^{T}) + {(P Q P)}^{†} d g + C_{A}^{‡} d h by () \\ = & [- (x^{T} \otimes {(P Q P)}^{†} A^{T} J) + (s^{T} \otimes {(P Q P)}^{†}) Π_{m n}] vec (d A) - [(x^{T} \otimes C_{A}^{‡}) - (β^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} \\ - ({(C^{†^{T}} A^{T} s)}^{T} \otimes {(P Q P)}^{†}) Π_{s n}] vec (d C) + {(P Q P)}^{†} d g + C_{A}^{‡} d h by () \\ = & [- (x^{T} \otimes {(P Q P)}^{†} A^{T} J) + (s^{T} \otimes {(P Q P)}^{†}) Π_{m n}, - (x^{T} \otimes C_{A}^{‡}) + (β^{T} \otimes C_{A}^{‡} C^{†^{T}}) Π_{s n} \\ + ({(C^{†^{T}} A^{T} s)}^{T} \otimes {(P Q P)}^{†}) Π_{s n}, {(P Q P)}^{†}, C_{A}^{‡}] [\begin{matrix} vec (d A) \\ vec (d C) \\ d g \\ d h \end{matrix}] . \end{matrix}

From the above result, we can recover the condition numbers of the EILS problem provided in [3,13,14]. Further, we observe that

r = (g - A (C_{A}^{‡} h + {(P Q P)}^{†} A^{T} J g))

. Applying the same procedure, we can determine

d r

and condition numbers for residuals of EILS.

4. Componentwise Perturbation Analysis

In the following section, we derive a componentwise perturbation analysis of the augmented system for the EILS problem.

Let the perturbations

d A \in R^{(p + q) \times n}

,

d C \in R^{s \times n}

,

d g \in R^{m}

and

d h \in R^{s}

satisfy

| d A | \leq ϵ | A |

,

| d C | \leq ϵ | C |

| d g | \leq ϵ | g |

and

| d h | \leq ϵ | h |

for a small

ϵ

and

s = J r

. Suppose that the perturbed augmented system is

[\begin{matrix} 0 & 0 & C + d C \\ 0 & J + d J & A + d A \\ {(C + d C)}^{T} & {(A + d A)}^{T} & 0 \end{matrix}] [\begin{matrix} λ + d λ \\ s + d s \\ x + d x \end{matrix}] = [\begin{matrix} h + d h \\ g + d g \\ 0 \end{matrix}] .

Denoting

S = [\begin{matrix} 0 & 0 & C \\ 0 & J & A \\ C^{T} & A^{T} & 0 \end{matrix}], u = [\begin{matrix} g \\ h \\ 0 \end{matrix}], v = [\begin{matrix} λ \\ s \\ x \end{matrix}]

and the perturbations

d S = [\begin{matrix} 0 & 0 & d C \\ 0 & d J & d A \\ {(d C)}^{T} & {(d A)}^{T} & 0 \end{matrix}], d f = [\begin{matrix} d g \\ d h \\ 0 \end{matrix}], d z = [\begin{matrix} d λ \\ d s \\ d x \end{matrix}] .

When A is of full column rank and C has full row rank, S is invertible. It can be verified that

S^{- 1} = [\begin{matrix} {C_{A}^{‡}}^{T} Q C_{A}^{‡} & - {(J A C_{A}^{‡})}^{T} & {C_{A}^{‡}}^{T} \\ - J A C_{A}^{‡} & J - J A {(P Q P)}^{†} A^{T} J & J A {(P Q P)}^{†} \\ C_{A}^{‡} & {(P Q P)}^{†} A^{T} J & - {(P Q P)}^{†} \end{matrix}] .

If the spectral radius

\begin{matrix} ρ (|S^{- 1}| | d S |) < 1 \end{matrix}

(31)

then

I_{m + n} + S^{- 1} d S

is invertible. Clearly, the condition

\begin{matrix} ϵ < ρ^{- 1} ([\begin{matrix} | {C_{A}^{‡}}^{T} {| | C |}^{T} & | {C_{A}^{‡}}^{T} {| | A |}^{T} & | {C_{A}^{‡}}^{T} Q C_{A}^{‡} | | C | + | {(J A C_{A}^{‡})}^{T} | | A | \\ |J A {(P Q P)}^{†}| {| C |}^{T} & |J A {(P Q P)}^{†}| {| A |}^{T} & |J A C_{A}^{‡}| | A | + |J - J A {(P Q P)}^{†} A^{T} J| | C | \\ {| (P Q P)}^{†} {| | C |}^{T} & {| (P Q P)}^{†} {| | A |}^{T} & | C_{A}^{‡} {| | C | + | (P Q P)}^{†} A^{T} J | | A | \end{matrix}]), \end{matrix}

(32)

implies (31). The following results [24] are important for Theorem 2.

Lemma 5.

The perturbed system of a linear system

S v = u

is defined as follows:

(S + d S) (v + d v) = u + d u,

where

v + d v

is the solution to the perturbed system, when the perturbations

d S

and

d u

are sufficiently small such that

S + d S

is invertible, the perturbation

d v

in the solution v satisfies

d v = {(I + S^{- 1} d S)}^{- 1} S^{- 1} (d u - d S v),

which implies

| d v | \leq |{(I + S^{- 1} d S)}^{- 1}| |S^{- 1}| (| d u | + | d S | | v |) .

Furthermore, when the spectral radius

ρ (|S^{- 1}| | d S |) < 1

, we have

\begin{matrix} | d v | & \leq {(I - |S^{- 1}| | d S |)}^{- 1} |S^{- 1}| (| d u | + | d S | | v |) \\ = (I + O (|S^{- 1}| | d S |)) |S^{- 1}| (| d u | + | d S | | v |) . \end{matrix}

(33)

Now, we have the following bounds for the perturbations in the equality constrained indefinite least squares solution and residual.

Theorem 2.

Under the above assumption, for any

ϵ > 0

satisfying the condition (32), when the componentwise perturbations

| d A | \leq ϵ | A |

,

| d C | \leq ϵ | C |

| d g | \leq ϵ | g |

and

| d h | \leq ϵ | h |

, the error in the solution is bounded by

\begin{matrix} {∥ d x ∥}_{\infty} & \leq ϵ (∥ C_{A}^{‡} (| h | + {| C | | x |) ∥}_{\infty} + {∥{(P Q P)}^{†} A^{T} J (| g | + | A | | x |)∥}_{\infty} + {∥{(P Q P)}^{†} {(| C |}^{T} | λ | + {| A |}^{T} | r |)∥}_{\infty}) \\ + O (ϵ^{2}) \end{matrix}

(34)

and error in the residual is bounded by

\begin{matrix} {∥ d r ∥}_{\infty} & \leq ϵ (∥ J A C_{A}^{‡} (| h | + {| C | | x |) ∥}_{\infty} {+ ∥ J - J A (P Q P)}^{†} A^{T} J (| g | + {| A | | x |) ∥}_{\infty} \\ {+ ∥ J A (P Q P)}^{†} {(| C |}^{T} | λ | + {| A |}^{T} {| r |) ∥}_{\infty}) + O (ϵ^{2}) . \end{matrix}

(35)

Proof.

Since the condition (32) implies (31), applying (33) in Lemma 5, we obtain

[\begin{matrix} d λ \\ d s \\ d x \end{matrix}] \leq (I + O (|S^{- 1}| | d S |)) |S^{- 1}| [\begin{matrix} | d h | + | d C | | x | \\ | d g | + | d A | | x | \\ {| d C |}^{T} | λ | + {| d A |}^{T} | r | \end{matrix} .

Finally, using the conditions

| d A | \leq ϵ | A |

,

| d C | \leq ϵ | C |

| d g | \leq ϵ | g |

and

| d h | \leq ϵ | h |

, and the explicit form of

S^{- 1}

, the upper bounds (34) and (35) can be obtained. □

Furthermore, we can obtain the componentwise perturbation bounds of the indefinite least squares solution and its residual.

Remark 4.

Assume that C is a zero matrix,

λ = 0

, and

h = 0

. Using the above notations, for any

ϵ > 0

, if the componentwise perturbations satisfy

| d A | \leq ϵ | A |

and

| d g | \leq ϵ | g |

, then the error in the solution is bounded by

{∥ d x ∥}_{\infty} \leq ϵ (∥ | ({(A^{T} J A)}^{- 1}) A^{T} J | (| g | + {| A | | x |) ∥}_{\infty} + ∥ {(A^{T} J A)}^{- 1} {| | A |}^{T} {| r | ∥}_{\infty}) + O (ϵ^{2})

and the error in the residual is bounded by

{∥ d r ∥}_{\infty} \leq ϵ (∥ | J - J A {(A^{T} J A)}^{- 1} A^{T} J | (| g | + {| A | | x |) ∥}_{\infty} + ∥ | J A {(A^{T} J A)}^{- 1} {| | A |}^{T} {| r | ∥}_{\infty}) + O (ϵ^{2}) .

5. Statistical Condition Estimates

This section proposes three algorithms for estimating the normwise, mixed and componentwise condition numbers for the generalized inverse

C_{A}^{‡} .

Algorithm 1 is based on a probabilistic condition estimator method [27] and utilized to examine the normwise condition number for K-weighted pseudoinverse

L_{K}^{†}

[23], ILS problem [33], constrained and weighted least squares problem [34] and Tikhonov regularization of total least squares problem [35]. Based on the SSCE method [28], we develop Algorithm 2 to estimate the normwise condition number; for details, see [23,33,36,37,38].

Algorithm 1: Probabilistic condition estimator for the normwise condition number

Compute the derivative $d ϕ (u) = [W (A), W (C)]$ , and choose a starting vector $u_{0}$ uniformly and randomly from the unit t-sphere $S_{t - 1}$ with $t = n^{2}$ .
Using the probabilistic spectral norm estimator [27], compute the certain lower bound $α_{1}$ and the probabilistic upper bound $α_{2}$ of $d ϕ (u)$ .
Compute the normwise condition number by using (26)

$\begin{matrix} n_{p}^{‡} (A, C) = \frac{n_{p} (A, C) {∥ [A, C] ∥}_{F}}{∥ C_{A}^{‡} ∥_{F}} with n_{p} (A, C) & = & \sqrt{\frac{α_{1} + α_{2}}{2}} . \end{matrix}$

Algorithm 2: Small-sample statistical condition estimation method for the normwise condition number

Generate matrices $[d A_{1}, d C_{1}], [d A_{2}, d C_{2}], \dots, [d A_{q}, d C_{q}]$ with each entry in $N (0, 1)$ and Orthonormalize the following matrix

$[\begin{matrix} vec (d A_{1}) & vec (d A_{2}) & \dots & vec (d A_{q}) \\ vec (d C_{1}) & vec (d C_{2}) & \dots & vec (d C_{q}) \end{matrix}]$

to obtain $[τ_{1}, τ_{2}, \dots, τ_{q}]$ by modified Gram-Schmidt orthogonalization process. Each $τ_{i}$ can be converted into the corresponding matrices $[d A_{i}, d C_{i}]$ by applying the unvec operation.
Let $p = m + m n$ . Approximate $ω_{p}$ and $ω_{q}$ by

$\begin{matrix} ω_{k} \approx \sqrt{\frac{2}{π (k - \frac{1}{2})}} \end{matrix}$

(36)
For $i = 1, 2, \dots, q,$ compute

$θ_{i} = - C_{A}^{‡} d C_{i} C_{A}^{‡} + C_{A}^{‡} C^{†^{T}} d C_{i}^{T} (I - C C^{†}) + {(P Q P)}^{†} d C_{i}^{T} C^{†^{T}} Q C_{A}^{‡} - {(P Q P)}^{†} A^{T} J d A_{i} C_{A}^{‡} - {(P Q P)}^{†} d A_{i}^{T} J A C_{A}^{‡} .$
Compute the absolute condition vector by

$\begin{matrix} κ_{abs}^{‡} : = \frac{ω_{q}}{ω_{p}} \sqrt{{|θ_{1}|}^{2} + {|θ_{2}|}^{2} + \dots + {|θ_{q}|}^{2}}, \end{matrix}$

(37)

where the square operation is applied to each entry of $θ_{i}, i = 1, 2, \dots, q$ and the square root is also applied componentwise.
Estimate the normwise condition number (26) by

$\begin{matrix} n^{‡} (A, C) = \frac{N_{SCE}^{‡} {∥ [A, C] ∥}_{F}}{{∥C_{A}^{‡}∥}_{F}}, \end{matrix}$

(38)

where $N_{SCE}^{‡} : = \frac{ω_{q}}{ω_{p}} \sqrt{{∥σ_{1}∥}_{2}^{2} + {∥σ_{2}∥}_{2}^{2} + \dots + {∥σ_{q}∥}_{2}^{2}} = {∥κ_{abs}^{‡}∥}_{F}$ .

To estimate the mixed and componentwise condition numbers, we need the following SSCE method, which is from [28] and has been applied to many problems (see, e.g., [23,32,33,34,35]).

6. Numerical Experiments

In the following section, we illustrate two specific examples. The first compares the normwise, mixed and componentwise condition numbers and their upper bounds. The second is used to present the efficiency of statistical condition estimators.

Example 1.

In this example, we first compute the condition numbers and their upper bounds by using the below matrix pair, then we demonstrate the reliability of Algorithms 1–3. Matlab2018a has been used to perform all the numerical experiments. We examine 200 matrices that are created by repeatedly applying the matrices

A \in R^{m \times n}

from [33] and

C \in R^{s \times n}

below.

A = [\begin{matrix} U_{p} & 0 \\ 0 & U_{q} \end{matrix}] [\begin{matrix} D \\ 0 \end{matrix}] V, U_{p} = I_{p} - 2 u_{p} u_{p}^{T}, U_{q} = I_{q} - 2 u_{q} u_{q}^{T}, and V = I_{n} - 2 v v^{T},

where

u_{p} \in R^{p}, u_{q} \in R^{q}

and

v \in R^{n}

are unit random vectors obtained from Matlab function randn(

\cdot, 1)

and

D =

n^{- l} diag (n^{l}, {(n - 1)}^{l}, \dots, 1^{l})

. It is simple to determine that the condition number of A, i.e.,

κ (A) = {∥ A ∥}_{2} {∥A^{†}∥}_{2}

, is

n^{l}

.

C = [C_{1}, 0]

, where

C_{1}

is a nonsymmetric Gaussian random Toeplitz matrix generated by the Matlabs function

toeplitz (c, r)

with

c = randn (s, 1)

,

r = randn (s, 1)

. From Table 1, we can see the numerical outcomes of the ratios given by

\begin{matrix} ω_{1} = & n^{upper} (A, C) / n^{‡} (A, C), ω_{2} = m^{upper} (A, C) / m^{‡} (A, C) and ω_{3} = c^{upper} (A, C) / c^{‡} (A, C) . \end{matrix}

To show the efficiency of the three algorithms discussed above, we run some numerical tests and choose parameters

δ = 0.01

and

ϵ = 0.001

for Algorithm 1 and

k = 2

for Algorithms 2 and 3. The ratios between the exact condition numbers and their estimated values are determined as follows:

\begin{matrix} r_{p} & = & n_{p}^{‡} (A, C) / n^{‡} (A, C), r_{s} = n_{s}^{‡} (A, C) / n^{‡} (A, C), \\ r_{m} & = & m_{s}^{‡} (A, C) / m^{‡} (A, C), r_{c} = c_{s}^{‡} (A, C) / c^{‡} (A, C), \end{matrix}

where

r_{p}

is the ratio between the exact normwise condition number and the estimated value of Algorithm 1,

r_{s}

is the ratio between the exact normwise condition number and the estimated value of Algorithm 2, and

r_{m}

and

r_{c}

are the ratios between the exact mixed and componentwise condition numbers and estimated values of Algorithm 3.

The results in Table 2 demonstrate that Algorithms 1–3 can reliably estimate the condition numbers in most situations, supporting the statement in ([39], Chapter 15) that an estimate of the condition number that is correct to within a factor 10 is generally appropriate because it is the magnitude of an error bound that is of interest, not its precise value. For the normwise condition number, Algorithm 1 works more effectively and stably.

Algorithm 3: Small-sample statistical condition estimation method for the mixed and componentwise condition numbers

Generate matrices $[d A_{1}, d C_{1}], [d A_{2}, d C_{2}], \dots, [d A_{q}, d C_{q}]$ with each entry in $N (0, 1)$ and Orthonormalize the following matrix:

$[\begin{matrix} vec (d A_{1}) & vec (d A_{2}) & \dots & vec (d A_{q}) \\ vec (d C_{1}) & vec (d C_{2}) & \dots & vec (d C_{q}) \end{matrix}]$

to obtain $[τ_{1}, τ_{2}, \dots, τ_{q}]$ by modified Gram-Schmidt orthogonalization process. Each $τ_{i}$ can be converted into the corresponding matrices $[d A_{i}, d C_{i}]$ by applying the unvec operation. Let $[d A_{i}, d C_{i}]$ be the matrix $[\tilde{d A_{i}}, \tilde{d C_{i}}]$ multiplied by $[A, C]$ componentwise.
Let $p = m n + s n$ . Approximate $ω_{p}$ and $ω_{q}$ by (36).
For $i = 1, 2, \dots, q,$ compute

$θ_{i} = - C_{A}^{‡} d C_{i} C_{A}^{‡} + C_{A}^{‡} C^{†^{T}} d C_{i}^{T} (I - C C^{†}) + {(P Q P)}^{†} d C_{i}^{T} C^{†^{T}} Q C_{A}^{‡} - {(P Q P)}^{†} A^{T} J d A_{i} C_{A}^{‡} - {(P Q P)}^{†} d A_{i}^{T} J A C_{A}^{‡} .$

Using the approximations for $ω_{p}$ and $ω_{q},$ compute the absolute condition vector

$κ_{s c e}^{†} = \frac{ω_{q}}{ω_{p}} \sqrt{{|θ_{1}|}^{2} + {|θ_{2}|}^{2} + \dots + {|θ_{q}|}^{2}}$
Estimate the mixed and componentwise condition estimations $m_{s c e}^{‡} (A, C)$ and $c_{s c e}^{‡} (A, C)$ as follows:

$m_{s}^{‡} (A, C) = \frac{∥ κ_{s c e}^{†} ∥_{\infty}}{∥ vec (C_{A}^{‡}) ∥_{\infty}}, c_{s}^{‡} (A, C) = {∥\frac{κ_{s c e}^{†}}{vec (C_{A}^{‡})}∥}_{\infty} .$

Example 2.

On similar patrons given in [2,3,5], we generate A and C matrices using the GHQR factorization.

H^{T} A Q = [\begin{matrix} L_{11} & 0 \\ L_{21} & L_{22} \end{matrix}], U^{T} C Q = [\begin{matrix} K_{11} & 0 \\ 0 & 0 \end{matrix}],

where

H \in R^{(p + q) \times (p + q)}

is J-orthogonal, i.e.,

H J H^{T} = J

, Q is orthogonal, and

L_{22} \in R^{(n - s) \times (n - s)}

and

K_{11} \in R^{(s \times s)}

are lower triangular and non-singular, respectively. In our experiment, we let

L_{11}

L_{21}

be random matrices. H is a random J-orthogonal matrix with a specific condition number generated using the method described in [40].

Q \in R^{(n \times n)}

and

U \in R^{(s \times s)}

generated randomly (by Matlabs gallery (‘qmult ’, …)),

L_{22}

, and

K_{11}

are generated by QR factorization of random matrices with specified condition numbers and pre-assigned singular value distributions (generated via Matlabs gallery (‘randsvd’, …)). To examine the above algorithms’ performance, we use 500 matrix pairs, variate the condition numbers of A and C, and set

p = 50

,

q = 30

,

n = 40

, and

s = 20

. The ratios between the exact condition numbers and their estimated values are below.

\begin{matrix} r_{p} & = & n_{p}^{‡} (A, C) / n^{‡} (A, C), r_{s} = n_{s}^{‡} (A, C) / n^{‡} (A, C), \\ r_{m} & = & m_{s}^{‡} (A, C) / m^{‡} (A, C), r_{c} = c_{s}^{‡} (A, C) / c^{‡} (A, C), \end{matrix}

where the parameters δ, ϵ, k, and ratios

r_{p}

,

r_{s}

,

r_{m}

and

r_{c}

are the same as given in Example 1. We present these numerical results and CPU time in Figure 1 and Figure 2. The time ratios are defined by

\begin{matrix} t_{p} : = \frac{t_{1}}{t}, t_{s} : = \frac{t_{2}}{t}, t_{m} & : = & \frac{t_{3}}{t}, t_{c} : = \frac{t_{4}}{t}, \end{matrix}

where t is the CPU time of computing the generalized inverse

C_{A}^{‡}

by GHQR decomposition [20].

t_{1}

is the CPU time of Algorithm 1,

t_{2}

is the CPU time of Algorithm 2, and

t_{3}

and

t_{4}

are the CPU times of Algorithm 3. From Figure 1 and Figure 2, we can see that these three algorithms are highly efficient in estimating condition numbers. However, Table 3 shows that the CPU times of Algorithms 1 and 2 are smaller than Algorithm 3.

7. Conclusions

In this paper, we provided the explicit expressions and upper bounds for the normwise, mixed, and componentwise condition numbers for the generalized inverse

C_{A}^{‡}

. Additionally, the corresponding results for the K-weighted pseudoinverse

L_{K}^{†}

can be obtained as a special case. We also show how to recover the previous condition numbers of the EILS solution from the generalized inverse

C_{A}^{‡}

condition numbers. We also developed the componentwise perturbation analysis of the EILS problem. Moreover, we designed three algorithms that efficiently estimate the normwise, mixed, and componentwise conditions for the generalized inverse

C_{A}^{‡}

using the probabilistic condition estimation method and the small-sample statistical condition estimation method. Finally, numerical results demonstrated the performance of these algorithms. In the future, we will continue our research on the MK-weighted generalized inverse.

Author Contributions

Methodology, M.S.; Investigation, M.S. and X.Z.; writing—original draft, M.S.; review and editing, A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Zhejiang Normal University Postdoctoral Research Fund (Grant No. ZC304022938), the Natural Science Foundation of China (Project No. 61976196) and the Zhejiang Provincial Natural Science Foundation of China under Grant No. LZ22F030003.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Cucker, F.; Diao, H.; Wei, Y. On mixed and componentwise condition numbers for Moore–penrose inverse and linear least squares problems. Math. Comput. 2007, 76, 947–963. [Google Scholar] [CrossRef]
Liu, Q.; Pan, B.; Wang, Q. The hyperbolic elimination method for solving the equality constrained indefinite least squares problem. Int. J. Comput. Math. 2010, 87, 2953–2966. [Google Scholar] [CrossRef]
Liu, Q.; Wang, M. Algebraic properties and perturbation results for the indefinite least squares problem with equality constraints. Int. J. Comput. Math. 2010, 87, 425–434. [Google Scholar] [CrossRef]
Bjöorck, Å.; Higham, N.J.; Harikrishna, P. Solving the indefinite least squares problem by hyperbolic QR factorization. SIAM J. Matrix Anal. Appl. 2003, 24, 914–931. [Google Scholar]
Bjöorck, Å.; Higham, N.J.; Harikrishna, P. The equality constrained indefinite least squares problem: Theory and algorithms. BIT Numer. Math. 2003, 43, 505–517. [Google Scholar]
Wei, M.; Zhang, B. Structures and uniqueness conditions of MK-weighted pseudoinverses. BIT Numer. Math. 1994, 34, 437–450. [Google Scholar] [CrossRef]
Bjöorck, Å. Algorithms for indefinite linear least squares problems. Linear Algebra Appl. 2021, 623, 104–127. [Google Scholar]
Shi, C.; Liu, Q. A hyperbolic MGS elimination method for solving the equality constrained indefinite least squares problem. Commun. Appl. Math. Comput. 2011, 25, 65–73. [Google Scholar]
Mastronardi, N.; Dooren, P.V. An algorithm for solving the indefinite least squares problem with equality constraints. BIT Numer. Math. 2014, 54, 201–218. [Google Scholar] [CrossRef]
Mastronardi, N.; Dooren, P.V. A structurally backward stable algorithm for solving the indefinite least squares problem with equality constraints. IMA J. Numer. Anal. 2015, 35, 107–132. [Google Scholar] [CrossRef]
Wang, Q. Perturbation analysis for generalized indefinite least squares problems. J. East China Norm. Univ. Nat. Sci. Ed. 2009, 4, 47–53. [Google Scholar]
Diao, H.; Zhou, T. Linearised estimate of the backward error for equality constrained indefinite least squares problems. East Asian J. Appl. Math. 2019, 9, 270–279. [Google Scholar]
Li, H.; Wang, S.; Yang, H. On mixed and componentwise condition numbers for indefinite least squares problem. Linear Algebra Appl. 2014, 448, 104–129. [Google Scholar] [CrossRef]
Wang, S.; Meng, L. A contribution to the conditioning theory of the indefinite least squares problems. Appl. Numer. Math. 2022, 17, 137–159. [Google Scholar] [CrossRef]
Skeel, R.D. Scaling for numerical stability in Gaussian elimination. J. ACM 1979, 26, 494–526. [Google Scholar] [CrossRef]
Bjöorck, Å. Component-wise perturbation analysis and error bounds for linear least squares solutions. BIT Numer. Math. 1991, 31, 238–244. [Google Scholar] [CrossRef]
Diao, H.; Liang, L.; Qiao, S. A condition analysis of the weighted linear least squares problem using dual norms. Linear Algebra Appl. 2018, 66, 1085–1103. [Google Scholar] [CrossRef] [Green Version]
Diao, H.; Zhou, T. Backward error and condition number analysis for the indefinite linear least squares problem. Int. J. Comput. Math. 2019, 96, 1603–1622. [Google Scholar] [CrossRef] [Green Version]
Diao, H. Condition numbers for a linear function of the solution of the linear least squares problem with equality constraints. J. Comput. Appl. Math. 2018, 344, 640–656. [Google Scholar] [CrossRef]
Eldén, L. A weighted pseudoinverse, generalized singular values, and constrained least squares problems. BIT Numer. Math. 1982, 22, 487–502. [Google Scholar] [CrossRef]
Wei, M. Algebraic properties of the rank-deficient equality-constrained and weighted least squares problem. Linear Algebra Appl. 1992, 161, 27–43. [Google Scholar] [CrossRef] [Green Version]
Gulliksson, M.E.; Wedin, P.A.; Wei, Y. Perturbation identities for regularized Tikhonov inverses and weighted pseudoinverses. BITBIT Numer. Math. 2000, 40, 513–523. [Google Scholar] [CrossRef]
Samar, M.; Li, H.; Wei, Y. Condition numbers for the K-weighted pseudoinverse $L_{K}^{†}$ and their statistical estimation. Linear Multilinear Algebra 2021, 69, 752–770. [Google Scholar] [CrossRef]
Burgisser, P.; Cucker, F. Condition: The geometry of numerical algorithms. In Grundlehren der Mathematischen Wissenschaften; Springer: Heidelberg, Germany, 2013; Volume 349. [Google Scholar]
Rice, J. A theory of condition. SIAM J. Numer. Anal. 1966, 3, 287–310. [Google Scholar] [CrossRef]
Gohberg, I.; Koltracht, I. Mixed, componentwise, and structured condition numbers. SIAM J. Matrix Anal. Appl. 1993, 14, 688–704. [Google Scholar] [CrossRef]
Hochstenbach, M.E. Probabilistic upper bounds for the matrix two-norm. J. Sci. Comput. 2013, 57, 464–476. [Google Scholar] [CrossRef] [Green Version]
Kenney, C.S.; Laub, A.J. Small-sample statistical condition estimates for general matrix functions. SIAM J. Sci. Comput. 1994, 15, 36–61. [Google Scholar] [CrossRef]
Xie, Z.; Li, W.; Jin, X. On condition numbers for the canonical generalized polar decomposition of real matrices. Electron. J. Linear Algebra 2013, 26, 842–857. [Google Scholar] [CrossRef]
Horn, R.A.; Johnson, C.R. Topics in Matrix Analysis; Cambridge University Press: New York, NY, USA, 1991. [Google Scholar]
Magnus, J.R.; Neudecker, H. Matrix Differential Calculus with Applications in Statistics and Econometrics, 3rd ed.; John Wiley and Sons: Chichester, UK, 2007. [Google Scholar]
Diao, H.; Xiang, H.; Wei, Y. Mixed, componentwise condition numbers and small sample statistical condition estimation of Sylvester equations. Numer. Linear Algebra Appl. 2012, 19, 639–654. [Google Scholar] [CrossRef]
Li, H.; Wang, S. On the partial condition numbers for the indefnite least squares problem. Appl. Numer. Math. 2018, 123, 200–220. [Google Scholar] [CrossRef] [Green Version]
Samar, M. Condition numbers for a linear function of the solution to the constrained and weighted least squares problem and their statistical estimation. Taiwan J. Math. 2021, 25, 717–741. [Google Scholar] [CrossRef]
Samar, M.; Lin, F. Perturbation and condition numbers for the Tikhonov regularization of total least squares problem and their statistical estimation. J. Comput. Appl. Math. 2022, 411, 114230. [Google Scholar] [CrossRef]
Diao, H.; Wei, Y.; Xie, P. Small sample statistical condition estimation for the total least squares problem. Numer. Algor. 2017, 75, 435–455. [Google Scholar] [CrossRef]
Samar, M.; Zhu, X. Structured conditioning theory for the total least squares problem with linear equality constraint and their estimation. AIMS Math. 2023, 8, 11350–11372. [Google Scholar] [CrossRef]
Baboulin, M.; Gratton, S.; Lacroix, R.; Laub, A.J. Statistical estimates for the conditioning of linear least squares problems. In Parallel Processing and Applied Mathematics: 10th International Conference, PPAM 2013, Warsaw, Poland, September 8–11, 2013, Revised Selected Papers, Part I 10; Springer: Berlin/Heidelberg, Germany, 2014; Lecture Notes in Computer Science; Volume 8384, pp. 124–133. [Google Scholar]
Higham, N.J. Accuracy and Stability of Numerical Algorithms, 2nd ed.; SIAM: Philadelphia, PA, USA, 2002. [Google Scholar]
Higham, N.J. J-Orthogonal matrices: Properties and generation. SIAM Rev. 2003, 45, 504–519. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Efficiency of normwise condition estimators and CPU times of Algorithms 1 and 2.

Figure 2. Efficiency of mixed and componentwise condition estimators and CPU time of Algorithm 3.

Table 1. Comparison of condition numbers and their upper bounds by choosing different values of

p, q, s

and n.

Table 1. Comparison of condition numbers and their upper bounds by choosing different values of

p, q, s

and n.

		Mean	Max	Mean	Max	Mean	Max
$n^{1}$	$p, q, n, s$	$ω_{1}$		$ω_{2}$		$ω_{3}$
	$25, 15, 20, 10$	1.0763 $\times 10^{0}$	4.8422 $\times 10^{0}$	1.0647 $\times 10^{0}$	2.8373 $\times 10^{0}$	1.1538 $\times 10^{0}$	3.9657 $\times 10^{0}$
	$50, 30, 40, 20$	1.3146 $\times 10^{0}$	6.7089 $\times 10^{0}$	1.0861 $\times 10^{0}$	4.4630 $\times 10^{0}$	1.2845 $\times 10^{0}$	5.9123 $\times 10^{0}$
	$75, 45, 60, 30$	1.7422 $\times 10^{0}$	1.6402 $\times 10^{1}$	1.1965 $\times 10^{0}$	1.2847 $\times 10^{1}$	1.0784 $\times 10^{0}$	1.5766 $\times 10^{1}$
	$100, 60, 80, 40$	2.6043 $\times 10^{0}$	1.9461 $\times 10^{1}$	1.4574 $\times 10^{0}$	1.6783 $\times 10^{1}$	1.7540 $\times 10^{0}$	1.8452 $\times 10^{1}$
$n^{2}$	$p, q, n, s$	$ω_{1}$		$ω_{2}$		$ω_{3}$
	$25, 15, 20, 10$	1.4032 $\times 10^{0}$	5.7654 $\times 10^{0}$	1.2433 $\times 10^{0}$	4.6501 $\times 10^{0}$	1.3601 $\times 10^{0}$	5.3752 $\times 10^{0}$
	$50, 30, 40, 20$	1.7341 $\times 10^{0}$	8.2074 $\times 10^{0}$	1.5623 $\times 10^{0}$	6.4738 $\times 10^{0}$	1.7320 $\times 10^{0}$	7.2004 $\times 10^{0}$
	$75, 45, 60, 30$	2.5254 $\times 10^{0}$	2.8732 $\times 10^{1}$	1.8510 $\times 10^{0}$	1.6062 $\times 10^{1}$	2.0653 $\times 10^{0}$	2.2903 $\times 10^{1}$
	$100, 60, 80, 40$	2.7034 $\times 10^{0}$	3.9543 $\times 10^{1}$	2.0312 $\times 10^{0}$	2.0106 $\times 10^{1}$	2.3871 $\times 10^{0}$	2.4803 $\times 10^{1}$
$n^{3}$	$p, q, n, s$	$ω_{1}$		$ω_{2}$		$ω_{3}$
	$25, 15, 20, 10$	1.7301 $\times 10^{0}$	7.9662 $\times 10^{0}$	1.4607 $\times 10^{0}$	6.8606 $\times 10^{0}$	1.5296 $\times 10^{0}$	8.0651 $\times 10^{0}$
	$50, 30, 40, 20$	1.9674 $\times 10^{0}$	3.7649 $\times 10^{1}$	1.7065 $\times 10^{0}$	8.5963 $\times 10^{0}$	1.8472 $\times 10^{0}$	9.7063 $\times 10^{0}$
	$75, 45, 60, 30$	2.7055 $\times 10^{0}$	5.6570 $\times 10^{1}$	2.0276 $\times 10^{0}$	3.2613 $\times 10^{1}$	2.3601 $\times 10^{0}$	4.6904 $\times 10^{1}$
	$100, 60, 80, 40$	2.9867 $\times 10^{0}$	7.1601 $\times 10^{1}$	2.2760 $\times 10^{0}$	4.9013 $\times 10^{1}$	2.5935 $\times 10^{0}$	5.9721 $\times 10^{1}$
$n^{4}$	$p, q, n, s$	$ω_{1}$		$ω_{2}$		$ω_{3}$
	$25, 15, 20, 10$	1.8271 $\times 10^{0}$	2.3021 $\times 10^{1}$	1.6354 $\times 10^{0}$	1.4032 $\times 10^{1}$	1.7925 $\times 10^{0}$	1.5102 $\times 10^{1}$
	$50, 30, 40, 20$	2.3064 $\times 10^{0}$	3.7632 $\times 10^{1}$	1.9642 $\times 10^{0}$	1.5210 $\times 10^{1}$	1.9862 $\times 10^{0}$	1.6082 $\times 10^{1}$
	$75, 45, 60, 30$	2.8063 $\times 10^{0}$	7.4310 $\times 10^{1}$	2.0513 $\times 10^{0}$	5.0471 $\times 10^{1}$	2.6743 $\times 10^{0}$	6.0437 $\times 10^{1}$
	$100, 60, 80, 40$	2.9887 $\times 10^{0}$	8.6501 $\times 10^{1}$	2.3810 $\times 10^{0}$	7.1089 $\times 10^{1}$	2.7011 $\times 10^{0}$	7.4810 $\times 10^{1}$

Table 2. Results by choosing different values of

p, q, s

and n for Algorithms 1–3.

Table 2. Results by choosing different values of

p, q, s

and n for Algorithms 1–3.

		Mean	Variance	Mean	Variance	Mean	Variance	Mean	Variance
$n^{1}$	$p, q, n, s$	$r_{p}$		$r_{s}$		$r_{m}$		$r_{c}$
	$25, 15, 20, 10$	1.0000 $\times 10^{0}$	5.3577 $\times 10^{- 11}$	1.0322 $\times 10^{0}$	1.2063 $\times 10^{- 1}$	1.0067 $\times 10^{0}$	1.3505 $\times 10^{- 2}$	1.2785 $\times 10^{0}$	1.0431 $\times 10^{- 2}$
	$50, 30, 40, 20$	1.0000 $\times 10^{0}$	7.0635 $\times 10^{- 9}$	1.1439 $\times 10^{0}$	3.5027 $\times 10^{- 1}$	1.0134 $\times 10^{0}$	3.9054 $\times 10^{- 2}$	1.3744 $\times 10^{0}$	3.6397 $\times 10^{- 2}$
	$75, 45, 60, 30$	1.0001 $\times 10^{0}$	1.5165 $\times 10^{- 11}$	1.2906 $\times 10^{0}$	4.6021 $\times 10^{- 1}$	1.1075 $\times 10^{0}$	4.1653 $\times 10^{- 2}$	1.5043 $\times 10^{0}$	3.9428 $\times 10^{- 2}$
	$100, 60, 80, 40$	1.0001 $\times 10^{0}$	1.7940 $\times 10^{- 12}$	1.3482 $\times 10^{0}$	5.7803 $\times 10^{- 1}$	1.2306 $\times 10^{0}$	4.9563 $\times 10^{- 2}$	1.8732 $\times 10^{0}$	4.6543 $\times 10^{- 2}$
$n^{2}$	$p, q, n, s$	$r_{p}$		$r_{s}$		$r_{m}$		$r_{c}$
	$25, 15, 20, 10$	1.0000 $\times 10^{0}$	6.5102 $\times 10^{- 9}$	1.2654 $\times 10^{0}$	2.7360 $\times 10^{- 1}$	1.3405 $\times 10^{0}$	3.4605 $\times 10^{- 2}$	1.2765 $\times 10^{0}$	2.6123 $\times 10^{- 2}$
	$50, 30, 40, 20$	1.0000 $\times 10^{0}$	7.4738 $\times 10^{- 11}$	1.4783 $\times 10^{0}$	4.4925 $\times 10^{- 1}$	1.7169 $\times 10^{0}$	4.8543 $\times 10^{- 2}$	1.5063 $\times 10^{0}$	4.3326 $\times 10^{- 2}$
	$75, 45, 60, 30$	1.0001 $\times 10^{0}$	1.6062 $\times 10^{- 9}$	1.6295 $\times 10^{0}$	6.8732 $\times 10^{- 1}$	1.8206 $\times 10^{0}$	6.4890 $\times 10^{- 2}$	1.7422 $\times 10^{0}$	5.0542 $\times 10^{- 2}$
	$100, 60, 80, 40$	1.0001 $\times 10^{0}$	2.5106 $\times 10^{- 13}$	1.8693 $\times 10^{0}$	7.9543 $\times 10^{- 1}$	2.1456 $\times 10^{0}$	7.4293 $\times 10^{- 2}$	2.0361 $\times 10^{0}$	6.3702 $\times 10^{- 2}$
$n^{3}$	$p, q, n, s$	$r_{p}$		$r_{s}$		$r_{m}$		$r_{c}$
	$25, 15, 20, 10$	1.0000 $\times 10^{0}$	1.7029 $\times 10^{- 8}$	1.2063 $\times 10^{0}$	4.2083 $\times 10^{- 1}$	1.6710 $\times 10^{0}$	5.7862 $\times 10^{- 2}$	1.3722 $\times 10^{0}$	4.7031 $\times 10^{- 2}$
	$50, 30, 40, 20$	1.0000 $\times 10^{0}$	2.4771 $\times 10^{- 11}$	1.7033 $\times 10^{0}$	7.2035 $\times 10^{- 1}$	1.8041 $\times 10^{0}$	6.0165 $\times 10^{- 2}$	1.5760 $\times 10^{0}$	5.7402 $\times 10^{- 2}$
	$75, 45, 60, 30$	1.0002 $\times 10^{0}$	6.1041 $\times 10^{- 12}$	2.0654 $\times 10^{0}$	7.5293 $\times 10^{- 1}$	2.2054 $\times 10^{0}$	8.3014 $\times 10^{- 2}$	2.0113 $\times 10^{0}$	7.2461 $\times 10^{- 2}$
	$100, 60, 80, 40$	1.0003 $\times 10^{0}$	5.6854 $\times 10^{- 13}$	2.1976 $\times 10^{0}$	8.2063 $\times 10^{- 1}$	2.2593 $\times 10^{0}$	8.6458 $\times 10^{- 2}$	2.1263 $\times 10^{0}$	7.9432 $\times 10^{- 2}$
$n^{4}$	$p, q, n, s$	$r_{p}$		$r_{s}$		$r_{m}$		$r_{c}$
	$25, 15, 20, 10$	1.0000 $\times 10^{0}$	5.6321 $\times 10^{- 7}$	1.6305 $\times 10^{0}$	6.2092 $\times 10^{- 1}$	1.9455 $\times 10^{0}$	6.7402 $\times 10^{- 2}$	1.8240 $\times 10^{0}$	6.0461 $\times 10^{- 2}$
	$50, 30, 40, 20$	1.0000 $\times 10^{0}$	6.0573 $\times 10^{- 9}$	1.7002 $\times 10^{0}$	8.0210 $\times 10^{- 1}$	1.9822 $\times 10^{0}$	8.0549 $\times 10^{- 2}$	1.9701 $\times 10^{0}$	7.4322 $\times 10^{- 2}$
	$75, 45, 60, 30$	1.0003 $\times 10^{0}$	8.6021 $\times 10^{- 11}$	2.1533 $\times 10^{0}$	9.0425 $\times 10^{- 1}$	2.4003 $\times 10^{0}$	9.3614 $\times 10^{- 2}$	2.2764 $\times 10^{0}$	8.4681 $\times 10^{- 2}$
	$100, 60, 80, 40$	1.0004 $\times 10^{0}$	2.8543 $\times 10^{- 12}$	2.4187 $\times 10^{0}$	9.2054 $\times 10^{- 1}$	2.6005 $\times 10^{0}$	9.5370 $\times 10^{- 2}$	2.5711 $\times 10^{0}$	9.4502 $\times 10^{- 2}$

Table 3. CPU times for Algorithms 1–3 by choosing different values of

p, q, s

and n.

Table 3. CPU times for Algorithms 1–3 by choosing different values of

p, q, s

and n.

$p, q, n, s$	$t_{p}$	$t_{s}$	$t_{m}$	$t_{c}$
$25, 15, 20, 10$	0.1065	0.2742	0.7601	0.4643
$75, 45, 60, 30$	0.3784	0.5204	1.3644	1.1677
$100, 60, 80, 40$	0.4842	0.6032	1.4569	1.2658
$120, 80, 100, 50$	0.5643	0.7411	1.6345	1.5403

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Samar, M.; Zhu, X.; Shakoor, A. Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations. Mathematics 2023, 11, 2111. https://doi.org/10.3390/math11092111

AMA Style

Samar M, Zhu X, Shakoor A. Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations. Mathematics. 2023; 11(9):2111. https://doi.org/10.3390/math11092111

Chicago/Turabian Style

Samar, Mahvish, Xinzhong Zhu, and Abdul Shakoor. 2023. "Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations" Mathematics 11, no. 9: 2111. https://doi.org/10.3390/math11092111

APA Style

Samar, M., Zhu, X., & Shakoor, A. (2023). Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations. Mathematics, 11(9), 2111. https://doi.org/10.3390/math11092111

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Conditioning Theory for Generalized Inverse $C_{A}^{‡}$ and Their Estimations

Abstract

1. Introduction

2. Preliminaries

3. Condition Numbers

4. Componentwise Perturbation Analysis

5. Statistical Condition Estimates

6. Numerical Experiments

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI