An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally

Wang, Shuai; Wang, Ziyang; Xie, Wu; Qi, Yunfei; Liu, Tao

doi:10.3390/math13071080

Open AccessArticle

An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally

by

Shuai Wang

¹

,

Ziyang Wang

²,

Wu Xie

³,

Yunfei Qi

³ and

Tao Liu

^4,*

¹

Foundation Department, Changchun Guanghua University, Changchun 130033, China

²

Sydney Smart Technology College, Northeastern University at Qinhuangdao, Qinhuangdao 066004, China

³

Eighth Geological Brigade of Hebei Bureau of Geology and Mineral Resources Exploration (Hebei Center of Marine Geological Resources Survey), Qinhuangdao 066000, China

⁴

School of Mathematics and Statistics, Northeastern University at Qinhuangdao, Qinhuangdao 066004, China

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(7), 1080; https://doi.org/10.3390/math13071080

Submission received: 18 February 2025 / Revised: 15 March 2025 / Accepted: 24 March 2025 / Published: 26 March 2025

(This article belongs to the Special Issue New Trends and Developments in Numerical Analysis: 2nd Edition)

Download

Browse Figure

Versions Notes

Abstract

:

The matrix sign function has a key role in several applications in numerical linear algebra. This paper presents a novel iterative approach with a sixth order of convergence to efficiently compute this function. The scheme is constructed via the employment of a nonlinear equations solver for simple roots. Then, the convergence of the extended matrix procedure is investigated to demonstrate the sixth rate of convergence. Basins of attractions for the proposed solver are given to show its global convergence behavior as well. Finally, the numerical experiments demonstrate the effectiveness of our approach compared to classical methods.

Keywords:

invertible matrix; sign; starting value; iterative methods; sixth order

MSC:

41A25; 65F60

1. Introductory Notes

The matrix sign function (MSF) is a fundamental concept in numerical linear algebra [1], with widespread applications in solving Lyapunov equations, algebraic Riccati equations, and computing spectral decompositions [2]. Given a square matrix

A \in C^{n \times n}

without purely imaginary eigenvalues, the MSF, denoted as

sign (A)

, is formally defined via its Jordan form representation [3]:

sign (A) = Z [\begin{matrix} - I_{γ} & 0 \\ 0 & I_{n - γ} \end{matrix}] Z^{- 1},

(1)

where

A = Z J_{A} Z^{- 1}

is the Jordan decomposition of A,

I_{γ}

is the identity matrix of the dimension

γ \times γ

,

I_{n - γ}

is the identity matrix of the dimension

(n - γ) \times (n - γ)

, and the block diagonal matrix

J_{A}

contains eigenvalues partitioned into those with positive and negative real parts. In addition, the MSF satisfies the nonlinear matrix equation [4]

X^{2} = I,

(2)

which provides a fundamental framework for designing iterative methods, I is the identity matrix of appropriate dimension and

sign (A) = X

.

Since its appearance, the MSF has been employed across various domains of scientific computing [5], graph theory, control theory [6], and linear algebra algorithms [7]. In the realm of differential equations, particularly those modeling physical phenomena, the MSF assists in decomposing matrices. This decomposition simplifies the solution process, making it easier to analyze and solve complex systems of differential equations.

The numerical computation of

sign (A)

has been extensively studied, leading to various iterative techniques, including Newton’s scheme, the Newton–Schultz local solver, Halley’s scheme, and the Padé family of iterations [8]. While classical schemes such as Newton’s iteration

X_{k + 1} = \frac{1}{2} (X_{k} + X_{k}^{- 1}), X_{0} = A,

(3)

exhibit quadratic convergence, recent research has focused on developing higher-order methods to achieve superior efficiency. This paper proposes a novel iterative approach based on high-order rational approximations, improving the Padé iterations [9].

The iterative computation of the MSF has undergone significant advancements, particularly with the development of rational approximations and multipoint iteration strategies. The foundational Newton iteration provides a straightforward approach with global convergence for matrices without imaginary-axis eigenvalues [4]. However, its quadratic convergence rate limits its efficiency for large-scale applications. The related numerical approaches are discussed in [10,11].

To accelerate convergence, Padé approximations have been incorporated into iterative schemes. Kenney and Laub [12] introduced Padé iterations, which utilize rational approximations of the function

h (z) = {(1 - z^{2})}^{- 1 / 2}

to construct the following recurrence relation:

X_{k + 1} = X_{k} P_{l m} (I - X_{k}^{2}) Q_{l m} {(I - X_{k}^{2})}^{- 1},

(4)

where

P_{l m}

and

Q_{l m}

denote the

(l, m)

-Padé approximants. The choice of parameters influences convergence behavior, with specific cases such as

l = m - 1

and

l = m

exhibiting global convergence.

Beyond the Padé iterations, high-order iterative methods have been explored to further enhance computational efficiency. Methods derived from root-finding schemes in scalar nonlinear equations, such as Halley’s method,

X_{k + 1} = X_{k} (I + 3 X_{k}^{2}) {(3 I + X_{k}^{2})}^{- 1},

(5)

demonstrate cubic convergence.

A key advancement in this direction involves designing high-order iterations using conformal mappings and stability-enhancing transformations. For instance, the recent work of Jung and Chun [13] proposes a three-point iterative framework where weight functions are tuned to optimize the stability region and convergence order. Their approach constructs an iteration scheme conformally equivalent to an eighth-order polynomial, effectively balancing convergence speed and numerical stability.

Despite these advancements, further improvements are needed to construct iterative methods with both high-order convergence and computational robustness [14]. Motivated by these challenges [15], this paper introduces a sixth-order iterative method based on refined rational approximations and weight functions. Unlike classical methods such as Newton’s iteration, which exhibits quadratic convergence, the proposed iterative scheme achieves a higher convergence rate of order six, leading to improved computational efficiency. The methodology is constructed by extending a high-order nonlinear solver for simple roots to the matrix setting, providing a systematic framework for deriving efficient iterative schemes for computing the MSF. The global convergence behavior of the scheme is further supported by an analysis of attraction basins, offering insights into its applicability across a broad spectrum of matrices. These contributions collectively distinguish the approach from existing methods and establish its effectiveness in computing the MSF.

The structure of the remainder of this study is organized as follows. Section 2 explores the benefits of a high-order scheme and introduces a novel approach for solving scalar nonlinear equations, which is subsequently generalized to the matrix setting. An analytical investigation establishes the sixth-order convergence of the proposed solver. Next, in Section 3, we investigate the global convergence behavior of the proposed method using regions of attraction basins, alongside an assessment of its stability properties. Section 4 presents numerical experiments that substantiate the theoretical findings and further illustrate the effectiveness of the proposed methodology. Finally, Section 5 offers a comprehensive conclusion, summarizing the key contributions of this work.

2. An Accelerated Sixth-Order Procedure

We now turn our attention to the scalar analogue of Equation (2), which is given by

f (x) = x^{2} - 1 = 0 .

In this context,

f (x)

corresponds to the scalar version of (2), whose solutions are

x = \pm 1

.

To enhance most classical approaches, such as (3)–(5), (for more details, see [16]), we propose a refined multi-step iteration scheme based on rational approximations and weight functions, formulated as follows:

\begin{matrix} \{\begin{matrix} d_{k} = x_{k} - \frac{f (x_{k})}{f^{'} (x_{k})}, k \geq 0, \\ z_{k} = x_{k} - \frac{835 f (x_{k}) - 836 f (d_{k})}{835 f (x_{k}) - 1671 f (d_{k})} \frac{f (x_{k})}{f^{'} (x_{k})}, \\ p_{k} = z_{k} - \frac{f (z_{k})}{\frac{f (z_{k}) - f (x_{k})}{z_{k} - x_{k}}}, \\ x_{k + 1} = p_{k} - \frac{f (p_{k})}{\frac{f (p_{k}) - f (d_{k})}{p_{k} - d_{k}}} . \end{matrix} \end{matrix}

(6)

The structure in (6) has been designed intentionally for two reasons. The first one is to obtain a new method (which does not belong to the Padé family of methods). The second one is efficient in computing the MSF with global convergence behavior, as will be discussed later in this work.

Theorem 1.

Let

θ \in D

be a simple root of the adequately smooth function

f : D \subseteq C \to C

. If the starting approximation

x_{0}

is chosen to be sufficiently close to θ, then the procedure (6) tends to θ with sixth-order accuracy.

Proof.

Let

θ

be a simple zero of the function f. Given that f possesses sufficient smoothness, we perform a Taylor series expansion of

f (x_{k})

and its derivative

f^{'} (x_{k})

around

θ

, yielding the following expressions:

f (x_{k}) = f^{'} (θ) [e r_{k} + c_{2} e r_{k}^{2} + c_{3} e r_{k}^{3} + c_{4} e r_{k}^{4} + c_{5} e r_{k}^{5} + c_{6} e r_{k}^{6} + O (e r_{k}^{7})],

(7)

as well as

f^{'} (x_{k}) = f^{'} (θ) [1 + 2 c_{2} e r_{k} + 3 c_{3} e r_{k}^{2} + 4 c_{4} e r_{k}^{3} + 5 c_{5} e r_{k}^{4} + 6 c_{6} e r_{k}^{5} + O (e r_{k}^{6})] .

(8)

Here, the notation

e r_{k} = x_{k} - θ

denotes the error term, and the coefficients

c_{k}

are given by

c_{k} = \frac{1}{k!} \frac{f^{(k)} (θ)}{f^{'} (θ)}, k \geq 2 .

By utilizing Equations (7) and (8), we derive the following expression:

\frac{f (x_{k})}{f^{'} (x_{k})} = e r_{k} - c_{2} e r_{k}^{2} + 2 (c_{2}^{2} - c_{3}) e r_{k}^{3} + (- 4 c_{2}^{3} + 7 c_{2} c_{3} - 3 c_{4}) e r_{k}^{4} + O (e r_{k}^{5}) .

(9)

Substituting Equation (9) into

y_{k}

as defined in (6), we obtain

d_{k} = θ + c_{2} e r_{k}^{2} + (- 2 c_{2}^{2} + 2 c_{3}) e r_{k}^{3} - (- 4 c_{2}^{3} + 7 c_{2} c_{3} - 3 c_{4}) e r_{k}^{4} + O (e r_{k}^{5}) .

(10)

Utilizing Equations (7)–(10), we obtain

\frac{835 f (x_{k}) - 836 f (d_{k})}{835 f (x_{k}) - 1671 f (d_{k})} = 1 + c_{2} e r + (2 c_{3} - \frac{834 c_{2}^{2}}{835}) e r^{2}

+ (- \frac{1669 c_{2}^{3}}{697225} - \frac{1666 c_{3} c_{2}}{835} + 3 c_{4}) e r^{3} + O (e r_{k}^{4}) .

(11)

By substituting (11) into Equation (6) and performing Taylor expansion with some simplifications, we deduce that

z_{k} = θ - \frac{1}{835} (c_{2}^{2} e r^{3}) + (\frac{699729 c_{2}^{3}}{697225} - \frac{839 c_{2} c_{3}}{835}) e r_{k}^{4} + O (e r_{k}^{5}) .

(12)

Now, by performing a Taylor expansion of

f (z_{k})

about

θ

and incorporating Equation (6), we arrive at

p_{k} = θ - \frac{1}{835} (c_{2}^{3}) e r^{4} + (\frac{700564 c_{2}^{4}}{697225} - \frac{168}{167} c_{2}^{2} c_{3}) e r^{5} + O (e r_{k}^{6}) .

(13)

Finally, using Equations (7) and (13), we obtain the refined expression

\frac{f (p_{k}) - f (d_{k})}{p_{k} - d_{k}} = f^{'} (θ) + f^{'} (θ) c_{2}^{2} e r^{2} + 2 c_{2} (c_{3} - c_{2}^{2}) f^{'} (θ) e r^{3}

+ (\frac{3339 c_{2}^{4} f^{'} (θ)}{835} - 6 c_{3} c_{2}^{2} f^{'} (θ) + 3 c_{4} c_{2} f^{'} (θ)) e r^{4} + O (e r_{k}^{5}) .

(14)

Using (14) we can determine that (6) describes the following final equation for the errors:

\begin{matrix} e r_{k + 1} = - \frac{c_{2}^{5}}{835} e r_{k}^{6} + O (e r_{k}^{7}) . \end{matrix}

(15)

This completes the proof by showing the sixth order of convergence for the error equation. □

The iterative scheme outlined in (6) can now be applied to solve (2). Proceeding with this approach leads to

X_{k + 1} = X_{k} (2925 I + 14615 X_{k}^{2} + 8763 X_{k}^{4} + 417 X_{k}^{6}) {[418 I + 8772 X_{k}^{2} + 14610 X_{k}^{4} + 2920 X_{k}^{6}]}^{- 1},

(16)

having the starting matrix

X_{0} = A .

(17)

In a similar manner, the reciprocal formulation corresponding to Equation (16) is obtained by employing the following systematic approach:

X_{k + 1} = (418 I + 8772 X_{k}^{2} + 14610 X_{k}^{4} + 2920 X_{k}^{6}) {[X_{k} (2925 I + 14615 X_{k}^{2} + 8763 X_{k}^{4} + 417 X_{k}^{6})]}^{- 1} .

(18)

The computational efficiency of iterative methods for determining the MSF depends largely on the order of convergence. Classical methods such as Newton’s iteration (3) exhibit quadratic convergence, which, while sufficient for moderate-sized problems, becomes computationally expensive for large-scale complex matrices. High-order methods are designed to achieve faster convergence rates, reducing the number of iterations needed to achieve a given accuracy. This is especially useful when dealing with complex matrices, where each iteration involves matrix–matrix operations such as multiplications and inversions. By employing high-order iterations, the total computational cost can be reduced, making the method more practical for applications in numerical linear algebra.

Another crucial advantage of high-order methods is their enhanced numerical stability. Lower-order methods, such as Newton’s iteration, often require stabilization techniques such as scaling and squaring to maintain numerical accuracy, especially for matrices with eigenvalues close to the imaginary axis. High-order methods naturally mitigate these issues by achieving rapid convergence within fewer iterations, thereby reducing error accumulation and improving robustness. Moreover, the flexibility of high-order schemes allows for the incorporation of adaptive techniques, such as dynamically selecting iteration parameters based on the spectral properties of the matrix. This adaptability makes high-order methods particularly attractive for solving ill-conditioned problems and ensures reliable performance across a broad spectrum of computational tasks.

Theorem 2.

Assume

X_{0}

serves as a suitable initial guess and A is an invertible matrix. Under these assumptions, the iterative scheme described by (18) (or equivalently (16)) converges to M, achieving a sixth rate of convergence.

Proof.

It is recalled that the decomposition of the matrix A is carried out using an invertible matrix Z of identical dimensions, in conjunction with the Jordan block matrix J. This leads to the following factorization:

A = Z J Z^{- 1} .

(19)

By leveraging this decomposition and conducting a meticulous structural analysis of the solver, an iterative scheme is formulated for calculating the eigenvalues (

μ

). This iterative process transitions from iteration k to the subsequent iteration

k + 1

as follows:

\begin{matrix} μ_{k + 1}^{i} = (418 + 8772 {μ_{k}^{i}}^{2} + 14610 {μ_{k}^{i}}^{4} + 2920 {μ_{k}^{i}}^{6}) \\ \times {[μ_{k}^{i} (2925 + 14615 {μ_{k}^{i}}^{2} + 8763 {μ_{k}^{i}}^{4} + 417 {μ_{k}^{i}}^{6})]}^{- 1}, 1 \leq i \leq n, \end{matrix}

(20)

where

s_{i} = sign (μ_{k}^{i}) = \pm 1 .

(21)

From a theoretical standpoint, and upon performing appropriate simplifications, the iterative scheme delineated in (20) reveals that the eigenvalues asymptotically converge toward the limiting values

s_{i} = \pm 1

. More precisely, this convergence behavior is mathematically characterized by the following expression:

lim_{k \to \infty} |\frac{μ_{k + 1}^{i} - s_{i}}{μ_{k + 1}^{i} + s_{i}}| = 0 .

(22)

Equation (22) encapsulates the asymptotic tendency of the eigenvalues to cluster around

\pm 1

as the iterative process advances. With each successive iteration, the eigenvalues exhibit an increasingly tight convergence toward these limiting values. Having established the theoretical foundation for the method’s convergence, we now shift our focus toward analyzing its rate of convergence. To facilitate this investigation, we proceed as follows:

B_{k} = X_{k} (2925 I + 14615 X_{k}^{2} + 8763 X_{k}^{4} + 417 X_{k}^{6}) .

(23)

Utilizing Equation (23) and recognizing that

X_{k}

represents a rational function of A, while simultaneously ensuring that

X_{k}

maintains commutativity with M in the same manner as A, the following expression can be formulated

\begin{matrix} X_{k + 1} - M & = & (418 I + 8772 X_{k}^{2} + 14610 X_{k}^{4} + 2920 X_{k}^{6}) B_{k}^{- 1} - M \\ = & [418 I + 8772 X_{k}^{2} + 14610 X_{k}^{4} + 2920 X_{k}^{6} - M B_{k}] B_{k}^{- 1} \\ = & [418 I + 8772 X_{k}^{2} + 14610 X_{k}^{4} + 2920 X_{k}^{6} \\ - X_{k} [2925 M + 14615 X_{k}^{2} M + 8763 X_{k}^{4} M + 417 X_{k}^{6} M] B_{k}^{- 1} \\ = & [- 418 {(X_{k} - M)}^{6} + 417 X_{k} M {(X_{k} - M)}^{6}] B_{k}^{- 1} \\ = & {(X_{k} - M)}^{6} [- 418 I + 417 X_{k}] B_{k}^{- 1} . \end{matrix}

(24)

Using (24) and 2-norm, it is possible to determine that:

∥ X_{k + 1} - M ∥ \leq (∥ B_{k}^{- 1} ∥ ∥ 418 X_{k} - 417 I ∥) {∥ X_{k} - M ∥}^{6} .

(25)

This finding underscores that the iterative scheme attains a convergence rate of order six, contingent upon the selection of a suitably chosen initial matrix, such as the one specified in (17), as the starting point. □

Compared to existing rational approximation-based methods, our approach integrates a high-order nonlinear solver into the iterative framework, enhancing both stability and convergence properties. Moreover, while lower-order methods often require stabilization techniques such as scaling and squaring to handle matrices with eigenvalues near the imaginary axis, the proposed method naturally mitigates such issues due to its rapid convergence and inherent numerical robustness.

3. Attraction Basins and Stability

Understanding the convergence behavior of iterative methods is crucial when computing the MSF. One effective way to visualize and analyze this behavior is through attraction basins, which depict the regions in the complex plane where different initial guesses lead to convergence to particular solutions. Attraction basins provide insights into the stability, efficiency, and robustness of numerical methods [17].

Iterative methods for computing the MSF rely on successive approximations, where the choice of the starting value

X_{0}

significantly affects the convergence trajectory. By plotting attraction basins, we can carry out the following steps:

Identify regions where the method converges rapidly.
Detect unstable zones where divergence or slow convergence occurs.
Compare different iterative schemes in terms of their efficiency.

Numerical stability is a crucial factor when selecting an iterative method. As analyzed in [18], attraction basin plots provide valuable insights into the robustness of a method against perturbations in the initial guess, the presence of fractal-like structures that indicate chaotic behavior in iterative dynamics, and the impact of scaling techniques on improving convergence regions.

A critical advantage of drawing attraction basins is their role in benchmarking different iterative methods. Methods with larger, well-connected basins typically offer superior convergence properties, while those with fragmented or irregular basins may suffer from instability.

Drawing attraction basins is a powerful tool for analyzing iterative methods that can be used to compute the MSF. These visualizations help assess convergence behavior, stability, and efficiency, guiding the selection and refinement of high-order iterative schemes. As we develop new iterative methods, attraction basin analysis remains essential in ensuring their practical effectiveness.

In this work, the techniques presented in (16) and (18) have been introduced with the specific aim of enhancing the attraction regions associated with such schemes in the context of solving the equation

f (x) = x^{2} - 1 = 0

. To provide a more thorough understanding, we proceed by investigating how the presented solvers show global convergence properties and demonstrate improved radii of convergence. This is achieved by illustrating their corresponding attraction regions within the area:

[- 4, 4] \times [- 4, 4],

while resolving

f (x) = 0

.

For this purpose, the complex plane is discretized into a grid of nodes, and the behavior of each point is evaluated by using it as an initial value. This allows us to determine whether the iteration converges or diverges. In cases of convergence, the points are shaded based upon the quantity of iterates they undergo, with the convergence condition being satisfied when

| f (x_{k}) | \leq 10^{- 2} .

Figure 1 depicts the basins of attraction for (16) and (18). The results reveal that, for the methods (16) and (18), the convergence radii are large and global.

A similar spirit of logic as the one already discussed in [19] indicates the stability of the scheme.

Theorem 3.

Let A be an invertible matrix. Based on Equation (18), the sequence

{X_{k}}_{k = 0}^{\infty}

, initialized with

X_{0} = A

, exhibits stability in the asymptotic sense.

Proof.

To initiate the analysis, we consider a perturbed evaluation of

W_{k}

at the

k th

iteration within the framework of the numerical solver. Further elaboration on this perturbation approach can be found in [20]. To systematically examine the effect of perturbations, the following iterative relation is formulated for each computational step:

{\tilde{X}}_{k} = X_{k} + W_{k} .

(26)

At this stage, we take into account the assumption that

{(W_{k})}^{i} \approx 0

for all

i \geq 2

, which is true under the framework of a first-order error analysis, provided that

W_{k}

remains sufficiently small. Based on this premise, and after performing a series of algebraic simplifications, the following inequality is derived:

∥ W_{k + 1} ∥ \leq \frac{1}{2} ∥ W_{0} - M W_{0} M ∥ .

(27)

Thus, the sequence

{X_{k}}_{k = 0}^{\infty}

generated by the method in (16) is stable. □

4. Numerical Examples

Here, we perform an assessment of the performance of the proposed iterative solvers by evaluating their effectiveness across a diverse range of problem types. The entire implementation process has been carried out using Wolfram (see [21,22]). A systematic approach has been adopted to address several computational aspects, including the precise detection of convergence. For the sake of clarity and coherence, the testing procedure is categorized into two distinct groups, following a methodology similar to that employed in [23]: the first category comprises tests involving real derived values, while the second focuses on complex matrices.

The iterative methods considered in the comparative analysis include the method given in Equation (3), referred to as NM2; the approach defined by (5), labeled as HM3; the procedure outlined in (16), indicated as PM61; the iteration specified in (18), designated as PM62; and the Zaka Ullah et al. method of fourth order defined by ZUM4 [14]:

X_{k + 1} = (5 I + 42 X_{k}^{2} + 17 X_{k}^{4}) {[X_{k} (23 I + 38 X_{k}^{2} + 3 X_{k}^{4})]}^{- 1} .

(28)

For all Newton-type iterative schemes considered in this comparison, the initial matrix

X_{0}

is chosen in accordance with the specification given in (17). The computational error at each iteration is evaluated using the following formulation:

E_{k + 1} = {∥ X_{k + 1}^{2} - I ∥}_{2} \leq κ,

(29)

where

κ

represents the predefined convergence threshold, serving as the stopping criterion for the iterative process.

Example 1.

A set of twelve randomly generated real matrices is obtained by employing the random seed command

SeedRandom [789]

. Following their generation, the corresponding MSFs are computed and examined to facilitate a comparative analysis. These matrices are constructed within the numerical range

[- 100, 100]

and encompass dimensions varying from

100 \times 100

up to

1200 \times 1200

. All computations are carried out under

κ = 10^{- 4}

.

Table 1 and Table 2 showcase the numerical results corresponding to Example 1, providing substantial evidence of the efficacy of the methods introduced in this study. Of particular note, the method PM61 enhances computational efficiency by reducing the total number of iterates needed to calculate the MSF. This improvement is reflected in a marked decrease in the average CPU time, measured in seconds, across 12 randomly produced matrices of different sizes. This is implemented to compare the overall cost for computing of the proposed scheme with the existing ones.

Example 2.

Within this numerical investigation, the MSF is computed for a set of eight randomly generated complex matrices. The evaluation is conducted while adhering to

κ = 10^{- 4}

. The implementation of these random matrices is illustrated in the Mathematica 13.3 code snippet presented below

SeedRandom[789];

nu = 8;

Table[A[n1] = RandomComplex[{-100 - 100 I,

100 + 100 I}, {150 n1, 150 n1}];, {n1, nu}];

Table 3 and Table 4 furnish computational comparisons for Example 2, reinforcing the efficacy of the furnished solver to determine the MSF for eight randomly produced complex matrices. Consistent numerical experiments conducted across a diverse set of related test cases further corroborate these findings. Among the evaluated methods, the PM61 algorithm exhibits superior efficiency and robustness, outperforming its counterparts in terms of computational accuracy and convergence behavior.

The numerical experiments conducted here provide substantial evidence in support of the superior performance of the proposed iterative methods, particularly PM61, in computing the MSF. The comparative assessment across multiple test cases demonstrates that PM61 exhibits a reduction in iteration count, as observed in Table 1 and Table 3. For instance, in Example 1, the average number of iterations needed for PM61 to achieve convergence is 8.08, which is markedly lower than NM2 (21.25), HM3 (13.66), and ZUM4 (9.75). This efficiency advantage remains consistent in Example 2, where PM61 requires an average of 8.50 iterations compared to NM2 (22.62) and HM3 (14.37). Such a reduction in iteration count is crucial when dealing with large-scale matrices, as it directly translates to fewer matrix–matrix operations, thereby minimizing computational complexity and memory usage.

Beyond the iteration count, another critical metric in evaluating iterative solvers is their CPU execution time, as shown in Table 2 and Table 4. The PM61 method consistently outperforms NM2, HM3, and ZUM4 in terms of computational speed. In Example 1, PM61 achieves an average execution time of 1.329 s, which is 29.7% faster than NM2 (1.889 s) and 11.2% faster than ZUM4 (1.481 s). Similar trends are observed in Example 2, where PM61 exhibits an average runtime of 4.456 s, making it significantly more efficient than NM2 (6.026 s) and HM3 (5.393 s). The improved efficiency of PM61 is attributed to its higher-order convergence rate, which reduces the number of matrix computations required per iteration. Furthermore, the method demonstrates superior scalability, handling larger matrices (e.g.,

1200 \times 1200

) with competitive execution times compared to alternative schemes.

A key advantage of PM61 lies in its numerical stability and robustness, particularly when applied to real and complex matrices of varying dimensions. Unlike lower-order methods such as NM2, which may require additional stabilization techniques (e.g., scaling and squaring), PM61 converges reliably across different problem instances without exhibiting erratic behavior. This robustness is evident in the consistent performance metrics recorded in both Example 1 and Example 2. Moreover, the method maintains a balanced trade-off between iteration count and per-iteration computational cost, ensuring that the efficiency gains are realized across a broad spectrum of matrix sizes.

In summary, the numerical analysis underscores the effectiveness of PM61 in computing the MSF. The method excels in minimizing the number of iterations, reducing computational time, and enhancing numerical stability, making it a highly favorable choice for large-scale applications.

5. Conclusions

The MSF remains a vital tool in numerical linear algebra, with applications spanning control theory, eigenvalue computations, and differential equations. While classical iterative methods such as Newton’s iteration and Padé approximations provide efficient solutions, the quest for high-order convergence has driven the development of advanced iterative schemes. This paper contributes to this growing body of research by proposing a novel sixth-order method designed to improve computational efficiency.

In fact, in this paper, we have furnished a novel procedure with a sixth-rate convergence for computing the MSF efficiently. By employing a nonlinear equations solver (6) designed for simple roots, we have formulated an extended matrix procedure and established its convergence properties. Theoretical analysis has confirmed the sixth-order convergence of our method, while numerical experiments have demonstrated its superior performance compared to classical techniques. Additionally, basins of attraction have been provided to showcase the global convergence features of PM61 (or PM62). For future research, one possible direction is to extend the proposed iterative method to calculate matrix functions beyond the sign function, such as the matrix square root or matrix sector function.

Author Contributions

Conceptualization, S.W. and T.L.; Methodology, T.L.; Software, S.W., Z.W. and T.L.; Validation, Z.W. and T.L.; Formal analysis, W.X. and T.L.; Investigation, W.X. and T.L.; Resources, Y.Q. and T.L.; Data curation, Y.Q. and T.L.; Writing—original draft, S.W., Z.W., W.X. and Y.Q.; Writing—review & editing, S.W., Z.W., W.X. and Y.Q.; Visualization, T.L.; Supervision, T.L.; Project administration, T.L.; Funding acquisition, T.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Scientific Research Project of Jilin Provincial Department of Education (JJKH20251638KJ), the Open Fund Project of Marine Ecological Restoration and Smart Ocean Engineering Research Center of Hebei Province (HBMESO2321), the Technical Service Project of Eighth Geological Brigade of Hebei Bureau of Geology and Mineral Resources Exploration (KJ2022-021), the Technical Service Project of Hebei Baodi Construction Engineering Co., Ltd. (KJ2024-012), the Natural Science Foundation of Hebei Province of China (A2020501007), and the Fundamental Research Funds for the Central Universities (N2123015).

Data Availability Statement

Concerning the data availability statement, it is clarified that data sharing is not applicable to this study, as no novel datasets were created or analyzed in the preparation of this paper.

Acknowledgments

We are grateful to the anonymous referees for their careful reading of the initial version of this work.

Conflicts of Interest

The writers declare the absence of any personal affiliations, relationships, or interests.

References

Hogben, L. Handbook of Linear Algebra; Chapman and Hall/CRC: Boca Raton, FL, USA, 2007. [Google Scholar]
Denman, E.D.; Beavers, A.N. The matrix sign function and computations in systems. Appl. Math. Comput. 1976, 2, 63–94. [Google Scholar] [CrossRef]
Roberts, J.D. Linear model reduction and solution of the algebraic Riccati equation by use of the sign function. Int. J. Cont. 1980, 32, 677–687. [Google Scholar] [CrossRef]
Higham, N.J. Functions of Matrices: Theory and Computation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2008. [Google Scholar]
Rani, L.; Kansal, M. Numerically stable iterative methods for computing matrix sign function. Math. Meth. Appl. Sci. 2023, 46, 8596–8617. [Google Scholar] [CrossRef]
Misrikhanov, M.S.; Ryabchenko, V.N. Matrix sign function in the problems of analysis and design of the linear systems. Autom. Remote Control 2008, 69, 198–222. [Google Scholar] [CrossRef]
Howland, J.L. The sign matrix and the separation of matrix eigenvalues. Linear Alg. Appl. 1983, 49, 221–232. [Google Scholar] [CrossRef]
Golzarpoor, J.; Ahmed, D.; Shateyi, S. Constructing a matrix mid-point iterative method for matrix square roots and applications. Mathematics 2022, 10, 2200. [Google Scholar] [CrossRef]
Gomilko, O.; Greco, F.; Ziȩtak, K. A Padé family of iterations for the matrix sign function and related problems. Numer. Lin. Alg. Appl. 2012, 19, 585–605. [Google Scholar]
Al-Sawalha, M.M.; Shah, R.; Khan, A.; Ababneh, O.Y.; Botmart, T. Fractional view analysis of Kersten-Krasil’shchik coupled KdV-mKdV systems with non-singular kernel derivatives. Aims Math. 2022, 7, 18334–18359. [Google Scholar]
Alderremy, A.A.; Shah, R.; Iqbal, N.; Aly, S.; Nonlaopon, K. Fractional Series Solution Construction for Nonlinear Fractional Reaction-Diffusion Brusselator Model Utilizing Laplace Residual Power Series. Symmetry 2022, 14, 1944. [Google Scholar] [CrossRef]
Kenney, C.S.; Laub, A.J. Rational iterative methods for the matrix sign function. SIAM J. Matrix Anal. Appl. 1991, 12, 273–291. [Google Scholar] [CrossRef]
Jung, D.; Chun, C. A general approach for improving the Padé iterations for the matrix sign function. J. Comput. Appl. Math. 2024, 436, 115348. [Google Scholar] [CrossRef]
Zaka Ullah, M.; Muaysh Alaslani, S.; Othman Mallawi, F.; Ahmad, F.; Shateyi, S.; Asma, M. A fast and efficient Newton-type iterative scheme to find the sign of a matrix. AIMS Math. 2023, 8, 19264–19274. [Google Scholar] [CrossRef]
Feng, Y.; Othman, A.Z. An accelerated iterative method to find the sign of a nonsingular matrix with quartical convergence. Iran. J. Sci. 2023, 47, 1359–1366. [Google Scholar] [CrossRef]
Traub, J.F. Iterative Methods for the Solution of Equations; Prentice-Hall: New York, NY, USA, 1964. [Google Scholar]
Getz, C.; Helmstedt, J. Graphics with Mathematica Fractals, Julia Sets, Patterns and Natural Forms; Elsevier: Amsterdam, The Netherlands, 2004. [Google Scholar]
Zainali, N.; Lotfi, T. A globally convergent variant of mid-point method for finding the matrix sign. Comput. Appl. Math. 2018, 37, 5795–5806. [Google Scholar]
Alaidarous, E.S.; Ullah, M.Z. Construction of a convergent scheme for finding matrix sign function. Appl. Math. Comput. 2015, 260, 242–248. [Google Scholar]
Iannazzo, B. Numerical Solution of Certain Nonlinear Matrix Equations. Ph.D. Thesis, Universita Degli Studi di Pisa, Pisa, Italy, 2007. [Google Scholar]
Abell, M.L.; Braselton, J.P. Mathematica by Example, 5th ed.; Academic Press: Cambridge, MA, USA, 2017. [Google Scholar]
Magrab, E.B. An Engineer’s Guide To Mathematica; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
Shi, L.; Ullah, M.Z.; Nashine, H.K.; Alansari, M.; Shateyi, S. An enhanced numerical iterative method for expanding the attraction basins when computing matrix signs of invertible matrices. Fractal Fract. 2023, 7, 684. [Google Scholar] [CrossRef]

Figure 1. Basin of attractions for (16) in (left) and (18) in (right).

Table 1. For Example 1, a comparative performance assessment is conducted by observing the number of iterates needed for convergence.

$n \times n$	NM2	HM3	ZUM4	PM61	PM62
$100 \times 100$	19	12	9	7	7
$200 \times 200$	17	11	8	7	7
$300 \times 300$	18	12	10	9	7
$400 \times 400$	20	13	9	7	7
$500 \times 500$	23	15	10	8	8
$600 \times 600$	20	13	9	8	8
$700 \times 700$	22	14	10	9	7
$800 \times 800$	23	14	10	8	8
$900 \times 900$	24	15	11	9	9
$1000 \times 1000$	22	14	10	8	8
$1100 \times 1100$	23	15	10	8	8
$1200 \times 1200$	24	16	11	9	9
Average	21.25	13.66	9.75	8.08	7.75

Table 2. For Example 1, comparisons in terms of CPU execution time (in seconds).

$n \times n$	NM2	HM3	ZUM4	PM61	PM62
$100 \times 100$	0.018	0.013	0.014	0.013	0.016
$200 \times 200$	0.058	0.054	0.049	0.069	0.046
$300 \times 300$	0.174	0.156	0.201	0.161	0.146
$400 \times 400$	0.354	0.381	0.310	0.265	0.262
$500 \times 500$	0.699	0.623	0.542	0.476	0.467
$600 \times 600$	0.996	0.828	0.738	0.715	0.722
$700 \times 700$	1.427	1.256	1.126	1.178	0.913
$800 \times 800$	2.002	1.751	1.570	1.383	1.420
$900 \times 900$	2.808	2.471	2.256	2.049	2.122
$1000 \times 1000$	3.406	3.068	2.708	2.356	2.446
$1100 \times 1100$	4.532	4.250	3.509	3.015	3.020
$1200 \times 1200$	6.194	5.701	4.750	4.272	4.518
Average	1.889	1.713	1.481	1.329	1.342

Table 3. For Example 2, a comparative assessment of performance is conducted by observing the number of iterates needed for convergence.

$n \times n$	NM2	HM3	ZUM4	PM61	PM62
$150 \times 150$	21	13	9	8	8
$300 \times 300$	22	14	10	8	8
$450 \times 450$	22	14	10	8	8
$600 \times 600$	23	15	10	9	9
$750 \times 750$	22	14	10	9	9
$900 \times 900$	22	14	10	8	8
$1050 \times 1050$	26	16	11	9	9
$1200 \times 1200$	23	15	10	9	9
Average	22.62	14.37	10.00	8.50	8.50

Table 4. For Example 2, comparisons are made in terms of CPU execution time (in seconds).

$n \times n$	NM2	HM3	ZUM4	PM61	PM62
$150 \times 150$	0.076	0.069	0.081	0.077	0.096
$300 \times 300$	0.411	0.415	0.400	0.370	0.446
$450 \times 450$	1.081	1.055	1.000	0.938	0.990
$600 \times 600$	2.252	2.302	2.014	2.115	2.138
$750 \times 750$	3.998	3.831	3.576	3.601	3.862
$900 \times 900$	6.810	6.224	6.020	5.119	5.291
$1050 \times 1050$	13.744	11.751	10.203	9.230	9.243
$1200 \times 1200$	19.832	17.498	13.737	14.201	14.130
Average	6.026	5.393	4.629	4.456	4.525

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, S.; Wang, Z.; Xie, W.; Qi, Y.; Liu, T. An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally. Mathematics 2025, 13, 1080. https://doi.org/10.3390/math13071080

AMA Style

Wang S, Wang Z, Xie W, Qi Y, Liu T. An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally. Mathematics. 2025; 13(7):1080. https://doi.org/10.3390/math13071080

Chicago/Turabian Style

Wang, Shuai, Ziyang Wang, Wu Xie, Yunfei Qi, and Tao Liu. 2025. "An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally" Mathematics 13, no. 7: 1080. https://doi.org/10.3390/math13071080

APA Style

Wang, S., Wang, Z., Xie, W., Qi, Y., & Liu, T. (2025). An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally. Mathematics, 13(7), 1080. https://doi.org/10.3390/math13071080

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Accelerated Sixth-Order Procedure to Determine the Matrix Sign Function Computationally

Abstract

1. Introductory Notes

2. An Accelerated Sixth-Order Procedure

3. Attraction Basins and Stability

4. Numerical Examples

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI