An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application

Qin, Huizeng; Lu, Youmin

doi:10.3390/appliedmath4020037

Open AccessArticle

An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application

by

Huizeng Qin

¹ and

Youmin Lu

^2,*

¹

The School of Mathematics and Statistics, Shandong University of Technology, Zibo 255049, China

²

Department of Mathematical and Digital Sciences, Bloomsburg University, Bloomsburg, PA 17815, USA

^*

Author to whom correspondence should be addressed.

AppliedMath 2024, 4(2), 690-708; https://doi.org/10.3390/appliedmath4020037

Submission received: 26 January 2024 / Revised: 4 March 2024 / Accepted: 19 April 2024 / Published: 5 June 2024

Download

Browse Figures

Versions Notes

Abstract

:

If the matrix function

f (A t)

posses the properties of

f (A t) = g (f ((\frac{t}{k} A))

, then the recurrence formula

f_{i - 1} = g (f_{i}), i = N, N - 1, \dots, 1, f (t A) = f_{0}

, can be established. Here,

f_{N} = f (A_{N}) = \sum_{j = 0}^{m} a_{j} A_{N}^{j}, A_{N} = \frac{t}{k^{N}} A

. This provides an algorithm for computing the matrix function

f (A t)

. By specifying the calculation accuracy p, a method is presented to determine m and N in a way that minimizes the time of the above algorithm, thus providing a fast algorithm for

f (A t)

. It is important to note that m only depends on the calculation accuracy p and is independent of the matrix A and t. Therefore,

f (A_{N})

has a fixed calculation format that is easily computed. On the other hand, N depends not only on A, but also on t. This provides a means to select t such that N is equal to 0, a property of significance. In summary, the algorithm proposed in this article enables users to establish a desired level of accuracy and then utilize it to select the appropriate values for m and N to minimize computation time. This approach ensures that both accuracy and efficiency are addressed concurrently. We develop a general algorithm, then apply it to the exponential, trigonometric, and logarithmic matrix functions, and compare the performance with that of the internal system functions of Mathematica and Pade approximation. In the last section, an example is provided to illustrate the rapid computation of numerical solutions for linear differential equations.

Keywords:

matrix function; efficient algorithms; high accuracy; linear differential equations

MSC:

68W25; 68U99

1. Introduction

Matrix exponent functions plays a crucial role in various fields, including ordinary differential equations and control theory. In particular, they are instrumental in analyzing general structural dynamic systems, represented by the initial value problem:

\begin{matrix} M x ″ + G x' + K x = r (t), x (0) = x_{0}, x' (0) = x_{1}, \end{matrix}

(1)

where

M, G

, and K are n by n square matrices, and

r (t)

denotes the external force applied. The introduction of dual variables of a Hamiltonian system,

\begin{matrix} p = M x' + G x / 2, \end{matrix}

(2)

results in the following system:

\begin{matrix} U' = H U + F (t), U (0) = U_{0}, \end{matrix}

(3)

where

\begin{matrix} U = (\begin{matrix} q \\ p \end{matrix}), H = (\begin{matrix} - M^{- 1} G / 2 & M^{- 1} \\ G M^{- 1} G / 4 - K & - M^{- 1} G / 2 \end{matrix}), \\ F (t) = (\begin{matrix} 0 \\ r (t) \end{matrix}), q = x, U_{0} = (\begin{matrix} x_{0} \\ M x_{1} + G x_{0} / 2 \end{matrix}) . \end{matrix}

(4)

The solution of (3) is provided as follows:

\begin{matrix} U (t) = e^{H t} U_{0} + \int_{0}^{t} e^{H (t - s)} F (s) d s . \end{matrix}

(5)

When a problem is oscillatory, the formal solution often involves both the sine and cosine of a matrix. Famous examples include the Schr

\overset{”}{o}

dinger equation in quantum mechanics:

\begin{matrix} i \frac{d ψ}{d t} = H (t) ψ, ψ (t_{0}) = ψ_{0}, \end{matrix}

(6)

where

H (t)

is a Hermitian operator and

ψ

is a complex wave function. If

H (t) = A

is a real and constant matrix, the solution of (6) is as follows:

\begin{matrix} ψ (t) = e^{- i A t} ψ_{0} = (cos (t A) - i sin (t A)) ψ_{0} . \end{matrix}

(7)

There are other examples where the computation of the sine and cosine of a matrix can be of interest. Wave equations provided by the generic second order system are written as follows:

\begin{matrix} x ″ + K x = F (t), x (0) = x_{0}, x' (0) = x_{1}, \end{matrix}

(8)

where the solution is provided by the variation of the constant method:

\begin{matrix} x (t) = cos (t A) x_{0} + {sin}_{1} (t A) x_{1} + \int_{0}^{t} {sin}_{1} ((t - s) A) F (s) d s, \end{matrix}

(9)

where

A = \sqrt{K}, {sin}_{1} (t A) = \int_{0}^{t} cos (s A) d s .

If A is invertible, then (9) can be written as follows:

\begin{matrix} x (t) = cos (t A) x_{0} + sin (t A) A^{- 1} x_{1} + A^{- 1} \int_{0}^{t} sin ((t - s) A) F (s) d s . \end{matrix}

(10)

For (5), providing

U (k τ) (k = 1, 2, \dots)

as accurately as possible is a concern for engineers and mathematicians, where

τ

is called the step. The specific method for this problem is as follows:

\begin{matrix} U_{k} = T U_{k - 1} + \int_{0}^{τ} e^{H s} F (k τ - s) d s, k = 1, 2, \dots, T = e^{H τ} . \end{matrix}

(11)

Similarly, for (10), providing the algorithm

x (k τ) (k = 1, 2, \dots)

as accurately as possible involves introducing vectors

x' = y

and the following algorithm:

\begin{matrix} x_{k} = cos (τ A) x_{k - 1} + sin (τ A) A^{- 1} y_{k - 1} + A^{- 1} \int_{0}^{τ} sin (s A) F (k τ - s) d s, \\ y_{k} = - A sin (τ A) x_{k - 1} + cos (τ A) y_{k - 1} + \int_{0}^{τ} cos (s A) F (k τ - s) d s, \\ x_{0} = x (0), y_{0} = x' (0) = x_{1} . \end{matrix}

(12)

To improve the accuracy of

U (k τ)

and

x (k τ)

, it is essential to develop an algorithm with high accuracy for computing

T = e^{H τ}, cos (τ A)

and

sin (τ A)

and the integral containing these matrix functions. This paper improves the algorithms developed for the computation of

e^{A}

and

cos A

in the existing literature, see [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15], and also discusses the computation of other trigonometric, hyperbolic, logarithmic, and related inverse triangular and inverse hyperbolic matrices.

2. The General Theorem of the Fast Algorithm for Matrix Functions

In [16], we addressed the challenge of rapidly calculating elementary functions with specified precision. The implementation method involves an analytic function

f (x)

specifying property,

f (x) = g (f (\frac{x}{k})), where k > 1 .

(13)

If

C_{0}

is an approximation as close as possible to

f (\frac{x}{k^{N}}),

with N as a positive integer, for example,

\begin{matrix} C_{0} = \sum_{j = 0}^{m} \frac{f^{(j)} (0) {(Δ x)}^{j}}{j!}, Δ x = \frac{x}{k^{N}}, m = 1, 2, \dots \end{matrix}

(14)

or

C_{0}

being the Padé approximation of

f (\frac{x}{k^{N}}),

then the following calculations are performed:

\begin{matrix} for (i = 0, i < N, i + +) C_{i + 1} = g (C_{i}) . \end{matrix}

(15)

Completing (14) and (15),

f (x) \approx C_{N}

is obtained. This method is also applicable to the calculation of matrix functions, where the variable x is replaced by the n by n square matrix

A .

Now, we have the following:

\begin{matrix} C_{0} = \sum_{j = 0}^{m} \frac{f^{(j)} (0) {(Δ A)}^{j}}{j!}, Δ A = \frac{1}{k^{N}} A, m = 1, 2, \dots \end{matrix}

(16)

or

C_{0}

is the Padé approximation of

f (Δ A) .

Our objective is to determine m and N to minimize the computation time under the condition that the truncation error and computational precision are

10^{- p}

. For the matrix function calculation, the coefficient calculation of (14) can be ignored, mainly focusing on the multiplication calculation of the matrix in (16) and (15). If the number of multiplications to complete

g (A)

is K, then the total number of times to complete (16) and (15) is roughly

\begin{matrix} q (m, N) = (m + K N) M_{n}, \end{matrix}

(17)

where

M_{n}

is the number of times to complete one multiplication of the n by n matrix. Based on the above discussion, we present a fast algorithm for the matrix function as follows.

Theorem 1.

Let

m_{p}

be an integer close to the minimum point of

h (m)

, where

\begin{matrix} h (m) = m + \frac{p K ln 10}{(m + 1) ln k} + \frac{K}{(m + 1) ln k} ln |\frac{f^{(m + 1)} (0)}{(m + 1)!}| \end{matrix}

(18)

and

\begin{matrix} N = [\frac{p ln 10}{(m + 1) ln k} + \frac{1}{(m + 1) ln k} ln |\frac{f^{(m + 1)} (0)}{(m + 1)!}| + \frac{ln ∥A∥}{ln k}] + 1 . \end{matrix}

(19)

(1)

C_{0} = \sum_{j = 0}^{m_{p}} \frac{f^{(j)} (0) {(Δ A)}^{j}}{j!}

, where

Δ A = \frac{1}{k^{N}} A,

or

C_{0}

is a Padé approximation of

f (Δ A) .

(2) for(

i = 0, i < N, i + +

)

C_{i + 1} = g (C_{i})

and

(A) \approx C_{N} .

The above algorithm is a fast algorithm for the general matrix function.

Proof.

With a truncation error of

10^{- p}

, for (16) to be satisfied,

\begin{matrix} ∥\frac{f^{(m + 1)} (0) {(Δ A)}^{m + 1}}{(m + 1)!}∥ = |\frac{f^{(m + 1)} (0)}{(m + 1)! k^{(m + 1) N}}| ∥A^{m + 1}∥ \leq |\frac{f^{(m + 1)} (0)}{(m + 1)! k^{(m + 1) N}}| {∥A∥}^{m + 1} < 10^{- p}, \end{matrix}

(20)

where

∥A∥

is a norm of

A .

So,

\begin{matrix} N > \frac{p ln 10}{(m + 1) ln k} + \frac{1}{(m + 1) ln k} ln |\frac{f^{(m + 1)} (0)}{(m + 1)!}| + \frac{ln ∥A∥}{ln k} . \end{matrix}

(21)

From (17), we obtain the following:

\begin{matrix} q (m, N) > g (m) = (m + \frac{p K ln 10}{(m + 1) ln k} + \frac{K}{(m + 1) ln k} ln |\frac{f^{(m + 1)} (0)}{(m + 1)!}| + \frac{K ln ∥A∥}{ln k}) M_{n} . \end{matrix}

(22)

Note that the minimum point of

g (m)

has nothing to do with

\frac{K ln ∥A∥}{ln k}

. By making

q (m, N)

as small as possible, the algorithm obtained by (1) and (2) is a fast algorithm, thus completing the proof of the theorem. □

Remark 1.

In the specific calculation,

∥A∥

can be replaced by the approximate eigenvalue with the maximum modulus in this paper.

\begin{matrix} |\frac{f^{(m + 1)} (0)}{(m + 1)! k^{(m + 1) N}}| {∥A∥}^{m + 1} < 10^{- p} . \end{matrix}

(23)

3. Fast Algorithm for $e^{A}, \cos A, \sin A, \sinh A,$ and $\cosh A$

Suppose

λ = ∥A∥

is a norm for A. If we let

A = λ A_{0},

then

∥A_{0}∥ \leq 1

. From (19), we can obtain the following:

\begin{matrix} N_{p} = [\frac{p ln 10}{(m + 1) ln k} + \frac{1}{(m + 1) ln k} ln |\frac{f^{(m + 1)} (0)}{(m + 1)!}| + \frac{ln λ}{K ln k}] + 1 . \end{matrix}

(24)

In order to reduce the length of the article, the algorithms for several matrix functions are compiled into a table.

Theorem 2.

For

e^{A}, cos A, sin A, cosh A

, and

sinh A,

we devised the following fast algorithm using Theorem 1.

Remark 2.

It is worth noting that

m_{p}

in Table 1 only depends on the calculation accuracy p and the matrix function, and is independent of matrix A.

Now, we utilize the general fast algorithm to specifically demonstrate the correctness of the algorithms in Table 1. The algorithms in Table 1 are straightforward; they involve placing the specific functions f and g into the general algorithm, and the tricky part lies in deriving the formulas of

m_{p}

and

N_{p} .

We calculate the formulas for

m_{p}

and

N_{p}

for the function

e^{A}

for

12 \leq p \leq 36

here. In fact, the graphing of the function of

h (m)

supports our results for all values of

p .

The derivation of

m_{p}

and

N_{p}

for the other functions is very similar.

(1) For the function

e^{A},

we have

g (u) = u^{2}, k = 2, K = 1,

and

\begin{matrix} h (m) = m + \frac{p ln 10}{(m + 1) ln 2} - \frac{1}{(m + 1) ln 2} ln (m + 1)! . \end{matrix}

(25)

Using Sterling’s approximation formula, we obtain the following:

ln ((m + 1)!) = \frac{1}{2} ln (2 π) + \frac{1}{2} ln (m + 1) + (m + 1) ln (m + 1) - m - 1 + O (\frac{1}{m + 1}) .

Now, implementing it into (25), we obtain the following:

h (m) \approx m + \frac{p ln 10}{(m + 1) ln 2} - \frac{ln (2 π)}{2 (m + 1) ln 2} - \frac{ln (m + 1)}{2 (m + 1) ln 2} + \frac{1}{ln 2} .

Let

q (x) = x + \frac{p ln 10}{(x + 1) ln 2} - \frac{ln (2 π)}{2 (x + 1) ln 2} - \frac{ln (x + 1)}{2 (x + 1) ln 2} + \frac{1}{ln 2} .

Taking its derivative with respect to

x,

one obtains

q' (x) = 1 - \frac{p ln 10}{{(x + 1)}^{2} ln 2} + \frac{ln (2 π)}{2 {(x + 1)}^{2} ln 2} + \frac{ln (x + 1)}{2 {(x + 1)}^{2} ln 2} - \frac{1}{2 {(x + 1)}^{2} ln 2} .

To determine extreme values, we set it to 0 and multiply both sides by

{(m + 1)}^{2} :

{(x + 1)}^{2} - \frac{1}{ln 2} (x + 1) + \frac{ln (x + 1)}{2 ln 2} - \frac{2 p ln 10 - ln (2 π) + 1}{2 ln 2} = 0 .

Using simple algebra, we obtain the following:

{(x + 1)}^{2} > \frac{(m + 1) - ln (m + 1) + (m + 1) - ln (2 π)}{2 ln 2} + \frac{2 p ln 10 + 1}{2 ln 2} > \frac{2 p ln 10 + 1}{2 ln 2} .

Using

p = 12,

we obtain a lower bound for

x :

x > \sqrt{\frac{2 p ln 10 + 1}{2 ln 2}} - 1 = 6.37 .

Thus, we have a lower bound for m:

m \geq 6 .

We also can solve the quadratic equation to get

m + 1 = \frac{1}{ln 2} + \sqrt{\frac{1}{4 {ln}^{2} 2} + \frac{8 p ln 10 - 4 ln (2 π) + 4}{8 ln 2}} .

Now, using the smallest value for m obtained and

p = 36,

we can obtain our first upper bound:

m < - 1 + \frac{1}{ln 2} + \sqrt{\frac{1}{4 {ln}^{2} 2} - \frac{ln 7}{2 ln 2} + \frac{276 ln 10 - 4 ln (2 π) + 4}{8 ln 2}} \approx 11.31 .

We can again use the values of m and

p = 12

to update the lower bound of m as follows:

m > - 1 + \frac{1}{ln 2} + \sqrt{\frac{1}{4 {ln}^{2} 2} + - \frac{ln 12}{2 ln 2} + \frac{96 ln 10 - 4 ln (2 π) + 4}{8 ln 2} \approx 6.61} .

Now, our new lower bound for m is 7. With this small range of values for m, by inspecting the formula for the values of m, we can see that the value of m is very close to

\frac{p ln 10}{ln 2}

, and we can safely claim that m

= [\sqrt{\frac{9 p ln 10}{8 ln 2}}]

. Our following numerical calculation supports this claim completely.

Case (a):

p = 12 .

Using the formula obtained by the quadratic formula and the upper bound

m = 11

, we obtain a new lower bound:

m \geq 6.63 .

Using the formula obtained by the quadratic formula and lower bound

m = 7,

we can also obtain the upper bound:

m \leq 6.63 .

Using our claim

\frac{9 p ln 10}{8 ln 2},

we obtain

m = 6.70 .

This clearly matches the lower and upper bound quite well.

Case (b):

p = 18 .

Using the formula obtained by the quadratic formula and the upper bound

m = 11

, we obtain a new lower bound:

m \geq 8.05 .

Using the formula obtained by the quadratic formula and lower bound

m = 7,

we can also obtain the upper bound:

m \leq 8.07 .

Using our claim

\frac{9 p ln 10}{8 ln 2},

we obtain

m = 8.20 .

These values match very well for taking the closest integer.

Case (c):

p = 24 .

Using the formula obtained by the quadratic formula and the upper bound

m = 11

, we obtain a new lower bound:

m \geq 9.27 .

Using the formula obtained by the quadratic formula and lower bound

m = 7,

we can also obtain the upper bound:

m \leq 9.28 .

Using our claim

\frac{9 p ln 10}{8 ln 2},

we obtain

m = 9.47 .

Case (d):

p = 36 .

Using the formula obtained by the quadratic formula and the upper bound

m = 11

, we obtain a new lower bound:

m \geq 11.29 .

Using the formula obtained by the quadratic formula and lower bound

m = 7,

we can also obtain the upper bound:

m \leq 11.37 .

Using our claim

\frac{9 p ln 10}{8 ln 2},

we obtain

m = 11.60 .

Using

m_{p} = \sqrt{\frac{9 p ln 10}{8 ln 2}},

we can obtain

\begin{matrix} N_{p} = [\frac{p ln 10}{(m_{p} + 1) ln 2} - \frac{ln (m_{p} + 1)!}{(m_{p} + 1) ln 2} + \frac{ln λ}{ln 2}] + 1 . \end{matrix}

(26)

(2) For

\{\begin{matrix} sin A \\ sinh A \end{matrix}\},

we have

g (u) = 3 u \mp 4 u^{3}, k = 3, K = 2

and

\begin{matrix} h (m) = m + \frac{2 p ln 10}{(2 m + 2) ln 3} - \frac{2 ln (2 m + 2)!}{(2 m + 2) ln k} . \end{matrix}

(27)

In this case, for

p = 12, 16, 24,

and 32, the corresponding integer points for

h (m)

to achieve its minimum value are

3, 3, 5,

and 6. Again, it can be estimated that

\begin{matrix} m_{p} = \sqrt{\frac{3 p ln 10}{5 ln 3}}, N_{p} = [\frac{p ln 10}{(2 m_{p} + 2) ln 3} - \frac{ln (2 m_{p} + 2)!}{(2 m_{p} + 2) ln 3} + \frac{ln λ}{ln 3}] + 1 . \end{matrix}

(28)

(3) For

\{\begin{matrix} cos A \\ cosh A \end{matrix}\},

there are two cases:

Case (I):

g (u) = 4 u^{3} - 3 u, k = 3, K = 2,

and (29) hold. Compared to I, the values in the second part of

C = I + \sum_{j = 1}^{m_{p}} \frac{{(- B)}^{j}}{(2 j)!}

are very small, so we may replace N with

N - 1

, and

m_{p}

with

m_{p} + 1

.

Case (II):

g (u) = u^{2} \mp 1, k = 2, K = 1

and

\begin{matrix} h (m) = m + \frac{p ln 10}{(2 m + 1) ln 2} - \frac{1}{(2 m + 1) ln 2} ln (2 m + 1)! . \end{matrix}

(29)

Similarly, for

p = 12, 16, 24,

and 32, the corresponding integer points for

h (m)

to achieve its minimum value are

4, 5, 6,

and 7. Consequently, we can determine the following:

\begin{matrix} m_{p} = \sqrt{\frac{3 p ln 10}{5 ln 2}}, N_{p} = [\frac{p ln 10}{(2 m_{p} + 1) ln 2} - \frac{ln (2 m_{p} + 1)!}{(2 m_{p} + 1) ln 2} + \frac{ln λ}{ln 2}] + 1 . \end{matrix}

(30)

The effectiveness of the algorithms is displayed in Table 1 by implementing numerical computations in Mathematica.

N_{p}

and

m_{p}

are replaced with

N_{p, μ} = μ N_{p}, m_{p, μ}

in Table 1, defined correspondingly by

\begin{matrix} m_{p, μ} = min_{∥B^{j} / j!∥ < 10^{- p}} j, B = ∥\frac{A}{2^{N_{p, μ}}}∥ for e^{A}, \\ m_{p, μ} = min_{∥B^{j + 1 / 2} / (2 j + 1)!)∥ < 10^{- p}} j, B = ∥\frac{A^{2}}{3^{2 N_{p, μ}}}∥ for sin A, \\ (I) m_{p, μ} = min_{∥B^{j} / (2 j)!∥ < 10^{- p}} j, B = ∥\frac{A^{2}}{3^{2 N_{p, μ}}}∥ for cos A, \\ (II) m_{p, μ} = min_{∥B^{j} / (2 j)!∥ < 10^{- p}} j, B = ∥\frac{A^{2}}{2^{2 N_{p, μ}}}∥ for cos A, \end{matrix}

(31)

where one has

N_{p, 1} = N_{p}

and

m_{p} = m_{p, 1} .

For comparison, let A take the following form:

A = U D U^{- 1},

(32)

where U is a randomly generated n by n matrix,

D_{a} = D i a l (d_{1}, d_{2}, \dots, d_{n})

is a randomly generated n by n diagonal matrix with

d_{i} \in (a, b), i = 1, 2, \dots, n

, so

f (A) = U f (D) U^{- 1},

f (D) = D i a l (f (d_{1}), f (d_{2}), \dots, f (d_{n})) .

In all the examples in this paper, we agree on the relative error as follows:

\begin{matrix} e r r (g) (A) = \frac{{∥f (A) - U f (D) U^{- 1}∥}_{\infty}}{∥U f (D) U^{- 1}∥} . \end{matrix}

(33)

Let

A_{i} (i = 1, 2, \dots, M)

be M matrices produced by (32). From Figure 1, it is evident that the calculation accuracy is essentially the same for

μ = 1, \frac{2}{3}, \frac{4}{3}

, but

μ = 1

provides the highest efficiency. The double-angle formula is notably more efficient and accurate than the triple-angle formula for

cos A

. Henceforth, we will exclude the case that utilizes the triple-angle formula for

cos A

.

Now let us juxtapose the algorithms presented in Table 1 with the internal functions and commonly used algorithms found in mathematical software. For ease of reference, let

e r r (g)

denote the calculation error of matrix function

f (A_{i}) (i = 1, 2, \dots, M)

, where a blank g implies the utilization of the algorithms in Table 1, Table 2 and Table 3 of this article. If

g = S y m

,

P a d e

,

\dots,

it signifies calculation according to the internal function, Padé approximation,

\dots,

respectively. When

p = 16,

m_{p} = 7, 4, 3

for

e^{A}, cos A (cosh A), sin A (sinh A),

respectively. We denote the Padé approximation of

sin x

as

r_{66} (x)

and the Padé approximation of

e^{x}

and

cos x

as

r_{88} (x) .

In Table 1, the Padé approximation of

e^{A}, cos A (cosh A), sin A (sinh A)

is as follows:

\begin{matrix} r_{88} (x) = \frac{1 + \frac{x}{2} + \frac{x^{2}}{60} + \frac{x^{3}}{60} + \frac{x^{4}}{624} + \frac{x^{5}}{9360} + \frac{x^{6}}{205920} + \frac{x^{7}}{7207200} + \frac{x^{8}}{518918400}}{1 - \frac{x}{2} + \frac{x^{2}}{60} - \frac{x^{3}}{60} + \frac{x^{4}}{624} - \frac{x^{5}}{9360} + \frac{x^{6}}{205920} - \frac{x^{7}}{7207200} + \frac{x^{8}}{518918400}} for e^{x}, \\ r_{88} (x) = \frac{1 \mp \frac{260735 x^{2}}{545628} + \frac{4375409 x^{4}}{141863280} \mp \frac{7696415 x^{6}}{1310867072} + \frac{80737373 x^{8}}{23594700729600}}{1 \pm \frac{12079 x^{2}}{545628} + \frac{34709 x^{4}}{141863280} \pm \frac{109247 x^{6}}{65540835360} + \frac{11321 x^{8}}{1814976979200}} for \{\begin{matrix} cos x \\ cosh x \end{matrix}\}, \\ r_{66} (x) = \frac{x \mp \frac{7 x^{3}}{60} + \frac{x^{5}}{600}}{1 \pm \frac{x^{2}}{20} + \frac{x^{4}}{600} \pm \frac{x^{6}}{14400}} for \{\begin{matrix} sin x \\ sinh x \end{matrix}\} . \end{matrix}

(34)

For the algorithms in Table 1, the

e r r (g) (A)

of random matrix from order 1 to 800 is as follows.

It is evident from Figure 2 that, for

e^{A}, cos A,

and

sin A

, the accuracy obtained by the three methods is essentially the same. However, when the matrix order is relatively high, the Taylor expansion method presented in Table 1 outperforms the others. In particular, for cos and

sin A

, the calculation time of the internal functions is approximately eight times longer than that of the algorithms in Table 1. Numerical calculations demonstrate that while the calculation speed of

cos A

and

sin A

using the internal functions

R e (e^{i A}) = cos A

and

I m (e^{i A}) = sin A

is faster than that of the internal functions

cos A

and

sin A

, their calculation accuracy is relatively poor.

For

cosh A

and

sinh A

, the internal functions can be calculated by utilizing the following equation:

\begin{matrix} cosh A = \frac{1}{2} (e^{A} + e^{- A}), sinh A = \frac{1}{2} (e^{A} - e^{- A}) . \end{matrix}

(35)

Our experiment provides the following results for

e r r (g) (A)

of random matrices with orders from 1 to

800 .

It is evident from Figure 3 that using (35) not only doubles the calculation time but also makes the calculation error substantial. Furthermore, by observing Figure 2 and Figure 3, it becomes apparent that the algorithms in Table 1 outperform the Padé approximation algorithm.

4. Fast Algorithm for $\tan A, \tanh A, \sec A$ , and $sechA$

Now let us contemplate the swift computation of

tan A, tanh A

,

sec A

, and

s e c h A

. Although these matrix functions can be computed using the following approach, we aim to directly compute them, as demonstrated in the previous section.

\begin{matrix} tan A = sin A {(cos A)}^{- 1}, sec A = {(cos A)}^{- 1}, \\ tanh A = sinh A {(cosh A)}^{- 1} = (e^{A} - e^{- A}) {(e^{A} + e^{- A})}^{- 1}, \\ s e c h A = {(cosh A)}^{- 1} = 2 {(e^{A} + e^{- A})}^{- 1} . \end{matrix}

(36)

While the Padé approximation significantly contributes to matrix function computation, the numerical results in the previous section indicate that, based on the algorithm presented in this paper, neither the calculation accuracy nor the calculation speed is predominant. Therefore, this section does not adopt it.

Theorem 3.

For

tan A, sec A, tanh A

, and

s e c h A,

we have the following fast algorithm based on Theorem 1.

In Table 2, for

σ (j)

and

{\hat{E}}_{j}

, we have the following recursive formula [1]:

\begin{matrix} σ (2 n) = \frac{4}{4 n + 1} \sum_{l = 1}^{n - 1} {σ (l) σ (2 n - l) + \frac{2}{4 n + 1} σ}^{2} (n), n = 1, 2, \dots, \\ σ (2 n - 1) = \frac{4}{4 n - 1} \sum_{l = 1}^{n - 1} σ (l) σ (2 n - 1 - l), n = 2, 3, \dots \\ {\hat{E}}_{n} = - \sum_{k = 0}^{n - 1} \frac{{\hat{E}}_{k}}{(2 n - 2 k)!}, {\hat{E}}_{0} = 1, σ (1) = \frac{1}{6} . \end{matrix}

(37)

Using the algorithms in Table 2, our experiments generated the following data for the

e r r (g) (A)

of random matrices from order 1 to 200.

Here,

S y m

indicates that matrix functions

tan A

,

tanh A

,

sec A

, and sech

A,

expressed by

sin A {(cos A)}^{- 1}

,

e^{A} - e^{- A} {(e^{A} + e^{- A})}^{- 1}

,

{(cos A)}^{- 1}

and

2 {(e^{A} + e^{- A})}^{- 1},

respectively, are computed using the internal functions. As observed in Figure 4, the algorithm’s computation speed in Table 2 surpasses that of the internal function, particularly for

tan A

and

sec A

, where the speed is increased by 8–11 times compared to the internal function, while maintaining consistent accuracy is consistent. When the order n of the matrix is relatively large, for

tanh A

and

sec h A,

the calculation speed is twice that of the internal function, yet the precision remains the same as the internal function.

Let us consider implementing our algorithms for calculating the functions

cot A, cot A

,

csc A

, and cschA. By

\begin{matrix} cot (A t) \approx A^{- 1} - 2 \sum_{j = 0}^{m} σ (j + 1) A^{2 j + 1}; (o < |A| < π, z \neq 0), \\ \coth (A t) \approx A^{- 1} + 2 \sum_{j = 0}^{m} {(- 1)}^{j} σ (j + 1) A^{2 j + 1} (0 < |A| < π), \\ csc (A t) \approx A^{- 1} + 2 \sum_{j = 0}^{m} σ (j + 1) (1 - \frac{2}{4^{j + 1}}) A^{2 j + 1} (0 < |A| < π), \\ csch (A t) \approx A^{- 1} - 2 \sum_{j = 0}^{m} {(- 1)}^{j} σ (j + 1) (1 - \frac{2}{4^{j + 1}}) A^{2 j + 1} (0 < |A| < π), \end{matrix}

(38)

we see that when

∥A∥

is small, the norm of the inverse of A will be large, so the calculation using (38) will produce large errors. Therefore, we must utilize the following method:

\begin{matrix} cot A = A^{- 1} {cot}_{1} A, coth A = A^{- 1} {coth}_{1} A, \\ csc A = A^{- 1} {csc}_{1} A, csch A = A^{- 1} {csch}_{1} A, \end{matrix}

(39)

where

\begin{matrix} {cot}_{1} A \approx I - 2 \sum_{j = 1}^{m + 1} σ (j) A^{2 j}; (|z| < π, z \neq 0), \\ \coth_{1} A \approx I - 2 \sum_{j = 1}^{m + 1} {(- 1)}^{j} σ (j) A^{2 j} (|z| < π, z \neq 0), \\ {csc}_{1} A \approx I + 2 \sum_{j = 1}^{m + 1} σ (j) (1 - \frac{2}{4^{j}}) A^{2 j} (|z| < π, z \neq 0), \\ {csch}_{1} A \approx I + 2 \sum_{j = 1}^{m + 1} {(- 1)}^{j} σ (j) (1 - \frac{2}{4^{j}}) A^{2 j} (|z| < π, z \neq 0) . \end{matrix}

(40)

Theorem 4.

Using Theorem 1, we have the following algorithm for

cot A, coth A, csc A,

and

sec h A

.

Implementation of the algorithms in Table 3 provides the following results for the matrix functions in (39).

Here,

S y m

signifies that matrix functions

cot A

,

coth A

,

csc A

, and

sec h A

are calculated using the internal functions with the regular formulas

cos A {(sin A)}^{- 1}

,

(e^{A} + e^{- A}) {(e^{A} - e^{- A})}^{- 1}

,

{(sin A)}^{- 1}

, and

2 {(e^{A} - e^{- A})}^{- 1},

respectively. The results in Figure 5 are essentially consistent with the corresponding results in Figure 2.

5. Fast Algorithm for Logarithmic Matrix Function $ln A$ and Some Related Functions

People consider various algorithms [17,18,19,20,21,22] for logarithmic matrix function

ln A .

One of the most effective methods we noticed was the following algorithm. The standard way of dealing with this problem is to use the square root operator repeatedly to bring A near the identity:

\begin{matrix} ln A = 2^{k} ln A_{k}, A_{k} = A^{1 / 2^{k}} . \end{matrix}

(41)

As k increases,

A^{1 / 2^{k}}

\to I

, so for sufficiently large k, we can apply a direct method to

A^{1 / 2^{k}} .

This procedure for the logarithm was introduced by Kenney and Laub [19]. The specific method is to choose k such that

\begin{matrix} ∥A^{1 / 2^{k}} - I∥ < α < 1, \end{matrix}

(42)

and usually, k is selected from 8. We slightly optimize the method as follows:

\begin{matrix} ln A = ln λ \cdot I + 2^{k} ln A_{k}, A_{k} = A_{0}^{1 / 2^{k}}, A_{0} = \frac{1}{∥A∥} A, \end{matrix}

(43)

and

\begin{matrix} ∥A_{0}^{1 / 2^{k}} - I∥ < α < 1, \end{matrix}

(44)

where k is usually selected from 4.

For

ln x

, the algorithm that converges faster is

\begin{matrix} ln x = ln \frac{1 - y}{1 + y} \approx - 2 \sum_{j = 1}^{m} \frac{y^{2 j + 1}}{2 j + 1}, \end{matrix}

(45)

where

y = \frac{1 - x}{1 + x} .

For

ln A_{k}

, we have

\begin{matrix} ln A_{k} = ln (I - B) - ln (I + B) \approx - 2 \sum_{j = 1}^{m} \frac{B^{2 j}}{2 j + 1}, \end{matrix}

(46)

where

B = (I - A_{k}) {(I + A_{k})}^{- 1} .

Applying the Euler–Abel transformation [21], we obtain the following accelerated series:

\begin{matrix} ln A_{k} \approx \frac{1}{3} B F^{2} (I - 12 \sum_{j = 1}^{m} \frac{B^{2 j}}{(2 j + 3) (4 j^{2} - 1)}) - F, \end{matrix}

(47)

where

F = 2 B {(I - B^{2})}^{- 2} .

We can use the following Padé approximation of

ln A_{k}

[20].

\begin{matrix} ln A_{k} \approx (B + 2 B^{2} + \frac{47}{36} B^{3} + \frac{11}{36} B^{4} + \frac{137}{7560} B^{5}) F, \\ F = {(I + \frac{5}{2} B + \frac{20}{9} B^{2} + \frac{5}{6} B^{3} + \frac{5}{42} B^{4} + \frac{1}{252} B^{5})}^{- 1}, \end{matrix}

(48)

where

B = A_{k} - I .

We also provide some numerical results for the matrix function

ln A .

Based on (41)–(44) we also provide some numerical results of

ln A

using (46)–(48). It can be seen from Figure 6 that through optimization (43), the calculation accuracy of (46)–(48) is significantly improved, and the calculation speed of (46)–(48) is more than twice as fast as the inner function. The calculation speed advantage of the Padé approximation is obvious.

We used the following identities to implement our general algorithm for the computations of the inverse trigonometric and hyperbolic functions. To control the length of the paper, we will not display the results here.

\begin{matrix} arcsin x = 2 arcsin \frac{x}{\sqrt{2 (1 + \sqrt{1 - x^{2}})}}, \\ arctan x = 2 arctan \frac{x}{1 + \sqrt{1 + x^{2}}}, \\ arcsin x = x + \sum_{l = 1}^{\infty} \frac{(2 l - 1) x^{2 l + 1}}{(2 l)! (2 l + 1)}, arctan x = \sum_{l = 0}^{\infty} \frac{x^{2 l + 1}}{(2 l + 1)}, \end{matrix}

(49)

\begin{matrix} arcsinh x = 2 arcsinh \frac{x}{\sqrt{2 (1 + \sqrt{1 + x^{2})}}}, arctanh x = 2 arctanh \frac{x}{1 + \sqrt{1 - x^{2}}}, \\ arcsinh x = x + \sum_{l = 1}^{\infty} \frac{{(- 1)}^{l} (2 l - 1) x^{2 l + 1}}{(2 l)! (2 l + 1)}, a r c t a n h x = \sum_{l = 0}^{\infty} \frac{{(- 1)}^{l} x^{2 l + 1}}{(2 l + 1)} . \end{matrix}

(50)

For

arcsin A, arctan A,

arcsinh

A, and

arctanh

A,

under certain conditions, we can use the following identity.

\begin{matrix} arcsin A = - i ln (\sqrt{I - A^{2}} + i A), arcsinh A = ln (\sqrt{I + A^{2}} + A), \\ arctan A = \frac{i}{2} ln ((i I + A) {(i I - A)}^{- 1}), \\ arctanh A = \frac{1}{2} ln ((I + A) {(I - A)}^{- 1}) . \end{matrix}

(51)

6. Improved Calculation Formula for Initial Value Problems (3) and (8)

Now, let us consider the calculation of the integral in (11), i.e.,

\begin{matrix} {\hat{F}}_{k} = \int_{0}^{τ} e^{H s} F_{H} (k τ - s) d s . \end{matrix}

(52)

If numerical integration is used to calculate this integral, it is quite time-consuming, and the calculation accuracy cannot be guaranteed. We adopt the following method here. For

n n = 1, 2, \dots

, there is the following relationship:

\begin{matrix} {\hat{F}}_{k} = \sum_{j = 0}^{n n - 1} \int_{\frac{j τ}{n n}}^{\frac{j τ}{n n}} e^{H s} F_{H} (k τ - s) d s = \sum_{j = 0}^{n n - 1} T_{n n, j} \int_{0}^{\frac{τ}{n n}} e^{H s} F_{H} (k τ - \frac{j τ}{n n} - s) d s \\ \approx \sum_{j = 0}^{n n - 1} T_{n n, j} \sum_{l = 0}^{m} \frac{H^{l}}{l!} \int_{0}^{\frac{τ}{n n}} s^{l} F_{H} (k τ - \frac{j τ}{n n} - s) d s, \end{matrix}

(53)

where

\begin{matrix} T_{n n, 0} = I, T_{n n} = e^{H \frac{τ}{n n}}, T_{n n, j} = e^{H \frac{j τ}{n n}} = T_{n n} T_{n n, j - 1}, j = 1, 2, \dots, n n - 1 . \end{matrix}

(54)

T_{n n}

can be calculated by the following method:

\begin{matrix} T_{n n} = I, z z = I; \\ for (i = 1, i \leq m, i + +) (z z = \frac{\frac{τ}{n n}}{i} H \times z z; T_{n n} = T_{n n} + z z) . \end{matrix}

(55)

So, (53) can be written as follows:

\begin{matrix} {\hat{F}}_{k} \approx \sum_{j = 0}^{n n - 1} T_{n n, j} H_{j}, H_{j} = \sum_{l = 0}^{m} \frac{H^{l}}{l!} \int_{0}^{\frac{τ}{n n}} s^{l} F_{H} (k τ - \frac{j τ}{n n} - s) d s, \end{matrix}

(56)

so we can convert the integral of the product of matrix exponential function and vector function into the integral of pure vector function.

The improved calculation formula for (3) is as follows:

\begin{matrix} (55), (56), \\ U_{k} = T U_{k - 1} + \sum_{j = 0}^{n n - 1} T_{n n, j} H_{j}, k = 1, 2, \dots, \end{matrix}

(57)

where

T = e^{H τ}

is calculated by the algorithm in Table 1, and

T_{n n, j} a n d H_{j}

are calculated by (55) and (56).

For

\begin{matrix} \int_{0}^{\frac{τ}{n n}} s^{l} F_{H} (k τ - \frac{j τ}{n n} - s) d s, \end{matrix}

(58)

the following two algorithms can be used.

(1) Integrable function method If

F (t) = {(f_{1} (t), f_{2} (t), \dots, f_{v} (t), \dots, f_{n} (t))}^{T},

f_{v} (t)

is the algebraic sum of functions of the following types:

\begin{matrix} t^{α}, t^{l} e^{- ω t}, t^{l} sin ω t, t^{l} cos ω t, \end{matrix}

(59)

then (58) can be expressed in analytical form, so

H_{j}

can also be expressed in analytical form. In this way, the calculation of (57) can be completed.

(2) Gauss three-point integral formula

Note that, under 16-bit calculation accuracy,

\begin{matrix} \frac{τ}{n n} < \frac{1}{2^{n_{16}} λ_{H}} \approx \frac{0.0376434}{|λ_{H}|} < 0.0376434, \end{matrix}

(60)

and the algebraic accuracy of the Gauss three-point integral formula is 5. Using the Gauss three-point integral formula,

\begin{matrix} \frac{1}{2} {(\frac{τ}{n n})}^{l + 1} \sum_{i = 1}^{3} w_{i} {(\frac{1 + b_{i}}{2})}^{l} F_{H} (k τ - \frac{(j + 1 / 2) τ}{n n} - \frac{b_{i} τ}{2 n n}), \end{matrix}

(61)

where

w_{1} = w_{3} = \frac{5}{9}, w_{2} = \frac{8}{9}, b_{1} = - \frac{\sqrt{15}}{5}, b_{2} = 0, b_{3} = \frac{\sqrt{15}}{5},

to approximate (58) with high accuracy. So, (57) can be replaced by the following algorithm.

Algorithm 1.

Numerical solution algorithm of initial value problem (3): for

n n = 1, 2, \dots

\begin{matrix} U_{k} = T U_{k - 1} + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} T_{n n, j} \sum_{i = 1}^{3} w_{i} e^{H \frac{(1 + b_{i}) τ}{2 n n}} F_{H} (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}), k = 1, 2, \dots, \end{matrix}

(62)

where

\begin{matrix} e^{H \frac{(1 + b_{i}) τ}{2 n n}} \approx \frac{w_{i} τ}{2 n n} (I + \sum_{l = 1}^{m} \frac{1}{l!} {(\frac{(1 + b_{i}) τ}{2 n n})}^{l} H^{l}) . \end{matrix}

(63)

In particular, if we let

\begin{matrix} n n = \{\begin{matrix} [2^{n_{p}} λ_{H} τ] + 1 2^{n_{p}} λ_{H} τ > 1 \\ 1 2^{n_{p}} λ_{H} τ \leq 1 \end{matrix}, \end{matrix}

(64)

then

N_{p} = 0, m = m_{p} = [\sqrt{\frac{9 p ln 10}{8 ln 2}}],

\begin{matrix} W_{i, H} \approx \frac{w_{i} τ}{2 n n} (I + \sum_{l = 1}^{m_{p}} \frac{1}{l!} {(\frac{(1 + b_{i}) τ}{2 n n})}^{l} H^{l}) . \end{matrix}

(65)

At this time, the theoretical calculation accuracy of (62) is

p .

Remark 3.

Note that,

W_{i, H} (i = 1, 2, 3)

in (63) and (65) only needs to be calculated once, and is independent of

F (t)

, which is not only efficient but also easy to program.

Now let us consider the numerical solution of the initial value problem (8). If it is converted to the initial value problem (3), then

\begin{matrix} (\begin{matrix} x (t) \\ x' (t) \end{matrix}) = U, H = (\begin{matrix} 0 & I_{3 \times 3} \\ - K & 0 \end{matrix}), F_{H} (t) = (\begin{matrix} 0 \\ F (t) \end{matrix}), U_{0} = (\begin{matrix} x_{0} \\ x_{1} \end{matrix}) . \end{matrix}

(66)

By

\begin{matrix} W_{i, H} = e^{τ H} \approx (\begin{matrix} {cos}_{p} (τ A) & τ {sin}_{1, p} (τ A) \\ - K τ {sin}_{1, p} (τ A) & {cos}_{p} (τ A) \end{matrix}), A = \sqrt{K}, \end{matrix}

(67)

where

\begin{matrix} {sin}_{1} A = \sum_{j = 0}^{\infty} \frac{{(- 1)}^{j}}{(2 j + 1)!} A^{2 j} = \sum_{j = 0}^{\infty} \frac{{(- 1)}^{j}}{(2 j + 1)!} K^{j}, \end{matrix}

(68)

and (68), we obtain the following algorithm.

Algorithm 2.

\begin{matrix} Δ t = τ / 3^{N}; T_{α} = I; z z = I; A_{2} = (A Δ t) \times (A Δ t); \\ for (i = 1, i \leq m, i + +) (z z = - \frac{A_{2}}{(2 i) (2 i + 1)} \times z z; T_{α} = T_{α} + z z), \end{matrix}

(69)

and

\begin{matrix} for (i = 0, i < N, i + +) \\ (z z = T_{α} \times T_{α}, z z = z z \times A_{2}, A_{2} = 9 A_{2}, T_{α} = T_{α} - \frac{4}{3} T_{α} \times z z), \end{matrix}

(70)

where

m_{p}

and

N_{p}

are determined using (31).

Algorithm 3.

Numerical solution algorithm of initial value problem (8):

\begin{matrix} x_{k} = (cos (τ A) x_{k - 1} + τ {sin}_{1} (τ A) y_{k - 1}) \\ + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} \frac{w_{i} (2 j + 1 + b_{i}) τ}{2 n n} {sin}_{1} (\frac{(2 j + 1 + b_{i}) τ}{2 n n} A) F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}) \\ y_{k} = - A sin (τ A) x_{k - 1} + cos (τ A) y_{k - 1} \\ + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} w_{i} cos (\frac{(2 j + 1 + b_{i}) τ}{2 n n} A) F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}) \end{matrix}

(71)

for

n n = 1, 2, \dots,

where

A = \sqrt{K} .

In particular, if

n n

is determined using (64), the theoretical accuracy of this algorithm is p and

\begin{matrix} x_{k} = cos (τ A) x_{k - 1} + τ {sin}_{1} (τ A) y_{k - 1} + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} w_{i} T W_{j, i}^{sin} F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}), \\ y_{k} = cos (τ A) y_{k - 1} - A sin (τ A) x_{k - 1} + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} w_{i} T W_{j, i}^{cos} F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}), \\ x_{0} = x (0), y_{0} = x' (0), \end{matrix}

(72)

where

\begin{matrix} T W_{j, i}^{sin} = \frac{(1 + b_{i}) τ}{2 n n} cos (\frac{j τ}{n n} A) {sin}_{1, m_{p}} (\frac{(1 + b_{i}) τ}{2 n n} A) \\ + \frac{j τ}{n n} {sin}_{1} (\frac{j τ}{n n} A) {cos}_{m_{p}} (\frac{(1 + b_{i}) τ}{2 n n} A), \\ T W_{j, i}^{cos} = cos (\frac{j τ}{n n} A) {cos}_{m_{p}} (\frac{(1 + b_{i}) τ}{2 n n} A) \\ - sin (\frac{j τ}{n n} A) {sin}_{m_{p}} (\frac{(1 + b_{i}) τ}{2 n n} A), \end{matrix}

(73)

and

\begin{matrix} {sin}_{m_{p}} (s A) = A \sum_{l = 0}^{m_{p}} \frac{{(- 1)}^{l} s^{2 l + 1} K^{l}}{(2 l + 1)!}, \\ {sin}_{1, m_{p}} (s A) = \sum_{l = 0}^{m_{p}} \frac{{(- 1)}^{l} s^{2 l} K^{l}}{(2 l + 1)!}, \\ {cos}_{m_{p}} (s A) = I + \sum_{l = 1}^{m_{p} + 1} \frac{{(- 1)}^{l} s^{2 l} K^{l}}{(2 l)!} . \end{matrix}

(74)

If A is reversible, then

\begin{matrix} x_{k} = cos (τ A) x_{k - 1} + A^{- 1} sin (τ A) y_{k - 1} \\ + \frac{τ}{2 n n} A^{- 1} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} w_{i} sin (\frac{(2 j + 1 + b_{i}) τ}{2 n n} A) F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}) \\ y_{k} = - A sin (τ A) x_{k - 1} + cos (τ A) y_{k - 1} \\ + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} \sum_{i = 1}^{3} w_{i} cos (\frac{(2 j + 1 + b_{i}) τ}{2 n n} A) F (k τ - \frac{(2 j + 1 + b_{i}) τ}{2 n n}) . \end{matrix}

(75)

Therefore, (75) can be written in the form of complex function as follows:

\begin{matrix} z_{k} = e^{- i τ A} z_{k - 1} + \frac{i τ}{2 n n} A^{- 1} \sum_{j = 0}^{n n - 1} e^{- \frac{j τ i}{n n} A} \sum_{l = 1}^{3} w_{l} e^{- \frac{(1 + b_{l}) τ i}{2 n n} A} F (k τ - \frac{(2 j + 1 + b_{l}) τ}{2 n n}), \end{matrix}

(76)

or

\begin{matrix} Z_{k} = e^{- i τ A} Z_{k - 1} + \frac{τ}{2 n n} \sum_{j = 0}^{n n - 1} e^{- \frac{j τ i}{n n} A} \sum_{l = 1}^{3} w_{l} e^{- \frac{(1 + b_{l}) τ i}{2 n n} A} F (k τ - \frac{(2 j + 1 + b_{l}) τ}{2 n n}), \end{matrix}

(77)

where

i = \sqrt{- 1},

\begin{matrix} z_{k} = x_{k} + i A^{- 1} y_{k}, x_{k} = R e z_{k}, y_{k} = A I m z_{k}, \\ Z_{k} = y_{k} - i A x_{k}, y_{k} = R e Z_{k}, x_{k} = - A^{- 1} I m Z_{k} . \end{matrix}

(78)

We first provide an examples to illustrate the effectiveness of Algorithms 1. Although we skip examples of Algorithm 2, we then include an example for the effectiveness of Algorithm 3.

Example 1.

In (8), let

\begin{matrix} K = (\begin{matrix} 1 & - 1 \\ - 1 & \frac{5}{2} \\ 1 & 4 \end{matrix}), F (t) = (\begin{matrix} sin t \\ \frac{1}{2} sin t \\ cos \frac{\sqrt{2} t}{2} - cos (\sqrt{3} t) + sin t \end{matrix}), \\ x_{0} = {(\begin{matrix} \frac{5}{2} & 0 & - 2 \end{matrix})}^{T}, x_{1} = {(\begin{matrix} 1 & 1 & 2 \end{matrix})}^{T} . \end{matrix}

(79)

The analytic solution of the initial value problem (8) is

\begin{matrix} x (t) = (\begin{matrix} 2 cos \frac{\sqrt{2} t}{2} + \frac{1}{2} cos (\sqrt{3} t) + sin t \\ cos \frac{\sqrt{2} t}{2} - cos (\sqrt{3} t) + sin t \\ sin (2 t) - 2 cos (2 t) \end{matrix}) . \end{matrix}

(80)

Let

e r r_{i, k} = {∥x (k τ) - x_{i, k}∥}_{\infty} i = 1, 2,

where

x_{1, k}, x_{2, k} (k = 0, 1, 2, \dots, N)

are the numerical solutions of the initial value problem (8) determined by Algorithm 1 and Algorithm 3, respectively.

e r r_{s y m, k} = {∥x (k τ) - x_{s y m} (k τ)∥}_{\infty},

where

x_{s y m} (t)

is the numerical solution obtained by calling the function

N D S o l v e

in Mathematica. The numerical results of

e r r_{1, k}, e r r_{2, k}, e r r_{s y m, k} [0, T]

(k = 0, 1, 2, \dots, N τ = T)

are as follows.

It can be seen from Figure 7 that the calculation accuracy of Algorithm 1 is higher than that of Algorithm 3 but the calculation speed of Algorithm 3 is three times that of Algorithm 1. However, the error of numerical results of Mathematica internal functions is far greater than the error of Algorithms 1 and 3. In particular, the collapse is calculated at

t = 667.787

. After this point, the error increases exponentially with time t. However, Algorithms 1 and 3 are quite stable. For example, the calculation error in

[0, 8000]

is still very small.

In Algorithms 1 and 3, let

n n = μ n n_{A}, μ = 0.5, 0.25, 0.125

, where

n n_{A}

is calculated according to (64). The numerical results are as follows.

Compared with Figure 7 and Figure 8, when

μ = 0.5

, the calculation accuracy is not reduced by much, and the calculation speed is almost doubled. For

μ = 0.125

Algorithms 1 and 3, the calculation accuracy is still high, and the calculation speed is 10 times higher than that in Figure 8.

Example 2.

In (8), let

K = {(k_{i, j})}_{n \times n}

be a tridiagonal matrix.

F (t) = {(f_{1} (t), f_{2} (t), \dots, f_{n} (t))}^{T}

,

f_{i} (t) = A_{i} c o s (ω_{i} t + φ_{i}) (i = 1, 2, \dots, n)

,

x_{0} = {(x_{0, 1}, x_{0, 2}, \dots, x_{0, n})}^{T}

,

x_{1} = {(x_{1, 1}, x_{1, 2}, \dots, x_{1, n})}^{T}

, and they are randomly generated and satisfy the following conditions:

\begin{matrix} |k_{i, i} - 9| \leq 1, |k_{i, i + 1} - \frac{3}{2}| \leq \frac{1}{2}, |k_{i + 1, i} - \frac{3}{2}| \leq \frac{1}{2}, \\ |A_{i} - \frac{3}{2}| \leq \frac{1}{2}, |ω_{i} - \frac{5}{2}| \leq \frac{1}{2}, 0 \leq φ_{i} \leq \frac{π}{2}, \\ |x_{0, i}| \leq 1, |x_{1, i}| \leq 1 (i = 1, 2, \dots, n) . \end{matrix}

For

n = 50, 100, 200, and 400,

the following numerical results are provided by Algorithms 1 and 3.

In Figure 9, (1)–(4), the left is the first two components of the numerical solution, and the right is the difference between the numerical solutions obtained by the two algorithms. It is noted that sometimes

ω_{i} (i = 1, 2, \dots, n)

is close to the frequency of the solution, and flutter or resonance may occur. However, the values obtained by the two algorithms are still very close.

7. Conclusions

In this paper, we established algorithms for the computation of matrix exponential functions, matrix trigonometric functions, and matrix hyperbolic functions. Our numerical results show that these algorithms are faster with higher accuracy compared to existing algorithms and the system algorithms in Mathematica. In the series expansion of matrix function, the number of expanded items

m_{p}

is only related to the requirement of calculation accuracy, and has nothing to do with the matrix itself. However, the number of times in the iterative process

N_{p}

is not only related to the matrix itself but also closely related to

t .

When

|3^{n_{p}} λ_{A} t| \leq 1

(

|2^{n_{p}} λ_{A} t| \leq 1

),

N_{p} = 0

, the calculation of the matrix function becomes an algebraic operation, so the integral in (9)–(12) becomes an easy integral to compute. In brief, our algorithm allows users to set a required level of accuracy and then use it to choose appropriate values for parameters m and N to minimize computation time. In this manner, both accuracy and efficiency are addressed simultaneously.

Author Contributions

Conceptulation, H.Q. and Y.L.; mehtodology, H.Q. and Y.L.; software, H.Q.; validation, H.Q. and Y.L.; formal analysis, H.Q. and Y.L.; investigation, H.Q. and Y.L.; visualization, H.Q.; supervision, H.Q.; project administration, Y.L. All authors have read and agreed to the publication version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ruiz, P.; Sastre, J.; Ibáñeza, J.; Defez, E. High performance computing of the matrix exponential. J. Comput. Applied Math. 2015, 291, 370–379. [Google Scholar] [CrossRef]
Molert, C.; Loans, C.V. Nineteen dubious ways to compute the exponential of a matrix. SIAM Rev. Soc. Ind. Applied Math. 1978, 20, 801–836. [Google Scholar]
Higham, N.J. Functions of Matrices: Theory and Computation; SIAM: Philadelphia, PA, USA, 2008. [Google Scholar]
Al-Mohy, A.H.; Higham, N.J. Computing the Action of the Matrix Exponential, with an Application to Exponential Integrators. SIAM J. Sci. Comput. 2011, 33, 488–511. [Google Scholar] [CrossRef]
Higham, N.J. The scaling and squaring method for the matrix exponential resisted. SIAM J. Matrix Anal. Appl. 2005, 26, 1179–1193. [Google Scholar] [CrossRef]
Bader, P.; Blanes, S.; Casas, F. Computing the Matrix Exponential with an Optimized Taylor Polynomial approximation. Mathematics 2019, 7, 1174. [Google Scholar] [CrossRef]
Seydaoglu, K.M.; Bader, P.; Blanes, S.; Casas, F. Computing the matrix sine and cosine simultaneously with a reduced number of products. arXiv 2020, arXiv:2010.00465v1. [Google Scholar] [CrossRef]
Hargreaves, G.I.; Higham, N.J. Efficient algorithms for the matrix cosine and sine. Numer. Algorithms 2005, 40, 383–400. [Google Scholar] [CrossRef]
Alonso, P.; Ibáñez, J.; Sastre, J.; Peinado, J.; Defez, E. Efficient and accurate algorithms for computing matrix trigonometric functions. J. Comput. Appl. Math. 2017, 309, 325–332. [Google Scholar] [CrossRef]
Alonso, P.; Peinado, J.; Ibáñez, J.; Sastre, J.; Defez, E. Computing Matrix Trigonometric Functions with GPUs through Matlab. J. Supercomput. 2019, 75, 1227–1240. [Google Scholar] [CrossRef]
Al-Mohy, A.H.; Higham, N.J.; Relton, S.D. New Algorithms for Computing the Matrix Sine and Cosine Separately or Simultaneously. SIAM J. Sci. Comput. 2015, 37, A456–A487. [Google Scholar] [CrossRef]
Serbin, S.M.; Blalock, S.A. An algorithm for computing the matrix cosine. SIAM J. Sci. Statist. Comput. 1980, 1, 198–204. [Google Scholar] [CrossRef]
Cooper, B.L.; Bingulac, S. Computational improvement in the calculation of the natural log of a square matrix. Electron. Lett. 1990, 26, 861–862. [Google Scholar] [CrossRef]
Wu, F.; Zhu, L.; Shi, Q. Efficient Computational method for matrix function in dynamic problems. Acta Mech. Sin. 2023, 39, 522451. [Google Scholar] [CrossRef]
Cardoso, J.R.; Sadeghi, A. A technique for improving the computation of functions of triangular matrices. Int. J. Comput. Math. 2022, 99, 2449–2465. [Google Scholar] [CrossRef]
Jiang, Y.; He, C.; Qin, H. An accelerated algorithm for high precision and fast calculation of basic elementary functions. Math. Pract. Theory 2017, 47, 238–246. [Google Scholar]
Fasi, M.; Higham, N.J. Multiprecision algorithm for Computing matrix logarithm. SIAM J. Matrixanal. Appl. 2018, 39, 472–491. [Google Scholar] [CrossRef]
Ibáñez, J.; Sastre, J.; Ruiz, P.; Alonso, J.M.; Defez, E. An Improved Taylor Algorithm for Computing the Matrix Logarithm. Mathematics 2021, 9, 2018. [Google Scholar] [CrossRef]
Cheng, S.H.; Higham, N.J.; Kenney, C.S.; Laub, A.J. Approximating the Logarithm of a Matrix to Specified Accuracy. SIAM J. Matrix Anal. Appl. 2001, 22, 1112–1125. [Google Scholar] [CrossRef]
Kenney, C.; Laub, A.J. Condition estimates for matrix functions. SIAM J. Matrix Anal. Appl. 1989, 10, 191–209. [Google Scholar] [CrossRef]
Kenney, C.; Laub, A.J. Padé error estimates for the logarithm of a matrix. Internat. J. Control. 1989, 50, 707–730. [Google Scholar] [CrossRef]
Milovanović, G.V.; Rančić, S.L. Some improvements in calculation of the natural log of a square matrix. In Proceedings of the Third International Conference on Functional Analysis and Approximation Theory, Aquafredda di Maratea, Potenza, Italy, 23–28 September 1996; Issue 52. Volume II. [Google Scholar]

Figure 1. Error comparison of the algorithms in Table 1 for

μ = 1, \frac{2}{3}, \frac{4}{3}

.

Figure 1. Error comparison of the algorithms in Table 1 for

μ = 1, \frac{2}{3}, \frac{4}{3}

.

Figure 2. Error of

e^{A}, cos A, and sin A

for three algorithms.

Figure 2. Error of

e^{A}, cos A, and sin A

for three algorithms.

Figure 3. Error of

sinh A and cosh A

for three algorithms.

Figure 3. Error of

sinh A and cosh A

for three algorithms.

Figure 4. Error of the

tan A, sec A, tanh A, and s e c h A

for two algorithms.

Figure 4. Error of the

tan A, sec A, tanh A, and s e c h A

for two algorithms.

Figure 5. Error of

cot A, cot A, csc A, and c s c h A f o r t w o a l g o r i t h m s

.

Figure 5. Error of

cot A, cot A, csc A, and c s c h A f o r t w o a l g o r i t h m s

.

Figure 6. Error of

ln A

for the several algorithms.

Figure 6. Error of

ln A

for the several algorithms.

Figure 7. Error of numerical solution (8) for several algorithms (

n m

is calculated according to (64)).

Figure 7. Error of numerical solution (8) for several algorithms (

n m

is calculated according to (64)).

Figure 8. Error of numerical solution (8) for several algorithms.

Figure 9. Numerical solutions and errors of two algorithms.

Table 1. The fast algorithm for

e^{A}, cos A, sin A, cosh A, and sinh A

.

Table 1. The fast algorithm for

e^{A}, cos A, sin A, cosh A, and sinh A

.

	$m_{p}, N_{p}$	Algorithm
$e^{A}$	$m_{p} = [\sqrt{\frac{9 p ln 10}{8 ln 2}}], N_{p} = [N_{e}] + 1$	$(1) B = \frac{A}{2^{N_{p}}}, C = I + \sum_{j = 1}^{m_{p}} \frac{B^{j}}{j!}$
		$(2) f o r (i = 0, i < N, i + +) C = C^{2}$
		$(3) e^{A} = C$
$\{\begin{matrix} sin A \\ sinh A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{p ln 10}{2 ln 3}}], N_{p} = [N_{sin}] + 1$	$(1) B = \frac{A^{2}}{3^{2 N_{p}}}, C = \frac{A}{3^{N}} + \frac{A}{3^{N}} \sum_{j = 1}^{m_{p}} \frac{{(\mp B)}^{j}}{(2 j + 1)!}$
		$(2) f o r (i = 0, i < N, i + +) C = 3 C \mp 4 C^{3}$
		$(3) \{\begin{matrix} sin A \\ sinh A \end{matrix}\} = C$
$(I) \{\begin{matrix} cos A \\ cosh A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{p ln 10}{2 ln 3}}] + 1, N_{p} = [N_{sin}]$	$(1) B = \frac{A^{2}}{3^{2 N_{p}}}, C = I + \sum_{j = 1}^{m_{p}} \frac{{(\mp B)}^{j}}{(2 j)!}$
		$(2) f o r (i = 0, i < N, i + +) C = 4 C^{3} - 3 C$
		$(3) \{\begin{matrix} cos A \\ cosh A \end{matrix}\} = C$
$(I I) \{\begin{matrix} cos A \\ cosh A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{p ln 10}{2 ln 3}}] + 1, N_{p} = [N_{sin}]$	$(1) B = \frac{A^{2}}{3^{2 N_{p}}}, C = I + \sum_{j = 1}^{m_{p}} \frac{{(\mp B)}^{j}}{(2 j)!}$
		$(2) f o r (i = 0, i < N, i + +) C = C^{2} - I$
		$(3) \{\begin{matrix} cos A \\ cosh A \end{matrix}\} = C$
$\begin{matrix} N_{e} = \frac{p ln 10}{(m_{p} + 1) ln 2} + \frac{ln λ}{ln 2} - \frac{ln (m_{p} + 1)!}{(m_{p} + 1) ln 2}, N_{cos} = \frac{p ln 10}{(2 m_{p} + 1) ln 2} + \frac{ln λ}{ln 2} - \frac{ln (2 m_{p} + 1)!}{(2 m_{p} + 1) ln 2}, \\ N_{sin} = \frac{p ln 10}{(2 m_{p} + 2) ln 3} + \frac{ln λ}{ln 3} - \frac{ln (2 m_{p} + 2)!}{(2 m_{p} + 2) ln 3} . \end{matrix}$

Table 2. The fast algorithms for

tan A, sec A, tanh A

, and

s e c h A

.

Table 2. The fast algorithms for

tan A, sec A, tanh A

, and

s e c h A

.

	$m_{p}, N$	Algorithm
$\{\begin{matrix} tan A \\ tanh A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{2 p ln 10}{ln 3}}], N = N_{1} + 1$	$B = \frac{A^{2}}{3^{2 N}}$ ,
		$C = \frac{2 A}{3^{N}} \sum_{j = 0}^{m} {(\pm 1)}^{j} σ (j + 1) (4^{j + 1} - 1) B^{j}$
		$f o r (i = 0, i < N, i + +)$
		$C = C (3 I \mp C^{2}) {(I \mp 3 C^{2})}^{- 1}$
		$\{\begin{matrix} tan A \\ tanh A \end{matrix}\} \approx C$
$\{\begin{matrix} sec A \\ sec h A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{2 p ln 10}{ln 3}}], N = N_{2} + 1$	$B = \frac{A^{2}}{3^{2 N}}, I + \sum_{j = 1}^{m} {(\mp 1)}^{j} {\hat{E}}_{j} B^{j}$
		$f o r (i = 0, i < N, i + +)$
		$C = C^{3} {(4 I - 3 C^{2})}^{- 1}$
		$\{\begin{matrix} sec A \\ sec h A \end{matrix}\} \approx C$
$N_{1} = [\frac{p ln 10 + ln (2 σ (m_{p} + 2))}{(2 m_{p} + 3) ln 3} + \frac{ln λ}{ln 3}], N_{2} = [\frac{p ln 10 + ln {\hat{E}}_{m_{p} + 1}}{(2 m_{p} + 3) ln 3} + \frac{ln λ}{ln 3}] .$

Table 3. The fast algorithms for

cot A, coth A, csc A,

and

csch A

.

Table 3. The fast algorithms for

cot A, coth A, csc A,

and

csch A

.

	$m_{p}, N$	Algorithm
$\{\begin{matrix} cot A \\ coth A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{7 p ln 10}{5 ln 3}}], N = N_{1} + 1$	$B = \frac{A^{2}}{3^{2 N}}, C = I - 2 \sum_{j = 1}^{m} {(\pm 1)}^{j} σ (j) B^{j}$
		$f o r (i = 0, i < N, i + +)$
		$(C = 3 (C (3 B \mp C^{2}) {(B \mp 3 C^{2})}^{- 1}; B = 9 B)$
		$\{\begin{matrix} cot A \\ coth A \end{matrix}\} \approx C$
$\{\begin{matrix} csc A \\ csch A \end{matrix}\}$	$m_{p} = [\sqrt{\frac{5 p ln 10}{2 ln 3}}], N = N_{2} + 1$	$B = \frac{A^{2}}{3^{2 N}}, I + 2 \sum_{j = 1}^{m + 1} {(\pm 1)}^{j} σ (j) (1 - \frac{2}{4^{j}}) B^{j}$
		$f o r (i = 0, i < N, i + +)$
		$(C = 3 C^{3} {(3 C^{2} \mp 4 B)}^{- 1}; B = 9 B)$
		$\{\begin{matrix} csc A \\ csch A \end{matrix}\} \approx C$
$N_{1} = [\frac{p ln 10 + ln (2 σ (m_{p} + 2))}{(2 m_{p} + 3) ln 3} + \frac{ln λ}{ln 3}], N_{2} = [\frac{p ln 10 + ln {\hat{E}}_{m_{p} + 1}}{(2 m_{p} + 3) ln 3} + \frac{ln λ}{ln 3}]$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Qin, H.; Lu, Y. An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application. AppliedMath 2024, 4, 690-708. https://doi.org/10.3390/appliedmath4020037

AMA Style

Qin H, Lu Y. An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application. AppliedMath. 2024; 4(2):690-708. https://doi.org/10.3390/appliedmath4020037

Chicago/Turabian Style

Qin, Huizeng, and Youmin Lu. 2024. "An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application" AppliedMath 4, no. 2: 690-708. https://doi.org/10.3390/appliedmath4020037

Article Menu

An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application

Abstract

1. Introduction

2. The General Theorem of the Fast Algorithm for Matrix Functions

3. Fast Algorithm for $e^{A}, \cos A, \sin A, \sinh A,$ and $\cosh A$

4. Fast Algorithm for $\tan A, \tanh A, \sec A$ , and $sechA$

5. Fast Algorithm for Logarithmic Matrix Function $ln A$ and Some Related Functions

6. Improved Calculation Formula for Initial Value Problems (3) and (8)

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

An Efficient Algorithm for Basic Elementary Matrix Functions with Specified Accuracy and Application

Abstract

1. Introduction

2. The General Theorem of the Fast Algorithm for Matrix Functions

3. Fast Algorithm for e A , cos A , sin A , sinh A , and cosh A

4. Fast Algorithm for tan A , tanh A , sec A , and sechA

5. Fast Algorithm for Logarithmic Matrix Function ln A and Some Related Functions

6. Improved Calculation Formula for Initial Value Problems (3) and (8)

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Fast Algorithm for $e^{A}, \cos A, \sin A, \sinh A,$ and $\cosh A$

4. Fast Algorithm for $\tan A, \tanh A, \sec A$ , and $sechA$

5. Fast Algorithm for Logarithmic Matrix Function $ln A$ and Some Related Functions