Schröder-Based Inverse Function Approximation

Howard, Roy M.

doi:10.3390/axioms12111042

Open AccessArticle

Schröder-Based Inverse Function Approximation

by

Roy M. Howard

School of Electrical Engineering, Computing and Mathematical Sciences, Curtin University, Perth 6845, Australia

Axioms 2023, 12(11), 1042; https://doi.org/10.3390/axioms12111042

Submission received: 18 July 2023 / Revised: 10 September 2023 / Accepted: 11 October 2023 / Published: 8 November 2023

(This article belongs to the Special Issue Advanced Approximation Techniques and Their Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Schröder approximations of the first kind, modified for the inverse function approximation case, are utilized to establish general analytical approximation forms for an inverse function. Such general forms are used to establish arbitrarily accurate analytical approximations, with a set relative error bound, for an inverse function when an initial approximation, typically with low accuracy, is known. Approximations for arcsine, the inverse of x − sin(x), the inverse Langevin function and the Lambert W function are used to illustrate this approach. Several applications are detailed. For the root approximation of a function, Schröder approximations of the first kind, based on the inverse of a function, have an advantage over the corresponding generalization of the standard Newton–Raphson method, as explicit analytical expressions for all orders of approximation can be obtained.

Keywords:

inverse function approximation; Schröder approximations; Newton–Raphson method; Taylor series; arcsine; inverse Langevin function; Lambert W function

MSC:

26A18; 33B10; 33B30; 41A27; 41A58

1. Introduction

Function definition and function approximation are fundamental to many areas of mathematics, science and technology. One area of function approximation that is a challenge is the establishment of accurate analytical approximations for the inverse,

f^{- 1}

, of a known function

f

when an explicit analytical expression for

f^{- 1}

is not known. When

f^{- 1}

is not known, a variety of approaches can be used to determine an analytical approximation to

f^{- 1}

with a modest relative error bound over its domain. Systematic approaches can be utilized (e.g., through the use of Taylor series, series reversion, Padè approximants, minimax optimization, geometric considerations, etc.) to yield convergent approximations as the order of approximation is increased. In such cases, the order of convergence is generally modest. Custom ad hoc approaches can be utilized to lead to improved results but these, in general, are not generalizable. The evolution of approaches to establish approximations for the Inverse Langevin function, e.g., [1,2], is representative of the situation.

In contrast, iterative approaches, such as iteration based on the Newton–Raphson method for finding the root of a function, have significantly higher levels of convergence. With

y = f (x)

, which implies that

f (x) - y = 0

, if is clear that finding the inverse

x = f^{- 1} (y)

, with

y

fixed, is a root problem and iterative methods can be employed. Potentially, much higher rates of convergence can be achieved. Gdawiec [3] provides a good overview of potential fixed-point iterative methods, which, in general, are associated with the more general problem of finding fixed points. For the sub-case of root approximation, the dominant method is Newton–Raphson iteration, and Ypma [4] provides details of the historical development of this method. Well-known alternatives include the Householder method, Steffensen’s method and Halley’s method. Newton–Raphson potentially leads to quadratic convergence, and research has led to many higher-order methods with better convergence, e.g., [5,6]. Amat [7] provides an overview of methods with cubic convergence. Abbasbandy [8] and Chun [9] proposed higher-order iteration methods based on Adomian decomposition. Noor [10] details a modified Householder two-step iterative method with fourth-order convergence.

An alternative, but less well known, approach for approximating the root of a function

f

is to directly utilize the inverse function

f^{- 1}

, with the result being Schröder’s approximations of the first kind. Petković [11] (Equation (17)), Gdawiec [3] (Equation (20)) and Dubeau [11] (Section 3) provide a perspective, and the original paper by Schröder dates from 1870 [12] (Equation (21). The focus of this paper is on utilizing Schröder’s approximations of the first kind, modified for the inverse function approximation case, to establish general analytical approximation forms for an inverse function whose explicit analytical form is not known. Such general forms can be used to establish arbitrarily accurate analytical approximations, with a set relative error bound, for an unknown inverse function when an initial approximation, typically with low accuracy, is known.

The ability of this approach to define arbitrarily accurate approximations for inverse functions is demonstrated via four examples: the arcsine function, the inverse of

x - \sin (x)

, the inverse Langevin function and the Lambert W function.

In Section 2, the theory underpinning root and inverse function approximation is detailed. The general theoretical results are applied to arcsine, the inverse of

x - \sin (x)

, the inverse Langevin function and the Lambert W function, respectively, in Section 3, Section 4, Section 5, Section 6. New approximations and several applications are noted. Conclusions are detailed in Section 7.

1.1. Background Result

Based on simply geometric considerations, the integral of an inverse function

f^{- 1}

can be shown to be

\int_{y_{1}}^{y} f^{- 1} (λ) d λ = {y f}^{- 1} (y) - {y_{1} f}^{- 1} (y_{1}) - \int_{f^{- 1} (y_{1})}^{f^{- 1} (y)} f (γ) d γ

(1)

assuming

f^{- 1}

is well defined on the interval

[y_{1}, y]

and the integral of

f

, on the associated interval

[f^{- 1} (y_{1}), f^{- 1} (y)]

, is also well defined.

1.2. Assumptions and Notation

For an arbitrary function

f

, defined over the interval

[α, β]

, an approximating function

f_{A}

has a relative error, at a point

x_{1}

, defined according to

r e (x_{1}) = 1 - \frac{f_{A} (x_{1})}{f (x_{1})}

. The relative error bound for the approximating function, over the interval

[α, β]

, is defined according to

{r e}_{B} = m a x \{|r e (x_{1})| : x_{1} \in [α, β]\} .

(2)

All functions are assumed to be differentiable up to the order being utilized in the analysis or results. The notation

f^{(k)}

is used for the kth derivative of a function. The differentiation operator, D, is also used with kth-order differentiation being denoted

D^{(k)}

.

Mathematica^® (version 13.1) is used to facilitate analysis and to obtain numerical results. In general, relative error results associated with approximations have been obtained by sampling specified intervals, in either a linear or logarithmic manner, as appropriate, with 1000 points.

2. Schröder’s Approximations of the First Kind

Consider the illustration, shown in Figure 1, of a function

f

and an initial approximation

x_{0}

for the root of

f

, which is denoted as

x_{o}

. The usual approach to finding a better approximation to

x_{o}

than

x_{0}

, is to utilize a first-order Taylor series approximation, denoted

t_{1}

, for

f

which is based on the point

(x_{0}, f (x_{0}))

. This leads to the classic Newton–Raphson approximation

x_{1}

for the root

x_{o}

according to

x_{1} = x_{0} - \frac{f (x_{0})}{f^{(1)} (x_{0})}

(3)

Naturally, and as illustrated in Figure 1, higher-order Taylor series are expected to lead to more accurate approximations. A second-order Taylor series yields the approximation

x_{2} = x_{0} - \frac{f^{(1)} (x_{0})}{f^{(2)} (x_{0})} \cdot [1 \pm \sqrt{1 - \frac{{2 f (x_{0}) f}^{(2)} (x_{0})}{{[f^{(1)} (x_{0})]}^{2}}}]

(4)

Explicit higher-order approximations are increasingly problematic: the kth-order approximation is associated with the dominant root of a kth-order polynomial. This problem can be avoided by utilizing, as illustrated in Figure 1, Taylor series approximations, denoted as

t_{k}^{I}

(kth-order approximation), for the inverse function

f^{- 1}

and based on the point

(y_{0}, x_{0})

,

y_{0} = f (x_{0})

. Whilst this may presuppose that the inverse function is known, the resulting Taylor series can be written solely in terms of

f

and known parameter values such as

x_{0}

. Thus, this indirect approach leads to explicit analytical expressions for the root of

f

and for all orders of approximation, a preferable outcome. The details are noted below, and the result was proposed by Schröder in 1870 [12].

2.1. Schröder’s Approximations of the First Kind

Consider the nth-order Taylor series, denoted as

t_{n}^{I}

, for

f^{- 1}

and based on the point

(y_{0}, f^{- 1} (y_{0}))

, where

x_{0} = f^{- 1} (y_{0})

:

\begin{matrix} t_{n}^{I} (y) = f^{- 1} (y_{0}) + (y - y_{0}) D [f^{- 1} (y_{0})] + \frac{{(y - y_{0})}^{2}}{2} \cdot D^{(2)} [f^{- 1} (y_{0})] + \dots + \\ \frac{{(y - y_{0})}^{n}}{n!} \cdot D^{(n)} [f^{- 1} (y_{0})] \end{matrix}

(5)

As

f^{- 1} (y_{0}) = x_{0}

and

y_{0} = f (x_{0})

, it then follows that the nth-order approximation to the root

x_{o}

, as given by

x_{n}^{I} = t_{n}^{I} (0)

, is

\begin{matrix} x_{n}^{I} = x_{0} - {f (x}_{0}) D [f^{- 1} (y_{0})] + \frac{f^{2} (x_{0})}{2} \cdot D^{(2)} [f^{- 1} (y_{0})] + \dots + \\ \frac{{{(- 1)}^{n} f}^{n} (x_{0})}{n!} \cdot D^{(n)} [f^{- 1} (y_{0})] \end{matrix}

(6)

This is the basis of Schröder’s approximation of the first kind, e.g., [12] (Equation (21)), [11] (Equation (17)), [11] (Section 3) and [3] (Equation (20)).

Theorem 1.

Schröder’s Approximations of the First Kind. Consider a real function

f

that is strictly monotonic in the interval around a real root

x_{o}

and including the initial approximation point of

x_{0}

. A nth-order Taylor series for

f^{- 1}

based on the point

(y_{0}, x_{0})

,

y_{0} = f (x_{0})

, yields the root according to

\begin{matrix} f^{- 1} (0) = x_{0} + \sum_{k = 1}^{n} \frac{{{(- 1)}^{k} f}^{k} (x_{0})}{k!} \cdot D^{(k)} [f^{- 1} (y_{0})] + ϵ_{n}^{I}, n \in \{1,2, \dots\}, \\ ϵ_{n}^{I} = \frac{{(- 1)}^{n + 1} y_{0}^{n + 1}}{(n + 1)!} \cdot D^{(n + 1)} [f^{- 1} (y_{k})], y_{k} \in [0, y_{0}], \end{matrix}

(7)

and the nth-order approximation to the root

x_{o}

is

{x_{n}^{I} = x}_{0} + \sum_{k = 1}^{n} \frac{{(- 1)}^{k} f^{k} (x_{0})}{k!} \cdot D^{(k)} [f^{- 1} (y_{0})], n \in \{1,2, \dots\} .

(8)

Evaluation of the derivatives leads to the nth-order approximation defined by Schröder [12] (Equation (21)):

\begin{array}{l} {x_{n}^{I} = x}_{0} - \frac{f (x_{0})}{f^{(1)} (x_{0})} - \frac{f^{2} (x_{0}) f^{(2)} (x_{0})}{{2 [f^{(1)} (x_{0})]}^{3}} - \frac{f^{3} (x_{0}) f^{(3)} (x_{0})}{{6 [f^{(1)} (x_{0})]}^{4}} \cdot [- 1 + \frac{3 {[f^{(2)} (x_{0})]}^{2}}{f^{(1)} (x_{0}) f^{(3)} (x_{0})}] - \\ \frac{f^{4} (x_{0}) f^{(4)} (x_{0})}{{24 [f^{(1)} (x_{0})]}^{5}} \cdot [1 - \frac{10 f^{(2)} (x_{0}) f^{(3)} (x_{0})}{f^{(1)} (x_{0}) f^{(4)} (x_{0})} + \frac{15 {[f^{(2)} (x_{0})]}^{3}}{{[f^{(1)} (x_{0})]}^{2} f^{(4)} (x_{0})}] - \\ \begin{array}{l} \frac{f^{5} (x_{0}) f^{(5)} (x_{0})}{{120 [f^{(1)} (x_{0})]}^{6}} \cdot [\begin{matrix} - 1 + \frac{15 f^{(2)} (x_{0}) f^{(4)} (x_{0})}{f^{(1)} (x_{0}) f^{(5)} (x_{0})} + \frac{10 {[f^{(3)} (x_{0})]}^{2}}{f^{(1)} (x_{0}) f^{(5)} (x_{0})} - \\ \frac{105 {[f^{(2)} (x_{0})]}^{2} f^{(3)} (x_{0})}{{[f^{(1)} (x_{0})]}^{2} f^{(5)} (x_{0})} + \frac{105 {[f^{(2)} (x_{0})]}^{4}}{{[f^{(1)} (x_{0})]}^{3} f^{(5)} (x_{0})} \end{matrix}] - \\ \frac{f^{6} (x_{0}) f^{(6)} (x_{0})}{{720 [f^{(1)} (x_{0})]}^{7}} \cdot [\begin{matrix} 1 - \frac{21 f^{(2)} (x_{0}) f^{(5)} (x_{0})}{f^{(1)} (x_{0}) f^{(6)} (x_{0})} - \frac{35 f^{(3)} (x_{0}) f^{(4)} (x_{0})}{f^{(1)} (x_{0}) f^{(6)} (x_{0})} + \frac{210 {[f^{(2)} (x_{0})]}^{2} f^{(4)} (x_{0})}{{[f^{(1)} (x_{0})]}^{2} f^{(6)} (x_{0})} + \\ \frac{280 f^{(2)} (x_{0}) {[f^{(3)} (x_{0})]}^{2}}{{[f^{(1)} (x_{0})]}^{2} f^{(6)} (x_{0})} - \frac{1260 {[f^{(2)} (x_{0})]}^{3} f^{(3)} (x_{0})}{{[f^{(1)} (x_{0})]}^{3} f^{(6)} (x_{0})} + \frac{945 {[f^{(2)} (x_{0})]}^{5}}{{[f^{(1)} (x_{0})]}^{4} f^{(6)} (x_{0})} \end{matrix}] - \\ \dots + \frac{{{(- 1)}^{n} f}^{n} (x_{0})}{n!} \cdot D^{(n)} [f^{- 1} (y_{0})] \end{array} \end{array}

(9)

where

D^{(n)} [f^{- 1} (y)] = D^{(n - 1)} [\frac{1}{f^{(1)} [f^{- 1} (y)]}], D^{(1)} [f^{- 1} (y)] = \frac{1}{f^{(1)} [f^{- 1} (y)]} .

(10)

Proof.

The general result for

x_{n}^{I}

follows from the above discussion. The form for the error

ϵ_{n}^{I}

is consistent with the Lagrange form for the error in an nth-order Taylor series approximation, e.g., [13] (p. 880, Equation (25.2.25)). The explicit form for

x_{n}^{I}

follows from the inverse function theorem and, for completeness, the evaluation of

D^{(k)} [f^{- 1} (y_{0})]

,

k \in \{1,2, \dots, 6\}

, is detailed in Appendix A. □

Notes

The convergence of an nth-order Schröder approximation is consistent with that of an nth-order Taylor series.

The first-order approximation is identical to the standard Newton–Raphson method result of

x_{1}^{I} = x_{0} - \frac{f (x_{0})}{f^{(1)} (x_{0})}

(11)

The second-order approximation is

x_{2}^{I} = x_{0} - \frac{f (x_{0})}{f^{(1)} (x_{0})} - \frac{f^{2} (x_{0}) f^{(2)} (x_{0})}{{2 [f^{(1)} (x_{0})]}^{3}}

(12)

and is a less complicated form than the second-order approximation specified by (4). This approximation is consistent with the second-order Adomian approximation for a root, e.g., [8].

2.2. Inverse Function Approximation

Consider the case of a well-defined function

f

whose inverse,

f^{- 1}

, is unknown. For

y_{o} = f (x_{o})

specified, the goal is to establish an approximation to

x_{o} = f^{- 1} (y_{o})

. As illustrated in Figure 2, the equivalent problem is that of finding the root of

f (x) - y_{o}

given an initial approximation to the root of

x_{0}

. This is the basis for Schröder’s approximations for an inverse function.

Theorem 2.

Schröder-Based Approximations for an Inverse Function. Consider a real function

f

that is monotonic in the interval around a point

x_{0}

and including the associated root

x_{o}

of

g (x) = f (x) - y_{o}

. A nth-order Taylor series for

g^{- 1}

, based on the point

y_{0} = f (x_{0}) - y_{o}

, yields

\begin{matrix} f^{- 1} (y_{o}) = x_{0} + \sum_{k = 1}^{n} \frac{{(- 1)}^{k} {[f (x_{0}) - y_{o}]}^{k}}{k!} \cdot D^{(k)} [f^{- 1} [f (x_{0})]] + ϵ_{n}^{I} (y_{o}), n \in \{1,2, \dots\}, \\ ϵ_{n}^{I} (y_{o}) = \frac{{(- 1)}^{n + 1} {[f (x_{0}) - y_{o}]}^{n + 1}}{(n + 1)!} \cdot D^{(n + 1)} [f^{- 1} (y_{o} + y_{k})], y_{k} \in [0, y_{0}], \end{matrix}

(13)

and the nth order approximation to

x_{o} = f^{- 1} (y_{o})

is

x_{n}^{I} = x_{0} + \sum_{k = 1}^{n} \frac{{(- 1)}^{k} {[f (x_{0}) - y_{o}]}^{k}}{k!} \cdot D^{(k)} [f^{- 1} [f (x_{0})]], n \in \{1,2, \dots\} .

(14)

It then follows that the nth-order approximation for

f^{- 1} (y_{o})

, denoted

f_{n}^{- 1} (y_{o})

, is

\begin{array}{l} {f_{n}^{- 1} (y_{o}) = x}_{0} - \frac{f (x_{0}) - y_{o}}{f^{(1)} (x_{0})} - \frac{{[f (x_{0}) - y_{o}]}^{2} f^{(2)} (x_{0})}{{2 [f^{(1)} (x_{0})]}^{3}} - \frac{{[f (x_{0}) - y_{o}]}^{3} f^{(3)} (x_{0})}{{6 [f^{(1)} (x_{0})]}^{4}} \cdot [- 1 + \frac{3 {[f^{(2)} (x_{0})]}^{2}}{f^{(1)} (x_{0}) f^{(3)} (x_{0})}] - \\ \frac{{[f (x_{0}) - y_{o}]}^{4} f^{(4)} (x_{0})}{{24 [f^{(1)} (x_{0})]}^{5}} \cdot [1 - \frac{10 f^{(2)} (x_{0}) f^{(3)} (x_{0})}{f^{(1)} (x_{0}) f^{(4)} (x_{0})} + \frac{15 {[f^{(2)} (x_{0})]}^{3}}{{[f^{(1)} (x_{0})]}^{2} f^{(4)} (x_{0})}] - \\ \begin{array}{l} \frac{{[f (x_{0}) - y_{o}]}^{5} f^{(5)} (x_{0})}{{120 [f^{(1)} (x_{0})]}^{6}} \cdot [\begin{matrix} - 1 + \frac{15 f^{(2)} (x_{0}) f^{(4)} (x_{0})}{f^{(1)} (x_{0}) f^{(5)} (x_{0})} + \frac{10 {[f^{(3)} (x_{0})]}^{2}}{f^{(1)} (x_{0}) f^{(5)} (x_{0})} - \\ \frac{105 {[f^{(2)} (x_{0})]}^{2} f^{(3)} (x_{0})}{{[f^{(1)} (x_{0})]}^{2} f^{(5)} (x_{0})} + \frac{105 {[f^{(2)} (x_{0})]}^{4}}{{[f^{(1)} (x_{0})]}^{3} f^{(5)} (x_{0})} \end{matrix}] - \\ \frac{{[f (x_{0}) - y_{o}]}^{6} f^{(6)} (x_{0})}{{720 [f^{(1)} (x_{0})]}^{7}} \cdot [\begin{matrix} 1 - \frac{21 f^{(2)} (x_{0}) f^{(5)} (x_{0})}{f^{(1)} (x_{0}) f^{(6)} (x_{0})} - \frac{35 f^{(3)} (x_{0}) f^{(4)} (x_{0})}{f^{(1)} (x_{0}) f^{(6)} (x_{0})} + \frac{210 {[f^{(2)} (x_{0})]}^{2} f^{(4)} (x_{0})}{{[f^{(1)} (x_{0})]}^{2} f^{(6)} (x_{0})} + \\ \frac{280 f^{(2)} (x_{0}) {[f^{(3)} (x_{0})]}^{2}}{{[f^{(1)} (x_{0})]}^{2} f^{(6)} (x_{0})} - \frac{1260 {[f^{(2)} (x_{0})]}^{3} f^{(3)} (x_{0})}{{[f^{(1)} (x_{0})]}^{3} f^{(6)} (x_{0})} + \frac{945 {[f^{(2)} (x_{0})]}^{5}}{{[f^{(1)} (x_{0})]}^{4} f^{(6)} (x_{0})} \end{matrix}] - \\ \dots + \frac{{{(- 1)}^{n} [f (x_{0}) - y_{o}]}^{n}}{n!} \cdot D^{(n)} [f^{- 1} [f (x_{0})]] \end{array} \end{array}

(15)

Proof.

Whilst this result follows from Theorem 1 by considering

f (x) - y_{o}

rather than

f (x)

, it is informative to provide a direct proof: With

g (x) = f (x) - y_{o}

, it follows that

g^{- 1} (0) = x_{o}

. Consider an initial approximation of

x_{0}

to

x_{o}

. The Taylor series approximation for

g^{- 1}

at the point

(y_{0}, x_{0})

,

y_{0} = f (x_{0}) - y_{o}

, is

\begin{matrix} \begin{matrix} t_{n}^{I} (y) = g^{- 1} (y_{0}) + (y - y_{0}) D [g^{- 1} (y_{0})] + \frac{{(y - y_{0})}^{2}}{2} \cdot D^{(2)} [g^{- 1} (y_{0})] + \dots + \end{matrix} \\ \frac{{(y - y_{0})}^{n}}{n!} \cdot D^{(n)} [g^{- 1} (y_{0})] \end{matrix}

(16)

For the case of

y = 0

, the definitions of

g^{- 1} (y_{0}) = x_{0}

and

y_{0} = g (x_{0}) = f (x_{0}) - y_{o}

yield the nth-order approximation,

x_{n}^{I}

, to

x_{o}

according to

\begin{matrix} x_{n}^{I} = t_{n}^{I} (0) = x_{0} - [f (x_{0}) - y_{o}] D [g^{- 1} (y_{0})] + \frac{{[f (x_{0}) - y_{o}]}^{2}}{2} \cdot D^{(2)} [g^{- 1} (y_{0})] + \dots + \\ \frac{{{(- 1)}^{n} [f (x_{0}) - y_{o}]}^{n}}{n!} \cdot D^{(n)} [g^{- 1} (y_{0})] \end{matrix}

(17)

and with an error given by

ϵ_{n}^{I} (y_{o}) = x_{o} - x_{n}^{I} = \frac{{{(- 1)}^{n + 1} [f (x_{0}) - y_{o}]}^{n + 1}}{(n + 1)!} \cdot D^{(n + 1)} [g^{- 1} (y_{k})], y_{k} \in [0, y_{0}] .

(18)

Consider the point

x_{0}

and the definition of

y_{0}

according to

y_{0} = g (x_{0}) = f (x_{0}) - y_{o}

. Thus,

y_{o} + y_{0} = f (x_{0})

and, hence,

x_{0} = f^{- 1} (y_{o} + y_{0}) = g^{- 1} (y_{0})

. It then follows, by considering the derivative of

g^{- 1}

at the point

y_{0}

, that

\begin{matrix} \frac{d}{d y} [g^{- 1} (y_{0})] = {\frac{1}{g^{(1)} (x_{0})}|}_{x_{0} = g^{- 1} (y_{0})} = {\frac{1}{f^{(1)} (x_{0})}|}_{x_{0} = f^{- 1} (y_{o} + y_{0})} \\ = \frac{d}{d y} [f^{- 1} (y_{o} + y_{0})] = \frac{d}{d y} [f^{- 1} [f (x_{0}]] \end{matrix}

(19)

and it then follows that

D^{(k)} [g^{- 1} (y_{0})] = D^{(k)} [f^{- 1} [f (x_{0}]], k \in \{1,2, \dots\} .

(20)

The required result, as stated by (14), then follows. □

2.3. Notes

Consider an initial approximation of

f_{0}^{- 1}

for the inverse function

f^{- 1}

. For a given value of

y

, the initial approximation of

x_{0}

to

f^{- 1} (y)

is given by

f_{0}^{- 1} (y)

, and the first-order approximation for

f^{- 1}

, consistent with (15), is

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}

(21)

This result is identical to the approximation arising from the Newton–Raphson method. The second- and third-order approximations are:

f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{{[f [f_{0}^{- 1} (y)] - y]}^{2} f^{(2)} [f_{0}^{- 1} (y)]}{{2 [f^{(1)} [f_{0}^{- 1} (y)]]}^{3}}

(22)

\begin{array}{l} f_{3}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{{[f [f_{0}^{- 1} (y)] - y]}^{2} f^{(2)} [f_{0}^{- 1} (y)]}{{2 [f^{(1)} [f_{0}^{- 1} (y)]]}^{3}} - \\ \frac{{[f [f_{0}^{- 1} (y)] - y]}^{3} f^{(3)} [f_{0}^{- 1} (y)]}{{6 [f^{(1)} [f_{0}^{- 1} (y)]]}^{4}} \cdot [- 1 + \frac{3 {[f^{(2)} [f_{0}^{- 1} (y)]]}^{2}}{f^{(1)} [f_{0}^{- 1} (y)] f^{(3)} [f_{0}^{- 1} (y)]}] \end{array}

(23)

2.4. Notes on Convergence

2.4.1. Convergence of Schröder Approximations

Consider the illustration of

f

,

f^{- 1}

,

g

,

g^{- 1}

and the initial approximation

f_{0}^{- 1}

shown in Figure 2. For fixed

y

, with a value

y_{o}

, the goal is for the initial approximation

x_{0} = f_{0}^{- 1} (y_{o})

to

f^{- 1} (y_{o})

to yield a value of

y_{0} = g (x_{0}) = f (x_{0}) - y_{o}

which is such that the region of convergence of the Taylor series approximation for

g^{- 1}

, based on the point

y_{0}

, includes the origin. When this is the case, convergence of the Schröder approximations is guaranteed at the point

y_{o}

. The goal is for the initial approximation

f_{0}^{- 1}

to be such that this is the case for all values of

y_{o}

in the domain of

f^{- 1}

.

To establish a bound for the region of convergence for a Taylor series for

g^{- 1}

, consider the Taylor series for

g

based on the point

x_{0}

and for

g^{- 1}

based on the point

y_{0}

:

\begin{array}{l} y = g (x_{0}) + (x - x_{0}) g^{(1)} (x_{0}) + \frac{{(x - x_{0})}^{2} g^{(2)} (x_{0})}{2} + \dots + \frac{{(x - x_{0})}^{n} g^{(n)} (x_{0})}{n!} + \dots \\ x = g^{- 1} (y_{0}) + (y - y_{0}) D [g^{- 1} (y_{0})] + \frac{{(y - y_{0})}^{2} D^{(2)} [g^{- 1} (y_{0})]}{2} + \dots + \\ \frac{{(y - y_{0})}^{n} D^{(n)} [g^{- 1} (y_{0})]}{n!} + \dots \end{array}

(24)

With the definitions

\begin{array}{l} \begin{matrix} Δ_{y} = y - g (x_{0}) = y - y_{0}, & Δ_{x} = x - g^{- 1} (y_{0}) = x - x_{0}, \end{matrix} \\ \begin{matrix} \begin{matrix} c_{k} = \frac{g^{(k)} (x_{0})}{k!}, & d_{k} = \frac{D^{(k)} [g^{- 1} (y_{0})]}{k!} \end{matrix}, & k \in \{1,2, \dots\}, \end{matrix} \end{array}

(25)

it follows that

\begin{array}{l} Δ_{y} = c_{1} Δ_{x} + c_{2} Δ_{x}^{2} + \dots + c_{n} Δ_{x}^{n} + \dots \\ Δ_{x} = d_{1} Δ_{y} + d_{2} Δ_{y}^{2} + \dots + d_{n} Δ_{y}^{n} + \dots \end{array}

(26)

Equality in the second equation depends on

|Δ_{y}| < {r o c}_{g^{- 1}} (y_{0})

, where

{r o c}_{g^{- 1}}

is the region of convergence for the Taylor series of

g^{- 1}

at the point

y_{0}

. The following bound due to Landau, e.g., [14], is relevant:

{r o c}_{g^{- 1}} (y_{0}) > \frac{{{r o c}_{g} (x_{0})}^{2} {[g^{(1)} (x_{0})]}^{2}}{6 g_{m a x} (x_{0})}

(27)

where

{r o c}_{g} (x_{0})

is the region of convergence for the Taylor series for

g

at the point

x_{0}

,

g_{m a x}

is the maximum value of the magnitude of

g

within the region of convergence and

g^{(1)} (x_{0})

is assumed to be non-zero.

Thus, the requirement for the initial approximation

x_{0} = f_{0}^{- 1} (y_{o})

to

f^{- 1} (y_{o})

is for the associated value

y_{0} = f (x_{0}) - y_{o}

to have a magnitude that is less than the region of convergence for

g^{- 1}

at the point

y_{0}

. A sufficient condition is

|y_{0}| < \frac{{{r o c}_{g} (x_{0})}^{2} {[g^{(1)} (x_{0})]}^{2}}{6 g_{m a x} (x_{0})}

(28)

The goal is for such a bound to hold for all values in the domain of the inverse function. The examples detailed below utilize initial approximations that lead to Schröder approximations with decreasing relative errors, which is indicative of convergence.

2.4.2. Relative Error Bound for First-Order Approximation

With an error

ε_{0}^{I} (y)

in the initial approximation

f_{0}^{- 1} (y)

to

f^{- 1} (y)

, i.e.,

f^{- 1} (y) = f_{0}^{- 1} (y) + ε_{0}^{I} (y)

, it follows that the error, denoted as

ε_{1}^{I} (y)

and in the first-order approximation specified by (21), is

\begin{matrix} ε_{1}^{I} (y) = f^{- 1} (y) - f_{1}^{- 1} (y) = - ε_{0}^{I} (y) \cdot \frac{ε_{0}^{I} (y) f^{(2)} [f^{- 1} (y)]}{2 f^{(1)} [f^{- 1} (y)]} \cdot \\ [\frac{1 - \frac{ε_{0}^{I} (y) f^{(3)} [f^{- 1} (y)]}{f^{(2)} [f^{- 1} (y)]}}{1 - \frac{ε_{0}^{I} (y) f^{(2)} [f^{- 1} (y)]}{f^{(1)} [f^{- 1} (y)]} + \frac{{[ε_{0}^{I} (y)]}^{2} f^{(3)} [f^{- 1} (y)]}{2 f^{(1)} [f^{- 1} (y)]}}] \end{matrix}

(29)

This result arises from the use of a second-order Taylor series for

f [f_{0}^{- 1} (y)]

, and

f^{(1)} [f_{0}^{- 1} (y)]

, that are based on the point

f^{- 1}

(y).

With the bound

\begin{matrix} |\frac{ε_{0}^{I} (y) f^{(2)} [f^{- 1} (y)]}{2 f^{(1)} [f^{- 1} (y)]}| < Δ_{1}, & y \in d o m a i n o f f^{- 1}, \end{matrix}

(30)

the error for the first-order Schröder approximation is related to the error associated with the initial approximation

f_{0}^{- 1}

according to

|ε_{1}^{I} (y)| < Δ_{1} |ε_{0}^{I} (y)|

(31)

assuming the bracketed term in (29) is close to unity. With such approximations, the relationship between the relative error bounds of the original and the first-order Schröder approximations is

{r e}_{B, 1} < {Δ_{1} r e}_{B, 0} .

(32)

The validity of this relationship depends on the nature of the function being approximated and the initial approximation being used. For example, this relationship is accurate for the approximations noted below for the inverse Langevin function but not for the approximations considered for arcsine.

2.5. Special Case: Ratio of Two Functions

Consider the case where

f (x) = n (x) / d (x)

is the ratio of two functions and the inverse

f^{- 1}

is to be approximated. The following preliminary result facilitates this.

Lemma 1.

Higher-order Derivatives of Ratio of Two Functions. For the case where

f

is a differentiable function for all orders, and defined according to

f (x) = n (x) / d (x)

, it is the case that

\begin{matrix} f^{(k)} (x) = \frac{n_{k} (x)}{d^{k + 1} (x)}, & \{\begin{array}{l} n_{1} (x) = d (x) n^{(1)} (x) - n (x) d^{(1)} (x) \\ n_{k} (x) = d (x) n_{k - 1}^{(1)} (x) - {k n}_{k - 1} (x) d^{(1)} (x) \end{array} \end{matrix}

(33)

Proof.

The proof is detailed in Appendix B. □

Approximations for the Inverse of $f (x) = n (x) / d (x)$

The iterative formula detailed in Lemma 1 is the basis for the explicit results detailed in Theorem 3.

Theorem 3.

Approximation for the inverse of f(x) = n(x)/d(x). For the case where

f

is differentiable, up to the order of approximation being considered, and monotonic in the interval of interest, the first- to fourth-order approximations for the inverse of

f (x) = n (x) / d (x)

, based on an initial approximating function,

f_{0}^{- 1}

, are:

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - [n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]] \cdot \frac{d [f_{0}^{- 1} (y)]}{n_{1} [f_{0}^{- 1} (y)]}

(34)

\begin{array}{l} f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - [n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]] \cdot \frac{d [f_{0}^{- 1} (y)]}{n_{1} [f_{0}^{- 1} (y)]} - \\ {[n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]]}^{2} \cdot \frac{n_{2} [f_{0}^{- 1} (y)] d [f_{0}^{- 1} (y)]}{{2 n}_{1}^{3} [f_{0}^{- 1} (y)]} \end{array}

(35)

\begin{array}{l} \begin{array}{l} f_{3}^{- 1} (y) = f_{0}^{- 1} (y) - [n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]] \cdot \frac{d [f_{0}^{- 1} (y)]}{n_{1} [f_{0}^{- 1} (y)]} - \\ {[n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]]}^{2} \cdot \frac{n_{2} [f_{0}^{- 1} (y)] d [f_{0}^{- 1} (y)]}{{2 n}_{1}^{3} [f_{0}^{- 1} (y)]} - \end{array} \\ {[n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]]}^{3} \cdot \frac{n_{3} [f_{0}^{- 1} (y)] d [f_{0}^{- 1} (y)]}{{6 n}_{1}^{4} [f_{0}^{- 1} (y)]} \cdot [- 1 + \frac{{3 n}_{2}^{2} [f_{0}^{- 1} (y)]}{n_{1} [f_{0}^{- 1} (y)] n_{3} [f_{0}^{- 1} (y)]}] \end{array}

(36)

\begin{array}{l} f_{4}^{- 1} (y) = f_{3}^{- 1} (y) - {[n [f_{0}^{- 1} (y)] - y d [f_{0}^{- 1} (y)]]}^{4} \cdot \frac{n_{4} [f_{0}^{- 1} (y)] d [f_{0}^{- 1} (y)]}{{24 n}_{1}^{5} [f_{0}^{- 1} (y)]} \cdot \\ [1 - \frac{10 n_{2} [f_{0}^{- 1} (y)] n_{3} [f_{0}^{- 1} (y)]}{n_{1} [f_{0}^{- 1} (y)] n_{4} [f_{0}^{- 1} (y)]} + \frac{{15 n}_{2}^{3} [f_{0}^{- 1} (y)]}{n_{1}^{2} [f_{0}^{- 1} (y)] n_{4} [f_{0}^{- 1} (y)]}] \end{array}

(37)

Proof.

These results follow from Theorem 2 and the derivative results stated in Lemma 1 and Appendix C. □

2.6. Newton–Raphson Iteration

Given an initial approximation

f_{0}^{- 1}

for

f^{- 1}

, Newton–Raphson iteration yields the approximation

f_{1}^{- 1}

, as specified by (21). Newton–Raphson iteration, based on

f_{1}^{- 1}

, yields the second-order approximation

\begin{array}{l} f_{2}^{- 1} (y) = f_{1}^{- 1} (y) - \frac{f [f_{1}^{- 1} (y)] - y}{f^{(1)} [f_{1}^{- 1} (y)]} \\ = f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{f [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}] - y}{f^{(1)} [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}]} \end{array}

(38)

A third iteration yields:

\begin{array}{l} f_{3}^{- 1} (y) = f_{2}^{- 1} (y) - \frac{f [f_{2}^{- 1} (y)] - y}{f^{(1)} [f_{2}^{- 1} (y)]} \\ \begin{matrix} = f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{f [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}] - y}{f^{(1)} [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}]} - \\ \frac{f [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{f [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}] - y}{f^{(1)} [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}]}] - y}{f^{(1)} [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]} - \frac{f [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}] - y}{f^{(1)} [f_{0}^{- 1} (y) - \frac{f [f_{0}^{- 1} (y)] - y}{f^{(1)} [f_{0}^{- 1} (y)]}]}]} \end{matrix} \end{array}

(39)

and similarly for higher-order iteration. Note the complexity associated with functions of functions, which increases with iteration. For the convergent case, Newton–Raphson iteration exhibits quadratic convergence.

2.7. Notes

Whilst the geometry associated with the Newton–Raphson method for establishing an approximation to the root of a function is compelling, its natural generalization via higher-order Taylor series is problematic. In contrast, the indirect approach of utilizing a Taylor series based on the inverse function leads to explicit approximation expressions—Schröder’s approximations of the first kind—for all orders. There is pedagogical value in such an approach.

Figure 3 illustrates the potential interaction between high-order approximations, for example, via a high-order Schröder approximation, and utilizing iteration, for example, via Newton–Raphson iteration, to establish highly accurate analytical approximations for an inverse function given an initial low-accuracy approximation. A combination of a first-order Newton–Raphson iteration based on a modest-order Schröder approximation can lead to a good compromise between accuracy and complexity.

Note that a Schröder approximation is a means to establish a higher-accuracy approximation given an initial approximation with modest accuracy. The new improved approximation can then be used as the base approximation for Newton–Raphson iteration with, potentially, quadratic convergence.

The following four sections detail the establishment of accurate analytical approximations, based on initial approximations with modest relative error bounds, for arcsine, the inverse of

x - \sin (x)

, the inverse Langevin function and the Lambert W function.

In many instances, the initial approximation for the inverse function to be approximated is defined in a custom manner. Point-based approximations such as Taylor series expansions, for example, often do not lead to suitable initial approximations as such approximations of a fixed order, whilst having a low error at the point of approximation, generally have an increasing error, and potentially an increasing relative error, as the distance from the point of approximation increases. This situation is illustrated in Figure 2 of [15], where the relative errors in Taylor series approximations for arcsine are detailed.

3. Example I: Analytical Approximations for Arcsine

Given an approximation for arcsine, approximations for arccosine and arctangent readily follow from the relationships, e.g., [16] (p. 57, Equations (1.623) and (1.624)):

\begin{matrix} acos (y) = \frac{π}{2} - asin (y), & acos (y) = a s i n [\sqrt{1 - y^{2}}], & 0 \leq y \leq 1, \end{matrix}

(40)

\begin{matrix} atan (y) = a s i n [\frac{y}{\sqrt{1 + y^{2}}}] = \frac{π}{2} - asin [\frac{1}{\sqrt{1 + y^{2}}}], & 0 \leq y < \infty . \end{matrix}

(41)

Naturally, there are many approximations for arcsine, and an overview of published approximations and new results for arcsine, arccosine and arctangent is provided in [15]. Graphs of arcsine and arccosine are shown in Figure 4.

3.1. General Schröder-Based Approximations

Consider

y = f (x) = s i n (x)

,

0 \leq x < π / 2

and an initial approximation

f_{0}^{- 1}

for the inverse function

x = f^{- 1} (y) = a s i n (y)

,

0 \leq y < 1

. Consistent with Theorem 2, the first- to fourth-order general approximations for arcsine are:

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]}

(42)

f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}}

(43)

\begin{matrix} f_{3}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}} - \\ \frac{{[\sin [f_{0}^{- 1} (y)] - y]}^{3}}{6 {c o s [f_{0}^{- 1} (y)]}^{3}} \cdot [1 + \frac{3 {\sin [f_{0}^{- 1} (y)]}^{2}}{{c o s [f_{0}^{- 1} (y)]}^{2}}] \end{matrix}

(44)

\begin{array}{l} \begin{array}{l} f_{4}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}} - \\ \begin{matrix} \frac{{[\sin [f_{0}^{- 1} (y)] - y]}^{3}}{6 {c o s [f_{0}^{- 1} (y)]}^{3}} \cdot [1 + \frac{3 {\sin [f_{0}^{- 1} (y)]}^{2}}{{c o s [f_{0}^{- 1} (y)]}^{2}}] + \\ \frac{{3 \sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{4}}{8 {c o s [f_{0}^{- 1} (y)]}^{5}} \cdot [1 + \frac{5 {\sin [f_{0}^{- 1} (y)]}^{2}}{{3 c o s [f_{0}^{- 1} (y)]}^{2}}] \end{matrix} \end{array} \end{array}

(45)

3.1.1. Initial Approximations

Consider the published approximations for arcsine [17], [15] (Equations (10) and (31)) and [13] (p. 81, Equation (4.4.46)):

f_{0, 1}^{- 1} (y) = \frac{π y}{2 + \sqrt{1 - y^{2}}}

(46)

\begin{array}{l} f_{0, 2}^{- 1} (y) = α_{0} [1 - \sqrt{1 - y}] + α_{1} y + α_{2} y^{2}, \\ \begin{matrix} \begin{matrix} α_{0} = \frac{π}{2} - \frac{1306}{10000}, & α_{1} = \frac{10653}{10000} - \frac{π}{4}, & α_{2} = \frac{π}{4} - \frac{9347}{10000} \end{matrix} \end{matrix} \end{array}

(47)

\begin{array}{l} f_{0, 3}^{- 1} (y) = \frac{π}{2} - \sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}} \\ \begin{matrix} \begin{matrix} c_{2,3} = \frac{16}{3} + 6 π - \frac{5 π^{2}}{2}, & c_{2,4} = \frac{- 35}{3} - 8 π + \frac{15 π^{2}}{4}, & c_{2,5} = \frac{16}{3} + 3 π - \frac{3 π^{2}}{2} \end{matrix} \end{matrix} \end{array}

(48)

\begin{array}{l} f_{0, 4}^{- 1} (y) = \frac{π}{2} - \sqrt{1 - y} \cdot [α_{0} + α_{1} y + α_{2} y^{2} + \dots + α_{7} y^{7}] \\ \begin{array}{l} \begin{matrix} \begin{matrix} α_{0} = \frac{π}{2}, & \begin{matrix} α_{1} = - 0.2145988016, & α_{2} = 0.0889789874, \end{matrix} \end{matrix} \end{matrix} \\ \begin{array}{l} \begin{matrix} \begin{matrix} α_{3} = - 0.0501743046, & \begin{matrix} α_{4} = 0.0308918810, & α_{5} = - 0.0170881256, \end{matrix} \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} α_{6} = 0.0066700901, & \begin{matrix} α_{7} = - 0.0012624911 \end{matrix} \end{matrix} \end{matrix} \end{array} \end{array} \end{array}

(49)

which have the respective relative error bounds, for the interval

[0, 1]

, of

4.72 \times 10^{- 2}

,

3.62 \times 10^{- 3}

,

3.64 \times 10^{- 4}

and

3.04 \times 10^{- 6}

.

3.1.2. Explicit Approximations

For example, the third approximation given by (48), when used in the general first- and second-order Schröder approximations specified by (42) and (43), yields the following approximations:

\begin{matrix} f_{1}^{- 1} (y) = \frac{π}{2} - \sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}} - \\ \frac{\cos [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}] - y}{\sin [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}]} \end{matrix}

(50)

\begin{array}{l} f_{2}^{- 1} (y) = \frac{π}{2} - \sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}} - \\ \begin{array}{l} \frac{\cos [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}] - y}{\sin [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}]} + \\ \frac{\cos [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}] {[\cos [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}] - y]}^{2}}{2 {\sin [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}]}^{3}} \end{array} \end{array}

(51)

which have, respectively, relative error bounds of

1.78 \times 10^{- 8}

and

3.68 \times 10^{- 12}

for

0 \leq y < 1

.

3.1.3. Results

The relative error bounds associated with the first- to fourth-order Schröder-based approximations, as specified by (42) to (45), are tabulated in Table 1 for the case of the initial approximations

f_{0}^{- 1}

being specified by (46) to (49). The relative errors associated with the second, third and fourth approximations are illustrated in Figure 5.

From the results detailed in Table 1, and for a set initial approximation, the clear improvement achieved by utilizing a higher-order approximation form is evident. Also evident is the improvement, for a set order of approximation, achieved by utilizing an initial approximation with a lower relative error bound.

3.2. Newton–Raphson Iteration

Consider an initial approximation

f_{0}^{- 1}

for arcsine. Consistent with (21), (38) and (39), Newton–Raphson iteration leads to the following result:

\begin{array}{l} a s i n (y) = s_{0} (y) + s_{1} (y) + s_{2} (y) + \dots \\ s_{i} (y) = - \frac{\sin [s_{0} (y) + s_{1} (y) + \dots + s_{i - 1} (y)] - y}{\cos [s_{0} (y) + s_{1} (y) + \dots + s_{i - 1} (y)]}, \\ \begin{matrix} \begin{matrix} i \in \{1,2, \dots\}, & s_{0} (y) = f_{0}^{- 1} (y), \end{matrix} \end{matrix} \end{array}

(52)

where

s_{1} (y) = - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}

(53)

s_{2} (y) = - \frac{\sin [f_{0}^{- 1} (y) + s_{1} (y)] - y}{\cos [f_{0}^{- 1} (y) + s_{1} (y)]} = - \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}]}

(54)

s_{3} (y) = - \frac{\sin [f_{0}^{- 1} (y) + s_{1} (y) + s_{2} (y)] - y}{\cos [f_{0}^{- 1} (y) + s_{1} (y) + s_{2} (y)]}

(55)

s_{4} (y) = - \frac{\sin [f_{0}^{- 1} (y) + s_{1} (y) + s_{2} (y) + s_{3} (y)] - y}{\cos [f_{0}^{- 1} (y) + s_{1} (y) + s_{2} (y) + s_{3} (y)]}

(56)

Explicit general first-, second- and third-order approximations are:

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}

(57)

f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]} - \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}]}

(58)

f_{3}^{- 1} (y) = f_{2}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]} - \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}]}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]} - \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{\cos [f_{0}^{- 1} (y)]}]}]}

(59)

With

f_{0}^{- 1}

specified by (46) to (49), the relative error bounds associated with these approximations are detailed in Table 1.

3.3. Hybrid Approximation

A first-order Newton–Raphson iteration, based on the second-order Schröder approximation

f_{2}^{- 1}

as specified by (43), is

\begin{array}{l} a s i n (y) \approx f_{2}^{- 1} (y) - \frac{\sin [f_{2}^{- 1} (y)] - y}{\cos [f_{2}^{- 1} (y)]} \\ \begin{matrix} = f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}} - \\ \frac{\sin [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}}] - y}{\cos [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]} + \frac{{\sin [f_{0}^{- 1} (y)] [\sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {c o s [f_{0}^{- 1} (y)]}^{3}}]} \end{matrix} \end{array}

(60)

For the case where

f_{0}^{- 1}

, as defined by (48), is used in this equation, the relative error bound is

2.69 \times 10^{- 24}

. Thus, an analytical approximation of modest complexity but with high accuracy. For comparison,

f_{0}^{- 1}

, as defined by (48), has a relative error bound of

3.64 \times 10^{- 4}

, and the associated second-order Schröder approximation (43) has a relative error bound of

3.68 \times 10^{- 12}

.

3.4. Applications

3.4.1. Lower Bound

The approximation

f_{0, 3}^{- 1}

given by (48) is a lower bound for arcsine [15] (Equation (112)). Simulation results indicate that the first- to fourth-order approximations, as given by (42) to (45), and based on

f_{0, 3}^{- 1}

, are lower bounds with improved accuracy and with the relative error bounds detailed in Table 1. Thus, for example:

f_{2} (y) \leq a s i n (y)

(61)

where

f_{2}

is the second-order approximation defined by (51) and with a relative error bound of

3.68 \times 10^{- 12}

. Upper bounded functions can be defined based on the lower bounded functions, as detailed in [18] (Lemma 1).

3.4.2. Integral

Consider the result

\begin{matrix} \int_{0}^{y} a s i n (t) d t = \sqrt{1 - y^{2}} - 1 + y a s i n (y), & 0 < y < 1 . \end{matrix}

(62)

It then follows, based on the first-order approximation given by (42), that

\begin{matrix} \int_{0}^{y} a s i n (t) d t \approx \sqrt{1 - y^{2}} - 1 + y [f_{0}^{- 1} (y) - \frac{\sin [f_{0}^{- 1} (y)] - y}{c o s [f_{0}^{- 1} (y)]}], & 0 < y < 1, \end{matrix}

(63)

for any function

f_{0}^{- 1}

that is an approximation to arcsine. The use of the approximation

f_{0, 3}^{- 1}

(see (48)) in this equation yields the approximation, for

0 < y < 1

, of

\begin{matrix} \int_{0}^{y} a s i n (t) d t \approx \sqrt{1 - y^{2}} - 1 + \frac{π y}{2} - y \sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}} - \\ \frac{y \sin [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}] - y^{2}}{\sin [\sqrt{\frac{π^{2}}{4} - π y + y^{2} + c_{2,3} y^{3} + c_{2,4} y^{4} + c_{2,5} y^{5}}]} \end{matrix}

(64)

which has a relative error bound for the interval

(0, 1)

of

3.66 \times 10^{- 8}

.

4. Example II: Analytical Approximations for Inverse of x − Sin(x)

Whilst

f (x) = x - \sin (x)

is a simple elementary function, establishing its inverse is not straightforward as

f^{(1)} (x) = 0

,

x \in \{0, 2 π, 4 π, \dots\}

, and derivatives of all orders of

f^{- 1}

are undefined at the origin. Graphs of

f

and

f^{- 1}

are shown in Figure 6.

As

f (x) = x - \sin (x)

is the summation of a linear function and a periodic function, and as it is anti-symmetric around the point

(π, π)

when considering the interval

[0, 2 π]

, it is sufficient to find an approximation for

f^{- 1}

over the interval

[0, π)

. The proofs for the required results:

\begin{array}{l} \begin{matrix} f^{- 1} (y) = f^{- 1} (y - 2 k π) + 2 k π, & 2 k π \leq y < 2 k π + 2 π, \end{matrix} \\ \begin{matrix} f^{- 1} (y) = 2 π - f^{- 1} (2 π - y), & y \in \end{matrix} [π, 2 π), \end{array}

(65)

are detailed in Appendix D.

4.1. Initial Approximation for $f^{- 1}$

To define an initial approximation with a bounded relative error, consider a Taylor series at the origin for

f (x) = x - \sin (x)

which is

y = f (x) \approx \frac{x^{3}}{6} - \frac{x^{5}}{5!} + \frac{x^{7}}{7!} + \dots

(66)

By utilizing the first term in this series, an initial approximation for

f^{- 1}

of

f^{- 1} (y) \approx 6^{1 / 3} y^{1 / 3}

(67)

can be defined that is accurate for

|y| ≪ 1

. An affine component can be added to this approximation to ensure equality of the new approximation to

f^{- 1}

at the end point,

π

, of the interval of interest. As

f^{- 1} (π) = π

, the approximation is

\begin{matrix} f^{- 1} (y) \approx c_{0} y^{1 / 3} + c_{1} y, & c_{0} = 6^{1 / 3}, & c_{1} = 1 - \frac{6^{1 / 3}}{π^{2 / 3}}, \end{matrix}

(68)

and has a relative error bound for the interval

[0, π]

of

1.89 \times 10^{- 2}

. Some optimized generalizations are:

\begin{matrix} f_{0, 1}^{- 1} (y) = c_{0} y^{1 / 3} + c_{1} y + c_{22} y^{2} (π - y), & c_{22} = \frac{- 133}{10000} \end{matrix},

(69)

\begin{matrix} f_{0, 2}^{- 1} (y) = c_{0} y^{1 / 3} + c_{1} y + c_{32} y^{2} (π - y) + c_{33} y^{3} (π - y), \\ \begin{matrix} c_{32} = \frac{- 305}{10000}, & c_{33} = \frac{105}{10000}, \end{matrix} \end{matrix}

(70)

\begin{matrix} f_{0, 3}^{- 1} (y) = c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y), & c_{2} = \frac{- 449}{10000} \end{matrix},

(71)

with respective relative error bounds, for the interval

[0, π]

, of

8.61 \times 10^{- 3}

,

5.74 \times 10^{- 3}

and

1.36 \times 10^{- 3}

.

4.2. General Schröder-Based Approximations

Consistent with Theorem 2, the first- to fourth-order approximations for

f^{- 1}

over the interval

[0, π]

, and based on an initial approximation function of the form

f_{0}^{- 1}

, are:

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]}

(72)

\begin{array}{l} f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]} - \\ \frac{{\sin [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {[1 - \cos [f_{0}^{- 1} (y)]]}^{3}} \end{array}

(73)

\begin{array}{l} \begin{array}{l} f_{3}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]} - \\ \begin{matrix} \begin{matrix} \frac{{\sin [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {[1 - \cos [f_{0}^{- 1} (y)]]}^{3}} + \end{matrix} \end{matrix} \end{array} \\ \begin{matrix} \begin{matrix} \frac{{\cos [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{3}}{6 {[1 - \cos [f_{0}^{- 1} (y)]]}^{4}} \cdot [1 - \frac{3 {\sin [f_{0}^{- 1} (y)]}^{2}}{\cos [f_{0}^{- 1} (y)] [1 - \cos [f_{0}^{- 1} (y)]]}] \end{matrix} \end{matrix} \end{array}

(74)

\begin{array}{l} f_{4}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]} - \\ \begin{array}{l} \begin{matrix} \begin{matrix} \frac{{\sin [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{2}}{2 {[1 - \cos [f_{0}^{- 1} (y)]]}^{3}} + \end{matrix} \end{matrix} \\ \begin{matrix} \begin{matrix} \begin{matrix} \frac{{\cos [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{3}}{6 {[1 - \cos [f_{0}^{- 1} (y)]]}^{4}} \cdot [1 - \frac{3 {\sin [f_{0}^{- 1} (y)]}^{2}}{\cos [f_{0}^{- 1} (y)] [1 - \cos [f_{0}^{- 1} (y)]]}] + \end{matrix} \end{matrix} \\ \begin{matrix} \frac{{\sin [f_{0}^{- 1} (y)] [f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y]}^{4}}{24 {[1 - \cos [f_{0}^{- 1} (y)]]}^{5}} \cdot [1 + \frac{10 \cos [f_{0}^{- 1} (y)]}{1 - \cos [f_{0}^{- 1} (y)]} - \frac{15 {\sin [f_{0}^{- 1} (y)]}^{2}}{{[1 - \cos [f_{0}^{- 1} (y)]]}^{2}}] \end{matrix} \end{matrix} \end{array} \end{array}

(75)

Examples

Based on the approximation

f_{0, 3}^{- 1} (y)

specified in (71), the first- and second-order approximations for the interval

[0, π]

, and arising from (72) and (73), respectively, are:

\begin{array}{l} f^{- 1} (y) \approx c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \\ \begin{matrix} \begin{matrix} \frac{c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y}{1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]} \end{matrix} \end{matrix} \end{array}

(76)

\begin{array}{l} f^{- 1} (y) \approx c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \\ \begin{matrix} \begin{array}{l} \begin{matrix} \begin{matrix}  \end{matrix} & \frac{c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y}{1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]} - \end{matrix} \\ \begin{matrix} \begin{matrix}  \end{matrix} & \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] \cdot \end{matrix} \end{array} \\ \begin{matrix} \begin{matrix}  \end{matrix} & \frac{{[c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y]}^{2}}{2 {[1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]]}^{3}} \end{matrix} \end{matrix} \end{array}

(77)

The respective relative error bounds associated with these approximations are

1.13 \times 10^{- 6}

and

2.44 \times 10^{- 9}

.

4.3. Newton–Raphson Iteration

Second-order Newton–Raphson iteration, consistent with (38) and based on the approximation

f_{0}^{- 1} (y)

, yields the general approximation form

\begin{array}{l} f_{2}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]} - \\ \begin{matrix} \begin{matrix} \frac{f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]} - \sin [f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]}] - y}{1 - \cos [f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - \sin [f_{0}^{- 1} (y)] - y}{1 - \cos [f_{0}^{- 1} (y)]}]} \end{matrix} \end{matrix} \end{array}

(78)

which has a relative error bound of

7.92 \times 10^{- 13}

when the approximation specified in (71) is utilized for

f_{0}^{- 1}

. The resulting approximation is of comparable complexity to the third-order Schröder approximation detailed in (74), which yields a similar relative error bound of

5.52 \times 10^{- 12}

when the initial approximation specified in (71) is used.

4.4. Results

The relative error bounds associated with the approximations defined by (69) to (71) are tabulated in Table 2.

The relative error bounds over the intervals

[k π, (k + 1) π]

,

k \in {1,2, \dots}

, for the inverse of

x - s i n (x)

, naturally, are lower. This is illustrated in Figure 7 where the relative errors for the approximations are shown over the interval

[0, 4 π]

.

4.5. Applications

The general integral formula for an inverse function (1) leads to

\int_{0}^{y} f^{- 1} (λ) d λ = {y f}^{- 1} (y) - \frac{{[f^{- 1} (y)]}^{2}}{2} - \cos [f^{- 1} (y)] + 1

(79)

and approximations arise from utilizing a given approximation for

f^{- 1}

. For example, the approximation

f_{0, 3}^{- 1}

defined by (71) leads to

\begin{array}{r} \int_{0}^{y} f^{- 1} (λ) d λ \approx y [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - \frac{1}{2} {[c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]}^{2} - \\ \begin{matrix} \begin{matrix} \begin{matrix} \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] + 1, & 0 < y \leq π, \end{matrix} \end{matrix} \end{matrix} \end{array}

(80)

which has a relative error bound of

3.20 \times 10^{- 6}

for

[0, π]

. Second, the first-order approximation, as specified by (76), yields, for

0 < y \leq π

:

\begin{array}{r} \begin{matrix} \int_{0}^{y} f^{- 1} (λ) d λ \approx y [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \frac{c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y}{1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]}] - \\ \begin{matrix} \begin{matrix} \frac{1}{2} \end{matrix} \end{matrix} {[c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \frac{c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y}{1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]}]}^{2} - \end{matrix} \\ \begin{matrix} \begin{matrix} \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \frac{c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y) - \sin [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)] - y}{1 - \cos [c_{0} y^{1 / 3} + c_{1} y + c_{2} s i n (y)]}] + 1 \end{matrix} \end{matrix} \end{array}

(81)

which has a relative error bound of

2.23 \times 10^{- 12}

for

[0, π]

.

5. Example III: Analytical Approximations for Inverse Langevin Function

The Langevin function is defined according to

\begin{array}{l} y = L (x) = \{\begin{matrix} c o t h (x) - \frac{1}{x}, & x \in (0, \infty) \\ 0, & x = 0 \end{matrix} \\ \begin{matrix} L (- x) = - L (x) \end{matrix} \end{array}

(82)

and its inverse,

L^{- 1}

, has been the subject of research interest over recent decades, e.g., [1,2]. Graphs of

L

and

L^{- 1}

are shown in Figure 8 for the positive real line case. The use of the standard exponential definition for the hyperbolic cotangent function leads to

y = L (x) = \frac{x - 1 + (1 + x) e^{- 2 x}}{x (1 - e^{- 2 x})} = \frac{n (x)}{d (x)}

(83)

where

n (x) = x - 1 + (1 + x) e^{- 2 x}

and

d (x) = x (1 - e^{- 2 x})

. This form implies, for fixed

y

, that

x = L^{- 1} (y)

is the solution of

e^{- 2 x} = \frac{1 - x + x y}{1 + x + x y}

(84)

5.1. Approximations

For

x, y

small, a Taylor series approach, e.g., [19], yields the approximation

\begin{matrix} L^{- 1} (y) \approx 3 y + \frac{9 y^{3}}{5} + \frac{297 y^{5}}{175} + \frac{1539 y^{7}}{875} + \dots, & 0 \leq y ≪ 1 . \end{matrix}

(85)

For large

x

, consistent with

y

approaching one, the left-hand side in (84) becomes vanishingly small leading to the approximation

\begin{matrix} L^{- 1} (y) \approx \frac{1}{1 - y}, & y \to 1, y < 1 . \end{matrix}

(86)

The issue, then, is how to incorporate both approximations into a simple expression that is valid for

y \in [0, 1)

. Representative approximations for

L^{- 1}

include:

L_{0, 1}^{- 1} (y) = \frac{3 y}{1 - y} \cdot [1 - \frac{24 y}{25} + \frac{22 y^{2}}{75}],

(87)

L_{0, 2}^{- 1} (y) = 3 y + \frac{y^{2}}{5} \cdot s i n [\frac{7 y}{2}] + \frac{y^{3}}{1 - y},

(88)

L_{0, 3}^{- 1} (y) = \frac{y (3 - y^{2})}{1 - y^{2}} - \frac{y^{10 / 3}}{2} + 3 y^{5} (y - \frac{76}{100}) (y - 1),

(89)

and are defined, respectively, in [20,21,22]. Their respective relative error bounds, associated with the interval

[0, 1)

, are:

9.69 \times 10^{- 3}

,

1.79 \times 10^{- 3}

and

7.22 \times 10^{- 4}

. The papers [1,2,20,23,24], for example, detail alternative approximations.

5.2. General Schröder-Based Approximations

The general approximation forms for the inverse Langevin function that are detailed below are based on the form

L (x) = n (x) / d (x)

, as given by (83). The result for

f^{(k)} (x) = \frac{n_{k} (x)}{d^{k + 1} (x)}

, stated in Lemma 1, yields the following results:

n_{1} (x) = 1 - 2 e^{- 2 x} - 4 x^{2} e^{- 2 x} + e^{- 4 x}

(90)

n_{2} (x) = - 2 + 6 e^{- 2 x} + 8 x^{3} e^{- 2 x} - 6 e^{- 4 x} + 8 x^{3} e^{- 4 x} + 2 e^{- 6 x}

(91)

\begin{array}{l} n_{3} (x) = 6 - 24 e^{- 2 x} - 16 x^{4} e^{- 2 x} + 36 e^{- 4 x} - 64 x^{4} e^{- 4 x} - 24 e^{- 6 x} - \\ \begin{matrix} \begin{matrix} 16 x^{4} e^{- 6 x} + 6 e^{- 8 x} \end{matrix} \end{matrix} \end{array}

(92)

\begin{array}{l} n_{4} (x) = - 24 + 120 e^{- 2 x} + 32 x^{5} e^{- 2 x} - 240 e^{- 4 x} + 352 x^{5} e^{- 4 x} + 240 e^{- 6 x} + \\ \begin{matrix} \begin{matrix} 352 x^{5} e^{- 6 x} - 120 e^{- 8 x} \end{matrix} \end{matrix} + 32 x^{5} e^{- 8 x} + 24 e^{- 10 x} \end{array}

(93)

These functions can be used in the general inverse function approximations stated in Theorem 3. With an initial approximation of

f_{0}^{- 1}

, the first- and second-order approximations for

L^{- 1}

are:

f_{1}^{- 1} (y) = {\frac{x [2 - x + x y] - 2 x [2 + 2 x^{2} + x y] e^{- 2 x} + x [2 + x + x y] e^{- 4 x}}{1 - 2 e^{- 2 x} - 4 x^{2} e^{- 2 x} + e^{- 4 x}}|}_{x = f_{0}^{- 1} (y)}

(94)

f_{2}^{- 1} (y) = {\frac{2 x n_{1}^{3} (x) - {2 n}_{1}^{2} (x) d (x) [n (x) - y d (x)] - n_{2} (x) d (x) {[n (x) - y d (x)]}^{2}}{{2 n}_{1}^{3} (x)}|}_{x = f_{0}^{- 1} (y)}

(95)

Higher-order approximations follow in a similar manner.

5.3. Results

The relative error bounds, based on (87) to (89), for approximations to the inverse Langevin function are tabulated in Table 3. The relative errors associated with the original approximations, (87) to (89), and the associated first-order approximations (94) are shown in Figure 9.

5.4. Newton–Raphson Iteration

A second-order Newton–Raphson iteration, which is equivalent to a first-order Newton–Raphson iteration, based on the first-order approximation

f_{1}^{- 1}

defined by (94), yields the approximation

\begin{array}{l} f_{{N R}_{2}}^{- 1} (y) = f_{1}^{- 1} (y) - \frac{f [f_{1}^{- 1} (y)] - y}{f^{(1)} [f_{1}^{- 1} (y)]} \\ \begin{matrix} \begin{matrix} = f_{1}^{- 1} (y) - \frac{[n [f_{1}^{- 1} (y)] - y d [f_{1}^{- 1} (y)]] d [f_{1}^{- 1} (y)]}{n_{1} [f_{1}^{- 1} (y)]} \end{matrix} \end{matrix} \end{array}

(96)

For the case of initial approximations defined by (87) to (89), i.e.,

\begin{array}{l} f_{1}^{- 1} (y) = \\ \begin{matrix} {\frac{x [2 - x + x y] - 2 x [2 + 2 x^{2} + x y] e^{- 2 x} + x [2 + x + x y] e^{- 4 x}}{1 - 2 e^{- 2 x} + 4 x^{2} e^{- 2 x} + e^{- 4 x}}|}_{x \in \{L_{0, 1}^{- 1} (y), L_{0, 2}^{- 1} (y), L_{0, 3}^{- 1} (y)\}} \end{matrix} \end{array}

(97)

the relative error bounds for the interval

[0, 1)

, respectively, are

8.80 \times 10^{- 9}

,

1.03 \times 10^{- 11}

and

1.09 \times 10^{- 13}

.

5.5. Applications

As

\int_{0}^{x} L (λ) d λ = \ln [s i n h (x)] - \ln (x)

, the general integral result, as given by (1), yields

\begin{matrix} \int_{0}^{y} L^{- 1} (λ) d λ = y L^{- 1} (y) + \ln [L^{- 1} (y)] - \ln [s i n h [L^{- 1} (y)]], & y \in (0, 1), \end{matrix}

(98)

and approximations then follow. For example, the approximation

f_{1}^{- 1}

(see (94)) yields the relative error bounds for the integral of the inverse Langevin function, respectively, of

2.72 \times 10^{- 9}

,

2.02 \times 10^{- 12}

and

8.78 \times 10^{- 14}

for the cases of

f_{0}^{- 1}

specified by (87) to (89).

Direct integration of the original approximations, as given by (87) to (89), yields the approximations

\int_{0}^{y} L^{- 1} (λ) d λ \approx - y + y^{2} - \frac{22 y^{3}}{75} - \ln (1 - y),

(99)

\begin{matrix} \int_{0}^{y} L^{- 1} (λ) d λ \approx \frac{- 16}{1715} - y + y^{2} - \frac{y^{3}}{3} - \ln (1 - y) + \frac{16 - 98 y^{2}}{1715} \cdot \cos [\frac{7 y}{2}] + \\ \begin{matrix} \begin{matrix} \begin{matrix} \frac{8 y}{245} \cdot \sin [\frac{7 y}{2}], \end{matrix} \end{matrix} \end{matrix} \end{matrix}

(100)

\int_{0}^{y} L^{- 1} (λ) d λ \approx \frac{y^{2}}{2} - \frac{3 y^{13 / 3}}{26} + \frac{19 y^{6}}{50} - \frac{132 y^{7}}{175} + \frac{3 y^{8}}{8} - \ln (1 - y^{2}),

(101)

with relative error bounds of

6.43 \times 10^{- 3}

,

1.14 \times 10^{- 3}

and

5.34 \times 10^{- 4}

. Use of the approximations, as given by (87) to (89), in (98) yields the respective relative error bounds of

6.71 \times 10^{- 5}

,

2.22 \times 10^{- 6}

and

3.25 \times 10^{- 7}

.

Inverse Langevin Function as Zero Crossing Time of an Impulse Response

Rearranging (84) implies, for fixed

y

and

0 < y < 1

, that

x = L^{- 1} (y)

is the solution of

1 - x + x y - [1 + x + x y] e^{- 2 x} = 0 .

(102)

The function

h (t) = 1 - k t - [1 + (2 - k) t] e^{- 2 t}

,

t > 0

, arising from the definition of

k = 1 - y

in this equation, is consistent with the impulse response of a linear system with a transfer function defined according to

H (s) = \frac{1}{s} - \frac{k}{s^{2}} - \frac{1 / 2}{1 + s / 2} - \frac{1 - k / 2}{{2 (1 + s / 2)}^{2}}

(103)

The zero crossing time of the impulse response is

L^{- 1} [1 - k]

for

0 < k < 1

. The impulse response is shown in Figure 10 for the cases of

k \in \{1 / 4, 1 / 2, 3 / 4\}

. The zero crossing times can be approximated via the approximations detailed above.

6. Example IV: Analytical Approximations for Lambert Function

The Lambert W function, denoted

W

for the principle branch and real valued case, is a generalization of the logarithm function and its approximation has received increasing attention in the literature, e.g., [25,26,27]. It is defined as the inverse of

y = f (x) = x e^{x}

for the case of

x \geq - 1

,

y \geq - 1 / e

, i.e.,

x = W (y) = f^{- 1} (y)

(104)

A graph of the Lambert W function is shown in Figure 11.

6.1. Approximations

The Lambert W function has widespread applications, e.g., [28,29], and, accordingly, its approximation has received significant interest, with the following approximations, for example, being proposed:

\begin{array}{l} f_{0, 1}^{- 1} (y) = (1 + δ) \ln [\frac{6}{5} \cdot \frac{y}{\ln [\frac{12}{5} \cdot \frac{y}{\ln [1 + 12 y / 5]}]}] - δ \ln [\frac{2 y}{\ln [1 + 2 y]}] \\ \begin{matrix} \begin{matrix} δ = 0.4586887 \end{matrix} \end{matrix} \end{array}

(105)

\begin{array}{l} f_{0, 2}^{- 1} (y) = - 1 + a \ln [\frac{1 + b \sqrt{1 + e y}}{1 + c ln [1 + \sqrt{1 + e y}]}] \\ \begin{matrix} \begin{matrix} \begin{matrix} \begin{matrix} a = 2.036, \end{matrix} \end{matrix} & c = \frac{e^{1 / a} - 1 - \sqrt{2} / a}{1 - \ln (2) e^{1 / a}} \end{matrix}, & b = \frac{\sqrt{2}}{a} + c \end{matrix} \end{array}

(106)

f_{0, 3}^{- 1} (y) = \ln [\frac{1 + 3 y + y \ln (1 + y)}{[1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}]

(107)

f_{0, 4}^{- 1} (y) = \ln [\frac{1 + 4 y + y \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}] + y \ln (1 + y) [2 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}{[1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]] [1 + \ln [\frac{1 + 3 y + y \ln (1 + y)}{[1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}]]}]

(108)

These approximations, respectively, are defined by [30] (Equation (15)), [31] (Equations (19) and (20)), [26] (Equation (33)) and [26] (Equation (35)). The respective relative error bounds for these approximations, and for the interval

[0, \infty)

, are:

1.96 \times 10^{- 3}

,

4.53 \times 10^{- 3}

,

1.33 \times 10^{- 3}

and

7.22 \times 10^{- 7}

. Useful overviews of published results can be found in [25,26,27,31,32].

6.2. General Schröder-Based Approximations

Based on the results stated in Theorem 2, the first- to fourth-order approximations for the Lambert W function, and based on an initial approximation of

f_{0}^{- 1}

, are:

f_{1}^{- 1} (y) = f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)} = \frac{{[f_{0}^{- 1} (y)]}^{2} + y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)}

(109)

f_{2}^{- 1} (y) = \frac{{[f_{0}^{- 1} (y)]}^{2} + y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)} - \frac{[2 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{2}}{2 {[1 + f_{0}^{- 1} (y)]}^{3}}

(110)

\begin{array}{l} f_{3}^{- 1} (y) = \frac{{[f_{0}^{- 1} (y)]}^{2} + y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)} - \frac{[2 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{2}}{2 {[1 + f_{0}^{- 1} (y)]}^{3}} - \\ \begin{matrix} \begin{matrix}  \end{matrix} & \frac{[3 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{3}}{6 {[1 + f_{0}^{- 1} (y)]}^{4}} \end{matrix} \cdot [- 1 + \frac{{3 [2 + f_{0}^{- 1} (y)]}^{2}}{[1 + f_{0}^{- 1} (y)] [3 + f_{0}^{- 1} (y)]}] \end{array}

(111)

\begin{array}{l} f_{4}^{- 1} (y) = f_{3}^{- 1} (y) - \frac{[4 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{4}}{24 {[1 + f_{0}^{- 1} (y)]}^{5}} \cdot \\ \begin{matrix} \begin{matrix}  \end{matrix} & [1 - \frac{10 [2 + f_{0}^{- 1} (y)] [3 + f_{0}^{- 1} (y)]}{[1 + f_{0}^{- 1} (y)] [4 + f_{0}^{- 1} (y)]} - \frac{{15 [2 + f_{0}^{- 1} (y)]}^{3}}{{[1 + f_{0}^{- 1} (y)]}^{2} [4 + f_{0}^{- 1} (y)]}] \end{matrix} \end{array}

(112)

6.2.1. Special Form

For the case consistent with the approximations stated in (107) and (108), where

f_{0}^{- 1} (y) = \ln [\frac{p (y}{q (y}],

(113)

the first- and second-order approximations, respectively, become

f_{1}^{- 1} (y) = \frac{{\ln [\frac{p (y}{q (y}]}^{2} + \frac{y q (y)}{p (y)}}{1 + \ln [\frac{p (y}{q (y}]}

(114)

f_{2}^{- 1} (y) = \frac{{\ln [\frac{p (y}{q (y}]}^{2} + \frac{y q (y)}{p (y)}}{1 + \ln [\frac{p (y}{q (y}]} - \frac{[2 + \ln [\frac{p (y}{q (y}]] {[\ln [\frac{p (y}{q (y}] - \frac{y q (y)}{p (y)}]}^{2}}{2 {[1 + \ln [\frac{p (y}{q (y}]]}^{3}}

(115)

6.2.2. Explicit Approximation

The use of

f_{0, 3}^{- 1} (y)

(see (107)) in the first-order form, as given by (109) or (114), yields the approximation

f_{1}^{- 1} (y) = \frac{\ln {[\frac{1 + 3 y + y \ln (1 + y)}{[1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}]}^{2} + \frac{y [1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}{1 + 3 y + y \ln (1 + y)}}{1 + \ln [\frac{1 + 3 y + y \ln (1 + y)}{[1 + \ln (1 + y)] [1 + \ln [\frac{1 + 2 y}{1 + \ln (1 + y)}]]}]}

(116)

which has a relative error bound for

(0, \infty)

of

5.12 \times 10^{- 6}

.

6.3. Hybrid Approximations

Consider a first-order Newton–Raphson iteration based on the second-order approximation

f_{2}^{- 1}

specified by (110), with

f_{0}^{- 1}

defined by (107). The relative error bound associated with

f_{0}^{- 1}

is

1.33 \times 10^{- 3}

; the relative error bound associated with

f_{2}^{- 1}

is

2.93 \times 10^{- 8}

. The first-order Newton–Raphson approximation is

W (y) \approx f_{2}^{- 1} (y) - \frac{f_{2}^{- 1} (y) - y e^{{- f}_{2}^{- 1} (y)}}{1 + f_{2}^{- 1} (y)}

(117)

and has a relative error bound of

3.44 \times 10^{- 15}

.

6.4. Results

The relative error bounds associated with Schröder and Newton–Raphson approximations are tabulated in Table 4. The relative errors for selected results are shown in Figure 12.

6.5. Applications

The approximations

f_{0, 3}^{- 1}

and

f_{0, 4}^{- 1}

, as given by (107) and (108), are upper bounds for the Lambert W function [26]. Simulation results indicate that the approximations, as given by (109) to (112), and based on these approximations, are also upper bounds with improved accuracy, and the bounds are detailed in Table 4. Lower bounded functions can be defined based on these upper bounds, as detailed in [18] (Lemma 1). Thus, for example, the second-order approximation given by (110) yields the bounds

\begin{array}{l} \frac{1}{1 + ε_{B}} [f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)} - \frac{[2 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{2}}{2 {[1 + f_{0}^{- 1} (y)]}^{3}}] \\ {\begin{matrix} \leq W (y) \leq \\ f_{0}^{- 1} (y) - \frac{f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}}{1 + f_{0}^{- 1} (y)} - \frac{[2 + f_{0}^{- 1} (y)] {[f_{0}^{- 1} (y) - y e^{{- f}_{0}^{- 1} (y)}]}^{2}}{2 {[1 + f_{0}^{- 1} (y)]}^{3}}| \end{matrix}}_{f_{0}^{- 1} \in \{f_{0, 3}^{- 1}, f_{0, 4}^{- 1}\}} \end{array}

(118)

where

ε_{B}

is the bound associated with the approximation and as given in Table 4. For example, when

f_{0}^{- 1} (y)

is given by

f_{0, 3}^{- 1} (y)

(see (107)),

ε_{B} = 2.93 \times 10^{- 8}

and the relative error bounds associated with the upper and lower bounded approximations are both

2.93 \times 10^{- 8}

.

The general integral result given by (1), along with the integral result

\int_{0}^{y} x e^{x} d x = 1 + (y - 1) e^{y}

(119)

yields

\begin{matrix} \int_{0}^{y} f^{- 1} (λ) d λ = y f^{- 1} (y) + [1 - f^{- 1} (y)] e^{f^{- 1} (y)} - 1, & y > 0, \end{matrix}

(120)

and approximations then follow. For example, the relative error bounds for the interval

(0, \infty)

associated with directly utilizing the approximations specified by (105) to (108), respectively, are:

1.70 \times 10^{- 5}

,

2.86 \times 10^{- 4}

(for the interval

(0, 10^{20})

),

6.17 \times 10^{- 6}

and

1.81 \times 10^{- 12}

. When the approximation

f_{1}^{- 1}

(see (109)) is utilized, the relative error bounds for the integral of the Lambert W function, respectively, are

3.84 \times 10^{- 9}

,

1.55 \times 10^{- 6}

(for the interval

(0, 10^{20})

),

1.11 \times 10^{- 10}

and

8.37 \times 10^{- 24}

for the cases of

f_{0}^{- 1}

specified by (105) to (108). The integrals of the original approximations, as given by (105) to (108), are not known.

7. Conclusions

In this paper, Schröder approximations of the first kind, modified for the inverse function approximation case, were utilized to establish general analytical approximation forms for an inverse function. Such general forms can be used to establish arbitrarily accurate analytical approximations, with a set relative error bound, for an inverse function when an initial approximation, typically with low accuracy, is known. Approximations for arcsine, the inverse of

x - s i n (x)

, the inverse Langevin function and the Lambert W function were used to illustrate the approach. Several applications were detailed.

Newton–Raphson iteration can also be used to yield analytical approximations to a given inverse function of arbitrary accuracy given an initial approximation with low to moderate accuracy but, in general, with a more complicated form. The use of a first-order Newton–Raphson iteration based on a Schröder approximation of a set order can lead to approximations that represent a good compromise between accuracy and complexity.

With respect to the root approximation of a function, Schröder approximations of the first kind, based on the inverse of a function, have an advantage over the corresponding generalization of the standard Newton–Raphson method, as explicit solutions for all orders of approximation can be obtained.

Further Research

The four examples considered illustrate the potential for utilizing Schröder approximations to establish accurate analytical approximations for an inverse function. As this approach is general, there is potential to establish useful analytical approximations for other inverse functions. The starting point is to find an initial approximation with a sufficiently low relative error bound over the domain of approximation. In general, custom approaches are used and advances in finding such approximations are of interest.

The relative error bound, as defined by (32), for the first-order Schröder approximation arises from two assumptions and the use of second-order Taylor series approximations that underpin (29). The use of first-order Taylor series leads, in general, to inaccurate results, and the complexity associated with the use of second-order Taylor series approximations complicates analysis. Further research to establish general relative error bounds, in terms of the relative bound of the initial approximation, for first-, second- and higher-order Schröder approximations is warranted.

Funding

This research did not receive external funding.

Acknowledgments

The author is pleased to acknowledge the support of A. Zoubir, SPG, Technische Universität Darmstadt, Darmstadt, Germany, who hosted a visit where part of the research underpinning this paper was completed. The author is appreciative of the feedback provided by the reviewers and the Academic Editor, which has led to an improved paper.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Proof of Theorem 1

Useful references include [33] and the Faà die Bruno formula, e.g., [34]. A direct proof follows from the inverse function theorem, which states, for a real, monotonic and differentiable function, that

D [f^{- 1} (y)] = {\frac{1}{f^{(1)} (x)}|}_{x = f^{- 1} (y)} = \frac{1}{f^{(1)} [f^{- 1} (y)]}

(A1)

Successive differentiation and use of the chain rule yield:

D^{(2)} [f^{- 1} (y)] = {\frac{{- f}^{(2)} (x)}{{[f^{(1)} (x)]}^{3}}|}_{x = f^{- 1} (y)}

(A2)

D^{(3)} [f^{- 1} (y)] = {\frac{{- f}^{(3)} (x)}{{[f^{(1)} (x)]}^{4}} + \frac{{3 [f^{(2)} (x)]}^{2}}{{[f^{(1)} (x)]}^{5}}|}_{x = f^{- 1} (y)}

(A3)

D^{(4)} [f^{- 1} (y)] = {\frac{{- f}^{(4)} (x)}{{[f^{(1)} (x)]}^{5}} + \frac{10 f^{(2)} (x) f^{(3)} (x)}{{[f^{(1)} (x)]}^{6}} - \frac{{15 [f^{(2)} (x)]}^{3}}{{[f^{(1)} (x)]}^{7}}|}_{x = f^{- 1} (y)}

(A4)

\begin{array}{r} D^{(5)} [f^{- 1} (y)] = \frac{{- f}^{(5)} (x)}{{[f^{(1)} (x)]}^{6}} + \frac{15 f^{(2)} (x) f^{(4)} (x)}{{[f^{(1)} (x)]}^{7}} + \frac{{10 [f^{(3)} (x)]}^{2}}{{[f^{(1)} (x)]}^{7}} - \\ \begin{matrix} \begin{matrix}  \end{matrix} & {\frac{{105 [f^{(2)} (x)]}^{2} f^{(3)} (x)}{{[f^{(1)} (x)]}^{8}} + \frac{{105 [f^{(2)} (x)]}^{4}}{{[f^{(1)} (x)]}^{9}}|}_{x = f^{- 1} (y)} \end{matrix} \end{array}

(A5)

\begin{array}{r} D^{(6)} [f^{- 1} (y)] = \frac{{- f}^{(6)} (x)}{{[f^{(1)} (x)]}^{7}} + \frac{21 f^{(2)} (x) f^{(5)} (x)}{{[f^{(1)} (x)]}^{8}} + \frac{35 f^{(3)} (x) f^{(4)} (x)}{{[f^{(1)} (x)]}^{8}} - \frac{210 {[f^{(2)} (x)]}^{2} f^{(4)} (x)}{{[f^{(1)} (x)]}^{9}} - \\ \begin{matrix} \begin{matrix} \begin{matrix}  \end{matrix} \end{matrix} & {\frac{280 f^{(2)} (x) {[f^{(3)} (x)]}^{2}}{{[f^{(1)} (x)]}^{9}} + \frac{{1260 [f^{(2)} (x)]}^{3} f^{(3)} (x)}{{[f^{(1)} (x)]}^{10}} - \frac{{945 [f^{(2)} (x)]}^{5}}{{[f^{(1)} (x)]}^{11}}|}_{x = f^{- 1} (y)} \end{matrix} \end{array}

(A6)

Appendix B. Proof of Lemma 1

A general formula for

f^{(k)}

, where

f (x) = n (x) / d (x)

, can be obtained from Leibniz’s rule for differentiation of the product of two functions, see, for example, [35]. The proof for the stated iterative algorithm follows from the differentiation of

f (x) = n (x) / d (x)

, which yields

f^{(1)} (x) = \frac{n^{(1)} (x) d (x) - d^{(1)} (x) n (x)}{d^{2} (x)} = \frac{n_{1} (x)}{d^{2} (x)}

(A7)

where

n_{1} (x) = d (x) n^{(1)} (x) - n (x) d^{(1)} (x)

. Differentiation of

f^{(1)}

yields

f^{(2)} (x) = \frac{n_{1}^{(1)} (x) d (x) - 2 d^{(1)} (x) n_{1} (x)}{d^{3} (x)} = \frac{n_{2} (x)}{d^{3} (x)}

(A8)

where

n_{2} (x) = d (x) n_{1}^{(1)} (x) - 2 n_{1} (x) d^{(1)} (x)

. Differentiation of

f^{(2)}

yields

f^{(3)} (x) = \frac{n_{2}^{(1)} (x) d (x) - 3 d^{(1)} (x) n_{2} (x)}{d^{4} (x)} = \frac{n_{3} (x)}{d^{4} (x)}

(A9)

where

n_{3} (x) = d (x) n_{2}^{(1)} (x) - 3 n_{2} (x) d^{(1)} (x)

. The required general relationship of

\begin{matrix} f^{(k)} (x) = \frac{n_{k} (x)}{d^{k + 1} (x)}, & n_{k} (x) = d (x) n_{k - 1}^{(1)} (x) - k n_{k - 1} (x) d^{(1)} (x), \end{matrix}

(A10)

then follows.

Appendix C. Derivative of $f^{(k)}$ for the Case of $f (x) = \frac{n (x)}{d (x)}$

With

f (x) = \frac{n (x)}{d (x)}

, the result

f^{(k)} (x) = \frac{n_{k} (x)}{d^{k + 1} (x)}

, stated in Lemma 1, yields the following results for the derivatives of

f^{- 1}

:

D [f^{- 1} (y)] = {\frac{1}{f^{(1)} (x)}|}_{x = f^{- 1} (y)} = {\frac{d^{2} (x)}{n_{1} (x)}|}_{x = f^{- 1} (y)}

(A11)

D^{(2)} [f^{- 1} (y)] = {\frac{{- f}^{(2)} (x)}{{[f^{(1)} (x)]}^{3}}|}_{x = f^{- 1} (y)} = {\frac{{- d}^{3} (x)}{n_{1}^{3} (x)} \cdot n_{2} (x)|}_{x = f^{- 1} (y)}

(A12)

D^{(3)} [f^{- 1} (y)] = {\frac{{- d}^{4} (x)}{n_{1}^{4} (x)} \cdot n_{3} (x) \cdot [1 - \frac{3 n_{2}^{2} (x)}{n_{1} (x) n_{3} (x)}]|}_{x = f^{- 1} (y)}

(A13)

D^{(4)} [f^{- 1} (y)] = {\frac{{- d}^{5} (x)}{n_{1}^{5} (x)} \cdot n_{4} (x) \cdot [1 - \frac{10 n_{2} (x) n_{3} (x)}{n_{1} (x) n_{4} (x)} + \frac{15 n_{2}^{3} (x)}{n_{1}^{2} (x) n_{4} (x)}]|}_{x = f^{- 1} (y)}

(A14)

D^{(5)} [f^{- 1} (y)] = {\frac{{- d}^{6} (x)}{n_{1}^{6} (x)} \cdot n_{5} (x) \cdot [\begin{array}{r} 1 - \frac{15 n_{2} (x) n_{4} (x)}{n_{1} (x) n_{5} (x)} - \frac{10 n_{3}^{2} (x)}{n_{1} (x) n_{5} (x)} + \\ \frac{105 n_{2}^{2} (x) n_{3} (x)}{n_{1}^{2} (x) n_{5} (x)} - \frac{105 n_{2}^{4} (x)}{n_{1}^{3} (x) n_{5} (x)} \end{array}]|}_{x = f^{- 1} (y)}

(A15)

Appendix D. Inverse of x-Sin(x): Use of Periodicity and Anti-Symmetry

Establishing the inverse of

f (x) = x - s i n (x)

is facilitated by the following two results:

Lemma 2.

Inverse of a Function Comprising a Linear and a Periodic Component. Consider a function

f

that is monotonically increasing from zero and comprises a linear component plus a periodic component, with a period,

x_{p}

, such that

\begin{array}{l} f (x) = β x + f_{p} (x), \\ \begin{matrix} \begin{matrix} \begin{matrix} f_{p} (x) = f_{p} (x + k x_{p}), & f_{p} (x) = 0, \end{matrix} & k \in \{0, 1, 2, \dots\}, \end{matrix} & x > 0 . \end{matrix} \end{array}

(A16)

For the case of

x_{1} = x + k x_{p}

,

0 \leq x < x_{p}

,

k \in \{0, 1, 2, \dots\}

, it follows that

y_{1} = f (x_{1}) = f (x + k x_{p}) = k β x_{p} + f (x) = y + k y_{p},

(A17)

where

y_{p} = β x_{p}

and

y = f (x)

. The inverse function then satisfies the relationship

\begin{matrix} \begin{matrix} f^{- 1} (y + k y_{p}) = f^{- 1} (y) + \frac{k y_{p}}{β}, & 0 \leq y < y_{p}, \end{matrix} & k \in \{0, 1, 2, \dots\} . \end{matrix}

(A18)

For the case of

f (x) = x - s i n (x)

, consistent with

β = 1

,

x_{p} = 2 π

and

y_{p} = 2 π

, it follows that

\begin{matrix} f^{- 1} (y) = f^{- 1} (y - 2 k π) + 2 k π, & 2 k π \leq y < 2 k π + 2 π . \end{matrix}

(A19)

Proof.

The first result follows very simply:

f (x + k x_{p}) = β [{x + k x}_{p}] + f_{p} ({x + k x}_{p}) = k β x_{p} + f (x) .

(A20)

The second result follows from the definitions

y_{1} = y + k y_{p}

,

x_{1} = x + k x_{p}

,

x_{1} = f^{- 1} (y_{1})

and

x = f^{- 1} (y)

, which imply that

\begin{matrix} x_{1} = f^{- 1} (y_{1}) = f^{- 1} (y + k y_{p}), & x_{1} = x + k x_{p} = f^{- 1} (y) + \frac{k y_{p}}{β} \end{matrix} .

(A21)

Equating these two results yields the required result:

f^{- 1} (y + k y_{p}) = f^{- 1} (y) + \frac{k y_{p}}{β}

.

For the case of

f (x) = x - s i n (x)

, consistent with

β = 1

,

x_{p} = 2 π

and

y_{p} = 2 π

, it follows that

\begin{matrix} f^{- 1} (z + 2 k π) = f^{- 1} (z) + 2 k π, & 0 \leq z < 2 π, \\ f^{- 1} (y) = f^{- 1} (y - 2 k π) + 2 k π, & 2 k π \leq y < 2 k π + 2 π, \end{matrix}

(A22)

assuming

z = y - 2 k π .

□

Lemma 3.

Use of Anti-Symmetric Nature of f in Defining f⁻¹. For the case of

f (x) = x - s i n (x)

, which is antisymmetric over the interval

[0, 2 π]

and around the point

(π, π)

, it follows that

\begin{matrix} f (x) = 2 π - f (2 π - x), & x \in [π, 2 π], \end{matrix}

(A23)

\begin{matrix} f^{- 1} (y) = 2 π - f^{- 1} (2 π - y), & y \in [π, 2 π] . \end{matrix}

(A24)

Proof.

Consider the illustration shown in Figure A1. From the definition

f (x) = x - s i n (x)

, it follows that

\begin{matrix} f (π + Δ) = π + Δ + \sin (Δ), & f (π - Δ) = π - Δ - \sin (Δ), & Δ \in [0, π] \end{matrix} .

(A25)

Thus,

f (π + Δ) + f (π - Δ) = 2 π

, and with

x = π + Δ

x \in [π, 2 π]

, the first result

f (x) = 2 π - f (2 π - x)

,

x \in [π, 2 π]

, follows.

In a similar manner, consider

δ

,

δ \in [0, π]

, such that

f^{- 1} (π + δ) = f (π + Δ)

and

f^{- 1} (π - δ) = f (π - Δ)

. It then follows that

\begin{matrix} f^{- 1} (π + δ) = π + Δ + \sin (Δ), & f^{- 1} (π - δ) = π - Δ - \sin (Δ), & Δ, δ \in [0, π] \end{matrix}

(A26)

Thus,

f^{- 1} (π + δ) + f^{- 1} (π - δ) = 2 π

. With

y = π + δ

,

y \in [π, 2 π]

, the second required result

f^{- 1} (y) = 2 π - f^{- 1} (2 π - y)

(A27)

follows. □

Figure A1. Illustration of the definitions

Δ

and

δ

and the anti-symmetric nature of

f

and

f^{- 1}

around the point

(π, π)

.

Figure A1. Illustration of the definitions

Δ

and

δ

and the anti-symmetric nature of

f

and

f^{- 1}

around the point

(π, π)

.

References

Jedynak, R. New facts concerning the approximation of the inverse Langevin function. J. Non-Newtonian Fluid Mech. 2017, 249, 8–25. [Google Scholar] [CrossRef]
Jedynak, R. A comprehensive study of the mathematical methods used to approximate the inverse Langevin function. Math. Mech. Solids 2018, 24, 1992–2016. [Google Scholar] [CrossRef]
Gdawiec, K.; Kotarski, W.; Lisowska, A. Polynomiography based on the nonstandard Newton-like root finding methods. Abstr. Appl. Anal. 2015, 2015, 797594. [Google Scholar] [CrossRef]
Ypma, T.J. Historical development of the Newton–Raphson method. SIAM Rev. 1995, 37, 531–551. [Google Scholar] [CrossRef]
Kalantari, B.; Kalantari, I.; Zaare-Nahandi, R. A basic family of iteration functions for polynomial root finding and its characterizations. J. Comput. Appl. Math. 1997, 80, 209–226. [Google Scholar] [CrossRef]
Petković, M.; Herceg, D. On rediscovered iteration methods for solving equations. J. Comput. Appl. Math. 1999, 107, 275–284. [Google Scholar] [CrossRef]
Amat, S.; Busquier, S.; Gutiérrez, J.M. Geometric constructions of iterative functions to solve nonlinear equations. J. Comput. Appl. Math. 2003, 157, 197–205. [Google Scholar] [CrossRef]
Abbasbandy, S. Improving Newton–Raphson method for nonlinear equations by modified Adomian decomposition method. Appl. Math. Comput. 2003, 145, 887–893. [Google Scholar] [CrossRef]
Chun, C. Iterative methods improving Newton’s method by the decomposition method. Comput. Math. Appl. 2005, 50, 1559–1568. [Google Scholar] [CrossRef]
Noor, M.A.; Gupta, V. Modified Householder iterative method free from second derivatives for nonlinear equations. Appl. Math. Comput. 2007, 190, 1701–1706. [Google Scholar] [CrossRef]
Dubeau, F. Polynomial and rational approximations and the link between Schröder’s processes of the first and second kind. Abstr. Appl. Anal. 2014, 2014, 719846. [Google Scholar] [CrossRef]
Schröder, E. Über unendlich viele Algorithmen zur Auflösung der Gleichungen. Math. Ann. 1870, 2, 317–365. [Google Scholar] [CrossRef]
Abramowitz, M.; Stegun, I.A. (Eds.) Handbook of Mathematical Functions with Formulas, Graphs and Mathematical Tables; Dover: Mineola, NY, USA, 1964. [Google Scholar]
Copson, E.T. An Introduction to the Theory of Functions of a Complex Variable; Oxford University Press: Oxford, UK, 1935; pp. 121–123. [Google Scholar]
Howard, R.M. Radial Based Approximations for Arcsine, Arccosine, Arctangent and Applications. AppliedMath 2023, 3, 343–394. [Google Scholar] [CrossRef]
Gradshteyn, I.S.; Ryzhik, I.M. Tables of Integrals, Series and Products, 7th ed.; Jeffery, A., Zwillinger, D., Eds.; Academic Press: Cambridge, MA, USA, 2007. [Google Scholar]
Fink, A.M. Two inequalities. Univ. Beograd. Publ. Elektrotehn. Fak. Ser. Mat. 1995, 6, 49–50. [Google Scholar]
Howard, R.M. Arbitrarily accurate analytical approximations for the Error function. Math. Comput. Appl. 2022, 27, 14. [Google Scholar] [CrossRef]
Itskov, M.; Dargazany, R.; Hornes, K. Taylor expansion of the inverse function with application to the Langevin function. Math. Mech. Solids 2011, 17, 693–701. [Google Scholar] [CrossRef]
Howard, R.M. Analytical approximations for the inverse Langevin function via linearization, error approximation and iteration. Rheol. Acta 2020, 59, 521–544. [Google Scholar] [CrossRef]
Petrosyan, R. Improved approximations for some polymer extension models. Rheol. Acta 2017, 56, 21–26. [Google Scholar] [CrossRef]
Nguessong, A.N.; Beda, T.; Peyraut, F. A new based error approach to approximate the inverse Langevin function. Rheol. Acta 2014, 53, 585–591. [Google Scholar] [CrossRef]
Kröger, M. Simple, admissible, and accurate approximants of the inverse Langevin and Brillouin functions, relevant for strong polymer deformations and flows. J. Non-Newton. Fluid Mech. 2015, 223, 77–87. [Google Scholar] [CrossRef]
Marchi, B.C.; Arruda, E.M. Generalized error-minimizing, rational inverse Langevin approximations. Math. Mech. Solids 2019, 24, 1630–1647. [Google Scholar] [CrossRef]
Veberič, D. Lambert W function for applications in physics. Comput. Phys. Commun. 2012, 183, 2622–2628. [Google Scholar] [CrossRef]
Howard, R.M. Analytical approximations for the principal branch of the Lambert W function. Eur. J. Math. Anal. 2022, 2, 14. [Google Scholar] [CrossRef]
Lóczi, L. Guaranteed-and high-precision evaluation of the Lambert W function. Appl. Math. Comput. 2022, 433, 127406. [Google Scholar] [CrossRef]
Banwell, T.C. Bipolar transistor circuit analysis using the Lambert W-function. IEEE Trans. Circuits Syst. I Fundam. Theory Appl. 2000, 47, 1621–1633. [Google Scholar] [CrossRef]
Visser, M. Primes and the Lambert W function. Mathematics 2018, 6, 56. [Google Scholar] [CrossRef]
Barry, D.A.; Parlange, J.Y.; Li, L.; Prommer, H.; Cunningham, C.J.; Stagnitti, F. Analytical approximations for real values of the Lambert W-function. Math. Comput. Simul. 2000, 53, 95–103. [Google Scholar] [CrossRef]
Iacono, R.; Boyd, J.P. New approximations to the principal real-valued branch of the Lambert W-function. Adv. Comput. Math. 2017, 43, 1403–1436. [Google Scholar] [CrossRef]
Goličnik, M. On the Lambert W function and its utility in biochemical kinetics. Biochem. Eng. J. 2012, 63, 116–123. [Google Scholar] [CrossRef]
Dargazany, R.; Hörnes, K.; Itskov, M. A simple algorithm for the fast calculation of higher order derivatives of the inverse function. Appl. Math. Comput. 2013, 221, 833–838. [Google Scholar] [CrossRef]
Craik, A.D. Prehistory of Faà di Bruno’s formula. Am. Math. Mon. 2005, 112, 119–130. [Google Scholar]
Leslie, R.A. How not to repeatedly differentiate a reciprocal. Am. Math. Mon. 1991, 98, 732–735. [Google Scholar] [CrossRef]

Figure 1. Illustration of the functions

y = f (x)

and

x = f^{- 1} (y)

and Taylor series approximations to these functions based on the points

(x_{0}, {f (x}_{0}))

and

(y_{0}, f^{- 1} (y_{0}))

. The root of the Taylor series, denoted, respectively,

x_{1}

,

x_{2}

,

\dots

,

x_{n}

and

x_{1}^{I}

,

x_{2}^{I}

,

\dots

,

x_{n}^{I}

, are approximations for the roots of

f

.

Figure 1. Illustration of the functions

y = f (x)

and

x = f^{- 1} (y)

and Taylor series approximations to these functions based on the points

(x_{0}, {f (x}_{0}))

and

(y_{0}, f^{- 1} (y_{0}))

. The root of the Taylor series, denoted, respectively,

x_{1}

,

x_{2}

,

\dots

,

x_{n}

and

x_{1}^{I}

,

x_{2}^{I}

,

\dots

,

x_{n}^{I}

, are approximations for the roots of

f

.

Figure 2. Illustration of the root of

f (x) - y_{o}

, denoted

x_{o}

and given by

f^{- 1} (y_{o})

, and an initial approximation of

x_{0}

to

x_{o}

. The illustration is for the monotonically increasing function case. The function

f_{0}^{- 1}

is an initial approximation to

f^{- 1}

.

Figure 2. Illustration of the root of

f (x) - y_{o}

, denoted

x_{o}

and given by

f^{- 1} (y_{o})

, and an initial approximation of

x_{0}

to

x_{o}

. The illustration is for the monotonically increasing function case. The function

f_{0}^{- 1}

is an initial approximation to

f^{- 1}

.

Figure 3. Illustration of the interaction between direct high order approximation, and iteration, to obtain accurate analytical inverse function approximations.

Figure 4. Graph of

y = f (x) = \sin (x)

,

x = f^{- 1} (y) = asin (y)

,

y = g (x) = \cos (x)

and

x = g^{- 1} (y) = acos (y)

for

0 \leq x < π / 2

,

0 \leq y < 1

.

Figure 4. Graph of

y = f (x) = \sin (x)

,

x = f^{- 1} (y) = asin (y)

,

y = g (x) = \cos (x)

and

x = g^{- 1} (y) = acos (y)

for

0 \leq x < π / 2

,

0 \leq y < 1

.

Figure 5. Graph of the relative errors in approximations to

a s i n (y)

.

Figure 5. Graph of the relative errors in approximations to

a s i n (y)

.

Figure 6. Graphs of

f (x) = x - \sin (x)

and its inverse

f^{- 1} (y)

.

Figure 6. Graphs of

f (x) = x - \sin (x)

and its inverse

f^{- 1} (y)

.

Figure 7. Graph of the relative error in approximations to the inverse of

x - s i n (x)

. Upper three curves: original approximations defined by (69) to (71). Lower three curves: first order approximation as defined by (72).

Figure 7. Graph of the relative error in approximations to the inverse of

x - s i n (x)

. Upper three curves: original approximations defined by (69) to (71). Lower three curves: first order approximation as defined by (72).

Figure 8. Graph of the Langevin and inverse Langevin functions.

Figure 9. Graph of the relative error in approximations to the inverse Langevin function. Upper three curves: original approximations as given by (87) to (89). Lower three curves: associated first order approximations as specified by (94).

Figure 10. Graph of the impulse response of the transfer function defined by (103).

Figure 11. Graph of

f (x) = x e^{x}

and its inverse, the Lambert W function, denoted

W

, for the principle branch and real case.

Figure 11. Graph of

f (x) = x e^{x}

and its inverse, the Lambert W function, denoted

W

, for the principle branch and real case.

Figure 12. Graphs of the relative error in approximations to the Lambert W function.

Table 1. Relative error bounds, over the interval

[0, 1]

, for approximations to arcsine based on the original approximations

f_{0, 1}^{- 1}

,

f_{0, 2}^{- 1}

,

f_{0, 3}^{- 1}

and

f_{0, 4}^{- 1}

, as specified by (46) to (49).

Table 1. Relative error bounds, over the interval

[0, 1]

, for approximations to arcsine based on the original approximations

f_{0, 1}^{- 1}

,

f_{0, 2}^{- 1}

,

f_{0, 3}^{- 1}

and

f_{0, 4}^{- 1}

, as specified by (46) to (49).

Approximation	$f_{0, 1}^{- 1}$	$f_{0, 2}^{- 1}$	$f_{0, 3}^{- 1}$	$f_{0, 4}^{- 1}$
Original approximation	$4.72 \times 10^{- 2}$	$3.62 \times 10^{- 3}$	$3.64 \times 10^{- 4}$	$3.04 \times 10^{- 6}$
1st order: (42)	$1.96 \times 10^{- 3}$	$1.84 \times 10^{- 5}$	$1.78 \times 10^{- 8}$	$9.18 \times 10^{- 16}$
2nd order: (43)	$3.39 \times 10^{- 4}$	$2.46 \times 10^{- 7}$	$3.68 \times 10^{- 12}$	$4.43 \times 10^{- 22}$
3rd order: (44)	$8.93 \times 10^{- 5}$	$4.43 \times 10^{- 9}$	$7.22 \times 10^{- 16}$	$1.27 \times 10^{- 30}$
4th order: (45)	$2.91 \times 10^{- 5}$	$9.37 \times 10^{- 11}$	$1.71 \times 10^{- 19}$	$3.44 \times 10^{- 37}$
5th order	$1.08 \times 10^{- 5}$	$2.18 \times 10^{- 12}$	$4.26 \times 10^{- 23}$	$1.95 \times 10^{- 45}$
NR—1st iteration: (57)	$1.96 \times 10^{- 3}$	$1.84 \times 10^{- 5}$	$1.78 \times 10^{- 8}$	$9.18 \times 10^{- 16}$
NR—2nd iteration: (58)	$1.28 \times 10^{- 5}$	$8.87 \times 10^{- 10}$	$6.52 \times 10^{- 17}$	$4.94 \times 10^{- 32}$
NR—3rd iteration: (59)	$1.29 \times 10^{- 9}$	$3.54 \times 10^{- 18}$	$1.05 \times 10^{- 33}$	$1.12 \times 10^{- 63}$
NR—4th iteration	$2.76 \times 10^{- 17}$	$9.56 \times 10^{- 35}$	$2.92 \times 10^{- 67}$	$2.70 \times 10^{- 126}$

Table 2. Relative error bounds, over the interval

[0, π]

, for approximations to the inverse of

x - s i n (x)

and based on the original approximations

f_{0, 1}^{- 1}

,

f_{0, 2}^{- 1}

and

f_{0, 3}^{- 1}

as defined by (69) to (71).

Table 2. Relative error bounds, over the interval

[0, π]

, for approximations to the inverse of

x - s i n (x)

and based on the original approximations

f_{0, 1}^{- 1}

,

f_{0, 2}^{- 1}

and

f_{0, 3}^{- 1}

as defined by (69) to (71).

Approximation	$f_{0, 1}^{- 1}$	$f_{0, 2}^{- 1}$	$f_{0, 3}^{- 1}$
Original approximation	$8.61 \times 10^{- 3}$	$5.74 \times 10^{- 3}$	$1.36 \times 10^{- 3}$
1st order: (72)	$6.13 \times 10^{- 5}$	$2.93 \times 10^{- 5}$	$1.13 \times 10^{- 6}$
2nd order: (73)	$8.24 \times 10^{- 7}$	$2.67 \times 10^{- 7}$	$2.44 \times 10^{- 9}$
3rd order: (74)	$1.31 \times 10^{- 8}$	$2.91 \times 10^{- 9}$	$5.52 \times 10^{- 12}$
4th order: (75)	$2.28 \times 10^{- 10}$	$3.49 \times 10^{- 11}$	$1.42 \times 10^{- 14}$
5th order	$4.23 \times 10^{- 12}$	$4.43 \times 10^{- 13}$	$3.83 \times 10^{- 17}$
NR—1st iteration: (72)	$6.13 \times 10^{- 5}$	$2.93 \times 10^{- 5}$	$1.13 \times 10^{- 6}$
NR—2nd iteration: (78)	$3.18 \times 10^{- 9}$	$7.69 \times 10^{- 10}$	$7.92 \times 10^{- 13}$
NR—3rd iteration	$8.61 \times 10^{- 18}$	$5.31 \times 10^{- 19}$	$3.95 \times 10^{- 25}$
NR—4th iteration	$6.31 \times 10^{- 35}$	$2.54 \times 10^{- 37}$	$9.89 \times 10^{- 50}$

Table 3. Relative error bounds over the interval

[0, 1)

for approximations to the inverse Langevin function based on the original approximations

L_{0, 1}^{- 1}

,

L_{0, 2}^{- 1}

and

L_{0, 3}^{- 1}

, as given by (87) to (89).

Table 3. Relative error bounds over the interval

[0, 1)

for approximations to the inverse Langevin function based on the original approximations

L_{0, 1}^{- 1}

,

L_{0, 2}^{- 1}

and

L_{0, 3}^{- 1}

, as given by (87) to (89).

Approximation	$L_{0, 1}^{- 1}$	$L_{0, 2}^{- 1}$	$L_{0, 3}^{- 1}$
Original approximation	$9.69 \times 10^{- 3}$	$1.79 \times 10^{- 3}$	$7.22 \times 10^{- 4}$
1st order: (94)	$9.39 \times 10^{- 5}$	$3.20 \times 10^{- 6}$	$3.81 \times 10^{- 7}$
2nd order: (95)	$9.11 \times 10^{- 7}$	$5.73 \times 10^{- 9}$	$2.59 \times 10^{- 10}$
3rd order	$8.80 \times 10^{- 9}$	$1.03 \times 10^{- 11}$	$1.73 \times 10^{- 13}$
4th order	$8.55 \times 10^{- 11}$	$1.84 \times 10^{- 14}$	$1.14 \times 10^{- 16}$
5th order	$8.30 \times 10^{- 13}$	$3.28 \times 10^{- 17}$	$7.35 \times 10^{- 20}$
NR—1st iteration	$9.39 \times 10^{- 5}$	$3.20 \times 10^{- 6}$	$3.81 \times 10^{- 7}$
NR—2nd iteration	$8.80 \times 10^{- 9}$	$1.03 \times 10^{- 11}$	$1.09 \times 10^{- 13}$
NR—3rd iteration	$7.74 \times 10^{- 17}$	$1.05 \times 10^{- 22}$	$8.91 \times 10^{- 27}$
NR—4th iteration	$5.98 \times 10^{- 33}$	$1.11 \times 10^{- 44}$	$6.03 \times 10^{- 53}$

Table 4. Relative error bounds, over the interval

[0, \infty)

, for approximations to the Lambert W function and based on the original approximations

f_{0, 1}^{- 1} (y)

,

f_{0, 2}^{- 1} (y)

,

f_{0, 3}^{- 1} (y)

and

f_{0, 4}^{- 1} (y)

as defined by (105) to (108). The relative error bounds for

f_{0, 1}^{- 1} (y)

occur at increasingly high values as the order of approximation increases. The bounds for the second- and higher-order approximations are given for the interval

(0, 10^{20})

. The relative error associated with

f_{0, 2}^{- 1} (y)

increases for values

≫ 10^{30}

, and the stated bounds are for the interval

(0, 10^{20})

.

Table 4. Relative error bounds, over the interval

[0, \infty)

, for approximations to the Lambert W function and based on the original approximations

f_{0, 1}^{- 1} (y)

,

f_{0, 2}^{- 1} (y)

,

f_{0, 3}^{- 1} (y)

and

f_{0, 4}^{- 1} (y)

as defined by (105) to (108). The relative error bounds for

f_{0, 1}^{- 1} (y)

occur at increasingly high values as the order of approximation increases. The bounds for the second- and higher-order approximations are given for the interval

(0, 10^{20})

. The relative error associated with

f_{0, 2}^{- 1} (y)

increases for values

≫ 10^{30}

, and the stated bounds are for the interval

(0, 10^{20})

.

Approximation	$f_{0, 1}^{- 1}$	$f_{0, 2}^{- 1}$	$f_{0, 3}^{- 1}$	$f_{0, 4}^{- 1}$
Original approximation	1.96 × 10⁻³	4.53 × 10⁻³	1.33 × 10⁻³	7.23 × 10⁻⁷
1st order: (109) or (114)	1.60 × 10⁻⁵	3.02 × 10⁻⁴	5.12 × 10⁻⁶	1.49 × 10⁻¹²
2nd order: (110) or (115)	2.96 × 10⁻⁷	2.92 × 10⁻⁵	2.93 × 10⁻⁸	4.31 × 10⁻¹⁸
3rd order: (111)	7.45 × 10⁻⁹	3.23 × 10⁻⁶	1.94 × 10⁻¹⁰	1.43 × 10⁻²⁵
4th order: (112)	2.02 × 10⁻¹⁰	3.86 × 10⁻⁷	1.39 × 10⁻¹²	5.06 × 10⁻²⁹
5th order	5.70 × 10⁻¹²	4.82 × 10⁻⁸	1.05 × 10⁻¹⁴	1.88 × 10⁻³⁴
NR—1st iteration: (109)	1.60 × 10⁻⁵	3.02 × 10⁻⁴	5.12 × 10⁻⁶	1.49 × 10⁻¹²
NR—2nd iteration	3.66 × 10⁻⁹	1.49 × 10⁻⁶	9.61 × 10⁻¹¹	6.98 × 10⁻²⁴
NR—3rd iteration	2.89 × 10⁻¹⁶	3.92 × 10⁻¹¹	3.91 × 10⁻²⁰	1.62 × 10⁻⁴⁶
NR—4th iteration	1.81 × 10⁻³⁰	2.79 × 10⁻²⁰	7.08 × 10⁻³⁹	9.04 × 10⁻⁹²

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Howard, R.M. Schröder-Based Inverse Function Approximation. Axioms 2023, 12, 1042. https://doi.org/10.3390/axioms12111042

AMA Style

Howard RM. Schröder-Based Inverse Function Approximation. Axioms. 2023; 12(11):1042. https://doi.org/10.3390/axioms12111042

Chicago/Turabian Style

Howard, Roy M. 2023. "Schröder-Based Inverse Function Approximation" Axioms 12, no. 11: 1042. https://doi.org/10.3390/axioms12111042

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Schröder-Based Inverse Function Approximation

Abstract

1. Introduction

1.1. Background Result

1.2. Assumptions and Notation

2. Schröder’s Approximations of the First Kind

2.1. Schröder’s Approximations of the First Kind

Notes

2.2. Inverse Function Approximation

2.3. Notes

2.4. Notes on Convergence

2.4.1. Convergence of Schröder Approximations

2.4.2. Relative Error Bound for First-Order Approximation

2.5. Special Case: Ratio of Two Functions

Approximations for the Inverse of f x = n ( x ) / d ( x )

2.6. Newton–Raphson Iteration

2.7. Notes

3. Example I: Analytical Approximations for Arcsine

3.1. General Schröder-Based Approximations

3.1.1. Initial Approximations

3.1.2. Explicit Approximations

3.1.3. Results

3.2. Newton–Raphson Iteration

3.3. Hybrid Approximation

3.4. Applications

3.4.1. Lower Bound

3.4.2. Integral

4. Example II: Analytical Approximations for Inverse of x − Sin(x)

4.1. Initial Approximation for f − 1

4.2. General Schröder-Based Approximations

Examples

4.3. Newton–Raphson Iteration

4.4. Results

4.5. Applications

5. Example III: Analytical Approximations for Inverse Langevin Function

5.1. Approximations

5.2. General Schröder-Based Approximations

5.3. Results

5.4. Newton–Raphson Iteration

5.5. Applications

Inverse Langevin Function as Zero Crossing Time of an Impulse Response

6. Example IV: Analytical Approximations for Lambert Function

6.1. Approximations

6.2. General Schröder-Based Approximations

6.2.1. Special Form

6.2.2. Explicit Approximation

6.3. Hybrid Approximations

6.4. Results

6.5. Applications

7. Conclusions

Further Research

Funding

Acknowledgments

Conflicts of Interest

Appendix A. Proof of Theorem 1

Appendix B. Proof of Lemma 1

Appendix C. Derivative of f ( k ) for the Case of f x = n ( x ) d ( x )

Appendix D. Inverse of x-Sin(x): Use of Periodicity and Anti-Symmetry

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Approximations for the Inverse of $f (x) = n (x) / d (x)$

4.1. Initial Approximation for $f^{- 1}$

Appendix C. Derivative of $f^{(k)}$ for the Case of $f (x) = \frac{n (x)}{d (x)}$