A Review of Collocation Approximations to Solutions of Differential Equations

Singh, Pravin; Parumasur, Nabendra; Singh, Shivani

doi:10.3390/math10234438

Open AccessReview

A Review of Collocation Approximations to Solutions of Differential Equations

by

Pravin Singh

^1,*,†

,

Nabendra Parumasur

^1,†

and

Shivani Singh

^2,†

¹

School of Mathematics Statistics and Computer Sciences, University of KwaZulu-Natal, Private Bag X54001, Durban 4000, South Africa

²

Department of Decision Sciences, University of South Africa (Unisa), P.O. Box 392, Pretoria 0003, South Africa

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics 2022, 10(23), 4438; https://doi.org/10.3390/math10234438

Submission received: 28 August 2022 / Revised: 20 November 2022 / Accepted: 21 November 2022 / Published: 24 November 2022

(This article belongs to the Section E: Applied Mathematics)

Download

Browse Figures

Versions Notes

Abstract

:

This review considers piecewise polynomial functions, that have long been known to be a useful and versatile tool in numerical analysis, for solving problems which have solutions with irregular features, such as steep gradients and oscillatory behaviour. Examples of piecewise polynomial functions used include splines, in particular B-splines, and Hermite functions. Spline functions are useful for obtaining global approximations whilst Hermite functions are useful for approximation over finite elements. Our aim in this review is to study quintic Hermite functions and develop a numerical collocation scheme for solving ODEs and PDEs. This choice of basis functions is further motivated by the fact that we are interested in solving problems having solutions with steep gradients and oscillatory properties, for which this approximation basis seems to be a suitable choice. We derive the quintic Hermite basis and use it to formulate the orthogonal collocation on finite element (OCFE) method. We present the error analysis for third order ODEs and derive both global and nodal error bounds to illustrate the super-convergence property at the nodes. Numerical simulations using the Julia programming language are performed for both ODEs and PDEs and enhance the theoretical results.

Keywords:

approximation; collocation; Hermite basis; interpolation

MSC:

65L10; 65M70; 65N35

1. Introduction

We begin this review by considering briefly various approximation strategies for continuous functions. The approximation of such continuous functions in Numerical Analysis is usually by a least squares approach or interpolation methods. In such cases a basis is chosen from an underlying approximation space. Depending on the problem this may be a polynomial, rational function or of the sinusoidal type. Specifically interpolation plays a key role in approximation of the unknown solution of differential equations. In such a case a polynomial basis is chosen, which serves as a trial function for the solution. For efficiency and stability a low order polynomial basis is used on a discretized domain. This approach is called the collocation method, and the trial functions are referred to as the collocation polynomial, or the interpolating polynomial. We review the background of collocation by looking at the criterion of the method. We then review different methods that are used to discretize both ODEs and PDEs. To do this, we go into detail about various collocation methods, as well as how they have been used over the years.

In particular we review the method of orthogonal collocation on finite elements (OCFE), where the domain is split into elements, and the latter mapped to the standard interval

[0, 1]

, thereafter Gauss points in

[0, 1]

are used as collocation points. This approach is proved to yield optimal error bounds. In particular using spline approximation results in sub-optimal error bounds. For example using quintic spline approximation for solving third order ODEs gives only third order convergence, whereas OCFE produces order six convergence. We solved the resulting DAEs using two approaches. Firstly we used DASSL to solve the DAEs resulting from OCFE in their original form. Secondly, we converted the DAEs to ODEs and used a stiff integrator to solve the resulting system of IVPs. These approaches are clearly superior to previous approaches, in which quasilinearization together with the second order Crank Nicolson method for the time integration is used. In Section 2 we show how the idea of collocation arises naturally from an approximation to the Galerkin method. Section 3 includes a literature review and the history of collocation. In Section 4 we derive the quintic Hermite basis functions and discuss their properties. Section 5 describes OCFE with respect to a quintic Hermite basis, and derives a simplified form of the trial functions, using the continuity conditions. In Section 6 we present a detailed error analysis for the linear ODE case using a quintic Hermite basis. In particular we show that the error is optimal when the collocation points are chosen as the Gauss points, and present nodal, and global error bounds. Finally in Section 7 we present numerical results for several linear, and non-linear differential equations, and validate the theorem in Section 6 by examining the global and nodal convergence orders.

2. Mathematical Setting

Here we review how collocation arises. We consider

L : C (X) \to C (X)

, where

C (X)

is the vector space of continuous functions on the domain X. Let L be a bounded linear operator. Let

{ϕ_{1}, ϕ_{2}, \dots, ϕ_{N}}

be a basis for a subspace

Φ

of

C (X)

of dimension N. Consider solving the linear equation

\begin{matrix} L y = f (x) . \end{matrix}

If

Y \in Φ

is an approximation to y, then it follows that

\begin{matrix} Y = \sum_{i = 1}^{N} a_{i} ϕ_{i} (x), \end{matrix}

where

a_{i}

are real coefficients. We also assume that

L (Φ) \subseteq Φ .

The quantity

r (x) = f (x) - L Y

is defined as the residual, and it can be shown that

\begin{matrix} {∥ Y - y ∥}_{2} ⩽ \frac{K}{{∥ L ∥}_{2}} {∥ r ∥}_{2}, \end{matrix}

where

K = ∥ L^{- 1} ∥_{2} {∥ L ∥}_{2}

is the condition number of L and

{∥ \cdot ∥}_{2}

denotes compatible operator and function norms on

C (X)

, with

\begin{matrix} {∥ f ∥}_{2} = {(\int_{X} {| f (x) |}^{2} d x)}^{\frac{1}{2}} . \end{matrix}

For well conditioned problems

{∥ r ∥}_{2} \to 0

implies that

Y \to y

. Galerkin’s method attempts to minimize the norm of the residual by requiring that

r ⊥ Φ

, which is equivalent to

\begin{matrix} 〈r, ϕ_{k}〉 = \int_{X} r (x) ϕ_{k} (x) d x = 0, k = 1, 2, \dots, N . \end{matrix}

(1)

This geometry in the Hilbert space

C (X)

is illustrated in Figure 1.

But these integrals may be expensive to evaluate. The collocation method attempts to avoid this expensive calculation and is a simplification of the Galerkin method.

Integrals such as those mentioned in (1) can be evaluated using numerical quadrature, for instance,

\begin{matrix} \int_{X} r (x) ϕ_{k} (x) d x \approx \sum_{i = 1}^{N} w_{i k} r (x_{i}) ϕ_{k} (x_{i}), k = 1, 2, \dots, N, \end{matrix}

where we have used the quadrature points

x_{i}, i = 1, 2, \dots, N

and

w_{i k}

are the corresponding weights. If

{ϕ_{1}, ϕ_{2}, \dots, ϕ_{N}}

is a set of orthogonal polynomials then we may choose the

x_{i}^{'} s

as the zeros of

ϕ_{N} (x)

. This ensures that

\begin{matrix} \int_{X} r (x) ϕ_{N} (x) d x \approx 0 . \end{matrix}

We note that the zeros of an orthogonal set of polynomials are real, unique and confined to the interval of orthogonality. Now

\begin{matrix} \int r (x) ϕ_{k} (x) d x, k = 1, 2, \dots, N - 1, \end{matrix}

would not approximate zero since

ϕ_{k} (x_{i}) \neq 0

for

k = 1, 2, \dots, N - 1

. Hence, in order that this integral is approximately zero, we set

r (x_{i}) = 0

for

i = 1, 2, \dots, N .

This is the criterion for the collocation method.

Remark

Whilst, theoretically the basis polynomials need not be orthogonal, it is advantageous to use an orthogonal basis and to exploit the properties of orthogonal polynomials mentioned previously. Furthermore, orthogonal polynomials satisfy a three term recurrence relation, which makes their evaluation simple. It must be stressed that the quadrature points

x_{i}

, henceforth called collocation points, may be chosen arbitrarily as long as they are confined to the domain of the problem. In what follows we shall always choose them as the zeros of a polynomial from an orthogonal family. This will then be referred to as the orthogonal collocation method.

3. Literature Review

There are three broad classes of methods for discretizing ODEs and PDEs in space.

(a): Finite difference methods (FDM) are local methods which yield the numerical solution at mesh points in the spatial domain. They are easy to implement yielding matrices, which are sparse and computationally less expensive to compute. The main drawback of finite difference methods are that they are not very accurate. The Crank-Nicholson method is the most popular FDM for solving PDEs. We refer to the books [1,2], and references therein, for a further exposition of these methods.
(b): Finite element methods are derived using a variational/integral formulation and are global methods, which yield the numerical solution on intervals/sub-intervals in the spatial domain. Variants include the method of weighted residuals, Galerkin, Petrov-Galerkin method, Rayleigh Ritz method, Tau method, etc. These methods are mathematically robust but from a numerical point of view are not very practical, due to the need to evaluate integrals exactly. We refer to the classical text [3] for a more detailed description of these methods. Besides this, many other texts can be found in the literature.
(c): Collocation methods have been developed to circumvent this shortfall. The collocation method is used to solve ordinary and partial differential equations, as well as integral equations. This is done by determining the approximate solution, by requiring the equation to be satisfied at specific points. These specific points are referred to as collocation points. These are chosen points that lie within the domain of the equation, and are used along with candidate solutions to check if the required conditions are met, such that the residual will be zero at these specific points.

Frazer et al. [4] was the first to use this method, in 1937. It was then used in 1941 by Bickley [5] in order to solve unstable heat condition problem. A low-order collocation method was used for a number of boundary-layer problems by Schetz [6], in 1963.

There are many variants of collocation methods and are mainly based on the application. Spectral/pseudo-spectral methods [7] are highly accurate methods and are mainly used in problems, which have very smooth solutions. Fourier Spectral methods are limited to problems posed on a periodic domain, such as when the basis functions are sinusoidal. If the basis functions are polynomials, then it is referred to as a polynomial collocation method. Chebyshev spectral methods have been developed to solve problems posed on a bounded domain. They yield very fast convergence algorithms known as spectral/exponential convergence. In most cases the matrices generated by these methods are dense and one has to resort to the Fast Fourier transform (FFT) to solve them.

Sinc collocation methods [8] are another example of collocation methods. By definition, the basis functions used in these methods are the Sinc functions, which are naturally posed on an unbounded domain, which is one of the drawbacks of this method. B-splines are by far the most popular choice of basis used in collocation methods to solve problems on a bounded domain. They have been successfully applied to solve many important problems including Burgers, KdV and Schrödinger equations [9,10]. The Hermite collocation method closely resembles the B-spline collocation method, but are computationally more efficient, and when used in conjunction with orthogonal collocation on finite elements, yield super convergence properties in algorithms similar to spectral convergence.

Orthogonal collocation arises from the polynomial collocation method, as the collocation points are chosen to be the roots of orthogonal polynomials. It can also be described as being an expansion to the method of weighted residuals.

Villadsen and Stewart were the first to use the orthogonal method, in 1967, after discovering that they yielded very good results, due to the roots of orthogonal polynomials having interesting properties [11]. Finlayson then used it for various applications in chemical engineering, in 1972 [12], and again for application to non-linear problems [13], in 1980. It was used by Fan, Chen and Erikson to solve problems from chemical reactions [14]. Orthogonal collocation is a method that has been widely used by many people for problems in chemical engineering [15,16,17,18,19,20] over the years. More recently, the method of orthogonal collocation using cubic Hermite basis has been developed, and applied successfully to solve parabolic partial differential equations, and models appearing in chemical engineering [21,22]. Traditionally a monomial basis

{1, x, x^{2}, \dots, x^{n}}

was used to represent the trial functions or more efficiently the Lagrange basis

{l_{0} (x), l_{1} (x), \dots, l_{n} (x)}

, where

\begin{matrix} l_{i} (x) = \prod_{\overset{k = 0}{k \neq i}}^{n} \frac{(x - x_{k})}{(x_{i} - x_{k})} \end{matrix}

(2)

The main focus in this review is to illustrate the use of orthogonal collocation using a quintic Hermite basis.

4. Quintic Hermite Basis Functions

We seek a basis for

P_{5}

, the vector space of polynomials of degree

\leq 5

on the interval

[x_{i}, x_{i + 1}]

. There are six such functions and we denote them by

H_{k}, k = 1, 2, \dots, 6 .

We further stipulate their function and derivative values at the end points

x_{i}

and

x_{i + 1}

as follows:

\begin{matrix} H_{k}^{(p)} (x_{i}) = \frac{δ_{k, p + 1}}{h^{p}}, H_{k}^{(p)} (x_{i + 1}) = 0, H_{k + 3}^{(p)} (x_{i}) = 0, H_{k + 3}^{(p)} (x_{i + 1}) = \frac{δ_{k, p + 1}}{h^{p}}, \end{matrix}

(3)

where

k, p + 1 \in {1, 2, 3}

and

δ_{i, j}

is the well-known Kronecker delta symbol.

It is convenient to transform to the variable

z \in [0, 1]

defined by

\begin{matrix} z = \frac{x - x_{i}}{x_{i + 1} - x_{i}} = \frac{x - x_{i}}{h}, \end{matrix}

(4)

where h is the uniform interval length. As x varies from

x_{i}

to

x_{i + 1}

, z varies from 0 to 1. The interpolatory conditions in (3) transform naturally in the variable z to

H_{k}^{(p)} (0) = δ_{k, p + 1}, H_{k}^{(p)} (1) = 0, H_{k + 3}^{(p)} (0) = 0, H_{k + 3}^{(p)} (1) = δ_{k, p + 1} .

These conditions enable the unique derivation of the

H_{k} (z), k = 1, 2, \dots, 6

, however it is only necessary to derive

H_{1} (z), H_{2} (z)

and

H_{3} (z)

as by symmetry/anti-symmetry one may obtain the others. These polynomials are displayed in (5)–(10).

\begin{matrix} H_{1} (z) & = {(1 - z)}^{3} (6 z^{2} + 3 z + 1), \end{matrix}

(5)

\begin{matrix} H_{2} (z) & = {(1 - z)}^{3} (3 z^{2} + z), \end{matrix}

(6)

\begin{matrix} H_{3} (z) & = {(1 - z)}^{3} (\frac{1}{2} z^{2}), \end{matrix}

(7)

\begin{matrix} H_{4} (z) & = H_{1} (1 - z), \end{matrix}

(8)

\begin{matrix} H_{5} (z) & = - H_{2} (1 - z), \end{matrix}

(9)

\begin{matrix} H_{6} (z) & = H_{3} (1 - z) . \end{matrix}

(10)

If we write

H_{4} (z) = H_{1} (- (z - 1))

, then we note that

H_{4} (z)

is a reflection of

H_{1} (z)

about the vertical axis together with a shift of one unit to the right.

H_{6} (z)

is related to

H_{3} (z)

in a similar manner. Also,

H_{5} (z)

may be interpreted as

H_{2} (z)

rotated by

180^{\circ}

anticlockwise and then shifted one unit to the right.

5. Orthogonal Collocation on Finite Elements

Consider solving an ordinary differential equations in one spatial variable, x, and on the domain

[a, b]

. Firstly the domain

[a, b]

is divided into N sub-intervals or elements of spacing

h = \frac{b - a}{N}

, by placing the dividing points or nodes,

x_{i}, i = 1, 2, \dots, N + 1

, as illustrated in Figure 2. We shall refer to this discretization as the mesh

Δ .

Here

x_{1} = a

and

x_{N + 1} = b

coincide with the left and right hand boundaries, respectively. This differs from global orthogonal collocation where the domain is not subdivided and instead higher order polynomials are used to achieve greater accuracy.

The ith element

[x_{i}, x_{i + 1}]

is mapped to

[0, 1]

by using a transformation of the form (4). We assume that the approximate solution in the ith element is given by

\begin{matrix} Y^{i} (x) = Y^{i} (z) = \sum_{k = 1}^{6} a_{k}^{i} H_{k}^{i} (z), \end{matrix}

and is represented in the

{(i + 1)}_{s t}

element by

\begin{matrix} Y^{i + 1} (x) = Y^{i + 1} (z) = \sum_{k = 1}^{6} a_{k}^{i + 1} H_{k}^{i + 1} (z) . \end{matrix}

The basis functions are plotted across both the ith and

{(i + 1)}_{s t}

elements and illustrated in Figure 3.

We observe continuity at

x_{i + 1}

for the basis functions. This is also true of their first and second derivatives. However the third derivative is discontinuous. This continuity has some interesting consequences on the coefficients of the solutions in the successive elements.

In order to obtain a smooth solution that is

C^{2}

continuous, we enforce the condition

\begin{matrix} Y^{i} (x_{i + 1}) = Y^{i + 1} (x_{i + 1}), \end{matrix}

which is equivalent, in the variable z, to

\begin{matrix} Y^{i} (1) & = Y^{i + 1} (0) . \end{matrix}

This implies that

a_{1}^{i + 1} = a_{4}^{i}

. The continuity of the derivative at

x_{i + 1}

is equivalent to

\begin{matrix} \frac{d Y^{(i)}}{d z} |_{z = 1} & = \frac{d Y^{(i + 1)}}{d z} |_{z = 0} \end{matrix}

and yields

a_{2}^{i + 1} = a_{5}^{i}

. Similarly the continuity of the second derivative at

x_{i + 1}

is equivalent to

\begin{matrix} \frac{d^{2} Y^{(i)}}{d z^{2}} |_{z = 1} = \frac{d^{2} Y^{(i + 1)}}{d z^{2}} |_{z = 0} . \end{matrix}

(11)

This results in

a_{3}^{i + 1} = a_{6}^{i}

. Hence the first three coefficients in the

{(i + 1)}_{s t}

interval coincide with the last three coefficients in the ith interval. This repetitive pattern continues as we proceed to successive elements. Thus, we may represent the solution in the ith element by

\begin{matrix} Y (z) = \sum_{k = 1}^{6} a_{k + 3 (i - 1)} H_{k} (z), \end{matrix}

(12)

where we write

H_{k} (z)

for

H_{k}^{i} (z)

bearing in mind that

H_{k} (z)

is a function of i, and that we have dropped the superscript i from

Y^{i} (z)

. With this labelling of the coefficients we are automatically ensuring that the solution, and its first and second derivatives are continuous at the nodes. Their are

3 N + 3

unknowns resulting from (12). Had we used a quintic Lagrange basis then we would have

6 N

unknowns, as it is not possible to build in the continuity conditions to get a simple form like in (12).

Remark

Substituting

z = 0

and

z = 1

into (12), its derivative and its second derivative, it can be shown that that

Y (x_{i}) = a_{3 i - 2}, h Y^{'} (x_{i}) = a_{3 i - 1}

and

h^{2} Y^{″} (x_{i}) = a_{3 i}, i = 1, 2, \dots, N + 1

. Thus every third coefficient beginning from

a_{1}

is an approximation to the solution at the nodes. Similarly every third coefficient beginning from

a_{2}

scaled by h, is an approximation to the derivative at the nodes. Likewise, every third coefficient beginning from

a_{3}

scaled by

h^{2}

represents an approximation to the second derivative at the nodes.

6. Error Analysis

The following is an adaptation of the work done by de Boor and Swartz [23] to quintic Hermite collocation. A third order differential equation, defined on

[a, b]

, can be written in the form

\begin{matrix} L y (x) = f (x), \end{matrix}

where the operator

L = \sum_{k = 0}^{3} a_{k} (x) D^{k}

and D denotes the derivative operator. Then, the error is given by

\begin{matrix} y (x) - Y (x) & = L^{- 1} f (x) - Y (x) \\ = L^{- 1} [f (x) - L Y (x)] \\ = L^{- 1} r (x), \end{matrix}

where

r (x)

is the residual. Hence,

\begin{matrix} y (x) - Y (x) = \int_{a}^{b} G (x, ξ) r (ξ) d ξ \end{matrix}

where

G (x, ξ)

is the Green’s function, or integral kernel, associated with the linear problem.

Now,

\begin{matrix} D^{p} (y (x) - Y (x)) & = \int_{a}^{b} G_{p} (x, ξ) r (ξ) d ξ \\ = \sum_{j = 1}^{N} \int_{x_{j}}^{x_{j + 1}} G_{p} (x, ξ) r (ξ) d ξ \end{matrix}

(13)

\begin{matrix} = \sum_{j = 1}^{N} E_{j} (x), \end{matrix}

(14)

where

G_{p} (x, ξ) = D^{p} G (x, ξ)

and

\begin{matrix} E_{j} (x) = \int_{x_{j}}^{x_{j + 1}} G_{p} (x, ξ) r (ξ) d ξ, p = 0, 1, 2 . \end{matrix}

(15)

Consider one of the intervals

[x_{j}, x_{j + 1}]

of

Δ

. Denote the collocation points by

\begin{matrix} x_{j, i} & = \frac{x_{j} + x_{j + 1}}{2} + r_{i} \frac{h}{2} \\ = x_{j} + \frac{h}{2} + r_{i} \frac{h}{2}, i = 1, 2, 3 . \end{matrix}

Approximate the residual

r (ξ)

on

[x_{j}, x_{j + 1}]

by a quadratic interpolant

p_{2} (ξ)

such that

\begin{matrix} r (ξ) = p_{2} (ξ) + \frac{r^{3} (θ_{ξ})}{6} (ξ - x_{j, 1}) (ξ - x_{j, 2}) (ξ - x_{j, 3}), \end{matrix}

(16)

where

θ_{ξ} \in (x_{j}, x_{j + 1}) .

Since

r (ξ)

vanishes at the collocation points, we have

p_{2} (ξ) = 0

and hence (16) can be written as

\begin{matrix} r (ξ) = p_{3} (ξ) r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ], \end{matrix}

where

p_{3} (ξ) = Π_{i = 1}^{3} (ξ - x_{j, i})

and

r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ]

denotes the third divided difference of

r .

Hence the integrand in (13) can be written in the form

\begin{matrix} G_{p} (x, ξ) r (ξ) & = p_{3} (ξ) G_{p} (x, ξ) r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ] \\ = p_{3} (ξ) g (x, ξ), \end{matrix}

(17)

where

g (x, ξ) = G_{p} (x, ξ) r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ]

.

Interpolate

g (x, ξ)

by a polynomial

q_{n - 1} (x, ξ)

of degree

⩽ n - 1

, in the variable

ξ

at any points

σ_{i}, i = 1, 2, \dots, n

in

[x_{j}, x_{j + 1}],

to obtain

\begin{matrix} g (x, ξ) & = q_{n - 1} (x, ξ) + \frac{g^{n} (x, ξ_{σ})}{n!} \prod_{i = 1}^{n} (ξ - σ_{i}), \end{matrix}

(18)

where

ξ_{σ} \in (x_{j}, x_{j + 1}) .

Equation (17) then becomes

\begin{matrix} G_{p} (x, ξ) r (ξ) = p_{3} (ξ) [q_{n - 1} (x, ξ) + \frac{g^{n} (x, ξ_{σ})}{n!} \prod_{i = 1}^{n} (ξ - σ_{i})] . \end{matrix}

(19)

Substituting (19) into (15) results in

\begin{matrix} E_{j} (x) & = \int_{x_{j}}^{x_{j + 1}} p_{3} (ξ) q_{n - 1} (x, ξ) d ξ + \int_{x_{j}}^{x_{j + 1}} p_{3} (ξ) \frac{g^{n} (x, ξ_{σ})}{n!} \prod_{i = 1}^{n} (ξ - σ_{i}) d ξ . \end{matrix}

If the collocation points are chosen so that

p_{3} (ξ)

is orthogonal to

q_{n - 1} (x, ξ),

that is

\begin{matrix} \int_{x_{j}}^{x_{j + 1}} p_{3} (ξ) q_{n - 1} (x, ξ) d ξ = 0 \end{matrix}

for all polynomials

q_{n - 1} (x, ξ)

of degree

⩽ n - 1

, then

\begin{matrix} E_{j} (x) & = \int_{x_{j}}^{x_{j + 1}} \prod_{i = 1}^{3} (ξ - x_{j, i}) \frac{g^{n} (x, ξ_{σ})}{n!} \prod_{i = 1}^{n} (ξ - σ_{i}) d ξ \\ = O (h^{n + 4}) . \end{matrix}

(20)

Equation (20) follows since

| ξ - x_{j, i} |, | ξ - σ_{i} | ⩽ h

and we have assumed that

D^{n} g (x, ξ)

can be bounded independently of

Δ .

If we now sum

E_{j} (x)

in (14), we obtain

\begin{matrix} D^{p} (y (x) - Y (x)) = O (h^{n + 3}) . \end{matrix}

If

n = 0

, we forgo the interpolation of

g (x, ξ)

and assume that

g (x, ξ)

can be bounded on

Δ

such that

\begin{matrix} D^{p} (y (x) - Y (x)) = O (h^{3}) . \end{matrix}

We follow this idea to obtain higher order convergence in the case that

D^{n} g (x, ξ)

cannot be bounded independent of

Δ

. In order to achieve this we prove the following theorem.

Theorem 1.

Let

k_{j}, j = 0, 1, \dots, 8

be constants independent of Δ and assume that there exists a constant

k_{1}

such that

\forall x \in [a, b]

\begin{matrix} ∥ D^{j} G_{p} (x, ξ) ∥ ⩽ k_{1}, p = 0, 1, 2, j = 0, 1, 2, 3, \end{matrix}

(21)

where we consider

G_{p} (x, ξ)

to be an element of

C^{3} [a, x] \times C^{3} [x, b] .

If the coefficients of L are smooth enough, specifically

a_{i} (x) \in C^{6} [a, b], i = 0, 1, 2, 3

and

y (x) \in C^{9} [a, b],

then there exists a constant

k_{2}

such that for each j

\begin{matrix} ∥ D^{3 + p} {r ∥}_{(j)} ⩽ k_{2} p = 0, 1, 2, 3, \end{matrix}

(22)

where

\begin{matrix} {∥ f ∥}_{(j)} = sup_{ξ \in (x_{j}, x_{j + 1})} | f (ξ) | . \end{matrix}

Proof.

Since

r = f - L Y = L (y - Y)

, it follows that

D^{3 + p} r

on

[x_{j}, x_{j + 1}]

is a combination of the derivatives of the coefficients of L, up to the order

3 + p

, with the derivatives of

y - Y

up to the order

6 + p

(recall L is third order). It is therefore sufficient to prove the existence of a constant

k_{3}

such that

\begin{matrix} ∥ D^{p} {(y - Y) ∥}_{(j)} ⩽ k_{3}, p = 0, 1, \dots, 9 . \end{matrix}

(23)

The following result, namely.

\begin{matrix} ∥ D^{p} {(y - Y) ∥}_{(j)} ⩽ k_{0} h^{3}, p = 0, 1, 2, 3, \end{matrix}

(24)

is proved by de Boor and Swartz [23].

We now consider the case

p = 4, 5

. Expand y at some point

σ \in [x_{j}, x_{j + 1}]

in a Taylor series

y_{j}

of order

< 6

to obtain

\begin{matrix} y - y_{j} = \frac{y^{6} (ξ_{j})}{6!} {(ξ - σ)}^{6}, \end{matrix}

where

ξ_{j} \in (x_{j}, x_{j + 1}),

from which it is simple to show that

\begin{matrix} D^{p} (y - y_{j}) = \frac{y^{6} (ξ_{j})}{(6 - p)!} {(ξ - σ)}^{6 - p} . \end{matrix}

(25)

From (25), we deduce that

\begin{matrix} ∥ D^{p} (y - y_{j}) ∥_{(j)} ⩽ k_{4} h^{6 - p} . \end{matrix}

(26)

If X is a non-negative discrete random variable and t is a positive real number then, by Markov’s inequality

\begin{matrix} P [X > t] ⩽ \frac{E (X)}{t} \end{matrix}

where

E (X)

denotes the expectation of X and P, the probability. With

X = X_{1} = \frac{∥ D^{3} (y_{j} - Y) ∥_{(j)}}{h}

and

t = t_{1} = {∥ D^{4} (y_{j} - Y) ∥}_{(j)}

, we obtain

\begin{matrix} P [X_{1} > t_{1}] ⩽ \frac{∥ D^{3} (y_{j} - Y) ∥_{(j)}}{h ∥ D^{4} (y_{j} - Y) ∥_{(j)}}, \end{matrix}

and with

X = X_{2} = \frac{∥ D^{3} (y_{j} - Y) ∥_{(j)}}{h^{2}}

and

t = t_{2} = {∥ D^{5} (y_{j} - Y) ∥}_{(j)}

we obtain

\begin{matrix} P [X_{2} > t_{2}] ⩽ \frac{∥ D^{3} (y_{j} - Y) ∥_{(j)}}{h^{2} {∥ D^{5} (y_{j} - Y) ∥}_{(j)}}, \end{matrix}

which allows us to write

\begin{matrix} ∥ D^{p} (y_{j} - Y) ∥_{(j)} ⩽ k_{7} h^{3 - p} {∥ D^{3} (y_{j} - Y) ∥}_{(j)}, \end{matrix}

(27)

for

p = 4, 5,

where

k_{7} = max \{\frac{1}{P [X_{1} > t_{1}]}, \frac{1}{P [X_{2} > t_{2}]}\} .

We also have that

\begin{matrix} ∥ D^{3} (y_{j} - Y) ∥_{(j)} & ⩽ ∥ D^{3} (y_{j} - y) ∥_{(j)} + {∥ D^{3} (y - Y) ∥}_{(j)} \\ ⩽ k_{4} h^{3} + k_{0} h^{3}, \end{matrix}

(28)

where we have used both (24) and (26). Substituting (28) into (27) yields

\begin{matrix} ∥ D^{p} (y_{j} - Y) ∥_{(j)} & ⩽ k_{7} (k_{4} + k_{0}) h^{6 - p} \end{matrix}

\begin{matrix} = k_{5} h^{6 - p} . \end{matrix}

(29)

Hence, for

p = 4, 5

we have that

\begin{matrix} ∥ D^{p} {(y - Y) ∥}_{(j)} & ⩽ ∥ D^{p} (y - y_{j}) ∥_{(j)} + {∥ D^{p} (y_{j} - Y) ∥}_{(j)} \\ ⩽ k_{4} h^{6 - p} + k_{5} h^{6 - p} \\ ⩽ k_{6} h^{6 - p}, \end{matrix}

(30)

where we have used (26) and (29).

For

p = 6, 7, 8, 9

\begin{matrix} D^{p} (y - Y) = D^{p} y (recall Y has degree < 6) \end{matrix}

so that

\begin{matrix} ∥ D^{p} {(y - Y) ∥}_{(j)} & = ∥ D^{p} {y ∥}_{(j)} \\ ⩽ max {∥ D^{p} y ∥_{(j)} | 6 ⩽ p ⩽ 9} \\ = k_{8} . \end{matrix}

(31)

Hence (24), (30) and (31) proves (23). □

The following theorem establishes the order of convergence.

Theorem 2.

Let

k_{j}, j = 9, 10, \dots, 13

be constants independent of Δ. Assume that the coefficients

a_{i} (x)

of L satisfy

a_{i} (x) \in C^{6}

for all i, that Equation (21) holds and that

y (x) \in C^{9} [a, b] .

If the collocation points are chosen such that

q_{2} (x, ξ)

is orthogonal to

p_{3} (ξ)

for every

q_{2} (x, ξ) \in P_{2}

then there exists a constant

c_{1}

such that

\begin{matrix} | D^{p} (y - Y) (x_{j}) | ⩽ c_{1} h^{6}, p = 0, 1, 2, \end{matrix}

(32)

and a constant

c_{2}

such that

\begin{matrix} ∥ D^{p} {(y - Y) ∥}_{\infty} ⩽ c_{2} h^{6 - p}, p = 0, 1, 2, 3 . \end{matrix}

(33)

Proof.

We write

\begin{matrix} G_{p} (x, ξ) r (ξ) = p_{3} (ξ) [q_{2} (x, ξ) + {(ξ - x_{j})}^{3} \frac{(D^{3} g (x, ξ)) (θ_{ξ})}{3!}] \end{matrix}

(34)

with

q_{2} (x, ξ)

the Taylor series for

g (x, ξ)

, with terms of order

⩽ 2

(see (17) and (18)). Choosing the collocation points so that

\begin{matrix} \int_{x_{j}}^{x_{j + 1}} q_{2} (x, ξ) p_{3} (ξ) d ξ = 0 \end{matrix}

(35)

for every polynomial

q_{2} \in P_{2} .

We have

\begin{matrix} E_{j} (x) & = \int_{x_{j}}^{x_{j + 1}} G_{p} (x, ξ) r (ξ) d ξ j = 1, \dots, N \end{matrix}

\begin{matrix} | E_{j} (x) | & = |\int_{x_{j}}^{x_{j + 1}} p_{3} (ξ) {(ξ - x_{j})}^{3} \frac{(D^{3} g (x, ξ)) (θ_{ξ})}{3!} d ξ| \end{matrix}

(36)

\begin{matrix} ⩽ k_{9} h^{7} {∥ D^{3} g (x, ξ) ∥}_{(j)} . \end{matrix}

(37)

Now

\begin{matrix} D^{3} (g_{1} g_{2}) = \sum_{s = 0}^{3} (\binom{3}{s}) D^{3 - s} g_{1} D^{s} g_{2} . \end{matrix}

We let

g_{1} = G_{p} (x, ξ)

and

g_{2} = r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ] .

From

\begin{matrix} D g_{2} & = r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ, ξ] = \frac{(D^{3 + 1} r) (θ_{x, 1})}{4!}, \\ D^{2} g_{2} & = r [x_{j, 1}, x_{j, 2}, x_{j, 3}, ξ, ξ, ξ] = \frac{(D^{3 + 2} r) (θ_{x, 2})}{5!}, \end{matrix}

we infer that

\begin{matrix} D^{s} g_{2} & = \frac{(D^{3 + s} r) (θ_{x, s})}{(s + 3)!} . \end{matrix}

Hence

\begin{matrix} D^{3} g (x, ξ) = \sum_{s = 0}^{3} (\binom{3}{s}) D^{3 - s} G_{p} (x, ξ) \frac{(D^{3 + s} r) (θ_{x, s})}{(s + 3)!} . \end{matrix}

(38)

We now consider three cases:

(i): If $x \notin (x_{j}, x_{j + 1})$ then from (21)

$\begin{matrix} ∥ D^{3 - s} G_{p} {(x, ξ) ∥}_{(j)} ⩽ k_{1} s = 0, 1, 2, 3 \end{matrix}$

while

$\begin{matrix} ∥ D^{3 + s} {r ∥}_{(j)} ⩽ k_{2} \end{matrix}$

from (22). Hence from (38), we have

$\begin{matrix} ∥ D^{3} {g (x, ξ) ∥}_{(j)} ⩽ k_{10}, \end{matrix}$

and (37) gives

$\begin{matrix} | E_{j} (x) | ⩽ k_{11} h^{7} . \end{matrix}$

(39)
(ii): If $x \in (x_{j}, x_{j + 1})$ then $G_{p} (x, ξ) \in C^{2 - p} [x_{j}, x_{j + 1}] .$ Now consider say $p = 0$ . Equation (34) is then rewritten as

$\begin{matrix} G_{0} (x, ξ) r (ξ) = p_{3} (ξ) [q_{1} (x, ξ) + {(ξ - x_{j})}^{2} \frac{D^{2} g (x, ξ) (θ_{ξ})}{2!}], \end{matrix}$

where $q_{1} (x, ξ)$ is the Taylor series for $g (x, ξ)$ of order one. Once again, we choose the collocation points so that

$\begin{matrix} \int_{x_{j}}^{x_{j + 1}} q_{1} (x, ξ) p_{3} (ξ) d ξ = 0 . \end{matrix}$

Hence the equivalent equation to (36) is

$\begin{matrix} | E_{j} (x) | & = |\int_{x_{j}}^{x_{j + 1}} p_{3} (ξ) {(ξ - x_{j})}^{2} \frac{D^{2} (g (x, ξ)) (θ_{ξ})}{2!} d ξ| \\ ⩽ k_{9} h^{6} {∥ D^{2} g (x, ξ) ∥}_{(j)} . \end{matrix}$

(40)

The equivalent equation to (38) is then

$\begin{matrix} D^{2} g (x, ξ) = \sum_{s = 0}^{2} (\binom{2}{s}) D^{2 - s} G_{0} (x, ξ) \frac{D^{3 + s} r (θ_{x, s})}{(s + 3)!} \end{matrix}$

and $∥ D^{2} {g (x, ξ) ∥}_{(j)} ⩽ k_{10} .$ Hence (40) reduces to

$\begin{matrix} | E_{j} (x) | ⩽ k_{12} h^{6} . \end{matrix}$

Similarly, by considering $p = 1, 2$ we may show that

$\begin{matrix} ∥ D^{s} {g (x, ξ) ∥}_{(j)} ⩽ k_{10}, s ⩽ 2 - p \end{matrix}$

and

$\begin{matrix} | E_{j} (x) | ⩽ k_{12} h^{6 - p} . \end{matrix}$

(41)
(iii): If $x \in Δ$ so that $x \notin (x_{j}, x_{j + 1})$ for all j then, from summing (39) (recall (14)) we obtain

$\begin{matrix} | D^{p} (y - Y) (x) | ⩽ c_{1} h^{6}, p = 0, 1, 2 \end{matrix}$

for x coinciding with the nodes $x_{j} .$ Otherwise $x \in (x_{j}, x_{j + 1})$ for exactly one of the sub-intervals of $Δ$ . So, by (39) and (41) and summing over j we obtain

$\begin{matrix} | D^{p} (y - Y) (x) | & ⩽ k_{11} h^{6} + k_{12} h^{6 - p} \\ ⩽ k_{13} h^{6 - p} \end{matrix}$

This implies (33).

□

The following theorem establishes the optimal choice for the collocation points.

Theorem 3.

The optimal choice for the collocation points are the Gauss points.

Proof.

The orthogonality of

q_{2} (x, ξ)

and

p_{3} (ξ)

from (35) is equivalent to

\begin{matrix} \int_{x_{j}}^{x_{j + 1}} (ξ - x_{j, 1}) (ξ - x_{j, 2}) (ξ - x_{j, 3}) ξ^{p} d ξ = 0, p = 0, 1, 2 . \end{matrix}

(42)

Change the integral in (42) to the interval

[- 1, 1]

by using the linear transformation

\begin{matrix} ξ = \frac{h}{2} ξ^{'} + x_{j} + \frac{h}{2}, \end{matrix}

to yield

\begin{matrix} \int_{- 1}^{1} (ξ^{'} - r_{1}) (ξ^{'} - r_{2}) (ξ^{'} - r_{3}) {(\frac{h}{2} ξ^{'} + x_{j} + \frac{h}{2})}^{p} d ξ^{'} = 0, p = 0, 1, 2 . \end{matrix}

(43)

Equation (43) is equivalent to the following three equations

\begin{matrix} \int_{- 1}^{1} (ξ^{'} - r_{1}) (ξ^{'} - r_{2}) (ξ^{'} - r_{3}) d ξ^{'} & = 0, \end{matrix}

(44)

\begin{matrix} \int_{- 1}^{1} (ξ^{'} - r_{1}) (ξ^{'} - r_{2}) (ξ^{'} - r_{3}) ξ^{'} d ξ^{'} & = 0, \end{matrix}

(45)

\begin{matrix} \int_{- 1}^{1} (ξ^{'} - r_{1}) (ξ^{'} - r_{2}) (ξ^{'} - r_{3}) ξ^{' 2} d ξ^{'} & = 0 . \end{matrix}

(46)

Integrating (44)–(46), we get

\begin{matrix} r_{1} + r_{2} + r_{3} + 3 r_{1} r_{2} r_{3} & = 0 \end{matrix}

(47)

\begin{matrix} r_{1} r_{2} + r_{2} r_{3} + r_{1} r_{3} & = - \frac{3}{5} \end{matrix}

(48)

\begin{matrix} r_{1} + r_{2} + r_{3} + \frac{5}{3} r_{1} r_{2} r_{3} & = 0 . \end{matrix}

(49)

Equations (47) and (49) implies that

r_{1} r_{2} r_{3} = 0,

hence we choose

r_{2} = 0 .

without loss of generality. This implies that

r_{3} = - r_{1}

and

r_{1}^{2} = \frac{3}{5},

so that

r_{1} = - \sqrt{\frac{3}{5}}, r_{3} = \sqrt{\frac{3}{5}} .

These are the Gauss points, or zeros, of the third degree Legendre Polynomial. □

7. Numerical Examples

Example 1.

We now consider the ODE

\begin{matrix} y^{‴} (x) + 2 y^{'} (x) = - 980 cos (10 x) \\ y (0) = 0, y^{'} (0) = 10, y (1) = sin (10) . \end{matrix}

Substituting (12) in the given ODE results in the following system of equations:

\begin{matrix} \sum_{k = 1}^{6} [\frac{1}{h^{3}} H_{k}^{'''} (z) + \frac{2}{h} H_{k}^{'} (z)] a_{k + 3 (i - 1)} = - 980 cos (10 (x_{i} + z h)), i = 1, 2, \dots, N . \end{matrix}

(50)

The boundary condition

y (0) = Y (0) = \sum_{k = 1}^{6} a_{k} H_{k} (0) = 0

yields

a_{1} = 0

whilst the derivative condition

y^{'} (0) = Y^{'} (0) = \frac{1}{h} \sum_{k = 1}^{6} a_{k} H_{k}^{'} (0) = 10

yields

a_{2} = 10 h

and

y (1) = Y (1) = \sum_{k = 1}^{6} a_{k + 3 (N - 1)} H_{k} (1) = sin (10)

yields

a_{3 N + 1} = sin (10) .

There are

3 N + 3

unknowns. Given that we have three boundary conditions, we thus require

3 N

conditions in order to solve the problem uniquely. We therefore choose three collocation points

r_{1}, r_{2}, r_{3}

in each element. The

r_{j}

are chosen as the zeros of the third degree Legendre polynomial, shifted to the interval

[0, 1] .

These were shown to be the optimal choice for collocation points. The collocation points are then substituted into Equation (50) to give the

3 N

linear equations. The matrix vector system, of size

(3 N + 3) \times (3 N + 3),

has the form

A a = f

where A has the form (illustrated for

N = 3

)

\begin{matrix} A = [\begin{matrix} a_{11} & a_{12} & a_{13} & a_{14} & a_{15} & a_{16} \\ a_{21} & a_{22} & a_{23} & a_{24} & a_{25} & a_{26} \\ a_{31} & a_{32} & a_{33} & a_{34} & a_{35} & a_{36} \\ a_{11} & a_{12} & a_{13} & a_{14} & a_{15} & a_{16} \\ a_{21} & a_{22} & a_{23} & a_{24} & a_{25} & a_{26} \\ a_{31} & a_{32} & a_{33} & a_{34} & a_{35} & a_{36} \\ a_{11} & a_{12} & a_{13} & a_{14} & a_{15} & a_{16} \\ a_{21} & a_{22} & a_{23} & a_{24} & a_{25} & a_{26} \\ a_{31} & a_{32} & a_{33} & a_{34} & a_{35} & a_{36} \\ 1 \\ 1 \\ 1 \end{matrix}], \end{matrix}

(51)

and

f

has the form

\begin{matrix} f_{j + 3 (i - 1)} = \{\begin{matrix} - 980 cos (10 (x_{i} + r_{j} h)) & i = 1, 2, \dots, N, j = 1, 2, 3 \\ 0 & i = N + 1, j = 1 \\ 10 h & i = N + 1, j = 2 \\ sin (10) & i = N + 1, j = 3 \end{matrix} \end{matrix}

The non-zero blocks of matrix A are shifted three places to the right and accounts for the repetition of the coefficients. The position of the ones account for the boundary conditions.

After solving (51), the solution is constructed on each sub-interval using the appropriate coefficients and can then be plotted. Since there is very good agreement between the approximate solution and the exact solution, we choose to show the error plot (

| sin (10 x) - Y (x) |

) in Figure 4 for

N = 40

.

If the discrete error at the nodes

x_{j}

is

O (h^{n})

then

\begin{matrix} | y (x_{j}) - Y (x_{j}) |_{h} = O (h^{n}) \end{matrix}

(52)

and

\begin{matrix} | y (x_{j}) - Y (x_{j}) |_{\frac{h}{2}} = O ({(\frac{h}{2})}^{n}) . \end{matrix}

(53)

By taking the ratio of (52) to (53) we obtain

\begin{matrix} α = \frac{| y (x_{j}) - Y (x_{j}) |_{h}}{| y (x_{j}) - Y (x_{j}) |_{\frac{h}{2}}} \approx 2^{n}, \end{matrix}

from which the order of convergence

n (h) \approx \frac{ln (α)}{ln (2)} .

The conditions of Theorem 2 are satisfied. We have solved the problem with

N = 20

and

N = 40

and summarise in Table 1 the convergence orders at the common nodes for

p = 0, 1, 2

according to Equation (32). The orders agree remarkably with Equation (32), the difference being attributed to numerical error. The global order is shown in Table 2 and illustrates the validity of Equation (33).

Example 2.

Consider the problem

\begin{matrix} y^{″} + \frac{y^{'}}{x} = {(\frac{2}{8 - x^{2}})}^{2}, 0 < x < 1, y^{'} (0) = y (1) = 0, \end{matrix}

which has the exact solution

\begin{matrix} y (x) = 2 ln (\frac{7}{8 - x^{2}}) . \end{matrix}

As there are two boundary conditions, we choose an additional collocation point in the first element. Hence the collocation points in the first element are the zeros of the fourth degree Legendre polynomial shifted to

[0, 1]

, whilst in the rest it is the zeros of the third degree Legendre polynomial. The solution is smooth on

[0, 1]

, but the coefficient

\frac{1}{x}

is unbounded at

x = 0 .

This, however, is not a problem since the collocation points are chosen in

(0, 1),

which avoids the endpoints. The errors at the common nodes as well as the convergence order is displayed in Table 3. Clearly convergence is

O (h)

. Theorem 2 does not apply here as there is a coefficient with a singularity at

x = 0

.

In the next example we consider the application of the method to partial differential equations.

Example 3.

We illustrate the full spatial discretization process involved in the collocation procedure on this example.

\begin{matrix} u_{t} - u_{x x x} = 27 π^{3} cos (t) cos (3 π x) - sin (t) sin (3 π x), 0 < x < 1, t ⩾ 0 \end{matrix}

(54)

with boundary conditions

\begin{matrix} u (0, t) = 0, u (1, t) = 0, u_{x} (0, t) = 3 π cos (t) \end{matrix}

and initial condition

\begin{matrix} u (x, 0) = sin (3 π x) . \end{matrix}

The exact solution is given by

\begin{matrix} u (x, t) = cos (t) sin (3 π x) . \end{matrix}

In the variable z Equation (54) becomes

\begin{matrix} u_{t} - \frac{1}{h^{3}} u_{z z z} = 27 π^{3} cos (t) cos (3 π (z h + x_{i})) - sin (t) sin (3 π z h + x_{i}) . \end{matrix}

(55)

Analogous to Equation (12), the trial solution, in the ith element, is written as

\begin{matrix} U^{i} (z) = \sum_{k = 1}^{6} a_{k + 3 (i - 1)} (t) H_{k} (z) . \end{matrix}

(56)

Here the time dependence is reflected in the coefficients a. Substituting (56) into (55), we obtain

\begin{matrix} \sum_{k = 1}^{6} a_{k + 3 (i - 1)}^{'} (t) H_{k} (z_{j}) - \frac{1}{h^{3}} \sum_{k = 1}^{6} a_{k + 3 (i - 1)} (t) H_{k}^{‴} (z_{j}) - 27 π^{3} cos (t) cos (3 π (z_{j} h + x_{i})) \\ + sin (t) sin (3 π (z_{j} h + x_{i})) = 0 . \end{matrix}

(57)

We then substitute three collocation points from each sub-interval into Equation (57) to obtain

3 N

conditions. Using these, along with the boundary conditions, we get a linear differential algebraic system of size

3 N + 3

. In particular, the boundary conditions imply that

a_{1} (t) = a_{3 N + 1} (t) = 0

and

a_{2} (t) = 3 π h cos (t) .

Differential algebraic systems are solved using the classical DASSL solver [24]. DASSL is designed for the numerical solution of implicit systems of differential/algebraic equations written in the form

F (t, y, y^{'}) = 0

, where

F, y

and

y^{'}

are vectors, and initial values for y and

y^{'}

are given. It is a classic DAE solver released as open source in 1982 and has been ported to Julia recently. (https://github.com/JuliaLang/julia, accessed on 1 November 2022) was released in 2012 by MIT.

It is an open-source, multi-paradigm, high-level, high-performance, and incredibly dynamic scripting language. For scientific computing it resembles Matlab in syntax. It comes close to C in speed. It allows interaction with other scientific languages like Fortran, Python and R. There are over 2000 modules that can be added to it. One can interface with many plotting modules, but we have elected to use pyplot from matplotlib.

The graphs of both the solution and error are depicted in Figure 5 and Figure 6, respectively. The numerical results indicate that the present method is successful in capturing the oscillatory nature of the solution in both space and time with a reasonable accuracy. The drop in accuracy as compared to the ode case can be attributed to the error in the time integration process.

We now solve the Korteweg–de Vries (KdV) equation [10]. The semi-discretization in space results in a system of DAEs which can be converted into a system of IVPs and be solved along similar lines as the ODE case discussed in Section 5. For the time integration we use a suitable high order stiff integrator. In contrast, most methods developed in previous studies used quasi-linearization with second order Crank Nicolson method for the time integration.

Example 4.

We consider the following form of the KdV equation and illustrate the spatial discretization process involved in the collocation process below:

\begin{matrix} u_{t} + ϵ u u_{x} + μ u_{x x x} = 0, 0 < x < 2 . \end{matrix}

(58)

It has boundary conditions

\begin{matrix} u_{x} (0, t) = u_{x} (2, t) = 0, u (2, t) = 0 \end{matrix}

and an initial condition

\begin{matrix} u (x, 0) = 3 C {sech}^{2} (A x + D), \end{matrix}

where

ϵ = 1, μ = 4.84 \times 10^{- 4}, C = 0.3, D = - 6, A = 0.5 \sqrt{\frac{ϵ C}{μ}} .

The exact solution is given by, (see [10])

\begin{matrix} u (x, t) = 3 C {sech}^{2} (A x - B t + D), \end{matrix}

We use the exact same method as before, switching to the variable z so that (58) becomes

\begin{matrix} u_{t} + \frac{ϵ}{h} u u_{z} + \frac{μ}{h^{4}} u_{z z z} = 0 . \end{matrix}

(59)

We then substitute (56) into (59), to get

\begin{matrix} \sum_{k = 1}^{6} a_{k + 3 (i - 1)}^{'} (t) H_{k} (z) + [\frac{ϵ}{h} \sum_{k = 1}^{6} a_{k + 3 (i - 1)} (t) H_{k} (z)] [\sum_{k = 1}^{6} a_{k + 3 (i - 1)} (t) H_{k}^{'} (z)] \\ + \frac{μ}{h^{4}} \sum_{k = 1}^{6} a_{k + 3 (i - 1)} (t) H_{k}^{‴} (z) = 0 . \end{matrix}

(60)

We substitute three collocation points per element into Equation (60) to obtain

3 N

conditions. Together with the boundary conditions, we obtain a non-linear differential algebraic system of size

3 N + 3 .

In particular, the boundary conditions imply that

a_{2} (t) = a_{3 N + 1} (t) = a_{3 N + 2} (t) = 0 .

In order to obtain the initial conditions for the coefficients, that is

a_{k + 3 (i - 1)} (0)

, we let

\begin{matrix} u (z, 0) = U (z, 0) = \sum_{k = 1}^{6} a_{k + 3 (i - 1)} (0) H_{k} (z) = 3 C {sech}^{2} (A (x_{i} + z h) + D), i = 1, 2, \dots, N . \end{matrix}

(61)

We substitute the

3 N

collocation points into (61) to obtain a

3 N \times 3 N

matrix vector system whose solution yields the value of the coefficients at

t = 0

. Equations (60) and (61) is easily converted to a system of of IVPs which can be efficiently solved using the Julia QNDF solver (which is equivalent to the ODE15s in Matlab). Thereafter the PDE solution over the domain is obtained from Equation (56). Figure 7 depicts the solution profiles for various t values. This example describes a wave propagating in the positive × direction. In this figure the solid line represents the exact solution and the circles the approximate solution. It is clear the approximate solution closely matches the exact solution. The 3D solution and error are shown in Figure 8 and Figure 9, respectively. We see that applying the quintic OCFE method in space combined with a powerful stiff ODE solver in time is capable of producing a very smooth and accurate solution.

8. Conclusions

We have reviewed the numerical approximation of the solution of differential equations, using collocation methods, with a bias towards a quintic Hermite basis. In Lagrange collocation, the continuity conditions have to be satisfied, as well as the residual set to zero at the collocation points. This results in a much larger system of equations to solve. By using a Hermite basis the continuity conditions are automatically satisfied, as reflected in the coefficients of the trial solution. Thus only the collocation equations have to be solved. This is a huge computational advantage. In addition we have provided a useful error analysis, which is also applicable to the non-linear case. Collocation on finite elements is essentially equivalent to minimizing the residual over the domain of the problem. It is a powerful and versatile tool that rivals finite difference approximations, especially when the solution is oscillatory in nature.

Author Contributions

All authors have equal contributions. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Iserles, A. A First Course in the Numerical Analysis of Differential Equations, 2nd ed.; Cambridge University Press: Cambridge, UK, 2008. [Google Scholar]
Süli, E.; Mayers, D. An Introduction to Numerical Analysis; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar]
Strang, G.; Fix, G.J. An Analysis of the Finite Element Method; Wellesley-Cambridge Press: Cambridge, UK, 1988. [Google Scholar]
Frazer, R.A.; Jones, W.P.; Skan, S.W. ARC R and M 1799; Springer: New York, NY, USA, 1937. [Google Scholar]
Bickley, W.G. Experiments in approximating to solutions of a partial-differential equation. Lond. Edinb. Dublin Philos. Mag. J. Sci. 1941, 32, 50–66. [Google Scholar] [CrossRef]
Schetz, J.A. On the Approximate Solution of Viscous-Flow Problems. ASME J. Appl. Mech. 1963, 30, 263–268. [Google Scholar] [CrossRef]
Canuto, C.; Hussaini, M.Y.; Quarteroni, A.; Zhang, T.A. Spectral Methods: Fundamental in Single Domains; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar]
Brunner, H. Collocation Methods for Volterra Integral and Related Functional Differential Equations; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Ali, A.H.A.; Gardner, G.A.; Gardner, L.R.T. A collocation solution for Burgers’ equation using cubic B-spline finite elements. Comput. Methods Appl. Mech. Eng. 1992, 100, 325–337. [Google Scholar] [CrossRef]
Zaki, S.I. A quintic B-spline finite elements scheme for the KdVB equation. Comput. Methods Appl. Mech. Engrg. 2000, 188, 121–134. [Google Scholar] [CrossRef]
Villadsen, J.; Stewart, W.E. Solution of Boundary Value Problems by Orthogonal Collocation. Sci. Direct Chem. Eng. Sci. 1967, 22, 1483–1501. [Google Scholar] [CrossRef]
Finlayson, B.A. The Method of Weighted Residuals and Variational Principles; Academic Press: New York, NY, USA, 1972. [Google Scholar]
Finlayson, B.A. Non-Linear Analysis in Chemical Engineering; McGraw-Hill Inc.: New York, NY, USA, 1980. [Google Scholar]
Fan, L.T.; Chen, G.K.C.; Erickson, L.E. Efficiency and Utility of the Collocation Methods in Solving the Performance Equations Flow Chemical Reactors with Axial Dispersion. Sci. Direct Chem. Eng. Sci. 1971, 26, 379–387. [Google Scholar] [CrossRef]
Adomaitis, R.A.; Lin, Y. A Technique for Accurate Collocation Residual Calculations. Chem. Eng. J. 1998, 71, 127–134. [Google Scholar] [CrossRef]
Alhumaizi, K.A. Moving Collocation Method for the Solution of the Transient Convection-Diffusion-Reaction Problems. Comput. Appl. Math. 2006, 193, 484–496. [Google Scholar] [CrossRef]
Arora, S.; Dhaliwal, S.S.; Kukreja, V.K. Solution of Two Point Boundary Value Problems Using Orthogonal Collocation on Finite Elements. Appl. Math. Comput. 2005, 171, 358–370. [Google Scholar] [CrossRef]
Arora, S.; Dhaliwal, S.S.; Kukreja, V.K. Simulation of Washing of Packed Bed of Porous Particles by Orthogonal Collocation on Finite Elements. Comput. Chem. Eng. 2006, 30, 1054–1060. [Google Scholar] [CrossRef]
Lefervre, L.; Dochain, D.; Feyo de Azevedo, S.; Magnus, A. Optimal Selection of Orthogonal Polynomials Applied to the Integration of Chemical Reactors. Sci. Direct Comput. Chem. Eng. 2000, 24, 2571–2588. [Google Scholar] [CrossRef]
Ruthven, D.M. Principles of Adsorption and Adsorption Process; Wiley-Inter Science Publication: New York, NY, USA, 1984. [Google Scholar]
Ganaie, I.A.; Gupta, B.; Parumasur, N.; Singh, P.; Kukreja, V.K. Asymptotic convergence of cubic Hermite collocation method for parabolic partial differential equation. Appl. Math. Comput. 2013, 220, 560–567. [Google Scholar] [CrossRef]
Mittal, A.K.; Ganaie, I.A.; Kukreja, V.K.; Parumasur, N.; Singh, P. Solution of diffusion–dispersion models using a computationally efficient technique of orthogonal collocation on finite elements with cubic Hermite as basis. Comput. Chem. Eng. 2013, 58, 203–210. [Google Scholar] [CrossRef]
de Boor, C.; Swartz, B. Collocation at Gaussian Points. SIAM J. Numer. Anal. 1973, 10, 582–606. [Google Scholar] [CrossRef]
Petzold, L.R. Description of DASSL: A Differential/Algebraic System Solver; SAND82-8637; Sandia National Labs.: Livermore, CA, USA, 1982. [Google Scholar]

Figure 1. Geometry of Galerkin method.

Figure 2. Mesh

Δ

.

Figure 2. Mesh

Δ

.

Figure 3. Basis functions on successive intervals.

Figure 4. Error plot with

N = 40

for Example 1.

Figure 4. Error plot with

N = 40

for Example 1.

Figure 5. Approximate solution with

N = 100

for Example 3.

Figure 5. Approximate solution with

N = 100

for Example 3.

Figure 6. Error with

N = 100

for Example 3.

Figure 6. Error with

N = 100

for Example 3.

Figure 7. Approximate solution

N = 100

for Example 4.

Figure 7. Approximate solution

N = 100

for Example 4.

Figure 8. 3D surface plot for the approximate solution for Example 4.

Figure 9. Error

N = 100

for Example 4.

Figure 9. Error

N = 100

for Example 4.

Table 1. Nodal convergence rates at the common nodes for Example 1.

	$\| D^{p} (y - Y) (x_{j}) \|$				$\| D^{p} (y - Y) (x_{j}) \|$
$x_{j}$	$p = 0$	$p = 1$	$p = 2$	$x_{j}$	$p = 0$	$p = 1$	$p = 2$
$0.05$	6.0178	$6.0180$	$6.0107$	$0.55$	$6.0102$	$6.0133$	$6.0231$
$0.10$	$6.0177$	$6.0151$	$6.0109$	$0.60$	$6.1360$	$6.0133$	$6.0130$
$0.15$	$6.0175$	$6.0146$	$6.0112$	$0.65$	$6.0155$	$6.0133$	$6.0116$
$0.20$	$6.0171$	$6.0144$	$6.0118$	$0.70$	$6.0166$	$6.0130$	$6.0113$
$0.25$	$6.0164$	$6.0143$	$6.0126$	$0.75$	$6.0172$	$6.0125$	$6.0114$
$0.30$	$6.0150$	$6.0142$	$6.0142$	$0.80$	$6.0175$	$6.0105$	$6.0117$
$0.35$	$6.0125$	$6.0142$	$6.0185$	$0.85$	$6.0177$	$6.0442$	$6.0123$
$0.40$	$6.0082$	$6.0142$	$6.0543$	$0.90$	$6.0177$	$6.0179$	$6.0134$
$0.45$	$6.0038$	$6.0146$	$6.9835$	$0.95$	$6.0177$	$6.0174$	$6.0163$
$0.50$	$6.0053$	$6.0119$	$6.9829$

Table 2. Global convergence rates for Example 1.

p	$∥ D^{p} {(y - Y) (x) ∥}_{\infty}$
0	$6.0214$
1	$5.0000$
2	$4.0125$
3	$2.9963$

Table 3. Nodal errors and convergence rates at the common nodes for Example 2.

	$\|(y - Y) (x_{j})\|$
$x_{j}$	$N = 10$	$N = 20$	$N = 40$	$n (h)$	$n (\frac{h}{2})$
0.1	3.851338 $\times 10^{- 3}$	1.881677 $\times 10^{- 3}$	9.307417 $\times 10^{- 4}$	1.0333	1.0156
0.2	3.409223 $\times 10^{- 3}$	1.677216 $\times 10^{- 3}$	8.302827 $\times 10^{- 4}$	1.0234	1.0144
0.3	3.005355 $\times 10^{- 3}$	1.474348 $\times 10^{- 3}$	7.303313 $\times 10^{- 4}$	1.0275	1.0135
0.4	2.582037 $\times 10^{- 3}$	1.271405 $\times 10^{- 3}$	6.301073 $\times 10^{- 4}$	1.0221	1.0128
0.5	2.172848 $\times 10^{- 3}$	1.067288 $\times 10^{- 3}$	5.291418 $\times 10^{- 4}$	1.0256	1.0122
0.6	1.747602 $\times 10^{- 3}$	8.611265 $\times 10^{- 4}$	4.270521 $\times 10^{- 4}$	1.0211	1.0118
0.7	1.327715 $\times 10^{- 3}$	6.521262 $\times 10^{- 4}$	3.234758 $\times 10^{- 4}$	1.0257	1.0115
0.8	8.915583 $\times 10^{- 4}$	4.394996 $\times 10^{- 4}$	2.180437 $\times 10^{- 4}$	1.0205	1.0112
0.9	4.548230 $\times 10^{- 4}$	2.224223 $\times 10^{- 4}$	1.103623 $\times 10^{- 4}$	1.0320	1.0111

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Singh, P.; Parumasur, N.; Singh, S. A Review of Collocation Approximations to Solutions of Differential Equations. Mathematics 2022, 10, 4438. https://doi.org/10.3390/math10234438

AMA Style

Singh P, Parumasur N, Singh S. A Review of Collocation Approximations to Solutions of Differential Equations. Mathematics. 2022; 10(23):4438. https://doi.org/10.3390/math10234438

Chicago/Turabian Style

Singh, Pravin, Nabendra Parumasur, and Shivani Singh. 2022. "A Review of Collocation Approximations to Solutions of Differential Equations" Mathematics 10, no. 23: 4438. https://doi.org/10.3390/math10234438

APA Style

Singh, P., Parumasur, N., & Singh, S. (2022). A Review of Collocation Approximations to Solutions of Differential Equations. Mathematics, 10(23), 4438. https://doi.org/10.3390/math10234438

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Review of Collocation Approximations to Solutions of Differential Equations

Abstract

1. Introduction

2. Mathematical Setting

Remark

3. Literature Review

4. Quintic Hermite Basis Functions

5. Orthogonal Collocation on Finite Elements

Remark

6. Error Analysis

7. Numerical Examples

8. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI