Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function

Anastassiou, George A.

doi:10.3390/fractalfract7030208

Open AccessArticle

Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function

by

George A. Anastassiou

Department of Mathematical Sciences, University of Memphis, Memphis, TN 38152, USA

Fractal Fract. 2023, 7(3), 208; https://doi.org/10.3390/fractalfract7030208

Submission received: 23 January 2023 / Revised: 8 February 2023 / Accepted: 16 February 2023 / Published: 21 February 2023

(This article belongs to the Special Issue Discrete Fractional Calculus, Local Fractional Inequalities, and Applications)

Download Versions Notes

Abstract

:

In this work, we perform univariate approximation with rates, basic and fractional, of continuous functions that take values into an arbitrary Banach space with domain on a closed interval or all reals, by quasi-interpolation neural network operators. These approximations are achieved by deriving Jackson-type inequalities via the first modulus of continuity of the on hand function or its abstract integer derivative or Caputo fractional derivatives. Our operators are expressed via a density function based on a q-deformed and

λ

-parameterized hyperbolic tangent activation sigmoid function. The convergences are pointwise and uniform. The associated feed-forward neural networks are with one hidden layer.

Keywords:

q-deformed; λ-parameterized hyperbolic tangent activation function; abstract neural network approximation; abstract quasi-interpolation operator; modulus of continuity; abstract Caputo fractional derivative; fractional approximation

MSC:

26A33; 41A17; 41A25; 41A30; 46B25

1. Introduction

The author of [1,2], see Chapters 2–5, was the pioneer to start neural network quantitative approximation to continuous functions by precisely defined neural network operators of Cardaliaguet–Euvrard and “Squashing” types, by using the modulus of continuity of the given function or its high order derivative, and deriving almost sharp Jackson-type inequalities. He dealt with both the univariate and multivariate cases. The defining these operators “bell-shaped” and “squashing” functions were taken with a compact support. Furthermore in [2], he provides the Nth order asymptotic expansion for the error of weak approximation of these two operators to a particular natural class of smooth functions, for more visit Chapters 4–5 therein.

Coming back the author motivated by [3], who continued his research on neural networks approximation by employing and using the appropriate quasi-interpolation operators of sigmoidal and hyperbolic tangent type, which resulted in [4,5,6,7,8], by treating both the univariate and multivariate cases. He also completed the corresponding fractional cases [9,10,11].

Let h be a general sigmoid function with

h (0) = 0

, and

y = \pm 1

the horizontal asymptotes. Of course h is strictly increasing over

R

. Let the parameter

0 < r < 1

and

x > 0

. Then clearly

- x < x

and

- x < - r x < r x < x

, furthermore it holds

h (- x) < h (- r x) < h (r x) < h (x)

. Consequently the sigmoid

y = h (r x)

has a graph inside the graph of

y = h (x)

, of course with the same asymptotes

y = \pm 1

. Therefore

h (r x)

has derivatives (gradients) at more points x than

h (x)

has different than zero or not as close to zero, thus killing a smaller number of neurons! Furthermore, of course

h (r x)

is more distant from

y = \pm 1

, than

h (x)

it is. A highly desired fact in neural networks theory.

The brain non-symmetry has been clearly discovered in animals and humans in terms of structure, function and behaviors. This observation reflects evolutionary, hereditary, developmental, experiential and pathological factors. Therefore it is natural to consider for our study deformed neural network activation functions and operators. So this paper is a specific study under this philosophy of approaching reality as close as possible.

Consequently the author here performs q-deformed and

λ

-parameterized hyperbolic tangent function activated neural network approximations to continuous functions over closed intervals of reals or over the whole

R

with values to an arbitrary Banach space

(X, ∥\cdot∥)

. Finally he deals with the related X-valued fractional approximation. All convergences here are quantitative expressed via the first modulus of continuity of the on hand function or its X-valued high order derivative, or X-valued fractional derivatives and given by almost attained Jackson-type inequalities.

Our closed intervals are not necessarily symmetric to the origin. Some of our upper bounds to error quantity are very flexible and general. In preparation to derive our results we describe important properties of the basic density function defining our operators which is induced by a q-deformed and

λ

-parameterized hyperbolic tangent sigmoid function.

Feed-forward X-valued neural networks (FNNs) with one hidden layer, the only type of networks we use in this work, are mathematically expressed by

S_{n} (x) = \sum_{j = 0}^{n} d_{j} k (〈c_{j} \cdot x〉 + f_{j}), x \in R^{s}, s \in N,

where for

0 \leq j \leq n

,

f_{j} \in R

are the thresholds,

c_{j} \in R^{s}

are the connection weights,

d_{j} \in X

are the coefficients,

〈c_{j} \cdot x〉

is the inner product of

c_{j}

and x, and k is the activation function of the network. For more in neural networks read [12,13,14].

2. About q-Deformed and λ-Parameterized Hyperbolic Tangent Function g_q,λ

Here all this background comes from [15,16].

We use

g_{q, λ}

, see (1), and exhibit that it is a sigmoid function and we will present several of its properties related to the approximation by neural network operators. It will act as activation function.

So, let us consider the function

g_{q, λ} (x) : = \frac{e^{λ x} - q e^{- λ x}}{e^{λ x} + q e^{- λ x}}, λ, q > 0, x \in R .

(1)

We have that

g_{q, λ} (0) = \frac{1 - q}{1 + q} .

We notice also that

g_{q, λ} (- x) = \frac{e^{- λ x} - q e^{λ x}}{e^{- λ x} + q e^{λ x}} = \frac{\frac{1}{q} e^{- λ x} - e^{λ x}}{\frac{1}{q} e^{- λ x} + e^{λ x}} = - \frac{(e^{λ x} - \frac{1}{q} e^{- λ x})}{e^{λ x} + \frac{1}{q} e^{- λ x}} = - g_{\frac{1}{q}, λ} (x) .

(2)

That is

g_{q, λ} (- x) = - g_{\frac{1}{q}, λ} (x), \forall x \in R,

(3)

and

g_{\frac{1}{q}, λ} (x) = - g_{q, λ} (- x),

hence

g_{\frac{1}{q}, λ}^{'} (x) = g_{q, λ}^{'} (- x) .

(4)

It is

g_{q, λ} (x) = \frac{e^{2 λ x} - q}{e^{2 λ x} + q} = \frac{1 - \frac{q}{e^{2^{l x}}}}{1 + \frac{q}{e^{2 λ x}}} \underset{(x \to + \infty)}{\to} 1,

i.e.,

g_{q, λ} (+ \infty) = 1,

(5)

Furthermore,

g_{q, λ} (x) = \frac{e^{2 λ x} - q}{e^{2 λ x} + q} \underset{(x \to - \infty)}{\to} \frac{- q}{q} = - 1,

i.e.,

g_{q, λ} (- \infty) = - 1 .

(6)

We find that

g_{q, λ}^{'} (x) = \frac{4 q λ e^{2 λ x}}{{(e^{2 λ x} + q)}^{2}} > 0,

(7)

therefore

g_{q, λ}

is strictly increasing.

Next we obtain (

x \in R

)

g_{q, λ}^{″} (x) = 8 q λ^{2} e^{2 λ x} (\frac{q - e^{2 λ x}}{{(e^{2 λ x} + q)}^{3}}) \in C (R) .

(8)

We observe that

q - e^{2 λ x} ≷ 0 \Leftrightarrow q ≷ e^{2 λ x} \Leftrightarrow ln q ≷ 2 λ x \Leftrightarrow x ≶ \frac{ln q}{2 λ} .

So, in case of

x < \frac{ln q}{2 λ}

, we have that

g_{q, λ}

is strictly concave up, with

g_{q, λ}^{″} (\frac{ln q}{2 λ}) = 0 .

Furthermore, in case of

x > \frac{ln q}{2 λ}

, we have that

g_{q, λ}

is strictly concave down.

Clearly,

g_{q, λ}

is a shifted sigmoid function with

g_{q, λ} (0) = \frac{1 - q}{1 + q}

, and

g_{q, λ} (- x) = - g_{q^{- 1}, λ} (x)

, (a semi-odd function), see also [17].

By

1 > - 1

,

x + 1 > x - 1

, we consider the function

M_{q . λ} (x) : = \frac{1}{4} (g_{q, λ} (x + 1) - g_{q, λ} (x - 1)) > 0,

(9)

\forall x \in R

;

q, λ > 0

. Notice that

M_{q, λ} (\pm \infty) = 0

, so the x-axis is horizontal asymptote.

We have that

M_{q, λ} (- x) = \frac{1}{4} (g_{q, λ} (- x + 1) - g_{q, λ} (- x - 1)) =

\frac{1}{4} (g_{q, λ} (- (x - 1)) - g_{q, λ} (- (x + 1))) =

\frac{1}{4} (- g_{\frac{1}{q}, λ} (x - 1) + g_{\frac{1}{q}, λ} (x + 1)) =

(10)

\frac{1}{4} (g_{\frac{1}{q}, λ} (x + 1) - g_{\frac{1}{q}, λ} (x - 1)) = M_{\frac{1}{q}, λ} (x), \forall x \in R .

Thus,

M_{q, λ} (- x) = M_{\frac{1}{q}, λ} (x), \forall x \in R; q, λ > 0,

(11)

a deformed symmetry.

Next, we have that

M_{q, λ}^{'} (x) = \frac{1}{4} (g_{q, λ}^{'} (x + 1) - g_{q, λ}^{'} (x - 1)), \forall x \in R .

(12)

Let

x < \frac{ln q}{2 λ} - 1

, then

x - 1 < x + 1 < \frac{ln q}{2 λ}

and

g_{q, λ}^{'} (x + 1) > g_{q, λ}^{'} (x - 1)

(by

g_{q, λ}

being strictly concave up for

x < \frac{ln q}{2 λ}

), that is

M_{q, λ}^{'} (x) > 0

. Hence,

M_{q, λ}

is strictly increasing over

(- \infty, \frac{ln q}{2 λ} - 1) .

Let now

x - 1 > \frac{ln q}{2 λ}

, then

x + 1 > x - 1 > \frac{ln q}{2 λ}

, and

g_{q, λ}^{'} (x + 1) < g_{q, λ}^{'} (x - 1)

, that is

M_{q, λ}^{'} (x) < 0 .

Therefore

M_{q, λ}

is strictly decreasing over

(\frac{ln q}{2 λ} + 1, + \infty) .

Let us next consider,

\frac{ln q}{2 λ} - 1 \leq x \leq \frac{ln q}{2 λ} + 1 .

We have that

M_{q, λ}^{″} (x) = \frac{1}{4} (g_{q, λ}^{″} (x + 1) - g_{q, λ}^{″} (x - 1)) =

2 q λ^{2} [e^{2 λ (x + 1)} (\frac{q - e^{2 λ (x + 1)}}{{(e^{2 λ (x + 1)} + q)}^{3}}) - e^{2 λ (x - 1)} (\frac{q - e^{2 λ (x - 1)}}{{(e^{2 λ (x - 1)} + q)}^{3}})] .

(13)

By

\frac{ln q}{2 λ} - 1 \leq x \Leftrightarrow \frac{ln q}{2 λ} \leq x + 1 \Leftrightarrow ln q \leq 2 λ (x + 1) \Leftrightarrow q \leq e^{2 λ (x + 1)} \Leftrightarrow q - e^{2 λ (x + 1)} \leq 0 .

By

x \leq \frac{ln q}{2 λ} + 1 \Leftrightarrow x - 1 \leq \frac{ln q}{2 λ} \Leftrightarrow 2 λ (x - 1) \leq ln q \Leftrightarrow e^{2 λ (x - 1)} \leq q \Leftrightarrow q - e^{2 λ β (x - 1)} \geq 0 .

Clearly by (13) we obtain that

M_{q, λ}^{″} (x) \leq 0

, for

x \in [\frac{ln q}{2 λ} - 1, \frac{ln q}{2 λ} + 1] .

More precisely

M_{q, λ}

is concave down over

[\frac{ln q}{2 λ} - 1, \frac{ln q}{2 λ} + 1]

, and strictly concave down over

(\frac{ln q}{2 λ} - 1, \frac{ln q}{2 λ} + 1) .

Consequently

M_{q, λ}

has a bell-type shape over

R .

Of course it holds

M_{q, λ}^{″} (\frac{ln q}{2 λ}) < 0 .

At

x = \frac{ln q}{2 λ}

, we have

M_{q, λ}^{'} (x) = \frac{1}{4} (g_{q, λ}^{'} (x + 1) - g_{q, λ}^{'} (x - 1)) =

q λ (\frac{e^{2 λ (x + 1)}}{{(e^{2 λ (x + 1)} + q)}^{2}} - \frac{e^{2 λ (x - 1)}}{{(e^{2 λ (x - 1)} + q)}^{2}}) .

(14)

Thus,

M_{q, λ}^{'} (\frac{ln q}{2 λ}) = q λ (\frac{e^{2 λ (\frac{ln q}{2 λ} + 1)}}{{(e^{2 λ (\frac{ln q}{2 λ} + 1)} + q)}^{2}} - \frac{e^{2 λ (\frac{ln q}{2 λ} - 1)}}{{(e^{2 λ (\frac{ln q}{2 λ} - 1)} + q)}^{2}}) =

q λ (\frac{q e^{2 λ}}{{(q e^{2 λ} + q)}^{2}} - \frac{q e^{- 2 λ}}{{(q e^{- 2 λ} + q)}^{2}}) =

λ (\frac{e^{2 λ}}{{(e^{2 λ} + 1)}^{2}} - \frac{e^{- 2 λ}}{{(e^{- 2 λ} + 1)}^{2}}) =

(15)

λ (\frac{e^{2 λ} {(e^{- 2 λ} + 1)}^{2} - e^{- 2 λ} {(e^{2 λ} + 1)}^{2}}{{(e^{2 λ} + 1)}^{2} {(e^{- 2 λ} + 1)}^{2}}) = 0 .

That is,

\frac{ln q}{2 λ}

is the only critical number of

M_{q, λ}

over

R

. Hence at

x = \frac{ln q}{2 λ},

M_{q, λ}

achieves its global maximum, which is

M_{q, λ} (\frac{ln q}{2 λ}) = \frac{1}{4} [g_{q, λ} (\frac{ln q}{2 λ} + 1) - g_{q, λ} (\frac{ln q}{2 λ} - 1)] =

\frac{1}{4} [(\frac{e^{λ (\frac{ln q}{2 λ} + 1)} - q e^{- λ (\frac{ln q}{2 λ} + 1)}}{e^{λ (\frac{ln q}{2 λ} + 1)} + q e^{- λ (\frac{ln q}{2 λ} + 1)}}) - (\frac{e^{λ (\frac{ln q}{2 λ} - 1)} - q e^{- λ (\frac{ln q}{2 λ} - 1)}}{e^{λ (\frac{ln q}{2 λ} - 1)} + q e^{- λ (\frac{ln q}{2 λ} - 1)}})] =

\frac{1}{4} [(\frac{\sqrt{q} e^{λ} - q q^{- \frac{1}{2}} e^{- λ}}{\sqrt{q} e^{λ} + q q^{- \frac{1}{2}} e^{- λ}}) - (\frac{\sqrt{q} e^{- λ} - q q^{- \frac{1}{2}} e^{λ}}{\sqrt{q} e^{- λ} + q q^{- \frac{1}{2}} e^{λ}})] =

(16)

\frac{1}{4} [(\frac{e^{λ} - e^{- λ}}{e^{λ} + e^{- λ}}) - (\frac{e^{- λ} - e^{λ}}{e^{- λ} + e^{λ}})] =

\frac{1}{4} [\frac{2 (e^{λ} - e^{- λ})}{e^{λ} + e^{- λ}}] = \frac{1}{2} (\frac{e^{λ} - e^{- λ}}{e^{λ} + e^{- λ}}) = \frac{tanh (λ)}{2} .

Conclusion: The maximum value of

M_{q, λ}

is

M_{q, λ} (\frac{ln q}{2 λ}) = \frac{tanh (λ)}{2}, λ > 0 .

(17)

We mention

Theorem 1

([16]). We have that

\sum_{i = - \infty}^{\infty} M_{q, λ} (x - i) = 1, \forall x \in R, \forall λ, q > 0 .

(18)

Thus,

\sum_{i = - \infty}^{\infty} M_{q, λ} (n x - i) = 1, \forall n \in N, \forall x \in R .

(19)

Similarly, it holds

\sum_{i = - \infty}^{\infty} M_{\frac{1}{q}, λ} (x - i) = 1, \forall x \in R .

(20)

However,

M_{\frac{1}{q}, λ} (x - i) \overset{(11)}{=} M_{q, λ} (i - x)

, ∀

x \in R .

Hence,

\sum_{i = - \infty}^{\infty} M_{q, λ} (i - x) = 1, \forall x \in R,

(21)

and

\sum_{i = - \infty}^{\infty} M_{q, λ} (i + x) = 1, \forall x \in R .

(22)

It follows

Theorem 2

([16]). It holds

\int_{- \infty}^{\infty} M_{q, λ} (x) d x = 1, λ, q > 0 .

(23)

So that

M_{q, λ}

is a density function on

R;

λ, q > 0 .

We need the following result

Theorem 3

([16]). Let

0 < α < 1

, and

n \in N

with

n^{1 - α} > 2

;

q, λ > 0

. Then,

\sum_{\{\begin{matrix} k = - \infty \\ : |n x - k| \geq n^{1 - α} \end{matrix}}^{\infty} M_{q, λ} (n x - k) < max \{q, \frac{1}{q}\} e^{4 λ} e^{- 2 λ n^{(1 - α)}} = T e^{- 2 λ n^{(1 - α)}},

(24)

where

T : = max \{q, \frac{1}{q}\} e^{4 λ} .

Let

⌈\cdot⌉

the ceiling of the number, and

⌊\cdot⌋

the integral part of the number.

Theorem 4

([16]). Let

x \in [a, b] \subset R

and

n \in N

so that

⌈n a⌉ \leq ⌊n b⌋

. For

q > 0

,

λ > 0,

we consider the number

λ_{q} > z_{0} > 0

with

M_{q, λ} (z_{0}) = M_{q, λ} (0)

and

λ_{q} > 1

. Then,

\frac{1}{\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k)} < m a x \{\frac{1}{M_{q, λ} (λ_{q})}, \frac{1}{M_{\frac{1}{q}, λ} (λ_{\frac{1}{q}})}\} = : Δ (q) .

(25)

We also mention

Remark 1

([16]). (i) We have that

\lim_{n \to + \infty} \sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k) \neq 1, for at least some x \in [a, b],

(26)

where

λ, q > 0 .

(ii) Let

[a, b] \subset R

. For large n we always have

⌈n a⌉ \leq ⌊n b⌋

. Furthermore,

a \leq \frac{k}{n} \leq b

, iff

⌈n a⌉ \leq k \leq ⌊n b⌋

. In general it holds

\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k) \leq 1 .

(27)

Let

(X, ∥\cdot∥)

be a Banach space.

Definition 1.

Let

f \in C ([a, b], X)

and

n \in N : ⌈n a⌉ \leq ⌊n b⌋

. We introduce and define the X-valued linear neural network operators

H_{n} (f, x) : = \frac{\sum_{k = ⌈n a⌉}^{⌊n b⌋} f (\frac{k}{n}) M_{q, λ} (n x - k)}{\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k)}, x \in [a, b]; q > 0, q \neq 1 .

(28)

For large enough n we always obtain

⌈n a⌉ \leq ⌊n b⌋ .

Furthermore,

a \leq \frac{k}{n} \leq b,

iff

⌈n a⌉ \leq k \leq ⌊n b⌋ .

The same

H_{n}

is used for real valued functions. We study here the pointwise and uniform convergence of

H_{n} (f, x)

to

f (x)

with rates.

For convenience, also we call

H_{n}^{*} (f, x) : = \sum_{k = ⌈n a⌉}^{⌊n b⌋} f (\frac{k}{n}) M_{q, λ} (n x - k),

(29)

(the same

H_{n}^{*}

can be defined for real valued functions) that is

H_{n} (f, x) : = \frac{H_{n}^{*} (f, x)}{\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k)} .

(30)

So that

H_{n} (f, x) - f (x) = \frac{H_{n}^{*} (f, x)}{\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k)} - f (x) =

(31)

\frac{H_{n}^{*} (f, x) - f (x) (\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k))}{\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k)} .

Consequently, we derive that

∥H_{n} (f, x) - f (x)∥ \leq Δ (q) ∥H_{n}^{*} (f, x) - f (x) (\sum_{k = ⌈n a⌉}^{⌊n b⌋} M_{q, λ} (n x - k))∥ =

Δ (q) ∥\sum_{k = ⌈n a⌉}^{⌊n b⌋} (f (\frac{k}{n}) - f (x)) M_{q, λ} (n x - k)∥,

(32)

where

Δ (q)

as in (25).

We will estimate the right hand side of the last quantity.

For that we need, for

f \in C ([a, b], X)

the first modulus of continuity

ω_{1} (f, δ) : = \sup_{\begin{matrix} x, y \in [a, b] \\ |x - y| \leq δ \end{matrix}} ∥f (x) - f (y)∥, δ > 0 .

(33)

Similarly, it is defined

ω_{1}

for

f \in C_{u B} (R, X)

(uniformly continuous and bounded functions from

R

into X), for

f \in C_{B} (R, X)

(continuous and bounded X-valued), and for

f \in C_{u} (R, X)

(uniformly continuous).

The fact

f \in C ([a, b], X)

or

f \in C_{u} (R, X)

, is equivalent to

\lim_{δ \to 0} ω_{1} (f, δ) = 0

, see [18].

We make

Definition 2.

When

f \in C_{u B} (R, X)

, or

f \in C_{B} (R, X)

, we define

\bar{H_{n}} (f, x) : = \sum_{k = - \infty}^{\infty} f (\frac{k}{n}) M_{q, λ} (n x - k),

(34)

n \in N

,

x \in R

, the X-valued quasi-interpolation neural network operator.

We give

Remark 2.

We have that

∥f (\frac{k}{n})∥ \leq {∥f∥}_{\infty, R} < + \infty,

and

∥f (\frac{k}{n})∥ M_{q, λ} (n x - k) \leq {∥f∥}_{\infty, R} M_{q, λ} (n x - k)

(35)

and

\sum_{k = - λ}^{λ} ∥f (\frac{k}{n})∥ M_{q, λ} (n x - k) \leq {∥f∥}_{\infty, R} (\sum_{k = - λ}^{λ} M_{q, λ} (n x - k)),

and, finally,

\sum_{k = - \infty}^{\infty} ∥f (\frac{k}{n})∥ M_{q, λ} (n x - k) \leq {∥f∥}_{\infty, R},

(36)

a convergent series in

R

.

So, the series

\sum_{k = - \infty}^{\infty} ∥f (\frac{k}{n})∥ M_{q, λ} (n x - k)

is absolutely convergent in X, hence it is convergent in X and

\bar{H_{n}} (f, x) \in X

. We denote by

{∥f∥}_{\infty} : = \sup_{x \in [a, b]} ∥f (x)∥

, for

f \in C ([a, b], X)

, similarly it is defined for

f \in C_{B} (R, X) .

3. Main Results

We present a set of X-valued neural network approximations to a function given with rates.

Theorem 5.

Let

f \in C ([a, b], X)

,

0 < α < 1

,

n \in N : n^{1 - α} > 2

,

q > 0

,

q \neq 1

,

x \in [a, b] .

Then,

(i)

∥H_{n} (f, x) - f (x)∥ \leq Δ (q) [ω_{1} (f, \frac{1}{n^{α}}) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}}] = : τ,

(37)

where T as in (24),

and

(ii)

{∥H_{n} (f) - f∥}_{\infty} \leq τ .

(38)

We obtain that

\lim_{n \to \infty} H_{n} (f) = f

, pointwise and uniformly.

Proof.

We see that

∥\sum_{k = ⌈n a⌉}^{⌊n b⌋} (f (\frac{k}{n}) - f (x)) M_{q, λ} (n x - k)∥ \leq

\sum_{k = ⌈n a⌉}^{⌊n b⌋} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) =

\sum_{\{\begin{matrix} k = ⌈n a⌉ \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{⌊n b⌋} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) +

\sum_{\{\begin{matrix} k = ⌈n a⌉ \\ |\frac{k}{n} - x| > \frac{1}{n^{α}} \end{matrix}}^{⌊n b⌋} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) \leq

(39)

\sum_{\{\begin{matrix} k = ⌈n a⌉ \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{⌊n b⌋} ω_{1} (f, |\frac{k}{n} - x|) M_{q, λ} (n x - k) +

2 {∥f∥}_{\infty} \sum_{\{\begin{matrix} k = ⌈n a⌉ \\ |k - n x| > n^{1 - α} \end{matrix}}^{⌊n b⌋} M_{q, λ} (n x - k) \leq

ω_{1} (f, \frac{1}{n^{α}}) \sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{\infty} M_{q, λ} (n x - k) +

2 {∥f∥}_{\infty} \sum_{\{\begin{matrix} k = - \infty \\ |k - n x| > n^{1 - α} \end{matrix}}^{\infty} M_{q, λ} (n x - k) \underset{(by Theorem 3)}{\leq}

ω_{1} (f, \frac{1}{n^{α}}) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}}

(40)

That is

∥\sum_{k = ⌈n a⌉}^{⌊n b⌋} (f (\frac{k}{n}) - f (x)) M_{q, λ} (n x - k)∥ \leq

ω_{1} (f, \frac{1}{n^{α}}) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}} .

(41)

Using the last equality we derive (37). □

Next we give

Theorem 6.

Let

f \in C_{B} (R, X)

,

0 < α < 1

,

q > 0,

q \neq 1,

n \in N : n^{1 - α} > 2

,

x \in R .

Then

(i)

∥{\bar{H}}_{n} (f, x) - f (x)∥ \leq ω_{1} (f, \frac{1}{n^{α}}) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}} = : γ,

(42)

and

(ii)

{∥{\bar{H}}_{n} (f) - f∥}_{\infty} \leq γ .

(43)

For

f \in C_{u B} (R, X)

we obtain

\lim_{n \to \infty} {\bar{H}}_{n} (f) = f

, pointwise and uniformly.

Proof.

We observe that

∥{\bar{H}}_{n} (f, x) - f (x)∥ \overset{(18)}{=} ∥\sum_{k = - \infty}^{\infty} f (\frac{k}{n}) M_{q, λ} (n x - k) - f (x) \sum_{k = - \infty}^{\infty} M_{q, λ} (n x - k)∥ =

∥\sum_{k = - \infty}^{\infty} (f (\frac{k}{n}) - f (x)) M_{q, λ} (n x - k)∥ \leq

\sum_{k = - \infty}^{\infty} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) =

\sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{\infty} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) +

\sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| > \frac{1}{n^{α}} \end{matrix}}^{\infty} ∥f (\frac{k}{n}) - f (x)∥ M_{q, λ} (n x - k) \leq

(44)

\sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{\infty} ω_{1} (f, |\frac{k}{n} - x|) M_{q, λ} (n x - k) +

2 {∥f∥}_{\infty} \sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| > \frac{1}{n^{α}} \end{matrix}}^{\infty} M_{q, λ} (n x - k) \leq

ω_{1} (f, \frac{1}{n^{α}}) \sum_{\{\begin{matrix} k = - \infty \\ |\frac{k}{n} - x| \leq \frac{1}{n^{α}} \end{matrix}}^{\infty} M_{q, λ} (n x - k) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}} \leq

ω_{1} (f, \frac{1}{n^{α}}) + 2 {∥f∥}_{\infty} T e^{- 2 λ n^{(1 - α)}},

(45)

proving the claim. □

We need the X-valued Taylor’s formula in an appropriate form:

Theorem 7

([19,20]). Let

N \in N

, and

f \in C^{N} ([a, b], X)

, where

[a, b] \subset R

and X is a Banach space. Let any

x, y \in [a, b]

. Then,

f (x) = \sum_{i = 0}^{N} \frac{{(x - y)}^{i}}{i!} f^{(i)} (y) + \frac{1}{(N - 1)!} \int_{y}^{x} {(x - t)}^{N - 1} (f^{(N)} (t) - f^{(N)} (y)) d t .

(46)

The derivatives

f^{(i)}

,

i \in N

, are defined like the numerical ones, see [21], p. 83. The integral

\int_{y}^{x}

in (46) is of Bochner type, see [22].

By [20,23] we have that: if

f \in C ([a, b], X)

, then

f \in L_{\infty} ([a, b], X)

and

f \in L_{1} ([a, b], X) .

In the next we discuss high order neural network X-valued approximation by using the smoothness of f.

Theorem 8.

Let

f \in C^{N} ([a, b], X)

,

n, N \in N

,

q > 0

,

q \neq 1

,

0 < α < 1

,

x \in [a, b]

and

n^{1 - α} > 2

. Then,

(i)

∥H_{n} (f, x) - f (x)∥ \leq Δ (q) \{\sum_{j = 1}^{N} \frac{∥f^{(j)} (x)∥}{j!} [\frac{1}{n^{α j}} + {(b - a)}^{j} T e^{- 2 λ n^{(1 - α)}}] +

(47)

[ω_{1} (f^{(N)}, \frac{1}{n^{α}}) \frac{1}{n^{α N} N!} + \frac{2 {∥f^{(N)}∥}_{\infty} {(b - a)}^{N}}{N!} T e^{- 2 λ n^{(1 - α)}}]\},

(ii) assume further

f^{(j)} (x_{0}) = 0

,

j = 1, \dots, N,

for some

x_{0} \in [a, b]

, it holds

∥H_{n} (f, x_{0}) - f (x_{0})∥ \leq Δ (q) \cdot

\{ω_{1} (f^{(N)}, \frac{1}{n^{α}}) \frac{1}{n^{α N} N!} + \frac{2 {∥f^{(N)}∥}_{\infty} {(b - a)}^{N}}{N!} T e^{- 2 λ n^{(1 - α)}}\},

(48)

and

(iii)

{∥H_{n} (f) - f∥}_{\infty} \leq Δ (q) \{\sum_{j = 1}^{N} \frac{{∥f^{(j)}∥}_{\infty}}{j!} [\frac{1}{n^{α j}} + {(b - a)}^{j} T e^{- 2 λ n^{(1 - α)}}] +

[ω_{1} (f^{(N)}, \frac{1}{n^{α}}) \frac{1}{n^{α N} N!} + 2 {∥f^{(N)}∥}_{\infty} {(b - a)}^{N} T e^{- 2 λ n^{(1 - α)}}]\} .

(49)

Again we obtain

\lim_{n \to \infty} H_{n} (f) = f

, pointwise and uniformly.

Proof.

It is lengthy, and as similar to [24] is omitted. □

All integrals from now on are of Bochner type [22].

We need

Definition 3

([20]). Let

[a, b] \subset R

, X be a Banach space,

α > 0

;

m = ⌈α⌉ \in N

, (

⌈\cdot⌉

is the ceiling of the number),

f : [a, b] \to X

. We assume that

f^{(m)} \in L_{1} ([a, b], X)

. We call the Caputo–Bochner left fractional derivative of order α:

(D_{* a}^{α} f) (x) : = \frac{1}{Γ (m - α)} \int_{a}^{x} {(x - t)}^{m - α - 1} f^{(m)} (t) d t, \forall x \in [a, b] .

(50)

If

α \in N

, we set

D_{* a}^{α} f : = f^{(m)}

the ordinary X-valued derivative (defined similar to numerical one, see [21], p. 83), and also set

D_{* a}^{0} f : = f .

By [19],

(D_{* a}^{α} f) (x)

exists almost everywhere in

x \in [a, b]

and

D_{* a}^{α} f \in L_{1} ([a, b], X)

.

If

{∥f^{(m)}∥}_{L_{\infty} ([a, b], X)} < \infty

, then by [23],

D_{* a}^{α} f \in C ([a, b], X),

hence

∥D_{* a}^{α} f∥ \in C ([a, b]) .

We mention

Definition 4

([19]). Let

[a, b] \subset R

, X be a Banach space,

α > 0

,

m : = ⌈α⌉

. We assume that

f^{(m)} \in L_{1} ([a, b], X)

, where

f : [a, b] \to X

. We call the Caputo–Bochner right fractional derivative of order α:

(D_{b -}^{α} f) (x) : = \frac{{(- 1)}^{m}}{Γ (m - α)} \int_{x}^{b} {(z - x)}^{m - α - 1} f^{(m)} (z) d z, \forall x \in [a, b] .

(51)

We observe that

(D_{b -}^{m} f) (x) = {(- 1)}^{m} f^{(m)} (x),

for

m \in N

, and

(D_{b -}^{0} f) (x) = f (x) .

By [19],

(D_{b -}^{α} f) (x)

exists almost everywhere on

[a, b]

and

(D_{b -}^{α} f) \in L_{1} ([a, b], X)

.

If

{∥f^{(m)}∥}_{L_{\infty} ([a, b], X)} < \infty

, and

α \notin N,

by [19],

D_{b -}^{α} f \in C ([a, b], X),

hence

∥D_{b -}^{α} f∥ \in C ([a, b]) .

We make

Remark 3

([18]). Let

f \in C^{n - 1} ([a, b])

,

f^{(n)} \in L_{\infty} ([a, b])

,

n = ⌈ν⌉

,

ν > 0

,

ν \notin N

. Then,

∥D_{* a}^{ν} f (x)∥ \leq \frac{{∥f^{(n)}∥}_{L_{\infty} ([a, b], X)}}{Γ (n - ν + 1)} {(x - a)}^{n - ν}, \forall x \in [a, b] .

(52)

Thus, we observe

ω_{1} (D_{* a}^{ν} f, δ) = \underset{|x - y| \leq δ}{\sup_{x, y \in [a, b]}} ∥D_{* a}^{ν} f (x) - D_{* a}^{ν} f (y)∥ \leq

(53)

\underset{|x - y| \leq δ}{\sup_{x, y \in [a, b]}} (\frac{{∥f^{(n)}∥}_{L_{\infty} ([a, b], X)}}{Γ (n - ν + 1)} {(x - a)}^{n - ν} + \frac{{∥f^{(n)}∥}_{L_{\infty} ([a, b], X)}}{Γ (n - ν + 1)} {(y - a)}^{n - ν})

\leq \frac{2 {∥f^{(n)}∥}_{L_{\infty} ([a, b], X)}}{Γ (n - ν + 1)} {(b - a)}^{n - ν} .

Consequently,

ω_{1} (D_{* a}^{ν} f, δ) \leq \frac{2 {∥f^{(n)}∥}_{L_{\infty} ([a, b], X)}}{Γ (n - ν + 1)} {(b - a)}^{n - ν} .

(54)

Similarly, let

f \in C^{m - 1} ([a, b])

,

f^{(m)} \in L_{\infty} ([a, b])

,

m = ⌈α⌉

,

α > 0

,

α \notin N

, then

ω_{1} (D_{b -}^{α} f, δ) \leq \frac{2 {∥f^{(m)}∥}_{L_{\infty} ([a, b], X)}}{Γ (m - α + 1)} {(b - a)}^{m - α} .

(55)

So for

f \in C^{m - 1} ([a, b])

,

f^{(m)} \in L_{\infty} ([a, b])

,

m = ⌈α⌉

,

α > 0

,

α \notin N

, we find

\sup_{x_{0} \in [a, b]} ω_{1} {(D_{* x_{0}}^{α} f, δ)}_{[x_{0}, b]} \leq \frac{2 {∥f^{(m)}∥}_{L_{\infty} ([a, b], X)}}{Γ (m - α + 1)} {(b - a)}^{m - α},

(56)

and

\sup_{x_{0} \in [a, b]} ω_{1} {(D_{x_{0} -}^{α} f, δ)}_{[a, x_{0}]} \leq \frac{2 {∥f^{(m)}∥}_{L_{\infty} ([a, b], X)}}{Γ (m - α + 1)} {(b - a)}^{m - α} .

(57)

By [20] we obtain that

D_{* x_{0}}^{α} f \in C ([x_{0}, b], X)

, and by [19] we obtain that

D_{x_{0} -}^{α} f \in C ([a, x_{0}], X) .

We present the following X-valued fractional approximation result by neural networks.

Theorem 9.

Let

α > 0

,

q > 0

,

q \neq 1

,

N = ⌈α⌉

,

α \notin N

,

f \in C^{N} ([a, b], X)

,

0 < β < 1

,

x \in [a, b]

,

n \in N : n^{1 - β} > 2 .

Then,

(i)

∥H (f, x) - \sum_{j = 1}^{N - 1} \frac{f^{(j)} (x)}{j!} H_{n} ({(\cdot - x)}^{j}) (x) - f (x)∥ \leq

\frac{Δ (q)}{Γ (α + 1)} \{\frac{(ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

T e^{- 2 λ n^{(1 - β)}} ({∥D_{x -}^{α} f∥}_{\infty, [a, x]} {(x - a)}^{α} + {∥D_{* x}^{α} f∥}_{\infty, [x, b]} {(b - x)}^{α})\},

(58)

(ii) if

f^{(j)} (x) = 0

, for

j = 1, \dots, N - 1

, we have

∥H_{n} (f, x) - f (x)∥ \leq \frac{Δ (q)}{Γ (α + 1)}

\{\frac{(ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

T e^{- 2 λ n^{(1 - β)}} ({∥D_{x -}^{α} f∥}_{\infty, [a, x]} {(x - a)}^{α} + {∥D_{* x}^{α} f∥}_{\infty, [x, b]} {(b - x)}^{α})\},

(59)

(iii)

∥H_{n} (f, x) - f (x)∥ \leq Δ (q)

\{\sum_{j = 1}^{N - 1} \frac{∥f^{(j)} (x)∥}{j!} \{\frac{1}{n^{β j}} + {(b - a)}^{j} T e^{- 2 λ n^{(1 - β)}}\} +

\frac{1}{Γ (α + 1)} \{\frac{(ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

T e^{- 2 λ n^{(1 - β)}} ({∥D_{x -}^{α} f∥}_{\infty, [a, x]} {(x - a)}^{α} + {∥D_{* x}^{α} f∥}_{\infty, [x, b]} {(b - x)}^{α})\}\},

(60)

∀

x \in [a, b],

and

(iv)

{∥H_{n} f - f∥}_{\infty} \leq Δ (q)

\{\sum_{j = 1}^{N - 1} \frac{{∥f^{(j)}∥}_{\infty}}{j!} \{\frac{1}{n^{β j}} + {(b - a)}^{j} T e^{- 2 λ n^{(1 - β)}}\} +

\frac{1}{Γ (α + 1)} \{\frac{(\sup_{x \in [a, b]} ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + \sup_{x \in [a, b]} ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

T e^{- 2 λ n^{(1 - β)}} {(b - a)}^{α} (\sup_{x \in [a, b]} {∥D_{x -}^{α} f∥}_{\infty, [a, x]} + \sup_{x \in [a, b]} {∥D_{* x}^{α} f∥}_{\infty, [x, b]})\}\} .

(61)

Above, when

N = 1

the sum

\sum_{j = 1}^{N - 1} \cdot = 0 .

As we see here we obtain X-valued fractionally type pointwise and uniform convergence with rates of

H_{n} \to I

the unit operator, as

n \to \infty .

Proof.

The proof is very lengthy and similar to [24]; therefore, it is omitted. □

Next we apply Theorem 9 for

N = 1 .

Theorem 10.

Let

0 < α, β < 1

,

q > 0,

q \neq 1,

f \in C^{1} ([a, b], X)

,

x \in [a, b]

,

n \in N : n^{1 - β} > 2 .

Then

(i)

∥H_{n} (f, x) - f (x)∥ \leq

\frac{Δ (q)}{Γ (α + 1)} \{\frac{(ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

T e^{- 2 λ n^{(1 - β)}} ({∥D_{x -}^{α} f∥}_{\infty, [a, x]} {(x - a)}^{α} + {∥D_{* x}^{α} f∥}_{\infty, [x, b]} {(b - x)}^{α})\},

(62)

and

(ii)

{∥H_{n} f - f∥}_{\infty} \leq \frac{Δ (q)}{Γ (α + 1)}

\{\frac{(\sup_{x \in [a, b]} ω_{1} {(D_{x -}^{α} f, \frac{1}{n^{β}})}_{[a, x]} + \sup_{x \in [a, b]} ω_{1} {(D_{* x}^{α} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{α β}} +

{(b - a)}^{α} T e^{- 2 λ n^{(1 - β)}} (\sup_{x \in [a, b]} {∥D_{x -}^{α} f∥}_{\infty, [a, x]} + \sup_{x \in [a, b]} {∥D_{* x}^{α} f∥}_{\infty, [x, b]})\} .

(63)

When

α = \frac{1}{2}

we derive

Corollary 1.

Let

0 < β < 1

,

q > 0,

q \neq 1,

f \in C^{1} ([a, b], X)

,

x \in [a, b]

,

n \in N : n^{1 - β} > 2 .

Then

(i)

∥H_{n} (f, x) - f (x)∥ \leq

\frac{2 Δ (q)}{\sqrt{π}} \{\frac{(ω_{1} {(D_{x -}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[a, x]} + ω_{1} {(D_{* x}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{\frac{β}{2}}} +

T e^{- 2 λ n^{(1 - β)}} ({∥D_{x -}^{\frac{1}{2}} f∥}_{\infty, [a, x]} \sqrt{(x - a)} + {∥D_{* x}^{\frac{1}{2}} f∥}_{\infty, [x, b]} \sqrt{(b - x)})\},

(64)

and

(ii)

{∥H_{n} f - f∥}_{\infty} \leq \frac{2 Δ (q)}{\sqrt{π}}

\{\frac{(\sup_{x \in [a, b]} ω_{1} {(D_{x -}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[a, x]} + \sup_{x \in [a, b]} ω_{1} {(D_{* x}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[x, b]})}{n^{\frac{β}{2}}} +

\sqrt{(b - a)} T e^{- 2 λ n^{(1 - β)}} (\sup_{x \in [a, b]} {∥D_{x -}^{\frac{1}{2}} f∥}_{\infty, [a, x]} + \sup_{x \in [a, b]} {∥D_{* x}^{\frac{1}{2}} f∥}_{\infty, [x, b]})\} < \infty .

(65)

We make

Remark 4.

Some convergence analysis follows based on Corollary 1.

Let

0 < β < 1

,

λ > 0,

f \in C^{1} ([a, b], X)

,

x \in [a, b]

,

n \in N : n^{1 - β} > 2 .

We elaborate on (65). Assume that

ω_{1} {(D_{x -}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[a, x]} \leq \frac{R_{1}}{n^{β}},

(66)

and

ω_{1} {(D_{* x}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[x, b]} \leq \frac{R_{2}}{n^{β}},

(67)

∀

x \in [a, b]

, ∀

n \in N

, where

R_{1}, R_{2} > 0

.

Then it holds

\frac{[\sup_{x \in [a, b]} ω_{1} {(D_{x -}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[a, x]} + \sup_{x \in [a, b]} ω_{1} {(D_{* x}^{\frac{1}{2}} f, \frac{1}{n^{β}})}_{[x, b]}]}{n^{\frac{β}{2}}} \leq

\frac{\frac{(R_{1} + R_{2})}{n^{β}}}{n^{\frac{β}{2}}} = \frac{(R_{1} + R_{2})}{n^{\frac{3 β}{2}}} = \frac{R}{n^{\frac{3 β}{2}}},

(68)

where

R : = R_{1} + R_{2} > 0 .

The other summand of the right hand side of (65), for large enough n, converges to zero at the speed

e^{- 2 λ n^{(1 - β)}}

, so it is about

A e^{- 2 λ n^{(1 - β)}}

, where

A > 0

is a constant.

Then, for large enough

n \in N

, by (65), (68) and the above comment, we obtain that

{∥H_{n} f - f∥}_{\infty} \leq \frac{B}{n^{\frac{3 β}{2}}},

(69)

where

B > 0

, converging to zero at the high speed of

\frac{1}{n^{\frac{3 β}{2}}} .

In Theorem 5, for

f \in C ([a, b], X)

and for large enough

n \in N

, the speed is

\frac{1}{n^{β}}

. So by (69),

{∥H_{n} f - f∥}_{\infty}

converges much faster to zero. The last comes because we assumed differentiability of f. Notice that in Corollary 1 no initial condition is assumed.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declare no conflict of interest.

References

Anastassiou, G.A. Rate of convergence of some neural network operators to the unit-univariate case. J. Math. Anal. Appl. 1997, 212, 237–262. [Google Scholar] [CrossRef] [Green Version]
Anastassiou, G.A. Quantitative Approximations; Chapman & Hall: Boca Raton, FL, USA; CRC: New York, NY, USA, 2001. [Google Scholar]
Chen, Z.; Cao, F. The approximation operators with sigmoidal functions. Comput. Math. Appl. 2009, 58, 758–765. [Google Scholar] [CrossRef] [Green Version]
Anastassiou, G.A. Univariate hyperbolic tangent neural network approximation. Math. Comput. Model. 2011, 53, 1111–1132. [Google Scholar] [CrossRef]
Anastassiou, G.A. Multivariate hyperbolic tangent neural network approximation. Comput. Math. 2011, 61, 809–821. [Google Scholar]
Anastassiou, G.A. Multivariate sigmoidal neural network approximation. Neural Netw. 2011, 24, 378–386. [Google Scholar] [CrossRef]
Anastassiou, G.A. Inteligent Systems: Approximation by Artificial Neural Networks. In Intelligent Systems Reference Library; Springer: Berlin/Heidelberg, Germany, 2011; Volume 19. [Google Scholar]
Anastassiou, G.A. Univariate sigmoidal neural network approximation. J. Comput. Anal. Appl. 2012, 14, 659–690. [Google Scholar]
Anastassiou, G.A. Fractional neural network approximation. Comput. Math. Appl. 2012, 64, 1655–1676. [Google Scholar] [CrossRef] [Green Version]
Anastassiou, G.A. Intelligent Systems II: Complete Approximation by Neural Network Operators; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2016. [Google Scholar]
Anastassiou, G.A. Nonlinearity: Ordinary and Fractional Approximations by Sublinear and Max-Product Operators; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2018. [Google Scholar]
Haykin, S. Neural Networks: A Comprehensive Foundation, 2nd ed.; Prentice Hall: New York, NY, USA, 1998. [Google Scholar]
McCulloch, W.; Pitts, W. A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biophys. 1943, 7, 115–133. [Google Scholar] [CrossRef]
Mitchell, T.M. Machine Learning 1997.
Anastassiou, G.A. q-Deformed and lambda-parametrized hyperbolic tangent function based Banach space valued multivariate multi layer neural network approximations. Ann. Univ. Sci. Bp. Sect. Comp. 2023, in press. [Google Scholar]
El-Shehawy, S.A.; Abdel-Salam, E.A.-B. The q-deformed hyperbolic Secant family. Intern. J. Appl. Math. Stat. 2012, 29, 51–62. [Google Scholar]
Anastassiou, G.A. General sigmoid based Banach space valued neural network approximation. J. Comput. Anal. Appl. 2023, 31, 520–534. [Google Scholar]
Anastassiou, G.A. Vector fractional Korovkin type Approximations. Dyn. Syst. Appl. 2017, 26, 81–104. [Google Scholar]
Anastassiou, G.A. Strong Right Fractional Calculus for Banach space valued functions. Rev. Proyecc. 2017, 36, 149–186. [Google Scholar] [CrossRef] [Green Version]
Anastassiou, G.A. A strong Fractional Calculus Theory for Banach space valued functions. Nonlinear Funct. Anal. Appl. 2017, 22, 495–524. [Google Scholar]
Shilov, G.E. Elementary Functional Analysis; Dover Publications, Inc.: New York, NY, USA, 1996. [Google Scholar]
Mikusinski, J. The Bochner integral; Academic Press: New York, NY, USA, 1978. [Google Scholar]
Kreuter, M. Sobolev Spaces of Vector-Valued Functions. Master’s Thesis, Ulm University, Ulm, Germany, 2015. [Google Scholar]
Anastassiou, G.A.; Karateke, S. Parametrized hyperbolic tangent induced Banach space valued ordinary and fractional neural network approximation. Progr. Fract. Differ. Appl. 2023, in press. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Anastassiou, G.A. Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function. Fractal Fract. 2023, 7, 208. https://doi.org/10.3390/fractalfract7030208

AMA Style

Anastassiou GA. Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function. Fractal and Fractional. 2023; 7(3):208. https://doi.org/10.3390/fractalfract7030208

Chicago/Turabian Style

Anastassiou, George A. 2023. "Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function" Fractal and Fractional 7, no. 3: 208. https://doi.org/10.3390/fractalfract7030208

Article Menu

Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function

Abstract

1. Introduction

2. About q-Deformed and λ-Parameterized Hyperbolic Tangent Function g_q,λ

3. Main Results

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Abstract Univariate Neural Network Approximation Using a q-Deformed and λ-Parametrized Hyperbolic Tangent Activation Function

Abstract

1. Introduction

2. About q-Deformed and λ-Parameterized Hyperbolic Tangent Function gq,λ

3. Main Results

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. About q-Deformed and λ-Parameterized Hyperbolic Tangent Function g_q,λ