Generalized Approach to Differentiability

Koceić-Bilan, Nikola; Braić, Snježana

doi:10.3390/math10173085

Open AccessEditor’s ChoiceArticle

Generalized Approach to Differentiability

by

Nikola Koceić-Bilan

^*

and

Snježana Braić

Faculty of Science, University of Split, 21000 Split, Croatia

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(17), 3085; https://doi.org/10.3390/math10173085

Submission received: 21 July 2022 / Revised: 17 August 2022 / Accepted: 22 August 2022 / Published: 27 August 2022

Download Versions Notes

Abstract

:

In the traditional approach to differentiability, found in almost all university textbooks, this notion is considered only for interior points of the domain of function or for functions with an open domain. This approach leads to the fact that differentiability has usually been considered only for functions with an open domain in

R^{n}

, which severely limits the possibility of applying the potential techniques and tools of differential calculus to a broader class of functions. Although there is a great need for generalization of the notion of differentiability of a function in various problems of mathematical analysis and other mathematical branches, the notion of differentiability of a function at the non-interior points of its domain has almost not been considered or successfully defined. In this paper, we have generalized the differentiability of scalar and vector functions of several variables by defining it at non-interior points of the domain of the function, which include not only boundary points but also all points at which the notion of linearization is meaningful (points admitting nbd rays). This generalization allows applications in all areas where standard differentiability can be applied. With this generalized approach to differentiability, some unexpected phenomena may occur, such as a function discontinuity at a point where a function is differentiable, the non-uniqueness of differentials… However, if one reduces this theory only to points with some special properties (points admitting a linearization space with dimension equal to the dimension of the ambient Euclidean space of the domain and admitting a raylike neighborhood, which includes the interior points of a domain), then all properties and theorems belonging to the known theory of differentiability remain valid in this extended theory. For generalized differentiability, the corresponding calculus (differentiation techniques) is also provided by matrices—representatives of differentials at points. In this calculus the role of partial derivatives (which in general cannot exist for differentiable functions at some points) is taken by directional derivatives.

Keywords:

differentiability; partial derivatives; derivatives in the direction; set of linear contribution; linearization space; neighbourhood ray; raylike neighbourhood

MSC:

26B05; 26B12

1. Introduction and Motivation

One of the basic ideas of differential calculus is to better approximate a given function

f : X \to R^{m}, X \subseteq R^{n},

locally by an affine function, i.e., to linearize it at a point

P_{0} \in X

. For this to be possible, the function must be differentiable at this point which means that there exists a linear operator

A : R^{n} \to R^{m}

such that the limit

\lim_{H \to 0} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||}

(1)

exists and is equal to

0 \in R^{m} .

For practical reasons, differentiability in mathematical analysis has been defined and considered almost only for functions

f : Ω \to R^{m}

with an open domain

Ω \subseteq R^{n}

[1,2,3,4]. Since every point of an open set

Ω \subseteq R^{n}

is an accumulation point of

Ω

[1] then for every point

P_{0} \in Ω

it holds that

0 \in R^{n}

is the accumulation point of the domain D of the function

H \mapsto \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||}

for a linear operator

A : R^{n} \to R^{m} .

Indeed, there exists

r > 0

such that the open ball

B (P_{0}, r)

is contained in

Ω

and consequently

B (0, r) \subseteq D

and the limit from the definition of differentiability (1) is reasonable to consider. (Recall that the limit of the function can be considered only at an accumulation point of the domain.)

However, reducing differentiability only to an open domain, i.e., to the interior points of a domain, has, in addition to many successful applications and advantages, some obvious deficiencies. For example, for the function

f : [0, \infty〉 \to R

f (x) = \sqrt{x^{3}}

it holds

\lim_{h \to 0^{+}} \frac{\sqrt{{(0 + h)}^{3}} - \sqrt{0^{3}} - 0 h}{|h|} = 0,

so this function can be well approximated by the zero operator, i.e., it could be linearized at the point

0 \in R

inside the natural domain of f, but due to the conditions from the definition of differentiability (that a point must belong to the interior of the domain [2]), the differentiability of the function at this point is usually not considered at all. Even though this issue can be overcome by extending the definition of differentiability (derivability) of a real function of a real variable to the endpoints of the given domain using one side limits [5], for a function of several variables the problem of differentiability at non-interior points of the domain remains current. For example, the differentiability of the function

f : D \to R f (x, y) = \sqrt{y - x^{3}}, D = \{(x, y) \in R^{2} ∣ y \geq x^{3}\}

cannot be considered in all boundary points

(x, x^{3}), x \in R

, although it can be well linearized locally by the zero operator in those points. Similarly, because of the reduction to open sets, the question of the existence of tangents [6] and tangent planes [4] of a function

f : Cl Ω \to R

,

Ω \subseteq R

or

R^{2}

, at points

(x, f (x)),

x \in Fr Ω,

remains open. For example, due to this reduction we cannot obtain the tangent of the function

x \mapsto \sqrt{x^{3}}

at the point

O = (0, 0)

although it is obvious that for points

T_{x} = (x, f (x)),

x \in 〈0, \infty〉,

the secants

O T_{x}

tend to the line

y = 0

as x tends to 0, and the line

y = 0

should be the tangent of this function at the point O. Moreover, the study of the local conditional extreme of a scalar function is reduced to the study of a function whose domain is not necessarily an open set, so that the problem of finding a conditional extreme cannot be clarified or fully studied if differentiability is studied only on open sets. Furthermore, a differentiable function would lose the property of differentiability at many points if differentiability at boundary points is not considered when switching from one Cartesian coordinate system to other non-affine coordinate systems (or vice versa).

These are some of the reasons that indicate that the notion of differentiability should be generalized by observing differentiability not only at interior points of sets, but much more broadly, at points of any domain

X \subseteq R^{n}

of a function

f : X \to R^{m}

in which the notion of differentiability and linearization is meaningful. John W. Milnor mentioned this problem in his famous series of lectures on differential topology which dates back to 1965 [7]. We will show that this extension is meaningful for all points

P_{0} \in X

for which there is at least one point

Q \in X \ \{P_{0}\}

such that the line segment

\bar{P_{0} Q}

is contained in X. Indeed, this is the most general case in which a linear operator can linearize a function at a point (at least on a line segment to which this point belongs). The linearization space is then a one-dimensional vector subspace of

R^{n}

which is also the smallest vector subspace on which it is interesting to consider and specify a linear operator.

In the history of modern mathematics one can find some other issues or (overlooked) problems of mathematical analysis like this one [8], where we take for granted some traditional approaches, common requirements and (sometimes wrong) conclusions. Concerning differentiability, one can find in the literature some generalizations of differentiability (derivability) such as the fractional derivative [9] or the derivative at the endpoints of a segment [5]. In this paper, we provide a natural generalization of differentiability of a function by defining it at some non-interior points of the domain of function. These points include not only the boundary points of the domain, but also all points in which the notion of differentiability and linearization is meaningful. For this generalized case, a corresponding calculus (techniques of differentiation) is also provided.

2. Preliminaries

In this paper we use the notation

(∣)

for the Euclidean scalar product on

R^{n}

, the notation

∣ ∣ ∣ ∣

for the Euclidean norm and the notation d for the Euclidean metric. We use the notation O for the point

(0, \dots, 0) \in R^{n}

or we simply write

0 \in R^{n} .

Let

Ω \subseteq R^{n}

be an open set,

f : Ω \to R^{m}

a function, and

P_{0} = (x_{1}^{0}, \dots, x_{n}^{0})

an arbitrary point in

Ω

. To approximate the function f on the open ball

B (P_{0}, r) \subseteq Ω,

r > 0,

at the point

P_{0}

with the special affine function

α : R^{n} \to R^{m}

α (H) = f (P_{0}) + A (H)

means to find a linear operator

A : R^{n} \to R^{m}

[10] such that

f (P_{0} + H) \sim α (H)

for any

H \in B (O, r) \subseteq R^{n}

. Geometrically interpreted, in the case of

m = 1

this means that we want to replace the part of the graph of the function f at the point

(P_{0}, f (P_{0}))

by the part of the graph of the affine function

α (x_{1}, \dots, x_{n}) = f (P_{0}) + a_{1} x_{1} + \dots + a_{n} x_{n}, a_{i} \in R, i = 1, \dots n,

i.e., the part of the hyperplane in

R^{n + 1}

. The desirable property of such an approximation is that it is as accurate as possible at points closer to the point

P_{0}

, i.e., that the error

r (H) : = f (P_{0} + H) - α (H) = f (P_{0} + H) - f (P_{0}) - A (H)

tends to zero as H tends to zero. However, if f is a continuous function, then the error

r (H)

always tends to zero as H tends to zero (because every linear operator acting between finite-dimensional vector spaces is continuous). This would mean that there is an adequate local replacement by the affine function of any continuous mapping, which is not the case. For example, if we consider the function

f : R^{2} \to R

f (x, y) = \sqrt{x^{2} + y^{2}}

, it is easy to see that on the open ball

B ((0, 0), ε)

we cannot approximate this function by an affine function, i.e., we cannot replace its graph well enough by a part of the plane passing through the origin

O \in R^{3}

, although this is perfectly possible on all rays starting in O. Thus, it is not only necessary that the error

r (H)

can be made arbitrarily small (because every continuous function has this property), but even more so that the relative error

\frac{r (H)}{||H||}

can be made arbitrarily small, which leads us to the definition of differentiability of the function f at the point

P_{0}

, which is as follows [3]:

Let

Ω \subseteq R^{n}

be an open set. A function

f : Ω \to R^{m}

is differentiable at a point

P_{0} \in Ω

if there exists a linear operator

A : R^{n} \to R^{m}

such that the limit

\lim_{H \to 0} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||}

exists and is equal to

0 \in R^{m} .

We then call the linear operator A the differential of the function f at the point

P_{0}

, it is unique and we denote it by

d f (P_{0}) .

A linear operator A is the differential of the function f at the point

P_{0}

if and only if

f (P_{0} + H) - f (P_{0}) = A (H) + r (H)

where

r : B (0, ε) \to R^{m}

is the error function with the property

\lim_{H \to 0} \frac{r (H)}{||H||} = 0 and B (P_{0}, ε) \subseteq Ω .

3. Linearization of Function

Definition 1.

Let

X \subseteq R^{n}

be a set and

P_{0} \in X .

We say that the point

P_{0}

admits a neighborhood ray (or simply admits a nbd ray) in X if there exists

H \in R^{n} \ \{0\}

such that the line segment

\bar{P_{0} P_{0} + H}

is contained in X.

This notion is of particular importance to us because we will consider the linearization of a function exactly at points in a domain that admit at least one nbd ray in the domain (and not, as before, only at points from its interior).

Example 1.

(a): No point of a sphere admits a nbd ray in it.
(b): Every point of a non-trivial convex set admits a nbd ray in that set.

Since every line segment is a convex set and every nontrivial convex set contains the line segment between any two of its points, a point

P_{0} \in X \subseteq R^{n}

admits a nbd ray in X if and only if there exists a nontrivial convex set

K \subseteq R^{n}

such that

P_{0} \in K \subseteq X

.

Definition 2.

Let

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting nbd ray in X. The set

Δ_{X, P_{0}} : = \{H \in R^{n} \ \{0\} ∣ \bar{P_{0} P_{0} + H} \subseteq X\}

is called the set of linear contributions at

P_{0}

in X, and its linear hull [10]

Σ_{X, P_{0}} : = [Δ_{X, P_{0}}]

is said to be the linearization space at

P_{0}

with respect to X.

For a function

f : X \to R^{m}

and a point

P_{0} \in X

we say that

Σ_{X, P_{0}}

is the linearization space of the function f at the point

P_{0}

.

Example 2.

(a): Let the points $P, Q, R \in R^{n}$ be in general position, i.e., let them be the three non-collinear points. Then it holds

$\begin{matrix} Δ_{\bar{P Q}, P} & = \{t (Q - P) ∣ t \in 〈0, 1]\} and \\ Σ_{\bar{P Q}, P} & = \{t (Q - P) ∣ t \in R\}, \end{matrix}$

$\begin{matrix} Δ_{\bar{P Q} \cup \bar{P R}, P} & = (\bar{O (Q - P)} \cup \bar{O (R - P)}) \ \{O\} and \\ Σ_{\bar{P Q} \cup \bar{P R}, P} & = \{α (Q - P) + β (R - P) ∣ α, β \in R\} . \end{matrix}$
(b): Let $D = \{P \in R^{n} ∣ ||P - P_{0}|| \leq r\}$ and $Q \in D$ . Then it holds

$Δ_{D, Q} = \{P - Q ∣ P \in D \ \{Q\}\} a n d$

$Σ_{D, Q} = R^{n} .$

Let us now generalize the notion of differentiability of a function to points admitting nbd ray in the domain. This will allow us to consider differentiability at points where this was not possible so far.

Definition 3.

Let

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting nbd ray in X. We say that a function

f : X \to R^{m}

is differentiable at

P_{0}

if there exists a linear operator

A : R^{n} \to R^{m}

such that the limit

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||}

(2)

exists and is equal to

0 \in R^{m}

. If such a linear operator exists, we call it the differential of the function f at the point

P_{0} .

The function f is differentiable on X if f is differentiable at every point of X.

Notice that if a point

P_{0} \in X

admits nbd ray in

X,

then

P_{0}

is an accumulation point of the set X and then

0 \in R^{n}

is an accumulation point of the set

Δ_{X, P_{0}}

. Indeed, if

\bar{P_{0} P_{0} + H} \subseteq X

then every nbd of

P_{0}

contains some points of the line segment

\bar{P_{0} P_{0} + H} \subseteq X

and consequently every nbd of 0 intersects

Δ_{X, P_{0}}

. Therefore, the limit from the previous definition makes sense to consider. Furthermore, the natural domain

D \subseteq R^{n}

of the function

H \mapsto \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||}

(3)

could be in general a superset of

Δ_{X, P_{0}}

, so it is necessary to emphasize that the limit (2) is considered only on the set

Δ_{X, P_{0}}

(this is the limit of the restriction of the function (3) to the set

Δ_{X, P_{0}}

at 0). Otherwise, the values of the above function at points that do not belong to

Δ_{X, P_{0}}

but are in D and near 0 may affect the existence of the limit of the function (3) at 0, which we do not want to allow. But, if

P_{0} \in Int X

then it holds

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = \lim_{H \to 0} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||},

which is a consequence of the following theorem:

Theorem 1.

Let

f : X \to R^{m},

X \subseteq R^{n},

Y \subseteq X

and

P_{0}

be an accumulation point of the set

Y .

Let U be an open neighborhood of the point

P_{0}

in

R^{n}

such that

(U \ \{P_{0}\}) \cap X \subseteq Y .

If the restriction

{f |}_{Y} : Y \to R^{m}

has the limit at the point

P_{0}

, then f has the limit at

P_{0}

and they are equal, i.e.,

\lim_{P_{0}} (f |_{Y}) = \lim_{P_{0}} f

.

Proof.

Let

Q_{0} : = \lim_{P_{0}} (f |_{Y})

and let

B (Q_{0}, ε)

be an open ball in

R^{m}

. Then there exists an open neighborhood V of

P_{0}

in

R^{n}

such that

V \subseteq U

and

{f |}_{Y} (V \cap (Y \ \{P_{0}\})) \subseteq B (Q_{0}, ε)

. Hence,

f (V \cap (X \ \{P_{0}\})) = f (V \cap (Y \ \{P_{0}\})) {= f |}_{Y} (V \cap (Y \ \{P_{0}\})) \subseteq B (Q_{0}, ε)

which implies that

\lim_{P_{0}} f = Q_{0} .

□

Therefore, in the above definition of differentiability of a function at a point, we can omit the notation of the restriction in the limit if this point belongs to the interior of the domain. In this case, the above definition coincides with the previously known definition of this notion. Thus, the Definition 3 is a natural generalization of the notion of differentiability and this generalization brings many advantages and solves many contentious issues and problems (e.g., at the boundary points of a domain…), which we will explain hereinafter with several various examples.

From the definition of differentiability, it follows that the linear operator

A : R^{n} \to R^{m}

is the differential of a function

f : X \to R^{m}

at a point

P_{0} \in X \subseteq R^{n}

if and only if

f (P_{0} + H) - f (P_{0}) = A (H) + r (H),

where

r : Δ_{X, P_{0}} \to R^{m}

is the error function with the property

\lim_{H \to 0} \frac{r (H)}{||H||} = 0

. Notice that the above equation makes sense only on

Δ_{X, P_{0}}

, i.e., only for a sufficiently small neighborhood U of the point

0 \in R^{n}

[2] we can write

f (P_{0} + H) - f (P_{0}) \sim A (H)

for

H \in U \cap Δ_{X, P_{0}}

. Likewise, the linear operator A, although defined on

R^{n}

, has its true meaning from the point of view of approximating the function f only on the linearization space

Σ_{X, P_{0}} \subseteq R^{n}

.

It is important to notice that the differential of a function at a point need not be unique (which could not be the case so far). Indeed, if the linearization space

Σ_{X, P_{0}}

is a proper subset of

R^{n}

and if there exists a differential

A : R^{n} \to R^{m}

of the function f at

P_{0},

then every linear operator

B : R^{n} \to R^{m}

that coincides with A on the subspace

Σ_{X, P_{0}}

(and there are infinitely many of them) is the differential of the function f at

P_{0}

because it satisfies the conditions of the definition of differentiability. Let us formalize this consideration by the following statement:

Proposition 1.

Let

f : X \to R^{m}

,

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting a nbd ray in X. If the function f is differentiable at

P_{0}

and

A : R^{n} \to R^{m}

is the differential of the function f at the point

P_{0}

then every linear operator

B : R^{n} \to R^{m}

which agrees with A on the vector space

Σ_{X, P_{0}}

is also the differential of the function f at the point

P_{0}

.

Proof.

Using equality

{A |}_{Σ_{X, P_{0}}} {= B |}_{Σ_{X, P_{0}}}

it is easy to check that

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = 0 = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - B (H)}{||H||}

holds. □

Now, we will show that all differentials of f at

P_{0}

are equal on the linearization space

Σ_{X, P_{0}} .

Theorem 2.

Let

f : X \to R^{m}

,

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting a nbd ray in X. If the differential of the function f exists at the point

P_{0}

then it is unique on the vector space

Σ_{X, P_{0}}

.

Proof.

Suppose that

A, B : R^{n} \to R^{m}

are two linear operators for which

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = 0 = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - B (H)}{||H||} .

Then it holds

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{B (H) - A (H)}{||H||} = 0 .

Every vector

H \in Σ_{X, P_{0}}

can be written as a linear combination of vectors from

Δ_{X, P_{0}}

. Therefore,

{A |}_{Σ_{X, P_{0}}} {= B |}_{Σ_{X, P_{0}}}

if and only if

A (H) = B (H)

for every

H \in Δ_{X, P_{0}} .

If

H \in Δ_{X, P_{0}}

then

t H \in Δ_{X, P_{0}}

for every

t \in 〈0, 1]

and

\lim_{t \to 0^{+}} \frac{B (t H) - A (t H)}{||t H||} = 0,

so it follows

0 = \lim_{t \to 0^{+}} \frac{(B - A) (t H)}{||t H||} = \lim_{t \to 0^{+}} \frac{t (B - A) (H)}{|t| ||H||} = \frac{(B - A) (H)}{||H||} .

Therefore,

A (H) = B (H)

for every

H \in Δ_{X, P_{0}}

and it holds

{A |}_{Σ_{X, P_{0}}} {= B |}_{Σ_{X, P_{0}}} .

□

Remark 1.

One might think that the cases where the linearization space

Σ_{X, P_{0}}

is a proper subset of

R^{n}

and the differential of the function f exists at the point

P_{0}

cause certain difficulties because the differential is not unique, but it is unique where it should be, i.e., on the linearization space

Σ_{X, P_{0}}

. According to the previous theorem, all differentials of the function f at the point

P_{0}

coincide in the space

Σ_{X, P_{0}}

and this is the only thing that is important for us because only in this space we can use the differential to approximate the function f at the point

P_{0}

.

Corollary 1.

Let

f : X \to R^{m},

X \subseteq R^{n},

P_{0} \in X

be a point admitting nbd ray in X and

Σ_{X, P_{0}} = R^{n} .

If the differential of the function f exists at the point

P_{0}

, then it is unique.

Proof.

This follows from Theorem 2 and Proposition 1. □

If the differential of a function

f : X \to R^{m}

,

X \subseteq R^{n}

, exists at a point

P_{0} \in X

and is unique, we denote it by

d f (P_{0}) .

If

f : R^{n} \to R^{m}

is a linear operator then f is differentiable on

R^{n}

and

d f (P) = f

at any point

P \in R^{n}

. In particular, the projection map

p_{i} : R^{n} \to R

,

i = 1, \dots, n

, is a linear operator and

d p_{i} (P) = p_{i}

for every

P \in R^{n}

. Usually

d p_{i} (P)

is denoted by

d x_{i} .

An affine mapping

f : R^{n} \to R^{m}

f (P) = P_{0} + A (P),

where

P_{0} \in R^{m}

and

A : R^{n} \to R^{m}

is a linear operator, is differentiable on

R^{n}

and

d f (P) = A

for every point

P \in R^{n}

.

Example 3.

Let

H = (1, 0) \in R^{2}

and

f : \bar{O H} \to R

f (x, y) = 3 x

. Since f is the restriction of the linear operator

A : R^{2} \to R

A (x, y) = 3 x

on the convex set

\bar{O H},

f is differentiable at any point of the domain and the differential at any point is equal to A. The linearization space of the function f at any point of the domain is

Σ = R \times \{0\}

and since it is a 1-dimensional subspace of

R^{2},

the differential of f is not unique. Moreover, all linear operators

R^{2} \to R

, represented by a matrix

[\begin{matrix} 3 & p \end{matrix}]

,

p \in R,

are all its different differentials. However, according to the previous theorem, the restriction of all these differentials on Σ is the same.

Notice that according to the traditional definition of differentiability, this function would not be differentiable at any point in its domain. On the other hand, the function f is perfectly linearized since its graph is

\bar{O T} \subseteq R^{3}

,

T = (1, 0, 3),

and it would be incorrect to say that it cannot be linearized (since its graph is perfectly linearized by the part of the line

O T

). However, since for functions whose domain is a subset of

R^{2}

the graph is linearized by part of the plane, we can do that in infinitely many ways, since the entire pencil of planes passes through the line

O T

, so its linearization is not unique. However, if we take the set

\bar{O H} \cup \bar{O H_{1}}

,

H_{1} = (0, 1) \in R^{2}

for the domain of the function f, then

Σ_{\bar{O H} \cup \bar{O H_{1}}, O} = R^{2}

, and by the previous theorem the differential of the function f at

O \in R^{2}

is unique, i.e., its linearization is the part of the unique plane passing through the line

O T

and

O T_{1}

,

T_{1} = (0, 1, 0)

,

O \in R^{3}

(the graph of the function f is the set

\bar{O T} \cup \bar{O T_{1}}

, which is a part of this plane).

Corollary 2.

Let

f : X \to R^{m},

X \subseteq R

and

x_{0} \in X

be a point admitting nbd ray in X. If a differential of the function f exists at the point

x_{0},

then it is unique.

Proof.

Since

Σ_{X, x_{0}} = R

, the statement follows from the previous corollary. □

Example 4.

Let us consider the function

f : D \to R

f (x) = \sqrt{y - x^{3}}

,

D = \{(x, y) \in R^{2} ∣ y \geq x^{3}\}

from the introduction. Since for

(0, 0) \in R^{2}

it holds

Δ_{D, (0, 0)} = (\{(x, y) \in R^{2} ∣ x \leq 0, y \geq 0\} \cup \{(x, y) \in R^{2} ∣ y \geq x^{3}, x > 0\}) \ \{(0, 0)\}

and

\lim_{\begin{matrix} (h_{1}, h_{2}) \to (0, 0) \\ (h_{1}, h_{2}) \in Δ_{D, (0, 0)} \end{matrix}} \frac{\sqrt{h_{2} - h_{1}^{3}} - \sqrt{0} - O (h_{1}, h_{2})}{∥(h_{1}, h_{2})∥} = 0,

where

O : R^{2} \to R

denotes the zero operator, f is differentiable at the point

0 .

Moreover, it follows from

Σ_{_{D, 0}} = R^{2}

that the zero operator is the unique differential of f at the point

(0, 0)

, i.e.,

d f (0, 0) = 0 .

Definition 4.

Let

P_{0} \in X \subseteq R^{n}

and

V \in R^{n} \ \{0\}

. We say that a point

P_{0}

admits a neighborhood ray in X in the direction of V if there exists

λ_{0} \in R^{+}

such that

\bar{P_{0} P_{0} + λ_{0} V} \subseteq X .

Proposition 2.

Let

Ω \subseteq R^{n}

be an open set. Every point

P_{0} \in Ω

admits a nbd ray in Ω in the direction of all vectors

H \in R^{n} \ \{0\}

and

Σ_{Ω, P_{0}} = R^{n} .

Proof.

Let

P_{0} \in Ω

and let

H \in R^{n} \ \{0\}

be arbitrary. Since

Ω

is open, there exists a ball

B (P_{0}, r) \subseteq Ω,

and since a ball is a convex set,

\bar{P_{0} P_{0} + \frac{r}{2} \frac{H}{||H||}} \subseteq B (P_{0}, r)

holds. Therefore,

P_{0}

admits nbd ray in

B (P_{0}, r)

in the direction of H and then admits it in

Ω

. Furthermore, from

Δ_{B (P_{0}, r), P_{0}} \subseteq Δ_{Ω, P_{0}}

it follows that

Σ_{B (P_{0}, r), P_{0}} \subseteq Σ_{Ω, P_{0}}

and

\frac{r}{2} \frac{H}{||H||} \in Δ_{B (P_{0}, r), P_{0}}

implies

H \in Σ_{B (P_{0}, r), P_{0}}

for every

H \in R^{n} \ \{0\} .

So,

Σ_{B (P_{0}, r), P_{0}} = R^{n}

and then

Σ_{Ω, P_{0}} = R^{n} .

□

Corollary 3.

If

Ω \subseteq R^{n}

is an open set and

f : Ω \to R^{m}

is differentiable at a point

P_{0} \in Ω,

then the differential of the function f at the point P is unique.

Proof.

This follows from the previous proposition and corollary 1. □

Remark 2.

We have already mentioned that the new definition of differentiability (Definition 3) coincides with the well-known definition of this notion when the domain of a function is an open set. It follows that in this particular case all previously known properties of differentials hold, including the property of uniqueness. However, the new theory induced by the extended definition of differentiability provides the proof of the uniqueness of the differential of an open domain function without relying on prior general knowledge of it.

Proposition 3.

Let

f : X \to R^{m},

X \subseteq R^{n},

Y \subseteq X

and

P_{0} \in Y

be a point admitting nbd ray in Y. If f is differentiable at

P_{0}

then

{f |}_{Y}

is differentiable at

P_{0}

and the differentials of the functions f and

{f |}_{Y}

at

P_{0}

coincide on

Σ_{Y, P_{0}}

.

Proof.

Since the function f is differentiable at

P_{0}

, there exists a linear operator

A : R^{n} \to R^{m}

such that

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = 0 .

Now,

Δ_{Y, P_{0}} \subseteq Δ_{X, P_{0}}

implies

0 = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{Y, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{Y, P_{0}} \end{matrix}} \frac{{f |}_{Y} (P_{0} + H) {- f |}_{Y} (P_{0}) - A (H)}{||H||}

from which it follows that the function

{f |}_{Y}

is differentiable at

P_{0}

and that the linear operator A is its differential at

P_{0}

. Now, by the Theorem 2, we conclude that every other differential at

P_{0}

coincides with A on

Σ_{Y, P_{0}}

. □

The converse does not hold, i.e., if the restriction of a function

f : X \to R^{m},

X \subseteq R^{n}

, to a subset

Y \subseteq X

is differentiable at a point

P_{0} \in Y,

then in general the function f need not be differentiable at that point. This will be shown by the following counterexample. But we will prove that if Y is open in

X,

then differentiability on Y implies differentiability on X.

Example 5.

Let

f : R^{2} \to R

f (x, y) = \sqrt{x^{2} + y^{2}} .

Let us consider the restrictions of the function f to sets

X_{1} = \{(0, y) ∣ y \in [0, \infty〉\} and X_{2} = \{(0, y) ∣ y \in 〈- \infty, 0]\} .

For functions

f_{1} : = f {|_{X_{1}} f_{1} (0, y) = y and f_{2} : = f |}_{X_{2}} f_{2} (0, y) = - y,

the linearization space at each point in their domains is

Σ = \{(0, y) ∣ y \in R\}

. The function

f_{1}

is the restriction of the linear operator

p_{2}

to the convex set

X_{1}

, so

p_{2}

is the differential of the function

f_{1}

at any point in

X_{1}

(not unique because the dimension of Σ is less than

2,

but they all coincide on Σ). Similarly, the differential of the function

f_{2}

at every point in

X_{2}

is

- p_{2}

. If the function f were differentiable at the point

0 \in R^{2},

then, by Proposition 3 and Theorem 2, the differentials of the functions

f_{1}

and

f_{2}

at the point 0 would coincide on Σ which is obviously not the case.

Theorem 3.

Let

f : X \to R^{m}

,

X \subseteq R^{n}

,

P_{0} \in X

and U be a neighborhood of the point

P_{0}

in

R^{n}

. If

P_{0}

admits a nbd ray in

U \cap X

and if

{f |}_{U \cap X}

is differentiable at

P_{0}

then f is also differentiable at

P_{0}

.

Proof.

Since U is a neighborhood of the point

P_{0}

in

R^{n}

, there exists

r \in R^{+}

such that

B (P_{0}, r) \subseteq U

and then

B (P_{0}, r) \cap X \subseteq U \cap X

. Therefore,

B (0, r) \cap Δ_{X, P_{0}} \subseteq Δ_{U \cap X, P_{0}} .

Due to differentiability of the function

{f |}_{U \cap X}

, there exists a linear operator

A : R^{n} \to R^{m}

such that

0 = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{U \cap X, P_{0}} \end{matrix}} \frac{{f |}_{U \cap X} (P_{0} + H) {- f |}_{U \cap X} (P_{0}) - A (H)}{||H||} = \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{U \cap X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} .

Since

Δ_{U \cap X, P_{0}} \subseteq Δ_{X, P_{0}}

and

B (0, r) \cap Δ_{X, P_{0}} \subseteq Δ_{X, P_{0}}

, by Theorem 1, it follows

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - A (H)}{||H||} = 0,

which implies that f is differentiable at

P_{0} .

□

The following statement follows from the previous theorem.

Corollary 4.

Let

f : X \to R^{m}

,

X \subseteq R^{n}

,

Ω \subseteq X

be an open set in

R^{n}

and

P_{0} \in Ω

. If

{f |}_{Ω}

is differentiable at

P_{0}

then f is also differentiable at

P_{0}

and

d f (P_{0}) {= d f |}_{Ω} (P_{0})

.

We will show now that differentiability does not imply continuity in general (which cannot be the case for a function with an open domain).

Example 6.

Let

P_{n} = (0, \frac{1}{n}),

Q_{n} = (1, \frac{1}{n}) \in R^{2}

,

n \in N

, and

X = ⋃_{n \in N} \bar{P_{n} Q_{n}} \cup \bar{(0, 0) (1, 0)} \subseteq R^{2} .

Let us consider the function

f : X \to R

f (P) = \{\begin{matrix} n, & P \in \bar{P_{n} Q_{n}} \\ 0, & P \in \bar{(0, 0) (1, 0)} . \end{matrix}

For the point

O = (0, 0) \in X

it holds

Δ_{X, O} = 〈0, 1] \times \{0\} and Σ_{X, O} = R \times \{0\} .

The function f is differentiable at the point O (it is differentiable at every point of its domain and the zero operator is one of its differentials), but f is discontinuous at all points of the line segment

\bar{(0, 0) (1, 0)}

, so it is discontinuous at O.

In this example, the dimension of the linearization space

Σ_{X, O}

is less than the dimension of the whole space

R^{2}

. However, even if the dimension of a linearization space is equal to the dimension of the whole space

R^{n}

, a function need not be continuous. This is shown by the following counterexample.

Example 7.

Let

S^{1} \subseteq R^{2}

be the 1-sphere and

P_{0}, Q, R \in S^{1}

three distinct points on it. Consider the union of two circular arcs and their corresponding chords

X = \hat{P_{0} Q} \cup \hat{P_{0} R} \cup \bar{P_{0} Q} \cup \bar{P_{0} R}

. The function

f : X \to R

f (x, y) = \{\begin{matrix} 0, (x, y) \in \bar{P_{0} Q} \cup \bar{P_{0} R} \\ 1, (x, y) \in (\hat{P_{0} Q} \cup \hat{P_{0} R}) \ \{Q, R, P_{0}\} \end{matrix}

is differentiable at

P_{0} .

Namely,

Δ_{X, P_{0}} = (\bar{O (Q - P_{0})} \cup \bar{O (R - P_{0})}) \ \{O\}

and

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0})}{||H||} = 0 .

Since

Σ_{X, P_{0}} = R^{2}

, the differential is unique and

d f (P_{0})

is the zero operator. But the function f is discontinuous at

P_{0}

because

\lim_{\begin{matrix} P \to P_{0} \\ P \in \bar{P_{0} Q} \cup \bar{P_{0} R} \end{matrix}} f (P) = 0

and

\lim_{\begin{matrix} P \to P_{0} \\ P \in \hat{P_{0} Q} \cup \hat{P_{0} R} \end{matrix}} f (P) = 1

hold.

To ensure that differentiability of a function at a point implies continuity at that point, we need an additional condition, which is introduced in the following definition.

Definition 5.

Let

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting nbd ray in X. A neighborhood U of the point

P_{0}

in X is said to be raylike neighborhood of the point

P_{0}

in X provided

\bar{P_{0} P} \subseteq U

holds for every

P \in U

. If there exists at least one raylike nbd in X of the point

P_{0},

we say that the point

P_{0}

admits raylike nbd in X.

It is easy to see that every point of a non-trivial convex set admits raylike nbd in that set, and then every point of the open set

Ω \subseteq R^{n}

admits raylike nbd in

Ω

.

Theorem 4.

Let a point

P_{0} \in X \subseteq R^{n}

admits raylike nbd in X. If

f : X \to R^{m}

is differentiable at

P_{0}

then it is also continuous at

P_{0}

.

Proof.

Let U be a raylike nbd of the point

P_{0}

in X. Then

U - P_{0} = \{P - P_{0} : P \in U\}

is a neighborhood of the point

O \in R^{n}

in

Δ_{X, P_{0}} \cup \{O\}

. To prove that f is continuous at

P_{0}

it suffices to prove that

{f |}_{U}

is continuous at

P_{0}

, i.e., that

\lim_{\begin{matrix} H \to 0 \\ H \in U - P_{0} \end{matrix}} f (P_{0} + H) = f (P_{0}) .

By the assumed differentiability, there exists a linear operator

A : R^{n} \to R^{m}

such that

f (P_{0} + H) - f (P_{0}) = A (H) + r (H)

(4)

where

r : Δ_{X, P_{0}} \to R^{m}

is an error function with the property

\lim_{H \to 0} \frac{r (H)}{||H||} = 0

. Then

\lim_{H \to 0} r (H) = 0 .

Since

U - P_{0} \subseteq Δ_{X, P_{0}} \cup \{0\}

,

\lim_{\begin{matrix} H \to 0 \\ H \in U - P_{0} \end{matrix}} r (H) = \lim_{H \to 0} r (H) = 0 .

Every linear operator operating between finite dimensional vectorial spaces is continuous, therefore

0 = A (0) = \lim_{H \to 0} A (H) = \lim_{\begin{matrix} H \to 0 \\ H \in U - P_{0} \end{matrix}} A (H) .

Hence, by (4), it follows

\lim_{\begin{matrix} H \to 0 \\ H \in U - P_{0} \end{matrix}} f (P_{0} + H) = f (P_{0}) .

□

Obviously, if

Ω \subseteq R^{n}

is an open set,

f : Ω \to R^{m}

,

P_{0} \in Ω

and f is differentiable at

P_{0},

then f is also continuous at

P_{0}

. Thus, if the domain of a function is an open set, differentiability implies continuity. The same is true for any convex domain.

4. Partial and Directional Derivatives

Definition 6.

Let

X \subseteq R^{n}

,

n \geq 2

,

V \in R^{n} \ \{0\}

and

P_{0} \in X

be a point admitting nbd ray in X in the direction of the vector V. The set

Δ_{V (X, P_{0})} : = \{t V ∣ t \in R\} \cap Δ_{X, P_{0}}

is called the set of linear contributions at

P_{0}

in the direction of V into X.

Let us denote

\begin{matrix} h_{1} & : = \inf \{t \in R ∣ t V \in Δ_{X, P_{0}}\}, \\ h_{2} & : = \sup \{t \in R ∣ t V \in Δ_{X, P_{0}}\}, \end{matrix}

(5)

where

h_{1} = - \infty

(h_{2} = \infty)

, provided the set

\{t \in R ∣ t V \in Δ_{X, P_{0}}\}

is not bounded from below (above). Let

X_{V, P_{0}}

denotes the largest convex subset of the set

X \cap P_{0} P_{0} + V

containing the point

P_{0}

(

P_{0} P_{0} + V

denotes the line passing through the points

P_{0}

and

P_{0} + V

). If

h_{1}, h_{2} \in R

then

X_{V, P_{0}} = X \cap \bar{(P_{0} + h_{1} V) (P_{0} + h_{2} V)}

(it is the line segment with or without boundary points), otherwise

X_{V, P_{0}}

is the half-line or the line

P_{0} P_{0} + V

.

Definition 7.

Let a point

P_{0} \in X \subseteq R^{n}

admit nbd ray in X in the direction of

V \in R^{n} \ \{0\}

. We say that a function

f : X \to R

has the derivative at

P_{0}

in the direction of V if there exists

\lim_{\begin{matrix} h \to 0 \\ h V \in Δ_{V (X, P_{0})} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} .

This limit, if it exists, is denoted by

\partial_{V} f (P_{0})

and is called the derivative of f at

P_{0}

in the direction of

V .

The derivative at

P_{0}

in the direction of

e_{i}

(

e_{i}

is the i-th basis vector of the standard ordered basis for

R^{n}

) is called i-th partial derivative of f at

P_{0}

and is denoted by

\partial_{i} f (P_{0}) .

Notice that, for

P_{0} = (x_{1}^{0}, \dots, x_{n}^{0})

,

\partial_{i} f (P_{0}) = \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{f (x_{1}^{0}, \dots, x_{i - 1}^{0}, x_{i}^{0} + h, x_{i + 1}^{0}, \dots, x_{n}^{0}) - f (x_{1}^{0}, \dots, x_{i}^{0}, \dots, x_{n}^{0})}{h}

holds. If

h_{1} < 0

, by Theorem 1, it follows

\partial_{i} f (P_{0}) = \lim_{h \to 0} \frac{f (x_{1}^{0}, \dots, x_{i - 1}^{0}, x_{i}^{0} + h, x_{i + 1}^{0}, \dots, x_{n}^{0}) - f (x_{1}^{0}, \dots, x_{i}^{0}, \dots, x_{n}^{0})}{h} .

Theorem 5.

Let

f : X \to R

,

X \subseteq R^{n}

,

V \in R^{n} \ \{0\}

and

P_{0} \in X

be a point admitting nbd ray in X in the direction of V. The function f has the derivative at

P_{0}

in the direction of V if and only if its restriction

{f |}_{X_{V, P_{0}}}

is differentiable at

P_{0} .

The value of each differential of the function

{f |}_{X_{V, P_{0}}}

at

P_{0},

at V is

\partial_{V} f (P_{0}) .

Proof.

Suppose f has the derivative at

P_{0}

in the direction of

V .

If

\{V, v_{1}, \dots, v_{n - 1}\}

is some basis for

R^{n},

then any linear operator

A : R^{n} \to R

, given by

A (V) = \partial_{V} f (P_{0}), A (v_{i}) = a_{i}

for some real numbers

a_{i}

,

i = 1, \dots, n - 1

, is a differential of the function

{f |}_{X_{V, P_{0}}} .

Namely,

\lim_{\begin{matrix} h \to 0^{+} \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - h \partial_{V} f (P_{0})}{|h|}

= \lim_{\begin{matrix} h \to 0^{+} \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0}) - \partial_{V} f (P_{0}) h}{h} =

\lim_{\begin{matrix} h \to 0^{+} \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} - \partial_{V} f (P_{0}) =

\lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} - \partial_{V} f (P_{0}) = 0 .

Similarly,

\lim_{\begin{matrix} h \to 0^{-} \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - h \partial_{V} f (P_{0})}{|h|} = 0

which implies

\lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - h \partial_{V} f (P_{0})}{|h|} = 0 .

Therefore,

\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{V (X, P_{0})} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + H) {- f |}_{X_{V, P_{0}}} (P_{0}) - A (H)}{||H||} =

= \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - A (h V)}{|h| ||V||} =

\frac{1}{||V||} \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - h \partial_{V} f (P_{0})}{|h|} = 0 .

Hence,

{f |}_{X_{V, P_{0}}}

is differentiable at the point

P_{0} .

Now assume that

{f |}_{X_{V, P_{0}}}

is differentiable at the point

P_{0},

i.e., that there exists a linear operator

A : R^{n} \to R

such that

\lim_{_{\begin{matrix} H \to 0 \\ H \in Δ_{V (X, P_{0})} \end{matrix}}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + H) {- f |}_{X_{V, P_{0}}} (P_{0}) - A (H)}{||H||} = 0 .

Then

0 = \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0}) - h A (V)}{|h| ||V||}

from which follows

\begin{matrix} A (V) & = \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{{f |}_{X_{V, P_{0}}} (P_{0} + h V) {- f |}_{X_{V, P_{0}}} (P_{0})}{h} \\ = \lim_{\begin{matrix} h \to 0 \\ h \in 〈h_{1}, h_{2}〉 \ \{0\} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} = \partial_{V} f (P_{0}) . \end{matrix}

Thus, f has the derivative at

P_{0}

in the direction of V and it is equal to

A (V)

. □

Corollary 5.

Let

f : X \to R

,

X \subseteq R^{n}

,

V \in R^{n} \ \{0\}

and

P_{0} \in X

be a point admitting nbd ray in X in the direction of V and let f be differentiable at

P_{0} .

Then f has the derivative at

P_{0}

in the direction of V and the value of each differential of the function f at

P_{0},

at V is equal to

\partial_{V} f (P_{0}) .

If

Σ_{X, P_{0}} = R^{n}

then

\partial_{V} f (P_{0}) = d f (P_{0}) (V) .

Proof.

The statement follows from the previous theorem and Proposition 3. □

The converse of this corollary does not hold, i.e., a function f at a point

P_{0}

can have directional derivatives and need not be differentiable at

P_{0}

, as shown in the following example.

Example 8.

The function

f : R^{2} \to R f (x, y) = \{\begin{matrix} \frac{x y}{x^{2} + y^{2}}, (x, y) \neq (0, 0) \\ 0, (x, y) = (0, 0) \end{matrix}

has both partial derivatives at the point

(0, 0)

but f is not continuous at this point, so by Theorem 4 f is not differentiable at

(0, 0) .

It is interesting to note that this function has the derivative at the point

(0, 0)

in the direction of any

V = (v_{1}, v_{2}) \in R^{2} \ \{0\},

and

\partial_{V} f (0, 0) = \lim_{h \to 0} \frac{h v_{1} h v_{2}}{h^{2} v_{1}^{2} + h^{2} v_{2}^{2}} = \frac{v_{1} v_{2}}{v_{1}^{2} + v_{2}^{2}} .

5. Differentiable Functions

5.1. Properties of Differentials

We will now prove the following important results which hold for differentiable functions.

Proposition 4.

Let

X \subseteq R^{n}

,

P_{0} \in X

be a point admitting nbd ray in X,

f, g : X \to R^{m}

be differentiable functions at

P_{0}

, and

A, B : R^{n} \to R^{m}

be differentials of the functions f and g at

P_{0},

respectively. Then the function

λ f + μ g : X \to R^{m}

is differentiable at

P_{0}

for any

λ, μ \in R

and

λ A + μ B

is its differential at

P_{0} .

Proof.

By the differentiability of the functions f and g at

P_{0}

it holds

f (P_{0} + H) - f (P_{0}) = A (H) + r_{1} (H)

and

g (P_{0} + H) - g (P_{0}) = B (H) + r_{2} (H),

for every

H \in Δ_{X, P_{0}} \subseteq R^{n}

, where

r_{1}, r_{2} : Δ_{X, P_{0}} \to R^{m}

are the functions with the property

\lim_{H \to 0} \frac{r_{1} (H)}{||H||} = 0 = \lim_{H \to 0} \frac{r_{2} (H)}{||H||} .

Now, it follows

(λ f + μ g) (P_{0} + H) - (λ f + μ g) (P_{0}) = λ (f (P_{0} + H) - f (P_{0})) + μ (g (P_{0} + H) - g (P_{0}))

= λ (A (H) + r_{1} (H)) + μ (B (H) + r_{2} (H)) =

= λ A (H) + μ B (H) + λ r_{1} (H) + μ r_{2} (H) =

= (λ A + μ B) (H) + λ r_{1} (H) + μ r_{2} (H),

for every

H \in Δ_{X, P_{0}}

. Since for the function

r (H) = λ r_{1} (H) + μ r_{2} (H)

it holds

\lim_{H \to 0} \frac{r (H)}{||H||} = 0

, the function

λ f + μ g

is differentiable at

P_{0}

and the linear operator

λ A + μ B

is its differential at this point. □

Proposition 5.

Let

X \subseteq R^{n}

,

P_{0} \in X

be a point admitting nbd ray in X,

α : X \to R

and

f : X \to R^{m}

be differentiable functions at

P_{0},

and

A : R^{n} \to R

and

B : R^{n} \to R^{m}

be differentials of the functions α and f at

P_{0},

respectively. Then the function

α \cdot f : X \to R^{m}

(α \cdot f) (P) = α (P) f (P)

is differentiable at

P_{0}

and

α (P_{0}) B + f (P_{0}) A : R^{n} \to R^{m}

(α (P_{0}) B + f (P_{0}) A) (H) = α (P_{0}) B (H) + A (H) f (P_{0})

is its differential at

P_{0}

.

Proof.

By the differentiability of the functions

α

and f at

P_{0},

it holds

α (P_{0} + H) - α (P_{0}) = A (H) + r_{1} (H)

and

f (P_{0} + H) - f (P_{0}) = B (H) + r_{2} (H),

for every

H \in Δ_{X, P_{0}} \subseteq R^{n}

, where

r_{1} : Δ_{X, P_{0}} \to R

and

r_{2} : Δ_{X, P_{0}} \to R^{m}

are the functions with the properties

\lim_{H \to 0} \frac{r_{1} (H)}{||H||} = 0 and \lim_{H \to 0} \frac{r_{2} (H)}{||H||} = 0 .

Hence

\lim_{H \to 0} r_{1} (H) = 0

and

\lim_{H \to 0} r_{2} (H) = 0 .

Now we infer that

(α \cdot f) (P_{0} + H) - (α \cdot f) (P_{0}) = α (P_{0}) B (H) + f (P_{0}) A (H) +

α (P_{0}) r_{2} (H) + r_{1} (H) f (P_{0}) + (A (H) + r_{1} (H)) (B (H) + r_{2} (H)),

holds, for every

H \in Δ_{X, P_{0}}

. Therefore, it is sufficient to prove that

\lim_{H \to 0} \frac{r (H)}{||H||} = 0

for the function

r : Δ_{X, P_{0}} \to R^{m}

r (H) = α (P_{0}) r_{2} (H) + r_{1} (H) f (P_{0}) + (A (H) + r_{1} (H)) (B (H) + r_{2} (H)) .

From the properties of the functions

r_{1}

and

r_{2}

it follows that

\lim_{H \to 0} \frac{α (P_{0}) r_{2} (H)}{||H||} = 0 = \lim_{H \to 0} \frac{r_{1} (H) f (P_{0})}{||H||} .

Furthermore, by the boundedness of a linear operator, there exists

λ > 0

such that

||A (H)|| \leq λ ||H||,

for every

H \in R^{n}

[11,12]. Therefore, since the linear operator is continuos and its value at zero is equal to zero, by the properties of the errors functions

r_{1}

and

r_{2}

, it follows

\begin{matrix} 0 & \leq & ∥\lim_{H \to 0} \frac{(A (H) + r_{1} (H)) (B (H) + r_{2} (H))}{||H||}∥ \\ = & ∥\lim_{H \to 0} (\frac{A (H)}{||H||} + \frac{r_{1} (H)}{||H||}) (B (H) + r_{2} (H))∥ \\ \leq & ∥\lim_{H \to 0} (λ + \frac{r_{1} (H)}{||H||}) (B (H) + r_{2} (H))∥ = 0 . \end{matrix}

This implies

\lim_{H \to 0} \frac{(A (H) + r_{1} (H)) (B (H) + r_{2} (H))}{||H||} = 0 .

Therefore, the function

α f

is differentiable at

P_{0}

and the linear operator

α (P_{0}) B + f (P_{0}) A

is its differential at this point. □

Proposition 6.

Let

X \subseteq R^{n}

,

P_{0} \in X

be a point admitting nbd ray in X,

f : X \to R

be a differentiable function at

P_{0},

and

A : R^{n} \to R

be the differential of the function f. If f is continuous at

P_{0}

and

f (P_{0}) \neq 0,

then the function

\frac{1}{f}

is differentiable at

P_{0}

and

- \frac{1}{{(f (P_{0}))}^{2}} A : R^{n} \to R

is its differential at

P_{0}

.

Proof.

By the continuity of the function f at

P_{0}

, there exists an open neighborhood O of the point

P_{0}

in X such that

f (P) \neq 0

for every

P \in O

. Since

O = U \cap X

for some neighborhood U of

P_{0}

in

R^{n}

, it suffices to prove that the restriction function

\frac{1}{f} |_{O}

is differentiable at

P_{0}

(Theorem 3). By the assumption,

f (P_{0} + H) - f (P_{0}) = A (H) + r (H)

holds for every

H \in Δ_{O, P_{0}} \subseteq R^{n}

, where

r : Δ_{O, P_{0}} \to R

is the function with the property

\lim_{H \to 0} \frac{r (H)}{||H||} = 0 .

It follows

\frac{1}{f (P_{0} + H)} - \frac{1}{f (P_{0})} = - \frac{f (P_{0} + H) - f (P_{0})}{f (P_{0}) f (P_{0} + H)} = - \frac{A (H) + r (H)}{f (P_{0}) f (P_{0} + H)} =

= - \frac{A (H)}{{(f (P_{0}))}^{2}} + \frac{A (H) (f (P_{0} + H) - f (P_{0})) - f (P_{0}) r (H)}{{(f (P_{0}))}^{2} f (P_{0} + H)},

for every

H \in Δ_{O, P_{0}}

. Thus, it is sufficient to prove the equality

\lim_{H \to 0} \frac{r_{1} (H)}{||H||} = 0

for the function

r_{1} : Δ_{O, P_{0}} \to R

r_{1} (H) = \frac{A (H) (f (P_{0} + H) - f (P_{0})) - f (P_{0}) r (H)}{{(f (P_{0}))}^{2} f (P_{0} + H)} .

Notice that

\lim_{H \to 0} (\frac{r (H)}{||H||} \frac{f (P_{0})}{{(f (P_{0}))}^{2} f (P_{0} + H)}) = 0

holds. By the boundedness of a linear operator, there exists

λ > 0

such that

||A (H)|| \leq λ ||H||,

for every

H \in R^{n}

[11,12]. It implies

0 \leq ||\frac{A (H) (f (P_{0} + H) - f (P_{0}))}{||H|| {(f (P_{0}))}^{2} f (P_{0} + H)}|| \leq λ ||\frac{f (P_{0} + H) - f (P_{0})}{{(f (P_{0}))}^{2} f (P_{0} + H)}|| .

Since f is continuous at

P_{0}

it holds

\lim_{H \to 0} ||\frac{f (P_{0} + H) - f (P_{0})}{{(f (P_{0}))}^{2} f (P_{0} + H)}|| = 0 .

Now, we infer

\lim_{H \to 0} ||\frac{A (H) (f (P_{0} + H) - f (P_{0}))}{||H|| {(f (P_{0}))}^{2} f (P_{0} + H)}|| = 0

and finally

\lim_{H \to 0} \frac{A (H) (f (P_{0} + H) - f (P_{0}))}{||H|| {(f (P_{0}))}^{2} f (P_{0} + H)} = 0 .

This proves that the function

- \frac{A}{{(f (P_{0}))}^{2}}

is the differential of the function

\frac{1}{f}

at the point

P_{0}

. □

Corollary 6.

Let

X \subseteq R^{n}

,

P_{0} \in X

be a point admitting nbd ray in X,

α : X \to R

and

f : X \to R^{m}

be differentiable functions at

P_{0},

and

A : R^{n} \to R

and

B : R^{n} \to R^{m}

be differentials of the functions α and f at

P_{0},

respectively. If α is continuous at

P_{0}

and

α (P_{0}) \neq 0,

then the function

\frac{1}{α} f

is differentiable at

P_{0}

and

\frac{α (P_{0}) B - f (P_{0}) A}{{(α (P_{0}))}^{2}} : R^{n} \to R^{m}

Proof.

This follows from the two previous propositions. □

Let us now prove that the composition of differentiable functions is differentiable.

Theorem 6.

Let

X \subseteq R^{n}

,

Y \subseteq R^{m}

,

f : X \to R^{m}

,

f (X) \subseteq Y,

and

g : Y \to R^{p}

. Let

P_{0} \in X

be a point admitting raylike nbd in X, and let

Q_{0} = f (P_{0})

be the point admitting raylike nbd in Y. If f is differentiable at

P_{0}

and g is differentiable at

Q_{0}

, then the composition

g \circ f : X \to R^{p}

is differentiable at

P_{0}

and

B \circ A

is its differential at the point

P_{0}

, where

A : R^{n} \to R^{m}

and

B : R^{m} \to R^{p}

are differentials at the points

P_{0}

and

Q_{0}

of the functions f and g, respectively.

Proof.

Since f is differentiable at the point

P_{0}

, there exists a linear operator

A : R^{n} \to R^{m}

such that

f (P) - f (P_{0}) = A (P - P_{0}) + r_{1} (P - P_{0})

for each

P \in X

such that

P - P_{0} \in Δ_{X, P_{0}},

and for each

r_{1} : Δ_{X, P_{0}} \to R^{m}

such that

\lim_{H \to 0} \frac{r_{1} (H)}{||H||} = \lim_{P \to P_{0}} \frac{r_{1} (P - P_{0})}{||P - P_{0}||} = 0 .

(6)

Similarly, there exists a linear operator

B : R^{m} \to R^{p}

such that

g (Q) - g (Q_{0}) = B (Q - Q_{0}) + r_{2} (Q - Q_{0})

for each

Q \in Y

such that

Q - Q_{0} \in Δ_{Y, Q_{0}}

, and for each

r_{2} : Δ_{Y, Q_{0}} \to R^{p}

such that

\lim_{H \to 0} \frac{r_{2} (H)}{||H||} = \lim_{Q \to Q_{0}} \frac{r_{2} (Q - Q_{0})}{||Q - Q_{0}||} = 0 .

By the assumption there exists a raylike nbd O of

P_{0}

in X. Since

O = U \cap X

for some neighborhood U of the point

P_{0}

in

R^{n}

, by Theorem 3, it suffices to prove that

{g \circ f |}_{O}

is differentiable at

P_{0}

. Now,

\begin{matrix} (g \circ f) (P) - (g \circ f) (P_{0}) & = B (f (P) - f (P_{0})) + r_{2} (f (P) - f (P_{0})) \\ = B (A (P - P_{0}) + r_{1} (P - P_{0})) + r_{2} (f (P) - f (P_{0})) \\ = B \circ A (P - P_{0}) + B (r_{1} (P - P_{0})) + r_{2} (f (P) - f (P_{0})) \end{matrix}

for every

P \in O

. Therefore, we have to prove that

\lim_{H \to 0} \frac{r (H)}{||H||} = \lim_{P \to P_{0}} \frac{r (P - P_{0})}{||P - P_{0}||} = 0

(7)

for the function

r : (O - P_{0}) \ \{0\} \to R^{p}

defined by

r (H) = B (r_{1} (H)) + r_{2} (f (P_{0} + H) - f (P_{0})) .

By the continuity of a linear operator, it follows that

\lim_{P \to P_{0}} \frac{B (r_{1} (P - P_{0}))}{||P - P_{0}||} = \lim_{P \to P_{0}} B (\frac{r_{1} (P - P_{0})}{||P - P_{0}||}) = B (\lim_{P \to P_{0}} \frac{r_{1} (P - P_{0})}{||P - P_{0}||}) = B (0) = 0 .

It remains to prove

\lim_{P \to P_{0}} \frac{r_{2} (f (P) - f (P_{0}))}{||P - P_{0}||} = 0 .

(8)

Since a linear operator is linearly bounded [11,12], there exists

λ > 0

such that

||A (P - P_{0})|| \leq λ ||P - P_{0}||

for every

P \in R^{n}

. Let

ε > 0

. By the equality (7) we infer that there exists

δ^{'} > 0

such that

B (Q_{0}, δ^{'}) \cap Y

is a raylike nbd of

Q_{0}

in Y and

||r_{2} (Q - Q_{0})|| \leq \frac{ε}{2 λ} ||Q - Q_{0}||

holds for every

Q \in B (Q_{0}, δ^{'}) \cap Y

. Furthermore, by the condition (6) and the continuity of the function f at the point

P_{0}

, there exists

δ > 0

such that

B (P_{0}, δ) \cap X \subseteq O

,

f (B (P_{0}, δ) \cap X) \subseteq B (Q_{0}, δ^{'}) \cap Y

and

\frac{||r_{1} (P - P_{0})||}{||P - P_{0}||} < λ

for every

P \in (B (P_{0}, δ) \ \{P_{0}\}) \cap X

. Hence,

d (\frac{r_{2} (f (P) - f (P_{0}))}{||P - P_{0}||}, 0) = \frac{||r_{2} (f (P) - f (P_{0}))||}{||P - P_{0}||} \leq

\leq \frac{\frac{ε}{2 λ} ||f (P) - f (P_{0})||}{||P - P_{0}||} \leq \frac{ε}{2 λ} \frac{||A (P - P_{0})|| + ||r_{1} (P - P_{0})||}{||P - P_{0}||} \leq \frac{ε}{2 λ} (λ + λ) = ε

for every

P \in (B (P_{0}, δ) \ \{P_{0}\}) \cap X .

Thus we have proved (8). □

Corollary 7.

Let

Ω_{1} \subseteq R^{n}

and

Ω_{2} \subseteq R^{m}

be open sets, and

f : Ω_{1} \to R^{m}

and

g : Ω_{2} \to R^{p}

such that

f (Ω_{1}) \subseteq Ω_{2}

. If f is differentiable at

P_{0} \in Ω_{1}

and g is differentiable at

f (P_{0}) \in Ω_{2}

, then the composition

g \circ f : Ω_{1} \to R^{p}

is differentiable at

P_{0}

and

d (g \circ f) (P_{0}) = d g (f (P_{0})) \circ d f (P_{0})

.

Proof.

Since every point of an open set admits raylike nbd in that set, the statement follows from the previous theorem, the Proposition 2 and the Corollary 1. □

Proposition 7.

Let

X \subseteq R^{n}

,

Y \subseteq R^{m}

and

f : X \to Y

be a bijection. Let the points

P_{0} \in X

and

Q_{0} = f (P_{0}) \in Y

admit a raylike nbd in X and Y, respectively, and let

Σ_{X, P_{0}} = R^{n}

and

Σ_{Y, Q_{0}} = R^{m}

. If the function f is differentiable at

P_{0}

and if its inverse

f^{- 1} : Y \to X

is differentiable at

Q_{0} \in Y

, then

m = n

, the differential

d f (P_{0}) : R^{n} \to R^{n}

is a regular operator and

d (f^{- 1}) (Q_{0}) = {(d f (P_{0}))}^{- 1}

.

Proof.

Since

f^{- 1} \circ f = 1_{X}

and

f \circ f^{- 1} = 1_{Y},

by the previous theorem, it follows that

d (f^{- 1}) (Q_{0}) \circ d f (P_{0}) = d (1_{X}) (P_{0}) and d f (P_{0}) \circ d (f^{- 1}) (Q_{0}) = d (1_{Y}) (Q_{0}) .

Furthermore, an identity is a linear operator, so that the differentials of all restrictions of the identity are equal to that identity. Now from

d (f^{- 1}) (Q_{0}) \circ d f (P_{0}) = 1_{R^{n}} and d f (P_{0}) \circ d (f^{- 1}) (Q_{0}) = 1_{R^{m}}

it follows that the linear operators

d f (P_{0}) : R^{n} \to R^{m}

and

d (f^{- 1}) (Q_{0}) : R^{m} \to R^{n}

are bijections, i.e., isomorphisms, so that

n = m

[10]. Then

d (f^{- 1}) (Q_{0}) = {(d f (P_{0}))}^{- 1}

. □

5.2. Differentiability of Real Functions of One Variable

Let

f : X \to R,

X \subseteq R,

and

x_{0} \in X

be a point admitting nbd ray in X. Then

Σ_{X, x_{0}} = R

, and if f is differentiable at

x_{0},

then the differential of f at

x_{0}

is unique and

\lim_{\begin{matrix} h \to 0 \\ h \in Δ_{X, x_{0}} \end{matrix}} \frac{f (x_{0} + h) - f (x_{0}) - d f (x_{0}) (h)}{|h|} = 0 .

Moreover, since

d f (x_{0})

is a linear operator, there exists a unique

a \in R

such that

\lim_{\begin{matrix} h \to 0 \\ h \in Δ_{X, x_{0}} \end{matrix}} \frac{f (x_{0} + h) - f (x_{0}) - a h}{|h|} = 0 .

Therefore, it follows that

a = \lim_{\begin{matrix} h \to 0 \\ h \in Δ_{X, x_{0}} \end{matrix}} \frac{f (x_{0} + h) - f (x_{0})}{h} .

This limit is denoted by

f^{'} (x_{0})

and is called the derivative of the function f at the point

x_{0} .

In general, if

f^{'} (x_{0})

exists for a function

f : X \to R,

X \subseteq R,

at a point

x_{0} \in X

, then the function f is said to be derivable at

x_{0}

. If the function f is derivable at every point of X, then we say that f is derivable. If X is an open set then

f^{'} (x_{0}) = \lim_{h \to 0} \frac{f (x_{0} + h) - f (x_{0})}{h} .

Notice that differentiability of the function f at

x_{0}

implies derivability and

d f (x_{0}) (h) = f^{'} (x_{0}) h .

Also, derivability of the function f at

x_{0}

(existence of the number

f^{'} (x_{0})

) implies differentiability at

x_{0}

, i.e., it holds following theorem:

Theorem 7.

Let

f : X \to R

and

x_{0} \in X \subseteq R

be a point admitting nbd ray in

X .

The function f is differentiable at

x_{0}

if and only if it is derivable at

x_{0}

.

Generalizing the notion of derivability (differentiability) of a real function of a real variable to points admitting an nbd ray in the domain of this function and not belonging to the interior of this domain allows the phenomenon of derivable (differentiable) but discontinuous functions at a given point, as shown in the following example.

Example 9.

Let

X = ⋃_{n \in N} [- \frac{1}{2 n}, - \frac{1}{2 n + 1}] \cup [0, 1]

and

f : X \to R

be the function defined by

f (x) = \{\begin{matrix} n, & x \in [- \frac{1}{2 n}, - \frac{1}{2 n + 1}] \\ 0, & x \in [0, 1] \end{matrix} .

The function f is derivable (differentiable) at every point of its domain X, and

f^{'} (x) = 0

for every

x \in X

, but f is discontinuous at 0.

However, if a point

x_{0} \in X \subseteq R

of a function

f : X \to R

admits raylike nbd in X and if f is derivable at

x_{0},

then f is continuous at

x_{0}

, which follows from Theorem 4 and the previous theorem.

Let us now consider the question of the tangent to the graph of a function

f : X \to R

,

X \subseteq R

. Since the number

\frac{f (x_{0} + h) - f (x_{0})}{h}

is the slope of the secant passing through the points

(x_{0}, f (x_{0}))

and

(x_{0} + h, f (x_{0} + h))

, the number

\lim_{h \to 0} \frac{f (x_{0} + h) - f (x_{0})}{h} = f^{'} (x_{0})

, if it exists, is the slope of the tangent line (the limiting position of secant) to the graph of the function f at the point

(x_{0}, f (x_{0})) .

Hence, we distinguish two cases:

(a): If f is derivable at the point $x_{0},$ then $f^{'} (x_{0})$ exists and we define the tangent to the graph of the function f at the point $(x_{0}, f (x_{0}))$ as the line passing through the point $(x_{0}, f (x_{0}))$ whose the slope is $f^{'} (x_{0}),$ so that its equation is

$y - f (x_{0}) = f^{'} (x_{0}) (x - x_{0}) .$
(b): If the function f at $x_{0}$ is not derivable, but is continuous and

$\lim_{h \to 0} \frac{f (x_{0} + h) - f (x_{0})}{h} = \infty or \lim_{h \to 0} \frac{f (x_{0} + h) - f (x_{0})}{h} = - \infty,$

then the line $x = x_{0}$ is the limiting position of the secant and we call it tangent at the point $(x_{0}, f (x_{0}))$ to the graph of the function f. For example, since $\lim_{h \to 0} \frac{\sqrt[3]{h} - 0}{h} = \infty$ , the line $x = 0$ is tangent to the graph of the function $x \mapsto \sqrt[3]{x}$ at the point $(0, 0)$ .

Likewise, the line

x = x_{0}

is called the tangent to the graph of the function f at

(x_{0}, f (x_{0}))

provided that

\lim_{h \to 0^{+}} \frac{f (x_{0} + h) - f (x_{0})}{h} = \infty (- \infty) and \lim_{h \to 0^{-}} \frac{f (x_{0} + h) - f (x_{0})}{h} = - \infty (\infty) .

Thus, for the function

x \mapsto \sqrt[3]{x^{2}}

, since

\lim_{h \to 0^{+}} \frac{\sqrt[3]{h^{2}} - 0}{h} = \infty

and

\lim_{h \to 0^{-}} \frac{\sqrt[3]{h^{2}} - 0}{h} = - \infty

, it follows that the line

x = 0

is the tangent to the graph of this function at the point

(0, 0)

.

5.3. Differentiability of Functions of Several Real Variables

Let

X \subseteq R^{n}

,

f : X \to R

be differentiable at a point

P_{0} \in X

that admits nbd ray in X in the direction of k linearly independent vectors

V_{1}, \dots, V_{k}

,

k \leq n

, and let these vectors form the basis of the space

Σ_{X, P_{0}}

. The differential

A : R^{n} \to R

of the function f at

P_{0}

is uniquely determined on

Σ_{X, P_{0}}

(Theorem 2) by values

\partial_{V_{i}} f (P_{0}) = A (V_{i}),

i = 1, \dots, k

(Corollary 5), so that, for every

H \in Σ_{X, P_{0}}

, there exist numbers

h_{i},

i = 1, \dots, k

, such that

H = h_{1} V_{1} + \dots + h_{k} V_{k}

and

A (H) = \sum_{i = 1}^{k} \partial_{V_{i}} f (P_{0}) h_{i} .

If

k = n,

then the vectors

V_{1}, \dots, V_{n}

form the basis of the space

Σ_{X, P_{0}} = R^{n},

therefore, the linear operator

d f (P_{0})

is unique (Corollary 1) and is uniquely determined by the values

\partial_{V_{i}} f (P_{0}) = d f (P_{0}) (V_{i}),

i = 1, \dots, n .

For each

H \in R^{n}

there exist numbers

h_{i},

i = 1, \dots, n

, such that

H = h_{1} V_{1} + \dots + h_{n} V_{n}

and

d f (P_{0}) (H) = \sum_{i = 1}^{n} \partial_{V_{i}} f (P_{0}) h_{i} .

This proves the following theorem.

Theorem 8.

Let

X \subseteq R^{n}

and

P_{0} \in X

be a point admitting nbd ray in X in the direction of n linearly independent vectors

V_{1}, \dots, V_{n} .

If

f : X \to R

is differentiable at

P_{0}

, then f has the derivatives at

P_{0}

in the direction of

V_{i}, \dots, V_{n},

and for any choice of vector

H = h_{1} V_{1} + \dots + h_{n} V_{n} \in R^{n}

holds

d f (P_{0}) (H) = \sum_{i = 1}^{n} \partial_{V_{i}} f (P_{0}) h_{i} .

If

Ω \subseteq R^{n}

is an open set and a function

f : Ω \to R

is differentiable at

P_{0} \in Ω

then all of its partial derivatives at

P_{0}

exist and

d f (P_{0}) (H) = \sum_{i = 1}^{n} \partial_{i} f (P_{0}) h_{i} = (grad f (P_{0}) | H)

for every

H = (h_{1}, \dots, h_{n}) \in R^{n}

.

Notice that the existence of the differential of a function

f : X \to R

,

X \subseteq R^{n}

, at

P_{0} \in X

now no longer depends on the existence of the partial derivatives at this point (which was the case so far). If the function f is differentiable at

P_{0}

and if the point

P_{0}

does not admit nbd ray in the direction of

e_{i}

for some

i \in \{1, \dots, n\},

but admits nbd rays in X in the direction of n linearly independent vectors

V_{1}, \dots, V_{n}

, then the role of the partial derivatives of f at

P_{0}

is taken over in the differential

d f (P_{0})

by derivatives in the direction of

V_{1}, \dots, V_{n}

and

d f (P_{0})

is represented by the matrix

[\begin{matrix} \partial_{V_{1}} f (P_{0}) & \dots & \partial_{V_{n}} f (P_{0}) \end{matrix}]

in the pair of ordered bases

(V_{1}, \dots, V_{n})

and

(e_{1}),

e_{1} = 1

. The converse of the previous theorem is not true. Namely, at some point derivatives of f in the direction of all vectors in

R^{n} \ \{0\}

can exist and the function f need not be differentiable at that point, as we have shown in the Example 8.

5.4. Differentiability of Vector Functions

The question of differentiability of a vector function

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

is equivalent to the question of differentiability of its coordinate functions

f_{i} = p_{i} \circ f

,

i = 1, \dots, m

.

Theorem 9.

Let

X \subseteq R^{n},

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

, and

P_{0} \in X

be a point admitting nbd ray in

X .

The function f is differentiable at

P_{0}

if and only if

f_{i}

is differentiable at

P_{0}

for every

i = 1, \dots, m

. A linear operator

A : R^{n} \to R^{m}

is the differential of the function f at

P_{0}

if and only if

p_{i} \circ A : R^{n} \to R

is the differential of the function

f_{i}

at

P_{0},

for

i = 1, \dots, m

.

Proof.

The differential of the function f at

P_{0}

is a linear operator

A : R^{n} \to R^{m}

with the property

f (P_{0} + H) - f (P_{0}) = A (H) + r (H)

for every

H \in Δ_{X, P_{0}}

, where

r = (r_{1}, \dots, r_{m}) : Δ_{X, P_{0}} \to R^{m}

is the function such that

\lim_{H \to 0} \frac{r (H)}{||H||} = 0

. It follows that

f_{i} (P_{0} + H) - f_{i} (P_{0}) = A_{i} (H) + r_{i} (H),

for every

H \in Δ_{X, P_{0}}

, and

\lim_{H \to 0} \frac{r_{i} (H)}{||H||} = 0

, for every

i \in {1, \dots, m}

, where

A_{i}

denotes the i-th coordinate function

p_{i} \circ A : R^{n} \to R

, which is also a linear operator as A is. Thus, the function

f_{i} : X \to R

is differentiable at

P_{0}

for every

i = 1, \dots, m

, i.e., the i-th coordinate function of the differential of the function f at

P_{0}

is the differential of the i-th coordinate function

f_{i}

of f.

In the same way it can be shown that the converse statement is also valid, i.e., that the differentiability of the coordinate functions

f_{i}

at

P_{0}

implies the differentiability of the function

f = (f_{1}, \dots, f_{m}) .

□

Corollary 8.

Let

X \subseteq R^{n}

and

P_{0} \in X

be a point that admits nbd rays in X in the direction of n linearly independent vectors

V_{1}, \dots, V_{n} .

If

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

is differentiable at

P_{0}

then there exists

\partial_{V_{i}} f (P_{0}) : = (\partial_{V_{i}} f_{1} (P_{0}), \dots, \partial_{V_{i}} f_{m} (P_{0})) \in R^{m}

for

i = 1, \dots, n

and

d f (P_{0}) (H) = \sum_{i = 1}^{n} \partial_{V_{i}} f (P_{0}) h_{i}, H = h_{1} V_{1} + \dots + h_{n} V_{n} \in R^{n} .

Proof.

The statement follows from the previous theorem, Corollary 1 and Theorem 8. □

In particular, if a function

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

,

X \subseteq R^{n}

, is differentiable at a point

P_{0} \in X

and if the point

P_{0}

admits nbd ray in X in the direction of

e_{1}, \dots, e_{n},

then the numbers

\partial_{j} f_{i} (P_{0}),

i = 1, \dots, m

,

j = 1, \dots, n

, uniquely determine the differential

d f (P_{0}) : R^{n} \to R^{m}

of f at

P_{0} .

This linear operator in the pair of standard bases is represented by the well-known Jacobi matrix. But if the point

P_{0}

does not admit nbd ray in X in the direction of

e_{i}

for some

i \in \{1, \dots, n\}

, but admits nbd ray in X in the direction of n linearly independent vectors

V_{1}, \dots, V_{n}

, then the role of partial derivatives of the functions

f_{i}

,

i = 1, \dots, m

, at

P_{0}

is taken over in the differential

d f (P_{0})

by derivatives in the direction of

V_{1}, \dots, V_{n}

and

d f (P_{0})

is represented by the matrix

[\begin{matrix} \partial_{V_{1}} f_{1} (P_{0}) & \dots & \partial_{V_{n}} f_{1} (P_{0}) \\ ⋮ \\ \partial_{V_{1}} f_{m} (P_{0}) & \dots & \partial_{V_{n}} f_{m} (P_{0}) \end{matrix}]

in the pair of ordered bases

(V_{1}, \dots, V_{n})

and

(e_{1}, \dots, e_{m})

. Let us show an application of this generalized calculus by the following simple example.

Example 10.

Let

V_{1}

,

V_{2} \in R^{2} \ \{0\}

be two linear independent vectors and let

P_{0} \in X \subseteq R^{2}

be a point admitting raylike nbd in X and admitting nbd ray in X in the direction of

V_{1}

and

V_{2}

. Let

f (x, y) = (u, v, w) : X \to R^{3}

and

g (u, v, w) : R^{3} \to R

be differentiable functions at

P_{0} = (x_{0}, y_{0})

and

Q_{0} = f (P_{0}) = (u_{0}, v_{0}, w_{0})

, respectively. By Theorem 6 the function

g \circ f

is differentiable at

P_{0}

and by Corollary 5 there exist directional derivatives

\partial_{V_{1}} (g \circ f) (P_{0})

,

\partial_{V_{2}} (g \circ f) (P_{0})

,

\partial_{V_{1}} f (P_{0}) = (\partial_{V_{1}} u (P_{0}), \partial_{V_{1}} v (P_{0}), \partial_{V_{1}} w (P_{0}))

and

\partial_{V_{2}} f (P_{0}) = (\partial_{V_{2}} u (P_{0}), \partial_{V_{2}} v (P_{0}), \partial_{V_{2}} w (P_{0})) .

The differential

d f (P_{0})

is represented by the matrix

[\begin{matrix} \partial_{V_{1}} u (P_{0}) & \partial_{V_{2}} u (P_{0}) \\ \partial_{V_{1}} v (P_{0}) & \partial_{V_{2}} v (P_{0}) \\ \partial_{V_{1}} w (P_{0}) & \partial_{V_{2}} w (P_{0}) \end{matrix}]

in the pair of ordered bases

(V_{1}, V_{2})

and

(e_{1}, e_{2}, e_{3})

. The differential

d (g \circ f) (P_{0})

is represented by the matrix

[\begin{matrix} \partial_{V_{1}} (g \circ f) (P_{0}) & \partial_{V_{2}} (g \circ f) (P_{0}) \end{matrix}]

in the pair of ordered basis

(V_{1}, V_{2})

and

(e_{1})

,

e_{1} = 1

. Now, the equality

d (g \circ f) (P_{0}) = d g (Q_{0}) \circ d f (P_{0})

induces the matrix equation

[\begin{matrix} \partial_{V_{1}} (g \circ f) (P_{0}) & \partial_{V_{2}} (g \circ f) (P_{0}) \end{matrix}] =

= [\begin{matrix} \partial_{1} g (Q_{0}) & \partial_{2} g (Q_{0}) & \partial_{3} g (Q_{0}) \end{matrix}] [\begin{matrix} \partial_{V_{1}} u (P_{0}) & \partial_{V_{2}} u (P_{0}) \\ \partial_{V_{1}} v (P_{0}) & \partial_{V_{2}} v (P_{0}) \\ \partial_{V_{1}} w (P_{0}) & \partial_{V_{2}} w (P_{0}) \end{matrix}]

which implies the following formulas:

\partial_{V_{i}} (g \circ f) (P_{0}) = \partial_{V_{i}} u (P_{0}) \cdot \partial_{1} g (Q_{0}) + \partial_{V_{i}} v (P_{0}) \cdot \partial_{2} g (Q_{0}) + \partial_{V_{i}} w (P_{0}) \cdot \partial_{3} g (Q_{0}),

i = 1, 2

.

5.5. Differentiability of Vector Functions of One Variable

Let

f = (f_{1}, \dots, f_{m}) : X \to R^{m},

X \subseteq R

, be a vector function of one variable. If f is differentiable at a point

x_{0} \in X

then the differential is unique (Corollary 2). Moreover, by Theorem 9, functions

f_{i}

are differentiable at

x_{0}

for

i = 1, \dots, n,

and

d f (x_{0}) (h) = (f_{1}^{'} (x_{0}) h, \dots, f_{m}^{'} (x_{0}) h) .

The vector

(f_{1}^{'} (x_{0}), \dots, f_{m}^{'} (x_{0}))

is called the derivative of the vector function f at the point

x_{0}

and is denoted by

f^{'} (x_{0})

. Obviously,

d f (x_{0}) (h) = h f^{'} (x_{0}) .

From Theorems 7 and 9 it follows that the differentiability of f at

x_{0}

is equivalent to the derivability of its coordinate functions

f_{1}, \dots, f_{m}

at

x_{0}

, consequently instead of differentiability of f we often use the term derivability of f.

Definition 8.

Let

x_{0} \in X \subseteq R

admit raylike nbd in X. We say that

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

is regular at

x_{0}

if f is derivable at

x_{0}

and

f^{'} (x_{0}) \neq 0

.

Definition 9.

Let

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

be a regular at

x_{0} \in X \subseteq R

and let any

x \in f^{- 1} (\{f (x_{0})\})

admits raylike nbd in X. We say that f is geometrically smooth at

x_{0}

provided f is derivable on

f^{- 1} (\{f (x_{0})\})

and the vectors

f^{'} (x_{0})

and

f^{'} (x)

are collinear for every

x \in f^{- 1} (\{f (x_{0})\})

. If f is geometrically smooth at all points in its domain, then we say that f is geometrically smooth.

Let

m > 1

and

f = (f_{1}, \dots, f_{m}) : X \to R^{m}

be a geometrically smooth at a point

x_{0} \in X

. The vector

\frac{f (x_{0} + h) - f (x_{0})}{h}

is the direction vector of the secant passing through the points

f (x_{0})

and

f (x_{0} + h),

and when

h \to 0

we obtain the vector

f^{'} (x_{0})

which is the direction vector of the tangent to the image

f (X) \subseteq R^{m}

of the function f at the point

f (x_{0})

, i.e., the tangent to the image of the function f at the point

f (x_{0})

is the line passing through the point

f (x_{0})

and its direction vector is

f^{'} (x_{0})

. Therefore its equation is

\frac{y_{1} - f_{1} (x_{0})}{f_{1}^{'} (x_{0})} = \dots = \frac{y_{m} - f_{m} (x_{0})}{f_{m}^{'} (x_{0})} .

Notice that at the point

f (x_{0}) = (x_{0}, g (x_{0}))

the terms tangent to the image of the function

f = (1_{X}, g) : X \to R^{2}

,

X \subseteq R

and the tangent to the graph of or a function

g : X \to R

coincide.

The tangent to the image of a vector function f of one variable makes sense only at points where the function f is geometrically smooth. This means, first of all, that we consider only points at which the function f is derivable. In addition to derivability, the regularity of f is also required, since the direction vector of each line is different from the zero vector. Furthermore, the tangent to the image of the function f at the point

f (x_{0})

only makes sense if there exist derivatives of f at all points

x \in f^{- 1} (\{f (x_{0})\})

and these derivatives are collinear vectors. Otherwise, we would get different tangents at

f (x_{0})

depending on which point

x \in f^{- 1} (\{f (x_{0})\})

we choose. Therefore, this condition is also included in the definition of geometrically smooth function f at a point

x_{0}

, since only at these points the tangent is uniquely determined. For example the function

f : R \to R^{2} f (t) = (\cos t, \sin t)

is a geometrically smooth function, since it is derivable and regular at every point in the domain. The image of f is the circle

S^{1}

and for an arbitrary point

f (t_{0})

the derivatives of f at all points in the set

f^{- 1} (\{f (t_{0})\}) = \{t_{0} + 2 k π, k \in Z\}

are equal. On the other hand, the function

g : [0, \frac{π}{3}] \to R^{2} g (t) = (\sin 3 t \cos t, \sin 3 t \sin t)

is not geometrically smooth at the points 0 and

\frac{π}{3} .

Namely,

g (0) = (0, 0) = g (\frac{π}{3}),

the function g is derivable and

g^{'} (t) = (3 \cos 3 t \cos t - \sin 3 t \sin t, 3 \cos 3 t \sin t + \sin 3 t \cos t),

but vectors

g^{'} (0) = (3, 0) and g^{'} (\frac{π}{3}) = (- \frac{3}{2}, - 3 \frac{\sqrt{3}}{2})

are not collinear. Therefore, the tangent to the image of the function g at the point

(0, 0)

is not defined.

Notation 1.

Although the notion of a geometrically smooth function allows the correct definition of a tangent to the image of the function at a point (a tangent cannot be a line without direction and must be unique at the point in the image of the function), the problem of the dependence of the tangent on the observed function remains, i.e., a tangent to the image of a function depends directly on the observed function and not only on its image. Indeed, two functions f and g may have the same image Γ and for a point

P \in Γ

the function f need not be geometrically smooth at x and g geometrically smooth at y for every

x \in f^{- 1} (\{P\})

and every

y \in g^{- 1} (\{P\})

, nor need the vectors

f^{'} (x)

and

g^{'} (y)

, if they exist, be collinear. For example, the image of the functions

f, g : [- 1, 1] \to R^{2} f (t) = (t, t), g (t) = (t^{3}, t^{3})

is the line segment

\bar{(- 1, - 1) (1, 1)}

. The function f is geometrically smooth at the point 0,

f^{'} (0) = (1, 1)

and the tangent to the image of f at

(0, 0)

is the line

y = x

. On the other hand, the function g is not regular at the point 0, so the tangent to its image (the same line segment) is not defined. If we want to define a tangent to a set which is the image of a function of one variable, but does not depend on the function itself, we should consider curves, i.e., smooth 1-parameterizable sets, which is beyond the scope of this paper.

6. Tangent Plane

Let

X \subseteq R^{n}

,

n \geq 2,

and

f : X \to R

. Let

S = {(x_{1}, \dots, x_{n}) \in X ∣ f (x_{1}, \dots, x_{n}) = 0}

be a nonempty set, and

P_{0} = (x_{1}^{0}, \dots, x_{n}^{0}) \in S

be a point admitting nbd ray in X in the direction of

e_{1}, \dots, e_{n}

and admitting raylike nbd in X. Let f be differentiable at

P_{0}

and

\nabla f (P_{0}) \neq 0

. For a continuous function

r = (r_{1}, \dots, r_{n}) : [a, b] \to S

,

[a, b] \subseteq R

, which is differentiable and geometrically smooth at a point

t_{0} \in [a, b]

and for which

r (t_{0}) = P_{0}

, the direction vector of the tangent to the image

r ([a, b])

of r at the point

P_{0}

is

r^{'} (t_{0})

. Since

f \circ r = 0

then

d (f \circ r) (t_{0}) = 0

and, by Theorem 6,

0 = d (f \circ r) (t_{0}) = d f (P_{0}) \circ d r (t_{0}),

i.e.,

0 = [\begin{matrix} \partial_{_{1}} f (P_{0}) & \dots & \partial_{_{n}} f (P_{0}) \end{matrix}] [\begin{matrix} r_{1}^{'} (t_{0}) \\ ⋮ \\ r_{n}^{'} (t_{0}) \end{matrix}] = (\nabla f (P_{0}) | r^{'} (t_{0}))

(9)

therefore the vectors

\nabla f (P_{0})

and

r^{'} (t_{0})

are orthogonal. Thus, the direction vector of the tangent to the image of any function

r : [a, b] \to S

with the above properties at the point

P_{0}

is orthogonal to

\nabla f (P_{0})

which implies that all these tangents lie in the same plane; we call this hyperplane the tangent plane to the set S at the point

P_{0}

. Since

\nabla f (P_{0})

is its normal vector, its equation is

\partial_{1} f (P_{0}) (x_{1} - x_{1}^{0}) + \dots + \partial_{n} f (P_{0}) (x_{n} - x_{n}^{0}) = 0 .

Let F be defined by

F (x_{1}, \dots, x_{n + 1}) = x_{n + 1} - f (x_{1}, \dots, x_{n})

for

(x_{1}, \dots, x_{n}) \in X .

The tangent plane to the set

\begin{matrix} Γ & = {(x_{1}, \dots, x_{n + 1}) \in R^{n + 1} ∣ F (x_{1}, \dots, x_{n + 1}) = 0, (x_{1}, \dots, x_{n}) \in X} \\ = \{(x_{1}, \dots, x_{n}, f (x_{1}, \dots, x_{n})) \in R^{n + 1} ∣ (x_{1}, \dots, x_{n}) \in X\} \end{matrix}

at the point

(P_{0}, f (P_{0}))

is called tangent plane to the graph of the function f at the point

(P_{0}, f (P_{0}))

. The vector

(- \partial_{1} f (P_{0}), \dots, - \partial_{n} f (P_{0}), 1)

is a normal vector of this plane, so its equation is

x_{n + 1} - f (P_{0}) = \sum_{i = 1}^{n} \partial_{i} f (P_{0}) (x_{i} - x_{i}^{0}) = d f (P_{0}) (x_{1} - x_{1}^{0}, \dots, x_{n} - x_{n}^{0}) .

Let us now define the tangent plane to the graph of a scalar function in an even more general case, i.e., at points which do not admit nbd ray in X in the direction of some of the vectors

e_{1}, \dots, e_{n}

but in the direction of some n linearly independent vectors.

Definition 10.

Let

f : X \to R,

X \subseteq R^{n},

be differentiable at a point

P_{0} = (x_{1}^{0}, \dots, x_{n}^{0}) \in X

that admits raylike nbd in

X,

and let

Σ_{X, P_{0}} = R^{n} .

The hyperplane

x_{n + 1} - f (P_{0}) = d f (P_{0}) (x_{1} - x_{1}^{0}, \dots, x_{n} - x_{n}^{0})

is called the tangent plane to the graph of the function f at the point

(P_{0}, f (P_{0}))

.

Remark 3.

Since the coordinates of the vector

P - P_{0}

are given in the standard basis of

R^{n},

it is assumed that the operator

d f (P_{0})

is defined in the pair of ordered basis

(e_{1}, \dots, e_{n})

and

(e_{1} = 1) .

The numbers

d f (P_{0}) (e_{i}),

i = 1, \dots, n

, are partial derivatives of the function f only if

P_{0}

admits nbd ray in the direction of

e_{i},

for

i = 1, \dots, n

.

Let us show the justification of the previously defined notion. Let

f : X \to R

fulfill conditions of the previous definition and let

P_{0}

admits nbd ray in the direction of a vector

V = (v_{1}, \dots, v_{n}) \in R^{n} \ \{0\}

in X such that

\bar{P_{0} P_{0} + V} \subseteq X

. Let us consider the parametrization of the line segment

\bar{P_{0} P_{0} + V},

i.e., the function

r = (r_{1}, \dots, r_{n}) : [0, 1] \to R^{n}

r (t) = P_{0} + t V

. Due to the assumed differentiability of the function f at

P_{0}

, there exists the derivative

\partial_{V} f (P_{0})

in the direction of V (Corollary 5) and then the function

f \circ r

is derivable at 0 because

\begin{matrix} {(f \circ r)}^{'} (0) & = \lim_{\begin{matrix} h \to 0 \\ h \in 〈0, 1〉 \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} \\ = \lim_{\begin{matrix} h \to 0 \\ h V \in Δ_{V (X, P_{0})} \end{matrix}} \frac{f (P_{0} + h V) - f (P_{0})}{h} = \partial_{V} f (P_{0}) . \end{matrix}

The image of the function

(r_{1}, \dots, r_{n}, f \circ r) : [0, 1] \to R^{n + 1}

belongs to the graph of f, passes through the point

(P_{0}, f (P_{0}))

and the direction vector of the tangent to the image of the function

(r_{1}, \dots, r_{n}, f \circ r)

at the point

(P_{0}, f (P_{0}))

is

(v_{1}, \dots, v_{n}, \partial_{V} f (P_{0}))

. We will now show that this tangent lies in the tangent plane to the graph of the function f at

(P_{0}, f (P_{0}))

, i.e., that the point

(x_{1}^{0} + v_{1}, \dots, x_{n}^{0} + v_{n}, f (P_{0}) + \partial f_{V} (P_{0}))

belongs to this plane. According to Corollary 5,

d f (P_{0}) (V) = \partial_{V} f (P_{0})

holds. This implies

(f (P_{0}) + \partial_{V} f (P_{0})) - f (P_{0}) = d f (P_{0}) (x_{1}^{0} + v_{1} - x_{1}^{0}, \dots, x_{n}^{0} + v_{n} - x_{n}^{0})

which proves that all points of the tangent belong to the tangent plane. This means that all previously described tangents lie in the same hyperplane and the normal vector of this hyperplane is orthogonal to the vector

(v_{1}, \dots, v_{n}, \partial_{V} f (P_{0}))

where which justifies the previous definition.

Remark 4.

For a scalar function

f : D \to R

of two variables which fulfils conditions of the previous definition at a point

P_{0} = (x_{0}, y_{0}) \in D

admitting nbd rays in the direction of two non-collinear vectors

V = (v_{1}, v_{2})

and

V^{'} = (v_{1}^{'}, v_{2}^{'})

in D, the vector of the direction of the normal line of the tangent plane to the graph of the function f at

(x_{0}, y_{0}, f (P_{0}))

is

(v_{1}, v_{2}, \partial_{V} f (P_{0})) \times (v_{1}^{'}, v_{2}^{'}, \partial_{V^{'}} f (P_{0}))

. This vector is orthogonal to the vector

(\bar{v_{1}}, \bar{v_{2}}, \partial_{(\bar{v_{1}}, \bar{v_{2}})} f (P_{0}))

where

(\bar{v_{1}}, \bar{v_{2}})

is an arbitrary vector such that the point

P_{0}

admits nbd ray in D in the direction of

(\bar{v_{1}}, \bar{v_{2}})

. In particular, at a point

P = (x, y) \in Int D,

i.e., at a point that admits nbd rays in D in the direction of vectors

e_{1}

and

e_{2}

, the normal vector of the tangent plane in

(x, y, f (P))

can be calculated as

(1, 0, \partial_{x} f (P)) \times (0, 1, \partial_{y} f (P)) = (- \partial_{x} f (P), - \partial_{y} f (P), 1) .

7. Differentiability in Different Coordinate Systems

7.1. Affine Coordinate Systems

Let

(O^{'}, (e^{'}))

be an affine coordinate system [10] where

O^{'} \in R^{n}

is its origin and

(e^{'}) = (e_{1}^{'}, \dots, e_{n}^{'})

is an ordered basis for the vector space

R^{n}

. For any

P \in R^{n}

the numbers

x_{1}^{'}, \dots, x_{n}^{'} \in R

for which

P - O^{'} = x_{1}^{'} e_{1}^{'} + \dots + x_{n}^{'} e_{n}^{'}

are the coordinates of the point P in that affine system and we write

P = (x_{1}^{'}, \dots, x_{n}^{'})

. Obviously,

O^{'} = (0, \dots, 0) .

If

(e) = (e_{1}, \dots, e_{n})

is the standard ordered basis and

O \in R^{n}

,

(O, (e))

is said to be the standard affine coordinate system.

Let

P \in R^{n}

has notation

(x_{1}, \dots, x_{n})

in the standard affine coordinate system

(O, (e))

and let

(x_{1}^{'}, \dots, x_{n}^{'})

be its notation in the affine coordinate system

(O^{'}, (e^{'}))

. Then the linear operator

A_{(e) (e^{'})} : R^{n} \to R^{n}

A (e_{i}) = e_{i}^{'}

,

i = 1, \dots, n

, is an isomorphism and its corresponding affine isomorphism

σ : R^{n} \to R^{n}

give us an analytical relation between the coordinates in the both coordinate systems according to the rule

(x_{1}, \dots, x_{n}) = σ (x_{1}^{'}, \dots, x_{n}^{'}) : = (x_{1}^{0}, \dots, x_{n}^{0}) + A_{(e) (e^{'})} (x_{1}^{'}, \dots, x_{n}^{'}),

(10)

where

(x_{1}^{0}, \dots, x_{n}^{0})

is the notation of the point

O^{'}

in the standard affine coordinate system

(O, (e))

. The affine isomorphism

σ

is called the function of the transition from the standard affine coordinate system to the affine coordinate system $(O^{'}, (e^{'}))$ . As an affine isomorphism, the function

σ

is differentiable, as well as its inverse

σ^{- 1} .

Definition 11.

Let

X \subseteq R^{n}

,

f : X \to R^{m}

,

σ : R^{n} \to R^{n}

be the transition function (10) and

X^{'} : = σ^{- 1} (X)

. The function

f_{σ} : = f \circ σ : X^{'} \to R^{m}

is called the representation of the function f in the affine coordinate system

(O^{'}, (e^{'}))

.

We will now prove the following important statements which hold for affine coordinate systems:

Theorem 10.

Let

X \subseteq R^{n}

,

σ : R^{n} \to R^{n}

be the function of the transition from the standard affine coordinate system to an affine coordinate system

(O^{'}, (e^{'}))

and let

P_{0} \in X

be a point admitting nbd ray in

X .

Let

f : X \to R^{m}

be a function,

X^{'} : = σ^{- 1} (X)

and

f_{σ} : X^{'} \to R^{m}

be its representation in the affine coordinate system

(O^{'}, (e^{'}))

. Then it holds:

(i): The point $P_{0}^{'} = σ^{- 1} (P_{0})$ admits nbd ray in $X^{'}$ and $Δ_{X^{'}, P_{0}^{'}} = A_{(e) (e^{'})}^{- 1} (Δ_{X, P_{0}})$ .
(ii): If f is differentiable at $P_{0},$ then the function $f_{σ}$ is differentiable at $σ^{- 1} (P_{0}) .$
(iii): If $Σ_{X, P_{0}} = R^{n}$ and f is differentiable at $P_{0},$ then the function $f_{σ}$ is also differentiable at $P_{0}^{'} = σ^{- 1} (P_{0})$ and $d f_{σ} (P_{0}^{'}) = d f (P_{0}) \circ A_{(e) (e^{'})}$ .

Proof.

(i): Since an affine function maps a line segment to the line segment whose endpoints are the image of the endpoints of the original line segment, the transition function $σ^{- 1}$ maps the line segment $\bar{P_{0} P_{0} + H}$ to the line segment $\bar{P_{0}^{'} P_{0}^{'} + A_{(e), (e^{'})}^{- 1} (H)}$ , from which follows the statement.
(ii): Let $Δ_{X, P_{0}}^{'} = A^{- 1} (Δ_{X, P_{0}})$ . By assumption, there exists a linear operator $B : R^{n} \to R^{m}$ such that

$\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - B (H)}{||H||} = 0 .$

Let us show that the linear operator $B \circ A : R^{n} \to R^{m}$ is the differential of the function $f_{σ}$ at the point $P_{0}^{'} = σ^{- 1} (P_{0})$ . It holds

$\lim_{\begin{matrix} H^{'} \to 0 \\ H^{'} \in Δ_{X, P_{0}}^{'} \end{matrix}} \frac{f \circ σ (P_{0}^{'} + H^{'}) - f \circ σ (P_{0}^{'}) - B \circ A (H^{'})}{||H^{'}||} =$

$\lim_{\begin{matrix} H^{'} \to 0 \\ H^{'} \in Δ_{X, P_{0}}^{'} \end{matrix}} \frac{f (O^{'} + A (P_{0}^{'}) + A (H^{'})) - f (P_{0}) - B (A (H^{'}))}{||H^{'}||} =$

$= \lim_{\begin{matrix} H^{'} \to 0 \\ H^{'} \in Δ_{X, P_{0}}^{'} \end{matrix}} \frac{f (P_{0} + A (H^{'})) - f (P_{0}) - B (A (H^{'}))}{||A^{- 1} \circ A (H^{'})||} =$

$\lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - B (H)}{||A^{- 1} (H)||} =$

$= \lim_{\begin{matrix} H \to 0 \\ H \in Δ_{X, P_{0}} \end{matrix}} \frac{f (P_{0} + H) - f (P_{0}) - B (H)}{||H)|| \cdot \frac{||A^{- 1} (H)||}{||H)||}}$

By the Lemma 1, there exists $m \in R^{+}$ such that

$0 \leq \frac{||f (P_{0} + H) - f (P_{0}) - B (H)||}{||H|| \cdot \frac{||A^{- 1} (H)||}{||H)||}} \leq \frac{||f (P_{0} + H) - f (P_{0}) - B (H)||}{||H|| \cdot m},$

for every $H \in Δ_{X, P_{0}}$ , thus from the provious identities it follows

$\lim_{\begin{matrix} H^{'} \to 0 \\ H^{'} \in Δ_{X, P_{0}}^{'} \end{matrix}} \frac{f_{σ} (P_{0}^{'} + H^{'}) - f_{σ} (P_{0}^{'}) - B \circ A (H^{'})}{||H^{'}||} = 0 .$
(iii): It follows from the previous two statements.

□

Lemma 1.

Let

A : R^{n} \to R^{m}

be an isomorphism. Then there exists

m \in R^{+}

such that

\frac{||A (H)||}{||H||} \geq m, for every H \in R^{n} .

Proof.

Assume the contrary that there exists a sequence

(H_{k})

in

R^{n}

and a real sequence

(m_{k})

converging to 0 such that

\frac{||A (H_{k})||}{||H_{k}||} < m_{k}, for every k \in N .

Consider a sequence of points

(P_{k}),

P_{k} = \frac{H_{k}}{||H_{k}||},

k \in N

, in

R^{n}

. Since the sphere

S^{n - 1}

is a compact set, the sequence

(P_{k})

, contained in it, has a certain convergent subsequence

(P_{k_{l}})

whose limit

P_{0}

belongs to the sphere [3]. Therefore,

||P_{0}|| = 1 .

From

(P_{k_{l}}) \to P_{0}

follows

(||A (P_{k_{l}})||) \to ||A (P_{0})|| .

By the property of a norm and a linear operator, it holds

||A (P_{k_{l}})|| = ||A (\frac{H_{k}}{||H_{k}||})|| = \frac{||A (H_{k})||}{||H_{k}||} < m_{k}, for every k \in N .

This implies

\lim (||A (P_{k_{l}})||) = 0,

from which it follows

||A (P_{0})|| = 0

and consequently

A (P_{0}) = 0 .

But since A is an isomorphism, this implies

P_{0} = 0

which contradicts the equality

||P_{0}|| = 1

. □

This theorem shows us that the notion of differentiability of a function does not depend on the chosen affine coordinate system. We will now show that it is not true for some other coordinate systems.

7.2. Polar, Elliptical, Cylindrical and Spherical Coordinate Systems

For each point

T = (x, y) \in R^{2}

in the standard affine coordinate system we define the coordinates

r, ϕ

in the standard polar coordinate system (so-called polar coordinates) by the following formulas

r = \sqrt{x^{2} + y^{2}},

ϕ : = \arg (T) = \{\begin{matrix} \arctan (\frac{y}{x}), & x > 0, y \geq 0 \\ \arctan (\frac{y}{x}) + π, & x < 0, y \leq 0 or x < 0, y \geq 0 \\ \arctan (\frac{y}{x}) + 2 π, & x > 0, y \leq 0 \\ \frac{π}{2}, & x = 0, y > 0 \\ \frac{3 π}{2}, & x = 0, y < 0 \\ 0, & x = 0, y = 0 . \end{matrix}

This defines bijection

ρ : Θ_{0} \to R^{2}, Θ_{0} : = 〈0, \infty〉 \times [0, 2 π〉 \cup \{(0, 0)\}

(x, y) = ρ (r, ϕ) = (r \cos ϕ, r \sin ϕ) .

which we call the transition function from the standard affine coordinate system to the standard polar coordinate system. The polar coordinate grid consists of “lines”

ϕ = ϕ_{0}

and “lines”

r = r_{0}

for

(r_{0}, ϕ_{0}) \in Θ_{0}

. This polar coordinate grid is mapped to a “spider web” in an affine coordinate system consisting of all concentric circles with center 0 and all half lines with 0 as endpoint.

Notice that the function

ρ

is not a homeomorphism [2] since its inverse has discontinuity at all points

(x, 0),

x \in [0, \infty〉 .

For this reason, in applications and transitions from the polar to the affine coordinate system, the restriction of the transition function is used

{ρ |}_{〈0, \infty〉 \times 〈0, 2 π〉} : 〈0, \infty〉 \times 〈0, 2 π〉 \to R^{2} \ p_{0}, p_{0} : = \{(x, 0) ∣ x \geq 0\},

which is a homeomorphism. With the transition function

ρ

, the rectangle

\{(r, ϕ) ∣ r_{1} \leq r \leq r_{2}, ϕ_{1} \leq ϕ \leq ϕ_{2}\}, 0 < r_{1} < r_{2}, 0 \leq ϕ_{1} < ϕ_{2} < 2 π,

is mapped to an area bounded by corresponding circles and half-lines, therefore the polar coordinate system is more suitable to consider such sections than any other system in the plane. Every point of the set

Θ_{0}

admits nbd ray in it and the function

ρ

is differentiable at every point

(r, ϕ) \in Θ_{0}

, and the differential is represented by a matrix

[\begin{matrix} \cos ϕ & - r \sin ϕ \\ \sin ϕ & r \cos ϕ \end{matrix}] .

Let

X \subseteq R^{2}

,

f : X \to R^{m}

be a function and

X^{'} = ρ^{- 1} (X) \subseteq Θ_{0}

. Then, for the function

f_{ρ} : = f \circ ρ : X^{'} \to R^{m},

we say that the representation of the function f is in the polar coordinate system or in polar coordinates.

Example 11.

The representation of a function

f : R^{2} \to R

f (x, y) = \sqrt{x^{2} + y^{2}}

in the polar coordinate system is

f_{ρ} : Θ_{0} \to R

f_{ρ} (r, ϕ) = r .

Notice that the function f is not differentiable at the point O (Example 5) but the function

f_{ρ}

is differentiable at the point

(0, 0) = ρ^{- 1} (O)

and it holds

d f_{ρ} (0, 0) = p_{1}

.

Therefore, we conclude that the notion of differentiability of a function depends on the chosen coordinate system (affine or polar) in which it is represented. Such a phenomenon is not possible in the transition from one affine coordinate system to another (Theorem 10). Thus, in the transition from polar coordinates to affine coordinates (or vice versa), the differentiability of the function need not be preserved, nor does the notion of admissibility of nbd rays. To avoid such undesirable phenomena, we will consider only functions f (in affine coordinates) whose domains are open sets in

R^{2} \ p_{0}

, and functions

f_{ρ}

(in polar coordinates) whose domains are open sets in

〈0, \infty〉 \times 〈0, 2 π〉

. Since

{ρ |}_{〈0, \infty〉 \times 〈0, 2 π〉}

is a diffeomorphism (differentiable bijection which inverse is also differentiable), the following holds according to the Theorem 6.

Corollary 9.

Let

Ω \subseteq R^{2} \ p_{0}

be an open set,

f : Ω \to R^{m}

a function and

f_{ρ} : Ω^{'} \to R^{m}

representation of the function f in polar coordinates, where

Ω^{'} = ρ^{- 1} (Ω)

. If f is differentiable at a point

(x, y) \in Ω

then the function

f_{ρ} : Ω^{'} \to R^{m}

, is differentiable at

(r, ϕ) = ρ^{- 1} (x, y)

and it holds

d (f_{ρ}) (r, ϕ) = d f (x, y) \circ d ρ (r, ϕ)

.

Notation 2.

Differentiability of a function represented in polar coordinates should be considered only on a formal level. Indeed, the idea of linearization of a function, i.e., its local approximation by an affine function, makes sense only in affine coordinates. For example, for the scalar function

z = f_{ρ} (r, ϕ)

in polar coordinates, the differential at the point

(r_{0}, ϕ_{0}) \neq (0, 0)

is the linear operator A,

A (r, ϕ) = \partial_{r} f_{ρ} (r_{0}, ϕ_{0}) r + \partial_{ϕ} f_{ρ} (r_{0}, ϕ_{0}) ϕ

, which is no longer a linear operator in affine coordinates. Differentiability of a function represented in polar coordinates in the context of an affine coordinate system not be understood as a possibility of local approximation by an affine function, but by a linear combination of the functions

z = \sqrt{x^{2} + y^{2}}

and

z = \arg (x, y)

since these can be considered special in polar coordinates. For functions with a conic graph (like those in the previous example), such an approximation is more appropriate than linearization, and differentiation of functions in polar coordinates has exactly this meaning. Thus, the function from the previous example becomes a linear operator in polar coordinates and its differential is equal to itself at every point and in this context it is a perfect approximation.

8. Conclusions

In this work we have presented some results obtained in the research conducted during COVID epidemic. It was motivated by some issues and shortcomings which occur in some applications of the traditional approach to differentiability. These problems were noticed by the first author, who has many years of experience in giving classes of various courses in mathematical analysis to students from University of Split, Croatia. In the traditional approach to differentiability, which is featured in almost all university textbooks, this notion is considered only for interior points of domain of function or for functions with an open domain. We have generalized differentiability of scalar and vector functions of several variables by defining it at non-interior points of the domain of function, which include not only boundary points but also all points where a notion of linearization is meaningful (points admitting nbd rays). This generalization allows applications in many fields of mathematics and engineering or, in short, in all areas where standard differentiability can be applied. With this generalized approach to differentiability, some unexpected phenomena may occur, such as the non-uniqueness of the differential in some special cases, a function discontinuity at a point where a function is differentiable, which is possible only for points that do not admit raylike nbd in a domain of function. But if one reduces this theory only to points with some special properties (points admitting a linearization space with dimension equal to the dimension of the ambient Euclidean space of the domain and admitting a raylike neighborhood, which includes the interior points of a domain), then all properties and theorems belonging to the known theory of differentiability remain valid in this extended theory. For generalized differentiability, the corresponding calculus (differentiation techniques) is also provided by matrices—representatives of differentials at points. In this calculus, the role of partial derivatives (which generally may not exist for differentiable functions at some points) is taken by directional derivatives. The results presented open the possibility for further research and examination of known theorems on standard differentiability in a new context.

Author Contributions

Conceptualization, N.K.-B.; investigation, N.K.-B.; writing—original draft preparation, N.K.-B. and S.B.; writing—review and editing, N.K.-B. and S.B.; supervision, N.K.-B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Aksoy, A.; Khamsi, M. A Problem Book in Real Analysis; Springer: New York, NY, USA, 2010; pp. 197–198. [Google Scholar]
Dugundji, J. Topology, 12th ed.; Allyn and Bacon, Inc.: Boston, MA, USA, 1978; pp. 62–97. [Google Scholar]
Rudin, W. Principles of Mathematical Analysis, 3rd ed.; McGraw-Hill, Inc.: New York, NY, USA, 1976; pp. 51–52, 212–213. [Google Scholar]
Zorich, V. Mathematical Analysis I, 4th ed.; Springer: Berlin, Germany, 2004; Chapters 5 and 8. [Google Scholar]
Wade, W. An Introduction to Analysis, 4th ed.; Pearson Education, Inc.: Hoboken, NJ, USA, 2010; Chapters 3 and 4. [Google Scholar]
Wrede, R.; Spiegel, M. Theory and Problems of Advanced Calculus, 2nd ed.; McGraw-Hill, Inc.: New York, NY, USA, 1976; pp. 65–89. [Google Scholar]
Differencial Topology with Prof. John W. Milnor. Available online: https://www.youtube.com/watch?v=u5C0GKmMHQ4&list=PLS8dWbmb9L0fPQclKNhgYddkNr8QWMGzX (accessed on 20 June 2022).
Koceić Bilan, N.; Jelić, I. On intersetions of exponential and logarithmic curves. Ann. Math. Inform. 2014, 43, 159–170. [Google Scholar]
Kukushkin, M. Abstract Fractional Calculus for m-Accretive Operators. Int. J. Appl. Math. 2021, 34, 1–41. [Google Scholar] [CrossRef]
Hogben, L. Handbook of Linear Algebra, 2nd ed.; Chapman and Hall/CRC: Boca Raton, FL, USA, 2017. [Google Scholar]
Bühler, T.; Salamon, D. Functional Analysis; American Mathematical Society: Providence, RI, USA, 2018; pp. 22–26. [Google Scholar]
Appendix Functional Analysis and Operators. Available online: https://link.springer.com/content/pdf/bbm%3A978-3-030-34949-3%2F1 (accessed on 20 July 2022).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Koceić-Bilan, N.; Braić, S. Generalized Approach to Differentiability. Mathematics 2022, 10, 3085. https://doi.org/10.3390/math10173085

AMA Style

Koceić-Bilan N, Braić S. Generalized Approach to Differentiability. Mathematics. 2022; 10(17):3085. https://doi.org/10.3390/math10173085

Chicago/Turabian Style

Koceić-Bilan, Nikola, and Snježana Braić. 2022. "Generalized Approach to Differentiability" Mathematics 10, no. 17: 3085. https://doi.org/10.3390/math10173085

APA Style

Koceić-Bilan, N., & Braić, S. (2022). Generalized Approach to Differentiability. Mathematics, 10(17), 3085. https://doi.org/10.3390/math10173085

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Generalized Approach to Differentiability

Abstract

1. Introduction and Motivation

2. Preliminaries

3. Linearization of Function

4. Partial and Directional Derivatives

5. Differentiable Functions

5.1. Properties of Differentials

5.2. Differentiability of Real Functions of One Variable

5.3. Differentiability of Functions of Several Real Variables

5.4. Differentiability of Vector Functions

5.5. Differentiability of Vector Functions of One Variable

6. Tangent Plane

7. Differentiability in Different Coordinate Systems

7.1. Affine Coordinate Systems

7.2. Polar, Elliptical, Cylindrical and Spherical Coordinate Systems

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI