1. Introduction and Motivation
One of the basic ideas of differential calculus is to better approximate a given function
locally by an affine function, i.e., to linearize it at a point
. For this to be possible, the function must be differentiable at this point which means that there exists a linear operator
such that the limit
exists and is equal to
For practical reasons, differentiability in mathematical analysis has been defined and considered almost only for functions
with an open domain
[
1,
2,
3,
4]. Since every point of an open set
is an accumulation point of
[
1] then for every point
it holds that
is the accumulation point of the domain
D of the function
for a linear operator
Indeed, there exists
such that the open ball
is contained in
and consequently
and the limit from the definition of differentiability (
1) is reasonable to consider. (Recall that the limit of the function can be considered only at an accumulation point of the domain.)
However, reducing differentiability only to an open domain, i.e., to the interior points of a domain, has, in addition to many successful applications and advantages, some obvious deficiencies. For example, for the function
it holds
so this function can be well approximated by the zero operator, i.e., it could be linearized at the point
inside the natural domain of
f, but due to the conditions from the definition of differentiability (that a point must belong to the interior of the domain [
2]), the differentiability of the function at this point is usually not considered at all. Even though this issue can be overcome by extending the definition of differentiability (derivability) of a real function of a real variable to the endpoints of the given domain using one side limits [
5], for a function of several variables the problem of differentiability at non-interior points of the domain remains current. For example, the differentiability of the function
cannot be considered in all boundary points
, although it can be well linearized locally by the zero operator in those points. Similarly, because of the reduction to open sets, the question of the existence of tangents [
6] and tangent planes [
4] of a function
,
or
, at points
remains open. For example, due to this reduction we cannot obtain the tangent of the function
at the point
although it is obvious that for points
the secants
tend to the line
as
x tends to 0, and the line
should be the tangent of this function at the point
O. Moreover, the study of the local conditional extreme of a scalar function is reduced to the study of a function whose domain is not necessarily an open set, so that the problem of finding a conditional extreme cannot be clarified or fully studied if differentiability is studied only on open sets. Furthermore, a differentiable function would lose the property of differentiability at many points if differentiability at boundary points is not considered when switching from one Cartesian coordinate system to other non-affine coordinate systems (or vice versa).
These are some of the reasons that indicate that the notion of differentiability should be generalized by observing differentiability not only at interior points of sets, but much more broadly, at points of any domain
of a function
in which the notion of differentiability and linearization is meaningful. John W. Milnor mentioned this problem in his famous series of lectures on differential topology which dates back to 1965 [
7]. We will show that this extension is meaningful for all points
for which there is at least one point
such that the line segment
is contained in
X. Indeed, this is the most general case in which a linear operator can linearize a function at a point (at least on a line segment to which this point belongs). The linearization space is then a one-dimensional vector subspace of
which is also the smallest vector subspace on which it is interesting to consider and specify a linear operator.
In the history of modern mathematics one can find some other issues or (overlooked) problems of mathematical analysis like this one [
8], where we take for granted some traditional approaches, common requirements and (sometimes wrong) conclusions. Concerning differentiability, one can find in the literature some generalizations of differentiability (derivability) such as the fractional derivative [
9] or the derivative at the endpoints of a segment [
5]. In this paper, we provide a natural generalization of differentiability of a function by defining it at some non-interior points of the domain of function. These points include not only the boundary points of the domain, but also all points in which the notion of differentiability and linearization is meaningful. For this generalized case, a corresponding calculus (techniques of differentiation) is also provided.
2. Preliminaries
In this paper we use the notation for the Euclidean scalar product on , the notation for the Euclidean norm and the notation d for the Euclidean metric. We use the notation O for the point or we simply write
Let
be an open set,
a function, and
an arbitrary point in
. To approximate the function
f on the open ball
at the point
with the special affine function
means to find a linear operator
[
10] such that
for any
. Geometrically interpreted, in the case of
this means that we want to replace the part of the graph of the function
f at the point
by the part of the graph of the affine function
i.e., the part of the hyperplane in
. The desirable property of such an approximation is that it is as accurate as possible at points closer to the point
, i.e., that the error
tends to zero as
H tends to zero. However, if
f is a continuous function, then the error
always tends to zero as
H tends to zero (because every linear operator acting between finite-dimensional vector spaces is continuous). This would mean that there is an adequate local replacement by the affine function of any continuous mapping, which is not the case. For example, if we consider the function
, it is easy to see that on the open ball
we cannot approximate this function by an affine function, i.e., we cannot replace its graph well enough by a part of the plane passing through the origin
, although this is perfectly possible on all rays starting in
O. Thus, it is not only necessary that the error
can be made arbitrarily small (because every continuous function has this property), but even more so that the relative error
can be made arbitrarily small, which leads us to the definition of differentiability of the function
f at the point
, which is as follows [
3]:
Let
be an open set. A function
is differentiable at a point
if there exists a linear operator
such that the limit
exists and is equal to
We then call the linear operator
A the differential of the function
f at the point
, it is unique and we denote it by
A linear operator
A is the differential of the function
f at the point
if and only if
where
is the error function with the property
3. Linearization of Function
Definition 1. Let be a set and We say that the point admits a neighborhood ray (or simply admits a nbd ray) in X if there exists such that the line segment is contained in X.
This notion is of particular importance to us because we will consider the linearization of a function exactly at points in a domain that admit at least one nbd ray in the domain (and not, as before, only at points from its interior).
Example 1. - (a)
No point of a sphere admits a nbd ray in it.
- (b)
Every point of a non-trivial convex set admits a nbd ray in that set.
Since every line segment is a convex set and every nontrivial convex set contains the line segment between any two of its points, a point admits a nbd ray in X if and only if there exists a nontrivial convex set such that .
Definition 2. Let and be a point admitting nbd ray in X. The setis called the set of linear contributions at in X, and its linear hull [10] is said to be the linearization space at with respect to X. For a function and a point we say that is the linearization space of the function
f at the point
.
Example 2. - (a)
Let the points be in general position, i.e., let them be the three non-collinear points. Then it holds - (b)
Let and . Then it holds
Let us now generalize the notion of differentiability of a function to points admitting nbd ray in the domain. This will allow us to consider differentiability at points where this was not possible so far.
Definition 3. Let and be a point admitting nbd ray in X. We say that a function is differentiable at if there exists a linear operator such that the limit
exists and is equal to . If such a linear operator exists, we call it the differential of the function f at the point The function f is differentiable on X if f is differentiable at every point of X.
Notice that if a point
admits nbd ray in
then
is an accumulation point of the set
X and then
is an accumulation point of the set
. Indeed, if
then every nbd of
contains some points of the line segment
and consequently every nbd of 0 intersects
. Therefore, the limit from the previous definition makes sense to consider. Furthermore, the natural domain
of the function
could be in general a superset of
, so it is necessary to emphasize that the limit (
2) is considered only on the set
(this is the limit of the restriction of the function (
3) to the set
at 0). Otherwise, the values of the above function at points that do not belong to
but are in
D and near 0 may affect the existence of the limit of the function
(3) at 0, which we do not want to allow. But, if
then it holds
which is a consequence of the following theorem:
Theorem 1. Let and be an accumulation point of the set Let U be an open neighborhood of the point in such that If the restriction has the limit at the point , then f has the limit at and they are equal, i.e., .
Proof. Let
and let
be an open ball in
. Then there exists an open neighborhood
V of
in
such that
and
. Hence,
which implies that
□
Therefore, in the above definition of differentiability of a function at a point, we can omit the notation of the restriction in the limit if this point belongs to the interior of the domain. In this case, the above definition coincides with the previously known definition of this notion. Thus, the Definition 3 is a natural generalization of the notion of differentiability and this generalization brings many advantages and solves many contentious issues and problems (e.g., at the boundary points of a domain…), which we will explain hereinafter with several various examples.
From the definition of differentiability, it follows that the linear operator
is the differential of a function
at a point
if and only if
where
is the error function with the property
. Notice that the above equation makes sense only on
, i.e., only for a sufficiently small neighborhood
U of the point
[
2] we can write
for
. Likewise, the linear operator
A, although defined on
, has its true meaning from the point of view of approximating the function
f only on the linearization space
.
It is important to notice that the differential of a function at a point need not be unique (which could not be the case so far). Indeed, if the linearization space is a proper subset of and if there exists a differential of the function f at then every linear operator that coincides with A on the subspace (and there are infinitely many of them) is the differential of the function f at because it satisfies the conditions of the definition of differentiability. Let us formalize this consideration by the following statement:
Proposition 1. Let , and be a point admitting a nbd ray in X. If the function f is differentiable at and is the differential of the function f at the point then every linear operator which agrees with A on the vector space is also the differential of the function f at the point .
Proof. Using equality
it is easy to check that
holds. □
Now, we will show that all differentials of f at are equal on the linearization space
Theorem 2. Let , and be a point admitting a nbd ray in X. If the differential of the function f exists at the point then it is unique on the vector space .
Proof. Suppose that
are two linear operators for which
Then it holds
Every vector
can be written as a linear combination of vectors from
. Therefore,
if and only if
for every
If
then
for every
and
so it follows
Therefore,
for every
and it holds
□
Remark 1. One might think that the cases where the linearization space is a proper subset of and the differential of the function f exists at the point cause certain difficulties because the differential is not unique, but it is unique where it should be, i.e., on the linearization space . According to the previous theorem, all differentials of the function f at the point coincide in the space and this is the only thing that is important for us because only in this space we can use the differential to approximate the function f at the point .
Corollary 1. Let be a point admitting nbd ray in X and If the differential of the function f exists at the point , then it is unique.
Proof. This follows from Theorem 2 and Proposition 1. □
If the differential of a function , , exists at a point and is unique, we denote it by
If is a linear operator then f is differentiable on and at any point . In particular, the projection map , , is a linear operator and for every . Usually is denoted by
An affine mapping where and is a linear operator, is differentiable on and for every point .
Example 3. Let and . Since f is the restriction of the linear operator on the convex set f is differentiable at any point of the domain and the differential at any point is equal to A. The linearization space of the function f at any point of the domain is and since it is a 1-dimensional subspace of the differential of f is not unique. Moreover, all linear operators , represented by a matrix , are all its different differentials. However, according to the previous theorem, the restriction of all these differentials on Σ is the same.
Notice that according to the traditional definition of differentiability, this function would not be differentiable at any point in its domain. On the other hand, the function f is perfectly linearized since its graph is , and it would be incorrect to say that it cannot be linearized (since its graph is perfectly linearized by the part of the line ). However, since for functions whose domain is a subset of the graph is linearized by part of the plane, we can do that in infinitely many ways, since the entire pencil of planes passes through the line , so its linearization is not unique. However, if we take the set , for the domain of the function f, then , and by the previous theorem the differential of the function f at is unique, i.e., its linearization is the part of the unique plane passing through the line and , , (the graph of the function f is the set , which is a part of this plane).
Corollary 2. Let and be a point admitting nbd ray in X. If a differential of the function f exists at the point then it is unique.
Proof. Since , the statement follows from the previous corollary. □
Example 4. Let us consider the function , from the introduction. Since for it holdsandwhere denotes the zero operator, f is differentiable at the point Moreover, it follows from that the zero operator is the unique differential of f at the point , i.e., Definition 4. Let and . We say that a point admits a neighborhood ray in X in the direction of V if there exists such that
Proposition 2. Let be an open set. Every point admits a nbd ray in Ω in the direction of all vectors and
Proof. Let
and let
be arbitrary. Since
is open, there exists a ball
and since a ball is a convex set,
holds. Therefore,
admits nbd ray in
in the direction of
H and then admits it in
. Furthermore, from
it follows that
and
implies
for every
So,
and then
□
Corollary 3. If is an open set and is differentiable at a point then the differential of the function f at the point P is unique.
Proof. This follows from the previous proposition and corollary 1. □
Remark 2. We have already mentioned that the new definition of differentiability (Definition 3) coincides with the well-known definition of this notion when the domain of a function is an open set. It follows that in this particular case all previously known properties of differentials hold, including the property of uniqueness. However, the new theory induced by the extended definition of differentiability provides the proof of the uniqueness of the differential of an open domain function without relying on prior general knowledge of it.
Proposition 3. Let and be a point admitting nbd ray in Y. If f is differentiable at then is differentiable at and the differentials of the functions f and at coincide on .
Proof. Since the function
f is differentiable at
, there exists a linear operator
such that
Now,
implies
from which it follows that the function
is differentiable at
and that the linear operator
A is its differential at
. Now, by the Theorem 2, we conclude that every other differential at
coincides with
A on
. □
The converse does not hold, i.e., if the restriction of a function , to a subset is differentiable at a point then in general the function f need not be differentiable at that point. This will be shown by the following counterexample. But we will prove that if Y is open in then differentiability on Y implies differentiability on X.
Example 5. Let Let us consider the restrictions of the function f to setsFor functionsthe linearization space at each point in their domains is . The function is the restriction of the linear operator to the convex set , so is the differential of the function at any point in (not unique because the dimension of Σ is less than but they all coincide on Σ). Similarly, the differential of the function at every point in is . If the function f were differentiable at the point then, by Proposition 3 and Theorem 2, the differentials of the functions and at the point 0 would coincide on Σ which is obviously not the case. Theorem 3. Let , , and U be a neighborhood of the point in . If admits a nbd ray in and if is differentiable at then f is also differentiable at .
Proof. Since
U is a neighborhood of the point
in
, there exists
such that
and then
. Therefore,
Due to differentiability of the function
, there exists a linear operator
such that
Since
and
, by Theorem 1, it follows
which implies that
f is differentiable at
□
The following statement follows from the previous theorem.
Corollary 4. Let , , be an open set in and . If is differentiable at then f is also differentiable at and .
We will show now that differentiability does not imply continuity in general (which cannot be the case for a function with an open domain).
Example 6. Let , , andLet us consider the function For the point it holdsThe function f is differentiable at the point O (it is differentiable at every point of its domain and the zero operator is one of its differentials), but f is discontinuous at all points of the line segment , so it is discontinuous at O. In this example, the dimension of the linearization space is less than the dimension of the whole space . However, even if the dimension of a linearization space is equal to the dimension of the whole space , a function need not be continuous. This is shown by the following counterexample.
Example 7. Let be the 1-sphere and three distinct points on it. Consider the union of two circular arcs and their corresponding chords . The function is differentiable at Namely, andSince , the differential is unique and is the zero operator. But the function f is discontinuous at because and hold. To ensure that differentiability of a function at a point implies continuity at that point, we need an additional condition, which is introduced in the following definition.
Definition 5. Let and be a point admitting nbd ray in X. A neighborhood U of the point in X is said to be raylike neighborhood of the point in X provided holds for every . If there exists at least one raylike nbd in X of the point we say that the point admits raylike nbd in X.
It is easy to see that every point of a non-trivial convex set admits raylike nbd in that set, and then every point of the open set admits raylike nbd in .
Theorem 4. Let a point admits raylike nbd in X. If is differentiable at then it is also continuous at .
Proof. Let
U be a raylike nbd of the point
in
X. Then
is a neighborhood of the point
in
. To prove that
f is continuous at
it suffices to prove that
is continuous at
, i.e., that
By the assumed differentiability, there exists a linear operator
such that
where
is an error function with the property
. Then
Since
,
Every linear operator operating between finite dimensional vectorial spaces is continuous, therefore
Hence, by (
4), it follows
□
Obviously, if is an open set, , and f is differentiable at then f is also continuous at . Thus, if the domain of a function is an open set, differentiability implies continuity. The same is true for any convex domain.
6. Tangent Plane
Let
,
and
. Let
be a nonempty set, and
be a point admitting nbd ray in
X in the direction of
and admitting raylike nbd in
X. Let
f be differentiable at
and
. For a continuous function
,
, which is differentiable and geometrically smooth at a point
and for which
, the direction vector of the tangent to the image
of
r at the point
is
. Since
then
and, by Theorem 6,
i.e.,
therefore the vectors
and
are orthogonal. Thus, the direction vector of the tangent to the image of any function
with the above properties at the point
is orthogonal to
which implies that all these tangents lie in the same plane; we call this hyperplane the
tangent plane to the set S at the point . Since
is its normal vector, its equation is
Let
F be defined by
for
The tangent plane to the set
at the point
is called
tangent plane to the graph of the function f at the point . The vector
is a normal vector of this plane, so its equation is
Let us now define the tangent plane to the graph of a scalar function in an even more general case, i.e., at points which do not admit nbd ray in
X in the direction of some of the vectors
but in the direction of some
n linearly independent vectors.
Definition 10. Let be differentiable at a point that admits raylike nbd in and let The hyperplaneis called the tangent plane to the graph of the function f at the point . Remark 3. Since the coordinates of the vector are given in the standard basis of it is assumed that the operator is defined in the pair of ordered basis and The numbers , are partial derivatives of the function f only if admits nbd ray in the direction of for .
Let us show the justification of the previously defined notion. Let
fulfill conditions of the previous definition and let
admits nbd ray in the direction of a vector
in
X such that
. Let us consider the parametrization of the line segment
i.e., the function
. Due to the assumed differentiability of the function
f at
, there exists the derivative
in the direction of
V (Corollary 5) and then the function
is derivable at 0 because
The image of the function
belongs to the graph of
f, passes through the point
and the direction vector of the tangent to the image of the function
at the point
is
. We will now show that this tangent lies in the tangent plane to the graph of the function
f at
, i.e., that the point
belongs to this plane. According to Corollary 5,
holds. This implies
which proves that all points of the tangent belong to the tangent plane. This means that all previously described tangents lie in the same hyperplane and the normal vector of this hyperplane is orthogonal to the vector
where which justifies the previous definition.
Remark 4. For a scalar function of two variables which fulfils conditions of the previous definition at a point admitting nbd rays in the direction of two non-collinear vectors and in D, the vector of the direction of the normal line of the tangent plane to the graph of the function f at is . This vector is orthogonal to the vector where is an arbitrary vector such that the point admits nbd ray in D in the direction of . In particular, at a point i.e., at a point that admits nbd rays in D in the direction of vectors and , the normal vector of the tangent plane in can be calculated as