1. Introduction and Motivation
Given a non-empty set E, let L be a real linear space of some real-valued functions x defined on E between others; the function , constantly equaling 1, belongs to L, and let A be a linear mean on L, i.e., any linear positive functional on L with .
For any convex functions
defined on a bounded interval
I, inequalities of the form
where
are classical.
The fundamental examples of linear means are
or
, where
are positive weights summable to 1 and
is a fixed non-negative function integrable to 1, while
or
is integrable. For these functionals, inequalities (
1) were established in 1906 by Jensen [
1], one of the founders of the convex functions theory. Presently, they are just called Jensen’s inequalities; moreover, a convex function itself is defined by (
1) in case of the mentioned discrete functional.
Twenty-five years later, Jessen [
2] proved (
1) in a general case for abstract linear means. The case of multi-variable convex function
was developed by McShane [
3]. The inequality remains true for the wider class of functionals compared to linear means, namely sublinear isotonic functionals
A preserving constants; i.e.,
Results of that type come from Pečarić and Raşa [
4] and Dragomir, Pearce, and Pečarić [
5] (see also ([
6] Th. 12.18)). The latest results on this topic can be found in [
7,
8,
9,
10,
11,
12]. For example, in [
12], Otachel obtained (
1) with weaker assumptions about functionals than those made in [
4] or in [
5] and generalized Jessen’s and McShane’s inequalities. It complements the results obtained in [
4,
5].
In this article, we prove a vectorial version of inequality (
1), where convex functions are replaced by convex maps
with values in ordered linear spaces and functionals
A being linear means are altered to
vectorial means. This is a body of Theorem 5 in
Section 5; the notion of vectorial mean is introduced therein as well. Moreover, we present there a counterpart of McShane’s result (Theorem 6) for our vectorial means.
The key tool for proving inequalities of type (
1) includes the supports of convex functions. In
Section 4, in Theorem 4, we show that certain convex maps admit analogous characterizations.
They are consequences of versions of Hahn–Banach theorems on extensions of linear or affine maps dominated by sublinear or convex maps taking values in ordered linear spaces. These theorems are presented in
Section 3, Theorems 2–3.
Section 2 includes the main settings of the paper, used notations, and the basic notions relevant to our considerations, along with illustrative examples and elementary properties.
2. Notation and Preliminaries
A binary relation ⪯ on a set Z that is reflexive ( for every x) and transitive (if and , then ) is called a preordering on Z. Additionally, if it is antisymmetric (both inequalities and , providing ), we speak about (partial) ordering.
The Kuratowski–Zorn lemma states that a partially ordered set containing upper bounds for every chain (that is, every totally ordered subset, where any two elements are comparable) necessarily contains at least one maximal element (that is, there are no other succeeding elements in the set).
In the following, we will be particularly interested in partial orderings in real linear spaces. A subset C of a real linear space is a convex cone if for all non-negative scalars . Then, defines a preordering in the space. Additionally, if C is a pointed cone (), then the relation is antisymmetric, so ⪯ is a partial ordering, called a cone ordering or an ordering generated by the convex cone or a cone-generated ordering.
In this case, one can add the inequalities side by side and multiply them by positive scalars.
The componentwise ordering in () and the natural ordering of real functions defined on a certain set E (that is, ) are basic examples of cone orderings. The cones generating above orderings are the convex cone of vectors with non-negative components and the convex cone of non-negative functions, respectively. Both of them are pointed. Example 3 includes a more sophisticated instance.
Throughout the paper, every ordering on a linear real space is a cone ordering.
Let be ordered real linear spaces. The notions of convexity, sublinearity, isotonicity, positivity, and negativity assigned to real functionals can be naturally extended to maps defined on and taking values in ordered real linear spaces.
A map is said to be
convex if for all and all with
sublinear if for all and all
isotonic if
positive if
negative if
The above maps can be defined on some dedicated subsets of space, e.g., convex on convex, sublinear on convex cones, or isotonic on increasing subsets of the space. This sometimes complicates the assumptions, and, if it is not essential for the considerations, we will limit ourselves to the situation when the domain of the map is the entire space. For considering sublinear or convex (concave) maps, no ordering of the space is needed.
Clearly, every sublinear map is convex. The epigraph of any map is denoted and defined as
Theorem 1. Let and be a convex cone that induces the partial order in .
- (i)
F is convex ⇔ is a convex subset of
- (ii)
F is sublinear ⇔ is a convex cone not including
Proof. (ii). Let be a sublinear map and , that is and Given arbitrary Since F is sublinear, where the last inequality is implied by our assumptions. Thus, ; i.e., is a convex cone.
Moreover, if , then . It is known that sublinearity of F ensures , so . It contradicts the basic setting that
Conversely, for arbitrary , . If is a convex cone, then ; equivalently, where are arbitrary. Therefore, F is a sublinear map.
(i). The proof is analogous. □
It is known, in case of a real valued function
f,
f is convex if and only if
is convex;
f is a sublinear functional if and only if
is a convex cone with
(see [
13] (Th. 2.1.1, Prop. 2.1.2)).
The following further properties of any sublinear map
A are easy to verify
Sublinearity of a map A is equivalent to non-negative homogeneity, for all real and all , and subadditivity, for all Indeed, if A is sublinear, then, for and , . On the other hand, ; hence, , and, finally, The remaining details of the mentioned equivalence are obvious.
It is evident in (
4) that any sublinear negative map is isotonic and therefore positive, but not conversely. For linear maps, isotonicity is equivalent to positiveness. Note, properties of isotonicity or positivity are independent of sublinearity or linearity. In case of maps with values in
, we will rather speak about functionals. A sublinear functional is called also a Banach functional. Examples of such functionals are found in [
12].
It might seem that sublinearity can be defined for any real scalar. The following example shows that the sublinear maps are simply linear.
Example 1. Let be a map fulfilling the following sublinearity condition for all and all (sic!).
Then, . Hence, and . Thus, . Substituting , we obtain and The first identity (for ) yields . Linking it with the second one for leads to
Now, according to the assumed sublinearity, for any , we have or, equivalently, . Furthermore, . Thus,
Therefore, a map A with for all and all is necessarily linear.
In [14,15], one can find suggestions that the condition of sublinearity, in the above sense, is apparently weaker than linearity in case of functionals. Example 2. Here, we will additionally assume that, for all , there exist and ; i.e., is a vector lattice. Set , and , where . It is known that and ; moreover, and and maps are sublinear.
For arbitrary , the following inequalities are obviously equivalent:Now, let be linear positive maps with . Then,or, equivalently,It proves subadditivity of the map Negativity, positivity, and non-negative homogeneity of A are obvious. Thus, A is sublinear and isotonic. Example 3. Let be the space of all extended real-valued and measurable functions that are integrable on the unit interval provided with Lebesgue measure μ. For each , let be its decreasing rearrangement defined as the right-hand continuous inverse of the distribution function i.e., Now, we are in position to recall a continuous version of well-known order originated by Hardy, Littlewood, and Pólya in [16,17]. For , we will write whenever It is easy to see that the operation is non-negatively homogeneous; i.e., . Moreover, in light of [18] (Th. 8.1), it is subadditive in the following sense . Thus, the operation of taking decreasing rearrangement of a function from is sublinear regarding the above defined order. The subset is a convex cone. Moreover, if and , then Hence, by the Lebesgue differentiation theorem, , a.e.
Note that the HLP-order restricted to the convex cone of all decreasing μ a.e. functions in is a cone order generated by C. Details of properties of HLP-orders are present in [19], cf. also [20]. 3. Versions of the Hahn–Banach Theorem with Sublinear or Convex Maps
Let and be certain maps defined on the vector space L with values in the ordered real vector space , where the partial order ⪯ is generated by the pointed convex cone C. In this case, we do not need any ordering in
Additionally, it will be assumed that, for any subset bounded from above, i.e., for any and a fixed , there exists , and, for any subset bounded from below, i.e., for any and a fixed , there exists .
In particular, if for any and for any , then there exist and and, moreover, .
We say the map F is dominated by the map S if .
In the literature (see e.g., [
21] (Chap. VI, §3, Th. 1)), one can find the following version of Hahn–Banach theorem on extension of linear maps dominated by sublinear maps.
Theorem 2. Let be a linear map defined on a subspace and dominated by a sublinear map S defined on the space L, both with values in the ordered space M. Then, there exists at least one linear extension of to whole L, dominated by S; i.e., there exists a linear map such that and .
Below, we show the similar fact on extension of affine maps dominated by convex ones. A particular case of the result for real convex functions is found in [
22] (Th. 43A).
Theorem 3. Let be an affine map defined on a subspace and dominated by a convex map Φ defined on the space L, both with values in the ordered space M. Then, there exists at least one affine extension of to whole L, dominated by Φ; i.e., there exists an affine map such that and .
Proof. We will adopt a modified version of the proof of the Hahn–Banach theorem based on the Kuratowski–Zorn lemma, which uses the preliminary version of the theorem obtained by E. Helly (1912). More on this can be found in [
23].
Let
be the collection of all pairs
where
is a linear subspace of
L including
and
B is an affine map on
, which is an extension of
dominated by the convex map
. The set
is non-empty because
, so we can define the partial order relation ≺ in
as follows
For any
totally ordered subset of
, let
be the sum of these
such that
, and for
we define
as equal to
if only
. Clearly,
is a subspace of
L including
and
is an affine map on
, being an extension of
dominated by
. Thus,
and
for any
; i.e.,
is an upper bound for
. By the Kuratowski–Zorn lemma, there is a maximal element in
, e.g.,
.
If we show that
, then
will be the affine extension of
dominated by the convex map
as required. To accomplish this, let us assume, on the contrary, that
, so there exists
. Then, for any
and
, we have
Hence,
Define subsets of the space
M as follows
under our settings, there exist
; selecting
leads us to the estimates
Let
Y be a subspace of
L defined as follows:
Obviously,
, but
. Since
, the representation
is unique, so we can define the affine map by
Note that
B is an extension of
A and also of
. Now, we shall show that
B is dominated by
; i.e.,
The case
is evident. If
, utilizing the second estimate of (
6), we obtain
In case
, we use the first estimate of (
6)
Finally, the pair
,
and
denies that
is a maximal element in
. □
5. A Vectorial Version of Jensen–Jessen Inequality
Given a non-empty set E and the ordered real vector space , where the partial order ⪯ is defined by the pointed convex cone C, as the real linear space L, we consider here a function space that consists of some vector-valued functions and meeting the requirements
- (L1):
the constant functions where is fixed, belonging to L;
- (L2):
for any linear map and .
Moreover, we assume that L is endowed with the preorder defined by the convex pointed cone of all positive functions from L onto M; that is, by , but there is no need to use an explicit symbol to denote it.
We are interested in linear maps having the properties
- (A1):
, meaning that for all being constant functions;
- (A2):
; i.e., A is positive, or, equivalently, isotonic;
- (A3):
for any linear map .
Then, we say that
is a
vectorial mean defined on
L with values in
M. Let us recall, in case of
, we speak about linear means [
24] (p. 47).
It is evident that the collection of vectorial means defined on L with values in M constitutes a convex subset in the space L, and there exist examples of such means.
Example 4. According to our previous assumptions and notations, set with the componentwise order and let L be a linear space of maps meeting (L1)–(L2). Then, any has the form , where
Let a be a fixed linear mean acting on the linear space consisting of all real functions , where ; i.e., a is a positive linear functional preserving real constants. We define .
The reader will easily notice A is linear and fulfils (A1)–(A2). Since any linear map is represented by matrix , by linearity of a, we haveThus, (A3) also holds, so A is vectorial mean on L with values in . In the below theorem, we extend Jessen’s inequality [
24] (Th. 2.4) from linear means to vectorial ones.
Theorem 5. Let L satisfy properties (L1)–(L2) on a non-empty set E and let A be a vectorial mean defined on L with values in M.
If is a convex map, then, for all such that , the following inequality holds Proof. Fix
such that
and denote
. According to Theorem 4, let
be a support map of
at
, where
and
is linear. Thus,
and
. Setting
), we obtain
Note,
, by (L2). The assumption (A2) of isotonicity for
A and further (A1) and (A3) provide the estimates
which finishes the proof. □
The next theorem provides a counterpart of McShane’s result [
24] (Th. 2.5) for vectorial means.
Theorem 6. Let M be, in addition, a locally convex topological vector space and be a closed and convex subset.
If A is a vectorial mean on L with values in M, then, for any , Proof. Assume . Then, there exists a functional that separates and K; i.e., and . Fix and define . Observe that T is linear and . By our hypotheses, Since (A1)–(A3), Hence, , which contradicts our earlier assumption and finishes the proof. □
Example 5. Let the spaces , and the vectorial mean A be as in Example 4. Fix any convex function and let Note, Φ is a convex map regarding the componentwise order on .
Now, for such that , we obtain (7) according to Theorem 5. It is equivalent to the inequalitywhere a is a linear mean on and This is McShane’s result; see [24] (Th. 2.6). If is closed convex and , thenby Theorem 6. This is McShane’s result; as well, see [24] (Th. 2.5). 6. Summary
Research in applied sciences, such as natural sciences, health sciences, economics, physics, and engineering, is based on experiments. Let us agree that the goal of an experiment is to obtain a measurement or a series of measurements, depending on whether we observe one or many features. Knowledge about phenomena is obtained from multiple often planned repetitions of such experiments. The numerical data obtained in this way are elements of a certain vector space—the sample space—composed of numerical sequences, matrices, or, more generally, real functions.
The aim of the research is to develop and adapt a mathematical model to the data. To achieve this, it is necessary to obtain some information about the location, spatial extent, symmetry, and other geometric properties of the data set in that space. The researcher accomplishes this using various characteristics and measures. The basic such characteristics are means, which index the location of a data set in a space, indicating one specific point in that space.
They meet various requirements. A mean calculated for generally larger data is greater with respect to some fixed orderings (isotonicity). If we transform the data (by scaling or shifting), the mean is transformed in the same way (linearity). In the case of data that do not exhibit variability and take a common constant value, the mean value also takes that constant value. Roughly speaking, the mean is the point contained in the convex hull of all the measurements. The mathematical abstraction of such characteristics can be linear means in the case of studying a single feature or vectorial means in the case of many features.
The formal definition of vectorial mean and the main results of the article are included in
Section 5 in the following theorems: Theorem 5, which presents a version of inequality (
1) for vectorial means and generalizes classic inequalities by Jensen [
1] and Jessen [
2]; and Theorem 6, which includes a counterpart of McShane’s result [
24] (Th. 2.5) for vectorial means and provides their important geometric property, stating that the average operation provides a point in the convex hull of all the averaged points.
There are other important results in the article without which proving the above theorems would not be possible: Theorem 3 on the extension of affine maps dominated by convex ones, being a multidimensional version of the Hahn–Banach theorem; and Theorem 4 on supports of convex maps. This last one is a key tool for proving inequality (
1) for vectorial means.
Ultimately, the author hopes that further research on vectorial means and Jensen–Jessen inequalities can be used to construct new characteristics of numerical data that will also describe other geometric aspects of the data set in the sample space.