1. Introduction
Fuzzy measures [
1], also known as capacities [
2], non-additive measures or monotone measures [
3], have shown to be a valuable tool for representing uncertainty, since they are able to cope with more general scenarios than probability measures do. Even though fuzzy measures have been successfully applied in a wide range of applications [
4], no theory analogous to mathematical statistics has emerged around them in the general case, due to the difficulty of defining statistical parameters with a clear interpretation when additivity is replaced by monotonicity.
A remarkable exception is the case of the so-called imprecise probabilities [
5,
6], characterized by upper and lower expectations that provide rich semantics and interpretability. Dempster–Shafer belief functions [
7,
8], for instance, can be formulated as special cases of imprecise probabilities.
The field of fuzzy probability and statistics [
9,
10,
11,
12,
13,
14] has received significant attention during the last two decades. The contributions in this field can be classified into two basic groups according to the underlying approach they follow [
15]. One of the groups include the methods that deal with the analysis of classical (non-fuzzy) data using methods based on fuzzy set theory, while the other group focuses on analyzing fuzzy data using statistical methods. In this context, fuzzy data refers to data in which the values correspond to fuzzy numbers [
16], characterized by a membership function that returns a value between 0 and 1 indicating to which extent a given real number matches a given fuzzy number.
Examples within the first group include fuzzy clustering [
17], fuzzy linear regression [
18], testing fuzzy hypothesis from non-fuzzy data [
19], fuzzy statistical quality control [
20], time series forecasting based on fuzzy logic [
21] and making statistical decisions with fuzzy utilities [
22].
The second group includes methods for maximum likelihood estimation from fuzzy data [
23], classification when data are labeled with Dempster–Shafer belief functions [
24], distance-based statistical analysis [
25], statistical hypothesis testing from fuzzy data [
26], principal component analysis [
27], discriminant analysis [
28] and clustering [
29].
In this paper, we are interested in the definition of statistical parameters when the uncertainty is represented by a general fuzzy measure. More precisely, our starting point is a measurable space and a measurable real-valued function defined on the reference set of the space. We also assume that the measurable space is endowed with a fuzzy measure, and we will study the definition of statistical parameters over the measurable function, in a similar way as statistical parameters over a random variable can be defined from a probability measure. In this way, we attempt to handle more general scenarios than the ones covered by probability measures. To achieve this, we rely on the concept of monotone expectation [
30]. We consider the case of marginal spaces as well as product spaces, and take advantage of recent advances in the construction of fuzzy measures over product spaces [
31]. Our study is restricted to discrete reference sets.
The rest of the paper is organized as follows.
Section 2 establishes the basic notation and definitions, and highlights the fundamental properties of product measures that are used throughout the paper.
Section 3 contains the original contributions in this paper, in what concerns the definition of parameters in a marginal measurable space, while
Section 4 describes our proposals for product spaces. The paper ends with conclusions in
Section 5.
2. Preliminaries and Notation
Definition 1. [1] Let X be a set and be a non-empty class of subsets of X so that and . A function is a fuzzy measure
if: - 1.
.
- 2.
.
- 3.
such thatit holds that.
- 4.
If such that and , then
- 5.
If such that and , then
The triplet
is a
measurable space, and
X is called the
reference set. We will only work with finite reference sets [
4] in this paper. By default, we will assume that
is the power set of
X.
Example 1 (Modified from [
32]).
Imagine there is a vehicle covering the connection between the harbor and the railway station in a city. This vehicle has four compartments: one for a car, one for a van, one for a motor-bike and another one for a bike. Assume that the gas tank of this vehicle has exactly the capacity necessary to carry the vehicle, with the four compartments busy, from the harbor to the railway station. Then we can regard this capacity to be equal to 1 unit. In this example, , where c stands for car compartment busy
, v for van compartment busy
, m for motor-bike compartment busy
and b for bike compartment busy
. Assume also that the vehicle does not start the trip unless at least one of the compartments is busy. All the possible transportation situations are then the elements in ( stands for the power set of (X). In these conditions, for every , can be interpreted as the proportion of gas consumed if A happens. A possible specification of a fuzzy measure for this problem is as follows. Note how the fuzzy measure in Example 1 is non-additive. Therefore, the same information cannot be represented by a single probability distribution.
Every fuzzy measure over a reference set of cardinality
n can be characterized by
probability functions (not necessarily different) [
33], each one of them corresponding to one possible permutation of the reference set. Given a permutation
of the set of indices
, we will denote by
the ordering of the elements of
X according to permutation
, i.e.,
. When it is clear from the context, we will drop
from the subscripts and write
.
Definition 2. [33] Let be a measurable space. The probability function associated with
μ and is defined as the set such thatwhere . Definition 3. [33] Let be a measurable space and let be the probability function associated with μ and . The probability measure generated by
μ and is We will use for both the probability function and the probability measure when it is clear from the context.
We will consider measures over marginal spaces as well as product spaces resulting from composing the marginal spaces and , with , which is not the same as .
Of particular interest are the elements of a product class that can be obtained from sets in the marginal space. They are called rectangles and are formally defined as follows:
Definition 4. Let and be two spaces where and are classes defined on and , respectively. The class of rectangles
of is Our proposals in this paper will be based on the product measures described in [
31], which make use of the concept of triangular norm and conorm.
Definition 5. [34] An operator is a triangular norm
or t-norm
for short, if it satisfies the following conditions: - 1.
, for all . (Boundary conditions)
- 2.
. (Commutativity)
- 3.
If and , then . (Monotonicity)
- 4.
. (Associativity)
Definition 6. [34] An operator is a triangular conorm
or t-conorm
for short, if it satisfies the following properties: - 1.
, for all . (Boundary conditions)
- 2.
. (Commutativity)
- 3.
If and , then . (Monotonicity)
- 4.
. (Assocciativity)
The usual way of integrating real functions with respect to a fuzzy measure is by means of the so-called Choquet integral, which is a generalization of Lebesgue integral to monotone measures.
Definition 7. [2] Let be a measurable space, and let h be a measurable real function of X. The Choquet integral
of h with respect to μ iswhere and are the -cuts
of h, defined as If the reference set is finite, the integral can be expressed as
where
is an ordering such that
and the sets
are of the form
. Furthermore, if
h is non-negative, it can be computed as
where
is the probability function associated with the ordering
induced by
h (see Definition 2).
Given two measurable spaces and , the concept of product fuzzy measure is defined as follows.
Definition 8. [31] Aproduct fuzzy measure
of and is a function satisfying: - 1.
, .
- 2.
For all such that it holds that .
- 3.
For all , it holds that .
- 4.
For all , it holds that .
The next definitions particularize the concept of a fuzzy measure product, so that it is guaranteed to be compatible with the intuitive idea of independence, in the sense that if two fuzzy measures are independent, their fuzzy measure product should be possible to be obtained using exclusively the two original fuzzy measures.
Definition 9. [31] Let and be measurable spaces. and are⊙-independent fuzzy measures
if there exists a product fuzzy measure such that for any ,where and ⊙
is a t-norm. is called the ⊙-
independent product of and . Definition 10. [31] Let and be measurable spaces. The ⊙-exterior product measure
for any is defined aswhere ⊙
is a t-norm. Definition 11. [31] Let and be measurable spaces. The ⊙-interior product measure
for any is defined aswhere ⊙
is a t-norm. Both measures conform to lower and upper bounds for any ⊙-independent product fuzzy measure.
Proposition 1. [31] Let and be measurable spaces. Given any ⊙-
independent product of and , it holds that for all , Note that, for the particular case of the class
, both measures coincide [
31], i.e., for all
,
Product fuzzy measures can also be defined in terms of the associated probability measures [
31].
Definition 12. [31] Let and be measurable spaces and and be the probability functions associated with and , respectively. The lower product p-measure
is defined asfor all , where ⊗ is the standard probabilistic product, i.e., . Definition 13. [31] Given the conditions in Definition 12, the upper product p-measure
is defined aswhere ⊗
is the standard probabilistic product. 3. Parameters over One Measurable Space
In this section we propose statistical parameters aimed to characterize the behavior of functions defined on a measurable space endowed with a fuzzy measure. We will separately address the case of analyzing a single function and the case of simultaneously analyzing two functions.
3.1. The Case of Only One Function
Our proposals rely on the extension of the concept of mathematical expectation associated with probability measures, to the more general case of fuzzy measures. Consider a measurable space
where
is a fuzzy measure, and the class
of all the additive measures over
X. One way to extend the concept of mathematical expectation [
5,
35] is based on defining the set
of all the probability measures that dominate the fuzzy measure
.
Since all the elements in
are additive measures, the expectation of a function
h with respect to a fuzzy measure
can be defined as
where
is the mathematical expectation of
h with respect to the probability measure
P.
The problem of this definition is that it is not always well defined, since there can exist a fuzzy measure for which . It happens, for instance, when the sum of the fuzzy measure over the unitary subsets of X is greater than 1, as it is not possible to find a probability measure bounding from above.
A class of fuzzy measures that are compatible with the definition of expectation in Equation (
16) are those that conform a lower envelope of a set of probability measures [
6], i.e.,
, because in that case
.
A more general definition of expectation, based on Choquet integral [
2], was given in [
30] with the aim of extending the probabilistic concept of expectation to non-additive settings.
Definition 14. [30] Let be a measurable space and let h be a non-negative, real valued measurable function of X. The monotone expectation of h with respect to the fuzzy measure μ is defined as Since a fuzzy measure can always be characterized by a set of probability measures, it is clear from Definition 14 and Equation (
7) that the monotone expectation is equal to the mathematical expectation obtained with the probability function associated with the fuzzy measure
and the ordering induced by the function
h (see Definition 2), i.e.,
where
denotes the probability function associated with
and the ordering induced by
h. In the particular case of considering a finite reference set, the monotone expectation can be expressed as
The relation between the monotone expectation and the mathematical expectation is also illustrated in Proposition 2.
Proposition 2. [30] Let be a measurable space and let be the set of all the probability functions associated with the fuzzy measure μ. Then, for any non-negative real valued, measurable function h of X it holds that 3.1.1. Monotone Variance
In the same way as the monotone expectation extends in a natural way the concept of mathematical expectation to non-additive measures, we will pursue the extension of other statistical parameters in a similar way.
We will start off considering the extension of the concept of variance to a non-monotone context. A direct approach is to define an extension of the variance using Choquet integral, as in the case of the monotone expectation, which yields
However, the definition of variance in Equation (
21) is problematic, since the distribution associated with
and the ordering induced by
h is not, in general, the same as the one induced by
. The reason is that functions
h and
are not comonotone, and therefore they may induce different orderings of the reference set. Hence, the monotone variance defined in this way could not be considered as a measure of dispersion with respect to the monotone expectation, as the underlying probability distribution can be different (see Definition 2).
Taking this into account, we propose a definition of monotone variance that preserves the underlying probability measure associated with and the ordering induced by h.
Definition 15. Let be a measurable space and let h be a non-negative real valued measurable function of X. We define the monotone variance of h with respect to the fuzzy measure μ aswhere is the probability function associated with μ and the ordering induced by h. It is clear from the definition that and that it is equal to the traditional variance when is a probability measure.
Example 2. Consider the fuzzy measure over the reference set and its associated probability distributions in Table 1, and the function h defined as and . The ordering of X induced by h is thus , i.e., the ordering induced by permutation , which corresponds to the probability distribution . Therefore, according to Equation (22), the monotone variance of h is just the variance of h computed using probability distribution , resulting in Our definition of monotone variance preserves some properties of the traditional variance, likewise the monotone expectation preserves some properties of the mathematical expectation. In particular, the result in Theorem 1 is of practical value as it simplifies the calculation, and it is also of interest because it links the concepts of monotone variance and monotone expectation.
Theorem 1. Let be a measurable space and let h be a non-negative real valued measurable function of X, then it holds that Proof. According to Equation (
22),
i.e., the variance of
h computed according to probability distribution
, which can be calculated as
and thus
The functions
h and
are comonotone, and therefore they induce the same ordering of the reference set and hence yield the same associated probability distribution (see Definition 2). Thus, it holds that
and therefore,
In addition, according to Equation (
18),
and
. Now, replacing
by
and
by
in Equation (
24) we obtain Equation (
23). ☐
Example 3. As a continuation of Example 2, we will compute using Equation (23). The next result shows that the monotone variance behaves in a similar way as traditional variance in relation to affine transformations.
Proposition 3. Assume the conditions in Theorem 1 and let t be a function defined as with and . It holds that Proof. First, we have to show that
t and
h are comonotone, i.e., that for all
,
and
have the same sign:
since
. Therefore, the probability distribution associated with the measure
is the same for both functions, i.e.,
and thus
□
The next results analyze when the monotone variance is equal to 0.
Theorem 2. Let be a measurable space and let h be a non-negative real valued measurable function of X. Let be the probability function associated with μ and h. Then, the following three conditions are equivalent:
- 1.
.
- 2.
such that and , , with .
- 3.
such that
where .
Proof. Let us assume without loss of generality that
Since , , and , there must be at least one such that .
Suppose that
and there exist two different
,
, such that
and
. Then it holds that
which means that
and
. However, according to the assumption in Equation (
26), it holds that
. Hence,
which means that
which is a contradiction with the assumption that
. Thus, there is only one
and furthermore,
.
Assume
such that
. Then,
and therefore
and
On the other hand, since , it follows that if and if .
It is straightforward from the definition of monotone variance. □
Corollary 1. If h is constant, then , for any fuzzy measure μ.
Example 4. Assume a function h defined on as , and , and an associated probability distribution such that , and . We will see how the monotone variance is equal to 0. However, first we need to calculate the monotone expectation. Now we will calculate the value of the measure μ over the sets , i.e., , and .
We can obtain the values of μ from using Definition 2. The result is Therefore, , and .
3.1.2. Monotone Moments
Following the same idea underlying the definition of monotone variance, we can extend the concepts of central and non-central moments from a probabilistic setting to a monotone one.
Definition 16. Let be a measurable space and let h be a non-negative real valued measurable function of X. We define the k-th non-central monotone moment of h with respect to μ as Note that Equation (
27) is well defined, since
h and
are comonotone, and therefore the corresponding probability function is the same for both of them, regardless of the value of
k.
The definition of central monotone moments is, however, more problematic. If we follow the same idea as in Definition 16, and define the central monotone moment as , we find the problem that functions h and are not comonotone, and that would mean that different underlying probability distributions would be used to compute and . We will therefore generalize the definition of monotone variance to values of , utilizing the probability function associated with and h.
Definition 17. Let be a measurable space and let h be a non-negative real valued measurable function of X. We define the k-th central monotone moment of h with respect to μ aswhere is the probability function associated with μ and h. The following result establishes the relation between central and non-central monotone moments.
Proposition 4. Let be a measurable space and let h be a non-negative real valued measurable function of X. It holds that Proof. Assume
.
□
3.2. The Case of Two Functions
In this section we approach the simultaneous analysis of two functions and over the same reference set, X. Our goal is to model the information that both functions have in common, or the way in which they interact with one another.
Generalizing the concept of covariance, for instance, by using , raises the problem that the underlying probability distribution used to compute the monotone expectation is not the one induced by nor by for the same fuzzy measure , and therefore it is not clear that this monotone covariance in fact measures the relationship between both functions at all. We will therefore explore a different approach, in which we will model the degree of similarity between and , by measuring the common region determined by both functions.
Definition 18. Let be a measurable space and let and be non-negative real valued measurable functions of X. We define the common expectation of and with respect to μ as The concept of common expectation is illustrated in
Figure 1. More precisely, the value of the common expectation of
and
is the measure, according to
, of the function under which the shaded area is.
Example 5. We want to obtain the global grade for two students out of the individual grades they obtained in four different courses . In the final grade we want to reflect if a student shows a good performance in the two scientific courses, , the humanistic ones, , or in the combination , corresponding to a social sciences profile. These criteria are encoded in the fuzzy measure in Table 2, while the grades obtained by both students (between 0 and 1) in each of the courses are shown in Table 3. The calculation of the respective monotone expectations and variances result inwhich are quite similar, while the common expectation is . The next proposition states the basic properties of the common expectation.
Proposition 5. Let be a measurable space and let and be non-negative real valued measurable functions of X. Then, satisfies the following properties:
- 1.
.
- 2.
.
- 3.
If , , then for any non-negative real valued measurable function h of X, .
- 4.
If , , then .
- 5.
.
Proof. It is straightforward from Equation (
30).
It follows from the facts that is a monotone functional and that is bounded from above by both and .
It is a direct consequence of the monotonicity of operator min and functional .
If , then , and therefore both expectations are the same.
If
then the minimum of both functions is the identically null function, which is known to be the only one that has null monotone expectation [
30].
□
The common expectation is not normalized, and therefore its value alone is not enough to determine if it can be regarded as high or low. For instance, in Example 5 we obtained , but that value does not tell us if it is high or low. However, it is clear that the common expectation can be bounded from above, since it is known that for any positive real numbers a and b, it holds that and the equality is reached only when . Hence, we can normalize the common expectation using these bounds, which yields three possible definitions of coefficients of concordance between and .
Definition 19. Let be a measurable space and let and be non-negative real valued measurable functions of X. We define the coefficients of concordance and between and with respect to μ as The next proposition shows the basic properties of the three concordance coefficients (when it is clear from the context, we will drop the measure and the functions, thus denoting by ).
Proposition 6. Assume the conditions in Definition 19. The coefficients of concordance satisfy the following conditions:
- 1.
, .
- 2.
.
- 3.
.
- 4.
iff and have empty intersection, i.e., .
- 5.
- 6.
If , with , then
Proof. Furthermore, since
is a monotone functional,
. According to property 2 in Proposition 5,
, and thus
On the other hand, since and are non-negative, so it is , which means that , and .
If , then and ; hence, the three coefficients are equal to 1.
From the proof of property 1, we know that
iff which, according to property 5 in Proposition 5, can only happen if .
If
, then
,
and
. Therefore,
If
with
, then
,
and
. Therefore,
□
Example 6. As a continuation of Example 5, we can use the data in Table 2 and Table 3 to compute the coefficients of concordance, obtaining Note how the three coefficients have low values, which is consistent with the data in the example, as in spite of the similar values for the monotone expectation and variance corresponding to both students, they have a clearly different profile, scientific in the case of and humanistic in the case of .
4. Parameters Defined over Product Spaces
In this section we explore scenarios where we have two measurable spaces each of them equipped with a different fuzzy measure. We will consider the definition of statistical parameters on the product space.
Likewise, in
Section 3, we will separately study the case of one or two real functions. In both cases, it is necessary to obtain a fuzzy measure over the product space. We will rely on the proposals in [
31] to obtain the product measures.
4.1. The Case of One Function
The methods proposed in [
31] for constructing fuzzy measures over product spaces, rather than single measures, usually yield a set of them, bounded by an upper and lower measure. Similarly, our proposals here will consist of intervals of parameters rather than a single one.
We will start defining the concept of joint expectation making use of the interior and exterior product measures (see Definitions 10 and 11).
Definition 20. Let and be measurable spaces, and , the ⊙-interior and ⊙-exterior product measures. We define the joint lower and upper ⊙-expectations as Proposition 7. Let and be measurable spaces, and , the ⊙-interior and ⊙-exterior product measures. It holds that Furthermore, if is any ⊙-independent product measure of and (see Definition 9), it also holds thatwhere . Proof. Note that
and
are monotone expectations, namely,
,
and
respectively. Therefore, Equations (
36) and (
37) are a direct consequence of the monotonicity of the monotone expectation and Proposition 1. □
The concept of joint ⊙-expectations is analogous to the concept of monotone expectation in a marginal space, with the difference that, in the case of the product space, the underlying fuzzy measure is not known, but instead we have an interval of measures bounded by the interior and exterior ⊙-product measures.
We can define joint expectations using other product measures, as the p-measures given in Definitions 12 and 13.
Definition 21. Let and be measurable spaces, and , the lower and upper product p-measures respectively. We define the lower and upper joint probabilistic expectations as Since we have a function defined over the product space and fuzzy measures defined over the marginal spaces, it is natural to define marginal expectations. We will utilize the concept of ⊕-marginal of a function [
31].
Definition 22. [31] Let h be a function defined on and taking values on . We define the ⊕-marginals
of h aswhere ⊕
is a t-conorm (see Definition 6), n is the cardinality of and m is the cardinality of . Definition 23. Let and be measurable spaces and let h be a function defined on and taking values on . We define the marginal ⊕-expectations aswhere are the ⊕-marginals of h. 4.2. The Case of Two Functions
We will now assume that we have two different functions, one for each marginal space, and define parameters that combine the information provided by the marginal spaces.
Definition 24. Let and be measurable spaces, and let , be functions defined on and respectively, taking values on . We define the upper and lower global expectation of and aswhere ☆ and ⊙ are arbitrary t-norms (see Definition 5), and and are the interior and exterior product measures of and respectively. The next proposition shows that both expectations coincide when ☆ is the min t-norm.
Proposition 8. Assume the conditions in Definition 24. If ☆
is the min t-norm, it holds that Proof. According to ([
31], Proposition 8), the
-cuts generated by
belong to
when ☆ is the min t-norm. Furthermore, Equation (
12) establishes that
for the elements of
, which proves the result. □
As a consequence of Proposition 8, when using the min t-norm we will just write for both and .
The global expectation is in fact an extension of the monotone expectation in the sense expressed by the next theorem.
Theorem 3. Let be a measurable space and let h be a function defined on X and taking values on . Consider the product space and let both ☆ and ⊙ be the min t-norm. Then, Proof. Assume
. Then
Without loss of generality, we can assume that
in which case the
-cuts generated by
are of the form
which are elements of the class
with their two projections being identical.
Since we are using the min t-norm to construct the product measure, it turns out that the measure in each
-cut of the product space is equal to the measure assigned by
to the
-cuts in the marginal space, and thus
□
Example 7. Consider a reference set and the function defined as . The function is displayed in Table 4. It can be seen how the diagonal contains the original values of h, and its α-cuts areand therefore Likewise for the common expectation, the global expectation is not normalized, but it can be easily normalized in the same way as we did for the common expectation case, as stated in the next definition.
Definition 25. Let and be measurable spaces and let and be functions defined on and respectively and taking values on . We define the global coefficients of concordance of and as Example 8. (Continuation of Example 5)
Using the data in Table 3 we can obtain the function , the values of which are given in Table 5. Using the fuzzy measure in Table 2, we find that and the global concordance coefficients are and . The value of the global expectation () is very close to the values of the monotone expectations for each student in Example 5 (0.61 and 0.65 respectively). It can be interpreted as the fact that the grades of both students are acceptable individually and also globally, which is reflected in high values of the global coefficients of concordance. Note how the global expectation is not detecting the fact that both students have different profiles (scientific and humanistic), while the common expectation detected this fact yielding a much lower value (0.25) resulting in lower values of the coefficients of concordance as well.
5. Conclusions
With the introduction of the concept of monotone variance, we have complemented the already known concept of monotone expectation. It can be regarded as a measure of dispersion with respect to a central position measure. We have also introduced the concepts of central and non-central monotone moments, that can serve as a vehicle to define further statistical parameters based on fuzzy measures as, for instance, shape measures. The potential application scope is certainly wide, as it covers non-additive scenarios like the ones described in the examples in this paper, and just to mention some of them, such scenarios can be found in Engineering and Social Sciences applications.
The common expectation and concordance coefficients can be interpreted as measures of match between the functions, and in that sense can provide information about to which extent one function explains the other one. A possible application of these concepts is the development of prediction models when the measures are not additive.
Thanks to the developments in [
31] we have been able to extend the concept of monotone expectation to product spaces, where, in addition, we have shown how to marginalize the information provided by a function over a product space using the marginal ⊕-expectations.
All the developments in this paper are restricted to finite reference sets. Even though it covers a wide variety of practical applications, it is worth exploring the formulation of the results obtained here to uncountable reference sets, which seems to be a promising research line. The first step in this direction would be the extension of the results in [
31] to continuous domains.