Statistical Parameters Based on Fuzzy Measures

Reche, Fernando; Morales, María; Salmerón, Antonio

doi:10.3390/math8112015

Open AccessArticle

Statistical Parameters Based on Fuzzy Measures

by

Fernando Reche

,

María Morales

^*

and

Antonio Salmerón

Department of Mathematics and Center for the Development and Transfer of Mathematical Research to Industry (CDTIME), University of Almería, 04120 Almería, Spain

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(11), 2015; https://doi.org/10.3390/math8112015

Submission received: 21 October 2020 / Revised: 5 November 2020 / Accepted: 9 November 2020 / Published: 12 November 2020

(This article belongs to the Special Issue Fuzzy Sets, Fuzzy Logic and Their Applications 2020)

Download

Browse Figure

Versions Notes

Abstract

:

In this paper, we study the problem of defining statistical parameters when the uncertainty is expressed using a fuzzy measure. We extend the concept of monotone expectation in order to define a monotone variance and monotone moments. We also study parameters that allow the joint analysis of two functions defined over the same reference set. Finally, we propose some parameters over product spaces, considering the case in which a function over the product space is available and also the case in which such function is obtained by combining those in the marginal spaces.

Keywords:

monotone statistical parameters; fuzzy measures; monotone measures; product spaces; fuzzy statistics

1. Introduction

Fuzzy measures [1], also known as capacities [2], non-additive measures or monotone measures [3], have shown to be a valuable tool for representing uncertainty, since they are able to cope with more general scenarios than probability measures do. Even though fuzzy measures have been successfully applied in a wide range of applications [4], no theory analogous to mathematical statistics has emerged around them in the general case, due to the difficulty of defining statistical parameters with a clear interpretation when additivity is replaced by monotonicity.

A remarkable exception is the case of the so-called imprecise probabilities [5,6], characterized by upper and lower expectations that provide rich semantics and interpretability. Dempster–Shafer belief functions [7,8], for instance, can be formulated as special cases of imprecise probabilities.

The field of fuzzy probability and statistics [9,10,11,12,13,14] has received significant attention during the last two decades. The contributions in this field can be classified into two basic groups according to the underlying approach they follow [15]. One of the groups include the methods that deal with the analysis of classical (non-fuzzy) data using methods based on fuzzy set theory, while the other group focuses on analyzing fuzzy data using statistical methods. In this context, fuzzy data refers to data in which the values correspond to fuzzy numbers [16], characterized by a membership function that returns a value between 0 and 1 indicating to which extent a given real number matches a given fuzzy number.

Examples within the first group include fuzzy clustering [17], fuzzy linear regression [18], testing fuzzy hypothesis from non-fuzzy data [19], fuzzy statistical quality control [20], time series forecasting based on fuzzy logic [21] and making statistical decisions with fuzzy utilities [22].

The second group includes methods for maximum likelihood estimation from fuzzy data [23], classification when data are labeled with Dempster–Shafer belief functions [24], distance-based statistical analysis [25], statistical hypothesis testing from fuzzy data [26], principal component analysis [27], discriminant analysis [28] and clustering [29].

In this paper, we are interested in the definition of statistical parameters when the uncertainty is represented by a general fuzzy measure. More precisely, our starting point is a measurable space and a measurable real-valued function defined on the reference set of the space. We also assume that the measurable space is endowed with a fuzzy measure, and we will study the definition of statistical parameters over the measurable function, in a similar way as statistical parameters over a random variable can be defined from a probability measure. In this way, we attempt to handle more general scenarios than the ones covered by probability measures. To achieve this, we rely on the concept of monotone expectation [30]. We consider the case of marginal spaces as well as product spaces, and take advantage of recent advances in the construction of fuzzy measures over product spaces [31]. Our study is restricted to discrete reference sets.

The rest of the paper is organized as follows. Section 2 establishes the basic notation and definitions, and highlights the fundamental properties of product measures that are used throughout the paper. Section 3 contains the original contributions in this paper, in what concerns the definition of parameters in a marginal measurable space, while Section 4 describes our proposals for product spaces. The paper ends with conclusions in Section 5.

2. Preliminaries and Notation

Definition 1.

[1] Let X be a set and

A

be a non-empty class of subsets of X so that

X \subset A

and

\emptyset \subset A

. A function

μ : A ⟶ [0, 1]

is a fuzzy measure if:

1.: $μ (\emptyset) = 0$ .
2.: $μ (X) = 1$ .
3.: $\forall A, B \in A$ such that $A \subseteq B$ it holds that $μ (A) \leq μ (B)$ .
4.: If ${A_{n}}_{n \in N} \in A$ such that $A_{1} \subseteq A_{2} \subseteq \dots$ and $⋃_{n = 1}^{\infty} A_{n} \in A$ , then ${lim}_{n} μ (A_{n}) = μ (⋃_{n = 1}^{\infty} A_{n}) .$
5.: If ${A_{n}}_{n \in N} \in A$ such that $A_{1} \supseteq A_{2} \supseteq \dots$ and $⋂_{n = 1}^{\infty} A_{n} \in A$ , then ${lim}_{n} μ (A_{n}) = μ (⋂_{n = 1}^{\infty} A_{n}) .$

The triplet

(X, A, μ)

is a measurable space, and X is called the reference set. We will only work with finite reference sets [4] in this paper. By default, we will assume that

A

is the power set of X.

Example 1

(Modified from [32]). Imagine there is a vehicle covering the connection between the harbor and the railway station in a city. This vehicle has four compartments: one for a car, one for a van, one for a motor-bike and another one for a bike. Assume that the gas tank of this vehicle has exactly the capacity necessary to carry the vehicle, with the four compartments busy, from the harbor to the railway station. Then we can regard this capacity to be equal to 1 unit. In this example,

X = {c, v, m, b}

, where c stands for car compartment busy , v for van compartment busy, m for motor-bike compartment busy and b for bike compartment busy. Assume also that the vehicle does not start the trip unless at least one of the compartments is busy. All the possible transportation situations are then the elements in

A = P (X)

(

P (X)

stands for the power set of (X). In these conditions, for every

A \subseteq X

,

μ (A)

can be interpreted as the proportion of gas consumed if A happens. A possible specification of a fuzzy measure for this problem is as follows.

μ ({b}) = 0.1, μ ({v}) = 0.4, μ ({c}) = 0.3, μ ({m}) = 0.2,

μ ({c, v}) = 0.6, μ ({c, b}) = 0.35, μ ({c, m}) = 0.45,

μ ({b, v}) = 0.42, μ ({b, m}) = 0.21, μ ({v, m}) = 0.68,

μ ({c, v, b}) = 0.7, μ ({c, v, m}) = 0.75, μ ({c, b, m}) = 0.5, μ ({v, b, m}) = 0.69 .

Note how the fuzzy measure in Example 1 is non-additive. Therefore, the same information cannot be represented by a single probability distribution.

Every fuzzy measure over a reference set of cardinality n can be characterized by

n!

probability functions (not necessarily different) [33], each one of them corresponding to one possible permutation of the reference set. Given a permutation

σ

of the set of indices

{1, \dots, n}

, we will denote by

X^{σ}

the ordering of the elements of X according to permutation

σ

, i.e.,

X^{σ} = {x_{σ (1)}, \dots, x_{σ (n)}}

. When it is clear from the context, we will drop

σ

from the subscripts and write

X^{σ} = {x_{(1)}, \dots, x_{(n)}}

.

Definition 2.

[33] Let

(X, A, μ)

be a measurable space. The probability function associated with μ and

X^{σ}

is defined as the set

P_{σ} = {p_{σ} (x_{(1)}), \dots, p_{σ} (x_{(n)})}

such that

p_{σ} (x_{(i)}) = \{\begin{matrix} μ (A_{(i)}) - μ (A_{(i + 1)}) & i f i < n, \\ μ (x_{(n)}) & i f i = n, \end{matrix}

(1)

where

A_{(i)} = {x_{(i)}, \dots, x_{(n)}}

.

Definition 3.

[33] Let

(X, A, μ)

be a measurable space and let

P_{σ}

be the probability function associated with μ and

X^{σ}

. The probability measure generated by μ and

X^{σ}

is

P_{σ} (A) = \sum_{x \in A} p_{σ} (x), \forall A \in A .

(2)

We will use

P_{σ}

for both the probability function and the probability measure when it is clear from the context.

We will consider measures over marginal spaces

(X, A)

as well as product spaces

(X_{1} \times X_{2}, A_{X_{1} \times X_{2}})

resulting from composing the marginal spaces

(X_{1}, A_{X_{1}})

and

(X_{2}, A_{X_{2}})

, with

A_{X_{1} \times X_{2}} = P (X_{1} \times X_{2})

, which is not the same as

P (X_{1}) \times P (X_{2})

.

Of particular interest are the elements of a product class that can be obtained from sets in the marginal space. They are called rectangles and are formally defined as follows:

Definition 4.

Let

(X_{1}, A_{X_{1}})

and

(X_{2}, A_{X_{2}})

be two spaces where

A_{X_{1}}

and

A_{X_{2}}

are classes defined on

X_{1}

and

X_{2}

, respectively. The class of rectangles of

A_{X_{1} \times X_{2}}

is

R = {H \in A_{X_{1} \times X_{2}} | H = A \times B, w h e r e A \in A_{X_{1}}, B \in A_{X_{2}}} .

(3)

Our proposals in this paper will be based on the product measures described in [31], which make use of the concept of triangular norm and conorm.

Definition 5.

[34] An operator

T : {[0, 1]}^{2} ⟶ [0, 1]

is a triangular norm or t-norm for short, if it satisfies the following conditions:

1.: $T (0, a) = 0$ , $T (a, 1) = a$ for all $a \in [0, 1]$ . (Boundary conditions)
2.: $T (a, b) = T (b, a)$ . (Commutativity)
3.: If $a \leq c$ and $b \leq d$ , then $T (a, b) \leq T (c, d)$ . (Monotonicity)
4.: $T (T (a, b), c) = T (a, T (b, c))$ . (Associativity)

Definition 6.

[34] An operator

T : {[0, 1]}^{2} ⟶ [0, 1]

is a triangular conorm or t-conorm for short, if it satisfies the following properties:

1.: $S (1, a) = 1$ , $S (a, 0) = a$ for all $a \in [0, 1]$ . (Boundary conditions)
2.: $S (a, b) = S (b, a)$ . (Commutativity)
3.: If $a \leq c$ and $b \leq d$ , then $S (a, b) \leq S (c, d)$ . (Monotonicity)
4.: $S (S (a, b), c) = S (a, S (b, c))$ . (Assocciativity)

The usual way of integrating real functions with respect to a fuzzy measure is by means of the so-called Choquet integral, which is a generalization of Lebesgue integral to monotone measures.

Definition 7.

[2] Let

(X, A, μ)

be a measurable space, and let h be a measurable real function of X. The Choquet integral of h with respect to μ is

\begin{matrix} C \int_{A} h \circ μ = \int_{- \infty}^{0} (μ (H_{α} \cap A) - 1) d α + \int_{0}^{\infty} μ (H_{α} \cap A) d α \end{matrix}

(4)

where

A \in A

and

H_{α}

are the

α

-cuts of h, defined as

\begin{matrix} H_{α} = {x \in X / h (x) \geq α} . \end{matrix}

(5)

If the reference set is finite, the integral can be expressed as

C \int_{} h \circ μ = h (x_{(1)}) μ (A_{(1)}) + \sum_{i = 2}^{n} μ (A_{(i)}) [h (x_{(i)}) - h (x_{(i - 1)})],

(6)

where

X^{σ}

is an ordering such that

h (x_{(1)}) \leq h (x_{(2)}) \leq \dots \leq h (x_{(n)})

and the sets

A_{(i)}

are of the form

{x_{(i)}, x_{(i + 1)}, \dots, x_{(n)}}

. Furthermore, if h is non-negative, it can be computed as

C \int_{} h \circ μ = \sum_{i = 1}^{n} h (x_{(i)}) p_{σ} (x_{(i)}), p_{σ} \in P_{h},

(7)

where

P_{h}

is the probability function associated with the ordering

X^{σ}

induced by h (see Definition 2).

Given two measurable spaces

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

, the concept of product fuzzy measure is defined as follows.

Definition 8.

[31] Aproduct fuzzy measureof

μ_{1}

and

μ_{2}

is a function

μ_{12} : A_{X_{1} \times X_{2}} ⟶ [0, 1]

satisfying:

1.: $μ_{12} (\emptyset) = 0$ , $μ_{12} (X_{1} \times X_{2}) = 1$ .
2.: For all $A, B \in A_{X_{1} \times X_{2}}$ such that $A \subseteq B$ it holds that $μ_{12} (A) \leq μ_{12} (B)$ .
3.: For all $A \in A_{X_{1}}$ , it holds that $μ_{12} (A \times X_{2}) = μ_{1} (A)$ .
4.: For all $B \in A_{X_{2}}$ , it holds that $μ_{12} (X_{1} \times B) = μ_{2} (B)$ .

The next definitions particularize the concept of a fuzzy measure product, so that it is guaranteed to be compatible with the intuitive idea of independence, in the sense that if two fuzzy measures are independent, their fuzzy measure product should be possible to be obtained using exclusively the two original fuzzy measures.

Definition 9.

[31] Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces.

μ_{1}

and

μ_{2}

are⊙-independent fuzzy measuresif there exists a product fuzzy measure

μ_{12}^{⊙}

such that for any

H \in R

,

μ_{12}^{⊙} (H) = μ_{1} (A) ⊙ μ_{2} (B),

(8)

where

H = A \times B

and ⊙ is a t-norm.

μ_{12}^{⊙}

is called the ⊙-independent product of

μ_{1}

and

μ_{2}

.

Definition 10.

[31] Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces. The ⊙-exterior product measure for any

H \in A_{X_{1} \times X_{2}}

is defined as

{\bar{μ}}_{12}^{⊙} (H) = min_{A \times B \supseteq H} μ_{1} (A) ⊙ μ_{2} (B),

(9)

where ⊙ is a t-norm.

Definition 11.

[31] Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces. The ⊙-interior product measure for any

H \in A_{X_{1} \times X_{2}}

is defined as

{\underset{̲}{μ}}_{12}^{⊙} (H) = max_{A \times B \subseteq H} μ_{1} (A) ⊙ μ_{2} (B),

(10)

where ⊙ is a t-norm.

Both measures conform to lower and upper bounds for any ⊙-independent product fuzzy measure.

Proposition 1.

[31] Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces. Given any ⊙-independent product of

μ_{1}

and

μ_{2}

, it holds that for all

C \in A_{X_{1} \times X_{2}}

,

{\underset{̲}{μ}}_{12}^{⊙} (C) \leq μ_{12}^{⊙} (C) \leq {\bar{μ}}_{12}^{⊙} (C) .

(11)

Note that, for the particular case of the class

R

, both measures coincide [31], i.e., for all

H \in R

,

{\underset{̲}{μ}}_{12}^{⊙} (H) = μ_{12}^{⊙} (H) = {\bar{μ}}_{12}^{⊙} (H) .

(12)

Product fuzzy measures can also be defined in terms of the associated probability measures [31].

Definition 12.

[31] Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces and

P_{σ_{1}}^{μ_{1}}

and

P_{σ_{2}}^{μ_{2}}

be the probability functions associated with

X_{1}^{σ_{1}}

and

X_{2}^{σ_{2}}

, respectively. The lower product p-measure is defined as

{\underset{̲}{m}}_{12} (C) = min_{σ_{1}, σ_{2}} [P_{σ_{1}}^{μ_{1}} \otimes P_{σ_{2}}^{μ_{2}} (C)],

(13)

for all

C \in A_{X_{1} \times X_{2}}

, where ⊗ is the standard probabilistic product, i.e.,

P_{σ_{1}}^{μ_{1}} \otimes P_{σ_{2}}^{μ_{2}} (C) = P_{σ_{1}}^{μ_{1}} (C) P_{σ_{2}}^{μ_{2}} (C)

.

Definition 13.

[31] Given the conditions in Definition 12, the upper product p-measure is defined as

{\bar{m}}_{12} (C) = max_{σ_{1}, σ_{2}} [P_{σ_{1}}^{μ_{1}} \otimes P_{σ_{2}}^{μ_{2}} (C)],

(14)

where ⊗ is the standard probabilistic product.

3. Parameters over One Measurable Space

In this section we propose statistical parameters aimed to characterize the behavior of functions defined on a measurable space endowed with a fuzzy measure. We will separately address the case of analyzing a single function and the case of simultaneously analyzing two functions.

3.1. The Case of Only One Function

Our proposals rely on the extension of the concept of mathematical expectation associated with probability measures, to the more general case of fuzzy measures. Consider a measurable space

(X, A, μ)

where

μ

is a fuzzy measure, and the class

P

of all the additive measures over X. One way to extend the concept of mathematical expectation [5,35] is based on defining the set

M_{P} (μ) = {P \in P | P (A) \geq μ (A), \forall A \in A}

(15)

of all the probability measures that dominate the fuzzy measure

μ

.

Since all the elements in

M_{P} (μ)

are additive measures, the expectation of a function h with respect to a fuzzy measure

μ

can be defined as

E_{μ} (h) = min_{P \in M_{P} (μ)} E_{P} (h),

(16)

where

E_{P} (h)

is the mathematical expectation of h with respect to the probability measure P.

The problem of this definition is that it is not always well defined, since there can exist a fuzzy measure

μ

for which

M_{P} (μ) = \emptyset

. It happens, for instance, when the sum of the fuzzy measure

μ

over the unitary subsets of X is greater than 1, as it is not possible to find a probability measure bounding

μ

from above.

A class of fuzzy measures that are compatible with the definition of expectation in Equation (16) are those that conform a lower envelope of a set of probability measures [6], i.e.,

μ (A) = min {P (A) | P \in M \subseteq P}

, because in that case

M_{P} (μ) \neq \emptyset

.

A more general definition of expectation, based on Choquet integral [2], was given in [30] with the aim of extending the probabilistic concept of expectation to non-additive settings.

Definition 14.

[30] Let

(X, A, μ)

be a measurable space and let h be a non-negative, real valued measurable function of X. The monotone expectation of h with respect to the fuzzy measure μ is defined as

E_{μ} (h) = C \int_{} h \circ μ .

(17)

Since a fuzzy measure can always be characterized by a set of probability measures, it is clear from Definition 14 and Equation (7) that the monotone expectation is equal to the mathematical expectation obtained with the probability function associated with the fuzzy measure

μ

and the ordering induced by the function h (see Definition 2), i.e.,

E_{μ} (h) = E_{P_{μ, h}} (h),

(18)

where

P_{μ, h}

denotes the probability function associated with

μ

and the ordering induced by h. In the particular case of considering a finite reference set, the monotone expectation can be expressed as

E_{μ} (h) = \sum_{i = 1}^{n} h (x_{(i)}) p_{σ} (x_{(i)}), p_{σ} \in P_{μ, h} .

(19)

The relation between the monotone expectation and the mathematical expectation is also illustrated in Proposition 2.

Proposition 2.

[30] Let

(X, A, μ)

be a measurable space and let

{P_{σ}, σ \in S_{n}}

be the set of all the probability functions associated with the fuzzy measure μ. Then, for any non-negative real valued, measurable function h of X it holds that

min_{σ} E_{P_{σ}} (h) \leq E_{μ} (h) \leq max_{σ} E_{P_{σ}} (h) .

(20)

3.1.1. Monotone Variance

In the same way as the monotone expectation extends in a natural way the concept of mathematical expectation to non-additive measures, we will pursue the extension of other statistical parameters in a similar way.

We will start off considering the extension of the concept of variance to a non-monotone context. A direct approach is to define an extension of the variance using Choquet integral, as in the case of the monotone expectation, which yields

{Var}_{μ} (h) = E_{μ} [{(h - E_{μ} (h))}^{2}] .

(21)

However, the definition of variance in Equation (21) is problematic, since the distribution associated with

μ

and the ordering induced by h is not, in general, the same as the one induced by

{(h - E_{μ} (h))}^{2}

. The reason is that functions h and

{(h - E_{μ} (h))}^{2}

are not comonotone, and therefore they may induce different orderings of the reference set. Hence, the monotone variance defined in this way could not be considered as a measure of dispersion with respect to the monotone expectation, as the underlying probability distribution can be different (see Definition 2).

Taking this into account, we propose a definition of monotone variance that preserves the underlying probability measure associated with

μ

and the ordering induced by h.

Definition 15.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X. We define the monotone variance of h with respect to the fuzzy measure μ as

{V a r}_{μ} (h) = {V a r}_{P_{μ, h}} (h),

(22)

where

P_{μ, h}

is the probability function associated with μ and the ordering induced by h.

It is clear from the definition that

{Var}_{μ} (h) \geq 0

and that it is equal to the traditional variance when

μ

is a probability measure.

Example 2.

Consider the fuzzy measure over the reference set

X = {x_{1}, x_{2}, x_{3}}

and its associated probability distributions in Table 1, and the function h defined as

h (x_{1}) = 0.4, h (x_{2}) = 0.1

and

h (x_{3}) = 0.7

. The ordering of X induced by h is thus

(x_{2}, x_{1}, x_{3})

, i.e., the ordering induced by permutation

σ = (2, 1, 3)

, which corresponds to the probability distribution

P_{(2, 1, 3)}

. Therefore, according to Equation (22), the monotone variance of h is just the variance of h computed using probability distribution

P_{(2, 1, 3)}

, resulting in

{V a r}_{μ} (h) = 0.0621 .

Our definition of monotone variance preserves some properties of the traditional variance, likewise the monotone expectation preserves some properties of the mathematical expectation. In particular, the result in Theorem 1 is of practical value as it simplifies the calculation, and it is also of interest because it links the concepts of monotone variance and monotone expectation.

Theorem 1.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X, then it holds that

{V a r}_{μ} (h) = E_{μ} (h^{2}) - E_{μ}^{2} (h) .

(23)

Proof.

According to Equation (22),

\begin{matrix} {Var}_{μ} (h) & = & {Var}_{P_{μ, h}} (h), \end{matrix}

i.e., the variance of h computed according to probability distribution

P_{μ, h}

, which can be calculated as

\begin{matrix} {Var}_{P_{μ, h}} (h) & = & E_{P_{μ, h}} (h^{2}) - {[E_{P_{μ, h}} (h)]}^{2}, \end{matrix}

and thus

\begin{matrix} {Var}_{μ} (h) & = & E_{P_{μ, h}} (h^{2}) - {[E_{P_{μ, h}} (h)]}^{2} . \end{matrix}

(24)

The functions h and

h^{2}

are comonotone, and therefore they induce the same ordering of the reference set and hence yield the same associated probability distribution (see Definition 2). Thus, it holds that

P_{μ, h} = P_{μ, h^{2}}

and therefore,

E_{P_{μ, h}} (h^{2}) = E_{P_{μ, h^{2}}} (h^{2}) .

In addition, according to Equation (18),

E_{μ} (h^{2}) = E_{P_{μ, h^{2}}} (h^{2}) = E_{P_{μ, h}} (h^{2})

and

E_{μ} (h) = E_{P_{μ, h}} (h)

. Now, replacing

E_{P_{μ, h}} (h^{2})

by

E_{μ} (h^{2})

and

E_{P_{μ, h}} (h)

by

E_{μ} (h)

in Equation (24) we obtain Equation (23). ☐

Example 3.

As a continuation of Example 2, we will compute

{V a r}_{μ} (h)

using Equation (23).

\begin{matrix} E_{μ} (h^{2}) & = & E_{P_{(2, 1, 3)}} (h^{2}) \\ = & 0.3 \cdot 0 . 4^{2} + 0.4 \cdot 0 . 1^{2} + 0.3 \cdot 0 . 7^{2} = 0.199 . \end{matrix}

\begin{matrix} E_{μ} (h) & = & E_{P_{(2, 1, 3)}} (h) \\ = & 0.3 \cdot 0.4 + 0.4 \cdot 0.1 + 0.3 \cdot 0.7 = 0.37 . \end{matrix}

Hence,

{V a r}_{μ} (h) = 0.199 - 0 . 37^{2} = 0.0621 .

The next result shows that the monotone variance behaves in a similar way as traditional variance in relation to affine transformations.

Proposition 3.

Assume the conditions in Theorem 1 and let t be a function defined as

t = a h + b

with

a \in R_{0}^{+}

and

b \in R

. It holds that

{V a r}_{μ} (t) = a^{2} {V a r}_{μ} (h) .

(25)

Proof.

First, we have to show that t and h are comonotone, i.e., that for all

x, y \in X

,

(h (x) - h (y))

and

(t (x) - t (y))

have the same sign:

\begin{matrix} (h (x) - h (y)) (t (x) - t (y)) = (h (x) - h (y)) (a h (x) + b - a h (y) - b) = a {(h (x) - h (y))}^{2} \geq 0, \end{matrix}

since

a \in R_{0}^{+}

. Therefore, the probability distribution associated with the measure

μ

is the same for both functions, i.e.,

P_{μ, t} = P_{μ, h}

and thus

\begin{matrix} {Var}_{μ} (t) = {Var}_{P_{μ, t}} (t) = {Var}_{P_{μ, h}} (t) = a^{2} {Var}_{P_{μ, h}} (h) = a^{2} {Var}_{μ} (h) . \end{matrix}

□

The next results analyze when the monotone variance is equal to 0.

Theorem 2.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X. Let

P_{μ, h}

be the probability function associated with μ and h. Then, the following three conditions are equivalent:

1.: ${V a r}_{μ} (h) = 0$ .
2.: $\exists! i$ $(1 \leq i \leq n)$ such that $p_{σ} (x_{i}) = 1$ and $p_{σ} (x_{j}) = 0$ , $\forall j \neq i$ , with $p_{σ} \in P_{μ, h}$ .
3.: $\exists i$ $(1 \leq i \leq n)$ such that

$μ (H_{α_{j}}) = 1, \forall j \leq i and μ (H_{α_{j}}) = 0, \forall j > i$

where

H_{α_{i}} = {x \in X | h (x) \geq h (x_{i})}, i = 1, \dots, n

.

Proof.

Let us assume without loss of generality that

h (x_{1}) \leq h (x_{2}) \leq \dots \leq h (x_{n}) .

(26)

(1) ⟹ (2)

Since

p_{σ} (x_{i}) \geq 0

,

i = 1, \dots, n

, and

\sum_{i = 1}^{n} p_{σ} (x_{i}) = 1

, there must be at least one

i \in {1, \dots, n}

such that

p_{σ} (x_{i}) \neq 0

.

Suppose that

{Var}_{μ} (h) = 0

and there exist two different

j, k \in {1, \dots, n}

,

j < k

, such that

p_{σ} (x_{j}) \neq 0

and

p_{σ} (x_{k}) \neq 0

. Then it holds that

{Var}_{μ} (h) = {Var}_{p_{σ}} (h) = p_{σ} (x_{j}) {(h (x_{j}) - E_{μ} (h))}^{2} + p_{σ} (x_{k}) {(h (x_{k}) - E_{μ} (h))}^{2} = 0,

which means that

h (x_{j}) = E_{μ} (h)

and

h (x_{k}) = E_{μ} (h)

. However, according to the assumption in Equation (26), it holds that

h (x_{j}) \leq h (x_{j + 1}) \leq \dots \leq h (x_{k - 1}) \leq h (x_{k})

. Hence,

E_{μ} (h) = h (x_{j}) \leq h (x_{j + 1}) \leq \dots \leq h (x_{k - 1}) \leq h (x_{k}) = E_{μ} (h),

which means that

\begin{matrix} h (x_{j}) = h (x_{j + 1}) = \dots = h (x_{k}) = E_{μ} (h) & \Rightarrow & H_{α_{j}} = H_{α_{j + 1}} = \dots = H_{α_{k}} \\ \Rightarrow & p_{σ} (x_{j}) = p_{σ} (x_{j + 1}) = \dots = p_{σ} (x_{k - 1}) = 0, \end{matrix}

which is a contradiction with the assumption that

p_{σ} (x_{j}) \neq 0

. Thus, there is only one

p_{σ} (x_{i}) \neq 0

and furthermore,

p_{σ} (x_{i}) = 1

.

(2) ⟹ (3)

Assume

\exists! i

such that

p_{σ} (x_{i}) \neq 0

. Then,

p_{σ} (x_{1}) = p_{σ} (x_{2}) = \dots = p_{σ} (x_{i - 1}) = p_{σ} (x_{i + 1}) = \dots = p_{σ} (x_{n}) = 0

and therefore

μ (H_{α_{1}}) = μ (H_{α_{2}}) = \dots = μ (H_{α_{i}})

and

μ (H_{α_{i + 1}}) = μ (H_{α_{i + 2}}) = \dots = μ (H_{α_{n}}) .

On the other hand, since

μ (H_{α_{i}}) - μ (H_{α_{i + 1}}) = 1

, it follows that

μ (H_{α_{j}}) = 1

if

j \leq i

and

μ (H_{α_{j}}) = 0

if

j > i

.

(3) ⟹ (1)

It is straightforward from the definition of monotone variance. □

Corollary 1.

If h is constant, then

{V a r}_{μ} (h) = 0

, for any fuzzy measure μ.

Example 4.

Assume a function h defined on

X = {x_{1}, x_{2}, x_{3}}

as

h (x_{1}) = 0.4

,

h (x_{2}) = 0.1

and

h (x_{3}) = 0.7

, and an associated probability distribution

p_{σ}

such that

p_{σ} (x_{1}) = 1

,

p_{σ} (x_{2}) = 0

and

p_{σ} (x_{3}) = 0

. We will see how the monotone variance is equal to 0. However, first we need to calculate the monotone expectation.

E_{μ} (h) = 1 \cdot 0.4 + 0 \cdot 0.1 + 0 \cdot 0.7 = 0.4 .

Thus,

{V a r}_{μ} (h) = 1 \cdot {(0.4 - 0.4)}^{2} + 0 \cdot {(0.1 - 0.4)}^{2} + 0 \cdot {(0.7 - 0.4)}^{2} = 0 .

Now we will calculate the value of the measure μ over the sets

H_{α_{i}} = {x \in X | h (x) \geq h (x_{i})}, i = 1, 2, 3

, i.e.,

H_{α_{1}} = {x_{1}, x_{3}}

,

H_{α_{2}} = {x_{1}, x_{2}, x_{3}}

and

H_{α_{3}} = {x_{3}}

.

We can obtain the values of μ from

p_{σ}

using Definition 2. The result is

\begin{matrix} μ ({x_{3}}) & = & p_{σ} (x_{3}) = 0, \\ μ ({x_{1}, x_{3}}) & = & p_{σ} (x_{1}) + μ ({x_{3}}) = 1 + 0 = 1, \\ μ ({x_{1}, x_{2}, x_{3}}) & = & p_{σ} (x_{2}) + μ ({x_{1}, x_{3}}) = 0 + 1 = 1 . \end{matrix}

Therefore,

μ (H_{α_{1}}) = 1

,

μ (H_{α_{2}}) = 1

and

μ (H_{α_{3}}) = 0

.

3.1.2. Monotone Moments

Following the same idea underlying the definition of monotone variance, we can extend the concepts of central and non-central moments from a probabilistic setting to a monotone one.

Definition 16.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X. We define the k-th non-central monotone moment of h with respect to μ as

g_{μ}^{k} (h) = E_{μ} (h^{k}) .

(27)

Note that Equation (27) is well defined, since h and

h^{k}

are comonotone, and therefore the corresponding probability function is the same for both of them, regardless of the value of k.

The definition of central monotone moments is, however, more problematic. If we follow the same idea as in Definition 16, and define the central monotone moment as

E_{μ} {(h - E_{μ} (h))}^{k}

, we find the problem that functions h and

{(h - E_{μ} (h))}^{k}

are not comonotone, and that would mean that different underlying probability distributions would be used to compute

E_{μ} (h)

and

E_{μ} {(h - E_{μ} (h))}^{k}

. We will therefore generalize the definition of monotone variance to values of

k \neq 2

, utilizing the probability function associated with

μ

and h.

Definition 17.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X. We define the k-th central monotone moment of h with respect to μ as

γ_{μ}^{k} (h) = E_{P_{μ, h}} [{(h - E_{μ} (h))}^{k}],

(28)

where

P_{μ, h}

is the probability function associated with μ and h.

The following result establishes the relation between central and non-central monotone moments.

Proposition 4.

Let

(X, A, μ)

be a measurable space and let h be a non-negative real valued measurable function of X. It holds that

γ_{μ}^{k} (h) = \sum_{j = 0}^{k} {(- 1)}^{j} (\binom{k}{j}) {[g_{μ} (h)]}^{j} g_{μ}^{k - j} (h) .

(29)

Proof.

Assume

X = {x_{1}, \dots, x_{n}}

.

\begin{matrix} γ_{μ}^{k} (h) & = & E_{P_{μ, h}} [{(h - E_{μ} (h))}^{k}] = \sum_{i = 1}^{n} {(h (x_{i}) - E_{μ} (h))}^{k} P_{μ, h} (x_{i}) \\ = & \sum_{i = 1}^{n} \sum_{j = 0}^{k} {(- 1)}^{j} (\binom{k}{j}) E_{μ}^{j} (h) h^{k - j} (x_{i}) P_{μ, h} (x_{i}) \\ = & \sum_{j = 0}^{k} {(- 1)}^{j} (\binom{k}{j}) E_{μ}^{j} (h) \sum_{i = 1}^{n} h^{k - j} (x_{i}) P_{μ, h} (x_{i}) \\ = & \sum_{j = 0}^{k} {(- 1)}^{j} (\binom{k}{j}) E_{μ}^{j} (h) E_{μ} (h^{k - j}) = \sum_{j = 0}^{k} {(- 1)}^{j} (\binom{k}{j}) {[g_{μ} (h)]}^{j} g_{μ}^{k - j} (h) . \end{matrix}

□

3.2. The Case of Two Functions

In this section we approach the simultaneous analysis of two functions

h_{1}

and

h_{2}

over the same reference set, X. Our goal is to model the information that both functions have in common, or the way in which they interact with one another.

Generalizing the concept of covariance, for instance, by using

E_{μ} [(h_{1} - E_{μ} (h_{1})) (h_{2} - E_{μ} (h_{2}))]

, raises the problem that the underlying probability distribution used to compute the monotone expectation is not the one induced by

h_{1}

nor by

h_{2}

for the same fuzzy measure

μ

, and therefore it is not clear that this monotone covariance in fact measures the relationship between both functions at all. We will therefore explore a different approach, in which we will model the degree of similarity between

h_{1}

and

h_{2}

, by measuring the common region determined by both functions.

Definition 18.

Let

(X, A, μ)

be a measurable space and let

h_{1}

and

h_{2}

be non-negative real valued measurable functions of X. We define the common expectation of

h_{1}

and

h_{2}

with respect to μ as

ψ_{μ} (h_{1}, h_{2}) = E_{μ} [min {h_{1}, h_{2}}] .

(30)

The concept of common expectation is illustrated in Figure 1. More precisely, the value of the common expectation of

h_{1}

and

h_{2}

is the measure, according to

μ

, of the function under which the shaded area is.

Example 5.

We want to obtain the global grade for two students out of the individual grades they obtained in four different courses

{x_{1}, x_{2}, x_{3}, x_{4}}

. In the final grade we want to reflect if a student shows a good performance in the two scientific courses,

{x_{1}, x_{2}}

, the humanistic ones,

{x_{3}, x_{4}}

, or in the combination

{x_{2}, x_{3}}

, corresponding to a social sciences profile. These criteria are encoded in the fuzzy measure in Table 2, while the grades obtained by both students (between 0 and 1) in each of the courses are shown in Table 3.

The calculation of the respective monotone expectations and variances result in

\begin{matrix} E_{μ} (h_{1}) & = 0.61, & E_{μ} (h_{2}) & = 0.65, \\ {V a r}_{μ} (h_{1}) & = 0.0769, & {V a r}_{μ} (h_{2}) & = 0.0765, \end{matrix}

which are quite similar, while the common expectation is

ψ_{μ} (h_{1}, h_{2}) = 0.25

.

The next proposition states the basic properties of the common expectation.

Proposition 5.

Let

(X, A, μ)

be a measurable space and let

h_{1}

and

h_{2}

be non-negative real valued measurable functions of X. Then,

ψ_{μ}

satisfies the following properties:

1.: $ψ_{μ} (h_{1}, h_{2}) = ψ_{μ} (h_{2}, h_{1})$ .
2.: $ψ_{μ} (h_{1}, h_{2}) \leq min {E_{μ} (h_{1}), E_{μ} (h_{2})}$ .
3.: If $\forall x \in X$ , $h_{1} (x) \leq h_{2} (x)$ , then for any non-negative real valued measurable function h of X, $ψ_{μ} (h_{1}, h) \leq ψ_{μ} (h_{2}, h)$ .
4.: If $\forall x \in X$ , $h_{1} (x) \leq h_{2} (x)$ , then $ψ_{μ} (h_{1}, h_{2}) = E_{μ} (h_{1})$ .
5.: $ψ_{μ} (h_{1}, h_{2}) = 0 \Leftrightarrow {x \in X | h_{1} (x) > 0} \cap {x \in X | h_{2} (x) > 0} = \emptyset$ .

Proof.

It is straightforward from Equation (30).
It follows from the facts that $E_{μ}$ is a monotone functional and that $min {h_{1}, h_{2}}$ is bounded from above by both $h_{1}$ and $h_{2}$ .
It is a direct consequence of the monotonicity of operator min and functional $E_{μ}$ .
If $h_{1} \leq h_{2}$ , then $min {h_{1}, h_{2}} = h_{1}$ , and therefore both expectations are the same.
If ${x \in X | h_{1} (x) > 0} \cap {x \in X | h_{2} (x) > 0} = \emptyset$ then the minimum of both functions is the identically null function, which is known to be the only one that has null monotone expectation [30].

□

The common expectation is not normalized, and therefore its value alone is not enough to determine if it can be regarded as high or low. For instance, in Example 5 we obtained

ψ_{μ} (h_{1}, h_{2}) = 0.25

, but that value does not tell us if it is high or low. However, it is clear that the common expectation can be bounded from above, since it is known that for any positive real numbers a and b, it holds that

min {a, b} \leq \sqrt{a \cdot b} \leq max {a, b}

and the equality is reached only when

a = b

. Hence, we can normalize the common expectation using these bounds, which yields three possible definitions of coefficients of concordance between

h_{1}

and

h_{2}

.

Definition 19.

Let

(X, A, μ)

be a measurable space and let

h_{1}

and

h_{2}

be non-negative real valued measurable functions of X. We define the coefficients of concordance

ρ_{1}, ρ_{2}

and

ρ_{3}

between

h_{1}

and

h_{2}

with respect to μ as

\begin{matrix} ρ_{1}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{\sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})}}, \end{matrix}

(31)

\begin{matrix} ρ_{2}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{E_{μ} (max {h_{1}, h_{2}})}, \end{matrix}

(32)

\begin{matrix} ρ_{3}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{min {E_{μ} (h_{1}), E_{μ} (h_{2})}} . \end{matrix}

(33)

The next proposition shows the basic properties of the three concordance coefficients (when it is clear from the context, we will drop the measure and the functions, thus denoting

ρ_{i}^{μ} (h_{1}, h_{2})

by

ρ_{i}

).

Proposition 6.

Assume the conditions in Definition 19. The coefficients of concordance satisfy the following conditions:

1.: $0 \leq ρ_{i} \leq 1$ , $i = 1, 2, 3$ .
2.: $h_{1} = h_{2} \Rightarrow ρ_{1} = ρ_{2} = ρ_{3} = 1$ .
3.: $ρ_{2} \leq ρ_{1} \leq ρ_{3}$ .
4.: $ρ_{1} = ρ_{2} = ρ_{3} = 0$ iff $h_{1}$ and $h_{2}$ have empty intersection, i.e., ${x \in X / h_{1} (x) > 0} \cap {x \in X / h_{2} (x) > 0} = \emptyset$ .
5.: If $h_{1} \leq h_{2}$ , then

$\begin{matrix} ρ_{1} & = & \sqrt{\frac{E_{μ} (h_{1})}{E_{μ} (h_{2})}}, \\ ρ_{2} & = & \frac{E_{μ} (h_{1})}{E_{μ} (h_{2})}, \\ ρ_{3} & = & 1 . \end{matrix}$
6.: If $h_{1} = k h_{2}$ , with $k > 1$ , then

$\begin{matrix} ρ_{1} & = & \frac{1}{\sqrt{k}}, \\ ρ_{2} & = & \frac{1}{k}, \\ ρ_{3} & = & 1 . \end{matrix}$

Proof.

It is clear that

$min {E_{μ} (h_{1}), E_{μ} (h_{2})} \leq \sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})} \leq max {E_{μ} (h_{1}), E_{μ} (h_{2})} .$

Furthermore, since $E_{μ}$ is a monotone functional, $max {E_{μ} (h_{1}), E_{μ} (h_{2})} \leq E_{μ} (max {h_{1}, h_{2}})$ . According to property 2 in Proposition 5, $ψ_{μ} (h_{1}, h_{2}) \leq min {E_{μ} (h_{1}), E_{μ} (h_{2})}$ , and thus

$ψ_{μ} (h_{1}, h_{2}) \leq min {E_{μ} (h_{1}), E_{μ} (h_{2})} \leq \sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})} \leq max {E_{μ} (h_{1}), E_{μ} (h_{2})} .$

Therefore

$\begin{matrix} ρ_{1}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{\sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})}} \leq 1, \\ ρ_{2}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{E_{μ} (max {h_{1}, h_{2}})} \leq 1, \\ ρ_{3}^{μ} (h_{1}, h_{2}) & = & \frac{ψ_{μ} (h_{1}, h_{2})}{min {E_{μ} (h_{1}), E_{μ} (h_{2})}} \leq 1 . \end{matrix}$

On the other hand, since $h_{1}$ and $h_{2}$ are non-negative, so it is $ψ_{μ} (h_{1}, h_{2})$ , which means that $ρ_{1}^{μ} (h_{1}, h_{2}) \geq 0$ , $ρ_{2}^{μ} (h_{1}, h_{2}) \geq 0$ and $ρ_{3}^{μ} (h_{1}, h_{2}) \geq 0$ .
If $h_{1} = h_{2} = h$ , then $min {h_{1}, h_{2}} = max {h_{1}, h_{2}} = h$ and $ψ_{μ} (h_{1}, h_{2}) = E_{μ} (h)$ ; hence, the three coefficients are equal to 1.
From the proof of property 1, we know that

$min {E_{μ} (h_{1}), E_{μ} (h_{2})} \leq \sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})} \leq E_{μ} (max {h_{1}, h_{2}}) \Rightarrow$

$\frac{ψ_{μ} (h_{1}, h_{2})}{E_{μ} (max {h_{1}, h_{2}})} \leq \frac{ψ_{μ} (h_{1}, h_{2})}{\sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})}} \leq \frac{ψ_{μ} (h_{1}, h_{2})}{min {E_{μ} (h_{1}), E_{μ} (h_{2})}} \Rightarrow$

$ρ_{2} \leq ρ_{1} \leq ρ_{3} .$
$ρ_{1} = ρ_{2} = ρ_{3} = 0$ iff $ψ_{μ} (h_{1}, h_{2})$ which, according to property 5 in Proposition 5, can only happen if ${x \in X / h_{1} (x) > 0} \cap {x \in X / h_{2} (x) > 0} = \emptyset$ .
If $h_{1} \leq h_{2}$ , then $min {h_{1}, h_{2}} = h_{1}$ , $max {h_{1}, h_{2}} = h_{2}$ and $E_{μ} (h_{1}) \leq E_{μ} (h_{2})$ . Therefore,

$ρ_{1} = \frac{E_{μ} (h_{1})}{\sqrt{E_{μ} (h_{1}) E_{μ} (h_{2})}} = \sqrt{\frac{E_{μ} (h_{1})}{E_{μ} (h_{2})}}, ρ_{2} = \frac{E_{μ} (h_{1})}{E_{μ} (h_{2})} and ρ_{3} = \frac{E_{μ} (h_{1})}{min {E_{μ} (h_{1}), E_{μ} (h_{2})}} = 1 .$
If $h_{1} = k h_{2}$ with $k > 0$ , then $min {h_{1}, h_{2}} = h_{2}$ , $max {h_{1}, h_{2}} = k h_{2}$ and $E_{μ} (h_{1}) = k E_{μ} (h_{2})$ . Therefore,

$ρ_{1} = \frac{E_{μ} (h_{2})}{\sqrt{k E_{μ} (h_{2}) E_{μ} (h_{2})}} = \frac{1}{\sqrt{k}}, ρ_{2} = \frac{E_{μ} (h_{2})}{E_{μ} (k h_{2})} = \frac{1}{k} and ρ_{3} = \frac{E_{μ} (h_{2})}{min {k E_{μ} (h_{2}), E_{μ} (h_{2})}} = 1 .$

□

Example 6.

As a continuation of Example 5, we can use the data in Table 2 and Table 3 to compute the coefficients of concordance, obtaining

ρ_{1} (h_{1}, h_{2}) = 0.397, ρ_{2} (h_{1}, h_{2}) = 0.301, ρ_{3} (h_{1}, h_{2}) = 0.410 .

Note how the three coefficients have low values, which is consistent with the data in the example, as in spite of the similar values for the monotone expectation and variance corresponding to both students, they have a clearly different profile, scientific in the case of

h_{1}

and humanistic in the case of

h_{2}

.

4. Parameters Defined over Product Spaces

In this section we explore scenarios where we have two measurable spaces each of them equipped with a different fuzzy measure. We will consider the definition of statistical parameters on the product space.

Likewise, in Section 3, we will separately study the case of one or two real functions. In both cases, it is necessary to obtain a fuzzy measure over the product space. We will rely on the proposals in [31] to obtain the product measures.

4.1. The Case of One Function

The methods proposed in [31] for constructing fuzzy measures over product spaces, rather than single measures, usually yield a set of them, bounded by an upper and lower measure. Similarly, our proposals here will consist of intervals of parameters rather than a single one.

We will start defining the concept of joint expectation making use of the interior and exterior product measures (see Definitions 10 and 11).

Definition 20.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces,

h : X_{1} \times X_{2} \to [0, 1]

and

{\underset{̲}{μ}}_{12}^{⊙}

,

{\bar{μ}}_{12}^{⊙}

the ⊙-interior and ⊙-exterior product measures. We define the joint lower and upper ⊙-expectations as

\begin{matrix} {\underset{̲}{E}}_{12}^{⊙} (h) & = & C \int h \circ {\underset{̲}{μ}}_{12}^{⊙} (l o w e r), \end{matrix}

(34)

\begin{matrix} {\bar{E}}_{12}^{⊙} (h) & = & C \int h \circ {\bar{μ}}_{12}^{⊙} (u p p e r) . \end{matrix}

(35)

Proposition 7.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces,

h : X_{1} \times X_{2} \to [0, 1]

and

{\underset{̲}{μ}}_{12}^{⊙}

,

{\bar{μ}}_{12}^{⊙}

the ⊙-interior and ⊙-exterior product measures. It holds that

{\underset{̲}{E}}_{12}^{⊙} (h) \leq {\bar{E}}_{12}^{⊙} (h) .

(36)

Furthermore, if

μ_{12}^{⊙}

is any ⊙-independent product measure of

μ_{1}

and

μ_{2}

(see Definition 9), it also holds that

{\underset{̲}{E}}_{12}^{⊙} (h) \leq E_{12}^{⊙} (h) \leq {\bar{E}}_{12}^{⊙} (h),

(37)

where

E_{12}^{⊙} (h) = C \int h \circ μ_{12}^{⊙}

.

Proof.

Note that

{\underset{̲}{E}}_{12}^{⊙}, {\bar{E}}_{12}^{⊙}

and

E_{12}^{⊙}

are monotone expectations, namely,

E_{{\underset{̲}{μ}}_{12}^{⊙}}

,

E_{{\bar{μ}}_{12}^{⊙}}

and

E_{μ_{12}^{⊙}}

respectively. Therefore, Equations (36) and (37) are a direct consequence of the monotonicity of the monotone expectation and Proposition 1. □

The concept of joint ⊙-expectations is analogous to the concept of monotone expectation in a marginal space, with the difference that, in the case of the product space, the underlying fuzzy measure is not known, but instead we have an interval of measures bounded by the interior and exterior ⊙-product measures.

We can define joint expectations using other product measures, as the p-measures given in Definitions 12 and 13.

Definition 21.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces,

h : X_{1} \times X_{2} \to [0, 1]

and

{\underset{̲}{m}}_{12}

,

{\bar{m}}_{12}

the lower and upper product p-measures respectively. We define the lower and upper joint probabilistic expectations as

\begin{matrix} E_{{\underset{̲}{m}}_{12}} (h) & = & C \int h \circ {\underset{̲}{m}}_{12} (l o w e r), \end{matrix}

(38)

\begin{matrix} E_{{\bar{m}}_{12}} (h) & = & C \int h \circ {\bar{m}}_{12} (u p p e r) . \end{matrix}

(39)

Since we have a function defined over the product space and fuzzy measures defined over the marginal spaces, it is natural to define marginal expectations. We will utilize the concept of ⊕-marginal of a function [31].

Definition 22.

[31] Let h be a function defined on

X_{1} \times X_{2}

and taking values on

[0, 1]

. We define the ⊕-marginals of h as

\begin{matrix} h_{X_{1}}^{\oplus} (x_{1 i}) & = & ⨁_{x_{2 j} \in X_{2}} h (x_{1 i}, x_{2 j}) = h (x_{1 i}, x_{21}) \oplus h (x_{1 i}, x_{22}) \oplus \dots \oplus h (x_{1 i}, x_{2 m}), \end{matrix}

(40)

\begin{matrix} h_{X_{2}}^{\oplus} (x_{2 j}) & = & ⨁_{x_{1 i} \in X_{1}} h (x_{1 i}, x_{2 j}) = h (x_{11}, x_{2 j}) \oplus h (x_{12}, x_{2 j}) \oplus \dots \oplus h (x_{1 n}, x_{2 j}), \end{matrix}

(41)

where ⊕ is a t-conorm (see Definition 6), n is the cardinality of

X_{1}

and m is the cardinality of

X_{2}

.

Definition 23.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces and let h be a function defined on

X_{1} \times X_{2}

and taking values on

[0, 1]

. We define the marginal ⊕-expectations as

E_{X_{i}}^{\oplus} (h) = C \int h_{X_{i}}^{\oplus} \circ μ_{i}, i = 1, 2,

(42)

where

h_{X_{i}}^{\oplus}

are the ⊕-marginals of h.

4.2. The Case of Two Functions

We will now assume that we have two different functions, one for each marginal space, and define parameters that combine the information provided by the marginal spaces.

Definition 24.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces, and let

h_{1}

,

h_{2}

be functions defined on

X_{1}

and

X_{2}

respectively, taking values on

[0, 1]

. We define the upper and lower global expectation of

h_{1}

and

h_{2}

as

\begin{matrix} {\underset{̲}{ϕ}}_{⊙}^{★} (h_{1}, h_{2}) & = & C \int h_{12}^{★} \circ {\underset{̲}{μ}}_{12}^{⊙} (l o w e r), \end{matrix}

(43)

\begin{matrix} {\bar{ϕ}}_{⊙}^{★} (h_{1}, h_{2}) & = & C \int h_{12}^{★} \circ {\bar{μ}}_{12}^{⊙} (u p p e r) . \end{matrix}

(44)

where ☆ and ⊙ are arbitrary t-norms (see Definition 5),

h_{12}^{★} (x_{1}, x_{2}) = h_{1} (x_{1}) ★ h_{2} (x_{2}), \forall (x_{1}, x_{2}) \in X_{1} \times X_{2}

and

{\underset{̲}{μ}}_{12}^{⊙}

and

{\bar{μ}}_{12}^{⊙}

are the interior and exterior product measures of

μ_{1}

and

μ_{2}

respectively.

The next proposition shows that both expectations coincide when ☆ is the min t-norm.

Proposition 8.

Assume the conditions in Definition 24. If ☆ is the min t-norm, it holds that

{\underset{̲}{ϕ}}_{⊙}^{★} (h_{1}, h_{2}) = {\bar{ϕ}}_{⊙}^{★} (h_{1}, h_{2}) .

(45)

Proof.

According to ([31], Proposition 8), the

α

-cuts generated by

h^{★}

belong to

R

when ☆ is the min t-norm. Furthermore, Equation (12) establishes that

{\underset{̲}{μ}}_{12}^{⊙} = {\bar{μ}}_{12}^{⊙}

for the elements of

R

, which proves the result. □

As a consequence of Proposition 8, when using the min t-norm we will just write

ϕ_{⊙}^{★}

for both

{\underset{̲}{ϕ}}_{⊙}^{★}

and

{\bar{ϕ}}_{⊙}^{★}

.

The global expectation is in fact an extension of the monotone expectation in the sense expressed by the next theorem.

Theorem 3.

Let

(X, A, μ)

be a measurable space and let h be a function defined on X and taking values on

[0, 1]

. Consider the product space

X \times X

and let both ☆ and ⊙ be the min t-norm. Then,

ϕ_{⊙}^{★} (h, h) = ϕ_{min}^{min} (h, h) = E_{μ} (h) .

(46)

Proof.

Assume

X = {x_{1}, x_{2}, \dots, x_{n}}

. Then

\begin{matrix} \forall (x_{1}, x_{2}) \in X \times X & \Rightarrow & h^{min} (x, y) = min {h (x_{1}), h (x_{2})} . \end{matrix}

Without loss of generality, we can assume that

h (x_{1}) < h (x_{2}) < \dots < h (x_{n}),

in which case the

α

-cuts generated by

h^{min}

are of the form

H_{α_{i}} = {(x_{k}, x_{l}) \in X \times X | k, l \geq i},

which are elements of the class

R

with their two projections being identical.

Since we are using the min t-norm to construct the product measure, it turns out that the measure in each

α

-cut of the product space is equal to the measure assigned by

μ

to the

α

-cuts in the marginal space, and thus

\begin{matrix} ϕ_{min}^{min} (h, h) & = & C \int h^{min} \circ μ_{12}^{min} = \sum_{i = 1}^{n \times n} μ_{12}^{min} (H_{α_{i}}) (α_{i} - α_{i - 1}) \\ = & \sum_{i = 1}^{n} [μ (H_{α_{i}}^{↓ X})] (α_{i} - α_{i - 1}) = E_{μ} (h) . \end{matrix}

□

Example 7.

Consider a reference set

X = {x_{1}, x_{2}, x_{3}}

and the function defined as

h (x_{1}) = 0.1, h (x_{2}) = 0, 4, h (x_{3}) = 0.7

. The function

h^{min}

is displayed in Table 4.

It can be seen how the diagonal contains the original values of h, and its α-cuts are

\begin{matrix} H_{0.1} & = X \times X, & μ_{12}^{min} (H_{0.1}) & = min {μ (X), μ (X)}, \\ H_{0.4} & = {x_{2}, x_{3}} \times {x_{2}, x_{3}}, & μ_{12}^{min} (H_{0.4}) & = min {μ ({x_{2}, x_{3}}), μ ({x_{2}, x_{3}})}, \\ H_{0.7} & = {x_{3}} \times {x_{3}}, & μ_{12}^{min} (H_{0.1}) & = min {μ ({x_{3}}), μ ({x_{3}})}, \end{matrix}

and therefore

\begin{matrix} ϕ_{min}^{min} (h, h) & = & μ_{12}^{min} (H_{0.1}) (0.1 - 0) + μ_{12}^{min} (H_{0.4}) (0.4 - 0.1) + μ_{12}^{min} (H_{0.7}) (0.7 - 0.4) \\ = & μ (X) (0.1 - 0) + μ ({x_{2}, x_{3}}) (0.4 - 0.1) + μ ({x_{3}}) (0.7 - 0.4) \\ = & E_{μ} (h) . \end{matrix}

Likewise for the common expectation, the global expectation is not normalized, but it can be easily normalized in the same way as we did for the common expectation case, as stated in the next definition.

Definition 25.

Let

(X_{1}, A_{X_{1}}, μ_{1})

and

(X_{2}, A_{X_{2}}, μ_{2})

be measurable spaces and let

h_{1}

and

h_{2}

be functions defined on

X_{1}

and

X_{2}

respectively and taking values on

[0, 1]

. We define the global coefficients of concordance of

h_{1}

and

h_{2}

as

\begin{matrix} Φ_{1} (h_{1}, h_{2}) & = & \frac{ϕ_{min}^{min} (h_{1}, h_{2})}{min {E_{μ_{1}} (h_{1}), E_{μ_{2}} (h_{2})}}, \end{matrix}

(47)

\begin{matrix} Φ_{2} (h_{1}, h_{2}) & = & \frac{ϕ_{min}^{min} (h_{1}, h_{2})}{\sqrt{E_{μ_{1}} (h_{1}) E_{μ_{2}} (h_{2})}} . \end{matrix}

(48)

Example 8.

(Continuation of Example 5)

Using the data in Table 3 we can obtain the function

h_{12}^{min}

, the values of which are given in Table 5.

Using the fuzzy measure in Table 2, we find that

ϕ_{min}^{min} = 0.6

and the global concordance coefficients are

Φ_{1} (h_{1}, h_{2}) = 0.953

and

Φ_{2} (h_{1}, h_{2}) = 0.984

.

The value of the global expectation (

0.6

) is very close to the values of the monotone expectations for each student in Example 5 (0.61 and 0.65 respectively). It can be interpreted as the fact that the grades of both students are acceptable individually and also globally, which is reflected in high values of the global coefficients of concordance. Note how the global expectation is not detecting the fact that both students have different profiles (scientific and humanistic), while the common expectation detected this fact yielding a much lower value (0.25) resulting in lower values of the coefficients of concordance as well.

5. Conclusions

With the introduction of the concept of monotone variance, we have complemented the already known concept of monotone expectation. It can be regarded as a measure of dispersion with respect to a central position measure. We have also introduced the concepts of central and non-central monotone moments, that can serve as a vehicle to define further statistical parameters based on fuzzy measures as, for instance, shape measures. The potential application scope is certainly wide, as it covers non-additive scenarios like the ones described in the examples in this paper, and just to mention some of them, such scenarios can be found in Engineering and Social Sciences applications.

The common expectation and concordance coefficients can be interpreted as measures of match between the functions, and in that sense can provide information about to which extent one function explains the other one. A possible application of these concepts is the development of prediction models when the measures are not additive.

Thanks to the developments in [31] we have been able to extend the concept of monotone expectation to product spaces, where, in addition, we have shown how to marginalize the information provided by a function over a product space using the marginal ⊕-expectations.

All the developments in this paper are restricted to finite reference sets. Even though it covers a wide variety of practical applications, it is worth exploring the formulation of the results obtained here to uncountable reference sets, which seems to be a promising research line. The first step in this direction would be the extension of the results in [31] to continuous domains.

Author Contributions

Investigation, F.R., M.M. and A.S.; writing—original draft, F.R., M.M. and A.S.; writing—review and editing, F.R., M.M. and A.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Spanish Ministry of Science and Innovation through grants TIN2016-77902-C3-3-P, PID2019-106758GB-C32 and by ERDF-FEDER funds.

Conflicts of Interest

The authors declare no conflict of interest.

References

Sugeno, M. Theory of Fuzzy Integrals and Its Applications. Ph.D. Thesis, Tokyo Institute of Technology, Tokyo, Japan, 1974. [Google Scholar]
Choquet, G. Theory of capacities. Annales de l′Institut Fourier 1954, 5, 131–295. [Google Scholar] [CrossRef] [Green Version]
Li, J. On null-continuity of monotone measures. Mathematics 2020, 8, 205. [Google Scholar] [CrossRef] [Green Version]
Beliakov, G.; James, S.; Wu, J. Discrete Fuzzy Measures; Studies in Fuzziness and Soft Computing; Springer: Berlin, Germany, 2020; Volume 382. [Google Scholar]
Walley, P. Statistical Reasoning with Imprecise Probabilities; Chapman and Hall: London, UK, 1991. [Google Scholar]
Walley, P. BI Statistical Methods. Volume I: Foundations; Prescience Press: New York, NY, USA, 2015. [Google Scholar]
Dempster, A.P. Upper and lower probabilities induced by a multivalued mapping. Ann. Math. Stat. 1967, 38, 325–339. [Google Scholar] [CrossRef]
Shafer, G. A Mathematical Theory of Evidence; Princeton University Press: Princeton, NJ, USA, 1976. [Google Scholar]
Buckley, J. Fuzzy Probability and Statistics; Studies in Fuzziness and Soft Computing; Springer: Berlin, Germany, 2006; Volume 196. [Google Scholar]
Coppi, R.; Gil, M.A.; Kiers, H.A.L. The fuzzy approach to statistical analysis. Comput. Stat. Data Anal. 2006, 51, 1–14. [Google Scholar] [CrossRef]
D’Urso, P.; Gil, M.A. Fuzzy data analysis and classification. Adv. Data Anal. Classif. 2017, 11, 645–657. [Google Scholar] [CrossRef] [Green Version]
Intan, R.; Mukaidono, M. Fuzzy conditional probability relations and their applications in fuzzy information systems. Knowl. Inf. Syst. 2004, 6, 345–365. [Google Scholar] [CrossRef]
Nguyen, H.; Wu, B. Fundamentals of Statistics with Fuzzy Data; Studies in Fuzziness and Soft Computing; Springer: Berlin, Germany, 2006; Volume 198. [Google Scholar]
Vierti, R. Statistical Methods for Fuzzy Data; Wiley: Chichester, UK, 2011. [Google Scholar]
Kruse, R.; Held, P.; Moewes, C. On fuzzy data analysis. On Fuzziness—A Homage to Lotfi A. Zadeh, Volume 1; Studies in Fuzziness and Soft Computing; Springer: Heidelberg, Germany, 2013; Volume 298, pp. 343–347. [Google Scholar]
Dijkman, J.G.; van Haeringen, H.; Lange, S.J. Fuzzy numbers. J. Math. Anal. Appl. 1983, 92, 301–341. [Google Scholar] [CrossRef] [Green Version]
D’Urso, P. Informational paradigm, management of uncertainty and theoretical formalisms in the clustering framework: A review. Inf. Sci. 2017, 400–401, 30–62. [Google Scholar] [CrossRef] [Green Version]
Tanaka, H.; Uejima, S.; Asai, K. Linear regression analysis with fuzzy model. IEEE Trans. Syst. Man Cybern. 1982, 12, 903–907. [Google Scholar]
Parchami, A.; Taheri, S.M.; Mashinchi, M. Fuzzy p-value in testing fuzzy hypotheses with crisp data. Stat. Pap. 2010, 51, 209. [Google Scholar] [CrossRef]
Grzegorzewski, P.; Hryniewicz, O. Soft methods in statistical quality control. Control Cybern. 2000, 29, 119–140. [Google Scholar]
Zhang, R.; Ashuri, B.; Deng, Y. A novel method for forecasting time series based on fuzzy logic and visibility graph. Adv. Data Anal. Classif. 2017, 11, 759–783. [Google Scholar] [CrossRef]
Gil, M.A.; López-Díaz, M. Fundamentals and Bayesian analyses of decision problems with fuzzy-valued utilities. Int. J. Approx. Reason. 1996, 15, 95–115. [Google Scholar] [CrossRef] [Green Version]
Denoeux, T. Maximum likelihood estimation from fuzzy data using the EM algorithm. Fuzzy Sets Syst. 2011, 183, 72–91. [Google Scholar] [CrossRef] [Green Version]
Quost, B.; Denoeux, T.; Li, S. Parametric classification with soft labels using the evidential EM algorithm: Linear discriminant analysis versus logistic regression. Adv. Data Anal. Classif. 2017, 11, 659–690. [Google Scholar] [CrossRef] [Green Version]
Blanco-Fernández, A.; Casals, M.R.; Colubi, A.; Corral, N.; García-Bárzana, M.; Gil, M.A.; González-Rodríguez, G.; López, M.T.; Lubiano, M.A.; Montenegro, M.; et al. A distance-based statistical analysis of fuzzy number-valued data. Int. J. Approx. Reason. 2014, 55, 1487–1501. [Google Scholar] [CrossRef]
Wu, H.C. Statistical hypotheses testing for fuzzy data. Inf. Sci. 2005, 279, 446–459. [Google Scholar] [CrossRef]
Calcagnì, A.; Lombardi, L.; Pascali, E. A dimension reduction technique for two-mode non-convex fuzzy data. Soft Comput. 2016, 20, 749–762. [Google Scholar] [CrossRef]
Colubi, A.; González-Rodríguez, G.; Gil, M.A.; Trutschnig, W. Nonparametric criteria for supervised classification of fuzzy data. Int. J. Approx. Reason. 2011, 52, 1272–1282. [Google Scholar] [CrossRef] [Green Version]
Coppi, R.; D’Urso, P.; Giordani, P. Fuzzy and possibilistic clustering for fuzzy data. Comput. Stat. Data Anal. 2012, 56, 915–927. [Google Scholar] [CrossRef]
Bolaños, M.J.; De Campos, L.M.; González, A. Convergence properties on monotone expectation and its applications to the extension of fuzzy measures. Fuzzy Sets Syst. 1989, 33, 201–212. [Google Scholar] [CrossRef]
Reche, F.; Morales, M.; Salmerón, A. Construction of fuzzy measures over product spaces. Mathematics 2020, 8, 1605. [Google Scholar] [CrossRef]
Reche, F.; Salmerón, A. Operational approach to general fuzzy measures. Int. J. Uncertain. Fuzziness -Knowl.-Based Syst. 2000, 8, 369–382. [Google Scholar] [CrossRef]
De Campos, L.M.; Bolaños, M.J. Representation of fuzzy measures through probabilities. Fuzzy Sets Syst. 1989, 31, 23–36. [Google Scholar] [CrossRef] [Green Version]
Schweizer, B.; Sklar, A. Probability Metric Spaces; Elsevier North Holland: New York, NY, USA, 1983. [Google Scholar]
Huber, P.H. Robust Statistics; Wiley Series in Probability and Mathematical Statistics; John Wiley and Sons: Hoboken, NJ, USA, 1981. [Google Scholar]

Figure 1. An illustration of the concept of common expectation of

h_{1}

and

h_{2}

.

Figure 1. An illustration of the concept of common expectation of

h_{1}

and

h_{2}

.

Table 1. A fuzzy measure and the associated probability distributions corresponding to all the possible permutations of the indices

(1, 2, 3)

.

Table 1. A fuzzy measure and the associated probability distributions corresponding to all the possible permutations of the indices

(1, 2, 3)

.

$A$	$μ$	$P_{(1, 2, 3)}$	$P_{(1, 3, 2)}$	$P_{(2, 1, 3)}$	$P_{(2, 3, 1)}$	$P_{(3, 1, 2)}$	$P_{(3, 2, 1)}$
$x_{1}$	0.2	0.6	0.6	0.3	0.2	0.4	0.2
$x_{2}$	0.1	0.1	0.1	0.4	0.4	0.1	0.3
$x_{3}$	0.3	0.3	0.3	0.3	0.4	0.5	0.5
$x_{1}, x_{2}$	0.5	0.7	0.7	0.7	0.6	0.5	0.5
$x_{1}, x_{3}$	0.6	0.9	0.9	0.6	0.6	0.9	0.7
$x_{2}, x_{3}$	0.4	0.4	0.4	0.7	0.8	0.6	0.8

Table 2. A fuzzy measure matching the criteria in Example 5.

Reference Subsets	Measure
${x_{1}}$ , ${x_{2}}$ , ${x_{3}}$ , ${x_{4}}$	0.2
${x_{1}, x_{2}}$	0.6
${x_{1}, x_{3}}$ , ${x_{1}, x_{4}}$	0.3
${x_{2}, x_{3}}$	0.5
${x_{2}, x_{4}}$	0.4
${x_{3}, x_{4}}$	0.7
${x_{1}, x_{2}, x_{3}}$	0.9
${x_{1}, x_{2}, x_{4}}$	0.6
${x_{1}, x_{3}, x_{4}}$	0.7
${x_{2}, x_{3}, x_{4}}$	0.8

Table 3. Grades obtained by the students in Example 5 in the individual courses.

Student	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$
$h_{1}$	0.9	0.8	0.3	0.2
$h_{2}$	0.2	0.3	0.8	0.9

Table 4. Values of the function

h^{min}

.

Table 4. Values of the function

h^{min}

.

	$x_{1}$	$x_{2}$	$x_{3}$
$x_{1}$	$0.1$	$0.1$	$0.1$
$x_{2}$	$0.1$	$0.4$	$0.4$
$x_{3}$	$0.1$	$0.4$	$0.7$

Table 5. Values of the function

h_{12}^{min}

.

Table 5. Values of the function

h_{12}^{min}

.

Course	$x_{1}$	$x_{2}$	$x_{3}$	$x_{4}$
$x_{1}$	0.2	0.3	0.8	0.9
$x_{2}$	0.2	0.3	0.8	0.8
$x_{3}$	0.2	0.3	0.3	0.3
$x_{4}$	0.2	0.2	0.2	0.2

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Reche, F.; Morales, M.; Salmerón, A. Statistical Parameters Based on Fuzzy Measures. Mathematics 2020, 8, 2015. https://doi.org/10.3390/math8112015

AMA Style

Reche F, Morales M, Salmerón A. Statistical Parameters Based on Fuzzy Measures. Mathematics. 2020; 8(11):2015. https://doi.org/10.3390/math8112015

Chicago/Turabian Style

Reche, Fernando, María Morales, and Antonio Salmerón. 2020. "Statistical Parameters Based on Fuzzy Measures" Mathematics 8, no. 11: 2015. https://doi.org/10.3390/math8112015

APA Style

Reche, F., Morales, M., & Salmerón, A. (2020). Statistical Parameters Based on Fuzzy Measures. Mathematics, 8(11), 2015. https://doi.org/10.3390/math8112015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Statistical Parameters Based on Fuzzy Measures

Abstract

1. Introduction

2. Preliminaries and Notation

3. Parameters over One Measurable Space

3.1. The Case of Only One Function

3.1.1. Monotone Variance

3.1.2. Monotone Moments

3.2. The Case of Two Functions

4. Parameters Defined over Product Spaces

4.1. The Case of One Function

4.2. The Case of Two Functions

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI