1. Introduction
The communication for omniscience (CO) problem is formulated in [
1]. It is assumed that there are a finite number of users in a system that are indexed by the set
V. Each user
observes a distinct component
of a discrete multiple random source
in private. The users are allowed to exchange their observations over public authenticated broadcast channels so as to attain omniscience, the state where each user recovers the observation sequence of the entire source
. Originally, the CO problem was studied in [
1] due to its dual relationship with the multi-terminal secret capacity (Theorem 1 in [
1]). The interactive data exchange process was also studied in other source coding scenarios, e.g., the interactive function computation problem [
2,
3,
4] (Refs. [
2,
3,
4] studied the two-terminal
lossy source coding problem, where each user communicates with the other to compute a function. The aim is to characterize the sum-rate-distortion region. This paper considers lossless source coding, which, for
, corresponds to zero-distortion case (see Theorem 2 in [
3], Section IV-B in [
4]), but the focus is to find how to distribute the sum-rate evenly among more than two users). More recently, CO was also cast into the
coded cooperative data exchange (CCDE) problem [
5,
6,
7], in which the users are mobile clients broadcasting linear combinations of packets over noiseless peer-to-peer (P2P) wireless channels and the communication rates are restricted to being integral.
One main optimization problem that arises in CO is how to minimize the overall source coding rate to attain omniscience. We call it the
minimum sum-rate problem and denote the value of the minimum sum-rate by
. By utilizing submodular function minimization (SFM) techniques, the value of
, along with an optimal rate vector, are determined in
time in [
8] for the asymptotic model, where the communication rates are real-valued (In an asymptotic model, the observation sequence is assumed to be infinitely long. The CCDE corresponds to the finite linear source model, an example of the non-asymptotic model. In the non-asymptotic model, each user only obtains a finite length of observations, and the broadcasts are integer numbers of linear combinations of observations (Section II in [
8])), and in [
9,
10] for CCDE. Here,
is the complexity of a SFM algorithm and is polynomial (Chapter VI in [
11]). A more efficient algorithm can be found in [
12] by simulating the communications based on the random linear network coding scheme [
13,
14]. The complexity was further reduced to
in [
15].
While [
8,
9,
10,
12] only determined one optimal rate vector, it is shown in (Section III-B in [
8]) that the optimal rate region is not a singleton in general. Thus, it is natural to consider how to choose an optimal rate vector that also attains fairness, particularly when the intention is to promote the mobile clients’ cooperation in CCDE or even out the battery usage in a wireless sensor network (WSN). The problem of how to attain fairness has been previously considered in [
16,
17] for CCDE. In [
17], a multi-layer acyclic graph is proposed, based on which, a constrained quadratic programming was formulated to determine the Jain’s fairness solution [
18]. The algorithm proposed in [
16] is a greedy approach, where, in each iteration, a unit rate is assigned to the user that optimizes a fairness measure, so that the resulting solution converges on a fair and integer-valued optimal rate vector (The fair solutions in [
16,
17] coincide with the egalitarian solution [
19] in coalitional game theory due to the equivalence between the submodular base polyhedron and the optimal rate region (Section III-B in [
8]), both of which, as will be shown in
Section 3 in this paper, coincide with the core of a coalitional game).
However, neither of them applies to systems where the communication rates are non-integral, e.g., the asymptotic model, or where packet splitting (and hence fractional transmission rates) is allowed in CCDE. The main purpose of this paper is to study how to attain fairness in the optimal rate region for the CO problem, where the broadcast rates are not constrained to be integer-valued. We start the study by showing the equivalence between the optimal rate region and the core (the solution set) of a coalitional game. We consider two fair solutions proposed in coalitional game theory: the Shapley value [
20] and the egalitarian solution [
19]. We propose a steepest descent algorithm (SDA) for searching a fractional egalitarian solution that can be implemented by packet splitting in CCDE. Finally, we show that the game can be decomposed by the fundamental partition
into subgames (The fundamental partition
is an optimizer that determines the minimum sum-rate
[
8]. See also
Section 2.1), each of which can attain fairness, either being the Shapley value or the egalitarian solution, on its own. This decomposition leads to a distributed computation method for fairness and reduces the complexity.
1.1. Summary of Main Results
Our main results are summarized as follows:
(1) We formulate the problem of attaining omniscience with the minimum sum-rate
by a coalitional game model, where the characteristic cost function, denoted by
for all user groups
, quantifies the remaining randomness in
given the common randomness
shared by all users in
V (The game model is closely related to the dual relationship (Theorem 1 in [
1,
21]):
, where
is the entropy of
and
is the common randomness that is shared by all the users in
V [
22,
23]. The interpretation is that attaining omniscience by the minimum sum-rate
is equivalent to determining how to let the users encode the remaining randomness in
given the common randomness
). That is,
specifies the source coding rate/cost upper bound to each user subset
. We show that (a) the core of the game coincides with the optimal rate region containing all the solutions to the minimum sum-rate problem, and (b) the game describes the users’ cooperation when they jointly encode the remaining randomness in
to reach omniscience with the sum-rate exactly equal to
.
(2) We show that the Shapley value [
20] assigns each user the expected marginal cost
over all
. This value is fair in that it penalizes each user according to the source coding rate he/she incurred in CO. While the complexity of obtaining the exact Shapley value is exponentially growing in the number of users
, we show that the Shapley value is the mean over all extreme points in the core. By randomly generating an extreme point set of a desired size, an approximation of the Shapley value can be obtained in polynomial time.
(3) The egalitarian solution [
19] aims to equalize the rate/cost allocation in the optimal rate region regardless of the marginal costs. We show that this solution is more suitable for those systems with equally privileged users, e.g., CCDE and WSN. We propose a steepest descent algorithm (SDA) for searching a fractional egalitarian solution that can be implemented in CCDE by splitting each packet into
chunks, where
denotes the number of user subsets in the fundamental partition
. Based on an optimality criterion for the egalitarian solution stating that the local optimum implies the global optimum, we show that the estimation sequence generated by the SDA converges in the fractional egalitarian solution in
time, where
is the maximum
-norm over all pairs of points in the optimal rate region. In addition, the steepest direction in each iteration of SDA can be computed in a distributed manner.
(4) The game is decomposable such that the users in each subset form a subgame with the characteristic cost function for all . This is due to the mutual independence between and for any two distinct subsets given the common randomness . To attain fairness, it suffices to let the users within each subgame decide how to allocate the source coding rates fairly, which not only reduces complexity, but also allows parallel computation.
1.2. Organization
The rest of the paper is organized as follows. The system model is described in
Section 2, where we also review existing results on the minimum sum-rate problem. In
Section 3, we formulate the coalitional game model and show that it can be decomposed by the fundamental partition
. In
Section 4, we show how to attain fairness in the optimal rate region by the Shapley value and discuss how to approximate it to avoid the exponentially growing complexity. In
Section 5, we propose the SDA algorithm for searching the fractional egalitarian solution. In both
Section 4 and
Section 5, we also present methods to obtain the Shapley value and egalitarian solution by the decomposition method.
3. Decomposable Coalitional Game
We formulate a coalitional game model (The coalitional game was first formulated to propose the Shapley value as a fair rate allocation for CO. This paper introduces the decomposition property and focuses on the mutual dependence, cooperation among the users and distributed computation of fair solutions) in this section and show the equivalence of the optimal rate region
and the core of this game. The purpose is to introduce two game-theoretic solutions, the Shapley value and egalitarian solution in
Section 4 and
Section 5, respectively, for attaining fairness in
. We also show the decomposition of this game model, a property that will be utilized in
Section 4 and
Section 5 to propose a decomposition method for obtaining the Shapley value and egalitarian solution, respectively.
3.1. Coalition Game Model
Let the users in
V be self-autonomous decision makers that take part in the CO, and assume that, instead of being selfish, they may cooperate with others to form groups. We call
a
coalition and
V the
grand coalition. Consider the function
. Here,
equals the common randomness
in
that is shared by all users in
V due to the dual relationship (Theorem 1 in [
1]) [
21]
Here,
is called the multivariate mutual information in [
23], or shared information in [
22]. Assume that
is obtained by a random variable
U, which does not need to be broadcast over the public channels. Then, the problem is how to encode the remaining randomness in
given
U for all
that is measured by the Dilworth truncation [
23]
We call the characteristic cost function in that specifies the upper bound on the (source) coding cost when the users in X form a coalition so as to jointly encode the randomness in given U. The coalitional game model is characterized by the user set V and the characteristic cost function . We denote it by . In this sense, the game formulates a multi-terminal data compression problem where the users jointly encode the remaining randomness in that is specified by the set function .
Example 2. For the 5-user system in Example 1, the common randomness is obtained by the random variable U. For users 1 and 2, we havebeing the remaining randomness in given U. The interpretation is that, in order to attain omniscience with sum-rate , the rate for users 1 and 2 to jointly encode their observations is no more than 2 bits. Alternatively, the maximum cost incurred by users 1 and 2 cooperating with each other is 2 bits of coding rate. One can show that (7) holds for all (An explanation of (7) can be found in (Section IV-B in [23])). 3.2. Core
While
quantifies the maximum coding cost in each coalition, each
denotes a cost allocation method, with each
being the source coding rate assigned to user
. The solution set of the game
is called the
core [
30,
31] which contains all
s distributing exactly the total cost
to individual users such that
holds for all coalitions
. It is not difficult to see from (
4) that the core coincides with the optimal rate region
, which is nonempty (Theorem 4 in [
8]) (The nonemptiness of the core
can also be explained by the submodularity of
. See
Appendix B). In the rest of the paper, we will refer to
as the core or the optimal rate region interchangeably.
The inequality
in the core
also has an interpretation in coalitional game theory. If a cost allocation method
results in
for some
X, the users in
X may break the coalition
X and seek another
such that
. This means the coalition
X is not stable (This can also be explained by the definition of stability (Section 4.3 in [
32]) and the fact that the core is a stable set in (Theorem 8 in [
32])). On the other hand, if
holds for all
, then no user has the incentive to break the coalition
V and form a smaller one, i.e., the grand coalition
V forms. In this sense, the core contains all cost allocation methods
that exactly distribute the sum-cost
to all users in a way such that all of them would like to cooperate with others for the purpose of attaining omniscience (Chapter 12 in [
30]).
3.3. Decomposition
For any such that , let ⊔ denote the disjoint union and be the direct sum of and . For example, for and , . For , let be the characteristic vector of the subset X such that if and if .
For the fundamental partition
, each
defines a subgame
with the characteristic cost function
for all
. The core of the subgame
is
where the polyhedron
is a
reduction/projection of
on to
C. The following lemma shows the decomposition property of the game
.
Lemma 1 (Theorem 38 and Lemma 39 in [
8])
. The game can be decomposed by the fundamental partition so that- (a)
the dimension of is and - (b)
The following holds for any :
- (i)
For any such that , , for all , and ;
- (ii)
For all , for some and .
The decomposition of the core
in Lemma 1(a) interprets the decomposition of the solution set of
and the fact that it makes no difference for the users to cooperate in the grand coalition
V or in subgames
(This fact can be seen more clearly via the definition of the decomposable game in
Appendix B). Lemma 1(b) states that the costs, or source coding rates, can be exchanged within a subgame, but not between subgames, which can be explained by the dependence relationship in the remaining randomness as follows.
Interpretation
Recall that
. Due to the fact that
is the finest minimizer of (
5), we have ((
8a) holds because
is the minimizer of the Dilworth truncation; the strict inequality (
8a) holds because otherwise
is not the finest minimizer)
Here, (
8a) means that given the common randomness, any two distinct coalitions
C and
in
have
and
mutually independent. That is, to attain omniscience with the minimum sum-rate
, the users in
C and
must encode the exact randomness
and
, respectively. In other words, the costs or the source coding rates cannot transfer between any two users
and
. This is the interpretation of Lemma 1(b)-(i) and we call it
zero exchange rate between
i and
j. On the other hand, (8b) states that, given the common randomness, any two users
i and
j in the same coalition
C are mutually dependent. In this case, the information amount
that is mutual to
X and
can be encoded by either
or
, i.e., the costs or source coding rates can be transferred between users
i and
j: they have
nonzero exchange rate.
Example 3. For the 5-user system in Example 1, we have the fundamental partition . The core has the dimension of and is decomposed aswhere , as shown in Figure 2, is a 2-dimensional plane and and are singletons containing single points and , respectively. Given the common randomness that is obtained by U, any two distinct are independent, e.g.,for any , any two disjoint such that are mutually dependent; e.g.,i.e., in the fundamental partition , we have zero exchange rate between coalitions and nonzero exchange rate within a coalition. The decomposition property in Lemma 1 is useful when considering the fairness. Since there is no freedom for the users who belong to distinct coalitions in
to negotiate how to allocate coding costs fairly, it suffices to just study how to attain fairness within each
. This will be further summarized in Theorem 1 in
Section 4 and Theorem 3 in
Section 5 that allow distributed computation for attaining the two fair solutions, the Shapley value and egalitarian solution, in the optimal rate region
.
4. Shapley Value
For an omniscience-achievable rate vector , it is worth discussing how fairly it can distribute the source coding rates. In the game model , fairness is also an important performance metric of a cost allocation method in that it promotes the users incentives to cooperate with each other. In this section, we discuss how to attain fairness by searching the Shapley value in the optimal rate region .
The Shapley value
is defined in (Theorem 7 in [
20]) as a unique solution in the core
, with each dimension being
Here, is the remaining uniqueness in given the and the common randomness in U. The interpretation is that, to attain the omniscience by the minimum sum-rate , if the users in X encode at the rate first, user i needs to encode at the rate .
In the game model
,
is the marginal coding cost incurred by user
i when he/she joins the coalition
X. Let
such that
and
for all
be a
permutation of
V. Here, each
denotes the order that the users join the grand coalition
V, for which, the total cost
can be assigned to individual users by the Edmond greedy algorithm [
33]: For
i increasing from 1 to
, we assign each user the marginal cost
where
and
for all
. The resulting
satisfies
. The Shapley value
is based on the assumption that all the permutations are equiprobable. For each
, user
i will be assigned the marginal coding cost
for
out of
times. Then,
assigns each user the expected marginal coding cost he/she incurs over all permutations.
4.1. Decomposition
The fairness of
can also be explained by its relationship with the extreme points in the core
. Let
be the
extreme point set containing all vertices of the core
. For a particular permutation
, the optimal rate vector returned by the Edmond greedy algorithm is an extreme point of
and
can be constructed by applying the Edmond greedy algorithm for all
permutations of
V (Section 3.2 in [
11]). Based on the definition (
9), the Shapley value is the mean value of
[
20] (In this sense, the Shapley value is the gravity center of
[
20]):
Since the core is decomposed by the fundamental partition (Lemma 1(a)), we have the extreme point set also decomposed as , which leads to the decomposition of the Shapley value in Theorem 1 below (Theorem 1 is a special case of Theorem 1 when the minimum sum-rate ).
Theorem 1. For the Shapley value in the core , we have where is the Shapley value in the core of the subgame .
Proof. For the fundamental partition
, since
, we have
Theorem holds. □
Example 4. In the core of the 5-user system in Example 1, the Shapley value by the definition (9) is . We have four extreme points insuch that . Recall that we have the fundamental partition that decomposes the game as in Example 2. According to Theorem 1, we havewhere is the Shapley value of the subgame as shown in Figure 3, and . 4.2. Complexity and Approximation
The complexity of computing the Shapley value is exponentially large in the problem size
, since the values of
for all
are required to be calculated to get
in (
9). What makes the situation worse is that determining the value of the Dilworth truncation
for a given
X requires calling SFM algorithms and their complexity is
. Therefore, it is impractical to obtain the exact value of
in large systems.
One alternative approach is to utilize the decomposition property in Theorem 1 to allow distributed and parallel computation. For each coalition C in the fundamental partition , let the users in C obtain the Shapley value in the subgame by themselves; All are combined to form the Shapley value of the entire game . By doing so, the complexity is determined by the subgame of maximum size . However, the complexity to obtain the Shapley value in the subgame is again exponentially growing in .
While the high computational complexity is an intrinsic problem of the Shapley value, there are various approximation algorithms proposed in the literature to alleviate this complexity problem. For example, the random permutation method in [
34] utilizes the fact that the Shapley value is the mean value over the extreme point set in (
10). The idea is to randomly generate a set of permutations of
V of a desired size, e.g.,
or
permutations, and apply the Edmond greedy algorithm to determine the corresponding extreme points, the mean of which is an approximation of the Shapley value
. This approximation method can also be used in combination with the decomposition method in Theorem 1.
Example 5. For the 5-user system in Example 1, we first decompose the game into subgames , and . For the subgame , we randomly select permutations. For example, for , and , we can generate three extreme points:respectively, so that the mean value is an approximation of the Shapley value in . Note, different permutations might result in different approximations. For example, if we choose three permutations , and , we would have the approximation . See the two approximations in Figure 3. By combining the approximation of with the ones obtained in other subgames, we have the approximation of the Shapley value of the game . For example, the above two approximations generate and , which are the two approximations to .
In Example 5, we chose no more than
permutations for each subgame
, where the extreme point corresponding to each permutation can be determined by Algorithm 3 in [
8] (The algorithm (Algorithm 3 in [
8]) can be considered as a modified Edmond greedy algorithm. See Appendix B in [
8] for the explanation. In
Figure 3, the path towards the extreme point
is generated by Algorithm 3 in [
8] for the permutation
) in
time. Therefore, the overall complexity for approximating the Shapley value
is determined by the subgame
of maximum size as polynomial time
. Accordingly, if we choose
permutations for each subgame
, the complexity would be
. We also remark that the approximation algorithm is not unique. In fact, there are many other existing methods, e.g., [
34,
35,
36], that can be implemented to approximate the Shapley value
.
5. Egalitarian Solution
The Shapely value
is fair in that it penalizes each user based on the expected marginal cost he/she incurs in game
. For example, in the 5-user system in
Figure 1, user 4 incurs the most expected marginal cost
over all
, the Shapley value
in Example 4 assigns him/her the most coding cost.
However, this fairness suggested by the Shapley value might not be the desired one in some practical systems. For example, in CCDE where mobile clients are considered as equally privileged peers, it is desirable to find a
that allocates the source coding rate as evenly as possible without considering users’ prior knowledge of the source
. Another example is a WSN containing a large number of battery-powered sensors with equal initial energy budget, where the even allocation of the source coding rate prolongs the overall lifetime of the WSN. In these cases, it might be more suitable to consider the
egalitarian solution [
19,
28], the minimizer of
. In this section, we consider a more general quadratic programming [
37,
38]:
where
and
is a positive weight vector which could have some interpretations in practical scenarios. For example,
could denote the quality of the wireless transmission of user
i in CCDE or the remaining battery energy of sensor node
i in a WSN.
It is shown [
39,
40] that, if the function value of
can be obtained directly, problem (
11) can be solved in
time (In the case when
, the minimizer of (
11) is also called the minimum-norm point in
, which can be searched by the algorithm in [
41] by polynomial time calls of the Dilworth truncation
). However, determining the Dilworth truncation
for a given
X has the complexity
. In addition, the minimizer of (
11) may not be fractional or, if it is fractional, may require splitting each packet into more than
chunks in CCDE. Since
and it is shown in (Corollary 28 in [
8]) that there exists an optimal rate vector in
with an LCM
, it would be of interest to see if we can find a fair optimal rate vector in
still with LCM
.
Example 6. Consider the minimizer of (11) for the 5-user system in Example 1, we have for and for . While the former can be implemented by 2-packet-splitting, the latter requires dividing each packets into 10 chunks. In fact, not only the minimizer of (
11), but also the Shapley value have the problem of incurring more than
-packet-splitting. For example, the Shapley value
in Example 4 requires 4-packet-splitting, where
, and its approximation
in Example 5 even requires 30-packet-splitting. Such dividing and reconstructing of packets could be cumbersome or even very impractical. In the next subsection, we consider how to search for an egalitarian solution in
that can be implemented by
-packet-splitting.
5.1. Steepest Descent Algorithm
For
, let
be the set containing all rational numbers that are divisible by
K. Consider the problem
The purpose is to search for a fractional egalitarian solution
with an LCM
. The objective function in (
12) is a separable convex function, for which local optimality w.r.t. the
elementary exchange implies the global optimality. See Lemma 2 below. Here,
denotes the cost/rate exchange between users
i and
j in the game
(The optimization criterion in Lemma 2 is related to the discrete convexity: The problem in (
11) exhibits
M-convexity on the real number set (Section 1.4.2 in [
42]), which also leads to the
M-convexity on the fractional number set of (
12). This is essentially due to the
M-convexity of a submodular base polyhedron (Theorem 4.12 and Proposition 4.13 in [
42]). See also
Appendix A for the definition of the submodular base polyhedron).
Lemma 2. In CCDE, is the minimizer of (12) if and only if, for all and positive integer such that ,where . Proof. The proof is based on a necessary and sufficient condition for the minimizer of (
11) for any convex function
g in (Theorem 20.3 in [
11]):
is the minimizer of (
11) if and only if, for all
and positive integer
such that
,
. In CCDE, the entropy function
H is integer-valued and
is fractional with denominator
so that the value of
has the denominator
for all
. Furthermore, all extreme points in
have the LCM
(Corollary 10 in [
8]). Therefore, for any
, if
, then
. So, Lemma 2 is the result of Theorem 20.3 in [
11] on the set
. □
Algorithm 1: Steepest descent algorithm (SDA). |
input: a positive integer and an initial point output: , the minimizer of ( 12) |
|
Lemma 2 directly suggests the steepest descent algorithm (SDA) in Algorithm 1 (The SDA algorithm is also based on a discrete convex minimization algorithm in (Section 10.1.1 in [
42]), which has been adopted in (Algorithm 1) for determining an integer-valued egalitarian solution for CCDE. The difference is that we use a dependence function
to search the steepest descent direction, which is more efficient than the brute-force search in (Algorithm 1). Furthermore, note that (Algorithm 1) only determines a real-valued egalitarian solution, which may be unable to be implemented in some practical systems, e.g., CCDE).
Furthermore, note that, as an input to the SDA, the initial point
can be searched by the MDA algorithm at the same time when the minimum sum-rate problem is solved (Corollary 28(a) in [
8]). The optimality of the SDA algorithm is stated below.
Theorem 2. For CCDE, the SDA algorithm generates an estimation sequence that converges on the minimizer of (12). Proof. Consider the recursive process
where
. This is a steepest descent approach: in each iteration
n, we move from the current estimation
in the steepest elementary exchange
by a constant step size
. Based on Lemma 2, starting with any initial
, the minimum of (
12) is reached when this recursion converges, i.e., when
.
For
, consider the
dependence function (Sections 2.2 and 2.3, Equations (2.14), (2.15), (2.18) and (2.19) in [
11])
The last equality (
14) states that
is the minimal minimizer of
(The last equality (
14) is shown in (Equations (2.14) and (2.15) in [
11]) due to the min-max theorem (Corollary 3.4 in [
11]). The minimizers of
form a set lattice and the smallest/minimal is the intersection of all minimizers. See Sections 2.2 and 2.3 in [
11] for details). A trivial case is that
. Based on (
14), we have
for all
. So, for all iterations
n of the recursion above,
and
Therefore, theorem holds. □
Remark 1. According to the proofs of Lemma 2 and Theorem 2, if , we could have for some iteration n in the SDA algorithm, or the estimation sequence converges on, but may not reach exactly, the minimizer of (12), i.e., the output vector can be a suboptimal solution of (12). Example 7. For the 5-user system in Example 1, we first apply the MDA algorithm in [8] and get the minimum sum-rate , the fundamental partition and an extreme point in the core . By setting and , we start the SDA algorithm with the initial point . At the first iteration , we have Then, . For and , we have and, therefore, . Since , we assign and continue the iteration.
By repeating the same procedure in each iteration, we get the estimation sequence that results in the update path The recursion converges at , where we have , which is the minimizer of (12) for and . Here, is a fractional egalitarian solution, a fair optimal rate vector in , that can be implemented by 2-packet-splitting in CCDE. 5.2. Dependence Function
Based on (
14), Lemma 1(b) and the discussion in
Section 3.3, it is not difficult to see that, for all
, if
for any
, then
and
are mutually dependent given the common randomness
in
U, i.e.,
, hence the name dependence function. Moreover, due to the fact that
, we can transfer arbitrarily small, but nonzero, coding cost from user
j to user
i for encoding the mutually shared information between users
i and
j, which is consistent with the nonzero exchange rate in
Section 3.3.
In addition, we must have
for the coalition
such that
, e.g., (
15). This is because
for all
such that
and
for all
, i.e., given the common randomness in
U, any
is only mutually dependent on any other
in the same coalition
. This will be formally stated as the decomposition of
in Theorem 3.
5.3. Complexity and Distributed Implementation
The SDA algorithm requires oracle calls of
, instead of
, which is equivalent to the entry of the entropy function
H and avoids the complexity of calculating the Dilworth truncation. We derive the worst-case complexity of SDA as follows. For any initial point
, the total number of iterations of the SDA algorithm is
. Let
denote the
-size of the core
. The maximum number of iterations of the SDA algorithm is
. The minimization problem (13) in step 5 in the SDA algorithm is a SFM due to the intersecting submodularity of
(Lemma 3 in [
8]). Thus, each iteration of the SDA algorithm completes in
time and the overall complexity is
(The reason that the
-size determines the upper bound on the number of iterations is explained in detail in (Section 10.1.1 in [
42])).
Example 8. For the estimation sequence generated in Example 7 by the SDA algorithm, we show the error of the estimation in terms of the -norm in Figure 4. Since in each iteration of the SDA algorithm, the estimation is updated along the steepest elementary exchange by step size toward the optimizer , we necessarily have decreased by each time. As in Figure 4, we have the error , a linearly decreasing curve. In this case, there are iterations in the SDA algorithm so that we incur calls of . In general, since the -size of is , the worst-case complexity of the SDA algorithm when applied to the 5-user system in Figure 1, is calls of . The SDA algorithm can also be implemented in a decentralized manner: let each user i obtain the dependence function , a set of mutually dependent users given the common randomness in U, by him/herself in steps 4 to 6; the steps 7 to 13 can be completed by users’ communications over the broadcast channels. By doing so, the computational complexity incurred at each user is .
5.4. Decomposition
Similar to the decomposition of the Shapley value in Theorem 1, we also have the decomposition property of the egalitarian solution in Theorem 3. We omit the proof since it is a direct result of Corollary 42 in [
8], Lemma 1 and Lemma 2(b).
Theorem 3. With as the egalitarian solution, the minimizer of (11), or the fractional egalitarian solution, the minimizer of (12), where is the egalitarian solution or fractional egalitarian solution, respectively, in the core of the subgame . Theorem 3 states that the egalitarian solution
can be determined by allowing the subgames
for all
to obtain their own
. This decomposition method can be used in combination with the SDA algorithm so that the complexity is reduced to
, where
is the
-size of the core
of the subgame
of maximum size. In addition, the users in each subgame can run the SDA algorithm in a distributed manner as discussed in
Section 5.3 and therefore the complexity incurred at each user is
.
Remark 2. Theorems 1 and 3 justify the exchange rate resulted from the mutual dependence in Section 3.3 when the game is decomposed by the fundamental partition into the subgames for all : since the exchange rate, or mutual dependence, is only nonzero inside each subgame , we just need to let the users cooperating in the same decide how to attain fairness. Example 9. For the 5-user system in Example 1, consider searching the fractional egalitarian solution w.r.t. in Example 7 by the decomposition method in Theorem 3. We first decompose into subgames , and . For the subgames and , we can directly assign and , respectively. For the subgame , we apply the SDA algorithm and get the following update path to the fractional egalitarian solution : See Figure 5. Then, we get , the fractional egalitarian solution w.r.t. in . In this case, we still have 5 iterations in the SDA algorithm and the convergence performance is exactly the same as in Figure 4. However, the complexity reduces to calls of . In general, since , the complexity of the SDA algorithm when applied to the subgame , is calls of . 6. Conclusions
We established the equivalence between the optimal rate region of CO and the core of a coalitional game with the characteristic cost function being the Dilworth truncation measuring the remaining information in for all subsets given the common randomness in U. For attaining fairness in the optimal rate region, we considered the Shapley value and the egalitarian solution. The Shapley value differs from the egalitarian solution in that the fairness is attained if each user i is penalized by the expected marginal cost or source coding rate he/she incurs if in coalition X. By utilizing the fact that the Shapley value is the average over all extreme points in the core, we showed that an approximation, instead of the exact Shapley value, can be obtained by taking the mean over a desired number of randomly generated extreme points. We also proposed the SDA algorithm for obtaining the egalitarian solution in the core that can be implemented in CCDE by -packet-splitting. We showed that the game is itself decomposable by the fundamental partition so that, given the common randomness, and for any two distinct are mutually independent, while and for all are mutually dependent. This dependence relationship leads to a decomposition method for obtaining the fair solutions: the Shapley value and the egalitarian solution can be obtained independently within each subgame.
The methods for searching the Shapley value and the egalitarian solution in this paper require the solutions to the minimum sum-rate problem, the value of and and also an optimal rate vector in to initiate the SDA. To further improve the efficiency of attaining fairness in CO, it is worth studying whether we can directly attain the fairness in the optimal rate region without solving the minimum sum-rate problem first. On the other hand, apart from the fact that the egalitarian solution is more suitable to CCDE and WSN, it is worth understanding to which scenarios the fairness suggested by the Shapley value applies. Finally, the fractional egalitarian solution only determines a fair rate assigned to each user in CCDE. We still need a complete network coding scheme that also specifies the coefficients in the linear combination of chunks in each transmission.