Abstract
Arrow contraction applied to a tropical diagram of probability spaces is a modification of the diagram, replacing one of the morphisms with an isomorphism while preserving other parts of the diagram. It is related to the rate regions introduced by Ahlswede and Körner. In a companion article, we use arrow contraction to derive information about the shape of the entropic cone. Arrow expansion is the inverse operation to the arrow contraction.
1. Introduction
In [1], we have initiated the theory of tropical probability spaces for the systematic study of information optimization problems in information theory and artificial intelligence, such as those arising in robotics [2], neuroscience [3], artificial intelligence [4], variational autoencoders [5], information decomposition [6], and causal inference [7]. In [8], we applied the techniques to derive a dimension-reduction result for the entropic cone of four random variables.
Two of the main tools used for the latter are what we call arrow contraction and arrow expansion. They are formulated for tropical commutative diagrams of probability spaces. Tropical diagrams are points in the asymptotic cone of the metric space of commutative diagrams of probability spaces endowed with the asymptotic entropy distance. Arrows in diagrams of probability spaces are (equivalence classes of) measure-preserving maps.
Arrow contraction and expansion take a commutative diagram of probability spaces as input, modify it, but preserve important properties of the diagram. The precise results are formulated as Theorems 3 and 4 in the main text. Their formulation requires language, notation, and definitions that we review in Section 2.
However, to give an idea of the results in this paper, we now present two examples. For basic terminology and notations used in these examples below, the reader unfamiliar with them is referred either to Section 2 of the present article or in the introductory material in the article [9].
1.1. Two Examples
1.1.1. Arrow Contraction and Expansion in a Two-Fan
Suppose we are given a fan , and we would like to complete it to a diamond
such that the entropy of V, denoted by , equals the mutual information between X and Y, i.e., we would like to realize the mutual information between X and Y by a pair of reductions and . This is not always possible, not even approximately. The Gacs-Körner Theorem [10] describes when such exact realization of mutual information is possible.
Arrow contraction instead produces another fan , such that the reduction is an isomorphism and the relative entropy of X given V equals . By collapsing this reduction, we obtain as a diagram just the reduction . If necessary, we can keep the original spaces Z and Y in the modified diagram obtaining the “broken diamond” diagram
such that . Of course, no special technique is necessary to achieve this result since it is easy to find a reduction from a tropical space to another tropical probability space with the prespecified entropy, as long as the Shannon inequalities are not violated.
However, a similar operation becomes non-trivial and in fact impossible without passing to the tropical limit, if instead of a single space X, there is a more complex sub-diagram as in the example in the next subsection.
To explain how arrow expansion works, we start with the chain of reductions . Can we extend it to a diamond, as in (1), so that ? This is again not possible, in general. However, if we pass to tropical diagrams, then such an extension always exists.
1.1.2. One More Example of Arrow Expansion and Contraction
Consider a diagram presented in Figure 1. Such a diagram is called a -diagram. We would like to find a reduction so that . It is not possible to achieve this within the realm of diagrams of classical probability spaces. But once we pass to the tropical limit, the reduction can be found by contracting and then collapsing the arrow , as shown in Figure 1.
Figure 1.
Arrow contraction and expansion in a -diagram. The fan (shown in red in the Figure) is admissible. Spaces , and belong to the co-ideal . After the operation the part of the diagram shown in blue in the Figure is left unmodified.
Arrow contraction is closely related to the Shannon channel coding theorem. This is perhaps most obvious from the proof. Furthermore, arrow contraction has connections with rate regions, as introduced by Ahlswede and Körner, see [11,12]. These results by Ahlswede and Körner were applied by [13], resulting in a new non-Shannon information inequality. Moreover, in [13], a new proof was given of the results; this new proof is similar to the proof of the arrow contraction result in the present paper.
The main contribution of our work lies in the fact that we prove a much stronger preservation of properties of the diagram under arrow contraction.
2. Preliminaries
2.1. Probability Spaces and Their Diagrams
Our main objects of study will be commutative diagrams of probability spaces. A finite probability space X is a set with a probability measure on it, supported on a finite set. We denote by the cardinality of the support of the measure. The statement means that point x is an atom with positive weight in X. For details see [1,9,14].
Examples of commutative diagrams of probability spaces are shown in Figure 2. The objects in such diagrams are finite probability spaces and morphisms are equivalence classes of measure-preserving maps. Two such maps are considered to be equivalent if they coincide on a set of full measurements. To record the combinatorial structure of a commutative diagram, i.e., the arrangement of spaces and morphisms, we use indexing categories, which are finite poset categories satisfying an additional property, which we describe below.
Figure 2.
Examples of diagrams of probability spaces.
2.1.1. Indexing Categories
A poset category is a finite category such that there is at most one morphism between any two objects either way.
For a pair of objects in a poset category , such that there is a morphism in , we call k an ancestor of l and l a descendant of k. The set of all ancestors of an object k together with all the morphisms between them is itself a poset category and will be called a co-ideal generated by k and denoted by . Co-ideals are also sometimes called filters. Similarly, a poset category consisting of all descendants of and morphisms between them will be called an ideal generated by k and denoted .
An indexing category used for indexing diagrams is a poset category satisfying the following additional property: for any pair of objects the the intersection of co-ideals is also a co-ideal generated by some object ,
In other words, for any pair of objects there exists a least common ancestor , i.e., is an ancestor to both and and any other common ancestor is also an ancestor of . Any indexing category is initial, i.e., there is a (necessarily unique) initial object in it, which is the ancestor of any other object in , in other words .
A fan in a category is a pair of morphisms with the same domain. Such a diagram is also called a span in some literature on Category Theory. A fan is called minimal, if for any other fan included in a commutative diagram
the vertical morphism must be an isomorphism. Any indexing category also satisfies the property that, for any pair of objects in it, there exists a unique minimal fan with target objects of the given ones.
This terminology will also be applied to diagrams of probability spaces indexed by . Thus, given a space X in a -diagram, we can talk about its ancestors, descendants, co-ideal , and ideal . We use square brackets to denote tropical diagrams and spaces in them. For the (co-)ideals in tropical diagrams, in order to unclutter notations, we will write
2.1.2. Diagrams
For an indexing category and a category , a commutative -diagram is a functor . A diagram is called minimal if it maps minimal fans in to minimal fans in .
A constant -diagram denoted is a diagram where all the objects equal to X, and all morphisms are identities.
Important examples of indexing categories are a two-fan, a diamond category, a full category on n spaces, chains . For detailed descriptions and more examples, the reader is referred to the articles cited at the beginning of this section.
2.2. Tropical Diagrams
2.2.1. Intrinsic Entropy Distance
For a fixed indexing category , the space of commutative -diagrams will be denoted by . Evaluating entropy on every space in a diagram gives a map
where the target space is the space of real-valued functions on objects of . We endow this space with the -norm. For a fan of -diagrams we define the entropy distance between its terminal objects by
and the intrinsic entropy distance between two arbitrary -diagrams by
This intrinsic version of the entropy distance was introduced in [15,16]. The triangle inequality for and various other properties are discussed in [1].
In the same article, a useful estimate for the intrinsic entropy distance called the Slicing Lemma is also proven. The following corollary ([1], Corollary 3.10(1)) of the Slicing Lemma will be used in the next section.
Proposition 1.
Let be an indexing category, and included in a pair of two fans
Then
2.2.2. Tropical Diagrams
Points in the asymptotic cone of are called tropical -diagrams and the space of all tropical -diagrams, denoted , is endowed with the asymptotic entropy distance. We explain this now in more detail, and a more extensive description can be found in [14].
To describe points in we consider quasi-linear sequences of -diagrams. That is, we fix a “slowly growing” increasing function satisfying
for some constant and any . We call a sequence -quasi-linear if it satisfies the bound for all
We have shown in [14] that the space does not depend on the choice of function as long as it is not zero. The space of all such sequences is endowed with the asymptotic entropy distance defined by
A tropical diagram is defined to be an equivalence class of such sequences, where two sequences and are equivalent if . The space carries the asymptotic entropy distance and has the structure of a -semi-module—one can take linear combinations with non-negative coefficients of tropical diagrams. The linear entropy functional is defined by
A detailed discussion about tropical diagrams can be found in [14]. In the cited article, we show that the space is metrically complete and isometrically isomorphic to a closed convex cone in some Banach space.
For a chain category, containing k objects and unique morphism for every pair , we have shown in [14] that the space is isomorphic to the following cone in
The isomorphism is given by the entropy function. Thus, we can identify tropical probability spaces (elements in ) with non-negative numbers via entropy. We will simply write to mean the entropy of the space . Along these lines, we also adopt the notations , and for the conditional entropy and mutual information for the tropical spaces included in some diagrams.
2.3. Asymptotic Equipartition Property for Diagrams
2.3.1. Homogeneous Diagrams
A -diagram is called homogeneous if the automorphism group acts transitively on every space in . Homogeneous probability spaces are uniform. For more complex indexing categories, this simple description is not sufficient.
2.3.2. Tropical Homogeneous Diagrams
The subcategory of all homogeneous -diagrams will be denoted and we write for the category of minimal homogeneous -diagrams. These spaces are invariant under the tensor product. Thus, they are metric Abelian monoids.
Passing to the tropical limit, we obtain spaces of tropical (minimal) homogeneous diagrams that we denote and .
2.3.3. Asymptotic Equipartition Property
In [1] the following theorem is proven
Theorem 1.
Suppose is a -diagram of probability spaces for some fixed indexing category . Then, there exists a sequence of homogeneous -diagrams such that
where is a constant only depending on and .
The approximating sequence of homogeneous diagrams is evidently quasi-linear with the defect bounded by the admissible function
Thus, Theorem 1 above states that . On the other hand, we have shown in [14] that the space of linear sequences is dense in . Combining the two statements, we obtain the following theorem.
Theorem 2.
For any indexing category , the space is dense in . Similarly, the space is dense in .
It is possible that the spaces and coincide. At this time, we have neither a proof nor a counterexample to this conjecture.
2.4. Conditioning in Tropical Diagrams
For a tropical -diagram containing a space we defined a conditioned diagram . It can be understood as the tropical limit of the sequence , where is the homogeneous approximation of , is the space in that corresponds to under combinatorial isomorphism and is any atom in .
We have shown in [9] that operation of conditioning is Lipschitz-continuous with respect to the asymptotic entropy distance.
3. Arrow Contraction
3.1. Arrow Collapse, Arrow Contraction, and Arrow Expansion
3.1.1. Prime Morphisms
A morphism in an indexing category will be called prime if it cannot be factored into a composition of two non-identity morphisms in . A morphism in a -diagram indexed by a prime morphism in will also be called prime.
3.1.2. Arrow Collapse
Suppose is a -diagram such that for some pair , the prime morphism is an isomorphism. Arrow collapse applied to results in a new diagram obtained from by identifying and via the isomorphism . The combinatorial type of is different from that of . The spaces and are replaced by a single space, and the new space will inherit all the morphisms in with targets and domains and .
3.1.3. Arrow Contraction and Expansion
Arrow contraction and expansion are two operations on tropical -diagrams. Roughly speaking, arrow contraction applied to a tropical -diagram results in another tropical -diagram such that one of the arrows becomes an isomorphism, while some parts of the diagram are not modified. Arrow expansion is an inverse operation to arrow contraction.
3.1.4. Admissible and Reduced Sub-Fans
An admissible fan in a -diagram is a minimal fan , such that Z is the initial space of and any space in belongs either to the co-ideal or ideal . For example, in the left-most diagram of Figure 1, the fan is admissible, while or are not.
An admissible fan in a diagram will be called reduced if the morphism is an isomorphism.
3.2. The Contraction Theorem
Our aim is to prove the following theorem.
Theorem 3.
Let be an admissible fan in some tropical -diagram . Then for every there exists a -diagram containing an admissible fan , corresponding to the original admissible fan through the combinatorial isomorphism, such that, with the notations and , the diagram satisfies
- (i)
- (ii)
- (iii)
It is not clear that constructing diagrams as in the theorem above for a sequence of values of parameter decreasing to 0, we can obtain a convergent sequence in with the limiting diagram satisfying conclusions of the theorem with . If were a locally compact space, which is an open question at the moment. The convergence would be guaranteed, and then in the theorem above could be replaced by 0.
The proof of Theorem 3 is based on the following proposition, which will be proven in Section 5.
Proposition 2.
Let be an admissible fan in some homogeneous -diagram of probability spaces . Then there exists a -diagram containing the admissible fan such that, with the notations and , it holds that
- (1)
- for any and .
- (2)
- (3)
Proof of Theorem 3.
First, we assume that is a homogeneous tropical diagram. It means that it can be represented by a quasi-linear sequence of homogeneous diagrams, with defect of the sequence bounded by the function for some . This means that for any
where is some constant depending on , see [14].
Fix a number and apply Proposition 2 to the homogeneous diagram , containing the admissible fan and sub-diagram . As a result, we obtain a diagram containing the fan and the sub-diagram , such that
Recall that for a diagram of probability spaces, we denote by the tropical diagram represented by the linear sequence . As an element of a closed convex cone , it can be scaled by an arbitrary non-negative real number; see, for instance, Section 2.3.5 in [14]. For example, is represented by the sequence .
Define the two tropical diagrams
Since does not depend on and does not depend on u we have and . From (3), we obtain
The distance between and can be bounded as follows
This also implies
Since conditioning is a Lipschitz-continuous operation with Lipschitz constant 2, we also have
Combining the estimates in (4)–(7) we obtain
Please note that grows at most exponentially (it is bounded by for some C) and is a strictly sub-linear function. Thus, by choosing sufficiently large n depending on the given , we obtain , satisfying conclusions of the theorem for homogeneous .
To prove the theorem in full generality, observe that all the quantities on the right-hand side of the inequalities are Lipschitz-continuous. Since is dense in the theorem extends to any by first approximating it with any precision by a homogeneous configuration and applying the argument above. □
3.3. The Expansion Theorem
The following theorem is complementary to Theorem 3. The expansion applied to a diagram containing a reduced admissible fan produces a diagram with an admissible fan, such that the contraction of it is the original diagram. Thus, arrow expansion is a right inverse of the arrow contraction operation.
In general, contraction erases some information stored in the diagram, so there are many right inverses. We prove the theorem below by providing a simple construction of one such right inverse.
Theorem 4.
Let be a reduced admissible fan in some tropical -diagram and . Let . Then there exists a -diagram containing the copy of , such that the corresponding admissible fan has and .
Proof.
Let be a tropical probability space with entropy equal to . For any reduction of tropical spaces , there are natural reductions
We construct the diagram by replacing every space in the ideal with . Every morphism within is replaced by
And any morphism from in to a space in is replaced by a composition
Clearly, the resulting diagram satisfies the conclusion of the theorem. □
The rest of the article is devoted to the development of the necessary tools and the proof of Proposition 2.
4. Local Estimate
In this section, we derive a bound, very similar to Fano’s inequality, on the intrinsic entropic distance between two diagrams of probability spaces with the same underlying diagram of sets. The bound will be in terms of the total variation distance between two distributions corresponding to the diagrams of probability spaces. It will be used in the next section to prove Proposition 2.
4.1. Distributions
4.1.1. Distributions on Sets
For a finite set S we denote by the collection of all probability distributions on S and by we denote the total variation distance between .
4.1.2. Distributions on Diagrams of Sets
Let denote the category of finite sets and surjective maps. For an indexing category , we denote by the category of -diagrams in . That is, objects in are commutative diagrams of sets indexed by the category , the spaces in such a diagram are finite sets, and arrows represent surjective maps, subject to commutativity relations.
For a diagram of sets we define the space of distributions on the diagram by
where is the affine map induced by a surjective map . If is the initial space of , then there is an isomorphism
Using the isomorphism (8) we define total variation distance between two distributions as
Given a -diagram of sets and an element we can construct a -diagram of probability spaces .
Below, we give the estimate of the entropy distance between two -diagrams of probability spaces and in terms of the total variation distance between distributions.
4.2. The Estimate
The upper bound on the entropy distance, which we derive below, has two summands. One is linear in the total variation distance with the slope proportional to the log-cardinality of . The second one is super-linear in the total variation distance, but it does not depend on . So, we have the following interesting observation: of course, the super-linear summand always dominates the linear one locally. However, as the cardinality of becomes large, it is the linear summand that starts playing the main role. This will be the case when we apply the bound in the next section.
For consider a binary probability space with the weight of one of the atoms equal to
Proposition 3.
For an indexing category , consider a -diagram of sets . Let be two probability distributions on . Denote , and . Then
Proof.
To prove the local estimate, we decompose both and into a convex combination of a common part and rests and . The coupling between the common parts gives no contribution to the distance and the worst possible estimate on the other parts is still enough to obtain the bound in the lemma, using Proposition 1.
Let be the initial set in the diagram . We will need the following obvious rough estimate of the entropy distance that holds for any :
It can be obtained by taking a tensor product for the coupling between and .
Our goal now is to write and as the convex combination of three other distributions , and as in
with the smallest possible .
We could do it the following way. Let and be the distributions on that correspond to and under isomorphisms (8). Let . If then the proposition follows from the rough estimate (9), so from now on, we assume that . Define three probability distributions , and on by setting for every
Denote by the distributions corresponding to under isomorphism (8). Thus, we have
Now, we construct two fans of -diagrams
by setting
and
The reduction in the fans in (10) is given by coordinate projections. Note that the following isomorphisms hold
Now we apply Proposition 1 along with the rough estimate in (9) to obtain the desired inequality
□
5. Proof of Proposition 2
In this section, we prove Proposition 2, which is shown below verbatim. The proof consists of the construction in Section 5.1 and estimates in Propositions 5 and 6.
5.1. The Construction
In this section, we fix an indexing category , a minimal -diagram of probability spaces with an admissible sub-fan . We denote and by we denote the combinatorial type of .
Instead of diagram , we consider an extended diagram, which is a two-fan of -diagrams
where consists of those spaces in , which are initial spaces of two fans with feet in U and in some space in . That is for every the space is defined to be the initial space in the minimal fan in . It may happen that for some pair of indices the initial spaces of the fans with one feet U and the other and coincide in . In , however, they will be treated as separate spaces so that the combinatorial type of is . Starting with the diagram in (11) one can recover by collapsing all the isomorphism arrows. The initial space of will be denoted .
We would like to construct a new fan , such that
Once this goal is achieved, we collapse all the isomorphisms to obtain -diagram satisfying conditions in the conclusion of Proposition 2.
We start with a general description of the idea behind the construction, followed by a detailed argument. To introduce the new space V we take its points to be N atoms in . Ideally, we would like to choose the atoms in such a way that are disjoint and cover the whole of . It is not always possible to achieve this exactly. However, when is large, N is taken slightly larger than , and are chosen at random, then with high probability the spaces will overlap only little and will cover most of . The details of the construction follow.
We fix and construct several new diagrams. For each of the new diagrams, we provide a verbal and formal description.
- The space . Points in it are independent samples of length N of points in U.
- The space . A point should be interpreted as a choice of index in a sample .
- The -diagram , whereA point in corresponds to the choice of a sample , an independent choice of a member of the sample and a point . Recall that the original diagram was assumed to be homogeneous and, in particular, the distribution on is uniform. Due to the assumption on homogeneity of , the space does not depend on . Since is also equipped with the uniform distribution, it follows that the distribution on will also be uniform.
- The -diagram , whereA point is the choice of a sample and a point x in one of the fibers , . The distribution on is chosen so that the natural projection is the reduction of probability spaces. Given a sample , if the fibers are not disjoint, then the distribution on need not to be uniform. Below, we will give an explicit description of and study the dependence of on the sample .
These diagrams can be organized into a minimal diamond diagram of -diagrams, where reductions are obvious projections.
To describe the probability distribution on , first we define several relevant quantities:
Recall that the distribution is completely determined by the distribution on the initial space of via isomorphism (8). From homogeneity of it follows that distributions on both and are uniform. Therefore
The desired fan mentioned in the beginning of the section is obtained from the top fan in the diagram in (13) by conditioning on . We will show later that for an appropriate choice of N and for most choices of , the fan we obtain in this way has the required properties.
First, we would like to make the following observations. Fix an arbitrary . Then:
- (1)
- The underlying set of the probability space is .
- (2)
- The diagramsare included in a two-fan of -diagramswhich is obtained by conditioning the top fan in the diagram in (13).
The very important observation is that diagrams and are isomorphic for any choice of and . The isomorphism is the composition of the following sequence of isomorphismswhere the first isomorphism follows from the definition of , the second—from minimality of the fan , the third—from the definition of and the fourth—from the homogeneity of .
5.2. The Estimates
We now claim and prove that one could choose a number N and in such that
- (1)
- .
- (2)
- , where and are initial spaces in and , respectively.
5.2.1. Total Variation and Entropic Distance Estimates
If we fix some , then is a scaled binomially distributed random variable with parameters N and , which means that .
First, we state the following bounds on the tails of a binomial distribution.
Lemma 1.
Let ν be a scaled binomial random variable with parameters N and ρ, then
- (i)
- for any holds
- (ii)
- for any holds
The proof of Lemma 1 can be found at the end of this section.
Below we use the notation for the probability distribution on . For a pair of complete diagrams , with the same underlying diagram of sets and with initial spaces , , we will write for the halved total variation distance between their distributions
Proposition 4.
In the settings above, for , the following inequality holds
Proof.
Recall that by definition . We use Equation (15) to expand the left-hand side of the inequality as follows
Since by homogeneity of the original diagram, all the summands are the same, we can fix some and estimate further:
Applying Lemma 1(i), we obtain the required inequality. □
In the propositions below we assume that is sufficiently large (larger than ).
Proposition 5.
In the settings above and for any holds:
Proof.
We will use local estimate, Proposition 3, to bound the entropy distance and then apply Proposition 4. To simplify notations, we will write simply for .
Please note that in the chosen regime, , the first summand in the right-hand side of the inequality is larger than the second, i.e., and therefore we can write
□
5.2.2. The “Height” Estimate
Recall that for given and we have constructed a two-fan of -diagrams
We will now estimate the length of the arrow .
Proposition 6.
In the settings above and for
Proof.
First, we observe that the fiber of the reduction over a point is a homogeneous probability space of cardinality equal to , therefore its entropy is .
The last inequality above follows from Lemma 1 (ii). □
5.3. Proof of Proposition 2
Let be the fan constructed in Section 5.1. The construction is parameterized by number N and atom . Below, we will choose a particular value for N and apply estimates in Propositions 5 and 6 with particular choice of parameter t to show that there is , so that the fan satisfies the conclusions of Proposition 2.
Let
With these choices of N and t, Proposition 5 implies
while Proposition 6 gives
Therefore, there is a choice of such that the fan
satisfies conditions in (12). As we have explained at the beginning of Section 5.1, by collapsing isomorphism arrows, we obtain -diagram satisfying conclusions of Proposition 2.
5.4. Proof of Lemma 1
The Chernoff bound for the tail of a binomially distributed random variable asserts that for any holds
Applying the bound for the upper and lower tail for the binomially distributed random variable , we obtain the inequality in (i).
The second assertion follows from the following estimate
For we have , therefore
By the Chernoff bound, we have
Author Contributions
Investigation, R.M. and J.W.P. All authors have read and agreed to the published version of the manuscript.
Funding
Open Access funding was provided by the Max Planck Society.
Institutional Review Board Statement
Not applicable.
Data Availability Statement
No new data were created or analyzed in this study. Data sharing is not applicable to this article.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Matveev, R.; Portegies, J.W. Asymptotic dependency structure of multiple signals. Inf. Geom. 2018, 1, 237–285. [Google Scholar] [CrossRef]
- Ay, N.; Bertschinger, N.; Der, R.; Güttler, F.; Olbrich, E. Predictive information and explorative behavior of autonomous robots. Eur. Phys. J. B 2008, 63, 329–339. [Google Scholar] [CrossRef]
- Friston, K. The free-energy principle: A rough guide to the brain? Trends Cogn. Sci. 2009, 13, 293–301. [Google Scholar] [CrossRef] [PubMed]
- Van Dijk, S.G.; Polani, D. Informational constraints-driven organization in goal-directed behavior. Adv. Complex Syst. 2013, 16, 1350016. [Google Scholar] [CrossRef]
- Kingma, D.P.; Welling, M. Auto-encoding variational Bayes. arXiv 2013, arXiv:1312.6114. [Google Scholar]
- Bertschinger, N.; Rauh, J.; Olbrich, E.; Jost, J.; Ay, N. Quantifying unique information. Entropy 2014, 16, 2161–2183. [Google Scholar] [CrossRef]
- Steudel, B.; Ay, N. Information-theoretic inference of common ancestors. Entropy 2015, 17, 2304–2327. [Google Scholar] [CrossRef]
- Matveev, R.; Portegies, J.W. Tropical probability theory and an application to the entropic cone. Kybernetika 2020, 56, 1133–1153. [Google Scholar] [CrossRef]
- Matveev, R.; Portegies, J.W. Conditioning in tropical probability theory. Entropy 2023, 25, 1641. [Google Scholar]
- Gács, P.; Körner, J. Common information is far less than mutual information. Probl. Control Inf. Theory 1973, 2, 149–162. [Google Scholar]
- Ahlswede, R.; Körner, J. On the connection between the entropies of input and output distributions of discrete memoryless channels. In Proceedings of the Fifth Conference on Probability Theory, Brasov, Romania, 1–6 September 1974. [Google Scholar]
- Ahlswede, R.; Körner, J. Appendix: On Common Information and Related Characteristics of Correlated Information Sources; Springer: Berlin/Heidelberg, Germany, 2006; pp. 664–677. [Google Scholar]
- Makarychev, K.; Makarychev, Y.; Romashchenko, A.; Vereshchagin, N. A new class of non-Shannon-type inequalities for entropies. Commun. Inf. Syst. 2002, 2, 147–166. [Google Scholar] [CrossRef]
- Matveev, R.; Portegies, J.W. Tropical diagrams of probability spaces. Inf. Geom. 2020, 3, 61–88. [Google Scholar] [CrossRef]
- Kovačević, M.; Stanojević, I.; Šenk, V. On the hardness of entropy minimization and related problems. In Proceedings of the 2012 IEEE Information Theory Workshop, Lausanne, Switzerland, 3–7 September 2012; pp. 512–516. [Google Scholar]
- Vidyasagar, M. A metric between probability distributions on finite sets of different cardinalities and applications to order reduction. IEEE Trans. Autom. Control 2012, 57, 2464–2477. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

