Research

Jump to: Review

637 KiB

Open AccessArticle

Minimum Mutual Information and Non-Gaussianity through the Maximum Entropy Method: Estimation from Finite Samples

by Carlos A. L. Pires and Rui A. P. Perdigão

Entropy 2013, 15(3), 721-752; https://doi.org/10.3390/e15030721 - 25 Feb 2013

Cited by 8 | Viewed by 6614

The Minimum Mutual Information (MinMI) Principle provides the least committed, maximum-joint-entropy (ME) inferential law that is compatible with prescribed marginal distributions and empirical cross constraints. Here, we estimate MI bounds (the MinMI values) generated by constraining sets T_cr comprehended by m_cr [...] Read more.

The Minimum Mutual Information (MinMI) Principle provides the least committed, maximum-joint-entropy (ME) inferential law that is compatible with prescribed marginal distributions and empirical cross constraints. Here, we estimate MI bounds (the MinMI values) generated by constraining sets T_cr comprehended by m_cr linear and/or nonlinear joint expectations, computed from samples of N iid outcomes. Marginals (and their entropy) are imposed by single morphisms of the original random variables. N-asymptotic formulas are given both for the distribution of cross expectation’s estimation errors, the MinMI estimation bias, its variance and distribution. A growing T_cr leads to an increasing MinMI, converging eventually to the total MI. Under N-sized samples, the MinMI increment relative to two encapsulated sets T_cr1 ⊂ T_cr2 (with numbers of constraints mcr1<mcr2 ) is the test-difference δH = H_{max 1, N} - H_{max 2, N} ≥ 0 between the two respective estimated MEs. Asymptotically, δH follows a Chi-Squared distribution ¹/_2NΧ2 (m_cr2-m_cr1) whose upper quantiles determine if constraints in T_cr2/T_cr1 explain significant extra MI. As an example, we have set marginals to being normally distributed (Gaussian) and have built a sequence of MI bounds, associated to successive non-linear correlations due to joint non-Gaussianity. Noting that in real-world situations available sample sizes can be rather low, the relationship between MinMI bias, probability density over-fitting and outliers is put in evidence for under-sampled data. Full article

(This article belongs to the Special Issue Estimating Information-Theoretic Quantities from Data)

► Show Figures