Information Properties of a Random Variable Decomposition through Lattices

Fábio C. C. Meneghetti; Henrique K. Miyamoto; Sueli I. R. Costa

doi:10.3390/psf2022005019

Abstract

A full-rank lattice in the Euclidean space is a discrete set formed by all integer linear combinations of a basis. Given a probability distribution on

R^{n}

, two operations can be induced by considering the quotient of the space by such a lattice: wrapping and quantization. For a lattice

Λ

, and a fundamental domain

D

, which tiles

R^{n}

through

Λ

, the wrapped distribution over the quotient is obtained by summing the density over each coset, while the quantized distribution over the lattice is defined by integrating over each fundamental domain translation. These operations define wrapped and quantized random variables over

D

and

Λ

, respectively, which sum up to the original random variable. We investigate information-theoretic properties of this decomposition, such as entropy, mutual information and the Fisher information matrix, and show that it naturally generalizes to the more abstract context of locally compact topological groups.

Keywords:

Fisher information; information geometry; lattices; mutual information; quantization; topological groups; wrapped distributions

1. Introduction

Lattices are discrete sets in

R^{n}

formed by all integer linear combinations of a set of independent vectors, and have found different applications, such as in information theory and communications [1,2,3]. Given a probability distribution in

R^{n}

, two operations can be induced by considering the quotient of the space by a lattice: wrapping and quantization.

The wrapped distribution over the quotient is obtained by summing the probability density over each coset. It is used to define parameters for lattice coset coding, particularly for AWGN and wiretap channels, such as the flatness factor, which is, up to a constant, the

L^{\infty}

distance from a wrapped probability distribution to a uniform one [4,5]. This factor is equivalent to the smoothing parameter, used in post-quantum lattice-based cryptography [6]. In the context of directional statistics, wrapping has been used as a standard way to construct distributions on a circle and on a torus [7].

The quantized distribution over the lattice can be defined by integrating over each fundamental domain translation, thus corresponding to the distribution of the fundamental domains after lattice-based quantization is applied. Lattice quantization has different uses in signal processing and coding: for instance, it can achieve the optimal rate-distortion trade-off and can be used for shaping in channel coding [2]. A special case of interest is when the distribution on the fundamental region is uniform, which amounts to high-resolution quantization or dithered quantization [8,9].

In this work, we relate these two operations by remarking that the random variables induced by wrapping and quantization sum up to the original one. We study information properties of this decomposition, both from classical information theory [10] and from information geometry [11], and provide some examples for the exponential and Gaussian distributions. We also propose a generalization of these ideas to locally compact groups. Probability distributions on these groups have been studied in [12], and some information-theoretic properties have been investigated in [13,14,15]. In addition to probability measures, one can also define the notions of lattice and fundamental domains on them, thereby generalizing the Euclidean case. We show that wrapping and quantization are also well defined, and provide some illustrative examples.

2. Lattices, Wrapping and Quantization

2.1. Lattices and Fundamental Domains

A lattice

Λ

in

R^{n}

is a discrete additive subgroup of

R^{n}

, or, equivalently, the set

Λ = {α_{1} b_{1} + \dots + α_{k} b_{k} | α_{1}, \dots, α_{k} \in Z}

formed by all integer linear combinations of a set of linearly independent vector

{b_{1}, \dots, b_{k}} \subset R^{n}

, called a basis of

Λ

. A matrix B whose column vectors form a basis is called a generator matrix of

Λ

, and we have

Λ = B Z^{k}

. The lattice dimension is k, and, if

k = n

, the lattice is said to be full-rank; we henceforth consider full-rank lattices. A lattice

Λ

defines an equivalence relation in

R^{n}

:

x \sim y \Leftrightarrow x - y \in Λ

. The associated equivalence classes are denoted by

\bar{x}

or

x + Λ

. The set of all equivalence classes is the lattice quotient

R^{n} / Λ

, and we denote the standard projection

π : R^{n} \to R^{n} / Λ, π (x) = \bar{x}

.

Let

D

be a Lebesgue-measurable set of

R^{n}

and

Λ

a lattice. We say that

D

is a fundamental domain or a fundamental region of

Λ

, or that

D

tiles

R^{n}

by

Λ

, if (1)

⋃_{λ \in Λ} (λ + D) = R^{n}

, and (2)

(λ + D) \cap (\tilde{λ} + D) = \emptyset

, for all

λ \neq \tilde{λ}

in

Λ

(it is often only asked that this intersection has Lebesgue measure zero, but we require it to be empty). Given a fundamental domain

D

, each coset

\bar{x} \in R^{n} / Λ

has a unique representative in

D

, i.e., the measurable map

{π |}_{D} : D \to R^{n} / Λ

is a bijection. This fact suggests using a fundamental domain to represent the quotient. Each fundamental domain contains exactly one lattice point, which may be chosen as the origin. An example of a fundamental domain is the fundamental parallelotope with respect to a basis

{b_{1}, \dots, b_{n}}

, namely

P (Λ) := {x = α_{1} b_{1} + \dots + α_{n} b_{n} | α_{1}, \dots, α_{n} \in [0, 1[}

. Another one is the Voronoi region

V (Λ)

of the origin, given by the points that are closer to the origin than to any other lattice point, with an appropriate choice for ties. It is a well-known fact that every fundamental domain has the same volume, denoted by

covol Λ := vol D = | det B |

, for any generator matrix B of

Λ

.

2.2. Wrapping and Quantization

Consider

R^{n}

with the Lebesgue measure

μ

, and P a probability measure such that

P ≪ μ

. Then the probability density function (pdf) of P is

p = \frac{d P}{d μ}

, the Radon–Nikodym derivative. For fixed full-rank lattice

Λ

and fundamental domain

D

, the wrapping of P by

Λ

is the distribution

P_{π} := π_{*} P

on

R^{n} / Λ

, given by

P_{π} (A) = P (π^{- 1} A)

. For simplicity, we identify

R^{n} / Λ

with

D

to regard

P_{π}

as a distribution over

D

, and then we have

π : R^{n} \to D

given by

(y + λ) \mapsto y

, for all

y \in D, λ \in Λ

. Using this identification, the wrapping has density

p_{π} = \frac{d P_{π}}{d μ}

given by

p_{π} (y) = \sum_{λ \in Λ} p (y + λ) .

(1)

A construction that is, in some sense, dual to wrapping is quantization. Note that each fundamental domain

D

partitions the space as

R^{n} = ⨆_{λ \in Λ} (λ + D)

. The quantization function is the measurable map

Q : R^{n} \to Λ

, given by

(y + λ) \mapsto λ

, for

y \in D

and

λ \in Λ

. The quantized probability distribution of P on the discrete set

Λ

is

P_{Q} := Q_{*} P

, given by

P_{Q} (A) := P (Q^{- 1} A)

. The probability mass function of the quantized distribution is then

p_{Q} (λ) = \int_{D} p (y + λ) d y .

(2)

Letting X be a vector random variable in

R^{n}

with distribution p, we define

X_{π} := π (X)

and

X_{Q} := Q (X)

the wrapped and quantized random variables, respectively. By definition, they are distributed according to

p_{π}

and

p_{Q}

. Interestingly, they sum up to the original one:

X = X_{π} + X_{Q},

(3)

since

π + Q = {id}_{R^{n}}

. Note also that

X_{π} + X_{Q}

has the same distribution as

(X_{π}, X_{Q})

, by the bimeasurable bijection

y + λ \mapsto (y, λ)

. These factors, however, are not independent, since, in general,

p (y + λ) \neq p_{π} (y) p_{Q} (λ)

. The difference between

p (x)

and

(p_{π} \otimes p_{Q}) (x) := p_{π} (π (x)) p_{Q} (Q (x))

shall be illustrated in the following examples. Note that the expression for the quantized distribution depends on the choice of fundamental domain, while the wrapped distribution does not, up to a lattice translation.

We say a random variable X over

[0, \infty)

is memoryless if

\bar{C} (t) = \bar{C} (t + s) / \bar{C} (s)

for all

t, s

, where

\bar{C} (t) := P [X > t]

is the tail distribution function. In particular, a memoryless distribution satisfies

\bar{C} (y + λ) = \bar{C} (y) \bar{C} (λ)

for all

y \in D

,

λ \in Λ

, which implies

p = p_{π} \otimes p_{Q}

. The converse, however, is not true; for example, independence holds whenever p is constant on each region

λ + D

, for

λ \in Λ

.

Example 1.

The exponential distribution, parametrized by

ν > 0

, is defined as

p (x) = ν e^{- ν x} 1_{[0, + \infty[} (x),

where

1_{A} (x)

takes value 1 if

x \in A

, and 0 otherwise. Choosing the lattice

Λ = α Z

,

α \in R_{+}

, and the fundamental domain

D = [0, α[

, one can write closed-form expressions for the wrapped and quantized distributions:

p_{π} (y) = \frac{ν e^{- ν y}}{1 - e^{- ν α}}, y \in D and p_{Q} (λ) = e^{- ν λ} (1 - e^{- ν α}), λ \in Λ \cap R_{+} .

(4)

Note that, in this special case,

p = p_{π} \otimes p_{Q}

, as a consequence of memorylessness. The wrapped distribution with

α = 2 π

, which amounts to a distribution on the unitary circle, is well studied in [16].

Example 2.

Consider the univariate Gaussian distribution

p (x) = \frac{1}{\sqrt{2 π σ^{2}}} exp (- \frac{{(x - μ)}^{2}}{2 σ^{2}})

and the lattice

Λ = α Z

, with fundamental domain

D = [- \frac{α}{2}, \frac{α}{2}[

,

α \in R_{+}

. The wrapped and quantized distributions are given respectively by

p_{π} (y) = \frac{1}{\sqrt{2 π σ^{2}}} \sum_{i \in Z} e^{- \frac{{(y - μ + α i)}^{2}}{2 σ^{2}}}, y \in D and p_{Q} (λ) = \frac{1}{\sqrt{2 π σ^{2}}} \int_{λ - \frac{α}{2}}^{λ + \frac{α}{2}} e^{- \frac{{(x - μ)}^{2}}{2 σ^{2}}} d x, λ \in Λ .

The value

α = 2 π

for the wrapped distribution on a unitary circle is usually considered in directional statistics [7]. Figure 1 illustrates the original, wrapped, quantized and product distributions for different zero-mean Gaussian distributions. As can be seen in the figure, in this case,

p (x) \neq p_{π} (y) p_{Q} (λ)

.

Figure 1. Example of zero-mean Gaussian distributions and their corresponding wrapped, quantized and product distributions, with

Λ = Z

and

D = [- \frac{1}{2}, \frac{1}{2} [

for different variances:

σ^{2} = 0.25

(blue),

σ^{2} = 1

(orange),

σ^{2} = 4

(green).

A straightforward consequence of the decomposition (3) is

$E [X] = E [X_{π}] + E [X_{Q}]$ ;
$Var [X] = Var [X_{π}] + Var [X_{Q}] + Cov [X_{π}, X_{Q}] + Cov [X_{Q}, X_{π}]$ ,

where

E [\cdot]

,

Var [\cdot]

and

Cov [\cdot, \cdot]

denote, respectively, the expectation, the variance and the cross-covariance operators.

We note that different types of discretization have also been studied, other then integrating over a fundamental domain [17]. For instance, in [4,18,19], the discretized distribution is defined by restricting the original pdf

p (x)

to the lattice

Λ

, and then normalizing:

D_{Λ, c} (λ) := \frac{p (c + λ)}{\sum_{\tilde{λ} \in Λ} p (c + \tilde{λ})},

(5)

for a fixed

c \in D

. This discretization is nothing other than the conditional distribution of

X_{Q}

given that

X_{π} = c

, expressed as

p_{Q | π} (λ | c) = p (c + λ) / p_{π} (c)

. Moreover, when

p = p_{π} \otimes p_{Q}

, such as in the exponential distribution, cf. example 1, then

D_{Λ, c} (λ) = p_{Q} (λ)

.

3. Information Properties

3.1. Information-Theoretic Measures

Let us consider a random variable X with distribution p and the induced wrapped and quantized ones, respectively,

X_{π} \sim p_{π}

and

X_{Q} \sim p_{Q}

. The mutual information between

X_{π}

and

X_{Q}

is defined as the Kullback–Leibler divergence

I (X_{π}; X_{Q}) := D_{KL} (p ∥ p_{π} \otimes p_{Q})

, and is a measure of how non-independent the marginal distributions

p_{π}

and

p_{Q}

are [10]. Using the theorem of change of variables, we have

\begin{matrix} I (X_{π}; X_{Q}) & = E_{X} [log \frac{p (X)}{p_{π} \otimes p_{Q} (X)}] \\ = E_{X} [log p (X)] - E_{X} [log p_{π} (X_{π})] - E_{X} [log p_{Q} (X_{Q})] \\ = E_{X} [log p (X)] - E_{X_{π}} [log p_{π} (X_{π})] - E_{X_{Q}} [log p_{Q} (X_{Q})] \\ = h (X_{π}) + H (X_{Q}) - h (X) . \end{matrix}

(6)

Note that, from this decomposition, we have

h (X) \leq h (X_{π}) + H (X_{Q})

.

Proposition 1.

Let X be a real random variable, and

X_{π}

and

X_{Q}

the respective wrapped and quantized random variables, using the lattice

α Z

. Denote

μ_{Q} := E [X_{Q}]

and

σ_{Q}^{2} := Var [X_{Q}]

. If X has support

[0, \infty)

, then the mutual information

I (X_{π}; X_{Q})

between

X_{π}

and

X_{Q}

is upper-bounded by

I (X_{π}; X_{Q}) < log (e (μ_{Q} + α / 2)) - h (X) .

(7)

If X has support

R

, then

I (X_{π}; X_{Q})

is upper-bounded by

I (X_{π}; X_{Q}) < \frac{1}{2} log (2 π e σ_{Q}^{2}) + \frac{2 log e}{exp (2 π^{2} α^{- 2} σ_{Q}^{2}) - 1} - h (X) .

(8)

Proof.

First,

h (X_{π}) \leq log α

, since the uniform distribution maximizes entropy on a bounded support. Then, note that the mean and variance of the integer-valued random variable

α^{- 1} X_{Q}

are

α^{- 1} μ_{Q}

and

α^{- 2} σ_{Q}^{2}

, respectively. For (7), use that, for positive integer random variables,

H (X_{Q}) < log (e (μ_{Q} / α + 1 / 2))

, as in [20] (Theorem 8); for (8), the upper-bound for integer-valued random variables from [20] (Theorem 10) gives us

H (X_{Q}) < \frac{1}{2} log (2 π e α^{- 2} σ_{Q}^{2}) + \frac{2 log e}{exp (2 π^{2} α^{- 2} σ_{Q}^{2}) - 1}

. Replacing the corresponding inequalities in (6) yields the desired results. □

The following lemma can be found in [2] (Appendix 3).

Lemma 1.

h (X_{π}) \leq h (X)

.

Proof.

h (X) = h (X_{π}) + H (X_{Q} | X_{π})

, and

H (X_{Q} | X_{π}) \geq 0

, since it is a discrete entropy. □

Proposition 2.

Let

Λ_{α} := α Λ

,

α > 0

, be a family of lattices, with fundamental domains

D_{α} := α D

.

1.: If $D$ is connected, and p is continuous and Riemann-integrable, then ${lim}_{α \to 0} I (X_{π}; X_{Q}) = 0$ .
2.: If 0 is an interior point of $D$ , then ${lim}_{α \to + \infty} I (X_{π}; X_{Q}) = 0$ .

Proof.

For

α \to 0

, the proof is an adaptation of [10] (Theorem 8.3.1). Since

D

is connected and p is continuous, we can use the mean value theorem: for every

λ \in Λ

there exists an

x_{λ, α} \in (λ + D_{α})

such that

p (x_{λ, α}) vol D_{α} = p_{Q} (λ)

. Therefore, we can write

H (X_{Q}) = - \sum_{λ \in Λ_{α}} p (x_{λ, α}) log (p (x_{λ, α})) vol D_{α} - log (vol D_{α})

, using that

\sum_{λ \in Λ_{α}} p (x_{λ, α}) vol D_{α} = 1

. The first term is an n-dimensional Riemann sum, and converges to

h (X)

when

α \to 0

, while the second term becomes arbitrarily small. Therefore,

0 \leq I (X_{π}; X_{Q}) \leq H (X_{Q}) + log (vol D_{α}) - h (X) \to 0

, so

I (X_{π}; X_{Q}) \to 0

.

For

α \to + \infty

, note that, from Lemma 1,

I (X_{π}; X_{Q}) \leq H (X_{Q})

. However, by choosing

α

sufficiently large, we can make

p_{Q} (0) = \int_{D_{α}} p (x) d x

arbitrarily close to 1, since 0 is in the interior of

D_{α}

. Therefore,

H (X_{Q})

can be made arbitrarily small. □

Example 3.

In the case of the exponential distributions, as in Example 1, the distributions of

X_{π}

and

X_{Q}

are independent, i.e.,

p = p_{π} \otimes p_{Q}

, therefore

I (X_{π}; X_{Q}) = 0

. The mutual information and the corresponding upper bound (7) are plotted in Figure 2a, as function of the parameter ν.

Figure 2. Mutual information

I (X_{π}; X_{Q})

and its upper bound.

Example 4.

In the case of the univariate zero-mean Gaussian distributions, as in Example 2, one can use (6) to numerically compute the mutual information

I (X_{π}; X_{Q})

, as a function of the standard deviation σ, and compare it with the upper bound (8) (Figure 2b). Interestingly,

I (X_{π}; X_{Q})

vanishes as

σ \to 0

or

σ \to + \infty

, which is equivalent to choosing a lattice

Λ = α Z

with

α \to 0

or

α \to + \infty

, cf. Proposition 2. The mutual information attains a maximum in

σ \approx 0.38

, showing this is the value for which

X_{π}

and

X_{Q}

are the least independent.

3.2. Fisher Information

Let

M = {p_{θ} : θ \in Θ}

be a family of probability densities

p_{θ} : R^{n} \to R_{+}

smoothly parametrized by

θ

in an open set

Θ \subset R^{d}

. The Fisher information matrix is defined as the positive semi-definite matrix

G (θ)

with coefficients

g_{i j} (θ) = E_{p_{θ}} [\partial_{i} ℓ_{θ} \partial_{j} ℓ_{θ}]

, where

ℓ_{θ} (x) := log p_{θ} (x)

. When M is a manifold satisfying certain regularity conditions [11], and G is positive definite, it becomes a Riemannian manifold with the metric given by

g_{i j} (θ)

, called a statistical manifold. Let ⪯ denote the Loewner partial order for matrices, given by

A ⪯ B

if, and only if,

B - A

is positive semi-definite. The following results justify the name information matrix given to this quantity.

Proposition 3

([11,21]). Let X be a random variable distributed according to a distribution parametrized by θ, and

G (θ)

its information matrix. The following hold.

1.: Monotonicity:if $F : X \to Y$ is a measurable function (i.e., a statistic) and $G_{F} (θ)$ is the information matrix of $F (X)$ , then $G_{F} (θ) ⪯ G (θ)$ , with equality if, and only if, F is a sufficient statistic for θ.
2.: Additivity:if $X, Y$ are independent random variables, then the joint information matrix satisfies $G_{(X, Y)} (θ) = G_{X} (θ) + G_{Y} (θ)$ .

Let X be a random variable on

R^{n}

, and

X_{π}

and

X_{Q}

its wrapped and quantized factors, respectively. We denote their respective Fisher information matrices by

G (θ)

,

G_{π} (θ)

and

G_{Q} (θ)

. By additivity, the Fisher information of

p_{π} \otimes p_{Q}

is

\tilde{G} (θ) := G_{π} (θ) + G_{Q} (θ)

, and, by monotonicity, we have both

G_{π} (θ) ⪯ G (θ)

and

G_{Q} (θ) ⪯ G (θ)

. It follows immediately that

\frac{\tilde{G} (θ)}{2} = \frac{G_{π} (θ) + G_{Q} (θ)}{2} ⪯ G (θ) .

(9)

Example 5.

In the family of exponential distributions, as in Example 1, the independence of

X_{π}

and

X_{Q}

implies that the Fisher information matrix is additive. Indeed, for

Λ = α Z

:

G (ν) = \frac{1}{ν^{2}}, G_{π} (ν) = \frac{1}{ν^{2}} + \frac{α^{2}}{2 (1 - cosh (α ν))}, and G_{Q} (ν) = \frac{α^{2}}{2 (cosh (α ν) - 1)} .

4. A Generalization to Topological Groups

A topological group is a topological space

(G, τ_{G})

that is also a group with respect to some operation · called product, and such that the inverse

g^{- 1}

and product

g \cdot h

are continuous. As additional requisites, we ask G to be locally compact, Hausdorff and second-countable (i.e., has a countable basis) [22]. Let

B_{G}

be the the Borel

σ

-algebra of G. Haar’s theorem says there is a unique (up to a constant) Radon measure on G that is invariant by left translations—we will suppose a fixed normalization, and denote both the measure and integration with respect to it by dg. The group G is said to be unimodular if dg is also invariant by right translations. Since G is

σ

-compact, the Haar measure is

σ

-finite [12].

Let

Γ

be a discrete subgroup of G, which is necessarily closed, since G is Hausdorff, and countable, since G is second-countable. Let us also consider the quotient space of left cosets

G / Γ = {\bar{g} = g Γ | g \in G},

which has a natural projection

π : G \to G / Γ

, given by

π (g) = \bar{g}

. We call

Γ

a lattice if the induced Haar measure on

G / Γ

is finite and bi-invariant. A particular case is when the quotient

G / Γ

is compact; then

Γ

is said to be a uniform lattice. A cross-section is defined as a set

D \subset G

of representatives of

G / Γ

such that all cosets are uniquely represented. A fundamental domain is a measurable cross-section. It can be shown that

Γ

is a lattice if, and only if, it admits a fundamental domain. Furthermore, every fundamental domain has the same measure [23,24].

Let P be a probability measure on the space

(G, B_{G})

that is absolutely continuous with respect to the Haar measure dg. By the Radon–Nikodym theorem, we can define a density function

p = \frac{d P}{d g} \in L^{1} (G)

, such that

p \geq 0

and

P (A) = \int_{A} p (g) d g

, for all

A \in B_{G}

. The original measure can be represented as

P = p d g

, and we consider the family of all such densities

{P (G) = p \in L^{1} (G) | p \geq 0 d g - a . e ., \int p d g = 1} .

Probability distributions on locally compact groups have been studied in [12], and some information-theoretic properties have been investigated in [13,14,15]. The result that allows us to consider wrapped distributions in this context is the Weil formula, taken as a particular case of [24] (Theorem 3.4.6):

Theorem 1.

For any

f \in L^{1} (G)

, the wrapping

f_{π} \in L^{1} (G / Γ)

,

f_{π} (\bar{g}) := \sum_{λ \in Γ} f (g λ)

is well defined

d \bar{g}

-almost everywhere, belongs to

L^{1} (G / Γ)

, and

\int_{G} f (g) d g = \int_{G / Γ} \sum_{λ \in Γ} f (g λ) d \bar{g} .

(10)

As a consequence, for every probability density

p \in P (G)

, we can consider its wrapping

p_{π} (\bar{g}) = \sum_{λ \in Γ} p (g λ)

, which is

L^{1} (G / Γ)

, non-negative and is also a probability density:

\int_{G / Γ} p_{π} d \bar{g} = 1

. The associated probability measure over

(G / Γ, B_{G / Γ})

is

P_{π} = p_{π} d \bar{g}

. This notation, suggesting

P_{π}

as the push-forward measure by

π

, is not a coincidence, since, from Theorem 1,

\begin{matrix} π_{*} P (A) & = \int_{G} 1_{A} (π (g)) p (g) d g \\ = \int_{G / Γ} \sum_{λ \in Γ} 1_{A} (π (g)) p (g λ) d \bar{g} \\ = \int_{G / Γ} 1_{A} (\bar{g}) p_{π} (\bar{g}) d \bar{g} = P_{π} (A) . \end{matrix}

Analogously, given a fundamental domain

D

, it is possible to define a quantization map

Q : G \to Λ

by

Q (g λ) = λ

, for every

g \in D, λ \in Γ

, which is unique since

G = ⨆_{g \in D} g Γ

. The quantized probability distribution is the discrete probability measure

P_{Q}

over

Λ

, defined by the mass function

p_{Q} (λ) = \int_{D} p (g λ) d g

, or as the push-forward measure

Q_{*} P

.

If X is distributed according to p, and

X_{π} = π (X) \sim p_{π}

,

X_{Q} = Q (X) \sim p_{Q}

, then

X = X_{π} \cdot X_{Q}

, again, as a consequence of

g \mapsto (π (g), Q (g))

being a measurable bijection whose inverse is the product

π (g) \cdot Q (g)

. Despite being an abstract definition, this framework expands the scope of the previous approach, cf. examples below. In the following, let

Λ \subset R^{n}

be a full-rank lattice, and

Λ_{s} \subset Λ

be a full-rank sublattice, as defined in Section 2.

Example 6.

Let

G = R^{n}

and

Γ = Λ

. This recovers the approach from Section 2 as a particular case.

Example 7.

Let

G = Λ

and

Γ = Λ_{s}

. A fundamental domain is a choice

D = {d_{1}, \dots, d_{k}}

of

k = | Λ / Λ_{s} |

points, where each point corresponds to a coset

\bar{λ} = (λ + Λ_{s}) \in Λ / Λ_{s}

. Of particular interest are Voronoi constellations [25,26] where the coset leaders are selected, with some choice made for ties. Since Λ is discrete, the Haar measure is the counting measure

μ (A) = | A |

, and

p : Λ \to [0, 1]

. The wrapped and quantized distributions are

p_{π} (\bar{λ}) = \sum_{λ_{s} \in Λ_{s}} p (λ + λ_{s})

, and

p_{Q} (λ_{s}) = \sum_{i = 1}^{k} p (d_{i} + λ_{s})

.

Example 8.

Let

G = R^{n} / Λ_{s}

(a torus) and

Γ = π_{s} (Λ)

(the projection of Λ to G). Then

π_{s} (Λ)

consists of a finite family of cosets

{\bar{λ}}_{1}, \dots, {\bar{λ}}_{k}

, for

k = | Λ / Λ_{s} |

, and a choice of fundamental domain

\bar{D}

is the projection of a fundamental domain

D

of Λ. There are some standard choices for the distribution on G, such as a wrapping from the Euclidean space and the bivariate von Mises distribution [7] (Section 11.4). Then,

p_{π} (\bar{x}) = \sum_{i = 1}^{k} p (\bar{x} + {\bar{λ}}_{i})

and

p_{Q} ({\bar{λ}}_{i}) = \int_{\bar{D}} p (\bar{x} + {\bar{λ}}_{k}) d \bar{x}

, and, in the particular case where

p (\bar{x}) = \sum_{λ_{s} \in Λ_{s}} p (x + λ_{s})

is a

Λ_{s}

-wrapped distribution, they become

p_{π} (\bar{x}) = \sum_{i = 1}^{k} \sum_{λ_{s} \in Λ_{s}} p (x + λ_{s} + λ_{i})

and

p_{Q} ({\bar{λ}}_{i}) = \sum_{λ_{s} \in Λ_{s}} \int_{D} p (x + λ_{s} + λ_{i}) d x

.

Example 9.

Let

G = F_{q}^{n}

(a finite field) or

G = Z_{q}^{n}

, and

Γ = C

(any linear block code). A fundamental domain can be a finite set of points that tiles the space by

C

. The distributions then become finite sums, such as in Example 7.

Example 10.

Let

G = SL (n, R)

, the Lie group of square matrices with determinant 1, and

Γ = SL (n, Z)

(the subgroup of integer matrices). This is, in fact, a lattice, since for

n = 2

,

vol (G / Γ) = \sqrt{2} ζ (2)

where ζ is the Riemann zeta function, and for

n > 2

the finite covolume is calculated in [27], where descriptions of fundamental domains are also given.

5. Conclusions

In this work, we have studied the decomposition of a random variable through lattices into its wrapping and quantization terms. Generalization of examples and of Proposition 1 to higher dimensions constitutes a work in progress. We have also proposed a generalization of this decomposition to topological groups; in particular, this allows one to study information theory on such abstract spaces, which is another perspective for future work.

Author Contributions

Conceptualization, all authors; investigation, F.C.C.M. and H.K.M.; writing—original draft preparation, F.C.C.M. and H.K.M.; writing—review and editing, all authors; supervision, S.I.R.C. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partly supported by Brazilian National Council for Scientific and Technological Development (CNPq) grants 141407/2020-4 and 314441/2021-2, and by São Paulo Research Foundation (FAPESP) grant 2021/04516-8.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are grateful to Max Costa for fruitful discussions, and to the reviewer for the relevant suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

Conway, J.H.; Sloane, N.J.A. Sphere Packings, Lattices and Groups; Springer: New York, NY, USA, 1999. [Google Scholar]
Zamir, R. Lattice Coding for Signals and Networks: A Structured Coding Approach to Quantization, Modulation and Multiuser Information Theory; Cambridge University Press: Cambridge, UK, 2014. [Google Scholar]
Costa, S.I.R.; Oggier, F.; Campello, A.; Belfiore, J.C.; Viterbo, E. Lattices Applied to Coding for Reliable and Secure Communications; Springer: Cham, Switzerland, 2017. [Google Scholar]
Ling, C.; Belfiore, J.C. Achieving AWGN channel capacity with lattice Gaussian coding. IEEE Trans. Inf. Theory 2014, 60, 5918–5929. [Google Scholar] [CrossRef][Green Version]
Damir, M.T.; Karrila, A.; Amorós, L.; Gnilke, O.W.; Karpuk, D.; Hollanti, C. Well-rounded lattices: Towards optimal coset codes for Gaussian and fading wiretap channels. IEEE Trans. Inf. Theory 2021, 67, 3645–3663. [Google Scholar] [CrossRef]
Chung, K.M.; Dadush, D.; Liu, F.H.; Peikert, C. On the lattice smoothing parameter problem. In Proceedings of the 2013 IEEE Conference on Computational Complexity, Stanford, CA, USA, 5–7 June 2013; pp. 230–241. [Google Scholar]
Mardia, K.V.; Jupp, P.E. Directional Statistics; Wiley: New York, NY, USA, 2000. [Google Scholar]
Zamir, R.; Feder, M. On lattice quantization noise. IEEE Trans. Inf. Theory 1996, 42, 1152–1159. [Google Scholar] [CrossRef]
Ling, C.; Gan, L. Lattice quantization noise revisited. In Proceedings of the 2013 IEEE Information Theory Workshop (ITW), Sevilla, Spain, 9–13 September 2013; pp. 1–5. [Google Scholar]
Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd ed.; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
Amari, S.; Nagaoka, H. Methods of Information Geometry; American Mathematical Society: Providence, RI, USA, 2000. [Google Scholar]
Heyer, H. Probability Measures on Locally Compact Groups; Springer: Berlin/Heidelberg, Germany, 1977. [Google Scholar]
Chirikjian, G.S. Stochastic Models, Information Theory, and Lie Groups; Birkhäuser: Boston, MA, USA, 2009. [Google Scholar]
Johnson, O.; Suhov, Y. Entropy and convergence on compact groups. J. Theor. Probab. 2000, 13, 843–857. [Google Scholar] [CrossRef]
Chirikjian, G.S. Information-theoretic inequalities on unimodular Lie groups. J. Geom. Mech. 2010, 2, 119–158. [Google Scholar] [CrossRef] [PubMed]
Jammalamadaka, S.R.; Kozubowski, T.J. New families of wrapped distributions for modeling skew circular data. Commun. Stat.–Theory Methods 2004, 33, 2059–2074. [Google Scholar] [CrossRef]
Chakraborty, S. Generating discrete analogues of continuous probability distributions-A survey of methods and constructions. J. Stat. Distrib. Appl. 2015, 2, 6. [Google Scholar] [CrossRef]
Nielsen, F. The Kullback–Leibler divergence between lattice Gaussian distributions. J. Indian Inst. Sci. 2022. [Google Scholar] [CrossRef]
Luzzi, L.; Vehkalahti, R.; Ling, C. Almost universal codes for MIMO wiretap channels. IEEE Trans. Inf. Theory 2018, 64, 7218–7241. [Google Scholar] [CrossRef]
Rioul, O. Variations on a Theme by Massey. IEEE Trans. Inf. Theory 2022, 68, 2813–2828. [Google Scholar] [CrossRef]
Kagan, A.; Smith, P.J. Multivariate normal distributions, Fisher information and matrix inequalities. Int. J. Math. Educ. Sci. Technol. 2001, 32, 91–96. [Google Scholar] [CrossRef]
Pontryagin, L.S. Topological Groups, 3rd ed.; Gordon and Breach Science Publishers: Montreux, Switzerland, 1986. [Google Scholar]
Raghunathan, M.S. Discrete Subgroups of Lie Groups; Springer: New York, NY, USA, 1972. [Google Scholar]
Reiter, H.; Stegeman, J.D. Classical Harmonic Analysis and Locally Compact Groups, 2nd ed.; Clarendon Press: Oxford, UK, 2000. [Google Scholar]
Forney, G.D. Multidimensional constellations. II. Voronoi constellations. IEEE J. Sel. Areas Commun. 1989, 7, 941–958. [Google Scholar] [CrossRef]
Boutros, J.J.; Jardel, F.; Méasson, C. Probabilistic shaping and non-binary codes. In Proceedings of the 2017 IEEE International Symposium on Information Theory (ISIT), Aachen, Germany, 25–30 June 2017; pp. 2308–2312. [Google Scholar]
Paula, G.T. Comparison of volumes of Siegel sets and fundamental domains for SL_n(ℤ). Geom. Dedicata 2019, 199, 291–306. [Google Scholar] [CrossRef]

Figure 1. Example of zero-mean Gaussian distributions and their corresponding wrapped, quantized and product distributions, with

Λ = Z

and

D = [- \frac{1}{2}, \frac{1}{2} [

for different variances:

σ^{2} = 0.25

(blue),

σ^{2} = 1

(orange),

σ^{2} = 4

(green).

Figure 1. Example of zero-mean Gaussian distributions and their corresponding wrapped, quantized and product distributions, with

Λ = Z

and

D = [- \frac{1}{2}, \frac{1}{2} [

for different variances:

σ^{2} = 0.25

(blue),

σ^{2} = 1

(orange),

σ^{2} = 4

(green).

Figure 2. Mutual information

I (X_{π}; X_{Q})

and its upper bound.

Figure 2. Mutual information

I (X_{π}; X_{Q})

and its upper bound.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

Information Properties of a Random Variable Decomposition through Lattices †

Abstract

1. Introduction

2. Lattices, Wrapping and Quantization

2.1. Lattices and Fundamental Domains

2.2. Wrapping and Quantization

3. Information Properties

3.1. Information-Theoretic Measures

3.2. Fisher Information

4. A Generalization to Topological Groups

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Article Access Statistics

Information Properties of a Random Variable Decomposition through Lattices^†