A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory

Xie, Li; Li, Liangyan; Chen, Jun; Yu, Lei; Zhang, Zhongshan

doi:10.3390/e27040441

Open AccessArticle

A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory

by

Li Xie

¹

,

Liangyan Li

²,

Jun Chen

^2,*

,

Lei Yu

³

and

Zhongshan Zhang

⁴

¹

School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China

²

Department of Electrical and Computer Engineering, McMaster University, Hamilton, ON L8S 4K1, Canada

³

School of Statistics and Data Science, LPMC, KLMDASR, and LEBPS, Nankai University, Tianjin 300071, China

⁴

School of Cyberspace Science and Technology, Beijing Institute of Technology, Beijing 100081, China

^*

Author to whom correspondence should be addressed.

Entropy 2025, 27(4), 441; https://doi.org/10.3390/e27040441

Submission received: 13 March 2025 / Revised: 18 April 2025 / Accepted: 18 April 2025 / Published: 19 April 2025

(This article belongs to the Special Issue Advances in Information and Coding Theory, the Third Edition)

Download

Browse Figures

Versions Notes

Abstract

:

A constrained version of Talagrand’s transportation inequality is established, which reveals an intrinsic connection between the Gaussian distortion-rate-perception functions with limited common randomness under the Kullback–Leibler divergence-based and squared Wasserstein-2 distance-based perception measures. This connection provides an organizational framework for assessing existing bounds on these functions. In particular, we show that the best-known bounds of Xie et al. are nonredundant when examined through this connection.

Keywords:

Kullback–Leibler divergence; optimal transport; rate-distortion-perception theory; squared error; transportation inequality; Wasserstein distance

1. Introduction

Traditional rate-distortion theory [1] seeks to determine the minimum rate required to encode a source while ensuring that the expected distortion remains below a given threshold. However, minimizing distortion alone does not always align with human perception, particularly in applications like image and audio compression, where perceptual quality plays a crucial role. Rate-distortion-perception theory addresses this by introducing a perception constraint [2], measured by a divergence between the source and reconstruction distributions, to ensure that the reconstructed signal remains perceptually similar to the original. This framework enables a more nuanced tradeoff between compression efficiency, signal fidelity, and perceptual quality, making it particularly relevant in modern machine learning and generative model-based compression techniques. The origin of rate-distortion-perception theory can be traced back to the foundational work of Klejsa et al. [3,4] and Saldi et al. [5,6] on distribution-preserving quantization. However, it was arguably the influential paper by Blau and Michaeli [7] (see also [8,9]) that brought the theory to the forefront of the research community’s attention. Since then, the field has developed rapidly, offering insights into architectural design principles [10,11,12,13], the role of randomness [14,15,16,17], and fundamental performance limits [18,19,20,21,22]. These advances have also catalyzed a variety of new research directions and applications [23,24,25,26,27,28,29,30,31,32].

Kullback–Leibler divergence and squared Wasserstein-2 distance are among the most widely adopted perception measures. When the source distribution is Gaussian, these two measures are intrinsically linked through Talagrand’s transportation inequality [33]. Exploring the implications of this connection in rate-distortion-perception theory is of significant interest. The availability of partial knowledge of the reconstruction distribution in perception-aware lossy source coding, in turn, motivates the study of constrained versions of Talagrand’s transportation inequality. Such inequalities will further strengthen the link between the information-theoretic performance limits under these two perception measures.

The main contributions of this paper are as follows:

1.: We prove a variant of Talagrand’s transportation inequality, where the reference distribution is Gaussian and the other distribution is subject to constraints on its first- and second-order statistics.
2.: This inequality is then used to establish a connection between the Gaussian distortion-rate-perception functions with limited common randomness under the Kullback–Leibler divergence-based and squared Wasserstein-2 distance-based perception measures. We leverage this connection as an organizational framework to assess existing bounds on these functions. In particular, it is shown that the best-known bounds of Xie et al. [22] are nonredundant when examined through this connection.

The rest of this paper is organized as follows. Section 2 presents a constrained Talagrand’s transportation inequality. Its application to rate-distortion-perception theory is explored in Section 3. We conclude this paper in Section 4.

We adopt standard notations for information measures, e.g.,

h (\cdot)

for differential entropy and

I (\cdot; \cdot)

for mutual information. For a given random variable X, its distribution, mean, and variance are written as

p_{X}

,

μ_{X}

, and

σ_{X}^{2}

, respectively. A Gaussian distribution with mean

μ

and variance

σ^{2}

is denoted by

N (μ, σ^{2})

. We use

Π (p_{X}, p_{\hat{X}})

to represent the set of all possible joint distributions with marginals

p_{X}

and

p_{\hat{X}}

. The cardinality of set

S

is expressed as

| S |

. For a real number a, define

{(a)}_{+} : = max {a, 0}

. Throughout this paper, the logarithm function is assumed to have a base e.

2. A Constrained Talagrand Transportation Inequality

For

p_{X} = N (μ_{X}, σ_{X}^{2})

, Talagrand’s transportation inequality [33] states that

W_{2}^{2} (p_{X}, p_{\hat{X}}) \leq 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}),

(1)

where

ϕ (p_{\hat{X}} ∥ p_{X}) : = E [log \frac{p_{\hat{X}} (\hat{X})}{p_{X} (\hat{X})}]

(2)

is the Kullback–Leibler divergence and

W_{2}^{2} (p_{X}, p_{\hat{X}}) : = inf_{p_{X \hat{X}} \in Π (p_{X}, p_{\hat{X}})} E [{(X - \hat{X})}^{2}]

(3)

is the squared Wasserstein-2 distance. Note that Talagrand’s transportation inequality does not impose any assumptions on

p_{\hat{X}}

. However, in practice, we often have partial knowledge of

p_{\hat{X}}

, which can be exploited to strengthen the inequality. In this paper, we focus on the case where

p_{\hat{X}}

satisfies

μ_{\hat{X}} = μ_{X}

and

σ_{\hat{X}} \leq σ_{X}

. Under these constraints on

p_{\hat{X}}

, we establish the following variant of Talagrand’s transportation inequality:

Theorem 1.

For

p_{X} = N (μ_{X}, σ_{X}^{2})

and

p_{\hat{X}}

with

μ_{\hat{X}} = μ_{X}

and

σ_{\hat{X}} \leq σ_{X}

,

W_{2}^{2} (p_{X}, p_{\hat{X}}) \leq 2 σ_{X}^{2} (1 - e^{- ϕ_{K L} (p_{\hat{X}} ∥ p_{X})}) .

(4)

It is clear that (4) is stronger than (1) since

1 + z \leq e^{z}

for

z \in R

. To prove Theorem 1, we need the following well-known result (see, e.g., Propositions 1 and 2 [22]) concerning the Gaussian extremal property of the Kullback–Leibler divergence and the squared Wasserstein-2 distance.

Lemma 1.

For

p_{X} = N (μ_{X}, σ_{X}^{2})

and

p_{\hat{X}}

with

E [{\hat{X}}^{2}] < \infty

,

\begin{matrix} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) & \geq ϕ_{K L} (p_{{\hat{X}}^{G}} ∥ p_{X}) \\ = log \frac{σ_{X}}{σ_{\hat{X}}} + \frac{{(μ_{X} - μ_{\hat{X}})}^{2} + σ_{\hat{X}}^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} \end{matrix}

(5)

and

\begin{matrix} W_{2}^{2} (p_{X}, p_{\hat{X}}) & \geq W_{2}^{2} (p_{X}, p_{{\hat{X}}^{G}}) \\ = {(μ_{X} - μ_{\hat{X}})}^{2} + {(σ_{X} - σ_{\hat{X}})}^{2}, \end{matrix}

(6)

where

p_{{\hat{X}}^{G}} : = N (μ_{\hat{X}}, σ_{\hat{X}}^{2})

.

Lemma 1 indicates that when the reference distribution is Gaussian, replacing the other distribution with its Gaussian counterpart leads to reductions in both the Kullback–Leibler divergence and the squared Wasserstein-2 distance. These reductions turn out to be quantitatively related, as shown by the next result.

Lemma 2.

For

p_{X} = N (μ_{X}, σ_{X}^{2})

and

p_{\hat{X}}

with

E [{\hat{X}}^{2}] < \infty

,

W_{2}^{2} (p_{X}, p_{\hat{X}}) - W_{2}^{2} (p_{X}, p_{{\hat{X}}^{G}}) \leq 2 σ_{X} σ_{\hat{X}} (1 - e^{- (ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) - ϕ_{K L} (p_{{\hat{X}}^{G}} ∥ p_{X}))}) .

(7)

Proof of Lemma 2.

Note that

\begin{matrix} W_{2}^{2} (p_{X}, p_{\hat{X}}) & = {(μ_{X} - μ_{\hat{X}})}^{2} + W_{2}^{2} (p_{X - μ_{X}}, p_{\hat{X} - μ_{\hat{X}}}) \\ = {(μ_{X} - μ_{\hat{X}})}^{2} + σ_{X}^{2} W_{2}^{2} (p_{σ_{X}^{- 1} (X - μ_{X})}, p_{σ_{X}^{- 1} (\hat{X} - μ_{\hat{X}})}) \\ \overset{(a)}{\leq} {(μ_{X} - μ_{\hat{X}})}^{2} + σ_{X}^{2} + σ_{\hat{X}}^{2} - 2 σ_{X}^{2} \sqrt{\frac{1}{2 π e} e^{2 h (σ_{X}^{- 1} \hat{X})}} \\ \overset{(b)}{=} W_{2}^{2} (p_{X}, p_{{\hat{X}}^{G}}) + 2 σ_{X} σ_{\hat{X}} - 2 σ_{X}^{2} \sqrt{\frac{1}{2 π e} e^{2 h (σ_{X}^{- 1} \hat{X})}}, \end{matrix}

(8)

where (a) is due to (Equation (8), [34]) and (b) is due to Lemma 1. Moreover,

\begin{matrix} h (σ_{X}^{- 1} \hat{X}) & = h (\hat{X}) - log σ_{X} \\ = \frac{1}{2} log \frac{2 π e σ_{\hat{X}}^{2}}{σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + ϕ_{K L} (p_{{\hat{X}}^{G}} ∥ p_{X}) . \end{matrix}

(9)

Substituting (9) into (8) proves Lemma 2. □

Proof of Theorem 1.

In view of Lemmas 1 and 2,

\begin{matrix} W_{2}^{2} (p_{X}, p_{\hat{X}}) & \leq max_{μ, σ} η (μ, σ) \end{matrix}

(10)

\begin{matrix} subject to & μ = μ_{X}, \end{matrix}

(11)

\begin{matrix} σ \leq σ_{X}, \end{matrix}

(12)

\begin{matrix} \frac{{(μ_{X} - μ)}^{2}}{2 σ_{X}^{2}} + ψ (σ) \leq ϕ_{K L} (p_{\hat{X}} ∥ p_{X}), \end{matrix}

(13)

where

η (μ, σ) : = - 2 σ_{X}^{2} e^{\frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X})} + {(μ_{X} - μ)}^{2} + σ_{X}^{2} + σ^{2}

(14)

and

ψ (σ) : = log \frac{σ_{X}}{σ} + \frac{σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} .

(15)

Since

ψ (σ)

decreases monotonically from ∞ to 0 as

σ

varies from 0 to

σ_{X}

and increases monotonically from 0 to ∞ as

σ

varies from

σ_{X}

to ∞, there must exist

\underset{̲}{σ} \leq σ_{X}

and

\bar{σ} \geq σ_{X}

satisfying

ψ (\underset{̲}{σ}) = ψ (\bar{σ}) = ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) .

(16)

Note that (10)–(13) can be written compactly as

W_{2}^{2} (p_{X}, p_{\hat{X}}) \leq max_{σ \in [\underset{̲}{σ}, σ_{X}]} η (μ_{X}, σ) .

(17)

For

σ \in [\underset{̲}{σ}, σ_{X}]

, we have

\frac{σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) \leq 0,

(18)

and, consequently,

\begin{matrix} \frac{\partial}{\partial σ} η (μ_{X}, σ) & = - 2 σ e^{\frac{σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X})} + 2 σ \\ \geq 0, \end{matrix}

(19)

which implies that the maximum in (17) is attained at

σ = σ_{X}

. Thus,

\begin{matrix} W_{2}^{2} (p_{X}, p_{\hat{X}}) & \leq η (μ_{X}, σ_{X}) \\ = 2 σ_{X}^{2} (1 - e^{- ϕ_{K L} (p_{\hat{X}} ∥ p_{X})}) . \end{matrix}

(20)

This proves Theorem 1. □

The following result shows that Talagrand’s transportation inequality (1) corresponds to a relaxed version of (10), obtained by removing Constraints (11) and (12).

Theorem 2.

For

p_{X} = N (μ_{X}, σ_{X}^{2})

and

p_{\hat{X}}

with

E [{\hat{X}}^{2}] < \infty

,

\begin{matrix} 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) & = max_{μ, σ} η (μ, σ) \end{matrix}

(21)

\begin{matrix} s u b j e c t t o & \frac{{(μ_{X} - μ)}^{2}}{2 σ_{X}^{2}} + ψ (σ) \leq ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) . \end{matrix}

(22)

Proof of Theorem 2.

First, recall the definitions of

\underset{̲}{σ}

and

\bar{σ}

from (16). It can be verified that

\begin{matrix} \frac{\partial}{\partial {(μ_{X} - μ)}^{2}} η (μ, σ) = - e^{\frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X})} + 1 . \end{matrix}

(23)

Given

σ < \underset{̲}{σ}

, there is no

μ

satisfying (22). Given

σ \in [\underset{̲}{σ}, σ_{X}]

, for

μ

satisfying (22), we have

\begin{matrix} \frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) & = \frac{{(μ_{X} - μ)}^{2}}{2 σ_{X}^{2}} + ψ (σ) - ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) - log \frac{σ_{X}}{σ} \\ \leq 0, \end{matrix}

(24)

and, consequently,

\begin{matrix} \frac{\partial}{\partial {(μ_{X} - μ)}^{2}} η (μ, σ) \geq 0, \end{matrix}

(25)

which implies that the maximum value of

η (μ, σ)

over

μ

satisfying (22) is attained when

\begin{matrix} log \frac{σ_{X}}{σ} + \frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} = ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) . \end{matrix}

(26)

Therefore, for

σ \in [\underset{̲}{σ}, σ_{X}]

,

\begin{matrix} max_{μ : (22)} η (μ, σ) = κ (σ), \end{matrix}

(27)

where

\begin{matrix} κ (σ) : = 2 σ_{X}^{2} (ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) - log \frac{σ_{X}}{σ} + 1) - 2 σ_{X} σ . \end{matrix}

(28)

Since the maximum value of

κ (σ)

over

σ \in [\underset{̲}{σ}, σ_{X}]

is attained at

σ = σ_{X}

, it follows that

\begin{matrix} max_{σ \in [\underset{̲}{σ}, σ_{X}]} max_{μ : (22)} η (μ, σ) = 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) . \end{matrix}

(29)

Given

σ \in (σ_{X}, \sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}})

, for

μ

satisfying (22), we have

\begin{matrix} \frac{\partial}{\partial {(μ_{X} - μ)}^{2}} η (μ, σ) \{\begin{matrix} \geq 0 & if \frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} \leq ϕ_{K L} (p_{\hat{X}} ∥ p_{X}), \\ < 0 & if \frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} > ϕ_{K L} (p_{\hat{X}} ∥ p_{X}), \end{matrix} \end{matrix}

(30)

which implies that the maximum value of

η (μ, σ)

over

μ

satisfying (22) is attained when

\frac{{(μ_{X} - μ)}^{2} + σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} = ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) .

(31)

Therefore, for

σ \in (σ_{X}, \sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}})

,

max_{μ : (22)} η (μ, σ) = 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) .

(32)

As a consequence,

max_{σ \in (σ_{X}, \sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}})} max_{μ : (22)} η (μ, σ) = 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) .

(33)

Given

σ \in [\sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}}, \bar{σ}]

, for

μ

satisfying (22), we have

\frac{\partial}{\partial {(μ_{X} - μ)}^{2}} η (μ, σ) \leq 0,

(34)

which implies that the maximum value of

η (μ, σ)

over

μ

satisfying (22) is attained when

{(μ_{X} - μ)}^{2} = 0, i . e ., μ = μ_{X} .

(35)

Therefore, for

σ \in [\sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}}, \bar{σ}]

,

max_{μ : (22)} η (μ, σ) = κ^{'} (σ),

(36)

where

κ^{'} (σ) : = - 2 σ_{X}^{2} e^{\frac{σ^{2} - σ_{X}^{2}}{2 σ_{X}^{2}} - ϕ_{K L} (p_{\hat{X}} ∥ p_{X})} + σ_{X}^{2} + σ^{2} .

(37)

Since the maximum value of

κ^{'} (σ)

over

σ \in [\sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}}, \bar{σ}]

is attained at

σ = \sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}}

, it follows that

max_{σ \in [\sqrt{2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) + σ_{X}^{2}}, \bar{σ}]} max_{μ : (22)} η (μ, σ) = 2 σ_{X}^{2} ϕ_{K L} (p_{\hat{X}} ∥ p_{X}) .

(38)

Given

σ > \bar{σ}

, there is no

μ

satisfying (22). Combining (29), (33), and (38) proves Theorem 2. □

3. Application to Rate-Distortion-Perception Theory

A length-n perception-aware lossy source coding system consists of an encoder

f^{(n)} : R^{n} \times K \to J

, a decoder

g^{(n)} : J \times K \to R^{n}

, and a random seed K. It takes an i.i.d. source sequence

X^{n}

as input and produces an i.i.d. reconstruction sequence

{\hat{X}}^{n}

. Specifically, the encoder maps

X^{n}

and K to a codeword J in codebook

J

according to some conditional distribution

p_{J | X^{n} K}

, while the decoder generates

{\hat{X}}^{n}

based on J and K according to some conditional distribution

p_{{\hat{X}}^{n} | J K}

. Here, K is assumed to be uniformly distributed over the alphabet

K

and independent of

X^{n}

. End-to-end distortion is quantified by

\frac{1}{n} \sum_{t = 1}^{n} E [{(X_{t} - {\hat{X}}_{t})}^{2}]

and perceptual quality by

\frac{1}{n} \sum_{t = 1}^{n} ϕ (p_{X_{t}}, p_{{\hat{X}}_{t}})

with some divergence

ϕ

. It is clear that

\frac{1}{n} \sum_{t = 1}^{n} ϕ (p_{X_{t}}, p_{{\hat{X}}_{t}}) = ϕ (p_{X}, p_{\hat{X}})

, where

p_{X}

and

p_{\hat{X}}

are the marginal distributions of

X^{n}

and

{\hat{X}}^{n}

, respectively.

Definition 1.

For an i.i.d. source

{X_{t}}_{t = 1}^{\infty}

, distortion level D is said to be achievable and subject to the compression rate constraint R, the common randomness rate constraint

R_{c}

, and the perception constraint P if there exists a length-n perception-aware lossy source coding system such that

\begin{matrix} \frac{1}{n} log | J | \leq R, \end{matrix}

(39)

\begin{matrix} \frac{1}{n} log | K | \leq R_{c}, \end{matrix}

(40)

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} E [{(X_{t} - {\hat{X}}_{t})}^{2}] \leq D, \end{matrix}

(41)

\begin{matrix} \frac{1}{n} \sum_{t = 1}^{n} ϕ (p_{X_{t}}, p_{{\hat{X}}_{t}}) \leq P . \end{matrix}

(42)

Moreover, the reconstruction sequence

{\hat{X}}^{n}

is ensured to be i.i.d. The infimum of such achievable distortion levels D is denoted by

D (R, R_{c}, P | ϕ)

.

The following result, which is built upon (Theorem 1, [6]) (see also (Theorem 2, [15])), provides a single-letter characterization of

D (R, R_{c}, P | ϕ)

.

Theorem 3

(Theorem 1, [22]). For

p_{X}

with

E [X^{2}] < \infty

,

\begin{matrix} D (R, R_{c}, P | ϕ) & = inf_{p_{U \hat{X} | X}} E [{(X - \hat{X})}^{2}] \end{matrix}

(43)

\begin{matrix} s u b j e c t t o & X \leftrightarrow U \leftrightarrow \hat{X} f o r m a M a r k o v c h a i n, \end{matrix}

(44)

\begin{matrix} I (X; U) \leq R, \end{matrix}

(45)

\begin{matrix} I (\hat{X}; U) \leq R + R_{c}, \end{matrix}

(46)

\begin{matrix} ϕ (p_{X}, p_{\hat{X}}) \leq P . \end{matrix}

(47)

According to (Lemmas 1 and 3, [22]), for

p_{X} = N (μ_{X}, σ_{X}^{2})

, there is no loss of generality in focusing on

p_{\hat{X}}

with

μ_{\hat{X}} = μ_{X}

and

σ_{\hat{X}} \leq σ_{X}

as far as

D (R, R_{c}, P | ϕ_{K L})

and

D (R, R_{c}, P | W_{2}^{2})

are concerned; therefore, it follows from Theorem 1 that

D (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2}) \leq D (R, R_{c}, P | ϕ_{K L}) .

(48)

This reveals an intrinsic connection between the Gaussian distortion-rate-perception functions with limited common randomness under the Kullback–Leibler divergence-based and squared Wasserstein-2 distance-based perception measures. Since

D (R, R_{c}, P | ϕ_{K L})

and

D (R, R_{c}, P | W_{2}^{2})

do not appear to have explicit expressions, recent research efforts have been devoted to deriving bounds for these functions. Note that via (48), every lower bound on

D (R, R_{c}, \cdot | W_{2}^{2})

induces a corresponding lower bound on

D (R, R_{c}, \cdot | ϕ_{K L})

, and every upper bound on

D (R, R_{c}, \cdot | ϕ_{K L})

induces a corresponding upper bound on

D (R, R_{c}, \cdot | W_{2}^{2})

. As a consequence, a lower bound on

D (R, R_{c}, \cdot | ϕ_{K L})

(or an upper bound on

D (R, R_{c}, \cdot | W_{2}^{2})

) can be considered redundant if it is implied by a lower bound on

D (R, R_{c}, \cdot | W_{2}^{2})

(or an upper bound on

D (R, R_{c}, \cdot | ϕ_{K L})

) through this connection. This provides an organizational framework for assessing existing bounds on these functions. As an illustrative example, we examine the best-known bounds due to Xie et al. [22], summarized in the following two theorems, from this perspective.

Let

ξ (R, R_{c}) : = \sqrt{(1 - e^{- 2 R}) (1 - e^{- 2 (R + R_{c})})} .

(49)

Moreover, let

σ (P)

be the unique number

σ \in [0, σ_{X}]

satisfying

ψ (σ) = P

, where

ψ (σ)

is defined in (15).

Theorem 4

(Theorem 3, [22]). For

p_{X} = N (μ_{X}, σ_{X}^{2})

,

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L}) \leq D (R, R_{c}, P | ϕ_{K L}) \leq \bar{D} (R, R_{c}, P | ϕ_{K L}),

(50)

where

\begin{matrix} \underset{̲}{D} (R, R_{c}, P | ϕ_{K L}) \\ : = min_{σ_{\hat{X}} \in [σ (P), σ_{X}]} σ_{X}^{2} + σ_{\hat{X}}^{2} - 2 σ_{X} σ_{\hat{X}} \sqrt{(1 - e^{- 2 R}) (1 - e^{- 2 (R + R_{c} + P - ψ (σ_{\hat{X}}))})} \end{matrix}

(51)

and

\bar{D} (R, R_{c}, P | ϕ_{K L}) : = σ_{X}^{2} - σ_{X}^{2} ξ^{2} (R, R_{c}) + {(σ (P) - σ_{X} ξ (R, R_{c}))}_{+}^{2} .

(52)

Theorem 5

(Theorem 4, [22]). For

p_{X} = N (μ_{X}, σ_{X}^{2})

,

\underset{̲}{D} (R, R_{c}, P | W_{2}^{2}) \leq D (R, R_{c}, P | W_{2}^{2}) \leq \bar{D} (R, R_{c}, P | W_{2}^{2}),

(53)

where

\begin{matrix} \underset{̲}{D} (R, R_{c}, P | W_{2}^{2}) \\ : = min_{σ_{\hat{X}} \in [{(σ_{X} - \sqrt{P})}_{+}, σ_{X}]} σ_{X}^{2} + σ_{\hat{X}}^{2} - 2 σ_{X} \sqrt{(1 - e^{- 2 R}) (σ_{\hat{X}}^{2} - {(σ_{X} e^{- (R + R_{c})} - \sqrt{P})}_{+}^{2})} \end{matrix}

(54)

and

\bar{D} (R, R_{c}, P | W_{2}^{2}) : = σ_{X}^{2} - σ_{X}^{2} ξ^{2} (R, R_{c}) + {(σ_{X} - \sqrt{P} - σ_{X} ξ (R, R_{c}))}_{+}^{2} .

(55)

In view of (48), Theorems 4 and 5 imply that

D (R, R_{c}, P | ϕ_{K L}) \geq \underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

(56)

and

D (R, R_{c}, P | W_{2}^{2}) \leq \bar{D} (R, R_{c}, ν (P) | ϕ_{K L}),

(57)

where

ν (P) : = log \frac{2 σ_{X}^{2}}{{(2 σ_{X}^{2} - P)}_{+}} .

(58)

It is thus of considerable interest to see how these induced bounds compare to their counterparts in Theorems 4 and 5, namely

D (R, R_{c}, P | ϕ_{K L}) \geq \underset{̲}{D} (R, R_{c}, P | ϕ_{K L})

(59)

and

D (R, R_{c}, P | W_{2}^{2}) \leq \bar{D} (R, R_{c}, P | W_{2}^{2}) .

(60)

The following result indicates that (56) and (57) are, in general, looser. In this sense, (59) and (60) are nonredundant.

Theorem 6.

For

p_{X} = N (μ_{X}, σ_{X}^{2})

,

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L}) \geq \underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

(61)

and

\bar{D} (R, R_{c}, P | W_{2}^{2}) \leq \bar{D} (R, R_{c}, ν (P) | ϕ_{K L}) .

(62)

Proof of Theorem 6.

In view of the definitions of

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L})

and

\underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

, for the purpose of proving (61), it suffices to show

[σ (P), σ_{X}] \subseteq [{(σ_{X} - \sqrt{2 σ_{X}^{2} (1 - e^{- P})})}_{+}, σ_{X}]

(63)

and

σ_{\hat{X}}^{2} - {(σ_{X} e^{- (R + R_{c})} - \sqrt{2 σ_{X}^{2} (1 - e^{- P})})}_{+}^{2} \geq σ_{\hat{X}}^{2} - σ_{\hat{X}}^{2} e^{- 2 (R + R_{c} + P - ψ (σ_{\hat{X}}))}

(64)

for

σ_{\hat{X}} \in [σ (P), σ_{X}]

. Invoking (4) with

p_{\hat{X}} = N (μ_{X}, σ (P))

(see also Lemma 1 for the expressions of the Kullback–Leibler divergence and the squared Wasserstein-2 distance between two Gaussian distributions) yields

{(σ_{X} - σ (P))}^{2} \leq 2 σ_{X}^{2} (1 - e^{- P}),

(65)

from which (63) follows immediately. Note that (64) is trivially true when

e^{- (R + R_{c})} \leq \sqrt{2 (1 - e^{- P})}

. When

e^{- (R + R_{c})} > \sqrt{2 (1 - e^{- P})}

, it can be written equivalently as

\sqrt{2 (1 - e^{- P})} \geq e^{- (R + R_{c})} (1 - e^{- (P + \frac{σ_{X}^{2} - σ_{\hat{X}}^{2}}{2 σ_{X}^{2}})}) .

(66)

Since

e^{- (R + R_{c})} \leq 1

and

1 - e^{- (P + \frac{σ_{X}^{2} - σ_{\hat{X}}^{2}}{2 σ_{X}^{2}})} \leq 1 - e^{- (P + \frac{σ_{X}^{2} - σ^{2} (P)}{2 σ_{X}^{2}})}

(67)

for

σ_{\hat{X}} \in [σ (P), σ_{X}]

, it suffices to show

\sqrt{2 (1 - e^{- P})} \geq 1 - e^{- (P + \frac{σ_{X}^{2} - σ^{2} (P)}{2 σ_{X}^{2}})} .

(68)

According to the definition of

σ (P)

,

P = log \frac{σ_{X}}{σ (P)} + \frac{σ^{2} (P) - σ_{X}^{2}}{2 σ_{X}^{2}} .

(69)

Substituting (69) into (68) gives

\sqrt{2 (1 - e^{log \frac{σ (P)}{σ_{X}} - \frac{σ^{2} (P)}{2 σ_{X}^{2}} + \frac{1}{2}})} \geq 1 - \frac{σ (P)}{σ_{X}} .

(70)

We can rewrite (70) as

τ (β) \geq 0,

(71)

where

τ (β) : = 1 - 2 β e^{- \frac{β^{2}}{2} + \frac{1}{2}} + 2 β - β^{2}

(72)

with

β : = \frac{σ (P)}{σ_{X}}

. Note that

β \in [0, 1]

. We have

\begin{matrix} \frac{d τ (β)}{d β} & = - 2 e^{- \frac{β^{2}}{2} + \frac{1}{2}} + 2 β^{2} e^{- \frac{β^{2}}{2} + \frac{1}{2}} + 2 - 2 β \\ \leq - 2 (1 - β^{2}) + 2 - 2 β \\ = - 2 (1 - β) β \\ \leq 0 . \end{matrix}

(73)

Since

τ (1) = 0

, it follows that

τ (β) \geq 0

for

β \in [0, 1]

, which verifies (71) and consequently proves (64).

Now, we proceed to prove (62), which is equivalent to

\bar{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2}) \leq \bar{D} (R, R_{c}, P | ϕ_{K L}) .

(74)

Since

\bar{D} (R, R_{c}, P | ϕ_{K L}) = \bar{D} (R, R_{c}, {(σ_{X} - σ (P))}^{2} | W_{2}^{2})

, it suffices to show

{(σ_{X} - σ (P))}^{2} \leq 2 σ_{X}^{2} (1 - e^{- P}),

(75)

i.e.,

P \geq log \frac{2 σ_{X}^{2}}{σ_{X}^{2} - σ^{2} (P) + 2 σ_{X} σ (P)} .

(76)

Substituting (69) into (76) and rearranging the inequality yields

log \frac{σ_{X}^{2} - σ^{2} (P) + 2 σ_{X} σ (P)}{2 σ_{X} σ (P)} \geq \frac{σ_{X}^{2} - σ^{2} (P)}{2 σ_{X}^{2}},

(77)

which is indeed true since

\begin{matrix} log \frac{σ_{X}^{2} - σ^{2} (P) + 2 σ_{X} σ (P)}{2 σ_{X} σ (P)} & \overset{(a)}{\geq} 1 - \frac{2 σ_{X} σ (P)}{σ_{X}^{2} - σ^{2} (P) + 2 σ_{X} σ (P)} \\ = \frac{σ_{X}^{2} - σ^{2} (P)}{σ_{X}^{2} - σ^{2} (P) + 2 σ_{X} σ (P)} \\ \geq \frac{σ_{X}^{2} - σ^{2} (P)}{2 σ_{X}^{2}}, \end{matrix}

(78)

where (a) is due to

log z \geq 1 - \frac{1}{z}

for

z > 0

. This completes the proof of (62). □

It can be seen from Figure 1 that

\underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

is indeed a looser lower bound on

D (R, R_{c}, P | ϕ_{K L})

as compared to

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L})

, and the latter almost meets the upper bound

\bar{D} (R, R_{c}, P | ϕ_{K L})

. Similarly, Figure 2 shows that

\bar{D} (R, R_{c}, ν (P) | ϕ_{K L})

is indeed a looser upper bound on

D (R, R_{c}, P | W_{2}^{2})

as compared to

\bar{D} (R, R_{c}, P | W_{2}^{2})

, especially in the low-rate regime, where the latter has a diminishing gap from the lower bound

\underset{̲}{D} (R, R_{c}, P | W_{2}^{2})

.

The fact that the bounds in Theorems 4 and 5 are nonredundant when examined through the connection in (48) serves as evidence of their non-triviality. Consequently, further improvements will likely require exploring deeper properties of the Kullback–Leibler divergence and squared Wasserstein-2 distance.

4. Conclusions

In this work, we have established a constrained variant of Talagrand’s transportation inequality. This result reveals a fundamental link between the information-theoretic performance limits of perception-aware lossy source coding under the Kullback–Leibler divergence-based and squared Wasserstein-2 distance-based perception measures. Moreover, it provides an organizational framework for assessing existing bounds in this setting. We believe that similar approaches could be applied to other perception measures. More broadly, the interplay between transportation inequalities and rate-distortion-perception theory presents a rich avenue for further exploration, with promising implications for both theoretical advancements and practical applications.

Author Contributions

Conceptualization, L.X. and J.C.; methodology, J.C. and L.Y.; validation, L.L.; formal analysis, L.X. and J.C.; writing—original draft preparation, L.X. and J.C.; writing—review and editing, J.C.; visualization, L.L.; supervision, Z.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Cover, T.M.; Thomas, J.A. Elements of Information Theory; Wiley: New York, NY, USA, 1991. [Google Scholar]
Blau, Y.; Michaeli, T. The perception-distortion tradeoff. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 6228–6237. [Google Scholar]
Li, M.; Klejsa, J.; Kleijn, W.B. Distribution preserving quantization with dithering and transformation. IEEE Signal Process. Lett. 2010, 17, 1014–1017. [Google Scholar] [CrossRef]
Klejsa, J.; Zhang, G.; Li, M.; Kleijn, W.B. Multiple description distribution preserving quantization. IEEE Trans. Signal Process. 2013, 61, 6410–6422. [Google Scholar] [CrossRef]
Saldi, N.; Linder, T.; Yüksel, S. Randomized quantization and source coding with constrained output distribution. IEEE Trans. Inf. Theory 2015, 61, 91–106. [Google Scholar] [CrossRef]
Saldi, N.; Linder, T.; Yüksel, S. Output constrained lossy source coding with limited common randomness. IEEE Trans. Inf. Theory 2015, 61, 4984–4998. [Google Scholar] [CrossRef]
Blau, Y.; Michaeli, T. Rethinking lossy compression: The rate-distortion-perception tradeoff. Proc. Mach. Learn. Res. 2019, 97, 675–685. [Google Scholar]
Matsumoto, R. Introducing the perception-distortion tradeoff into the rate-distortion theory of general information sources. IEICE Comm. Express 2018, 7, 427–431. [Google Scholar] [CrossRef]
Matsumoto, R. Rate-distortion-perception tradeoff of variable-length source coding for general information sources. IEICE Comm. Express 2019, 8, 38–42. [Google Scholar] [CrossRef]
Yan, Z.; Wen, F.; Ying, R.; Ma, C.; Liu, P. On perceptual lossy compression: The cost of perceptual reconstruction and an optimal training framework. Proc. Mach. Learn. Res. 2021, 139, 11682–11692. [Google Scholar]
Zhang, G.; Qian, J.; Chen, J.; Khisti, A. Universal rate-distortion-perception representations for lossy compression. In Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Online, 6–14 December 2021; pp. 11517–11529. [Google Scholar]
Freirich, D.; Michaeli, T.; Meir, R. A theory of the distortion-perception tradeoff in Wasserstein space. In Proceedings of the 35th Conference on Neural Information Processing Systems (NeurIPS 2021), Online, 6–14 December 2021; pp. 25661–25672. [Google Scholar]
Yan, Z.; Wen, F.; Liu, P. Optimally controllable perceptual lossy compression. In Proceedings of the ICMLC 2022: 2022 14th International Conference on Machine Learning and Computing, Guangzhou, China, 18–21 February 2022; pp. 24911–24928. [Google Scholar]
Theis, L.; Agustsson, E. On the advantages of stochastic encoders. In Proceedings of the 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, 3–7 May 2021; pp. 1–8. [Google Scholar]
Wagner, A.B. The rate-distortion-perception tradeoff: The role of common randomness. arXiv 2022, arXiv:2202.04147. [Google Scholar]
Chen, J.; Yu, L.; Wang, J.; Shi, W.; Ge, Y.; Tong, W. On the rate-distortion-perception function. IEEE J. Sel. Areas Inf. Theory 2022, 3, 664–673. [Google Scholar] [CrossRef]
Hamdi, Y.; Wagner, A.B.; Gündxuxz, D. The rate-distortion-perception trade-off: The role of private randomness. In Proceedings of the 2024 IEEE International Symposium on Information Theory (ISIT 2024), Athens, Greece, 7–12 July 2024; pp. 1083–1088. [Google Scholar]
Theis, L.; Wagner, A.B. A coding theorem for the rate-distortion-perception function. In Proceedings of the 9th International Conference on Learning Representations, ICLR 2021, Virtual Event, 3–7 May 2021; pp. 1–5. [Google Scholar]
Freirich, D.; Weinberger, N.; Meir, R. Characterization of the distortion-perception tradeoff for finite channels with arbitrary metrics. In Proceedings of the 2024 IEEE International Symposium on Information Theory (ISIT 2024), Athens, Greece, 7–12 July 2024; pp. 238–243. [Google Scholar]
Serra, G.; Stavrou, P.A.; Kountouris, M. On the computation of the Gaussian rate–distortion–perception function. IEEE J. Sel. Areas Inf. Theory 2024, 5, 314–330. [Google Scholar] [CrossRef]
Qian, J.; Salehkalaibar, S.; Chen, J.; Khisti, A.; Yu, W.; Shi, W.; Ge, Y.; Tong, W. Rate-distortion-perception tradeoff for vector Gaussian sources. IEEE J. Sel. Areas Inf. Theory 2025, 6, 1–17. [Google Scholar] [CrossRef]
Xie, L.; Li, L.; Chen, J.; Zhang, Z. Output-constrained lossy source coding with application to rate-distortion-perception theory. IEEE Trans. Commun. 2025, 73, 1801–1815. [Google Scholar] [CrossRef]
Xu, T.; Zhang, Q.; Li, Y.; He, D.; Wang, Z.; Wang, Y.; Qin, H.; Wang, Y.; Liu, J.; Zhang, Y.-Q. Conditional perceptual quality preserving image compression. arXiv 2023, arXiv:2308.08154. [Google Scholar]
Niu, X.; Gündüz, D.; Bai, B.; Han, W. Conditional rate-distortion-perception trade-off. In Proceedings of the 2023 IEEE International Symposium on Information Theory (ISIT), Taipei, Taiwan, 25–30 June 2023; pp. 1068–1073. [Google Scholar]
Qiu, Y.; Wagner, A.B.; Ballé, J.; Theis, L. Wasserstein distortion: Unifying fidelity and realism. In Proceedings of the 2024 58th Annual Conference on Information Sciences and Systems (CISS), Princeton, NJ, USA, 11–13 March 2024; pp. 1–6. [Google Scholar]
Qiu, Y.; Wagner, A.B. Low-rate, low-distortion compression with Wasserstein distortion. In Proceedings of the 2024 IEEE International Symposium on Information Theory (ISIT 2024), Athens, Greece, 7–12 July 2024; pp. 855–860. [Google Scholar]
Salehkalaibar, S.; Chen, J.; Khisti, A.; Yu, W. Rate-distortion-perception tradeoff based on the conditional-distribution perception measure. IEEE Trans. Inf. Theory 2024, 70, 8432–8454. [Google Scholar] [CrossRef]
Zhou, C.; Lu, G.; Li, J.; Chen, X.; Cheng, Z.; Song, L.; Zhang, W. Controllable distortion-perception tradeoff through latent diffusion for neural image compression. In Proceedings of the 2025 AAAI Conference on Artificial Intelligence, Dubai, United Arab Emirates, 20–22 May 2025; pp. 10725–10733. [Google Scholar]
Niu, X.; Bai, B.; Guo, N.; Zhang, W.; Han, W. Rate–distortion–perception trade-off in information theory, generative models, and intelligent communications. Entropy 2025, 27, 373. [Google Scholar] [CrossRef]
Gunlu, O.; Skorski, M.; Poor, H.V. Low-latency Rate-distortion-perception Trade-Off: A Randomized Distributed Function Computation Application. 2025. Cryptology ePrint Archive. Paper 2025/613. Available online: https://eprint.iacr.org/2025/613 (accessed on 12 March 2025).
Tan, K.; Dai, J.; Liu, Z.; Wang, S.; Qin, X.; Xu, W.; Niu, K.; Zhang, P. Rate-distortion-perception controllable joint source-channel coding for high-fidelity generative semantic communications. IEEE Trans. Cogn. Commun. Netw. 2025, 11, 672–686. [Google Scholar] [CrossRef]
Lei, E.; Hassani, H.; Bidokhti, S.S. Optimal neural compressors for the rate-distortion-perception tradeoff. arXiv 2025, arXiv:2503.17558. [Google Scholar]
Talagrand, M. Transportation cost for Gaussian and other product measures. Geom. Funct. Anal. 1996, 6, 587–600. [Google Scholar] [CrossRef]
Bai, Y.; Wu, X.; Özgür, A. Information constrained optimal transport: From Talagrand, to Marton, to Cover. IEEE Trans. Inf. Theory 2023, 69, 2059–2073. [Google Scholar] [CrossRef]

Figure 1. Illustrations of

\bar{D} (R, R_{c}, P | ϕ_{K L})

,

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L})

, and

\underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

for

p_{X} = N (0, 1)

,

R_{c} = 0

, and

P = 0.1

.

Figure 1. Illustrations of

\bar{D} (R, R_{c}, P | ϕ_{K L})

,

\underset{̲}{D} (R, R_{c}, P | ϕ_{K L})

, and

\underset{̲}{D} (R, R_{c}, 2 σ_{X}^{2} (1 - e^{- P}) | W_{2}^{2})

for

p_{X} = N (0, 1)

,

R_{c} = 0

, and

P = 0.1

.

Figure 2. Illustrations of

\bar{D} (R, R_{c}, ν (P) | ϕ_{K L})

,

\bar{D} (R, R_{c}, P | W_{2}^{2})

, and

\underset{̲}{D} (R, R_{c}, P | W_{2}^{2})

for

p_{X} = N (0, 1)

,

R_{c} = 0

, and

P = 0.1

.

Figure 2. Illustrations of

\bar{D} (R, R_{c}, ν (P) | ϕ_{K L})

,

\bar{D} (R, R_{c}, P | W_{2}^{2})

, and

\underset{̲}{D} (R, R_{c}, P | W_{2}^{2})

for

p_{X} = N (0, 1)

,

R_{c} = 0

, and

P = 0.1

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xie, L.; Li, L.; Chen, J.; Yu, L.; Zhang, Z. A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory. Entropy 2025, 27, 441. https://doi.org/10.3390/e27040441

AMA Style

Xie L, Li L, Chen J, Yu L, Zhang Z. A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory. Entropy. 2025; 27(4):441. https://doi.org/10.3390/e27040441

Chicago/Turabian Style

Xie, Li, Liangyan Li, Jun Chen, Lei Yu, and Zhongshan Zhang. 2025. "A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory" Entropy 27, no. 4: 441. https://doi.org/10.3390/e27040441

APA Style

Xie, L., Li, L., Chen, J., Yu, L., & Zhang, Z. (2025). A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory. Entropy, 27(4), 441. https://doi.org/10.3390/e27040441

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Constrained Talagrand Transportation Inequality with Applications to Rate-Distortion-Perception Theory

Abstract

1. Introduction

2. A Constrained Talagrand Transportation Inequality

3. Application to Rate-Distortion-Perception Theory

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI