Robust Biometric Authentication from an Information Theoretic Perspective

Grigorescu, Andrea; Boche, Holger; Schaefer, Rafael F.

doi:10.3390/e19090480

Open AccessArticle

Robust Biometric Authentication from an Information Theoretic Perspective^†

by

Andrea Grigorescu

^1,*

,

Holger Boche

¹ and

Rafael F. Schaefer

²

¹

Chair of Theoretical Information Technology, Technical University of Munich, Munich 80290, Germany

²

Information Theory and Applications Chair, Technische Universität Berlin, Berlin 10587, Germany

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in the 7th IEEE International Workshop on Information Forensics and Security, Rome, Italy, 16–19 November 2015.

Entropy 2017, 19(9), 480; https://doi.org/10.3390/e19090480

Submission received: 22 June 2017 / Revised: 28 August 2017 / Accepted: 7 September 2017 / Published: 9 September 2017

(This article belongs to the Special Issue Information-Theoretic Security)

Download

Browse Figures

Versions Notes

Abstract

:

Robust biometric authentication is studied from an information theoretic perspective. Compound sources are used to account for uncertainty in the knowledge of the source statistics and are further used to model certain attack classes. It is shown that authentication is robust against source uncertainty and a special class of attacks under the strong secrecy condition. A single-letter characterization of the privacy secrecy capacity region is derived for the generated and chosen secret key model. Furthermore, the question is studied whether small variations of the compound source lead to large losses of the privacy secrecy capacity region. It is shown that biometric authentication is robust in the sense that its privacy secrecy capacity region depends continuously on the compound source.

Keywords:

biometric authentication; compound source; strong secrecy; privacy leakage; robustness

1. Introduction

Biometric identifiers, such as fingerprints, iris and retina scans, are becoming increasingly attractive for the use in security systems because of their uniqueness and time invariant characteristics—for example, in authentication and identification systems. Conventional personal authentication systems usually use secret passwords or physical tokens to guarantee the legitimacy of a person. On the other hand, biometric authentication systems use the physical characteristics of a person to guarantee the legitimacy of the person to be authenticated.

Biometric authentication systems are decomposed into two phases: the enrollment and the authentication phase. A simple authentication approach is to gather biometric measurements in the enrollment phase, apply a one-way function and then store the results in a public database. In the authentication phase, new biometric measurements are gathered. The same one-way is applied and the outcome is then compared to the one stored in the database. Unfortunately, biometric measurements might be affected by noise. To deal with noisy data, error correction is needed. Therefore, helper data is generated during the enrollment phase as well based on the biometric measurements and then stored directly in the public database that will be then used in the authentication phase, which will then be used in the authentication phase to correct the noisy imperfections of the measurements.

Since the database containing the helper data is public, an eavesdropper can have access to the data if desired. How can we prevent an eavesdropper from gaining information about the biometric data from the publicly stored helper data? One is interested in encoding the biometric data into a helper data and a secret key such that the helper data does not reveal any information about the secret key. Cryptographic techniques are one approach to keeping the key secret. However, security on higher layers is usually based on the assumption of insufficient computational capabilities of eavesdroppers. Information theoretic security, on the contrary, uses the physical properties of the source to guarantee security independent from the computational capabilities of the adversary. This line of research was initiated by Shannon in [1] and has attracted considerable interest recently—cf., for example, recent textbooks [2,3,4] and references therein. In particular, Ahlswede and Csiszár in [5] and Maurer in [6] introduced a secret key sharing model. It consists of two terminals that observe the correlated sequences of a joint source. Both terminals generate a common key based on their observation and using public communication. The message transmitted over the public channel should not leak any amount of information about the common key.

Both works mentioned above use the weak secrecy condition as a measure of secrecy. Given a code of a certain blocklength, the weak secrecy condition is fulfilled if the mutual information between the key and the available information at the eavesdropper normalized by the code blocklength is arbitrarily small for large blocklengths. On the other hand, the strong secrecy condition is fulfilled if the un-normalized mutual information between the key and the available information at the eavesdropper is arbitrarily small for large blocklengths, i.e., the total amount of information leaked to the eavesdropper is negligible. The secret key sharing model satisfying the strong secrecy condition has been studied in [7].

One could model the biometric authentication similar to this secret key generation source model; however, this model does not take into account the amount of information that the public data (the helper data in the biometric scenario) leaks about the biometric measurement. The goal of biometric authentication is to perform a secret and successful authentication procedure without compromising the information about the user (privacy leakage). Compromised biometric information is unique and cannot be replaced, so once it is compromised, it is compromised forever, which might lead to an identity theft (see [8,9,10] for more information on privacy concerns). Since the helper data we use to deal with noisy data is a function of the biometric measurements, it contains information about the biometric measurement. Thus, if an attacker breaks into the data base, he could be able to extract information about the biometric measurement from where the helper data is stored. Hence, we aim to control the privacy leakage as well. An information theoretic approach of secure biometric authentication controlling the privacy leakage was studied in [11,12] under ideal conditions, i.e., with perfect source state information (SSI) and without the presence of active attackers.

In both references [11,12], the capacity results under the weak secrecy condition were derived. In [13], the capacity result for the sequential key-distillation with rate limited one-way public communication using the strong secrecy condition was shown.

For reliable authentication, SSI is needed; however, in practical systems, it is never perfectly available. Compound sources model a simple and realistic SSI scenario in which the legitimate users are not aware of the actual source realisation. Nevertheless, they know that it belongs to a known uncertainty set of sources and that it remains constant during the entire observation. This model was first introduced and studied in [14,15] in a channel coding context. Compound sources can also model the presence of an active attacker, who is able to control the state of the source. We are interested in performing an authentication process that is robust against such uncertainties and attacks. The secret key generation for source uncertainty was studied in [16,17,18,19]. In [16], the secret key generation using compound joint sources was studied and the key-capacity was established.

In [20], the achievability result of the privacy secrecy capacity region for generated secret keys for compound sources has been derived under the weak secrecy condition. In this work, we study robust biometric authentication in detail and extend this result in several directions. First, we consider a model where the legitimate users suffer from source uncertainty and/or attacks and derive achievability results under the strong secrecy conditions for both the generated and chosen secret key authentication. We then provide matching converses to obtain single-letter characterizations of the corresponding privacy secrecy capacity regions.

We further address the following question: can small changes of the compound source cause large changes in the privacy secrecy capacity region? Such a question has been first studied in [21] for arbitrarily varying quantum channels (AVQCs) showing that deterministic capacity has discontinuity points, while the randomness-assisted capacity is a continuous function of the AVQCs. This line of research is continued in [22,23], in which the classical compound wiretap channel, the arbitrarily varying wiretap channel (AVWC), and the compound broadcast channel with confidential messages (BCC) are studied. We study this for the biometric authentication problem at hand and show that the corresponding privacy secrecy capacity regions are continuous functions of the underlying uncertainty sets. Thus, small changes in the compound set lead to small changes in the capacity region only.

The rest of this paper is organized as follows. In Section 2, we introduce the biometric authentication model for perfect SSI and present the corresponding capacity results. In Section 3, we introduce the biometric authentication model for compound sources and show that secure, under the strong secrecy condition, and reliable authentication, under source uncertainty with positive rates, is possible deriving a single-letter characterization of the privacy secrecy capacity region for the chosen and generated secret key model. In Section 4, we show that the privacy secrecy capacity region for compound sources is a continuous function of the uncertainty set. Finally, the paper ends with a conclusion in Section 5.

Notation: Discrete random variables are denoted by capital letters and their realizations and ranges by lower case and script letters.

P (X)

denotes the set of all probability distributions on

X

;

E (\cdot)

denotes the expectation of a random variable;

Pr {\cdot}

,

H (\cdot)

and

I (\cdot; \cdot)

indicate the probability, the entropy of a random variable, and mutual information between two random variables;

D (\cdot ∥ \cdot)

is the information divergence;

{∥ p - q ∥}_{T V}

is the total variation distance between p and q on

X

defined as

{∥ p - q ∥}_{T V} ≔ \sum_{x \in X} | p (x) - q (x) |

. The set

T_{p, δ}^{n}

denotes the set of

δ -

typical sequences of length n with respect to the distribution p; the set

T_{W, δ}^{n} (x^{n})

denotes the set of

δ -

conditional typical sequences with respect to the conditional distribution

W : X \to P (Y)

and sequence

x^{n} \in X^{n}

;

p_{x^{n}}

denotes the empirical distribution of the sequence

x^{n}

.

2. Information Theoretic Model for Biometric Authentication

Let

X

and

Y

be two finite alphabets. Let

(x^{n}, y^{n}) \in X^{n} \times Y^{n}

be a pair of biometric sequences of length

n \in N

; then, the discrete memoryless joint-source is given by the joint probability distribution

Q^{n} (x^{n}, y^{n}) ≔ \prod_{i = 1}^{n} Q (x_{i}, y_{i})

. This models perfect SSI, i.e., all possible measurements are generated by the discrete memoryless joint-source source Q, which is perfectly known at both the enrollment and the authentication terminal.

2.1. Generated Secret Key Model

The information theoretic authentication model consists of a discrete memoryless joint-source Q, which represents the biometric measurement source, and two terminals: the enrollment terminal and the authentication terminal as shown in Figure 1. At the enrollment terminal, the enrollment sequence

X^{n}

is observed and the secret key K and helper data

M^{'}

are generated. At the authentication terminal, the authentication sequence

Y^{n}

is observed. An estimate of the secret key

\hat{K}

is made based on the authentication sequence

Y^{n}

and the helper data

M^{'}

. Since the helper data is stored in a public database, this should not reveal anything about the secret key K and also as little as possible about the enrollment measurement

X^{n}

. The distribution of the key must be close to uniform.

We consider a block-processing of arbitrary but fixed length n. Let

M^{'} ≔ {1, \dots, M_{n}^{'}}

be the helper data set and

K ≔ {1, \dots, K_{n}}

the secret key set.

Definition 1.

An

(n, M_{n}^{'}, K_{n})

-code for generated secret key authentication for joint-source

Q \in P (X \times Y)

consists of an encoder f at the enrollment terminal with

\begin{matrix} f & : X^{n} \to K \times M^{'} \end{matrix}

and a decoder ϕ at the authentication terminal

φ : Y^{n} \times M^{'} \to K .

Remark 1.

Note that the function f means that every

x^{n}

is mapped into a

(k, m^{'}) \in K \times M^{'}

, which implies that

| f (\cdot) | = K_{n} M_{n}^{'} \leq | X^{n} |

.

Definition 2.

A privacy secrecy rate pair

(R_{P L}, R_{K}) \in R_{+}^{2}

is called achievable for the generated secret key authentication for a joint-source Q, if, for any

δ > 0,

there exist an

n (δ) \in N

and a sequence of

(n, M_{n}^{'}, K_{n})

-codes such that, for all

n \geq n (δ),

we have

\begin{matrix} Pr {\hat{K} \neq K} & \leq δ, \end{matrix}

(1a)

\begin{matrix} \frac{1}{n} H (K) + δ \geq \frac{1}{n} log K_{n} & \geq R_{K} - δ, \end{matrix}

(1b)

\begin{matrix} \frac{1}{n} I (K; M^{'}) & \leq δ, \end{matrix}

(1c)

\begin{matrix} \frac{1}{n} I (X^{n}; M^{'}) & \leq R_{P L} + δ . \end{matrix}

(1d)

Remark 2.

Condition (1b) requires the key distribution

p_{K}

to be close to the uniform distribution

p_{\tilde{K}}

, where

\tilde{K}

is a random variable uniformly distributed over the key set

K

. By (1b), we have

\frac{1}{n} log K_{n} - \frac{1}{n} H (K) = D (K ∥ \tilde{K}) \leq δ

; combined with Pinsker’s inequality, we have

∥ p_{K} - p_{\tilde{K}} ∥ \leq \sqrt{2 ln 2 δ}

. For small δ, we have that both distributions are close to each other.

Remark 3.

Condition (1a) stands for reliable authentication, the information about the key leaked by the helper data is negligible by (1c) and the information about the biometric measurements leaked by the helper data

\frac{1}{n} I (X^{n}; M^{'})

is close to

R_{P L}

by (1d).

Definition 3.

The set of all achievable privacy secrecy rate pairs for generated key authentication is called privacy secrecy capacity region and is denoted by

C_{G} (Q)

.

We next present the privacy secrecy capacity region for the generated key authentication for the joint-source Q, which was first established in [11,12].

To do so, for some U with alphabet

| U | \leq | X | + 1

and

V : X \to P (U)

, we define the region

R (Q, V)

as the set of all

(R_{P L}, R_{K}) \in R_{+}^{2}

satisfying

\begin{matrix} R_{K} & \leq I (U; Y), \\ R_{P L} & \geq I (U; X) - I (U; Y), \end{matrix}

with

P_{U X Y} (u, x, y) = V (u | x) Q (x, y)

.

Theorem 1

[11,12]. The privacy secrecy capacity region for generated key authentication is given by

C_{G} (Q) = ⋃_{V : X \to P (U)} R (Q, V) .

2.2. Chosen Secret Key Model

In this section, we study the authentication model for systems for which the secret key is chosen beforehand. At the enrollment terminal, a secret key K is chosen uniformly and independent of the biometric measurements. The secret key K is bound to the biometric measurements

X^{n},

and, based on this, the helper data

M^{'}

is generated as shown in Figure 2. At the authentication terminal, the authentication measurement

Y^{n}

is observed. An estimate of the secret key

\hat{K}

is made based on the authentication sequence

Y^{n}

and the helper data

M^{'}

. Since the helper data is stored in a public database, this should not reveal anything about the secret key and minimize the information leakage about the enrollment sequence

X^{n}

. However, we should be able to reconstruct K. To achieve this, a masking layer based on the one-time pad principles is used.

The masking layer, which is another uniformly distributed chosen secret key K, is added to the top of the generated secret key authentication. At the enrollment terminal, a secret key

K_{g}

and a helper data M are generated. The generated secret key is added modulo-

| K |

to the masking layer K and sent together with the helper data as additional helper data, i.e.,

M^{'} = (M, K \oplus K_{g})

. At the authentication terminal, an estimation of the generated secret key

{\hat{K}}_{g}

is made based on

Y^{n}

and M and the estimation of masking layer is made

\hat{K} = K \oplus K_{g} ⊖ {\hat{K}}_{g}

.

We consider a block-processing of arbitrary but fixed length n. Let

M^{'} ≔ {1, \dots, M_{n}^{'}}

be the helper data set and

K ≔ {1, \dots, K_{n}}

the secret key set.

Definition 4.

An

(n, M_{n}^{'}, K_{n})

-code for chosen secret key authentication for joint-source

Q \in P (X \times Y)

consists of an encoder f at the enrollment terminal with

\begin{matrix} f & : K \times X^{n} \to M^{'} \end{matrix}

and a decoder ϕ at the authentication terminal

φ : Y^{n} \times M^{'} \to K .

Definition 5.

A privacy secrecy rate pair

(R_{P L}, R_{K}) \in R_{+}^{2}

for chosen secret key authentication is called achievable for a joint-source Q, if, for any

δ > 0,

there exist an

n (δ) \in N

and a sequence of

(n, M_{n}^{'}, K_{n})

-codes, such that, for all

n \geq n (δ),

we have

\begin{matrix} Pr {\hat{K} \neq K} & \leq δ, \end{matrix}

(2a)

\begin{matrix} \frac{1}{n} log K_{n} & \geq R_{K} - δ, \end{matrix}

(2b)

\begin{matrix} \frac{1}{n} I (K; M^{'}) & \leq δ, \end{matrix}

(2c)

\begin{matrix} \frac{1}{n} I (X^{n}; M^{'}) & \leq R_{P L} + δ . \end{matrix}

(2d)

Remark 4.

The difference between Definition 5 and 2 is that, in here, the uniformity of the key is already guaranteed.

Definition 6.

The privacy secrecy capacity region for chosen secret key authentication for the joint-source

Q \in P (X \times Y)

is called privacy secrecy capacity region and is denoted as

C_{C} (Q)

.

We next present the privacy secrecy capacity region for chosen secret key authentication for the joint-source Q as showed in [11].

Theorem 2

([11]). The privacy secrecy capacity region for the chosen secret key authentication is given by

C_{C} (Q) = ⋃_{V : X \to P (U)} R (Q, V) .

3. Authentication for Compound Sources

Let

X

and

Y

be two finite sets and

S

a finite state set. Let

(x^{n}, y^{n}) \in X^{n} \times Y^{n}

be a sequence pair of length

n \in N

. For every

s \in S,

the discrete memoryless joint-source is given by the joint probability distribution

Q_{s}^{n} (x^{n}, y^{n}) ≔ \prod_{i = 1}^{n} Q_{s} (x_{i}, y_{i}) = \prod_{i = 1}^{n} p_{s} (x_{i}) W_{s} (y_{i} | x_{i}),

with

p_{s} \in P (X)

a marginal distribution on

X

and

W_{s} : X \to P (Y)

a stochastic matrix.

Definition 7.

The discrete memoryless compound joint-source

Q_{XY}

is given by the family of joint probabilities distributions on

X \times Y

as

Q_{X Y} ≔ {Q_{s} \in P (X \times Y) : s \in S} .

We define the finite set of marginal distributions

Q_{X}

over the alphabet

X

from the compound joint-source

Q_{XY}

as

\begin{matrix} Q_{X} ≔ \{p_{s} \in P (X) : s \in S, p_{s} (x) = \sum_{y \in Y} Q_{s} (x, y) for every x \in X and Q_{s} \in Q_{X Y}\} . \end{matrix}

We define

L

as the index set of

Q_{X}

. Note that

| L | = | Q_{X} | \leq | Q_{XY} |

.

For every

ℓ \in L

, we define the subset of the compound joint-source

Q_{XY}

with the same marginal distribution

p_{ℓ}

as

\begin{matrix} Q_{XY, ℓ} ≔ \{Q_{s} \in Q_{XY} : Q_{s} (x, y) = p_{ℓ} (x) W_{s} (y | x) for every (x, y) \in X \times Y\} . \end{matrix}

For every

ℓ \in L

, we define the index set

S_{ℓ}

of

Q_{XY, ℓ}

as

S_{ℓ} ≔ {s \in S : Q_{s} \in Q_{XY, ℓ}} .

Remark 5.

Note that, for every

ℓ, ℓ^{'} \in L

with

ℓ \neq ℓ^{'}

, it holds that

Q_{XY, ℓ} \cap Q_{XY, ℓ^{'}} = \emptyset

,

S_{ℓ} \cap S_{ℓ^{'}} = \emptyset

,

S = ⋃_{ℓ \in L} S_{ℓ}

and

Q_{XY} = ⋃_{ℓ \in L} Q_{XY, ℓ}

.

3.1. Compound Generated Secret Key Model

In this section, we study the generated secret key authentication for finite compound joint-sources, which is a special class of sources that model a limited SSI, as shown in Figure 3.

We consider a block-processing of arbitrary but fixed length n. Let

M^{'} ≔ {1, \dots, M_{n}^{'}}

be the helper data set and

K ≔ {1, \dots, K_{n}}

the secret key set.

Definition 8.

An

(n, M_{n}^{'}, K_{n})

-code for generated secret key authentication for the compound joint-source

Q_{XY} \subset P (X \times Y)

consists of an encoder f at the enrollment terminal with

\begin{matrix} f & : X^{n} \to K \times M^{'} \end{matrix}

and a decoder ϕ at the authentication terminal

φ : Y^{n} \times M^{'} \to K .

Definition 9.

A privacy secrecy rate pair

(R_{P L}, R_{K}) \in R_{+}^{2}

is called achievable for generated secret key authentication for the compound joint-source

Q_{XY}

, if, for any

δ > 0,

there exist an

n (δ) \in N

and a sequence of

(n, M_{n}^{'}, K_{n})

-codes, such that for all

n \geq n (δ)

and for every

s \in S,

we have

\begin{matrix} Pr {\hat{K} \neq K} & \leq δ, \\ H (K) + δ \geq \frac{1}{n} log K_{n} & \geq R_{K} - δ, \\ I (K; M^{'}) & \leq δ, \\ \frac{1}{n} I (X_{s}^{n}; M^{'}) & \leq R_{P L} + δ . \end{matrix}

Consider the compound joint-source

Q_{XY}

. For a fixed

ℓ \in L

,

V : X \to P (U)

and for every

s \in S_{ℓ}

, we define the region

R (V, ℓ, s)

as the set of all

(R_{P L}, R_{K}) \in R_{+}^{2}

that satisfy

\begin{matrix} R_{K} & \leq I (U_{ℓ}; Y_{s}), \\ R_{P L} & \geq I (U_{ℓ}; X_{ℓ}) - I (U_{ℓ}; Y_{s}), \end{matrix}

with

P_{U X Y, s} (u, x, y) = V (u | x) Q_{s} (x, y)

.

Theorem 3.

The privacy secrecy capacity region for generated secret key authentication for the compound joint-source

Q_{XY}

is given by

C_{G} (Q_{XY}) = ⋂_{ℓ \in L} ⋃_{\begin{matrix} V : X \to P (U) \\ | U | \leq | X | + | S_{ℓ} | \end{matrix}} ⋂_{s \in S_{ℓ}} R (V, ℓ, s) .

Proof.

The proof of Theorem 3 consists of two parts: achievability and converse. The achievability scheme uses the following protocol:

Estimate the marginal distribution $p_{\hat{ℓ}} \in Q_{X}$ from the observed sequence $X^{n}$ at the enrollment terminal via hypothesis testing.
Compute the key K and a helper data M based on $X^{n}$ , a common shared sequence $T = U^{n}$ by the enrollment and authentication terminal and using an extractor function $g : {0, 1}^{n} \times {0, 1}^{d} \to {0, 1}^{k}$ with $N, d, k \in N$ whose input are the shared sequence T and a sequence of d uniformly distributed bits $U_{d}$ . The helper data M is equivalent to the helper data for the case with perfect SSI. The extended helper data in this case contains also the state of the marginal distribution and the uniformly distributed bits sequence, i.e., $M^{'} = (M, \hat{L}, U_{d})$ .
Store the extended helper data $M^{'}$ in the public database.
Estimate the key $\hat{K}$ at the authentication terminal, based on the observations $M^{'}$ and $Y^{n}$ , which can be seen as the outcome of one of the channels in $W_{\hat{ℓ}} ≔ {W_{s} : X \to P (Y) : s \in S_{\hat{ℓ}}}$ .

A detailed proof can be found in Appendix A. □

Remark 6.

Note that the authentication for compound source model is a generalization of the models studied by [11,12], i.e.,

| S | = 1

. Furthermore, one can see that, for

| S | = 1

, the capacity region under the strong secrecy condition equals the capacity region under the weak secrecy condition showed by [11,12].

Remark 7.

As we already mentioned, we aim for strong secrecy, i.e., in contrast to the weak secrecy constraint in (1c), we now require the un-normalized mutual information between the key and the helper data to be negligibly small. It would be Ideal to show perfect secrecy and a perfectly uniformed key, i.e.,

I (K; M^{'}) = 0

and

H (K) = \frac{1}{n} log K_{n}

. It would be interesting to see how this constraint affects the achievable rate region. We suspect that the achievable rate region under perfect secrecy and perfectly uniformed key remains the same as in Theorem 3.

Remark 8.

From the protocol, note that once we have estimated the marginal distribution

p_{\hat{ℓ}} \in Q_{X},

we deal with a compound channel model without channel state information (CSI) at the transmitter (see [24]).

Remark 9.

The order of the set operations of the capacity region displays the fact that the marginal distribution is first estimated. This can be seen as partial state information, where the marginal distribution over

X

is known.

3.2. Compound Chosen Secret Key Model

In this section, we study chosen secret key authentication for finite compound joint-sources (see Figure 4).

We consider a

(n, M_{n}^{'}, K_{n})

-code of arbitrary but fixed length n.

Definition 10.

A privacy secrecy rate pair

(R_{P L}, R_{K}) \in R_{+}^{2}

is called achievable for chosen secret key authentication for the compound joint-source

Q_{XY}

, if for any

δ > 0

there exist an

n (δ) \in N

and a sequence of

(n, M_{n}^{'}, K_{n})

-codes, such that, for all

n \geq n (δ)

and for every

s \in S,

we have

\begin{matrix} Pr {\hat{K} \neq K} & \leq δ, \end{matrix}

(3a)

\begin{matrix} \frac{1}{n} log K_{n} & \geq R_{K} - δ, \end{matrix}

(3b)

\begin{matrix} I (K; M^{'}) & \leq δ, \end{matrix}

(3c)

\begin{matrix} \frac{1}{n} I (X_{s}^{n}; M^{'}) & \leq R_{P L} + δ . \end{matrix}

(3d)

Consider the compound joint-source

Q_{XY}

. For a fixed

ℓ \in L

,

V : X \to P (U)

and for every

s \in S_{ℓ}

, we define the region

R (V, ℓ, s)

as the set of all

(R_{P L}, R_{K}) \in R_{+}^{2}

that satisfy

\begin{matrix} R_{K} & \leq I (U_{ℓ}; Y_{s}), \\ R_{P L} & \geq I (U_{ℓ}; X_{ℓ}) - I (U_{ℓ}; Y_{s}), \end{matrix}

with

P_{U X Y, s} (u, x, y) = V (u | x) Q_{s} (x, y)

.

Theorem 4.

The privacy secrecy capacity region for chosen secret key authentication for the compound joint-source

Q_{XY}

is given by

C_{C} (Q_{XY}) = ⋂_{ℓ \in L} ⋃_{\begin{matrix} V : X \to P (U) \\ | U | \leq | X | + | S_{ℓ} | \end{matrix}} ⋂_{s \in S_{ℓ}} R (V, ℓ, s) .

Proof.

The proof can be found in Appendix B. □

Remark 10.

Note that, as for generated secret key authentication for compound sources, chosen secret key authentication for compound sources is a generalization of the models studied by [11]. Furthermore, for perfect SSI, one can see that the capacity region under the strong secrecy condition equals the capacity region under the weak secrecy condition showed by [11].

Remark 11.

Note that the privacy secrecy capacity region for the generated key model equals the privacy secrecy capacity region for chosen secret key authentication, i.e.,

C_{G} (Q_{XY}) = C_{C} (Q_{XY})

.

4. Continuity of the Privacy Secrecy Capacity Region for Compound Sources

We are interested in studying how small variations in the compound source affect the privacy secrecy capacity region. The question of whether the capacity or capacity region is a continuous function of a source or channel is not always clear, especially if the source or channel are complicated. In [22], one can find an example of AVWCs, whose uncertainty set consists of only two channels, which already shows discontinuity points in its unassisted secrecy capacity. For a detailed discussion, see [25]. In this section, we study the continuity of the privacy secrecy capacity region for compound sources. For this purpose, we introduce the distance between two compound sources and capacity regions, respectively.

4.1. Distance between Compound Sources

Definition 11.

Let

Q_{XY, 1}

and

Q_{XY, 2}

be two compound sources. We define

\begin{matrix} d_{1} (Q_{XY, 1}, Q_{XY, 2}) & = max_{s_{2} \in S_{2}} min_{s_{1} \in S_{1}} {∥ Q_{s_{1}} - Q_{s_{2}} ∥}_{T V}, \\ d_{2} (Q_{XY, 1}, Q_{XY, 2}) & = max_{s_{1} \in S_{1}} min_{s_{2} \in S_{2}} {∥ Q_{s_{1}} - Q_{s_{2}} ∥}_{T V} . \end{matrix}

The Hausdorff distance

D_{H} (Q_{XY, 1}, Q_{XY, 2})

between

Q_{XY, 1}

and

Q_{XY, 2}

is defined as

\begin{matrix} D_{H} (Q_{XY, 1}, Q_{XY, 2}) = max \{d_{1} (Q_{XY, 1}, Q_{XY, 2}), d_{2} (Q_{XY, 1}, Q_{XY, 2})\} . \end{matrix}

Definition 12.

Let

R_{1},

and

R_{2}

be two non-empty subsets of the metric space

(R^{2}, d)

with

d (x, y) = \sqrt{\sum_{i = 1}^{2} {| x_{i} - y_{i} |}^{2}}

for all

x, y \in R^{2}

. We define the distance between two sets as

\begin{matrix} D_{R} (R_{1}, R_{2}) = max {max_{r_{1} \in R_{1}} min_{r_{2} \in R_{2}} d (r_{1}, r_{2}), max_{r_{2} \in R_{2}} min_{r_{1} \in R_{2}} d (r_{1}, r_{2})} . \end{matrix}

4.2. Continuity of the Privacy Secrecy Capacity Region

Theorem 5.

Let

ϵ \in (0, 1)

and

n \in N

. Let

Q_{XY, 1}

and

Q_{XY, 2}

be two compound sources. If

D_{H} (Q_{XY, 1}, Q_{XY, 2}) \leq ϵ,

then it holds

D_{R} (C_{G} (Q_{XY, 1}), C_{G} (Q_{XY, 2})) \leq δ (ϵ, | X |, | Y |)

(4)

with

δ (ϵ) = \sqrt{δ_{1} {(ϵ)}^{2} + δ_{2} {(ϵ)}^{2}},

where

δ_{1} (ϵ) = 2 ϵ log | Y | + 2 H_{2} (ϵ) - ϵ log \frac{ϵ}{| U |}

and

δ_{2} (ϵ) = 2 ϵ log | Y | | X | + 4 H_{2} (ϵ) - 2 ϵ log \frac{ϵ}{| U |}

.

Remark 12.

Note that since the privacy secrecy capacity region for the chosen secret key equals the privacy secrecy capacity region for the chosen secret key, the continuity behaviour holds also for the chosen secret key privacy capacity region.

Remark 13.

This theorem shows that the privacy secrecy capacity region is a continuous function of the uncertainty set. In other words, small variations of the uncertainty set lead to small variations in the capacity region.

Proof.

A detailed proof can be found in Appendix C. □

Remark 14.

A complete characterisation of the discontinuity behaviour of the AVC capacity under list decoding can be found in [26]. Note that this behaviour, based on Theorem 5, can not occur.

5. Conclusions

In this paper, we considered a biometric authentication model in the presence of source uncertainty. In particular, we studied a model where the actual source realization is not known, however it belongs to a known source set: this is the finite compound source model. We have shown that biometric authentication is robust against source uncertainty and certain classes of attacks. In other words, reliable and secure authentication is possible at positive key rates. We further characterize the minimum privacy leakage rate under source uncertainty. For future work, perfect secrecy for the biometric authentication model and a compound source with infinite sources is of great interest.

Acknowledgments

The authors would like to thank their Sebastian Baur for insightful discussions. This work was supported by the Gottfried Wilhelm Leibniz Programme of the German Research Foundation (DFG) under Grant BO 1734/20-1, Grant BO 1734/24-1 and Grant BO 1734/25-1.

Author Contributions

Andrea Grigorescu, Holger Boche and Rafael Schaefer conceived this study and derived the results. Andrea Grigorescu and wrote the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 3

Appendix A.1. Achievability of Theorem 3

Appendix A.1.1. State Estimation

We first show that we can estimate the marginal distribution

p_{\hat{ℓ}} \in Q_{X}

correctly with probability approaching one. Then, for every

ℓ = \hat{ℓ} \in L,

we use the random coding argument to show that all rate pairs

(R_{P L}, R_{K}) \in R (V, ℓ, s)

are achievable.

To estimate the actual source realization, we perform hypothesis testing. The set of hypotheses is the set of finite marginal distributions

Q_{X}

. For every

ℓ \in L,

we define

δ_{ℓ} = \frac{1}{2} min_{\begin{matrix} ℓ^{'} \neq ℓ \\ ℓ^{'} \in L \end{matrix}} {∥ p_{ℓ} - p_{ℓ^{'}} ∥}_{T V} .

We choose

0 < δ < {min}_{ℓ \in L} δ_{ℓ}

and consider the test set (typical sequences set)

T_{p_{ℓ}, δ}^{n} ≔ {x^{n} \in X^{n} : ∥ p_{x^{n}} - p_{ℓ} ∥ \leq δ}

. Note that, for every

ℓ, ℓ^{'} \in L

with

ℓ^{'} \neq ℓ,

we have that

T_{p_{ℓ^{'}}, δ}^{n} \cap T_{p_{ℓ}, δ}^{n} = \emptyset

. We show this by arbitrarily choosing a sequence

x^{n} \in T_{p_{ℓ}, δ}^{n}

of type

p_{x^{n}}

and show that

∥ p_{ℓ^{'}} - p_{x^{n}} ∥_{T V} > δ

for

ℓ^{'} \neq ℓ

. By the triangle inequality, we have

\begin{matrix} ∥ p_{ℓ} - p_{ℓ^{'}} ∥_{T V} & = ∥ p_{ℓ} - p_{ℓ^{'}} + p_{x^{n}} - p_{x^{n}} ∥_{T V} \\ \leq ∥ p_{ℓ} - p_{x^{n}} ∥_{T V} + {∥ p_{x^{n}} - p_{ℓ^{'}} ∥}_{T V} . \end{matrix}

Hence,

\begin{matrix} ∥ p_{x^{n}} - p_{ℓ^{'}} ∥_{T V} & \geq ∥ p_{ℓ} - p_{ℓ^{'}} ∥_{T V} - {∥ p_{ℓ} - p_{x^{n}} ∥}_{T V} \\ \geq 2 δ_{ℓ^{'}} - δ > δ, \end{matrix}

proving the disjointness of the sets.

The test function is the indicator function

1 [x^{n} \in T_{p_{ℓ}, δ}^{n}]

, i.e., after observing

x^{n}

the test looks for the hypothesis

p_{\hat{ℓ}} = p_{ℓ}

for which

1 [x^{n} \in T_{p_{ℓ}, δ}^{n}] = 1

.

An error occurs, if the sequence

x^{n}

was generated by the source

p_{ℓ}

for any

ℓ \in L

; however,

x^{n} \notin T_{p_{ℓ}, δ}^{n}

. This implies that either

x^{n} \notin ⋃_{ℓ \in L} T_{p_{ℓ}, δ}^{n}

or

x^{n} \in T_{p_{ℓ^{'}}, δ}^{n}

with

ℓ^{'} \neq ℓ

. Using Lemma 2.12 in [27], we upper bound the probability of this error event by

p_{ℓ} ({T_{p_{ℓ}, δ}^{n}}^{c}) \leq ϵ_{δ} (n, | X |),

(A1)

where

ϵ_{δ} (n, | X |) = {(n + 1)}^{| X |} 2^{- n c δ^{2}}

. Letting

n \to \infty

, the right-hand side of (A1) tends to zero.

Appendix A.1.2. Code Construction

For each

ℓ \in L,

we consider the auxiliary random variable U and the channel V and construct a code for which we analyze the decoding error, secrecy and privacy condition.

Generate

2^{n (R_{K} + R_{M})}

codewords

U_{k, m}^{n}

with

k \in K ≔ {1, \dots, 2^{n R_{K}}}

and

m \in M ≔ {1, \dots, 2^{n R_{M}}}

by choosing each symbol

U_{i_{k, m}}

in the codebook independently at random according to

p_{u} \in P (U)

, computed from

p_{ℓ} (x) V (u | x) for every (x, u) \in X \times U

. We denote the codebook as

\tilde{U} = {U_{k, m}^{n}}_{(k, m) \in K \times M}

.

For every

ℓ \in L

and every

s \in S_{ℓ},

we define the following channels

Σ_{X_{ℓ}} : U \to P (X), Σ_{Y_{s}} : U \to P (Y)

and

Σ_{{XY}_{s}} : U \to P (X \times Y)

that satisfy:

\begin{matrix} Σ_{X_{ℓ}} (x | u) & = \frac{p_{ℓ} (x) V (u | x)}{\sum_{x \in X} p_{ℓ} (x) V (u | x)}, \\ Σ_{Y_{s}} (y | u) & = \frac{\sum_{x \in X} V (u | x) Q_{s} (x, y)}{\sum_{(x, y) \in X \times Y} V (u | x) Q_{s} (x, y)}, \\ Σ_{{XY}_{s}} (x, y | u) & = \frac{V (u | x) Q_{s} (x, y)}{\sum_{(x, y) \in X \times Y} V (u | x) Q_{s} (x, y)}, \end{matrix}

for every

(u, x, y) \in U \times X \times Y

.

Appendix A.1.3. Encoding Sets

For every

(k, m, ℓ) \in K \times M \times L,

we define the encoding sets

E_{k, m, ℓ} (\tilde{U}) \subset X^{n}

as follows:

E_{k, m, ℓ} (\tilde{U}) = T_{Σ_{X_{ℓ}}, δ^{'}}^{n} (U_{k, m}^{n}),

with

δ^{'} > \frac{δ}{| U |}

.

Remark 15.

Note that, by the definition of

δ^{'}

and Lemma 2.10 in [27], if

U_{k, m}^{n} \in T_{p_{u}, δ^{'''}}^{n}

with

δ^{'''} = \frac{δ}{| U |} - δ^{'}

and

x^{n} \in T_{Σ_{X_{ℓ}}, δ^{'}}^{n} (U_{k, m}^{n}),

then

x^{n} \in T_{p_{ℓ}, δ}^{n}

.

Appendix A.1.4. Decoding Sets

For every

(k, m, ℓ) \in K \times M \times L,

we define the decoding sets

D_{k} (m (\tilde{U}), ℓ) \subset Y^{n}

as follows:

\begin{matrix} D_{k}^{'} (m (\tilde{U}), ℓ) ≔ & ⋃_{s \in S_{\hat{ℓ}}} T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k, m}^{n}), \\ D_{k} (m (\tilde{U}), ℓ) ≔ & D_{k}^{'} (m (\tilde{U}), ℓ) \cap {(⋃_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} D_{k^{'}}^{'} (m (\tilde{U}), ℓ))}^{c}, \end{matrix}

with

δ^{″} > \frac{δ}{| U |}

.

Remark 16.

One could consider sending some bits of the sequences

X^{n}

through the public channel, such that the user at the authentication terminal can be able to estimate the actual source realization and so avoid the complicated decoding strategy. However, this approach would violate the strong secrecy condition.

Appendix A.1.5. Encoder–Decoder Pair Sets

For every

(k, m) \in K \times M

, we define the encoder–decoder pair set

C_{k, m, ℓ} (\tilde{U}) \in X^{n} \times Y^{n}

as follows:

\begin{matrix} C_{k, m, ℓ} (\tilde{U}) & = (E_{k, m, ℓ} (\tilde{U}) \times D_{k} (m (\tilde{U}), ℓ)) \cap (⋃_{s \in S_{\hat{ℓ}}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} (U_{k, m}^{n})), \end{matrix}

with

\tilde{δ} > 0

.

Appendix A.1.6. Error Analysis

For every

ℓ \in L,

assume that the marginal distribution was estimated correctly, i.e.,

\hat{ℓ} = ℓ

. We analyze the probability of each error event separately. We denote the error at the enrollment terminal given the codebook

\tilde{U}

as

ϵ_{E, n} (\tilde{U})

. An error occurs at the enrollment terminal if, for every

(k, m, ℓ) \in K \times M \times L,

the observed sequence

x^{n}

does not belong to

E_{k, m, ℓ} (\tilde{U})

, i.e.,

\begin{matrix} ϵ_{E, n} (\tilde{U}) & = p_{ℓ}^{n} ({(⋃_{(k, m) \in K \times M} E_{k, m, ℓ} (\tilde{U}))}^{c}) \\ = p_{ℓ}^{n} (⋂_{(k, m) \in K \in M} E_{k, m, ℓ} {(\tilde{U})}^{c}) \\ = \prod_{(k, m) \in K \in M} [1 - p_{ℓ}^{n} (E_{k, m, ℓ} (\tilde{U}))] . \end{matrix}

Averaging over all codebooks, from the independence of the random variables involved and from Lemma 2.13 in [27], we have

\begin{matrix} E_{\tilde{U}} (ϵ_{E, n} (\tilde{U})) & = \prod_{(k, m) \in K \in M} E_{U_{k, m}^{n}} [1 - p_{ℓ}^{n} (T_{Σ_{X_{ℓ}}, δ^{'}}^{n} (U_{k, m}^{n}))] \\ \leq {[1 - {(n + 1)}^{- | U | | X |} (2^{- n (I (U_{ℓ}; X_{ℓ})} 2^{ψ (δ^{'}, | U | | X |))})]}^{2^{n (R_{K} + R_{M})}} \\ \leq exp (- {(n + 1)}^{- | U | | X |}) exp (2^{n (R_{K} + R_{M} - I (U_{ℓ}; X_{ℓ}) - ψ (δ^{'}, | U | | X |))}) . \end{matrix}

(A2)

The inequality (A2) follows from

{(1 - x)}^{r} \leq exp (- r x)

, which holds for every

x, r > 0

. Letting

n \to \infty

and choosing

R_{K} + R_{M} > I (U_{ℓ}; X_{ℓ}) + ψ (δ^{'}, | U | | X |),

(A3)

the right-hand side of (A2) goes doubly exponentially fast to zero. An error at the authentication terminal occurs, when

(k, m)

was encoded at the enrollment terminal, but

k^{'} \neq k

was decoded at the authentication terminal. The set of joint observations describing this event is given by

\begin{matrix} C_{E_{k, m, ℓ}} {(\tilde{U})}^{c} & = C_{k, m, ℓ} {(\tilde{U})}^{c} \cap (E_{k, m, ℓ} (\tilde{U}) \times D_{k} {(m (\tilde{U}), ℓ)}^{c}) \\ = (E_{k, m, ℓ} (\tilde{U}) \times D_{k} {(m (\tilde{U}), ℓ)}^{c}) \cup (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c}) . \end{matrix}

We denote the error probability of this event given the codebook

\tilde{U}

for each correlated source

Q_{t}

with

t \in S_{ℓ}

as

ϵ_{n, k}^{t} (\tilde{U})

.

\begin{matrix} ϵ_{n, k}^{t} (\tilde{U}) & = Σ_{{XY}_{t}}^{n} (C_{E_{k, m, ℓ}} {(\tilde{U})}^{c} | U_{k, m}^{n}) \\ = Σ_{{XY}_{t}}^{n} ((E_{k, m, ℓ} (\tilde{U}) \times D_{k} {(m (\tilde{U}), ℓ)}^{c}) \cup (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c}) | U_{k, m}^{n}) \\ \leq Σ_{{XY}_{t}}^{n} (E_{k, m, ℓ} (\tilde{U}) \times D_{k} {(m (\tilde{U}), ℓ)}^{c} | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) \\ \leq Σ_{Y_{t}}^{n} (D_{k} {(m (\tilde{U}), ℓ)}^{c} | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) \\ = Σ_{Y_{t}}^{n} (D_{k}^{'} {(m)}^{c} \cup (⋃_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} D_{k^{'}}^{'} (m (\tilde{U}), ℓ)) | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) \\ \leq Σ_{Y_{t}}^{n} (D_{k}^{'} {(m)}^{c} | U_{k, m}^{n}) + Σ_{Y_{t}}^{n} (⋃_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} D_{k^{'}}^{'} (m (\tilde{U}), ℓ) | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) \\ = Σ_{Y_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{Y_{s}}, δ^{″}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) + Σ_{Y_{t}}^{n} (⋃_{s \in S_{ℓ}} ⋃_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k^{'}, m}^{n}) | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (⋂_{s \in S_{ℓ}} T_{Σ_{{XY}_{s}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) \\ \leq Σ_{Y_{t}}^{n} (T_{Σ_{Y_{t}}, δ}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) + \sum_{s \in S_{ℓ}} \sum_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} Σ_{Y_{t}}^{n} (T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k^{'}, m}^{n}) | U_{k, m}^{n}) + Σ_{{XY}_{t}}^{n} (T_{Σ_{{XY}_{t}}, \tilde{δ}}^{n} {(U_{k, m}^{n})}^{c} | U_{k, m}^{n}) . \end{matrix}

Averaging over all codebooks and applying Lemma 2.12 in [27], we have

\begin{matrix} E_{\tilde{U}} (ϵ_{n, k}^{t} (\tilde{U})) & \leq ϵ_{δ^{″}} (n, | U | | Y |) + ϵ_{\tilde{δ}} (n, | U | | X | | Y |) + \sum_{s \in S_{ℓ}} \sum_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} E_{U_{k^{'}, m}^{n}} E_{U_{k, m}^{n}} Σ_{Y_{t}}^{n} (T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k^{'}, m}^{n}) | U_{k, m}^{n}), \end{matrix}

with

ϵ_{δ^{″}} (n, | U | | Y |) = {(n + 1)}^{| U | | Y |} 2^{- n c δ^{″ 2}}

and

ϵ_{\tilde{δ}} (n, | U | | X | | Y |) = {(n + 1)}^{| U | | X | | Y |} 2^{- n c {\tilde{δ}}^{2}}

.

For

k^{'} \neq k

and applying from Lemma 3.3 in [28], we can bound the second term of the last inequality by

\begin{matrix} E_{U_{k, m}^{n}} Σ_{Y_{t}}^{n} (T_{Σ_{Y_{s}}, δ^{″}}^{n} ( & U_{k^{'}, m}^{n}) | U_{k, m}^{n}) \leq \frac{p_{Y, t}^{n} (T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k^{'}, m}^{n}))}{p_{u}^{n} (T_{p_{u}, δ^{'''}}^{n}),} \end{matrix}

with

δ^{'''} = δ^{'} - \frac{δ}{| U |}

, since

U_{k^{'}, m}^{n} \in T_{p_{u}, δ}^{n}

with probability one. For any

t, s \in S_{ℓ},

we have

\begin{matrix} E_{U_{k, m}^{n}} Σ_{Y_{t}}^{n} (T_{Σ_{Y_{s}}, δ^{″}}^{n} (U_{k^{'}, m}^{n}) | U_{k, m}^{n}) \leq \frac{{(n + 1)}^{| U | | Y |}}{1 - ϵ_{δ^{'''}} (n, | U |)} 2^{- n (I (U_{ℓ}; Y_{s}) - ϕ (δ^{″}, | U |, | Y |))} . \end{matrix}

For every

t, s \in S_{ℓ}

and every

k \in K

, we have

\begin{matrix} E_{\tilde{U}} (ϵ_{n, k}^{t} (\tilde{U}) | U_{k, m}^{n}) \leq ϵ_{δ^{″}} (n, | U | | Y |) + \sum_{s \in S_{ℓ}} \sum_{\begin{matrix} k^{'} \neq k \\ k^{'} \in K \end{matrix}} \frac{{(n + 1)}^{| U | | Y |}}{1 - ϵ_{δ^{'''}} (n, | U |)} 2^{- n (I (U_{ℓ}; Y_{s}) - ϕ (δ^{″}, | U |, | Y |))} \\ + ϵ_{\tilde{δ}} (n, | U | | X | | Y |) \\ \leq ϵ_{δ^{″}} (n, | U | | Y |) + \frac{{(n + 1)}^{| U | | Y |}}{1 - ϵ_{δ^{'''}} (n, | U |)} | S_{ℓ} | 2^{- n ({min}_{s \in S_{ℓ}} I (U_{ℓ}; Y_{s}) - R_{K} - ϕ (δ^{″}, | U |, | Y |))} \\ + ϵ_{\tilde{δ}} (n, | U | | X | | Y |) . \end{matrix}

There is an

n (δ^{″}, δ^{'''}, \tilde{δ}, | U |, | X |, | Y |)

such that for all

n > n (δ, δ^{'''}, \tilde{δ}, | U |, | X |, | Y |)

for which we have

\begin{matrix} E_{\tilde{U}} (ϵ_{n, k}^{t} (\tilde{U}) | U_{k, m}^{n}) \leq | S_{ℓ} | 2^{- n ({min}_{s \in S_{ℓ}} I (U_{ℓ}; Y_{s}) - R_{K} - ϕ (δ^{″}, | U |, | Y |))} \end{matrix}

(A4)

for all

k \in K

. By choosing

R_{K} < I (U_{ℓ}; Y_{s}) - ϕ (δ^{″}, | U |, | Y |)

(A5)

and letting

n \to \infty

, the right-hand side of (A4) tends to zero. Considering (A5) and (A3), the helper data rate is lower bounded by

\begin{matrix} R_{M} & > I (U_{ℓ}; X_{ℓ}) - I (U_{ℓ}; Y_{s}) + ϕ (δ^{″}, | U |, | Y |) + ψ (δ^{'}, | U |, | X |) . \end{matrix}

(A6)

Appendix A.1.7. Key Distribution

Besides reliability, a privacy secrecy rate pair has to fulfill three other conditions. One of them is that the secret key distribution must be close to the uniform distribution. Here, we show that this is indeed satisfied using the proof of [13]. For completeness, we introduce a sketch of the proof shown in [13] for a sequential key distillation, which consists of two phases: reconciliation and privacy amplification. The reconciliation step is equivalent to the reliability proved above. The privacy amplification step consists on the construction of the key K from a common shared sequence

T = U^{n}

using an extractor function

g : {0, 1}^{n} \times {0, 1}^{d} \to {0, 1}^{k}

with

d, k, N \in N

whose inputs are the shared sequence T and a sequence of d uniformly distributed bits

U_{d}

and gives as output a k nearly uniformly distributed sequence.

Lemma 1

([7]). Let

T \in {0, 1}^{n}

be the random variable that represents the common sequence shared by both terminals and let E be the random variable that represents the total knowledge about T available to the eavesdropper. Let e be a particular realization of E. If both terminals know the conditional min-entropy

H_{\infty} (T | E = e) \geq γ n

for some

γ \in (0, 1)

, then there exists an extractor

g : {0, 1}^{n} \times {0, 1}^{d} \to {0, 1}^{k}

with

d \leq n δ (n) a n d k \geq n (γ - δ (n)),

with

{lim}_{n \to \infty} δ (n) = 0

and if

U_{d}

is a random variable with uniform distribution on

{0, 1}^{d}

and both terminals choose

K = g (T, U_{d})

as their secret key, then

H (K | U_{d}, E = e) \geq k - δ (n) .

Sequential key distillation protocol: For every source realization

s \in S,

we have an

ℓ = ℓ (s) \in L

such that

Q_{s} \in Q_{XY, ℓ}

. For every

ℓ \in L,

we perform the following protocol:

Repeat $i \in N$ times the reconciliation protocol creating i shared sequences $T_{1}, T_{2}, \dots, T_{i}$ of length n.
Perform the privacy amplification phase based on an extractor with output size k, i.e., $K = g (T_{1}, T_{2}, \dots, T_{i}, U_{d}) = g (U_{1}^{n}, U_{2}^{n}, \dots, U_{i}^{n}, U_{d}) = g (U^{N}, U_{d})$ with $N = i n$ . $U_{d}$ has to be transmitted through the public channel together with the public message $M^{i}$ .
The total information available to the eavesdropper is $E = (M^{i}, U_{d}, Θ)$ , with $Θ$ being a binary random variable introduced for calculation purposes informing if $T^{i} \in T_{p_{T}, δ}^{n}$ .

In [13], it was shown that

\begin{matrix} H_{\infty} (T^{i} | M_{ℓ}^{i} = m^{i}, \hat{L} = ℓ, Θ = 1, U_{d}) \geq & N I (U_{ℓ}; Y_{s}) - N ϕ (δ,^{″} | U |, | Y |) H (X_{ℓ} | U_{ℓ}) - 2 i \\ - i ϕ (δ,^{″} | U |, | Y |) - δ_{ϵ} (i) - N δ (N) - \sqrt{(} N), \end{matrix}

(A7)

with

{lim}_{n \to \infty} δ_{ϵ} (n) = 0

(see Lemma 1 in [13]). Using Lemma 1, we have

H (K_{ℓ} | M_{ℓ}^{i} = m^{i}, \hat{L} = ℓ, Θ = 1, U_{d}) \geq k - δ (N),

which implies that

\begin{matrix} H (K_{ℓ} | \hat{L} = ℓ) & \geq H (K_{ℓ} | M_{ℓ}^{i}, Θ, U_{d}, \hat{L} = ℓ) \geq k - δ (N) . \end{matrix}

(A8)

Since this holds for every

ℓ \in L

, we have that

\begin{matrix} log | K | & \geq k \\ \geq H (K_{ℓ}) \\ \geq H (K_{ℓ} | \hat{L}) . \end{matrix}

(A9)

Furthermore, we have

\begin{matrix} H (K_{ℓ} | \hat{L}) & = \sum_{\tilde{ℓ} \in L} Pr {\hat{L} = \tilde{ℓ}} H (K_{ℓ} | \hat{L} = \tilde{ℓ}) \\ = Pr {\hat{L} = ℓ} H (K_{ℓ} | L = ℓ) + \sum_{\begin{matrix} \tilde{ℓ} \neq ℓ \\ \tilde{ℓ} \in L \end{matrix}} Pr {\hat{L} = \tilde{ℓ}} H (K_{ℓ} | \hat{L} = \tilde{ℓ}) \\ = Pr {\hat{L} = ℓ} H (K_{ℓ} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} H (K_{ℓ} | \hat{L} \neq ℓ) \\ \leq Pr {\hat{L} = ℓ} H (K_{ℓ} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} max_{ℓ \in L} H (K_{ℓ} | \hat{L} \neq ℓ) \\ \leq Pr {\hat{L} = ℓ} H (K_{ℓ} | \hat{L} = ℓ) + ϵ_{δ} {(n, | X |)}^{i} N log | X | \end{matrix}

(A10)

\begin{matrix} \leq H (K_{ℓ} | \hat{L} = ℓ) + {(n + 1)}^{| X | i} N log | X | 2^{- N c δ^{2}} \\ = H (K_{ℓ} | \hat{L} = ℓ) + ϵ_{δ} (n, i, | X |), \end{matrix}

(A11)

where

{lim}_{i, n \to \infty} ϵ_{δ} (n, i, | X |) = 0

and (A10) follows from (A1). We then have that

\begin{matrix} | H (K_{ℓ} | \hat{L}) - H (K_{ℓ} | \hat{L} = ℓ) | \leq ϵ_{δ} (n, i, | X |), \end{matrix}

(A12)

showing that

H (K_{ℓ} | \hat{L})

approaches

H (K_{ℓ} | \hat{L} = ℓ)

for increasing n or i or both at the same time.

Combining (A8), (A9) and (A12), we get

\begin{matrix} log | K | & \geq H (K_{ℓ} | \hat{L} = ℓ) - ϵ_{δ} (n, i, | X |) \\ \geq k - δ (N) - ϵ_{δ} (n, i, | X |) \\ = log | K | - δ (N) - ϵ_{δ} (n, i, | X |) . \end{matrix}

(A13)

Appendix A.1.8. Privacy Leakage

Another condition that has to be fulfilled by an achievable privacy secrecy rate pair is that the information rate provided by the helper data about the sequence

X^{n}

is bounded. We show here that this condition is fulfilled.

For every source realization

s \in S,

we have an

ℓ = ℓ (s) \in L

such that

Q_{s} \in Q_{XY, ℓ}

. For every

ℓ \in L,

we have

\begin{matrix} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d}, \hat{L}) & = \frac{1}{N} I (X_{ℓ}^{N}; \hat{L}) + \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L}) \\ \leq \frac{log | L |}{N} + \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L}) . \end{matrix}

(A14)

We analyze the second term of the right-hand side of (A14):

\begin{matrix} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L}) & \leq Pr {\hat{L} = ℓ} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} log | X | . \end{matrix}

Similar to (A11), we have

\begin{matrix} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L}) \leq \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L} = ℓ) + ϵ_{δ} {(n, | X |)}^{i} log | X | . \end{matrix}

(A15)

For every

ℓ \in L

and from (A6), it holds

\begin{matrix} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L} = ℓ) & \leq \frac{i log | M |}{N} + \frac{d + 1}{N} \\ \leq I (U_{ℓ}; X_{ℓ} | \hat{L} = ℓ) - I (U_{ℓ}; Y_{s}) + ϕ (δ^{″}, | U |, | Y |) + ψ (δ^{'}, | U |, | X |) \end{matrix}

(A16)

\begin{matrix} + \frac{d + 1}{N}, \end{matrix}

(A17)

with

ϕ (δ^{″}, | U |, | Y |) > 0

and

ψ (δ^{'}, | U |, | X |) > 0

. Combining (A14), (A15) and (A17), it follows that

\begin{matrix} \frac{1}{N} I (X_{ℓ}^{N}; M_{ℓ}^{i}, Θ, U_{d}, \hat{L}) & \leq I (U_{ℓ}; X_{ℓ} | \hat{L} = ℓ) - I (U_{ℓ}; Y_{s}) + \frac{log | L |}{N} + ϕ (δ^{″}, | U |, | Y |) \\ + ψ (δ^{'}, | U |, | X |) + ϵ_{δ} {(n, | X |)}^{i} log | X |, \end{matrix}

where the last three terms of the right-hand side of the inequality goes to zero for large enough n and i.

Appendix A.1.9. Secrecy Leakage

The last condition that has to be fulfilled by an achievable privacy secrecy rate pair is that the information rate provided by the helper data about the secret key is negligibly small. For every source realization

s \in S,

we have an

ℓ = ℓ (s) \in L

such that

Q_{s} \in Q_{XY, ℓ}

. For every

ℓ \in L,

we have

\begin{matrix} I (K_{ℓ}; M_{ℓ}^{i}, Θ, U_{d}, \hat{L}) = I (K_{ℓ}; \hat{L}) + I (K_{ℓ}; M_{ℓ}^{i}, Θ, U_{d}, | \hat{L}) . \end{matrix}

(A18)

We first consider the first term of (A18). Using (A8) and (A12), we get that

\begin{matrix} I (K_{ℓ}; \hat{L}) & = H (K_{ℓ}) - H (K_{ℓ} | \hat{L}) \leq δ (N) + ϵ_{δ} (n, i, | X |) . \end{matrix}

We consider the second term of (A18). Using (A13), we get

\begin{matrix} I (K_{ℓ}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L}) & \leq Pr {\hat{L} = ℓ} I (K_{ℓ}; M_{ℓ}^{i}, Θ, U_{d} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} N log | X | \\ \leq H (K_{ℓ} | \hat{L} = ℓ) - H (K_{ℓ} | M_{ℓ}^{i}, Θ, U_{d}, \hat{L} = ℓ) + ϵ {(n, | X |)}^{i} N log | X | 2^{- N c δ^{2}} \\ \leq log | K | - log | K | + δ (N) + ϵ_{δ} (n, i, | X |) \\ = δ (N) + ϵ_{δ} (n, i, | X |) . \end{matrix}

(A19)

Hence,

I (K_{ℓ}; M_{ℓ}^{i}, Θ, U_{d}, \hat{L}) \leq 2 δ (N) + 2 ϵ_{δ} (n, i, | X |) .

(A20)

Note that the right-hand side of the inequality goes to zero for large enough N, showing that for every source realization

s \in S

, the secret key information rate leaked by the helper is negligibly small.

Note that we showed that the rate pair can be achieved for large

N = i n

, i.e., not for all

N \in N

. To show the achievability for all blocklengths

N \in N,

we define the sequence

N_{i}

with

i \in N

with

N_{i} = i^{2}

. We showed that for the sequence

N_{i}

of blocklengths with

i \in N

, there exists a blocklength

N_{i_{0}}

such that for all blocklengths

N_{i} > N_{i_{0}}

, we can find a code sequence that fulfills the achievability conditions. For every

N_{i} < N < N_{i + 1},

one can rewrite

N = N_{i} + r_{i}

with

r_{i} < N_{i + 1} - N_{i}

. We use only the first

N_{i}

symbols to generate the key and discard the rest

r_{i}

. One can easily see that there is a

ϵ (N)

such that, for

δ = ϵ (N),

all conditions are fulfilled. This completes the proof of achievability.

Appendix A.2. Converse of Theorem 3

For the converse, we consider a genie-aided enrollment and authentication terminal, i.e., the user at the enrollment and authentication terminal has partial knowledge of the source, i.e., he knows the actual state of the marginal distribution

ℓ \in L

but not the complete source state. The converse follows from the corresponding result for a joint-source with perfect SSI shown in [11]. For a fixed

ℓ \in L

,

s \in S_{ℓ}

and

V : X \to P (U),

we define the region

R (V, ℓ, s)

as the set of all

(R_{P L}, R_{K}) \in R_{+}^{2}

that satisfy

\begin{matrix} R_{K} & \leq I (U_{ℓ}; Y_{s}), \\ R_{P L} & \geq I (U_{ℓ}; X_{ℓ} | L = ℓ) - I (U_{ℓ}; Y_{s}) . \end{matrix}

We start analyzing the secret key rate. For a fixed

ℓ \in L

and

s \in S_{ℓ},

we have

\begin{matrix} H (K_{ℓ}) & = H (K_{ℓ} | L = ℓ) \\ = I (K_{ℓ}; M_{ℓ} Y_{s}^{n} | L = ℓ) + H (K_{ℓ} | M_{ℓ} Y_{s}^{n} \hat{K}, L = ℓ), \end{matrix}

where

\hat{K}

is a deterministic function of

M, Y^{n}

and

L = ℓ

, i.e.,

\hat{K} = f (M, Y^{n}, L = ℓ),

\begin{matrix} H (K_{ℓ}) & \leq I (K_{ℓ}; M_{ℓ} Y_{s}^{n} | L = ℓ) + H (K_{ℓ} | \hat{K}) \\ \leq I (K_{ℓ}; M_{ℓ} Y_{s}^{n} | L = ℓ) + ϵ_{n} \\ = I (K_{ℓ}; M_{ℓ} | L = ℓ) + I (K_{ℓ} M_{ℓ}; Y_{s}^{n} | L = ℓ) + ϵ_{n} \\ = I (K_{ℓ}; M_{ℓ} | L = ℓ) + \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ}; Y_{s, i} | Y_{s}^{i - 1} L = ℓ) + ϵ_{n} \\ = I (K_{ℓ}; M_{ℓ} | L = ℓ) + \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} Y_{s}^{i - 1}; Y_{s, i} | L = ℓ) + ϵ_{n} \end{matrix}

(A21)

\begin{matrix} \leq I (K_{ℓ}; M_{ℓ} | L = ℓ) + \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1}; Y_{s, i} | L = ℓ) + ϵ_{n} \end{matrix}

(A22)

\begin{matrix} = I (K_{ℓ}; M_{ℓ} | L = ℓ) + n I (U_{ℓ}; Y_{s} | L = ℓ) + ϵ_{n}, \end{matrix}

(A23)

where (A21) holds for

ϵ_{n} = 1 + Pr {\hat{K} \neq K} log K_{n}

and follows from Fano’s Inequality and (A22) from

Y^{i - 1} - K M X^{i - 1} - Y_{i}

forming a Markov chain. This comes from

\begin{matrix} P_{K M Y^{i - 1} X^{i - 1} Y_{i}} (k, m, y^{i - 1}, x^{i - 1}, y_{i}) & = \sum_{x_{i + 1}^{n}} \sum_{x_{i}} p_{ℓ} (x^{i - 1}) p_{ℓ} (x_{i}) p_{ℓ} (x_{i - 1}^{n}) P_{K M} (k, m | x^{n}) W_{s} (y_{i}, x_{i}) W_{s} (y^{i - 1} | x^{i - 1}) \\ = P_{X^{i - 1} K M Y_{i}} (x^{i - 1}, k, m, y_{i}) W_{s} (y^{i - 1} | x^{i - 1}) \\ = p_{ℓ} (x^{i - 1}) Pr (k, m, y_{i} | x^{i - 1}) W_{s} (y^{i - 1} | x^{i - 1}) . \end{matrix}

We define

U_{ℓ, i} = (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1})

. The Equality (A23) is obtained using a time-sharing variable T uniformly distributed over

{1, \dots, n}

and independent of all other variables. Setting

U = (U_{ℓ, i})

,

X = X_{ℓ, i}

and

Y = Y_{ℓ, i}

for

T = i,

we obtain

\begin{matrix} \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1}; Y_{s, i} | L = ℓ) & = \sum_{i = 1}^{n} I (U_{ℓ, i}; Y_{s, i} | L = ℓ) \\ = n I (U_{ℓ, T}; Y_{s, T} | T, L = ℓ) \\ = n I ((U_{ℓ, T}, T); Y_{s, T} | L = ℓ) \\ = n I (U_{ℓ}; Y_{s} | L = ℓ) . \end{matrix}

Dividing by n, we get

\begin{matrix} \frac{1}{n} H (K_{ℓ}) & \leq \frac{1}{n} I (K_{ℓ}; M_{ℓ} | L = ℓ) + I (U_{ℓ}; Y_{s}, L = ℓ) + \frac{1}{n} ϵ \\ \leq I (U_{ℓ}; Y_{s}, L = ℓ) + λ_{n, ℓ} + \frac{1}{n} + \frac{1 + ϵ}{n}, \end{matrix}

where the last inequality holds with

λ_{n, ℓ} \to 0

for

n \to \infty

(see [11]).

Assuming the rate pair

(R_{P L}, R_{K})

is achievable, we have that

ϵ \leq 1 + δ log K_{n}

and obtain

\begin{matrix} R_{K} - δ & \leq I (U_{ℓ}; Y_{s}, L = ℓ) + λ_{n, ℓ} + \frac{1}{n} + \frac{1 + δ log K_{n}}{n} . \end{matrix}

(A24)

We continue with the privacy leakage. For a fixed

s \in S_{ℓ}

we have

\begin{matrix} I (X_{ℓ}^{n}; M_{ℓ}) & = I (X_{ℓ}^{n}; M_{ℓ} | L = ℓ) \\ = H (M_{ℓ} | L = ℓ) - H (M_{ℓ} | X_{ℓ}^{n}, L = ℓ) \\ \geq H (M_{ℓ} | Y_{s}^{n}, L = ℓ) - H (K_{ℓ} M_{ℓ} | X_{ℓ}^{n}, L = ℓ) \\ = H (K_{ℓ} M_{ℓ} | Y_{s}^{n}, L = ℓ) - H (K | M Y^{n} \hat{K}) - H (K_{ℓ} M_{ℓ} | X_{ℓ}^{n}, L = ℓ) \\ \geq H (K_{ℓ} M_{ℓ} | Y_{s}^{n}, L = ℓ) - H (K | \hat{K}) - H (K_{ℓ} M_{ℓ} | X_{ℓ}^{n}, L = ℓ) \\ \geq H (K_{ℓ} M_{ℓ} | Y_{s}^{n}, L = ℓ) - ϵ_{n} - H (K_{ℓ} M_{ℓ} | X_{ℓ}^{n}, L = ℓ) \\ = I (K_{ℓ} M_{ℓ}; X_{ℓ}^{n} | L = ℓ) - I (K_{ℓ} M_{ℓ}; Y_{s}^{n} | L = ℓ) {- ϵ)}_{n} \\ = \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ}; X_{ℓ, i} | X_{ℓ}^{i - 1}, L = ℓ) - \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ}; Y_{s, i} | Y_{ℓ}^{i - 1}, L = ℓ) - ϵ_{n} \\ = \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1}; X_{ℓ, i}, L = ℓ) - \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} Y_{s}^{i - 1}; Y_{s, i}, L = ℓ) - ϵ_{n} \\ \geq \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1}; X_{ℓ, i}, L = ℓ) - \sum_{i = 1}^{n} I (K_{ℓ} M_{ℓ} X_{ℓ}^{i - 1}; Y_{s, i}, L = ℓ) - ϵ_{n} \\ = n I (U_{ℓ}; X_{ℓ} | L = ℓ) - n I (U_{ℓ}; Y_{s} | L = ℓ) - ϵ_{n} . \end{matrix}

Dividing by n, we get

\begin{matrix} \frac{1}{n} I (X_{ℓ}^{n}; M_{ℓ}) & \geq I (U_{ℓ}; X_{ℓ} | L = ℓ) - I (U_{ℓ}; Y_{s} | L = ℓ) + \frac{1}{n} ϵ_{n} . \end{matrix}

Assuming

(R_{P L}, R_{K})

is achievable, we have that

ϵ \leq 1 + δ log K_{n}

and obtain

\begin{matrix} R_{P L} + δ & \geq I (U_{ℓ}; X_{ℓ} | L = ℓ) - I (U_{ℓ}; Y_{s} | L = ℓ) + \frac{1 + δ log K_{n}}{n} . \end{matrix}

(A25)

We have shown that

C_{G} (Q_{XY}) \subseteq ⋂_{ℓ \in L} C_{ℓ}

. This means that if

(R_{P L}, R_{K}) \in C_{G} (Q_{XY})

holds, then we have that

(R_{P L}, R_{K}) \in ⋂_{ℓ \in L} C_{ℓ}

. Equivalently, if

(R_{P L}, R_{K}) \notin ⋂_{ℓ \in L} C_{ℓ}

, then

(R_{P L}, R_{K}) \notin C_{G} (Q_{XY})

. Assume

(R_{P L}^{*}, R_{K}^{*}) \notin ⋂_{ℓ \in L} C_{ℓ}

. This implies that there exists a

ℓ \in L

such that, for all auxiliary channels V, we have that

(R_{P L}^{*}, R_{K}^{*}) \notin R (V, ℓ)

, which implies that

(R_{P L}^{*}, R_{K}^{*}) \notin C_{G} (Q_{XY})

. This completes the converse and therewith proves the desired result.

It remains to derive the bound on the cardinality of the auxiliary random variables U. Let

ℓ \in L

be arbitrarily but fixed and U be a random variable fulfilling

P_{U X Y, s} (u, x, y) = V (u | x) Q_{s} (x, y)

for all

s \in S (ℓ)

. We show that there is a random variable

\bar{U}

with range

| \bar{U} | = | X | + | S_{ℓ} |

\begin{matrix} I (\bar{U}; Y) & = I (U; Y), \\ I (\bar{U}; X) - I (\bar{U}; Y) & = I (U; X) - I (U; Y), \end{matrix}

(A26)

for all

s \in S_{ℓ}

. We consider the following

| X | + | S_{ℓ} |

real valued continuous functions on

P (X)

\begin{matrix} f_{x} (p) & = p (x), for all x \in X but one, \\ g_{s} (P) & = H (p W_{s}), \\ h (P) & = H (p), \end{matrix}

for all

s \in S_{ℓ}

. We have that

Σ_{X_{ℓ}} (\cdot | u) \in P (X)

having

μ

-measure

p_{u}

. Then, it holds that

\begin{matrix} \sum_{u} p_{u} (u) f_{x} (Σ_{X_{ℓ}} (\cdot | u)) & = p (x), \\ \sum_{u} p_{u} (u) g_{s} (Σ_{X_{ℓ}} (\cdot | u)) & = H (Y | U), \\ \sum_{u} p_{u} (u) h (Σ_{X_{ℓ}} (\cdot | u)) & = H (X | U), \end{matrix}

for all

s \in S_{ℓ}

. According to (Lemma 15.4, [27]), there exists a random variable

\bar{U}

fulfilling the Markov condition with values in

\bar{U} = {1, \dots, | X | + | S_{ℓ} |}

and (A26) holds (see also Lemma 15.5 in [27]). □

Appendix B. Proof of Theorem 4

Appendix B.1. Achievability of Theorem 4

The achievability proof of Theorem 4 is very similar to the achievability proof of Theorem 3, where first the index of marginal distribution ℓ over

X

is estimated. The difference is that, in this model, we use a generated secret key

K_{ℓ, g}

in a one-pad system to conceal the uniformly distributed chosen key K over the set

K

; as in [11], it is additionally sent together with the generated helper message

M_{ℓ, g}

and the index of the estimated marginal distribution

\hat{L}

over the public message, i.e., the helper data is

M^{'} = (M_{ℓ, g}, K \oplus K_{ℓ, g}, \hat{L})

. The error analysis is similar to the error analysis for Theorem 3 and the key is already uniformly distributed; however, we should take a deeper look into the privacy leakage and the secrecy leakage. We perform the privacy amplification step as in Appendix A to show that the strong secrecy is fulfilled.

Appendix B.1.1. Privacy Leakage

Another condition that has to be fulfilled by an achievable privacy secrecy rate pair is that the information rate provided by the helper data about the sequence

X^{n}

is bounded. We show here that this condition is fulfilled.

For every source realization

s \in S

, we have an

ℓ = ℓ (s)

such that

Q_{s} \in Q_{XY, ℓ}

. We have

\begin{matrix} \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d}, \hat{L}) & \leq \frac{log | L |}{N} + \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L}) . \end{matrix}

(A27)

We analyze the second term of the right-hand side of (A27)

\begin{matrix} \frac{1}{N} & I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L}) \\ \leq Pr {\hat{L} = ℓ} \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} \frac{N log | X |}{N} \\ \leq Pr {\hat{L} = ℓ} \times \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) + ϵ_{δ} (n, i, | X |) log | X | . \end{matrix}

Similar to (A11), we have

\begin{matrix} \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L}) \\ \leq \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) + ϵ_{δ} (n, i, | X |) log | X | . \end{matrix}

(A28)

In [11], the authors show that, for every

ℓ \in L,

it holds

\begin{matrix} \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) \\ \leq \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, Θ, U_{d} | \hat{L} = ℓ ∥ Q_{s} \in Q_{XY, ℓ}) + \frac{1}{N} H (K \oplus K_{ℓ, g} | \hat{L} = ℓ ∥ Q_{s} \in Q_{XY, ℓ}) \\ - \frac{1}{N} H (K \oplus K_{ℓ, g} | X^{n}, M_{ℓ, g}^{i}, Θ, U_{d}, K_{ℓ, g}, \hat{L} = ℓ) \\ \leq \frac{1}{N} I (X^{N}; M_{ℓ, g}^{i}, Θ, U_{d} | \hat{L} = ℓ ∥ Q_{s} \in Q_{XY, ℓ}) + \frac{1}{N} log K_{N} - \frac{1}{N} log K_{N} \\ \leq \frac{1}{N} I (X^{n}; M_{ℓ, g}^{i}, Θ, U_{d} | \hat{L} = ℓ ∥ Q_{s} \in Q_{XY, ℓ}) \\ \leq I (U; X | \hat{L} = ℓ ∥ Q_{s} \in Q_{XY, ℓ}) - I (U; Y ∥ Q_{s} \in Q_{XY, ℓ}) + ϕ (δ^{″}, | U |, | Y |) + ψ (δ^{'}, | U |, | X |) \\ + \frac{d + 1}{N}, \end{matrix}

(A29)

which proves the bound on the privacy leakage.

Appendix B.1.2. Secrecy Leakage

For every source realization

s \in S

, we have an

ℓ = ℓ (s)

such that

Q_{s} \in Q_{XY, ℓ}

. Following similar steps as for the privacy leakage, it can be shown that the secrecy leakage is upper-bounded by

\begin{matrix} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d}, \hat{L}) = I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L}) . \end{matrix}

(A30)

We analyze the right-hand side of (A30):

\begin{matrix} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L}) = \sum_{\tilde{ℓ} \in L} Pr {\hat{L} = \tilde{ℓ}} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = \tilde{ℓ}) \\ = Pr {\hat{L} = ℓ} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) + Pr {\hat{L} \neq ℓ} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} \neq ℓ) . \end{matrix}

For every

ℓ \in L,

it holds

\begin{matrix} I (K; M_{ℓ, g}^{i}, K \oplus K_{ℓ, g}, Θ, U_{d} | \hat{L} = ℓ) & \leq log | K | - H (K_{ℓ, g} | \hat{L} = ℓ) + I (K_{ℓ, g}; M_{ℓ, g}^{i}, Θ, U_{d} | \hat{L} = ℓ) \\ \leq log | K | - log | K | + I (K_{ℓ, g}; M_{ℓ, g}^{i}, Θ, U_{d} | \hat{L} = ℓ) . \end{matrix}

(A31)

The last inequality follows from (A13). Substituting

K_{ℓ, g}

with K, combining (A31) with (A19) and letting

i, n \to \infty,

we obtain the desired result.

Appendix B.2. Converse of Theorem 4

The converse of Theorem 4 can be shown using the same lines of arguments as for the converse of Theorem 3. □

Appendix C. Proof Lemma 1

For every channel

V : X \to P (U)

, for every

s_{1} \in S_{1}

and

s_{2} \in S_{2},

we have the following effective sources:

\begin{matrix} P_{U X Y, s_{1}} (u, x, y) & = V (u | x) Q_{s_{1}} (x, y), \\ P_{U X Y, s_{2}} (u, x, y) & = V (u | x) Q_{s_{2}} (x, y) . \end{matrix}

Let

d_{H} (Q_{{XY}_{1}}, Q_{{XY}_{2}}) \leq ϵ

then there exists a

s_{1} \in S_{1}

and

s_{2} \in S_{2}

such that

(\bar{V}, \bar{s_{1}}, \bar{s_{2}}) = argmax d_{H} (Q_{{XY}_{1}}, Q_{{XY}_{2}})

. Then, we have that

\begin{matrix} ∥ P_{U X Y, \bar{s_{1}}} - P_{U X Y, \bar{s_{2}}} ∥_{T V} & = \sum_{u \in U} \sum_{x, y \in X \times Y} | P_{U X Y, \bar{s_{1}}} (u, x, y) - P_{U X Y, \bar{s_{2}}} (u, x, y) | \\ = \sum_{u \in U} \sum_{x, y \in X \times Y} | V (u | x) Q_{\bar{s_{1}}} (x, y) - V (u | x) Q_{\bar{s_{2}}} (x, y) | \\ = \sum_{u \in U} \sum_{x, y \in X \times Y} | V (u | x) (Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y)) | \\ = \sum_{u \in U} \sum_{x, y \in X \times Y} V (u | x) | Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y) | \\ = \sum_{x, y \in X \times Y} (| Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y) | \sum_{u \in U} V (u | x)) \\ = \sum_{x, y \in X \times Y} | Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y) | \\ \leq ϵ, \end{matrix}

(A32)

and

\begin{matrix} ∥ P_{U, \bar{s_{1}}} - P_{U, \bar{s_{2}}} ∥_{T V} & = \sum_{u \in U} | \sum_{x, y \in X \times Y} P_{U X Y, \bar{s_{1}}} (u, x, y) - P_{U X Y, \bar{s_{2}}} (u, x, y) | \\ = \sum_{u \in U} | \sum_{x, y \in X \times Y} V (u | x) (Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y)) | \\ \leq \sum_{u \in U} \sum_{x, y \in X \times Y} V (u | x) | (Q_{\bar{s_{1}}} (x, y) - Q_{\bar{s_{2}}} (x, y)) | \\ \leq ϵ . \end{matrix}

(A33)

For every channel

V : X \to P (U)

, for every

s_{1} \in S_{1}

and

s_{2} \in S_{2},

there is an

ℓ_{1} = ℓ_{1} (s_{1})

and

ℓ_{2} = ℓ_{2} (s_{2})

the region

R (V, ℓ_{i}, s_{i})

with

i = {1, 2}

is rectangular. Therefore, to calculate the Hausdorff distance between regions, we are only interested in the corner points:

\begin{matrix} R_{K, s_{i}} & = I (U_{ℓ_{i}}; Y_{s_{i}}), \\ R_{P L, s_{i}} & = I (U_{ℓ_{i}}; X_{ℓ_{i}}) - I (U_{ℓ_{i}}; Y_{s_{i}}) . \end{matrix}

Let V be arbitrary but fixed. Then, for every

s_{1} \in S_{1}

and

s_{2} \in S_{2},

we have

\begin{matrix} | I (U_{ℓ_{1}}; Y_{s_{1}} - I (U_{ℓ_{2}}; Y_{s_{2}}) | & = | H (U_{ℓ_{1}}) - H (U_{ℓ_{2}}) + H (Y_{s_{2}} | U_{ℓ_{2}}) - H (Y_{s_{2}} | U_{ℓ_{1}}) | \\ \leq | H (U_{ℓ_{1}}) - H (U_{ℓ_{2}}) | + | H (Y_{s_{2}} | U_{ℓ_{2}}) - H (Y_{s_{2}} | U_{ℓ_{1}}) | . \end{matrix}

For

\bar{V}

,

\bar{s_{1}}

and

\bar{s_{2}},

there is a

\bar{ℓ_{1}} = \bar{ℓ_{1}} (\bar{s_{1}})

and

\bar{ℓ_{2}} = \bar{ℓ_{2}} (\bar{s_{2}})

. Using [27], Lemma 2.12 and Using Lemma 1 in [22], we get

\begin{matrix} | I (U_{\bar{ℓ_{1}}}; Y_{\bar{s_{1}}} - I (U_{\bar{ℓ_{2}}}; Y_{\bar{s_{2}}}) | & \leq 2 ϵ log | Y | + 2 H_{2} (ϵ) - ϵ log \frac{ϵ}{| U |} . \end{matrix}

(A34)

Following the same line of arguments as for (A34), we get

\begin{matrix} | I (U_{\bar{ℓ_{1}}}; X_{\bar{ℓ_{1}}} - I (U_{\bar{ℓ_{2}}}; X_{\bar{ℓ_{2}}}) | \leq 2 ϵ log | X | + 2 H_{2} (ϵ) - ϵ log \frac{ϵ}{| U |} . \end{matrix}

(A35)

Hence, for every channel

V : X \to P (U)

,

\bar{s_{1}}

and

\bar{s_{2}},

we have

D_{H} (R (V, \bar{ℓ_{1}}, \bar{s_{1}}), R (V, \bar{ℓ_{2}}, \bar{s_{2}})) \leq δ (ϵ),

(A36)

with

δ (ϵ) = \sqrt{δ_{1} {(ϵ)}^{2} + δ_{2} {(ϵ)}^{2}},

where

δ_{1} (ϵ) = 2 ϵ log | Y | + 2 H_{2} (ϵ) - ϵ log \frac{ϵ}{| U |}

and

δ_{2} (ϵ) = 2 ϵ log | Y | | X | + 4 H_{2} (ϵ) - 2 ϵ log \frac{ϵ}{| U |}

.

For fixed

ℓ_{1}

and

ℓ_{2},

we denote

\begin{matrix} R (V, ℓ_{1}) & = ⋂_{s_{i} \in S_{1}} R (V, ℓ_{1}, s_{1}), \\ R (V, ℓ_{2}) & = ⋂_{s_{i} \in S_{2}} R (V, ℓ_{2}, s_{2}), \\ R (ℓ_{1}) & = ⋃_{\begin{matrix} V : X \to P (U) \\ | U | \leq | X | + | S_{ℓ_{1}} \end{matrix}} R (V, ℓ_{1}), \\ R (ℓ_{2}) & = ⋃_{\begin{matrix} V : X \to P (U), \\ | U | \leq | X | + | S_{ℓ_{2}} \end{matrix}} R (V, ℓ_{2}) . \end{matrix}

We have

\begin{matrix} D_{H} (R (V, ℓ_{1}), R (V, ℓ_{2})) & = D_{H} (⋂_{s_{1} \in S_{1}} R (V, ℓ_{1}, s_{1}), ⋂_{s_{2} \in S_{2}} R (V, ℓ_{2}, s_{2})) \end{matrix}

\begin{matrix} = D_{H} (⋃_{s_{1} \in S_{1}} R {(V, ℓ_{1}, s_{1})}^{c}, ⋃_{s_{2} \in S_{2}} R {(V, ℓ_{2}, s_{2})}^{c}) \end{matrix}

(A37)

\begin{matrix} \leq D_{H} (R {(\bar{V}, \bar{ℓ_{1}}, \bar{s_{1}})}^{c}, R {(\bar{V}, \bar{ℓ_{2}}, \bar{s_{2}})}^{c}) \\ \leq δ (ϵ) . \end{matrix}

(A38)

Equation (A37) holds since the Hausdorff distance between two sets equals the Hausdorf distance between the complements of each set. Inequation (A37) holds since

\bar{V}, \bar{s_{1}}, \bar{s_{2}}

is the index of the sets that maximises the Hausdorf distance. It also holds that

\begin{matrix} D_{H} (R (ℓ_{1}), R (ℓ_{2})) & = D_{H} (⋃_{\begin{matrix} V : X \to P (U) \\ | U | \leq | X | + | S_{ℓ_{1}} \end{matrix}} R (V, ℓ_{1}), ⋃_{\begin{matrix} V : X \to P (U) \\ | U | \leq | X | + | S_{ℓ_{2}} \end{matrix}} R (V, ℓ_{2})) \\ \leq D_{H} (R {(\bar{V}, \bar{ℓ_{1}}, \bar{s_{1}})}^{c}, R {(\bar{V}, \bar{ℓ_{2}}, \bar{s_{2}})}^{c}) \\ \leq δ (ϵ), \end{matrix}

and

\begin{matrix} D_{H} (C_{G} (Q_{XY, 1}), C_{G} (Q_{XY, 2})) & = D_{H} (⋂_{ℓ_{1} \in L_{1}} R (ℓ_{1}), ⋂_{ℓ_{2} \in L_{2}} R (ℓ_{2})) \\ = D_{H} (⋃_{ℓ_{1} \in L_{1}} R {(ℓ_{1})}^{c}, ⋃_{ℓ_{2} \in L_{2}} R {(ℓ_{2})}^{c}) \\ \leq D_{H} (R {(\bar{V}, \bar{ℓ_{1}}, \bar{s_{1}})}^{c}, R {(\bar{V}, \bar{ℓ_{2}}, \bar{s_{2}})}^{c}) \\ \leq δ (ϵ) . \end{matrix}

References

Shannon, C.E. Communication theory of secrecy systems. Bell Syst. Tech. J. 1949, 28, 656–715. [Google Scholar] [CrossRef]
Liang, Y.; Poor, H.V.; Shamai, S. Information theoretic security. Found. Trends Commun. Inf. Theor. 2009, 5, 355–580. [Google Scholar] [CrossRef]
Bloch, M.; Barros, J. Physical-Layer Security; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Schaefer, R.F.; Boche, H.; Khisti, A.; Poor, H.V. Information Theoretic Security and Privacy of Information Systems; Cambridge University Press: Cambridge, UK, 2017. [Google Scholar]
Ahlswede, R.; Csiszàr, I. Common randomness in information theory and cryptography—Part I: Secret sharing. IEEE Trans. Inf. Theor. 1993, 39, 1121–1132. [Google Scholar] [CrossRef]
Maurer, U.M. Secret key agreement by public discussion from common information. IEEE Trans. Inf. Theor. 1993, 39, 733–742. [Google Scholar] [CrossRef]
Maurer, U.; Wolf, S. Information-theoretic key agreement: From weak to strong secrecy for free. Adv. Crypt. EUROCRYPT 2000, 1807, 351–368. [Google Scholar]
Schneier, B. Inside risks: The uses and abuses of biometrics. Commun. ACM 1999, 42, 136. [Google Scholar] [CrossRef]
Ratha, N.K.; Connell, J.H.; Bolle, R.M. Enhancing security and privacy in biometrics-based authentication systems. IBM Syst. J. 2001, 40, 614–634. [Google Scholar] [CrossRef]
Prabhakar, S.; Pankanti, S.; Jain, A.K. Biometric recognition: Security and privacy concerns. IEEE Secur. Priv. 2003, 1, 33–42. [Google Scholar] [CrossRef]
Ignatenko, T.; Willems, F.M. Biometric systems: Privacy and secrecy aspects. IEEE Trans. Inf. Forensics Secur. 2009, 4, 956–973. [Google Scholar] [CrossRef]
Lai, L.; Ho, S.W.; Poor, H.V. Privacy–security trade-offs in biometric security systems—Part I: Single use case. IEEE Trans. Inf. Forensics Secur. 2011, 6, 122–139. [Google Scholar] [CrossRef]
Chou, R.A.; Bloch, M.R. One-way rate-limited sequential key-distillation. In Proceedings of the IEEE International Symposium Information Theory, Cambridge, MA, USA, 1–6 July 2012; pp. 1777–1781. [Google Scholar]
Wolfowitz, J. Simultaneous channels. Arch. Ration. Mech. Anal. 1959, 4, 371–386. [Google Scholar] [CrossRef]
Blackwell, D.; Breiman, L.; Thomasian, A. The capacity of a class of channels. Ann. Math. Stat. 1959, 30, 1229–1241. [Google Scholar] [CrossRef]
Boche, H.; Wyrembelski, R.F. Secret key generation using compound sources-optimal key-rates and communication costs. In Proceedings of the 9th International ITG Conference on Systems, Communication and Coding, München, Germany, 21–24 January 2013; pp. 1–6. [Google Scholar]
Bloch, M. Channel intrinsic randomness. In Proceedings of the IEEE International Symposium on Information Theory, Austin, TX, USA, 13–18 June 2010; pp. 2607–2611. [Google Scholar]
Chou, R.; Bloch, M.R. Secret-key generation with arbitrarily varying eavesdropper’s channel. In Proceedings of the IEEE Global Conference on Signal and Information Processing, Austin, TX, USA, 3–5 December 2013; pp. 277–280. [Google Scholar]
Tavangaran, N.; Boche, H.; Schaefer, R.F. Secret-key generation using compound sources and one-way public communication. IEEE Trans. Inf. Forensics Secur. 2017, 12, 227–241. [Google Scholar] [CrossRef]
Grigorescu, A.; Boche, H.; Schaefer, R.F. Robust PUF based authentication. In Proceedings of the IEEE International Workshop on Information Forensics and Security, Rome, Italy, 16–19 November 2015; pp. 1–6. [Google Scholar]
Boche, H.; Nötzel, J. Positivity, discontinuity, finite resources, and nonzero error for arbitrarily varying quantum channels. J. Math. Phys. 2014, 55, 122201. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F.; Poor, H.V. On the continuity of the secrecy capacity of compound and arbitrarily varying wiretap channels. IEEE Trans. Inf. Forensics Secur. 2015, 10, 2531–2546. [Google Scholar] [CrossRef]
Grigorescu, A.; Boche, H.; Schaefer, R.F.; Poor, H.V. Capacity region continuity of the compound broadcast channel with confidential messages. In Proceedings of the IEEE Information Theory Workshop, Jerusalem, Israel, 24 April–1 May 2015; pp. 1–6. [Google Scholar]
Wolfowitz, J. Coding Theorems of Information Theory; Springer: New York, NY, USA, 1978. [Google Scholar]
Schaefer, R.F.; Boche, H.; Poor, H.V. Super-activation as a unique feature of secure communication in malicious environments. Information 2016, 7, 24. [Google Scholar] [CrossRef]
Boche, H.; Schaefer, R.F.; Poor, H.V. Characterization of Super-Additivity and Discontinuity Behavior of the Capacity of Arbitrarily Varying Channels Under List Decoding. Available online: http://ieeexplore.ieee.org/abstract/document/8007044/ (accessed on 7 September 2017).
Csiszàr, I.; Körner, J. Information Theory: Coding Theorems for Discrete Memoryless Systems; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Bjelaković, I.; Boche, H.; Sommerfeld, J. Secrecy results for compound wiretap channels. Probl. Inf. Transm. 2013, 49, 73–98. [Google Scholar] [CrossRef]

Figure 1. The biometric measurements

X^{n}

and

Y^{n}

are observed in the enrollment and authentication terminal, respectively. In the enrollment terminal, the key K and the helper data

M^{'}

are generated. The helper data is public, hence the eavesdropper also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed biometric measurements

Y^{n}

and the helper data

M^{'}

.

Figure 1. The biometric measurements

X^{n}

and

Y^{n}

are observed in the enrollment and authentication terminal, respectively. In the enrollment terminal, the key K and the helper data

M^{'}

are generated. The helper data is public, hence the eavesdropper also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed biometric measurements

Y^{n}

and the helper data

M^{'}

.

Figure 2. The biometric sequences

X^{n}

and

Y^{n}

are observed at the enrollment and authentication terminal, respectively. In the enrollment terminal, the helper data

M^{'}

is generated for a given secret key K. The helper data is public, hence the eavesdropper also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed biometric authentication sequence

Y^{n}

and the helper data

M^{'}

.

Figure 2. The biometric sequences

X^{n}

and

Y^{n}

are observed at the enrollment and authentication terminal, respectively. In the enrollment terminal, the helper data

M^{'}

is generated for a given secret key K. The helper data is public, hence the eavesdropper also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed biometric authentication sequence

Y^{n}

and the helper data

M^{'}

.

Figure 3. The attacker controls the state of the source

s \in S

. The biometric sequences

X^{n}

and

Y^{n}

are observed at the enrollment and authentication, terminal respectively. In the enrollment terminal, the key K and the helper data

M^{'}

are generated. The helper data is public, hence the attacker also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed authentication sequence

Y^{n}

and the helper data

M^{'}

.

Figure 3. The attacker controls the state of the source

s \in S

. The biometric sequences

X^{n}

and

Y^{n}

are observed at the enrollment and authentication, terminal respectively. In the enrollment terminal, the key K and the helper data

M^{'}

are generated. The helper data is public, hence the attacker also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed authentication sequence

Y^{n}

and the helper data

M^{'}

.

Figure 4. The attacker controls the state of the source

s \in S

. The biometric sequences

X^{n}

and

Y^{n}

are observed in the enrollment and authentication terminal, respectively. In the enrollment terminal, the key K is predefined and the helper data

M^{'}

is generated. The helper data is public, hence the attacker also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed authentication sequences

Y^{n}

and the helper data

M^{'}

.

Figure 4. The attacker controls the state of the source

s \in S

. The biometric sequences

X^{n}

and

Y^{n}

are observed in the enrollment and authentication terminal, respectively. In the enrollment terminal, the key K is predefined and the helper data

M^{'}

is generated. The helper data is public, hence the attacker also has access to it. In the authentication terminal, an estimation of a key

\hat{K}

is made based on the observed authentication sequences

Y^{n}

and the helper data

M^{'}

.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Grigorescu, A.; Boche, H.; Schaefer, R.F. Robust Biometric Authentication from an Information Theoretic Perspective. Entropy 2017, 19, 480. https://doi.org/10.3390/e19090480

AMA Style

Grigorescu A, Boche H, Schaefer RF. Robust Biometric Authentication from an Information Theoretic Perspective. Entropy. 2017; 19(9):480. https://doi.org/10.3390/e19090480

Chicago/Turabian Style

Grigorescu, Andrea, Holger Boche, and Rafael F. Schaefer. 2017. "Robust Biometric Authentication from an Information Theoretic Perspective" Entropy 19, no. 9: 480. https://doi.org/10.3390/e19090480

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Biometric Authentication from an Information Theoretic Perspective †

Abstract

1. Introduction

2. Information Theoretic Model for Biometric Authentication

2.1. Generated Secret Key Model

2.2. Chosen Secret Key Model

3. Authentication for Compound Sources

3.1. Compound Generated Secret Key Model

3.2. Compound Chosen Secret Key Model

4. Continuity of the Privacy Secrecy Capacity Region for Compound Sources

4.1. Distance between Compound Sources

4.2. Continuity of the Privacy Secrecy Capacity Region

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

Appendix A. Proof of Theorem 3

Appendix A.1. Achievability of Theorem 3

Appendix A.1.1. State Estimation

Appendix A.1.2. Code Construction

Appendix A.1.3. Encoding Sets

Appendix A.1.4. Decoding Sets

Appendix A.1.5. Encoder–Decoder Pair Sets

Appendix A.1.6. Error Analysis

Appendix A.1.7. Key Distribution

Appendix A.1.8. Privacy Leakage

Appendix A.1.9. Secrecy Leakage

Appendix A.2. Converse of Theorem 3

Appendix B. Proof of Theorem 4

Appendix B.1. Achievability of Theorem 4

Appendix B.1.1. Privacy Leakage

Appendix B.1.2. Secrecy Leakage

Appendix B.2. Converse of Theorem 4

Appendix C. Proof Lemma 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Robust Biometric Authentication from an Information Theoretic Perspective^†