Next Article in Journal
Hybrid Beamforming Design for Self-Interference Cancellation in Full-Duplex Millimeter-Wave MIMO Systems with Dynamic Subarrays
Next Article in Special Issue
Quantum Information Entropy for a Hyperbolic Double Well Potential in the Fractional Schrödinger Equation
Previous Article in Journal
Variational Quantum Algorithm Applied to Collision Avoidance of Unmanned Aerial Vehicles
Previous Article in Special Issue
Universal Framework for Quantum Error-Correcting Codes
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Entropy of Quantum Measurements

Department of Mathematics, University of Denver, Denver, CO 80208, USA
Entropy 2022, 24(11), 1686; https://doi.org/10.3390/e24111686
Submission received: 25 October 2022 / Revised: 12 November 2022 / Accepted: 14 November 2022 / Published: 18 November 2022
(This article belongs to the Special Issue Entropy in Quantum Systems and Quantum Field Theory (QFT) II)

Abstract

:
If a is a quantum effect and ρ is a state, we define the ρ -entropy S a ( ρ ) which gives the amount of uncertainty that a measurement of a provides about ρ . The smaller S a ( ρ ) is, the more information a measurement of a gives about ρ . In Entropy for Effects, we provide bounds on S a ( ρ ) and show that if a + b is an effect, then S a + b ( ρ ) S a ( ρ ) + S b ( ρ ) . We then prove a result concerning convex mixtures of effects. We also consider sequential products of effects and their ρ -entropies. In Entropy of Observables and Instruments, we employ S a ( ρ ) to define the ρ -entropy S A ( ρ ) for an observable A. We show that S A ( ρ ) directly provides the ρ -entropy S I ( ρ ) for an instrument I . We establish bounds for S A ( ρ ) and prove characterizations for when these bounds are obtained. These give simplified proofs of results given in the literature. We also consider ρ -entropies for measurement models, sequential products of observables and coarse-graining of observables. Various examples that illustrate the theory are provided.

1. Introduction

In an interesting article, D. Šafránek and J. Thingna introduce the concept of entropy for quantum instruments [1]. Various important theorems are proved and applications are given. In quantum computation and information theory one of the most important problems is to determine an unknown state by applying measurements on the system [2,3,4,5]. Entropy provides a quantification for the amount of information given to solve this so-called state discrimination problem [6,7,8]. In this article, we first define the entropy for the most basic measurement, namely a quantum effect a [2,3,9,10]. If ρ is a state, we define the ρ -entropy S a ( ρ ) which gives the amount of uncertainty (or randomness) that a measurement of a provides about ρ . The smaller S a ( ρ ) is, the more information a measurement of a provides about ρ . In Section 2, we give bounds on S a ( ρ ) and show that if a + b is an effect then S a + b ( ρ ) S a ( ρ ) + S b ( ρ ) . We then prove a result concerning convex mixtures of effects. We also consider sequential products of effects and their ρ -entropies.
In Section 3, we employ S a ( ρ ) to define the entropy S A ( ρ ) for an observable A. Then S A ( ρ ) gives the uncertainty that a measurement of A provides about ρ . We show that S A ( ρ ) directly gives the ρ -entropy S I ( ρ ) for an instrument I . We establish bounds for S A ( ρ ) and characterize when these bounds are obtained. These give simplified proofs of results given in [1,5,11]. We also consider ρ -entropies for measurement models, sequential products of observables and coarse-graining of observables. Various examples that illustrate the theory are provided. In this work, all Hilbert spaces are assumed to be finite dimensional. Although this is a restriction, the work applies for quantum computation and information theory [2,3,9,10].

2. Entropy for Effects

Let H be a finite dimensional complex Hilbert space with dimension n. We denote the set of linear operators on H by L ( H ) and the set of states on H by S ( H ) . If ρ S ( H ) with nonzero eigenvalues λ 1 , λ 2 , , λ m including multiplicities, the von Neumann entropy of ρ is [4,6,7,8].
S ( ρ ) = i = 1 m λ i ln ( λ i ) = tr ρ ln ( ρ )
We consider S ( ρ ) as a measure of the randomness or uncertainty of ρ and smaller values of S ( ρ ) indicate more information content. For example, ρ is the completely random state I / n , where I is the identity operator, if and only if S ( ρ ) = ln ( n ) and ρ is a pure state if and only if S ( ρ ) = 0 . Moreover, it is well-known that 0 S ( ρ ) ln ( n ) for all ρ S ( H ) . The following properties of S are well-known [4,6,8]:
S ( U ρ U * ) = S ( ρ ) when U is unitary S ( ρ 1 ρ 2 ) = S ( ρ 1 ) + S ( ρ 2 ) μ i S ( ρ i ) S μ i ρ i μ i S ( ρ i ) μ i ln ( μ i )
where 0 μ i = 1 with μ i = 1 .
An operator a L ( H ) that satisfies 0 a I is called an effect [2,3,9,10]. We think of an effect a as a two-outcome yes-no measurement. If a measurement of a results in outcome yes we say that a occurs and if it results in outcome no then a does not occur. The effect a = I a is the complement of a and a occurs if and only if a does not occur. We denote the set of effects by E ( H ) . If a E ( H ) and ρ S ( H ) then 0 tr ( ρ a ) 1 and we interpret tr ( ρ a ) as the probability that a occurs when the system is in state ρ . If a 0 we define the ρ -entropy of a to be
S a ( ρ ) = tr ( ρ a ) ln tr ( ρ a ) tr ( a )
We interpret S a ( ρ ) as the amount of uncertainty that the system is in state ρ resulting from a measurement of a. The smaller S a ( ρ ) is, the more information a measurement of a gives about ρ . Such information is useful for state discrimination problems [2,3,4,5].
If ρ is the completely random state I / n then (1) becomes
S a ( I / n ) = tr ( I a / n ) ln tr ( I a / n ) tr ( a ) = 1 n tr ( a ) ln 1 n = tr ( a ) n ln ( n )
Since tr ( a ) n we conclude that S a ( I / n ) S ( I / n ) for all a E ( H ) . Another extreme case is when a = λ I for 0 < λ 1 . We then have for any ρ S ( H ) that
S λ I ( ρ ) = tr ( ρ λ I ) ln tr ( ρ λ I ) tr ( λ I ) = λ ln λ λ tr ( I ) = λ ln ( n )
Thus, as λ gets smaller, the more information we gain.
A real-valued function with domain D ( f ) , an interval in R , is strictly convex if for any x 1 , x 2 D ( f ) with x 1 x 2 and 0 < λ < 1 we have
f λ x 1 + ( 1 λ ) x 2 < λ f ( x 1 ) + ( 1 λ ) f ( x 2 )
If the opposite inequality holds, then f is strictly concave. It is clear that f is strictly convex if and only if f is strictly concave. Of special importance in this work are the strictly convex functions ln x and x ln x . We shall frequently employ Jensen’s theorem which says: if f is strictly convex and 0 μ i 1 with i = 1 m μ i = 1 , then
f i = 1 m μ i x i i = 1 m μ i f ( x i )
Moreover, we have equality if and only if x i = x j for all i , j = 1 , 2 , , m [1].
Theorem 1. 
If ρ S ( H ) with nonzero eigenvalues λ i , i = 1 , 2 , , m , and a E ( H ) with tr ( ρ a ) 0 , then
i tr ( P i a ) λ i ln ( λ i ) S a ( ρ ) ln tr ( a ) tr ( ρ a )
where ρ = i λ i P i is the spectral decomposition of ρ. Moreover, S a ( ρ ) = ln tr ( a ) / tr ( ρ a ) if and only if tr ( ρ a ) = 1 in which case S a ( ρ ) = l n tr ( a ) and if
S a ( ρ ) = i tr ( P i a ) λ i ln ( λ i )
then tr ( P i a ) = tr ( P j a ) for all i , j = 1 , 2 , , m and S a ( ρ ) = ( tr ( a ) / m ) S ( ρ ) while if tr ( P i a ) = tr ( P j a ) for all i , j = 1 , 2 , m then S a ( ρ ) = ( tr ( a ) / m ) ln ( m ) .
Proof. 
Letting μ j = tr ( P j a ) / tr ( a ) , j = 1 , 2 , , m , we have that 0 μ j 1 and j μ j = 1 . Since x ln ( x ) is strictly concave we obtain
S a ( ρ ) = tr ( ρ a ) ln tr ( ρ a ) tr ( a ) = tr i λ i P i a ln tr j λ j P j a tr ( a ) = λ i tr ( P i a ) ln j λ j μ j = tr ( a ) i λ i μ i j λ j μ j tr ( a ) i μ i λ i ln ( λ i ) = tr ( a ) i tr ( P i a ) tr ( a ) λ i ln ( λ i ) = i tr ( P i a ) λ i ln ( λ i )
Since
tr ( ρ a ) = tr ( a 1 / 2 ρ a 1 / 2 ) tr ( ρ ) = 1
we have that
S a ( ρ ) = tr ( ρ a ) ln tr ( a ) tr ( ρ a ) ln tr ( a ) tr ( ρ a )
If tr ( ρ a ) = 1 , then
S a ( ρ ) = tr ( ρ a ) ln tr ( ρ a ) tr ( ρ a ) = ln 1 tr ( a ) = ln tr ( a )
Conversely, if S a ( ρ ) = ln tr ( a ) / tr ( ρ a ) , then clearly tr ( ρ a ) = 1 . If (2) holds, then we have equality for Jensen’s inequality. Hence, tr ( P i a ) = tr ( P j a ) for all i , j = 1 , 2 , , m . Since
tr ( a ) = i tr ( P i a ) = m tr ( P i a )
we conclude that
S a ( ρ ) = tr ( P 1 a ) i λ i ln ( λ i ) = tr ( a ) m S ( ρ )
Finally, suppose tr ( P i a ) = tr ( P j a ) for all i , j = 1 , 2 , , m . Then
tr ( a ) = i tr ( P i a ) = m tr ( P 1 a )
We conclude that
S a ( ρ ) = tr ( P 1 a ) i λ i ln j λ j tr ( P 1 a ) tr ( a ) = tr ( P 1 a ) i λ i ln j λ j 1 m = tr ( P 1 a ) i λ i ln 1 m = tr ( a ) m ln ( m ) e
For a , b E ( H ) we write a b if a + b E ( H ) .
Theorem 2. 
If a b , then S a + b ( ρ ) S a ( ρ ) + S b ( ρ ) for all ρ S ( H ) . Moreover, S a + b ( ρ ) = S a ( ρ ) + S b ( ρ ) if and only if tr ( b ) tr ( ρ a ) = tr ( a ) tr ( ρ b ) .
Proof. 
Since x ln x is concave, letting λ 1 = tr ( a ) / tr ( a ) + tr ( b ) , λ 2 = tr ( b ) / tr ( a ) + tr ( b ) , x 1 = tr ( ρ a ) / tr ( a ) , x 2 = tr ( ρ b ) / tr ( b ) we obtain
S a + b ( ρ ) = tr ρ ( a + b ) ln tr ρ ( a + b ) tr ( a + b ) = tr ( a + b ) tr ( ρ a ) + tr ( ρ b ) tr ( a + b ) ln tr ( ρ a ) + tr ( ρ b ) tr ( a + b ) = tr ( a + b ) ( λ 1 x 1 + λ 2 x 2 ) ln ( λ 1 x 1 + λ 2 x 2 ) tr ( a + b ) λ 1 x 1 ln ( x 1 ) + λ 2 x 2 ln ( x 2 ) = tr ( ρ a ) ln tr ( ρ a ) tr ( a ) tr ( ρ b ) ln tr ( ρ b ) tr ( b ) = S a ( ρ ) + S b ( ρ )
We have equality if and only if x 1 = x 2 which is equivalent to tr ( b ) tr ( ρ a ) = tr ( a ) tr ( ρ b ) . □
Corollary 1. 
S a ( ρ ) + S a ( ρ ) ln ( n ) and S a ( ρ ) + S a ( ρ ) = ln ( n ) if and only if tr ( a ) = n tr ( ρ a ) .
Proof. 
Applying Theorem 2 we obtain
S a ( ρ ) + S a ( ρ ) S a + a ( ρ ) = S I ( ρ ) = ln ( n )
We have equality tr ( a ) tr ( ρ a ) = tr ( a ) tr ( ρ a ) n tr ( a ) tr ( ρ a ) = tr ( a ) 1 tr ( ρ a ) tr ( a ) = n tr ( ρ a ) e
Corollary 2. 
S a + b ( ρ ) S a ( ρ ) , S b ( ρ ) .
Corollary 3. 
If a b , then S a ( ρ ) S b ( ρ ) for all ρ S ( H ) .
Proof. 
If a b , then b = a + c for c = b a E ( H ) . Hence,
S b ( ρ ) = S a + c ( ρ ) S a ( ρ ) + S c ( ρ ) S a ( ρ )
for every ρ S ( H ) . □
Applying Theorem 2 and induction we obtain the following.
Corollary 4. 
If a 1 + a 2 + + a m I , then S a i ( ρ ) S a i ( ρ ) . Moreover, we have equality if and only if tr ( a j ) tr ( ρ a i ) = tr ( a i ) tr ( ρ a j ) for all i , j = 1 , 2 , , m .
Notice that E ( H ) is a convex set in the sense that if a i E ( H ) and 0 λ i 1 with i = 1 m λ i = 1 , then λ i a i E ( H ) .
Corollary 5. 
(i) If 0 < λ 1 and a E ( H ) , then S λ a ( ρ ) = λ S a ( ρ ) for all ρ S ( H ) . (ii) If 0 < λ i 1 , a i E ( H ) , with i = 1 m λ i = 1 , then S λ i a i ( ρ ) λ i S a i ( ρ ) for all ρ S ( H ) . We have equality if and only if tr ( a j ) tr ( ρ a i ) = tr ( a i ) tr ( ρ a j ) for all i , j = 1 , 2 , , m .
Proof. 
(i) We have that
S λ a ( ρ ) = tr ( ρ λ a ) ln tr ( ρ λ a ) tr ( λ a ) = tr ( ρ a ) ln λ tr ( ρ a ) λ tr ( a ) = λ S a ( ρ )
(ii) Applying (i) and Corollary 4 gives
S λ i a i ( ρ ) S λ i a i ( ρ ) = λ i S a i ( ρ )
together with the equality condition. □
As with E ( H ) , S ( H ) is a convex set and we have the following.
Theorem 3. 
If 0 < λ i 1   ρ i S ( H ) , i = 1 , 2 , , m , with i = 1 m λ i = 1 , then
S a λ i ρ i λ i S a ( ρ i )
for all a E ( H ) . We have equality if and only if tr ( ρ i a ) = tr ( ρ j a ) for all i , j = 1 , 2 , , m .
Proof. 
Letting x i = tr ( ρ i a ) / tr ( a ) , since x ln x is concave, we obtain
S a λ i ρ i = tr λ i ρ i a ln tr λ i ρ i a tr ( a ) = tr ( a ) λ i tr ( ρ i a ) tr ( a ) ln λ i tr ( ρ i a ) tr ( a ) = tr ( a ) λ i x i ln λ j x j tr ( a ) λ i x i ln ( x i ) = tr ( a ) λ i tr ( ρ i a ) tr ( a ) ln tr ( ρ i a ) tr ( a ) = λ i tr ( ρ i a ) ln tr ( ρ i a ) tr ( a ) = λ i S a ( ρ i )
We have equality if and only if x i = x j which is equivalent to tr ( ρ i a ) = tr ( ρ j a ) for all i , j = 1 , 2 , , m . □
Theorem 4. 
If a i E ( H i ) , ρ i S ( H i ) , i = 1 , 2 , then
S a 1 a 2 ( ρ 1 ρ 2 ) = tr ( ρ 2 a 2 ) S a 1 ( ρ 1 ) + tr ( ρ 1 a 1 ) S a 2 ( ρ 2 ) S a 1 ( ρ 1 ) + S a 2 ( ρ 2 ) .
Proof. 
This follows from
S a 1 a 2 ( ρ 1 ρ 2 ) = tr ( ρ 1 ρ 2 a 1 a 2 ) ln tr ( ρ 1 ρ 2 a 1 a 2 ) tr ( a 1 a 2 ) = tr ( ρ 1 a 1 ) tr ( ρ 2 a 2 ) ln tr ( ρ 1 a 1 ) tr ( ρ 2 a 2 ) tr ( a 1 ) tr ( a 2 ) = tr ( ρ 1 a 1 ) tr ( ρ 2 a 2 ) ln tr ( ρ 1 a 1 ) tr ( a 1 ) + ln tr ( ρ 2 a 2 ) tr ( a 2 ) = tr ( ρ 2 a 2 ) S a 1 ( ρ 1 ) + tr ( ρ 1 a 1 ) S a 2 ( ρ 2 ) S a 1 ( ρ 1 ) + S a 2 ( ρ 2 ) e
An operation on H is a completely positive linear map I : L ( H ) L ( H ) such that tr I ( A ) tr ( A ) for all A L ( H ) [2,3,6,9,10]. If I is an operation we define the dual of I to be the unique linear map I * : L ( H ) L ( H ) that satisfies tr I ( A ) B = tr A I * ( B ) for all A , B L ( H ) . If a E ( H ) then for any ρ S ( H ) we have 0 tr I ( ρ ) a 1 and it follows that I * ( a ) E ( H ) . We say that I measures a E ( H ) if tr I ( ρ ) = tr ( ρ a ) for all ρ S ( H ) . If I measures a we define the I -sequential product a b = I * ( b ) for all b E ( H ) [12,13]. Although a b depends on the operation used to measure a we do not include I in the notation for simplicity. We interpret a b as the effect that results from first measuring a using I and then measuring b.
Theorem 5. 
(i) If b c , then a ( b + c ) = a b + a c . (ii) a I = a . (iii) a b a for all b E ( H ) . (iv) S a b ( ρ ) S a ( ρ ) for all ρ S ( H ) .
Proof. 
(i) For every ρ S ( H ) we obtain
tr ρ a ( b + c ) = tr ρ I * ( b + c ) = tr I ( ρ ) ( b + c ) = tr I ( ρ ) b + tr I ( ρ ) c = tr ρ I * ( b ) + tr ρ I * ( c ) = tr ρ a b + tr ρ a c = tr ρ ( a b + a c )
Hence, a ( b + c ) = a b + a c . (ii) For all ρ S ( H ) we have
tr ( ρ a I ) = tr ρ I * ( I ) = tr I ( ρ ) I = tr I ( ρ ) = tr ( ρ a )
Hence, a I = a . (iii) By (i) and (ii) we have
a b + a b = a ( b + b ) = a I = a
It follows that a b a . (iv) Since a b a , by Corollary 3 we obtain S a b ( ρ ) S a ( ρ ) for all ρ S ( H ) . □
Theorem 5(iv) shows that a b gives more information than a about ρ . We can continue this process and make more measurements as follows. If I i measures a i , i = 1 , 2 , , m , we have
a 1 a 2 a m = ( I 1 ) * ( I 2 ) * ( I m 1 ) * ( a m )
and it follows from Theorem 5(iv) that
S a 1 a 2 a m ( ρ ) S a 1 a 2 a m 1 ( ρ )
Notice that the probability of occurrence of the effect a 1 a 2 · a m in state ρ is
tr ( ρ a 1 a 2 a m ) = tr ρ ( I 1 ) * ( I 2 ) * ( I m 1 ) * ( a m ) = tr I m 1 I m 2 I 1 ( ρ ) a m
Thus, we begin with the input state ρ , then measure a 1 using I 1 , then measure a 2 using I 2 , and finally measuring a m .
Example 1. 
1 For a E ( H ) we define the Lüders operation L a ( A ) = a 1 / 2 A a 1 / 2 [14]. Since
tr A ( L a ) * ( B ) = L a ( A ) B = tr a 1 / 2 A a 1 / 2 B = tr ( A a 1 / 2 B a 1 / 2 )
we have ( L a ) * ( B ) = a 1 / 2 B a 1 / 2 so ( L a ) * = L a . We have that L a measures a because
tr L a ( ρ ) = tr ( a 1 / 2 ρ a 1 / 2 ) = tr ( ρ a )
for every ρ S ( H ) . We conclude that the L a sequential product is
a b = ( L a ) * ( b ) = a 1 / 2 b a 1 / 2
We also have that
S a b ( ρ ) = tr ( ρ a b ) ln tr ( ρ a b ) tr ( a b ) = tr ( ρ a 1 / 2 b a 1 / 2 ) ln tr ( ρ a 1 / 2 b a 1 / 2 ) tr ( a 1 / 2 b a 1 / 2 ) = tr ( a ρ b ) ln tr ( a ρ b ) tr ( a b ) .
Example 2. 
2 For a E ( H ) , α S ( H ) we define the Holevo operation [15] H ( a , α ) ( A ) = tr ( A a ) α . Since
tr A H ( a , α ) * ( B ) = tr H ( a , α ) ( A ) B = tr tr ( A a ) α B = tr ( A a ) tr ( α B ) = tr A tr ( α B ) a
we have H ( a , α ) * ( B ) = tr ( α B ) a . We have H ( a , α ) measures a because
tr H ( a , α ) ( ρ ) = tr ( ρ a )
for every ρ S ( H ) . We conclude that the H ( a , α ) sequential product is
a b = H ( a , α ) * ( b ) = tr ( α b ) a
We also have that
S a b ( ρ ) = tr ( α b ) tr ( ρ a ) ln tr ( ρ a ) tr ( a ) = tr ( α b ) S a ( ρ )
If a i E ( H ) , i = 1 , 2 , , m , and we measure a i with operations H ( a i , α i ) , i = 1 , 2 , , m 1 , then
a 1 a 2 a m = a 1 ( a 2 a m ) = tr ( α 1 a 2 a m ) a 1 = tr α 1 tr ( α 2 a 3 a m ) a 2 a 1 = tr ( α 2 a 3 a m ) tr ( α 1 a 2 ) a 1 = tr ( α m 1 a m ) tr ( α m 2 a m 1 ) tr ( α 1 a 2 ) a 1
Moreover, it follows from Corollary 5(i) that
S a 1 a m ( ρ ) = tr ( α m 1 a m ) tr ( α m 2 a m 1 ) tr ( α 1 a 2 ) S a 1 ( ρ )
for all ρ S ( H ) .

3. Entropy of Observables and Instruments

We now extend our work on entropy of effects to entropy of observables and instruments. An observable on H is a finite collection of effects A = A x : x Ω A , A x 0 , where x Ω A A x = I [2,3,9]. The set Ω A is called the outcome space of A. The effect A x occurs when a measurement of A results in the outcome x. If ρ S ( H ) , then tr ( ρ A x ) is the probability that outcome x results from a measurement of A when the system is in state ρ . If Δ Ω A , then
Φ ρ A ( Δ ) = x Δ tr ( ρ A x )
is the probability that A has an outcome in Δ when the system is in state ρ and Φ ρ A is called the distribution of A. We also use the notation A ( Δ ) = A x : x Δ so Φ ρ A ( Δ ) = tr ρ A ( Δ ) for all Δ Ω A . In this way, an observable is a positive operation-valued measure (POVM). We say that an observable A is sharp if A x is a projection on H for all x Ω A and A is atomic if A x is a one-dimensional projection for all x Ω A .
If A is an observable and ρ S ( H ) the ρ -entropy of A is S A ( ρ ) = S A x ( ρ ) where the sum is over the x Ω A such that tr ( ρ A x ) 0 . Then S A ( ρ ) is a measure of the information that a measurement of A gives about ρ . The smaller S A ( ρ ) is, the more information given. Notice that if A is sharp, then tr ( A x ) = dim ( A x ) and if A is atomic, then
S A ( ρ ) = x tr ( ρ A x ) ln tr ( ρ A x )
There are two interesting extremes for S A ( ρ ) . If ρ has spectral decomposition ρ = i = 1 m λ i P i and A is the observable A = P i : i = 1 , 2 , , m , then
S A ( ρ ) = i tr ( ρ P i ) ln tr ( ρ P i ) = λ i ln ( λ i ) = S ( ρ )
As we shall see, this gives the minimum entropy (most information). For the completely random state I / n and any observable A we obtain
S A ( I / n ) = x tr ( A x ) n ln tr ( A x ) / n tr ( A x ) = 1 n x tr ( A x ) ln 1 n = ln ( n ) n x tr ( A x ) = ln ( n ) n tr ( I ) = ln ( n )
We shall also see that this gives the maximum entropy (least information).
Theorem 6. 
For any observable A and ρ S ( H ) we have
S ( ρ ) S A ( ρ ) ln ( n )
Proof. 
Applying Theorem 1 we obtain
S A ( ρ ) = x Ω A S A x ( ρ ) x Ω A i tr ( P i A x ) λ i ln ( λ i ) = i tr P i x Ω A A x λ i ln ( λ i ) = i tr ( P i ) λ i ln ( λ i ) = i λ i ln ( λ i ) = S ( ρ )
Since ln ( x ) is concave and tr ( ρ A x ) > 0 , x tr ( ρ A x ) = 1 we have by Jensen’s inequality
S A ( ρ ) = x tr ( ρ A x ) ln tr ( A x ) tr ( ρ A x ) ln x tr ( ρ A x ) tr ( A x ) tr ( ρ A x ) = ln x tr ( A x ) = ln tr ( I ) = ln ( n ) e
An observable A is trivial if A x = λ x I , 0 < λ x 1 , λ x = 1 .
Corollary 6. 
(i) S A ( ρ ) = ln ( n ) if and only if tr ( A x ) tr ( ρ A y ) = tr ( A y ) tr ( ρ A x ) for all x , y Ω A . (ii) A is trivial if and only if S A ( ρ ) = ln ( n ) for all ρ S ( H ) . (iii) ρ = I / n if and only if S A ( ρ ) = ln ( n ) for all observables A. (iv) S ( ρ ) = ln ( n ) if and only if ρ = I / n .
Proof. 
(i) This follows from the proof of Theorem 6 because this is the condition for equality in Jensen’s inequality. (ii) Suppose A is trivial with A x = λ x I . Then for every ρ S ( H ) we have
S A ( ρ ) = x tr ( ρ λ x I ) ln tr ( ρ λ x I ) tr ( λ x I ) = x λ x ln λ x n λ x = ln ( n ) x λ x = ln ( n )
Conversely, suppose S A ( ρ ) = ln ( n ) for all ρ S ( H ) . By (i) we have that tr ( A x ) tr ( ρ A y ) = tr ( A y ) tr ( ρ A x ) for all ρ S ( H ) . It follows that
ϕ , A y ϕ = ϕ , A x ϕ tr ( A y ) tr ( A x )
for every ϕ H , ϕ 0 . Hence, A y = ( tr ( A y ) ) / ( tr ( A x ) ) A x so that
I = y A y = y tr ( A y ) tr ( A x ) A x = n tr ( A x ) A x
We conclude that A x = ( tr ( A x ) ) / n I for all x Ω A so A is trivial. (iii) If ρ = I / n , we have shown in (3) that S A ( ρ ) = ln ( n ) for all observables A. Conversely, if S A ( ρ ) = ln ( n ) for every observable A, as before, we have tr ( A x ) tr ( ρ A y ) = tr ( A y ) tr ( ρ A x ) for every observable A. Letting A x be the observable given by the spectral decomposition ρ = λ x A x where A is atomic, we conclude that λ x = λ y for all x , y Ω A . Hence, λ x = 1 / n and ρ = ( 1 / n ) A x = I / n . (iv)If S ( ρ ) = ln ( n ) , by Theorem 6, S A ( ρ ) = ln ( n ) for every observable A. Applying (iii), ρ = I / n . Conversely, if ρ = I / n , then
S ( ρ ) = i = 1 n 1 n ln 1 n = ln 1 n = ln ( n ) e
We now extend Corollary 5(ii) and Theorem 3 to observables. If A i = A x i : x Ω are observables with the same outcome space Ω , i = 1 , 2 , , m , and 0 < λ i 1 with i = 1 m λ i = 1 , then the observable A = A x : x Ω where A x = i = 1 m λ i A x i is called a convex combination of the A i [12].
Theorem 7. 
(i) If A is a convex combination of A i , i = 1 , 2 , , m , then for all ρ S ( H ) we have
S A ( ρ ) i = 1 m λ i S A i ( ρ )
(ii) If 0 < λ i 1 with i = 1 m λ i = 1 , ρ i S ( H ) , i = 1 , 2 , , m , and A is an observable, then
S A i λ i ρ i i λ i S A ( ρ i )
Proof. 
(i) Applying Corollary 5(ii) gives
S A ( ρ ) = x S A x ( ρ ) = x S λ i A x i ( ρ ) x i λ i S A x i ( ρ ) = i λ i x S A x i ( ρ ) = i λ i S A i ( ρ )
(ii) Applying Theorem 3 gives
S A i λ i ρ i = x S A x i λ i ρ i x i λ i S A x ( ρ i ) = i λ i x S A x ( ρ i ) = i λ i S A ( ρ i ) e
We say that an observable B is a coarse-graining of an observable A if there exists a surjection f : Ω A Ω B such that
B y = A x : f ( x ) = y = A f 1 ( y )
for every y Ω B [2,12,16].
Theorem 8. 
If B is a coarse-graining of A, then S B ( ρ ) S A ( ρ ) for all ρ S ( H ) .
Proof. 
Let B y = A f 1 ( y ) for all y Ω B and let p y = tr ( ρ B y ) , p x = tr ( ρ A x ) for all y Ω b , x Ω A . Then
p y = tr ρ f ( x ) = y A x = f ( x ) = y tr ( ρ A x ) = f ( x ) = y p x
Let V y = tr ( B y ) , V x = tr ( A x ) so that
V y = tr f ( x ) = y A x = f ( x ) = y tr ( A x ) = f ( x ) = y V x
Since x ln ( x ) is concave, we conclude that
S B ( ρ ) = y p y ln p y V y = y f ( x ) = y p x ln f ( x ) = y p x V y = y V y f ( x ) = y p x V x V x V y ln f ( x ) = y p x V x V x V y y V y f ( x ) = y V x V y p x V x ln p x V x = y f ( x ) = y p x ln p x V x = x p x ln p x V x = S A ( ρ ) e
The equality condition for Jensen’s inequality gives the following.
Corollary 7. 
An observable A possesses a coarse-graining B y = A f 1 ( y ) with S B ( ρ ) = S A ( ρ ) for all ρ S ( H ) if and only if for every x 1 , x 2 Ω A with f ( x 1 ) = f ( x 2 ) we have
tr ( A x 2 ) tr ( ρ A x 1 ) = tr ( A x 1 ) tr ( ρ A x 2 )
A trace preserving operation is called a channel. An instrument on H is a finite collection of operations I = I x : x Ω such that x Ω I I x is a channel [2,3,9]. We call Ω I the outcome space for I . If I is an instrument, there exists a unique observable A such that tr ( ρ A x ) = tr I x ( ρ ) for all x Ω A = Ω I , ρ S ( H ) and we say that I  measures A. Although an instrument measures a unique observable, an observable is measured by many instruments For example, if A is an observable, the corresponding Łüders instrument [14] is defined by
L x A ( B ) = A x 1 / 2 B A x 1 / 2
for all B L ( H ) . Then L A is an instrument because
tr x L x A ( B ) = x tr L x A ( B ) = x tr ( A x 1 / 2 B A x 1 / 2 ) = x tr ( A x B ) = tr x A x B = tr ( I B ) = tr ( B )
for all B L ( H ) . Moreover, L A measures A because
tr L x A ( ρ ) = tr ( A x 1 / 2 ρ A x 1 / 2 ) = tr ( ρ A x )
for all ρ S ( H ) . Of course, this is related to Example 1. Corresponding to Example 2, we have a Holevo instrument  H ( A , α ) where α x S ( H ) , x Ω A and
H x ( A , α ) ( B ) = tr ( B A x ) α x
for all B L ( H ) [15]. To show that H ( A , α ) is an instrument we have
tr x H x ( A , α ) ( B ) = x tr H x ( A , α ) ( B ) = x tr tr ( B A x ) α x = x tr ( B A x ) = tr B x A x = tr ( B )
Moreover, H ( A , α ) measures A because
tr H x A , α ( ρ ) = tr ( ρ A x ) α x = tr ( ρ A x ) tr ( α x ) = tr ( ρ A x )
Let A , B be observables and let I be an instrument that measures A. We define the I -sequential product A B [12,13] by Ω A B = Ω A × Ω B and
A B ( x , y ) = I x * ( B y ) = A x B y
Defining f : Ω A B Ω A by f ( x , y ) = x ,we obtain
A B f 1 ( x ) = f ( x , y ) = x A x B y = y Ω B I x * ( B y ) = I α * ( I ) = A x
We conclude that A is a coarse-graining of A B . Applying Theorem 8 we obtain the following.
Corollary 8. 
If A , B are observables, the S A B ( ρ ) S A ( ρ ) for all ρ S ( H ) . Equality S A B ( ρ ) = S A ( ρ ) holds if and only if for every x Ω A , y 1 , y 2 Ω B we have
tr ( ρ A x B y 1 ) tr ( A x B y 1 ) ln tr ( ρ A x B y 1 ) tr ( A x B y 1 ) = tr ( ρ A x B y 2 ) tr ( A x B y 2 ) ln tr ( ρ A x B y 2 ) tr ( A x B y 2 )
Extending this work to more than two observables, let I 1 , I 2 , , I m 1 be instruments that measure the observables A 1 , A 2 , , A m 1 , respectively. If A m is another observable, we have that
( A 1 A 2 A m ) ( x 1 , x 2 , , x m ) = ( I x 1 1 ) * ( I x 2 2 ) * ( I x m 1 m 1 ) * ( A x m m )
The next result follows from Corollary 8.
Corollary 9. 
If A 1 , A 2 , , A m are observables, then
S A 1 A 2 A m ( ρ ) S A 1 A 2 A m 1 ( ρ )
for all ρ S ( H ) .
If I is an instrument, let A be the unique observable that I measures so tr I x ( ρ ) = tr ( ρ A x ) for all x Ω I and ρ S ( H ) . We define the ρ -entropy of I as S I ( ρ ) = S A ( ρ ) . Since A x = I x * ( I ) we have
tr ( A x ) = tr I x * ( I ) = tr I x ( I )
Hence,
S I ( ρ ) = S A ( ρ ) = x tr ( ρ A x ) ln tr ( ρ A x ) tr ( A x ) = x tr I x ( ρ ) ln tr I x ( ρ ) tr I x ( I )
Now let I 1 , I 2 , , I m be instruments and let A 1 , A 2 , , A m be the unique observables they measure, respectively. Denoting the composition of two instruments I , J by I J we have
tr I x m m I x m 1 m 1 I x 1 1 ( ρ ) = tr ρ ( I x 1 1 ) * ( I x 2 1 ) * ( I x m m ) * ( I ) = tr ( ρ A x 1 1 A x 2 2 A x m m )
Hence, the observable measured by I m I m 1 I 1 is A 1 A 2 A m . It follows that
S I m I m 1 I 1 ( ρ ) = S A 1 A 2 A m ( ρ )
We conclude that Theorems 1, 2 and 3 [1] follow from our results. Moreover, our proofs are simpler since they come from the more basic concept of ρ -entropy for effects.
Let A , B be observables on H and let I be an instrument that measures A. The corresponding sequential product becomes
( A B ) ( x , y ) = I x * ( B y ) = A x B y
The ρ -entropy of A B has the form
S A B ( ρ ) = x , y tr ρ ( A B ) ( x , y ) ln tr ρ ( A B ) ( x , y ) tr ( A B ) ( x , y ) = x , y tr ρ I x * ( B y ) ln tr ρ I x * ( B y ) tr I x * ( B y ) = x , y tr I x ( ρ ) B y ln I x ( ρ ) B y tr I x ( I ) B y
If L A is the Lüders instrument I x A ( ρ ) = A x 1 / 2 ρ A x 1 / 2 we have ( A B ) ( x , y ) = A x 1 / 2 B y A x 1 / 2 and
S A B ( ρ ) = x , y tr ( A x 1 / 2 ρ A x 1 / 2 B y ) ln tr ( A x 1 / 2 ρ A x 1 / 2 B y ) tr ( A x B y )
If H ( A , α ) is the Holevo instrument H x ( A , α ) ( ρ ) = tr ( ρ A x ) α x , α x S ( H ) we obtain
S A B ( ρ ) = x , y tr ( ρ A x ) tr ( α x B y ) ln tr ( ρ A x ) tr ( α x B y ) tr ( A x ) tr ( α x B y ) = x , y tr ( ρ A x ) tr ( α x B y ) ln tr ( ρ A x ) tr ( A x ) = x tr ( ρ A x ) ln tr ( ρ A x ) tr ( A x ) = S A ( ρ )
This also follows from Corollary 8 because
tr ( ρ A x B y ) tr ( A x B y ) = tr ( α x B y ) tr ( ρ A x ) tr ( α x B y ) tr ( A x ) = ( ρ A x ) tr ( A x )
If A is an observable on H and B is an observable on K we form the tensor product observable A B on H K given by ( A B ) ( x , y ) = A x B y where Ω A B = Ω A × Ω B [12].
Lemma 1. 
If ρ 1 S ( H ) , ρ 2 S ( K ) , then
S A B ( ρ 1 ρ 2 ) = S A ( ρ 1 ) + S B ( ρ 2 )
Proof. 
From the definition of A B we obtain
S A B ( ρ 1 ρ 2 ) = x , y tr ( ρ 1 ρ 2 A x B y ) ln tr ( ρ 1 ρ 2 A x B y ) tr ( A x B y ) = x , y tr ( ρ 1 A x ) tr ( ρ 2 B y ) ln tr ( ρ 1 A x ) tr ( ρ 2 B y ) tr ( A x ) tr ( B y ) = x , y tr ( ρ 1 A x ) tr ( ρ 2 B y ) ln tr ( ρ 1 A x ) tr ( A x ) x , y tr ( ρ 1 A x ) tr ( ρ 2 B y ) ln tr ( ρ 2 B y ) tr ( B y ) = x tr ( ρ 1 A x ) ln tr ( ρ 1 A x ) tr ( A x ) y tr ( ρ 2 B y ) ln tr ( ρ 2 B y ) tr ( B y ) = S A ( ρ 1 ) + S B ( ρ 2 ) e
We conclude that A gives more information about ρ 1 than A and B give about ρ 1 ρ 2 and similarly for B.
A measurement model [2,3,9] is a 5-tuple M = ( H , K , ν , σ , P ) where H is the system Hilbert space, K is the probe Hilbert space, ν is the interaction channel, σ S ( K ) is the initial probe state and P is the probe observable on K. We interpret M as an apparatus that is employed to measure an instrument and hence an observable. In fact, M measures the unique instrument I on H given by
I x ( ρ ) = tr K ν ( ρ σ ) ( I P x )
In this way, a state ρ S ( H ) is input into the apparatus and combined with the initial state σ of the probe system. The channel ν interacts the two states and a measurement of the probe P is performed resulting in outcome x. The outcome state is reduced to H by applying the partial trace over K. Now I measures an unique observable A on H that satisfies
tr ( ρ A x ) = tr I x ( ρ ) = tr ν ( ρ σ ) ( I P x )
The ρ -entropy of I becomes
S I ( ρ ) = S A ( ρ ) = x tr ( ρ A x ) ln tr ( ρ A x ) tr ( A x )
where tr ( ρ A x ) is given by (4). Of course, S I ( ρ ) = S A ( ρ ) gives the amount of information that a measurement by M provides about ρ . A closely related concept is the observable I P and S I P ν ( ρ σ ) also provides the amount of information that a measurement M provides about ρ . It follows from (4) that the distribution of A in the state ρ equals the distribution of I P in the state ν ( ρ σ ) . We now compare S A ( ρ ) and S I P ν ( ρ σ ) . Applying (4) gives
S I P ν ( ρ σ ) = x tr ν ( ρ σ ) ( I P x ) ln tr ν ( ρ σ ) ( I P x ) tr ( I P x ) = x tr ( ρ A x ) ln tr ( ρ A x ) n tr ( P x ) = x tr ( ρ A x ) ln tr ( A x ) n tr ( P x ) tr ( ρ A x ) tr ( A x ) = x tr ( ρ A x ) ln tr ( ρ A x ) tr ( A x ) tr ( ρ A x ) ln tr ( A x ) n tr ( P x ) = S A ( ρ ) x tr ( ρ A x ) ln tr ( A x ) n tr ( P x )
It follows that S A ( ρ ) S I P ν ( ρ σ ) if and only if
x tr ( ρ A x ) ln tr ( A x ) n tr ( P x ) 0
Now (5) may or may not hold depending on A, ρ and P. In many cases, P is atomic [2,9] and then
ln tr ( A x ) n tr ( P x ) = ln tr ( A x ) n < 0
so S A ( ρ ) S I P ν ( ρ σ ) for all ρ S ( H ) . Also, (5) holds if P is sharp.

Funding

This research received no external funding.

Conflicts of Interest

The author declare no conflict of interest.

References

  1. Šafránek, D.; Thingna, J. Quantifying information extraction using generalized quantum measurements. arXiv 2022, arXiv:2007.07246. [Google Scholar]
  2. Heinosaari, T.; Ziman, M. The Mathematical Language of Quantum Theory; Cambridge University Press: Cambridge, UK, 2012. [Google Scholar]
  3. Nielson, M.; Chuang, I. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2000. [Google Scholar]
  4. Ohya, M.; Petz, D. Quantum Entropy and It’s Uses; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
  5. Šafránek, D.; Aguirre, A.; Schindler, J.; Deutsch, J. A brief introduction to observational entropy. Found. Phys. 2021, 51, 101. [Google Scholar] [CrossRef]
  6. Lindblad, G. Completely positive maps and entropy inequalities. Comm. Math. Phys. 1975, 40, 147–151. [Google Scholar] [CrossRef]
  7. von Neumann, J. Mathematical Foundations of Quantum Mechanics; Princeton University Press: Princeton, NJ, USA, 1955. [Google Scholar]
  8. Wehrl, A. General properties of entropy. Rev. Mod. Phys. 1978, 50, 221. [Google Scholar] [CrossRef]
  9. Busch, P.; Lahti, P.; Mittlestaedt, P. The Quantum Theory of Measurement; Springer: Berlin/Heidelberg, Germany, 1996. [Google Scholar]
  10. Kraus, K. States, Effects and Operations; Springer: Berlin/Heidelberg, Germany, 1983. [Google Scholar]
  11. Šafránek, D.; Deutsch, J.; Aguirre, A. Quantum coarse-grained entropy and thermodynamics. Phys. Rev. A 2019, 99, 010101. [Google Scholar] [CrossRef] [Green Version]
  12. Gudder, S. Combinations of quantum observables and instruments. J. Phys. A Math. Theor. 2021, 54, 364002. [Google Scholar] [CrossRef]
  13. Gudder, S. Sequential products of Quantum measurements. arXiv 2021, arXiv:2108.07925. [Google Scholar] [CrossRef]
  14. Lüders, G. Über due Zustandsänderung durch den Messprozess. Ann. Physik 1951, 6, 322–328. [Google Scholar]
  15. Holevo, A. Probabilistic and Statistical Aspects of Quantum Theory; North-Holland: Amsterdam, The Netherlands, 1982. [Google Scholar]
  16. Gudder, S. Coarse-graining of observables. Quant. Rep. 2022, 4, 401–417. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Gudder, S. Entropy of Quantum Measurements. Entropy 2022, 24, 1686. https://doi.org/10.3390/e24111686

AMA Style

Gudder S. Entropy of Quantum Measurements. Entropy. 2022; 24(11):1686. https://doi.org/10.3390/e24111686

Chicago/Turabian Style

Gudder, Stanley. 2022. "Entropy of Quantum Measurements" Entropy 24, no. 11: 1686. https://doi.org/10.3390/e24111686

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop