Next Article in Journal
Contextuality Analysis of Impossible Figures
Next Article in Special Issue
Application of Cloud Model in Qualitative Forecasting for Stock Market Trends
Previous Article in Journal
Minimal Products of Coordinate and Momentum Uncertainties of High Orders: Significant and Weak High-Order Squeezing
Previous Article in Special Issue
Cointegration and Unit Root Tests: A Fully Bayesian Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Perspective of the Kalman Filter from the Rényi Entropy

1
Global Navigation Satellite System Research Center, Wuhan University, Wuhan 430079, China
2
Artificial Intelligence Institute, Wuhan University, Wuhan 430079, China
*
Author to whom correspondence should be addressed.
Entropy 2020, 22(9), 982; https://doi.org/10.3390/e22090982
Submission received: 21 July 2020 / Revised: 31 August 2020 / Accepted: 31 August 2020 / Published: 3 September 2020
(This article belongs to the Special Issue Data Science: Measuring Uncertainties)

Abstract

:
Rényi entropy as a generalization of the Shannon entropy allows for different averaging of probabilities of a control parameter α . This paper gives a new perspective of the Kalman filter from the Rényi entropy. Firstly, the Rényi entropy is employed to measure the uncertainty of the multivariate Gaussian probability density function. Then, we calculate the temporal derivative of the Rényi entropy of the Kalman filter’s mean square error matrix, which will be minimized to obtain the Kalman filter’s gain. Moreover, the continuous Kalman filter approaches a steady state when the temporal derivative of the Rényi entropy is equal to zero, which means that the Rényi entropy will keep stable. As the temporal derivative of the Rényi entropy is independent of parameter α and is the same as the temporal derivative of the Shannon entropy, the result is the same as for Shannon entropy. Finally, an example of an experiment of falling body tracking by radar using an unscented Kalman filter (UKF) in noisy conditions and a loosely coupled navigation experiment are performed to demonstrate the effectiveness of the conclusion.

1. Introduction

In the late 1940s, Shannon introduced a logarithmic measure of information [1] and a theory that included information entropy (the literature shows that it is related to Boltzmann entropy in statistical mechanics). The more stochastic and unpredictable a variable is, the larger its entropy is. As a measure of information, entropy has been used in various fields, such as information theory, signal processing, information-theoretic learning [2,3], etc. As a generalization of the Shannon entropy, Rényi entropy, named after Alfréd Rényi [4], allows for different averaging of probabilities through a control parameter α , and is usually used to quantify the diversity, uncertainty, or randomness of random variables. Liang [5] presented the evolutionary entropy equations and the uncertainty estimation for Shannon entropy and relative entropy, which is also called Kullback–Leibler divergence [6], within the framework of dynamical systems. However, higher-order Rényi entropy has some better properties than Shannon entropy by setting the control parameter α in most cases.
The Kalman filter [7] and its variants have been widely used in navigation, control, tracking, etc. Many works focus on combining different entropy and entropy-like quantities with the original Kalman filter to improve the performance. When the state space equation is nonlinear, Rényi entropy can be used to measure the nonlinearity [8,9]. Shannon entropy was used to estimate the weight of each particle from the weights of different measurement models for the fusion algorithm in [10]. Quadratic Rényi entropy [11] of innovation has been used as a minimum entropy criterion under a nonlinear and non-Gaussian circumstance [12] in unscented Kalman filter (UKF) [13] and finite mixtures [14]. A generalized density evolution equation [15] and polynomial-based non-linear compensation [16] were used to improve the minimum entropy filtering [17]. Relative entropy has been used to measure the similarity between the probabilistic density functions during the recursive processes of the nonlinear filter [18,19]. As for the nonlinear measurement equation with additive Gaussian noise, relative entropy can be deduced to measure the nonlinearity of the measurement [20], and can also be used to measure the approximation error of the i-th measurement element in the partitioned update Kalman filter [21]. When the state variables and the measurement variables do not belong to strict Gaussian distribution, such as in the seamless indoor/outdoor multi-source fusion positioning problem [22], the estimation error can be measured by the relative entropy. Relative entropy can also be used to calculate the number of particles in the unscented particle filter for mobile robot self-localization [23] and to calculate the sample window size in the cubature Kalman filter (CKF) [24] for attitude estimation [25]. Moreover, it has been verified that the original Kalman filter can be derived by maximizing the relative entropy [26]. Meanwhile, the robust maximum correntropy criterion has been adopted as the optimal criterion to derive the maximum correntropy Kalman filter [27,28]. However, there has been no work on the direct connections between the Rényi entropy and the Kalman filter theory until now.
In this paper, we propose a new perspective of the Kalman filter from the Rényi entropy for the first time, which bridges the gap between the Kalman filter and the Rényi entropy. We calculate the temporal derivative of the Rényi entropy for the Kalman filter mean square error matrix, which provides the optimal recursive solution mathematically and will be minimized to obtain the Kalman filter gain. Moreover, from the physical point of view, the continuous Kalman filter approaches a steady state when the temporal derivative of the Rényi entropy is equal to zero, which also means that the Rényi entropy will keep stable. A numerical experiment of falling body tracking in noisy conditions with radar using the UKF and a practical experiment of loosely-coupled integration are provided to demonstrate the effectiveness of the above conclusion.
The structure of this paper is as follows. In Section II, the definitions and properties of Shannon entropy and Rényi entropy are presented. In Section III, the Kalman filter is derived from the perspective of minimizing the temporal derivative of Rényi entropy, and the connection between the Rényi entropy and the algebraic Riccati equation is explained. In Section IV, experimental results and analysis are given by the simulation of the UKF and the real integrated navigation data. We finally conclude this paper and provide an outlook for future work in Section V.

2. The Connection between the Kalman Filter and the Temporal Derivative of the Rényi Entropy

2.1. Rényi Entropy

To calculate the Rényi entropy of the continuous probability density function (PDF), it is necessary to extend the definition of the Rényi entropy to the continuous form. The Rényi entropy of order α for a continuous random variable with a multivariate Gaussian PDF p ( x ) is defined [4] and calculated [9] as:
H R α ( x ) = 1 1 α ln S p α ( x ) d x = N 2 ln ( 2 π α 1 α 1 ) + 1 2 ln ( det Σ ) ,
where α > 0 , α 1 , and α is a parameter providing a family of entropy functions. N is the dimension of the random variable x. S is the support. Σ is the covariance matrix of p ( x ) .
It is straightforward to show that the temporal derivative of the Rényi entropy is given by [9]:
H ˙ R ( α ) ( x ) = 1 2 T r { Σ 1 Σ ˙ } ,
where Σ ˙ is the temporal derivative of the covariance matrix and T r ( · ) is the trace operator.
It is easy to get the Shannon entropy for the multivariate Gaussian PDF by taking the limitation of Equation (1) as α approaches 1. This entropy is given as H ( x ) = N 2 ln ( 2 π e ) + 1 2 ln ( det Σ ) , and the temporal derivative of the Shannon entropy is given as H ˙ ( x ) = 1 2 T r { Σ 1 Σ ˙ } . It is obvious the temporal of the Shannon entropy is the same as the temporal of the Rényi entropy. Therefore, we will see later that the conclusion can also be derived from the temporal derivative of the Shannon entropy. However, the Rényi entropy for the multivariate Gaussian PDF instead of the temporal derivative of the Rényi entropy will be used by adjusting the free parameter α for different uncertainty measurements in most cases, as the filtering problem has to account for the nonlinearity and the non-Gaussian noise; we adopt the Rényi entropy as the measurement for uncertainty.

2.2. Kalman Filter

Given the continuous-time linear system [29]:
X ˙ ( t ) = F ( t ) X ( t ) + G ( t ) w ( t )
Z ( t ) = H ( t ) X ( t ) + v ( t ) ,
where X ( t ) is the state vector; F ( t ) is the state transition matrix; G ( t ) is the system noise driving matrix; Z ( t ) is the measurement vector; H ( t ) is the measurement matrix; and w ( t ) and v ( t ) are independent white Gaussian noise with zero mean value; their covariance matrices are Q ( t ) and R ( t ) , respectively:
E [ w ( t ) ] = 0 , E [ w ( t ) w T ( τ ) ] = Q ( t ) δ ( t τ )
E [ v ( t ) ] = 0 , E [ v ( t ) v T ( τ ) ] = R ( t ) δ ( t τ )
E [ w ( t ) v T ( τ ) ] = 0 ,
where δ ( t ) is the Dirac impulse function, Q ( t ) is a symmetric non-negative definite matrix, and R ( t ) is a symmetric positive matrix.
The continuous Kalman filter can be deduced by taking the limit of the discrete Kalman filter. The discrete-time state-space model is arranged as follows [29]:
X k = Φ k | k 1 X k 1 + Γ k | k 1 W k 1
Z k = H k X k + V k
where X k is an n-dimensional state vector; Z k is an m-dimensional measurement vector; Φ k | k 1 , Γ k | k 1 , and H k are the known system structure parameters, which are called the n × n dimensional one-step state update matrix, the n × l dimensional system noise distribution matrix, and the m × n dimensional measurement matrix, respectively; W k 1 is the l-dimensional system noise vector, and V k is the m-dimensional measurement noise vector. Both of them are Gaussian noise vector sequences with zero mean value, and are independent of each other:
E [ W k ] = 0 , E [ W k W j T ] = Q k δ k j
E [ V k ] = 0 , E [ V k V j T ] = R k δ k j
E [ W k V j T ] = 0 .
The above equation is the basic assumption for the noise requirement in the Kalman filtering state space model, where Q k is a symmetric non-negative definite matrix, and R k is a symmetric positive definite matrix. δ k j is the Kronecker δ function.
The covariance parameters Q k and R k play roles similar to those of Q and R in the continuous filter, but they do not have the same numerical values. Next, the relationship between the corresponding continuous and discrete filter parameters will be derived.
To achieve the transformation from the continuous form to the discrete form, the relations between Q and R and the corresponding Q k and R k for a small step size T s are needed. According to the linear system theory, the relation between Q and Q k from Equation (3) to Equation (8) is as follows:
Φ k | k 1 = Φ ( t k , t k 1 ) e t k 1 t k F ( τ ) d τ
Γ k | k 1 W k 1 = t k 1 t k Φ ( t k , τ ) G ( τ ) w ( t ) d τ .
Denote the discrete-time interval as T s = t k t k 1 , when F ( t ) does not change too dramatically within the shorter integral interval [ t k 1 , t k ] . Take the Taylor expansion of e F ( t k 1 ) T s with respect to F ( t k 1 ) T s and set F ( t k 1 ) T s < < I , so the higher-order terms are negligible and the one-step transition matrix, Equation (13), can be approximated as:
Φ k | k 1 e F ( t k 1 ) T s = I + F ( t k 1 ) T s + F 2 ( t k 1 ) T s 2 2 ! + F 3 ( t k 1 ) T s 3 3 ! + I + F ( t k 1 ) T s .
Equation (14) shows that Γ k | k 1 W k 1 is the linear transform of the Gaussian white noise w ( τ ) ; the result remains the normal distribution random vector. Therefore, the first- and second-order statistical characteristics can be used to describe and be equivalent to Γ k | k 1 W k 1 . Referring to Equation (5), the mean of Γ k | k 1 W k 1 is given as follows:
E [ Γ k | k 1 W k 1 ] = E [ t k 1 t k Φ ( t k , τ ) G ( τ ) w ( τ ) d τ ] = t k 1 t k Φ ( t k , τ ) G ( τ ) E [ w ( τ ) ] d τ = 0 .
For the second-order statistical characteristics, when k j , the time parameter between the noise w ( τ k ) and w ( τ j ) is independent, so Γ k | k 1 W k 1 and Γ j | j 1 W j 1 are uncorrelated:
E [ ( Γ k | k 1 W k 1 ) ( Γ j | j 1 W j 1 ) T ] = 0 ( k j ) .
When k = j , thus
E [ ( Γ k | k 1 W k 1 ) ( Γ k | k 1 W k 1 ) T ] = E [ t k 1 t k Φ ( t k , τ ) G ( τ ) w ( τ ) d τ ] [ t k 1 t k Φ ( t k , s ) G ( s ) w ( s ) d s ] T = E t k 1 t k Φ ( t k , τ ) G ( τ ) w ( τ ) t k 1 t k w T ( s ) G T ( s ) Φ T ( t k , s ) d s d τ = t k 1 t k Φ ( t k , τ ) G ( τ ) t k 1 t k E [ w ( τ ) w T ( s ) ] G T ( s ) Φ T ( t k , s ) d s d τ .
Substituting Equation (5) into the above equation:
E [ ( Γ k | k 1 W k 1 ) ( Γ k | k 1 W k 1 ) T ] = t k 1 t k Φ ( t k , τ ) G ( τ ) t k 1 t k Q ( t ) δ ( τ s ) G T ( s ) Φ T ( t k , s ) d s d τ = t k 1 t k Φ ( t k , τ ) G ( τ ) Q ( τ ) G T ( τ ) Φ T ( t k , τ ) d τ .
When the noise control matrix G ( τ ) changes slowly during the time interval [ t k 1 , t k ] , Equation (19) becomes:
E [ ( Γ k | k 1 W k 1 ) ( Γ k | k 1 W k 1 ) T ] t k 1 t k [ I + F ( t k 1 ) ( t k τ ) ] G ( t k 1 ) Q ( τ ) G T ( t k 1 ) [ I + F ( t k 1 ) ( t k τ ) ] T d τ = [ I + F ( t k 1 ) T s ] · [ G ( t k 1 ) Q ( t k 1 ) G T ( t k 1 ) T s ] · [ I + F ( t k 1 ) T s ] T + 1 12 F ( t k 1 ) G ( t k 1 ) Q ( t k 1 ) G T ( t k 1 ) F ( t k 1 ) T T s 3 [ I + F ( t k 1 ) T s ] G ( t k 1 ) · [ Q ( t k 1 ) T s ] · [ I + F ( t k 1 ) T s ] G ( t k 1 ) T .
When F ( t k 1 ) T s < < I is satisfied, the above equation can be further approximated:
E [ ( Γ k | k 1 W k 1 ) ( Γ k | k 1 W k 1 ) T ] G ( t k 1 ) · [ Q ( t k 1 ) T s ] · G T ( t k 1 ) .
Comparing the result with Equation (10):
Γ k | k 1 [ I + F ( t k 1 ) T s ] G ( t k 1 ) G ( t k 1 )
E [ W k W j T ] = Q k δ k j = [ Q ( t k ) T s ] δ k j .
Notice that [29]:
Q k = Q ( t k ) T s .
The derivation of the equation relating to R k and R is more subtle. In the continuous model, v ( t ) is white, so simple sampling of Z ( t ) leads to measurement noise with infinite variance. Hence, in the sampling process, we have to imagine averaging the continuous measurement over the T s interval to get an equivalent discrete sample. This is justified because x is not the Gaussian white noise and can be approximately constant within the interval.
Z k = 1 T s t k 1 t k Z ( t ) d t = 1 T s t k 1 t k [ H ( t ) x ( t ) + v ( t ) ] d t = H ( t k ) x k + 1 T s t k 1 t k v ( t ) d t .
Then, the discrete noise matrix and the continuous noise matrix are equivalent:
V k = 1 T s t k 1 t k v ( t ) d t .
From Equation (12), we have:
E [ V k V j T ] = R k δ k j = 1 T s 2 t k 1 t k t j 1 t j E [ v ( τ ) v ( s ) ] d τ d s = 1 T s 2 t k 1 t k t j 1 t j R ( τ ) δ ( s τ ) d τ d s = 1 T s 2 t k 1 t k R ( τ ) δ k j d τ R ( t k ) T s δ k j .
Comparing it with Equation (6), we have [29]:
R k = R ( t k ) T s .

2.3. Derivation of the Kalman Filter

Assuming that the optimal state estimation at t k 1 is X ^ k 1 , the state estimation error is X ˜ k 1 , and the state estimation covariance matrix is Σ k 1 :
X ˜ k 1 = X k 1 X ^ k 1
and
Σ k 1 = E [ X ˜ k 1 X ˜ k 1 T ] = E [ ( X k 1 X ^ k 1 ) ( X k 1 X ^ k 1 ) T ] .
If we take the expectation operator of both sides of Equation (8), we obtain the state one-step prediction and the state one-step estimation error:
X k | k 1 = E [ X k ] = E [ Φ k | k 1 X k 1 + Γ k | k 1 W k 1 ] = Φ k | k 1 E [ X k 1 ] = Φ k | k 1 X ^ k 1 ,
X ˜ k | k 1 = X k X k | k 1 .
Substituting Equations (8) and (31) into Equation (32) leads to:
X ˜ k | k 1 = ( Φ k | k 1 X k 1 + Γ k | k 1 W k 1 ) Φ k | k 1 X ^ k 1 = Φ k | k 1 ( X k 1 X ^ k 1 ) + Γ k | k 1 W k 1 = Φ k | k 1 X ˜ k 1 + Γ k | k 1 W k 1 .
Since X ˜ k 1 is uncorrelated with W k 1 , we therefore obtain the covariance of the state one-step estimation error X ˜ k | k 1 as follows:
Σ k | k 1 = E [ X ˜ k | k 1 X ˜ k | k 1 T ] = E [ ( Φ k | k 1 X ˜ k 1 + Γ k | k 1 W k 1 ) ( Φ k | k 1 X ˜ k 1 + Γ k | k 1 W k 1 ) T ] = Φ k | k 1 E [ X ˜ k 1 X ˜ k 1 T ] Φ k | k 1 T + Γ k | k 1 E [ W k 1 W k 1 T ] Γ k | k 1 T = Φ k | k 1 Σ k 1 Φ k | k 1 T + Γ k | k 1 Q k 1 Γ k | k 1 T .
In a similar way, the measurement at t k can be predicted by the state one-step estimation prediction X k | k 1 and system measurement Equation (9) as follows:
Z k | k 1 = E [ H k X k | k 1 + V k ] = H k X k | k 1 .
In fact, there is difference between the measurement one-step prediction Z k | k 1 and the actual measurement Z k . The difference is denoted as measurement one-step prediction error:
Z ˜ k | k 1 = Z k Z k | k 1 .
Substituting the measurement Equations (9) and (35) into Equation (36) yields:
Z ˜ k | k 1 = Z k H k X k | k 1 = H k X k + V k H k X k | k 1 = H k X ˜ k | k 1 + V k .
In general, the measurement one-step prediction error Z ˜ k | k 1 is called innovation in the classical Kalman filter theory, and it indicates the new information about the state estimate carried by the measurement one-step prediction error.
On the one hand, if the estimation of X k only includes the state one-step prediction X k | k 1 of the system state equation, the estimation accuracy will be low, as no information of the measurement equation has been used. On the other hand, according to Equation (37), the measurement one-step prediction error calculated using the system measurement equation contains the information of the state one-step prediction of X k | k 1 . Consequently, it is natural to consider all the state information that comes from the system state equation and the measurement equation, respectively, and correct the state one-step prediction mean X k | k 1 with the measurement one-step prediction error Z ˜ k | k 1 . Thereby, the optimal estimation of X k can be calculated by the combination of X k | k 1 and Z ˜ k | k 1 as follows:
X ^ k = X k | k 1 + K k Z ˜ k | k 1 ,
where K k is the undetermined correction factor matrix.
Substituting Equations (31) and (37) into Equation (38) obtains:
X ^ k = X k | k 1 + K k ( Z k H k X k | k 1 ) = ( I K k H k ) X k | k 1 + K k Z k = ( I K k H k ) Φ k | k 1 X ^ k 1 + K k Z k .
From Equation (39), the current state estimation X ^ k is a linear combination of the last state estimation X ^ k 1 and the current measurement Z k , which considers the influence of the structural parameters Φ k | k 1 in the state equation and the structure parameters H k in the measurement equation with different types of construction.
The state estimation error at the current time t k is denoted as:
X ˜ k = X k X ^ k ,
where X k is the true values and X ^ k is the posterior estimation of X k .
Substituting Equation (39) into Equation (40) obtains:
X ˜ k = X k [ X k | k 1 + K k ( Z k H k X k | k 1 ) ] = ( X k X k | k 1 ) K k ( H k X k + V k H k X k | k 1 ) = X ˜ k | k 1 K k ( H k X ˜ k | k 1 + V k ) = ( I K k H k ) X ˜ k | k 1 K k V k .
Then, the mean square error matrix of state estimation X ^ k is given by:
Σ k = E [ X ˜ k X ˜ k T ] = E { [ ( I K k H k ) X ˜ k | k 1 K k V k ] [ ( I K k H k ) X ˜ k | k 1 K k V k ] T } = ( I K k H k ) E [ X ˜ k | k 1 X ˜ k | k 1 T ] ( I K k H k ) T + K k E [ V k V k T ] K k T = ( I K k H k ) Σ k | k 1 ( I K k H k ) T + K k R k K k T .
Substituting Equation (34) into Equation (42) obtains:
Σ k = ( I K k H k ) [ Φ k | k 1 Σ k 1 Φ k | k 1 T + Γ k | k 1 Q k 1 Γ k | k 1 T ] ( I K k H k ) T + K k R k K k T = Φ k | k 1 Σ k 1 Φ k | k 1 T + K k H k Φ k | k 1 Σ k 1 Φ k | k 1 T H k T K k T Φ k | k 1 Σ k 1 Φ k | k 1 T H k T K k T K k H k Φ k | k 1 Σ k 1 Φ k | k 1 T + Γ k | k 1 Q k 1 Γ k | k 1 T K k H k Γ k | k 1 Q k 1 Γ k | k 1 T Γ k | k 1 Q k 1 Γ k | k 1 T H k T K k T + K k H k Γ k | k 1 Q k 1 Γ k | k 1 T H k T K k T + K k R k K k T .
We now use the approximation Φ k | k 1 I + F ( t k 1 ) T s as Equation (15). From Equation (22) with Γ k | k 1 G ( t k 1 ) , we have:
Σ k = [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T ] + K k H k [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T K k T [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T K k T K k H k [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T + G ( t k 1 ) Q k 1 G T ( t k 1 ) K k H k G ( t k 1 ) Q k 1 G T ( t k 1 ) G ( t k 1 ) Q k 1 G T ( t k 1 ) H k T K k T + K k H k G ( t k 1 ) Q k 1 G T ( t k 1 ) H k T K k T + K k R k K k T .
Note from Equation (24) that Q k is of the order of T s and from Equation (28) that R k = R ( t k ) T s ; then, Equation (44) becomes:
Σ k = [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T ] + K k H k [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T K k T [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T K k T K k H k [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T + G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) K k H k G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) H k T K k T + K k H k G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) H k T K k T + K k R ( t k ) T s K k T .

2.4. The Temporal Derivative of the Rényi Entropy and the Kalman Filter Gain

To obtain the continuous form of covariance matrix Σ , the limit will be taken. However, the relation between the undetermined correction factor matrix K k and its continuous form still remains unknown. Therefore, we make the following assumption.
Assumption 1.
K k is of the order of T s , that is:
K ( t k ) = K k T s .
From the conclusion, we can also derive this assumption conversely. We next draw the conclusion as one theorem under the assumption, as follows:
Theorem 1.
The discrete form of the undetermined correction factor matrix is the same as the continuous form when the temporal derivative of Rényi entropy is minimized. This can be presented in a mathematical form as follows:
{ K k = Σ k H k T R k , K = Σ H T R 1 | K * = arg min K H ˙ R ( α ) ( K ) } .
Proof of Theorem 1.
We substitute the expression for K k into Equation (45) and neglect higher-order terms in T s ; Equation (45) becomes:
Σ k = [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T ] + T s K ( t k ) H k Σ k 1 [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T T s K T ( t k ) [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T H k T T s K T ( t k ) T s K ( t k ) H k [ I + F ( t k 1 ) T s ] Σ k 1 [ I + F ( t k 1 ) T s ] T + G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) T s K ( t k ) H k G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) H k T T s K T ( t k ) + T s K ( t k ) H k G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) H k T T s K T ( t k ) + T s K ( t k ) R k T s T s K T ( t k ) = Σ k 1 + T s F ( t k 1 ) Σ k 1 + T s Σ k 1 F T ( t k 1 ) Σ k 1 H k T T s K ( t k ) T T s K ( t k ) H k Σ k 1 + G ( t k 1 ) Q ( t k ) T s G T ( t k 1 ) + T s K ( t k ) R ( t k ) T s T s K T ( t k ) .
Moving the first term of Equation (48) from right to left and dividing both sides by T s to form the finite difference expression:
Σ k Σ k 1 T s = F ( t k 1 ) Σ k 1 + Σ k 1 F T ( t k 1 ) Σ k 1 H k T K ( t k ) T K ( t k ) H k Σ k 1 + G ( t k 1 ) Q ( t k ) G T ( t k 1 ) + K ( t k ) R ( t k ) K T ( t k ) .
Finally, passing to the limit as T s 0 and dropping of the subscripts lead to the matrix differential equation:
Σ ˙ = F Σ + Σ F T Σ H T K T K H Σ + G Q G T + K R K T .
Σ is invertible, as it is a positive matrix. Multiplying Σ 1 with Equation (50), we can consider the temporal derivative of the Rényi entropy of the mean square error matrix Σ using Equation (2):
H ˙ R ( α ) = 1 2 T r { Σ 1 Σ ˙ } = 1 2 T r { Σ 1 F Σ + F T H T K T Σ 1 K H Σ + Σ 1 G Q G T + Σ 1 K R K T } = 1 2 T r { F + F T H T K T K H + Σ 1 G Q G T + Σ 1 K R K T } = 1 2 T r { 2 F 2 K H + Σ 1 G Q G T + Σ 1 K R K T } ,
where the invariance under the cyclic permutation property of the trace operator has been used to eliminate Σ 1 and Σ , as well as the truth that T r ( F ) = T r ( F T ) has been used to simplify the formula.
It is obvious that Equation (51) is a quadratic function of the undetermined correction factor matrix K. Thereby, there must be a minimum of H ˙ R ( α ) ( x ) in a probabilistic sense. Taking the derivative of both sides of Equation (51) with respect to matrix K obtains:
K H ˙ R ( α ) = 2 T r ( K H ) K + T r ( Σ 1 K R K T ) K = 2 H T + T r ( Σ 1 K R ( K ) T ) K + T r ( Σ 1 ( K ) R K T ) K = 2 H T + Σ 1 K R + ( R K T Σ 1 ) T .
In addition, since Σ 1 and R k are symmetric matrices, the result is:
K H ˙ R ( α ) = 2 H T + 2 Σ 1 K R .
R k is invertible, as it is a positive matrix. According to the extreme value principle of the function, when the above are equal to zero, then we have:
K = Σ H T R 1 .
So far, we have found the analytic solution to the undetermined correction factor matrix K, which is called the continuous-time Kalman filter gain in the classical Kalman filter. Then, the recursive formulations of the Kalman filter can be established through the Kalman filter gain K. Most importantly, this implies the connection between the temporal derivative of Rényi entropy and the classical Kalman filter: The temporal derivative of the Rényi entropy is minimized when the Kalman filter gain satisfies Equation (54).
Looking back to Assumption 1 and substituting Equation (28) into Equation (54), we obtain:
K ( t k ) = K k T s = K = Σ H T R 1 = Σ k H k T R k ( T s ) = Σ k H k T R k T s .
Therefore, the discrete-time Kalman filter gain can be expressed as follows:
K k = Σ k H k T R k .
Remark 1.
The discrete-time Kalman filter gain has the same form as the continuous-time filter gain, as shown in the Equation (54). In principle, this is consistent with our intuition and proves the correctness and rationality of Assumption A1, in turn.
Remark 2.
The Kalman filter gain is equivalent to the minimization of the temporal derivative of the Rényi entropy, although it has the same result as the original Kalman filter, which is deduced under the minimum mean square error criterion.
Substituting Equation (54) into Equation (50), we have:
Σ ˙ = F Σ + Σ F T Σ H T K T Σ H T R 1 H Σ + G Q G T + Σ H T R 1 R K T = F Σ + Σ F T Σ H T R 1 H Σ + G Q G T .
This is a second-order nonlinear differential equation with respect to the mean square error matrix Σ , and it is commonly called the Riccati equation. This is the same result as that of the Bucy–Kalman filter [7].
If the system equation, Equation (3), and the measurement equation, Equation (4), form a linear time-invariant system with constant noise covariance, the mean square error matrix Σ may reach a steady-state value, and Σ ˙ may eventually reach zero. So, we have the continuous algebraic Riccati equation as follows:
Σ ˙ = F Σ + Σ F T Σ H T R 1 H Σ + G Q G T = 0 .
As we can see, the time derivative of covariance at the steady state is zero; then, the temporal derivative of the Rényi entropy should also be zero:
H ˙ R ( α ) = 0 .
This implies that when the system approaches a stable state, the Rényi entropy approaches a steady value so that the temporal derivative of the Rényi entropy is zero. This is reasonable when the steady system owns a constant Rényi entropy, as uncertainty is stable, which follows our intuitive understanding. Consequently, it is worth noting that whether the value of the Rényi entropy is stable or not can be a validated indicator of whether the system is approaching the steady state.

3. Simulations and Analysis

In this section, we give two experiments to show that when the nonlinear filter system approaches the steady state, the Rényi entropy of the system approaches stability. The first experiment is a numerical example of a falling body in noisy conditions, tracked by radar [30] using the UKF. The second experiment is a practical experiment of loosely coupled integration [29]. The simulations were carried out on MATLAB 2018a running on a computer with i5-5200U, 2.20 GHz CPU, and the graphs were plotted by MATLAB.

3.1. Falling Body Tracking

In the example of a falling body being tracked by radar, the body falls vertically. The radar is placed at a vertical distance L from the body, and the radar measures the distance y from the radar to the body. The state-space equation of the body is given by:
x ˙ 1 = x 2 x ˙ 2 = d + g x ˙ 3 = 0 ,
where x 1 is the height, x 2 is the velocity, x 3 is the ballistic coefficient, g = 9.81 m/s 2 is the gravity acceleration, and d is the air drag, which could be approximated as:
d = ρ x 2 2 2 x 3 = ρ 0 exp ( x 1 k ) x 2 2 2 x 3 ,
where ρ is the air density with an initial value of ρ 0 = 1.225 ; ρ 0 = 1.225 and k = 6705.6 are constants.
The measurement equation is:
y = L 2 + x 1 2 .
It is worth noting that the drag and the square root cause severely nonlinearity in the state-space function and measurement function, respectively.
The discrete-time nonlinear system can be given by the Euler discretization method. Combining the additive process with Gaussian white noises for measurement, we can obtain:
x 1 ( n + 1 ) = x 1 ( n ) + x 2 ( n ) · T + w 1 ( n ) x 2 ( n + 1 ) = x 2 ( n ) + ( d + g ) · T + w 2 ( n ) x 3 ( n + 1 ) = x 3 ( n ) + w 3 ( n )
y ( n ) = L 2 + x 1 2 ( n ) + v ( n ) .
In the UKF numerical experiment, we set the sampling period to T = 0.4 s, the horizontal distance to L = 100 m, the maximum number of samples to N = 100 , the process noise to S w = d i a g ( 10 5 , 10 3 , 10 2 ) , the measurement noise to S v = 10 6 , and the initial state to x = [ 10 5 ; 5000 ; 400 ] . The results are shown as follows:
Figure 1 shows the evolution of covariance matrix Σ . Figure 2 and Figure 3 show the Rényi entropy of covariance matrix Σ and its change in adjacent time, respectively. Notice that the uncertainty increases near the middle of the plots, which is coincident with the drag peak. However, the Rényi entropy fluctuates around 15; even the fourth element of Σ changes dramatically. Of course, the entropy changes are closely accompanied by the drag peak, which means the change of the entropy of covariance reflects the evolution of matrix Σ . Consequently, the Rényi entropy can be viewed as the indicator of whether the system is approaching the steady state or not.

3.2. Practical Integrated Navigation

In the loosely integrated navigation system, the system state parameter x is composed of inertial navigation system (INS) error states in the North–East–Down (NED) local-level navigation frame, and can be expressed as follows:
x = [ ( δ r n ) T ( δ v n ) T ( ψ ) T ( b g ) T ( b a ) T ] T ,
where δ r n , δ v n , and ψ represent the position error, the velocity error, and the attitude error, respectively; b g and b a are modeled as first-order Gauss–Markov processes, representing the gyroscope bias and the accelerometer bias, respectively.
The discrete-time state update equation is used to update state parameters as follows:
x k = Φ k | k 1 x k 1 + G k | k 1 w k 1 ,
where G k | k 1 is the system noise matrix, w k 1 is the system noise, and Φ k | k 1 is the state transition matrix from t k 1 to t k ; this is determined by the dynamic model of the state parameter.
In the loosely coupled integration, the measurement equation can be simply expressed as:
δ z = H k x k + v k ,
where v k is the measurement noise, H k is the measurement matrix, and z k is the measurement vector calculated by subtracting the global navigation satellite system (GNSS) observation with the inertial navigation system (INS) mechanism.
The experiments reported in this section were carried out by processing the data from an unmanned ground vehicle test. The gyroscope random walk was set to 0.03 deg/ h and the velocity random walk was set to 0.05 m/s/ h . The sampling rates of the inertial measurement unit (IMU) and the GNSS are 200 Hz and 1 Hz, respectively. The test lasts 48 min.
The position error curve, velocity error curve, and attitude error curve of the loosely coupled integration are shown in Figure 4, Figure 5 and Figure 6. The root mean squares (RMSs) of the position errors in the north, east, and earth directions are 0.0057 m, 0.0024 m, and 0.0134 m, respectively. The RMS of the velocity errors in the north, east, and earth directions are 0.0023 m/s, 0.0021 m/s, and 0.0038 m/s, respectively. The RMSs of the attitude errors in the roll, pitch, and yaw directions are 0.0034 deg, 0.0030 deg, and 0.0178 deg, respectively.
The Rényi entropy of the covariance P is shown in Figure 7. As we can see, the Rényi entropy fluctuates around 100 once the filter converges, which is consistent with the conclusion from the entropy perspective.

4. Conclusions and Final Remarks

We have considered the original Kalman filter by taking the minimization of the temporal derivative of the Rényi entropy. In particular, we show that the temporal derivative of Rényi entropy is equal to zero when the Kalman filter system approaches the steady state, which means that the Rényi entropy approaches a stable value. Finally, simulation experiments and practical experiments show the Rényi entropy truly stays stable when the system becomes steady.
Future work includes calculating the Rényi entropy of the innovation term when the measurements and the noise are non-Gaussian [14] in order to evaluate the effectiveness of measurements and adjust the noise covariance matrix. Meanwhile, we can also calculate the Rényi entropy of the nonlinear dynamical equation to measure the nonlinearity in the propagation step.

Author Contributions

Conceptualization, Y.L. and C.G.; Funding acquisition, C.G. and J.L.; Investigation, Y.L.; Methodology, Y.L., C.G., and S.Y.; Project administration, J.L.; Resources, C.G.; Software, Y.L. and S.Y.; Supervision, J.L.; Validation, S.Y.; Visualization, S.Y.; Writing—original draft, Y.L.; Writing—review and editing, C.G., S.Y., and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant from the National Key Research and Development Program of China (2018YFB1305001).

Acknowledgments

In this section you can acknowledge any support given which is not covered by the author contribution or funding sections. This may include administrative and technical support, or donations in kind (e.g., materials used for experiments).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef] [Green Version]
  2. Principe, J.C. Information Theoretic Learning: Renyi’s Entropy and Kernel Perspectives; Springer Science & Business Media: Berlin, Germany, 2010. [Google Scholar]
  3. He, R.; Hu, B.; Yuan, X.; Wang, L. Robust Recognition via Information Theoretic Learning; Springer International Publishing: Berlin, Germany, 2014. [Google Scholar]
  4. Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20 June–30 July 1961. [Google Scholar]
  5. Liang, X.S. Entropy evolution and uncertainty estimation with dynamical systems. Entropy 2014, 16, 3605–3634. [Google Scholar] [CrossRef]
  6. Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
  7. Kalman, R.E.; Bucy, R.S. New results in linear filtering and prediction theory. J. Basic Eng. 1961, 83, 95–108. [Google Scholar] [CrossRef]
  8. DeMars, K.J. Nonlinear Orbit Uncertainty Prediction and Rectification for Space Situational Awareness. Ph.D. Thesis, The University of Texas at Austin, Austin, TX, USA, 2010. [Google Scholar]
  9. DeMars, K.J.; Bishop, R.H.; Jah, M.K. Entropy-based approach for uncertainty propagation of nonlinear dynamical systems. J. Guid. Control. Dyn. 2013, 36, 1047–1057. [Google Scholar] [CrossRef]
  10. Kim, H.; Liu, B.; Goh, C.Y.; Lee, S.; Myung, H. Robust vehicle localization using entropy-weighted particle filter-based data fusion of vertical and road intensity information for a large scale urban area. IEEE Robot. Autom. Lett. 2017, 2, 1518–1524. [Google Scholar] [CrossRef]
  11. Zhang, J.; Du, L.; Ren, M.; Hou, G. Minimum error entropy filter for fault detection of networked control systems. Entropy 2012, 14, 505–516. [Google Scholar] [CrossRef]
  12. Liu, Y.; Wang, H.; Hou, C. UKF based nonlinear filtering using minimum entropy criterion. IEEE Trans. Signal Process. 2013, 61, 4988–4999. [Google Scholar] [CrossRef]
  13. Julier, S.; Uhlmann, J.; Durrant-Whyte, H.F. A new method for the nonlinear transformation of means and covariances in filters and estimators. IEEE Trans. Autom. Control 2000, 45, 477–482. [Google Scholar] [CrossRef] [Green Version]
  14. Contreras-Reyes, J.E.; Cortés, D.D. Bounds on rényi and shannon entropies for finite mixtures of multivariate skew-normal distributions: Application to swordfish (xiphias gladius linnaeus). Entropy 2016, 18, 382. [Google Scholar] [CrossRef]
  15. Ren, M.; Zhang, J.; Fang, F.; Hou, G.; Xu, J. Improved minimum entropy filtering for continuous nonlinear non-Gaussian systems using a generalized density evolution equation. Entropy 2013, 15, 2510–2523. [Google Scholar] [CrossRef] [Green Version]
  16. Zhang, Q. Performance enhanced Kalman filter design for non-Gaussian stochastic systems with data-based minimum entropy optimisation. AIMS Electron. Electr. Eng. 2019, 3, 382. [Google Scholar] [CrossRef]
  17. Chen, B.; Dang, L.; Gu, Y.; Zheng, N.; Príncipe, J.C. Minimum error entropy Kalman filter. IEEE Trans. Syst. Man Cybern. Syst. 2019. [Google Scholar] [CrossRef] [Green Version]
  18. Gultekin, S.; Paisley, J. Nonlinear Kalman filtering with divergence minimization. IEEE Trans. Signal Process. 2017, 65, 6319–6331. [Google Scholar] [CrossRef] [Green Version]
  19. Darling, J.E.; DeMars, K.J. Minimization of the kullback–leibler divergence for nonlinear estimation. J. Guid. Control Dyn. 2017, 40, 1739–1748. [Google Scholar] [CrossRef]
  20. Morelande, M.R.; Garcia-Fernandez, A.F. Analysis of Kalman filter approximations for nonlinear measurements. IEEE Trans. Signal Process. 2013, 61, 5477–5484. [Google Scholar] [CrossRef]
  21. Raitoharju, M.; García-Fernández, Á.F.; Piché, R. Kullback–Leibler divergence approach to partitioned update Kalman filter. Signal Process. 2017, 130, 289–298. [Google Scholar] [CrossRef] [Green Version]
  22. Hu, E.; Deng, Z.; Xu, Q.; Yin, L.; Liu, W. Relative entropy-based Kalman filter for seamless indoor/outdoor multi-source fusion positioning with INS/TC-OFDM/GNSS. Clust. Comput. 2019, 22, 8351–8361. [Google Scholar] [CrossRef]
  23. Yu, W.; Peng, J.; Zhang, X.; Li, S.; Liu, W. An adaptive unscented particle filter algorithm through relative entropy for mobile robot self-localization. Math. Probl. Eng. 2013. [Google Scholar] [CrossRef]
  24. Arasaratnam, I.; Haykin, S. Cubature kalman filters. IEEE Trans. Autom. Control 2009, 54, 1254–1269. [Google Scholar] [CrossRef] [Green Version]
  25. Kiani, M.; Barzegar, A.; Pourtakdoust, S.H. Entropy-based adaptive attitude estimation. Acta Astronaut. 2018, 144, 271–282. [Google Scholar] [CrossRef]
  26. Giffin, A.; Urniezius, R. The Kalman filter revisited using maximum relative entropy. Entropy 2014, 16, 1047–1069. [Google Scholar] [CrossRef]
  27. Chen, B.; Liu, X.; Zhao, H.; Principe, J.C. Maximum correntropy Kalman filter. Automatica 2017, 76, 70–77. [Google Scholar] [CrossRef] [Green Version]
  28. Chen, B.; Xing, L.; Liang, J.; Zheng, N.; Principe, J.C. Steady-state mean-square error analysis for adaptive filtering under the maximum correntropy criterion. IEEE Signal Process. Lett. 2014, 21, 880–884. [Google Scholar]
  29. Gongmin, Y.; Jun, W. Lectures on Strapdown Inertial Navigation Algorithm and Integrated Navigation Principles; Northwestern Polytechnical University Press: Xi’an, China, 2019. [Google Scholar]
  30. Kumari, L.; Padma Raju, K. Application of Extended Kalman filter for a Free Falling body towards Earth. IJACSA Ed. 2011, 2, 4. [Google Scholar] [CrossRef]
Figure 1. Evolution of matrix Σ .
Figure 1. Evolution of matrix Σ .
Entropy 22 00982 g001
Figure 2. Simulation results for the entropy.
Figure 2. Simulation results for the entropy.
Entropy 22 00982 g002
Figure 3. Simulation results for the change of entropy.
Figure 3. Simulation results for the change of entropy.
Entropy 22 00982 g003
Figure 4. Position error of the loosely coupled integration.
Figure 4. Position error of the loosely coupled integration.
Entropy 22 00982 g004
Figure 5. Velocity error of the loosely coupled integration.
Figure 5. Velocity error of the loosely coupled integration.
Entropy 22 00982 g005
Figure 6. Attitude error of the loosely coupled integration.
Figure 6. Attitude error of the loosely coupled integration.
Entropy 22 00982 g006
Figure 7. Rényi entropy of the covariance Σ .
Figure 7. Rényi entropy of the covariance Σ .
Entropy 22 00982 g007

Share and Cite

MDPI and ACS Style

Luo, Y.; Guo, C.; You, S.; Liu, J. A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy 2020, 22, 982. https://doi.org/10.3390/e22090982

AMA Style

Luo Y, Guo C, You S, Liu J. A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy. 2020; 22(9):982. https://doi.org/10.3390/e22090982

Chicago/Turabian Style

Luo, Yarong, Chi Guo, Shengyong You, and Jingnan Liu. 2020. "A Novel Perspective of the Kalman Filter from the Rényi Entropy" Entropy 22, no. 9: 982. https://doi.org/10.3390/e22090982

APA Style

Luo, Y., Guo, C., You, S., & Liu, J. (2020). A Novel Perspective of the Kalman Filter from the Rényi Entropy. Entropy, 22(9), 982. https://doi.org/10.3390/e22090982

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop