A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise

Shan, Chenghao; Zhou, Weidong; Yang, Yefeng; Shan, Hanyu

doi:10.3390/e23101351

Open AccessArticle

A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise

¹

Department of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China

²

Center for Control Theory and Guidance Technology, Harbin Institute of Technology, Harbin 150001, China

³

Department of Information and Communication Engineering, Harbin Engineering University, Harbin 150001, China

^*

Author to whom correspondence should be addressed.

Entropy 2021, 23(10), 1351; https://doi.org/10.3390/e23101351

Submission received: 7 September 2021 / Revised: 12 October 2021 / Accepted: 14 October 2021 / Published: 16 October 2021

(This article belongs to the Special Issue Advances in Image Fusion)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, a new variational Bayesian-based Kalman filter (KF) is presented to solve the filtering problem for a linear system with unknown time-varying measurement loss probability (UTVMLP) and non-stationary heavy-tailed measurement noise (NSHTMN). Firstly, the NSHTMN was modelled as a Gaussian-Student’s t-mixture distribution via employing a Bernoulli random variable (BM). Secondly, by utilizing another Bernoulli random variable (BL), the form of the likelihood function consisting of two mixture distributions was converted from a weight sum to an exponential product and a new hierarchical Gaussian state-space model was therefore established. Finally, the system state vector, BM, BL, the intermediate random variables, the mixing probability, and the UTVMLP were jointly inferred by employing the variational Bayesian technique. Simulation results revealed that in the scenario of NSHTMN, the proposed filter had a better performance than current algorithms and further improved the estimation accuracy of UTVMLP.

Keywords:

variational Bayesian; Kalman filter; measurement loss probability; mixture distribution; non-stationary heavy-tailed measurement noise

1. Introduction

Under the minimal mean square error criteria, the KF is the optimal estimator for the linear Gaussian state-space model [1,2]. KF has been widely employed in a variety of applications [3,4,5]. Unfortunately, in many practical applications, when the sensor produces intermittent faults, the actual measurement of the sensors may not be accurately represented by the KF measurement model [6,7]. If the random measurement loss occurs, the measurement of the sensors contains only pure noise. In this situation, the estimation accuracy of a typical KF will drop significantly or even diverge. Various filtering methods have been developed to address the measurement loss filtering issue, such as the intermittent KF (IKF) [8,9]. However, IKF has an important assumption: the measurement loss probability is known. In practical applications, the measurement loss probability is usually unknown and the IKF is no longer applicable in this case [7].

In order to address the filtering issues of the unknown measurement loss probability of the linear system, the first Bayesian Kalman filter and the second Bayesian Kalman filter were designed by estimating the process and posterior distribution of the measurement loss, respectively [10]. The above two filters, however, are no longer valid if the unknown measurement loss probability is time-varying. Recently, the variational Bayesian-based adaptive KF (VBAKF) was derived for a linear system with unknown time-varying measurement loss probability (UTVMLP) and both the system vector and UTVMLP are jointly estimated by introducing the variational Bayesian technique [7]. Additionally, VBAKF shows excellent performance in the context of white Gaussian measurement noise with known statistical characteristics. Unfortunately, in realistic engineering applications, measurement outliers may occur at various periods due to environmental changes and unreliable sensors, resulting in NSHTMN, i.e., when the system runs healthily, the measurement noise is the Gaussian-distributed, and when the time-varying measurement outliers erode the system, the measurement noise is heavy-tail-distributed [11,12]. In the scenario of NSHTMN, the estimation accuracy of VBAKF will drop sharply.

Recently, some mixture distribution-based algorithms have been presented to address NSHTMN, such as the Gaussian-Student’s t-mixture distribution-based KF (GSTKF) [13,14]. However, the filtering problem with UTVMLP and NSHTMN cannot directly solved by employing a mixture distribution, that is, under the scenario of UTVMLP and NSHTMN, the current likelihood function is a weighted sum of double-mixture distributions, which is an unclosed and unconjugated distribution that makes the Bayesian inference difficult to employ directly.

In this paper, a new variational Bayesian-based KF is presented to settle the filtering issue for linear discrete-time systems with UTVMLP and NSHTMN. Firstly, the Gaussian-Student’s t-mixture distribution with BM is employed to model the NSHTMN. Secondly, the form of the likelihood function is converted to an exponential product and constructs a new hierarchical Gaussian state-space model by utilizing BL. Thirdly, the variational Bayesian method is introduced to simultaneously estimate the system state vector, BM, BL, the intermediate random variables, the mixing probability, and the UTVMLP. Finally, a numerical simulation experiment reveals that the proposed filter has better estimation accuracy but is more time-consuming than existing filtering algorithms in the scenarios of NSHTMN and UTVMLP.

The contributions of this paper are as follows:

(a): By employing a Bernoulli-distributed variable, the NSHTMN is modelled as a Gaussian-Student’s t-mixture distribution;
(b): The measurement likelihood function is converted from the weight sum of two mixture distributions to an exponential product and a new hierarchical Gaussian state-space model is therefore derived;
(c): The system state vector, UTVMLP, and the unknown variables are simultaneously estimated by utilizing the variational Bayesian technique;
(d): Numerical simulation results indicate that the proposed filter has better performance than that of existing algorithms in the scenarios of NSHTMN and UTVMLP

2. Problem Formulation

Consider the linear stochastic system with the following state and measurement equations:

x_{t} = F_{t - 1} x_{t - 1} + e_{t - 1}

(1)

y_{t} = b_{t} H_{t} x_{t} + g_{t}

(2)

where

x_{t} \in ℝ^{m}

denotes the system state vector;

F_{t - 1} \in ℝ^{m \times m}

denotes the state transition matrix;

e_{t} \in ℝ^{m}

represents the Gaussian-distributed white process noise vector with a zero mean value and covariance matrix

Q_{t}

;

y_{t} \in ℝ^{n}

represents the measurement vector;

H_{t} \in ℝ^{n \times m}

is the measurement matrix;

g_{t} \in ℝ^{n}

is the white NSHTMN vector; and

t

represents the index of discrete time. The phenomenon of measurement loss is described by introducing the identically distributed and mutually uncorrelated measurement loss defined as the Bernoulli random variable (BL)

b_{t}

, which is expressed by the following equations.

p (b_{t} = 1) = E [b_{t}] = γ_{t}

(3)

p (b_{t} = 0) = 1 - E [b_{t}] = 1 - γ_{t}

(4)

where

γ_{t} \in [0, 1]

denotes the time-varying measurement loss probability. Note that the value of

γ_{t}

is unknown in this paper. The initial Gaussian-distributed system state vector

x_{0}

is the random vector with mean

{\hat{x}}_{0 | 0} = 0

and covariance matrix

P_{0 | 0}

. Additionally, it is assumed that the initial system state vector

x_{0}

, the noise vectors

e_{t - 1}

and

g_{t}

, and the Bernoulli random variable

b_{t}

are mutually independent.

It can be seen from Equations (1)–(4) that the ideal measurement was received by the sensor when

b_{t} = 0

and the measurement loss with UTVMLP occurred when

b_{t} = 1

. Meanwhile, the measurement noise is NSHTMN due to measurement outliers, that is, when the system runs healthily, the measurement noise is Gaussian-distributed, and when measurement outliers erode the system, the measurement noise is heavy-tail-distributed. The NSHTMN and UTVMLP can result in estimation errors or even in filtering divergence. Therefore, a new variational Bayesian-based Kalman filter with NSHTMN and UTVMLP will be proposed.

3. Proposed Variational Bayesian-Based Kalman Filter

In this section, a new variational Bayesian-based Kalman filter is proposed to address the filtering issue for a linear system with NSHTMN and UTVMLP. Firstly, the Gaussian-Student’s t-mixture distribution is utilized to model the NSHTMN and the hierarchical form is derived. Secondly, by converting the measurement likelihood function into an exponential multiplication, a new hierarchical Gaussian state-space model is established. Thirdly, by using the variational Bayesian method, the system state and unknown variables are simultaneously estimated. Finally, the required mathematical expectations are given.

3.1. Gaussian-Student’s t-Mixture Distribution

The NSHTMN vector can be modeled as the Gaussian-Student’s t-mixture distribution by employing another mixing-defined Bernoulli random variable (BM),

ζ_{t}

, and the probability density function (PDF),

p (g_{t})

is given as

p (g_{t}) = \sum_{ζ_{t} = 0}^{1} \int {[N (g_{t}; 0, R_{t})]}^{ζ_{t}} {[ST (g_{t}; 0, R_{t}, μ)]}^{(1 - ζ_{t})} p (ζ_{t} | φ_{t}) p (φ_{t}) d φ_{t} ζ_{t} \in {0, 1}

(5)

where

N (x; 0, Σ)

represents the Gaussian PDF with a zero mean vector and covariance matrix

Σ

, and

ST (x; 0, Υ, ω)

represents the student’s t-PDF with a zero mean vector, covariance matrix

Υ

, and degree of freedom (dof) parameter

ω

.

R_{t}

represents the covariance matrix of the nominal measurement noise. The PDF of the mixing probability

φ_{t}

and the probability mass function (PMF) of

ζ_{t}

are defined as follows, respectively.

p (φ_{t}) = Be (φ_{t}; h_{0}, 1 - h_{0})

(6)

p (ζ_{t} | φ_{t}) = φ_{t}^{ζ_{t}} {(1 - φ_{t})}^{(1 - ζ_{t})}

(7)

where

Be (x; σ, κ)

represents the Beta PDF with shape parameters

σ

and

κ

.

Due to the hierarchical properties of the student’s t-distribution, Equation (5) can be rewritten as such:

p (g_{t}) = \sum_{ζ_{t} = 0}^{1} \int_{0}^{+ \infty} \int {[N (g_{t}; 0, R_{t})]}^{ζ_{t}} {[N (g_{t}; 0, R_{t} / β_{t})]}^{(1 - ζ_{t})} p (β_{t}) p (ζ_{t} | φ_{t}) p (φ_{t}) d φ_{t} d β_{t}

(8)

p (β_{t}) = G (β_{t}, \frac{μ}{2}, \frac{μ}{2})

(9)

where

G (x, a, b)

represents the Gamma PDF with shape parameter

a

and rate parameter

b

, and

β_{t}

represents the intermediate random variable.

3.2. New Hierarchical Gaussian State-Space Model (HGSSM)

According to Equations (2)–(4), the measurement likelihood PDF is derived as Based on Equation (2), the following equation can be obtained.

\begin{array}{l} p (y_{t} | x_{t}, γ_{t}) & = \sum_{b_{t} = 0}^{1} p (y_{t}, b_{t} | x_{t}, γ_{t}) \\ = p (y_{t} | x_{t}, b_{t} = 1) p (b_{t} = 1) + p (y_{t} | x_{t}, b_{t} = 0) p (b_{t} = 0) \\ = (1 - γ_{t}) p (y_{t} | x_{t}, b_{t} = 1) + γ_{t} p (y_{t} | x_{t}, b_{t} = 0) \end{array}

(10)

p (y_{t} | x_{t}, b_{t} = 1) = p_{g_{t}} (y_{t} - H_{t} x_{t})

(11)

p (y_{t} | x_{t}, b_{t} = 0) = p_{g_{t}} (y_{t})

(12)

where

p_{g_{t}} (\cdot)

represents the measurement noise PD. Substituting Equations (11) and (12) in Equation (10) results in

p (y_{t} | x_{t}, γ_{t}) = (1 - γ_{t}) p_{g_{t}} (y_{t} - H_{t} x_{t}) + γ_{t} p_{g_{t}} (y_{t})

(13)

Remark 1.

The measurement likelihood PDF in Equation (13) is an unclosed and unconjugated weighted sum form, and it is impossible to infer the system state vector and unknown parameters directly by utilizing the variational Bayesian. The weighted sum will then be converted into an exponential multiplication form to address this problem.

The PMF of BL

b_{t}

is given as

p (b_{t} | γ_{t}) = {(1 - γ_{t})}^{b_{t}} γ_{t}^{(1 - b_{t})}

(14)

Exploiting Equations (13) and (14), the measurement likelihood PDF is reformulated as

\begin{array}{l} p (y_{t} | x_{t}, γ_{t}) & = \sum_{b_{t} = 0}^{1} p (y_{t} | x_{t}, b_{t}) p (b_{t} | γ_{t}) \\ = \sum_{b_{t} = 0}^{1} [{(1 - γ_{t})}^{b_{t}} {[p_{g_{t}} (y_{t} - H_{t} x_{t})]}^{b_{t}} γ_{t}^{(1 - b_{t})} {[p_{g_{t}} (y_{t})]}^{(1 - b_{t})}] \\ = \sum_{b_{t} = 0}^{1} [{[p_{g_{t}} (y_{t} - H_{t} x_{t})]}^{b_{t}} {[p_{g_{t}} (y_{t})]}^{(1 - b_{t})}] p (b_{t} | γ_{t}) \end{array}

(15)

According Equation (15), the exponential multiplication-formed likelihood PDF

p (y_{t} | x_{t}, b_{t})

is given as follows.

p (y_{t} | x_{t}, b_{t}) = {[p_{g_{t}} (y_{t} - H_{t} x_{t})]}^{b_{t}} {[p_{g_{t}} (y_{t})]}^{(1 - b_{t})}

(16)

Remark 2.

The variational Bayesian method must select the suitable conjugate-prior distributions for unknown variables. Therefore, the appropriate prior PDFs to construct a new HGSSM are selected.

The one-step predicted PDF

p (x_{t} | y_{1 : t - 1})

of system state vector

x_{t}

is assumed as being Gaussian distributed as follows.

p (x_{t} | y_{1 : t - 1}) = N (x_{t}; {\hat{x}}_{t | t - 1}, P_{t | t - 1})

(17)

where

{\hat{x}}_{t | t - 1}

represents the mean vector and

P_{t | t - 1}

represents the covariance matrix. Both

{\hat{x}}_{t | t - 1}

and

P_{t | t - 1}

can be updated by the typical Kalman filter, which is given as

{\hat{x}}_{t | t - 1} = F_{t - 1} {\hat{x}}_{t - 1 | t - 1}

(18)

P_{t | t - 1} = H_{t - 1} P_{t - 1 | t - 1} H_{t - 1}^{T} + Q_{t - 1}

(19)

In employing Equations (8), (9) and (16), the conditional likelihood PDF

p (y_{t} | x_{t}, b_{t}, β, ζ_{t})

is derived as

\begin{array}{l} p (y_{t} | x_{t}, b_{t}, β_{t}, ζ_{t}) & = {[N {(y_{t}; H_{t} x_{t}, R_{t})}^{ζ_{t}} N {(y_{t}; H_{t} x_{t}, R_{t} / β_{t})}^{(1 - ζ_{t})}]}^{b_{t}} \\ \times {[N {(y_{t}; 0, R_{t})}^{ζ_{t}} N {(y_{t}; 0, R_{t} / β_{t})}^{(1 - ζ_{t})}]}^{(1 - b_{t})} \\ = N {(y_{t}; H_{t} x_{t}, R_{t})}^{ζ_{t} b_{t}} N {(y_{t}; H_{t} x_{t}, R_{t} / β_{t})}^{(1 - ζ_{t}) b_{t}} \\ \times N {(y_{t}; 0, R_{t})}^{ζ_{t} (1 - b_{t})} N {(y_{t}; 0, R_{t} / β_{t})}^{(1 - ζ_{t}) (1 - b_{t})} \end{array}

(20)

It can be seen from Equations (6)–(9), (13) and (20) that the measurement vector

y_{t}

depends on system state vector

x_{t}

, intermediate random variable

β_{t}

, BM

ζ_{t}

, BL

b_{t}

, mixing probability

φ_{t}

, and measurement loss probability

γ_{t}

. The following joint-prior PDF must be calculated, i.e.,

\begin{matrix} p (Ξ | y_{1 : t - 1}) = p (x_{t} | y_{1 : t - 1}) p (β_{t}) p (b_{t} | γ_{t}) p (ζ_{t} | φ_{t}) p (φ_{t}) p (γ_{t} | y_{1 : t - 1}), \\ Ξ ≜ {x_{t}, β_{t}, b_{t}, ζ_{t}, γ_{t}, φ} \end{matrix}

(21)

where the definitions of

p (γ_{t})

,

p (ζ_{t} | φ)

,

p (β)

,

p (b_{t} | γ_{t})

, and

p (x_{t} | y_{1 : t - 1})

are given in Equations (6), (7), (9), (14) and (17), respectively. Additionally,

p (γ_{t} | y_{1 : t - 1})

denotes the prior PDF of the time-varying measurement loss probability, which can be assumed as the following Beta distribution.

\begin{array}{l} p (γ_{t} | y_{1 : t - 1}) & = Be (γ_{t}; {\hat{η}}_{t | t - 1}, {\hat{δ}}_{t | t - 1}) \\ = \int p (γ_{t - 1} | y_{1 : t - 1}) p (γ_{t} | γ_{t - 1}) d γ_{t - 1} (Bayes ’ theorem) \end{array}

(22)

where the shape parameters

{\hat{η}}_{t | t - 1}

and

{\hat{δ}}_{t | t - 1}

can be calculated by introducing the forgetting factor

ρ \in (0, 1]

as follows.

{\hat{η}}_{t | t - 1} = ρ {\hat{η}}_{t - 1 | t - 1}

(23)

{\hat{δ}}_{t | t - 1} = ρ {\hat{δ}}_{t - 1 | t - 1}

(24)

where

{\hat{η}}_{t - 1 | t - 1}

and

{\hat{δ}}_{t - 1 | t - 1}

represent posterior shape parameters.

The proposed new HGSSM is comprised of Equations (14) and (17)–(24). System state vector

x_{t}

, intermediate random variable

β_{t}

, BM

ζ_{t}

, BL

b_{t}

, mixing probability

φ_{t}

, and measurement loss probability

γ_{t}

will be simultaneously estimated by utilizing the variational Bayesian method.

3.3. Variational Bayesian Approximation of the Joint Posterior PDFs

Aiming at the estimation of the unknown variables of the new HGSSM, the joint posterior PDF

p (Ξ | y_{t})

with

Ξ ≜ {x_{t}, β_{t}, b_{t}, ζ_{t}, φ_{t}, γ_{t}}

is required to be solved. However, the analytical solution of

p (Ξ | y_{t})

is not accessible. The variational Bayesian approach is therefore employed to determine an approximate PDF for

p (Ξ | y_{t})

and to compute an approximate solution [15,16,17], i.e.,

p (Ξ | y_{1 : t}) \approx q_{a} (x_{t}) q_{a} (β_{t}) q_{a} (b_{t}) q_{a} (ζ_{t}) q_{a} (φ_{t}) q_{a} (γ_{t})

(25)

where

θ

represents an arbitrary element of

Ξ

and

q_{a} (θ)

denotes the approximate PDF or PMF. By minimizing the Kullback–Leibler divergence (KLD) between

p (Ξ | y_{1 : t})

and

q_{a} (x_{t}) q_{a} (β_{t}) q_{a} (b_{t}) q_{a} (ζ_{t}) q_{a} (φ_{t}) q_{a} (γ_{t})

,

q_{a} (θ)

can be obtained as follows.

\begin{matrix} {q_{a} (x_{t}), q_{a} (β_{t}), q_{a} (b_{t}), q_{a} (ζ_{t}), q_{a} (φ_{t}), q_{a} (γ_{t})} = argminKLD \\ (q_{a} (x_{t}) q_{a} (β_{t}) q_{a} (b_{t}) q_{a} (ζ_{t}) q_{a} (φ_{t}) q_{a} (γ_{t}) | | p (Ξ | y_{1 : t})) \end{matrix}

(26)

KLD (q_{a} (A) | | p (A)) ≜ \int q_{a} (A) \log \frac{q_{a} (A)}{p (A)} d A

(27)

where

KLD (q_{a} (A) | | p (A))

represents the KLD between

q_{a} (A)

and

p (A)

, and the optimal solution of Equation (26) can be calculated via the following formula [15,17].

\log q_{a} (θ) = E_{Ξ^{- θ}} [\log p (Ξ, y_{1 : t})] + c_{θ}

(28)

where

E (θ)

denotes the mathematical expectation operation,

Ξ^{- θ}

signifies a grouping of all the components in

Ξ

apart from

θ

, and the constant with regard to

θ

is denoted by

c_{θ}

. Additionally, the fixed-point iteration technique is utilized to derive the approximate formation of

q_{a} (θ)

due to the fact that estimated parameters are mutually coupled.

The joint PDF

p (Ξ, y_{1 : t})

in Equation (26) can be derived as

\begin{array}{l} p (Ξ, y_{1 : t}) & = N {(y_{t}; H_{t} x_{t}, R_{t})}^{ζ_{t} b_{t}} N {(y_{t}; H_{t} x_{t}, R_{t} / β_{t})}^{(1 - ζ_{t}) b_{t}} \\ \times N {(y_{t}; 0, R_{t})}^{ζ_{t} (1 - b_{t})} N {(y_{t}; 0, R_{t} / β_{t})}^{(1 - ζ_{t}) (1 - b_{t})} N (x_{t}; {\hat{x}}_{t | t - 1}, P_{t | t - 1}) \\ \times G (β_{t}, \frac{μ_{t}}{2}, \frac{μ_{t}}{2}) φ_{t}^{ζ_{t}} {(1 - φ_{t})}^{(1 - ζ_{t})} {(1 - γ_{t})}^{b_{t}} γ_{t}^{(1 - b_{t})} \\ \times Be (φ_{t}; h_{0}, 1 - h_{0}) Be (γ_{t}; {\hat{η}}_{t | t - 1}, {\hat{δ}}_{t | t - 1}) p (y_{1 : t - 1}) \end{array}

(29)

Proposition 1.

Let

θ = x_{t}

and by using Equation (29) in (28),

q_{a}^{(s + 1)} (x_{t})

can be updated as Gaussian, i.e.,

q_{a}^{(s + 1)} (x_{t}) = N (x_{t}; {\hat{x}}_{t | t}^{(s + 1)}, P_{t | t}^{(s + 1)})

(30)

where

q_{a}^{(s + 1)} (\cdot)

represents the approximate PDF in the

(s + 1) t h

iteration, while the mean vector

{\hat{x}}_{t | t}^{(s + 1)}

and the covariance matrix

P_{t | t}^{(s + 1)}

are assumed to be updated in accordance with the traditional Kalman filter as follows.

{\hat{x}}_{t | t}^{(s + 1)} = {\hat{x}}_{t | t - 1} + K_{t}^{(s + 1)} (y_{t} - H_{t} {\hat{x}}_{t | t - 1})

(31)

P_{t | t}^{(s + 1)} = {\hat{P}}_{t | t - 1}^{(s + 1)} - K_{t}^{(s + 1)} H_{t} {\hat{P}}_{t | t - 1}^{(s + 1)}

(32)

K_{t}^{(s + 1)} = {\hat{P}}_{t | t - 1}^{(s + 1)} H_{t}^{T} {(H_{t} {\hat{P}}_{t | t - 1}^{(s + 1)} H_{t}^{T} + {\tilde{R}}_{t}^{(s + 1)})}^{- 1}

(33)

where

K_{t}^{(s + 1)}

represents the Kalman gain matrix. The modified measurement noise covariance matrix at

(s + 1) t h

iteration

{\tilde{R}}_{t}^{(s + 1)}

is formulated as

{\tilde{R}}_{t}^{(s + 1)} = \frac{R_{t}}{E^{(s)} [ζ_{t}] E^{(s)} [b_{t}] + E^{(s)} [1 - ζ_{t}] E^{(s)} [b_{t}] E^{(s)} [β_{t}]}

(34)

where

E^{(s)} [\cdot]

represents the mathematical expectation of variables in the

s t h

iteration.

Proof: see Appendix A.

Proposition 2.

Let

θ = β_{t}

and by using Equation (29) in Equation (28),

q_{a}^{(s + 1)} (β_{t})

can be updated as Gamma, i.e.,

q_{a}^{(s + 1)} (β_{t}) = G (β_{t}; π_{t}^{(s + 1)}, ν_{t}^{(s + 1)})

(35)

where the shape parameter

π_{t}^{(s + 1)}

and rate parameter

ν_{t}^{(s + 1)}

are formulated as

π_{t}^{(s + 1)} = 0.5 (n E^{(s + 1)} [1 - ζ_{t}] + μ_{t})

(36)

ν_{t}^{(s + 1)} = 0.5 [tr (G_{t}^{(s + 1)} R_{t}^{- 1}) + μ_{t}]

(37)

where

n

represents the dimension of the measurement vector,

tr (\cdot)

represents the trace operation, and

G_{t}^{(s + 1)}

is defined as

\begin{array}{l} G_{t}^{(s + 1)} & = E^{(s)} [1 - ζ_{t}] {E^{(s)} [b_{t}] [H_{t} P_{t | t}^{(s + 1)} H_{t}^{T} + (y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)}) {(y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)})}^{T}] \\ + E^{(s)} [1 - ζ_{t}] E^{(s)} [1 - b_{t}] y_{t} {(y_{t})}^{T}} \end{array}

(38)

Proof: see Appendix B.

Proposition 3.

Let

θ = b_{t}

and by using Equation (29) in Equation (28),

q_{a}^{(s + 1)} (b_{t})

can be updated as the Bernoulli distribution. The posterior probabilities

p (b_{t} = 1)

and

p (b_{t} = 0)

of BL

b_{t}

are given as

p^{(s + 1)} (b_{t} = 1) ∆^{(s + 1)} \exp {E^{(s)} [\log (1 - γ_{t})] - 0.5 tr (C_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [1 - ζ_{t}] E^{(s)} [\log (β_{t})]}

(39)

p^{(s + 1)} (b_{t} = 0) = ∆^{(s + 1)} \exp {E^{(s)} [\log (γ_{t})] - 0.5 tr (D_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [1 - ζ_{t}] E^{(s)} [\log (β_{t})]}

(40)

where

∆^{(s + 1)}

represents the normalizing constant and the parameters

C_{t}^{(s + 1)}

and

D_{t}^{(s + 1)}

are, respectively, defined as

C_{t}^{(s + 1)} = {E^{(s)} [ζ_{t}] + E^{(s)} [β_{t}] E^{(s)} [1 - ζ_{t}]} [H_{t} P_{t | t}^{(s + 1)} H_{t}^{T} + (y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)}) {(y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)})}^{T}]

(41)

D_{t}^{(s + 1)} = {E^{(s)} [ζ_{t}] + E^{(s)} [β_{t}] E^{(s)} [1 - ζ_{t}]} y_{t} y_{t}^{T}

(42)

Proof: see Appendix C.

Proposition 4.

Let

θ = ζ_{t}

and by using Equation (29) in (28),

q_{a}^{(s + 1)} (ζ_{t})

can be also updated as the Bernoulli distribution. The posterior probabilities

p (ζ_{t} = 1)

and

p (ζ_{t} = 0)

of BM

ζ_{t}

are given as

p^{(s + 1)} (ζ_{t} = 1) = \nabla^{(s + 1)} \exp {E^{(s)} [\log (φ_{t})] - 0.5 tr (V_{t}^{(s + 1)} R_{t}^{- 1})}

(43)

p^{(s + 1)} (ζ_{t} = 0) = \nabla^{(s + 1)} \exp {E^{(s)} [\log (1 - φ_{t})] - 0.5 tr (W_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [\log (β_{t})]}

(44)

where

\nabla^{(s + 1)}

also represents the normalizing constant and the definitions of parameters

V_{t}^{(s + 1)}

and

W_{t}^{(s + 1)}

are, respectively, given as

\begin{array}{l} V_{t}^{(s + 1)} & = E^{(s)} [b_{t}] [H_{t} P_{t | t}^{(s + 1)} H_{t}^{T} + (y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)}) {(y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)})}^{T}] \\ + E^{(s)} [1 - b_{t}] y_{t} y_{t}^{T} \end{array}

(45)

\begin{array}{l} W_{t}^{(s + 1)} & = E^{(s)} [b_{t}] E^{(s)} [β_{t}] [H_{t} P_{t | t}^{(s + 1)} H_{t}^{T} + (y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)}) {(y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)})}^{T}] \\ + E^{(s)} [1 - b_{t}] E^{(s)} [β_{t}] y_{t} y_{t}^{T} \end{array}

(46)

Proof: see Appendix D.

Proposition 5.

Let

θ = φ_{t}

and by using Equation (29) in (28),

q_{a}^{(s + 1)} (φ_{t})

can be updated as the Beta distribution, i.e.,

q_{a}^{(s + 1)} (φ_{t}) = Be (φ_{t}; h_{t}^{(s + 1)}, d_{t}^{(s + 1)})

(47)

where the shape parameters

h_{t}^{(s + 1)}

and

d_{t}^{(s + 1)}

are formulated as

h_{t}^{(s + 1)} = E^{(s + 1)} [ζ_{t}] + h_{0}

(48)

d_{t}^{(s + 1)} = E^{(s + 1)} [1 - ζ_{t}] + 1 - h_{0}

(49)

Proof: see Appendix E.

Proposition 6.

Let

θ = γ_{t}

and by using Equation (29) in Equation (28),

q_{a}^{(s + 1)} (γ_{t})

can be also updated as the Beta distribution, i.e.,

q_{a}^{(s + 1)} (γ_{t}) = Be (γ_{t}; {\hat{η}}_{t}^{(s + 1)}, {\hat{δ}}_{t}^{(s + 1)})

(50)

where the definitions of shape parameters

{\hat{η}}_{t}^{(s + 1)}

and

{\hat{δ}}_{t}^{(s + 1)}

are given as

{\hat{η}}_{t}^{(s + 1)} = E^{(s + 1)} [1 - b_{t}] + {\hat{η}}_{t | t - 1}

(51)

{\hat{δ}}_{t}^{(s + 1)} = E^{(s + 1)} [b_{t}] + {\hat{δ}}_{t | t - 1}

(52)

Proof: see Appendix F.

3.4. Calculation of the Required Mathematical Expectations

The required mathematical expectations

E^{(s + 1)} [β_{t}]

,

E^{(s + 1)} [\log (β_{t})]

,

E^{(s + 1)} [b_{t}]

,

E^{(s + 1)} [1 - b_{t}]

,

E^{(s + 1)} [ζ_{t}]

,

E^{(s + 1)} [1 - ζ_{t}]

,

E^{(s + 1)} [\log (φ_{t})]

,

E^{(s + 1)} [\log (1 - φ_{t})]

,

E^{(s + 1)} [\log (γ_{t})],

and

E^{(s + 1)} [\log (1 - γ_{t})]

in Section 3.3 are defined, respectively, as follows:

E^{(s + 1)} [β_{t}] = \frac{π_{t}^{(s + 1)}}{ν_{t}^{(s + 1)}}

(53)

E^{(s + 1)} [\log (β_{t})] = - Ψ (π_{t}^{(s + 1)}) - \log (ν_{t}^{(s + 1)})

(54)

E^{(s + 1)} [b_{t}] = \frac{p^{(s + 1)} (b_{t} = 1)}{p^{(s + 1)} (b_{t} = 1) p^{(s + 1)} (b_{t} = 0)}

(55)

E^{(s + 1)} [1 - b_{t}] = 1 - E^{(s + 1)} [b_{t}]

(56)

E^{(s + 1)} [ζ_{t}] = \frac{p^{(s + 1)} (ζ_{t} = 1)}{p^{(s + 1)} (ζ_{t} = 1) p^{(s + 1)} (ζ_{t} = 0)}

(57)

E^{(s + 1)} [1 - ζ_{t}] = 1 - E^{(s + 1)} [ζ_{t}]

(58)

E^{(s + 1)} [\log (φ_{t})] = Ψ (h_{t}^{(s + 1)}) - Ψ (h_{t}^{(s + 1)} + d_{t}^{(s + 1)})

(59)

E^{(s + 1)} [\log (1 - φ_{t})] = Ψ (d_{t}^{(s + 1)}) - Ψ (h_{t}^{(s + 1)} + d_{t}^{(s + 1)})

(60)

E^{(s + 1)} [\log (γ_{t})] = Ψ (η_{t}^{(s + 1)}) - Ψ (η_{t}^{(s + 1)} + δ_{t}^{(s + 1)})

(61)

E^{(s + 1)} [\log (1 - γ_{t})] = Ψ (δ_{t}^{(s + 1)}) - Ψ (η_{t}^{(s + 1)} + δ_{t}^{(s + 1)})

(62)

where

Ψ (\cdot)

represents the digamma function [18].

The presented variational Bayesian-based Kalman filter with UTVMLP and NSHTMN consists of Equations (18), (19) and (30)–(62). Table 1 describes the implementation of the proposed new KF.

4. Simulations

Aimed at demonstrating the superiority of the presented filter in the scenario with UTVMLP and NSHTMN, a numerical example is simulated. The process and measurement equations of the stochastic system are, respectively, given as [7]

x_{t} = [\begin{matrix} 0.6 & 0.4 \\ 0.1 & 0.9 \end{matrix}] x_{t - 1} + e_{t - 1}

(63)

y_{t} = b_{t} [\begin{matrix} 1 & - 2 \end{matrix}] x_{t} + g_{t}

(64)

where the Gaussian process noise

e_{t - 1}

and the NSHTMN

g_{t}

are given as [12]

e_{t - 1} ~ N (0, Q_{t})

(65)

{\begin{matrix} g_{t} ~ N (0, R_{t}) t \in [1, 100] (Gaussian) \\ g_{t} ~ {\begin{matrix} N (0, R_{t}) w . p . = 0.98 \\ N (0, 500 R_{t}) w . p . = 0.02 \end{matrix} t \in [101, 200] (slightly heavy - tailed) \\ g_{t} ~ {\begin{matrix} N (0, R_{t}) w . p . = 0.95 \\ N (0, 500 R_{t}) w . p . = 0.05 \end{matrix} t \in [201, 300] (moderately heavy - tailed) \\ g_{t} ~ N (0, R_{t}) t \in [301, 400] (Gaussian) \end{matrix}

(66)

where

w . p .

represents “with probability”. The true process noise covariance matrix

Q_{t}

with parameter

M = 1

and the nominal measurement noise covariance matrix

R_{t}

with parameter

N = 150 m^{2}

are set as

Q_{t} = M [\begin{matrix} 1 & 1 \\ 0 & 1 \end{matrix}]

(67)

R_{t} = N

(68)

The real UTVMLP is set as

p (γ_{t}) = {\begin{matrix} 0.1 t \in [1, 100] \\ 0.15 t \in [101, 200] \\ 0.3 t \in [201, 300] \\ 0.1 t \in [301, 400] \end{matrix}

(69)

From Equations (66)–(69), it can be seen that the measurement noise and UTVMLP are divided into four stages, as shown in Table 2. The remaining system parameters are as follows: the sampling interval

∆ k = 0.01 s

and the total simulation time

T = 400 s

. The proposed filter is compared with the typical Kalman filter (KF) [2]; the existing variational Bayesian-based adaptive KF with UTVMLP (VBAKF) [7]; the existing Gaussian-Student’s t-mixture distribution-based KF (GSTKF) with Gaussian process noise [14]; and the existing IKF with known real measurement loss probability [8]. The parameters of VBAKF are selected as

p_{0} = 0.5

,

α_{0} = 5

,

β_{0} = 5

,

ρ = 1 - \exp (- 5)

, and

N_{m} = 10

. The parameters of GSTKF are selected as

v_{t} = 5

and

e_{0} = 0.85

. The parameters of the proposed filter are given as

ρ = 0.99

,

μ = 5

,

h_{0} = 0.85

,

N_{I} = 10

,

η_{0} = 5

, and

δ_{t} = 5

,

ς = 10^{- 16}

. All filters are programmed with MATLAB R2018a and run on a computer with Intel Core i5-6300HQ CPU at 2.30 GHz and 8 GB of RAM.

Aimed at evaluating the performances in the estimation of the system state vector of all the algorithms, the root-mean square error (RMSE) and the averaged root-mean square error (AGRMSE) are utilized as performance indicators. The definitions of RMSE and AGRMSE of the system state are given as

{RMSE}_{x} = \sqrt{\frac{1}{M_{c}} \sum_{r = 1}^{M_{r}} ({(x_{t}^{r} - {\hat{x}}_{t}^{r})}^{2} + {(y_{t}^{r} - {\hat{y}}_{t}^{r})}^{2})}

(70)

{AGRMSE}_{x} = \sqrt{\frac{1}{M_{c} T} \sum_{t = 1}^{T} \sum_{r = 1}^{M_{r}} ({(x_{t}^{r} - {\hat{x}}_{t}^{r})}^{2} + {(y_{t}^{r} - {\hat{y}}_{t}^{r})}^{2})}

(71)

where

(x_{t}^{r}, y_{t}^{r})

and

({\hat{x}}_{t}^{r}, {\hat{y}}_{t}^{r})

denote the actual and estimated system state at the

j th

Monte Carlo run and discrete-time

t

, respectively.

M_{c} = 500

represents the total Monte Carlo run time.

Different from the proposed algorithm and VBAKF, the KF, IKF and GSTKF do not estimate UTVMLP. Although IKF can also address the filtering problem with measurement loss, IKF is based on the assumption that the measurement loss probability is known. Therefore, only VBAKF and the proposed algorithm participate in the comparison of the UTVMLP estimation performance.

Figure 1 shows the

{RMSE}_{x}

s of the proposed filter and the existing filters over 500 times of the Monte Carlo run. Additionally, the

{AGRMSE}_{x}

s and SSRTs of different filters are listed in Table 2. It can be seen from Figure 1 and Table 3 that in the contexts of UTVMLP and NSHTMN, when the measurement is the Gaussian measurement noise and there is slight loss probability, as shown in stages 1 and 4, the estimation accuracy of the proposed filter is close to the IKF with true loss probability and the performance of the proposed algorithm is better than the other algorithms. We can also find that the proposed algorithm still has better performance than the existing algorithms when the measurement has heavy-tailed measurement noise and larger measurement loss probability, as shown in stages 2 and 4. In addition, the proposed algorithm has longer SSRT and higher computational complexity than the existing filters, which can be observed from Table 3.

Figure 2 shows the curves of the true and estimated UTVMLPs of VBAKF and the proposed filter over 500 times of the Monte Carlo run. Obviously, the NSHTMN has a great influence on the filtering performance of VBAKF and the proposed filter has better UTVMLP estimation accuracy than VBAKF in the scenario of NSHTMN.

Figure 3 and Figure 4 show the

{RMSE}_{x}

s and the estimated UTVMLPs of the proposed filter with shape parameter

μ = 3, 4, 5, 6

over the 500 Monte Carlo run, respectively. The corresponding SSRTs of the proposed filter with

μ = 3, 4, 5, 6

are 0.2991, 0.2983, 0.2993, and 0.2989. It can be seen that the proposed filter with different shape parameters has better performance than current algorithms in the system state and UTVMLP estimations. Moreover, the degree of freedom parameter

μ

has little influence on the estimation accuracy and time complexity of the proposed algorithm, and the recommended value of

μ

is therefore set as 5.

Figure 5 and Figure 6 show the

{RMSE}_{x}

s and the estimated UTVMLPs of the proposed filter with forgetting factor

ρ = 0.93, 0.95, 0.97, 0.99

over the 500 Monte Carlo run, respectively. The corresponding SSRT of the proposed filter with

ρ = 0.93, 0.95, 0.97, 0.99

is approximately equal to 0.2990. We can find that the proposed filter with

ρ = 0.99

has the best performance in the system state and UTVMLP estimations, and the value of

ρ

has little effect on calculation complexity. Therefore, the recommended value of

ρ

is 0.99.

Figure 7 shows the

{AGRMSE}_{x}

s of the proposed filter and the current algorithms with the iteration number

N_{I} = 1, 2, \dots 10

. We can see from Figure 7 that the proposed filter has a smaller

{AGRMSE}_{x}

than the existing filters when

N_{I} \geq 3

and the proposed filter converges faster than the existing filters. However, Table 4 shows that the setting of

N_{I}

has a huge impact on the time consumption of the proposed filter and the SSRT increases with the increase of

N_{I}

. Therefore, considering time consumption and estimation accuracy, the recommended value of

N_{I}

ranges from 4 to 10.

5. Conclusions

In this paper, a new VB-based KF is presented to address the filtering issue with UTVMLP and NSHTMN. The system state vector, BM, BL, the intermediate random variables, the mixing probability, and the UTVMLP are simultaneously inferred by introducing the variational Bayesian technique. Simulation results illustrated that the proposed filter has a better performance than existing filters in the estimations of the system state vector and UTVMLP.

6. Future Work

The environmental factors in practical applications may be more complicated than this paper illustrated. Apart from non-stationary heavy-tailed measurement noise and unknown loss probability, the system process noise may also present non-stationary heavy-tailed distribution. In terms of measurement, random delays in measurement will also appear. Therefore, in our future research, we will further consider the factors of non-stationary heavy-tailed process noise and random measurement delay based on the theoretical content of this paper. Additionally, we will design a non-linear filtering method with lower computational complexity to verify the effectiveness in the real world.

Author Contributions

Conceptualization, C.S.; methodology, C.S.; software, C.S. and H.S.; data curation, W.Z.; writing—original draft preparation, C.S.; writing—review and editing, Y.Y.; visualization, C.S. and Y.Y.; supervision, W.Z.; project administration, W.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research study was funded by the National Natural Science Foundation of China, grant number 61573113.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Proposition 1

Utilizing Equation (29),

\log p (Ξ, y_{1 : t})

can be derived as

\begin{array}{l} \log p (Ξ, y_{1 : t}) & = - 0.5 ζ_{t} b_{t} {(y_{t} - H_{t} x_{t})}^{T} R_{t}^{- 1} (y_{t} - H_{t} x_{t}) - 0.5 (1 - ζ_{t}) b_{t} β_{t} {(y_{t} - H_{t} x_{t})}^{T} R_{t}^{- 1} (y_{t} - H_{t} x_{t}) \\ - 0.5 ζ_{t} (1 - b_{t}) y_{t}^{T} R_{t}^{- 1} y_{t} - 0.5 (1 - ζ_{t}) (1 - b_{t}) β_{t} y_{t}^{T} R_{t}^{- 1} y_{t} - 0.5 {(x_{t} - {\hat{x}}_{t | t - 1})}^{T} P_{t | t - 1}^{- 1} \\ \times (x_{t} - {\hat{x}}_{t | t - 1}) - logG (β_{t}, \frac{μ_{t}}{2}, \frac{μ_{t}}{2}) + ζ_{t} \log φ_{t} + (1 - φ_{t}) \log (1 - ζ_{t}) + (1 - γ_{t}) \log b_{t} \\ + γ_{t} \log (1 - b_{t}) + logBe (φ_{t}; h_{0}, 1 - h_{0}) + logBe (γ_{t}; {\hat{η}}_{t | t - 1}, {\hat{δ}}_{t | t - 1}) + c_{Ξ} \end{array}

(A1)

Exploiting

θ = x_{t}

in Equation (28) and utilizing (a) yields

\begin{array}{l} \log q_{a}^{(s + 1)} (x_{t}) & = - 0.5 E^{(s)} [ζ_{t}] E^{(s)} [b_{t}] {(y_{t} - H_{t} x_{t})}^{T} R_{t}^{- 1} (y_{t} - H_{t} x_{t}) - 0.5 E^{(s)} [1 - ζ_{t}] E^{(s)} [b_{t}] \\ \times E^{(s)} [β_{t}] {(y_{t} - H_{t} x_{t})}^{T} R_{t}^{- 1} (y_{t} - H_{t} x_{t}) - 0.5 {(x_{t} - {\hat{x}}_{t | t - 1})}^{T} P_{t | t - 1}^{- 1} (x_{t} - {\hat{x}}_{t | t - 1}) + c_{x_{t}} \end{array}

(A2)

By exploiting Equation (34) in Equation (A2), the posterior PDF

q_{a}^{(s + 1)} (x_{t})

is defined as

q_{a}^{(s + 1)} (x_{t}) \propto N (x_{t}; {\hat{x}}_{t | t}^{(s + 1)}, P_{t | t}^{(s + 1)}) N (y_{t}; H_{t} x_{t}, {\tilde{R}}_{t}^{(s + 1)})

(A3)

Based on Equation (A3), Equation (30) can be calculated and

q_{a}^{(s + 1)} (x_{t})

is updated by utilizing the measurement update of the traditional Kalman filter.

Appendix B. Proof of Proposition 2

Exploiting

θ = β_{t}

in Equation (28) and utilizing Equation (A1) yields

\begin{array}{l} \log q_{a}^{(s + 1)} (β_{t}) & = 0.5 (n E^{(s)} [1 - ζ_{t}] + μ_{t} - 2) \log (β_{t}) - 0.5 β_{t} E^{(s)} [1 - ζ_{t}] E^{(s)} [b_{t}] \\ \times tr (A_{t}^{(s + 1)} R_{t}^{- 1}) - 0.5 β_{t} E^{(s)} [1 - ζ_{t}] E^{(s)} [1 - b_{t}] tr (B_{t}^{(s + 1)} R_{t}^{- 1}) - 0.5 μ_{t} β_{t} + c_{β_{t}} \\ = 0.5 (n E^{(s)} [1 - ζ_{t}] + μ_{t} - 2) \log (β_{t}) - 0.5 β_{t} [tr (G_{t}^{(s + 1)} R_{t}^{- 1}) + μ_{t}] + c_{β_{t}} \end{array}

(A4)

where

A_{t}^{(s + 1)}

and

B_{t}^{(s + 1)}

are defined as

\begin{array}{l} A_{t}^{(s + 1)} & = E^{(s + 1)} [(y_{t} - H_{t} x_{t}) {(y_{t} - H_{t} x_{t})}^{T}] \\ = H_{t} P_{t | t}^{(s + 1)} H_{t}^{T} + (y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)}) {(y_{t} - H_{t} {\hat{x}}_{t | t}^{(s + 1)})}^{T} \end{array}

(A5)

B_{t}^{(s + 1)} = E^{(s + 1)} [y_{t} {(y_{t})}^{T}] = y_{t} {(y_{t})}^{T}

(A6)

By exploiting Equations (36) and (37) in Equation (A5), we have

\log q_{a}^{(s + 1)} (β_{t}) = (π_{t}^{(s + 1)} - 1) \log (β_{t}) - ν_{t}^{(s + 1)} β_{t} + c_{β_{t}}

(A7)

According to Equation (A7), we can obtain

q_{a}^{(s + 1)} (β_{t}) \propto β_{t}^{(π_{t}^{(s + 1)} - 1)} \exp (- ν_{t}^{(s + 1)} β_{t})

(A8)

Based on Equation (A8), Equation (41) can be obtained.

Appendix C. Proof of Proposition 3

Exploiting

θ = b_{t}

in Equation (28) and utilizing Equation (A1) yields

\log q_{a}^{(s + 1)} (b_{t}) = b_{t} Π_{t a}^{(s + 1)} + (1 - b_{t}) Π_{t b}^{(s + 1)} + c_{b_{t}}

(A9)

where the intermediate variables

Π_{t a}^{(s + 1)}

and

Π_{t b}^{(s + 1)}

are defined as

Π_{t a}^{(s + 1)} = E^{(s)} [\log (1 - γ_{t})] - 0.5 tr (C_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [1 - ζ_{t}] E^{(s)} [\log (β_{t})]

(A10)

Π_{t b}^{(s + 1)} = E^{(s)} E^{(s)} [\log (γ_{t})] - 0.5 t r (D_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [1 - ζ_{t}] E^{(s)} [\log (β_{t})]

(A11)

By utilizing (A9)–(A11), the posterior PMF

q_{a}^{(s + 1)} (b_{t})

can be formulated as

q_{a}^{(s + 1)} (b_{t}) = {[∆^{(s + 1)} \exp (Π_{t a}^{(s + 1)})]}^{b_{t}} {[∆^{(s + 1)} \exp (Π_{t b}^{(s + 1)})]}^{(1 - b_{t})}

(A12)

Based on Equation (A12),

q_{a}^{(s + 1)} (b_{t})

can be updated as the Bernoulli distribution.

Appendix D. Proof of Proposition 4

Exploiting

θ = ζ_{t}

in Equation (28) and utilizing Equation (A1) yields

\log q_{a}^{(s + 1)} (ζ_{t}) = ζ_{t} Σ_{t a}^{(s + 1)} + (1 - ζ_{t}) Σ_{t b}^{(s + 1)} + c_{b_{t}}

(A13)

where the intermediate variables

Σ_{t a}^{(s + 1)}

and

Σ_{t b}^{(s + 1)}

are defined as

Σ_{t a}^{(s + 1)} = {E^{(s)} [\log (φ_{t})] - 0.5 tr (V_{t}^{(s + 1)} R_{t}^{- 1})}

(A14)

Σ_{t b}^{(s + 1)} = E^{(s)} [\log (1 - φ_{t})] - 0.5 tr (W_{t}^{(s + 1)} R_{t}^{- 1}) + 0.5 n E^{(s)} [\log (β_{t})]

(A15)

By utilizing Equations (A13)–(A15), the posterior PMF

q_{a}^{(s + 1)} (ζ_{t})

can be formulated as

q_{a}^{(s + 1)} (ζ_{t}) = {[\nabla^{(s + 1)} \exp (Σ_{t a}^{(s + 1)})]}^{ζ_{t}} {[\nabla^{(s + 1)} \exp (Σ_{t b}^{(s + 1)})]}^{(1 - ζ_{t})}

(A16)

Based on Equation (A16),

q_{a}^{(s + 1)} (ζ_{t})

can be updated as the Bernoulli distribution.

Appendix E. Proof of Proposition 5

Exploiting

θ = φ_{t}

in Equation (28) and utilizing Equation (A1) yields

\log q_{a}^{(s + 1)} (φ_{t}) = \log (φ_{t}) {h_{0} + E^{(s)} [ζ_{t}] - 1} + \log (1 - φ_{t}) {E^{(s)} [1 - ζ_{t}] - h_{0}} + c_{φ_{t}}

(A17)

Substituting Equations (48) and (49) in Equation (A17) yields

\log q_{a}^{(s + 1)} (φ_{t}) = \log (φ_{t}) (h_{t}^{(s + 1)} - 1) + \log (1 - φ_{t}) (d_{t}^{(s + 1)} + 1) + c_{φ_{t}}

(A18)

According to Equation (A18), we obtain

q_{a}^{(s + 1)} (φ_{t}) \propto φ_{t}^{(h_{t}^{(s + 1)} - 1)} {(1 - φ_{t})}^{(d_{t}^{(s + 1)} + 1)}

(A19)

Based on Equation (A19), Equation (47) is obtained.

Appendix F. Proof of Proposition 5

Exploiting

θ = γ_{t}

in Equation (28) and utilizing Equation (A1) yields

\log q_{a}^{(s + 1)} (γ_{t}) = \log (γ_{t}) ({\hat{η}}_{t | t - 1} - E^{(s + 1)} [b_{t}]) + \log (1 - γ_{t}) ({\hat{δ}}_{t | t - 1} + E^{(s + 1)} [1 - b_{t}])

(A20)

Substituting Equations (51) and (52) in Equation (A20) yields

\log q_{a}^{(s + 1)} (γ_{t}) = \log (γ_{t}) ({\hat{η}}_{t}^{(s + 1)} + 1) + \log (1 - γ_{t}) ({\hat{δ}}_{t}^{(s + 1)} - 1) + c_{φ_{t}}

(A21)

According to Equation (A21), we obtain

q_{a}^{(s + 1)} (γ_{t}) \propto γ_{t}^{({\hat{η}}_{t}^{(s + 1)} + 1)} {(1 - γ_{t})}^{({\hat{δ}}_{t}^{(s + 1)} - 1)}

(A22)

Based on Equation (A22), Equation (50) is obtained.

References

Mohinder, S.G.; Angus, P.A. Linear Optimal Filters and Predictors. In Kalman Filtering: Theory and Practice Using MATLAB; IEEE: Manhattan, NY, USA, 2008; pp. 131–181. [Google Scholar]
Simon, D. The discrete-time Kalman filter. In Optimal State Estimation; Wiley-Interscience: New York, NY, USA, 2006; pp. 121–148. [Google Scholar]
Li, Q.; Li, R.; Ji, K.; Dai, W. Kalman Filter and Its Application. In Proceedings of the 2015 8th International Conference on Intelligent Networks and Intelligent Systems (ICINIS), Tianjin, China, 1–3 November 2015; pp. 74–77. [Google Scholar]
Yang, Y.J.; Fan, X.G.; Zhuo, Z.F.; Wang, S.D.; Nan, J.G.; Huang, J.K. AFAKF for manoeuvring target tracking based on current statistical model. Iet Sci. Meas. Technol. 2016, 10, 637–643. [Google Scholar] [CrossRef]
Hou, B.W.; He, Z.M.; Zhou, X.Y.; Zhou, H.Y.; Li, D.; Wang, J.Q. Maximum Correntropy Criterion Kalman Filter for alpha-Jerk Tracking Model with Non-Gaussian Noise. Entropy 2017, 19, 648. [Google Scholar] [CrossRef] [Green Version]
Joerger, M.; Pervan, B. Kalman Filter-Based Integrity Monitoring Against Sensor Faults. J. Guid. Control. Dyn. 2013, 36, 349–361. [Google Scholar] [CrossRef] [Green Version]
Jia, G.L.; Huang, Y.L.; Zhang, Y.G.; Chambers, J. A Novel Adaptive Kalman Filter With Unknown Probability of Measurement Loss. IEEE Signal Process. Lett. 2019, 26, 1862–1866. [Google Scholar] [CrossRef]
Sinopoli, B.; Schenato, L.; Franceschetti, M.; Poolla, K.; Jordan, M.I.; Sastry, S.S. Kalman filtering with intermittent observations. IEEE Trans. Autom. Control 2004, 49, 1453–1464. [Google Scholar] [CrossRef]
Mo, Y.L.; Sinopoli, B. Kalman Filtering With Intermittent Observations: Tail Distribution and Critical Value. IEEE Trans. Autom. Control 2012, 57, 677–689. [Google Scholar]
Zhang, J.Q.; You, K.Y.; Xie, L.H. Bayesian Filtering With Unknown Sensor Measurement Losses. Ieee Trans. Control Netw. Syst. 2019, 6, 163–175. [Google Scholar] [CrossRef] [Green Version]
Bai, M.M.; Huang, Y.L.; Jia, G.L.; Zhang, Y.G. A robust fixed-interval smoother for nonlinear systems with non-stationary heavy-tailed state and measurement noises. Signal Process. 2021, 180, 107898. [Google Scholar] [CrossRef]
Zhu, H.; Zhang, G.R.; Li, Y.F.; Leung, H. A novel robust Kalman filter with unknown non-stationary heavy-tailed noise. Automatica 2021, 127, 109511. [Google Scholar] [CrossRef]
Jia, G.; Huang, Y.; Bai, M.B.; Zhang, Y. A Novel Robust Kalman Filter With Non-stationary Heavy-tailed Measurement Noise. IFAC-PapersOnLine 2020, 53, 368–373. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Zhao, Y.; Chambers, J.A. A Novel Robust Gaussian–Student’s t Mixture Distribution Based Kalman Filter. IEEE Trans. Signal Process. 2019, 67, 3606–3620. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Wu, Z.; Li, N.; Chambers, J. A Novel Adaptive Kalman Filter With Inaccurate Process and Measurement Noise Covariance Matrices. IEEE Trans. Autom. Control 2018, 63, 594–601. [Google Scholar] [CrossRef] [Green Version]
Bishop, C.; Ligne, S. Pattern Recognition and Machine Learning; Elsevier: Amsterdam, The Netherlands, 2006; Volume 1. [Google Scholar]
Tzikas, D.G.; Likas, A.C.; Galatsanos, N.P. The variational approximation for Bayesian inference. IEEE Signal Process. Mag. 2008, 25, 131–146. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Li, N.; Chambers, J. A Robust Gaussian Approximate Fixed-Interval Smoother for Nonlinear Systems With Heavy-Tailed Process and Measurement Noises. IEEE Signal Process. Lett. 2016, 23, 468–472. [Google Scholar] [CrossRef] [Green Version]

Figure 1.

{RMSE}_{x}

of different filters.

Figure 1.

{RMSE}_{x}

of different filters.

Figure 2. The true and estimated UTVMLPs.

Figure 3.

{RMSE}_{x}

of the proposed filter with

μ = 3, 4, 5, 6

.

Figure 3.

{RMSE}_{x}

of the proposed filter with

μ = 3, 4, 5, 6

.

Figure 4. The estimated UTVMLPs of the proposed filter with

μ = 3, 4, 5, 6

.

Figure 4. The estimated UTVMLPs of the proposed filter with

μ = 3, 4, 5, 6

.

Figure 5.

{RMSE}_{x}

of the proposed filter with forgetting factor

ρ = 0.93, 0.95, 0.97, and 0.99

.

Figure 5.

{RMSE}_{x}

of the proposed filter with forgetting factor

ρ = 0.93, 0.95, 0.97, and 0.99

.

Figure 6. The estimated UTVMLPs of the proposed filter with

ρ = 0.93, 0.95, 0.97, and 0.99

.

Figure 6. The estimated UTVMLPs of the proposed filter with

ρ = 0.93, 0.95, 0.97, and 0.99

.

Figure 7.

{AGRMSE}_{x}

of the proposed filter when the iteration number

N_{I} = 1, 2, \dots 10

.

Figure 7.

{AGRMSE}_{x}

of the proposed filter when the iteration number

N_{I} = 1, 2, \dots 10

.

Table 1. The proposed variational Bayesian-based Kalman filter with UTVMLP and NSHTMN (one-time step).

Inputs:

{\hat{x}}_{t - 1 | t - 1}

,

P_{t - 1 | t - 1}

,

Q_{t - 1 | t - 1}

,

R_{t - 1 | t - 1}

,

y_{t}

,

F_{t - 1}

,

H_{t}

,

n

,

m

,

μ_{t}

,

h_{0}

,

{\hat{η}}_{t - 1}

,

{\hat{δ}}_{t - 1}

,

N_{I}

,

ς

Time update:

1. Obtain

{\hat{x}}_{t | t - 1}

and

P_{t | t - 1}

utilizing Equations (18) and (19) (time update of typical Kalman filter).

Variational measurement update:

2. Initialization:

{\hat{x}}_{t | t}^{(0)} = x_{t | t - 1}

,

P_{t | t}^{(0)} = P_{t | t - 1}

,

E^{(0)} [β_{t}] = 1

,

E^{(0)} [\log (β_{t})] = 0

,

E^{(0)} [b_{t}] = {\hat{η}}_{t - 1} / {\hat{δ}}_{t - 1}

,

E^{(0)} [1 - b_{t}] = 1 - E^{(0)} [b_{t}]

,

{\hat{η}}_{t}^{(0)} = {\hat{η}}_{t - 1}

,

{\hat{δ}}_{t}^{(0)} = {\hat{δ}}_{t - 1}

,

E^{(0)} [ζ_{t}] = 1

,

E^{(0)} [ζ_{t}] = 1 - E^{(0)} [ζ_{t}]

,

E^{(0)} [\log (φ_{t})] = Ψ (h_{0}) - Ψ (1)

,

E^{(s + 1)} [\log (1 - φ_{t})] = Ψ (1 - h_{0}) - Ψ (1)

,

E^{(s + 1)} [\log (γ_{t})] = Ψ ({\hat{η}}_{t}^{(0)}) - Ψ ({\hat{η}}_{t}^{(0)} + {\hat{δ}}_{t}^{(0)})

,

E^{(s + 1)} [\log (1 - γ_{t})] = Ψ ({\hat{δ}}_{t}^{(0)}) - Ψ ({\hat{η}}_{t}^{(0)} + {\hat{δ}}_{t}^{(0)})

for

s = 0 : N_{I} - 1

.

3. Update

q_{a}^{(s + 1)} (x_{t})

by Equation (30).

4. Obtain

{\hat{x}}_{t | t}^{(s + 1)}

,

P_{t | t}^{(s + 1)}

, and

{\tilde{R}}_{t}^{(s + 1)}

by utilizing Equations (31)–(34) (typical Kalman filter).

5. Update the Gamma-distributed

q_{a}^{(s + 1)} (β_{t})

by Equation (35).

6. Obtain

π_{t}^{(s + 1)}

,

ν_{t}^{(s + 1)},

and

G_{t}^{(s + 1)}

by utilizing Equations (36)–(38).

7. Update the Bernoulli-distributed

q_{a}^{(s + 1)} (b_{t})

.

8. Obtain

p^{(s + 1)} (b_{t} = 1)

,

p^{(s + 1)} (b_{t} = 0)

,

C_{t}^{(s + 1)}

, and

D_{t}^{(s + 1)}

by utilizing Equations (39)–(42).

9. Obtain

E^{(s + 1)} [b_{t}]

and

E^{(s + 1)} [1 - b_{t}]

by utilizing Equations (55) and (56).

10. Update the Bernoulli-distributed

q_{a}^{(s + 1)} (ζ_{t})

.

11. Obtain

p^{(s + 1)} (ζ_{t} = 1)

,

p^{(s + 1)} (ζ_{t} = 0)

,

V_{t}^{(s + 1)},

and

W_{t}^{(s + 1)}

by utilizing Equations (43)–(46).

12. Obtain

E^{(s + 1)} [ζ_{t}]

and

E^{(s + 1)} [1 - ζ_{t}]

by utilizing Equations (57) and (58).

13. Update the Beta-distributed

q_{a}^{(s + 1)} (φ_{t})

by Equation (47).

14. Obtain

h_{t}^{(s + 1)}

and

d_{t}^{(s + 1)}

by utilizing Equations (48) and (49).

15. Obtain

E^{(s + 1)} [\log (φ_{t})]

and

E^{(s + 1)} [\log (1 - φ_{t})]

by utilizing Equations (59) and (60).

16. Update the Beta-distributed

q_{a}^{(s + 1)} (γ_{t})

by Equation (50).

17. Obtain

{\hat{η}}_{t}^{(s + 1)}

and

{\hat{δ}}_{t}^{(s + 1)}

by utilizing Equations (51) and (52).

18. Obtain

E^{(s + 1)} [\log (γ_{t})]

and

E^{(s + 1)} [\log (1 - γ_{t})]

by utilizing Equations (61) and (62).

19. If

(∥ {\hat{x}}_{t | t}^{(s + 1)} - {\hat{x}}_{t | t}^{(s)} ∥ / ∥ {\hat{x}}_{t | t}^{(s)} ∥) \leq ς

, the iteration stopped.

End for:

20.

{\hat{x}}_{t | t} = {\hat{x}}_{t | t}^{(s)}

,

P_{t | t} = P_{t | t}^{(s)}

,

h_{t} = h_{t}^{(s)}

,

d_{t} = d_{t}^{(s)}

,

η_{t} = η_{t}^{(s)}

,

δ_{t} = δ_{t}^{(s)}

Outputs:

{\hat{x}}_{t | t}

,

P_{t | t}

,

h_{t}

,

d_{t}

,

η_{t}

,

δ_{t}

,

h_{t} / (h_{t} + d_{t})

,

η_{t} / (η_{t} + δ_{t})

Table 2. The stages of measurement and the corresponding measurement noise and UTVMLP.

Measurement Stage	Measurement Noise	UTVMLP
Stage 1, time 1 s~100s	$g_{t} ~ N (0, R_{t})$ (Gaussian)	0.1 (slight loss)
Stage 2, time 101 s~200 s	$g_{t} ~ {\begin{matrix} N (0, R_{t}) w . p . = 0.98 \\ N (0, 500 R_{t}) w . p . = 0.02 \end{matrix}$ (slightly heavy-tailed)	0.15 (slight loss)
Stage 3, time 201 s~300 s	$g_{t} ~ {\begin{matrix} N (0, R_{t}) w . p . = 0.95 \\ N (0, 500 R_{t}) w . p . = 0.05 \end{matrix}$ (moderately heavy-tailed)	0.3 (moderate loss)
Stage 4, time 301 s~400 s	$g_{t} ~ N (0, R_{t})$ (Gaussian)	0.1 (slight loss)

Table 3.

{AGRMSE}_{x}

s and single-step running times (SSRT) of different filters.

Table 3.

{AGRMSE}_{x}

s and single-step running times (SSRT) of different filters.

Filters	KF	IKF	VBAKF	GSTKF	The Proposed Filter
${AGRMSE}_{x}$ in Stage 1	10.1336	4.4326	4.7705	8.7683	4.7704
${AGRMSE}_{x}$ in Stage 2	25.4009	11.1049	18.7603	14.4343	5.0660
${AGRMSE}_{x}$ in Stage 3	58.0561	17.5274	35.8194	28.0354	6.3044
${AGRMSE}_{x}$ in Stage 4	24.9129	4.4448	27.0252	9.5892	4.5081
${AGRMSE}_{x}$ in all stages	29.6259	9.3774	21.5944	15.2068	5.1623
SSRT (ms)	0.0276	0.0925	0.1166	0.1491	0.2989

Table 4. SSRT of the proposed filter with the iteration number

N_{I} = 1, 2, \dots 10

.

Table 4. SSRT of the proposed filter with the iteration number

N_{I} = 1, 2, \dots 10

.

Iteration Number $N_{I}$	1	2	3	4	5
SSRT (ms)	0.0327	0.0550	0.0894	0.1307	0.1570
Iteration Number $N_{I}$	6	7	8	9	10
SSRT (ms)	0.1883	0.2243	0.2451	0.2835	0.2989

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shan, C.; Zhou, W.; Yang, Y.; Shan, H. A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise. Entropy 2021, 23, 1351. https://doi.org/10.3390/e23101351

AMA Style

Shan C, Zhou W, Yang Y, Shan H. A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise. Entropy. 2021; 23(10):1351. https://doi.org/10.3390/e23101351

Chicago/Turabian Style

Shan, Chenghao, Weidong Zhou, Yefeng Yang, and Hanyu Shan. 2021. "A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise" Entropy 23, no. 10: 1351. https://doi.org/10.3390/e23101351

APA Style

Shan, C., Zhou, W., Yang, Y., & Shan, H. (2021). A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise. Entropy, 23(10), 1351. https://doi.org/10.3390/e23101351

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Variational Bayesian-Based Kalman Filter with Unknown Time-Varying Measurement Loss Probability and Non-Stationary Heavy-Tailed Measurement Noise

Abstract

1. Introduction

2. Problem Formulation

3. Proposed Variational Bayesian-Based Kalman Filter

3.1. Gaussian-Student’s t-Mixture Distribution

3.2. New Hierarchical Gaussian State-Space Model (HGSSM)

3.3. Variational Bayesian Approximation of the Joint Posterior PDFs

3.4. Calculation of the Required Mathematical Expectations

4. Simulations

5. Conclusions

6. Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Proposition 1

Appendix B. Proof of Proposition 2

Appendix C. Proof of Proposition 3

Appendix D. Proof of Proposition 4

Appendix E. Proof of Proposition 5

Appendix F. Proof of Proposition 5

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI