Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise

Chang, Guan-Nan; Fu, Wen-Xing; Cui, Tao; Song, Ling-Yun; Dong, Peng

doi:10.3390/jsan13040038

Open AccessArticle

Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise

by

Guan-Nan Chang

^1,2,

Wen-Xing Fu

¹,

Tao Cui

³,

Ling-Yun Song

³

and

Peng Dong

^3,*

¹

Unmanned System Research Institute, Northwestern Polytechnical University, Xi’an 710072, China

²

Xi’an Modern Control Technology Research Institute, Xi’an 710065, China

³

School of Aeronautics and Astronautics, Shanghai Jiao Tong University, Shanghai 200240, China

^*

Author to whom correspondence should be addressed.

J. Sens. Actuator Netw. 2024, 13(4), 38; https://doi.org/10.3390/jsan13040038

Submission received: 8 April 2024 / Revised: 26 June 2024 / Accepted: 26 June 2024 / Published: 28 June 2024

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Distributed state estimation is one of the critical technologies in the field of target tracking, where the process noise and measurement noise may have a heavy-tailed distribution. Traditionally, heavy-tailed distributions like the student-t distribution are employed, but our observation reveals that Gaussian noise predominates in many instances, with occasional outliers. This sporadic reliance on heavy-tailed distributions can degrade performances or necessitate frequent parameter adjustments. To overcome this, we introduce a novel distributed consensus multi-distribution state estimation method that combines Gaussian and student-t filters. Our approach establishes a system model using both Gaussian and student-t distributions. We derive a multi-distribution filter for a single sensor, assigning probabilities to Gaussian and student-t noise models. Parallel estimation under both distributions, utilizing Gaussian and student-t filters, allows us to calculate the likelihood of each distribution. The fusion of these results yields a mixed-state estimation and corresponding error matrix. Recognizing the increasing degrees of freedom in the student-t distribution over time, we provide an effective approximation. An information consensus strategy for multi-distribution filters is introduced, achieving global estimation through consensus on fused local filter results via interaction with neighboring nodes. This methodology is extended to the distributed case, and the recursive process of the distributed multi-distribution consensus state estimation method is presented. Simulation results demonstrate that the estimation accuracy of the proposed algorithm improved by at least 20% compared to that of the traditional algorithm in scenarios involving both Gaussian and heavy-tailed distributions.

Keywords:

multi-distribution; consensus filters; distributed state estimation; sensor networks; student-t distribution; heavy-tailed distribution

1. Introduction

1.1. Background

Sensor networks have garnered significant attention in recent years in distributed applications, such as environmental monitoring, power grid systems and traffic management [1,2,3]. Distributed target tracking or state estimation is one of the classical applications, and the core of it is the use of local communication and measurement through information exchange to obtain the state estimation of the entire system. However, some interference may lead to heavy-tailed measurement noise, as shown in Figure 1, which could degrade the estimation performance. Thus, how to improve the estimation performance in distributed scenarios with heavy-tailed noise is a troublesome problem.

1.2. Related Work

There are three main types of distributed state estimation methods, including consensus methods, gossip methods and diffusion methods. For gossip-based method [4,5], each sensor selects one or several connected neighbors randomly to send information to. Although they have a low communication cost, gossip-based methods show slow convergence. The diffusion methods use a one-step convex combination of the received data [6,7], so they cannot converge to a consensus result. In contrast, the consensus method can ensure global convergence in many cases [8,9]; thus, it is the most widely used. Combining this method with Kalman filtering is the most direct method, and there are four main types: the consensus on estimation (CE) method, consensus on measurement (CM) method, consensus on information (CI) method and hybrid CM-CI (HCMCI) method [10,11]. Since the CE method does not consider the error covariance matrix, the estimation results are conservative [12]. In order to solve this problem, the CM method is proposed. However, the bounded estimation error of the CM method can only be achieved when consensus steps are large enough [8]. The CI method conforms the information vector and the information matrix, which can ensure the bounding of the estimation error of any number of consensus steps, while the new information is inevitably under-weighted [13,14,15]. The HCMCI method can handle both a priori and likelihood consensus, combining the advantages of the CM and CI methods. In addition, there are consensus algorithms [16] and consensus algorithms based on the Luenberger observer [17].

However, the above systems assume that the system noise obeys a Gaussian distribution, which is easy to be violated in practice. Due to unmodeled anomalies, sudden disturbances from the environment, unreliable sensors, target maneuvers, system failures or attacks on the system, the system will suffer from outliers [18]. In such scenarios, relying solely on the Gaussian noise assumption for state estimation leads to severe performance degradation and potential system divergence [19]. Outliers in distributions typically exhibit heavy-tailed characteristics, prompting the utilization of heavy-tailed distributions to accurately characterize system states and measurements [20]. The student-t distribution serves as an effective model for simulating heavy-tailed distributions, especially in handling non-Gaussian outliers, rendering it a widely adopted solution in this context [21]. In [22], the sensor measurement noise was modeled as a multivariable student-t distribution using the CI consensus strategy, where the state and noise parameters are estimated at the same time. Combined with the variational Bayesian (VB) method, the joint posterior distribution is processed.

However, the above method can only deal with the situation where the measurement noise follows a heavy-tailed distribution. In order to deal with the situation that both process noise and measurement noise follow a heavy-tailed distribution, ref. [23] established both process and measurement noises as student-t distributions and proposed a distributed state estimation method based on a student-t filter and CI consensus strategy. In practical real-world settings, noise predominantly adheres to a Gaussian distribution, with outliers occurring sporadically. Given this scenario, an intuitive approach involves modeling the system where a mixture of Gaussian and student-t distributions encapsulates the dynamic nature of the noise. This method capitalizes on the strengths inherent in both Gaussian and student-t distributions, aiming to attain precise and consistent state estimations. Grounded in this conceptual framework, a resilient filtering algorithm, utilizing a blend of Gaussian and student-t distributions, is introduced for single-sensor state estimation [24]. Nevertheless, the iterative calculations of the parameters pose a challenge, particularly in the context of distributed systems characterized by lightweight architectures and limited communication bandwidth.

We can find that many studies have focused on the robustness. However, in practical scenarios, heavy-tailed noise is a low-probability event, and the model noise can still present as a Gaussian distribution in most cases. Although the mentioned robust filter can provide a robust performance in the presence of outliers, it will lose estimation accuracy when the noise is normal. Therefore, how to balance the robustness and estimation accuracy as well as obtain consensus results are significant problems.

Motivated by these, this paper proposes a distributed consensus state estimation method based on the Gaussian distribution and the student-t distribution. A comparison of related studies is shown in Table 1.

The main contribution of this paper can be highlighted as follows:

(1): The derivation of a method for local sensors that models process and measurement noises in both Gaussian and student-t distributions. Each distribution is assigned a probability, enabling parallel state estimation using Gaussian and student-t filters. The fusion of the results from these distributions provides a mixed-state estimation and corresponding error matrix.
(2): Addressing the challenge of the increasing degrees of freedom in the student-t distribution over time by providing an effective approximate solution, ensuring stability and accuracy in the estimation process.
(3): The introduction of an information consensus strategy for multi-distribution filters, enabling global estimation by achieving consensus on fused local filter results through interaction with neighboring nodes. The results are extended to the distributed case, and the recursive process presented further validates its efficacy, as supported by simulation results showcasing its performance in scenarios involving both Gaussian and heavy-tailed distributions.

The organization of this paper is shown in Figure 2. Section 2 presents the problem, Section 3 is the proposed distributed multiple-consensus-state estimation algorithm, the simulation results are presented in Section 4, and the conclusions are presented in Section 5.

2. Preliminaries and Problem Formulation

We consider a sensor network with node

N

and edge

A

. Then, the network can be represented as a two-tuple

(N, A)

. The set of neighbor nodes (including node i itself) of node

i \in N

is

N^{i}

. The process equation of the system is assumed to be discrete-time linear:

x_{k} = F_{k - 1} x_{k - 1} + w_{k - 1}

(1)

where

x_{k}

means the

n_{x}

-dimension state vector at time k,

F_{k}

means the state transition matrix, and

w_{k}

means the process noise at time k.

The measurement equation of each node

i \in N

is also linear:

z_{k}^{i} = H_{k}^{i} x_{k} + v_{k}^{i}, i \in N

(2)

where

z_{k}^{i}

means the

n_{z}^{i}

-dimension measurement vector of node

i \in N

at time k,

H_{k}^{i}

means the measurement matrix of node

i \in N

, and

v_{k}^{i}

means the measurement noise of node

i \in N

at time k.

In general, the noises can be supposed to follow a Gaussian distribution. However, some unknown perturbations may lead to outliers characterized by heavy-tailed noise. In order to deal with the problem, many studies have used distributions that are insensitive to outliers to model the noise.

The student-t distribution is one of them. A random variable x obeys the student-t distribution if its probability density satisfies the following [20]:

St (x; m, P, η) = M {(1 + \frac{1}{η} {(x - m)}^{T} P^{- 1} (x - m))}^{- \frac{η + n_{x}}{2}}

(3)

where

M = \frac{Γ (\frac{η + n_{x}}{2})}{Γ (\frac{η}{2})} \frac{1}{{(η π)}^{\frac{n_{x}}{2}}} \frac{1}{\sqrt{det (P)}}

(4)

m is the mean,

η

is the degrees of freedom (dof), P is a positive definite symmetric matrix,

Γ

denotes the Gamma function, and

n_{x}

is the dimension of x. The variables of the system model are summarized in Table 2.

It should be noted that the variance in the student-t distribution is

\frac{η}{η - 2} P

for dof

η

greater than 2; otherwise, the variance is undefined. When

η = 1

, it converges to the Cauchy distribution, and while

η

tends to infinity, it becomes the Gaussian distribution. Compared with the Gaussian distribution, the student-t distribution shows heavy-tailed characteristics: when moving away from the mean, the density does not decrease as quickly as with the Gaussian distribution. Figure 3 presents logarithmic presentations of the Gaussian and student-t distribution. It can be seen from the figure that the value of the Gaussian distribution quickly drops below −16, while the value of the student-t distribution does not drop so fast.

Due to the long-tailed property, the student-t distribution is effective in dealing with heavy-tailed noise. However, it may not describe normal noise well. In practical scenarios, the Gaussian noise can adapt to most situations, while the heavy-tailed noise is a low-probability event. Thus, only student-t distribution molding can improve the robustness of the algorithm, but it will loss estimation accuracy.

The problem is how to balance and obtain the robustness and estimation accuracy of the consensus estimation results for sensor networks in the presence of possible heavy-tailed noise.

3. Main Results

The proposed multi-distribution filter, based on Gaussian and student-t distributions for a single local sensor operating without information interaction, is introduced, addressing inherent algorithm limitations. An approximate method is provided to mitigate these shortcomings. Subsequently, a CI-based consensus strategy is proposed for scenarios involving mixed Gaussian and student-t distributions. Finally, building upon the aforementioned consensus strategy and the single-sensor multi-distribution filter, an algorithm for distributed multi-distribution filtering is proposed.

3.1. The Multi-Distribution Filter Based on Gaussian Distribution and Student-t Distribution

In this subsection, we present a multi-distribution filter for a single sensor exposed to both a heavy-tailed process and measurement noises. The exclusive reliance on a student-t distribution filter often results in prolonged performance degradation or necessitates frequent parameter readjustments during normal system operations. Conversely, relying solely on Gaussian filters tends to cause divergence in scenarios where system outliers manifest. To harness the strengths of both filters, we present the following two hypotheses for a single sensor node (omitting the superscript representing the sensor node in this subsection):

H_{0}

: Suppose that the process and measurement noises obey the Gaussian distribution as follows:

p (w_{k}) = N (w_{k}; 0, Q_{k}^{0})

(5)

p (v_{k}) = N (v_{k}; 0, R_{k}^{0})

(6)

where

N (x; m, P)

denotes that x obeys the Gaussian distribution with mean m and covariance P.

H_{1}

: Suppose that the process and measurement noises obey the student-t distribution as follows:

p (w_{k}) = St (w_{k}; 0, Q_{k}^{1}, η_{k})

(7)

p (v_{k}) = St (v_{k}; 0, R_{k}^{1}, η_{k})

(8)

where the probability of

H_{0}

is

μ_{k}^{0}

, the probability of

H_{1}

is

μ_{k}^{1}

,

\sum_{r \in H} μ_{k}^{r} = 1

, and

H = {0, 1}

.

Now, we need to assign a filter to each distribution. For the hypothesis of the Gaussian distribution, since the system model is linear, the standard Kalman filter can be used. The steps are as follows: given the initial values

{\hat{x}}_{0}^{0}

and

{\hat{P}}_{0}^{0}

, when time step

k > 1

, the following recursive process is performed:

(1) Time update

{\hat{x}}_{k | k - 1}^{0} = F_{k} {\hat{x}}_{k - 1}^{0}

(9)

P_{k | k - 1}^{0} = F_{k} P_{k - 1}^{0} F_{k}^{T} + Q_{k - 1}^{0}

(10)

(2) Measurement update

{\tilde{z}}_{k}^{0} = z_{k}^{0} - H_{k} {\hat{x}}_{k | k - 1}^{0}

(11)

S_{k}^{0} = H_{k} P_{k | k - 1}^{0} {(H_{k})}^{T} + R_{k}^{0}

(12)

K_{k}^{0} = P_{k | k - 1}^{0} {(H_{k})}^{T} / S_{k}^{0}

(13)

{\hat{x}}_{k}^{0} = {\hat{x}}_{k | k - 1}^{0} + K_{k}^{0} {\tilde{z}}_{k}^{0}

(14)

P_{k}^{0} = P_{k | k - 1}^{0} - K_{k}^{0} S_{k}^{0} {(K_{k}^{0})}^{T}

(15)

For the hypothesis of the student-t distribution, we use the student-t filter described as follows: given the initial values

{\hat{x}}_{0}^{1}

,

P_{0}^{1}

and

η_{0}

, when time step

k > 1

, the following recursive process is performed:

(1) Time update

{\hat{x}}_{k | k - 1}^{1} = F_{k} {\hat{x}}_{k - 1}^{1}

(16)

P_{k | k - 1}^{1} = F_{k} P_{k - 1}^{1} F_{k}^{T} + Q_{k - 1}^{1}

(17)

(2) Measurement update

{\tilde{z}}_{k}^{1} = z_{k}^{1} - H_{k} {\hat{x}}_{k | k - 1}^{1}

(18)

S_{k}^{1} = H_{k} P_{k | k - 1}^{1} {(H_{k})}^{T} + R_{k}^{1}

(19)

K_{k}^{1} = P_{k | k - 1}^{1} {(H_{k})}^{T} / S_{k}^{1}

(20)

{\hat{x}}_{k}^{1} = {\hat{x}}_{k | k - 1}^{1} + K_{k}^{1} {\tilde{z}}_{k}^{1}

(21)

P_{k}^{1} = \frac{η_{k - 1} + Δ_{z, k}^{1}}{{\tilde{η}}_{k - 1} + d} (P_{k | k - 1}^{1} - K_{k}^{1} S_{k}^{1} {(K_{k}^{1})}^{T})

(22)

Δ_{z, k}^{1} = {({\tilde{z}}_{k}^{1})}^{T} {(S_{k}^{1})}^{- 1} {\tilde{z}}_{k}^{i, 1}

(23)

η_{k} = η_{k - 1} + n_{z}

(24)

The detailed derivation of the student-t filter can be seen in [25].

In assuming that the state posterior obeys the mixed distribution of Gaussian and student-t distributions, their probabilities are

μ_{k}^{0}

and

μ_{k}^{1}

, respectively. According to the full probability theorem,

\begin{matrix} p (x_{k} | Z^{k}) & = \sum_{r \in H} μ_{k}^{r} p (x_{k} | Z^{k}, H_{r}) \\ = μ_{k}^{0} N (x_{k}; {\hat{x}}_{k}^{0}, P_{k}^{0}) + μ_{k}^{1} St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η_{k}) \end{matrix}

(25)

where

Z^{k} = {z_{1}, z_{2}, \dots, z_{k}}

represents the measurement set up to time k. In order to obtain the posterior probability density function (PDF), the required parameters are the probability

μ_{k}^{r}

corresponding to the two distributions, the state estimation

{\hat{x}}_{k}^{r}

and the matrix

P_{k}^{r}

. For the student-t distribution, the dof

η_{k}

is also required. In addition to the distribution probability

μ_{k}^{r}

, the others are obtained by the two parallel filters. Given the distribution probability

μ_{k - 1}^{i}

of the previous time, the probability of assuming that

H_{r}

is correct can be given by

\begin{matrix} μ_{k}^{r} & \overset{Δ}{=} p (H_{r} | Z^{k}) = p (H_{r} | z_{k}, Z^{k - 1}) \\ = \frac{p (z_{k} | Z^{k - 1}, H_{r}) p (H_{r} | Z^{k - 1})}{p (z_{k} | Z^{k - 1})} \\ = \frac{p (z_{k} | Z^{k - 1}, H_{r}) p (H_{r} | Z^{k - 1})}{\sum_{j \in H} p (z_{k} | Z^{k - 1}, H_{j}) p (H_{j} | Z^{k - 1})} \\ = \frac{p (z_{k} | Z^{k - 1}, H_{r}) μ_{k - 1}^{r}}{\sum_{j \in H} p (z_{k} | Z^{k - 1}, H_{j}) μ_{k - 1}^{j}} \end{matrix}

(26)

where

p (z_{k} | Z^{k - 1}, H_{r})

is the measurement likelihood of

H_{r}

at time k, that is,

Λ_{k}^{r} \overset{Δ}{=} p (z_{k} | Z^{k - 1}, H_{r})

=

p ({\tilde{z}}_{k}^{r})

. For the Gaussian filter,

Λ_{k}^{0} = \frac{1}{\sqrt{det (2 π S_{k}^{0})}} e^{- \frac{1}{2} {({\tilde{z}}_{k}^{0})}^{T} {(S_{k}^{0})}^{- 1} {\tilde{z}}_{k}^{0}}

(27)

For the student-t distribution filter,

Λ_{k}^{1} = \frac{Γ (\frac{η_{k - 1} + n_{z}}{2})}{Γ (\frac{η_{k - 1}}{2})} \frac{1}{{(η_{k - 1} π)}^{\frac{n_{z}}{2}}} \frac{1}{\sqrt{det (S_{k}^{1})}} {(1 + \frac{1}{η_{k - 1}} {({\tilde{z}}_{k}^{1})}^{T} {(S_{k}^{1})}^{- 1} {\tilde{z}}_{k}^{1})}^{- \frac{η_{k - 1} + n_{z}}{2}}

(28)

The mean of the mixed posterior distribution is

\begin{matrix} {\hat{x}}_{k} & = E [x_{k} | Z^{k}] = \int x_{k} [μ_{k}^{0} N (x_{k}; {\hat{x}}_{k}^{0}, P_{k}^{0}) + μ_{k}^{1} St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η_{k})] d x_{k} \\ = μ_{k}^{0} \int x_{k} N (x_{k}; {\hat{x}}_{k}^{0}, P_{k}^{0}) d x_{k} + μ_{k}^{1} \int x_{k} St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η_{k}) d x_{k} \\ = μ_{k}^{0} E [x_{k} | Z^{k}, H_{0}] + μ_{k}^{1} E [x_{k} | Z^{k}, H_{1}] \\ = \sum_{r \in H} μ_{k}^{r} {\hat{x}}_{k}^{r} . \end{matrix}

(29)

The covariance corresponding to the mixed posterior distribution is

\begin{matrix} P_{k} & = E [(x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} | Z^{k}] \\ = \int (x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} [μ_{k}^{0} N (x_{k}; {\hat{x}}_{k}^{0}, P_{k}^{0}) + μ_{k}^{1} St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η_{k})] d x_{k} \\ = μ_{k}^{0} \int (x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} N (x_{k}; {\hat{x}}_{k}^{0}, P_{k}^{0}) d x_{k} + μ_{k}^{1} \int (x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η_{k}) d x_{k} \\ = \sum_{r \in H} μ_{k}^{r} E [(x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} | Z^{k}, H_{r}] \\ = \sum_{r \in H} μ_{k}^{r} E [(x_{k} - {\hat{x}}_{k}^{r} + {\hat{x}}_{k}^{r} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k}^{r} + {\hat{x}}_{k}^{r} - {\hat{x}}_{k})}^{T} | Z^{k}, H_{r}] \\ = \sum_{r \in H} μ_{k}^{r} E [(x_{k} - {\hat{x}}_{k}^{r}) {(x_{k} - {\hat{x}}_{k}^{r})}^{T} | Z^{k}, H_{r}] + \sum_{r \in H} μ_{k}^{r} ({\hat{x}}_{k}^{r} - {\hat{x}}_{k}) {({\hat{x}}_{k}^{r} - {\hat{x}}_{k})}^{T} . \end{matrix}

(30)

For the Gaussian distribution, the variance is

P_{k}^{0} = E [(x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} | Z^{k}, H_{0}]

(31)

and for the student-t distribution, the variance is

{\bar{P}}_{k}^{1} = E [(x_{k} - {\hat{x}}_{k}) {(x_{k} - {\hat{x}}_{k})}^{T} | Z^{k}, H_{1}] = \frac{η_{k}}{η_{k} - 2} P_{k}^{1} .

(32)

Therefore, the expression of the covariance of the mixed posterior distribution can be obtained by

P_{k} = μ_{k}^{0} P_{k}^{0} + μ_{k}^{1} {\bar{P}}_{k}^{1} + \sum_{r \in H} μ_{k}^{r} ({\hat{x}}_{k}^{r} - {\hat{x}}_{k}) {({\hat{x}}_{k}^{r} - {\hat{x}}_{k})}^{T}

(33)

Thus, we obtain a complete recursive step for the multi-distribution filter.

With each measurement update, the dof increase according to Equation (24). In turn, this also requires an increase in the dof of noise, making the problem more and more Gaussian. In fact, the algorithm will converge to the Kalman filter after several time steps. Therefore, it is necessary to find some approximate methods.

One of the simplest ways is to enforce

η_{k} = η

in Equation (24), where

η

is a constant, so that the dof will not increase all the time. However, the actual posterior density is

p (x_{k}) = St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η + n_{z})

instead of

St (x_{k}; {\hat{x}}_{k}^{1}, P_{k}^{1}, η)

. Notice that we have omitted the condition here. At this point, we need to find a posterior density

q (x_{k}) = St (x_{k}; {\hat{x}}_{k}^{1}, {\tilde{P}}_{k}^{1}, η)

to approximate

p (x_{k})

. The qualitative characteristics should be retained. Therefore, the adjusted matrix parameters should be the scaled version of the original matrix. The general expression of this problem is how to find the probability density

q (x) = St (x; \hat{x}, c P, \tilde{η})

to approximate the probability density

p (x) = St (x; \hat{x}, c P, η)

so that the two are as close as possible. This density is controlled by the scaling factor c>0. Therefore, our problem is how to find c so that the two probability densities are close in a certain sense. A suggested method is to use the moment matching method, that is, to make the variances of the

p (x)

and

q (x)

equal. The advantages of this method are that it is simple to apply and does not need any parameters to be adjusted. We can obtain the following conditions:

\frac{η}{η - 2} P = \frac{\tilde{η}}{\tilde{η} - 2} c P

(34)

where

η > 2

and

\tilde{η} > 2

. Then, the scale factor c can be obtained:

c = \frac{η (\tilde{η} - 2)}{(η - 2) \tilde{η}}

(35)

The PDF of process and measurement noises can be approximated in the same way.

3.2. Consensus on Mixed Density

The basic idea of consensus is to calculate the aggregation of the whole network by iteratively computing the same type of region on each node in the network that only contains the subset of adjacent nodes. Consensus is used to average the PDF of states in each node and the PDF of states received from neighbors. Given PDFs

p (\cdot)

and

q (\cdot)

and weight

π

, define the following information fusion and weighting operations:

p (x) \oplus q (x) = \frac{p (x) q (x)}{\int p (x) q (x) d x}

(36)

π ⊙ p (x) = \frac{p {(x)}^{π}}{\int p {(x)}^{π} d x}

(37)

The Kullback–Leibler average (KLA), which depends on relative entropy, is an average information definition of a PDF. The weighted KLA in PDF

{p^{i} (\cdot)}

is defined as

\bar{p} = arg inf_{p (\cdot)} \sum_{i \in N} π^{i} K L (p | | p^{i})

(38)

where

π^{i} > 0

, and

\sum_{i \in N} π^{i} = 1

represents the relative weight.

K L (p | | p^{i})

is the Kullback–Leibler divergence (KLD) between

p (\cdot)

and

p^{i} (\cdot)

. The problem of the consensus algorithm can be described as finding a way to make

lim_{l \to + \infty} p_{l}^{i} = \bar{p} (x), \forall i \in N

(39)

where asymptotic PDF

\bar{p} (\cdot)

represents the KLA with the same weight. The solution to this problem is the collective KLA: the weighted KLA is given by the normalized weighted geometric mean of PDF

\bar{p} (x) = \frac{\prod_{i \in N} p^{i} {(x)}^{π^{i}}}{\int \prod_{i \in N} p^{i} {(x)}^{π^{i}} d x} = \underset{i \in N}{\oplus} (π^{i} ⊙ p^{i} (x))

(40)

It can be calculated by updating local data in a distributed manner using a convex combination with data from neighbors

p_{ℓ}^{i} (x) = \underset{j \in N^{i}}{\oplus} (π^{i, j} ⊙ p_{ℓ - 1}^{j} (x)), \forall i \in N

(41)

where ℓ is the number of consensus iteration steps,

π^{i, j}

is the consensus weight satisfying

π^{i, j} \geq 0

, and

\sum_{j \in N^{i}} π^{i, j} = 1

, and it is also the

(i, j)

component of the consensus matrix

Π

(if

j \notin N^{i}

,

π^{i, j} = 0

). In addition, the initial value of iteration is

p_{0}^{i} (\cdot) = p^{i} (\cdot)

. Let

π_{l}^{i, j}

be the

(i, j)

component of matrix

Π^{ℓ}

. Then,

p_{ℓ}^{i} (x) = \underset{j \in N}{\oplus} (π_{ℓ}^{i, j} ⊙ p^{j} (x)), \forall i \in N .

(42)

When

π^{i, j}

is chosen so that matrix

Π

is primitive and doubly stochastic,

lim_{ℓ \to + \infty} π_{ℓ}^{i, j} = \frac{1}{| N |}, \forall i \in N

(43)

Therefore, as the number of consensus steps increases, each local PDF tends to focus on the unweighted KLA.

For PDF

p (x^{i}) = N (x; m^{i}, P^{i})

with the Gaussian distribution, it can be proved that the probability density consensus algorithm can be simplified to an algebraic expression involving only its information vector

q^{i} = {(P^{i})}^{- 1} m^{i}

and information matrix

Ω^{i} = {(P^{i})}^{- 1}

:

Ω_{ℓ}^{i} = \sum_{j \in N^{i}} π^{i, j} Ω_{ℓ - 1}^{j}, i \in N, ℓ = 0, 1, \dots

(44)

q_{ℓ}^{i} = \sum_{j \in N^{i}} π^{i, j} q_{ℓ - 1}^{j}, i \in N, ℓ = 0, 1, \dots

(45)

This is the so-called CI consensus method.

In the above section, the multiple-distribution filter for a single sensor is given. To extend it to the distributed case, two important problems need to be solved: (1) for the mixed posterior distribution, which information should be transferred by the adjacent nodes and what strategy should be adopted to realize the collective KLA; and (2) since the mixed distribution is non-Gaussian, whether the CI consensus strategy for Gaussian distribution can be directly applied to the mixed distribution of Gaussian and student-t distributions.

For the first problem, the following consensus strategy is given: the distribution probability

μ_{k}^{r}

can be the first to obtain the consensus distribution probability, and then the fused PDF

p^{i} (x) = \sum_{r = 0}^{1} μ_{k}^{r} p^{i, r} (x)

is given as the initial value, and then the consensus is run on the fused PDF. This strategy only needs to transfer the distribution probability and the fused PDF.

It should be noted that the above consensus method is for a continuous PDF. When the distribution probability is discrete, its distribution is a probability mass function (PMF). Given a PMF distribution

μ^{i}

(

i \in N

), the weighted KLA is defined as

\bar{μ} = arg inf_{μ (\cdot)} \sum_{i \in N} π^{i} K L (μ | | μ^{i})

(46)

For PMF

μ

,

ν

and weight

π > 0

, the following information fusion and weighting operations are defined:

μ \oplus ν = η = c o l {(η^{j})}_{j \in M} w i t h η^{j} = \frac{μ^{j} ν^{j}}{\sum_{h \in M} μ^{h} ν^{h}} \overset{Δ}{=} μ^{j} \oplus ν^{j}, \forall j \in M,

(47)

π ⊙ μ = η = c o l {(η^{j})}_{j \in M} w i t h η^{j} = \frac{{(μ^{j})}^{π}}{\sum_{h \in M} {(μ^{h})}^{π}} \overset{Δ}{=} π ⊙ μ^{j}, \forall j \in M

(48)

Then, the KLA probability mass function can be expressed as

\bar{μ} = \underset{i \in N}{\oplus} (π^{i} ⊙ μ_{i})

(49)

The collective fusion of the PMF can be obtained in a distributed manner:

μ_{ℓ}^{i} = \underset{j \in N^{i}}{\oplus} (π^{i, j} ⊙ μ_{ℓ - 1}^{j})

(50)

where

π^{i, j} > 0

is the consensus weight, and

\sum_{j \in N^{i}} π^{i, j} = 1

.

Before answering the second question, let us first look at how to use a Gaussian distribution to approximate a student-t distribution. That is, how to find a scalar c to minimize the difference between

q (x) = N (x; \hat{x}, c P)

and

p (x) = St (x; \hat{x}, P, η)

under certain criteria. We know that when the dof of the student-t distribution tends to infinity, its distribution tends to the Gaussian distribution. Thus, we can obtain

q (x) = N (x; \hat{x}, c P) \approx St (x; \hat{x}, c P, \tilde{η} \to \infty)

. In this way, the problem becomes an approximation between two student-t distributions, so we can use the moment matching method to obtain the value of c, that is

c = lim_{\tilde{η} \to \infty} \frac{η (\tilde{η} - 2)}{(η - 2) \tilde{η}} = \frac{η}{η - 2}

(51)

Therefore,

q (x) = N (x; \hat{x}, \frac{η}{η - 2} P)

can be used to approximate

p (x) = St (x; \hat{x}, P, η)

. We can see that the fusion process can be seen as the fusion of two Gaussian distributions. Thus, we can directly use the CI consensus method based on the Gaussian distribution.

3.3. The Distributed Multi-Distribution Filter

We have previously obtained the algorithm for a single sensor and the consensus strategy for multi-sensor mixed density. Now, we need to extend the results to the distributed case.

For local node i, the initial values

{\hat{x}}_{0}^{i, r}

,

P_{0}^{i, r}

, and

μ_{0}^{r}

are given for

r \in H

. For the student-t filter, the common dof

η

is also given. When

k > 0

, start the following recursive process.

Step 1: Parallel filtering

For the Gaussian filter, we use Equations (9)–(15). For the student-t distribution filter, we make an approximation so that

{\tilde{η}}_{k - 1}^{i} = η, c_{k}^{i} = \frac{(η + n_{z}) (η - 2)}{(η + n_{z} - 2) η}

(52)

{\tilde{P}}_{k - 1}^{1} = c_{k}^{i} P_{k - 1}^{1}, {\tilde{Q}}_{k - 1}^{1} = c_{k}^{i} Q_{k - 1}^{1}, {\tilde{R}}_{k - 1}^{1} = c_{k}^{i} R_{k - 1}^{1}

(53)

Then, Equations (17), (19) and (24) become

P_{k | k - 1}^{i, 1} = F_{k} {\tilde{P}}_{k - 1}^{i, 1} F_{k}^{T} + {\tilde{Q}}_{k - 1}^{i, 1}

(54)

S_{k}^{i, 1} = H_{k} P_{k | k - 1}^{i, 1} {(H_{k})}^{T} + {\tilde{R}}_{k}^{i, 1}

(55)

η_{k} = {\tilde{η}}_{k - 1} + n_{z}

(56)

Therefore, we can use Equations (16), (54), (18), (55), (20)–(23) and (56) for the student-t distribution filter. After that, we can obtain

{\hat{x}}_{k}^{i, r}

,

P_{k}^{i, r}

,

{\tilde{z}}_{k}^{i, r}

, and

S_{k}^{i, r}

for

r \in H

and

η_{k}^{i}

.

Step 2: Calculate distribution probability

Update probability

μ_{k, 0}^{i, r} = \frac{Λ_{k}^{i, r} μ_{k - 1}^{i, r}}{\sum_{j \in H} Λ_{k}^{i, j} μ_{k - 1}^{i, j}}

(57)

where the likelihoods of the Gaussian and student-t filter

Λ_{k}^{i, 0}

and

Λ_{k}^{i, 1}

can be obtained by Equations (27) and (28).

Step 3: Consensus on distribution probability

For L-step consensus,

μ_{k, ℓ}^{i, r} = \underset{j \in N^{i}}{\oplus} (π^{i, j} ⊙ μ_{k, ℓ - 1}^{j, r}), ℓ = 1, \dots, L r \in H

(58)

μ_{k}^{i, r} = μ_{k, L}^{i, r}

(59)

where

ℓ = 1, 2, \dots, L

is the number of consensus steps,

π^{i, j}

is the consensus weight,

π^{i, j} \geq 0

, and

\sum_{j \in N^{i}} π^{i, j} = 1

.

Step 4: Fuse the mixed PDF

{\hat{x}}_{k, 0}^{i} = \sum_{r \in H} μ_{k}^{i, r} {\hat{x}}_{k}^{i, r}

(60)

{\bar{P}}_{k}^{i, 1} = \frac{η_{k}}{η_{k} - 2} P_{k}^{i, 1}

(61)

P_{k, 0}^{i} = μ_{k}^{i, 0} P_{k}^{i, 0} + μ_{k}^{i, 1} {\bar{P}}_{k}^{i, 1} + \sum_{r \in H} μ_{k}^{i, r} ({\hat{x}}_{k}^{i, r} - {\hat{x}}_{k, 0}^{i}) {({\hat{x}}_{k}^{i, r} - {\hat{x}}_{k, 0}^{i})}^{T}

(62)

Step 5: Consensus on fused PDF

Ω_{k, 0}^{i} = {(P_{k, 0}^{i})}^{- 1}

,

q_{k, 0}^{i} = Ω_{k, 0}^{i} x_{k, 0}^{i}

For L-step consensus,

Ω_{k, ℓ}^{i} = \sum_{j \in N^{i}} π^{i, j} Ω_{k, ℓ - 1}^{j}, ℓ = 1, \dots, L

(63)

q_{k, ℓ}^{i} = \sum_{j \in N^{i}} π^{i, j} q_{k, ℓ - 1}^{j}, ℓ = 1, \dots, L

(64)

Step 6: Reinitialization

{\hat{x}}_{k}^{i, 0} = {(Ω_{k, L}^{i})}^{- 1} q_{k, L}^{i}, P_{k}^{i, 0} = {(Ω_{k, L}^{i})}^{- 1}

(65)

{\hat{x}}_{k}^{i, 1} = {(Ω_{k, L}^{i})}^{- 1} q_{k, L}^{i}, P_{k}^{i, 1} = \frac{η_{k} - 2}{η_{k}} {(Ω_{k, L}^{i})}^{- 1}

(66)

The workflow of the proposed algorithm is shown in Figure 4, and the pseudocode is summarized in Algorithm 1.

Algorithm 1: Distributed consensus multi-distribution filter (DCMDF)

Given the initial values

{\hat{x}}_{0}^{i, r}

,

P_{0}^{i, r}

and

μ_{0}^{r}

for

r \in H

and the common dof

η

, for each time step k at every node i, start the following recursive process:

Step1 Parallel filtering:

Gaussian filtering: Calculate

{\hat{x}}_{k}^{0, i}

,

P_{k}^{0, i}

,

S_{k}^{0, i}

by Equations (9)–(15)

Student-t filtering: Calculate

{\hat{x}}_{k}^{1, i}

,

P_{k}^{1, i}

,

S_{k}^{1, i}

by Equations (16)–(22) and (54), (55)

η_{k} = {\tilde{η}}_{k - 1} + n_{z}

Step2 Calculate distribution probability:

Calculate

μ_{k, 0}^{i, r}

according to (57)

Step3 Consensus on distribution probability:

for

ℓ = 1

to L

Calculate

μ_{k, ℓ}^{i, r}

according to (58)

end for

μ_{k}^{i, r} = μ_{k, L}^{i, r}

Step4 Fuse the mixed PDF:

Calculate

{\hat{x}}_{k, 0}^{i}

and

P_{k, 0}^{i}

according to (60)–(62)

Step5 Consensus on fused PDF:

Ω_{k, 0}^{i} = {(P_{k, 0}^{i})}^{- 1}

,

q_{k, 0}^{i} = Ω_{k, 0}^{i} x_{k, 0}^{i}

for

ℓ = 1

to L

Calculate

Ω_{k, ℓ}^{i}

and

q_{k, ℓ}^{i}

according to (63) and (64)

end for

Step6 Reinitialization:

{\hat{x}}_{k}^{i, 0} = {(Ω_{k, L}^{i})}^{- 1} q_{k, L}^{i}, P_{k}^{i, 0} = {(Ω_{k, L}^{i})}^{- 1}

{\hat{x}}_{k}^{i, 1} = {(Ω_{k, L}^{i})}^{- 1} q_{k, L}^{i}, P_{k}^{i, 1} = \frac{η_{k} - 2}{η_{k}} {(Ω_{k, L}^{i})}^{- 1}

Return:

{\hat{x}}_{k}^{i, 0}

,

P_{k}^{i, 0}

,

{\hat{x}}_{k}^{i, 1}

and

P_{k}^{i, 1}

4. Numerical Simulation

Consider a linear dynamic system in which the state contains

x = {[p_{x}, {\dot{p}}_{x}, p_{y}, {\dot{p}}_{y}]}^{T}

:

F_{k} = [\begin{matrix} 1 & T & 0 & 0 \\ 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & T \\ 0 & 0 & 0 & 1 \end{matrix}], Q_{k} = G_{k} Δ G_{k}^{T}

(67)

G_{k} = {[\begin{matrix} T^{2} / 2 & T & 0 & 0 \\ 0 & 0 & T^{2} / 2 & T \end{matrix}]}^{T}

(68)

where the sampling time is T = 1s,

Δ = diag ([w_{x}^{2}, w_{y}^{2}])

, and

w_{x}^{2} = w_{y}^{2} = 0.1

.

There are 20 sensor nodes in the sensor network, and their graphical topology is shown in Figure 5. The measurement model is

H_{k}^{i} = [\begin{matrix} 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 \end{matrix}]

(69)

The standard measurement noise variance is

R = diag ([{(15 m)}^{2}, {(15 m)}^{2}])

.

Assume that the initial true state is

x_{0} = {[2600 m, 20 m / s, 3800 m, 10 m / s]}^{T}

(70)

In the simulation, the initial state of the filter is randomly selected from

N (x_{0}, P_{0})

, where

P_{0} = diag ([50^{2} m^{2}, 5^{2} m^{2} / s^{2}, 5 0^{2} m^{2}, 5^{2} m^{2} / s^{2}])

(71)

Process noise and measurement noise with outliers are generated by the following model [20,23]:

w_{k} \sim \{\begin{matrix} N (0, Q) & with probability 1 - p_{o} \\ N (0, 100 Q) & with probability p_{o} \end{matrix}

(72)

v_{k}^{i} \sim \{\begin{matrix} N (0, R) & with probability 1 - p_{o} \\ N (0, 100 R) & with probability p_{o} \end{matrix}

(73)

where

p_{o}

is the probability of measuring outliers. It should be noted that this model for heavy-tailed noise is widely used.

In this section, the following three methods are compared: (1) distributed consensus Kalman Filter [14], referred to as DCKF; (2) distributed consensus student-t filter, proposed in [23], referred to as DCSTF; (3) the proposed algorithm, referred to as DCMDF; and (4) the multiple-model method with two Gaussian distributions, referred to as DCKFIMM.

The number of consensus steps was set to

L = 3

, and the consensus weight of the sensor node was set to

π^{i, j} = 1 / | N^{i} |

if

j \in N^{i}

and

π^{i, j} = 0

if

j \notin N^{i}

. The dof of the student-t distribution in DCSTF and DCMDF was

η = 10

. One model of DCKFIMM is the same as the DCKF, and anther model has the following parameters: the measurement noise variance is 100R, and the process noise variance is 100Q. The simulation results were obtained through 100 Monte Carlo simulations and evaluated using the root mean square errors (RMSEs) of the position and velocity.

The occurrence probability of outliers was set to

p_{o} = 0

, which means that the noises are Gaussian. The RMSEs of the position and velocity of the four algorithms are shown in Figure 6 and Figure 7. It can be found that under the condition of a Gaussian distribution, the performance of the DCMDF algorithm is almost the same as that of DCKF, while the error of DCSTF is relatively large. The performance of the DCKFIMM algorithm is better than the DCSTF and little worse than the DCMDF. This result shows that Gaussian models are better than the single student-t model in dealing with estimation problems in normal noise environments. At the same time, it is worth noting that DCMDF adopts a mixed model of Gaussian distribution and student-t distribution; therefore, it can obtain a higher estimation accuracy through model adaptation.

The occurrence probability of outliers was set to

p_{o} = 0.3

. Figure 8 presents the true and estimated trajectories for one simulation. Figure 9 and Figure 10 show the RMSEs of the position and velocity of the three algorithms. It can be found that DCKF has the worst performance, which is caused by the natural defects of the Gaussian distribution. Although it can achieve good performance in a normal noise environment, it struggles to dealing with the estimation in the presence of outliers. The DCKFIMM has a better performance than DCKF; however, the performance of the DCKDIMM is worse than those of DCSTF and DCMDF because DCSTF and DCMDF adopt the student-t distribution, which is more in line with the distribution of noise while outliers occur. It is worth noting that DCMDF has the best performance because the algorithm uses a hybrid distribution model, which can be adaptive for the model in the estimation process, making the algorithm more robust and accurate.

Table 3 and Table 4 show the simulation results of the three algorithms under different outlier probabilities. It can be found that the RMSEs of the four algorithms will increase with the increase in outlier probabilities. However, the estimation performance of DCMDF always shows a relatively better estimation performance, and the estimation accuracy of the proposed algorithm improved by at least 20% compared to the other algorithms when

p_{0} \neq 0

, which further proves that DCMDF has a good performance in robustness and estimation accuracy.

To further test the performance of the algorithm, we reduced the number of nodes and density of network. The topology of a sensor network with 15 nodes is shown in Figure 11. Since the nodes and density of network were reduced, we increased the number of consensus steps to 6, the RMSEs of the position and velocity are shown in Table 5 and Table 6. It can be seen that the proposed algorithm still has better performances in estimation accuracy.

In order to handle the estimation problem with heavy-tailed noise, in this study, we made two hypotheses for the model noise, which include the Gaussian distribution and student-t distribution. The reason why the student-t distribution was chosen to represent the heavy-tailed noise distribution is discussed in detail in [23], which is briefly reviewed here. By adjusting the dof, the student-t distribution can tend toward a Gaussian distribution and Cauchy distribution, which have good flexibility. Similar to the traditional multiple-model algorithm, we calculate the state estimation results based on two different hypothetical noises. Then, the two results are fused based on different weights. Therefore, the proposed algorithm can balance the robustness and estimation accuracy.

The computation and communication burden are important issues in distributed-state estimation. Compared with the traditional CI algorithm, the computation of the proposed algorithm is mainly in the step of local estimation. The amount of computation has doubled, but the strategy of parallel computing can be adopted. The other step is to calculate the distribution probability of consensus, which has a small increase in calculation. The increase in communication also comes from this step. It should be noted that the quantities that we are concerned with are the state and its error matrix. This step can also be omitted in the application. Then, the increases in computation and communication from the distribution probability calculation and consensus will be eliminated, which can further improve the computational efficiency and reduce the communication burden.

5. Conclusions and Outlook

For the problem of distributed state estimation in which both process noise and measurement noise are heavy-tailed, a distributed consistent multi-distribution state estimation algorithm based on the Gaussian distribution and student-t distribution parallel filter is proposed in this paper, with the following steps:

(1): The system model is established as two models based on the Gaussian distribution and student-t distribution, and each model is assigned a filter based on the distribution. An algorithm based on single-sensor observation is derived, and a combined posterior distribution based on a mixed posterior distribution is presented. To solve the problem of the filter converging to the Gaussian distribution quickly, the moment matching method is used to approximate the filter to keep the heavy-tailed characteristics.
(3): Aiming at the consensus problem of mixed probability density, a mixed strategy based on CI consensus is proposed: firstly, the consensus on the discrete distributed probability PMF is carried out to obtain the KLA PMF; then, the posterior of the multi-distributions is combined using the probability of the distribution to obtain the posterior of the combination; finally, the CI method is used to obtain the consensus of the posterior of the combination. Based on the above hybrid probability consensus strategy and single-sensor algorithm, the recursive steps of distributed consensus multi-distribution filtering are presented.
(3): The simulation results show that the proposed algorithm can achieve good results in both Gaussian noise and heavy-tailed noise scenarios.

Although the method proposed this paper deals with state estimation with heavy-tailed noise, it is still a type of rough processing method. The algorithm can ensure a low-bound estimation accuracy, but there is still potential for improvement.

For example, in target tracking, the process transfer model is time-variant. The two types of noise hypotheses in this paper can model the uncertainty of a process transfer, but this modeling method is still not precise enough. In a broad sense, the algorithm proposed in this paper can be regarded as a special case of the multi-model method. Therefore, the concepts of probability transfer and input interaction of the multi-model method can also be applied to the algorithm proposed in this paper. How to integrate these technologies is one of the problems to be solved in the future.

Author Contributions

Conceptualization, G.-N.C. and W.-X.F.; methodology, G.-N.C. and P.D.; software, Guan-nan Chan, T.C. and L.-Y.S.; validation, G.-N.C., W.-X.F. and T.C.; formal analysis, G.-N.C. and W.-X.F.; investigation, W.-X.F.; resources, P.D.; data curation, W.-X.F.; writing—original draft preparation, G.-N.C. and L.-Y.S.; writing—review and editing, G.-N.C. and T.C.; visualization, G.-N.C. and L.-Y.S.; supervision, P.D.; project administration, W.-X.F.; funding acquisition, P.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was jointly supported by the National Natural Science Foundation of China (No. 61803260).

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Acronyms and Nomenclatures

Notations	Definitions	Notations	Definitions
CE	consensus on estimation	CM	consensus on measurement
CI	consensus on information	HCMCI	hybrid CM-CI
dof	degrees of freedom	PDF	probability density function
KLA	Kullback–Leibler average	KLD	Kullback–Leibler divergence
DCKF	distributed consensus Kalman Filter	DCSTF	distributed consensus student-t filter
DCMDF	distributed consensus multiple-distribution filter	DCKFIMM	distributed multiple-model method with two Gaussian distributions
RMSE	root mean square error	PMF	probability mass function

References

Ding, D.; Han, Q.L.; Wang, Z.; Ge, X. A survey on model-based distributed control and filtering for industrial cyber-physical systems. IEEE Trans. Ind. Inform. 2019, 15, 2483–2499. [Google Scholar] [CrossRef]
He, S.; Shin, H.; Xu, S.; Tsourdos, A. Distributed estimation over a low-cost sensor network: A review of state-of-the-art. Inform. Fusion 2020, 54, 21–43. [Google Scholar] [CrossRef]
Huang, S.; Li, Y.; Wu, J. Distributed state estimation for linear time-invariant dynamical systems: A review of theories and algorithms. Chin. J. Aeronaut. 2022, 35, 1–17. [Google Scholar] [CrossRef]
Sirocchi, C.; Bogliolo, A. Community-Based Gossip Algorithm for Distributed Averaging. In Proceedings of the IFIP International Conference on Distributed Applications and Interoperable Systems, Lisbon, Portugal, 19–23 June 2023. [Google Scholar]
Loizou, N.; Richtarik, P. Revisiting randomized gossip algorithms: General framework, convergence rates and novel block and accelerated protocols. IEEE Trans. Ind. Inform. 2019, 15, 2483–2499. [Google Scholar] [CrossRef]
Cai, P.; Wang, S.; Chen, Y. Diffusion Mixture Minimum Total Error Entropy Adaptive Filtering Algorithm and Its Performance Analysis. IEEE Trans. Signal Inf. Process. Over Netw. 2023, 9, 397–411. [Google Scholar] [CrossRef]
Liang, Y.; Li, Y.; Chen, Y.; Sheng, A. Event-triggered diffusion nonlinear estimation for sensor networks with unknown cross-correlations. Syst. Control Lett. 2023, 175, 105506. [Google Scholar] [CrossRef]
Olfati-Saber, R. Distributed Kalman filter with embedded consensus filters. In Proceedings of the 44th IEEE Conference on Decision and Control, Seville, Spain, 15 December 2005. [Google Scholar]
Olfati-Saber, R.; Murray, R.M. Consensus problems in networks of agents with switching topology and time-delays. IEEE Trans. Autom. Control 2004, 49, 1520–1533. [Google Scholar] [CrossRef]
Battistelli, G.; Chisci, L.; Fantacci, C. Parallel consensus on likelihoods and priors for networked nonlinear filtering. IEEE Signal Process. Lett. 2014, 21, 787–791. [Google Scholar] [CrossRef]
Battistelli, G.; Chisci, L.; Mugnai, G.; Farina, A.; Graziano, A. Consensus-based linear and nonlinear filtering. IEEE Trans. Autom. Control 2015, 60, 1410–1415. [Google Scholar] [CrossRef]
Olfati-Saber, R. Distributed Kalman filtering for sensor networks. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007. [Google Scholar]
Battistelli, G.; Chisci, L.; Morrocchi, S.; Papi, F. An information-theoretic approach to distributed state estimation. IFAC Proc. Vol. 2011, 44, 12477–12482. [Google Scholar] [CrossRef]
Battistelli, G.; Chisci, L. Kullback–Leibler average, consensus on probability densities, and distributed state estimation with guaranteed stability. Automatica 2014, 50, 707–718. [Google Scholar] [CrossRef]
He, X.; Xue, W.; Fang, H. Consistent distributed state estimation with global observability over sensor network. Automatica 2018, 92, 162–172. [Google Scholar] [CrossRef]
Qu, H.; Yang, F.; Han, Q.; Zhang, Y. Distributed H∞ consensus filtering for attitude tracking using ground-based radars. IEEE Trans. Cybern. 2021, 51, 3767–3778. [Google Scholar] [CrossRef] [PubMed]
Rego, F.C.; Pu, Y.; Alessandretti, A.; Aguiar, A.P.; Pascoal, A.M.; Jones, C.N. A distributed Luenberger observer for linear state feedback systems with quantized and rate-limited communications. IEEE Trans. Autom. Control 2021, 66, 3922–3937. [Google Scholar] [CrossRef]
Ting, J.A.; Theodorou, E.; Schaal, S. A Kalman filter for robust outlier detection. In Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Diego, CA, USA, 29 October–2 November 2007. [Google Scholar]
Fu, H.; Cheng, Y. A Novel Robust Kalman Filter Based on Switching Gaussian-Heavy-Tailed Distribution. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 3012–3016. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Li, N.; Wu, Z.; Chambers, J.A. A novel robust student’s t based Kalman filter. IEEE Trans. Aerosp. Electron. Syst. 2019, 55, 1545–1554. [Google Scholar]
Huang, Y.; Zhang, Y.; Xu, B.; Wu, Z.; Chambers, J. A New Outlier-Robust Student’s t Based Gaussian Approximate Filter for Cooperative Localization. IEEE-ASME Trans. Mechatron. 2017, 22, 2380–2386. [Google Scholar] [CrossRef]
Dong, P.; Jing, Z.; Shen, K.; Li, M. A distributed consensus filter for sensor networks with heavy-tailed measurement noise. Sci. China Inform. Sci. 2018, 61, 119201. [Google Scholar] [CrossRef]
Wang, J.; Dong, P.; Shen, K.; Song, X.; Wang, X. Distributed consensus student-t filter for sensor networks with heavy-tailed process and measurement noises. IEEE Access 2020, 8, 167865–167874. [Google Scholar] [CrossRef]
Huang, Y.; Zhang, Y.; Zhao, Y.; Chambers, J.A. A novel robust Gaussian–student’s t mixture distribution based Kalman filter. IEEE Trans. Signal Process. 2019, 67, 3606–3620. [Google Scholar] [CrossRef]
Roth, M.; Ozkan, E.; Gustafsson, F. A student’s t filter for heavy tailed process and measurement noise. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013. [Google Scholar]

Figure 1. Research background and problem.

Figure 2. The organization of this paper.

Figure 3. Logarithmic heat maps of Gaussian distribution (left) and student-t distribution (right).

Figure 4. The workflow of the proposed algorithm.

Figure 5. Topology of sensor network.

Figure 6. Position root mean square errors of different algorithms.

Figure 7. Velocity root mean square errors of different algorithms.

Figure 8. The true and estimated trajectories for one simulation.

Figure 9. Position root mean square errors of different algorithms.

Figure 10. Velocity root mean square errors of different algorithms.

Figure 11. Topology of sensor network with 15 nodes.

Table 1. Comparison with related studies.

Reference	Consensue	Gaussian Distribution	Heavy-Tailed Distribution
[10,11,12,16,17]	✔	✔
[19,20,21]			✔
[22,23]	✔		✔
[24]		✔	✔
the proposed algorithm	✔	✔	✔

Table 2. Variables of system model.

Variables	Meaning	Variables	Meaning
x	State	P	Covariance
z	Measurement	$η$	dof
m	Mean value	$Γ$	Gamma function

Table 3. Root mean square errors of position under different outlier probabilities.

$p_{0}$	DCKF	DCSTF	DCMDF	DCKFIMM
0	3.8339	5.5194	3.7899	4.4447
0.1	11.4991	8.9549	7.0405	9.0710
0.2	15.3950	9.4373	8.1685	11.5212
0.3	18.2466	11.4850	9.6818	14.0321
0.4	20.9295	14.8262	12.2020	18.3049

Table 4. Root mean square errors of velocity under different outlier probabilities.

$p_{0}$	DCKF	DCSTF	DCMDF	DCKFIMM
0	0.9405	1.0602	0.9376	1.0908
0.1	2.4378	2.2842	2.1637	2.2934
0.2	3.0931	2.6195	2.5565	2.3951
0.3	3.5050	2.9143	2.7751	3.0377
0.4	3.9130	3.3194	3.03984	3.4926

Table 5. Root mean square errors of position under different outlier probabilities.

$p_{0}$	DCKF	DCSTF	DCMDF	DCKFIMM
0	3.7362	4.7315	3.7021	4.1563
0.1	11.5420	8.4957	6.9921	9.8612
0.2	15.2641	9.5924	8.0813	11.6715
0.3	18.8536	11.4042	9.7624	15.6121
0.4	19.7912	13.8126	11.4620	17.9736

Table 6. Root mean square errors of velocity under different outlier probabilities.

$p_{0}$	DCKF	DCSTF	DCMDF	DCKFIMM
0	0.7623	0.8325	0.7546	0.8216
0.1	1.9820	1.7741	2.0831	1.8742
0.2	3.0362	2.6135	2.5272	2.6718
0.3	3.4931	2.8635	2.7452	3.1803
0.4	3.8842	3.3146	3.0653	3.6943

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chang, G.-N.; Fu, W.-X.; Cui, T.; Song, L.-Y.; Dong, P. Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise. J. Sens. Actuator Netw. 2024, 13, 38. https://doi.org/10.3390/jsan13040038

AMA Style

Chang G-N, Fu W-X, Cui T, Song L-Y, Dong P. Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise. Journal of Sensor and Actuator Networks. 2024; 13(4):38. https://doi.org/10.3390/jsan13040038

Chicago/Turabian Style

Chang, Guan-Nan, Wen-Xing Fu, Tao Cui, Ling-Yun Song, and Peng Dong. 2024. "Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise" Journal of Sensor and Actuator Networks 13, no. 4: 38. https://doi.org/10.3390/jsan13040038

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Consensus Multi-Distribution Filter for Heavy-Tailed Noise

Abstract

1. Introduction

1.1. Background

1.2. Related Work

2. Preliminaries and Problem Formulation

3. Main Results

3.1. The Multi-Distribution Filter Based on Gaussian Distribution and Student-t Distribution

3.2. Consensus on Mixed Density

3.3. The Distributed Multi-Distribution Filter

4. Numerical Simulation

5. Conclusions and Outlook

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Acronyms and Nomenclatures

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI