Recent Advances in Stochastic Sensor Control for Multi-Object Tracking

Panicker, Sabita; Gostar, Amirali Khodadadian; Bab-Hadiashar, Alireza; Hoseinnezhad, Reza

doi:10.3390/s19173790

Open AccessFeature PaperReview

Recent Advances in Stochastic Sensor Control for Multi-Object Tracking

School of Engineering, RMIT University, Victoria 3083, Australia

^*

Author to whom correspondence should be addressed.

Sensors 2019, 19(17), 3790; https://doi.org/10.3390/s19173790

Submission received: 27 June 2019 / Revised: 21 August 2019 / Accepted: 29 August 2019 / Published: 1 September 2019

(This article belongs to the Special Issue Mobile Sensing: Platforms, Technologies and Challenges)

Download

Browse Figures

Versions Notes

Abstract

:

In many multi-object tracking applications, the sensor(s) may have controllable states. Examples include movable sensors in multi-target tracking applications in defence, and unmanned air vehicles (UAVs) as sensors in multi-object systems used in civil applications such as inspection and fault detection. Uncertainties in the number of objects (due to random appearances and disappearances) as well as false alarms and detection uncertainties collectively make the above problem a highly challenging stochastic sensor control problem. Numerous solutions have been proposed to tackle the problem of precise control of sensor(s) for multi-object detection and tracking, and, in this work, recent contributions towards the advancement in the domain are comprehensively reviewed. After an introduction, we provide an overview of the sensor control problem and present the key components of sensor control solutions in general. Then, we present a categorization of the existing methods and review those methods under each category. The categorization includes a new generation of solutions called selective sensor control that have been recently developed for applications where particular objects of interest need to be accurately detected and tracked by controllable sensors.

Keywords:

stochastic sensor control; PHD filter; multi-Bernoulli filter; random finite sets; multi-target tracking

1. Introduction

The sensor control problem is often considered under the umbrella of “Sensor Management”, which often deals with a wide range of management issues related to sensors in applications such as sensor scheduling, communication protocols, and sensor kinematics. Sensor control in multi-object tracking systems, on the other hand, presents a very specific domain which pays exclusive attention to the applications that need sensor control implementation, where all deployed sensors are efficiently utilized to gain a comprehensive understanding of the tracking scenario.

The term ‘Sensor Control’, in the context of this paper, refers to the dynamic deployment of one or more sensors in various multi-object tracking scenarios, by seeking the best control command for achieving optimum tracking results. The control command refers to the control of the freedom of movement of mobile sensors in a sensing system, to efficiently track the objects within the operational constraints of the system. The sensor control strategies are devised using stochastic methods, given the highly probabilistic scenario posed by the unpredictability of the noise, clutter and the object behaviors.

This review focuses on stochastic control solutions for sensor movements in multi-object systems that are modeled in the Random Finite Set (RFS) framework. Such solutions are usually formulated based on the widely followed Partially Observable Markov Decision Process (POMDP) [1] approach for Bayesian multi-object filtering, in which each action is perceived as a result of the previous measurement.

Although the broader area of sensor management has seen a tremendous growth in the past decade and has been reviewed in several papers in the literature [2,3,4,5,6,7,8,9,10,11,12,13,14,15,16], there seems to be no comprehensive review of the specific area of stochastic sensor control that provides detailed insight into the contributions and progress made in this domain. As such, in this work, we present the general sensor control problem and the key components of typical solutions introduced in the literature. This is followed by a categorization of the existing methods that include a new generation of solutions called selective sensor control.

The broader area of sensor management has seen tremendous growth in the past few decades, and the solutions developed for conventional multi-object filters have been reviewed in several papers in the literature [1,2,3,5,17,18,19,20,21,22,23]. Therefore, although we consider those methods in our categorization, we do not include a a review of them in this article, and concentrate on the more recent literature that is focused on sensor control for RFS-based filters.

The paper is organized as follows. Section 2 presents an overview of the sensor control problem formulation and the key concepts. Section 3 provides a categorization of sensor control approaches. This is followed by Section 4 in which a review of the major contributions in each category is provided. Some comparative simulation experiments and their results are presented and discussed in Section 5, followed by concluding remarks and future directions presented in Section 7.

2. Sensor Control Problem

The multi-object sensor control is a nonlinear stochastic control problem that aims to assign sensors the right sensor state at the right time [3]. The control problem is particularly challenging due to the high complexity and uncertainties involved in multi-object systems. The uncertainty is introduced into the system by the unperceived variations in the number of objects and also the false alarms and misdetections inherent to the system dynamics.

The sensor control decision is made in the presence of uncertainties in the object state and measurement spaces, usually with the assumption that the previous observation is available when making the next decision. Such stochastic control problems can be effectively handled in the POMDP framework [1] where the multi-object state is modeled as a Markov process, with knowledge on the posterior probability density function (pdf) of the multi-object state conditioned on the past measurements and the true state being unknown. The solution to sensor control problems generally depends on the definition of a measure of goodness that is usually quantified in terms of either the accuracy of the resulting multi-object state estimation, or the information content of the resulting multi-object posterior distribution. The decision-making is performed based on optimizing an objective function.

Consider a general multi-object system. As shown in the schematic diagram of Figure 1a, it receives measurements from sensor(s) and outputs’ multi-object state information at any time. The sensor control solution inputs that information and outputs control command(s) for the sensor(s) that are then actuated accordingly, before the next measurements are acquired and sent to the multi-object system for processing at the next iteration.

In many applications, the multi-object system is implemented as a Bayesian multi-object filter that is mainly comprised of three steps: prediction, update, and estimation. In such scenarios, the sensor control problem turns into a stochastic control solution. Figure 1b shows a schematic diagram of the complete closed-loop system, exhibiting the usual approach in which the sensor control module constructs the control commands from the predicted multi-object density. Figure 2 presents how this information is processed inside a generic sensor control algorithm to generate a control command decision in the single-sensor case. Note that the outputs of the “estimation block” are only determined after the sensor control commands are determined and sensors are actuated accordingly then measurements are acquired using which the update step is executed. Importantly, the outputs of the “Estimation” block cannot be directly used for sensor control. However, the prediction step does not need the sensor measurements and can occur before any determination occurs on the sensor control commands. Hence, the most informative (and most recently updated) input that can be given to the sensor control block in Figure 1b is the “Predicted multi-object density”. Let us denote the predicted multi-object density at time k by

π_{k | k - 1} (\cdot)

and the set of all admissible sensor control commands by

U = {u_{1}, \dots, u_{m}}

. The predicted density is used first to extract a predicted set of object state estimates, denoted by

{\hat{X}}_{k | k - 1}

. For each possible control command

u_{i} \in U

, the following steps are taken:

First, a set of ideal noise-free and clutter-free measurements called the predicted ideal measurement set (PIMS) [6], denoted by

\tilde{Z} (u_{i}),

is constructed as follows:

\begin{matrix} \tilde{Z} (u_{i}) & = & {\{\tilde{z} (\hat{x}; u_{i})\}}_{\hat{x} \in {\hat{X}}_{k | k - 1}}, \\ where \\ \tilde{z} (\hat{x}; u_{i}) & = & \underset{z}{arg max} g (z | \hat{x}; u_{i}), \end{matrix}

(1)

in which

g (z | \hat{x}; u_{i})

is the single-object likelihood function (sensor model) which varies with the control command

u_{i}

.

Example 1.

Consider a multi-object tracking application in 2D space, where each single-object state contains the object location coordinates (e.g., its x and y if the object is moving in a 2D space) and may include other components such as velocity, acceleration, and bearing angle. We also consider a sensor that its location is controllable, i.e., the control command

u_{i}

is the sensor location vector. The sensor measures range. That is, if it detects the object, it returns:

z = ρ + n,

(2)

where

ρ = | | \hat{x} - u_{i} {| |}_{2}

and the measurement noise n is a zero-mean Gaussian random variable with a range dependent variance

σ {(ρ)}^{2}

. The single-object likelihood function is then given by:

g (z | \hat{x}; u_{i}) = \frac{1}{\sqrt{2 π} σ (ρ)} exp (- \frac{{(z - ρ)}^{2}}{2 σ {(ρ)}^{2}}) .

(3)

Hence, for a given sensor location

u_{i}

and object location

\hat{x}

, the PIMS measurement is given by:

\tilde{z} (\hat{x}; u_{i}) = ρ = | | \hat{x} - u_{i} {| |}_{2},

(4)

which achieves the maximum likelihood value of

g_{max} (\hat{x}; u_{i}) = \frac{1}{\sqrt{2 π} σ (ρ)} .

(5)

After the PIMS is calculated, the update step of the multi-object filter is run. Since this update step utilizes the PIMS as a measurement set, we call it pseudo-update and its output is called a pseudo-posterior denoted by

{\tilde{π}}_{k} (\cdot; u_{i})

(as shown in Figure 2). An essential component of the algorithm is an objective function that inputs this pseudo-posterior and outputs a cost value,

C (u_{i})

, or a reward

R (u_{i})

. An optimization search over all the cost or rewards for various admissible control commands returns the best control command

u^{*}

. This command can be one of various actions such as spinning in positive or negative directions or making a step displacement in different directions.

3. Sensor Control Categorization

In this section, we present the sensor control methods that have been proposed in the recent literature, in three classes based on the following criteria: (a) nature of multi-object tracking (MOT), (b) objective function and (c) purpose of tracking. As presented in Figure 3, in each class, there are a few sub-classes.

3.1. Nature of MOT

This review focuses on sensor control for applications where the multi-object system is a Bayesian multi-object filter—see Figure 1b. In such applications, the filter is usually designed to track the objects/targets, i.e., not only to estimate the number and states of the objects, but also to give them labels at each time. Therefore, the multi-object system block in Figure 1a,b are indeed multi-object trackers (MOTs).

The design of the sensor control method is heavily dependent on the nature of the MOT used in the closed-loop control system. In this study, we classify the sensor control methods into solutions that work with: (a) conventional and (b) Random Finite Set (RFS)-based multi-object trackers (also called multi-target filters in some papers).

The conventional multi-target filtering and sensor control methods use approaches such as Global Nearest Neighbor (GNN), Joint Probabilistic Data Association (JPDA) filter, and Multiple Hypothesis Tracking (MHT) [24] for finding associations between targets and measurements. In these methods, the uncertainty caused by difficulties in data association pose a major challenge along with the exponential growth in computations related to the number of targets and measurements.

As multi-target tracking deals with highly complex multi-object stochastic systems in which the number of objects vary randomly in time, and the measurements are subject to missed detections and false alarms, it would be best handled with a unified framework that extends the single-target tracking approach to multiple targets. This can be achieved by representing the multi-target states and observations as a random finite set (RFS) [3] in the finite set statistics (FISST) framework [25]. The RFS-based approach provides a principled and rigorous framework for modeling the multi-object systems and devising efficient control methods.

3.2. Objective Function

The essential part of any sensor control solution is its decision-making component, which selects the optimal sensor command(s). Commonly, the decision-making is performed based on optimizing an objective function. Based on the objective function employed, there are two classifications presented in the sensor control literature: Task-driven and Information-driven methods.

In the task-driven method, the objective function is formulated as a cost function that directly depends on the tracking performance quantified by metrics such as error variance and cardinality variance [16,26]. In the case of information-driven methods, the focus is on the information content of the multi-object posterior resulting from the Bayesian update with measurements obtained after sensor control. In this case, the objective function is a reward function, which is usually computed using a measure of information gain from prior to posterior. Some examples of reward functions are Shannon entropy [27], Kullback–Leibler divergence [2,28], Rényi divergence [11,12] and Cauchy–Schwarz divergence [29].

3.3. Purpose of Tracking

Apart from the two classifications discussed above, a new classification based on the purpose of tracking has come into vogue, which stems from the recently introduced selective sensor control methods [16]. This groups the sensor control methods into selective and non-selective types.

3.3.1. Non-Selective Sensor Control

This is the conventional sensor control method which aims at efficiently tracking a group of targets by positioning the sensors towards the direction of targets, in the case of non-mobile sensors; and towards the center of target locations, in case of mobile sensors. Several works exist in the literature that have enriched the domain of non-selective sensor control [1,2,3,5,6,11,26,28,30,31,32,33,34,35,36].

3.3.2. Selective Sensor Control

While the conventional sensor control strategies are designed to manage sensors in tracking a set of targets, selective sensor control methods focus on tracking scenarios where only a specific target or a selected subset of targets in the surveillance area is of primary interest. In such applications, the goodness of sensor measurements is of primary concern for tracking only the specific targets of interest (ToIs), and sensor control is performed to obtain better measurements for the ToIs. The availability of labeled random finite set filters [37,38,39,40,41] have made it possible to track the trajectories of targets apart from estimating their number and states [37,38,40,41,42]. For instance, the target label information returned by a Labeled Multi-Bernoulli (LMB) filter can be effectively used for sensor control in selective target tracking applications.

4. Survey of Recent Literature

In this section, we explore some of the significant methods that have been recently proposed in the signal processing literature for single and multi-sensor control for multi-target tracking in a stochastic POMDP framework. Table 1 lists how each of the methods that are reviewed in this article, are classified. In all these methods, the sensor control solution is primarily proposed to be used with RFS-based multi-target tracking filters. However, Mahler’s method called Posterior Expected Number of Targets (PENT) [5] can be simply used with both the RFS-based multi-target filters (such as the probability hypothesis density—PHD—filter) and the conventional ones (such as the MHT filter). This section would provide the reader with a snapshot of the significant steps in the evolution of sensor control methods to the current state-of-the-art.

4.1. Information-Driven Sensor Control

The purpose of sensor control is to effectively use the sensor/s, by allowing them to interact with the tracking environment to gain useful information that reduce uncertainty. Every movement of the sensor is aimed at an information gain on the targets’ states and cardinality i.e., information on the accuracy of state/location of the target that is being tracked; and/or information on the presence or absence of targets. The amount of information gain is usually measured by the change in entropy between the probability densities prior and post sensor measurements. It was in the work of Hintz and McVey [46] that we find the earliest reference to the application of information-driven control for sensor management and state estimation, or, more precisely, the measures for information gain, where they employed Shannon entropy with Kalman filters for target tracking.

A number of information theoretic divergence measures followed that, for sensor control in single and multi-target tracking scenarios. These included divergence methods like the Kullback–Leibler (KL) divergence [17] and Rényi divergence [47] between the prior and posterior target densities. The expected divergence or information gain (that measures the difference between information content of random variables) is thus used as the basis for selecting the optimal control action. Similarity between random variables can also be measured using the distance between their probability distributions, as in methods like total variation, Bhattacharyya, Hellinger-Matusita, Wasserstein, etc. These measures cannot be computed analytically and hence require expensive approximations like Monte Carlo, except for some special cases. In this section, we endeavor to briefly touch upon information-driven sensor control methods due to their popular usage.

4.1.1. Rényi Divergence-Based Sensor Control

The Rényi or alpha divergence is a commonly used objective function in information-driven sensor control. It is defined as a measure of information gain from a prior to a posterior multi-target density. Referring to Figure 2, let us assume that, at time k, for a given sensor control command, u, the predicted multi-object density

π_{k | k - 1} (\cdot)

evolves to a pseudo-posterior

{\tilde{π}}_{k} (\cdot; u)

after going through a pseudo-update step using the PIMS

\tilde{Z} (u)

. The information gain from the prior to the pseudo posterior (which is the information gained from PIMS measurements) is quantified by the Rényi divergence between the two densities, defined as follows:

R (u) = \frac{1}{α - 1} log (\int [{\tilde{π}}_{k} {(X; u)]}^{α} {[π_{k | k - 1} (X)]}^{1 - α} δ X),

(6)

where

α \in (0, 1)

is an adjustable parameter. The Rényi divergence turns into the Kullback–Leibler divergence when

α \to 1

or the Hellinger affinity with

α = 0.5

. Set integration is generally defined as follows: [42]

\int_{S} f (X) δ X = f (\emptyset) + \int_{S} f (x) d x + \frac{1}{2!} \int_{S \times S} f (x_{1}, x_{2}) d x_{1} d x_{2} + \frac{1}{3!} \int_{S \times S \times S} f (x_{1}, x_{2}, x_{3}) d x_{1} d x_{2} d x_{3} + \dots .

(7)

Rényi Divergence-Based Sensor Control with a General RFS Filter

In 2010, Ristic and Vo [11] proposed an information-driven sensor control method to propagate the multi-object posterior of the particle implementation of Bayes multi-object filter using random finite sets. They employed Rényi divergence as reward function in their implementation. As there is no general closed-from solution to Rényi divergence, a numerical approximation using a Sequential Monte Carlo (SMC) method is provided in the paper. Assume that the prior distribution is approximated by N particles:

π_{k | k - 1} (X) ≊ \sum_{i = 1}^{N} w_{i} δ (X - X_{i}),

(8)

where each pair

(X_{i}, w_{i})

represents a multi-object set particle and its weight. Through application of Bayes’ rule (which is related the posterior to the prior through the multi-object likelihood function), the Rényi divergence is proven to be approximated by:

R (u) ≊ \frac{1}{α - 1} log \frac{\sum_{i = 1}^{N} w_{i} {[g (\tilde{Z} (u) | X_{i}, u)]}^{α}}{{[\sum_{i = 1}^{N} w_{i} g (\tilde{Z} (u) | X_{i}, u)]}^{α}},

(9)

where

g (Z | X, u)

is the multi-object likelihood function (sensor model). The optimal control vector is chosen by:

u^{*} = \underset{u \in U}{arg max} [R (u)] .

(10)

Rényi Divergence-Based Sensor Control with a PHD/CPHD Filter

Approximating a multi-object density by particle sets is evidently very expensive in terms of computation. Indeed, the original method proposed in [11] involved sampling in the measurement set as well, which can be simplified using the PIMS instead. However, the computational cost can be too heavy to implement in the presence of more than five objects in the scene. Later in 2011, Ristic and Vo [12] proposed the Poisson and i.i.d. cluster approximation of Rényi divergence for sensor control with multi-object tracking using PHD and Cardinalized PHD (CPHD) filters.

Assume that both the predicted and updated multi-object densities are of i.i.d. cluster types given by:

π_{k | k - 1} (X) = n! ρ_{0} (n) \prod_{x \in X} s_{0} (x),

(11)

{\tilde{π}}_{k} (X; u) = n! ρ_{1} (n; u) \prod_{x \in X} s_{1} (x; u),

(12)

where

ρ (n)

and

s (x)

are the cardinality distribution and the spatial single-object density, respectively. The Rényi divergence for i.i.d. cluster pdfs applicable to the CPHD filter recursions is derived as [12]

R (u) = \frac{1}{α - 1} log \sum_{n = 0}^{\infty} ρ_{1} {(n; u)}^{α} ρ_{0} {(n)}^{1 - α} {[\int s_{1} {(x; u)}^{α} s_{0} {(x)}^{1 - α} d x]}^{n} .

(13)

In PHD filter recursion where both the multi-object densities are Poisson RFS densities, the Rényi divergence becomes simpler as follows:

R (u) = λ_{0} + \frac{α}{1 - α} λ_{1} (u) + \frac{λ_{1} {(u)}^{α} λ_{0}^{1 - α}}{α - 1} \int s_{1} {(x; u)}^{α} s_{0} {(x)}^{1 - α} d x,

(14)

where

λ

is the average number of objects in the Poisson RFS and

s (x)

is the spatial single-object density. In their works, Ristic and Vo [11,12] examine the reward function in a scenario involving multiple moving objects and a controllable moving range-only sensor. The sensor’s detection accuracy (misdetection, noise and false alarm rate) improves for shorter sensor-object distance. Hence, in this case, the sensor control algorithm is expected to drive the sensor as close as possible to all objects. The Rényi divergence in this case is seen to be rapidly controlling the sensor to move towards the objects. Consequently, the optimal sub-pattern assignment (OSPA) metric, which is used as for performance evaluation, decreases when the sensor control algorithm is in place.

Though several solutions for sensor control using Rényi divergence have been proposed, the obvious problem with all these is the computational expense associated with them, primarily due to the absence of a robust analytical solution to Rényi divergence.

4.1.2. Cauchy–Schwarz Divergence

Another information theoretic reward function that has attracted attention recently is the Cauchy–Schwarz (CS) divergence. The CS divergence is based on the Cauchy–Schwarz inequality for inner products, and, for two random vectors with probability densities f and g, it is defined as [29]

D_{C S} (f, g) = - ln \frac{〈f, g〉}{∥f∥ ∥g∥},

(15)

where

〈f, g〉 ≜ \int f (x) g (x) d x

is the inner product of the two functions, and

| | f | | ≜ \sqrt{〈 f, f 〉}

. The CS divergence is a measure of the distance between the two densities. It is to be noted that, when

f = g

,

D_{C S} (f, g) = 0

; otherwise, it is positive and a symmetric function. The argument of the logarithm in Equation (15) does not exceed one (according to Cauchy–Schwarz inequality) and is positive, as probability densities are non-negative. Geometrically, in CS divergence, the logarithm argument is the cosine of the angle between the two densities in the space of density functions, hence representing the “difference” in information content of the two densities. The Cauchy–Schwarz divergence can also be interpreted as an approximation to the Kullback–Leibler divergence [17].

CS Divergence-Based Sensor Control with a PHD Filter

Consider the predicted and pseudo-posterior multi-object RFS densities,

π_{k | k - 1} (\cdot)

and

{\tilde{π}}_{k} (\cdot; u)

. Hoang et al. [31] showed that the CS divergence between the two RFS densities is given by:

D_{C S} (π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)) = - ln \frac{\int K^{| X |} π_{k | k - 1} (X) {\tilde{π}}_{k} (X; u) δ X}{\sqrt{[\int K^{| X |} {(π_{k | k - 1} (X))}^{2} δ X] [\int K^{| X |} {({\tilde{π}}_{k} (X; u))}^{2} δ X]}},

(16)

where K is the unit of hyper-volume in the single-object state space.

Hoang et al. [31] derived a closed-form solution for the CS divergence between two Poisson RFS densities and proposed how it could be used for sensor control with the PHD filter [4,7]. Assume that the predicted prior and pseudo posterior densities are Poisson densities in a PHD filter, with their intensity functions denoted by

v_{k | k - 1} (x)

and

\tilde{v_{k}} (x; u)

. Note that with the intensity function

v (\cdot)

given, the average cardinality

λ

and spatial single-object density

s (\cdot)

can be computed as follows:

λ = \int v (x) d x; s (x) = \frac{v (x)}{\int v (y) d y} .

(17)

Honag et al. [31] proved that the CS divergence between the two Poisson RFS densities is simply given by:

\begin{matrix} R (u) = D_{CS} (π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)) & = & \frac{K}{2} {∥v_{k | k - 1} (\cdot) - {\tilde{v}}_{k} (\cdot; u)∥}^{2} \\ = & \frac{K}{2} \int {|v_{k | k - 1} (x) - {\tilde{v}}_{k} (x; u)|}^{2} d x, \end{matrix}

(18)

where K is the unit of hyper-volume measurement in the single-object state space. In simple terms, the Cauchy–Schwarz divergence between any two Poisson point processes is half the squared distance between their intensity functions. Sensor control takes place by choosing the control command that maximizes the above divergence as the reward function:

u^{*} = \underset{u \in U}{arg max} \int {|v_{k | k - 1} (x) - {\tilde{v}}_{k} (x; u)|}^{2} d x .

(19)

Calculation of the reward function depends on how the PHD filter is implemented. In an SMC implementation, suppose that the two intensity functions are approximated by the same particles but different weights. Note that this assumption is based on what actually happens in the filter because, through the update step, only particle weights are changed, not the location of the particles. Then we can assume:

\begin{matrix} v_{k | k - 1} (x) & ≊ & \sum_{i = 1}^{N} w_{k | k - 1, i} δ (x - x_{i}), \end{matrix}

(20)

\begin{matrix} \tilde{v_{k}} (x; u) & ≊ & \sum_{i = 1}^{N} {\tilde{w}}_{k, i} (u) δ (x - x_{i}) . \end{matrix}

(21)

Then, the sensor control policy (19) is implemented as below:

u^{*} = \underset{u \in U}{arg max} \sum_{i = 1}^{N} {|w_{k | k - 1, i} - {\tilde{w}}_{k, i} (u)|}^{2} .

(22)

For applications that require target tracks, this approach does not seem to fit, as the PHD filters do not provide tracks. In addition, the drawback with the PHD filter for sensor control is that it involves a poor approximation to the multi-target posterior, leading to highly uncertain cardinality estimates [8,9].

CS Divergence-Based Sensor Control with a Multi-Bernoulli Filter

Later in 2016, based on (18), Gostar et al. [35] derived an approximation for the CS divergence between two multi-Bernoulli densities and used it for sensor control in a cardinality-balanced multi-Bernoulli filter (CB-MeMBer). Consider the multi-Bernoulli (MB) predicted multi-object prior and pseudo posterior being parameterized as

\begin{matrix} π_{k | k - 1} (\cdot) & = & {\{(r_{k | k - 1}^{(i)}, p_{k | k - 1}^{(i)} (\cdot))\}}_{i = 1}^{M}, \end{matrix}

(23)

\begin{matrix} {\tilde{π}}_{k} (\cdot; u) & = & {\{({\tilde{r}}_{k}^{(i)} (u), {\tilde{p}}_{k}^{(i)} (\cdot; u))\}}_{i = 1}^{M}, \end{matrix}

(24)

where each

(r^{(i)}, p^{(i)} (\cdot))

denotes a possible object (a Bernoulli component) with its probability of existence being

r^{(i)}

, and its state density function conditioned on existence being

p^{(i)} (\cdot)

. Gostar et al. [35] approximate each MB density with its closest Poisson RFS density, which is the Poisson with matching intensity function. The intensity function of an RFS density is its first moment, and, for the above two MB densities, they are given by:

\begin{matrix} v_{k | k - 1} (x) & = & \sum_{i = 1}^{M} (r_{k | k - 1}^{(i)} p_{k | k - 1}^{(i)} (x)), \end{matrix}

(25)

\begin{matrix} {\tilde{v}}_{k} (x; u) & = & \sum_{i = 1}^{M} ({\tilde{r}}_{k}^{(i)} (u) {\tilde{p}}_{k}^{(i)} (x; u)) . \end{matrix}

(26)

Substituting the intensity functions in Equation (22) results in the following sensor control policy:

u^{*} = \underset{u \in U}{arg max} \int {|\sum_{i = 1}^{M} (r_{k | k - 1}^{(i)} p_{k | k - 1}^{(i)} (x) - {\tilde{r}}_{k}^{(i)} (u) {\tilde{p}}_{k}^{(i)} (x; u))|}^{2} d x .

(27)

Again, to calculate the integral, in an SMC implementation, suppose that single-object density of each Bernoulli component is approximated by particles:

\begin{matrix} p_{k | k - 1}^{(i)} (x) & ≊ & \sum_{j = 1}^{J^{(i)}} w_{k | k - 1, j}^{(i)} δ (x - x_{j}^{(i)}), \end{matrix}

(28)

\begin{matrix} {\tilde{p}}_{k}^{(i)} (x; u) & ≊ & \sum_{j = 1}^{J^{(i)}} {\tilde{w}}_{k, j}^{(i)} (u) δ (x - x_{j}^{(i)}) . \end{matrix}

(29)

Then, the sensor control policy (27) is implemented as below:

u^{*} = \underset{u \in U}{arg max} \sum_{i = 1}^{M} \sum_{j = 1}^{J^{(i)}} {(r_{k | k - 1}^{(i)} w_{k | k - 1, j}^{(i)} - {\tilde{r}}_{k}^{(i)} (u) {\tilde{w}}_{k, j}^{(i)} (u))}^{2} .

(30)

It is important to note that, through the process of computing the PIMS and running the pseudo-update step, each Bernoulli component is assigned a measurement, i.e., data association is known. Hence, for each particle weight

w_{k | k - 1, j}^{(i)}

, finding its associated weight in the updated MB,

{\tilde{w}}_{k, j}^{(i)} (u)

, is straightforward. See [35] for more details.

CS Divergence-Based Sensor Control with a Labeled Multi-Bernoulli Filter

The above investigation was followed by another approximate solution for the CS divergence between Poisson approximations of the predicted and updated Sequential Monte Carlo (SMC) Labeled Multi-Bernoulli (LMB) posteriors [45]. Consider the LMB predicted multi-object prior and pseudo posterior being parameterized as

π_{k | k - 1} (\cdot) \sim {\{(r_{k | k - 1}^{(ℓ)}, p_{k | k - 1}^{(ℓ)} (\cdot))\}}_{ℓ \in L}, {\tilde{π}}_{k} (\cdot; u) \sim {\{({\tilde{r}}_{k}^{(ℓ)} (u), {\tilde{p}}_{k}^{(ℓ)} (\cdot; u))\}}_{ℓ \in L},

where each

(r^{(ℓ)}, p^{(ℓ)} (\cdot))

denotes a possible target track (with label ℓ) with its probability of existence denoted by

r^{(ℓ)}

, and its state density function conditioned on existence denoted by

p^{(ℓ)} (\cdot)

. Note the change of density symbols to boldface, which is customary in the literature when denoting labeled densities. Similarly, labeled single-object and multi-object states are denoted by bold-face symbols

x

and

X

, respectively.

In a similar fashion to their previous work [35], Gostar et al. [45] showed that, with the above assumptions, the sensor control policy is derived as follows:

u^{*} = \underset{u \in U}{arg max} \sum_{ℓ \in L} \sum_{j = 1}^{J^{(ℓ)}} {(r_{k | k - 1}^{(ℓ)} w_{k | k - 1, j}^{(ℓ)} - {\tilde{r}}_{k}^{(ℓ)} (u) {\tilde{w}}_{k, j}^{(ℓ)} (u))}^{2},

(31)

where

w_{k | k - 1, j}^{(ℓ)}

and

{\tilde{w}}_{k, j}^{(ℓ)} (u)

are the weights of j-th particle which approximate the densities

p_{k | k - 1}^{(ℓ)} (\cdot)

and

{\tilde{p}}_{k}^{(ℓ)} (\cdot; u)

in an SMC implementation of the LMB filter, respectively.

CS Divergence-Based Sensor Control with a Generalized Labeled Multi-Bernoulli Filter

Beard et al. [33] derived the exact closed-form solution for CS divergence between two generalized labeled multi-Bernoulli (GLMB) densities for sensor control in applications where a GLMB filter [37,41], also known as the Vo–Vo filter is being used to track multiple targets. Let

x = (x, ℓ)

be a labeled single-object state and

L (x) ≜ ℓ

be a projection from the labeled state space to the label space, a function that takes the labeled state and returns the label part only. Denoting a labeled set by

X

, its label set is similarly defined as

L (X) = {L (x) : x \in X}

. For a labeled RFS, each element must have a distinct label, i.e., the cardinality of the set itself must equal the cardinality of its label set:

Δ (X) ≜ \{\begin{matrix} 1, & if | X | = | L (X) |, \\ 0, & otherwise . \end{matrix}

(32)

The function

Δ (X)

is called the distinct label indicator. A GLMB is a labeled RFS with a density of the form:

π (X) = Δ (X) \sum_{c \in C} w^{(c)} (L (X)) \prod_{(x, ℓ) \in X} p^{(c)} (x, ℓ),

(33)

where

C

denotes a discrete and finite index set, each

p^{(c)} (x, ℓ)

is a density in the single-object state space, i.e.,

\int p^{(c)} (x, ℓ) d x = 1

, and the label set weights

w^{(c)} (\cdot)

are normalized over the space of labels and indexes, i.e.,

\sum_{c \in C} \sum_{L \in L} w^{(c)} (L) = 1 .

The above GLMB density is completely characterized by the set:

\{(w^{(c)} (L), p^{(c)} (\cdot)) : c \in C, L \subseteq L\} .

For detailed description of prediction and update steps of the GLMB filter and its particular form called delta-GLMB, refer to [37,41].

Assume that the predicted GLMB prior is characterized as:

π_{k | k - 1} (\cdot) \sim \{(w_{k | k - 1}^{(c)} (L), p_{k | k - 1}^{(c)} (\cdot)) : c \in C_{k | k - 1}, L \subseteq L\}

and the pseudo-posterior associated with control command u is a GLMB characterized by:

{\tilde{π}}_{k} (\cdot; u) \sim \{({\tilde{w}}_{k}^{(c^{'})} (L; u), {\tilde{p}}_{k}^{(c^{'})} (\cdot; u)) : c^{'} \in C_{k}, L \subseteq L\} .

Beard et al. [33] showed that the CS divergence between the above GLMB densities is given by:

R (u) = D_{CS} (π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)) = - ln \frac{{〈π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)〉}_{K}}{\sqrt{{〈π_{k | k - 1} (\cdot), π_{k | k - 1} (\cdot)〉}_{K} {〈{\tilde{π}}_{k} (\cdot; u), {\tilde{π}}_{k} (\cdot; u)〉}_{K}}},

(34)

where

\begin{matrix} {〈π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)〉}_{K} & = & \sum_{L \subseteq L} [\sum_{\begin{matrix} c \in C_{k | k - 1} \\ c^{'} \in C_{k} \end{matrix}} (w_{k | k - 1}^{(c)} (L) {\tilde{w}}_{k}^{(c^{'})} (L; u) \prod_{ℓ \in L} 〈p_{k | k - 1}^{(c)} (\cdot, ℓ), {\tilde{p}}_{k}^{(c^{'})} (\cdot, ℓ; u)〉)], \\ {〈π_{k | k - 1} (\cdot), π_{k | k - 1} (\cdot)〉}_{K} & = & \sum_{L \subseteq L} [\sum_{\begin{matrix} c \in C_{k | k - 1} \\ c^{'} \in C_{k | k - 1} \end{matrix}} (w_{k | k - 1}^{(c)} (L) w_{k | k - 1}^{(c^{'})} (L) \prod_{ℓ \in L} 〈p_{k | k - 1}^{(c)} (\cdot, ℓ), p_{k | k - 1}^{(c^{'})} (\cdot, ℓ)〉)], \\ {〈{\tilde{π}}_{k} (\cdot; u), {\tilde{π}}_{k} (\cdot; u)〉}_{K} & = & \sum_{L \subseteq L} [\sum_{\begin{matrix} c \in C_{k} \\ c^{'} \in C_{k} \end{matrix}} ({\tilde{w}}_{k}^{(c)} (L; u) {\tilde{w}}_{k}^{(c^{'})} (L; u) \prod_{ℓ \in L} 〈{\tilde{p}}_{k}^{(c)} (\cdot, ℓ; u), {\tilde{p}}_{k}^{(c^{'})} (\cdot, ℓ; u)〉)] . \end{matrix}

SMC implementation of the above is straightforward. If each density is approximated with the same particles but different weights, each inner product of the densities is calculated by summing all the products of corresponding weights in the two sets of particles.

CS Divergence-Based Sensor Control with Constraints

An important consideration that has been taken into account by Beard et al. [33] and Gostar et al. [45] is the practical constraints that may be applicable when choosing the optimal sensor control command

u^{*}

. Imagine an application in which there is a region in the state space that the sensor cannot be controlled to end up there. For instance, in a defense application, if the objects are enemy targets, the sensors cannot end up up close to any of the targets. For each control command u, such a region can be denoted by

ϵ (u)

. In such a constrained sensor control problem, we want the region to be void (empty of any objects) with a very high probability called the “void probability”. For a given multi-object density

π (\cdot)

, the void probability for a region S is defined as

ψ_{π (\cdot)} (S) ≜ \int_{X - S} π (X) δ X .

(35)

For the purpose of sensor control, the void probability is computed with the pseudo-posterior density in mind. The work in [45] suggests that, in multi-target tracking with LMB filters, implementation of sensor control with void probability constraint enables the control of sensors to be moved in desirable directions for maximizing the information gain, while keeping a safe distance from the targets. When the pseudo-posterior is of LMB form with parameters

{\{({\tilde{r}}_{k}^{(ℓ)} (u), {\tilde{p}}_{k}^{(ℓ)} (\cdot; u))\}}_{ℓ \in L}

, and each density

{\tilde{p}}_{k}^{(ℓ)} (\cdot; u)

is approximated with particles and weights

{\{(x_{k, j}^{(ℓ)}, {\tilde{w}}_{k, j}^{(ℓ)} (u))\}}_{j = 1}^{J^{(ℓ)}}

, the void probability is derived as follows [45]:

ψ_{{\tilde{π}}_{k} (\cdot; u)} (ϵ (u)) = \prod_{ℓ \in L} [1 - {\tilde{r}}_{k}^{(ℓ)} (u) \sum_{j = 1}^{J^{(ℓ)}} 1_{ϵ (u)} (x_{k, j}^{(ℓ)}) {\tilde{w}}_{k, j}^{(ℓ)} (u)],

(36)

where

1_{S} (x) ≜ \{\begin{matrix} 1, & if x \in S, \\ 0, & otherwise . \end{matrix}

(37)

Taking the void probability constraint into account, the sensor control policy is turned into:

u^{*} = \underset{u \in U_{constr .}}{arg max} \sum_{ℓ \in L} \sum_{j = 1}^{J^{(ℓ)}} {(r_{k | k - 1}^{(ℓ)} w_{k | k - 1, j}^{(ℓ)} - {\tilde{r}}_{k}^{(ℓ)} (u) {\tilde{w}}_{k, j}^{(ℓ)} (u))}^{2},

(38)

where

U_{constr .} ≜ \{u \in U : ψ_{{\tilde{π}}_{k} (\cdot; u)} (ϵ (u)) > ψ_{min}\}

(39)

and

ψ_{min}

is the minimum void probability, a user-defined threshold that is very close to 1.

When a GLMB filter is being used for multi-object tracking, the void probability is formulated differently. Consider the pseudo-posterior associated with control command u in which GLMB is characterized by:

{\tilde{π}}_{k} (\cdot; u) \sim \{({\tilde{w}}_{k}^{(c)} (L; u), {\tilde{p}}_{k}^{(c)} (\cdot; u)) : c \in C_{k}, L \subseteq L\} .

Beard et al. [33] show that the void probability in this case is given by:

ψ_{{\tilde{π}}_{k} (\cdot; u)} (ϵ (u)) = \sum_{L \subseteq L} [\sum_{c \in C_{k}} ({\tilde{w}}_{k}^{(c)} (L; u) \prod_{ℓ \in L} 〈1 - 1_{ϵ (u)} (\cdot), {\tilde{p}}_{k}^{(c)} (\cdot, ℓ; u)〉)] .

(40)

In case of SMC implementation where each density

{\tilde{p}}_{k}^{(c)} (\cdot, ℓ; u)

is approximated by particles and weights

{\{({\tilde{x}}_{j}^{(ℓ) (c)}, {\tilde{w}}_{j}^{(ℓ) (c)})\}}_{j = 1}^{J^{(ℓ) (c)}}

, the above expression is simplified to:

ψ_{{\tilde{π}}_{k} (\cdot; u)} (ϵ (u)) = \sum_{L \subseteq L} [\sum_{c \in C_{k}} ({\tilde{w}}_{k}^{(c)} (L; u) \prod_{ℓ \in L} \sum_{j = 1}^{J^{(ℓ) (c)}} (1 - 1_{ϵ (u)} ({\tilde{x}}_{j}^{(ℓ) (c)})) {\tilde{w}}_{j}^{(ℓ) (c)})] .

(41)

In this case, the control policy is:

u^{*} = \underset{u \in U_{constr .}}{arg max} D_{CS} (π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u)),

(42)

where

D_{CS} (π_{k | k - 1} (\cdot), {\tilde{π}}_{k} (\cdot; u))

is given by Equation (34) and

U_{constr .}

is the set of all control commands

u \in U

for which the void probability calculated by (41) is more than the user-defined minimum threshold

ψ_{min}

.

4.2. Task-Driven Sensor Control

The RFS-based sensor control methods, in general and task-driven sensor control approaches specifically, saw their beginnings in the works published by Mahler in 2003 [3] and 2004 [5], which provided the early investigations to devise a foundational basis for sensor management based on the RFS filtering framework. In the task-driven approach, the objective function is formulated as a cost function that directly depends on the tracking performance of the system, quantified by metrics such as error and cardinality variance.

4.2.1. Posterior Expected Number of Targets (PENT)

In 2003, Mahler [3] showed that the Csiszár information-theoretic objective function and geometric functionals lead to tractable sensor management algorithms when used in conjunction with the multi-hypothesis correlator (MHC) filtering algorithms. In 2004, Mahler et al. [5] proposed a “probabilistically natural” sensor management objective function called the posterior expected number of targets (PENT), constructed using an optimization strategy called “maxi-PIMS”. PENT was introduced for the control of sensors with a finite field-of-view (FoV), for the purpose of selecting the action that will maximize the posterior expected number of objects (cardinality) returned after updating the multi-object density. Hence, in the general sensor control framework shown in Figure 2, PENT is implemented by maximizing the reward function:

R (u) = E_{{\tilde{π}}_{k} (\cdot; u)} \{|{\tilde{X}}_{k}|\} = \int |{\tilde{X}}_{k}| {\tilde{π}}_{k} (X; u) δ X .

(43)

Note that the term

E_{{\tilde{π}}_{k} (\cdot; u)} \{|{\tilde{X}}_{k}|\}

is indeed the Expected A Posteriori (EAP) estimate of posterior cardinality, and could be replaced with the Maximum A Posterior (MAP) estimate, turning the sensor control policy into the general form of:

u^{*} = \underset{u \in U}{arg max} |{\hat{X}}_{k} (u)| .

(44)

Although PENT was originally introduced and its objective function was further formulated for PHD and MHC filters [3,5], the above general control policy is applicable with any multi-object filter in place, including the JPDA, MHT, MB, LMB and GLMB filters. Here are some examples:

When a PHD filter with SMC implementation is used as the multi-object system, the above control policy is given by:

\begin{matrix} u^{*} & = & \underset{u \in U}{arg max} \int_{X} \tilde{v_{k}} (x; u) d x \end{matrix}

(45)

\begin{matrix} = & \sum_{i = 1}^{N} {\tilde{w}}_{k, i} (u) . \end{matrix}

(46)

Assuming that a multi-Bernoulli filter or an LMB filter with SMC is in place, we have:

\begin{matrix} u^{*} & = & \underset{u \in U}{arg max} \sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u), \\ or \end{matrix}

(47)

\begin{matrix} u^{*} & = & \underset{u \in U}{arg max} \sum_{ℓ \in L} {\tilde{r}}_{k}^{(ℓ)} (u) . \end{matrix}

(48)

4.2.2. Cardinality Variance-Based Sensor Control

The PENT method is particularly useful when sensors have limited FoV. An alternative task-driven approach towards sensor control is to prioritize the accuracy of the resulting cardinality estimate rather than its mean. In this approach, the objective function is the following cost to be minimized:

C (u) = σ_{{\tilde{π}}_{k} (\cdot; u)}^{2} \{|{\tilde{X}}_{k}|\} = \sum_{m = 1}^{M} {(m - E_{{\tilde{π}}_{k} (\cdot; u)} \{|{\tilde{X}}_{k}|\})}^{2} Pr (|{\tilde{X}}_{k}| = m) .

(49)

Gostar et al. [14,15] derived this cost function for the applications where a multi-Bernoulli filter is used for multi-object tracking, as follows:

C (u) = \sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u) [1 - {\tilde{r}}_{k}^{(i)} (u)] .

(50)

Similarly, with an LMB filter, the cost function is given by:

C (u) = \sum_{ℓ \in L} {\tilde{r}}_{k}^{(ℓ)} (u) [1 - {\tilde{r}}_{k}^{(ℓ)} (u)] .

(51)

Hoang et al. [13] proposed to use the maximum a posteriori (MAP) estimate of cardinality for variance calculations, i.e., to the following cost function:

C (u) = \sum_{m = 1}^{M} {(m - {\hat{| X |}}_{MAP})}^{2} Pr (|{\tilde{X}}_{k}| = m),

(52)

where

{\hat{| X |}}_{MAP} = \underset{m = 1 : M}{arg max} Pr (|{\tilde{X}}_{k}| = m)

is the MAP estimate of cardinality. With an LMB filter, they showed the cost function is given by:

C (u) = \sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u) (1 - {\tilde{r}}_{k}^{(i)} (u)) + {[{\hat{| X |}}_{MAP} - \sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u)]}^{2},

(53)

where the MAP cardinality estimate is calculated as follows:

{\hat{| X |}}_{MAP} = \underset{m = 1 : M}{arg max} [\prod_{i = 1}^{M} (1 - {\tilde{r}}_{k}^{(i)} (u)) \sum_{1 \leq i_{1} \neq \dots \neq i_{m} \leq M} (\prod_{j = 1}^{m} \frac{{\tilde{r}}_{k}^{(i_{j})} (u)}{1 - {\tilde{r}}_{k}^{(i_{j})} (u)})] .

(54)

4.2.3. Posterior Expected Error of Cardinality and States (PEECS)

In 2015, Gostar et al. [26] proposed a new cost function called Posterior Expected Error of Cardinality and States (PEECS) [26] in which a linear combination of the normalized errors of the number of objects and their estimated states is considered as the cost function for sensor control,

C (u) = η ξ_{∣ X ∣}^{2} (u) + (1 - η) ξ_{X}^{2} (u),

(55)

where

ξ_{∣ X ∣}^{2} (u)

and

ξ_{X}^{2} (u)

denote the normalized variances of the cardinality and state estimates, respectively, and

η

is a user-defined constant to determine emphasis on accuracy of cardinality estimates versus the state estimates. The normalized variance is given by (50) but divided by

M / 4

, which is the maximum variance corresponding to the worst case where all probabilities of existence are 0.5,

ξ_{∣ X ∣}^{2} (u) = \frac{4}{M} \sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u) [1 - {\tilde{r}}_{k}^{(i)} (u)] .

(56)

The normalized variance of state estimates depends on the application and the variables included in the single-object state. Consider an application where the principal interest is in localization error in a 2D space. Let us assume that the state vector includes the x- and y-coordinates of the object, denoted by

p_{x}

and

p_{y}

, respectively. Consider a multi-Bernoulli filter being used as the MOT, and implemented using the SMC method in which the

(i)

-th Bernoulli component of the pseudo posterior

{\tilde{π}}_{k} (.; u)

is parameterized by its probability of existence

{\tilde{r}}_{k}^{(i)} (u)

and its density approximated by particles and weights

{\{({\tilde{w}}_{k, j}^{(i)} (u), x_{k, j}^{(i)})\}}_{j = 1}^{J^{(i)}}

where the location components of the particle

x_{k, j}^{(i)}

are denoted by

p_{x, k, j}^{(i)}

and

p_{y, k, j}^{(i)}

. Then, the error variance associated with the

(i)

-th Bernoulli component is given by:

ξ_{x^{(i)}}^{2} \propto σ_{p_{x}^{(i)}}^{2} σ_{p_{y}^{(i)}}^{2},

(57)

where

σ_{p_{x}^{(i)}}^{2}

and

σ_{p_{y}^{(i)}}^{2}

are the variances of x-coordinate and y-coordinate estimates of the possible object associated with the

(i)

-the Bernoulli component, and are given by:

\begin{matrix} σ_{p_{x}^{(i)}}^{2} & = & \sum_{j = 1}^{J^{(} i)} {\tilde{w}}_{k, j}^{(i)} (u) {[p_{x, k, j}^{(i)}]}^{2} - {[\sum_{j = 1}^{J^{(} i)} {\tilde{w}}_{k, j}^{(i)} (u) p_{x, k, j}^{(i)}]}^{2}, \end{matrix}

(58)

\begin{matrix} σ_{p_{y}^{(i)}}^{2} & = & \sum_{j = 1}^{J^{(} i)} {\tilde{w}}_{k, j}^{(i)} (u) {[p_{y, k, j}^{(i)}]}^{2} - {[\sum_{j = 1}^{J^{(} i)} {\tilde{w}}_{k, j}^{(i)} (u) p_{y, k, j}^{(i)}]}^{2} . \end{matrix}

(59)

The maximum of the above variances occur when all particle weights are equal to

\frac{1}{J^{(i)}}

, which leads to:

\begin{matrix} max (σ_{p_{x}^{(i)}}^{2}) & = & \frac{1}{J^{(i)}} (1 - \frac{1}{J^{(i)}}) (\sum_{j = 1}^{J^{(} i)} {[p_{x, k, j}^{(i)}]}^{2}), \end{matrix}

(60)

\begin{matrix} max (σ_{p_{y}^{(i)}}^{2}) & = & \frac{1}{J^{(i)}} (1 - \frac{1}{J^{(i)}}) (\sum_{j = 1}^{J^{(} i)} {[p_{y, k, j}^{(i)}]}^{2}) \end{matrix}

(61)

and the normalized state variance for the

(i)

-th Bernoulli component is calculated by:

ξ_{x^{(i)}}^{2} = \frac{σ_{p_{x}^{(i)}}^{2} σ_{p_{y}^{(i)}}^{2}}{max (σ_{p_{x}^{(i)}}^{2}) max (σ_{p_{y}^{(i)}}^{2})} .

(62)

The total state variance (which is still normalized) is given by:

ξ_{X}^{2} = \frac{\sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u) ξ_{x^{(i)}}^{2}}{\sum_{i = 1}^{M} {\tilde{r}}_{k}^{(i)} (u)} .

(63)

4.3. Selective Sensor Control

All the methods discussed so far deal with the sensor control problem for multi-target tracking of a group of targets or a target ensemble. The labeled random finite set filters such as LMB and GLMB filters allow for tracking the target trajectories along with estimating the number of targets and their states within the stochastic filtering scheme. Therefore, with such filters used, such as the MOT, one can control the sensor(s) towards acquiring most useful measurements for the purpose of tracking the targets of interest (ToIs), which are those with specific labels of interest.

4.3.1. Maximum Confidence in Existence

Panicker et al. [16] investigated how the target label information returned by an LMB filter can be effectively used for sensor control focused on some ToIs. An intuitive solution is also proposed in [16] for scenarios in which one or more of the ToIs temporarily disappear from the tracking scene. Their method is a task-driven sensor control routine with a cost function that is dependent on the pseudo-updated states of only the ToIs.

This approach is based on maximizing the expected confidence of the filter’s inference on the existence of the ToIs. After the pseudo-update of the LMB density, the filter returns an estimate of the total number of existing ToIs, given by the cardinality mean

\sum_{ℓ \in L_{ToI}} {\tilde{r}}_{k}^{(ℓ)} (u),

where

L_{ToI}

is the set of labels of interest and

{\tilde{r}}_{k}^{(ℓ)} (u)

is the pseudo-updated probability of existence for the object with label ℓ. The confidence in this estimate is inversely related to the variance of cardinality. Therefore, to maximize the confidence, the following cost function is minimized:

C (u) = \sum_{ℓ \in L_{ToI}} {\tilde{r}}_{k}^{(ℓ)} (u) (1 - {\tilde{r}}_{k}^{(ℓ)} (u)) .

(64)

4.3.2. Selective-PEECS

As an alternative selective sensor control solution, the Selective-PEECS approach [16] employs the PEECS cost function [26]. The selective-PEECS cost function is similar to PEECS in Equation (55) with the difference that the normalized variances of cardinality and state terms are computed by summing over the labels of interest. More specifically, we have:

\begin{matrix} ξ_{∣ X ∣}^{2} (u; L_{ToI}) & = & \frac{4}{| L_{ToI} |} \sum_{ℓ \in L_{ToI}} {\tilde{r}}_{k}^{(ℓ)} (u) [1 - {\tilde{r}}_{k}^{(ℓ)} (u)], \end{matrix}

(65)

\begin{matrix} ξ_{X}^{2} & = & \sum_{ℓ \in L_{ToI}} {\tilde{r}}_{k}^{(ℓ)} (u) ξ_{x^{(ℓ)}}^{2} / \sum_{ℓ \in L_{ToI}} {\tilde{r}}_{k}^{(ℓ)} (u) . \end{matrix}

(66)

4.4. Extension to Multi-Sensor Control

Most of the work discussed earlier uses a single sensor control strategy for tracking multiple targets. In a multi-sensor scenario, the control solution is straightforward if the sensors form a centralized network. Consider

n_{S}

sensor nodes that are all connected to a central node as shown in Figure 4. At each time k, in each node

S_{i}

, the sensor receives a command

u_{i} \in U

from the central node and changes its state (e.g., moves, spins, changes its gain and so on) accordingly. Then, it generates a measurement set of detections

Z_{i}

that is locally used to run the update step of a multi-object filter. All of the local posteriors,

π_{1, k}, π_{2, k}, \dots, π_{n_{S}, k}

are then communicated back to the central node.

In the central node, the received local posteriors are fused and the resulting multi-object density is used to run the prediction step of the multi-object filter centrally. The predicted prior is then used for two purposes: (i) it is communicated back to each sensor node for the local update to run in the next time, and (ii) it is input to a multi-sensor control routine that generates an

n_{S}

-tuple multi-sensor control command

(u_{1}^{*}, u_{2}^{*}, \dots, u_{n_{S}}^{*}) \in U^{n_{S}}

and sends each control command to the corresponding sensor node.

Algorithm 1 shows the most straightforward process that can be implemented for the “Multi-Sensor Control” block in Figure 4. In the first step, a multi-object estimate

{\hat{X}}_{k | k - 1}

is computed from the prior. It is then used to calculate PIMS for each sensor node after being hypothetically actuated based on each possible control command. The PIMS is then used to run a pseudo-update step in the central node, which results in a pseudo-posterior for each node and each control command.

After all the possible pseudo-posteriors are computed for all sensor nodes and all control commands, for each

n_{S}

-tuple multi-sensor control command

(u_{1}, u_{2}, \dots, u_{n_{S}}) \in U^{n_{S}},

all the corresponding pseudo-posteriors at different sensor nodes are fused, and the resulting pseudo-posterior is used to compute the objective function. In Algorithm 1, it is a reward function that is maximized to return the optimal multi-sensor control command

(u_{1}^{*}, u_{2}^{*}, \dots, u_{n_{S}}^{*})

.

Note: The reward (or cost) function used in line 11 of Algorithm 1 can be any of the functions discussed in this section, divergence-based or task-driven or selective.

It is also important to note that the information fusion operation used in line 10 of Algorithm 1 must be the same operation that is used for fusion of real posteriors in the central node (the “Information Fusion” block in Figure 4). One of the widely used methods for fusion of multiple posteriors is the Generalized Covariance Intersection (GCI) rule that is employed for consensus-based fusion of multiple multi-object densities of various forms. It has been used for fusion of Poisson multi-object posteriors of multiple local PHD filters [48], multi-Bernoulli densities of local multi-Bernoulli filters [49], i.d.d. clusters densities of several CPHD filters [50] and GLMB densities of several local Vo–Vo filters or LMB densities of several LMB filters [51].

Algorithm 1 Step-by-step operations that run inside the multi-sensor control block in Figure 4.

1:: functionMulti_Sensor_Control( $π_{k | k - 1} (\cdot)$ )
2:: ${\hat{X}}_{k | k - 1} \leftarrow$ Estimate( $π_{k | k - 1} (\cdot)$ )
3:: for $i = 1 : n_{S}$ do
4:: for $u_{i} \in U$ do
5:: $\tilde{Z} (u_{i}) \leftarrow$ PIMS( ${\hat{X}}_{k | k - 1}; u_{i}, i$ )
6:: ${\tilde{π}}_{i, k} (\cdot; u_{i}) \leftarrow$ Update( $π_{k | k - 1} (\cdot), \tilde{Z} (u_{i})$ )
7:: end for
8:: end for
9:: for $(u_{1}, \dots, u_{n_{S}}) \in U^{n_{S}}$ do
10:: ${\tilde{π}}_{k} (\cdot; u_{1}, \dots, u_{n_{S}}) \leftarrow$ Fuse( ${\tilde{π}}_{1, k} (\cdot; u_{1}), \dots, {\tilde{π}}_{n_{S}, k} (\cdot; u_{n_{S}})$ )
11:: $R (u_{1}, \dots, u_{n_{S}}) \leftarrow$ Reward( ${\tilde{π}}_{k} (\cdot; u_{1}, \dots, u_{n_{S}})$ )
12:: end for
13:: $(u_{1}^{*}, \dots, u_{n_{S}}^{*}) \leftarrow {arg max}_{(u_{1}, \dots, u_{n_{S}}) \in U^{n_{S}}} R (u_{1}, \dots, u_{n_{S}})$
14:: return $(u_{1}^{*}, \dots, u_{n_{S}}^{*})$
15:: end function

ARAPP (Accelerated Ratio of Absence to Presence Probability) Method

As a multi-sensor extension to the selective sensor control solution discussed earlier, the ARAPP approach [52] employs the RAPP (Ratio of Absence to Presence Probability) cost function [52]. This method optimizes a closed-form objective function called RAPP which can be calculated directly after the prediction step in the central node of the sensor network. The new cost function is computed before the pseudo-update operations, and hence a large amount of computation is saved. The simulation results in the paper suggest that the proposed methods lead to significant improvements in terms of tracking accuracy of objects of interest, compared to using the non-selective sensor control methods. Numerical experiments indicate that the proposed method significantly outperforms the common non-selective sensor control methods. When compared to selective-PEECS, the state-of-the-art method for selective sensor control, it is seen to perform similarly in terms of the mean-square-error (MSE) of tracking of the targets of interest, but is significantly (eight times) faster than the selective-PEECS method, which suggests a substantial reduction in the computational overhead.

5. Comparative Simulation Results

This section presents a few samples of recent works in which the performance of various sensor control algorithms are compared through simulations that involve multiple manoeuvring targets. In all the simulations, the single target state includes the location coordinates and speeds of the target, i.e.,

x = {[\begin{matrix} p_{x} & {\dot{p}}_{x} & p_{y} & {\dot{p}}_{y} \end{matrix}]}^{⊤} .

Each target randomly moves from time

k - 1

to k according to

x_{k} = F x_{k - 1} + e_{k}

where

F = diag (F_{2}, F_{2}, 1)

and

e_{k} \sim N (0, Q)

and we have:

F_{2} = [\begin{matrix} 1 & T \\ 0 & 1 \end{matrix}], Q = σ_{x}^{2} diag (Q_{2}, Q_{2}, 1), Q_{2} = [\begin{matrix} T^{4} / 4 & T^{3} / 2 \\ T^{3} / 2 & T^{2} \end{matrix}]

in which T is the sampling time, and

σ_{x}^{2}

is the noise power.

A controllable sensor is used to detect the targets and return their range and bearing angles in a surveillance area. The probability of the sensor detecting an object depends on the distance between the sensor and the object,

p_{D} (s, p) = \{\begin{matrix} 0.99, & if | | s - p | | \leq R o, \\ max (0, 0.99 - h (| | s - p | | - R_{0})), & otherwise, \end{matrix}

(67)

where

p = {[p_{x} p_{y}]}^{⊤}

and

s = {[s_{x} s_{y}]}^{⊤}

are the object and sensor locations, respectively. Conditional on detection, the measurement returned by the sensor is noisy, formulated by:

z = [\begin{matrix} | | s - p | | + n_{ρ} \\ ∡ (s - p) + n_{θ} \end{matrix}],

(68)

where

n_{ρ}

and

n_{θ}

are range and bearing measurement noise, and both normally distributed with zero mean but different and varying variances,

\begin{matrix} n_{ρ} & \sim & N (0, σ_{ρ}^{2}), \end{matrix}

(69)

\begin{matrix} n_{θ} & \sim & N (0, σ_{θ}^{2}) . \end{matrix}

(70)

Similar to the probability of detection, the noise power terms are also dependent on the sensor-object distance:

\begin{matrix} σ_{ρ} & = & σ_{ρ, 0} + β_{ρ} | | s - {p | |}^{2}, \end{matrix}

(71)

\begin{matrix} σ_{θ} & = & σ_{θ, 0} + β_{θ} | | s - {p | |}^{2} . \end{matrix}

(72)

The target position is a function of the pulse transit time between the sensor and target and is proportional to the distance between them. In addition, the delay of received echo is proportional to the distance to target. In a recent work, Yong et al. [53] state that, although the radar system errors are impacted by various factors such as the altitude of the sensor (radar), range and elevation between the sensor and target, the main factors of influence are the range and elevation. Importantly, the probability of detection decreases with the sensor-object distance (see Figure 5), and the power of measurement noise increases with the sensor-object distance. Hence, in this case, the sensor is generally expected to return more accurate measurements (that include less noise and misdetections) and the sensor control solution is expected to drive the sensor towards the centre of all objects as they manoeuvre.

The above scenario and models for target motion and measurement uncertainties is one of the most common scenarios considered for performance evaluation in the sensor control literature. Table 2 shows a list of such methods and the scenarios considered in them.

Figure 6 illustrates the typical resultant sensor trajectories for Rényi divergence-based and the MAP cardinality variance-based sensor control strategies as reported by Hoang et al. [32]. In their simulation, five targets manoeuvre in a 1000 m×1000 m surveillance area. The sensor starts from the origin, and at each time k, its previous location

s_{k - 1} = {[s_{x, k - 1} s_{y, k - 1}]}^{⊤}

varies to one of the admissible locations in the following control command space:

s_{k} \in U_{k} = \{{[\begin{matrix} s_{x, k - 1} + i_{R} Δ R cos (\frac{2 π i_{θ}}{N_{θ}}) & s_{y, k - 1} + i_{R} Δ R sin (\frac{2 π i_{θ}}{N_{θ}}) \end{matrix}]}^{⊤} : (i_{R}, i_{θ}) \in [0 : N_{R}] \times [0 : N_{θ}]\} .

In the experiment shown in Figure 6, we have borrowed the parameters

Δ R = 50

m,

N_{R} = 2

and

N_{θ} = 8

. All the admissible sensor movements from the location

s_{k - 1} = {[s_{x, k - 1} s_{y, k - 1}]}^{⊤}

are shown as solid black circles in Figure 7. The parameters of the detection profile in Equation (67) and measurement noise are chosen as:

\begin{matrix} R_{0} & = & 300 m, & h & = & 5 \times 10^{- 4} m^{- 1}, \\ σ_{ρ, 0} & = & 1 m, & β_{ρ} & = & 5 \times 10^{- 5} m^{- 1}, \\ σ_{θ, 0} & = & \frac{π}{180} rad, & β_{θ} & = & 10^{- 5} rad \times m^{- 1} . \end{matrix}

The task-driven PEECS method [26] is compared with the MAP cardinality variance method [32] and the resulting sensor locations are shown in Figure 8. Figure 9 compares the OSPA errors of PEECS and the Rényi divergence-based method suggested in [32]. As both cardinality and localization errors are considered in the cost function of PEECS, this method improves the sensor control performance for multi-target scenarios with high clutter. In Figure 10, the estimation errors of PEECS sensor control method are compared to the PHD-based methods [26]. Similarly, the OSPA errors returned by the sensor control method based on Cauchy–Schwarz divergence [35] are compared to Rényi divergence-based sensor control [11] as presented in Figure 11.

Figure 12 and Figure 13 show the simulation results borrowed from [16] for a selective single-sensor multi-target scenario. While Figure 12a demonstrates the sensor path returned by the maximum confidence method, Figure 12b provides the sensor trajectories returned by the selective and non-selective PEECS sensor control methods. Comparing the trajectories, it is inferred that the selective sensor control methods definitely serve selective tracking applications better than the traditional non-selective methods. Figure 13a provides the mean squared error (MSE) returned on tracking the ToIs in order to compare the performance of selective method with PEECS [26], the non-selective sensor control routine. Figure 13b demonstrates the results obtained for the case of disappearing target. It is shown in [16] that tracking error for the ToIs is reduced and speed of computation is considerably increased, when selective sensor control is included.

6. Discussion on Performance Comparison

In this section, we provide a performance comparison for various methods considering the contributions made towards the sensor control. In the sensor control literature, performance enhancement is usually demonstrated in terms of reduction in tracking errors. The computational speed has not been a major focus for performance and comparison in the recent literature. Hence, this review covers the performance comparisons in terms of tracking accuracy in a comprehensive manner. However, acknowledging the importance of computation, we also provide a table for comparison of computational speed (Table 3), as presented in the recent literature on selective sensor control methods [52]. It compares the labelled RFS-based selective sensor control methods with a state-of-the-art non-selective sensor control method.

A comparative study of the Rényi divergence (information theoretic) and cardinality variance (task driven) approaches discussed by Hoang and Vo [32] imply that, although both the control methods perform well in cases with high target observability, in specific cases where the observability is low, their performance degrades. In such scenarios, Rényi divergence performs better than the other, but the Cardinality variance method results in a smoother sensor trajectory, as observed from Figure 6. On comparing the OSPA performance of these methods with PEECS sensor control, as discussed in [26] and observed from Figure 9, it is evident that PEECS exhibits better performance, primarily due to its structural emphasis on taking error terms into account that are similar to the terms that form the OSPA metric. PEECS performs similarly to the MAP cardinality variance cost function, differing in its use of variance of cardinality around the mean (statistical variance) instead of variance around the MAP estimate. In addition, errors in both the cardinality and state estimates impact the PEECS cost. Figure 10a,b indicate that the OSPA errors of PEECS converge to a minimum faster than PENT and PHD-based Rényi divergence methods.

Performance evaluation of these non-selective sensor control methods compared to selective control methods returns promising results for the tracking accuracy in terms of computational speed and performance metric (mean-squared error (MSE) of the state estimates of targets of interest). This is evident from Figure 13, which clearly shows significant improvements in state estimation errors for the ToIs with the selective methods such as Maximum confidence and selective-PEECS compared to the non-selective PEECS method.

Considering computational speed, due to lack of a generic closed-form solution, Rényi divergence-based sensor control is usually implemented with set particle approximations and hence is substantially more computationally expensive than task driven solutions such as PEECS whose cost functions are computationally effective. Indeed, PEECS sensor control is shown to be at least four times faster than the PHD-based sensor control methods [12]. The closed form solution for Cauchy–Schwarz divergence proposed by Gostar et al. [35] (in a multi-Bernoulli filtering scheme) seems to outperform the computational speed of the information-theoretic sensor control methods, while maintaining comparable tracking accuracy. In Figure 11, the OSPA errors returned by the sensor control method based on Cauchy–Schwarz divergence [35] is shown in comparison to the Rényi divergence-based sensor control [11], as reported in [35]. It can be inferred from Table 3 that, in a multi-sensor scenario, the ARAPP sensor control method is very efficient, making it 36 times faster than the non-selective PEECS, and eight times faster than the selective-PEECS method.

7. Conclusions and Future Directions

One of the most significant challenges with multi-sensor control is its computational cost due to the need for search in the multi-dimensional sensor command space

U^{n_{s}}

. Indeed, looking at lines 9–12 of Algorithm 1, for each

n_{s}

-tuple

(u_{1}, \dots, u_{n_{s}}) \in U^{n_{s}}

, a fusion operation step followed by a reward calculation step need to be conducted. Algorithm 1 presents the most straightforward approach which involves an excessive search for the optimal multi-sensor control command. For practical feasibility, especially in real-time applications, there is a significant interest in alternative accelerated search routines.

Wang et al. [36] have recently introduced a guided search method to solve the multi-dimensional optimization problem inherent to multi-sensor control using an accelerated scheme inspired by the Coordinate Descent Method (CDM) [36]. This results in significant improvement in the runtime of the algorithm and also its real-time feasibility in the presence of multiple sensors. However, there is still a large room for improvement, as the CDM does not exploit the multi-object density and PIMS information at its core and is merely an accelerated search over the multi-dimensional space.

The computational cost can also be improved by devising new ways to calculate the objective function without the need to go through the pseudo-update steps of the filter, which can be very expensive. The main point of sensor control is due to the sensor’s detection profile being dependent on the sensor state. Therefore, this dependency might be formulated directly in a closed-form approximate derivation for the objective function, so that we can calculate it without the need to run the pseudo-update step for each admissible (multi-)sensor control command.

Author Contributions

Conceptualization, S.P. and R.H.; methodology, S.P.; software, S.P. and A.K.G.; validation, A.B.-H.; writing—original draft preparation, S.P.; writing—review and editing, R.H.; visualization, R.H.; supervision, R.H. and A.B.-H.

Funding

This research was funded by the Australian Research Council Grant No. LP160101081.

Conflicts of Interest

The authors declare no conflict of interest. The funding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

POMDP	Partially Observable Markov Decision Process
SMC	Sequential Monte Carlo
RFS	Random Finite Set
LMB	Labeled Multi-Bernoulli
GLMB	Generalized Labeled Multi-Bernoulli
MAP	Maximum A Posteriori
MC	Monte Carlo
CS	Cauchy–Schwarz
PHD	Probability Hypothesis Density
MHC	Multi-Hypothesis Correlator
JMPD	Joint Multitarget Probability Densities
PENT	Posterior Expected Number of Targets
PEECS	Posterior Expected Error of Cardinality and States
RAPP	Ratio of Absence to Presence Probability
ARAPP	Accelerated RAPP
GCI	Generalized Covariance Intersection
CDM	Coordinate Descent Method
ToI	Target of Interest

References

Castanón, D.A.; Carin, L. Stochastic control theory for sensor management. In Foundations and Applications of Sensor Management; Hero, A.O., III, Castanón, D.A., Cochran, D., Kastella, K., Eds.; Springer: Boston, MA, USA, 2008; pp. 7–32. [Google Scholar]
Mahler, R.P.S. Global posterior densities for sensor management. In Proceedings of the SPIE Conference on Acquisition, Tracking, and Pointing XII, Orlando, FL, USA, 30 July 1998; Volume 3365, pp. 252–263. [Google Scholar]
Mahler, R.P.S. Multisensor-multitarget sensor management: A unified Bayesian approach. In Proceedings of the SPIE Acquisition, Tracking, and Pointing XII, Orlando, FL, USA, 25 August 2003; Volume 5096, pp. 222–234. [Google Scholar]
Mahler, R.P.S. Multitarget Bayes filtering via first-order multitarget moments. IEEE Trans. Aerosp. Electron. Syst. 2003, 39, 1152–1178. [Google Scholar] [CrossRef]
Mahler, R.P.S.; Zajic, T.R. Probabilistic objective functions for sensor management. In Proceedings of the Signal Processing, Sensor Fusion, and Target Recognition XIII, Orlando, FL, USA, 9 August 2004; Volume 5429, pp. 233–244. [Google Scholar]
Mahler, R.P.S.; Grundel, D.; Murphey, R.; Pardalos, P.M. Multitarget sensor management of dispersed mobile sensors. In Theory and Algorithms for Cooperative Systems; World Scientific: Singapore, 2004; pp. 239–310. [Google Scholar]
Vo, B.-T.; Ma, W.-K. The Gaussian mixture probability hypothesis density filter. IEEE Trans. Signal Process. 2006, 54, 4091–4104. [Google Scholar] [CrossRef]
Mahler, R. PHD filters of higher order in target number. IEEE Trans. Aerosp. Electron. Syst. 2007, 43, 1523–1543. [Google Scholar] [CrossRef]
Vo, B.-T.; Vo, B.-N.; Cantoni, A. Analytic implementations of the cardinalized probability hypothesis density filter. IEEE Trans. Signal Process. 2007, 55, 3553–3567. [Google Scholar] [CrossRef]
Zatezalo, A.; El-Fallah, A.; Mahler, R.; Mehra, R.K.; Pham, K. Joint search and sensor management for geosynchronous satellites. In Proceedings of the Signal Processing, Sensor Fusion, and Target Recognition XVII, Orlando, FL, USA, 6 May 2008. [Google Scholar]
Ristic, B.; Vo, B.-N. Sensor control for multi-object state-space estimation using random finite sets. Automatica 2010, 46, 1812–1818. [Google Scholar] [CrossRef]
Ristic, B.; Vo, B.-N.; Clark, D. A note on the reward function for PHD filters with sensor control. IEEE Trans. Aerosp. Electron. Syst. 2011, 47, 1521–1529. [Google Scholar] [CrossRef]
Hoang, H.G. Control of a mobile sensor for multi-target tracking using multi-target/object multi-Bernoulli filter. In Proceedings of the 2012 International Conference on Control, Automation and Information Sciences (ICCAIS), Ho Chi Minh City, Vietnam, 26–29 November 2012. [Google Scholar]
Gostar, A.; Hoseinnezhad, R.; Bab-Hadiashar, A. Multi-Bernoulli sensor control for multi-target tracking. In Proceedings of the IEEE Eighth International Conference on Intelligent Sensors, Sensor Networks and Information Processing, Melbourne, Australia, 2–5 April 2013. [Google Scholar]
Gostar, A.; Hoseinnezhad, R.; Bab-Hadiashar, A. Robust multi-Bernoulli sensor selection for multi-target tracking in sensor networks. Signal Process. Lett. 2013, 20, 1167–1170. [Google Scholar] [CrossRef]
Panicker, S.; Gostar, A.K.; Bab-Hadiashar, A.; Hoseinnezhad, R. Sensor Control for Selective Object Tracking Using Labeled Multi-Bernoulli Filter. In Proceedings of the 21st International Conference on Information Fusion (FUSION), Cambridge, MA, USA, 10–13 July 2018; pp. 2218–2224. [Google Scholar]
Kullback, S.; Leibler, R.A. Probabilistic objective functions for sensor management. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
Hero, A.O.; Cochran, D. Sensor management: Past, present, and future. IEEE Sens. J. 2011, 11, 3064–3075. [Google Scholar] [CrossRef]
Castanedo, F. A review of data fusion techniques. Sci. World J. 2013, 2013, 704504. [Google Scholar] [CrossRef]
Oh, S.; Russell, S.; Sastry, S. Markov chain Monte Carlo data association for multi-target tracking. IEEE Trans. Autom. Control 2009, 54, 481–497. [Google Scholar]
Yilmaz, A.; Javed, O.; Shah, M. Object tracking: A survey. ACM Comput. Surv. 2006, 38, 13. [Google Scholar] [CrossRef]
Kalandros, M.K.; Trailovic, L.; Pao, L.Y.; Bar-Shalom, Y. Tutorial on multisensor management and fusion algorithms for target tracking. In Proceedings of the IEEE American Control Conference, Boston, MA, USA, 30 June–2 July 2004; Volume 5, pp. 4734–4748. [Google Scholar]
Farina, A.; Battistelli, G.; Chisci, L.; Di Lallo, A. 40 Years of tracking for radar systems: A cross-disciplinary academic and industrial viewpoint. In Proceedings of the IEEE International Conference on Control, Automation and Information Sciences (ICCAIS), Chiang Mai, Thailand, 31 October–1 November 2017. [Google Scholar]
Bar-Shalom, Y.; Fortmann, T.E. Tracking and Data Association; Academic Press: Boston, MA, USA, 1988; Volume 179, p. 1988. [Google Scholar]
Mahler, R. Statistical Multisource-Multitarget Information Fusion; Artech House: Norwood, MA, USA, 2007. [Google Scholar]
Gostar, A.K.; Hoseinnezhad, R.; Bab-Hadiashar, A. Multi-Bernoulli sensor control via minimization of expected estimation errors. IEEE Trans. Aerosp. Electron. Syst. 2015, 51, 1762–1773. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423. [Google Scholar] [CrossRef]
Kreucher, C.; Hero, A.O.; Hero, A.O. A comparison of task driven and information driven sensor management for target tracking. In Proceedings of the IEEE Conference of Decision and Control (CDC-ECC’05), Seville, Spain, 12–15 December 2005; pp. 4004–4009. [Google Scholar]
Kampa, K.; Hasanbelliu, E.; Principe, J.C. Closed-form Cauchy-Schwarz PDF divergence for mixture of Gaussians. In Proceedings of the 2011 International Joint Conference on Neural Networks (IJCNN), San Jose, CA, USA, 31 July–5 August 2011; pp. 2578–2585. [Google Scholar]
Jiang, M.; Yi, W.; Kong, L. Multi-sensor control for multi-target tracking using Cauchy-Schwarz divergence. In Proceedings of the 19th International Conference on Information Fusion (FUSION), Heidelberg, Germany, 5–8 July 2016; pp. 2059–2066. [Google Scholar]
Hoang, H.G.; Vo, B.N.; Vo, B.T.; Mahler, R. The Cauchy-Schwarz divergence for Poisson point processes. IEEE Trans. Inf. Theory 2015, 61, 4475–4485. [Google Scholar] [CrossRef]
Hoang, H.G.; Vo, B.T. Sensor management for multi-target tracking via multi-Bernoulli filtering. Automatica 2014, 50, 1135–1142. [Google Scholar] [CrossRef] [Green Version]
Beard, M.; Vo, B.T.; Vo, B.N.; Arulampalam, S. Void Probabilities and Cauchy–Schwarz Divergence for Generalized Labeled Multi-Bernoulli Models. IEEE Trans. Signal Process. 2017, 65, 5047–5061. [Google Scholar] [CrossRef]
Gostar, A.K.; Hoseinnezhad, R.; Bab-Hadiashar, A.; Papi, F. OSPA-based sensor control. In Proceedings of the IEEE Conference on Control Automation and Information Sciences (ICCAIS), Changshu, China, 29–31 October 2015; pp. 214–218. [Google Scholar]
Gostar, A.K.; Hoseinnezhad, R.; Bab-Hadiashar, A. Multi-Bernoulli sensor control using Cauchy-Schwarz divergence. In Proceedings of the 19th International Conference on Information Fusion (FUSION), Heidelberg, Germany, 5–8 July 2016; pp. 651–657. [Google Scholar]
Wang, X.; Hoseinnezhad, R.; Gostar, A.K.; Rathnayake, T.; Xu, B.; Bab-Hadiashar, A. Multi-sensor control for multi-object Bayes filters. Signal Process. 2018, 142, 260–270. [Google Scholar] [CrossRef] [Green Version]
Vo, B.T.; Vo, B.N. Labeled random finite sets and multi-object conjugate priors. IEEE Trans. Signal Process. 2013, 61, 3460–3475. [Google Scholar] [CrossRef]
Reuter, S.; Vo, B.T.; Vo, B.N.; Dietmayer, K. The labeled multi-Bernoulli filter. IEEE Trans. Signal Process. 2014, 62, 3246–3260. [Google Scholar]
Papi, F.; Vo, B.N.; Vo, B.T.; Fantacci, C.; Beard, M. Generalized labeled multi-Bernoulli approximation of multi-object densities. IEEE Trans. Signal Process. 2015, 63, 5487–5497. [Google Scholar] [CrossRef]
Vo, B.N.; Vo, B.T.; Hoang, H.G. An efficient implementation of the Generalized Labeled Multi-Bernoulli filter. IEEE Trans. Signal Process. 2017, 65, 1975–1987. [Google Scholar] [CrossRef]
Vo, B.N.; Vo, B.T.; Phung, D. Labeled random finite sets and the Bayes multi-target tracking filter. IEEE Trans. Signal Process. 2014, 62, 6554–6567. [Google Scholar] [CrossRef]
Mahler, R.P. Advances in Statistical Multisource-Multitarget Information Fusion; Artech House: Norwood, MA, USA, 2014. [Google Scholar]
Kreucher, C.; Kastella, K.; Kastella, K. Multi-target sensor management using alpha-divergence measures. In Information Processing in Sensor Networks; Springer: Berlin/Heidelberg, Germany, 2003; pp. 209–222. [Google Scholar]
Beard, M.; Vo, B.T.; Vo, B.N.; Arulampalam, S. Sensor control for multi-target tracking using Cauchy-Schwarz divergence. In Proceedings of the 2015 18th International Conference on Information Fusion (Fusion), Washington, DC, USA, 6–9 July 2015; pp. 937–944. [Google Scholar]
Gostar, A.K.; Hoseinnezhad, R.; Rathnayake, T.; Wang, X.; Bab-Hadiashar, A. Constrained sensor control for labeled multi-Bernoulli filter using Cauchy-Schwarz divergence. IEEE Signal Process. Lett. 2017, 24, 1313–1317. [Google Scholar] [CrossRef]
Hintz, Y.; McVey, E.S. Multi-Process Constrained Estimation; Academic Press: Boston, MA, USA, 1991; pp. 237–244. [Google Scholar]
Rényi, A. On measures of entropy and information. In Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA, USA, 20–30 June 1960; pp. 547–561. [Google Scholar]
Battistelli, G.; Chisci, L.; Fantacci, C.; Farina, A.; Mahler, R.P.S. Distributed fusion of multitarget densities and consensus PHD/CPHD filters. Proc. SPIE 2015, 9474, 1–15. [Google Scholar]
Bailu, W.; Wei, Y.; Hoseinnezhad, R.; Suqi, L.; Lingjiang, K.; Xiaobo, Y. Distributed fusion with multi-Bernoulli filter based on generalized covariance intersection. IEEE Trans. Signal Process. 2017, 65, 242–255. [Google Scholar]
Battistelli, G.; Chisci, L.; Fantacci, C.; Farina, A.; Graziano, A. Consensus CPHD filter for distributed multitarget tracking. IEEE J. Sel. Top. Signal Process. 2013, 7, 508–520. [Google Scholar] [CrossRef]
Fantacci, C.; Vo, B.N.; Vo, B.T.; Battistelli, G.; Chisci, L. Consensus labeled random finite set filtering for distributed multi-object tracking. arXiv 2015, arXiv:1501.01579. [Google Scholar]
Panicker, S.; Gostar, A.K.; Bab-Hadiashar, A.; Hoseinnezhad, R. Accelerated Multi-Sensor Control for Selective Multi-Object Tracking. In Proceedings of the International Conference on Control, Automation and Information Sciences (ICCAIS), Hangzhou, China, 24–27 October 2018; pp. 183–188. [Google Scholar]
Yong, X.; Wu, Y.; Tu, M.; Du, X.; Zhang, S. Improving Bias Estimation Precision via a More Accuracy Radar Bias Model. Math. Probl. Eng. 2018, 2018, 3083258. [Google Scholar] [CrossRef]

Figure 1. (a) general block diagram of a multi-object system with sensor control; (b) common data flow with a Bayesian multi-object filter used to implement the multi-object system.

Figure 2. The most common approach to implement the “Sensor Control” block in Figure 1b.

Figure 3. Classification of recent advanced sensor control solutions for multi-object tracking systems.

Figure 4. The general block diagram of a multi-sensor control system within a centralized sensor network.

Figure 5. The detection probability given by (67) decreases as the sensor-object distance increases.

Figure 6. Sensor trajectories of the MAP cardinality variance-based and Rényi divergence-based sensor control methods.

Figure 7. All the admissible movements of a sensor with parameters

N_{R} = 2

and

N_{θ} = 8

are shown as solid black circles.

Figure 7. All the admissible movements of a sensor with parameters

N_{R} = 2

and

N_{θ} = 8

are shown as solid black circles.

Figure 8. Sensor locations for PEECS [26] and Cardinality variance [32] cost functions (as reported in [26]).

Figure 9. Optimal Sub-Pattern Assignment (OSPA) errors of PEECS compared to the sensor control methods suggested by Hoang et al. in [32] (as reported in [26]).

Figure 10. Estimation errors of the PEECS method compared to PHD-based sensor control methods (PENT [2] and Rényi divergence [12]) as reported in [26].

Figure 11. OSPA errors returned by the sensor control method based on Cauchy–Schwarz divergence [35] compared to the Rényi divergence-based sensor control [11], as reported in [35].

Figure 12. Sensor trajectories with non-selective (PEECS) and selective sensor control methods as reported in [16]: (a) comparison of the resulting sensor trajectories using PEECS [26] and selective-PEECS. [16], (b) sensor trajectory using maximum confidence method for sensor control solution [16].

Figure 13. Mean square error of state estimates returned by selective sensor control methods (selective-PEECS and Maximum Confidence method) and non-selective PEECS as reported in [16].

Table 1. Sensor control categorization.

Reference	Sensor Control Method	Year	Task-Driven?	Information-Driven?	RFS-Based?
Mahler et al. [2]	Csiszár divergence	1998		✓	✓
Kreucher et al. [43]	Alpha divergence with JMPD	2003		✓	✓
Mahler et al. [5]	PENT for PHD and MHC filters	2004		✓	✓
Ristic et al. [11]	Rényi alpha divergence	2010		✓	✓
Ristic et al. [12]	Rényi divergence	2011		✓	✓
Hoang et al. [32]	a) Rényi divergence b) MAP estimate of cardinality variance	2014		✓	✓
Gostar et al. [15]	statistical mean of cardinality variance	2013	✓		✓
Gostar et al. [26]	PEECS	2015	✓		✓
Beard et al. [44]	Closed form solution of Cauchy-Schwarz (CS) divergence for GLMB densities	2015		✓	✓
Gostar et al. [34]	OSPA-based objective Function	2015	✓		✓
Gostar et al. [35]	Closed form solution of CS divergence for multi Bernoulli densities	2016		✓	✓
Jiang et al. [30]	CS divergence based JDM, IDM methods for Labeled RFS	2016		✓	✓
Beard et al. [33]	Void probability functional and CS divergence for GLMB filter	2017		✓	✓
Gostar et al. [45]	Void probability functional and CS divergence for LMB filter	2017		✓	✓
Wang et al. [36]	Multi sensor control with GCI and CDM for PEECS	2018	✓		✓
Panicker et al. [16]	Maximum confidence method and selective-PEECS	2018	✓		✓

Table 2. Sensor control scenarios in the recent sensor control and target tracking literature.

Sensor Control Solution	Tracking and Sensing Scenario Description
Ristic et al. [11]	Constant number of static targets (2 targets), probability of detection is homogeneous and constant across the area of surveillance, standard deviation of range measurements depends on the distance between the sensor and target. Range only controllable sensor, 17 control options.
Ristic et al. [12]	5 targets, pD depends on distance between sensor and object, Standard deviation of range measurements depends on the distance between sensor and target constant velocity target motion model. Range only controllable sensor, 17 control options.
Hoang et al. [13]	Maximum of 5 targets, pD depends on distance between sensor and object, constant velocity target motion model. Range only controllable sensor, 17 control options.
Gostar et al. [14]	5 targets, pD depends on distance between sensor and object, constant velocity target motion model. Range only controllable sensor, 17 control options.
Panicker et al. [16]	4 targets, 2 ToIs (Targets of Interest), pD depends on distance between sensor and object, Nearly constant velocity target motion model. Controllable mobile sensor, 9 control options, Selective sensor control.
Panicker et al. [52]	10 targets, 2 ToIs, Coordinated Turn (CT) target motion model, pD depends on distance between sensor and object. 4 sensors, 11 possible sensor control commands.
Hoang et al. [32]	5 moving targets, constant velocity motion model, pD is range dependent. Single mobile range and bearing sensor, Range-dependent sensor noise.
Beard et al. [33]	8 targets, Discrete white noise acceleration motion model, pD is range dependent. Single mobile range and bearing sensor, measurement noise on the bearings is constant for all targets, but the range noise is state-dependent, increasing as the true range between the sensor and target increases.
Gostar et al. [26]	5 targets, pD range dependent, Case study 1: Pseudo stationery targets, Case study 2: Nearly constant turn model. Single mobile sensor, Range-dependent sensor noise, Case study 1: Controllable range only sensor, Case study 2: Controllable range and bearing sensor
Gostar et al. [34]	15 pseudo stationery targets, pD distance dependent. Bearing and range mobile sensor.
Gostar et al. [35]	10 pseudo stationery targets, pD distance dependent. Single sensor, 17 control commands.
Kreucher et al. [43]	3 targets, white noise acceleration target motion model. Single bearings-only sensor, constant velocity sensor motion model.
Gostar et al. [45]	6 targets, coordinated turn model, pD range dependant. Single range and bearing mobile sensor.

Table 3. Comparison of computational speed achieved by recent sensor control methods as reported in [52].

Sensor Control Method	Type	Execution Time per MC Run
PEECS	Non-selective	$2.280 \times 10^{4}$ s
Selective-PEECS	Selective	$4.975 \times 10^{3}$ s
ARAPP	Selective	$6.200 \times 10^{2}$ s

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Panicker, S.; Gostar, A.K.; Bab-Hadiashar, A.; Hoseinnezhad, R. Recent Advances in Stochastic Sensor Control for Multi-Object Tracking. Sensors 2019, 19, 3790. https://doi.org/10.3390/s19173790

AMA Style

Panicker S, Gostar AK, Bab-Hadiashar A, Hoseinnezhad R. Recent Advances in Stochastic Sensor Control for Multi-Object Tracking. Sensors. 2019; 19(17):3790. https://doi.org/10.3390/s19173790

Chicago/Turabian Style

Panicker, Sabita, Amirali Khodadadian Gostar, Alireza Bab-Hadiashar, and Reza Hoseinnezhad. 2019. "Recent Advances in Stochastic Sensor Control for Multi-Object Tracking" Sensors 19, no. 17: 3790. https://doi.org/10.3390/s19173790

APA Style

Panicker, S., Gostar, A. K., Bab-Hadiashar, A., & Hoseinnezhad, R. (2019). Recent Advances in Stochastic Sensor Control for Multi-Object Tracking. Sensors, 19(17), 3790. https://doi.org/10.3390/s19173790

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Recent Advances in Stochastic Sensor Control for Multi-Object Tracking

Abstract

1. Introduction

2. Sensor Control Problem

3. Sensor Control Categorization

3.1. Nature of MOT

3.2. Objective Function

3.3. Purpose of Tracking

3.3.1. Non-Selective Sensor Control

3.3.2. Selective Sensor Control

4. Survey of Recent Literature

4.1. Information-Driven Sensor Control

4.1.1. Rényi Divergence-Based Sensor Control

Rényi Divergence-Based Sensor Control with a General RFS Filter

Rényi Divergence-Based Sensor Control with a PHD/CPHD Filter

4.1.2. Cauchy–Schwarz Divergence

CS Divergence-Based Sensor Control with a PHD Filter

CS Divergence-Based Sensor Control with a Multi-Bernoulli Filter

CS Divergence-Based Sensor Control with a Labeled Multi-Bernoulli Filter

CS Divergence-Based Sensor Control with a Generalized Labeled Multi-Bernoulli Filter

CS Divergence-Based Sensor Control with Constraints

4.2. Task-Driven Sensor Control

4.2.1. Posterior Expected Number of Targets (PENT)

4.2.2. Cardinality Variance-Based Sensor Control

4.2.3. Posterior Expected Error of Cardinality and States (PEECS)

4.3. Selective Sensor Control

4.3.1. Maximum Confidence in Existence

4.3.2. Selective-PEECS

4.4. Extension to Multi-Sensor Control

ARAPP (Accelerated Ratio of Absence to Presence Probability) Method

5. Comparative Simulation Results

6. Discussion on Performance Comparison

7. Conclusions and Future Directions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI