1. Introduction
Wireless positioning is the most popular approach for indoor location-based services. It finds applications in emergency rescue, smart home, security monitoring and many other areas. In contrast to outdoor positioning, whose dominant approach is based on the global positioning system (GPS), indoor positioning is hindered by multiple sources of interference and multipath effects in time-varying environments. In view of this, multiple techniques for indoor positioning are currently being investigated. The most popular ones are based on radio frequency identification [
1,
2], Bluetooth signals [
3,
4], ultra-wideband signals [
5,
6] and WiFi signals [
7,
8]. Due to their large coverage, low implementation cost and availability in existing mobile devices, WiFi-based indoor positioning is the most promising technique.
A broad classification of indoor localization methods can be done in two categories, namely, geometric mapping and fingerprinting [
9]. In geometric mapping, the target’s position is estimated using geometric information like distances and angles from the target to a set of access points (APs) [
10]. Typical techniques for obtaining geometric information are time of flight (ToF) [
11] and angle of arrival (AoA) [
12]. This approach typically requires obtaining information from more than three APs. the resulting methods are based on line of sight (LOS) signals, and therefore are easily influenced by obstacles. Strict synchronization is also needed between different devices. The above limitations are overcome by the fingerprinting method [
13]. It consists of building a database containing the values of certain received signal features at a given set of known locations. Localization is then done by comparing the features obtained at the target’s location with those of the database. Localization based on fingerprinting is more immune to distortions caused by the environment, since the latter are coded in the feature database. However, it suffers from poor positioning accuracy when the target is in a position far from anyone in the database.
The most commonly WiFi signal feature used for indoor localization is its received signal strength (RSS) [
14]. It measures the signal power strength at the receiver side. Although RSS measurements are easy to obtain, only a very rough estimation of the receiver’s position can be obtained from it. This is due to the fading and distortion suffered by signals propagating through multipaths. To tackle this problem, in recent years channel state information (CSI) has been used instead of RSS [
9]. CSI contains information of the phase and amplitude value of each subcarrier of an orthogonal frequency division multiplexing (OFDM) channel. It can be readily obtained from any OFDM system based on the 802.11n protocol. Since multipath and fading information is represented in the CSI, its use for indoor localization leads to more accurate and stable results in comparison to RSS.
To improve the accuracy of any given localization method, extra information can be obtained from a motion model of the target. The common solution for fusing the information from the motion model with that from measurements consists of using some Bayesian tracking technique. We refer to resulting techniques as dynamic localization methods [
15]. One such techniques is the Kalman filter [
16,
17,
18]. While being optimal and having a closed-form, it only applies to linear Gaussian model. This assumption can be relaxed using the extended Kalman filter [
19]. However, since this method is simply based on linearizing a non-linear model, it often leads to inaccurate results. This can be overcome by using particle filtering [
20,
21,
22,
23]. This essentially consists of approximating non-Gaussian probability distributions using a number of samples (particles) multiplied by their associated weights.
As it is known, a Bayesian tracking method consists of alternating two steps called prediction and update. In a particle filter, a set of new particles is drawn during each prediction step, whose weighs are determined during the subsequent update step. A drawback of this approach is that this weighting results in little use of those particles whose weights are very small. The number of wasted particles can be very large in the case where measurements already provide enough information to determine the target’s location with a reasonable accuracy, without using the information provided by the motion model. This is often the case in localization setups, for otherwise, algorithms not using a motion model would not work with reasonable accuracy. Consequently, a very large number of particles are required to avoid particle depletion after the update step, which would result in an inaccurate result. The vast number of required particles renders the approach numerically intractable in many applications.
In order to tackle the above shortcoming, a number of methods are available to reduce the estimation error resulting from using particle filtering with a restricted number of particles. In [
21], a machine learning classifier is added to decide whether particles are in the correct room. In [
22], a context variable, with values in the set {‘free space’, ‘constrained space’, ‘static space’}, is added to the state to account for the movement constraints at the target’s current position. Also, the authors of [
23] divide the environment into reachable and unreachable areas, so as to discard particles falling into an unreachable area.
Generally speaking, the above methods aim to reduce the particle depletion phenomenon described above by complementing the motion model with environment- dependent information to decide where to draw particles. The aim of this work is to make the most of this principle, i.e., ideally to totally avoid particle depletion. To this end, we propose a novel particle filtering scheme which combines the prediction and update steps into a single one. In other words, we avoid drawing unnecessary particles during the prediction step that will afterwards disappear during the subsequent update step. This drastically reduces the required number of particles, making the approach numerically feasible in virtually any application. In deriving our method, we exploit the principle underlying a Bayesian tracking method called maximum likelihood Kalman filtering (MLKF) [
24]. We then call the proposed method the maximum likelihood particle filter (MLPF).
The authors used the MLKF in [
25] to propose a dynamic localization method which only applies to the case in which the motion model is linear and Gaussian. In this sense, the method proposed in this work can be thought of as the generalization of the method we proposed in [
25] to the case of non-linear motion models. This kind of models are typically used in robotics [
26], and as we show with experiments using real data, leads to great accuracy improvements.
The rest of the paper is organized as follows: In
Section 2, we state the dynamic localization problem. In
Section 3, we describe the particle filtering approach for solving it. In
Section 4, we derive the proposed MLPF method. In
Section 5, we present experimental evidence, showing the numerical advantages offered by the proposed method. Concluding remarks are given in
Section 6. For ease of readability, all proofs appear in
Appendix C.
3. Available Solution Using Particle Filtering
As described in
Section 1, the dynamic localization problem can be solved using the well-known particle filtering method [
27]. In this section, we describe this approach.
The Bayesian filtering equations are initialized by
, and proceed as follows
Equations (
5) and (6) are called the prediction and update steps, respectively. These equations do not have close form expressions, in general. Hence, a numeric method is needed to approximately compute them. This can be done using particle filtering. More precisely, we start with
Then, at time
, the prediction step is computed by
with
obtained by drawing it from the following distribution
Also, the update step is computed by
with
normalized so that
.
The use of particle filtering for solving the localization problem permits accommodating the accuracy by increasing the number of particles. As mentioned in
Section 1, a problem that often occurs is that the desired accuracy requires a very large number of particles. This can result in a numerically very expensive algorithm. To address this issue, in the next section we propose a variant of the particle filtering algorithm, which drastically reduces the number of particles required to achieve a given accuracy.
4. Proposed Algorithm
In the particle filtering algorithm described in
Section 3, at time
t, we use the approximation (
9) of the updated distribution
, together with the state-transition Equation (
1), to generate the approximation (
8) of the predicted distribution
. The latter is then used in combination with the likelihood function
to approximate the new updated distribution
. This is done by weighting the
i-th particle
by a weight
proportional to the particle’s likelihood
. As mentioned in
Section 1, a drawback of this approach is that this weighting wastes many particles. In order to avoid this, we combine the prediction and update steps into a single one. More precisely, particles for approximating the updated distribution
are directly drawn using the knowledge of the particle approximation of the previous updated distribution
, without drawing particles to represent the predicted distribution
. We describe below how this is done.
Notation 1. In order to simplify the notation we remove the dependence of Q on , i.e., we use Q instead of .
Notation 2. Letand, for , We assume that at time
t we know the following approximation
Our goal is to use the above to build an approximation of the form
Our first step is to derive a convenient decomposition of . This is done in the following lemma.
Lemma 1. The following equality holds in the system given by (1), (2) and (4)where It follows from Lemma 1 that, in order to find an expression of
we need expressions of
and
. Under the approximation (
10), the desired expressions are given in the following lemma.
Lemma 2. If (10) holds, thenwhere to simplify the notation we use , and denotes the approximately proportional sign. Alsowhere denoted the Jacobian operator,and In order to build an approximation of the form (
11) we need a method for drawing samples from
. In view of (
12), this can be done by drawing samples from
and
. It follows from (16) that
is a Gaussian mixture distribution. Hence, we can readily draw samples from it. On the other hand, from (
13) and (
15) we obtain
from where it is not clear how to draw samples. Our strategy to go around this consists finding a convenient Gaussian approximation
of
, from where we can draw samples, and then weight these samples to account for the difference between
and
.
To derive the Gaussian approximation
we use (
13) and proceed in three steps. In Lemma 3 we do a Gaussian approximation of
, in Lemma 4 we do the same with
, and in Lemma 5 we combine these two approximations to obtain
.
Lemma 3. If (10) holds, thenwhere with and denoting the identity matrix. Lemma 4. The following approximation holdswherewith Combining Lemmas 3 and 4 we obtain the Gaussian approximation of given in the following lemma.
Lemma 5. The following approximation holdswherewith Combining the results in Lemmas 1, 2 and 5 we can build the desired approximation (
11) following the procedure described in Algorithm 1.
Algorithm 1 Proposed localization algorithm. |
At , initialize using ( 7). At each : - 1.
Compute and using ( 18) and (19). - 2.
For each , do:
- (a)
Pick an index by drawing it from the discrete distribution such that i has probability proportional to . - (b)
Draw from . - (c)
Compute the particle weight using . - (d)
Pick an index by drawing it from the discrete distribution such that i has probability proportional to . - (e)
Draw from . - (f)
Put .
- 3.
Normalize the weights so that . - 4.
Estimate the position using .
|
5. Experimental Validation
In this section, we evaluate the performance of our proposed indoor localization method. To this end, we compare it with that of two methods based on particle filtering (PF). The first one is a newly proposed RSS-based method described in [
20], which we refer to as PF-RSS. In order to assess the advantage resulting from using CSI measurements instead of RSS ones, the second method is the CSI-based version the PF-RSS one. It is obtained by using the PF method, described in
Section 3, instead of the proposed MLPF method. We refer to it as PF-CSI. We also consider the dynamic localization method from [
25], which we call MLKF, and only applies to linear Gaussian motion models like the one given in
Appendix B. We also consider the localization performance obtained via static localization. To this end, we use the static localization method proposed in [
25], which, as shown in that work, outperforms other available static positioning methods like FILA [
28], DeepFi [
29], and PhaseFi [
30].
As APs we used TP-Link TL-WDR4310 routers with the OpenWrt platform installed. To acquire CSI values we use the Atheros CSI tool [
31]. It provides CSI values of 56 subcarriers, and for each one, two 10 bit values are used to represent its phase and amplitude. Motion capture cameras with millimeter-level accuracy are used to obtain the ground truth position.
For the target we use a two-wheel robot controlled by a NVIDIA Jetson Nano developer board, equipped with an Atheros AR9590 network interface controller. We consider a target whose motion is described by the velocity model given in
Appendix A.1, with
,
and
. To generate the measurements we pass the acquired CSI phase values through the linear calibration stage proposed in [
30] (
§ II.B), then unwrap the resulting phases and use the resulting phase differences as fingerprints. The whole procedure for generating fingerprints is described in [
25]. The measurement model is given by the following Gaussian kernel expansion
where
,
, denote the positions used to build the fingerprint database. Following [
32] (Chapter 4.3.2) we choose
. Also,
is computed so that, for each
,
matches the CSI measured at
.
As performance metric we use the mean squared localization error
, i.e., if the estimated pose at time
t is
and the ground truth is
, then
For the PF method, following [
23] we use
particles. Also, for the MLKF method, we use the linear Gaussian motion model described in
Appendix B, with
and
.
In the first experiment, we aim to evaluate the performance in large environments. To this end we use an empty room of
m
. Only one AP is placed at the corner of the room as shown in
Figure 2. To build the fingerprinting database, we measured CSI’s at
positions, one meter apart from each other, on a grid as also shown in the same figure. As the target moves within the room, we estimate its position every
seconds.
As we can see from
Table 1, the proposed MLPF method, with only 10 particles, clearly outperforms the outcomes of static positioning and particle filtering. Its performance is comparable to that of the MLKF. However, it outperforms the latter if we increase the number of particles to 50. This is due to the use of a non-linear motion model which yields more accurate statistical knowledge of the target’s position at each sample time. Also, although particle filtering should yield the theoretically optimal result if the number of particles is large enough, the very large number of required particles result in that, even with 1000 particles, its accuracy is significantly inferior than that of the proposed MLPF method with 10 particles.
Figure 3 shows how the positioning accuracy of the proposed method improves with the number of particles. In
Figure 4 we show the error cumulative distribution (ECD) of each method, i.e., for each error (in meters), we show the proportion of positions whose localization error is smaller than that error. The estimated trajectory and ground truth of the experiment are shown in
Figure 5.
The second experiment aims to show the performance of our method in living and working environments. To this end we use two adjacent offices of
m
connected by a corridor. We build the fingerprint database using
positions, as shown in
Figure 6, together with the AP’s location.
We see from
Table 2 that, while in the empty room the proposed method with 10 particles performed similarly to the MLKF method, in this second experiment it largely outperforms the latter. The table also shows the clear advantage of the proposed method over its rivals. The reason for this advantage is as follows. Due to the signal distortion and attenuation caused by multipaths, walls and furnitures, the localization information provided by measurements is not enough to yield accurate estimates at certain locations. This can be seen in the ECD shown in
Figure 7. At those locations, dynamic localization methods resort to motion model information to produce an estimate.
Figure 7 shows that only the proposed MLPF method can produce accurate estimates at those locations. This is because it is the only method that is able to make full use of the rich information provided by a non-linear motion model. While methods based on PF potentially have the same possibility, the extremely large number of required particles make this unfeasible in practice. To conclude the experiment, we show in
Figure 8 the ground truth and estimated trajectories.