1. Introduction
There are different methods for indoor localization based on RF signals. In the Time of Arrival (TOA), Time Difference of Arrival (TDOA), and Angle of Arrival (AoA) methods [
1], a target is estimated by calculating the time differences, distances, and angles, respectively, between a target and transmitters using the received signals from deployed beacons. On the other hand, the WiFi fingerprinting approach estimates a target location based on pattern recognition through machine learning methods. This paper focuses on WiFi fingerprinting localization.
Figure 1 illustrates multiple fingerprinting indoor localization using a WiFi router and its measurements, with different colors representing different users’ trajectories.
The Wi-Fi indoor positioning method requires a database of signal strength fingerprints collected from surrounding areas for each predefined location. In an indoor environment with many obstacles, such as walls blocking radio waves, more fingerprint data are required for higher positioning accuracy. In considering the resolution of the Wi-Fi signal strength according to the distance, fingerprint data are usually collected at intervals of 1 to 2 m to form a training database. When providing the positioning service, the current location is predicted by comparing the currently received signal strength set with the previously built database. Because this is the same as the training and test phases of machine learning approaches,
k-nearest neighbors, support vector machine, Gaussian processes, and deep neural networks can be used for fingerprinting localization [
2,
3,
4,
5].
When applying a deep neural network for WiFi indoor positioning, a wireless signal strength set is configured as an input of the neural network, a location is set as an output terminal, then a hidden network is learned as a positioning model. In the learning phase, the model is trained using the signal strength fingerprint data collected over the entire positioning region, and in the test phase the predicted position is obtained by inputting the query signal strength set to the learned neural network model.
Most indoor localization focuses on estimating the location of a single target [
6,
7,
8,
9,
10,
11,
12], with only a few addressing multi-target localization. In [
13,
14], multi-targeting was achieved using UHF RFID and microphone devices, respectively. These methods require a number of devices to be located in a room, which is not feasible for general localization purposes. In [
15,
16], a WiFi fingerprint was used for multiple targets. In [
15], an extraction method of signal reflection corresponding to multiple users was proposed. This technique requires a controlled environment in which measurements of near-static WiFi devices are available. In [
16], graph-based optimization was used to fuse WiFi and dead reckoning for a single target. When this filtration method is expanded to multi-user scenarios, the number of users and measurements strongly affects the time complexity.
Existing neural network-based positioning technology is inefficient in terms of real-time processing speed because it has no choice but to perform estimation calculations as many times as requested when a large number of users request positioning service. When indoor positioning service is provided from a server to local smart-devices in a centralized way, the service quality depends on the time required to estimate the location, not on the learning processing time. Therefore, a new positioning method for simultaneous multiple users is needed.
As it becomes necessary to infer many locations at the same time, the dimensionality of the machine learning input data increases. Because a deep neural network has excellent reasoning ability for learning high-dimensional data, it is suitable for multi-location estimation problems. This paper handles prepocessing and standardization of WiFi received signal strength indicator (RSSI) measurements, which are significant for obtaining good learning performance.
To evaluate the proposed multi-target estimation approach based on an artificial neural network for predicting 3D information, including both location and floor, experimental data were collected in a five-story building. We analyzed the number of targets that could be estimated without loss of significant accuracy. The experimental results show that fifteen targets can be accurately estimated with similar performance in the single localization.
The rest of this paper is organized as follows.
Section 2 describes the proposed multi-target WiFi fingerprinting localization approach.
Section 3 reports our experimental results.
Section 4 presents our conclusions.
2. WiFi Fingerprinting Indoor Localization
Suppose that there are a total of
d WiFi routers in which the indoor positioning service is to be established. In a multi-story building, the location is expressed as a three-dimensional vector such as (
x coordinate,
y coordinate, floor). By storing RSSI measurements received from
d routers at collecting locations, a fingerprint database consisting of (location,
d-dimensional RSSI set) pairs is constructed, in which router locations are not necessarily known. To include environmental variations in the data, RSSIs are measured several times at a location.
Table 1 shows the fingerprint database format.
Because certain signals from distantly located routers are not observable, the fingerprint database has many elements filled with empty spaces (null). Because null spaces cannot be processed in machine learning, these are intentionally converted to a dB value.
The following
Section 2.1 and
Section 2.2 present feature extraction for WiFi RSSI measurements and standardization for labelled location data, respectively.
Section 2.3 presents deep learning-based multi-target localization.
2.1. WiFi RSSI Feature Extraction
Principal component analysis (PCA) [
17] is a data transformation method for reducing the dimensionality of data and extracting the features data by minimizing information loss. It can be understood as an unsupervised learning method, as it does not require label information such as location; thus, in this paper, it is used to extract the features of WiFi fingerprint measurement data.
Suppose
are fingerprint training datasets, where
is a vector of RSSI measurements at a location
provided by
where
is a scalar indicating the RSSI value at the
ith location received from the
jth WiFi router over a total of
d routers,
and
are the 2D location, and
is floor level.
The original data
x are transformed to the feature dataset
by the transformation matrix
Q, as follows:
In this paper, finding an optimal
Q is achieved by the PCA algorithm, which solves a generalized eigenvalue problem. Let
be the generalized eigenvectors associated with the generalized eigenvalues
of the following generalized eigenvalue problem:
where
I is an identity matrix and
A is a scatter matrix provided by
with the mean of all the RSSI samples
:
Then, the optimal matrix
is calculated by solving the optimization problem presented by
where the condition of the eigenvalue problem is that the generalized eigenvectors are orthogonal:
and the generalized eigenvectors are normalized:
Finally, when the eigenvalues are sorted in descending order such that
the transformation matrix
Q is provided by
It should be noted that the last small values of the eigenvalues and eigenvectors do not affect the transformation. Thus, we choose a proper l to cut down the original dimensionality from d to l.
2.2. Label Standardization
In a WiFi fingerprint localization system, the two-dimensional coordinate values in the position vector
are generally expressed in meters and the floor values are expressed as integers, ensuring that the scale of the data values between position and floor is different. In addition, the location data are not evenly distributed, as the two-dimensional location is concentrated in the hallway due to the nature of the indoor space. This characteristic can have a negative effect on the accuracy of all general machine learning algorithms, including artificial neural networks. In this paper, standardization is used to convert the original location label data
to
, as follows:
where
and
are the mean and standard deviation of
, respectively.
As a summary of the data preprecessing, the PCA transforms the RSSI measurement data from x to , and standardization transforms the location data from y to . As a result, the new feature database is obtained.
2.3. Multi-Target Estimation Based on Artificial Neural Network
An artificial neural network consists of several layers. In a layer, neural nodes have a linear combination composed of learning parameters. When the combined signal propagates to the next layer, a nonlinear model such as a rectified linear unit (ReLU) activation function decides the output of the layer. As the signal propagates to multiple layers, the nonlinearity increases. The success of deep learning in finding optimal node parameters depends on determining the appropriate number of hidden layers and neural nodes.
Figure 2 shows the structure of a deep neural network for general single-location learning. In this case, the input layer of the neural network uses the
l-dimensional RSSI vector,
in (
3).
Figure 3 is a neural network structure for multi-location recognition. It is a parallel extension of the single-location neural network shown in
Figure 2. As shown in
Figure 3, as the number of positions to be simultaneously estimated increases, more neural network nodes and a higher number of learning parameters are required. When the input data and output data are expanded, as shown in
Figure 3, the dimensionality of the data increases.
General machine learning techniques including deep neural network require large amounts of training data for high dimensionality data [
18]. The datasets
have undergone preprocessing for single positioning. Given a fixed amount of data
, in order to generate sufficient data for multi-location learning (that is, to make up the input/output dataset represented in
Figure 3), an additional rendering operation is required.
Let us suppose that the number of multiple locations to be estimated is
K; then, the composition of input data and output data of the neural network is as follows:
In (
14) and (
15), the
jth component
and
are sampled from the original
. It should be noted here that according to the sorting order of the pairs
, data
and
can be heterogeneous. For example,
and
are different sets unless all elements are equal. Therefore, an appropriate amount of new training data to be used for multi-location learning can be generated by repeating and aligning random extractions at
while keeping the condition of
for any
. As a result, new training datasets for the multiple targets
are made, in which parameter
c is defined by a developer. Algorithm 1 summarizes the algorithm used to produce multi-target training data
F from
.
Algorithm 1 Rendering multi-target fingerprint training dataset |
Input:, single target fingerprint training data
Output:, multi-target fingerprint training data
Initialization:
Set the desired number of multi-target training data c and the multi-target K, and define as empty sets.
- 1:
Initialize generator with , discriminator with and classifier with . - 2:
fordo - 3:
for do - 4:
Sample randomly from subject to and . - 5:
Add the samples by and - 6:
end for - 7:
(Here, the ith data in Equations ( 14) and ( 15) are made up.) - 8:
end for
|