Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network

Yoo, Jaehyun

doi:10.3390/s24175698

Open AccessArticle

Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network

by

Jaehyun Yoo

School of AI Convergence, Sungshin Women’s University, 34 da-gil 2, Bomun-ro, Seongbuk-gu, Seoul 02844, Republic of Korea

Sensors 2024, 24(17), 5698; https://doi.org/10.3390/s24175698 (registering DOI)

Submission received: 18 June 2024 / Revised: 21 August 2024 / Accepted: 26 August 2024 / Published: 1 September 2024

(This article belongs to the Special Issue Sensors and Techniques for Indoor Positioning and Localization)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Wi-Fi fingerprint indoor localization uses Wi-Fi signal strength measurements obtained from a number of access points. This method needs manual data collection across a positioning area and an annotation process to label locations to the measurement sets. To reduce the cost and effort, this paper proposes a Wi-Fi Semi-Supervised Generative Adversarial Network (SSGAN), which produces artificial but realistic trainable fingerprint data. The Wi-Fi SSGAN is based on a deep learning, which is extended from GAN in a semi-supervised learning manner. It is designed to create location-labeled Wi-Fi fingerprint data, which is different to unlabeled data generation by a normal GAN. Also, the proposed Wi-Fi SSGAN network includes a positioning model, so it does not need a external positioning method. When the Wi-Fi SSGAN is applied to a multi-story landmark localization, the experimental results demonstrate a 35% more accurate performance in comparison to a standard supervised deep neural network.

Keywords:

generative adversarial network; indoor localization; semi-supervised learning; Wi-Fi fingerprint

1. Introduction

Indoor localization has attracted increasing attention for location awareness where the Global Navigation Satellite System (GNSS) does not work in indoor buildings. Many different methods have been developed by using methods such as Pedestrian Dead Reckoning (PDR) [1], hardware-based localizations such as Angle of Arrival (AoA) [2] and Time Difference of Arrival (TDoA) [3], and distance estimations [4]. Additionally, Wi-Fi Received Signal Strength Indicator (RSSI) fingerprint localization has become popular due to its advantage of utilizing complementary Wi-Fi RSSIs obtained from a large number of existing Access Points (APs) built into the structure [5]. A Wi-Fi fingerprint is defined as a labeled data point that is a pair of RSSIs and their measuring location. To estimate a location, given a set of RSSIs, a machine learning approach aims to find a mapping, such that

\begin{matrix} Positioning function : a set of RSSIs \to a location . \end{matrix}

A major issue with Wi-Fi RSSI fingerprint localization is that data collection is costly. It needs manual collection across all positioning areas, and an annotation process to label locations to RSSI sets. To reduce these effort, a Generative Adversarial Network (GAN) might be one of the promising solutions. The purpose of the GAN is to produce artificial data samples similar to real ones [6]. A typical GAN has two independent deep neural networks, i.e, a generator and a discriminator. An adversarial learning approach via a min–max game is used for training; the generator is learned to fool the discriminator by making realistic fake data, whereas the discriminator is learned to distinguish fake and real data.

In the scenario where GANs are assumed to be applied for fingerprint indoor localization, a generator might be modeled to produce artificial RSSI data, such that

\begin{matrix} GAN generator : noise \to a set of RSSIs . \end{matrix}

It is expected that the GAN improves the localization accuracy by supplying a large amount of training data. However, most of the existing GAN-based fingerprint methods [7,8,9,10,11] can only produce unlabeled RSSI samples. By this restriction, their methods have reported using only a few visible Wi-Fi APs whose locations are known preliminarily. This limitation is different to a general fingerprint localization environment in which a large number of location-unknown APs are used.

The main contribution of this paper is to develop a Wi-Fi Semi-Supervised GAN (SSGAN) for fingerprint localization, which produces synthetic labeled RSSI data, such that

\begin{matrix} Wi-Fi SSGAN generator : a location \to a set of RSSIs . \end{matrix}

One of main differences from a normal GAN is the input configuration. By inputting a specific query location, the SSGAN generates a corresponding labeled RSSI fingerprint, whereas a normal GAN produces only unlabeled RSSI values irrelevant to locations. Moreover, the proposed Wi-Fi SSGAN includes an additional classification network, which can be utilized as a positioning model, so it does not need to employ an extra positioning method. The produced labeled data can help improve learning performance of the positioning network model because the fingerprint localization mainly uses labeled data. While GAN is an unsupervised learning method, SSGAN can be seen as a type of semi-supervised learning whose main purpose is to learn a classifier, as illustrated in Figure 1.

Because raw RSSI fingerprints are sparse due to the AP range limit, feature extraction from RSSIs [12] is mandatory for fingerprint localization. This paper applies an Auto-Encoder (AE) to convert them to trainable data, where an AE is an unsupervised deep representational neural network to recover original input data [13]. A learned AE model extracts feature values of neural nodes from the middle layer. The resultant feature set has far lower dimensionality than the original data set. As a result, the high dimensional and sparse raw RSSI measurements are transformed to feature sets by the AE, and then the feature sets are used as input data for the Wi-Fi SSGAN to learn an indoor localization model.

For the experimental study, we collected Wi-Fi fingerprints through corridors at a five-story office building, in which 508 different AP devices are scanned across five floors. To evaluate the proposed Wi-Fi SSGAN algorithm, we define a multi-classification problem as landmark localization. From the experiments, Wi-Fi SSGAN achieved 35% better accuracy compared to a supervised deep neural network when a small amount of training data was used.

The rest of this paper is organized as follows. Section 2 overviews the related works. Section 3 describes the Wi-Fi fingerprint data preprocessing. Section 4 mainly introduces the Wi-Fi SSGAN algorithm. Section 5 and Section 6 report the experimental results and conclusion, respectively.

2. Related Works

2.1. Data Generation for Wi-Fi Fingerprint Localization

To improve the cost efficiency of fingerprint indoor localization, various approaches have been proposed. Unlabeled data that include only RSSI measurements without ground-truth locations can be simply collected by crowdsourcing [12,14]. To exploit the unlabeled data, a semi-supervised learning method has been applied for indoor localization [15,16]. It first decides pseudo-labels (e.g., locations) of the unlabeled data based on a graphical representation, and then it learns a localization model with a penalty balance for the pseudo-labeled data.

The GAN can produce qualified fake data samples if a generative model and discriminative model are successfully trained to outplay it [10,17]. GAN learns its model by support of the generated data, as well as real data. It is easy to confirm the trustworthiness of fake data by visually comparing them to real ones. An accurately taught generator can produce fake data infinitely.

Because the standard GAN method has a weakness of convergence problem [18], the Wasserstein GAN (WGAN) was developed, in which the objective loss function is defined as the Wasserstein distance. Later, the WGAN-gradient penalty (GP) enhanced learning performance more by adding a gradient penalty to the loss objective [19], and it has recently been used for various applications such as images, text and digital signals.

2.2. Semi-Supervised GAN (SSGAN)

Compared to the original GAN, which uses only unlabeled data, the SSGAN [20] might exhibit a better learning result by utilizing labeled data as well. In [11,21], location information is used as the generator’s input to produce fingerprint data samples to improve the positioning accuracy. In [22,23], the SSGAN methods are validated to produce accurate synthetic data for applications of cross-modal hashing and clinical decision, respectively.

A more advanced strategy to apply to SSGAN learning is to involve a part of prediction network into a unified model. By combining a prediction model into a discriminator, it does not need to employ an extra prediction model. In [24], the SSGAN was used to generate labeled electroencephalography (EEG) signal samples, and had a validated better performance than a standard GAN.

Figure 1 compares the GAN and SSGAN. In the SSGAN, the label prediction is computed by a classifier that shares its network model with a discriminator, while the original GAN concentrates on producing unlabeled data without a prediction model.

3. Wi-Fi Fingerprint Preprocessing

This section overviews Wi-Fi fingerprint data localization in Section 3.1 and feature extraction by the AE in Section 3.2. The feature data will be used as input of the main learning algorithm described in Section 4.

3.1. Wi-Fi RSSI Fingerprint for Landmark Localization

For landmark localization, the Wi-Fi RSSI fingerprint data are collected by placing a receiver, e.g., a smartphone, at different landmark locations. A receiver measures RSSIs obtained from near APs that are broadcasting their signals periodically. The Wi-Fi signal conveys unique information of a AP transmitter by means of Media Access Control (MAC) address. This enables the receiver to recognize which AP sent the Wi-Fi signal. A general fingerprint localization method utilizes all RSSIs of APs scanned across the positioning area.

Suppose that N number of Wi-Fi RSSI fingerprints at N locations are obtained from total d number of APs, given by

\begin{matrix} D = {(r_{i}, y_{i})}_{i = 1}^{N}, \end{matrix}

(1)

where

r_{i} \in R^{d}

is am RSSI fingerprint set and

y_{i} \in R^{l}

is a landmark index. The landmark label index

y_{i}

is defined as a l-by-1 one-hot vector, where l is the number of landmarks that are predesignated by a developer. The RSSI set at the i-th landmark is given by

\begin{matrix} r_{i} = {[r s s i_{i}^{1}, r s s i_{i}^{2}, \dots, r s s i_{i}^{d}]}^{T}, \end{matrix}

(2)

where

r s s i_{i}^{j}

is a RSSI measurement obtained from the j-th AP. The dimensionality of an RSSI set

r_{i}

equals the number of APs, d.

Given the training data D, the objective of machine learning in the training phase is to build a classifier:

R^{d} \to R^{l}

, which represents a relationship between a set of Wi-Fi RSSI measurements and a landmark. In the test phase, when a query

r^{*}

is given, the positioning model infers on which landmark a receiver is located.

3.2. Feature Extraction by Auto-Encoder (AE)

The outstanding property of raw RSSI fingerprint data is its sparsity as shown in Figure 2a. By restriction of the Wi-Fi signal propagation range, there are many empty elements in a raw RSSI set, so that

r_{i}

in (2) has many empty values. Typically, these empty components are filled with a possible minimum value, such as −100 dBm. Because this prompts inaccurate learning performance, feature extraction for the sparse RSSI data is mandatory. The objective of feature extraction is to obtain a model H that converts a raw d-dimensional Wi-Fi RSSI fingerprint set

r \in R^{d}

into a s-dimensional feature set

x \in R^{s}

(

d ≫ s

), where s is dimensionality of feature data.

AE is an unsupervised deep representational neural network. It trains a meaningful feature space among input data in a layer-wise manner by learning a neural network model to replicate the original input data to output. Hidden layers from the input layer to the feature layer are called encoders, and the rest of the layers for the restoration are called decoders, as described in Figure 2b.

Given the Wi-Fi RSSI data set

{r_{i}}_{i = 1}^{N}

from (1), the encoder converts raw data

r \in R^{d}

to low-dimensional feature data

x \in R^{s}

. The decoder reconstructs the feature

x

back to the original data

r

by estimating its prediction

\hat{r}

. More detail of mathematical explanation of the AE can be found in [13].

After the AE model is learned, the encoder part is used as the feature extraction method to obtain the following feature database,

\begin{matrix} O = {(x_{i}, y_{i})}_{i = 1}^{N} . \end{matrix}

(3)

The newly made database (3) replaces the original dataset (1) to learn the proposed Wi-Fi SSGAN model in the next section.

4. Wi-Fi SSGAN

The Wi-Fi SSGAN consists of multiple neural networks, and they are learned by complementary optimizations. Variables

(x^{f}, y^{f})

and

(x^{r}, y^{r})

are fake and real data, respectively. Figure 3 shows network models of Wi-Fi SSGAN in which the generative model produces fake labeled RSSI feature data, and the combined model of discriminator and classification predicts a location and distinguishes fake data, simultaneously.

4.1. Generator

The SSGAN generator produces fake samples

x^{f} \sim P_{f}

with respect to a specific location query

y^{f}

by learning the network with parameter set

θ_{G}

. Using an artificial label

y^{f}

as the generator’s input is one of the major differences between the SSGAN and a normal GAN that only allows random noise as input. A simulated RSSI set generated by the SSGAN generator mimics a real fingerprint sample in relation to an actual location. Consequently, the accurately created fingerprint data can effectively support the learning of a positioning model.

4.2. Discriminator

When real data

x^{r} \sim P_{r}

are initially given and fake data

x^{f} \sim P_{f}

are made from the generator, the discriminator aims to recognize whether the produced samples are fake or not. The learning process lets the generator and discriminator outplay each other. The learning process is repeated until the generator finally produces realistic samples, so that the discriminator with

θ_{D}

does not exactly distinguish if the generated samples are fake.

4.3. Classifier

Classifier involvement into the discriminator network is another major difference between the SSGAN and a normal GAN in which a prediction model does not exist. The classifier with

θ_{C}

and discriminator with

θ_{D}

share a network model, except for the output layer. It is noted that the fake data

x^{f}

also have ground-truths

y^{f}

, so that the prediction errors of the fake data can be calculated. Therefore, training both the actual and the fake data can improve the learning performance of the classifier.

4.4. SSGAN Formulation

In WGAN-GP [19], the discriminator D and generator G play a min–max game based on the Wasserstein distance

V (D, G)

, which is a distance between distributions D and G, such that

\begin{matrix} min_{G} max_{D} V (D, G) & = - E_{x^{r} \sim P_{r}} [D (x^{r})] + E_{z \sim P_{z}} [D (G (z))], \end{matrix}

(4)

where

x^{r} \sim P_{r}

are real data,

z \sim P_{z}

are noise vectors, and

P_{z}

is uniform distribution with bound [0, 1]. To achieve a solution to (4), separate optimizations are performed to derive each generator and discriminator. The discriminator is learned by minimizing the following loss:

\begin{matrix} L_{D} (θ_{D}, θ_{G}^{*}) = V (D_{θ_{D}} (x^{r}), G_{θ_{G}^{*}} (z)) + ρ E_{\bar{x} \sim P_{\bar{x}}} [(∥ \nabla_{\bar{x}} D (\bar{x}) {∥_{2} - 1)}^{2}] . \end{matrix}

(5)

In (5), the first term is the Wasserstein distance, and the second term is a gradient penalty controller to improve the stability of the learning convergence, where

ρ

is a tuning parameter and

\bar{x}

are samples lying on a straight line between

P_{r}

and

P_{f}

[19]. On the other hand, the generator is learned to fool the discriminator by reducing the following loss:

\begin{matrix} L_{G} (θ_{G}, θ_{D}^{*}) = E_{x^{f} \sim P_{f}} [D_{θ_{D}^{*}} (x^{f})] . \end{matrix}

(6)

In (5) and (6),

θ_{G}^{*}

and

θ_{D}^{*}

are the fixed weight parameters, respectively, so that each network focuses on learning own model.

The proposed Wi-Fi SSGAN is extended from the WGAN-GP in a semi-supervised learning manner. One of the main differences between SSGAN and WGAN-GP is the input configuration of the generator. SSGAN generator’s input

\bar{z} \sim P_{\bar{z}}

is defined as a concatenation of noise and label, given by

\begin{matrix} \bar{z} = {[z, y_{f}]}^{T} . \end{matrix}

(7)

Accordingly, fake data

x^{f}

are made by the generator given by

\begin{matrix} x^{f} = G_{θ_{G}} (\bar{z}), \end{matrix}

(8)

and the Wasserstein distance is defined as

\begin{matrix} V (D, G) = - E_{x^{r} \sim P_{r}} [D (x^{r})] + E_{\bar{z} \sim P_{\bar{z}}} [D (G (\bar{z}))] . \end{matrix}

(9)

The second difference is the prediction model involvement into the discriminator network for the purpose of increasing the accuracy of label prediction. The combined discriminator and classifier (CDC) model minimizes the following loss:

\begin{matrix} L_{C D C} (θ_{D}, θ_{C}, θ_{G}^{*}) = L_{D} (θ_{D}, θ_{G}^{*}) + L_{C} (θ_{C}, θ_{G}^{*}), \end{matrix}

(10)

where

L_{D}

is defined in (5), and

L_{C}

is as follows:

\begin{matrix} L_{C} (θ_{C}, θ_{G}^{*}) = E [log P (y^{r} | x^{r})] + E [log P (y^{f} | G_{θ_{G}^{*}} (\bar{z}))] . \end{matrix}

(11)

Finally, the generator loss of SSGAN with the fixed

θ_{D}^{*}

and

θ_{C}^{*}

is given by

\begin{matrix} L_{G} (θ_{G}, θ_{D}^{*}, θ_{C}^{*}) = E_{\bar{z} \sim P_{\bar{z}}} [D_{θ_{D}^{*}, θ_{C}^{*}} (G_{θ_{G}} (\bar{z}))] . \end{matrix}

(12)

The overall training algorithm of the proposed Wi-Fi SSGAN is summarized in Algorithm 1. After the training process, the AE model and the classifier model are obtained. For the test when a query Wi-Fi RSSI set is given, the AE model first converts it to a feature set. Then, the classifier performs the probabilistic inference to localization.

Algorithm 1 Wi-Fi SSGAN

Input: Training dataset

{r_{i}, y_{i}}_{i = 1}^{N}

with RSSI set

r_{i} \subset R^{d}

and one-hot landmark label

y_{i} \subset R^{l}

.

Output: An auto-encoder

: R^{d} \to R^{s}

, and a classifier

: R^{s} \to R^{l}

.

Feature extraction:

1:: Given the training dataset, learn the auto-encoder model and build the new feature dataset ${{(x_{i}, y_{i}}}_{i = 1}^{N}$ with $x_{i} \subset R^{s}$ .

SSGAN:

2:: Initialize a generator with $θ_{G}$ , a discriminator with $θ_{D}$ and a classifier with $θ_{C}$ .
3:: repeat
4:: for $j = 1, 2, \dots, k$ do
5:: Sample a batch of real data ${x_{i}^{r}, y_{i}^{r}}_{i = 1}^{M}$ from the feature dataset.
6:: Produce a batch of fake data ${x_{i}^{f}, y_{i}^{f}}_{i = 1}^{M}$ by the generator in (8).
7:: Do optimization to reduce loss $L_{C D C}$ in (10) and update $θ_{D}, θ_{C}$ .
8:: end for
9:: Produce a batch of fake data ${x_{i}^{f}, y_{i}^{f}}_{i = 1}^{M}$ by the generator in (8).
10:: Do optimization to reduce loss $L_{G}$ in (6) and update $θ_{G}$ .
11:: until end of learning
12:: Obtain the classifier with $θ_{C}^{*}$ .

5. Experiments

5.1. Setup

We collect the Wi-Fi RSSI data through the corridors in a five-story office building, where each floor has different shape, as shown in Figure 4. In total, 508 different AP devices whose locations are unknown are scanned, so the dimensionality of the raw Wi-Fi RSSI set is 508. Seventeen different landmarks shown in Figure 4 are defined at particular landmarks, and the machine learning aims to solve a multi-classification problem (17 classifications in this paper). We collect the RSSI measurements within a radius of 5 m from each landmark center to hold the diversity of the data and to avoid overfitting of the learning. A total of 100 data points for each landmark are prepared for each training and test set. To assess the usefulness of the fake data for fingerprint localization, we divide the training data points into subsets ranging from 10% to 100%. We then compare the positioning and learning performances for each data ratio.

In Wi-Fi SSGAN, the generator network has three layers, and all hidden layers have 50 neural nodes. The generator input

\bar{z}

in (7) is combination of 10-dimensional uniform noise and 17-dimensional landmark one-hot

y_{f}

. Because the dimensionality of the feature set produced by the AE is 20, the generator output

x^{f}

is also 20-dimensional. The CDC network whose loss function is defined in (10) is defined to share only the input layer. The discriminator has one hidden layer with 200 nodes. The classifier has three hidden layers with 200 nodes. The CDC network results in an 18-dimensional probabilistic vector, whose first element indicates the prediction score and the rest is the landmark prediction. To learn the network, an Adam optimizer and Xavier initializer are used. The size of batch M in Steps 6, 7, and 10 in Algorithm 1, which indicates the each sample size of real and fake data to learn each generator and CDC, is set as

M = 50

. Additionally, iteration number k at Step 5 in Algorithm 1, and parameter

ρ

in (5), are set as

k = 5

and

ρ = 0.01

.

As the baseline to evaluate the proposed Wi-Fi SSGAN, a supervised deep neural network (DNN) having five layers and 200 nodes is used. Because the main contribution of the Wi-Fi SSGAN is to generate labeled fake data to improve the learning performance, the outstanding difference from the DNN is expected when a small amount of training data are available. An unsupervised normal GAN, such as WGAN-GP, is not suitable for comparison in the same setup because it produces only unlabeled data, making fingerprint localization unfeasible.

Through the same AE model, the same feature data as the input of the DNN and the Wi-Fi SSGAN are fairly applied. The AE is designed to have five layers, including input and output layers. The first and second hidden layers have 200 and 20 neural nodes, respectively, so that dimensionality of a feature set is 20. The rectified linear unit activation function and Adam optimizer are used to learn the AE network.

5.2. Landmark Localization by Wi-Fi SSGAN

For performance evaluation, we change the number of labeled training data ratio from 10% to 100% to learn both the Wi-Fi SSGAN and DNN models, and the portion of the training data are randomly picked out of entire training data. Because test is performed 10 times, the accuracy is represented by the mean and standard deviation. Figure 5 shows the test accuracy of the proposed Wi-Fi SSGAN classifier and the compared DNN, according to change in the amount of used training data. The Wi-Fi SSGAN entirely outperforms the supervised DNN, and a noticeable difference is found when a small amount of labeled data are used. For example, when only 10% samples are used, the Wi-Fi SSGAN achieves 35% higher accuracy. Moreover, the supervised DNN shows large variations in accuracy due to an insufficient amount of training data, whereas the Wi-Fi SSGAN deviations are consistent regardless of the data ratio. Additionally, across all data ratio cases, the proposed algorithm outperforms the DNN due to the support of accurately produced synthetic data.

Figure 6 shows the loss value graphs of Wi-Fi SSGAN during learning iterations when 10% samples are used. We choose the 10% ratio samples because it is common for GAN models to fail when using a small amount of actual data. Successful learning with the 10% ratio implies that the other cases will also be successful. In Figure 6, the losses of real and fake data in (11) are shown in the top figure, and the CDC loss in (10) and the generator loss in (6) are are in the bottom figure. As the fake data and the classifier are updated at each iteration, the loss decreases and eventually converges, indicating successful learning termination.

It is possible to visually confirm the accuracy of the generator by comparing the fake and real data. Figure 7 shows the produced RSSI feature data (red line) and the real data (blue-dash line) according to variations in the amount of used training samples.

We present the results for two groups: (i) when a small amount of actual data (10%) is used, and (ii) when a large amount of actual data (100%) is used. The first group results are shown in Figure 7a–c, and the second group results are shown in Figure 7d, Figure 7e and Figure 7f, respectively.

In this paper, the first case is emphasized to effectively determine if the generated fingerprint data are useful for positioning. From Figure 7a–c, we observe that the generated data are not only diverse, but also closely align with the actual data’s distribution. Diverse data are more beneficial than overfitted data for learning a fingerprint positioning model for two reasons. First, natural RSSI values at a location are inconsistent due to external factors, such as noise and multi-path problems. Second, machine learning algorithms require diverse data rather than overfitted data. In the second case, when sufficient data are used for learning, as shown in Figure 7d–f, the generator also produces high-quality synthetic data.

Another way to evaluate the usefulness of the generated data are by comparing the shape of the fake data to the actual data that have the same location labels. Figure 7a,d are the RSSI feature values whose labels are annotated from the second landmark, Figure 7b,e are from the fifth landmark, and Figure 7c,f are from the eleventh landmark. The results indicate that the fake data and the actual data with the same labels are visually similar, which helps maintain localization accuracy, even when only a small amount of actual data are used.

6. Conclusions

The Wi-Fi SSGAN, which is a new semi-supervised learning version of a generative adversarial network for Wi-Fi indoor localization, was presented. The proposed method aims to produce artificial fingerprint data to support a lack of actual fingerprint data. From the experiments, the similarity of the fake data to real data was demonstrated, and localization accuracy was improved, especially when small amounts of actual training data were used. Many RSSI-based indoor localization algorithms have been reported, and these methods typically depend on manually collected fingerprint-labeled data. Therefore, the accurately produced fingerprint data presented in this paper can significantly support RSSI-based positioning algorithms. For instance, the synthetic data can be used as an alternative for predicting locations in areas that are otherwise inaccessible. As a future work, we plan to further analyze the effectiveness of the generated data in enhancing other RSSI-based localization methods, and to explore its limitations in various scenarios where it may not be useful for positioning.

Funding

This work was supported by the Sungshin Women’s University Research Grant of H20210081.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

References

Chen, X.; Xie, Y.; Zhou, Z.; He, Y.; Wang, Q.; Chen, Z. An Indoor 3D Positioning Method Using Terrain Feature Matching for PDR Error Calibration. Electronics 2024, 13, 1468. [Google Scholar] [CrossRef]
Florio, A.; Avitabile, G.; Talarico, C.; Coviello, G. A Reconfigurable Full-Digital Architecture for Angle of Arrival Estimation. IEEE Trans. Circuits Syst. I 2023, 71, 1443–1455. [Google Scholar] [CrossRef]
Ivanov, S.; Kuptsov, V.; Badenko, V.; Fedotov, A. RSS/TDoA-based source localization in microwave UWB sensors networks using two anchor nodes. Sensors 2022, 22, 3018. [Google Scholar] [CrossRef] [PubMed]
Milano, F.; da Rocha, H.; Laracca, M.; Ferrigno, L.; Espírito Santo, A.; Salvado, J.; Paciello, V. BLE-Based Indoor Localization: Analysis of Some Solutions for Performance Improvement. Sensors 2024, 24, 376. [Google Scholar] [CrossRef] [PubMed]
Yoo, J. Multiple fingerprinting localization by an artificial neural network. Sensors 2022, 22, 7505. [Google Scholar] [CrossRef] [PubMed]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. Adv. Neural Inf. Process. Syst. 2014, 27, 2672–2680. [Google Scholar]
Yean, S.; Goh, W.; Lee, B.S.; Oh, H.L. extendGAN+: Transferable Data Augmentation Framework Using WGAN-GP for Data-Driven Indoor Localisation Model. Sensors 2023, 23, 4402. [Google Scholar] [CrossRef]
Nabati, M.; Navidan, H.; Shahbazian, R.; Ghorashi, S.A.; Windridge, D. Using synthetic data to enhance the accuracy of fingerprint-based localization: A deep learning approach. IEEE Sensors Lett. 2020, 4, 6000204. [Google Scholar] [CrossRef]
Li, Q.; Qu, H.; Liu, Z.; Zhou, N.; Sun, W.; Sigg, S.; Li, J. AF-DCGAN: Amplitude feature deep convolutional GAN for fingerprint construction in indoor localization systems. IEEE Trans. Emerg. Top. Comput. Intell. 2019, 5, 468–480. [Google Scholar] [CrossRef]
Seong, J.H.; Seo, D.H. Selective unsupervised learning-based Wi-Fi fingerprint system using autoencoder and GAN. IEEE Internet Things J. 2019, 7, 1898–1909. [Google Scholar] [CrossRef]
Chen, K.M.; Chang, R.Y. Semi-supervised learning with GANs for device-free fingerprinting indoor localization. In Proceedings of the Global Communications Conference, Taipei, Taiwan, 8–10 December 2020. [Google Scholar]
Yoo, J.; Johansson, K.H.; Kim, H.J. Indoor localization without a prior map by trajectory learning from crowdsourced measurements. IEEE Trans. Instrum. Meas. 2017, 66, 2825–2835. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Zhuang, Y.; Syed, Z.; Georgy, J.; El-Sheimy, N. Autonomous smartphone-based Wi-Fi positioning system by using access points localization and crowdsourcing. Pervasive Mob. Comput. 2015, 18, 118–136. [Google Scholar] [CrossRef]
Zhou, M.; Tang, Y.; Nie, W.; Xie, L.; Yang, X. GrassMA: Graph-based semi-supervised manifold alignment for indoor WLAN localization. IEEE Sensors J. 2017, 17, 7086–7095. [Google Scholar] [CrossRef]
Yoo, J.; Johansson, K.H. Semi-supervised learning for mobile robot localization using wireless signal strengths. In Proceedings of the International Conference on Indoor Positioning and Indoor Navigation, Sapporo, Japan, 18–21 September 2017. [Google Scholar]
Zou, H.; Chen, C.L.; Li, M.; Yang, J.; Zhou, Y.; Xie, L.; Spanos, C.J. Adversarial learning-enabled automatic Wi-Fi indoor radio map construction and adaptation with mobile robot. IEEE Internet Things J. 2020, 7, 6946–6954. [Google Scholar] [CrossRef]
Arjovsky, M.; Chintala, S.; Bottou, L. Wasserstein generative adversarial networks. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017. [Google Scholar]
Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved training of Wasserstein GANS. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
Odena, A. Semi-supervised learning with generative adversarial networks. arXiv 2016, arXiv:1606.01583. [Google Scholar]
Belmonte-Hernandez, A.; Hernandez-Penaloza, G.; Gutiérrez, D.M.; Alvarez, F. Recurrent model for wireless indoor tracking and positioning recovering using generative networks. IEEE Sensors J. 2019, 20, 3356–3365. [Google Scholar] [CrossRef]
Zhang, J.; Peng, Y.; Yuan, M. SCH-GAN: Semi-supervised cross-modal hashing by generative adversarial network. IEEE Trans. Cybern. 2018, 50, 489–502. [Google Scholar] [CrossRef] [PubMed]
Yang, Y.; Nan, F.; Yang, P.; Meng, Q.; Xie, Y.; Zhang, D.; Muhammad, K. GAN-based semi-supervised learning approach for clinical decision support in health-IoT platform. IEEE Access 2019, 7, 8048–8057. [Google Scholar] [CrossRef]
Panwar, S.; Rad, P.; Jung, T.P.; Huang, Y. Modeling EEG data distribution with a Wasserstein generative adversarial network to predict RSVP events. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 1720–1730. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Comparison of GAN and SSGAN.

Figure 2. Wi-Fi RSSI fingerprint collection and feature extraction via auto-encoder (AE) in (a) and neural network structure of AE in (b).

Figure 3. Wi-Fi SSGAN models: the generator (a), and the combined discriminator and classifier (b).

Figure 4. Fingerprint distribution from the experimental office building.

Figure 5. Comparison of classification performance according to changes in the amount of labeled data.

Figure 6. Learning curve of the proposed Wi-Fi SSGAN; classifier loss (a), and CDC and generator losses (b).

Figure 7. Synthetic labeled data by Wi-Fi SSGAN vs. actual labeled data; (a) landmark index = 2, sample ratio = 10%, (b) landmark index = 5, sample ratio = 10%, (c) landmark index = 11, sample ratio = 10%, (d) landmark index = 2, sample ratio = 100%, (e) landmark index = 5, sample ratio = 100%, (f) landmark index = 11, sample ratio = 100%.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yoo, J. Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network. Sensors 2024, 24, 5698. https://doi.org/10.3390/s24175698

AMA Style

Yoo J. Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network. Sensors. 2024; 24(17):5698. https://doi.org/10.3390/s24175698

Chicago/Turabian Style

Yoo, Jaehyun. 2024. "Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network" Sensors 24, no. 17: 5698. https://doi.org/10.3390/s24175698

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Wi-Fi Fingerprint Indoor Localization by Semi-Supervised Generative Adversarial Network

Abstract

1. Introduction

2. Related Works

2.1. Data Generation for Wi-Fi Fingerprint Localization

2.2. Semi-Supervised GAN (SSGAN)

3. Wi-Fi Fingerprint Preprocessing

3.1. Wi-Fi RSSI Fingerprint for Landmark Localization

3.2. Feature Extraction by Auto-Encoder (AE)

4. Wi-Fi SSGAN

4.1. Generator

4.2. Discriminator

4.3. Classifier

4.4. SSGAN Formulation

5. Experiments

5.1. Setup

5.2. Landmark Localization by Wi-Fi SSGAN

6. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI