Semantic VPS for Smartphone Localization in Challenging Urban Environments
Abstract
:1. Introduction
- First, we take advantage of building materials as visual aids for precise self-localization, overcoming inaccuracies due to a non-distinctive or obscured skyline, which are common in urban environments.
- Second, the semantic VPS uses building information modeling (BIM), which is widely available in smart cities, due to its existing use in construction, thus eliminating the need for pre-surveyed images. Hence, it is highly scalable and low cost.
- Third, unlike storing feature data as 3D point clouds in a searchable index, the semantics of materials are stored as the properties of the objects in the BIM, enabling simple and accurate updates to be undertaken.
- Finally, the proposed method identifies and considers dynamic objects in its scoring system, which have usually been neglected in previous studies.
2. Overview of the Proposed Method
3. Proposed Method in Detail
3.1. Textured and Segmented BIM
3.2. Cubic Projection Generation
3.3. Equirectangular Projection Generation
3.4. Smartphone Image Acquistion and Format
3.5. Candidate Position Distribution
3.6. Hand Labelled Material Segmentation
3.7. Material Matching
3.7.1. Dice Metric
3.7.2. Jaccard Metric
3.7.3. Boundary F1 Metric
3.8. Combined Material Matching
3.9. Position Solution
4. Experimental Results
4.1. Image and Test Location Setting
4.2. Positioning Results Using Ideal Segmentation
- Proposed semantic VPS (Combination of Dice, Jaccard and BF Metrics)
- Proposed semantic VPS (Dice only)
- Proposed semantic VPS (Jaccard only)
- Proposed semantic VPS (BF only)
- Skyline Matching: Matching using sky and building class only [21].
- 3DMA: Integrated solution by 3DMA GNSS algorithm on shadow matching, skymask 3DMA and likelihood based ranging GNSS [33].
- WLS: Weighted Least Squares [34].
- NMEA: Low-cost GNSS solution by Galaxy S20 Ultra, Broadcom BCM47755.
4.3. Rotational Results Using Ideal Segmenatation
4.4. Segmentation Accuracy vs. Localization Results
4.5. Discussion on Validity and Limitation
5. Conclusion and Future Work
5.1. Conclusions
- The formulation of positioning as a semantic-based problem enables us to apply the existing wide variety of advanced optimization/shape matching metrics to the problem.
- Materials are diverse, distinctive, and widely distributed; hence, the semantic information in an image can be easily recognized.
- The utilization of building materials for positioning eliminates the need for skyline and building boundary reliance.
- Foliage and dynamic objects are considered for positioning.
- The semantics of buildings stored as vector maps can be simply and accurately updated and labeled.
5.2. Future Work
- Research has shown it is possible to identify a wide variety of materials in images in the indoor environment [36]. Therefore, it is suggested to develop and train a deep learning neural network to identify materials in smartphone images in the outdoor environment for real-time use. Improvement in the deep learning neural network may also aid automatic segmentation of 3D building models, reducing the offline preparation time.
- By adding the common building material classes and dynamic objects to aid differentiation (including concrete, stone, glass, metal, wood, bricks, pedestrians, cars, etc.), given a large and high-quality dataset, the proposed method can be adapted to a variety of different uses.
- It is possible to provide computation of depth based on the BIM and the virtual camera, which can then be stored as additional information in the generated images. This depth information can allow precise AR after image matching.
- To maximize all available visual information, the semantic VPS can also make use of objects in addition to materials, or the combination of a semantic VPS and a feature-based VPS, to yield better positioning performance.
- To reduce storage and computational load, the images can be stored as contour coordinates rather than pixels.
- The semantic VPS may also be further improved by extending the functionality to work in different weather, time, and brightness conditions.
- One difficulty encountered in this experiment was the discrepancy between reality and the BIM; hence, it is suggested to use a crowdsourcing map to continuously update the model.
- For dynamic positioning, a multiresolution framework can be used, where the search starts from a big and sparse grid and is then successively refined on smaller and denser grids. Thus, the position of the chosen candidate is used to refine a smaller search area.
Author Contributions
Funding
Conflicts of Interest
References
- Li, W.; Chen, Z.; Gao, X.; Liu, W.; Wang, J. Multimodel Framework for Indoor Localization Under Mobile Edge Computing Environment. IEEE Internet Things J. 2019, 6, 4844–4853. [Google Scholar] [CrossRef]
- Zou, Y.; Liu, H.; Wan, Q. Joint Synchronization and Localization in Wireless Sensor Networks Using Semidefinite Programming. IEEE Internet Things J. 2017, 5, 199–205. [Google Scholar] [CrossRef]
- Chen, C.; Chen, Y.; Han, Y.; Lai, H.-Q.; Zhang, F.; Liu, K.J.R. Achieving Centimeter Accuracy Indoor Localization on WiFi Platforms: An multi-antenna approach. IEEE Internet Things J. 2016, 4, 111–121. [Google Scholar] [CrossRef]
- Guo, X.; Zhu, S.; Li, L.; Hu, F.; Ansari, N. Accurate WiFi Localization by Unsupervised Fusion of Extended Candidate Location Set. IEEE Internet Things J. 2019, 6, 2476–2485. [Google Scholar] [CrossRef]
- Huang, B.; Xu, Z.; Jia, B.; Mao, G. An Online Radio Map Update Scheme for WiFi Fingerprint-Based Localization. IEEE Internet Things J. 2019, 6, 6909–6918. [Google Scholar] [CrossRef]
- Li, B.; Quader, I.J.; Dempster, A.G. On outdoor positioning with Wi-Fi. J. Glob. Position. Syst. 2008, 7, 18–26. [Google Scholar] [CrossRef] [Green Version]
- Huang, Y.; Hsu, L.-T.; Gu, Y.; Wang, H.; Kamijo, S. Database Calibration for Outdoor Wi-Fi Positioning System. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2016, 99, 1683–1690. [Google Scholar] [CrossRef]
- Sun, R.; Wang, G.; Cheng, Q.; Fu, L.; Chiang, K.-W.; Hsu, L.-T.; Ochieng, W.Y. Improving GPS Code Phase Positioning Accuracy in Urban Environments Using Machine Learning. IEEE Internet Things J. 2021, 8, 7065–7078. [Google Scholar] [CrossRef]
- Hsu, L.-T. Analysis and modeling GPS NLOS effect in highly urbanized area. GPS Solut. 2017, 22, 7. [Google Scholar] [CrossRef] [Green Version]
- Guvenc, I.; Chong, C.-C. A Survey on TOA Based Wireless Localization and NLOS Mitigation Techniques. IEEE Commun. Surv. Tutorials 2009, 11, 107–124. [Google Scholar] [CrossRef]
- Kuutti, S.; Fallah, S.; Katsaros, K.; Dianati, M.; McCullough, F.; Mouzakitis, A. A Survey of the State-of-the-Art Localization Techniques and Their Potentials for Autonomous Vehicle Applications. IEEE Internet Things J. 2018, 5, 829–846. [Google Scholar] [CrossRef]
- Huang, G.; Hu, Z.; Wu, J.; Xiao, H.; Zhang, F. WiFi and Vision-Integrated Fingerprint for Smartphone-Based Self-Localization in Public Indoor Scenes. IEEE Internet Things J. 2020, 7, 6748–6761. [Google Scholar] [CrossRef]
- Biljecki, F.; Stoter, J.; Ledoux, H.; Zlatanova, S.; Coltekin, A. Applications of 3D City Models: State of the Art Review. ISPRS Int. J. Geo Inf. 2015, 4, 2842–2889. [Google Scholar] [CrossRef] [Green Version]
- Groves, P.D. Shadow Matching: A New GNSS Positioning Technique for Urban Canyons. J. Navig. 2011, 64, 417–430. [Google Scholar] [CrossRef]
- Miura, S.; Hsu, L.-T.; Chen, F.; Kamijo, S. GPS Error Correction with pseudorange evaluation using three-dimensional maps. IEEE Trans. Intell. Transp. Syst. 2015, 16, 3104–3115. [Google Scholar] [CrossRef]
- Groves, P.; Zhong, Q.; Faragher, R.; Esteves, P. Combining Inertially-aided Extended Coherent Integration (Supercorrelation) with 3D-Mapping-Aided GNSS. Presented at the ION GNSS+ 2020, St. Louis, MO, USA, 20–24 September 2020. [Google Scholar]
- Wada, Y.; Hsu, L.-T.; Gu, Y.; Kamijo, S. Optimization of 3D building models by GPS measurements. GPS Solutions 2017, 21, 65–78. [Google Scholar] [CrossRef]
- Lee, M.J.L.; Lee, S.; Ng, H.-F.; Hsu, L.-T. Skymask matching aided positioning using sky-pointing fisheye camera and 3D City models in urban canyons. Sensors 2020, 20, 4728. [Google Scholar] [CrossRef] [PubMed]
- Google. Using Global Localization to Improve Navigation. Google LLC. Available online: https://ai.googleblog.com/2019/02/using-global-localization-to-improve.html (accessed on 20 May 2021).
- Xiao, A.; Chen, R.; Li, D.; Chen, Y.; Wu, D. An Indoor Positioning System Based on Static Objects in Large Indoor Scenes by Using Smartphone Cameras. Sensors 2018, 18, 2229. [Google Scholar] [CrossRef] [Green Version]
- Armagan, A.; Hirzer, M.; Lepetit, V. Semantic segmentation for 3D localization in urban environments. In Proceedings of the 2017 Joint Urban Remote Sensing Event (JURSE), Dubai, United Arab Emirates, 6–8 March 2017; pp. 1–4. [Google Scholar]
- Ramalingam, S.; Bouaziz, S.; Sturm, P.; Brand, M. SKYLINE2GPS: Localization in urban canyons using omni-skylines. In Proceedings of the 2010 IEEE/RSJ International Conference on Intelligent Robots and Systems, Taipei, Taiwan, 18–22 October 2010; pp. 3816–3823. [Google Scholar]
- Delmerico, J.A.; David, P.; Corso, J.J. Building facade detection, segmentation, and parameter estimation for mobile robot localization and guidance. In Proceedings of the 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Francisco, CA, USA, 25–30 September 2011; pp. 1632–1639. [Google Scholar]
- Suzuki, T.; Kubo, N. GNS S Photo Matching: Positioning using GNSS and Camera in Urban Canyon. In Proceedings of the 28th International Technical Meeting of the Satellite Division of The Institute of Navigation, ION GNSS, Tampa, FL, USA, 8–12 September 2015; Volume 4, pp. 2470–2480. [Google Scholar]
- Tao, A.; Sapra, K.; Catanzaro, B. Hierarchical multi-scale attention for semantic segmentation. arXiv 2020, arXiv:2005.10821. [Google Scholar]
- Lands Department. The Government of the Hong Kong Special Administrative Region. Available online: https://www.landsd.gov.hk/ (accessed on 20 May 2021).
- Explanatory notes on geodetic datums in Hong Kong. Available online: https://www.geodetic.gov.hk/common/data/pdf/explanatorynotes.pdf (accessed on 20 May 2021).
- Bourke, P. Miscellaneous Transformations and Projections. Available online: http://paulbourke.net/geometry/transformationprojection/ (accessed on 20 May 2021).
- Sorensen, T.A. A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyses of the vegetation on Danish commons. Biol. Skar. 1948, 5, 1–34. [Google Scholar]
- Jaccard, P. The distribution of the flora in the alpine zone. New Phytol. 1912, 11, 37–50. [Google Scholar] [CrossRef]
- Csurka, G.; Larlus, D.; Perronnin, F. What is a good evaluation measure for semantic segmentation? In Proceedings of the British Machine Vision Conference, Bristol, UK, 9–13 September 2013. [Google Scholar]
- Zhang, G.; Ng, H.-F.; Wen, W.; Hsu, L.-T. 3D Mapping Database Aided GNSS Based Collaborative Positioning Using Factor Graph Optimization. IEEE Trans. Intell. Transp. Syst. 2020, 1–13. [Google Scholar] [CrossRef]
- Ng, H.-F.; Zhang, G.; Hsu, L.-T. A Computation Effective Range-Based 3D Mapping Aided GNSS with NLOS Correction Method. J. Navig. 2020, 73, 1202–1222. [Google Scholar] [CrossRef]
- Realini, E.; Reguzzoni, M. goGPS: Open source software for enhancing the accuracy of low-cost receivers by single-frequency relative kinematic positioning. Meas. Sci. Technol. 2013, 24, 115010. [Google Scholar] [CrossRef]
- Bloice, M.D.; Roth, P.M.; Holzinger, A. Biomedical image augmentation using Augmentor. Bioinformatics 2019, 35, 4522–4524. [Google Scholar] [CrossRef]
- Bell, S.; Upchurch, P.; Snavely, N.; Bala, K. Material recognition in the wild with the Materials in Context Database. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; pp. 3479–3487. [Google Scholar]
Method | ||
---|---|---|
Dice | 0.1813 | 0.6686 |
Jaccard | 0.1567 | 0.5399 |
BF | 0.1387 | 0.4275 |
Loc. | Experimental Images | |||
---|---|---|---|---|
1 | The Hong Kong Polytechnic University, Hung Hom | |||
Overview | 1.1 | 1.2 | 1.3 | |
Overview | Obscured | Concealed | Obscured | |
2 | Isquare, Tsim Sha Tsui | |||
Overview | 2.1 | 2.2 | 2.3 | |
Overview | Distinctive | Distinctive | Distinctive | |
3 | East Tsim Sha Tsui | |||
Overview | 3.1 | 3.2 | 3.3 | |
Overview | Symmetrical | Insufficient | Insufficient |
Loc. | Deviation from Ground Truth Error. Unit: Meter. | ||||
---|---|---|---|---|---|
Semantic VPS (Combined) | Skyline Matching | 3DMA | WLS | NMEA | |
1.1 | 7.07 | 22.92 | 7.96 | 17.66 | 36.24 |
1.2 | 4.34 | 22.62 | |||
1.3 | 5.28 | 7.14 | |||
1. Avg. | 5.56 | 17.56 | |||
2.1 | 0.66 | 14.80 | 6.87 | 23.29 | 7.94 |
2.2 | 1.83 | 1.58 | |||
2.3 | 3.43 | 2.89 | |||
2. Avg. | 1.97 | 6.42 | |||
3.1 | 29.89 | 13.57 | 18.80 | 46.58 | 18.89 |
3.2 | 6.61 | 25.53 | |||
3.3 | 10.53 | 24.80 | |||
3. Avg. | 15.68 | 21.30 | |||
All Avg. | 7.74 | 15.09 | 11.21 | 29.18 | 21.02 |
Loc. | ||||
1 | 1.1 | 1.2 | 1.3 | |
2 | 2.1 | 2.2 | 2.3 | |
3 | 3.1 | 3.2 | 3.3 | |
Reality | BIM | |
---|---|---|
Textured | ||
Labelled |
Loc. | Deviation from Ground Truth. Unit: Degrees. | |||||
---|---|---|---|---|---|---|
Semantic VPS | Smartphone IMU | |||||
1.1 | −4 | 0 | −1 | −27 | −2.0 | 1.0 |
1.2 | 3 | 2 | −2 | 7 | 0.5 | −0.5 |
1.3 | 3 | 2 | -1 | 18 | −0.5 | 0.5 |
1. Avg. | 3.3 | 1.3 | 1.3 | 17.3 | 1.0 | 0.6 |
2.1 | 5 | 1 | −2 | 11 | 0.5 | −1.0 |
2.2 | −3 | −1 | 0 | 18 | 2.0 | 0.0 |
2.3 | 1 | 2 | −2 | 19 | −2.0 | 0.5 |
2. Avg. | 3 | 1.3 | 1.3 | 16 | 1.5 | 0.5 |
3.1 | 2 | 2 | −2 | 31 | 1.0 | −1.5 |
3.2 | 0 | 1 | 0 | 28 | 0.5 | −0.2 |
3.3 | 0 | −2 | −2 | 27 | −0.5 | −0.2 |
3. Avg. | 0.6 | 1.7 | 1.3 | 28.6 | 0.6 | 1.8 |
All Avg. | 2.3 | 1.4 | 1.3 | 20.6 | 1.0 | 1.0 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lee, M.J.L.; Hsu, L.-T.; Ng, H.-F. Semantic VPS for Smartphone Localization in Challenging Urban Environments. Sensors 2021, 21, 6137. https://doi.org/10.3390/s21186137
Lee MJL, Hsu L-T, Ng H-F. Semantic VPS for Smartphone Localization in Challenging Urban Environments. Sensors. 2021; 21(18):6137. https://doi.org/10.3390/s21186137
Chicago/Turabian StyleLee, Max Jwo Lem, Li-Ta Hsu, and Hoi-Fung Ng. 2021. "Semantic VPS for Smartphone Localization in Challenging Urban Environments" Sensors 21, no. 18: 6137. https://doi.org/10.3390/s21186137
APA StyleLee, M. J. L., Hsu, L. -T., & Ng, H. -F. (2021). Semantic VPS for Smartphone Localization in Challenging Urban Environments. Sensors, 21(18), 6137. https://doi.org/10.3390/s21186137