A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing
Abstract
1. Introduction
2. Overall Structure
2.1. Bidirectional Scoring Strategy for the Filtering of Dynamic Feature Points
2.2. Transformation Matrix Estimation
- All values of are calculated.
- The mean w of all is found.
- is judged: If , and are correct similarities, and they are retained; otherwise, they are deleted.
- The filtered point pair with the correct similarity is taken as the initial iterative feature point pair for the RANSAC algorithm.
- The point pair with the correct similarity is used as a candidate matching feature set. Four groups are randomly selected to establish equations and calculate the unknowns in the transformation matrix M for the estimation of the transformation matrix.
- The distances between other feature points and the candidate matching points are calculated by using the transformation model, and the threshold r is set. When the distance is less than this threshold, the feature point is determined to be an inlier; otherwise, it is an outlier.
- The inliers are used to re-estimate the transformation matrix for N iterations.
3. Experiments and Analysis
3.1. Experimental Environment and Datasets
3.2. Feature Point Matching Based on the Bidirectional Scoring Strategy
3.3. Feature Point Filtering Based on the Estimation of a Transformation Matrix
3.4. Three-Dimensional Pose Tracking Accuracy
4. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
SLAM | Simultaneous localization and mapping |
TUM | Technical University of Munich |
RGB-D | Red–green–blue-depth |
ORB | Oriented FAST and rotated BRIEF |
ORB-SLAM2 | Oriented FAST and rotated BRIEF SLAM II |
ORB-SLAM3 | Oriented FAST and rotated BRIEF SLAM III |
RANSAC | Random sample consensus |
SegNet | Semantic segmentation network |
Mask R-CNN | Mask region-based convolutional neural network |
YOLO | You only look once |
DS-SLAM | Dynamic semantic SLAM |
DynaSLAM | Dynamic SLAM |
CNN | Convolutional neural network |
APE | Absolute pose error |
RMSE | Root-mean-squared error |
SSEs | Sum of squared errors |
References
- Montemerlo, M.; Thrun, S.; Koller, D.; Wegbreit, B. FastSLAM: A factored solution to the simultaneous localization and mapping problem. In Proceedings of the AAAI-02: Eighteenth National Conference on Artificial Intelligence, Edmonton, AL, Canada, 28 July–1 August 2002; Volume 593598. [Google Scholar]
- Chen, H.; Guo, P.; Li, P.; Lee, G.H.; Chirikjian, G. Multi-person 3D pose estimation in crowded scenes based on multi-view geometry. In Proceedings of the Computer Vision–ECCV 2020: 16th European Conference, Proceedings, Part III, Glasgow, UK, 23–28 August 2020; Springer: Berlin/Heidelberg, Germany, 2020; pp. 541–557. [Google Scholar]
- Henein, M.; Zhang, J.; Mahony, R.; Ila, V. Dynamic SLAM: The need for speed. In Proceedings of the 2020 IEEE International Conference on Robotics and Automation (ICRA), Paris, France, 31 May–31 August 2020; pp. 2123–2129. [Google Scholar]
- Cheng, Z.Q.; Chen, Y.; Martin, R.R.; Lai, Y.K.; Wang, A. SuperMatching: Feature Matching Using Supersymmetric Geometric Constraints. IEEE Trans. Vis. Comput. Graph. 2013, 19, 1885–1894. [Google Scholar] [CrossRef]
- Lu, X.; Manduchi, R. Wide baseline feature matching using the cross-epipolar ordering constraint. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, Washington, DC, USA, 27 June–2 July 2004; Volume 1, p. I. [Google Scholar] [CrossRef]
- Yu, C.; Liu, Z.; Liu, X.-J.; Xie, F.; Yang, Y.; Wei, Q.; Fei, Q. DS-SLAM: A semantic visual SLAM towards dynamic environments. In Proceedings of the 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, 1–5 October 2018; pp. 1168–1174. [Google Scholar]
- Mur-Artal, R.; Tardós, J.D. ORB-SLAM2: An open-source SLAM system for monocular, stereo, and RGB-D cameras. IEEE Trans. Robot. 2017, 33, 1255–1262. [Google Scholar] [CrossRef]
- Lopez-Molina, C.; Baets, B.D.; Bustince, H. Quantitative error measures for edge detection. Pattern Recognit. 2013, 46, 1125–1139. [Google Scholar] [CrossRef]
- Derpanis, K.G. Overview of the RANSAC Algorithm. Image Rochester NY 2010, 4, 2–3. [Google Scholar]
- Cashbaugh, J.; Kitts, C. Automatic Calculation of a Transformation Matrix Between Two Frames. IEEE Access 2018, 6, 9614–9622. [Google Scholar] [CrossRef]
- Badrinarayanan, V.; Kendall, A.; Cipolla, R. SegNet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 2481–2495. [Google Scholar] [CrossRef]
- He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the IEEE International Conference on Computer Vision, Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
- Tian, Y.; Yang, G.; Wang, Z.; Wang, H.; Li, E.; Liang, Z. Apple detection during different growth stages in orchards using the improved YOLO-V3 model. Comput. Electron. Agric. 2019, 157, 417–426. [Google Scholar] [CrossRef]
- Bescos, B.; Fácil, J.M.; Civera, J.; Neira, J. DynaSLAM: Tracking, mapping, and inpainting in dynamic scenes. IEEE Robot. Autom. Lett. 2018, 3, 4076–4083. [Google Scholar] [CrossRef]
- Li, S.; Lee, D. RGB-D SLAM in dynamic environments using static point weighting. IEEE Robot. Autom. Lett. 2017, 2, 2263–2270. [Google Scholar] [CrossRef]
- Zhang, Y.; Jin, R.; Zhou, Z. Understanding bag-of-words model: A statistical framework. Int. J. Mach. Learn. Cybern. 2010, 1, 43–52. [Google Scholar] [CrossRef]
- Wang, L.; Zhang, Y.; Feng, J. On the Euclidean distance of images. IEEE Trans. Pattern Anal. Mach. Intell. 2005, 27, 1334–1339. [Google Scholar] [CrossRef]
- Roy, P.; Dutta, S.; Dey, N.; Dey, G.; Ray, S. Adaptive thresholding: A comparative study. In Proceedings of the 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), Kanyakumari, India, 10–11 July 2014. [Google Scholar]
- Campos, C.; Elvira, R.; Rodríguez, J.J.G.; Montiel, J.M.; Tardós, J.D. ORB-SLAM3: An accurate open-source library for visual, visual-inertial, and multimap SLAM. IEEE Trans. Robot. 2021, 37, 1874–1890. [Google Scholar] [CrossRef]
Mean | RMSE | |
---|---|---|
60% | 0.0652 | 0.0693 |
70% | 0.0447 | 0.0481 |
80% | 0.0281 | 0.0235 |
90% | 0.0637 | 0.0693 |
Comparative Experiments | ORB-SLAM3 | DynaSLAM | Proposed | Improvement on DynaSLAM | |||
---|---|---|---|---|---|---|---|
Feature Matching and APE | Matches | RMSE | Matches | RMSE | Matches | RMSE | RMSE |
freiburg3_sitting_halfsphere | 462 | 0.0430 | 198 | 0.0245 | 146 | 0.0240 | 2.91% |
freiburg3_sitting_rpy | 396 | 0.9839 | 176 | 0.9460 | 122 | 0.2591 | 72.60% |
freiburg3_sitting_static | 411 | 0.5759 | 171 | 0.2147 | 133 | 0.1270 | 40.87% |
freiburg3_sitting_xyz | 389 | 0.0270 | 185 | 0.0210 | 145 | 0.0175 | 16.77% |
freiburg3_walking_halfsphere | 367 | 0.7125 | 163 | 0.0230 | 159 | 0.0225 | 2.19% |
freiburg3_walking_rpy | 335 | 0.7551 | 153 | 0.1319 | 110 | 0.1125 | 14.69% |
freiburg3_walking_static | 459 | 2.7401 | 203 | 0.1710 | 172 | 0.1331 | 22.63% |
freiburg3_walking_xyz | 312 | 1.7032 | 189 | 0.0255 | 132 | 0.0235 | 7.68% |
Comparative Experiments | Improvement on ORB-SLAM3 | Improvement on DynaSLAM | ||||||
---|---|---|---|---|---|---|---|---|
APE | Mean | Median | RMSE | See | Mean | Median | RMSE | See |
freiburg3_sitting_halfsphere | 47.43% | 44.94% | 43.07% | 64.64% | 3.59% | 2.28% | 2.91% | 0.34% |
freiburg3_sitting_rpy | 73.69% | 72.76% | 73.66% | 92.97% | 72.63% | 72.92% | 72.60% | 64.42% |
freiburg3_sitting_static | 77.97% | 78.33% | 77.95% | 97.22% | 40.91% | 42.02% | 40.87% | 69.23% |
freiburg3_sitting_xyz | 37.00% | 39.83% | 35.26% | 50.46% | 16.89% | 21.26% | 16.77% | 32.46% |
freiburg3_walking_halfsphere | 97.14% | 97.52% | 96.84% | 99.97% | 2.87% | 1.82% | 2.19% | 2.03% |
freiburg3_walking_rpy | 84.64% | 85.21% | 85.10% | 99.36% | 15.31% | 16.52% | 14.69% | 27.90% |
freiburg3_walking_static | 95.14% | 95.14% | 95.14% | 99.86% | 22.19% | 21.85% | 22.63% | 58.92% |
freiburg3_walking_xyz | 98.71% | 98.78% | 98.62% | 99.99% | 10.63% | 12.03% | 7.68% | 22.65% |
Comparative Experiments | Improvement on ORB-SLAM3 | Improvement on DynaSLAM | ||||||
---|---|---|---|---|---|---|---|---|
RPE | Mean | Median | RMSE | See | Mean | Median | RMSE | See |
rgbd_bonn_balloon | 68.37% | 82.37% | 54.98% | 24.01% | 62.28% | 61.03% | 49.86% | 27.07% |
rgbd_bonn_crowd3 | 63.57% | 68.81% | 52.98% | 3.95% | 39.89% | 43.82% | 31.10% | 3.52% |
rgbd_bonn_person_tracking2 | 37.23% | 31.52% | 30.26% | 22.58% | 37.61% | 50.10% | 23.73% | 4.05% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, B.; Cheng, X.; Wang, J.; Jiao, L. A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sens. 2024, 16, 723. https://doi.org/10.3390/rs16040723
Wang B, Cheng X, Wang J, Jiao L. A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sensing. 2024; 16(4):723. https://doi.org/10.3390/rs16040723
Chicago/Turabian StyleWang, Bo, Xina Cheng, Jialiang Wang, and Licheng Jiao. 2024. "A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing" Remote Sensing 16, no. 4: 723. https://doi.org/10.3390/rs16040723
APA StyleWang, B., Cheng, X., Wang, J., & Jiao, L. (2024). A Bidirectional Scoring Strategy-Based Transformation Matrix Estimation of Dynamic Factors in Environmental Sensing. Remote Sensing, 16(4), 723. https://doi.org/10.3390/rs16040723