A Multiscale Multi-Feature Deep Learning Model for Airborne Point-Cloud Semantic Segmentation
Abstract
:1. Introduction
- (1)
- Multi-view-based methods. The main idea of this method is to project 3D point clouds into 2D images from multiple views, perform semantic segmentation processing using 2D convolutional neural networks, and finally restore the images to the original 3D point cloud for representation. In 3D radar point-cloud processing, Xu et al. [6] designed the spatial adaptive convolution (SAC) method and constructed SqueezeSegV3 for LiDAR point-cloud segmentation accordingly, which solved the common problem of different image locations affecting the feature distribution of the image, which in turn affected the network’s performance [7]. Moreover, 3D-MiniNet was designed on the basis of MiniNet to achieve higher efficiency [8,9]. Milioto et al. designed RangeNet++ with modifications and optimization on DarkNet to enable semantic segmentation; however, these methods have limited application and inevitable information loss during projection [10].
- (2)
- (3)
- On the basis of the original point0cloud approach, in the study of irregular point clouds, Qi et al. [13] designed a new deep neural network, PointNet, which can be used to learn point cloud features point by point. PointNet does not convert the point cloud into any other data representation, but directly uses it as an input network. It retains the spatial features of point clouds to the maximum extent and has strong testing performance. The subsequent deep learning networks based on original point clouds can be broadly classified into several categories, such as multilayer perceptrons, graph convolutions, and RNNs. Xiang et al. [14] proposed a neighborhood search method that selects the appropriate search method according to the characteristics of each point, thus avoiding the shortcomings of manual selecting search methods. Thomas et al. and Boulch et al. [15,16] used different convolution weights to process point clouds, an approach which also solved the alignment invariance problem. Hou et al. and Zeng et al. [17,18] used graph convolution to process point clouds. Graph convolution has a powerful local feature extraction ability and can solve structural problems in point clouds, but the structure of the network is relatively complex and computationally inefficient. Huang et al. [19] combined PointNet with an RNN and introduced the contextual information of the point cloud, which enabled the network to achieve more accurate classification. However, the PointNet network simply connects all the points and only considers global features and individual point characteristics without local information, and the results of multi-instance, multi-classification problems are not good, which limits its ability to capture local structure and identify fine-grained models. Charles et al. [20] proposed an improved PointNet++ architecture to address PointNet’s limitations. While PointNet++ can extract point cloud features at multiple scales, it is still computationally intensive in multiscale neighborhoods due to its complex structure and architecture. In addition, like the original PointNet, PointNet++’s performance in semantic segmentation of complex language environments is unknown.
2. Materials and Methods
- (1)
- Selection of multi-neighborhood features
- (2)
- Expression of multiscale features
- (3)
- The construction of the MSMF-PointNet network
2.1. Selection of Multi-Neighborhood Features
2.2. Expression of Multiscale Features
2.3. Construction of the MSMF-PointNet Model
2.3.1. MSMF-PointNet Deep Neural Network
2.3.2. Parameter Setting
3. Experimental Data
3.1. Experimental Data Information
3.2. Classification Results for Point Clouds
4. Discussion
4.1. Classification Accuracy
4.2. Comparison with Other Methods
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
References
- Zhu, Z.; Zhou, Y.; Seto, K.C.; Stokes, E.C.; Deng, C.; Pickett, S.T.A.; Taubenböc, H.K. Understanding an Urbanizing Planet: Strategic Directions for Remote Sensing. Remote Sens. Environ. 2019, 228, 164–182. [Google Scholar] [CrossRef]
- Wang, R.; Peethambaran, J.; Chen, D. LiDAR Point Clouds to 3-D Urban Models: A Review. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 11, 606–627. [Google Scholar] [CrossRef]
- Liu, X. Airborne LiDAR for DEM generation: Some critical issues. Prog. Phys. Geogr. 2008, 32, 31–49. [Google Scholar] [CrossRef]
- Chen, J.; Zhao, Y.; Meng, C.; Liu, Y. Multi-Feature Aggregation for Semantic Segmentation of an Urban Scene Point Cloud. Remote Sen. 2022, 14, 5134. [Google Scholar] [CrossRef]
- Yu, C.; Gao, C.; Wang, J.; Yu, G.; Shen, C.; Sang, N. BiSeNet V2: Bilateral Network with Guided Aggregation for Real-Time Semantic Segmentation. Int. J. Comput. Vis. 2021, 129, 3051–3068. [Google Scholar] [CrossRef]
- Xu, C.; Wu, B.; Wang, Z.; Zhan, W.; Vajda, P.; Keutzer, K.; Tomizuka, M. SqueezeSegV3: Spatially-Adaptive Convolution for Efficient Point-Cloud Segmentation. arXiv 2020, arXiv:2004.01803. [Google Scholar]
- Wu, B.; Zhou, X.; Zhao, S.; Yue, X.; Keutzer, K. SqueezeSegV2: Improved Model Structure and Unsupervised Domain Adaptation for Road-Object Segmentation from a LiDAR Point Cloud. In Proceedings of the 2019 International Conference on Robotics and Automation (ICRA), Montreal, QC, Canada, 20–24 May 2019; pp. 4376–4382. [Google Scholar]
- Alonso, I.; Riazuelo, L.; Montesano, L.; Murillo, A.C. 3D-MiniNet: Learning a 2D Representation from Point Clouds for Fast and Efficient 3D LIDAR Semantic Segmentation. IEEE Robot. Autom. Lett. 2020, 5, 5432–5439. [Google Scholar] [CrossRef]
- Alonso, I.; Riazuelo, L.; Murillo, A.C. MiniNet: An Efficient Semantic Segmentation ConvNet for Real-Time Robotic Applications. IEEE Trans. Robot. 2020, 36, 1340–1347. [Google Scholar] [CrossRef]
- Lei, J.; Song, J.; Peng, B.; Li, W.; Pan, Z.; Huang, Q. C2FNet: A Coarse-to-Fine Network for Multi-View 3D Point Cloud Generation. IEEE Trans. Image Process. 2022, 31, 6707–6718. [Google Scholar] [CrossRef] [PubMed]
- Alkadri, M.F.; Luca, F.D.; Turrin, M.; Sariyildiz, S. A Computational Workflow for Generating A Voxel-Based Design Approach Based on Subtractive Shading Envelopes and Attribute Information of Point Cloud Data. Remote Sens. 2020, 12, 2561. [Google Scholar] [CrossRef]
- Zhao, L.; Xu, S.; Liu, L.; Ming, D.; Tao, W. SVASeg: Sparse Voxel-Based Attention for 3D LiDAR Point Cloud Semantic Segmentation. Remote Sens. 2022, 14, 4471. [Google Scholar] [CrossRef]
- Charles, R.Q.; Su, H.; Kaichun, M.; Guibas, L.J. PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 77–85. [Google Scholar] [CrossRef] [Green Version]
- Xiang, Q.; He, Y.; Wen, D. Adaptive deep learning-based neighborhood search method for point cloud. Sci. Rep. 2022, 12, 2098. [Google Scholar] [CrossRef]
- Thomas, H.; Qi, C.R.; Deschaud, J.E.; Marcotegui, B.; Goulette, F.; Guibas, L. KPConv: Flexible and Deformable Convolution for Point Clouds. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Republic of Korea, 27 October–2 November 2019; pp. 6410–6419. [Google Scholar] [CrossRef] [Green Version]
- Boulch, A. ConvPoint: Continuous convolutions for point cloud processing. Comput. Graph. 2020, 88, 24–34. [Google Scholar] [CrossRef] [Green Version]
- Hou, X.; Yu, X.; Liu, H. 3D Point Cloud Classification and Segmentation Model Based on Graph Convolutional Network. Laser Optoelectron. Prog. 2020, 57, 181019. [Google Scholar] [CrossRef]
- Zeng, Z.; Xu, Y.; Xie, Z.; Wan, J.; Wu, W.; Dai, W. RG-GCN: A Random Graph Based on Graph Convolution Network for Point Cloud Semantic Segmentation. Remote Sens. 2022, 14, 4055. [Google Scholar] [CrossRef]
- Huang, Q.; Wang, W.; Neumann, U. Recurrent Slice Networks for 3D Segmentation of Point Clouds. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 2626–2635. [Google Scholar] [CrossRef] [Green Version]
- Qi, C.R.; Li, Y.; Hao, S.; Guibas, L.J. PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 5105–5114. [Google Scholar]
- Li, Y.; Cai, J. Point cloud classification network based on self-attention mechanism. Comput. Electr. Eng. 2022, 104, 108451. [Google Scholar] [CrossRef]
- Yang, J.; Zhang, X.; Huang, Y. Graph Attention Feature Fusion Network for ALS Point Cloud Classification. Sensors 2021, 21, 6193. [Google Scholar] [CrossRef]
- Luo, B.; Yang, J.; Song, S.; Shi, S.; Gong, W.; Wang, A.; Du, L. Target Classification of Similar Spatial Characteristics in Complex Urban Areas by Using Multispectral LiDAR. Remote Sens. 2022, 14, 238. [Google Scholar] [CrossRef]
- Li, D.; Shen, X.; Guan, H.; Yu, Y.; Wang, H.; Zhang, G.; Li, J.; Li, D. AGF-Net: Attentive geometric feature pyramid network for land cover classification using airborne multispectral LiDAR data. Int. J. Appl. Earth Obs. Geoinf. 2022, 108, 102723. [Google Scholar] [CrossRef]
- Yao, M.M.; Li, X.M.; Wang, W.X.; Xie, L.F.; Tang, S.J. Semantic Segmentation of Indoor 3d Point Clouds by Joint Optimization of Geometric Features and Neural Networks. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, X-4/W2-2022, 305–310. [Google Scholar] [CrossRef]
- Meng, F.; Wang, X.; Shao, F.; Wang, D.; Hua, X. Energy-Efficient Gabor Kernels in Neural Networks with Genetic Algorithm Training Method. Electronics 2019, 8, 105. [Google Scholar] [CrossRef] [Green Version]
- Lai, X.; Yang, J.; Li, Y.; Wang, M. A Building Extraction Approach Based on the Fusion of LiDAR Point Cloud and Elevation Map Texture Features. Remote Sens. 2019, 11, 1636. [Google Scholar] [CrossRef] [Green Version]
- Yue, C.; Liu, C.; Wang, X. Classification Algorithm for Laser Point Clouds of High-steep Slopes Based on Multi-scale Dimensionality Features and SVM. Geomat. Inf. Sci. Wuhan Univ. 2016, 41, 882–888. [Google Scholar] [CrossRef]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2015, arXiv:1412.6980. [Google Scholar]
- Shang, W.; Sohn, K.; Almeida, D.; Lee, H. Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units. In Proceedings of the 33rd International Conference on Machine Learning, New York, NY, USA, 19–24 June 2016; pp. 2217–2225. [Google Scholar]
- Yousefhussien, M.; Kelbe, D.J.; Ientilucci, E.J.; Salvaggio, C. A Fully Convolutional Network for Semantic Labeling of 3D Point Clouds. ISPRS J. Photogramm. Remote Sens. 2017, 143, 191–204. [Google Scholar] [CrossRef]
- Garcia-Garcia, A.; Orts-Escolano, S.; Oprea, S.; Villena-Martinez, V.; Martinez-Gonzalez, P.; Garcia-Rodriguez, J. A survey on deep learning techniques for image and video semantic segmentation. Appl. Soft Comput. 2018, 70, 41–65. [Google Scholar] [CrossRef]
- Yang, Z.; Tan, B.; Pei, H.; Jiang, W. Segmentation and Multi-Scale Convolutional Neural Network-Based Classification of Airborne Laser Scanner Data. Sensors 2018, 18, 3347. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Zhao, R.; Pang, M.; Wang, J. Classifying airborne LiDAR point clouds via deep features learned by a multi-scale convolutional neural network. Int. J. Geogr. Inf. Sci. 2018, 32, 960–979. [Google Scholar] [CrossRef]
Feature Type | Name | Formula | Explanation |
---|---|---|---|
Basic features | XYZ | / | The basic features of point cloud are obtained through fusion of the point cloud and image. |
RGB | / | ||
Common features | Roughness | / | The roughness is the ratio between the surface area of a given region and its projected area. |
Eigenvalue-based features | Omnivariance | Omnivariance describes the surface undulation of the point cloud. | |
Planarity | Planarity denotes the evenness of the fitted surface in the neighborhood at this point. | ||
Linearity | The linearity of the point cloud. | ||
Verticality | Verticality describes the fitting at a point according to the relationship between the plane and horizontal plane, where Z is the unit vector in the vertical direction, and N is the normal vector of this point. |
(a) | |
Hardware | Configuration Status |
CPU CPU frequency | Intel® Core (TM) i7-8700K CPU 2.20 GHz |
RAM | 64 GB RAM |
Hard disk | 128 GB SSD + 1 TB HDD |
GPU | NVIDIA GeForce RTX 2080 |
Video memory | 12 GB |
Computing platform | CUDA10.0 cudnn7.5 |
(b) | |
Software | Configuration Status |
Operating system | Windows 10 |
Deep learning framework | Tensorflow-gpu 1.13 |
Development language | Python 3.7.2 |
Manager | Anaconda |
IDE | Pycharm |
Parameters Model | PointNet | PointNet++ | MSMF-PointNet |
---|---|---|---|
Optimizer | Adam | Adam | Adam [29] |
Activation function | Relu | Relu | Relu [30] |
Learning rate | 0.001 | 0.001 | 0.001 |
Dropout | 0.7 | 0.5 | 0.5 |
Batch Size | 32 | 32 | 32 |
Low_veg (%) | Car (%) | Imp_Surfacce (%) | Shrub (%) | Tree (%) | Roof (%) | OA (%) | |
---|---|---|---|---|---|---|---|
SS | 27.4 | 15.5 | 70.4 | 30.8 | 55.4 | 60.1 | 60.5 |
SM | 39.5 | 37.7 | 87.7 | 49.4 | 66.7 | 89.3 | 79.7 |
MM | 45.5 | 32.4 | 90.1 | 54.7 | 85.4 | 91.4 | 88.1 |
Category | Roof (%) | Tree (%) | Imp_Surface (%) | Low_Veg (%) | Shrub (%) | Average IoU (%) |
---|---|---|---|---|---|---|
IoU | 90.7 | 78.5 | 88.4 | 35.7 | 48.9 | 68.4 |
Algorithm | Power Line | Car | Low_Veg | Imp_Surfaces | Roof | Hedge | Facade | Shrub | Tree |
---|---|---|---|---|---|---|---|---|---|
PointNet [13] | 0.8 | 23.2 | 32.1 | 47.6 | 84.7 | 2.3 | 5.7 | 15.4 | 76.2 |
Ours | 0.5 | 58.7 | 65.4 | 78.4 | 95.8 | 15.8 | 16.9 | 27.5 | 92.4 |
F1 | OA | ||||||
---|---|---|---|---|---|---|---|
Low_Veg | Imp_Surface | Car | Roof | Shrub | Tree | ||
Ours | 88.1 | 94.8 | 55.8 | 97.5 | 51.1 | 93.6 | 89.1 |
WhuY4 | 82.7 | 91.4 | 74.7 | 94.3 | 47.9 | 82.8 | 84.9 |
UM | 79.0 | 89.1 | 47.7 | 92.0 | 40.9 | 77.9 | 80.8 |
NANJ2 | 88.8 | 91.2 | 66.7 | 93.6 | 55.9 | 82.6 | 85.2 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
He, P.; Ma, Z.; Fei, M.; Liu, W.; Guo, G.; Wang, M. A Multiscale Multi-Feature Deep Learning Model for Airborne Point-Cloud Semantic Segmentation. Appl. Sci. 2022, 12, 11801. https://doi.org/10.3390/app122211801
He P, Ma Z, Fei M, Liu W, Guo G, Wang M. A Multiscale Multi-Feature Deep Learning Model for Airborne Point-Cloud Semantic Segmentation. Applied Sciences. 2022; 12(22):11801. https://doi.org/10.3390/app122211801
Chicago/Turabian StyleHe, Peipei, Zheng Ma, Meiqi Fei, Wenkai Liu, Guihai Guo, and Mingwei Wang. 2022. "A Multiscale Multi-Feature Deep Learning Model for Airborne Point-Cloud Semantic Segmentation" Applied Sciences 12, no. 22: 11801. https://doi.org/10.3390/app122211801
APA StyleHe, P., Ma, Z., Fei, M., Liu, W., Guo, G., & Wang, M. (2022). A Multiscale Multi-Feature Deep Learning Model for Airborne Point-Cloud Semantic Segmentation. Applied Sciences, 12(22), 11801. https://doi.org/10.3390/app122211801