VEPL Dataset: A Vegetation Encroachment in Power Line Corridors Dataset for Semantic Segmentation of Drone Aerial Orthomosaics
Abstract
:1. Summary
- The power line corridor class has a very low pixel percentage versus the vegetation and the background classes, and also, due to the embedding vector similarity of objects in neural networks, and the difference in geometry between the vegetation class (polygon) and power line corridors (lines), it becomes a challenge to train [3];
- The vegetation class is only one, but an extensive variety exists, with different colors, shapes, and heights;
- There are many cases where power lines overlap vegetation, making semantic segmentation more complicated.
- Providing a dataset of optic images. This was acquired with a consumer-grade UAV, which is an accessible and low-price tool for companies and governments, making it simple to replicate and use Deep Learning techniques [17];
- Showing a way to obtain a multi-label orthorectified paired dataset, in which image-masks pairs allow the training of Deep Learning models for the semantic segmentation of vegetation encroachment phenomena.
2. Data Description
3. Methods
3.1. Data Acquisition and Processing
3.2. Data Labelling
- To create a union between polygons of the vegetation class and the power line corridor class;
- To use an Identity GIS tool between the previous result and the area of interest. The output is a polygon layer that contains all the 3 classes. Rasterize the previous layer using the “Class” field;
- To export the previous rater to a .tiff format with 8-bit radiometric resolution. The whole process should guarantee the use of the same spatial reference system. Figure 6 shows the labeling steps and the final raster output.
3.3. Data Tessellation and Imbalance Check
3.4. Dataset Bias
Data Augmentation
- Traditional: Images are transformed using a combination of affine transformations (rotation, reflection, scaling, and shearing) and color or spectral modification;
- Synthetic: Based on Deep Learning models like Generative Adversarial Networks (GANs), where an unsupervised generation of images is performed. One of the networks generates images (the generator) which tries to fool another network (the discriminator) that is trained to distinguish fake from real images. Other strategies are CNN-based methods, like random erasing [26].
- Geometric augmentation: alters the image’s geometry to make neural networks agnostic to object changes in position and orientation [27]. It may perform random rotations, grid distortion, horizontal flips, shift scale, and elastic deformation. We implemented the Albumentations library for the geometric augmentation of the images and corresponding masks of the dataset [23]. The total pairs obtained through this process were 3724;
- Spectral augmentation: pursues to make models robust to image changes in lighting and color [27]. The spectral augmentation techniques used in the original VEPL dataset were Random Brightness and Contrast, Hue Saturation Changes, Gaussian Blur Filter, Gamma Correction, and CLAHE (Contrast Limited Adaptive Histogram Equalization) [23]. The total image pairs obtained through spectral augmentation were 3192.
4. Conclusions
5. Future Work
- To add other classes to the dataset, like power poles and conductors, among others. Also, to increase the dataset size in terms of a larger number of orthomosaics, including other types of vegetation;
- To include DSM information for making inferences about height and obtain a more precise segmentation. Differences in height between the vegetation and the power lines can be determined, creating, in this way, an alert of possible contact in a 3D space;
- To use other techniques of data augmentation, such as synthetic augmentation;
- To implement different Deep Learning models on the proposed dataset and evaluate their performance.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Ahmad, J.; Malik, A.S.; Xia, L.; Ashikin, N. Vegetation encroachment monitoring for transmission lines right-of-ways: A survey. Electr. Power Syst. Res. 2013, 95, 339–352. [Google Scholar] [CrossRef]
- Nguyen, V.N.; Jenssen, R.; Roverso, D. Automatic autonomous vision-based power line inspection: A review of current status and the potential role of deep learning. Int. J. Electr. Power Energy Syst. 2018, 99, 107–120. [Google Scholar] [CrossRef] [Green Version]
- Wang, J.; Yang, Y.; Mao, J.; Huang, Z.; Huang, C.; Xu, W. CNN-RNN: A unified framework for multi-label image classification. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 2285–2294. [Google Scholar]
- Azevedo, F.; Dias, A.; Almeida, J.; Oliveira, A.; Ferreira, A.; Santos, T.; Martins, A.; Silva, E. Real-Time LiDAR-based Power Lines Detection for Unmanned Aerial Vehicles. In Proceedings of the 2019 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), Porto, Portugal, 24–26 April 2019; pp. 1–8. [Google Scholar] [CrossRef] [Green Version]
- Ahmad, J.; Malik, A.S.; Abdullah, M.F.; Kamel, N.; Xia, L. A novel method for vegetation encroachment monitoring of transmission lines using a single 2D camera. Pattern Anal. Appl. 2015, 18, 419–440. [Google Scholar] [CrossRef]
- Odo, A.; McKenna, S.; Flynn, D.; Vorstius, J.B. Aerial Image Analysis Using Deep Learning for Electrical Overhead Line Network Asset Management. IEEE Access 2021, 9, 146281–146295. [Google Scholar] [CrossRef]
- Gazzea, M.; Pacevicius, M.; Dammann, D.O.; Sapronova, A.; Lunde, T.M.; Arghandeh, R. Automated Power Lines Vegetation Monitoring Using High-Resolution Satellite Imagery. IEEE Trans. Power Deliv. 2022, 37, 308–316. [Google Scholar] [CrossRef]
- Haroun, F.M.E.; Deros, S.N.M.; Bin Baharuddin, M.Z.; Din, N.M. Detection of Vegetation Encroachment in Power Transmission Line Corridor from Satellite Imagery Using Support Vector Machine: A Features Analysis Approach. Energies 2021, 14, 3393. [Google Scholar] [CrossRef]
- Han, Y.; Han, J.; Ni, Z.; Wang, W.; Jiang, H. Instance Segmentation of Transmission Line Images Based on an Improved D-SOLO Network. In Proceedings of the 2021 IEEE 3rd International Conference on Power Data Science (ICPDS), Harbin, China, 26 December 2021; pp. 40–46. [Google Scholar] [CrossRef]
- Nguyen, V.N.; Jenssen, R.; Roverso, D. LS-Net: Fast single-shot line-segment detector. Mach. Vis. Appl. 2020, 32, 12. [Google Scholar] [CrossRef]
- Zhang, R.; Yang, B.; Xiao, W.; Liang, F.; Liu, Y.; Wang, Z. Automatic Extraction of High-Voltage Power Transmission Objects from UAV Lidar Point Clouds. Remote Sens. 2019, 11, 2600. [Google Scholar] [CrossRef] [Green Version]
- Liu, X.; Miao, X.; Jiang, H.; Chen, J. Data analysis in visual power line inspection: An in-depth review of deep learning for component detection and fault diagnosis. Annu. Rev. Control 2020, 50, 253–277. [Google Scholar] [CrossRef]
- Shelhamer, E.; Long, J.; Darrell, T. Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 640–651. [Google Scholar] [CrossRef]
- Farabet, C.; Couprie, C.; Najman, L.; LeCun, Y. Learning Hierarchical Features for Scene Labeling. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 35, 1915–1929. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Thoma, M. A Survey of Semantic Segmentation. arXiv 2016, arXiv:1602.06541. [Google Scholar]
- Ballesteros, J.R.; Sanchez-Torres, G.; Branch-Bedoya, J.W. HAGDAVS: Height-Augmented Geo-Located Dataset for Detection and Semantic Segmentation of Vehicles in Drone Aerial Orthomosaics. Data 2022, 7, 50. [Google Scholar] [CrossRef]
- Ballesteros, J.R.; Sanchez-Torres, G.; Branch, J.W. Automatic road extraction in small urban areas of developing countries using drone imagery and Image Translation. In Proceedings of the 2021 2nd Sustainable Cities Latin America Conference (SCLA), Medellin, Colombia, 25–27 August 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Restrepo, Z.; Botero, S.; González-Caro, S.; Ortiz-Yusty, C.; Alvarez-Davila, E. Guía de Flora y Fauna del Sistema Local de Área Protegidas de Envigado (Colombia); Editorial Jardín Botánico de Medellín: Medellín, Colombia, 2018. [Google Scholar]
- Ballesteros, J.R.; Sanchez-Torres, G.; Branch-Bedoya, J.W. A GIS Pipeline to Produce GeoAI Datasets from Drone Overhead Imagery. ISPRS Int. J. Geo-Inf. 2022, 11, 508. [Google Scholar] [CrossRef]
- Shorten, C.; Khoshgoftaar, T.M. A survey on Image Data Augmentation for Deep Learning. J. Big Data 2019, 6, 60. [Google Scholar] [CrossRef] [Green Version]
- Agisoft Metashape Professional (Version 1.6.3) (Software). Available online: https://www.agisoft.com (accessed on 2 April 2023).
- Torralba, A.; Efros, A.A. Unbiased Look at Dataset Bias. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, USA, 20–25 June 2011; pp. 1521–1528. [Google Scholar]
- Buslaev, A.; Iglovikov, V.I.; Khvedchenya, E.; Parinov, A.; Druzhinin, M.; Kalinin, A.A. Albumentations: Fast and Flexible Image Augmentations. Information 2020, 11, 125. [Google Scholar] [CrossRef] [Green Version]
- Selvaraju, R.R.; Cogswell, M.; Das, A.; Vedantam, R.; Parikh, D.; Batra, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 618–626. [Google Scholar] [CrossRef] [Green Version]
- Cubuk, E.D.; Zoph, B.; Mane, D.; Vasudevan, V.; Le, Q.V. AutoAugment: Learning Augmentation Strategies From Data. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 113–123. [Google Scholar] [CrossRef]
- Mikołajczyk, A.; Grochowski, M. Data augmentation for improving deep learning in image classification problem. In Proceedings of the International Interdisciplinary PhD Workshop (IIPhDW), Swinoujscie, Poland, 9–12 May 2018; pp. 117–122. [Google Scholar] [CrossRef]
- Taylor, L.; Nitschke, G. Improving Deep Learning with Generic Data Augmentation. In Proceedings of the 2018 IEEE Symposium Series on Computational Intelligence (SSCI), Bangalore, India, 18–21 November 2018; pp. 1542–1547. [Google Scholar] [CrossRef]
Orthomosaics | Class 0 (Background) | Class 1 (Vegetation) | Class 2 (Power Lines) | Total |
---|---|---|---|---|
Cols, Rows: 26,898; 35,093 GSD 1.26, Size: 2.70 GB Format: TIFF, Bands: 3 Pixel Depth: 8 Bit Spatial Reference: GCS_WGS1984 | 103 examples | 115 examples | 115 examples | 333 examples |
Cols, Rows: 42,591; 20,637 GSD 1.06, Size: 2.46 GB Format: TIFF, Bands: 3 Pixel Depth: 8 Bit Spatial Reference: GCS_WGS19842 | 155 examples | 166 examples | 166 examples | 487 examples |
Cols, Rows: 27,885; 16,436 GSD 0.91, Size: 1.28 GB Format: TIFF, Bands: 3 Pixel Depth: 8 Bit Spatial Reference: GCS_WGS19842 | 82 examples | 94 examples | 94 examples | 270 examples |
Cols, Rows: 47,780; 41,390 GSD 1.35, Size: 5.53 GB Format: TIFF, Bands: 3 Pixel Depth: 8 Bit Spatial Reference: GCS_WGS19842 | 149 examples | 157 examples | 157 examples | 463 examples |
Item | Description |
---|---|
Field of application | Vegetation encroachment in power lines corridors |
Collected data | Aerial images |
Method for data acquisition | Drone flights |
Used drone | DJI Mavic 2 Pro |
Camera resolution and sensor size | 20 Mpx, 1 inch CMOS |
Software for processing collected data and products | Agisoft Metashape |
GSD of obtained orthomosaics and DSM | 0.9 to 1.35 cm/px |
Method of annotation | Semi-automatically in GIS |
Dataset production | Scripts in Jupyter Notebook |
Language for scripts | Python 3.8 |
Used GIS software | ArcGIS 10.8 |
Number of classes and objects | Three: vegetation, power lines, background |
Number of orthomosaics | Four |
Data collected by | Authors of this paper |
Year of collection | 2023 |
Segmentation dataset | Multi-class color |
Additional information | RGB |
Dataset size | 3.88 Gb compressed |
Image format | .jpg |
Image quantity | The dataset is divided into 12 TIF images (4 orthomosaics, DSM, and mask), 532 images (after tessellation), 3192 images (with spectral augmentation), and 3724 (with geometric augmentation) |
Image-mask size | 256 × 256 px |
RGB Image average memory size | 16 Mb |
RGB, Mask, Image spectral resolution | 3 bands |
RGB, Mask Image radiometric resolution | 8 bits |
RGB, RG-DSM, Mask Image Coordinate System | WGS1984 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cano-Solis, M.; Ballesteros, J.R.; Branch-Bedoya, J.W. VEPL Dataset: A Vegetation Encroachment in Power Line Corridors Dataset for Semantic Segmentation of Drone Aerial Orthomosaics. Data 2023, 8, 128. https://doi.org/10.3390/data8080128
Cano-Solis M, Ballesteros JR, Branch-Bedoya JW. VEPL Dataset: A Vegetation Encroachment in Power Line Corridors Dataset for Semantic Segmentation of Drone Aerial Orthomosaics. Data. 2023; 8(8):128. https://doi.org/10.3390/data8080128
Chicago/Turabian StyleCano-Solis, Mateo, John R. Ballesteros, and John W. Branch-Bedoya. 2023. "VEPL Dataset: A Vegetation Encroachment in Power Line Corridors Dataset for Semantic Segmentation of Drone Aerial Orthomosaics" Data 8, no. 8: 128. https://doi.org/10.3390/data8080128