1. Introduction
Ground-penetrating radar (GPR) is a non-destructive detection method for the subsurface environment widely used in the fields of tunnel health conditions [
1], underground pipelines [
2], and highway structural distress detection [
3]. The use of GPR in highway structural distress detection generates a huge amount of data. Even experienced practitioners spend significant time and effort interpreting GPR data. Manual interpretation is costly, inefficient, and lacks standardized reading criteria, making accuracy hard to guarantee, and, thus, greatly limiting its application in the field [
4].
Deep learning [
5] is a subclass of machine learning (ML) that uses techniques such as convolutional neural networks (CNNs) to learn features from images. CNNs, with multiple hidden layers, leverage learning from previous layers during model training [
6], exhibiting a strong capability in recognizing computer vision features and high robustness in detection tasks. In recent years, deep learning technology has been developed vigorously, and many scholars have tried to apply deep learning to the interpretation of GPR data. Li et al. [
7] used the deep learning model YOLO to detect hidden cracks in GPR images to realize the automatic identification and localization of hidden cracks in asphalt pavement. Three versions, namely YOLOv3, YOLOv4, and YOLOv5, were trained and compared, and finally, YOLOv5 was obtained. The three versions of YOLOv3, YOLOv4, and YOLOv5 were trained and compared, and finally, the conclusion that the YOLOv4 model is the most balanced deep learning model in terms of speed and actual performance in detecting hidden cracks was obtained. Qiu et al. [
8] improved the problem of false and missed detection in real-time GPR detection by improving the network structure of YOLOv5, adding the attention mechanism and data augmentation, and then the regression equation for the location information of the ground-penetrating radar’s position information to establish a regression equation, which realized the accurate location of foreign objects in underground soil. Liu et al. [
9] denoised the original GPR road detection image and then tested the accuracy of the original and denoised images using YOLOv3, and the final detection accuracy was improved by 30%. Liu et al. [
10] established a dataset of asphalt pavement distresses including settlement, hidden cracks, and loosening, and then proposed a two-way object detection model for targets of different scales based on the distribution of hidden distresses in the GPR image data. The components of the two-way model were optimized and improved based on the basic model, and the final model’s average precision (AP) for small targets was improved by 17.9%, and the combined index AP (0.5:0.95) was improved by 9.9%. Gao et al. [
11] proposed a Faster R-ConvNets object detection model, enabling intelligent detection of water damage, cracks, and uneven settlement damage in GPR images, with model precision and recall reaching 0.89.
However, training a reliable deep learning model requires a large amount of GPR data labeled with underground targets, which are often difficult to obtain due to the high cost of data collection and field validation. With less data, the scarcity of samples often leads to low recognition accuracy when performing model training. Therefore, data augmentation of GPR data remains an urgent problem. Generative adversarial networks (GANs) [
12] are widely used to enrich the number and diversity of datasets, providing a new avenue for GPR data enhancement. Yue et al. [
13] proposed an improved least squares generative adversarial network (LSGAN) model, which can generate high-precision GPR data and solve the scarcity of labeled GPR data. Zhao et al. [
14] proposed a Wasserstein GAN (WAEGAN) based on generative adversarial networks (GANs) and verified that the method is effective in simultaneously generating multiple target classes and generating realistic GPR data.
Slicing-aided hyper inference (SAHI) is a recently developed technique for small object detection on high-resolution images, which can be integrated with various types of object detection methods. It achieves further enhancement of small object detection by automatically overlapping and segmenting original images during inference, significantly improving small object detection capabilities [
15]. Wang et al. [
16] proposed an improved object detection algorithm, YOLOX_w, based on YOLOX-X, which was developed for UAV aerial images with complex backgrounds and a large number of small targets. They preprocessed images using SAHI to slice them according to set overlap rates, allowing small targets to occupy larger pixel areas on slices. Combining SAHI with data augmentation strategies effectively enhances small object detection performance in UAV aerial images. Duan et al. [
17] proposed a deep learning model-based, automatic detection method for submarine pipelines and introduced SAHI to solve the challenge of detecting small targets with inconspicuous features in large-size, low-resolution images. Muzammul et al. [
18] utilized the VisDrone-DET dataset to combine the real-time detection and recognition model RT-DETR-X with the SAHI methodology, which was used to significantly improve the model detection accuracy, especially for small targets. Currently, there is no case of SAHI being used in GPR image recognition. In a GPR image, the structural loose distress only occupies a small portion of the pixels in the original image. After using SAHI, the original image is sliced according to the set size and overlap rate, so that the structural loose distress can occupy more pixels in the slice, and has a relatively greater visibility compared to the original image, which can avoid the leakage of detection to a large extent. The result is a larger visibility compared to the original image, which largely avoids missed detections.
In this paper, we first intelligently expand the GPR images of structural loose distresses based on DCGAN. Then we use YOLOv5 to detect the structural loose distresses in the GPR images. In the training stage, SAHI is used for data preprocessing, which slices the original image into the input size of the model to eliminate the loss of the target details of the image in the compression process. This allows the model to better learn distress features. In the inference stage, SAHI is combined for detection. Through experiments, it is verified that adding DCGAN-generated images and using SAHI in the training and inference stages can significantly improve the recognition accuracy of structural loose distress.
4. Conclusions
In this paper, to solve the problem of insufficient structural loose distress data, we propose to generate images based on DCGAN for data augmentation, which solves the problem of poor training effect due to insufficient data volume, and to input the generated images together with the original images into YOLOv5 for training. The recognition accuracy then improves by 5.3%. SAHI is introduced in the training stage, and the original images are sliced according to the set slice size and overlap rate before the original image is fed into YOLOv5 model for training. SAHI is also used in the inference stage and the final recognition accuracy reaches 74.1%, which is improved by 10.8%. The experimental results show that after expanding the data, the detection accuracy can be improved. Then, after introducing the SAHI framework in the training and inference stages, the detection accuracy can be improved more obviously, and most of the missed structural loose distress can be effectively detected, thus reducing the rate of missed detection.