Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering

Zhang, Hang; Zeng, Jun; Ma, Chunchi; Li, Tianbin; Deng, Yelin; Song, Tao

doi:10.3390/s21206762

Open AccessArticle

Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering

¹

State Key Laboratory of Geohazard Prevention and Geoenvironment Protection, Chengdu University of Technology, Chengdu 610059, China

²

Chongqing City Construction Investment (Group) Co., Ltd., Chongqing 400023, China

³

Key Laboratory of Transportation Tunnel Engineering, Ministry of Education, Southwest Jiaotong University, Chengdu 610031, China

^*

Author to whom correspondence should be addressed.

Sensors 2021, 21(20), 6762; https://doi.org/10.3390/s21206762

Submission received: 26 July 2021 / Revised: 8 September 2021 / Accepted: 21 September 2021 / Published: 12 October 2021

(This article belongs to the Special Issue New Technologies and Data Analysis Methods for Seismic Monitoring)

Download

Browse Figures

Versions Notes

Abstract

:

Due to the complexity of the various waveforms of microseismic data, there are high requirements on the automatic multi-classification of such data; an accurate classification is conducive for further signal processing and stability analysis of surrounding rock masses. In this study, a microseismic multi-classification (MMC) model is proposed based on the short time Fourier transform (STFT) technology and convolutional neural network (CNN). The real and imaginary parts of the coefficients of microseismic data are inputted to the proposed model to generate three classes of targets. Compared with existing methods, the MMC has an optimal performance in multi-classification of microseismic data in terms of Precision, Recall, and F1-score, even when the waveform of a microseismic signal is similar to that of some special noise. Moreover, semisynthetic data constructed by clean microseismic data and noise are used to prove the low sensitivity of the MMC to noise. Microseismic data recorded under different geological conditions are also tested to prove the generality of the model, and a microseismic signal with M_w ≥ 0.2 can be detected with a high accuracy. The proposed method has great potential to be extended to the study of exploration seismology and earthquakes.

Keywords:

microseismic waveforms; multi-classification; convolutional neural network; similarity

1. Introduction

As a new real-time monitoring technology of rock mass stability, microseismic monitoring technology has been extensively applied in tunnel, mines, slopes, and other dynamic disaster early warning system projects [1,2,3,4,5,6,7,8,9]. This method can help effectively evaluate the current fracture status of surrounding rocks by analyzing the microseismic data recorded during monitoring, and then help further evaluate and predict the potential risk areas of rock masses. This is conducive to the early warning of disasters and auxiliary construction. Given the complexity of a construction environment, lengthy construction period, and continuous real-time data acquisition in tunnel projects, the various types of recorded data are often subject to interference from different background noise, including micro-fracture signal (MS) (generated by surrounding rock fractures and movement), blast, mechanical, and other unknown noise. Hence, effectively detecting the MS is challenging. MS detection depends on experience and seismic knowledge of personnel; the detection process is time-consuming and inefficient, and its accuracy cannot be ensured. Moreover, some special noise is similar to MS in time domain, which brings great challenges to MS detection. Finally, an inaccurate MS detection may make the microseismic catalog confusing and affect further analyses.

Recently, various automatic algorithms for microseismic/seismic signal detection have been proposed to resolve the above issues, such as short and long-term average (STA/LTA) [10], waveform autocorrelation, cross correlation, and fingerprint and similarity threshold (FAST) methods. Despite their advantages, each method has some disadvantages. The STA/LTA method easily misses the target signals with a low signal-to-noise ratio (SNR) [11,12]. Waveform autocorrelation, known as template matching, requires a tremendous amount of computation when the number of templates increases [13]. Although the FAST method performs well in terms of detection sensitivity and applicability, it has considerable overhead in memory and computation [14]. With the rapid development in the field of computers, artificial intelligence technology has been widely used in seismic/microseismic processing and disaster prediction [15,16,17,18]. Xin et al. (2021) [19] proposed an explainable time-frequency convolutional neural network (CNN) to provide an excellent classification performance and explainability. Liang et al. (2021) [20] combined multiple base learners and classifiers to estimate the probability of short-term rockburst risks and achieved good performance. Saad and Chen (2020) [21] extracted waveforms from continuous microseismic data using an automatic unsupervised method, which outperformed the simple k-means and short-term and long-term average ratio methods. Tang et al. (2020) [22] proposed a modified CNN with attention mechanism to detect microseismic events.

In this study, a CNN is established for the multi-classification of microseismic waveforms in frequency domain. The Short Time Fourier Transform (STFT) technology is used to transform the microseismic data in time domain to frequency domain, and a combination of the time-frequency coefficients is generated as input to the microseismic multi-classification (MMC) model. Microseismic data are divided into three types (MS, blast, and noise) as the categories of targets. The microseismic data recorded from the Grand Canyon tunnel of Lehan Expressway (China) are used for network training, validation, and testing. Compared with existing methods, the performance of the MMC is evaluated based on three metrics: Precision, Recall, and F1-score. Semisynthetic data are used to evaluate the noise sensitivity of the model. The proposed method is applied to test some special noise whose waveform is similar to that of MS with a low amplitude. The proposed method has been also applied in other projects under different geological conditions and engineering situations.

2. Method and Data Preparation

The STFT technology, also known as the windowed Fourier transform, is an effective time-frequency analysis method, whereby the time-frequency information of different time windows can be obtained by a moving window function and by performing Fourier transform in this window [23,24,25,26]. The nonstationary signal is regarded as the superposition of a series of short-time stationary signals.

T F T (f, k) = \sum_{n = 0}^{N - 1} S (n) [W (n - k) e^{\frac{f 2 π f n}{- N}}]

(1)

where N and n represent the length of the time point of the recorded signal and time point, respectively. S(n) represents the microseismic data in time domain, and W is the moving window function. K and f represent the index of the different time windows and frequency, respectively. The length of the time window was set to 256 time points, and the window function of ‘hann’ was selected in this study [27].

Microseismic data are typically collected and recorded by sensors (accelerometers or speedometers) in the microseismic monitoring system. Each sensor represents a channel for recording a waveform. In this work, different types of recorded data were obtained from the microseismic monitoring system installed in the Grand Canyon tunnel of Lehan Expressway (China), which is currently the deepest buried expressway tunnel in the world. The system comprises six mono-axial accelerometers with a sensitivity of 28 V/g and a response frequency ranging from 50 Hz to 5 kHz, one data acquisition station with a sampling frequency of 20 kHz, and a data processing station. The recorded data consist of 30,000 time points in voltage. Figure 1 shows the different types of microseismic data, including MS, noise, blast, mechanical, and unknown signals. Different propagation media, sensor array, and noise pollution can lead to different amplitudes of each channel in the microseismic data. In addition, some channels may not record the signal due to some technical issues.

Generally, microseismic data can be broadly classified into different types in time domain (Figure 1 and Figure 2). In particular, some noise waveforms (defined as similar noise) are highly similar to that of MS with a low amplitude, which brings challenges when distinguishing these two types of microseismic data in time domain (Figure 1b,c). Therefore, the time-frequency characteristics of the microseismic data are analyzed using the STFT, including the real and imaginary parts of the time-frequency coefficients (Figure 1 and Figure 2). It can be found that the frequency range and amplitude spectra have a significant difference between the different types of the microseismic data. Figure 2a shows that the blast signal covers a wide range of frequencies, and the intensity and the amplitude spectra are the highest. Its peak amplitude is mostly over 4000 mV. The intensity and frequency of the MS are relatively lower than those of the blast signal, and the waveform attenuation is faster (Figure 1a,b). The similar noise has a low frequency range and amplitude spectra, which shows an evident difference from the MS with a low amplitude (Figure 1c). Mechanical signals typically show the characteristics of regular and repeated vibrations (Figure 2b). In addition, recorded data may contain some unknown signals with unapparent features and patterns, and their amplitude spectra is the lowest (Figure 2c). Thus, different types of microseismic data can be effectively distinguished in frequency domain by the STFT.

The MS is the signal of interest for rockburst early warning, and it must be detected. The blast signal has accurate onset time picking, and the wave velocity model of the surrounding rock can be improved based on the measurable initial blast point and regression method (such as the least squares method). Combined with the improved velocity model and microseismic sensor array, it is conducive to the high accuracy of source localization. As for the other types of signals, they are useless and unnecessary. Therefore, the microseismic data can divide into three types in this study: MS, blast signal and noise. Too few samples will lead to overfitting and poor performance of the model, on account of which the various and complex characteristics of all categories cannot be covered. For the experiment in this study, 1600 MS samples, 1200 blast samples, 1500 noise samples (including 500 similar noise, 400 mechanical, and 600 unknown samples) were selected, and randomly split into two parts: training (80%) and test (20%) datasets. Each sample includes six waveforms based on the microseismic monitoring system. Moreover, the k-fold cross validation was introduced to avoid overfitting and to find the optimal model. The training dataset was divided into k parts (i.e., folds), and each fold was used as a validation dataset in turn; the remaining k-1 folds were taken as the training dataset. The model was trained k times, and the optimal model was obtained based on the training results. In this study, the k value was set to 5 to ensure that the number of microseismic waveforms of each fold was greater than 4000. The test set was mainly used to record the network performance.

3. Network Architecture and Training

Figure 3 shows the architecture of the proposed neural network, which includes Input, convolutional layer, maximum pooling layer, flatten layer, fully connected layer, and Output. The combination of the real and imaginary parts of the time-frequency coefficients forms the network input with dimensions of 129 × 236 × 2 by applying the STFT to the signal in time domain. A series of convolution and pooling operations was used to extract and compress the input features. The kernel and stride sizes of the convolutional layer were set to 3 × 3 and 1 × 1, respectively, to extract the features of the real and imaginary parts of the time-frequency coefficients. Moreover, the maximum pooling layer with a kernel size of 2 × 2 and a stride of 1 × 1 were selected to compress the extracted feature, which helped remove the redundant information and retain the key features. Moreover, a BN operation and ReLU activation function were used to process the features after the convolution operation. The input for each layer was uniformed to accelerate the convergence and avoid the overfitting of the model based on the BN operation [28]. The ReLU activation function was proposed by Glorot et al. (2011) [29]:

f (x) = {\begin{matrix} x \\ 0 \end{matrix} \begin{matrix} x > 0 \\ x \leq 0 \end{matrix}

(2)

The outputs of zero for some neurons in Equation (2) are conducive to enhance the sparsity and nonlinear relationship of the neural network and further alleviate model overfitting. The Dropout operation is used to improve the generalization ability of the neural network and prevent overfitting by stopping the activation of some neurons with a probability [30]. The deeper the network, the greater the number of features extracted. After multiple 2D convolution and maximum pooling layers, the Flatten layer is used to convert the features into 1D vectors. Next, fully connected layers are used to perform high-level reasoning and map the learned features to the probability of the required output classes from the last step. A SoftMax activation function is used in the last layer of the network to output a vector of the predicted probabilities of each class. Moreover, the Adam optimizer for weight updates [31] and a cross entropy loss function are used. The Early Stopping operation also helps avoid the model overfitting. The learning rate is set to 0.005 he batch size to 32.

Table 1 shows the parameters of MMC model, including the layer output, activation function, kernel size, stride size, weight, and bias. Overall, the network comprises 13 layers and has 5.79 × 10⁶ trainable parameters. In this study, one-hot encoding was used for the three desired classes in the training process, and the number of epochs was set to 300.

4. Results

4.1. Model Evaluation

The structure of the MMC with 13 neural layers is similar to that of the baseline neural network VGG13, which is commonly used in image classification tasks. Therefore, the standard VGG13 and VGG16 networks were selected for a comprehensive comparative analysis of the MMC. For a fair model comparison, the parameters of the fully connected layers in VGG13 and VGG16 were set the same as those of the MMC. The same training datasets were used to train VGG13 and VGG16. The indicators of accuracy and loss are typically used to monitor the training performance of the model. A high accuracy and low loss indicate that a model has a good training effect. Table 2 shows the comprehensive comparison results of VGG13, VGG16, and MMC using k-fold cross validation. With the deepening of the neural network, the performance of the model is improved, however, the number of parameters and calculation cost (i.e., GFLOPs) are relatively increased. In addition, the complexity of the neural network affects the model performance based on the comparison between VGG13 and MMC, even if they have the same number of neural layers. Therefore, the MMC was selected for multi-classification of the complex microseismic waveforms based on the comprehensive consideration of the computing consumption, memory footprint, and model performance.

Figure 4 shows the optimal values of the accuracy and loss in the model training of the MMC using k-fold cross validation. The accuracy and loss do not change significantly in the last 90 epochs, indicating that the model gradually approaches to fitting and well trained. Finally, the accuracies of the training and validation are 99.8% and 99.5%, and the loss values are 0.009 and 0.018, respectively. These results prove that the MMC has a good performance of model training.

The test dataset (including 320 MS, 240 blast and 300 noise samples) is also used to compare the existing methods (correlation [32] and AlexNet [30]) with the MMC in terms of their performance for the multiclassification of microseismic signals. Moreover, Precision, Recall, and F1-score are introduced to evaluate the performance of these methods:

{Precision}_{i} = \frac{{TP}_{i}}{{TP}_{i} {+ FP}_{i}}

(3)

{Recall}_{i} = \frac{{TP}_{i}}{{TP}_{i} {+ FN}_{i}}

(4)

{Micro F 1 - score}_{i} = 2 \times \frac{{Precision}_{i} {\times Recall}_{i}}{{Precision}_{i} {+ Recall}_{i}}

(5)

Macro F 1 - score = \frac{\sum_{i = 1}^{i = n} {Micro F 1 - score}_{i}}{n}

(6)

where i represents the category of the target. TP, FP, and FN are the true positives, false positives, and false negatives, respectively. Precision is defined as the proportion of correct predictions in the predictions that are positive (both TP and FP), and Recall is defined as the proportion of correct predictions in the actual positive samples (both TP and FN). F1-score is used to evaluate the comprehensive performance of the models and eliminate the impact of sample size imbalance [33]. Micro F1-score represents the performance of the method on each category, whereas Macro F1-score represents the comprehensive performance on all categories. Generally, the higher the F1-score, the better the performance of the model. For the correlation method, a large amount of waveform templates is used to provide maximum coverage for the feature information and further ensure the classification accuracy. Table 3 and Table 4 show the experimental results of the different methods. The MMC can detect 1924 MS waveforms, 1886 of which are true positive, thus outperforming the correlation and AlexNet methods. Moreover, the MMC can detect all the blast waveforms completely and accurately. The results also show that the Correlation method takes more time for the test dataset than the AlexNet and MMC. A well-trained model can efficiently deal with high-volume data and reach sufficient accuracy. In conclusion, the best performance in terms of Precision, Recall and F1-score indicates that the MMC can effectively extract the features of microseismic data in frequency domain.

The receiver operating characteristic (ROC) curve is introduced for the model evaluation; it represents the relationship between the true positive rate (TPR) and the false positive rate (FPR) of the classifier. The area under curve (AUC) is defined as the area enclosed by the coordinate axis under the ROC curve, and the AUC value of an ideal classifier is 1. The closer the AUC value is to 1, the better the performance of the classifier. Figure 5 shows the ROC curve of the three target classes (MS, blast, and noise). Each class is set to positive and the rest to negative. Thus, the multi-classification is transformed into binary classification, and the ROC curve and AUC value of each class can be calculated. A high AUC value of each class means that the MMC has good performance for the multiple classification of the microseismic waveforms.

To further evaluate the noise sensitivity of the model, semisynthetic data were constructed based on clean data and noise (including background and Gaussian noises). Noisy signals with different SNR values were generated by scaling the noise amplitude (Figure 6). The detail calculation of the SNR is as follows [33]:

SNR = 20 \times \log_{10} (S_{A m a x} / N_{A m a x})

(7)

where

S_{A m a x}

and

N_{A m a x}

are the peak amplitudes of the signal and noise, respectively.

By adding the different levels of background and Gaussian noises to the clean signal, 14 types of noisy signals with SNRs ranging from −2 to 22 dB were formed. The MMC, AlexNet and Correlation methods were applied to these semisynthetic data. Regardless of the method used, the detection rate increases with the improvement in the SNR, and the MMC exhibits the best detection performance among these methods (Figure 7). When the SNR is close to 0, the detection rate of the model can reach more than 80%, while those of the AlexNet and Correlation are 63.3% and 0, respectively. Moreover, the MMC can completely and accurately detect the microseismic signals with a SNR higher than 2 dB. Therefore, the MMC is less sensitive to background and Gaussian noises.

4.2. Application and Discussion

It has been proved that the MMC can effectively classify the various types of complex microseismic data based on the training, validation, and testing. Generally, a successful model should have a good generality to deal with different situations. Hence, microseismic data recorded under different geological conditions (Micang Mountain tunnel) were applied to the proposed method. The results show that 699 MSs and 756 noise signals could be detected, of which 20 MSs and 689 noise are already found in the previous human detection catalog. The visual detection results show that 35 of the remaining MSs and 41 of the remaining noise are new, resulting in a Precision value of 0.937 for MS and 0.966 for noise. In addition, the moment magnitude (M_w) of the detected MSs ranges from −0.6 to 1.4 (Figure 8) [34]. The MSs with M_w

\geq

0.2 can be better detected by the MMC, however, it could be a challenge to detect MSs with low M_w (Figure 9). For the blast signal, all the samples were correctly detected, which confirms the high performance of the proposed method in blast signal detection.

From Figure 1, we find that the MS with a low amplitude is similar to similar noise (defined in Section 2). To further evaluate the performance of the MMC on this issue, 50 MSs with low amplitudes and 50 similar noise samples were used. Precision, Recall and F1-score were also selected to measure and compare the performance of the different methods. The results show that the MMC outperforms the Correlation and AlexNet methods, indicating that it is more suitable for classifying complex microseismic waveforms (Table 5). Therefore, the MMC has good application prospects for the multi-classification of microseismic data in tunnels, even when some special noise is similar to MS.

The MMC can well classify microseismic data into three types (MS, blast, and noise) in frequency domain, even when the waveform of the MS with a low amplitude is similar to that of some noise. Although the proposed method has a good performance for MS detection in actual field, it has some limitations. Microfracture events with low Mw and MS heavy polluted by noise are not conducive to the accurate detection by the proposed method. Insufficient number of samples or some specific samples cannot cover the general characteristics of the target category, which can easily cause model overfitting and ineffective training. The complex monitoring environments and waveform propagation bring various types of waveforms, including natural earthquakes, rock mass ruptures, collapses, blast, mechanical, and artificial noise. It is insufficient to divide the microseismic data into three types in some cases. In addition, many models have a ‘performance bottleneck’ in actual application, which is reflected in the difficultly of improving some metrics such as Precision, Recall, and F1-score. Whether this issue is due to the uneven distribution of training data or the drawbacks of the model itself remains unclear. Future research topics include adding more types of microseismic data recorded under different geological conditions and regions, and the depth and complexity of the neural network, in a bid to obtain a trained model with high generality and accuracy. Moreover, an effective multi-classification of microseismic data can improve the further analysis of focal mechanism, source location, and disaster warning, etc.

5. Conclusions

This study developed an advanced signal processing method based on the CNN for the multi-classification of microseismic data. Considering the similarity in time domain between the MSs with low amplitudes and some special noise, the STFT technology was used to enhance the characteristics of various microseismic data to facilitate the classification in frequency domain. Compared with the Correlation and AlexNet methods, the MMC exhibited a better performance in microseismic multi-classification through model training, validation, and testing. The model was proven to exhibit a low sensitivity to noise based on semisynthetic data. Moreover, the MMC was applied to microseismic data recorded in different tunnels, suggesting that the model has generalization ability and good performance for MS detection in different geological backgrounds. The proposed method basically overcomes the difficulty in distinguishing between low-amplitude MS and similar noise. While this study is motivated by the need for efficient and automated microseismic signal processing, notably, the proposed method can be seamlessly extended to signal analysis for disaster estimation in geophysical and geotechnical fields, such as hydraulic fracturing, mining industry, shale-gas exploitation, and earthquakes.

Author Contributions

Writing—original draft preparation, H.Z.; data processing and investigation, J.Z.; writing—review and editing, C.M.; supervision, T.L.; data processing, Y.D.; investigation, T.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant numbers 41807255 and 42177173); State Key Laboratory of Geohazard Prevention and Geoenvironment Protection Independent Research Project (grant numbers SKLGP2020Z010); and Sichuan Science and Technology Project (grant number 2019YJ0465).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw/processed data required to reproduce these findings cannot be shared at this time as the data also forms part of an ongoing study.

Acknowledgments

The authors would like thank the editor, assistant editor, and anonymous reviewers for their careful reviews and insightful remarks.

Conflicts of Interest

The authors declare no conflict of interest.

References

Xu, N.W.; Tang, C.A.; Li, L.C.; Zhou, Z.; Sha, C.; Liang, Z.Z.; Yang, J.Y. Microseismic monitoring and stability analysis of the left bank slope in Jinping first stage hydropower station in southwestern China. Int. J. Rock Mech. Min. 2011, 48, 950–963. [Google Scholar] [CrossRef]
Zhao, Y.; Yang, T.H.; Zhang, P.H.; Zhou, J.R.; Yu, Q.L.; Deng, W.X. The analysis of rock damage process based on the microseismic monitoring and numerical simulations. Tunn. Undergr. Space Technol. 2017, 49, 1–17. [Google Scholar] [CrossRef]
Ma, C.C.; Li, T.B.; Zhang, H.; Wang, J.F. An evaluation and early warning method for rockburst based on EMS microseismic source parameters. Rock Soil Mech. 2018, 39, 765–774. (In Chinese) [Google Scholar]
Xu, N.W.; Wu, J.Y.; Dai, F.; Fan, Y.L.; Li, T.; Li, B. Comprehensive evaluation of the stability of the left-bank slope at the Baihetan hydropower station in southwest China. Bull. Eng. Geol. Environ. 2018, 77, 1567–1588. [Google Scholar] [CrossRef]
Feng, G.L.; Feng, X.T.; Chen, B.R.; Xiao, Y.X.; Zhao, Z.N. Effects of structural planes on the microseismicity associated with rockburst development processes in deep tunnels of the Jinping-II Hydropower Station, China. Tunn. Undergr. Space Technol. 2019, 84, 273–280. [Google Scholar] [CrossRef]
Feng, L.; Pazzi, V.; Intrieri, E.; Gracchi, T.; Gigli, G. Rockfall seismic features analysis based on in situ tests: Frequency, amplitude, and duration. J. Mt. Sci.-Engl. 2019, 16, 955–970. [Google Scholar] [CrossRef]
Zhang, H.; Ma, C.C.; Li, T.B. Quantitative Evaluation of the “Non-Enclosed” Microseismic Array: A Case Study in a Deeply Buried Twin-Tube Tunnel. Energies 2019, 12, 2006. [Google Scholar] [CrossRef] [Green Version]
Feng, G.L.; Lin, M.Q.; Yu, Y.; Fu, Y. A Microseismicity-Based Method of Rockburst Intensity Warning in Deep Tunnels in the Initial Period of Microseismic Monitoring. Energies 2020, 13, 2698. [Google Scholar] [CrossRef]
Zhang, Z.; Arosio, D.; Hojat, A.; Zanzi, L. Tomographic experiments for defining the 3D velocity model of an unstable rock slope to support microseismic event interpretation. Geosciences 2020, 10, 327. [Google Scholar] [CrossRef]
Allen, R.V. Automatic earthquake recognition and timing from single trace. Bull. Seismol. Soc. Am. 1978, 68, 1521–1532. [Google Scholar] [CrossRef]
Withers, M.; Aster, R.; Young, C.; Beiriger, J.; Trujillo, J. A Comparison of select trigger algorithms for automated global seismic phase and event detection. Bull. Seismol. Soc. Am. 1998, 88, 95–106. [Google Scholar] [CrossRef]
Trnkoczy, A. Understanding and Parameter Settings of STA/LTA Trigger Algorithm. In New Manual of Seismological Observatory Practice 2 (NMSOP-2); Deutsches GeoForschungsZentrum GFZ: Potsdam, Germany, 2012. [Google Scholar]
Yoon, C.E.; O’Reilly, O.; Bergen, K.J.; Beroza, G.C. Earthquake detection through computationally efficient similarity search. Sci. Adv. 2015, 1, e1501057. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Skoumal, R.J.; Brudzinski, M.R.; Currie, B.S.; Levy, J. Optimizing multi-station earthquake template matching through re-examination of the Youngstown, Ohio, sequence. Earth. Planet. Sci. Lett. 2014, 405, 274–280. [Google Scholar] [CrossRef] [Green Version]
Meier, M.A.; Ross, Z.E.; Ramachandran, A.; Balakrishna, A.; Nair, S.; Kundzica, P.; Li, Z.F.; Andrews, J.; Hauksson, E.; Yue, Y.S. Reliable real-time seismic signal/noise discrimination with machine learning. J. Geophys. Res.-Solid Earth 2018, 124, 788–800. [Google Scholar] [CrossRef] [Green Version]
Zhang, H.; Ma, C.C.; Pazzi, V.; Zou, Y.L.; Casagli, N. Microseismic Signal Denoising and Separation Based on Fully Convolutional Encoder–Decoder Network. Appl. Sci. 2020, 10, 6621. [Google Scholar] [CrossRef]
Giudicepietro, F.; Esposito, A.M.; Ricciolino, P. Fast discrimination of local earthquakes using a neural approach. Seismol. Res. Lett. 2017, 88, 1089–1096. [Google Scholar] [CrossRef]
Lin, B.; Wei, X.; Zhao, J.J.; Zhao, H. Automatic classification of multi-channel microseismic waveform based on DCNN-SPP. J. Appl. Geophys. 2018, 159, 446–452. [Google Scholar] [CrossRef]
Xin, B.A.; Chao, Z.B.; Yao, H.C.; Zhao, X.G.; Sun, Y.J.; Ma, Y.L. Explainable time–frequency convolutional neural network for microseismic waveform classification. Inform. Sci. 2021, 546, 883–896. [Google Scholar] [CrossRef]
Liang, W.; Sari, Y.A.; Zhao, G.; Mckinnon, S.D.; Wu, H. Probability Estimates of Short-Term Rockburst Risk with Ensemble Classifiers. Rock Mech. Rock Eng. 2021, 54, 1799–1814. [Google Scholar] [CrossRef]
Saad, O.M.; Chen, Y. Automatic waveform-based source-location imaging using deep learning extracted microseismic signals. Geophysics 2020, 85, KS171–KS183. [Google Scholar] [CrossRef]
Tang, S.B.; Wang, J.X.; Tang, C.A. Identification of Microseismic Events in Rock Engineering by a Convolutional Neural Network Combined with an Attention Mechanism. Rock Mech. Rock Eng. 2020, 54, 47–69. [Google Scholar] [CrossRef]
Griffin, D.W.; Lim, J.S. Signal estimation from modified short-time Fourier transform. IEEE Trans. Acoust. Speech Signal Process. 1984, 32, 236–243. [Google Scholar] [CrossRef]
Wongsaroj, W.; Hamdani, A.; Thong-Un, N.; Takahashi, H.; Kikura, H. Extended Short-Time Fourier Transform for Ultrasonic Velocity Profiler on Two-Phase Bubbly Flow Using a Single Resonant Frequency. Appl. Sci. 2018, 9, 50. [Google Scholar] [CrossRef] [Green Version]
Khan, A.; Ko, D.K.; Lim, S.C.; Kim, H.S. Structural vibration-based classification and prediction of delamination in smart composite laminates using deep learning neural network. Compos. Part B Eng. 2019, 161, 586–594. [Google Scholar] [CrossRef]
Pan, X.; Cheng, Z.F.; Zheng, Z.; Zhang, Y.H. Sparse Bayesian learning beamforming combined with short-time Fourier transform for fault detection of wind turbine blades. J. Acoust. Soc. Am. 2019, 145, 1802. [Google Scholar] [CrossRef]
Zhu, W.Q.; Mousavi, S.M.; Beroza, G.C. Seismic Signal Denoising and Decomposition Using Deep Neural Networks. IEEE Trans. Geosci. Remote Sens. 2019, 57, 9476–9488. [Google Scholar] [CrossRef] [Green Version]
Ioffe, S.; Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv 2015, arXiv:1502.03167. [Google Scholar]
Glorot, X.; Bordes, A.; Bengio, Y. Deep sparse rectifier neural networks. In Proceedings of the 14th International Conference on Artificial Intelligence and Statistics, Fort Lauderdale, FL, USA; 2011; pp. 315–323. [Google Scholar]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Commun. ACM 2017, 60, 84–90. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. Comput. Sci. 2015. Available online: https://www.semanticscholar.org/paper/Adam%3A-A-Method-for-Stochastic-Optimization-Kingma-Ba/a6cb366736791bcccc5c8639de5a8f9636bf87e8 (accessed on 24 July 2021).
Sun, H.M.; Jia, R.S.; Du, Q.Q.; Fu, Y. Cross-correlation analysis and time delay estimation of a homologous micro-seismic signal based on the Hilbert–Huang transform. Comput. Geosci.-UK 2016, 91, 98–104. [Google Scholar] [CrossRef]
Mousavi, S.M.; Zhu, W.Q.; Sheng, Y.X.; Beroza, G.C. Cred: A deep residual network of convolutional and recurrent units for earthquake signal detection. Sci. Rep.-UK 2019, 9, 10267. [Google Scholar] [CrossRef]
Hanks, T.C.; Kanamori, H. A moment magnitude scale. J. Geophys. Res.-Atmos. 1979, 84, 2348–2350. [Google Scholar] [CrossRef]

Figure 1. MS and similar noise data and their amplitude spectra in the Grand Canyon Tunnel. (a)–(c) MS at high and low amplitudes, and similar noise. Their amplitude spectra corresponding to the sensor are 0.1, 0.5, and 0.01 for (a)–(c), respectively. (a) MS with high amplitude; (b) MS with low amplitude; (c) Similar noise.

Figure 2. Other various types of noise data and their amplitude spectra in the Grand Canyon Tunnel. (a)–(c) Mechanical, blast, and unknown signals. Their amplitude spectra corresponding to the sensor are 0.05, 1.0, and 0.005 for (a)–(c), respectively. (a) Mechanical signal; (b) Blast signal; (c) Unknown signal.

Figure 3. Architecture of the proposed network. It includes three parts: (1) combination of real and imaginary parts of the time-frequency coefficients as input; (2) feature extraction. The rectangular block and arrow represent the convolutional layer and maximum pooling layer, respectively; (3) classification. The output is the probability of the target category. The blue region and circles represent the fully connected layer and neurons, respectively. The other different colors of the rectangles represent different operations, including ReLU activation, batch normalization, and Dropout. Conv, Maxp, and FC represent the 2D convolution, Max pooling, and fully connected layers, respectively.

Figure 4. Optimal results of model training. (a) Accuracy changes with the epochs; (b) Loss changes with the epochs. The dark blue and red lines represent the accuracy and loss for the training and validation datasets, respectively. The accuracy increases with the increase in the number of epochs, while the loss decreases.

Figure 5. ROC curve of different classes obtained by applying the MMC to the test dataset. The AUC values of the three classes (MS, blast, and noise) are 0.995, 0.998, and 0.994, respectively.

Figure 6. Semisynthetic data with different SNR values. (a) Clean microfracture waveform; (b,c) Background and Gaussian noises, respectively; (d,e) Semisynthetic data with SNR values of 35.2 and–1.9 based on background noise; (f,g) Semisynthetic data with SNR values of 10.8 and 3.9 based on Gaussian noise.

Figure 7. Noise sensitivity evaluation of MMC, AlexNet, and Correlation methods. (a) TPR (True positive rate); (b) detection results on semisynthetic data with different SNR values.

Figure 8. MS with different M_w in Micang Mountain tunnel, China. (a)–(c) Waveforms of the channel with the highest amplitude. (a) M_w = −0.6; (b) M_w = 0.6; (c) M_w = 1.8.

Figure 9. Detection results of MMC on MS with different M_w in Micang Mountain tunnel.

Table 1. Parameters of the MMC model.

Layer	Output	Activation Function	Kernel/Stride	Parameters
Conv1	129 × 236 × 32	ReLU	3 × 3/1 × 1	608
Conv2	129 × 236 × 32	ReLU	3 × 3/1 × 1	9248
Maxp1	64 × 118 × 32		2 × 2/1 × 1	0
Conv3	64 × 118 × 64	ReLU	3 × 3/1 × 1	18,496
Conv4	64 × 118 × 64	ReLU	3 × 3/1 × 1	36,928
Maxp2	32 × 59 × 64		2 × 2/1 × 1	0
Conv5	32 × 59 × 128	ReLU	3 × 3/1 × 1	73,856
Conv6	32 × 59 × 128	ReLU	3 × 3/1 × 1	147,584
Maxp3	16 × 29 × 128		2 × 2/1 × 1	0
Conv7	16 × 29 × 256	ReLU	3 × 3/1 × 1	295,168
Conv8	16 × 29 × 256	ReLU	3 × 3/1 × 1	590,080
Maxp4	8 × 17 × 256		2 × 2/1 × 1	0
Conv9	8 × 17 × 512	ReLU	3 × 3/1 × 1	1,180,160
Conv10	8 × 17 × 512	ReLU	3 × 3/1 × 1	2,359,808
Maxp5	4 × 8 × 512		2 × 2/1 × 1	0
Flatten				0
FC1	256	ReLU		1,048,832
FC2	128	ReLU		32,896
FC3	3	Softmax		387

Table 2. Comprehensive comparison between different networks using k-fold cross validation. The GFLOPs represents the computing power of the networks. Val_loss and Val_accuracy represent the loss and accuracy for the validation dataset, respectively.

Model	Parameters (×10⁶)	GFLOPs	Val_Loss	Val_Accuracy (%)
VGG13	10.52	0.025	0.018 ± 0.004	99.2 ± 0.5
VGG16	15.79	0.032	0.009 ± 0.005	99.5 ± 0.3
MMC	5.79	0.017	0.023 ± 0.005	99.0 ± 0.5

Table 3. Confusion matrix of the test dataset of different methods. The overall accuracies of the Correlation, AlecNet, and MCC methods are 78.3%, 98.58%, and 99.54%, respectively.

Correlation		Predict	MS	Blast	Noise	Overall Accuracy
	Class
	MS		1377	102	396	78.33%
	Blast		59	1289	28
	Noise		484	49	1376
AlexNet		Predict	MS	Blast	Noise	Overall Accuracy
	Class
	MS		1876	4	20	98.58%
	Blast		11	1434	3
	Noise		33	2	1777
MCC		Predict	MS	Blast	Noise	Overall Accuracy
	Class
	MS		1909	0	9	99.54%
	Blast		1	1440	4
	Noise		10	0	1787

Table 4. Comparison between correlation, AlexNet and MMC methods on the test dataset containing MS, blast, and noise data, excluding the overhead runtimes (model training for MMC and AlexNet took 42 and 53 min, respectively).

Correlation
Classes	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN	Reported Runtime
MS	0.734	0.717	0.726	0.794	1377	498	543	1 h
Blast	0.937	0.895	0.915		1289	87	151
Noise	0.721	0.764	0.742		1376	533	424
AlexNet
Classes	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN	Reported Runtime
MS	0.987	0.977	0.982	0.986	1876	24	44	16 s
Blast	0.990	0.996	0.993		1434	14	6
Noise	0.981	0.987	0.984		1777	35	23
MMC
Classes	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN	Reported Runtime
MS	0.995	0.994	0.995	0.996	1909	9	11	12 s
Blast	0.997	1.000	0.998		1440	5	0
Noise	0.994	0.993	0.994		1787	10	13

Table 5. Comparison between the Correlation, AlexNet and MMC methods on the test dataset containing MS and similar noise.

Correlation
	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN
MS	0.668	0.683	0.675	0.672	205	102	95
Similar noise	0.676	0.660	0.668	0.672	198	95	102
AlexNet
	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN
MS	0.904	0.937	0.920	0.918	281	30	19
Similar noise	0.934	0.900	0.917	0.918	270	19	30
MMC
	Precision	Recall	Micro F1-Score	Marco F1-Score	TP	FP	FN
MS	0.945	0.970	0.957	0.957	291	17	9
Similar noise	0.969	0.943	0.956	0.957	283	9	17

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, H.; Zeng, J.; Ma, C.; Li, T.; Deng, Y.; Song, T. Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering. Sensors 2021, 21, 6762. https://doi.org/10.3390/s21206762

AMA Style

Zhang H, Zeng J, Ma C, Li T, Deng Y, Song T. Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering. Sensors. 2021; 21(20):6762. https://doi.org/10.3390/s21206762

Chicago/Turabian Style

Zhang, Hang, Jun Zeng, Chunchi Ma, Tianbin Li, Yelin Deng, and Tao Song. 2021. "Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering" Sensors 21, no. 20: 6762. https://doi.org/10.3390/s21206762

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multi-Classification of Complex Microseismic Waveforms Using Convolutional Neural Network: A Case Study in Tunnel Engineering

Abstract

1. Introduction

2. Method and Data Preparation

3. Network Architecture and Training

4. Results

4.1. Model Evaluation

4.2. Application and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI