1. Introduction
Before mining engineering construction activities, such as roadway excavation and support design, the geological survey is indispensable due to its significant role in identifying the sudden instability of various rock masses caused by the randomness and concealment of local geological changes in strata [
1,
2]. However, traditional geological detection methods like core drilling and drilling peep have limitations, including poor real-time performance, high cost, limited detection range, and extensive manual involvement [
3]. Since the 1980s, scholars have introduced MWD technology into coal mines for geological investigation [
4]. Numerous studies have demonstrated the effectiveness of MWD technology in conducting real-time and efficient surveys of geology [
5,
6,
7]. However, existing MWD technology commonly suffers from low identification accuracy. To address this issue, numerous researchers have focused on enhancing the precision of identification models through building new indicators and optimizing algorithms [
8,
9].
Regarding building new indicators, the drilling specific energy (SED) was defined as the energy required to break a unit volume of rock, and there existed a remarkable correlation between SED and rock properties [
10]; the scholars established a functional relationship between SED and rock strength [
11], while the SED was also utilized to predict discontinuities, such as joints and fractures within rock masses [
12]. Leung et al. [
13] identified the limitations of SED in terms of low specificity and high variance; therefore, they processed the SED using a logistic function to obtain modulated specific energy, which effectively distinguished between coal and non-coal strata. Based on dimensional analysis theory, Zhang et al. [
14,
15] developed a rock drillability index (RDA) and provided corresponding ranges of RDA for different lithologies; a linear regression model was established between the RDA and rock strength. In addition, some scholars use Savitzky–Golay filtering [
16], Kalman filtering [
17], ensemble empirical mode decomposition [
18], and other methods to reduce the noise of drilling parameters, so as to improve the response sensitivity of drilling parameters to rock geology.
In terms of model algorithms, Burak et al. [
19] investigated the issue of lithology identification delays caused by sensor offset in logging while drilling, comparing the effectiveness of petrophysical methods, a combination of unsupervised and supervised machine learning approaches, and the random forest (RF) classification algorithm for lithology identification. He found that the RF model significantly improved lithology prediction accuracy by optimizing input parameters. Furthermore, based on the RF algorithm’s prediction of compressional wave velocity and lithology, he proposed a hybrid rate of penetration (ROP) prediction model and demonstrated that parameters such as compressive strength and weight on bit have a significant influence on ROP [
20]. Sun et al. [
21] developed a decoupling model between control parameters and comprehensive indicators; they predicted rock strength using support vector machines (SVMs) and decoupled comprehensive indicators, and the accuracy of the prediction model rose to 84%, while the relationships between the drilling rate index and physicomechanical rock properties was also confirmed [
22,
23]. Wang developed a lithology identification model using neural networks based on time-domain and frequency-domain features of drilling tool vibrations. This model enables the recognition of formation changes through the analysis of vibration signals, achieving an average accuracy of 89.57% in lithology classification [
24]. Liu et al. [
25] compared the identification accuracy of rock properties using BP neural networks and SVM and found that both models achieved an accuracy exceeding 80%. Furthermore, three artificial intelligence models were developed using artificial intelligence tools, such as artificial neural networks (ANNs), adaptive neuro-fuzzy inference system (ANFIS), and support vector machine (SVM), to predict the UCS of the downhole formations while drilling [
26]. Li et al. [
27] proposed a data cleaning method to correct drilling parameters and enhance their correlation with formation information. By employing a soft voting strategy to integrate classifiers, including SVM, DT, KNN, and neural network, the lithology identification model achieved an accuracy rate of 98.79% in field tests.
The aforementioned analysis indicates that numerous scholars have conducted extensive and beneficial research on the identification of strata strength and structure through MWD technology, significantly advancing the development of MWD technology in coal mining. However, external factors such as drill rig vibration [
28], drill rod buckling [
29], and rock heterogeneity [
30] contribute to significant fluctuations in drilling parameters. Consequently, there is a high overlap rate among drilling parameters, which obscures their response characteristics to differences in rock strength. This issue is particularly prominent in field drilling operations. Furthermore, control parameters (such as penetration rate) exhibit a strong coupling relationship with drilling parameters, where the influence of penetration rate can even surpass that of rock strength [
31]. As a result, this becomes a significant factor in reducing the sensitivity of drilling parameters. When various external factors cause low sensitivity between drilling parameters and rock strength, merely building new indicators and optimizing algorithms are insufficient to effectively improve the accuracy of MWD technology. Therefore, it is undoubtedly crucial to find methods for reducing the overlap rate of drilling parameters for different strength strata, while ensuring the reliability of drilling parameters to enhance the accuracy of MWD technology.
From the perspective of drilling parameters throughout the entire borehole, different strata exhibit varying degrees of overlap in their drilling parameters, with a higher overlap rate observed for strata with smaller differences in strength. However, specific concentrated distribution ranges exist for the drilling parameters of each stratum, within which significant differences are observed [
32]. In other words, the overlap rate of drilling parameters between different rock layers is significantly reduced within these ranges. By utilizing the drilling parameters within these ranges as a basis for predicting strata strength and structure, the prediction accuracy of strata information can be effectively improved. This study presents an innovative approach to identifying strata information through MWD technology by considering the frequency distribution of drilling parameters, thereby contributing significantly to improving accuracy in identifying strata information.
5. Multimodal Distribution Characteristics of Torque Frequency for Composite Rocks
The roadways in coal mines commonly feature composite strata, and the MWD technology involves drilling through various lithological strata. Each stratum is associated with a specific characteristic interval of frequency distribution for torque. Consequently, the frequency distribution of the overall borehole’s torque data will exhibit multimodal characteristics. By analyzing factors such as the number of peaks, characteristic intervals, and torque frequency, we can make predictions regarding the lithology, strength, and thickness of the strata.
5.1. Concentrated Distribution Behavior of Torque Frequency for Composite Rocks
The roof strata of coal mine roadways are often composed of mixed rock layers, and the drilling parameters exhibit corresponding frequency distributions due to the different rock layers. The overall drilling parameters for each borehole display a multimodal characteristic. By analyzing the number of peaks, characteristic intervals, and frequency counts, one can infer the lithology, strength, and thickness of the roof strata.
Figure 13 presents the torque frequency distributions for four groups of samples, revealing that the torque data for different samples exhibit a bimodal distribution. Each peak corresponds to a specific type of rock, with lower peak torques indicating lower rock strength. A Gaussian function is employed for multimodal fitting to obtain the cumulative curve, and the peak torques are statistically compiled in
Table 7.
To validate the feasibility of identifying composite rock layers using the multimodal frequency feature, the torque data from four groups of samples were mixed, resulting in the distribution shown in
Figure 14. As observed, the torque frequency exhibits five distinct peaks, corresponding to five types of rock. However, the actual number of rock types is seven, with the missing rock types being M35 and M40. This phenomenon is attributed to the overlapping frequency peak values of M25/M35 and M40/M45. Additionally, the peak torque values are 3.78, 4.76, 5.39, 6.19, and 7.26 N·m. As the strength difference of the composite samples increases, the average distance between the peaks rises from 0.9 N·m to 2.44 N·m, indicating that greater differences in rock layer strength lead to more pronounced multimodal characteristics, thus facilitating the identification of lithology and strength.
The relationship between peak torque and rock strength is illustrated in
Figure 15. As depicted, the fitting equation for rock strength and peak torque can be expressed as
Tp = 0.1
Rc + 2.0, exhibiting a determination coefficient of 0.93, indicating an incremental relationship of 0.1 N·m/MPa. Consequently, when the strength difference exceeds 5 MPa, peak torque demonstrates a commendable lithology identification effect and can serve as an effective indicator for predicting rock strength.
5.2. Quantitative Method of the Characteristic Interval of Multimodal Frequency Distribution
Unlike the single-peak distribution of frequency quantification, multimodal frequency distributions exhibit multiple extreme points, making it impossible to directly determine the peak torque using the first derivative. Therefore, this study proposes using the extremum method to solve for the frequency multimodal characteristic intervals, the principle of which is illustrated in
Figure 16.
The calculation steps for the frequency multimodal characteristic intervals based on the extremum method are as follows: First, the cumulative peak fitting equation of the torque frequency distribution of composite rock layers is obtained using Gaussian fitting [
41]. Next, the first derivative of this equation is solved to determine the locations of the extreme points, and the type of each extreme point (maximum or minimum) is identified based on the sign of the second derivative. Then, the extreme points
xi are categorized into a sequence of maxima
El and a sequence of minima
Es. After arranging them in ascending order, the distances
∆Ti between each maximum and its adjacent minimum are calculated in order. Finally, based on the peak torque and the distances of the extreme points, the characteristic intervals CIM corresponding to each frequency peak are determined, as demonstrated in Equation (7):
5.3. Thickness of Strata Calculation Model Based on Torque Frequency
Previous studies have focused on the relationship between peak torque and average values within characteristic intervals and rock strength but have not fully explored the rock layer information contained within torque frequency. Theoretically, at a fixed sampling frequency, the torque frequency is closely related to rock strength and thickness: the rock layer thickness and penetration rate together determine the frequency value, while the on-site penetration rate is primarily controlled by the rock strength. Therefore, this study aims to statistically analyze the torque frequency within the characteristic intervals, eliminate frequency differences caused by strength variations, and invert the thickness of each rock layer in conjunction with borehole depth. The specific steps are as follows.
Assuming the sampling frequency of the data monitoring system is
f, the difference in torque frequency
QR caused by varying rock strength within the
i-th characteristic interval can be expressed as shown in Equation (8):
Numerous studies have indicated that there is an inherent functional relationship between rock strength and penetration rate [
42]. Let us denote this relationship as
k. By substituting this into Equation (8), we can derive the relationship between rock strength and the difference in torque frequency, as given in Equation (9):
The torque frequency QHi within the i-th characteristic interval, which is primarily influenced by the thickness of strata, can be mathematically expressed as Equation (10).
When the torque frequency within the
i-th characteristic interval is
Qi, the torque frequency
QHi mainly related to the rock layer thickness within that interval can be expressed as in Equation (10):
Finally, based on
QHi and borehole depth
H, we obtain the thickness estimation formula for different rock layers, as shown in Equation (11):
5.4. Identification Strategy of Strata Information Based on Frequency Distribution Characteristics
In this study, the distribution characteristics of torque frequency are utilized to identify rock strength during drilling. However, the purpose of this study is to take torque data as an example to reveal the general law of drilling parameters. Therefore, for other drilling parameters such as thrust, rotational speed, and penetration rate, it also has frequency distribution characteristics similar to torque data. To facilitate its application, a process for identifying strata information based on the frequency distribution of drilling parameters is presented in
Figure 17.
For single-strength rock layers, the segmental mean method is used to process the drilling parameters. Then, the characteristic intervals are determined through frequency distribution and fitting equations. Based on the drilling parameters within these characteristic intervals, either a support vector machine (SVM) or a backpropagation (BP) neural network is established to predict rock strength. For composite rock layers, the identification of rock types and quantities is achieved using the extremum method based on multimodal frequency fitting. The peak torque is utilized to predict strength, and the frequency of parameters within the characteristic intervals is further employed to estimate rock layer thickness.
This paper proposes a method to identify rock strength based on the frequency distribution characteristics of torque data, but there are the following limitations:
- (1)
The current study focuses solely on torque data and does not incorporate other drilling parameters (such as thrust, rate of penetration, and vibration). In future work, we plan to introduce integrated multi-sensor data analysis methods to improve the accuracy and robustness of rock strength identification.
- (2)
The laboratory tests adopted a “constant rotational speed–constant penetration rate” control mode. However, in field drilling, rotational speed and penetration rate are often adjusted dynamically according to formation changes, which in turn affects the response characteristics of parameters such as torque and thrust. Therefore, our next step will focus on investigating the coupling mechanisms between control parameters and drilling parameters.
- (3)
Due to limitations in field-testing conditions, the robustness of the proposed method when applied beyond the tested strength range has not yet been fully validated. We will actively promote industrial field trials in the next phase, compare differences between laboratory and field drilling data, and refine the existing models and methods to make them more suitable for practical engineering environments.