DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations

Zhang, Tao; Li, Hao; Meng, Zhuoran; Yuan, Zongling; Wang, Mengfan; Li, Jun

doi:10.3390/pr13092700

Open AccessArticle

DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations

by

Tao Zhang

¹,

Hao Li

^1,*

,

Zhuoran Meng

¹

,

Zongling Yuan

²,

Mengfan Wang

¹

and

Jun Li

³

¹

Beijing Key Laboratory of High Dynamic Navigation Technology, Beijing University of Information Science and Technology, Beijing 100101, China

²

No. 4 Drilling Engineering Branch, Bohai Drilling Engineering Co., Ltd., Cangzhou 062552, China

³

School of Petroleum Engineering, China University of Petroleum, Beijing 102249, China

^*

Author to whom correspondence should be addressed.

Processes 2025, 13(9), 2700; https://doi.org/10.3390/pr13092700

Submission received: 4 July 2025 / Revised: 29 July 2025 / Accepted: 7 August 2025 / Published: 25 August 2025

(This article belongs to the Topic Advances and Application in Intelligent Oil and Gas Field Development Technology)

Download

Browse Figures

Versions Notes

Abstract

Downhole torsional vibrations, especially high-frequency torsional oscillations (HFTOs) and stick–slip phenomena, pose a serious threat to drilling operations, often resulting in tool damage, prolonged non-productive time, and significant cost increases. Traditional monitoring methods cannot promptly capture complex vibration patterns, so there is an urgent need for advanced early warning systems. This study proposes the DS-DW-TimesNet model, which improves the TimesNet framework by incorporating downsampling technology for efficient data compression, dilated convolution that can expand the temporal receptive field, and a learnable weight normalization method that can stabilize the training process, thereby enhancing the capabilities of feature extraction and long-sequence modeling. Verified using field data from the Fuman Oilfield, the results show that in terms of the mean absolute error (MAE) for 210 s predictions, this model is 77.2% and 21.8% lower than LSTM and Informer, respectively, and the inference speed is increased by 78.5% (reaching 48 milliseconds). It can provide reliable 210 s early warning windows for high-frequency torsional oscillations and 150 s early warning windows for stick–slip, exceeding industry standards and helping to improve the safety and efficiency of drilling operations.

Keywords:

downhole torsional vibrations; early warning; DS-DW-TimesNet; HFTOs; stick–slip

1. Introduction

Drilling systems are used for the exploration and harvesting of oil, gas, and geothermal energy [1]. Typically, the drilling process is affected by unwanted vibrations, including axial, lateral, and torsional vibrations [2,3]. More specifically, axial vibrations manifest as bit bounce; lateral vibrations manifest as buckling; and torsional vibrations manifest as stick–slip. Industry data show that these vibrations can cause serious damage to drill strings and drill bits, such as premature failure of drill string components, excessive wear of the bit, and many other negative impacts. Among all these abnormal vibrations, torsional vibration is the most destructive, affecting approximately 40% of annual drilling depth and leading to extended non-productive time (NPT) [2,3,4,5]. It is worth noting that these vibration signals not only contain key information for predicting downhole lithology [6,7,8,9,10,11], but their early identification and warning can also effectively reduce abnormal vibrations, thereby significantly reducing risks such as bit wear. Most downhole acceleration sensors collect data at a frequency of up to 100 Hz, which is sufficient to identify traditional stick–slip [12,13,14]. Stick–slip is considered a rotational constraint that may lead to the drill string twisting due to this movement constraint. This vibration mode causes the drill bit to stop rotating (stick) for a period, then releases the restriction, and the drill bit rotates (slip) again under the action of increasing torque [12,13,14,15,16]. With advances in sensor technology enabling sampling frequencies up to 1000 Hz, HFTOs have been identified. The fundamental cause of HFTOs is usually considered to be torsional resonance at the natural frequency of the lower bottom-hole assembly (BHA), which is excited by the interaction between the drill bit and the rock. Both types of torsional vibration modes can cause BHA connection fatigue, drill bit wear, and electronic component damage [2]. These risks underscore the critical need for real-time torsional vibration monitoring and early warning systems, necessitating advanced monitoring solutions beyond traditional approaches [12].

Extensive research on drill string vibration modeling has been conducted worldwide, primarily focusing on two core components: drill string dynamics modeling and bit–rock interaction modeling [16,17,18]. Research globally mainly focuses on these two directions: drill string dynamics modeling (evolving from low-degree-of-freedom torsional pendulum models to high-fidelity distributed parameter and finite element models [18,19,20]), and bit–rock interaction modeling (mainly including torsional friction models and axial–torsional coupled rock/bit interaction models [21,22]). Besselink et al. pioneered the coupling of axial and torsional dynamics, proposing a rate-independent bit–rock interaction law and employing semi-analytical methods to reveal axial vibration as the root cause of torsional stick–slip, while clarifying how bit bluntness parameters regulate stability [23]. Kamel and Yigit advanced drill string dynamics modeling by integrating hoisting system dynamics with a three-phase polycrystalline diamond compact (PDC) bit cutting model, developing a “smoothness index” optimization function to quantify operational parameter effects on stick–slip and bit–bounce coupling, thus providing guidance for deep well parameter optimization [24]. Most recently, Sharma et al. focused on geothermal hard rock scenarios, comparing velocity decaying friction (VDF) and state-dependent delayed friction (SDDF) models to validate the superior field data matching of VDF, identifying critical sensitive parameters, and establishing stick–slip operational windows [25]. Although these methods have made significant contributions, they face major challenges. Vibration modeling and monitoring are complex processes that require ideal conditions. However, in actual drilling operations, these ideal conditions cannot be consistently met due to variations in BHA, reservoirs, geology, and formations [26,27,28,29].

With the development of downhole measurement tools and intelligent algorithms, various machine learning (ML) methods have been successfully applied to drill string vibration identification [30]. Traditional approaches include Okoli et al.’s model for classifying axial/lateral vibrations using torque, rotational speed, and weight on bit (WOB) (with an accuracy of 50–85%) [31], and Saadeldin et al.’s multi-model method for predicting axial, torsional, and lateral vibration modes using surface drilling parameters (with an accuracy of 90–99%) [12]. For stick–slip vibrations, Hegde et al. pioneered the development of an ML-driven model for grading stick–slip severity [32]; Gupta et al. proposed a data scale-based model selection strategy: random forest/gradient boosting for small datasets and convolutional neural network–long short-term memory (CNN-LSTM) for large datasets [33]. Recent breakthroughs include Zha and Pham’s deep neural network (DNN), which achieved 99% accuracy in stick–slip classification using generalized data on torque, rotational speed, WOB, and triaxial acceleration [34]; Yahia et al., on the other hand, explored LSTM data-driven models, transfer learning, and hybrid models integrating physical features, evaluating the impact of normalization methods on generalization ability [35]. Kulke et al. developed a stability map algorithm based on downhole high-frequency data, suppressing HFTOs by inducing controlled stick–slip [36].

Real-time monitoring is crucial for avoiding operational risks [37,38]. Michael Yi et al. developed a Bayesian network model that predicts downhole failures (stick–slip/bit–bounce/whirl) using only surface data [39]; Millan et al. combined surface measurements with machine learning to achieve real-time vibration detection [40]. Zhao et al. developed an ML event-triggered model to capture abnormal vibration signals in real time based on drilling data [41]. Zhang et al. confirmed that HFTOs induce drilling tool failures through high-frequency downhole measurements in North America and proposed a real-time parameter adjustment scheme [42]. Elahifar et al. innovatively combined Bayesian optimized extra trees (BO_ET) with model-agnostic meta-learning to realize real-time prediction of stick–slip events using downhole data [43]. de Souza et al. integrated pre-job BHA modeling with real-time HFTO monitoring, dynamically adjusting parameters to reduce tool failure rates by 35% [44]. However, surface monitoring systems are prone to high false alarm rates due to signal attenuation and transmission delays. To overcome these limitations, hybrid approaches have emerged: Sheth et al. integrated physics-based and data-driven methods [45]; Hutahaean et al. used Bayesian-optimized Random Undersampling Boosting (RUSBoost) decision trees to predict the risk of drilling tool failures involving stick–slip [46]; and Huang et al. innovatively introduced deep reinforcement learning to autonomously optimize drilling parameters through a comprehensive reward function (incorporating stick–slip, HFTOs, and bit wear) [47]. For severity quantification, Zhang et al. combined the downhole rotational speed range predicted by eXtreme Gradient Boosting (XGBoost) with continuous wavelet transform (CWT) analysis to achieve real-time assessment of stick–slip severity [48]. Existing challenges affect the LSTM vibration prediction model by Vishnumolakala et al., which suffers from accuracy degradation in sequences longer than 30 s due to gradient vanishing and convergence issues [49].

The emergence of high-frequency downhole sensors and high-speed data transmission systems has enabled big data-driven drilling analytics. To address the limitations in prediction accuracy caused by data transmission delays and the inadequacies of downhole vibration data in long-sequence forecasting, this study conducted time-domain and frequency-domain characteristic analyses on five high-frequency downhole engineering parameters (weight on bit, torque, and triaxial acceleration) collected near the drill bit using a self-developed downhole engineering parameter measurement tool (with a sampling frequency of 400 Hz and a cumulative operation time of 23 h). Two characteristic operating conditions of torsional vibration were identified. This study proposes DS-DW-TimesNet, which integrates downsampling modules for computational efficiency, dilated convolutional structures to expand temporal receptive fields, and weight normalization for stable training, collectively forming a lightweight solution for torsional vibration prediction and early warning. Compared with Informer, LSTM, TimesNet-V1, and TimesNet-V2, DS-DW-TimesNet demonstrates superior effectiveness and performance.

2. Methodology

Figure 1 illustrates the workflow of the proposed DS-DW-TimesNet early warning model. The framework achieves precise torsional vibration prediction through three core stages: (a) data preprocessing, (b) model training and optimization, and (c) evaluation of prediction performance. The detailed implementation process is described as follows:

(a): Data Preprocessing:

Downhole sensors collect raw high-frequency signals. Due to the limitations of measurement-while-drilling (MWD) transmission bandwidth, the raw data undergo two-step real-time preprocessing downhole: calculating the mean and root mean square (RMS) values of acceleration/torque using a fixed 1 s window, and uploading the calculated results to the surface system, with the raw data stored in downhole tools.

After receiving the data, the surface system processes them through the model’s built-in downsampling evaluation module. First, a downsampling operation is performed. Subsequently, a threefold evaluation using Kullback–Leibler divergence, relative mean, and relative RMS is conducted. Only the downsampled data that pass the verification are input into the prediction model.

(b): Model Training and Optimization:

The preprocessed data first undergo layer normalization for standardization before being input into the DW-TimesNet architecture. The model extracts spectral features of the input sequence through Fast Fourier Transform (FFT), identifies k periods, and calculates the corresponding amplitude weights. Based on the identified period lengths, the model reshapes the one-dimensional time series into a two-dimensional tensor representation, capturing both intra-period and inter-period trends simultaneously. In the feature extraction stage, dynamically weight-normalized dilated convolutions are applied to process the two-dimensional representations. The final output is generated through residual summation of multiple TimesBlock modules. Model optimization is achieved through performance evaluation on the validation set.

(c): Prediction Result Evaluation:

The finalized model is deployed on the test dataset, with quantitative assessment conducted using three key metrics.

Historical data from offset wells are used for model training. Drilling engineers can view prediction curves through the real-time monitoring interface, and when a risk trend of abnormal torsional vibration is identified, they can intervene in advance to reduce the occurrence of drilling accidents.

2.1. TimesNet

The TimesNet model employs TimesBlock as its backbone for time-series analysis, transforming 1D time series into 2D tensors to enhance representational capacity while simultaneously achieving unified modeling of intra-period and inter-period variations [50]. The overall architecture of the model is illustrated in Figure 2.

2.1.1. Dimensional Expansion from 1D to 2D

To uniformly represent the temporal variations within and between periods, it is first necessary to explore the periodicity of the time series. For a one-dimensional time series

X_{1 D} \in R^{T \times C}

with a time length of T and a channel dimension of C, its periodicity can be calculated via FFT on the time dimension, specifically as follows:

A = A v g (A m p (F F T (X_{1 D}))) .

(1)

f_{1}, \dots, f_{k} = \frac{a r g T o p k (A)}{f_{*} \in (1, \dots, [\frac{T}{2}])}

(2)

p_{1}, \dots \dots, p_{k} = [\frac{T}{f_{1}}], \dots \dots, [\frac{T}{f_{k}}]

(3)

Among them,

A \in R^{T}

represents the intensity of each frequency component in

X_{1 D}

. The k frequencies with the highest intensity

\{f_{1}, \dots, f_{k}\}

correspond to the most significant period lengths

\{P_{1}, \dots, P_{k}\}

. The above process is abbreviated as follows:

A, \{f_{1}, \dots, f_{k}\}, \{p_{1}, \dots, p_{k}\} = P e r i o d (X_{1 D}) .

(4)

For the selected period

p_{i}

, the original one-dimensional time series

X_{1 D}

is folded. Zero-padding is performed at the end of the sequence

p a d d i n g (\cdot)

to ensure that the sequence length is divisible by the period

p_{i}

.

X_{2 D}^{i} = R e s h a p e_{p_{i}, f_{i}} (P a d d i n g (X_{1 D})), i \in \{1, \dots, k\}

(5)

2.1.2. 2D Temporal Feature Extraction

The architecture comprises multiple stacked TimesBlock modules. The input sequence first passes through an embedding layer for deep feature

X_{2 D}^{0}

presentation. Each subsequent TimesBlock then progressively processes the hierarchical features

X_{2 D}^{m - 1}

from its preceding layer, employing 2D convolutions to extract temporal patterns through learned 2D representations.

{\hat{X}}_{2 D}^{m, i} = I n c e p t i o n (X_{2 D}^{m, i})

(6)

2.1.3. Dimensionality Reduction and Adaptive Fusion

The model first projects the 2D temporal features back to 1D space while preserving their original period lengths. It then generates the final output through an amplitude-weighted summation of all 1D sequences, where the weights correspond to the spectral amplitudes of their associated frequency components.

{\hat{X}}_{1 D}^{m, i} = T r u n c (R e s h a p e_{1, (p, x f_{i})} ({\hat{X}}_{2 D}^{m, i})), i \in \{1, \dots, k\}

(7)

{\hat{A}}_{f_{1}}^{m - 1}, \dots, {\hat{A}}_{f_{k}}^{m - 1} = S o f t \max (A_{f_{1}}^{m - 1}, \dots, A_{f_{k}}^{m - 1})

(8)

X_{1 D}^{m} = \sum_{i = 1}^{k} {\hat{A}}_{f_{i}}^{m - 1} \times {\hat{X}}_{1 D}^{m, i}

(9)

In

{\hat{X}}_{1 D}^{m, i}

,

T r u n c (\cdot)

represents the removal of the zero-padding added during operation

p a d d i n g (\cdot)

in Step 1.

2.1.4. Residual Architecture

As illustrated in Figure 3, adjacent TimesBlock modules are interconnected through residual connections. Specifically, the output from the previous layer is element-wise added to the output of the current layer, forming a residual learning framework that guarantees performance stability when increasing model depth.

2.2. Dilated Convolution with Weight Normalization

The initial TimesNet architecture adopted Inception-v1 modules for 2D convolution operations when transforming temporal sequences into two-dimensional representations. However, the inherent computational overhead of Inception’s multi-branch parallel convolutions substantially hinders training efficiency, particularly when processing high-resolution 2D temporal maps [51]. To address this, this study has implemented dilated convolutions with exponentially increasing dilation rates, which systematically expand the effective receptive field while maintaining computational efficiency.

D_k e r n e l = d i l a t i o n \times (k e r n e l_s i z e - 1) + 1

(10)

D_k e r n e l

represents the size of the dilated convolution kernel,

d i l a t i o n

denotes the dilation factor, and

k e r n e l_s i z e

refers to the size of the original convolution kernel.

Weight normalization (WN) is integrated into the convolutional network to normalize weights vector-wise, enforcing unit norm. This offers three key benefits: (1) faster convergence through stable gradient updates, (2) better generalization via implicit regularization, and (3) improved training stability by mitigating exploding gradients. WN ensures consistent weight magnitudes across layers, enhancing optimization in deep networks.

Representing the weight

w

using the parameter vector

v

and the scalar

g

, the new parameter formulation is given by

w = g \frac{v}{| | v | |},

(11)

where

v

is the unnormalized weight vector,

g

is a learnable scaling parameter, and

| | v | |

represents the Euclidean norm of

v

.

2.3. Downsampling Performance Evaluation

The bandwidth limitation in downhole data transmission significantly affects the accuracy of torsional vibration prediction. To address this issue, an optimized data compression method is proposed, aiming to achieve two critical objectives: (1) substantially reducing the volume of transmitted data and (2) effectively retaining key dynamic features essential for prediction reliability. The performance of the proposed method is evaluated using the following quantitative metrics: Kullback–Leibler (

D_{K L} (P I I Q)

) divergence to measure the distributional differences in transmitted data before and after compression [52], and the relative mean variation (

R_{μ}

) and relative variance variation (

R_{σ^{2}}

) to quantify the preservation of transmission data distribution characteristics [53,54].

D_{K L} (P I I Q) = \sum P (x) * l o g (P (x) / Q (x))

(12)

R_{μ} = \frac{μ_{n} - μ_{1}}{μ_{1}} * 100 %

(13)

R_{σ^{2}} = \frac{σ_{n}^{2} - σ_{1}^{2}}{σ_{1}^{2}} * 100 %

(14)

3. Data Description and Analysis

3.1. Data Source

The data used in this study were obtained from field measurements during actual drilling operations in the Fuman Oilfield, specifically from the third spud section between 5449 m and 5635 m, which yielded a total drilled footage of 186 m with 23 h of pure drilling time. Detailed statistical characteristics of the dataset are summarized in Table 1. The experimental setup incorporated a self-monitoring system that combined a torsional impact tool with a near-bit measurement tool assembly, with the detailed configuration of the BHA presented in Figure 4. The near-bit measurement tool was equipped with multiple sensors including a triaxial accelerometer, gyroscope, and temperature sensor, enabling continuous recording of downhole time-series data such as triaxial vibration (±40 g, where g represents gravitational acceleration), rotational speed (±333 rpm), weight on bit (±300 kN), torque (±30 kN·m), and temperature (150 °C). Given the measurement tool’s immediate proximity to the drill bit, the acquired data effectively approximate the bit’s operational status, thereby supporting subsequent downhole data analysis for drilling condition identification. Notably, the triaxial accelerometer was eccentrically mounted, and based on its specific installation configuration, the accelerometer’s measurement output can be expressed as follows:

X = a_{x} + r \dot{ω}

(15)

Y = a_{y} + r ω^{2}

(16)

Z = a_{z}

(17)

where X, Y, and Z denote the tangential, normal, and axial accelerations, respectively;

a_{x}

and

a_{y}

represent the lateral acceleration components at the measurement tool’s center;

a_{z}

indicates the axial acceleration at the measurement tool’s center;

γ

stands for the eccentricity distance; and w signifies the angular velocity.

3.2. Data Labeling

The drilling process induces torsional vibrations due to bit–rock and drill string–wellbore interactions. These vibrations are classified into two distinct categories: low-frequency stick–slip (typically <1 Hz) and HFTOs [2,55,56,57].

3.2.1. Stick–Slip

During stick–slip, downhole drill strings experience significant torque fluctuations, which adversely affect both drilling efficiency and operational safety. This vibrational phenomenon consists of two distinct phases: the sticking phase and the slipping phase. Frictional interactions at the drill string–wellbore interface and bit–rock contact have been identified as the primary excitation mechanisms for stick–slip [1,2,3,4,5,6]. To characterize stick–slip patterns and develop effective mitigation strategies, triaxial acceleration measurements near the drill bit are essential. As clearly illustrated in Figure 5, the mean normal acceleration consistently displays markedly higher amplitudes compared to tangential and axial components during stick–slip occurrences. This distinctive behavior results from the abrupt liberation of accumulated radial elastic energy during slip events, coupled with the restrictive influence exerted by the wellbore on drill string lateral movement, thus establishing normal acceleration as a robust and quantifiable diagnostic parameter for stick–slip identification and severity assessment.

3.2.2. High-Frequency Torsional Oscillations

HFTOs are a self-excited torsional vibration mode of the drill string, analogous to stick–slip phenomena but distinguishable through modal characteristics and frequency-domain features [2,55,56,57]. During HFTOs, tangential acceleration exhibits extreme fluctuations ranging from −40 g to +40 g, markedly exceeding the peak amplitudes of axial and normal accelerations. This disparity arises from the pronounced coupling effect between HFTOs and tangential vibration modes. Through FFT spectral analysis of the X-axis acceleration data, it is found that the dominant frequency of high-frequency torsional oscillations is 177 Hz. The spectrum shows that vibrational energy is concentrated in the high-frequency band, and there is a significant spectral peak at 177 Hz. It should be emphasized that the 177 Hz peak is caused by the local modal excitation of the specific BHA in this well and is not a universally existing feature, while the fundamental frequency band above 50 Hz conforms to the general definition of high-frequency torsional oscillations. The above conclusion is drawn based on data analysis in storage mode, with the data coming from downhole records with high sampling frequencies; due to the bandwidth limitation of real-time transmission data, after downsampling processing, it is impossible to resolve vibration information with frequencies higher than 50 Hz. Therefore, the characteristic of 177.4 Hz can only be detected in storage mode records.

In Figure 6, the mean triaxial acceleration values lack significant periodicity, while a pronounced divergence exists between tangential and normal acceleration means. This phenomenon strongly indicates sustained downhole torsional vibration. Furthermore, tangential acceleration exhibits the highest RMS values among all axes, revealing both intensified fluctuation amplitudes and elevated vibrational energy density. Its dominant variability establishes tangential acceleration as the primary diagnostic parameter for HFTO detection.

4. Model Training

This study utilizes near-bit measurements including WOB, torque, and triaxial acceleration to construct two preprocessed datasets: Case One applies n-second moving average filtering (n ∈ {1,2,3,4,5}) to smooth transient fluctuations, while Case Two employs RMS processing within identical time windows to preserve vibration energy characteristics. The datasets contain labeled samples of both normal vibrations and torsional oscillations, with n = 1 (representing original downhole data transmission resolution) serving as the baseline for comparative validation. For time-series data, both datasets are partitioned into training, validation, and test sets in a 7:1:2 ratio, strictly following the chronological order.

For the multi-step prediction task, five comparative experimental models were established: (1) DW-TimesNet; (2) TimesNet-V1, referring to the original TimesNet architecture; (3) TimesNet-V2, which uses the InceptionV2 structure to extract two-dimensional temporal features; (4) Informer; and (5) LSTM. To eliminate random initialization bias, each experiment was repeated 10 times, taking the average as the result. The predictive performance was evaluated using standard regression metrics: the coefficient of determination (R²), MAE, and root mean square error (RMSE). In this context,

y_{i}

denotes the observed value,

{\hat{y}}_{i}

denotes the predicted value, and

\bar{x}

,

\bar{y}

denotes the mean value.

R^{2} = 1 - \frac{\sum_{i = 1}^{n} {(\bar{θ_{i}} - y_{i})}^{2}}{\sum_{i = 1}^{n} {(\bar{θ_{i}} - y_{i})}^{2}}

(18)

M A E = \frac{1}{n} \sum_{i = 1}^{n} (y_{i} - {\hat{y}}_{i})

(19)

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(20)

This study adopts a fully unsupervised learning paradigm, where the original time-series data contain no labels generated manually or by algorithms. The model hyperparameters are set as follows: the learning rate is 0.01, the batch size is 32, patience for early stopping is 3, the dropout rate is 0.1, and the number of training epochs is 10; other hyperparameters are detailed in Table 2 below.

The models were employed to predict drilling data from both datasets. All experiments were conducted using the PyTorch 2.2.2+cu121 framework on a hardware platform equipped with an AMD Ryzen 7 7735H CPU, 60 GB RAM, and an NVIDIA RTX 4050D GPU.

5. Experimental Results and Analysis

To achieve comprehensive wellbore data utilization and real-time early warning, the selected model must possess low computational complexity while maintaining stable performance in long-sequence prediction. These characteristics are essential to adapt to field conditions, mitigate transmission error impacts, and ultimately enhance both the accuracy and timeliness of early warnings.

5.1. Case One

As shown in Table 3, when n = 3, KL divergence in the normal acceleration dimension is significantly lower than that for n = 4 and n = 5 relative to the transmitted data. This indicates limited variation in the normal dimension—lower volatility and distribution closer to the transmitted data. Additionally, mean and variance changes across n values show negligible deviation from the transmitted data’s mean, with no significant fluctuations in variance. Figure 7 confirms this: comparing n = 3 with the transmitted data reveals virtually invariant non-stationary distribution across all dimensions, validating robust data integrity preservation.

This experiment evaluates multi-step prediction performance across varying time horizons (10–70 steps). All predictions were iteratively generated based on a fixed 10-step historical observation window. The selected prediction horizons were empirically determined through preliminary experimental analysis. Given the 3 s sampling interval of the dataset, these prediction steps correspond to drilling operation forecasts spanning 30 to 210 s in increments of 30 s.

Compared to the original dataset, the model’s runtime decreased from 108.4 s to 47.8 s after downsampling, representing a 55.9% reduction. This demonstrates a significant improvement in computational efficiency.

The proposed model demonstrates superior performance compared to other deep learning architectures. Particularly at the 210 s prediction step, its long-term time-series forecasting capability is significantly improved. Comparative analysis with LSTM shows that TimesNet-V1 achieves a significant average reduction of 78.8% in RMSE, which confirms its accuracy in trend capture and reliability in multi-step forecasting. In addition, MAE is reduced by 77.2%, indicating a significant decrease in prediction deviations and higher overall precision. Detailed comprehensive performance metrics are presented in Figure 8.

Figure 9 compares the triaxial vibration prediction performance of different models at the 30 s and 210 s prediction horizons. All models demonstrate accuracy in short-term predictions at 30 s, with the predicted curves closely matching the actual values. However, at the 210 s prediction step, only TimesNet maintains prediction accuracy, while other models show significant performance degradation.

The experimental results demonstrate that while DW-TimesNet shows marginal accuracy gains over TimesNet-V1/V2, its optimized architecture—featuring weight-normalized two-layer dilated convolutions replacing Inception-V1—achieves substantial computational improvements. In continuous 34,353 s predictions, DW-TimesNet reduces runtime by 56.1% at 210 s horizons compared to TimesNet-V1. The simplified structure decreases floating-point operations (FLOPs) by two orders of magnitude (30 s input/210 s prediction) while maintaining prediction accuracy during incremental training; the specific data are shown in Table 4.

Comparing the curves in Figure 10 for 30-s and 210 s predictions based on 30 s historical data, the TimesNet model does not exhibit a sharp increase in error as the prediction step increases. The model was tested using non-dataset data from the same well, with the results presented in Figure 10. Verified by actual drilling data, the model accurately captures the periodic characteristics of HFTOs and achieves precise prediction of peak values. For the measured HFTO periodicity of approximately 130 s in this well, the model can effectively predict the subsequent 210 s vibration trend using 30 s historical data. This design achieves two goals:

(1): Complete cycle coverage: The prediction duration covers the full HFTO cycle of ≥130 s, ensuring accurate assessment of the evolution trend of oscillation energy;
(2): Key decision redundancy: An additional 20–50 s of buffer time is provided, compatible with downhole command transmission delays (5–10 s) and operator response time (10–30 s).

5.2. Case Two

Table 5 shows that while the relative mean changes for all values of n are close to zero and the relative variance changes consistently match the transmitted data, there is high stability in both mean and variance measures. However, at n = 2, KL divergence reveals significant distributional differences in the tangential and normal dimensions compared to the transmitted data. In Figure 11, the comparison between n = 2 and the transmitted data indicates compromised data integrity along the tangential and normal axes.

The experiment conducts multi-step predictions (30–150 steps at 30-step increments) using a 30-step historical window. Given a 1 s sampling interval, these steps translate to 30–150 s drilling operation forecasts.

Comparative analysis shows that although TimesNet has slightly lower R² values, it outperforms other models across different forecasting horizons. Specifically, compared with LSTM, TimesNet-V1 achieves an average reduction of 11.19% in RMSE and a 7.91% decrease in MAE. As shown in Figure 12, the model’s predicted curves are highly consistent with the actual measurements, which further validates its superiority.

While all models demonstrate excellent short-term forecasting capabilities, only TimesNet maintains high accuracy in long-term predictions. The experimental results in Table 6 show that DW-TimesNet, as an optimized variant, achieves a 48.17% reduction in runtime compared to TimesNet-V1 at the 150 s prediction step, with detailed indicators provided in Table 6.

Figure 13 compares the enlarged 1800 s segments of DW-TimesNet predictions at 30 s and 150 s horizons. The results show that in the enlarged 1800 s prediction segment, although the overall trend is consistent with the actual signal, there are large local deviations, which explains the occurrence of a low R². This is because R² quantifies both global trend alignment and point-wise fitting accuracy. In torque vibration early warning applications, where the main goal is to detect macroscopic trend anomalies, the actual impact of these local errors is negligible.

As shown in Figure 14, the predicted results are remarkably consistent with the characteristics of stick–slip and have been rigorously verified against actual drilling data. The predicted outcomes not only capture typical stick–slip behavior but also reveal an interesting phenomenon: regular periodic fluctuations in normal acceleration may serve as an early precursor to the occurrence of stick–slips. This provides a new perspective for the early warning of stick–slip events. In practical drilling operations, the timely and accurate detection of these signs is crucial. Once these early warning indicators are identified, effective mitigation strategies can be promptly implemented, such as increasing torque and adjusting rotational speed.

Based on an in-depth analysis of the stick–slip cycle characteristics, it has been determined that utilizing 30 s of historical data to predict vibrations for the subsequent 150 s is an optimal approach. This conclusion is derived from testing the trained model on non-dataset data from the same well, with the results illustrated in Figure 14. This time interval ensures comprehensive coverage of the entire stick–slip cycle, considering the inevitable time delays in downhole data transmission and providing operators with sufficient response time.

5.3. Adjacent Well Test

To verify the generalization performance of the early warning model, this study selected Manshen Well in Shaya County, Aksu Prefecture, Xinjiang Uygur Autonomous Region, China, for testing. This well is approximately 80 km in a straight-line distance from Yueman Well, which can effectively test the model’s adaptability in different geographical locations. The tested well section is 3501–3863 m, with a total working time of 83 h, and its BHA is consistent with that of Yueman Well, ensuring the comparability of the test results to a certain extent.

In the specific testing process, this study adopted the 210 s root mean square model and 150 s mean model for data processing and analysis of Manshen Well. These two models, with different time window settings, can comprehensively capture the characteristics of the well data from multiple perspectives, thereby more accurately evaluating the generalization performance of the early warning model. The statistical data of Manshen Well processed by the two models are shown in Table 7 below.

Figure 15 and Figure 16 show comparisons of the prediction results based on the mean dataset and RMS dataset, respectively. In terms of curve trends, the model’s prediction results are generally consistent with the measured data. However, since the test well and the training well belong to different geological blocks and there are differences in drilling depth, the prediction accuracy at some peak positions has decreased. It is expected that better fitting results can be obtained if adjacent well data from the same block and with similar depths are used for testing.

From the vibration data in Figure 15, the tangential vibration shows an obvious peak at around 150,000 s, while the axial vibration has a large fluctuation range with significant negative values. These characteristics are consistent with the periodic, large-amplitude fluctuation features caused by the “stick–slip” alternation in torsional vibrations. In particular, the severe fluctuations in axial vibrations conform to the force mutation characteristics during energy storage and release of the drill string in stick–slip, so it can be judged that stick–slip vibrations exist. Combined with the further verification results of downhole stored data, it can be clearly determined that stick–slip is present.

From the analysis of the vibration data characteristics in Figure 15, it can be seen that the vibration signals do not exhibit the typical time-domain characteristics of HFTOs. Combined with the further verification results of downhole stored data, it can be clearly determined that HFTOs are absent.

5.4. Limitations and Future Work

While the proposed DS-DW-TimesNet demonstrates superior performance, several limitations should be acknowledged:

Geographical Generalization: All experiments were conducted with data from the Fuman Oilfield. The model’s performance in other geological formations requires further validation.
Noise Robustness: Downhole sensor noise may degrade prediction accuracy, which was not explicitly tested.
Data Gaps: The model assumes continuous data streams. Its resilience to missing data scenarios needs evaluation.
Physical Interpretability: Unlike physics-based models, the black-box nature of deep learning may limit operational trust.

Future work will center on adjacent well testing under the same block, depth, and formation conditions, while focusing on the following directions: (a) conducting multi-basin validation; (b) developing hybrid modeling combined with adaptive filters; (c) and researching real-time noise suppression technologies.

6. Conclusions

This study addresses the real-time and accuracy requirements for early warning of downhole torsional vibrations by proposing the DS-DW-TimesNet model based on an improved TimesNet framework. By integrating downsampling, dilated convolution, and dynamic weight normalization mechanisms, the model achieves significant breakthroughs in feature extraction efficiency and computational lightweighting, effectively overcoming the prediction lag caused by data transmission delays in traditional methods. The key innovations and conclusions are as follows:

(1): DS-DW-TimesNet demonstrates stable performance in both short- and long-term predictions, with an MAE as low as 0.41 for 210 s predictions, representing improvements of 77.2% and 21.8% over LSTM and Informer, respectively.
(2): Through structural optimization, the model’s runtime is reduced by 78.5% to 48 s, and computational load is lowered by two orders of magnitude, significantly improving its suitability for real-time downhole warning scenarios.
(3): Using field data from the Fuman Oilfield, the model successfully achieves early warnings for high-frequency torsional oscillations and stick–slips 150–210 s in advance, providing critical decision-making windows for drilling safety.

This study offers an efficient solution for intelligent early warning in complex downhole conditions, combining theoretical innovation with practical engineering value.

Author Contributions

Conceptualization, T.Z. and H.L.; methodology, H.L.; software, Z.M.; validation, T.Z., H.L. and M.W.; formal analysis, M.W.; investigation, H.L.; resources, T.Z.; data curation, T.Z.; writing—original draft preparation, H.L.; writing—review and editing, Z.Y.; visualization, H.L.; supervision, J.L.; project administration, T.Z. and J.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China-Maior Scientific Research Instrument Project, grant number 52227804, and the General Program of National Natural Science Foundation of China, grant number 52274003.

Data Availability Statement

The datasets presented in this article are not readily available due to laboratory confidentiality.

Conflicts of Interest

Author Zongling Yuan was employed by the Bohai Drilling Engineering Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Cheng, J.; Wu, M.; Wu, F.; Lu, C.; Chen, X.; Cao, W. Modeling and control of drill-string system with stick-slip vibrations using LPV technique. IEEE Trans. Control Syst. Technol. 2020, 29, 718–730. [Google Scholar] [CrossRef]
Zhang, Y.; Zhang, H.; Chen, D.; Ashok, P.; van Oort, E. Comprehensive review of high frequency torsional oscillations (HFTOs) while drilling. J. Pet. Sci. Eng. 2023, 220, 111161. [Google Scholar] [CrossRef]
Hohl, A.; Kulke, V.; Kueck, A.; Herbig, C.; Reckmann, H.; Ostermeyer, G.-P. The Nature of the Interaction Between Stick/Slip and High-Frequency Torsional Oscillations. In Proceedings of the IADC/SPE International Drilling Conference and Exhibition, Galveston, TX, USA, 3–5 March 2020. [Google Scholar] [CrossRef]
Zhang, P.; Hu, W.; Zhou, J.; Cao, W. Incident detection adapting to the drilling depth for geological drilling processes based on domain adversarial dual graph convolutional network. IEEE Trans. Instrum. Meas. 2024; in press. [Google Scholar] [CrossRef]
Boukredera, F.S.; Youcefi, M.R.; Hadjadj, A.; Ezenkwu, C.P.; Vaziri, V.; Aphale, S.S. Enhancing the drilling efficiency through the application of machine learning and optimization algorithm. Eng. Appl. Artif. Intell. 2023, 126, 107035. [Google Scholar] [CrossRef]
Zhekenov, T.; Nechaev, A.; Chettykbayeva, K.; Zinovyev, A.; Sardarov, G.; Tatur, O.; Petrakov, Y.; Sobolev, A. Application of Machine Learning for Lithology-on-Bit Prediction using Drilling Data in Real-Time. In Proceedings of the SPE Russian Petroleum Technology Conference, Virtual, 12–15 October 2021. [Google Scholar] [CrossRef]
Mensah, A.O. Real-Time Lithology Prediction While Drilling Using Machine Learning Algorithms: A Web Application Based Solution. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 9–11 October 2023. [Google Scholar] [CrossRef]
Yang, Z.; Liu, H.; Ding, Z. Research on the Strength Prediction Method of Coal and Rock Mass Based on the Signal While Drilling in a Coal Mine. Appl. Sci. 2025, 15, 4427. [Google Scholar] [CrossRef]
Khoshouei, M.; Bagherpour, R. Predicting the geomechanical properties of hard rocks using analysis of the acoustic and vibration signals during the drilling operation. Geotech. Geol. Eng. 2021, 39, 2087–2099. [Google Scholar] [CrossRef]
Chen, G.; Chen, M.; Hong, G.; Lu, Y.; Zhou, B.; Gao, Y. A new method of lithology classification based on convolutional neural network algorithm by utilizing drilling string vibration data. Energies 2020, 13, 888. [Google Scholar] [CrossRef]
Lakshminarayana, C.R.; Tripathi, A.K.; Pal, S.K. Prediction of mechanical properties of sedimentary type rocks using rotary drilling parameters. Geotech. Geol. Eng. 2020, 38, 4863–4876. [Google Scholar] [CrossRef]
Saadeldin, R.; Gamal, H.; Elkatatny, S. Machine Learning Solution for Predicting Vibrations While Drilling the Curve Section. ACS Omega 2023, 8, 35822–35836. [Google Scholar] [CrossRef]
Hegde, C.; Millwater, H.; Gray, K. Classification of drilling stick slip severity using machine learning. J. Pet. Sci. Eng. 2019, 179, 1023–1036. [Google Scholar] [CrossRef]
Qu, F.T.; Liao, H.L.; Lu, M.; Niu, W.L.; Shi, F. Recognition of drill string vibration state based on WGAN-div and CNN-IWPSO-SVM. Geoenergy Sci. Eng. 2024, 243, 213342. [Google Scholar] [CrossRef]
Tang, L.; Guo, B.; Zhu, X.; Shi, C.; Zhou, Y. Stick–slip vibrations in oil well drillstring: A review. J. Low Freq. Noise Vib. Act. Control 2020, 39, 885–907. [Google Scholar] [CrossRef]
Rahman, N.A.; Mohaideen, A.; Bakar, F.H.; Tang, K.H.; Maury, R.; Cox, P.; Le, P.; Donald, H.; Brahmanto, E.; Subroto, B. Solving Stick-Slip Dilemma: Dynamic Modeling System Significantly Reduces Vibration, Increases ROP by 54%. In Proceedings of the Abu Dhabi International Petroleum Conference and Exhibition, Abu Dhabi, United Arab Emirates, 11–14 November 2012. [Google Scholar] [CrossRef]
Bailey, J.R.; Biediger, E.A.O.; Gupta, V.; Ertas, D.; Elks, W.C.; Dupriest, F.E. Drilling vibrations modeling and field validation. In Proceedings of the IADC/SPE Drilling Conference and Exhibition, Orlando, FL, USA, 4–6 March 2008. [Google Scholar] [CrossRef]
Janwadkar, S.S.; Fortenberry, D.G.; Roberts, G.K.; Kramer, M.; Trichel, D.K.; Rogers, T.; Privott, S.A.; Welch, B.; Isbell, M.R. BHA and Drillstring Modeling Maximizes Drilling Performance in Lateral Wells of Barnett Shale Gas Field of N. Texas. In Proceedings of the SPE Gas Technology Symposium, Calgary, AB, Canada, 15–17 May 2006. [Google Scholar] [CrossRef]
Ghasemloonia, A.; Rideout, D.G.; Butt, S.D. A review of drillstring vibration modeling and suppression methods. J. Pet. Sci. Eng. 2015, 131, 150–164. [Google Scholar] [CrossRef]
Yigit, A.S.; Christoforou, A.P. Coupled axial and transverse vibrations of oilwell drillstrings. J. Sound Vib. 1996, 195, 617–627. [Google Scholar] [CrossRef]
Liu, X.; Vlajic, N.; Long, X.; Meng, G.; Balachandran, B. Coupled axial-torsional dynamics in rotary drilling with state-dependent delay: Stability and control. Nonlinear Dyn. 2014, 78, 1891–1906. [Google Scholar] [CrossRef]
Liu, W.; Yang, F.; Zhu, X.; Chen, X. Stick-slip vibration behaviors of BHA and its control method in highly-deviated wells. Alex. Eng. J. 2022, 61, 9757–9767. [Google Scholar] [CrossRef]
Besselink, B.; Van De Wouw, N.; Nijmeijer, H. A semi-analytical study of stick-slip oscillations in drilling systems. J. Comput. Nonlinear Dyn. 2011, 6, 021006. [Google Scholar] [CrossRef]
Kamel, J.M.; Yigit, A.S. Modeling and analysis of stick-slip and bit bounce in oil well drillstrings equipped with drag bits. J. Sound Vib. 2014, 333, 6885–6899. [Google Scholar] [CrossRef]
Sharma, A.; Al Dushaishi, M.F.; Nygaard, R. Evaluating PDC bit-rock interaction models to investigate torsional vibrations in geothermal drilling. Geothermics 2024, 122, 103060. [Google Scholar] [CrossRef]
Ertas, D.; Bailey, J.R.; Wang, L.; Pastusek, P.E. Drillstring mechanics model for surveillance, root cause analysis, and mitigation of torsional vibrations. SPE Drill. Complet. 2014, 29, 405–417. [Google Scholar] [CrossRef]
Dykstra, M.W.; Neubert, M.; Hanson, J.M.; Meiners, M.J. Improving drilling performance by applying advanced dynamics models. In Proceedings of the SPE/IADC Drilling Conference and Exhibition, Amsterdam, The Netherlands, 27 February–1 March 2001. [Google Scholar] [CrossRef]
Hegde, C.; Daigle, H.; Millwater, H.; Gray, K. Analysis of rate of penetration (ROP) prediction in drilling using physics-based and data-driven models. J. Pet. Sci. Eng. 2017, 159, 295–306. [Google Scholar] [CrossRef]
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [Google Scholar] [CrossRef]
Shokry, A.; Elkatatny, S.; Abdulraheem, A. Real-time rate of penetration prediction for motorized bottom hole assembly using machine learning methods. Sci. Rep. 2023, 13, 14496. [Google Scholar] [CrossRef]
Okoli, P.; Cruz Vega, J.; Shor, R. Estimating downhole vibration via machine learning techniques using only surface drilling parameters. In Proceedings of the SPE Western Regional Meeting, Midland, TX, USA, 23–25 April 2019. [Google Scholar] [CrossRef]
Hegde, C.M.; Pyrcz, M.; Millwater, H.; Daigle, H.; Gray, K. Fully coupled end-to-end drilling optimization model using machine learning. J. Pet. Sci. Eng. 2020, 186, 106681. [Google Scholar] [CrossRef]
Gupta, S.; Chatar, C.; Celaya, J.R. Machine Learning Lessons Learnt in Stick-Slip Prediction. In Proceedings of the Abu Dhabi International Petroleum Exhibition & Conference, Abu Dhabi, United Arab Emirates, 11–14 November 2019. [Google Scholar] [CrossRef]
Zha, Y.; Pham, S. Monitoring downhole drilling vibrations using surface data through deep learning. In Proceedings of the SEG Technical Program Expanded Abstracts 2018, Anaheim, CA, USA, 18 October 2018. [Google Scholar] [CrossRef]
Yahia, H.; Romary, T.; Gerbaud, L.; Figluizzi, B.; Di Meglio, F.; Menand, S.; Mahjoub, M. Combining Machine-Learning and Physics-Based Models to Mitigate Stick-Slip in Real-Time. In Proceedings of the IADC/SPE International Drilling Conference and Exhibition, Galveston, TX, USA, 5–7 March 2024. [Google Scholar] [CrossRef]
Kulke, V.; Ostermeyer, G.P.; Reckmann, H.; Hohl, A. Determination of operational parameters to mitigate HFTO based on algorithmic analysis of downhole sampled high-frequency data. In Proceedings of the IADC/SPE International Drilling Conference and Exhibition, Galveston, TX, USA, 8–10 March 2022. [Google Scholar] [CrossRef]
Yang, W.; Li, J.; Zhang, Z. A Monitoring System for Failure Risk of Downhole Drilling Tools in Complex Formations. J. Fail. Anal. Prev. 2024, 24, 2378–2392. [Google Scholar] [CrossRef]
Ren, Y.; Wang, N.; Jiang, J.; Zhu, J.; Song, G.; Chen, X. The application of downhole vibration factor in drilling tool reliability big data analytics—A review. ASCE-ASME J. Risk Uncertain. Eng. Syst. Part B Mech. Eng. 2019, 5, 010801. [Google Scholar] [CrossRef]
Yi, M.; Ramos, D.; Ashok, P.; Thetford, T.; Bohlander, S.; Noworyta, M.; Behounek, M. Time-Series Data Augmentation Techniques for Improving Automated Drilling Dysfunction Classifiers. In Proceedings of the SPE/IADC Drilling Conference and Exhibition, The Hague, The Netherlands, 16–18 March 2021. [Google Scholar] [CrossRef]
Millan, E.; Ringer, M.; Boualleg, R.; Li, D. Real-Time drillstring vibration characterization using machine learning. In Proceedings of the SPE/IADC Drilling Conference and Exhibition, The Hague, The Netherlands, 5–7 March 2019. [Google Scholar] [CrossRef]
Zhao, J.; Shen, Y.; Chen, W.; Zhang, Z.; Johnston, S. Machine learning–based trigger detection of drilling events based on drilling data. In Proceedings of the SPE Eastern Regional Meeting, Lexington, KY, USA, 4–6 October 2017. [Google Scholar] [CrossRef]
Zhang, Z.; Shen, Y.; Chen, W.; Shi, J.; Bonstaff, W.; Tang, K.; Smith, D.L.; Arevalo, Y.I.; Jeffryes, B. Continuous high frequency measurement improves understanding of high frequency torsional oscillation in North America land drilling. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 9–11 October 2017. [Google Scholar] [CrossRef]
Elahifar, B.; Hosseini, E. A new approach for real-time prediction of stick–slip vibrations enhancement using model agnostic and supervised machine learning: A case study of Norwegian continental shelf. J. Petrol. Explor. Prod. Technol. 2024, 14, 175–201. [Google Scholar] [CrossRef]
de Souza, R.L.B.; Al Fadhel, H.; Malik, K.A.; Zapata, J. Drilling Optimization Using High Frequency Data Measuring Torsional Oscillations (HFTO) and Corresponding Frequencies Provided by Downhole Tools, Supported by Extensive Scientific Pre-Job BHA Modeling Allows to Reduce Downhole Tool Failures and Improve Performance. In Proceedings of the International Petroleum Technology Conference, Dhahran, Saudi Arabia, 12 February 2024. [Google Scholar] [CrossRef]
Sheth, P.; Roychoudhury, I.; Chatar, C.; Celaya, J. A Hybrid Physics-Based and Machine-Learning Approach for Stick/Slip Prediction. In Proceedings of the IADC/SPE International Drilling Conference and Exhibition, Galveston, TX, USA, 8–10 March 2022. [Google Scholar] [CrossRef]
Hutahaean, J.; Simon, K.; Sakautzky, E. Risk Assessment of Drilling Tool Failure—A Data Driven Technique Assisted by Expert Knowledge. In Proceedings of the SPE Annual Technical Conference and Exhibition, San Antonio, TX, USA, 16–18 October 2023. [Google Scholar] [CrossRef]
Huang, X.; Luu, T.P.; Furlong, T.; Bomidi, J. Deep Reinforcement Learning for Automatic Drilling Optimization Using an Integrated Reward Function. In Proceedings of the IADC/SPE International Drilling Conference and Exhibition, Galveston, TX, USA, 5–7 March 2024. [Google Scholar] [CrossRef]
Zhang, M.; Kaya, M.E.; Habib, N.; Groh, A.; Small, A.; Xue, Y.; Walthall, J. Real-Time Downhole RPM Range Prediction for Improved Stick-Slip Detection Using Ensemble Machine Learning. In Proceedings of the SPE/IADC International Drilling Conference and Exhibition, Stavanger, Norway, 4–6 March 2025. [Google Scholar] [CrossRef]
Vishnumolakala, N.; Murphy, D.M.; Nguyen, T.; Zarate Losoya, E.; Kesireddy, V.R.; Gildin, E. Predicting Dysfunction Vibration Events While Drilling Using LSTM Recurrent Neural Networks. In Proceedings of the SPE/IATMI Asia Pacific Oil & Gas Conference and Exhibition, Virtual, 12–14 October 2021. [Google Scholar] [CrossRef]
Wu, H.; Hu, T.; Liu, Y.; Zhou, H.; Wang, J.; Long, M. TimesNet: Temporal 2D-Variation Modeling for General Time Series Analysis. arXiv 2022, arXiv:2210.02186. [Google Scholar] [CrossRef]
Zhang, C.; Wang, Y.; Fu, Y.; Qiao, X.; Nazir, M.S.; Peng, T. A novel DWTimesNet-based short-term multi-step wind power forecasting model using feature selection and auto-tuning methods. Energy Convers. Manag. 2024, 301, 118045. [Google Scholar] [CrossRef]
Kim, T.; Oh, J.; Kim, N.Y.; Cho, S.; Yun, S.-Y. Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation. In Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence (IJCAI), Montreal, QC, Canada, 19–27 August 2021; pp. 2628–2635. [Google Scholar] [CrossRef]
Chatfield, M.D.; Marquart-Wilson, L.; Dobson, A.J.; Farewell, D.M. Mean relative error and standard relative deviation. Stat. Neerl. 2025, 79, e70001. [Google Scholar] [CrossRef]
Poon, W.Y.; Leung, K.; Lee, S.Y. The comparison of single item constructs by relative mean and relative variance. Organ. Res. Methods 2002, 5, 275–298. [Google Scholar] [CrossRef]
Jain, J.R.; Oueslati, H.; Hohl, A.; Reckmann, H.; Ledgerwood, L.W.; Tergeist, M.; Ostermeyer, G.P. High-frequency torsional dynamics of drilling systems: An analysis of the bit-system interaction. In Proceedings of the IADC/SPE Drilling Conference and Exhibition, Fort Worth, TX, USA, 4–6 March 2014. [Google Scholar] [CrossRef]
Sharma, A.; Abid, K.; Srivastava, S.; Baena Velasquez, A.F.; Teodoriu, C. A Review of Torsional Vibration Mitigation Techniques Using Active Control and Machine Learning Strategies. Petroleum 2024, 10, 411–426. [Google Scholar] [CrossRef]
Eli, E.; Kueck, A.; Huang, X.; Lam, S.-L.; Heinisch, D.; Reckmann, H.; Bomidi, J. Testing and Characterization of High-Frequency Torsional Oscillations in a Research Lab to Develop New HFTO Suppressing Solutions. In Proceedings of the SPE/IADC International Drilling Conference and Exhibition, Stavanger, Norway, 7–9 March 2023. [Google Scholar] [CrossRef]

Figure 1. Flowchart of DS-DW-TimesNet early warning model.

Figure 2. Architecture of TimesNet.

Figure 3. Residual structure.

Figure 4. Field application diagram of near-bit measuring tool (MPU: Microprocessor Unit; PDC: Polycrystalline Diamond Compact).

Figure 5. Vibration characteristics during stick–slip: (a) triaxial time-domain, (b) mean triaxial, and (c) RMS triaxial.

Figure 6. Vibration characteristics during HFTOs: (a) triaxial time-domain; (b) tangential time–frequency; (c) RMS triaxial; (d) mean triaxial.

Figure 7. Drilling parameters in RMS time series (comparison between n = 1 and n = 3): (a) WOB; (b) torque; (c) tangential vibration; (d) normal vibration; (e) axial vibration.

Figure 8. Performance metrics of different models for the RMS dataset with varying step sizes: (a) MAE; (b) R²; (c) RMSE.

Figure 9. Predicted RMS profile of downhole vibration test set at 30 s, (a) tangential; (b) normal; (c) axial, and at 210 s, (d) tangential; (e) normal; (f) axial.

Figure 10. Predicted RMS profile of downhole vibrations at 30 s, (a) tangential; (b) normal; (c) axial; (d) triaxial vibration, with the circled region indicating the characteristics of HFTOs; and at 210 s, (e) tangential; (f) normal; (g) axial; (h) triaxial vibration, with the circled region indicating the characteristics of HFTOs.

Figure 11. Drilling parameters of mean time series (comparison between n = 1 and n = 2): (a) WOB; (b) torque; (c) tangential vibration; (d) normal vibration; (e) axial vibration.

Figure 12. Performance metrics of different models for the mean dataset with varying step sizes: (a) MAE; (b) R²; (c) RMSE.

Figure 13. Zoomed-in view of fitting results (0–1800 s). For 30 s: (a) tangential; (b) normal; (c) axial. For 150 s: (d) tangential; (e) normal; (f) axial.

Figure 14. Predicted mean profile of downhole vibrations. For 30 s: (a) tangential; (b) normal; (c) axial; (d) TimesNet triaxial vibration, with the circled region indicating the characteristics of stick-slip. For 150 s: (e) tangential; (f) normal; (g) axial; (h) TimesNet triaxial vibration, with the circled region indicating the characteristics of stick-slip.

Figure 15. Stick–slip early warning tests on adjacent wells: (a) tangential; (b) normal; (c) axial.

Figure 16. HFTO early warning tests on adjacent wells: (a) tangential; (b) normal; (c) axial.

Table 1. Fuman Data statistics.

	WOB	Torque	Tangential	Axial	Normal
minimum	22.5	4.5	−40.894	−40.894	−40.894
maximum	89.0	13.86	40.894	40.894	40.894
mean	58.6	11.2	0.0045	0.0043	0.0015
standard deviation	15.3	2.4	2.7733	0.5415	1.0386

Table 2. Hyperparameter of models.

Model	d_model	d_ff	hidden_size	Layers	top_k	n_heads
TimesNet-V1	16	32	0.01	3	5	-
TimesNet-V2	16	32	0.01	3	5	-
DW-TimesNet	16	32	0.01	3	5	-
LSTM	64	128	128	3	-	-
Informer	64	128	0.01	3	-	8

Table 3. Statistics of KL divergence and relative changes for the RMS dataset: (a) KL divergence; (b) relative mean change; (c) relative variance change.

n	WOB	Torque	Tangential	Axial	Normal
2	0.0132	0.0011	0.0005	0.0022	0.077
3	0.0191	0.0036	0.0023	0.0071	0.1267
4	0.0261	0.0074	0.0057	0.0327	0.3412
5	0.0421	0.0268	0.0072	0.0245	0.3694
(a)
n	WOB	Torque	Tangential	Axial	Normal
2	0	0.02	0.43	0.44	0.55
3	0	0.04	0.84	0.88	1.03
4	0	0.06	1.17	1.24	1.38
5	0	0.07	1.47	1.56	1.69
(b)
n	WOB	Torque	Tangential	Axial	Normal
2	−0.026	−0.107	−0.641	−1.282	−0.993
3	−0.042	−0.192	−1.262	−2.544	−1.848
4	−0.056	−0.275	−1.761	−3.576	−2.465
5	−0.068	−0.351	−2.209	−4.516	−3.02
(c)

Table 4. Computational metrics of TimesNet variants for the RMS dataset: (a) runtime, (b) FLOPs.

Model	DW-TimesNet	TimesNet-V1	TimesNet-V2
Running Time (s)	47.8	108.9	100.4
(a)
Model	DW-TimesNet	TimesNet-V1	TimesNet-V2
FLOP(M)	472.4	14,999.6	1630.3
(b)

Table 5. Statistics of KL divergence and relative changes for the mean dataset: (a) KL divergence; (b) relative mean change; (c) relative variance change.

n	WOB	Torque	Tangential	Axial	Normal
2	0.0141	0.3998	2.0525	0.0217	1.2856
3	0.0197	0.4085	0.0662	0.0055	1.7304
4	0.0267	0.3758	5.2086	4.1326	4.7199
5	0.0429	0.3533	0.1615	4.6325	6.101
(a)
n	WOB	Torque	Tangential	Axial	Normal
2	0	0	0	0	0
3	0	0	0	−0.06	0
4	0	0	0	0	0
5	0	0.01	0	0.05	0
(b)
n	WOB	Torque	Tangential	Axial	Normal
2	−0.026	−0.112	−24.725	−19.253	−16.41
3	−0.039	−0.205	−41.883	−35.436	−35.342
4	−0.055	−0.283	−54.087	48.451	−54.749
5	−0.063	−0.375	−60.433	−56.738	−69.557
(c)

Table 6. Computational metrics of TimesNet variants for the mean dataset: (a) runtime, (b) FLOPs.

Model	DW-TimesNet	TimesNet-V1	TimesNet-V2
Running Time (s)	102.4	197.6	192.3
(a)
Model	DW-TimesNet	TimesNet-V1	TimesNet-V2
FLOP(M)	1072.5	33,753.1	3676.1
(b)

Table 7. Manshen Data statistics.

	WOB	Torque	Tangential	Axial	Normal
minimum	0	0	−40.894	−40.894	−40.894
maximum	193.40	19.28	40.894	40.894	40.894
mean	4.82	0.39	−0.0001	0.0043	0.0024
standard deviation	22.57	1.80	0.9459	0.5415	1.2141

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, T.; Li, H.; Meng, Z.; Yuan, Z.; Wang, M.; Li, J. DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations. Processes 2025, 13, 2700. https://doi.org/10.3390/pr13092700

AMA Style

Zhang T, Li H, Meng Z, Yuan Z, Wang M, Li J. DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations. Processes. 2025; 13(9):2700. https://doi.org/10.3390/pr13092700

Chicago/Turabian Style

Zhang, Tao, Hao Li, Zhuoran Meng, Zongling Yuan, Mengfan Wang, and Jun Li. 2025. "DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations" Processes 13, no. 9: 2700. https://doi.org/10.3390/pr13092700

APA Style

Zhang, T., Li, H., Meng, Z., Yuan, Z., Wang, M., & Li, J. (2025). DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations. Processes, 13(9), 2700. https://doi.org/10.3390/pr13092700

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

DS-DW-TimesNet-Driven Early Warning for Downhole Near-Bit Torque Vibrations

Abstract

1. Introduction

2. Methodology

2.1. TimesNet

2.1.1. Dimensional Expansion from 1D to 2D

2.1.2. 2D Temporal Feature Extraction

2.1.3. Dimensionality Reduction and Adaptive Fusion

2.1.4. Residual Architecture

2.2. Dilated Convolution with Weight Normalization

2.3. Downsampling Performance Evaluation

3. Data Description and Analysis

3.1. Data Source

3.2. Data Labeling

3.2.1. Stick–Slip

3.2.2. High-Frequency Torsional Oscillations

4. Model Training

5. Experimental Results and Analysis

5.1. Case One

5.2. Case Two

5.3. Adjacent Well Test

5.4. Limitations and Future Work

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI