1. Introduction
A spread mooring system is crucial for the safe operation of a floating production storage and offloading (FPSO) platform without power drive [
1,
2]. The windlasses and winches are commonly used as chain retraction equipment in traditional mooring systems. Due to low automation and excessive bending stress, the chain is seriously worn. Therefore, the hydraulic synchronized CJ is widely used in the FPSO mooring system. Wudtke [
3] developed a linear vertical lifting anchor chain hoist to replace the traditional FPSO mooring system, reducing the investment cost. Grindheim [
4] took the GOLIAT project of ENI Group and the P58/P62 project of PETROBRAS as examples and introduced in detail the difference between the movable anchor system and the movable CJ. Wang et al. [
5] used the CJ to keep the relative static between the two yoke arms during the operation of the rotating soft yoke mooring system and the FPSO, which was convenient for docking installation. Biggerstaff et al. [
6] discussed the design, manufacture and installation of 1000 tons of CJs. The offshore installation of Lucius Spar export risers was completed by using the installed CJ. Sablok et al. [
7] proposed a high-integrity variable-position mooring system using electric and hydraulically actuated CJs to enable the chain to enter or exit to achieve the given tension or displacement. Wu et al. [
8] compared CJs with six other tensioning systems in the mooring system of a deep water platform and found that the CJ is still the most popular choice for semis. Johnson et al. [
9] designed a smaller semi-submersible FPSO, which uses the CJ to tighten 12 taut chain-polyester-chain mooring lines to complete the mooring of the system, eliminating the traditional mechanically complex and space-demanding winch system. The CJ-integrated push–pull hydraulic cylinder synchronously provides linear tension for mooring chains of different sizes and locates the required mooring chain segments. Compared with the windlasses and winches with the same load capacity, a CJ not only greatly reduces the space left by the deck for the mooring tensioning system but also effectively solves the problem of chain wear and excessive bending stress during the retracting process.
However, FPSO platforms operate for long periods in complex and harsh marine environments, and their mooring operations require extremely high reliability, controllability and safety from the CJs. For large and complex engineering hydraulic systems, traditional methods of predicting the exact location and magnitude of the failure within the failed part cannot be relied upon entirely. Recent advances in fault diagnosis have achieved remarkable performance through deep learning architectures. Research on fault diagnosis predominantly focuses on sensor data, employing multi-sensor fusion technology to enhance diagnostic accuracy and reliability by integrating multiple features such as vibration, temperature, pressure and flow [
10,
11,
12]. Wang et al. [
13] used 2D time modeling and attention mechanism fusion to decouple composite faults. Features are extracted from multi-sampling rate sensor data to achieve high-precision diagnosis of hydraulic system faults under noise conditions. Chen et al. [
14] proposed a two-stage fault diagnosis method using flow data to address the issue of significant data incompleteness in hydraulic systems. Zhu et al. [
15] fused vibration, pressure and acoustic sensor signals, mined the time-frequency characteristics of faults from the signals through continuous wavelet transform and used a CNN to establish an adaptive diagnosis model to identify typical faults of axial piston pumps. Xia et al. [
16] simultaneously considered spatiotemporal characteristics of raw data from different sensors, avoiding manual feature extraction or selection. Gong et al. [
17] directly integrated multiple sensor raw data to ensure the integrity of information for traditional diagnostic methods that rely on existing experience and manual feature extraction. However, this fusion occurs at the lowest level of information. To address the issue of diagnostic model inflexibility, Wu et al. [
18] suggested a fault diagnosis model using a 1D CNN to extracts features from the vibration signal, enabling rapid end-to-end composite fault diagnosis. Building on this, Xue et al. [
19] proposed a parallel multichannel structure based on a 1D CNN and 2D CNN for deep feature extraction. By using feature fusion, the method achieved more reliable fault diagnosis with fewer iterations and lower computational cost. Hao et al. [
20] introduced an end-to-end solution using an LSTM network, effectively diagnosing bearing faults by fusing deep spatiotemporal features from multiple sensors. Ma et al. [
21] analyzed the operating characteristics of the lifting hydraulic system of the Very Large Crude Carrier, extracted the fault sensitivity characteristics of the pressure sensor signal of the hydraulic system through WPT and classified the leakage fault based on the SVM multi-classification model. Liu et al. [
22] developed a fault diagnosis method based on multi-sensor fusion, extracting spatial features using a CNN and temporal features using LSTM to identify the severity of the fault. Yang et al. [
23] established a 1D-SENets model for the fault diagnosis of axial piston pumps with a small number of training samples available. The model diagnosis and verification are carried out by the fault vibration signal of the actual working condition, and the over-fitting problem in the diagnosis process is solved. Tao et al. [
24] proposed a real-time fault diagnosis method for hydraulic systems based on an MS-CNN. The model reduces the computational complexity and improves the performance of the diagnosis model by mining higher-level fault correlation features.
The sensor measurement data often contain errors, and the information obtained by different sensors can be imprecise, ambiguous, unclear or even contradictory, ultimately resulting in unreliable fusion outcomes. The concept of the attention mechanism was first proposed by cognitive neuroscientists who conducted research in this field [
25]. The model employs a weighting system that assigns varying degrees of importance to different input components, thereby facilitating the extraction of crucial information while simultaneously reducing the computational and storage overhead associated with the model [
26]. The attention mechanism is integrated into the DNN structure, enabling its learning alongside other network parameters [
27]. Wang et al. [
28] introduced an attention mechanism module to enhance fault-related feature learning, combining it with a 1D CNN to adaptively recalibrate features at each level, significantly improving discriminative feature representation. Shao et al. [
29] developed an autonomous ship navigation model using multiple sensors to obtain state information. They enhanced the modelling capability by extending the deformable convolution and introducing a self-designed attention mechanism. Wu et al. [
30] proposed an IES short-term load forecasting method based on an attention-based CNN combined with LSTM and BiLSTM models. Gao [
31] proposed a detection method that integrates a CNN and BiLSTM network, where the CNN captures local features, and BiLSTM extracts long-range dependencies. An attention mechanism is incorporated to enhance classification accuracy. Long et al. [
32] developed an improved AdaBoost-based motor fault diagnosis method utilizing an attention mechanism and multi-sensor data.
As can be seen from the above, although many scholars have conducted in-depth research on the fault diagnosis technology of the hydraulic system, the real-time fault diagnosis technology of the hydraulic system of the CJ has not been widely studied. The CJ equipment developed in this paper has a strong coupling effect and high nonlinearity. The hydraulic lifting cylinder is the main actuator, cooperating with the clamping and limiting hydraulic cylinder group. Due to the interference of actual complex sea conditions, the failure rate is significantly higher than that of land-based equipment, and the fault is difficult to detect. Especially under diverse operating conditions, the sensitivity to different degrees of fault characteristics is different, and the early fault of a low load is very difficult to be detected in time. In order to adaptively perform real-time fault diagnosis of the CJ under complex loads, this study proposes a deep neural network model based on a GAM, which enhances the ability of a model to capture the previous and subsequent information. The model ensures robustness across diverse load conditions while identifying key sensor features to optimize sensor layout. The innovations of this study are as follows:
This study investigated typical fault mechanisms in CJ operations under varying loads and designed a CNN-BiLSTM-GAM hierarchical model.
The model integrates multi-scale local and global features, capturing fault patterns in short intervals and persistent behaviors across time scales, ensuring accurate and robust fault diagnosis under dynamic load conditions.
A multi-scale GAM dynamically adjusts feature importance across time steps and sensors, enabling key sensor selection and layout optimization.
2. Methods and Models
During the operation of the CJ, fault occurrences can lead to significant variations in critical physical parameters. The CJ system prototype developed in this study accurately replicates the dynamic characteristics and functional properties of an actual spread mooring system. Through the design of a non-destructive fault injection test platform, various typical fault conditions were precisely simulated. To address the uncertainty in time-series signals caused by sensor measurement errors, the research team deployed a pressure sensor array (FS1–FS11), temperature sensor network (TS1–TS12) and flow monitoring units (FS1–FS5) at key nodes of the hydraulic actuators, establishing a synchronized multi-physical quantity acquisition system. However, while this distributed sensor network enhances system observability, it inevitably introduces challenges, such as increased computational complexity and sensor data conflicts. Hence, this paper innovatively proposes a deep network diagnostic framework based on a GAM. The proposed model achieves precise identification of faults with varying severity, including internal pump leakage (I0–I2), proportional solenoid valve conditions (V0–V3) and circuit pipe blockages (P0–P3). By incorporating the GAM, the model not only accomplishes accurate fault classification but also generates a sensor importance evaluation matrix, providing quantitative guidance for optimizing the configuration of the sensor network.
2.1. CNN-GAM Model Architecture
This paper discusses the application of the CNN-GAM and CNN-BiLSTM-GAM model in fault diagnosis and sensor optimization. The model is trained and validated by fusing multi-scale time-series data with different fault severities under full-load conditions. The performance of the model is tested under half-load conditions.
Initially, this study employs a CNN integrated with a GAM to assess the severity of hydraulic system faults under varying load conditions, as illustrated in
Figure 1. The convolutional layer extracts spatial local features from the multivariate time-series data, which are then weighted by the GAM to emphasize the most critical features and evaluate each sensor’s importance.
The primary function of CNNs in fault identification is to extract local features from input data via convolution operations. For sensor or time-series data, local features often reveal critical patterns or anomalies within the signal, providing valuable insights into system behavior. By utilizing convolutional layers, the model can automatically learn these features, enhancing its ability to comprehend local data structures and facilitating the detection of potential fault signals. The data from 28 multivariate sensors are input into a 1D convolutional layer with a batch size of 32, allowing for hierarchical feature extraction through stacked convolutional layers. The ReLU activation function is employed to mitigate the issue of gradient disappearance. Following each convolutional layer, a max pooling layer performs sampling aggregation to complete local feature extraction. The output of the convolutional network serves as input for the self-attention mechanism. The CrossEntropy loss function iteratively updates weights using the error backpropagation algorithm, facilitating the optimal convergence of the loss function. A fully connected (FC) layer calculates the weight of each feature, acting as a classifier where its input is the convolutional layer output, and its output represents the feature weights. By multiplying these weights with the convolutional output, we obtain the weighted features, which are then passed to the FC layer to generate the final classification result using the Softmax activation function. The Adam optimizer is employed during training with a learning rate of 0.001, and the model undergoes training for 50 epochs. In each epoch, the training set undergoes forward and backward propagation to update the weights, using a mini-batch gradient descent strategy with a batch size of 32. After each epoch, hyperparameter tuning is performed on the validation set to evaluate model performance and select the optimal configuration.
This approach optimizes sensor placement, reducing the model’s computational costs while accurately identifying fault severity. Assume that
,
and
represent the input data from the 11 pressure sensors, 5 flow sensors and 12 temperature sensors, respectively. The input of the model can be expressed as follows:
The CNN layer extracts local spatial features from multivariate time-series data. Convolutional filters use a sliding window to scan the data, highlighting critical fault-related features in short time intervals. Equation (2) demonstrates how the CNN extracts local features at different scales.
where
,
represent the number of convolutional kernels at different scales;
denotes the weights of the convolutional kernel at the
s-th scale; and
is the bias term. The GAM dynamically assigns weights to different features through weighted summation, enabling the selection and fusion of features extracted by the CNN while suppressing redundant and irrelevant features.
2.2. CNN-BiLSTM-GAM Model Architecture
While the CNN-GAM fault diagnosis model effectively extracts fine-grained features through local convolution and captures global features via the attention mechanism, its performance may be suboptimal under actual operating conditions. This decline arises from the dynamic progression of hydraulic system fault severity over time, where critical fault information may be obscured by long-term dependencies in the time-series data, ultimately reducing diagnostic accuracy. Hence, this paper proposes an innovative CNN-BiLSTM-GAM hybrid architecture based on the fusion of multi-scale time-series data features under variable load conditions, as illustrated in
Figure 2.
The proposed model establishes a hierarchical feature learning framework that synergistically integrates the spatial pattern recognition capability of a CNN with the temporal dynamics modeling strength of BiLSTM, further enhanced by a GAM. The CNN component extracts spatial features for preliminary fault type identification, while the BiLSTM network precisely captures the grained evolution of fault severity from initial stages to complete failure, thereby enabling early fault detection essential for predictive maintenance. This addresses the limitations of conventional CNN-BiLSTM networks in preserving subtle fault signatures, particularly under half-load conditions where fault characteristics become significantly attenuated. The core innovation of this research lies in the implementation of the GAM as an intelligent feature regulator that performs 3D attention weighting. Firstly, the spatial-dimension weighting across sensor channels emphasizes critical measurement points while simultaneously filtering redundant sensors through weight screening, thereby reducing computational overhead. Secondly, temporal-dimension weighting along the time series reinforces key fault progression patterns, providing crucial support for the real-time condition monitoring of CJ systems. Finally, cross-layer hierarchical weighting preserves diagnostically relevant yet weak features during feature fusion, enhancing the model’s adaptability across diverse operating conditions. This multi-scale attention mechanism dynamically recalibrates feature importance at different abstraction levels, effectively overcoming the signal-to-noise ratio challenge in half-load fault detection while maintaining high sensitivity to incipient fault symptoms. The proposed architecture demonstrates exceptional performance in characterizing the nonlinear interactions between load variations and fault progression characteristics, establishing a new benchmark for condition monitoring in hydraulic systems. The methodology advances the field by providing a robust framework for early fault identification and system health management under complex operational scenarios.
The proposed CNN-BiLSTM-GAM model employs carefully configured parameters to optimize its temporal modeling and attention mechanisms. For the BiLSTM component, we set the hidden dimension to 16 units with a single recurrent layer, adopting a bidirectional configuration to capture both forward and backward temporal dependencies in the hydraulic system data. This compact yet effective design yields a total of 2112 trainable parameters for the BiLSTM layer, balancing model complexity with computational efficiency. The GAM is implemented with 2 parallel attention heads operating on the 32-dimensional feature space. This configuration creates multiple representation subspaces that jointly attend to different temporal patterns in the fault evolution process. The attention layer introduces 4160 parameters. The CrossEntropy loss function is chosen to suit the multi-category classification task. During model training, the Adam optimization algorithm is utilized with a learning rate of 0.001, enabling efficient parameter updates and promoting rapid convergence. A batch size of 32 is selected to balance memory stability and model learning efficiency. The model undergoes 50 training cycles to capture the underlying data patterns adequately. The BiLSTM output at each scale can be expressed as follows:
where
and
represent the time-dependent features extracted at each scale
;
denotes the concatenated features from the forward and backward directions.
The GAM is used to fuse features from different time steps, sensors and scales, dynamically adjusting the importance of these features. Through the GAM, the model assigns different attention weights based on the contribution of each sensor and time point to fault severity. The multi-scale properties enable the model to focus on faults occurring at various time scales, ensuring it captures both immediate fault behaviors and persistent fault patterns. The features after global fusion are shown in Equation (4).
where
,
and
represent the features generated by the GAM layer;
is the attention matrix across time steps; and
and
represent the weights at the
s-th scale and the attention weights from multi-scale fusion, respectively.
By integrating an importance weight mechanism, the overall architecture enables the model to adapt effectively to multi-dimensional features, improving robustness and classification accuracy for complex data. The final global features are computed by dynamically weighting the sensors and time steps, as shown in Equation (5).
where
and
denote the weights of the
t-th time step and the
j-th sensor dimension, respectively.
Following feature extraction, classification is performed using the fully connected (FC) layer. The FC1 layer utilizes batch normalization to expedite the training process and improve the stability of the model. The final output layer maps the extracted features to identify varying degrees of the three fault types, as shown in Equation (6). This structural design maximizes the potential information within time-series data, offering an effective solution for fault diagnosis tasks.
3. Experimental Work
The hydraulic fault simulation test bench presented in this study is a sophisticated, multi-functional testing device that can be categorized into two primary components. The first component is the hydraulic mechanical structure, which encompasses the limit, clamping and lifting systems. The second component consists of the measurement and control system, including an upper computer (PC), a lower computer (PLC) and sensors for gathering the system’s physical parameters. These elements collaborate effectively to achieve automated control of fault simulation tests and ensure precise collection of system parameters.
3.1. Experimental Bench
The principle and function of the CJ prototype of the test bench are the same as those of the actual spread mooring system. It mainly uses the lifting hydraulic cylinder group to realize the segmented lifting of the loaded anchor chain, and uses the limit and clamping hydraulic cylinder group to realize the limit and clamping of the load anchor chain, so as to prevent the chain from slipping under load. The main structure and physical diagram of the prototype of the hydraulic fault simulation test bench for the CJ are shown in
Figure 3.
The test bench realizes the segmented retracting and releasing of the loaded chain under the cooperation of lifting, clamping and limiting hydraulic cylinder groups. To lift the chain, the piston of the cylinder group extends outward, releasing the clamp’s restriction on the chain. As the pivot of the clamping cylinder group moves inward, the chain link is clamped by the clamping clamp. The lifting hydraulic cylinder group promotes the lifting beam to move upward. A piston moves inward when it moves to the top dead center to reach the stroke limit of the lifting hydraulic cylinder group. Limiting clamps restrict chain links, clamping hydraulic cylinders move outward and clamping clamps release the clamping effect on chain links. At this time, the lifting hydraulic cylinder group drives its own mechanism to move downward to the stroke limit of the bottom dead center. The test bench repeats the above working cycle, and the chain is lifted in sections to raise the load to the working position. The need to reset the chain operation and the lifting of the loaded chain step is opposite.
3.2. Experimental Descripition
Based on the working principle of the CJ, the hydraulic system of the test bench comprises three subsystems: the hydraulic clamping system (HCS), hydraulic lifting actuator (HLA) and hydraulic limiting system (HLS). The HLS controls the hydraulic cylinder group to restrict the mooring chain. Firstly, the HCS controls the opening and closing of the mooring chain clamping mechanism for chain clamping. Subsequently, the HLA is used to deploy and retrieve the chain for mooring tension regulation. Finally, the limit clamp mechanism is driven by HLS to constrain the position of the mooring chain to prevent the chain from falling off. The three subsystems cooperate with each other to complete the work cycle under the same hydraulic tank and hydraulic pump drive. The fault injection principle and sensor arrangement of the CJ hydraulic system simulation are shown in
Figure 4.
Through the normal operation of the lifting system, the analog fault injection module controls the adjustable throttle valves 11.1, 11.2 and 11.3 to be closed. The motor 1 drives the hydraulic pump 2 to work and output the pressure oil, which is transmitted to the high-pressure filter 4, the check valve 5.1, the ball valve 10.1, the electromagnetic directional valves 6.1 and 6.2 and the hydraulic cylinder group 7, and then changes the flow direction through the electromagnetic directional valve group, flows through the check valve 5.2, the ball valve 10.6 and finally flows back to the oil tank to complete a working cycle.
The hydraulic fault simulation test bench of the CJ allows for reversible changes in each component to achieve fault simulation injection. The hydraulic system may have multiple faults at the same time, and the duration, severity and type of faults are different. Non-destructive simulation fault injections were conducted using the lifting hydraulic system of the test bench. By adjusting the switches of the relevant valve components and coordinating their settings, specific simulation fault injections were designed for three hydraulic system components: the power component (hydraulic pump leakage), the executive component (proportional solenoid valve conditions) and the control component (circuit pipe blockage). The methods for simulating different fault types are as follows:
Circuit pipe blockages: Check valve 5.1 as a simulation of the blockage object. In the lifting system work, in turn, open the hydraulic circuit of the ball valve 10.2, adjustable throttle valve 11.2 and the ball valve 10.3 and then close the ball valve 10.1, by adjusting the opening of the throttle valve 11.2 to simulate the different levels of blockage of the check valve. The safety valve 12.1 plays a role in overload protection.
Internal pump leakage: The leakage of hydraulic pump 2 is realized by connecting an adjustable throttle valve 11.1 in series between the outlet of the hydraulic pump and the fuel tank. The different leakage levels of the hydraulic pump are simulated by adjusting the opening of the throttle valve. In order to prevent the backflow of hydraulic oil during the leakage test of the simulated hydraulic pump, a check valve 13 is installed at the outlet of the hydraulic pump to ensure the flow direction of the oil circuit.
Proportional solenoid valve conditions: The proportional solenoid reversing valve is a hydraulic control component that plays a conversion role between the hydraulic control system and the electrical control system. It can not only change the direction of the hydraulic oil flow but also control the reversing speed and flow. Valve spool stagnation can cause improper operation, leading to abnormal pressure and flow fluctuations before and after the proportional solenoid directional valve in the hydraulic system. By manually adjusting the current settings of proportional solenoid directional valves 6.1 and 6.2 via the PLC control panel, different spool movements are controlled, altering the flow rate through the directional valve to simulate varying degrees of spool stall.
3.3. Data Acquisition
Developing a fully operational hydraulic failure simulation test bench necessitates the integration of a robust measurement and control system. This system must precisely regulate hydraulic actuators, execute specified action functions and evaluate system parameters under diverse environmental conditions. In this study, the measurement and control system for the CJ hydraulic failure simulation test bench employs a programmable logic controller (PLC) for the lower computer, utilizes a C++ environment for the upper computer software and establishes communication between the two systems via the Modbus-TCP protocol to facilitate data interaction and control.
To fulfil data acquisition requirements, the measurement and control system of the test bench must encompass the following functionalities:
The sensors used in the test bench should have a fast frequency response, which can accurately and sensitively collect physical quantities. The test bench allows for simultaneous sampling of the pressure, flow rate and temperature sensors at varying rates, with the maximum sampling frequency exceeding 100 Hz. At the same time, the sampling rate of each sensor is self-adjustable, and the collected analogue quantity is delayed in the PLC after processing. After processing, it is displayed and stored on the PC.
The PC manages and adjusts the test parameters, while operators can manually configure the console and touch screen on the lower-level PLC. The control program supports adjustments to accommodate varying test requirements, including different load conditions, simulated fault types and fault severity.
In addition to the inherent sampling channels of the test bench, additional channels are reserved to accommodate future needs. Both the lower-level PLC and upper-level control software are designed using a modular architecture, enabling flexible modifications and facilitating the secondary development of the test bench.
The test bench console and touch screen are configured with select switches, function buttons, operation indicator lights, etc. The lower-level PLC is responsible for managing the input and output states of both analog and digital signals, ensuring precise control and data acquisition. When the test is abnormal, the relevant indicator lights will issue an early warning signal.
It ensures seamless communication and interactive control between devices by providing the required network and USB interfaces.
Equipped with a UPS power failure protection function, the system prevents test data loss in the event of an accidental power outage.
According to the principles of fault simulation and injection, three fault types are simulated under variable load conditions in the lifting hydraulic system: internal pump leakage, circuit pipe blockage and proportional solenoid valve conditions. To comprehensively simulate various fault scenarios, different severities are defined for each fault condition. The physical parameters of the hydraulic system under diverse operating conditions are collected, yielding a substantial dataset for subsequent fault diagnosis research. Notably, the varying severity of each fault requires threshold adjustments, which can be performed either manually or automatically. To maintain system safety, these thresholds must be meticulously calibrated based on comprehensive testing and debugging. The prior study [
22] outlines the detailed analysis process. As shown in
Figure 5, FS2 is used as an example to analyze the real-time data of multivariate time series with severities of fault under variable load conditions.
Figure 5a presents the real-time dataset for circuit pipe blockage severity under varying load conditions, categorized into four levels: no blockage (P0), weak blockage (P1), severe blockage (P2) and close to total failure (P3).
Figure 5b illustrates the dataset for internal pump leakage severity, classified into three levels: no leakage (I0), weak leakage (I1) and severe leakage (I2).
Figure 5c displays the dataset for valve condition faults, also under different load conditions, divided into four levels: optimal switching behavior (V0), small lag (V1), severe lag (V2) and close to total failure (V3).
This paper employs a deep neural network architecture based on a self-attention mechanism to diagnose and identify varying degrees of three fault types. The fault time-series data collected in
Section 3 serve as inputs to the network, with 80% of the samples under full load utilized as the training dataset and 10% reserved for model validation. Given that the hydraulic system experiences variable loads during actual operations, data samples from 10% half-load conditions are further used as test datasets to evaluate the model’s diagnostic performance across different load scenarios, as illustrated in
Table 1.