Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks

Teng, You; Li, Donghui; Xue, Hongkai; Zhou, Yunkai; Wang, Kefu; Wu, Qi

doi:10.3390/electronics14081546

Open AccessArticle

Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks

by

You Teng

^1,2,3

,

Donghui Li

¹,

Hongkai Xue

²,

Yunkai Zhou

²,

Kefu Wang

³ and

Qi Wu

^2,*

¹

School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China

²

College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China

³

Zhejiang Qiaoshi Intelligent Industry Co., Ltd., Ningbo 315470, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(8), 1546; https://doi.org/10.3390/electronics14081546

Submission received: 24 February 2025 / Revised: 1 April 2025 / Accepted: 9 April 2025 / Published: 10 April 2025

Download

Browse Figures

Versions Notes

Abstract

:

Tracing the root cause of defective products in liquid storage tank (LST) production poses a formidable challenge due to the complex dependencies between production and inspection processes. With associated coupling existing among multiple production processes, and the correspondence between the faults in production processes and inspection links being non-unique, these faults are usually difficult to be directly located via a single inspection process. In this paper, the problem of tracing the root cause of defective LST products, which is caused by process parameter deviations or human operation errors during production, is studied. A root cause tracing method that is based on the dynamic multi-fault diagnosis (DMFD) framework is proposed. First, a factorial hidden Markov model (FHMM) is established to depict the state transition process of the LST product, where its status changes over time and across production processes. This is achieved by considering the product state at each production process as a hidden state and the outcomes of each inspection process as an observation state. Then, the Viterbi algorithm is employed to solve the hidden state transition matrix and diagnostic matrix within the framework of the FHMM. Finally, experimental verification is carried out on a real LST assembly production line, and the influence of imperfect testing on the model accuracy is also considered. The experiment is carried out on an LST assembly line that encompasses three discrete links, including the welding of the upper and lower bodies, the installation of check valves, and the installation of sensors. Experimental results demonstrate that the proposed method achieves significantly more superior performance when compared to existing algorithms.

Keywords:

liquid storage tanks (LSTs); automotive component manufacturing; defective products; root cause tracing; dynamic multi-fault diagnosis

1. Introduction

In discrete manufacturing systems, root cause tracing of defective products plays a pivotal role in quality control. This is because it not only precisely pinpoints the underlying cause of defective products but also clearly uncovers the fault propagation path within the production process. Such insights are essential for optimizing processes and enhancing product quality [1,2]. Take the LST assembly line, a quintessential complex discrete manufacturing system, as an example. It encompasses processes like welding, sensor assembly, and filter mesh installation. These processes are intricately interlinked. A single production process failure can potentially cascade through the entire process [3]. This leads to the generation of defective products with complex causes, and causes other processes to deviate from normal levels. Consequently, these challenges pose significant hurdles to the root cause tracing of defective products [4,5].

In recent years, root cause tracing for defective products has garnered increasing attention in the industry, with many effective methods emerging in engineering practice. These methods mainly include model-based [6,7], logic inference-based [8,9], and artificial intelligence-based [10,11] approaches. Logic inference-based methods analyze and identify the underlying cause by applying logical rules and reasoning. For example, Yan Feng Li et al. [12] proposed a fault tree-based root cause tracing method, and its effectiveness was experimentally verified through the CNC machining center’s hydraulic system platform. However, logic inference-based methods often rely heavily on expert experience, and they struggle to capture the dynamic characteristics of the system in real-time. Additionally, artificial intelligence methods have gained increasing attention in recent years due to their powerful data processing and pattern recognition capabilities. Qiuping Ma et al. [13] proposed a KNN clustering and MLP-driven root cause identification method for product quality inspection, aimed at automatically predicting the root cause of various quality issues. However, the black-box problem severely affects the interpretability of artificial intelligence methods, limiting their further application in actual industrial production. In contrast, model-based methods grounded in the white-box concept can construct dynamic models of the production system, clearly revealing each process’s operation logic, their interconnections, and the mechanisms linking these processes to product quality. Ruan Sui [14] proposed a DMFD framework for complex systems. It can infer the most likely set of faults in real-time, reveal the fault propagation path, and accurately identify the root cause of defective products. Shakeri et al. [15] developed a modeling method for a two-level coordinated solution framework, where dynamic programming techniques were used to solve the original DMFD problem. On this basis, Anuradha et al. [16] conducted validation experiments on an automotive power generation and storage system, thereby demonstrating the effectiveness of the two-level coordinated framework. Similar successful applications of model-based methods have also been achieved in diverse fields, including electronics [17], mechatronic systems [18], mechanical systems [19,20], and chemical engineering [21]. However, when it comes to specific scenarios like the LST assembly production line, existing model-based methods exhibit certain limitations. The production processes of LST assembly lines are complex and interconnected, while the test results of LST products are often imperfect, such as data missing or inadequate testing precision. There is a lack of modeling applications capable of effectively addressing these issues, making it difficult to meet the practical needs of precise analysis, fault diagnosis, and quality control in the production process.

Hidden Markov models (HMMs) are an ideal choice for modeling DMFD problems. In discrete manufacturing systems, HMM can use observed states to represent product inspection results related to quality and hidden states to describe the actual states associated with quality. Ying et al. [22] were the first to use HMMs to formalize dynamic fault diagnosis problems. In addition, Q. Suxiang et al. [23] introduced the theory of HMMs into the field of power transformer fault diagnosis. Qiu et al. [24] integrated multi-feature fusion technology with Gaussian mixture hidden Markov models to conduct fault diagnosis on a multi-axis engraving machine platform. However, the HMM typically assumes that the system has only a single-component state at most, which restricts its capacity to comprehensively model multiple faults [18]. As an extension of the HMM, the FHMM supposes the system to be composed of multiple independent Markov chains, which endows it with the capability to handle multiple related factors simultaneously. For instance, Satnam Singh et al. [19] proposed a fault diagnosis method based on the FHMM, which provides important theoretical support for the modeling and analysis of dynamic multi-faults. Inspired by the aforementioned methods, a dynamic multi-fault diagnosis modeling method based on the FHMM is proposed, and it is applied to trace the root causes of defective products on the LST assembly line. First, the problem of tracing the root causes of defective products is mathematically modeled and an FHMM within the dynamic multi-fault diagnosis framework is constructed. Then, the model parameters are iteratively optimized by applying the EM algorithm. Finally, the hidden state transition matrix and the diagnostic matrix are solved using the Viterbi algorithm so that the optimal root cause tracing path for the defective LSTs can be obtained.

The contributions of this paper are as follows:

A DMFD-based framework is proposed to locate the root cause of defective products in the LST assembly line. An FHMM is established by utilizing key factors such as production, inspection processes, and inspection results to describe the changes in product quality. This transformation turns the problem of root cause analysis into a solvable DMFD problem.
The impact of imperfect testing on the root cause tracing of defective products is taken into account, and a model that is closely aligned with the actual scenario is constructed. Through formula derivation, the missing detection results are incorporated into the model. Moreover, experiments are designed to quantify the influence of incorrect results on the accuracy of root cause tracing. Consequently, the reliability of root cause tracing for defective products in practical production is enhanced.
Experimental verification has been carried out on a real LST assembly production line. The experimental results show that the proposed method can achieve a 100% accuracy rate for root cause tracing of three typical quality issues, namely welding misalignment, missing installation of the valve body, and sensor offset.

The structure of this paper is as follows. The related work is introduced in Section 2. In Section 3, we provide a system description and mathematical modeling of the LST assembly line. Section 4 presents the dynamic inference algorithm based on the FHMM. Section 5 presents the results of the computational experiments to evaluate the performance of the inference algorithm. In Section 6, we discuss the application scenarios of the proposed method in this paper. Section 7 provides the conclusion of this paper.

2. Related Work

Multi-fault diagnosis methods can be broadly classified into two categories: data-driven methods and model-driven methods. Data-driven methods use statistical analysis and machine learning to detect fault patterns from data without explicit system modeling [25,26]. For example, the Transformer model has made significant progress in mechanical equipment fault diagnosis [27], benefiting from its self-attention mechanism and parallel computing capabilities. Muhammad Samiullah et al. [28] proposed a Decision Tree algorithm for motor fault classification, which efficiently handles large-scale datasets and offers a certain degree of interpretability. However, it relies heavily on data and is highly dependent on labeled samples, which are often scarce in real-world scenarios. At the same time, it exhibits certain “black-box” characteristics, making its diagnostic results less interpretable.

Model-driven methods rely on the system’s mathematical modeling and physical laws [29]. For example, Nan C et al. [30] proposed a fault diagnosis model based on prior knowledge to address challenges in abnormal operating conditions in complex environments, demonstrating good interpretability through systematic modeling. Christoph Wehner et al. [31] introduced an interactive intelligent RCA tool that significantly reduces the learning time of causal Bayesian networks and decreases the number of false causal relationships, thereby improving the efficiency of fault cause analysis in electric vehicle manufacturing. Yiming Xu et al. [32] proposed a model-based fault diagnosis method for application in the battery management system (BMS) of lithium-ion batteries (LIBs). However, in practical applications, faults typically arise from the interactions of multiple factors. This complexity makes it challenging to model faults precisely. The aforementioned methods fail to effectively capture these complex dependencies. FHMM models multiply components by constructing hidden states, and can adeptly capture the complex dependencies within the system, providing a feasible solution.

3. System Description and Mathematical Modeling

The LST shown in Figure 1 was produced by a company. It is typically made of transparent material and mainly consists of the tank cover, upper and lower tank bodies, liquid level alarm, float, and filter screen. The functional description of each of these components is presented in Table 1, which provides a detailed understanding of how each part contributes to the overall function of the LST. The production of the LST is mainly accomplished through the cooperation of three workshops, namely the injection molding workshop, the small parts workshop, and the assembly workshop. The injection molding workshop manufactures the main components of the LST, including the upper and lower parts of the tank body. The small parts workshop produces the accessories for the liquid storage tank. The semi-finished products produced by the injection molding workshop and the small parts workshop are transferred to the assembly workshop. In the assembly workshop, the upper and lower parts of the LST are welded into a sealed tank body, and the accessories are assembled with the tank body to complete the final assembly of the LST. Specifically, only the production process of the assembly workshop is focused on in this paper, with the assumption that the semi-finished products provided by the injection molding workshop and the small parts workshop are of qualified quality.

The schematic diagram of the LST assembly line in the assembly shop is shown in Figure 2. The automotive braking LST undergoes a series of processes from raw materials to finished products. Key production processes include corresponding detection steps. Due to the high scrap cost caused by quality issues, the production quality indices of both semi-finished and finished products are tested at the early, middle, and late stages of the production process. Figure 3 shows the production installation and testing steps in the core of the assembly plant, including upper and lower body welding, check valve installation, air-tightness test, mechanical performance test, check valve plus shell air-tightness test, sensor installation, and sensor pull-out force test.

First, the upper and lower bodies of the LST with the float are welded using a servo welding machine. Then, a check valve is installed on the welded tank. Additionally, the LST undergoes an air-tightness test, which is included in the air-tightness test table. The mechanical performance test is recorded in the mechanical performance test table, and the check valve test is noted in the check valve shell air-tightness test table. After completing these three tests, the sensor is installed, followed by the sensor pull test, which tests the firm pull force of the sensor installation. Once the pull-out test is completed, the product is finished, and the process of marking and packaging begins.

In this article, the task of root cause tracing of defective LST products is defined as a DMFD problem.

As shown in Figure 4, the problem can be represented as an FHMM, which is discussed in papers [19,26] Here, the FHMM state is factored into multiple state variables and presented in a distributed manner. Specifically, the state transitions between components are stable. Formally, a DMFD problem can be defined as

\begin{matrix} D M F D = \{S, κ, T, O, D, P, Π\} \end{matrix}

(1)

where

S = {s_{1}, \dots, s_{m}}

is a finite set of m components associated with the system;

κ = {0, 1, \dots, k, \dots, K}

is the set of discretized observation epochs;

T = {t_{1}, t_{2}, \dots, t_{n}}

is a finite set of n available binary tests, the passed tests

T_{p}

, and failed tests

T_{f}

; O is a finite set of test outcomes up to and including epoch K;

D = [d_{i j}]

is the D-matrix;

P = {P_{d}, P_{f}}

is a set of probabilities of detection and false alarm; and

Π = (P_{a_{i}} (k), P_{v_{i}} (k))

denotes the set of fault appearance probability

P_{a_{i}} (k)

and fault disappearance probability

P_{v_{i}} (k)

.

The state of the m-th production components at the k-th epoch is

\hat{x} (k) = {x_{1} (k), x_{2} (k), \dots, x_{m} (k)}

, assuming that the initial state

\hat{x} (0)

is known (or its distribution is known). Here, for each

i = 1, 2, \dots, m

, the value of

x_{i} (k)

is determined by

\begin{matrix} x_{i} (k) = \{\begin{matrix} 1, & if component s_{i} occurs a fault \\ 0, & otherwise \end{matrix} \end{matrix}

(2)

In practical situations, the DMFD tasks are divided into perfect and imperfect situations. In the perfect situation, each test result is available, i.e.,

O^{K} = {O (k) = (O_{p} (k), O_{f} (k))}_{k = 1}^{K}

, where

O_{f} (k)

is the set of failed tests at epoch k, and

O_{p} (k)

is the set of passed tests at epoch k. In the imperfect situation, due to human- or equipment-related factors, the detection results may not be completely recorded, i.e.,

(O_{f} (k) \cup O_{p} (k)) \subseteq O

. And the Markov observation sequence can be defined by

\begin{matrix} o_{j} (k) = \{\begin{matrix} 0, & if t_{j} (k) \in T_{p} \\ 1, & if t_{j} (k) \in T_{f} \\ 2, & otherwise \end{matrix} \end{matrix}

(3)

where

o_{j} (k)

represents the outcome of the j-th test at time k. When

o_{j} (k)

= 2, it implies that the test result is missing.

The likelihood function

Pr (O^{K} ∣ X^{K}, \hat{x} (0))

, based on the assumption of conditional independence, which describes the probability of the observed test results

O^{K}

given the fault states

X^{K}

and the initial state

\hat{x} (0)

, is calculated as

\begin{matrix} Pr (O^{K} ∣ X^{K}, \hat{x} (0)) = \prod_{k = 1}^{K} Pr (O_{p} (k) ∣ \hat{x} (k)) \cdot Pr (O_{f} (k) ∣ \hat{x} (k)), \end{matrix}

(4)

where

X^{K} = {\hat{x} (1), \dots, \hat{x} (K)}

,

Pr (O (k) ∣ \hat{x} (k))

is the probability of the test results

O (k)

given the fault state

\hat{x} (k)

at time k.

Then, we define the matrix

D = [d_{i j}]

as the diagnostic matrix, which represents the dependencies between the fault-related production processes

s_{i}

and the detection processes

T_{j}

. This matrix captures the causality between the failure component (or root cause failure) of the system and the corresponding test. We introduce the collection

P = {P_{d}, P_{f}}

, which includes the fault detection and false alarm probabilities. Specifically, we have the fault detection probability

P_{d_{i j}} = Pr (o_{j} (k) = 1 ∣ x_{i} (k) = 1)

, which is the probability that the j-th test detects the failure of the i-th component, and the false alarm probability

P_{f_{i j}} = Pr (o_{j} (k) = 1 ∣ x_{i} (k) = 0)

, which is the probability that the j-th test falsely indicates a failure when the i-th component is functioning. The state of each fault is modeled as a non-homogeneous Markov chain. For each fault state, we define

Π = {P_{a_{i}} (k), P_{v_{i}} (k)}

, where

P_{a_{i}} (k) = Pr (x_{i} (k) = 1 ∣ x_{i} (k - 1) = 0)

is the probability that the fault occurs at time k, given that it was not present at time

k - 1

, and

P_{v_{i}} (k) = Pr (x_{i} (k) = 0 ∣ x_{i} (k - 1) = 1)

is the probability that the fault disappears at time k, given that it was present at time

k - 1

.

4. Inference Algorithm for Fault Localization and Diagnosis

The fault diagnosis task in this paper can be defined as a problem of finding maximum a posteriori estimation to evaluate the evolution of fault sequence state with time step.

The solution of

{\hat{X}}^{K} = {\hat{x} (1), \hat{x} (2), \dots, \hat{x} (K)}

can be used to explain the sequence of the observed test results:

{\hat{X}}^{K} = arg max_{X^{K}} Pr (X^{K} |O^{K}, \hat{x} (0)),

(5)

where K is the total number of epochs; when

K = 1

, the problem is simplified to a static fusion problem. Using the Bayes formula, the objective function is equivalent to

{\hat{X}}^{K} = arg max_{X^{K}} Pr (O^{K} |X^{K}, \hat{x} (0)) Pr (X^{K} |\hat{x} (0)) .

(6)

In the case of a given fault state and the Markov property of the fault state evolution, the passed and failed test results are conditionally independent, so the objective function is equivalent to

\begin{matrix} {\hat{X}}^{K} = & a r g \underset{X^{K}}{m a x} \prod_{k = 1} [\{P r (O_{p} (k) ∣ \hat{x} (k)) \cdot P r (O_{f} (k) ∣ \hat{x} (k))\} \\ \cdot Pr (\hat{x} (k) |\hat{x} (k - 1))], \end{matrix}

(7)

where

O_{p} (k) \subseteq O

and

O_{f} (k) \subseteq O

represent the set of passed and failed tests at time k, respectively. A new function

f_{k} (\hat{x} (k), \hat{x} (k - 1))

is defined as follows:

\begin{matrix} f_{k} (\hat{x} (k), \hat{x} (k - 1)) = ln {P r (O_{p} (k) ∣ \hat{x} (k)) P r (O_{f} (k) ∣ \hat{x} (k)) \\ Pr (\hat{x} (k) |\hat{x} (k - 1))} . \end{matrix}

(8)

Given the failure state

\hat{x} (k)

, the test results are independent. Therefore,

\begin{matrix} Pr (O_{p} (k) | \hat{x} (k)) = \{\prod_{o_{j} (k) \in O_{p} (k)} Pr (o_{j} (k) = pass | \hat{x} (k))\}, \\ Pr (O_{f} (k) | \hat{x} (k)) = \{\prod_{o_{j} (k) \in O_{f} (k)} Pr (o_{j} (k) = fail | \hat{x} (k))\} . \end{matrix}

(9)

Assuming that the test results

o_{j}

pass, it should pass all of its associated failure statuses; therefore,

\Pr (o_{j} (k) = pass | \hat{x} (k)) = \underset{i = 1}{\prod^{m}} \Pr (o_{j} (k) = pass | x_{i} (k)),

(10)

where

\Pr (o_{j} (k) = pass ∣ \hat{x} (k)) = \{\begin{matrix} {(1 - P_{f_{i j}})}^{x_{i} (k)} {(1 - P_{d_{i j}})}^{1 - x_{i} (k)}, & x_{i} (k) = 1 \\ {(1 - P_{d_{i j}})}^{x_{i} (k)} {(1 - P_{f_{i j}})}^{1 - x_{i} (k)}, & x_{i} (k) = 0 . \end{matrix}

(11)

Obviously,

\begin{matrix} Pr (o_{j} (k) = fail | \hat{x} (k)) & = 1 - Pr (o_{j} (k) = pass | \hat{x} (k)) . \end{matrix}

(12)

Similarly, since the fault is independent of this assumption,

\begin{matrix} Pr (\hat{x} (k) | \hat{x} (k - 1)) & = \prod_{i = 1}^{m} Pr (x_{i} (k) | x_{i} (k - 1)), \end{matrix}

(13)

where

\begin{matrix} Pr (\hat{x} (k) | \hat{x} (k - 1)) & = \{\begin{matrix} 1 - x_{i} (k - 1) = 0, x_{i} = 0 \\ x_{i} (k - 1) = 0, x_{i} = 1 \\ x_{i} (k - 1) = 1, x_{i} = 0 \\ 1 - x_{i} (k - 1) = 1, x_{i} = 1 \end{matrix} \\ = {(- P a_{i} (k))}^{(1 - x_{i} (k - 1)) (1 - x_{i} (k))} \\ \cdot P a_{i} {(k)}^{(1 - x_{i} (k - 1)) x_{i} (k)} \cdot P v_{i} {(k)}^{x_{i} (k - 1) (1 - x_{i} (k))} \\ \cdot {(1 - P v_{i} (k))}^{x_{i} (k - 1) x_{i} (k)}; x_{i} (k - 1), x_{i} (k) \in {0, 1} . \end{matrix}

(14)

Therefore, the objective function of Formula (1) is equivalent to

\begin{matrix} {\hat{X}}^{K} & = \underset{X^{K}}{argmax} \sum_{k = 1}^{K} f_{k} (\hat{x} (k), \hat{x} (k - 1)), \end{matrix}

(15)

where

\begin{matrix} f_{k} (x (k), x (k - 1), y (k)) & = \sum_{o_{j} \in O_{p} (k)} \sum_{i = 1}^{m} c_{i j} x_{i} (k) + \sum_{i = 1}^{m} μ_{i} (k) x_{i} (k) \\ + \sum_{o_{j} \in O_{f} (k)} ln (1 - y_{i} (k)) + \sum_{i = 1}^{m} σ_{i} (k) x_{i} (k - 1) \\ + \sum_{i = 1}^{m} h_{i} (k) x_{i} (k) x_{i} (k - 1) + γ (k) + g (k) \\ c_{i j} & = ln (\frac{1 - P d_{i j}}{1 - P f_{i j}}) \\ γ (k) & = \sum_{o_{j} \in O_{p} (k)} η_{j} \\ y_{i j} (k) & = \prod_{i = 1}^{m} {(1 - P d_{i j})}^{x_{i} (k)} {(1 - P f_{i j})}^{(1 - x_{i} (k))} \\ η_{j} & = \sum_{i = 1}^{m} ln (1 - P f_{i j}) \\ μ_{i} (k) & = ln (\frac{P a_{i} (k)}{1 - P a_{i} (k)}) \\ σ_{i} (k) & = ln (\frac{P v_{i} (k)}{1 - P a_{i} (k)}) \\ g (k) & = \sum_{i = 1}^{m} ln (1 - P a_{i} (k)) \\ \hat{x} (k), \hat{x} (k - 1) \in {0, 1}^{m} . \\ h_{i} (k) & = ln (\frac{(1 - P v_{i} (k)) (1 - P a_{i} (k))}{P a_{i} (k) P v_{i} (k)}) . \end{matrix}

(16)

The goal of the EM algorithm is to estimate the model parameters and maximize the log-likelihood function of the observed sequence

O^{K}

.

\begin{matrix} θ^{*} & = \arg \max_{θ} \log \Pr (O^{K} ∣ θ) . \end{matrix}

(17)

The log-likelihood function involves

\log \Pr (O^{K} ∣ θ) = \log \sum_{X^{K}} \Pr (O^{K}, X^{K} ∣ θ) .

Due to the difficulty of directly optimizing the logarithms that contain hidden variables, E-step computes the expectation of the a posteriori distribution under the current parameter,

Q (θ ∣ θ^{(t)}) = E_{X^{K} \sim \Pr (X^{K} ∣ O^{K}, θ^{(t)})} [\log \Pr (O^{K}, X^{K} ∣ θ)] .

So, the joint probability of sum expands to

\begin{matrix} \Pr (O^{K}, X^{K} ∣ θ) = \Pr (X^{K} ∣ θ) \Pr (O^{K} ∣ X^{K}, θ), \end{matrix}

(18)

where

\begin{matrix} Pr (X^{K} ∣ O^{K}, x (0), θ) & = \frac{Pr (O^{K}, X^{K} ∣ x (0), θ)}{Pr (O^{K} ∣ x (0), θ)} . \end{matrix}

(19)

So,

Q (θ ∣ θ^{(t)})

is changed to

\begin{matrix} Q (θ ∣ θ^{(t)}) & = \sum_{X^{K}} Pr (X^{K} ∣ O^{K}, θ^{(t)}) \log \Pr (O^{K}, X^{K} ∣ θ) . \end{matrix}

(20)

Step M is updated

Q (θ ∣ θ^{(t)})

to maximize

\begin{matrix} θ^{(t + 1)} & = \arg \max_{θ} \sum_{X^{K}} Pr (X^{K} ∣ O^{K}, x (0), θ^{(t)}) \\ [logPr (X^{K} ∣ x (0), θ) + logPr (O^{K} ∣ X^{K}, θ)] . \end{matrix}

(21)

A thresholding method is utilized, in which the increment of the log-likelihood values between the current epoch and the previous one is closely monitored.

\begin{matrix} | L (θ^{(t)}) - L (θ^{(t - 1)}) | < ε, \end{matrix}

(22)

where

L (θ^{(t)})

represents the log-likelihood value in the t-th iteration, and

ε

is a small positive number.

For fault sequences, the inference formula can be expressed as

\begin{matrix} {x_{i} (k)}_{k = 1 : K} & = arg max_{x_{i} (k)} \sum_{k = 1}^{K} ζ_{i} (x_{i} (k), x_{i} (k - 1)), \end{matrix}

(23)

where

\begin{matrix} ζ_{k} (x_{i} (k), x_{i} (k - 1)) & = \sum_{o_{j} \in O_{f} (k)}^{m} c_{i j} x_{i} (k) + \sum_{i = 1}^{m} μ_{i} (k) x_{i} (k) \\ + \sum_{i = 1}^{m} ln (1 - exp [\sum_{j = 1}^{i - 1} c_{i j} x_{i} (k) + η_{j}]) \\ + \sum_{i = 1}^{m} σ_{i} (k) x_{i} (k - 1) \\ + \sum_{i = 1}^{m} h_{i} (k) x_{i} (k) x_{i} (k - 1) + g (k) . \end{matrix}

(24)

Next, we use the Viterbi algorithm to find the optimal

x_{i} (k)

, where each path corresponds to a state sequence.

Initialization step: Assume that the initial state

x (0)

is known for all fault states. Let the maximum value of the function

ζ_{i} (x_{i} (k), x_{i} (k - 1))

at time K be denoted as

δ_{k} (x_{i} (k))

, and the maximum value of

x_{i}

at this time be represented by

ψ_{k} (x_{i} (k))

.

When

K = 1

,

\begin{matrix} δ_{1} (x_{i} (1)) & = ζ_{1} (x_{i} (1), x_{i} (0)) \\ = \sum_{o_{j} \in O_{f} (k)} c_{i j} x_{i} (1) + \sum_{o_{j} \in O_{f} (1)} ln (1 - exp [\sum_{j = 1}^{i - 1} c_{i j} x_{i} (1) + η_{j}]) \\ + \sum_{i = 1}^{m} σ_{i} (1) x_{i} (0) + \sum_{i = 1}^{m} μ_{i} (1) x_{i} (1) \\ + \sum_{i = 1}^{m} h_{i} (1) x_{i} (1) x_{i} (0) + γ (1) + g (1), \end{matrix}

(25)

where

x_{i} (0) \in {0, 1}

and

x_{l} (1) \in [0, 1]

, for

\forall l \neq i

.

Recursive step: This step involves maximizing the target function at each epoch K.

\begin{matrix} δ_{k} (x_{i} (k)) & = \sum_{o_{j} \in O_{f} (k)} c_{i j} x_{i} (k) \\ + \sum_{o_{j} \in O_{f} (k)} ln (1 - exp [\sum_{i = 1}^{m} c_{i j} x_{i} (k) + η_{j}]) \\ + \max_{x_{i} (k - 1) \in {0, 1}} [δ_{k - 1} (x_{i} (k - 1)) + g (k) \\ + \sum_{i = 1}^{m} σ_{i} (k) x_{i} (k - 1) \\ + \sum_{i = 1}^{m} μ_{i} (k) x_{i} (k) \\ + h_{i} (k) x_{i} (k) x_{i} (k - 1)] . \end{matrix}

(26)

where

2 \leq k \leq K; x_{i} (k) \in \{0, 1\}; x_{l} (1) \in [0, 1], \forall l \neq i .

\begin{matrix} ψ_{k} (x_{i} (k)) & = \underset{x_{i} (k - 1) \in {0, 1}}{argmax} [δ_{k - 1} (x_{i} (k - 1)) + g (k) \\ + \sum_{i = 1}^{m} σ_{i} (k) x_{i} (k - 1) \\ + \sum_{i = 1}^{m} μ_{i} (k) x_{i} (k) \\ + h_{i} (k) x_{i} (k) x_{i} (k - 1)] . \end{matrix}

(27)

Termination step: This step computes the objective function for time

K = k

.

\begin{matrix} F^{*} & = max_{x_{i} (K) \in {0, 1}} [δ_{K} (x_{i} (K))] \\ x_{i} {(K)}^{*} & = {argmax}_{x_{i} (K) \in {0, 1}} [δ_{K} (x_{i} (K))] . \end{matrix}

(28)

Optimal state sequence backtracking: The backtracking step computes the optimal state sequence through the backtracking path. The optimal state

x_{i} {(k)}^{*}

of the i-th fault at time k is derived from the following formula:

\begin{matrix} x_{i} {(k)}^{*} & = ψ_{k + 1} (x_{i} {(k + 1)}^{*}), k = K - 1, \dots, 1 . \end{matrix}

(29)

Assumption 1.

Within the system, there is one fault occurrence at each instance.

Assumption 2.

When a component malfunctions, the entire system is regarded as being faulty.

Assumption 3.

The faulty state will continue to exist until it is repaired manually.

Remark 1.

Assumption 1, which limits the system to a single fault at a time, can simplify the complexity of the problem. Based on Assumption 1, Assumption 2 describes that no further faults will occur in the system when it is in the faulty state. As described in Assumption 3, this state will persist until it is manually lifted by the staff; otherwise, it will continue indefinitely. This ensures that the LST assembly line will resume normal operation.

5. Experiment

Based on the data from the LST assembly line in a rubber and plastic enterprise, multi-coupling faults in the production process were analyzed. As an essential liquid storage component in automotive brakings, the production process of LSTs is complex and involves many key processes. Due to equipment aging, process errors, or improper operations, various types of faults may occur during production, and strong coupling exists among these faults, posing significant challenges for fault diagnosis. Figure 5 shows the production steps of the LST assembly line and their inspection results under the DMFD framework. The process begins with three main component-related steps: S1 for upper and lower body welding, S2 for check valve installation, and S3 for sensor installation. S1 is associated with air-tightness testing (result

o_{1} (k)

under T3) and mechanical and valve testing (result

o_{2} (k)

under T2/T4). S2 is related to both the air-tightness testing connected to T3 and the mechanical and valve testing under T2/T4. Meanwhile, S3 is only related to the pull test with the result

o_{3} (k)

under T1. This figure systematically presents the production and inspection process flow, clearly demonstrating the relationships between various steps and test outcomes in the LST assembly line within the DMFD framework.

As shown in Table 2, the LST assembly line primarily consists of the following key processes:

Welding misalignment (S1): This process is used to weld and secure the upper and lower parts of the LST. It is a fundamental step in the production process, but defects during welding may lead to tank leakage or breakage during pressure testing or actual use.
Missing installation of the valve body (S2): The check valve ensures the unidirectional flow of liquid within the tank. Deviations in its installation location, insecure installation, or inherent defects in the valve itself may prevent the liquid from flowing in one direction or cause leakage, resulting in failure during the production process.
Sensor offset (S3): The LST sensor monitors the operational state of the tank. If the sensor is improperly installed or experiences signal transmission issues, the monitoring data may become inaccurate, and it may fail the pull-out test, leading to suboptimal performance of the tank.

Table 2. List of failures.

Fault	Fault Number
Welding misalignment	S1
Missing installation of the valve body	S2
Sensor offset	S3

As shown in Table 3, the assembly line also includes several critical testing procedures to evaluate the quality and reliability of key processes:

Drawing Test (T1): This test is designed to assess the stability of the sensor by applying a drawing force. If the sensor is improperly installed, excessive displacement may occur, affecting the tank’s stability and its performance.
Performance Testing (T2): This test evaluates the mechanical properties of the LST, particularly the strength of the welded structure and the integrity of the check valve installation. Defects in either may cause failure during this test.
Air-Tight Test (T3): This procedure checks the overall sealing performance of the LST by applying pressurization to ensure that the tank does not leak under high or negative pressure conditions. Defects such as holes in welds, cracks, or voids in the check valve may result in test failure.
Check valve Air-Tightness Test with Shell (T4): This test is focused on verifying the air-tightness of the check valve and its shell. It ensures the valve’s unidirectional flow function and sealing performance after installation. If the valve is poorly installed or has manufacturing defects, it may lead to substandard results during this test.

Table 3. Test list.

Fault	Fault Number
Pull results	T1
Mechanical performance test results	T2
Air-tightness test results	T3
Check valve plus shell air-tightness test	T4

These tests and processes provide valuable insights into the production quality of LSTs, enabling identification and diagnosis of faults during production.

Experimental Procedure:

(1): Data Pre-processing: Data pre-processing is performed based on prior knowledge by collecting data from various processes and tests of the LST assembly line. The result states of the processes are categorized. The related data are further divided into a training set and a test set, which will be used for subsequent model training and testing.
(2): Model Training: The FHMM is constructed, and the data from the pre-processed training set are fed into the model. The probability distribution learned by the hidden state chain after model training is used to analyze the coupling relationships between the process and the tests on the test set data.
(3): Result Analysis: The correct isolation rate (CI) and false isolation rate (FI) are calculated. Additionally, the detection probability/false alarm probability matrix is described.

The following formulas are used to compute the rates:

\begin{matrix} \bar{C} \bar{I} & = \frac{\sum_{k = 1}^{K} \frac{| \hat{x} (k) \cap r (k) |}{| r (k) |}}{K}, \end{matrix}

(30)

\begin{matrix} \bar{F I} & = \frac{\sum_{k = 1}^{K} \frac{| \hat{x} (k) \cap \neg r (k) |}{| S | - | r (k) |}}{K} . \end{matrix}

(31)

5.1. Analysis of the Results

As shown in Table 4, a model’s dependency matrix is typically used to represent relationships between different elements in some systems or models. The upper and lower body welding, check valve installation, and sensor installation are denoted as S1, S2, and S3, respectively. The drawing test, performance test, airtight test, and check valve shell airtight test are represented as T1, T2, T3, and T4, respectively.

As shown in Table 5, the detection probability refers to the probability that the test (T1, T2, T3, T4) will correctly identify and diagnose a state (S1, S2, S3) when it actually occurs. False positives represent the probability that a test will incorrectly identify a state (S1, S2, S3) as occurring when that state does not occur.

Table 6 presents performance metrics for the model across different fault states, including the correct isolation rate with 95% confidence intervals and the false isolation rate with 95% confidence intervals. It compares FHMM with Decision Tree, Fully Convolutional Neural Network (FCNN), and Support Vector Machine (SVM) under the same evaluation metrics. The results show that FHMM achieves a perfect correct isolation rate of 1.0 for all fault states and an error isolation rate of 0, indicating superior diagnostic capability.

The correct isolation rates of Decision Tree for S1, S2, and S3 are (0.6829, 0.9024), (0.8293, 1), and (0.8165, 0.9756), with higher error isolation rates in S1 and S2, which are (0.0726, 0.3058) and (0.0304, 0.1918). These results are inferior to FHMM, likely due to Decision Trees not accounting for feature interdependencies. Similarly, FCNN shows lower correct isolation rates, such as (0.7805, 0.9756) in S1, possibly due to insufficient training data or inadequate feature extraction. SVM also demonstrates similar limitations, especially in S1 and S3, where the correct isolation rates are (0.6829, 0.9152) and (0.8049, 1), with poor error isolation rates in S1 and S2, indicating limited generalization ability in high-dimensional feature space.

The computational complexity of FHMM is

O (N M K^{2})

, compared to

O (N D log N)

for Decision Trees,

O (N L d^{2})

for FCNN, and

O (N^{3})

for SVM. When comparing these complexities based on the highest-order terms, FHMM’s complexity is dominated by

N M K^{2}

, which is higher than the

N D log N

of Decision Trees but significantly lower than the

N^{3}

of SVM. Therefore, FHMM strikes a balance between computational cost and model complexity.

These comparisons emphasize FHMM’s robustness and reliability in fault diagnosis, outperforming traditional machine learning models in accuracy and fault isolation precision.

5.2. Sensitivity Experiments

The sensitivity experiments were conducted to evaluate the impact of different initialization strategies on system performance. Specifically, variations in the initialization parameters, including the transition matrix and initial hidden state distribution, were explored to assess their influence on diagnostic results. To ensure a fair comparison, all other experimental settings were kept consistent with previous experiments. The parameters were initialized using two different distributions: the uniform distribution and the Dirichlet distribution.

As shown in Table 7, the results indicate that stable diagnostic accuracy was maintained under both the CI and FI metrics, regardless of the initialization strategy. This demonstrates the model’s robustness to changes in initialization conditions.

On this basis, we note that there may be mislabeling in the actual assembly line; this so-called mislabeling refers to the cause of the fault into the product. To simulate this, we added a small number of negative samples to the training dataset. After the same test set, we obtained the model performance metrics shown in Table 8.

6. Discussion

The proposed method, under the DMFD framework, offers unique advantages for fault tracing in the LST assembly line. This method provides an important reference for fault diagnosis in discrete manufacturing industries. The following discusses the potential application scenarios and implementation guidelines:

(1) Potential application scenarios. The LST assembly process involves several key steps, such as upper and lower body welding, valve installation, and sensor installation. By implementing the DMFD method, it ensures that faults are identified at the root, enabling a quick response and reducing the risk of defective products entering the market. The main advantage of the method proposed in this paper lies in modeling the LST assembly line using the DMFD framework, which is particularly suitable for discrete industrial assembly lines. By establishing the relationship between production and testing processes, and integrating the FHMM, this method is expected to effectively predict and diagnose potential issues during the production process.

(2) Guidelines for implementation. In practice, the effective implementation of a fault diagnosis system requires the establishment of a comprehensive and robust data acquisition infrastructure. High-quality data must be collected at every stage of the production process, including sensor data for monitoring key parameters such as temperature, pressure, and pull-out displacement. However, the proposed method does not require perfect fault data, which greatly reduce the difficulty of obtaining data in real industrial scenarios. Currently, different assembly lines often have different processes and testing procedures. This makes it necessary to adapt general fault diagnosis methods to the specific needs of each assembly line during implementation. To effectively achieve cross-line adaptability, the proposed method must be flexible. For instance, by optimizing testing processes and modifying data analysis strategies, the components of the method can be dynamically adjusted according to different production environments.

Overall, while the proposed framework shows excellent potential in the LST assembly line, establishing the relationship between production and testing processes and integrating the FHMM, it is expected to effectively predict and diagnose potential issues in the production process. Although further experimental validation is required, this method could play an important role in improving production efficiency and product quality. This research direction has broad application prospects and warrants further exploration through carefully designed experiments to fully verify its feasibility and effectiveness in complex industrial environments.

7. Conclusions

In this paper, a root cause tracing method for defective products based on the DMFD framework has been proposed. This framework has used the FHMM to model the real LST assembly production line. It has fully taken into account the production and inspection processes of the automotive braking LST assembly line, as well as the dependency relationships between the actual states related to product quality and the inspection results. Experimental results have shown that the proposed algorithm has been able to effectively locate and trace the root cause of defective products.

Future research will further extend this work. It will investigate the complex cascading effects between fault processes in the case of multiple coupled simultaneous faults. Additionally, non-homogeneous transition mechanisms will be explored to capture the dynamic evolution of failure rates. Moreover, the algorithm is expected to be extended for broader industrial use, supporting fault prediction, quality control, and process optimization. These efforts will provide more effective support for fault diagnosis and root cause analysis in complex systems.

Author Contributions

Methodology, Q.W.; Software, Y.Z.; Validation, Y.Z.; Investigation, D.L.; Resources, Y.T. and D.L.; Writing—original draft, H.X. and Y.Z.; Writing—review & editing, H.X. and Q.W.; Project administration, Y.T., K.W. and Q.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China under Grants No. 62203390.

Data Availability Statement

The datasets presented in this article are not available since our data involves corporate secrets.

Conflicts of Interest

The authors You Teng and Kefu Wang were employed by the company Zhejiang Qiaoshi Intelligent Industry Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Nomenclature

k	Epoch
$x_{i} (k)$	State of the i-th component
${\hat{x}}_{i} (k)$	Estimate state of the i-th component
$o_{j} (k)$	Result of the j-th test
${\hat{X}}^{K}$	State set
$O^{K}$	Set of observation results
$O_{p} (k)$	The pass result sequence
$O_{f} (k)$	The fail result sequence
D	Diagnostic matrix
$P_{d_{i j}}$	Correctly detecting probability
$P_{f_{i j}}$	Falsely detecting probability
$P_{a_{i}} (k)$	Fault occurrence probability
$P_{v_{i}} (k)$	Fault disappearance probability
$θ^{(t)}$	Estimate model parameter

References

Venkatasubramanian, V.; Rengaswamy, R.; Yin, K.; Kavuri, S.N. A review of process fault detection and diagnosis Part I: Quantitative model-based methods. Comput. Chem. Eng. 2003, 27, 293–311. [Google Scholar] [CrossRef]
Huang, X.; Gao, J.; Jiang, H.; Chen, K. A systematic fault root cause tracing method for process systems. In Proceedings of the 2011 Proceedings—Annual Reliability Maintainability Symposium, Lake Buena Vista, FL, USA, 24–27 January 2011; pp. 1–7. [Google Scholar]
Huang, X.; Gao, J.; Jiang, H.; Gao, Z.; Chen, F. Fault root cause tracing of complicated equipment based on fault graph. Inst. Mech. Eng. Part E J. Process. Mech. Eng. 2013, 227, 17–32. [Google Scholar] [CrossRef]
Wang, R.; Xu, G.; Gao, J.; Gao, Z.; Kang, J. An information transfer based novel framework for fault root cause tracing of complex electromechanical systems in the processing industry. Mech. Syst. Signal Process. 2018, 101, 121–139. [Google Scholar] [CrossRef]
Latino, M.A.; Latino, R.J.; Latino, K.C. Root Cause Analysis: Improving Performance for Bottom Line Results; CRC Press: Boca Raton, FL, USA, 2019. [Google Scholar]
Medina Oliva, G.; Iung, B.; Barberá, L.; Viveros, P.; Ruin, T. Root cause analysis to identify physical causes. In Proceedings of the 11th International Probabilistic Safety Assessment & Management Conference & the Annual European Safety & Reliability Conference 2012, Psam11 Esrel 2012, Helsinki, Finland, 25–29 June 2012; Volume 1, pp. 671–680. [Google Scholar]
Purushotham, V.; Narayanan, S.; An, P.S. Multi-fault diagnosis of rolling bearing elements using wavelet analysis and hidden Markov model based fault recognition. NDT E Int. 2005, 38, 654–664. [Google Scholar] [CrossRef]
Pandian, A.; Ali, A. A review of recent trends in machine diagnosis and prognosis algorithms. In Proceedings of the 2009 World Congress on Nature & Biologically Inspired Computing (NABIC), Coimbatore, India, 9–11 December 2009. [Google Scholar]
Gao, Z.; Cecati, C.; Ding, S.X. A Survey of Fault Diagnosis and Fault-Tolerant Techniques-Part I: Fault Diagnosis with Model-Based and Signal-Based Approaches. IEEE Trans. Ind. Electron. 2015, 62, 3757–3767. [Google Scholar] [CrossRef]
Velásquez, A.R.M.; Lara, M.J.V. Root cause analysis improved with machine learning for failure analysis in power transformers. Eng. Fail. Anal. 2020, 115, 104684. [Google Scholar] [CrossRef]
Panchal, G.; Ganatra, A.; Kosta, Y.P.; Panchal, D. Behaviour Analysis of Multilayer Perceptrons with Multiple Hidden Neurons and Hidden Layers. Int. J. Comput. Theory Eng. 2011, 3, 332–337. [Google Scholar] [CrossRef]
Li, Y.F.; Huang, H.Z.; Liu, Y.; Xiao, N.-C.; Li, H. A New Fault Tree Analysis Method: Fuzzy Dynamic Fault Tree Analysis. Eksploat. Niezawodn. 2012, 14, 208–214. [Google Scholar]
Ma, Q.; Li, H.; Thorstenson, A. A Big Data-driven Root Cause Analysis System: Application of Machine Learning in Quality Problem Solving. Comput. Ind. Eng. 2021, 160, 107580. [Google Scholar] [CrossRef]
Ruan, S.; Zhou, Y.; Yu, F.; Pattipati, K.R.; Willett, P.; Patterson-Hine, A. Dynamic multiple-fault diagnosis with imperfect tests. IEEE Trans. Syst. Man-Cybern.-Part A Syst. Hum. 2009, 39, 1224–1236. [Google Scholar] [CrossRef]
Shakeri, M.; Pattipati, K.R.; Pattipati, K.R.; Patterson-Hine, A. Optimal and near-optimal algorithms for multiple fault diagnosis with unreliable tests. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 1998, 28, 431–440. [Google Scholar] [CrossRef]
Kodali, A.; Zhang, Y.; Sankavaram, C.; Pattipati, K.; Salman, M. Fault diagnosis in the automotive electric power generation and storage system (EPGS). IEEE/ASME Trans. Mechatron. 2013, 18, 1809–1818. [Google Scholar] [CrossRef]
Zhang, J.; Liu, M.; Deng, W.; Zhang, Z.; Jiang, X.; Liu, G. Research on electro-mechanical actuator fault diagnosis based on ensemble learning method. Int. J. Hydromechatron. 2024, 7, 113–131. [Google Scholar] [CrossRef]
Li, Y.; Yang, Y.; Li, G.; Xu, M.; Huang, W. A fault diagnosis scheme for planetary gearboxes using modified multi-scale symbolic dynamic entropy and mRMR feature selection. Mech. Syst. Signal Process. 2017, 91, 295–312. [Google Scholar] [CrossRef]
Singh, S.; Kodali, A.; Choi, K.; Pattipati, K.R.; Namburu, S.M.; Sean, S.C. Dynamic Multiple Fault Diagnosis: Mathematical Formulations and Solution Techniques. IEEE Trans. Syst. Man Cybern. Part A Syst. Humans 2009, 39, 160–176. [Google Scholar] [CrossRef]
Jiang, H.; Li, C.; Li, H. An improved EEMD with multiwavelet packet for rotating machinery multi-fault diagnosis. Mech. Syst. Signal Process. 2013, 36, 225–239. [Google Scholar] [CrossRef]
Luo, J.; Jin, Z.; Jin, H.; Li, Q.; Ji, X.; Dai, Y. Causal temporal graph attention network for fault diagnosis of chemical processes. Chin. J. Chem. Eng. 2024, 70, 20–32. [Google Scholar] [CrossRef]
Ying, J.; Kirubarajan, T.; Pattipati, K.R.; Patterson-Hine, A. A hidden Markov model-based algorithm for fault diagnosis with partial and imperfect tests. IEEE Trans. Syst. Man Cybern. Part Appl. Rev. 2000, 30, 463–473. [Google Scholar] [CrossRef]
Qian, S.; Jiao, W.; Hu, H.; Yan, G. Transformer power fault diagnosis system design based on the HMM method. In Proceedings of the 2007 IEEE International Conference on Automation & Logistics, Jinan, China, 18–21 August 2007; pp. 1077–1082. [Google Scholar]
Qiu, X.; Chen, W.; Wu, Q.; Wang, Y.W.; Gu, C.; Zhang, W.A. Fault diagnosis for multi-axis carving machine systems with Gaussian mixture hidden Markov models: A data-model interactive perspective. Control. Eng. Pract. 2025, 154, 106163. [Google Scholar] [CrossRef]
Gu, C.; Wu, X.; Zhang, W.; Ni, H.; Ding, S.X. Active Disturbance Rejection Formation Control for Multiagent Systems with Input Constraints. IEEE Trans. Control. Netw. Syst. 2024, 12, 38–50. [Google Scholar] [CrossRef]
Samiullah, M.; Ali, H.; Zahoor, S.; Ali, A. Fault Diagnosis on Induction Motor using Machine Learning and Signal Processing. arXiv 2024, arXiv:2401.15417v1. [Google Scholar] [CrossRef]
Liu, G.; Zhu, B. A Review of Intelligent Device Fault Diagnosis Technologies Based on Machine Vision. arXiv 2024, arXiv:2412.08148v1. [Google Scholar] [CrossRef]
Gabbar, H.A.; Hussain, S.; Hosseini, A.H. Simulation-based fault propagation analysis—Application on hydrogen production plant. Process. Saf. Environ. Prot. 2014, 92, 723–731. [Google Scholar] [CrossRef]
Gu, C.; Wu, Q.; Zhang, B.; Wang, Y.; Zhang, W.A.; Ni, H. Data-model interactive Rul prediction of stochastic degradation devices with multiple uncertainty quantification and multi-sensor information fusion. ISA Trans. 2025, 157, 293–305. [Google Scholar] [CrossRef]
Nan, C.; Khan, F.; Iqbal, M.T. Real-time fault diagnosis using knowledge-based expert system. Process. Saf. Environ. Prot. 2008, 86, 55–71. [Google Scholar] [CrossRef]
Wehner, C.; Kertel, M.; Wewerka, J. Interactive and Intelligent Root Cause Analysis in Manufacturing with Causal Bayesian Networks and Knowledge Graphs. arXiv 2024, arXiv:2402.00043v1. [Google Scholar] [CrossRef]
Xu, Y.; Ge, X.; Guo, R.; Shen, W. Recent Advances in Model-Based Fault Diagnosis for Lithium-Ion Batteries: A Comprehensive Review. arXiv 2024, arXiv:2401.16682v1. [Google Scholar] [CrossRef]

Figure 1. Schematic diagram of LST structure.

Figure 2. Schematic diagram of liquid storage tank assembly line.

Figure 3. Flow chart of liquid storage tank assembly.

Figure 4. DMFD problem viewed as an FHMM.

Figure 5. Production steps of the LST assembly line and their inspection results under the DMFD framework.

Table 1. LST components and their roles.

Components	Effect
Tank caps	The tank caps in the tank cover can be used to balance the air pressure in the tank to keep the liquid flowing smoothly
Up and down tank	The upper tank is usually used to store and supply brake liquid, while the lower tank is used to collect and discharge brake liquid. The upper and lower tanks are operated by a welding process and are combined to form a tank assembly
Floater	The floater is a liquid level monitoring element, it has a certain density, built-in magnetic structure, its function is to detect the brake liquid level of the liquid height, and through floating movement of the liquid level information into mechanical or electrical signals, for monitoring and control of the system
Liquid level alarm	The liquid level alarm belongs to the liquid level monitoring device, which is matched with the float, and its function is to detect the brake oil level height and send a signal to the system

Table 4. Dependency matrix of the model.

	T1	T2	T3	T4
S1	0	1	1	0
S2	0	1	1	1
S3	1	0	0	0

Table 5. Reliability index (detection probability/false positive probability).

	T1	T2	T3	T4
S1	0	1/0	1/0	0
S2	0	1/0	1/0	1/0
S3	1/0	0	0	0

Table 6. Model performance indicators.

Method	Fault	Correct Isolation Rate with 95% Confidence Interval	Error Isolation Rate with 95% Confidence Interval
FHMM	S1	(1, 1)	(0, 0)
	S2	(1, 1)	(0, 0)
	S3	(1, 1)	(0, 0)
Decision Tree	S1	(0.6829, 0.9024)	(0.0726, 0.3058)
	S2	(0.8293, 1)	(0.0304, 0.1918)
	S3	(0.8165, 0.9756)	(0, 0.0878)
FCNN	S1	(0.7805, 0.9756)	(0, 0)
	S2	(0.7073, 0.9512)	(0, 0)
	S3	(0.8896, 1)	(0, 0.0878)
SVM	S1	(0.6829, 0.9152)	(0.0726, 0.3058)
	S2	(0.8537, 1)	(0.0304, 0.1918)
	S3	(0.8049, 1)	(0, 0.0878)

Table 7. Sensitivity analysis results.

Fault	Correct Isolation Rate with 95% Confidence Interval	Error Isolation Rate with 95% Confidence Interval
S1	1 ± 0.0	0 ± 0.0
S2	1 ± 0.0	0 ± 0.0
S3	1 ± 0.0	0 ± 0.0

Table 8. Model performance indicators.

Fault	Correct Isolation Rate with 95% Confidence Interval	Error Isolation Rate with 95% Confidence Interval
S1	(0.8966, 0.9741)	(0.0227, 0.1153)
S2	(0.8231, 0.9224)	(0.0738, 0.1965)
S3	(0.9265, 0.9828)	(0.0139, 0.0881)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Teng, Y.; Li, D.; Xue, H.; Zhou, Y.; Wang, K.; Wu, Q. Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks. Electronics 2025, 14, 1546. https://doi.org/10.3390/electronics14081546

AMA Style

Teng Y, Li D, Xue H, Zhou Y, Wang K, Wu Q. Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks. Electronics. 2025; 14(8):1546. https://doi.org/10.3390/electronics14081546

Chicago/Turabian Style

Teng, You, Donghui Li, Hongkai Xue, Yunkai Zhou, Kefu Wang, and Qi Wu. 2025. "Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks" Electronics 14, no. 8: 1546. https://doi.org/10.3390/electronics14081546

APA Style

Teng, Y., Li, D., Xue, H., Zhou, Y., Wang, K., & Wu, Q. (2025). Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks. Electronics, 14(8), 1546. https://doi.org/10.3390/electronics14081546

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Dynamic Multi-Fault Diagnosis-Based Root Cause Tracing for Assembly Production Lines of Liquid Storage Tanks

Abstract

1. Introduction

2. Related Work

3. System Description and Mathematical Modeling

4. Inference Algorithm for Fault Localization and Diagnosis

5. Experiment

5.1. Analysis of the Results

5.2. Sensitivity Experiments

6. Discussion

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI