1. Introduction
Efficient fault detection and isolation systems for launchers are a major requirement. However, the development of such systems is an inherently complex task due to the unique conditions under which these vehicles operate. The environment includes extremely high and low temperatures, intense vibrations, and rapid changes in operating conditions, which makes the accurate measurement of those variables difficult. Unlike other applications, weight and space constraints limit the instrumentation used to determine the health state of the system. The development of Health Monitoring Systems (HMSs), which includes the Fault Detection and Isolation (FDI) task, has been a subject of research since the early 1970s [
1]. The first strategy consisted of examining if the measurement-critical parameters were within a safe limit. If the parameters were outside safe limits, the HMS would trigger alerts or automatic shutdown.
The importance of an FDI system was brought to another level with the development of new Reusable Launch Vehicles (RLVs) in the mid-2010s. In contrast to expendable launch vehicles, RLVs are designed to be launched multiple times. In order to ensure the vehicle’s reusability, a high-performance FDI system is necessary. Furthermore, sophisticated technologies are incorporated into RLVs to enable its recovery, which allows for the implementation of powerful FDI systems. RLVs have new requirements for thrust control compared to traditional vehicles, particularly during vertical landings, when only a portion of the thrust is needed [
2]. One possible solution is the use of a multi-engine cluster [
3]. In addition to the large range of thrust modulation, a multi-engine cluster also offers higher mission reliability when compared with a single engine [
3]. If a fault occurs, e.g., loss of thrust or extinction, in one engine, the remaining healthy engines can be used to attenuate the effect of this fault [
4]. In order to attenuate the effect of a fault, an FDI system is necessary.
There are many methods for developing an FDI system. In [
5], the methods were classified into three main categories: quantitative model-based, qualitative model-based, and data-based methods. Recent studies addressing the FDI problem in launchers have mostly relied on quantitative model-based and data-based methods. The quantitative methods are mainly linked to state observers [
6,
7,
8,
9], especially variations of Kalman filters [
10,
11,
12,
13,
14]. The main drawback of these methods is that the launcher’s model is very complex, with numerous nonlinear equations. Computing a state observer for the entire launcher ca,n thus, be very challenging. Concerning data-based methods, machine learning techniques [
15,
16,
17], particularly neural networks [
18,
19,
20,
21,
22,
23,
24,
25], have dominated recent contributions to FDI systems in the context of launchers. However, implementing these machine learning techniques requires a significant amount of training data, which are extremely difficult to obtain because launch vehicles are strategic assets for industries and nations and are not available during the early development phase. A significant gap is observed in the open literature: the qualitative model-based approach has rarely been used in the development of FDI systems for launch vehicles.
This paper addresses the development of a model-based FDI system for a reusable launcher using a qualitative model-based approach based on Structural Analysis (SA). Even with the recent development of SA for FDI systems [
26,
27,
28,
29], to the best of our knowledge, only one paper [
30] has employed SA for fault diagnosis in rocket engines. In [
30], a specific subsystem of the rocket engine was considered: the LOX turbopump. The paper addressed the problem of estimating the fault magnitude. The considered faults were parametric deviations in the efficiency of the turbopump. They affected the dynamic equation that describes the evolution of the pump shaft speed. The model has six equations, and SA was used to find independent subsystems. Fault diagnosis filters were calculated for each subsystem and provided an accurate estimation of the faults. However, the study considered a rocket’s engine subsystem composed of six equations. In contrast, a multi-engine cluster model can contain over 100 equations. The number of equations directly impacts the complexity of an FDI system when using structural analysis because, in large-scale systems, the number of subsystems derived from SA increases exponentially. Thus, it becomes necessary to establish selection criteria to determine which subsystems will be utilized.
A scheme summarizing the FDI system proposed here is illustrated in
Figure 1. It can be divided into five main steps:
- 1.
Analytical model development: The analytical model of the system is derived from differential and algebraic equations.
- 2.
Structural model acquisition: The structural model is obtained from the analytical model.
- 3.
Structural analysis: The structural model is used to compute residual generator candidates through SA.
- 4.
Residual selection: Subsets of residuals with minimal cardinality, capable of detecting and isolating all considered faults, are formed using a residual selection algorithm.
- 5.
Subset analysis: The subset that offers the best FDI performance is selected
Contributions concerning Steps 1, 2, and 3 were addressed in our previous works in a less complex scenario, where only a small range of faults was considered. In [
31], SA was applied in a multi-engine cluster to investigate the possibility of fault detection and isolation, considering different measurement scenarios.
The residual selection algorithm was first introduced in [
32]. The residual selection problem has been addressed by other authors [
33,
34]. However, in both cases, the isolation constraint was too restrictive, and the cardinality of the subsets was not minimal. The selection algorithm we propose relies on a weaker isolation constraint based on the fault signature. It returns subsets of residual generators with minimal cardinality capable of producing a unique fault signature.
To achieve efficient fault detection and isolation of faults, a possible approach is to search for the subset of residuals with the highest fault sensitivity. To quantify the sensitivity of a subset to faults, the Subset Sensitivity Index (SSI) is introduced. The subsets with the highest SSI values are considered the most effective for fault detection and isolation. The SSI is calculated based on the Residual Sensitivity Index (RSI), which measures the sensitivity of a residual to a specific fault. The RSI is calculated using the signal-to-noise ratio (SNR), and the residual signal in the absence of faults is used as the baseline for noise.
Fault isolation is performed by analyzing the fault signature. If a perfect match between the theoretical fault signature and the observed fault signature is not observed, a new method is proposed to isolate the most probable fault present in the system. One common method is to use the closest fault signature, as described in [
35] and applied in [
36] for fault isolation in wind turbines. The main drawback is that, depending on the observed fault signature, it may return a list of probable faults. The problem of finding the most probable fault was also addressed in [
37], where a ‘consistency index’ was used. The index is calculated using the probability distribution of the residual signal by quantifying how confident the signal is in its nominal (or fault-free) condition. However, it does not account for the different fault sensitivities of the residuals. A method that considers the numerical fault sensitivity was proposed in [
38], where the sensitivity of the residuals was calculated considering the partial derivative of the residual with respect to the fault. While powerful, this solution requires the analytical expression of the residuals. Inspired by the solution proposed in [
38], we also propose the use of the RSI, which was first calculated to select the most sensitive subsets, for fault isolation to identify the most probable fault present in the system.
The main contributions of this paper include a comprehensive FDI technique based on structural analysis, which spans from the qualitative analysis of the analytical model to fault isolation. The technique includes a new method to evaluate subsets of residuals and determines which subsets offer the highest FDI performance. In addition, a new fault isolation technique is proposed, based on the predefined sensitivity of the residuals to faults. Finally, these methods are applied to a complete model of a multi-engine cluster for the first time. The findings discussed here were originally developed as part of the author’s thesis manuscript [
39].
The remainder of this paper is organized as follows. In
Section 2, the multi-engine cluster model under study is presented. Structural analysis of the model is performed in
Section 3. The residual selection algorithm is recalled in
Section 4. The numerical residual sensitivity is calculated in
Section 5.
Section 6 presents the fault isolation methods. Finally, the proposed methods are applied to the multi-engine cluster model in
Section 7. Conclusions and perspectives are presented in
Section 8 and
Section 9, respectively.
3. Structural Analysis
In this section structural analysis is performed for the multi-engine cluster model. The analysis returns a large number of residuals that would be infeasible to implement. An algorithm to select the best residuals is presented in
Section 4; it performs a qualitative analysis of the residuals and returns subsets of residuals with minimal cardinality capable of isolating all faults. In order to select the subsets that use the most sensitive residuals, a method that performs a quantitative analysis of the residuals is presented in
Section 5.
Section 6 presents a fault isolation algorithm that considers residual sensitivity. Finally, the combination of these methods is applied in
Section 7 to a multi-engine cluster case study, where simulation results are presented.
To perform the structural analysis of the model presented in
Section 2, the structural model must be constructed. This model can be extracted from a system described by nonlinear differential–algebraic equations of the following form:
where
X is the vector of unknown variables in
and its derivatives (
),
Z is the vector of known variables in
,
F is the fault vector in
,
t is the time, and
V is the vector of
algebraic equations. One requirement to build the structural model is that each fault (
) must affect only one equation from
V. This requirement is easily fulfilled with intermediate variables if one fault simultaneously impacts more than one equation.
A large range of systems can be written in the form described in (
19). The multi-engine model can be written in this form by defining the known vector (
Z) as the concatenation of the input vector (
u) and output measurement vector (
y), which yields
. Only variables related to rocket engine 1 are considered for further analysis, since all results obtained for engine 1 are valid for the other two engines due to the symmetry of the engine cluster. The vector of unknown variables (
X), input variables (
u), output measurements (
y), and the fault vector (
F) are defined as follows:
Using the vectors defined in Equations (
20)–(
22), the equations presented in
Section 2 are used to build the structural model. The equations from the LOX tank presented in
Section 2.3 were not used in the structural analysis because they are affected by the faults from engines 2 and 3, which are not addressed here.
The structural model is a bipartite graph that abstracts the form of the equations and retains only information for which variables are present in the equation. Let
W represent all the variables from (
19), i.e.,
, and the structural model of (
19) is defined as follows [
45]:
Definition 1.
Structural model: The structural model of the system is a bipartite graph where is the set of edges defined as follows:
if the variable appears in equation .
3.1. Dulmage–Mendelsohn Decomposition
The structural model determines which faults can be detected and isolated using the measurement assumptions from
Section 2. To do so, the Dulmage–Mendelsohn (DM) decomposition [
46] of the incidence matrix is calculated using the Fault Diagnosis Toolbox [
28]. Decomposition rearranges the system’s equations (
V) into three parts: the underdetermined part (
), where the number of equations is smaller than the number of unknown variables; the exactly determined part (
), where the number of equations is equal to the number of unknown variables; and the overdetermined part (
), where the number of equations is larger than the number of unknown variables. Only the faults that affect the equations localized at the overdetermined part of the model can be detected, since it is the part of the system that can generate residuals.
DM decomposition is depicted in
Figure 3. The equations in the overdetermined part are confined within the large blue rectangle. All considered faults impact equations localized in the overdetermined part and can, thus, be detected. The structural isolability of the faults is defined by the gray rectangles called equivalent classes. A group of equations in the same equivalent class is necessary to build an overdetermined set, i.e., two faults affecting the same equivalent class cannot be structurally isolatable. From
Figure 3, the selected faults can be isolated, with each one affecting different equivalent classes.
3.2. Residual Generator Candidates
Structural analysis allows for the systematic computation of residual generator candidates. An ideal residual generator is defined as follows.
Definition 2 (Ideal residual generator)
. Consider a model (V) such as (19). A system (R) with input Z and output r is a residual generator for V and r is a residual if implies for all . In practice, residuals are not ideal due to measurement noise and model uncertainties, but they have a value close to zero. Each residual is affected by a predefined group of faults, which determines its sensitivity. The fault sensitivity of a residual generator () is defined below.
Definition 3 (Fault sensitivity)
. Let be a residual generator for the model (V) (19). Then, is sensitive to fault if implies . From a structural analysis perspective, Minimally Structurally Overdetermined (MSO) setsare residual generators. The computation of MSO sets is possible if the degree of redundancy of a system is larger than one. The degree of redundancy of the system (
19) is defined as follows:
where
and
are the numbers of equations and unknown variables in the overdetermined part of the system, respectively. The degree of redundancy is directly related to the number of measurements. Each added measurement increases the number of equations while the number of unknown variables remains constant, raising the degree of redundancy.
An efficient method to find MSO sets [
47] is implemented by the fault diagnosis toolbox. The formal definition of the MSO set obtained in [
48] is recalled below.
Definition 4.
A non-empty set of equations () is a Proper Structurally Overdetermined (PSO) set if , where is the set of equations of F in the overdetermined part.
Definition 5.
A PSO set is a minimally structurally overdetermined set if no proper subset is a PSO set.
According to Definition 5, every MSO set has one more equation than unknown variables. An MSO set is considered a candidate for a residual generator because it is a testable submodel that can be solved independently. In addition, it has one extra equation that can be used to generate a residual signal.
The multi-engine cluster has a degree of redundancy of 8, with 53 unknown variables and 61 equations in the overdetermined part of the system. The number of MSO sets increases exponentially with the degree of redundancy of the system. In total, 16441 MSO sets are computed for the engine cluster. To illustrate the form of an MSO, all equations used by MSO 172 (denoted
) are listed below.
The analytical expressions of Equation (
24) are given in Equations (
1)–(
3), (
6) and (
10). Analyzing the set of equations and known variables used by
, it is possible to notice how this group of equations can be solved independently. For instance, the LOX mass flow in the gas generator (
) depends on the pressure in the gas generator (
). Instead of using the expression of
defined in Equation (
8), the direct measurement of
(
) is employed. The same happens with the
combustion chamber mass flow (
), which is estimated directly from the measurement of
. In addition, using measurements to replace the analytical expression of unknown variables explains the fault sensitivity defined in Equation (
26). For example, using
to estimate
makes the
insensitive to fault
.
4. Residual Selection
A large number of residual generator candidates were found in the previous section using structural analysis. However, implementing all candidates is intractable and unnecessary, assuming that only a subset of 10 faults is considered. The high number of residual generator candidates leads to a residual selection problem that is not observed in FDI systems that are not based on structural methods. This problem is addressed in the following to select the most efficient residual generator candidates in terms of fault sensitivity and isolability so as to detect and isolate the predefined faults.
In order to formally build the residual selection problem, some basic model-based diagnosis notions [
33] related to fault detection and isolation are recalled. The outputs of a group of residual generators (
) generates a fault signature (
S) when a fault is injected according to the fault sensitivity of
R. Only the residuals that were sensitive to this fault deviated from zero.
Definition 6 (Fault signature). For a set of residual generators (R), the fault signature () of a fault (f) contains all the residuals () sensitive to f.
Fault isolation is achieved if the set of residual generators (R) generates a unique set of fault signatures for each fault:
Definition 7 (Fault signature isolability). A fault (f) is isolable using a set of residual generators (R) if its fault signature () is unique when compared to the other fault signatures.
The objective is to find the minimal subset of residuals capable of detecting and isolating the predefined group of faults. The group of residuals must generate a unique fault signature for each fault. Considering
as the group of all residual generator candidates, the residual selection problem can be formally defined as the following optimization problem:
where
is the cardinality of subset
R and
is the number of faults. The same optimization problem is defined was [
49]. However, the solution to this problem was not addressed, and the paper’s main goal was to find a subset of residuals that would provide the most unique fault signatures.
A method to solve the optimization problem (
28) was first proposed in our previous work [
32] but only applied to an illustrative scenario. The proposed residual selection algorithm is applied here to the full rocket engine case study, with ten faults to be detected and isolated.
Minimal Residual Selection Algorithm
The minimal number of residuals (
) needed to isolate
faults based on their fault signatures must agree with the following inequality:
The selection algorithm takes the group of all residual generator candidates (
) and computes all possible subsets of
elements. Then, the fault-signature isolability of each fault for each subset is tested. The subsets that generate unique fault signatures for each fault are the solution to the optimization problem (
28). The number of all possible subsets of
elements is defined as
where
is the number of residual generator candidates.
If the number of residual generator candidates () is too large, a combinatorial explosion results. To avoid this problem, pre-selection of the residual generator candidates is performed before computing and testing all possible subsets. First, the residuals are separated into two classes based on their fault sensitivity:
Detectability class: For each fault (), list every residual sensitive to this fault ();
Undetectability class: For each fault (), list every residual that is not sensitive to this fault ().
Subsequently, the most suitable residual generator of each class is selected. The selection criteria are based on the method of residual generation to be employed. Two model-based methods for residual generation are considered here: the sequential residual generation method [
50] and observer-based residual generation [
51]. The aim is to select the residual generators composed of equations that are suitable for the selected residual generation methods. In addition, the number of equations is taken into account, assuming that each equation has a level of uncertainty. The selection criteria are defined as follows:
- 1.
Choose the residual generators composed of Ordinary Differential Equations (ODEs) or the Differential Algebraic system of Equations (DAE) of index 1.
- 2.
Select the residual generators with minimal “state cardinality”, which means the residuals with a minimal number of differential equations, which are equivalent to the state dimension of the corresponding observer.
The first criterion is to select candidates that can be easily written as ODEs. DAE systems of index 1 can be transformed into ODEs by using the first-order derivative of their algebraic equations [
52]. The second criterion selects candidates with minimal cardinality to minimize the level of uncertainty brought about by the equations. The residuals that fit both criteria of each class are then used to search for a solution to the optimization problem (
28).
A procedure to find the minimal subset of residuals capable of detecting and isolating a predefined group of faults is described by Algorithm 1. The procedure has three inputs: a list of all residual generators (), the list of faults (F), and the minimal number of residuals (). The output is a list of subsets of with elements capable of detecting and isolating all faults in F.
The algorithm first computes the detectability and undetectability classes; then, a first loop is used to filter the residuals based on both criteria. If the isolability requirements of the pre-selected group of residuals () are not met, the selection criteria are relaxed (). Then, all possible subsets of the pre-selected residuals containing elements are computed (). The isolability of each subset is verified, and only the subsets that can isolate the faults (F) are considered the solution (). The other procedures used in Algorithm 1 are described below.
The DetectabilityClass() for each fault () lists all residuals from R that are sensitive to this fault and returns subsets of residuals corresponding to each fault.
The UndetectabilityClass() for each fault () lists all residuals from R that are not sensitive to this fault and returns subsets of residuals corresponding to each fault.
For each detectability class (d) and undetectability class (u), FilterResiduals() filters the residuals considering cardinality and equation structure criteria. If the flag () is activated, the cardinality criteria are relaxed, and the list of residuals () that fit all filtering criteria is returned.
CheckIsolability() checks if a group of residuals (R) generates unique fault signatures for each fault (); 1 is returned if true and 0 if false.
ComputeSubsets() computes all possible combinations of residuals from R separated into groups of residuals and returns a list containing all possible combinations.
Algorithm 1: Residual Selection Algorithm |
Inputs: Set of residual generators , List of faults F, minimum number of residuals |
Output: Subsets of with minimal cardinality |
procedure ResidualSelection(R,F) |
detectabilityClass(R,F) |
undetectabilityClass(R,F) |
|
|
while do |
filterResiduals() |
if checkIsolability(,F) then |
|
else |
|
computeSubsets(,) |
|
for all do |
if checkIsolability() then |
|
|
if then |
|
return |
Having the set of residual generators that will be used for fault detection and isolation, the theoretical Fault Signature Matrix (FSM) of the subset can be defined. The FSM binary codifies all fault signatures of a set of faults (
) on a set of residual generators (
) [
48]. The FSM has
columns and
lines. If a residual generator (
) is sensitive to a fault (
), the
i-th line cell of the
jth column is marked 1 and 0 otherwise. Considering that Algorithm 1 guarantees that all fault signatures are unique, each column of the FSM is different.
The residual selection algorithm is illustrated for a simple example. Consider a set of three faults (
) and three residual generators (
). The fault sensitivity of each residual generator is defined by the fault signature matrix in
Table 2.
Three faults generate six detectability and undetectability classes. Consider that the
FilterResiduals( )procedure returns all three residuals (
), i.e., all residual generators from
R are composed of ODEs or DAEs of index 1 and have the same cardinality. The minimal number of residuals required to isolate three faults is
. The
ComputeSubsets( )function returns three possible subsets of two residuals generators:
The
CheckIsolability( )procedure confirms that all three subsets expressed in Equation (
31) are capable of detecting and isolating the three considered faults (
). It is clear from the fault signature matrix illustrated in
Table 2 that all three subsets of Equation (
31) generate a unique fault signature for each fault.
5. Numerical Residual Sensitivity
In practice, the residual sensitivity may vary according to various factors, such as the equations and measurements used to compute the residuals. To illustrate the different sensitivities that residuals may have in practice,
Figure 4 displays 28 residual signals when 10 different faults are injected. Each fault is active in turn in the gray-shaded areas. The numerical fault sensitivity varies according to the residual generator. For example, when the seventh fault is injected at 36 s, some residuals are much more affected than others.
To achieve the best performance in detecting and isolating faults, it is suitable to implement the subset of residuals with the highest fault sensitivity. A new method to compute the residual numerical fault sensitivity is presented. The method calculates the residuals’ sensitivity in a practical way on a simulated scenario, where the residual signals without faults are compared with the residuals when faults are injected. This comparison involves calculating the signal-to-noise ratio, using the signal without faults as the baseline for noise. Each residual has one residual sensitivity index (RSI) for each fault. The subsets that include residuals with the highest RSIs are then considered the most effective for fault detection and isolation.
The theoretical fault signature matrix contains information about the fault sensitivity of each residual. In order to quantify this sensitivity and compare the different sensitivities between residuals, a Numerical Residual Sensitivity Matrix (NRSM) is constructed. This matrix has the same dimensions as the FSM, but the Boolean values are replaced by an RSI, varying within the range of . The RSI is defined as follows:
Definition 8.
Residual Sensitivity Index (RSI): Consider a set of residual signals (), all of which are sensitive to one fault (). Assuming that is the residual signal when all faults that affect are zero and is the residual signal when fault is injected, the RSI of residual concerning fault is defined as follows:where is the signal-to-noise ratio between the signal () and the noise ():where is the number of samples. In other words, the RSI of residual signal of fault , denoted as , is a max normalization of the signal-to-noise ratio considering all residual signals that are sensitive to fault . For instance, consider two residual signals ( and ), both of which are sensitive to fault . If , then signal-to-noise ratio of residual is two times greater than the signal-to-noise ratio of residual when is injected. The number of RSIs that can be computed from a residual depends on the sensitivity of the residual. If residual is sensitive to three faults, for example (, and and ), three RSIs can be computed (, , and ).
The numerical residual sensitivity matrix is constructed by calculating the RSI between each residual and fault (Equation (
32)) and defining
. Two other indices can be calculated from the RSI: the Overall Residual Sensitivity Index (ORSI) and the Subset Sensitivity Index (SSI).
Definition 9.
Overall Residual Sensitivity Index (ORSI): Consider a residual signal () sensitive to a group of faults (); the ORSI is the geometric mean of all that can be calculated from : Each residual has only one ORSI. From the ORSI, the SSI is obtained:
Definition 10.
Subset Sensitivity Index (SSI): Consider a subset (S) composed of residual. The of subset S is defined as follows: The definition of and using the geometric mean may lead to values equal to zero if one residual does not present in the same theoretical sensitivity in simulation. If residual is, in theory, sensitive to but the same sensitivity is not observed in simulation (), it follows that and that the of all subsets that use will also be zero.
The reduced-order example from
Table 2 is now used to illustrate how the RSI, ORSI, and SSI are calculated. Consider that three residual generators (
) generate three different residuals in the fault-free scenario (
). In addition, three different residuals are generated when each fault is injected (
,
). Consider that when
is injected,
and
have the same SNR. When
is injected,
has an SNR ten times greater than that of
, and when
is injected,
has an SNR two times greater than that of
. From (
32), the following RSI is obtained, which yields the NRSM defined in
Table 3:
From (
36), the
of each residual generator can be calculated:
The results from the residual selection problem show that any combination of two residuals from
Table 3 is sufficient to detect and isolate the three faults. By using the
obtained in (
37), the following
are derived:
where
,
, and
. In (
38), subset
is the highest SSI and, therefore, achieves the best performance in detecting and isolating faults.
7. Application to the Multi-Engine Cluster
The multi-engine cluster model presented in
Section 2 was implemented in
. Experimental results are not yet possible, as reusable launch vehicles are still under development in Europe [
53].
To replicate the behavior of the cluster when a fault is injected, a MIMO control system with three PIDs for each engine was designed. The control system is designed to keep each engine around a nominal operating point of 1000
Each PID uses a classical configuration [
40] where three outputs are controlled by three inputs. The outputs are the mixture ratio in the combustion chamber and gas generator chamber and the combustion chamber pressure (
). The control inputs are the directional valves and the gas generator
and LOX valves (
), where the index
denotes the i-th engine. The control system is configured to have a settling time to the step response of 2
without overshooting.
7.1. Measurement Noise and Sampling Frequency
Measurement noise consistent with realistic measurement assumptions is added to the simulation. The noise information is presented in
Table 4. The sensors are configured to measure at a frequency of 10
7.2. Residual Selection for the Multi-Engine Cluster
The structural analysis from
Section 3 returned 16,441 residual generator candidates to detect and isolate a group of 10 faults. The residual selection algorithm presented in
Section 4 was applied to find the subsets of residuals capable of isolating all faults. This led to the extraction of 20 detectability and undetectability classes from ten faults (
). The first pre-selection of residual generator candidates performed by the
filterResiduals function returned 28 different residual generator candidates (
) after the analysis of 16,441 candidates. These 28 selected candidates can isolate all faults. According to Equation (
30), at least 4 residuals are necessary to isolate 10 faults (
).
The residual selection Algorithm 1 was run two times to investigate if the size of the subsets influences fault detection and isolation performance. In the first run, the minimal number of residuals was defined as . There were 23,741 subsets of four residuals, and 6 subsets met the detectability and isolability requirements (). The second time, the minimum number of residuals was defined as . There were 118,755 subsets of five residuals, and 594 subsets met the detectability and isolability requirements ().
The results obtained from the selection Algorithm 1 can be compared with the selection algorithm presented in [
33], where a more restrictive isolability constraint is used. When applying the selection algorithm from [
33] on the 28 residuals, the algorithm did not find any solution capable of isolating all faults. The algorithm requires that 90 constraints be respected in order to isolate the same 10 faults. The 28 residuals were not able to respect 3 out of 90 constraints.
To illustrate some results returned by the residual selection algorithm, three subsets of residual generators that theoretically can detect and isolate all faults are defined below.
The fault signature matrix of the residual generators used by the subsets from (
40) are illustrated in
Figure 5. It is possible to see that all three subsets generate a unique fault signature for each fault.
7.3. Residual Generation
The sequential residual generation method [
50] is used for residual generation. It is a fundamental method for residual generation that relies on the model equations to verify the consistency between them and the known variables. The process of calculating the residual is described in [
54]. Let
be the subset of equations of a residual generator (
) with unknown variables (
). By choosing one equation from
to check the consistency (
), the remaining subset of equations, which initially had a degree of redundancy equal to one, now forms
as a just-determined set of equations. The generation method first solves the just-determined problem to estimate all unknown variables (
). Then, the consistency equation (
) is used to generate a residual. The only configuration needed to implement this method is the definition of the equation that will be used to check the consistency (
). This generation method is automated by the fault diagnosis toolbox [
28].
Another residual generation technique is based on state observers [
51]. The main advantage of state observers over the sequential residual generation method is the use of measurements (
y) to adjust the state estimation (
), which makes it more robust to model uncertainty and unmodeled dynamics. Under some specific constraints, the residuals generated by the two methods have a very similar form.
Figure 6 illustrates the signal generated by residual
(
24) using the sequential residual generation method and a method based on the Unscented Kalman Filter (UKF) state observer [
55], where the UKF process noise covariance matrix (
Q) is defined as one hundred times smaller than the measurement noise covariance matrix (
R).
The rest of the analysis is performed using the residuals generated by the sequential residual generation method due to its simplicity of implementation. However, the results could be extended to observer-based residual generation as illustrated in
Figure 6.
7.4. RSI, ORSI, and SSI Computation
To compute the SSI for each of the 600 subsets, it is necessary to compute the RSI and ORSI of the 28 residual generators selected in
Section 7.2. To illustrate how those indices can be calculated, two residual generators are used as examples:
and
. The sensitivity of those residuals is defined in
Figure 5. Residual generators
and
are sensitive to eight and six faults, respectively, which leads to eight RSI values for
and six RSI values for
.
The first step is to use the residual generator (
R) to generate a residual signal (
r). The sequential residual generation method is used with the aid of the fault diagnosis toolbox [
28]. The residual signal in a fault-free condition generated by both residual generators is depicted in
Figure 7a. The residual signal when all ten considered faults are injected is illustrated in
Figure 7b, where the faults are active in the gray-shaded areas.
Using Equation (
33) to compute the signal-to-noise ratio between the noise signal depicted in
Figure 7a and the signal from
Figure 7b, the values presented in
Table 5 are obtained. Based on the signal-to-noise ratio, the residual signal of
is more sensitive than
considering faults
,
, and
, while residual
is more sensitive than
to faults
and
. The largest difference is concerning fault
, where
has a signal-to-noise ratio 54 times higher than that of
.
In order to obtain the RSI values, the signal-to-noise ratio must be normalized by dividing the signal-to-noise ratio value by the maximum value considering all 28 residual signals. Repeating the process of generating residual signals using the SQR and computing the signal-to-noise ratio of each residual for the remaining 26 residual generators and performing the max normalization, the RSI values for the used residuals form the subsets defined in Equation (
40) in
Table 6.
From (
34), the ORSI is obtained by computing the geometric mean of the RSI values for each residual signal. For
and
, using the RSI values defined in
Table 6, the following ORSIs are obtained:
Having defined the ORSI for all residuals, the SSI of the subsets can be defined. The SSI of subsets
and
, for example, is computed by taking the geometric mean of the ORSI values:
From (
44), we can infer that
, which indicates that subset
achieves better FDI performance than
. In addition, subset
has the highest SSI among the 594 subsets of five residuals.
For the SSI values of the other subsets, the six subsets of four residuals have a mean SSI of
, and the 594 subsets of five residuals have a mean SSI of
. The subset sensitivity cannot be compared between the two families of subsets (with four and five residuals) because it uses a different value to normalize the RSI. The precise value of each subset sensitivity index is analyzed using Monte Carlo simulation results in
Section 7.6.
7.5. Residual Evaluation
The residual evaluation was carried out by applying Wald’s Sequential Probability Ratio Test (SPRT) [
56] to detect a change in mean
for a given residual (
r). The SPRT is based on a variable horizon; if a decision cannot be taken for a given risk, the decision is reported, and another sample is used. Usually, SPRT detects faults faster than classical ratio tests with fixed observation horizons.
The SPRT equation used to detect any mean change is defined in [
57]:
where
and
are the nominal variance and mean of the residual
r, respectively;
is the mean threshold;
k is the number of samples; and constants
A and
B are configured according to the probability of false alarm (
) and non detection (
), respectively:
The test (
45) has two outputs:
if residual
r has a nominal mean of
or
if the residual
r has a absolute mean greater than
considering the risk defined by
A and
B.
The sequential probability ratio test was implemented for each residual generator. To configure the test, a fault-free simulation was run for 300
and the expected mean (
) and standard deviation (
) of each residual in a fault-free condition were computed. The hypothesis test is configured to detect a fault if the residual mean (
) exceeds the expected mean by more than 1.8 times the expected standard deviation:
To avoid false alarms, the SPRT is configured to take a decision after 100 measurements, which corresponds to minimum detection time of 0.1 The probability of non-detection and the false-alarm were set at and , respectively.
7.6. Monte Carlo Simulations
To analyze the performance of the selected subsets in detecting and isolating the predefined faults and the relation between the subset sensitivity index and its detection and isolation rates, 500 Monte Carlo simulations were carried out. All ten faults were injected during the simulation remained active for 2
after injection. The fault injection time and magnitude varied according to a predefined range shown in
Table 7. All fault magnitudes are expressed in terms of the percentage of their corresponding value when the multi-engine cluster is at its nominal operating point, i.e.,
represents a blocking that varies from
to
of the nominal value of
. The range of magnitudes was intentionally selected to test the sensitivity of the FDI systems with small values.
The results obtained with the six subsets of four residual generators are presented in
Table 8. For the 595 subsets of five residuals, the simulation results are illustrated in
Figure 8, where the detectability and isolability rate of each subset is represented by a circle.
The subsets with four residuals all had a similar sensitivity index, with detection and isolability rates varying within a narrow range. The sensitivity index appears to be more closely correlated with the isolability rate than the detection rate. Due to the limited number of subsets and the small difference between subsets, it is not possible to draw definitive conclusions.
The most interesting results were obtained with the subsets composed of the five residuals shown in
Figure 8. The sensitivity index of the subsets varies within a large range, from 0 to
. From a detection rate perspective, the sensitivity index seems to have a small correlation. Where one sort of orange line is formed, some residual generators are much more sensitive to faults, and if these residuals are part of the subset, the subset present a high detection rate, regardless of the subset sensitivity index. However, a strong correlation can be noticed between the sensitivity index and the isolability rate. It is clear that if a subset has a high sensitivity index, the isolability rate is also high.
The results from
Figure 8 also show the utility of the fault signature analysis discussed in
Section 6. Subsets with a sensitivity index equal to zero mean that one residual of the subset does not present, in practice, the expected theoretical fault sensitivity. The residual selection problems consider only the theoretical fault sensitivity, so it is not guaranteed that a subset with a sensitivity index of zero can isolate all faults. The strategy of taking into account the residual sensitivity to predict an active fault in the system explains subsets with a zero sensitivity index and non-negligible isolability rates.
The subsets that presented the highest isolability rates are
and
, with 79.66% and 79.72%, respectively. According to (
43) and (
5),
had a higher SSI than
, but in simulation, the performance appears to be similar. The difference between the two subsets is clear when the performance is analyzed in more detail. Subset
has a higher sensitivity index because its performance in isolating the ten faults is more even when compared with subset
, which is more effective in isolating specific faults. The performance of both subsets in isolating each fault is presented in
Table 9.
The difference between subsets
and
is more prominent when the fault magnitude is taken into account, as illustrated in
Figure 9, where the mean isolability rate of all faults is separated by different groups of fault magnitudes. For small fault magnitudes, the isolability rate of
is superior. However, when the magnitude of the faults is increased, the isolability performance of
overtakes that of
.
The spread of the isolability rates obtained for all ten faults separated by fault magnitude is illustrated by the box-and-whisker diagram in
Figure 10. It is possible to see that for subset
, the isolability performance increases for all faults when the magnitude is more significant. On the other hand, for subset
, global isolability performance stays similar for all fault magnitudes. In addition, the performance of
has outliers with an isolability rate close to
, even for the highest fault magnitudes, which corresponds to the isolability of fault
defined in
Table 9. The difference between the two subsets is clear, and the choice of the best subset depends on the FDI constraints—for instance, if it is preferable to detect all faults, even with a low isolability rate for small magnitudes (
), or if the global isolability performance should be higher for all fault magnitudes, with some faults not being isolated (
).
8. Conclusions
A model-based fault detection and isolation system was developed for ten different actuator and sensor faults for a multi-engine cluster. A model of a multi-engine cluster with three engines was presented; it includes the dynamics of the liquid oxygen tank, feeding lines, and liquid-propellant rocket engines. For the first time, structural analysis was used in a complete multi-engine cluster model. The analysis computed 16,441 independent subsystems that could be used as residual generators for FDI purposes. A residual selection algorithm was proposed to find subsets of candidates capable of detecting and isolating all faults with minimal carnality. The selection algorithm returned 6 subsets of 4 residuals and 594 subsets of 5 residuals theoretically capable of detecting and isolating the faults.
In order to find the subsets with the highest fault sensitivity, a subset sensitivity index (SSI) is introduced. The SSI is calculated based on the residual signal. Two residual generation methods were considered: the sequential residual generation method and an unscented Kalman filter (UKF). The computation of the SSI relies on the residuals generated by the sequential method. The subsets with the highest SSI are, in theory, more sensitive to the faults. The similarities between the residual signals generated by the two methods indicated that the RSI is also valid for residuals generated using the estimations of the UKF. Fault detection was performed by analyzing the residual signal using the sequential probability ratio test for the mean. Fault isolation is performed in two steps. First, the fault is isolated if a perfect match between a theoretical and an observed fault signature is detected. Otherwise, if the observed fault signature does not match any theoretical fault signature, an algorithm that uses information about the numerical residual sensitivity predicts which is the most probable fault to be present in the system.
Monte Carlo simulations were performed under realistic measurement noise assumptions. They showed a high correlation between the SSI and the isolability rate, indicating that the SSI can be used to find the subset that yields the highest FDI performance. The best subsets achieved a detection rate of more than 95% and an isolability rate of 79%.