1. Introduction
With the development of science and technology, the systems involved in people’s production and life are developing in the direction of complexity, large-scale and modularization. This change in the system poses a challenge for the fast and accurate detection and isolation of faults in the system and, so, how to improve the fault diagnosis capability of the system, so as to improve the reliability and utilization of the system, has become a hot spot for researchers to study.
The concept of diagnosability has been introduced in order to improve the fault diagnosis capability of the system. Diagnosability is a property that indicates the extent to which system faults can be accurately and efficiently identified [
1]. The diagnosability concept was originally established to uncover deeper information about fault diagnosis, such as how difficult or costly it is to detect and isolate a fault, from a system perspective. The current research concluded that the diagnosis capability of the system is suitable to be improved by designing fault diagnosis methods only when the fault of the system can be diagnosed. If the system fault itself is difficult to diagnose or even undiagnosable, it is meaningless to improve the performance of the diagnosis algorithm for this kind of fault [
1]. Therefore, taking the difficulty of fault diagnosis into the scope of fault diagnosis, that is, considering the diagnosability of the system, can fundamentally improve the diagnosis capability of the system.
Research work on diagnosability can be divided into two aspects: diagnosability evaluation [
2,
3,
4] and diagnosability design [
5,
6]. Diagnosability evaluation refers to the process of measuring the degree to which system faults can be identified deterministically and efficiently, mainly revealing how difficult the fault can be diagnosed. High diagnosability means that the system is more capable of fault diagnosis, while low diagnosability means that the system is less capable of fault diagnosis. Diagnosability design refers to a design technique that takes diagnosability as an optimization index in the design stage of system and diagnosis algorithm [
1]. It can be seen that the diagnosability design is a key step in improving the system’s ability to diagnose faults. Depending on the object of diagnosability design, diagnosability design can be further categorized into the design of system structure and the design of diagnosis methods.
- (1)
Design of the system structure. Researchers in this direction focus on optimizing the design of the system’s structure to improve the system’s ability to respond to faults and, thus, improve the system’s diagnosis capability, while satisfying the premise of system diagnosability. Since the location and number of sensors can affect the effectiveness of fault identification and isolation [
7], the current research direction focuses on changing the structure of the system by optimizing the configuration of the sensors in order to improve the diagnosis capability of the system. The optimal configuration of sensors is essentially a multi-objective decision-making (MODM) problem in the multi-criteria decision-making (MCDM) problem [
8]. Various objectives such as system cost [
9], reliability, number of sensors, redundancy between sensor information [
10], system diagnosability [
5,
11,
12,
13], and the degree of hazard of missed faults are considered in different sensor configuration schemes to select one of them that is suitable for the system by balancing the conflicts between multiple objectives and reducing information redundancy [
14], making the system inherently more capable of acquiring fault information.
- (2)
Design of the diagnosis method of the system. Ding [
6] evaluated the false alarm rate, fault detection rate and fault detection time based on a stochastic algorithm, and based on this, proposed a design method for observer-based fault detection system, and [
15] proposed an adaptive application strategy that combines system diagnosability analysis with actual fault diagnosis. For different degrees of diagnosability, different diagnosis methods are used to detect and isolate faults, so as to achieve a balance between time efficiency and diagnosis accuracy; [
16,
17] defined K-gap and used it to give performance indexes of fault detection and isolation for use in the analysis of fault diagnosis capability. Literature [
18] proposes a strategy for verifying the diagnosability of discrete-time systems using conjunctive regular expressions, and illustrates the construction of diagnoser and the verification of diagnosability in a real-world case. The diagnosability design of the diagnosis method is mainly to take the diagnosability of the system into account, combined with the fault pattern recognition method [
19,
20], relying on the model or signal of the system, so that the fault of the system can be exposed as much as possible, in order to achieve the purpose of improving the diagnosability of the system.
From the above analysis, it can be seen that the diagnosability design method starts from the system structure design and diagnosis method design to improve the fault diagnosis capability of the system, in which the system structure design is mainly for the design of the system’s sensors, which mainly solves the problem of extracting information from the system; after all, only by extracting high-quality information from the system can reflect the change of the system fault and the system fault be quickly diagnosed. The diagnosis method design mainly solves the problem of how to utilize the information, by transforming the sensor information, extracting the features and other operations, the fault information can be exposed and the diagnosis of the fault can be realized.
Thus, both aspects can have an impact on the diagnosability of a system, but the two factors are currently considered separately when designing for diagnosability. The current process for diagnosability design is shown in
Figure 1a. In this process, the design of system structure and diagnosis method are two successive steps, which do not affect each other. Diagnosis method and system structure are separated. However, it is clear from the above analysis that the design of the system structure determines what diagnosis algorithm is used, and the choice of diagnosis algorithm affects the design of the system structure. If both factors are taken into account in the diagnosability design, the degree of fit between the fault diagnosis method and the system structure can be greatly improved, and, thus, the diagnosis capability of the system can be enhanced.
Based on the above ideas, an integrated design process was utilized in this paper when performing diagnosability design to improve system fault diagnosis, and the flow of this design method is shown in
Figure 1b, in which the mutual influence between the diagnosis method and the system structure was considered, and the degree of fit between the two factors was improved by considering the specificity of each measurement point for fault diagnosis. Thus, the fault diagnosis performance of system is greatly improved [
21]. Relying on this process, a diagnosability integration design method based on graph theory was proposed in this paper. In the current literature, it is rare to see an approach that enhances the diagnostic capability of a system by considering both system architecture design and diagnostic algorithms at the same time. The method was divided into three main steps: 1. Measurement point determination. The diagnosis plot of measurement point was drawn based on the diagnosability evaluation results, the maximal clique in the plot was extracted, and the maximal clique set was obtained according to the edge covering theorem, so that the set could cover the fault mode of the system. 2. Diagnosis method determination. The method of setting a cascade classifier on a maximal clique set was studied, and the posterior probability output problem of the classifier was studied. The output results of classifier and Shannon entropy were used to evaluate the fault identification results of the diagnosis method. 3. Measuring point updating mechanism. The measurement points can be replaced according to the evaluation results of the diagnosis method in order to obtain better diagnosis results.
The rest of this paper can be summarized as follows:
Section 2 describes the framework of the algorithm proposed in this paper;
Section 3 describes the specific details of the algorithm based on
Section 2;
Section 4 gives the flow block diagram of the whole algorithm; and
Section 5 illustrates the practicality and effectiveness of the method in this paper by taking the filter amplifier circuit as an example.
3. Diagnosability Integrated Design Method Details
In this section, the details of the design framework proposed in
Section 2 are developed in detail.
In order to facilitate the method description, the notation of this paper is explained.
Let the set of fault modes of the system be
, where
denotes the number of fault modes,
denotes the normal state, The set of tests is
, where
denotes the number of test. The results of the diagnosability evaluation for this system are
where
is the fault detectability evaluation matrix,
indicates the detection difficulty when using test
to detect the fault
, and a higher value means less difficulty (cost) to detect the fault.
is the fault isolability evaluation matrix,
indicates the isolation difficulty when using test
to isolate the system fault pair
, and a larger value means that the difficulty (cost) of isolating the fault pair is smaller.
3.1. Measurement Point Determination
3.1.1. Diagnosis Plot of Measurement Point Determination
The column vectors of Equations (1) and (2) indicate the difficulty of the fault or fault pair to be detected or isolated by the tests in the system. The tests were grouped according to the contents of Equations (1) and (2) and using the K-means clustering algorithm. Use to denote the set of tests that can detect faults at a smaller cost, and use to denote the set of tests that can distinguish fault pair at a smaller cost.
According to the results of and , the detection and isolation capability of the measurement point for different faults and fault pairs can be further revealed. In order to more graphically represent the diagnosis capability of measurement points for different faults, the diagnosis plot of measurement point was introduced in this paper.
The following algorithm was used to draw the diagnosis plot of measurement point .
Based on Algorithm 1 and Definition 1, it can be inferred that
Algorithm 1 the diagnosis plot of measurement point determination |
Input: , , , Output: 01. initialize: , 02. for to do 03. if 04. 05. 06. end if 07. for do 08. if do 09. 10. end if 11. end for 12. end for 13. Return |
Definition 1. Diagnosis plot of measurement point. The diagnosis plot of measurement point is used to qualitatively represent the diagnosis capability of the signal from measurement point for different faults, denoted as , where denotes the set of vertices of the graph , indicating that the faults corresponding to the vertices can be detected by test . The set of edges of the graph is denoted as , indicates that the fault pair can be isolated by test .
Corollary 1. The diagnosis plot of measurement point for any measurement point
- (1)
,
- (2)
3.1.2. Maximal Clique Extraction
According to Corollary 1, for a system with three states (
), all forms of the diagnosis plot of measurement point on this system are shown in
Figure 3.
Among the four cases shown in
Figure 3, (a) indicates that the measurement point is insensitive to both faults and cannot detect the occurrence of faults; (b) indicates that the measurement point can only detect one of the faults but not the other; (c) indicates that the measurement point can detect both faults but cannot distinguish between them; (d) indicates that the measurement point can detect and isolate both faults. In the diagnosability design of this test point, the information reflected by the test point in case (a) was too little, so it was not considered in this paper. The remaining three cases were discussed in this paper, and the distribution of samples in the feature space corresponding to these three cases is shown in
Figure 4.
As can be seen in
Figure 4, the different forms of diagnosis plot of measurement points corresponded to different situations that distinguish three system states. For the third case, the three states of the system were completely distinguishable, while for both the first and second cases, the presence of the fault
caused some fault states to be indistinguishable, with the difference that for the second case, the fault state
and
were mixed and it was difficult to distinguish between the two fault states. In other words, the fault can be detected, but the fault cannot be isolated and identified, while for the first case, the fault state
and
were mixed. The reason for this situation may be that the fault
was difficult to detect, or it may be that due to the existence of the system structure, the relevant information was not transmitted to the measurement point when the fault occurred, making the signal of the measurement point in the fault state not different from that in the normal state, causing the fault
to be undetectable.
According to the above analysis, when the diagnosis plot of measurement point presented the third case, as shown in
Figure 4, the states of the system were best discriminated at this time, and the states in the space did not appear to be mixed. However, in practice, it is not always possible to achieve this; therefore, the diagnosis plot of measurement point
should contain as many states as possible, and these states should satisfy the structure shown in
Figure 3d as much as possible, i.e., the diagnosis plot of measurement point should be as close as possible to the structure of the maximal clique.
In order to extract the maximal clique from the diagnosis plot of measurement point , we can refer to the Bron–Kerbosch algorithm in the literature, the details of which are not described in this paper due to the length of the paper.
It should be noted that for each diagnosis plot of measurement point, the maximal cliques contained in it may not be unique (e.g., in (c) of
Figure 3, both
and
are maximal cliques), and the p-th maximal clique of the diagnosis plot of measurement point
is noted as
.
3.1.3. Maximal Clique Set Selection Algorithm Based on Hypergraph
Through the above analysis, it can be seen that the structure of the maximal clique found in the diagnosis plot of measurement point represents the system states that can be most easily distinguished using the information of the measurement point; so, maximal clique structure is the basis of designing fault diagnosis classifier. However, it was obvious that the maximal clique of measurement points did not necessarily contain all the states of the system, and so, a single maximal clique cannot correctly identify the state of the system. Therefore, multiple maximal cliques are needed to jointly identify the fault state of the system. Therefore, it is the main problem addressed in this section to select a suitable set of maximal cliques from among many maximal cliques to facilitate the detection and isolation of system faults.
- 1.
Problem Description
The problem in this section can be described formally as follows:
For the aforementioned system, suppose the set of maximal cliques of the system is , . Now, it is necessary to choose the optimal set of maximal cliques for the maximal cliques in such that , while ensuring that .
- 2.
Problem solving
In this section, the above problem is solved based on the concept of hypergraph (Definitions 2 and 3).
Definition 2 (Hypergraph). Given the undirected hypergraph , the hypergraph vertex set is defined as , which contains hypergraph vertices. The set of hypergraph edges is defined as a set containing hypergraph edges, where for any hypergraph edge in the set of hypergraph edges , , it means that the hypergraph edge has hypergraph vertices.
Definition 3 (Edge Coverage). Given the hypergraph , . is said to be an edge covering of if each vertex is associated with at least one edge in .
The problem in this section can be converted to solving the edge covering problem in hypergraph: where the state of the system is equivalent to the vertex in hypergraph, and the maximal clique is the edge in hypergraph. For the edge-covering problem of hypergraph, the most classical method is to use a greedy algorithm to solve this problem, and the steps of this method are shown as follows (Algorithm 2).
Algorithm 2 the set of maximal cliques determination |
Input: , Output: 01. initialize: , 02. while do 03. selecet , so that it can cover most fault in 04. 05. 06. end while 07. Return |
3.2. Diagnosis Method Determination
3.2.1. Design of Maximal Clique Classifier
The set of maximal cliques determined through
Section 3.1.3 is denoted as
where
represents the
j-th maximal clique in the set.
According to the previous section, the system faults contained in the maximal clique are easily distinguishable under the measurement signal of the measurement point, and so, a classifier can be set for any maximal clique in the set , and the classifier can identify the system fault states by the signal of the current measurement point.
In classification problems, the difficulty of the classification problem is closely related to the kind of classification. In general, the more kinds of classification, the more detailed features are required and the more resources are consumed.
In order to reduce the difficulty of classification, researchers try to use a cascade approach to solve the classification problem in the current classification problem [
22], that is, the problems are first roughly classified by using features, and then the characteristics are finely classified for each rough classification, so as to achieve the purpose of correctly classifying the target samples. Studies have shown that the cascade classification approach has good classification results [
23]. In this section, cascade classifiers of each maximal clique in maximal clique set
were designed to reduce classification categories as far as possible, so as to select fewer features in feature extraction and identify the state of the system.
Based on the structure shown in Equation (3), construct the matrix
where
, if
denotes the
j-th maximal clique
in the maximal cluster clique set
can recognize the fault state
, and vice versa,
.
In this paper, the method of setting up classifiers for each maximal clique can be followed as follows.
The following is a concrete example to illustrate the calculation process of Algorithm 3.
Algorithm 3 Design of cascade classifier on the maximal cliques set |
Input: Output: classifier design results 1. is determined by according to Formula (4) 2. while do 3. adds up the rows to get the vector 4. if then 5. Set the classifier on , the number of categories is 6. 7. else 8. 9. 10. if and then 11. , s.t 12. the binary classifier is set on , the positive class is , the negative class is all the remaining faults 13. 14. 15. end if 16. if and then 17. Select so that it can cover the most faults in , 18. 19. 20. end if 21. if then 22. the binary classifier is set on , the positive class is , the negative class is all the remaining faults 23. 24. 25. end if 26. end if 27. end while |
Suppose the fault mode of a system is
, the set of maximal clique is
, and the matrix is determined according to Equation (4) as
According to the approach of Algorithm 3, at this time,
is accumulated according to the rows to obtain
. Then, choose maximal clique
and set the binary classifier, where the positive class label is
and the negative class label is
, and update
as
according to the approach of Algorithm 3, we can obtain
, and then choose a maximal clique
and set the binary classifier, where the positive class label is
and the negative class label is
, and update
as
is obtained, then the maximal clique
is chosen and a triple classifier is set up, in which three system states
,
and
are distinguished, respectively.
3.2.2. Feature Cacullation and Feature Selection
Section 3.2.1 sets the cascade classifier on maximal clique, but the features of the signal need to be extracted and selected before the classification, assuming that the set of features of the signal is
, where
denotes the number of features. The signal features here can use a variety of signal features such as time domain signal characteristics, frequency domain signal characteristics, and time-frequency domain signal characteristics.
In the feature selection session, the signal features that are beneficial for fault diagnosis are selected based on the Relief_F algorithm. The specific algorithm for feature selection is shown in Algorithm 4.
Algorithm 4 Signal feature screening algorithm based on Relief_F |
Input: feature set samples , , Output: 01. initialize: 02. for to do 03. select sample 04. for to do 05. find neighbors of the same class for 06. for every class 07. find neighbors of different classes of 08. end for 09. 10. end for 11. end for 12. 13. 14. while do 15. 16. 17. 18. 19. end while 20. Return |
3.2.3. Posterior Probability Output of Classifier
According to the approach shown in
Section 3.2.1, a classifier can be determined for each maximal clique. In the design of classifiers, constructing a classifier to generate posterior probabilities is a better approach in the discrimination case. Combining multiple types of posterior probabilities is useful in the final decision, even if the discrimination result is chosen based on the maximum posterior probability, which is the most Bayesian decision for the equal loss case. The output of the posterior probabilities allows not only to determine the classification of the classifier but also to measure the classification performance of the classifier by the posterior probabilities.
In this paper, we took the support vector machine (SVM) as an example to illustrate the output method of its posterior probability.
Assume that the training sample data set is
. For a sample
in it, the output of the support vector machine takes the form
where
denotes the symbolic function acting as a decision function.
is a quantity proportional to the distance of the interface, and it is also believed that the closer the point is to the interface, the less likely it is to be correctly classified; the farther it is from the interface, the more likely it is to be correctly classified.
For a binary classification problem, based on the Bayesian formula, we can get
By fitting the Equation (9), we can get
where
and
are the parameters to be estimated.
Equation (10) was got from dataset using maximum likelihood estimation.
Construct the maximum likelihood function according to Equation (10)
Turning Equation (11) into a log-likelihood function
3.3. Performance Evaluation Method of Diagnosis Scheme Based on Information Entropy
A diagnosis scheme for the system can be designed according to the methods shown in
Section 3.1 and
Section 3.2. In order to study the pros and cons among the schemes, this section examines the evaluation of the diagnosis scheme.
It is assumed that the signal set of each measurement point obtained by a certain operation of the system is
, the diagnosis results for this sample can be expressed in terms of the posterior probability vector of the output of the diagnosis scheme according to the previous section
suppose the posterior probability of the diagnosis scheme for the current state of the system is
and
, then it can be considered that the current state of the system is
; while when its diagnosis posterior probability is
and
, although it can also be considered that the system is in the fault state
, However, in this case, it is reasonable to assume that the reliability of the diagnosis scheme for the current system state identification results is low. Therefore, from the posterior probability results shown in Equation (13), its distribution indicates confidence of the diagnosis scheme to the diagnosis result, when
are more concentrated, it means that the diagnosis scheme has more confidence for the diagnosis results, and the diagnosis scheme has better diagnosis performance at this time, while when
are more dispersed, it means that the diagnosis method has lower confidence for the diagnosis results. Therefore, by calculating the entropy of Equation (13) to quantify the confidence
of the classifier on the sample, the smaller the entropy, the lower the uncertainty in the classification, and the higher the confidence of the classifier output, i.e., the confidence and entropy are inversely proportional. In the selection of entropy, the more commonly used Shannon entropy is chosen as a measure of the diagnosis effect of a single sample in this paper. If the diagnosis of the current sample is correct, the Shannon entropy of Equation (13) is the loss of the diagnostic scheme to the sample diagnosis, and if the diagnosis of the current sample is wrong, the maximum Shannon entropy in this classification case is used as the loss of the diagnostic scheme to the sample diagnosis, that is
3.4. Measurement Point Update Mechanism
The diagnosis result of the current fault diagnosis scheme can be evaluated by Equation (14), and when the evaluation result is not good, it means that the current set of maximal cliques , i.e., the selected measurement points, does not fully reflect the fault information of the system, and additional information needs to be obtained by adding new measurement points and using this new information to diagnose the system.
For the existing set of maximal cliques, it is necessary to find new maximal cliques from the set to supplement the fault information of the system, and the following principles should be followed when selecting new maximal cliques:
- (1)
The new maximal clique should cover as many faults as possible in order to provide more information for the fault diagnosis of the system;
- (2)
When there are multiple maximal cliques in that can cover the same number of faults, the most economical one is selected based on the principle of economy.
5. Instance Verification
5.1. Experiment Preparation
In this paper, a filter amplifier circuit was used to verify the proposed method. The circuit diagram of the experimental circuit is shown in
Figure 6, and 20 dB of noise was added to the measured signal in it. The simulated faults of the experimental circuit are shown in
Table 1. The circuit diagram was run 50 times in different states, where 30 sets of data for each state were used for classifier training, 10 sets were used for diagnosis method evaluation, and 10 sets were used for final testing. The set of all classifier training data is denoted as
, the set of all diagnosis method evaluation data is denoted as
, and the set of test data is denoted as
. The signal features selected in this paper is shown in
Table 2.
5.2. Experimental Results
Based on the results of the diagnosability evaluation of the system shown in
Figure 6, the diagnosis plot of measurement point is drawn in conjunction with Algorithm 1, as shown in
Figure 7.
Using the Bron–Kerbosch algorithm to analyze the diagnosis plot of measurement point and extract maximal cliques, a total of four maximal cliques can be extracted from the diagnosis plot of measurement point shown in
Figure 7, which is recorded as
. We can obtain
from
using Algorithm 2, and according to Algorithm 3, the design result of cascade classifier based on the maximal clique can be obtained: using the data of measurement point 6, the five states of the system were identified. Algorithm 4 was used to calculate the contribution of each feature to the diagnosis, which is shown in
Figure 8. Finally, feature 12 and 13 were selected to distinguish the system state, and then, a complete fault diagnosis scheme can be obtained.
Data set was used to evaluate the diagnosis results of this scheme, and the average Shannon entropy of samples is 0.4815.
The measurement point information was updated to . Repeating the above steps, the average Shannon entropy of the evaluation using the sample in this condition was 0.5369. This result was not as good as the last diagnosis, indicating that the diagnosis method obtained using is better.
Therefore, the diagnosis scheme obtained by the graph-theory-based diagnosability-integrated design method in this paper was as follows: the signal of measuring point 6 was collected, and the two features of gravity frequency and mean square frequency can be used to identify the fault of the system.
Using this diagnosis scheme, the state of
was identified, and the results are shown in
Figure 9.
5.3. Diagnosis Results of Other Methods in the Literature
The method in this paper was compared with four common fault recognition methods in the literature, namely, K-nearest neighbor (KNN) classifiers [
24], random forests (RF) [
25], discriminative classifiers, and support vector machines (SVM) [
26], to illustrate the degree of superiority of the different methods by comparing the error rate of recognizing systematic fault patterns under the different methods.
Firstly, the data of measurement points 5 and 6 were extracted according to the signal features in
Table 2, and the feature vectors of multiple measurement points were then pieced together to form a new vector or matrix for diagnosis [
27], and the schematic diagram of the piecing method is shown in
Figure 10.
In this paper, the feature vectors were recombined by means of horizontal splicing of the data from multiple measurement points. In performing the diagnosis, the data set
(a total of 5 × 30 groups of samples) was used for training, and the data sets
and
(a total of 5 × 20 groups of samples) were used for testing. The fault diagnosis results are shown in
Table 3, and the table shows the number of samples that were incorrectly diagnosed.
The four methods in the literature and the method in this paper were subjected to 50 simulation experiments, and the number of samples that were misdiagnosed in each experiment was recorded, and the statistical indexes such as the mean and quartile of the experimental results of each method were counted and plotted as a box-and-line diagram, as shown in
Figure 11.
5.4. Experimental Analysis
Table 3 and
Figure 9 show the diagnosis of different fault modes in one experiment. From the results, the fault diagnosis scheme designed by this paper’s method can identify the faults of the system well, and the methods in the literature can only identify some of the fault modes, while there were difficulties in identifying other faults. For example, the KNN and random forest methods had poorer results in the identification of fault mode 4 compared to other fault modes. In terms of diagnosis correctness, the correctness of this paper’s method was improved by 3.25 percentage points compared to the average correctness of the four methods in the literature.
Figure 11 shows the statistical results of the diagnosis of different fault modes in 50 repeated experiments. From the statistical results, the median number of incorrectly diagnosed samples of this paper’s method was 0, and the maximum value was 2, which indicates that at least in half of the experiments, this paper’s method was able to correctly diagnose all the samples, and the number of incorrectly diagnosed samples of the worst diagnosis result was 2. This diagnosis result was already better than that of the KNN and the random forest (the minimum value of the number of incorrectly diagnosed samples of the two methods was 5 and 2, respectively), while for the use of discriminative classifier and SVM, both methods performed stably in 50 repetitions, and their number of incorrectly diagnosed samples was 1.
In the comparison method, the design of the fault diagnosis method can only be based on the measurement points that have been selected, which does not guarantee that the information provided by the selected measurement points is best adapted to the current fault diagnosis method. Since the information provided by two groups of measurement points, measurement point 5 and measurement point 6, was used in the experiments to diagnose the faults of the system, in order to handle multiple groups of measurement points, the method adopted in the literature was to splice the signal features of multiple measurement points horizontally to form new feature vectors to be used, which seems to be a fast and easy way, but the implied premise of doing so is that the distinguishing ability of each signal for the faults is the same, and the same features on different signals for the faults, which makes the implied premise obviously over-idealized. In the actual classifier, the classification is based on the relative position of the sample features in the feature space, in this case, when the features used are not sensitive to faults, the addition of these features will make the originally clear relative position become blurred, and, thus, the phenomenon of misclassification occurs. This leads to the situation in the experiment: although the information of two measurement points was used in the comparison experiment, which provided more diagnosis information, the diagnosis effect was not satisfactory. Therefore, it is not the case that the more measurement point information is used, the better the diagnosis effect is, but only by combining the information of the measurement points with the specific fault diagnosis method can the best fault diagnosis effect be produced.
In the method of this paper, the inclusion of the measurement point updating link makes the diagnosability design work to form a feedback, and the existence of the feedback makes the selection of the measurement point and the fault pattern recognition method no longer a back-and-forth step, and the two can be balanced to realize the optimal fit. In this paper, people can not only choose the fault diagnosis method based on the selected test points but also the existence of feedback allows people to “reflect” on whether the current measurement data are appropriate for the fault diagnosis method, and improve the data through the measurement-point-updating mechanism. Thus, in a cycle, the selected test points and the diagnosis methods are adapted to each other, thus obtaining better diagnosis results and improving the system’s diagnostic capability.
6. Conclusions
To address the problem of diagnosability design, in order to take into account both system structural factors and diagnostic schemes, and to improve the degree of fit between diagnostic methods and system structure, a diagnosability-integrated design method based on graph theory was proposed in this paper, which can generate specific fault diagnosis methods for specific systems, and can be applied to the diagnosis of faults on electronic, mechanical, and other systems. The method draws a diagnosis plot of measurement point through the diagnosability evaluation results, extracts the maximal cliques from the diagnosis plot of measurement point, and sets cascading classifiers on the based on maximal cliques to achieve the design of diagnosis methods, and calculates the Shannon entropy based on the posterior probability of the classifier output as the diagnosis capability measure of each diagnosis scheme. The diagnosis scheme with low diagnosis capability is reconfigured through the measurement point update mechanism to achieve the purpose of diagnostic scheme optimization. The results of simulation experiments showed that compared with the existing methods in the literature, the method in this paper improved the average diagnosis correct rate by 3.25 percentage points, while in many simulation experiments, the error rate of most of the methods in this paper was 0, which indicates that the method in this paper had a high degree of accuracy and stability.
The ultimate aim of the method purpose of this paper was to design the diagnostic scheme of the system to improve the fault diagnosis capability of the system through the diagnosability design, and the operation of the method was based on the rich operation data of the system. In the future, how to improve the fault diagnosis capability of the system through the diagnosability design can be considered, especially in the case of the system signal data being difficult to obtain or the data being contaminated.