The fault propagation behaviour analysis method proposed in this paper has a certain general applicability and can provide guidance for the analysis of fault propagation behaviour of other complex systems. This paper selects the machining centre commonly used by various manufacturing companies as an example to illustrate the specific application.
4.1. Implementation
In this section, we take MDH series horizontal machining centres as the research object that are mainly used for processing rotary parts. We collected 108 on-site fault information details of 36 machining centres of this series during the course of one year. After fault analysis, we can determine whether each component fault is an independent fault or a related fault. If it is a related fault, the antecedent component that caused the component fault will be determined through fault analysis. For example, when the workpiece cannot be clamped, the tool magazine system (T) is the direct fault location but the root cause is that the workpiece cannot be clamped due to insufficient pressure in the hydraulic system (H). At this time, the faulty component is the tool magazine system (T) and its antecedent component is the hydraulic system (H). Considering the existence of this propagation fault, there is likely to be a directed arrow pointing from the hydraulic system (H) to the tool magazine system (T). Similarly, when the servo motor fails, the fault location is the feed system (F) but the root cause is the abnormality of the spindle system (B). Therefore, there is likely to be a directed arrow from the spindle system (B) to the feed system (F). In this manner, we can identify other related faults. The statistical analysis results of the related faults are shown in
Table 1.
As expressed in
Table 1, combined with the knowledge of graph theory, the fault propagation relationship of a machining centre is modelled. We do not consider components with uncorrelated faults such as the workbench (
U). Thus, we can obtain the fault propagation digraph of a machining centre as shown in
Figure 5.
As expressed in the fault propagation digraph of a machining centre in
Figure 5, the relationship matrix (
A) and the design structure matrix (
M) can be obtained as follows:
The element 1 in the matrix M indicates that the fault of the system component impacts . Contrarily, 0 indicates that no impact exists.
Thus, on the basis of the modelling process of the DSM fault propagation hierarchy structure model, the fault propagation DSM model of a machining centre can be obtained as shown in
Figure 6. The diagonal elements in
Figure 6 are represented by black squares.
On the basis of
Figure 6, we can derive the result of module division after the DSM modeling processing: the first-layer system components are (T, F), the second-layer system components are (B, W, K), the third-layer system components are (NC, H, Q, R), and the fourth-layer system component is (D). The system components of the first layer, which is the appearance layer, are classified as the fault absorption layer in the fault propagation model. The system components of the fourth layer, the root layer, are classified as the fault initiation layer and the remaining layers are classified as the fault propagation layers. Thus, the fault propagation hierarchy structure model of a machining centre can be obtained as shown in
Figure 7.
On the basis of obtaining the fault information of a machining centre, we calculated the fault probability function of each component according to the calculation method presented in
Section 2.2.1. Considering the impact of the timing truncation test and the fault time truncation on the sequence of the fault data, we use the Johnson method to modify it. The parameters of the fault probability model are then estimated and the distribution hypothesis test is passed. Finally, the fault probability function of each component can be obtained as shown in
Table 2 and the function curve is illustrated in
Figure 8.
Similarly, the fault probability function of a whole machining centre is .
According to
Table 2 and Equations (9) and (10), the probability importance and criticality importance of each system component node can be obtained at any time. The function curves of probability importance and criticality importance of each system component node are illustrated in
Figure 9 and
Figure 10, respectively.
Given that the values of probability and the criticality importance of system components are varied, this paper takes t = 1500 h as an example to illustrate this concept. By substituting the fault probability values of system components obtained from
Table 2 into Equations (9) and (10), the probability importance and criticality importance of each system component node at t = 1500 h can be obtained as shown in
Table 3.
On the basis of Equations (10) and (11), combined with
Figure 7, the edge fault influence value of the same level at 1500 h can be calculated as shown in
Table 4.
In reference to Equation (12),
Table 3, and
Figure 7, the edge fault influence value between different levels in the fault propagation hierarchy structure model of a machining centre at 1500 h is calculated. The results are detailed in
Table 5.
On the basis of
Table 4 and
Table 5, the 1500 h fault propagation hierarchy model of a machining centre based on fault influence degree can be obtained as shown in
Figure 11.
By substituting the data in
Table 2,
Table 4 and
Table 5 into Equation (13), the fault propagation probability of each system component at any time can be obtained. In addition, the calculation results of the 1500 h example are presented in
Table 6.
On the basis of the fault propagation hierarchy model of a machining centre in
Figure 7, all the paths between any two nodes in the fault propagation digraph can be obtained and the paths are listed in
Table 7. In reference to Equation (14) and
Table 7, the edge betweenness of the fault propagation hierarchy model is calculated and the results are presented in
Table 8.
Table 8 reveals that the edge betweenness of different directed edges differs. Therefore, the influence of the structural characteristics of the model on the fault propagation also must be considered. The larger the edge betweenness value is, the more important the edge is in the fault propagation of a whole machine and greater consideration is required when the probability of being selected as the fault propagation path increases.
Therefore, on the basis of Equation (15),
Table 6 and
Table 8, the fault propagation intensity of each directed edge at 1500 h can be obtained as shown in
Table 9.
In reference to
Figure 11 and
Table 9, a hierarchy structure model of the fault propagation intensity of a machining centre can be obtained as shown in
Figure 12.
In reference to Equation (16) and
Figure 11, the fault propagation probability values of each path in the fault propagation hierarchy model of a machining centre at 1500 h can be calculated as shown in
Table 10.
As expressed in
Table 10, the fault propagation probability of each path is greater than the threshold value of 10
−8; thus, a fault propagation phenomenon exists in the model.
Based on
Table 9 and the hierarchy structure model of fault propagation in a machining centre, for the fault initiation layer there is only the electrical system component
D. Hence, the critical node of the fault initiation layer is
D. For the fault propagation layer at 1500 h,
, thus the critical node currently is
NC. This indicates that the fault is more likely to be transferred from the electrical system to the numerical control system. Given that
, the critical node is
B. Similarly, for the fault absorption layer system component node,
, thus the feed system
F is the critical component of this layer. Therefore at 1500 h, the critical nodes are D, NC, B, and F, and the path composed of critical nodes is the critical fault propagation path. At 1500 h, the most likely propagation path of the fault is D→NC→B→F.
Similarly, the fault propagation intensity value of each directed edge of a machining centre at any time can be determined; in turn, the critical fault nodes and paths of a machining centre at any time can be obtained. This paper arbitrarily chooses the running time of 200 h and 5000 h as examples and draws the comparison diagram of the fault propagation intensity value of each directed edge as shown in
Figure 13.
In reference to
Figure 13, the fault propagation intensity values of each directed edge are varied at different times and the critical fault propagation paths of the machining centre at different times are distinct. The critical fault propagation path of 200 h is D→NC→B→T, at 1500 h the critical fault propagation path is D→NC→B→F, and at 5000 h the critical fault propagation path is D→NC→B→T.
4.2. Comparison Analysis
The proposed method in this paper is compared with the importance evaluation method proposed in Reference [
49]. When evaluating the importance of machine tool system components, the method in Reference [
49] only evaluated the importance of components from the perspective of the fault propagation mechanism and did not consider the structural characteristics of the model. Combining the application examples in this paper, when the model structure characteristics are not considered and only the fault propagation mechanism is considered, the fault propagation probability value of each directed edge at 1500 h can be calculated according to Equations (10)–(13). Combined with the fault propagation hierarchy structure model of the machining centre in
Figure 6, the fault propagation hierarchy structure model of the machining centre based on the fault propagation probability is drawn as shown in
Figure 14. The model only considered the fault propagation mechanism of the machining centre.
As expressed in
Figure 14 at 1500 h, the fault source of the machining centre is component D. The fault will propagate along the components with a high probability of fault propagation. As
, the fault will preferentially propagate along D→B and because
, the fault is more likely to propagate along B→F. The critical fault propagation path at this time is D→B→F and the critical nodes are D, B, and F. That is, at 1500 h, the fault is most likely to be transmitted as such: electrical system→spindle system→feeding system. However, according to the method proposed in this paper, the result of calculation and analysis is D→NC→B→F. There is a certain difference in the critical fault propagation path obtained by the two methods that is mainly because the method based on the fault propagation probability does not consider the structural characteristics of the model. The component
NC plays an important role in the fault propagation structure model; when it fails, it will have a greater impact on the entire system, thus requiring attention. The influence of the structural characteristics of the model on the propagation of faults cannot be ignored. Therefore, the method proposed in this paper is more conducive to the analysis of the fault propagation behaviour of a machining centre and the analysis result is more reasonable.