3.2. The HMM of Our System
In general, HMM can be expressed as: λ
HMM = (TM, EM, π). It consists of [
29]:
- -
A set of states ‘S’: S = {S0, S1, S2, …, Sn} where ‘S0’ is the initial state of the system and Si, i ϵ {1, 2, …, n} is the faulty state (n is the number of treated states).
- -
The observables ‘O’: O = {O0, O1, O2, …, Om}. This is the apparent input of the model.
- -
An initial probability value (π) for each state. This consists of an estimate expressing the probability that initially the system is at a definite state
- -
A transition probability matrix (TM). This matrix represents the probability that the system moves from one state to another. The size of matrix (TM) is n by n.
- -
An output probability distribution matrix or emission matrix (EM). This expresses the likelihood that a certain measured sequence of values corresponds to a specific sequence of states. The size of matrix (EM) is n by m, where m is the number of observables.
In our system, the elements of the HMM are as follows [
30]:
- 53 main states, which encounter the healthy state, 12 states where there is a 1 mm crack in one piece of the 12 magnets located on the machine’s rotor, 36 states where there is a turn-to-turn short circuit in one slot of the 36 slots and 4 states where there is a 10% eccentricity fault on the left, right, upward and downward.
We mention that in this analysis, one type of fault is considered at a time; hence, there is no interrelation between the different types of faults.
Each of the above mentioned main states can propagate and reveal a new state. For example, let’s consider a scenario where the machine moves from the healthy state to the state of a turn-to-turn short circuit in one slot. In this case, the set of states that may face the machine can be expressed as
where S
H refers to the healthy state, S
F1-1 refers to the faulty state with one turn short circuited, S
F1-2 refers to the faulty state where two turns are short circuited, S
F1-13 refers to the faulty state where all the thirteen turns of the coil are circuited.
- Three distinct observation sequences per state. This expresses the observed data. In our case, the observation will be the measured values from the torque, temperature or vibration sensors. We mention that the raw data set, coming from the three sensors, is investigated over one machine revolution, and the number of sampling points is 400.
Considering the above example of turn-to-turn short circuit, the sequence of observation can be expressed as
where {SO
1-1} is a vector containing torque, temperature and vibration data in the case of one turn short circuited, {SO
1-2} is a vector containing torque, temperature and vibration data in the case of two turns short circuited, …
The trellis diagram representing this example is illustrated in
Figure 19 [
11,
31].
In
Figure 19, ‘Temp’ is for temperature and ‘Vib’ is for vibration. The trellis diagram in this figure illustrates the life cycle of the coil from the small-scale fault until the complete deterioration. However, the prognostic model that we are going to build will detect the presence of the fault at its early stage, and this is highlighted in red in the diagram.
Each type of fault has its own trellis diagram. If we consider the case of a 1 mm crack in one piece of the magnet, the states of the trellis diagram, other than the initial healthy state, will be 5: the first state refers to 1 mm crack in magnet 1, state 2 refers to the faulty state where the 1 mm crack deepens and becomes a 2 mm crack, state 3 refers to the faulty state where the 2 mm crack deepens and becomes a 3 mm crack, state 4 refers to the faulty state where the 3 mm crack deepens and becomes a 4 mm crack and state 5 refers to the faulty state where the crack becomes a complete fracture. Hence, in our case, we have a multilevel trellis diagram that can be schematically represented as in
Figure 20.
In our case, the considered states are the healthy state and the faulty states with primitive fault. The observations are a continuous signals function of time; the extracted features from those signals (average for torque and vibration signal and spectral amplitude for temperature) will be the input of the model.
A global HMM will be built where all the states and observables highlighted in red are grouped in a single model. The observed features {{SOH}, {SO1-1}, {SO2-1}, {SO3-1} …} will be linked to the discrete states of HMM (SH, SF1-1, SF2-1, SF3-1…) where each number designates a pre-settled state. For example, state ‘0’ designates the machine in the healthy case, state ‘1’ designates the machine in the case of turn-to-turn short circuit in slot 1, …, state ‘37’ designates the machine with crack in magnet 1, …, ‘49’ designates the machine with eccentricity fault… In the same context, observation ‘0’ is the range of average vibration detected by the vibration sensor when the machine is healthy, observation ‘1’ is the range of average vibration detected by the vibration sensor when the machine has a turn-to-turn short circuit fault in slot 1, …, observation ‘53’ is the range of average torque detected by the torque sensor when the machine is healthy…
In other words, the constructed model will encounter the first layers highlighted in red in
Figure 19 because these layers represent the system with the small scale faults we are interested in.
As we stated previously, the observations of the HMM will come from three observers: torque, vibration and temperature sensors. For each state of the machine there is data coming from the three sensors. However, some states share the same observations. For example, the temperature of the machine remains almost the same in the healthy case, case of eccentricity fault or case of crack in one magnet. Torque in the case of turn-to-turn short circuit remains the same wherever the short circuit is. The same is applied for the case of crack in one magnet or eccentricity fault. Hence, the total number of observations will be 94 encountering 53 vibration ranges, 4 torque ranges and 37 temperature ranges.
Accordingly, the size of the transition matrix is (53 × 53), the size of the emission matrix is (53 × 94), and the initial state of the machine is considered to be healthy.
The below ‘TM’ matrix is a schematic presentation of the transmission matrix.
The percentage that the machine remains healthy is 60%. The percentage that the machine becomes faulty is 40%. This 40% was distributed among the different considered faulty states according to the percentage of fault occurrence.
This self-correlation of the states is very common in hidden Markov models, and it is always observed as a strong diagonal in the transition matrix. We note also that faults of a similar nature have a similar probability of transition to crack in magnet 1 and crack in magnet 2 and crack in magnet 3 …
The below ‘EM’ matrix is a schematic presentation of the emission matrix.
The probabilities in the emission matrix can be arbitrary. However, in our HMM model, the numbers expressing the probabilities of emission between the states and the observations are inspired from the percentage of observations’ deviation between that of the healthy case and the faulty case [
15].
The TM and the EM constitutes the HMM of our model. The input sequence of observations will be features of vibration, torque and temperature data sensors. The decoding of the observation sequence will be executed using the Viterbi algorithm. Viterbi will elaborate the appropriate sequence of states by calculating the likelihood probability. A schematic representation of the combination between the HMM and the Viterbi algorithm, where the states and the observations are shown, is presented in
Figure 21.
For illustration, let’s consider a simple example where the sequence of observations is {SO
H, SO
H, SO
F-1} that corresponds to the sequence of states {S
H, S
H, S
F1-1}.
Figure 22 shows how this sequence will be decoded through the Viterbi algorithm and the corresponding sequence of states is detected.
At the start, the probability that the system is healthy is ‘0.6’, and the probability that the system is faulty is ‘0.4’. The probability that the system state is SH if the observation is SOH is ‘0.6’, and the probability that the system state is SF1-1 if the observation is SOH is ‘0’.
Hence, the weight probability from ‘Start’ to ‘SH’ is 0.36 (0.6*0.6), and the weight probability from ‘Start’ to ‘SF1-1’ is 0 (0.4*0). Viterbi will choose the path having the highest probability, which is in this case 0.36, and the selected path is highlighted in red. For simplicity, we will consider the observation from one sensor; the selected one is the vibration sensor.
The second observation is also SOH. The probability of remaining in state SH is ‘0.6’ (from the transition matrix). The probability of being in state SH if the observation is SOH is ‘0.6’. The probability from the previous state is 0.36. Hence, the weight probability of remaining in state SH when the second observation is SOH will be 0.1296 (0.36×0.6×0.6). Following the same logic calculation, we got the weighted probabilities of all the paths. The path of higher probability, at each observation time, is highlighted in red. Accordingly, the adequate sequence of states is {SH, SH, SF1-1}. This path, selected by the Viterbi algorithm, is called the ‘survivor path’.
After the elaboration of the machine’s condition state, the next and final step will be the calculation of the remaining useful life (RUL).