Next Article in Journal
Innovation of the Component GARCH Model: Simulation Evidence and Application on the Chinese Stock Market
Previous Article in Journal
An Integrated Two-Level Integer Linear Program (ILP) Model for Elective Surgery Scheduling: A Case Study in an Italian Hospital
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of k-Out-of-N-Systems with Different Units under Simultaneous Failures: A Matrix-Analytic Approach

by
Delia Montoro-Cazorla
1 and
Rafael Pérez-Ocón
2,*
1
Department of Statistics and Operational Research, University of Jaén, 23071 Jaén, Spain
2
Department of Statistics and Operational Research, University of Granada, 18071 Granada, Spain
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(11), 1902; https://doi.org/10.3390/math10111902
Submission received: 4 May 2022 / Revised: 27 May 2022 / Accepted: 30 May 2022 / Published: 2 June 2022
(This article belongs to the Section Engineering Mathematics)

Abstract

:
An N-system with different units submitted to shock and wear is studied. The shocks cause damage and, eventually, simultaneous failures of several units. The units can also fail due to internal failures. At random times, the system is inspected, and the down units are simultaneously replaced by identical ones. The arrival of shocks is governed by a Markovian arrival process. The operational times and the interarrival times between inspections follow phase-type distributions. The generator of the multidimensional Markov process modeling the system is constructed. This is performed introducing indicator functions for the different transition rates among the units using the algorithm of Kronecker. This is a general Markov process that can be applied for modeling different reliability systems depending on the structure of the units and how the systems operate. The general model is applied to the study of k-out-of-N systems, calculating the main performance measures. A practical example is presented showing the approximation of the model to a system with units following different Weibull distributions.

1. Introduction

The use of wireless sensor networks (WSN) is broadly extended in high technology and communications’ systems. A survey of the use, properties, characteristics and applications is given in [1,2]. The WSN frequently operates in remote areas under hard environmental conditions. The elements are formed by nodes operating unattended and the miniaturization implies that the elements in the nodes are so close that the arrival of a shock can produce the failure of several elements simultaneously. Other structures of frequent use are the high performance computing (HPC) systems, composed by aggregate computing power in order to reach higher performance than particular computers or workstations, solving problems in science, industry, and business. These two structures are formed by a large number of components and systems, generally including redundancy and the possibility of simultaneous failures. In the WSN, the maintenance is usually performed by inspections at certain times for reducing costs and a higher effectivity.
In references [3,4], systems under simultaneous failures have been studied in the discrete case. In the first one, series-parallel and parallel-series systems of order 3 with lifetimes of the components not depending on the time are studied, and the reliability is calculated by using combinatorial methods. In the second one, the reliability of a network is calculated by using computational calculations, and the lifetime of the components are discrete random variables.
The study of simultaneous failures in continuous time is not frequent in the literature. In [5], an HPC system with multiple failures is studied. The failure time of the nodes is assumed to follow Weibull distributions, and the authors develop a reliability model for a system with k nodes and simultaneous failures. The reliability, the failure rate, and the mean time to failure of the system are calculated. Dependence between the reliability measures and the number of nodes is detected. The analytical structure proposed by the authors is a multivariate Weibull model based on the Marshall–Olkin multivariate exponential distribution. The Markov processes have been an appropriate model in reliability, see [6,7], among many others. A limitation of the Markov processes in modeling systems is that the staying times in states are exponentially distributed, or, equivalently, that the instantaneous transition rates do not depend on the time; they are constant. This is a severe restriction and in many cases is not satisfied. We present an extension of this last paper to the case in which the staying times follow phase-type distributions. The class of the phase-type distributions is dense, in the sense of the distributions, in the family of the distribution functions defined on the positive real half-line. Thus, this assumption supposes that the model can be applied to systems in which the staying times follow distributions as Weibull, Gamma, Erlang, and others; see [8]. Moreover, it can also be applied to datasets coming from lifetimes of components of systems. In both cases, a model governing the system by approximating the lifetimes of the components by Phase-type distributions can be constructed. Some models using Phase-type distributions in reliability and other domains are [9,10], among others. In [9], a reliability redundancy allocation problem is developed on a non-repairable system with heterogenous components where lifetimes follow PH distributions. In [10], these distributions are involved in a queuing model applied to nodes in a WSN.
We propose a model based in the matrix-analytic methods (MAMs), extending other previous papers. The MAMs have proven to be a flexible structure for modeling arrival of events and contain many familiar processes as very special cases, see [11,12,13], among others. A survey of these MAMs applied to different domains is in [14]. The introduction of the MAMs in the analysis of the systems implies the use of the following elements: the Markovian arrival processes, the Phase-type distributions, and the Kronecker operations; they play a central role in the study and calculations of the performance measures, and extend many other particular systems given the generality of these structures. The Kronecker operations are studied in [15,16]. In [17], an updated revision of the Markovian arrival process, an essential element in the MAMs, is presented. A survey of the techniques through the state-space models are investigated and compared, in order to facilitate the appropriate model for calculating the reliability and the availability in a dynamic environment to practitioners [18]. When the model is constructed by using these methods, its applicability is extended, and the stochastic process governing the system is a multidimensional Markov process analytically tractable.
The system we present is submitted to shocks, and the shock models have been studied in the literature under different methodologies and applied to different systems. The initial systematic study for shock models is [19]; from then, many extensions have been following different ways. In [20], the behavior of different populations under shock models is studied. In [21], a complete study of the different types of failures and models is performed. In [22], different shock models following non-homogeneous Poisson processes are applied for calculating the reliability of the systems. In [23], an analysis of the availability is made in a system affected by attacks. One of the first papers applying the phase-type distributions as a model for arrival of shocks is [24], extending previous papers. The shock models and the continuous Markov processes have been considered suitable models for analyzing the ageing of components and systems [25]. The MAP as a model for the arrival of failures is introduced in [26], and it is applied in [27,28,29,30], for studying different reliability systems. In [31], a system under shocks, damages, and other elements is modeled by a Markovian arrival process with marked arrivals.
We study an N-system with different units. It is submitted to internal failures affecting single units, and shocks that can produce damage and, eventually, the failure of one or several units. The maintenance is performed by inspections, replacing all the down units (if any) with new and identical ones. The case under exponential units has been studied in [6]. In the present paper, we extend it to a more general system: the lifetimes of the units follow PH-distributions, extending the exponential lifetimes; the arrival of shocks is governed by a Markovian arrival processs (MAP), extending the Poisson process of the previous paper; the time between inspections follows a PH-distribution, extending the exponential case; the size of the damage produced by the shocks is introduced in the study, and it follows a discrete PH-distribution, extending the exponential case, where the number of failures follows a Geometric distribution. Clearly, the applicability of the present paper is greater than the previous ones. The study we carry out does not follow straightaway from the exponential version; the matricial structure in the present one has to be constructed—it is not directly derived from the exponential case. A Markov process is not a suitable structure for being applied to systems under simultaneous failures; it must be treated in an appropriate way, and this is solved by introducing the Kronecker operations involving indicator functions allowing for the preservation of the Markovian structure for systems whose units are not exponentially distributed.
For this system, a multidimensional Markov process governing the system under the conditions above given is constructed. The main quantities of interest associated with the system can be calculated in an well-structured form in transient and stationary regimes. In return, the dimensions of the involved matrices are very large even in systems with a few units. Formally, the construction of the generator is not affected by the orders of the involved matrices, though it is complex due to the simultaneity of the events. We illustrate the general procedure applying it to a k-out-of-N system, of frequent use in engineering; an application for N = 3 is carried out; in it, the performance measures usual in reliability are calculated, and it is shown how the model can be applied taking into account the structure of the system and how it operates. In [32], a computational method for determining the reliability of a k-out-of-n system with different components is constructed; in the present paper, we propose an alternative procedure applying the MAMs, and calculating other performance measures using algebraic calculations with matrices. A numerical application is added illustrating how the model can be constructed when the units follow Weibull distributions.
The paper contributes to the study of general reliability systems in several ways: (1) The units are different and they are not exponentially distributed; the lifetimes of the units follow PH-distributions, this allows the approximation of systems with units following general distributions; (2) the arrival of shocks is governed by an MAP, this allows dependence among the interarrival times. In addition, they comprise the usual arrival distributions; (3) the shocks produce damage and eventually the failure; (4) there are simultaneous failures of units and simultaneous replacements are carried out after an inspection; (5) the generator construction and the performed reliability analysis can be applied to n-systems with dissimilar components arranged in different structures. The general hypotheses established for this model and the compact expressions achieved by using the MAM methodology expand its application possibilities.
The paper is organized as follows: in Section 1, we give the definitions of Phase-type distributions and Markovian arrival processes and comment on how these operate. In Section 2, the assumptions of the system are established and the multidimensional Markov process governing the system is constructed. In Section 3, the model is applied to a k-out-of-N system, obtaining explicit expressions for the performance measures. Section 4 is a methodological one, and a detailed numerical application to a 2-out-of-3 system is performed, showing the steps for applying the calculations to other systems.

Preliminaries

The present section introduces the basic definitions used in the paper and some comments about how the MAMs operate.
Definition 1.
If A and B are rectangular matrices of orders m 1 × m 2 a n d n 1 × n 2 , respectively, their Kronecker product A B is the matrix of order m 1 n 1 × m 2 n 2 , written in compact form as ( a i j B ) .
The Kronecker sum of the square matrices C and D of orders p and q, respectively, is defined by C D = C I q + I p D , where I k denotes the identity matrix of order k.
For more details about these operations, see references [12,15].
Definition 2.
The distribution function H ( · ) on [ 0 , ) of a phase-type distribution is
H ( x ) = 1 α exp ( T x ) e , x 0
It is associated with a finite Markov process with one absorbent state and initial vector α . Matrix T is the submatrix of the generator of the process restricted to the transient states, and it is non-singular. Vector e is a column vector of 1’s. The absorption column vector is denoted by T 0 , and it satisfies Te + T 0 = 0 . The order of matrix T and vectors T 0 , e are the same. It is said that the distribution has representation ( α , T ) , and it is written P H ( α , T ) .
H ( x ) is the distribution function of the first passage time of the Markov process for the absorbent state given the initial vector α . The order of the distribution is the order of matrix T . It is denoted a PH-distribution. A PH-renewal process is a renewal process whose distribution function is given by a PH-distribution.
The discrete version of a PH-distribution is defined in [12], and it is denoted by PH d -distribution.
Definition 3.
Let D be an irreducible infinitesimal generator of a Markov process. Let a sequence of matrices D k , k = 1 , 2 , , n be non-negative and matrix D 0 have non-negative off-diagonal entries. The diagonal entries of D 0 are strictly negative, and it is non-singular. The following is satisfied:
D = D 0 + k = 1 n D k .
Associated with this Markov process D , there is a renewal Markov process performing an arrival process to real line operating as follows. It is assumed that there are n different types of arrival. Matrix D 0 governs the interarrival times, and matrix D k , k = 1 , 2 , , n , governs the occurrence of the arrival of type k . This is the Markovian arrival process (MAP) associated with the initial Markov process, with parameter matrices D k , k = 1 , 2 , , n . It is denoted by M A P ( D 0 , D k ) , k = 1 , , n . The order of the MAP is the order of the involved matrices. The states of the Markov process D are denominated virtual phases of the MAP, and they follow exponential times.
A P H ( α , T ) -distribution can be interpreted as follows. While operating, the process occupies exponential states (phases) randomly, and it finishes when it reaches the absorbent one. The PH-renewal process associated with this PH-distribution is a renewal one that reinitiates with matrix rate T 0 α . A M A P ( D 0 , D 1 ) with two parameter matrices extends the concept of PH-renewal process when, after a renewal, the following period initiates according to a rate depending on the phase occupied when the renewal occurs; the matrix including the different initial rates after a renewal is D 1 . Matrix D 0 represents the change of virtual phases not corresponding to renewals. The renewals will be interpreted as shocks arriving to the system.

2. The System

We consider a system with N independent units under the conditions given in the Introduction. The system is renewed randomly, replacing the non-operational units, if any, with new and identical ones after inspections. The internal failures, the arrival of shocks, the damage caused by the shocks, and the number of units affected by the shocks are random, and they occur according to the following assumptions.
Assumption 1.
The time of the internal failure of unit i follows a P H ( α i , T i ) -distribution of order m i , i = 1 , 2 , , N .
Assumption 2.
The shocks arrive following an M A P ( C 0 , C 1 ) with initial vector c and order a.
Assumption 3.
The shocks cause damage or failure. The size of the damage is governed by a discrete P H d ( β , D ) -distribution with initial probability vector β and order r.
Assumption 4.
The time between consecutive inspections follows a P H ( γ , L ) -distribution of order b. The times of inspection are negligible.
The damage caused by a shock can be interpreted as a counter before the failure, in such a way that, after a shock, the counter of the units changes and the absorption in the discrete PH-distribution governing the damages would indicate the failure of the corresponding unit.
First, we define the internal state of the units. The internal state of unit i is denoted by ε i . These states are characterized by the internal phase and the phase of damage. Unit i is operational if it occupies any operational phase of P H ( α i , T i ) and the same for P H d ( β , D ) ; the corresponding absorption vectors for these distributions are T i 0 , and D 0 , respectively. The set associated with this case is { ( j , k ) , 1 j m i , 1 k r } . In other cases, it will be non-operational, the set associated is { m i + 1 } { r + 1 } , and it is formed by the corresponding absorbent phases of the previous PH-distributions. We denote by S i the set formed by the union of these sets:
S i = { ( j , k ) , 1 j m i , 1 k r } { m i + 1 } { r + 1 }
Vector ε = ( ε 1 , ε 2 , , ε N ) , ε i S i , denotes the set of the internal states of the units. The macro states of the system are determined by vectors ε i , the virtual phases ( ν ) of the MAP governing the arrival of shocks and the phases (l) of the PH-distribution governing the time between inspections. They are denoted by
S = { ( ε 1 , ε 2 , , ε N , v , l ) : ε i S i , 1 i N , 1 v a , 1 l b }
We denote by X ( t ) the number of non-operational units at time t. We will construct the multidimensional Markov process { X ( t ) , t 0 } with space of macro states S.
If x i = 0 , unit i is operational and x i = 1 if not, i = 1 , 2 , , N . Vector x = ( x 1 , x 2 , . , x N ) is a state-vector of the system. Given a state-vector x , the occurrence of a failure (internal or shock) or inspection causes a transition to another state-vector y , and the changes in the components of vector x are indicated by ω i x , y = 1 if x i y i and ω i x , y = 0 in other case (there is no change). This is an indicator of the change of a unit, and vector ω x , y = ( ω 1 x , y , ω 2 x , y , . , ω N x , y ) registers the changes of the units in a transition x y . The number of units changing in such transition is the sum over i of the components of the vector.
The number of non-operational units in vector x is s x = i = 1 N x i . The possible values for s x are { 0 , 1 , , N } . The set { x : s x = m } is formed by the state-vectors having m non operational units (non-null components). These sets form a partition of the set of state-vectors. Let two state-vectors x , y and the Kronecker indicator δ i , j = 1 if i = j and δ i , j = 0 if i j ; expression δ s x s y , 0 indicates that vectors x , y have the same number of operational units.
We calculate a general Markov process governing the system considering that the system fails when the last operational unit fails; this is equivalent to a system organized in parallel. We will show that the case of a 1-out-of-N system is obtained from the present model in a straight way. First, the transition rate matrices among the state-vectors are constructed and then the generator Q including the transition rate matrices among the states. Once Q is constructed, the transition probability matrix P ( t ) = ( P i j ( t ) ) with P i j ( t ) = P { X ( t ) = j / X ( 0 ) = i } is calculated from expression P ( t ) = exp ( Q t ) , t 0 , P ( 0 ) = I , by using computational calculations.
The transition among the state-vectors occurs by the arrival of one of the events: the arrival of a shock, an internal failure, or an inspection. The construction of the Markov process governing the system follows the line in [6], but this does not mean that they are calculated straightaway; new matrices are necessary and the operations of Kronecker play a central role. Going forward, the subindices of the identity matrices indicate their order, and it is the same for vectors e . The transition among the state-vectors are determined by the state of the units (up or down) and the indicator of change ω .
If the system occupies state-vector x and a shock arrives, there is a possible transition to state-vector y (that can be the same vector x) according to the rates among the phases of the state-vectors; the rate of change of unit i due to a shock is given by
φ i x , y = e m i D 0 i f ω i x , y = 1 I m i D i f ω i x , y = 0 , x i = 0 1 i f ω i x , y = 0 , x i = 1
The first line indicates that there is a change in unit i, the failure of the unit is governed by D 0 , and it can occur from any phase ( e m i ). The Kronecker product expresses the changes in terms of the corresponding matrices and vectors. The second line indicates that there is no change (given by I m i ), and a shock arrives (counter D ) producing no change. In the third line, the unit is not operational.
The expression of the change for all units in vector x to the arrival of a shock is calculated using the Kronecker product, and it is
φ x , y = φ 1 x , y φ 2 x , y φ N x , y
If an internal failure arrives, the rate among the phases of the state-vectors is given by reasoning similarly as above for the arrival of internal failures, and we have for unit i:
Ψ i x , y = T i 0 e r i f ω i x , y = 1 T i I r δ s y s x , 0 + I m i I r ( 1 δ s y s x , 0 ) i f ω i x , y = 0 , x i = 0 1 i f ω i x , y = 0 , x i = 1
The first line indicates that the failure of the unit is governed by T i 0 , and it occurs in any phase of damage ( e r ). The second line indicates that there is a change in an operational phase without damage ( T i ) or there is a change in another unit; in the second summand, a change is produced in one unit different from i, and this is represented by 1 δ s x s y , 0 . For the total of units, we have
Ψ x , y = Ψ 1 x , y Ψ 2 x , y Ψ N x , y
Note that expressions φ x , y , Ψ x , y are similar, though in the previous one the shocks can be simultaneous and in the last the internal failures cannot. The difference is that, in expression Ψ i x , y , simultaneous failures are not allowed; this is performed incorporating the Kronecker’s delta.
After the occurrence of an inspection, the unit i is affected in two ways:
H i x , y = α i β i f ω i x , y = 1 I m i I r i f ω i x , y = 0 , x i = 0
The first line indicates a replacement of the down unit that reinitiates ( α i ) and the counter reinitiates ( β ). The second line indicates that nothing occurs. For the total of units, we have
H x , y = H 1 x , y H 2 x , y H N x , y
The matrix transition rates among the state-vectors depend on the number of units failing, which is s y s x , and the special case s x = N . They are
Λ x , y = φ x , y C 1 I b + ( Ψ x , y I a I b ) δ s y s x , 1 i f s y s x 1 ( a t l e a s t o n e u n i t f a i l s )
Λ x , y = i u p ( T i I r ) I a I b + + i u p ( I m i I r ) C 0 I b + + i u p ( I m i I r ) I a ( L + L 0 γ δ s x , 0 ) + + i u p ( I m i D ) C 1 I b i f s y s x = 0 , s x N
Λ x , y = ( C 0 + C 1 ) I b + I a L i f s y s x = 0 , s x = N
Λ x , y = H x , y I a L 0 γ i f s y s x 1 , s x = N ( a r e p l a c e m e n t )
The matrix grouping the transition rates between states i j in terms of the state-vectors is
Q ij = Λ xy { x : s x = i } { y : s y = j }
and, finally, the generator of the Markov process in terms of the macro states is
Q = ( Q ij ) 0 j N 0 k N
The order of Q is
k = 0 k = N 1 i 1 i k N m i 1 m i 2 m i k · r N k a b
If X ( 0 ) = 0 , the initial block vector of the Markov process is
ρ = ( ρ 0 , 0 , , 0 ( N ) ) w i t h ρ 0 = ( ( α 1 β ) ( α N β ) c γ )
Vector ρ 0 is of order
i = 1 N m i r N a b
The number of 0’s in the expression of ρ is the difference between the order of Q and the order of the first component.
The stationary distribution is obtained as in the exponential case, changing the corresponding matrices [6].

3. k-Out-of-N Systems

The model we have constructed is organized as in parallel, since the system fails when the last operational unit fails. One of the advantages of the use of MAMs is that they are versatile and can be applied to systems under other assumptions with the appropriate changes. We dedicate this section to studying the k-out-of-N systems under the assumptions and notation in Section 2. This class of systems is of frequent use in the applications.
The system is operational if at least k units are. The set of macro-states is S = { 0 , 1 , , N } ; this set can be partitioned in S U = { 0 , 1 , , N k } and S D = { N k + 1 , , N } , operational and non-operational macro-states. Grouping the set of non-operational macro-states in one, denoted by 0 ¯ , the set of macro-states of the system is S k / N = { 0 , 1 , , N k , 0 ¯ } . The initial probability vector (3) and generator Q in (2), are, respectively,
ρ k / N = ( ρ 0 , 0 , , 0 ( N k ) , 0 ) Q k / N = ( Q ij ) i , j = 0 , 1 , , N k , 0 ¯
The occupancy probabilities of the operational macro states are ρ 0 P 0 j ( t ) e , j S U . The reliability R ( t ) is the time of the first failure of the system; it is a P H ( ρ k / N * , Q k / N * ) -distribution, where ρ k / N * and Q k / N * are derived from ρ k / N and Q k / N , respectively, eliminating the row and column associated with 0 ¯ . The availability at time t is given by
A ( t ) = ρ 0 j S U P 0 j ( t ) e
Particular systems are the series and parallel ones. As particular cases, we have the 1-out-of-N system (parallel) and the N-out-of-N system (series). The corresponding renewal processes associated with the first time of failure can be studied straightaway as can be seen in references from the authors.
Let a k-out-of-3 system be N = 3 . For k = 1 , it is a parallel system. The set of operational and non-operational states are S U = { 0 , 1 , 2 } , S D = { 3 } , respectively. The failure of the system follows a P H ( ρ 1 / 3 * , Q 1 / 3 * ) -distribution with
ρ 1 / 3 * = ( ρ 0 , 0 , 0 ) Q 1 / 3 * = Q 00 Q 01 Q 02 Q 10 Q 11 Q 12 Q 20 0 Q 22
The mean time of failure is ρ 1 / 3 * ( Q 1 / 3 * ) 1 e . The renewal of the system occurs when an inspection arrives while the system occupies state 3. The time until the renewal follows a P H ( g 1 / 3 , G 1 / 3 ) -distribution with
g 1 / 3 = ( ρ 0 , 0 , 0 , 0 ) G 1 / 3 = Q 00 Q 01 Q 02 Q 03 Q 10 Q 11 Q 12 Q 13 Q 20 0 Q 22 Q 23 0 0 0 Q 33
For k = 2 , S U = { 0 , 1 } , S D = { 2 , 3 } . The failure of the system follows a P H ( ρ 2 / 3 * , Q 2 / 3 * ) -distribution with
ρ 2 / 3 * = ( ρ 0 , 0 ) Q 2 / 3 * = Q 00 Q 01 Q 10 Q 11
The mean time of failure is ρ 2 / 3 * ( Q 2 / 3 * ) 1 e . The renewal of the system occurs when an inspection arrives and the system is down; it follows a P H ( g 2 / 3 , G 2 / 3 ) -distribution with
g 2 / 3 = ( ρ 0 , 0 , 0 , 0 ) G 2 / 3 = Q 00 Q 01 Q 02 Q 03 Q 10 Q 11 Q 12 Q 13 0 0 Q 22 Q 23 0 0 0 Q 33
For k = 3 , S U = { 0 } , states 1 , 2 , 3 , are non-operational. The first time of failure follows a P H ( ρ 3 / 3 * , Q 3 / 3 * ) with
ρ 3 / 3 * = ρ 0
and
Q 3 / 3 * = Q 00
The mean time of failure is ρ 0 Q 00 1 e .
The renewal of the system occurs when an inspection arrives while the system occupies a non-operational state. The time until the renewal of the system follows a P H ( g 3 / 3 , G 3 / 3 ) with vector
g 3 / 3 = ( ρ 0 , 0 , 0 , 0 )
and
G 3 / 3 = Q 00 Q 01 Q 02 Q 03 0 Q 11 Q 12 Q 13 0 0 Q 22 Q 23 0 0 0 Q 33
These are the fundamental matrices for calculating the performance measures associated with the system.

4. Numerical Application

The expressions above can be directly applied when the lifetimes of the units are governed by PH-distributions. In engineering, the Weibull and the lognormal distributions, among others, are frequently used. The Weibull distribution plays a central role in reliability, a text presenting in an integrated form different types of Weibull models is [33]; it is a complete survey, classifying different models, including an inferential study and applying to several domains with special dedication to reliability. We illustrate the numerical application to k-out-of-3 systems with units governed by Weibull distributions. The first step is to fit a PH-distribution, and then to apply the results in the previous section.
The reliability function of a two parametric ( λ , β ) Weibull distribution is
W ( t ) = exp [ ( λ t ) β ] , t 0
with mean and variance given, respectively, by
m = ( 1 / λ ) Γ [ 1 + ( 1 / β ) ] σ 2 = ( 1 / λ ) 2 { Γ [ 1 + ( 2 / β ) ] Γ 2 [ 1 + ( 1 / β ) ] }

4.1. Fitting PH-Distributions

We consider three different Weibull distributions corresponding to the three units with the parameters given below. For every one, the mean and the variance are calculated. A fit of a PH-distribution is performed applying the EM algorithm [8]. The number of data are n = 400 , and 1000 iterations are realized. In every case, the first PH-distribution selected for fitting is of order 2; if the fitting is rejected, then a PH-distribution of order 3 is fitted, and so on. The goodness of the fit is calculated using the Kolmogorov–Smirnov test. In the cases we present, two of them are well fitted to PH-distributions of order 2, and the other to a PH-distribution of order 3. In all cases, the statistic calculated Z from the sample is less than the statistics Z 0.05 = 0.0675 , and all the fittings are not rejected.
We assume a prefixed unit time. The goodness of the fits guarantees a good approximation of the model and consequently also for the performance measures to the initial system. The calculations using the MAMs are more tractable than the ones with the Weibull distributions. In the following figures, when the first significant decimal figure is after a ten thousandth, it approaches zero.
Unit 1 is assumed to follow a Weibull distribution with λ 1 = 1 , β 1 = 1.2 . Then, m 1 = 0.9407 u . t . , σ 1 = 0.7872 u . t . The representation ( α 1 , T 1 ) of the PH-distribution fitted is
α 1 = ( 0.6774 , 0.3226 ) T 1 = 1.8597 1.7830 0.0043 1.7070
Unit 2 is assumed to follow a Weibull distribution with λ 1 = 1 , β 1 = 1.5 . Then, m 2 = 0.9027 u . t . , σ 2 = 0.6129 u . t . The representation ( α 2 , T 2 ) of the PH-distribution fitted is
α 2 = ( 0.9571 , 0.0429 ) T 2 = 2.1664 2.1632 0 2.1664
Unit 3 is assumed to follow a Weibull distribution with λ 1 = 1 , β 1 = 2 . Then, m 3 = 0.9407 u . t . , σ 3 = 0.7872 u . t . The representation ( α 3 , T 3 ) of the PH-distribution fitted is
α 3 = ( 0.0107 , 0.9893 , 0 ) T 3 = 3.3649 0 0 0 3.3554 3.3512 3.3542 0 3.3542

4.2. System 2-Out-of-3

We apply the results to a 2-out-of-3 system. The matrices of the MAP governing the arrival of shocks are assumed to be
c = ( 1 , 0 ) C 0 = 3.30 1.2 1.2 3.5 C 1 = 1.10 1 1.3 1
The probability of a fatal shock is p = 0.15 , and q = 0.85 is the probability that it only causes damage. For simplifying the calculations, the occurrence of a fatal shock after k 1 non fatal shocks follows a geometric distribution G ( p ) defined by p k = ( 1 p ) k p ; k = 1 , 2 , ; it is a P H d ( β , D ) -distribution with β = 1 , D = 1 p = 0.85 . The time between inspections is assumed to be exponentially distributed with mean 1 / 2 ; then, γ = 1 and L = 2 .
The rate of failure of the system is
r ( t ) = ρ 2 / 3 * exp ( Q 2 / 3 * t ) ( Q 2 / 3 * e ) ρ 2 / 3 * exp ( Q 2 / 3 * t ) e , t 0
The availability A ( t ) , the reliability R ( t ) , and the rate of system failure r ( t ) for different values of time are obtained in Table 1. The last column represents the stationary values of the performance measures.
The availability decreases and approaches the stationary value for t 10 , the failure rate increases with time, and the reliability decreases quickly.
These are performance measures for the system. We consider the cycles associated with the system. A cycle is the timespan of two consecutive operational and non-operational periods, the length of it is denoted by T C . The mean number of renewals is ν R = 1 / E [ T C ] . The study of these measures in the cycles is relevant, since it allows us to know the behavior of the system in a transient regime. Denoting by T U , T D the up and down times in a cycle, respectively, we have T C = T U + T D . The fraction of time that the system is operational in a cycle is
f C = E [ T U ] E [ T C ] = ρ 2 / 3 * ( Q 2 / 3 * ) 1 e g 2 / 3 ( G 2 / 3 ) 1 e
The values of these measures are E [ T U ] = 0.7187 , E [ T C ] = 1.2187 , v R = 0.8206 , f C = 0.5817 . It is deduced that the system is operational close to 58 % of the time in a cycle, and the failure rate is less than the one in the transient case for t > 10 1 .
In order to improve the previous performance measures, the only quantity that can be controlled by the researcher in the evolution of the system is the inspection. We take into account this to study how the performance measures change in terms of the inspection time. We consider the stationary regime. In Table 2, the numerical values of the availability (A), the mean time operational in a cycle ( E [ T U ] ), and the fraction of time that the system is operational in a cycle f C are calculated for different values of μ I = 1 / L .
The variation of these performance measures is observed; availability, the mean time operational in a cycle, and the fraction of time that the system is operational in a cycle decrease when the mean time of the inspections increases.
Finally, we give a graphical comparison among the studied 2-out-of-3 system, and series and parallel ones. In Figure 1, the availability of these three systems is plotted.
As expected, the availability is significantly higher when units are arranged in parallel. In a similar way, other performance measures could be compared, and other numerical analysis can be performed according to the interest of the researcher in the practice.

5. Conclusions

The major novelty in this paper is the construction of a Markovian model for the study of systems under simultaneous failures with non-exponential units. Another contribution is the applicability to systems whose units have different lifetimes or only observed values of the lifetimes are available. The Markovian structure governing the system is maintained though there are simultaneous failures, and the analytical structure of this type of process is calculated, obtaining explicit expressions for the performance measures of the system in a well-structured form. The system we consider is submitted to shocks, damages, internal failures, inspections, replacements, and multiple failures; some particular cases can be directly deduced, eliminating some of them. The versatility of the MAMs allows the application of the model to other systems, such as the k-out-of-N ones, deduced from the general model directly. The study allows for extending methods previously proposed by other authors to systems under multiple failures and incorporating damages, such as in the wireless sensor networks (WSN) and in the high performance computing (HPC) systems, supplying a more powerful model. The possibility of the applications depends on the order of the involved matrices, and the use of computational programs is essential for solving the systems under the conditions given.

Author Contributions

Formal analysis, D.M.-C. and R.P.-O.; Investigation, R.P.-O.; Methodology, D.M.-C. and R.P.-O.; Software, D.M.-C.; Supervision, R.P.-O.; Writing—original draft, D.M.-C.; Writing—review & editing, R.P.-O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Notation

P H ( α i , T i ) distribution of the lifetime of unit i.
M A P ( C 0 , C 1 ) shock arrival process.
P H d ( β , D ) distribution of the size of the damage caused by shocks.
P H ( γ , L ) distribution of inter inspections time.
I r identity matrix of order r.
e r column vector of 1’s of order r.
ε i internal state of unit i.
ε = ( ε 1 , ε 2 , , ε N ) vector collecting the internal states of all units.
x i indicator of operational (0) or non-operational state (1) in unit i.
x = ( x 1 , x 2 , . , x N ) vector collecting the operational or non-operational states of all units.
s x non-operational units in vector x
ω i x , y indicator of a change of unit i in the transition from x to y .
ω x , y = ( ω 1 x , y , ω 2 x , y , . , ω N x , y ) vector collecting the indicators of change of all units, in the transition from x to y .
φ i x , y rate of change of unit i due to a shock, in the transition from x to y .
φ x , y = φ 1 x , y φ 2 x , y φ N x , y vector collecting the rate of change of all units i due to a shock, in the transition
from x to y .
Ψ i x , y rate of change of unit i due to internal failure, in the transition from x to y .
Ψ x , y = Ψ 1 x , y Ψ 2 x , y Ψ N x , y vector collecting the rate of change of all units i due to an internal failure, in the
transition from x to y .
H i x , y the rate of change of unit i due to the arrival of an inspection, in the transition
from x to y .
H x , y = H 1 x , y H 2 x , y H N x , y vector collecting the rate of change of all units i due to the arrival of an inspection,
in the transition from x to y .
Λ x , y block rate transition from x to y .
Q generator of the model.
ρ k / N , Q k / N initial probability vector and generator, respectively, for a K-out-of N system.
P H ( ρ k / N * , Q k / N * ) distribution of the lifetime of a K-out-of N system.
P H ( g k / N , G k / N ) distribution of the renewal time in a K-out-of N system.
E [ T U ] expected length of the up period of the system.
E [ T D ] expected length of the down period of the system.
E [ T C ] expected length of a cycle of the system.
f C fraction of time that the system is operational in a cycle.
ν R the mean number of renewals.
A ( t ) Availability of the system in time t, the stationary value is A.
R ( t ) Reliability of the system in time t.
r ( t ) Rate of failure of the system in time t.

References

  1. Akyildiz, I.F.; Su, W.; Sankarasubramaniam, Y.; Cayirci, E. Wireless sensor networks: A survey. Comput. Netw. 2002, 38, 393–422. [Google Scholar] [CrossRef] [Green Version]
  2. Lee, S.; Younis, M. Recovery from multiple simultaneous failures in wireless sensor networks using minimum Steiner tree. J. Parallel Distrib. Comput. 2010, 70, 525–536. [Google Scholar] [CrossRef]
  3. Satoh, M.; Sasaki, M.; Yuge, T.; Yanagi, S. Reliability of 3-State Device Systems with Simultaneous Failures. IEEE Trans. Reliab. 1993, 42, 470–477. [Google Scholar] [CrossRef]
  4. Song, W.T.; Cui, P.L.; Chen, J.; Wu, B. System reliability of stochastic networks with multiple reworks. Reliab. Eng. Syst. Saf. 2018, 169, 257–268. [Google Scholar] [CrossRef]
  5. Thanakornworaki, T.; Nassar, R.; Leangsuksun, C.B.; Paun, M. Reliability model of a system of k nodes with simultaneous failures for high-performance computing applications. High Perform. Comput. Appl. 2013, 27, 474–482. [Google Scholar] [CrossRef]
  6. Montoro-Cazorla, D.; Pérez-Ocón, R. Constructing a Markov process for modelling a reliability system under multiple failures and replacements. Reliab. Eng. Syst. Saf. 2018, 173, 34–74. [Google Scholar] [CrossRef]
  7. Fa, H.; Xu, Z.; Chen, S. Optimally maintaining a multi-state system with limited imperfet preventive repairs. Int. J. Syst. Sci. 2015, 46, 1729–1740. [Google Scholar]
  8. Asmussen, S.; Nerman, O.; Olsson, M. Fitting phase-type distribution via the EM algorithm. Scand. J. Stat. 1996, 23, 419–441. [Google Scholar]
  9. Kim, H.; Kim, P. Reliability models for a nonrepairable system with heterogeneous components having a phase-type time-to-failure distribution. Reliab. Eng. Syst. Saf. 2017, 159, 37–46. [Google Scholar] [CrossRef]
  10. Dey, S.; Deepak, T.G. A matrix analytic approach to study the queuing characteristics of nodes in a wireless network. OPSEARCH 2019, 56, 477–496. [Google Scholar] [CrossRef]
  11. Asmussen, S. Matrix-analytic Models and their Analysis. Scand. J. Stat. 2000, 27, 193–226. [Google Scholar] [CrossRef]
  12. Neuts, M.F. Matrix-Geometric Solutions in Stochastic Models—An Algorithmic Approach; John Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
  13. Latouche, G.; Ramaswami, V. Introduction to Matrix Analytic Methods in Stochastic Modeling; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1999. [Google Scholar]
  14. He, Q.-M. Fundamentals of Matrix-Analytic Methods; Springer: New York, NY, USA, 2014. [Google Scholar]
  15. Bellman, R. Introduction to Matrix Analysis; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 1997. [Google Scholar]
  16. Graham, A. Kronecker Products and Matrix Calculus with Applications; Dover Publications Incorporation: New York, NY, USA, 2018. [Google Scholar]
  17. Artalejo, J.R.; Gómez-Corral, A. Markovian arrivals in stochastic modelling: A survey and some new results. SORT-Stat. Oper. Res. Trans. 2010, 34, 101–144. [Google Scholar]
  18. Distefano, D.; Longo, F.; Trivedi, S. Investigating dynamic reliability and availability through state-space models. Comput. Math. Appl. 2012, 64, 3701–3716. [Google Scholar] [CrossRef] [Green Version]
  19. Esary, J.D.; Marshall, A.W.; Proschan, R. Shock models and wear processes. Ann. Probab. 1973, 1, 627–649. [Google Scholar] [CrossRef]
  20. Finkelstein, M. Shocks in homogeneous and heterogeneous populations. Reliab. Eng. Syst. Saf. 2007, 92, 569–574. [Google Scholar] [CrossRef] [Green Version]
  21. Gut, A.; Hürsler, J. Realistic variation of shock models. Stat. Probab. Lett. 2005, 74, 187–204. [Google Scholar] [CrossRef]
  22. Cha, J.H.; Finkelstein, M. Stochastic survival models with events triggered by external shocks. Probab. Eng. Informational Sci. 2012, 26, 183–195. [Google Scholar] [CrossRef]
  23. Wu, B.; Maya, B.I.G.; Limnios, N. Using Semi-Markov Chains to Solve Semi-Markov Processes. Methodol. Comput. Appl. Probab. 2021, 23, 1419–1431. [Google Scholar] [CrossRef]
  24. Neuts, M.F.; Bhattacharjee, M.C. Shock models with phase type survival and shock resistance. Nav. Res. Logist. 1981, 28, 213–219. [Google Scholar] [CrossRef]
  25. Ye, Z.-S.; Xie, M. Stochastic modelling and analysis of degradation for highly reliable products. Appl. Stoch. Model. Bus. Ind. 2015, 31, 16–32. [Google Scholar] [CrossRef]
  26. Montoro-Cazorla, D.; Pérez-Ocón, R. Reliability of a system under two types of failures using a Markovian arrival process. Oper. Res. Lett. 2006, 34, 525–530. [Google Scholar] [CrossRef]
  27. Montoro-Cazorla, D.; Pérez-Ocón, R. A maintenance model with failures and inspection following Markovian arrival processes and two repair modes. Eur. J. Oper. Res. 2008, 186, 694–707. [Google Scholar]
  28. Pérez-Ocón, R.; Segovia, M.C. Shock models under a Markovian arrival process. Math. Comput. Model. 2009, 50, 879–884. [Google Scholar] [CrossRef]
  29. Montoro-Cazorla, D.; Pérez-Ocón, R. Replacement policy in a system under shocks following a Markovian arrival process. Reliab. Eng. Syst. Saf. 2009, 94, 497–502. [Google Scholar] [CrossRef]
  30. Montoro-Cazorla, D.; Pérez-Ocón, R. A reliability system under cumulative shocks governed by a BMAP. Appl. Math. Model. 2015, 39, 7620–7629. [Google Scholar] [CrossRef]
  31. Ruiz-Castro, J.E. A complex multi-state k-out-of-n: G system with preventive maintenance and loss of units. Reliab. Eng. Syst. Saf. 2020, 2020 197, 106797. [Google Scholar] [CrossRef]
  32. Arulmozhi, G. Direct method for reliability computation of k-out-of-n: G systems. Appl. Math. Comput. 2003, 143, 421–429. [Google Scholar] [CrossRef]
  33. Prabhakar Murthy, D.N.; Xie, M.; Jiang, R. Weibull Models; Wiley: Hoboken, NJ, USA, 2004. [Google Scholar]
Figure 1. Comparison of the availability in series, parallel, and 2-out-of-3 systems.
Figure 1. Comparison of the availability in series, parallel, and 2-out-of-3 systems.
Mathematics 10 01902 g001
Table 1. Availability, reliability, and rate of failure of the system for different values of t.
Table 1. Availability, reliability, and rate of failure of the system for different values of t.
t · 10 1 0246810
A ( t ) 1 0.9159 0.7923 0.6947 0.6371 0.6099 0.5986
R ( t ) 1 0.9024 0.7245 0.5393 0.3826 0.2664 0
r ( t ) 0.1276 0.8486 1.3155 1.6116 1.7775 1.8577 1.9171
Table 2. Some performance measures for different values of inspection mean time.
Table 2. Some performance measures for different values of inspection mean time.
μ I 01/213/2235
A1 0.5986 0.4050 0.3059 0.2459 0.1766 0.1131
E [ T U ] 7.5129 0.7781 0.7042 0.6797 0.6676 0.6554 0.6457
f C 1 0.6008 0.4132 0.3118 0.2503 0.1793 0.1144
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Montoro-Cazorla, D.; Pérez-Ocón, R. Analysis of k-Out-of-N-Systems with Different Units under Simultaneous Failures: A Matrix-Analytic Approach. Mathematics 2022, 10, 1902. https://doi.org/10.3390/math10111902

AMA Style

Montoro-Cazorla D, Pérez-Ocón R. Analysis of k-Out-of-N-Systems with Different Units under Simultaneous Failures: A Matrix-Analytic Approach. Mathematics. 2022; 10(11):1902. https://doi.org/10.3390/math10111902

Chicago/Turabian Style

Montoro-Cazorla, Delia, and Rafael Pérez-Ocón. 2022. "Analysis of k-Out-of-N-Systems with Different Units under Simultaneous Failures: A Matrix-Analytic Approach" Mathematics 10, no. 11: 1902. https://doi.org/10.3390/math10111902

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop