Next Article in Journal
Task Scheduling Approach in Cloud Computing Environment Using Hybrid Differential Evolution
Next Article in Special Issue
Reliability Analysis and Redundancy Optimization of a Command Post Phased-Mission System
Previous Article in Journal
Modeling Non-Point Source Nutrient Loads with Different Cropping Systems in an Agricultural Lake Watershed in Southwestern China: From Field to Watershed Scale
Previous Article in Special Issue
Optimal Triggering Policy of Protective Devices Considering Self-Exciting Mechanism of Shocks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resilience-Based Surrogate Robustness Measure and Optimization Method for Robust Job-Shop Scheduling

1
School of Logistics Engineering, Shanghai Maritime University, Shanghai 201306, China
2
Department of Mechanical Engineering, North China Electric Power University, Baoding 071003, China
3
School of Management Engineering, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(21), 4048; https://doi.org/10.3390/math10214048
Submission received: 28 September 2022 / Revised: 23 October 2022 / Accepted: 26 October 2022 / Published: 31 October 2022

Abstract

:
This paper addresses the robust job-shop scheduling problems (RJSSP) with stochastic deteriorating processing times by considering the resilience of the production schedule. To deal with the disturbances caused by the processing time variations, the expected deviation between the realized makespan and the initial makespan is adopted to measure the robustness of a schedule. A surrogate model for robust scheduling is proposed, which can optimize both the schedule performance and robustness of RJSSP. Specifically, the computational burden of simulation is considered a deficiency for robustness evaluation under the disturbance of stochastic processing times. Therefore, a resilience-based surrogate robustness measure (SRM-R) is provided for the robustness estimation in the surrogate model. The proposed SRM-R considers the production resilience and can utilize the available information on stochastic deteriorating processing times and slack times in the schedule structure by analyzing the disturbance propagation of the correlated operations in the schedule. Finally, a multi-objective hybrid estimation of distribution algorithm is employed to obtain the Pareto optimal solutions of RJSSP. The simulation experiment results show that the presented SRM-R is effective and can provide the Pareto solutions with a lower computational burden. Furthermore, an RJSSP case derived from the manufacturing environment demonstrates that the proposed approach can generate satisfactory robust solutions with significantly improved computational efficiency.

1. Introduction

Scheduling problems always play a key role in improving the production efficiency of a manufacturing system, such as flow shop scheduling [1,2], job-shop scheduling [3,4], and flexible job-shop scheduling [5,6,7]. The job shop is a typical workshop organization form to model the manufacturing environment. In recent years, tremendous research work has been conducted by considering a deterministic environment [8]. However, in a real-life job shop, the processing times would be deteriorated by many uncertainties, such as random machine breakdowns, uncertain setup times, and the unavailability of fixtures [9,10,11]. For example, in the aero-engine parts workshop, machine failures will cause processing interruption, and thus extend the processing time, or the tool wear will increase the actual processing time. Since the random deterioration of the processing time may decrease the production performance of a real-life manufacturing system, how to effectively deal with the job-shop scheduling problem (JSSP) with it has become a critical and interesting issue. There are four major approaches for dealing with such problems: a stochastic approach, a fuzzy approach, a stability approach, and a robust approach [12]. The stochastic approach [13] assumed that the job processing times are random variables with certain probability distributions determined before scheduling. A fuzzy approach [14] allows a scheduler to find the best schedules, with respect to the fuzzy processing times of the jobs to be processed. The stability approach was developed to construct a minimal dominant set of schedules, which optimally covers all feasible scenarios. For example, Sotskov et al. [15] addressed a two-machine job-shop scheduling problem with fixed lower and upper bounds on the job processing times and provided a minimal dominant set of schedules to make the online scheduling decision. A robust approach considers the robustness to generate a schedule that can preserve a specified level of solution quality in the presence of uncertainties [16]. Generally, robustness has two definitions: quality robustness and solution robustness [17]. Quality robustness is used to indicate the insensitivity of the schedule performance under uncertainties, such as makespan or tardiness. The solution robustness refers to the insensitivity of activity start times to uncertainty [7,18]. In this study, we will focus on the quality robustness to deal with the robust job-shop scheduling problems.
For quality robustness, some studies used the minimized maximum regret value to evaluate it [8] and some regard the expected actual performance as the robustness [4,19]. For example, Wu et al. [20] proposed a decomposition method based on graph theory for JSSP, by using expected average weighted tardiness as a robustness measure. However, taking the expected performance as the robustness evaluation measure cannot evaluate the degree of influence of disturbances on the scheduling performance. To address this, the Pareto optimization method can be applied to optimize scheduling performance and robustness at the same time, just like when Xiong et al. [7] studied the flexible job-shop dual-objective robust scheduling problem with random machine failures. However, a new problem is that the robustness measure cannot be calculated analytically for the job-shop scheduling.
Robust scheduling based on simulation evaluation is an effective method [8], which can use the Monte Carlo simulation to simulate stochastic processing times to evaluate the robustness approximately. However, for a large number of processing time scenarios that are required to get a reasonable approximation, this type of method often bears an unacceptable computational burden [6,21]. By comparison, using the surrogate robustness measure (SRM) is an effective method to improve the computational efficiency of robustness evaluation [22,23]. In essence, the SRM is an approximate estimation measure that uses the partially known information of random disturbances and the structure of the schedule to decrease the computational burden of the robustness evaluation [13,24].
In recent years, researchers have begun to apply the surrogate model based on a surrogate measure to scheduling problems under the influence of uncertainties. Zahid et al. [25] investigated the surrogate measure of robustness for project scheduling problems. The authors stated that to avoid large tardiness due to disturbances, managers might opt for a more robust schedule rather than an optimal schedule. Therefore, they emphasized the importance of the SRMs that aid in identifying proactive schedules, which can provide flexible yet efficient plans with minimum redundancy. Wu et al. [26] address the optimization of risks in performance and stability (solution robustness) for the job-shop scheduling under random machine breakdowns. They considered three objectives: makespan, makespan risk, and stability risk. Then, they used the buffering approach under the limited predictive makespan to generate predictive schedules, which allows inserting additional idle time to control the risks. Zheng et al. [27] studied the production scheduling problem in an assembly manufacturing system with uncertain processing time and random machine breakdown. They considered the objectives of minimizing makespan and the performance deviation of the actual schedule from the baseline schedule. Xiong et al. [7] suggested two surrogate measures for the flexible job-shop scheduling problem with random machine breakdowns, considering both the available uncertainty information and the schedule structure. Yang et al. [28] studied a flexible job-shop schedule problem (FJSP) with random machine breakdown. They considered the makespan and robustness as the scheduling objectives. Then, they developed one surrogate measure named RMc to evaluate robustness using the heuristics of maximizing the workload and float time of each operation and the machine breakdowns.
From the above literature, although great efforts have been conducted for the robust scheduling problems in the field of production scheduling and project scheduling, a further study is urgently needed to consider the resilience of the manufacturing system to construct a more accurate surrogate robustness measure. System resilience is the ability of a system to respond and recover quickly from an external disruptive event [29]. In recent years, studies have been conducted considering system resilience. Mao et al. [30] formulates the supply chain networks problem as a network maximum-resilience decision, and thus developed the resilience of cumulative performance loss and the resilience of restoration rapidity to measure the resilience of the supply chain networks. Dui et al. [29] studied data-driven maintenance priority and resilience evaluation of performance loss in the main coolant system, in which the resilience of the MCS is represented by the loss and recovery of system performance. To effectively reduce and prevent losses, a resilience importance measure for performance loss is proposed to evaluate the performance of the main coolant system. Qi et al. [31] proposed a metro network’s robustness and vulnerability measurement method under node interruption and edge failure based on the complex network modeling and topological characteristics analysis of metro systems. Robustness is an important connotation of system resilience; therefore, it has the potential to design better surrogate robustness measures based on resilience.
In the previous study, the authors made some contributions in the field of job-shop scheduling under uncertainties. The authors have considered the methods of robust scheduling and risk control to deal with the disturbances to improve the robustness and reduce the scheduling risk of the job shop. Xiao et al. [22] proposed an effective robustness surrogate measure for the job-shop scheduling problem with stochastic processing time and a hybrid estimation of the distribution algorithm. Wu et al. [23,24,26] studied the job-shop scheduling problem under random machine failures and proposed robust and/or stability optimization methods based on risk indicators. To further solve the JSSP under machine–worker dual resource constraints, Xiao et al. [32] proposed a multi-objective hybrid distribution estimation algorithm (MO-HEDA). At the same time, Dui et al. [29] carried out extended research in the field of system resilience and reliability. Although the existing research has made some progress, the robustness evaluation accuracy and the solution efficiency for solving the RJSSP are still restricted by uncertainties. This study focuses on the RJSSP under stochastic deteriorating processing times, which are assumed to follow the truncated normal distribution. The main contributions can be attributed as follows. First, we constructed a surrogate model of robust scheduling, considering the performance and the robustness at the same time, in which the robustness is evaluated by a surrogate robustness measure. Second, we proposed a new effective resilience-based surrogate robustness measure (SRM-R) that considers the disturbance propagation process. Moreover, we adopt a multi-objective hybrid estimation of the distribution algorithm (MO-HEDA) based on the research of Xiao et al. [18], which can provide the Pareto optimal solutions with a satisfactory performance by minimizing the SRM-R and the performance measure simultaneously.
The organization of this paper is as follows. Section 2 provides the problem description and the mathematical modeling of RJSSP. Section 3 investigates the existing SRMs and then presents an SRM based on the resilience and disturbance propagation among the operations. Section 4 provided the MO-HEDA for the dual-objective RJSSP. Section 5 verifies the effectiveness and performance of the presented SRM-R for bi-objective robust scheduling by three groups of computational experiments. Moreover, we show the effectiveness of the proposed surrogate model by using a scheduling problem instance derived from a real-life manufacturing workshop. Finally, we draw some conclusions and provide directions for future research in Section 6.

2. Problem Description and Surrogate Model

2.1. Problem Description

The notations that are used in the formulation of the RJSSP are defined in Table 1.
In the literature on stochastic scheduling, various types of probability distributions for stochastic processing times have been studied, such as exponential distribution, normal distribution, and uniform distribution. One method is to treat both the stochastic processing time and the scheduling objective as the stochastic variables, which aims at providing the processing ordering of the tasks on the machines with an expected optimization model. The historical processing data can be collected and used to get the probability distribution. To model the uncertain processing time, the actual job processing time is usually generated from a normal distribution with its expected value and the standard deviation, such as Gu et al. [19], Horng et al. [4], Wang et al. [13,33]. Therefore, the normal distribution is a representative type to study the stochastic processing times in the field of job-shop scheduling with uncertainty.
The RJSSP based on a classic job-shop scheduling problem is described as follows: n jobs processed on m machines; each job has m operations. The probability distribution of the stochastic processing times is derived from the historical manufacturing data. In this study, all the stochastic processing times, T i j = μ i j + ε i j , i , j , are assumed to follow the normal distribution, i.e., T i j ~ N ( μ i j , σ i j 2 ) , where ε i j denotes the value of deteriorating disturbances on the expected processing time scenario. The processing information of any operation O i j is denoted as a vector ( M j , μ i j , σ i j 2 ) . Let Ω denotes the feasible solution set that meets the process constraints. A feasible schedule is denoted by S .
In an arbitrary schedule, the expected processing time μ i j is a nominal value, then the total start time deviation of each operation is affected by two factors. The first factor is the internal disruption ε i j that occurs in the operation O i j . The second factor is the unabsorbed deviation caused by the preceding job J i on the same machine M j or by the preceding machine M j of the job J i . We use Figure 1 to explain how the deviation in operatn O i j . is caused.
Figure 1 shows the deviation of the start time of the operation O i j , caused by the disruption of the machine M j . Before the disruption occurs, the start time and completion time of O i j are denoted by S T i j e and C T i j e , respectively. After the disruption occurs, the estimated start time and completion time are denoted by ξ S T i j and ξ C T i j , respectively. Thus, the total deviation of start time is indicated by D e v i j = δ + ε i j , where δ denotes the disruption caused by the preceding operation of the same job or the preceding job on the same machine.

2.2. Mathematical Formulation of the Surrogate Model

We use the Pareto optimization method to perform robust scheduling, thus, providing decision makers with a set of Pareto solutions that meet different risk preferences. The surrogate model of RJSSP is described as J m | S P T | ( f 1 , f 2 ) by a triplet proposed by Graham et al. [34], where J m denotes the JSSP, SPT denotes the stochastic deteriorating processing times, and f 1 and f 2 are the objectives that need to be optimized. Specifically, the SRM is used to evaluate the robustness of a schedule. The mathematical model of RJSSP is shown as follows:
min S Ω   { f 1 ( S ) ,   f 2 ( S ) }
f 1 ( S ) = C m a x e ( S ) = max { C i j e ,   i , j   }
f 2 ( S ) = S R M ( S )
s.t.
ξ C i k ξ T i k + M 0 ( 1 a i j k ) ξ C i j i = 1 , 2 , , n ; j , k = 1 , 2 , , m ,   j k ;
ξ C h j ξ T h j + M 0 ( 1 a i h j ) ξ C i j i , h = 1 , 2 , , n ,   i h ; j = 1 , 2 , , m ;
  a i j k = 1 , i , j , k , j k ;
  x i h k = 1 , i , j , k , i h ;
a i j k { 0 , 1 } , i , j , k , j k ;
x i h k { 0 , 1 } , i , h , j , i h ;
ξ C i j 0 , i , h , j , i h .
The objective functions are shown in Equation (1). The predictive makespan of a schedule is obtained by using the expected values of the processing times by function f 1 ( S ) . Function f 2 ( S ) is used to evaluate the robustness of a schedule solution. Inequality (4) represents the process constraints, which means the job J i must be processed on the machine before being processed on the machine M k . M 0 is a large enough positive number. The machine constraints are denoted by inequality (5), which means the job J h must be processed on the machine M j after job J i . Constraints (6) and (7) denote one machine can only process one job at one time and one job can only be processed on one machine at the same time. Constraints (8) and (9) represent the value of and can only be the integers 1 or 0. Inequality (10) denotes that none of the completion times of the operations can be negative.

3. The SRM-R for the Surrogate Model

Robust scheduling aims to generate the initial schedule with a satisfactory makespan and be capable of absorbing the disturbances caused by the deteriorating processing times when a right-shift reaction policy is adopted [35]. Generally, the robustness is estimated through Monte Carlo simulation by generating a certain number of processing times scenarios, which can be calculated by the following function:
R M sim ( S ) = 1 L i = 1 L [ C m a x l ( S ) C m a x e ( S ) ]
where l = 1 , 2 , L denotes the number of simulation replications, R M sim denotes the simulation-based robustness, C m a x l ( S ) denotes the makespan of schedule S under the l th processing time scenario. In this paper, the processing time scenario indicates a group of potentially realized stochastic deteriorating processing times of all the operations.
The simulation-based robustness evaluation is time- and resource-intensive. Thus, the robustness of a schedule is usually evaluated by designing an SRM to avoid the computational burden caused by simulations. However, in the previous research, seldom effective SRM for RJSSP have been developed. We analyze five typical existing SRMs and then propose an SRM for RJSSP by using a statistical method and a disturbance propagation formula.

3.1. The Existing Surrogate Robustness Measures

In the research of Leon et al. [35], an SRM for evaluating the absorption capability of a schedule was proposed based on the average total slacks. The measure is rewritten as SRM1, where T S i j denotes the total slack time of the operation O i j .
S R M 1 = C m a x e ( S ) i = 1 n j = 1 m T S i j m n
The SRMs that aim at providing an estimation of the schedule robustness for multi-mode project scheduling problems were investigated by Hazir et al. [36]. In one robustness measure, they define the activities that have the total slacks less than ξ , ξ [ 0 , 1 ] of the activity duration as the potentially critical activities (PCA), i.e., PCA = { O i j : T S i j μ i j ξ } . The delays in PCA are likely to result in delays in the project completion time. Thus, those schedules with fewer critical activities are preferred on the robustness. The measure of PCA used in RJSSP is reasonable because of the similarity of the two problems, which is renamed potential critical operations (PCO). Considering the variance of RJSSP is assumed known, the definition of PCA is revised as the ratio of total slacks and the sum of planned processing time and standard deviation, i.e., PCO = { O i j : T S i j μ i j + σ i j ξ } . Therefore, the SRM is rewritten as follows:
S R M 2 = { N PCO m n , O i j O PCO }
where N PCO denotes the number of PCOs, O PCO denotes the set of PCO, which includes all the operations that satisfy the inequality T S i j μ i j + σ i j ξ .
Goren et al. [37] developed a surrogate stability measure to generate efficient and stable schedules for a job-shop subject to processing time variability and random machine breakdowns. The SRM is based on decreasing the maximum variance on the mean critical path, where the mean critical path is equivalent to the critical path of RJSSP when expected processing times are used. The SRM is written as follows:
S R M 3 = m a x {   V A R ( O i j ) , O i j O CP }
where the objective of SRM3 is to evaluate and select the critical path with the maximum sum variances, in which VAR denotes the variance of the processing times of the operations in the critical operation O CP .
Xiao et al. [22] provided two SRMs for the robust scheduling of JSSP with stochastic processing times. They utilize the probability distribution information of stochastic processing times and the structure of the schedule solution determined by the slack times to estimate the robustness.
S R M 4 = S M cp + S M ncp
In Equation (15), S M cp and S M ncp denote the estimated robustness degradation of the critical path and the non-critical path, respectively. The S R M 4 aims to minimize the total robustness degradation of the baseline schedule.
S R M 5 = m a x { S M cp , S M ncp }
In Equation (16), S R M 5 is provided to estimate the possible robustness degradation by minimizing the maximal robustness degradation on the critical path and non-critical path. In the following section, a disturbance propagation-based SRM is proposed, which is inspired by the disturbance absorption ability of a baseline schedule.

3.2. The Resilience-Based Surrogate Robustness Measures

One of the core connotations of system resilience is the ability to absorb disturbances, which is reflected by the degree of system performance degradation under disturbance events. From this point of view, a schedule with good resilience may also have strong robustness. Therefore, we constructed a resilience-based surrogate robustness measure by analyzing the influence and propagation mechanism of stochastic deteriorating processing times to the makespan.
To describe the realized processing time scenario, the stochastic deteriorating disturbances were represented by ξ T i j = μ i j + ε i j when disruption was introduced. In this study, the processing times of each operation are considered to follow a normal distribution, i.e., ξ T i j ~ N ( μ i j , σ i j 2 ) . Then, map ξ T i j into Z i j = ( ξ T i j μ i j ) σ i j , yielding Z i j ( 0 , 1 ) . Furthermore, 1 α was denoted as the confidence level that is specified by the decision maker, considering the degree of risk aversion on the stochastic deteriorating processing times. We used the maximum possible value of the disruption with a certain confidence level to convert the disruption ε i j into a confidence interval. Then, the worst case of the disruption was utilized for designing the SRM for robustness estimation. Since we are concerned about the upper limit of the realized processing time, the right-sided confidence interval of the normal distribution function was used. The probability is represented by Equation (17),
P r { Z i j = ( ξ T i j μ i j ) σ i j Z α } = 1 α ,
where the value of realized processing time ξ T i j μ i j + Z α σ i j , Z α is the critical value determined by the predefined value of α . Therefore, the possible disturbance of operation O i j follows the equation ε i j = ξ T i j μ i j Z α σ i j .
Since the disruptions among the operations propagate downstream under the process constraints, the recursion formula for calculating the D e v i j of operation O i j should be provided. Since the maximum value of the internal disruption ε i j is set by using a predefined confidence level ( 1 α ) , consideration is required for the worst case of all the processing time scenarios. The scenarios denote the realization of the stochastic processing time of the operations. The main difficulty, thereby, is to analyze the disturbances caused by the preceding operations. We use a problem instance with two consecutive jobs and two machines to analyze how to estimate the total deviation of an arbitrary operation, as shown in Figure 2.
We used Figure 2 to illustrate the schedule before and after the disruptions occur; ε denoted the possible maximum of the disruption of each operation; F s k j and F s i h denoted the free slacks of operation O k j and O k j , respectively, where F s i h = 0 . Generally, the absorption ability of the schedule is determined by the free slacks that exist in the schedule. To estimate the robustness of a schedule, we analyzed the formula of each operation first and then extended the formula into a generic form. Furthermore, the generalized formula for estimating the deviation D e v i j was used to develop the SRM for estimating the robustness of RJSSP.
The operation O k j and O k j are in the first position on the machine M j and M h . The deviation D e v k j and D e v i h are represented by D e v k j = ε i j and D e v i h = ε i h , respectively. The deviation of operation O i j is possibly affected by its preceding operation O i h or O k j beside the disruption ε i j . It was also found that there are some units of slack times before the operation O i j , which can be regarded as the free slacks belonging to the operation O k j . If D e v k j = ε k j > F s k j , the free slacks cannot absorb the disruption of operation O k j , otherwise, the deviation of operation O k j will not influence the starting time of operation O i j . In addition, the deviation of operation O i h may also affect the deviation of operation O i j , since its free slack is F s i h = 0 . Therefore, the total deviation D e v i j was determined by the maximum deviation of its preceding job or operation. The available free slack is calculated by using the start time of O i j and the maximum of the end time of operation O i h and O k j . From the above analysis, the D e v i j is formulated as follows:
D e v i j = max { [ D e v i h ( S T i j e C T i h e ) ] , [ D e v k j ( S T i j e C T k j e ) ] } + ε i j
where S T e and C T e denoted the start time and completion time of the initial schedule, which are obtained by decoding the schedule using the nominal processing times μ i j . The reasonability lies in that the start times and completion times of the succeeding operation are shifted backward due to the adoption of the right-shift rescheduling method when disruption occurs. Therefore, the deviation D e v k h of operation O k h can be estimated by the following equation:
D e v k h = max { [ D e v k j ( S T k h e C T k j e ) ] , [ D e v i j ( S T k h e C T i h e ) ] } + ε i j
To generalize the formula for estimating the deviation on the arbitrary operation O i j , we define O i j as the preceding operation of the operation O i j on the same machine M j and define O i j as the preceding operation of the same job J i . There are two cases in which operation O i j has no preceding operation or has no preceding job. The first case is that the operation O i j is the first operation on a machine M j while the second one is that O i j is the first operation of the job J i . To create a generic formula, four cases were discussed by considering whether the operation O i j is the first operation on a machine M j and whether operation O i j is the first operation of the job J i . We used M j ( p ) | p = 1 , , n ; j = 1 , , m to represent the p th job on the machine M j and used J i ( q ) | q = 1 , , m ; i = 1 , , n to denote the q th operation of job J i . Therefore, the deviation of the operation that considers the disturbance propagation was calculated by the following equation:
D e v i j = { ε i j , M j ( p = 1 ) = O i j ,   J i ( q = 1 ) = O i j max { [ D e v i j ' ( S T i j e C T i j ' e ) ] , 0 } + ε i j ,   M j ( p = 1 ) = O i j , J i ( q = 1 ) O i j max { [ D e v i ' j ( S T i j e C T i ' j e ) ] , 0 } + ε i j ,   M j ( p = 1 ) O i j , J i ( q = 1 ) = O i j max { [ D e v i j ' ( S T i j e C T i j ' e ) ] , [ D e v i ' j ( S T i j e C T i ' j e ) ] , 0 } + ε i j ,   otherwise
where M j ( p = 1 ) = O i j indicates that the first operation on the machine M j is O i j while J i ( q = 1 ) = O i j indicates that O i j is the first operation of Job J i . Due to the adoption of the railway execution strategy, the start time of O i j cannot move forward, then the deviation of any operation cannot be negative. Moreover, the difference between the deviation and the free slack should be greater or equal to zero in the case that the disruption is fully absorbed. To further simplify the formula for estimating D e v i j , we provide the following Proposition 1.
Proposition 1.
For O i j , i = 1 , , n ; j = 1 , , m in the schedule S Ω , if the operation O i j has at least one preceding job or one preceding operation, it holds that at least one of the two inequalities between D e v i j ( S T i j e C T i j e ) 0 and D e v i j ( S T i j e C T i j e ) 0 are satisfied in any case.
Proof of Proposition 1.
For O i j , i = 1 , , n ; j = 1 , , m , the following four cases cover all the possibilities:
  • Case 1, if M j ( p = 1 ) = O i j , J i ( q = 1 ) = O i j , there is no preceding operation and preceding job before the operation O i j , it is easy to find that S T i j e = 0 ;
  • Case 2, Else if M j ( p = 1 ) = O i j , J i ( q = 1 ) O i j , there is at least one operation that precedes O i j of job J i , then S T i j e C T i j e = 0 ;
  • Case 3, Else if M j ( p = 1 ) O i j ,   J i ( q = 1 ) = O i j , there is at least one operation precedes O i j on the machine M j , then S T i j e C T i j e = 0 ;
  • Case 4, Otherwise, M j ( p = 1 ) O i j , J i ( q = 1 ) O i j , there exists at least one operation that precedes O i j of job J i and at least one operation precedes O i j on the machine M j , then S T i j e max { C T i j e , C T i j e } = 0 . It can be seen from the four cases that if an operation has no preceding jobs and preceding operations, its start time is equal to zero.
In other cases, there are at least one of the two equations of S T i j e C T i j e = 0 and S T i j e C T i j e = 0 holds. Since it also holds that D i j 0   and D i j 0 , therefore, at least one inequality in D e v i j ( S T i j e C T i j e ) 0 and D e v i j ( S T i j e C T i j e ) 0 is satisfied in any case.□
According to Proposition 1, the formula in Equation (20) is rewritten as Equation (21)
D e v i j = { ε i j , M j ( p = 1 ) = O i j ,   J i ( q = 1 ) = O i j [ D e v i j ' ( S T i j e C T i j ' e ) ] + ε i j ,   M j ( p = 1 ) = O i j , J i ( q = 1 ) O i j [ D e v i ' j ( S T i j e C T i ' j e ) ] + ε i j ,   M j ( p = 1 ) O i j , J i ( q = 1 ) = O i j max { [ D e v i j ' ( S T i j e C T i j ' e ) ] ,   [ D e v i ' j ( S T i j e C T i ' j e ) ] } + ε i j ,   otherwise
By Equation (21), the completion time of each operation after being disrupted is estimated by Equation (22).
ξ C i j = S T i j e + D e v i j + μ i j
According to Equation (22), the makespan is calculated by Equation (23).
ξ C m a x = max ξ C i j = max { S T i j e + D e v i j + μ i j }
In Equation (23), S T i j e , μ i j and the estimated disruption ε i j in the formula of D e v i j can be deemed as constants. Thereby, the only unknown parameter is the disturbance caused by the adjacent preceding operations and the available free slack times. For each schedule solution S Ω , the estimated deviation of the makespan is calculated by Equation (24):
E D e v ( S ) = max { S T i j e + D e v i j + μ i j , i , j } C m a x e ( S )
where E D e v ( S ) denotes the estimated deviation between the makespan after the disruptions and the initial makespan, considering the nominal processing times.
To keep the higher robustness of a schedule S , the E D e v ( S ) should be minimized. Therefore, the presented SRM is written as Equation (25):
S R M - R = min S Ω E D e v ( S )
where SRM-R denotes the proposed SRM. Therefore, the schedule with a higher disruption absorption capability, which keeps a lower deviation of the makespan, was selected.

4. A Dual-Objective Optimization Algorithm

The estimation of distribution algorithm (EDA) is an evolutionary algorithm that explores the space of candidate solutions by using an explicit probabilistic model [38]. In this study, the procedures of HEDA presented by Xiao et al. [22,32,39] are adopted as the basis of MO-HEDA for solving the surrogate model of RJSSP.
In this section, an improved multi-objective optimization algorithm based on the HEDA is adopted [32]. Before describing the MO-HEDA, the definitions are described:
Definition 1.
Pareto optimal solution: A schedule solution S * Ω is Pareto optimal solution if there is no other solution S Ω that dominates S * .
Definition 2.
Pareto front: The Pareto front denotes the Pareto boundary constructed by the Pareto optimal solutions.
Definition 3.
Dominance: For S 1 , S 2 Ω , there exist M objectives [ f 1 ( S i ) , , f k ( S i ) , ,   f M ( S i ) ] for each solution. It is said that S 1 dominates S 2 , if and only if i { 1 , , M } : f i ( S 1 ) f i ( S 2 ) , and there exists at least one h { 1 , , m } , h i satisfied f h ( S 1 ) f h ( S 2 ) .
The MO-HEDA is developed by combining the HEDA and NSGA-II [40,41]. The framework of the MO-HEDA is shown in Figure 3.
In the first three steps, the algorithm is used to get the Pareto optimal solution by running given generations. When RMsim is used as the robustness measure, simulation-based evaluation is adopted in the evolutionary process, then the Pareto front can be obtained directly. Then, the Pareto front will be further evaluated by Step 4 when SRM is used as the robustness measure. After being evaluated by simulation, the values of SRMs are turned into simulation-based robustness, which can be compared with the robustness obtained by simulation-based robustness. The pseudocode of MO-HEDA is shown in Algorithm 1.
Algorithm 1. The pseudocode of MO-HEDA.
 Step 1. Set the parameters: population size P s i z e , the number of parents and offsprings μ , λ , the number of superior solutions B s i z e , evolutionary generation I t e r a t i o n , recombination probability P c , inheriting rate θ , learning rate α .
 Step 2. Initialize the algorithm.
 Step 2.1 Generate P s i z e = N , initial schedule solutions, let P t = { S i | i = 1 , , N ; t = 0 } denote the initial solution set, t is the index of iteration.
 Step 2.2 Decode the solution to obtain the fitness value of each objective [ f 1 ( S i ) , f 2 ( S i ) , , f M ( S i ) ] ; M denotes the number of objectives.
 Step 2.3 Calculate the probability matrix P ( 0 ) .
 Step 3. Set t = 1
 While t I t e r a t i o n Do
 Step 3.1 1 Generate P o p s i z e = N new solutions by using the probability matrix P ( t 1 ) . Let Q t = { S i | i = 1 , , N } denote the newly generated solution set.
 Step 3.2 Merge the solution set P t and Q t to get a new solution set P t = P t Q t .
Step 3.2.1 Execute the improved non-dominated sort algorithm to obtain the non-dominated levels of the Pareto solutions. Let F = { F 1 , F 2 , } denote the solution set in all the non-dominated levels, where F 1 denotes the first level.
Step 3.2.2 Set i = 1
 While P t < N Do
If   | P t | > | P t | + | F i |  
P t = P t   F i  
i = i + 1
 Else if | P t | | P t | + | F i |
P t = P t   { s e l e c t   N | P t |   s o l u t i o n s   i n   F i } , the solutions are selected by using the crowded-comparison operator;
 End While
Step 3.2.3 Sort the solutions in the new solution set P t .
 Step 3.3 Select the first μ = N solutions in P t as the parent solutions.
 Step 3.4 Generate λ = N offspring solutions by using the recombination method with the recombination probability P c and then sort the ( μ + λ ) solutions by using the same procedure as Step 3.2 to obtain an updated solution set P t .
 Step 3.5 Select B size superior solution to update the probability matrix P ( t ) of HEDA and save N elites in the solution set P t   or the next iteration by using the crowded-comparison operator.
t = t + 1 ;
 End While
 Step 4. Robustness evaluation by Monte Carlo simulation method.
 Step 4.1 Conduct the simulation evaluation on the obtained Pareto optimal solutions in the solution set P t   .
 Step 4.2 Sort the solutions in F 1 according to the simulation results by using the improved non-dominated sort algorithm.
 Step 4.3 Save the Pareto frontier of the simulated solutions.
In the MO-HEDA, we used an operation-based encoding scheme [22]. Each in the population denotes a scheduling solution of the RJSSP. The operations of the job J i are represented by [ ( i 1 ) m + 1 , , ( i 1 ) m + j , , ( i 1 ) m + m ] . Let π : = ( π 1 , , π k , , π n × m ) , where k : = 1 , 2 , , n × m denotes the n × m positions in each solution. Under the encoding scheme, ( i 1 ) m + j in the position of π k in π denotes that the j th operation of the job J i is located at the k th position of π . The individual will never produce infeasible solutions in both the population initialization process and the individual sampling process, since it is encoded under precedence constraints.
The probability model is a vital procedure for the learning process of HEDA. The offspring solutions in each generation are produced by sampling the probability model. A probability matrix P ( t ) = { p i j , π k t | i , j , k } was defined considering the encoding scheme of RJSSP, where the element p i j , π k t represents the probability of the operation on the machine M j of a job J i located in the position π k . The probability function of the superior population is written as Equation (26):
p i j , π k t = s = 1 B s i z e I i j , π k t ( t ) B s i z e ,
where B s i z e denotes the quantity of the superior population, I i j , π k t ( t ) denotes the statistical function, according to Equation (27):
I i j , π k t ( t ) = { 1 , if   operation   O i j   appears   in   position   π k   of   the   schedule   0 ,   otherwise
To update the probability matrix of each generation, an incremental learning rule in the field of machine learning was adopted. The learning function is shown in Equation (28).
p i j , π k t + 1 = ( 1 α ) p i j , π k t + α 1 B s i z e s = 1 B s i z e I i j , π k t ( t )
The recombination method of HEDA is designed considering the encoding scheme and the requirement of inheriting the excellent structure of the parent solutions [13,22]. With the recombination scheme, feasible solutions are generated directly and do not need a repairing process. As described by Xiao et al. [39], HEDA adopts an operation-based individual recombination method. This operator relies on the parent’s operation inheritance rate, which represents the number of jobs that need to be inherited from the relative position and is used to adjust the inheritance degree of the offspring to the parent’s structural characteristics. To perform the recombination process, we first set the value of the parent process inheritance rate and the recombination probability; second, we selected μ parent individuals and N positioning jobs, and found the positioning operations of jobs and the relative positions of the parent individuals, named P1 and P2, respectively. At last, we extracted all the operations of the remaining jobs, inserted the remaining operations of P1 into the vacancy of the offspring S2 in sequence, and inserted the remaining operations of P2 into the vacancy of the offspring S1 in sequence.
The non-dominated sort algorithm is an innovative algorithm for sorting a population into different non-domination levels. However, the algorithm can be further improved by reducing the storage space complexity of the solutions and the number of comparisons in the sorting process. The computational complexity of the non-dominated sort algorithm is O ( M N 2 ) , where M denotes the number of objectives and N denotes the size of the population. As stated by Deb et al. [40], although the computational time complexity has reduced to O ( M N 2 ) , the storage requirement has increased to O ( N 2 ) . Therefore, there are two aspects of the previous algorithm that can be further improved. First, the comparison of each solution is conducted twice. Second, the solution set is dominated by the arbitrary solution S i P t , i will be stored in the cache, while the dominated set will not be outputted. Therefore, the improved non-dominated sort algorithm is adopted. The procedures are shown in Algorithm 2.
Algorithm 2. The pseudocode of the improved non-dominated sort algorithm.
 Step 1. Set the index of the solutions in the solution set P t = { S i | i = 1 , , N } , for each solution, set n i = 1 as the number that is dominated.
 Step 2. Determine the number that is dominated for each solution.
 For i = 1 , , N 1  
 For j = 1 , , N i
 If S i dominates S i + j
n i + j = n i + j + 1 ;
 Else if S i + j dominates S i
n i = n i + 1 ;
 End
 End
 Step 3. Sorting and storing the solutions in each front by using the times to be dominated.
 Step 3.1 Sorting the solutions based on the n i values of S i P t in ascending order.
 Step 3.2 For i : = 1 : N 1
 Set k : = 1 , k denotes the number of solutions in each level.
 Set f r o n t = 1 , f r o n t is the index of the non-dominated level.
 If n i = n i + 1
k : = k + 1 ;
F f r o n t : = F f r o n t   S i + 1 ;
 Else if n i n i + 1
f r o n t : = f r o n t + 1 ;
F f r o n t : = F f r o n t   S i + 1 ;
 End
 End
Step 4. Return the solution sets F : = { F f r o n t | f r o n t = 1 , 2 , } .
The overall complexity of the NSGA-II is O ( M N 2 ) , which is governed by the non-dominated sorting algorithm. The improved non-dominated sorting algorithm reduces the storage space complexity from O ( N 2 ) to O ( N ) by adopting the above well-designed algorithm structure. Meanwhile, the time complexity of comparisons is reduced to O ( M N ( N + 1 ) / 2 ) from O ( M N 2 ) due to the solution comparison being reduced to once for each solution [32].
In the improved non-dominated sort algorithm, the niche selection technology is provided to maintain the diversity of Pareto solutions. In this way, the algorithm can converge to the uniformly distributed Pareto front. In this study, the crowding-distance assignment proposed by Deb et al. [40] is used. They defined density estimation to get an estimation of the density of solutions surrounding a particular solution in the solution set. The crowding distance i d i s is represented by the parameter of the cuboid formed by using the nearest neighbors as the vertices. For the arbitrary non-dominated solution set F i , the crowding distance of each solution S i can be calculated by the algorithm in Algorithm 3.
Algorithm 3. The crowding distance assignment.
Step 1. Let l = | F f r o n t |
Step 2. For i = 1 : l
 Set S [ i ] distance = 0
 For j = 1 : M
F f r o n t = s o r t ( F f r o n t ,   j ) , sort the j t h objective in an ascending order
 Set S [ 1 ] distance = S [ l ] distance =  
 For i = 2 : l 1
S [ i ] distance = S [ i ] distance + ( S [ i + 1 , j ] distance S [ i 1 , j ] distance ) / ( f j m a x f j m i n )
 End
 End
 End
Step 3. Save the S [ i ] distance for the crowded-comparison operator.
In the algorithm of crowding distance assignment, S [ i , j ] d i s denotes the crowding distance of the j t h objective of the solution S i , f i m a x and f i m i n represents the maximum and minimum of the j t h objective in solution set F . By using the algorithm 2 and algorithm 3, we obtained two attributes of the solution, i.e., the non-domination rank n i and crowding distance S [ i ] d i s .

5. Simulation and Analysis

5.1. Experiment Setting

The experimental analysis approach and the MO-HEDA are both coded by Matlab 2016 on a computer of Intel(R) Xeon(R) CPU E31230 @ 3.2 GHz with RAM of 8.00 GB, produced by Dell, Round Rock, Texas. The parameters of MO-HEDA set the same values for both RMsim and SRM, which are listed in Table 2.
The benchmark instances are modified from the classic JSSP benchmark instances by introducing the variance into the nominal processing times. The benchmark instances of FT06, FT10, and FT20 provided by Muth and Thompson [42] and LA06, LA16, LA21, LA26, and LA32 provided by Lawrence [43] are selected.

5.2. The Effectiveness Analysis of SRM-R

This section focuses on the correlation analysis between the SRMs and the RMsim by using the coefficient of determination ( R 2 value). To analyze the R 2 values under different degrees of uncertainty, we define the terminology of uncertainty level.
Definition 4.
Uncertainty level (UL): The uncertain level denotes the corresponding degrees of uncertaintyin aRJSSPproblem instance, which is measured by the percentage of uncertain operationsrelative toall the operations.
The uncertain operations are selected randomly by a uniform distribution according to the specified UL. We choose five ULs (ULx, x = 2, 4, 6, 8, 10) to compare the correlation between the SRMs and the corresponding simulation-based robustness (RMsim). The value of RMsim is also calculated using Equation (11).
The compared SRMs are optimized without considering the makespan. The critical value for possible maximal processing time disruption is set at Z α = 1.96 with a confidence level of 97.5%. The R 2 values are obtained from the regression of the values of SRM and the corresponding SRMsim in the optimization process of HEDA with 200 generations. For each benchmark, five existing SRMs [22] and the presented SRM-R are tested. The average R 2 values obtained by running HEDA ten times are shown in Table 3. Moreover, the average R2 values for all the selected ULs are provided to show the overall correlation of the comparison SRMs.

5.3. The Performance Analysis of the Pareto Optimal Solutions

In this section, two different degrees of uncertainties are investigated by defining the benchmarks with 50% and 100% uncertain jobs as UL-M and UL-H, respectively, where UL-M denotes the medium uncertainty level and UL-H denotes the highest UL. To keep the benchmarks consistent when conducting robust scheduling by using SRM4 and RMsim, we select all the operations of the first [ 0.5 n ] jobs in each benchmark as the uncertain operations under UL-M, while all the operations of the n jobs will be selected as the uncertain operations under UL-H.
The simulation-based robustness measure is always regarded as the best one while with the largest computational burden [6,7]. Generally, the comparison is based on the set coverage between the Pareto front obtained by RMsim and that obtained by the proposed SRMs. In this study, we compare the performance of the presented SRM4 and RMsim directly, since the correlation values of the existing SRMs are apparently worse than those of the presented SRM4. To obtain the Pareto optimal solutions, the objective combinations of [makespan, RMsim] and [makespan, SRM-R] are used as the scheduling objectives. Two performance criteria are selected to conduct the performance comparison of RMsim and the presented SRM-R refers to the literature [7,44].
Set Coverage (SC): Let A and B be two Pareto solution sets, the set coverage SC (A, B) is defined as the percentage of the solutions in set B and is dominated by at least one solution in Pareto solution set A, as shown in Equation (29).
S C ( A , B ) = | { x B | y A : y     x } | | B |
A higher value of indicates a better performance of set A. Referring to the study of Xiong et al. (2013) [7], we use SC1 and SC2 to indicate the set coverage of the two non-dominated sets obtained by the robustness measures RMsim and SRM-R. For the convenience of comparing the performance of SRM-R and RMsim, we define S C 1 = S C ( R M s i m , S R M - R ) , S C 2 = S C ( S R M - R , R M s i m ) .
Computational time (CT): The computational time of running the MO-HEDA on the same benchmark instance by using RMsim and SRM-R.
Moreover, to show that a certain number of Pareto solutions can be obtained by using the proposed surrogate robustness measure, the number of Pareto optimal solutions is defined. Since RJSSP is an NP-hard problem and the exact optimal solution can hardly be obtained, the mentioned Pareto optimal solution is not the exact Pareto optimal solution. Given this, we use the optimal solutions obtained by MO-HEDA as the Pareto optimal solutions.
The number of Pareto optimal solutions (NPS): the number of Pareto optimal solutions on the Pareto front in the last generation.
For each benchmark, the solution performance obtained by SRM-R and RMsim will be compared using the above performance metrics; those with better performance are marked in bold. The RJSSP is a complex NP-hard combinatorial optimization problem. Moreover, the stochastic processing times also increase the randomness of the Pareto optimal solutions. Therefore, the MO-HEDA usually cannot get the same Pareto front over different runs of MO-HEDA. In the simulation, we find that the Pareto optimal solutions over different runs of MO-HEDA converge to nearly the same Pareto front under small-size benchmarks. However, under large-size benchmark instances, the Pareto optimal solutions are hard to converge to the same Pareto front due to the randomness of the optimal solutions of MO-HEDA and the effects of stochastic processing times.
From Table 4, we find that the average NPS obtained by RMsim is slightly greater than that of SRM-R under both UL-M and UL-H. The reason lies in that the SRM-R, which is an estimator of RMsim, would be slightly worse than RMsim. Moreover, the robustness evaluation in Step 4 of the MO-HEDA will also decrease the NPS in the first Pareto front. Meanwhile, the average, as well as the difference of the average under UL-H, are less than that under UL-M, which demonstrates under higher UL, the optimal Pareto solutions are harder to be obtained due to the variability of the objective values under uncertain processing times. When compared with the average NPS over all the benchmarks, the NPS of SRM-R is smaller than that of RMsim, although the difference between the average NPS values under UL-H is slight.
The performance of the Pareto optimal solutions obtained by SRM-R and RMsim is represented by set coverage, shown in Table 5. Under the UL-M for the nine benchmarks, the values of SC2 range from 0.66 to 0.98; the SRM-R shows better optimal performance in two benchmarks. The average values of SC1 and SC2 over all the benchmarks are 0.78 and 0.9, respectively. It means there is about a 12% performance deviation of the Pareto solutions obtained by SRM-R and those obtained by RMsim under UL-M from an overall perspective. As seen from the results under UL-H, there are four benchmarks that show the better performance of SRM-R, which is 44.4% of the total number of the tested benchmarks. Moreover, the average value of SC2 is 0.79, which is very close to the value of SC1, namely, 0.84. In this study, the direct comparison of the set coverage between RMsim and SRM-R shows that a higher SC2 is obtained, which is ranging from 0.62 to 1.0. This result is very satisfying when compared with the performance of the SRM proposed for flexible JSSP by Xiong et al. [7], which is rarely greater than 0.2 over the tested problems. Therefore, it is concluded that Pareto solutions can be obtained by using SRM-R as an estimator of robustness.
The main advantage of the SRM-R is the computational efficiency. The percentage of the decreasing of the CT of SRM-R, compared with that of RMsim is calculated by Equation (30).
P D = 100 × ( C T _ R M s i m C T _ S R M D P ) C T _ R M s i m %
The average of the CT of ten replications, as well as the relative percentage deviation under both the UL-M and UL-H over the nine benchmarks are provided in Table 6.
It is easily found that the CT of SRM-R is significantly lower than that of RMsim. The decreasing percentage of CT ranges from 65.45% to 77.51% under UL-M and ranges from 66.82% to 78.46% under UL-H. The results are obtained with simulation replications of 50.
According to the analysis, it is verified that the presented SRM-R can estimate robustness accurately. Therefore, the decision makers can obtain the Pareto optimal solutions using SRM-R as the robustness measure with a tremendously lower computational time. Meanwhile, the RMsim is preferred when there is enough simulation time or more Pareto solutions are needed, since it can provide more Pareto solutions. However, if the decision makers want to obtain the Pareto solutions with high computational efficiency, especially for large-size problem instances, the presented SRM-R is a better choice.

5.4. Case Study

A typical problem instance derived from a job-shop manufacturing environment is selected [13]. The problem instance is extracted from the manufacturing enterprise and then modified to verify the effectiveness of the surrogate model for RJSSP. There are eight parts to be processed, which are typical aircraft engine parts. Each part requires six major processes, including turning, milling, planning, grinding, drilling, and boring. After the setup process of the machine tool is finished, it will then be machined by the machine tool automatically. The machine tools are numbered, respectively, including lathe (M1), milling machine (M2), planer (M3), grinding machine (M4), drilling machine (M5), and boring machine (M6). The sample mean and variance of the processing time of each process are obtained, according to the historical data. The processing times approximately follow the normal distribution, then the processing time scenarios are assumed to follow the trunked normal distribution. The corresponding machine information and processing time information (units/hour) are shown in the form of (the machine number, the expected value, and the variance) [13]. We solve the problem instance using the surrogate model and the simulation-based model of RJSSP, respectively. In each computational simulation, we use MO-HEDA to solve the problem instance 10 times, the Pareto front is selected to compare the performance of the SRM-R and RMsim, shown in Table 7, the unit of Makespan and RMsim is the hour.
The average computation time of SRM-R is 103.57 s. The average computation time of RMsim is 563.55 s. The decreasing percentage of the computation time is up to 81.62%. To further show the robust scheduling results of the SRM-R, the Pareto front of the SRM-R is provided in Figure 4.
The variation range of RMsim is 8.18–12.93; (12.93–8.18)/12.93 = 31.9%. The range of Makespan is 54–64; (64–54)/64 = 15.6%, which means that the decision maker can choose any solution sacrificing at most 15.6% performance of the makespan while obtaining 31.9% improvements in the robustness. Comparing the Pareto optimal solutions obtained by SRM-R and RMsim in ten computational experiments, the set coverage value of the Pareto optimal fronts is 40% (RMsim dominates SRM-R, SC1) and 37.5% (SRM-R dominates RMsim, SC2), respectively. As for the Pareto front, the set coverage value is 50% (SC1) and 37.5% (SC2), respectively. The results denote that the SRM-R is effective to obtain robust schedule solutions with a significantly lower computational burden.

6. Conclusions

This paper addressed the RJSSP under stochastic deteriorating processing times, considering the resilience of production schedule, by formulating a surrogate model. To develop an effective SRM for robustness estimation, we provide a statistical approach that transforms the disturbances of stochastic deteriorating processing time to an estimated upper bound with a specified confidence level. Thereby, the formula of the disruption propagation is provided by analyzing the structure of the schedule. Additionally, then, we utilize the disruption propagation formula and the available processing time information to formulate the SRM-R. Moreover, we provide the MO-HEDA for solving the multi-objective RJSSP, in which the improved non-dominated sorting algorithm can increase computational efficiency by reducing the storage complexity and time complexity of the non-dominated sorting process.
Three groups of simulation experiments verified the effectiveness and the performance of the presented SRM-R. First, we investigated the correlation between the SRMs and the RMsim. The results demonstrated that the correlation of the presented SRM-R is higher than that of the existing SRMs, which indicates a better estimation accuracy of robustness. The second experiment compared the performance in obtaining the Pareto optimal solution obtained by using the presented SRM-R and the RMsim. According to the results, we concluded that the SRM-R can be used to estimate the robustness of RJSSP and obtain satisfying Pareto optimal solutions, with a tremendously lower computational burden. In addition, a case study further verified that the proposed surrogate model is effective for the studied RJSSP and production resilience can be effectively improved. Therefore, the presented SRM-R is a good choice for the decision makers, when computational efficiency is emphasized.
The limitation of the proposed SRM-R is the assumption that the processing time obeys the normal random separation. When the distribution type changes, the conditions for the disturbance propagation will change. Therefore, future research will further study the surrogate robustness measure under different probability distribution types, such as uniform distribution, lognormal distribution, and exponential distribution. Moreover, to further improve the production reliability, the surrogate measure of stability by using the disruption propagation formula should also be studied. Thereby, the multi-objective scheduling optimization approach for RJSSP will be provided, considering the makespan, the robustness, and the stability by using the presented surrogate measures. At the same time, this research may provide a theoretical basis for data-driven robust scheduling of job shops with the support of new technologies, such as digital twins and cloud manufacturing.

Author Contributions

Conceptualization, S.X. and Z.W.; methodology, S.X. and H.D.; software, S.X.; validation, S.X., Z.W. and H.D.; formal analysis, S.X.; investigation, S.X.; resources, Z.W.; data curation, Z.W.; writing—original draft preparation, S.X.; writing—review and editing, S.X., Z.W. and H.D.; visualization, S.X.; supervision, S.X. and H.D.; project administration, S.X.; funding acquisition, S.X. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Soft Science Key Project of Shanghai Science and Technology Innovation Action Plan (No. 20692193300]; the National Natural Science Foundation of China (No. U1904211); the Fundamental Research Funds for the Central Universities (No. 9160421005); Shanghai University Young Teacher Training Grant Program: (No. ZZSH20010).

Data Availability Statement

The authors confirm that the data supporting the findings of this study are available within the article.

Acknowledgments

The authors are thankful for the valuable comments and suggestions provided by the anonymous reviewers on the drafts of our paper. We are also grateful to the Shanghai Frontiers Science Center of “Full penetration” Far-Reaching Offshore Ocean Energy and Power.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Abdel-Basset, M.; Mohamed, R.; Abouhawwash, M.; Chang, V.; Askar, S. A Local Search-Based Generalized Normal Distribution Algorithm for Permutation Flow Shop Scheduling. Appl. Sci. 2021, 11, 4837. [Google Scholar] [CrossRef]
  2. Liu, F.; Wang, S.; Hong, Y.; Yue, X. On the Robust and Stable Flowshop Scheduling Under Stochastic and Dynamic Disruptions. In IEEE Transactions on Engineering Management; IEEE: Manhattan, NY, USA, 2017; pp. 1–15. [Google Scholar]
  3. Zhang, R.; Wu, C. An Artificial Bee Colony Algorithm for the Job Shop Scheduling Problem with Random Processing Times. Entropy 2011, 13, 1708–1729. [Google Scholar] [CrossRef]
  4. Horng, S.-C.; Lin, S.-S.; Yang, F.-Y. Evolutionary algorithm for stochastic job shop scheduling with random processing time. Expert Syst. Appl. 2012, 39, 3603–3610. [Google Scholar] [CrossRef]
  5. Wang, S.; Wang, L.; Xu, Y.; Liu, M. An effective estimation of distribution algorithm for the flexible job-shop scheduling problem with fuzzy processing time. Int. J. Prod. Res. 2013, 51, 3778–3793. [Google Scholar] [CrossRef]
  6. Al-Hinai, N.; ElMekkawy, T.Y. Robust and stable flexible job shop scheduling with random machine breakdowns using a hybrid genetic algorithm. Int. J. Prod. Econ. 2011, 132, 279–291. [Google Scholar] [CrossRef]
  7. Xiong, J.; Xing, L.-N.; Chen, Y.-W. Robust scheduling for multi-objective flexible job-shop problems with random machine breakdowns. Int. J. Prod. Econ. 2013, 141, 112–126. [Google Scholar] [CrossRef]
  8. Xiong, H.; Shi, S.; Ren, D.; Hu, J. A survey of job shop scheduling problem: The types and models. Comput. Oper. Res. 2022, 142, 105731. [Google Scholar] [CrossRef]
  9. Aytug, H.; Lawley, M.A.; McKay, K.; Mohan, S.; Uzsoy, R. Executing production schedules in the face of uncertainties: A review and some future directions. Eur. J. Oper. Res. 2005, 161, 86–110. [Google Scholar] [CrossRef]
  10. Ouelhadj, D.; Petrovic, S. A survey of dynamic scheduling in manufacturing systems. J. Sched. 2009, 12, 417–431. [Google Scholar] [CrossRef] [Green Version]
  11. Sabuncuoglu, I.; Goren, S. Hedging production schedules against uncertainty in manufacturing environment with a review of robustness and stability research. Int. J. Comput. Integr. Manuf. 2009, 22, 138–157. [Google Scholar] [CrossRef]
  12. Sotskov, Y.N.; Werner, F. Sequencing and Scheduling with Inaccurate Data; Nova Science Publishers, Inc.: New York, NY, USA, 2014. [Google Scholar]
  13. Xiao, S.; Wu, Z.; Yu, S. A two-stage assignment strategy for the robust scheduling of dual-resource constrained stochastic job shop scheduling problems. IFAC-PapersOnLine 2019, 52, 421–426. [Google Scholar] [CrossRef]
  14. Palacios, J.J.; González-Rodríguez, I.; Vela, C.R.; Puente, J. Robust multiobjective optimisation for fuzzy job shop problems. Appl. Soft Comput. 2016, 56, 604–616. [Google Scholar] [CrossRef] [Green Version]
  15. Sotskov, Y.N.; Matsveichuk, N.M.; Hatsura, V.D. Two-Machine Job-Shop Scheduling Problem to Minimize the Makespan with Uncertain Job Durations. Algorithms 2019, 13, 4. [Google Scholar] [CrossRef] [Green Version]
  16. Zhang, J.; Yang, J.; Zhou, Y. Robust scheduling for multi-objective flexible job-shop problems with flexible workdays. Eng. Optim. 2016, 48, 1973–1989. [Google Scholar] [CrossRef]
  17. Herroelen, W.; Leus, R. Robust and reactive project scheduling: A review and classification of procedures. Int. J. Prod. Res. 2004, 42, 1599–1620. [Google Scholar] [CrossRef]
  18. Goren, S.; Sabuncuoglu, I. Optimization of schedule robustness and stability under random machine breakdowns and processing time variability. IIE Trans. 2009, 42, 203–220. [Google Scholar] [CrossRef] [Green Version]
  19. Gu, J.; Gu, M.; Cao, C.; Gu, X. A novel competitive co-evolutionary quantum genetic algorithm for stochastic job shop scheduling problem. Comput. Oper. Res. 2010, 37, 927–937. [Google Scholar] [CrossRef]
  20. Wu, S.D.; Byeon, E.-S.; Storer, R.H. A Graph-Theoretic Decomposition of the Job Shop Scheduling Problem to Achieve Scheduling Robustness. Oper. Res. 1999, 47, 113–124. [Google Scholar] [CrossRef]
  21. Xiao, S.; Sun, S.; Yang, H. Proactive Scheduling Research on Job Shop with Stochastically Controllable Processing Times. J. Northwest. Polytech. Univ. 2014, 6, 019. [Google Scholar]
  22. Xiao, S.; Sun, S.; Jin, J. Surrogate Measures for the Robust Scheduling of Stochastic Job Shop Scheduling Problems. Energies 2017, 10, 543. [Google Scholar] [CrossRef] [Green Version]
  23. Wu, Z.; Sun, S.; Xiao, S. Risk measure of job shop scheduling with random machine breakdowns. Comput. Oper. Res. 2018, 99, 1–12. [Google Scholar] [CrossRef]
  24. Wu, Z.; Yu, S.; Li, T. A Meta-Model-Based Multi-Objective Evolutionary Approach to Robust Job Shop Scheduling. Mathematics 2019, 7, 529. [Google Scholar] [CrossRef] [Green Version]
  25. Zahid, T.; Agha, M.H.; Schmidt, T. Investigation of surrogate measures of robustness for project scheduling problems. Comput. Ind. Eng. 2019, 129, 220–227. [Google Scholar] [CrossRef]
  26. Wu, Z.; Sun, S.; Yu, S. Optimizing makespan and stability risks in job shop scheduling. Comput. Oper. Res. 2020, 122, 104963. [Google Scholar] [CrossRef]
  27. Zheng, P.; Zhang, P.; Wang, J.; Zhang, J.; Yang, C.; Jin, Y. A data-driven robust optimization method for the assembly job-shop scheduling problem under uncertainty. Int. J. Comput. Integr. Manuf. 2020, 33, 1–16. [Google Scholar] [CrossRef]
  28. Yang, Y.; Huang, M.; Wang, Z.Y.; Zhu, Q.B. Robust scheduling based on extreme learning machine for bi-objective flexible job-shop problems with machine breakdowns. Expert Syst. Appl. 2020, 158, 113545. [Google Scholar] [CrossRef]
  29. Dui, H.; Xu, Z.; Chen, L.; Xing, L.; Liu, B. Data-Driven Maintenance Priority and Resilience Evaluation of Performance Loss in a Main Coolant System. Mathematics 2022, 10, 563. [Google Scholar] [CrossRef]
  30. Mao, X.; Lou, X.; Yuan, C.; Zhou, J. Resilience-Based Restoration Model for Supply Chain Networks. Mathematics 2020, 8, 163. [Google Scholar] [CrossRef] [Green Version]
  31. Qi, Q.; Meng, Y.; Zhao, X.; Liu, J. Resilience Assessment of an Urban Metro Complex Network: A Case Study of the Zhengzhou Metro. Sustainability 2022, 14, 11555. [Google Scholar] [CrossRef]
  32. Xiao, S.; Wu, Z.; Sun, S.; Jin, M. Research on the Dual-resource Constrained Robust Job Shop Scheduling Problems. J. Mech. Eng. 2021, 57, 227–239. [Google Scholar]
  33. Wang, K.; Choi, S.H.; Lu, H. A hybrid estimation of distribution algorithm for simulation-based scheduling in a stochastic permutation flowshop. Comput. Ind. Eng. 2015, 90, 186–196. [Google Scholar] [CrossRef] [Green Version]
  34. Graham, R.L.; Lawler, E.L.; Lenstra, J.K.; Kan, A.R. Optimization and approximation in deterministic sequencing and scheduling: A survey. Ann. Discret. Math. 1979, 5, 287–326. [Google Scholar]
  35. Leon, V.J.; Wu, S.D.; Storer, R.H. Robustness measures and robust scheduling for job shops. IIE Trans. 1994, 26, 32–43. [Google Scholar] [CrossRef]
  36. Hazır, Ö.; Haouari, M.; Erel, E. Robust scheduling and robustness measures for the discrete time/cost trade-off problem. Eur. J. Oper. Res. 2010, 207, 633–643. [Google Scholar] [CrossRef]
  37. Goren, S.; Sabuncuoglu, I.; Koc, U. Optimization of schedule stability and efficiency under processing time variability and random machine breakdowns in a job shop environment. Nav. Res. Logist. 2012, 59, 26–38. [Google Scholar] [CrossRef] [Green Version]
  38. Larranaga, P.; Lozano, J.A. Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation; Springer: New York, NY, USA, 2002. [Google Scholar]
  39. Xiao, S.; Sun, S.; Guo, H.; Jin, M.; Yang, H. Hybrid Estimation of Distribution Algorithm for Solving the Stochastic Job Shop Scheduling Problem. J. Mech. Eng. 2015, 51, 27–35. [Google Scholar] [CrossRef] [Green Version]
  40. Deb, K.; Pratap, A.; Agarwal, S.; Meyarivan, T. A Fast and Elitist Multiobjective Genetic Algorithm: NSGA-II. In IEEE Transactions on Evolutionary Computation; IEEE: Manhattan, NY, USA, 2002; Volume 6, p. 15. [Google Scholar]
  41. Valledor, P.; Gomez, A.; Puente, J.; Fernandez, I. Solving Rescheduling Problems in Dynamic Permutation Flow Shop Environments with Multiple Objectives Using the Hybrid Dynamic Non-Dominated Sorting Genetic II Algorithm. Mathematics 2022, 10, 2395. [Google Scholar] [CrossRef]
  42. Muth, J.F.; Thompson, G.L. Industrial Scheduling; Prentice-Hall: Hoboken, NJ, USA, 1963. [Google Scholar]
  43. Lawrence, S. Resource constrained project scheduling: An experimental investigation of heuristic scheduling techniques (Supplement); Carnegie-Mellon University: Pennsylvania, PA, USA, 1984. [Google Scholar]
  44. Ahmadi, E.; Zandieh, M.; Farrokh, M.; Emami, S.M. A multi objective optimization approach for flexible job shop scheduling problem under random machine breakdown by evolutionary algorithms. Comput. Oper. Res. 2016, 73, 56–66. [Google Scholar] [CrossRef]
Figure 1. The start time deviation after being disrupted.
Figure 1. The start time deviation after being disrupted.
Mathematics 10 04048 g001
Figure 2. An instance of two consecutive jobs processed on two machines.
Figure 2. An instance of two consecutive jobs processed on two machines.
Mathematics 10 04048 g002
Figure 3. The framework of the MO-HEDA.
Figure 3. The framework of the MO-HEDA.
Mathematics 10 04048 g003
Figure 4. The Pareto optimal front of the comparison robustness measures.
Figure 4. The Pareto optimal front of the comparison robustness measures.
Mathematics 10 04048 g004
Table 1. The notations of the mathematical model.
Table 1. The notations of the mathematical model.
NotationsDefinitions
ξ The index of random variables
n The total number of jobs
m The total number of machines
J i The job J i ,   i = 1 , 2 , , n
M j The machine M j ,   j = 1 , 2 , , m
O i j The operation of the job J i processed on machine M j
μ i j The expected processing time of operation O i j
σ i j 2 The processing time variance of operation O i j
M 0 A positive number that is large enough
ξ C i j The stochastic completion time of the job J i on machine M j
ξ T i j The stochastic processing time of the job J i of on machine M j
C m a x e The makespan when the expected processing times are used
a i j k Job J i must be processed on a machine M j then on machine M k , if it is satisfied, a i j k = 1 , otherwise, a i j k = 0
x i h j Job J h must be processed after J i on machine M j , if it is satisfied, x i h j = 1 , otherwise, x i h j = 0
Table 2. The parameters setting for MO-HEDA.
Table 2. The parameters setting for MO-HEDA.
Parameters of HEDAValues
Population size200
Evolutionary generations200
Recombination probability0.8
Number of positioning jobs[ 0.5 n ]
Learning rate0.3
Number of superior sub-population40
Value of μ and λ200
Number of simulation replications50 or 200
Table 3. The R 2 value of the comparison SRMs.
Table 3. The R 2 value of the comparison SRMs.
Benchmark InstancesSRMsUncertainty Level
UL2UL4UL6UL8UL10Average
FT06SRM10.440.460.290.550.530.45
SRM20.270.090.140.480.530.30
SRM30.150.130.260.230.140.18
SRM40.690.720.830.690.870.76
SRM50.710.770.740.730.760.74
SRM-R0.840.810.780.820.780.81
FT10SRM10.130.190.390.690.770.43
SRM20.140.160.440.560.720.40
SRM30.530.570.590.480.700.57
SRM40.820.900.890.890.850.87
SRM50.780.830.830.860.850.83
SRM-R0.860.920.960.890.910.91
FT20SRM10.030.120.350.480.760.35
SRM20.140.310.350.400.730.39
SRM30.440.670.620.590.680.60
SRM40.840.830.900.840.810.84
SRM50.840.760.870.790.820.82
SRM-R0.920.960.940.960.920.94
LA06SRM10.530.610.310.180.580.44
SRM20.460.490.450.680.870.59
SRM30.510.560.290.520.400.46
SRM40.840.780.670.790.680.75
SRM50.870.810.840.730.690.79
SRM-R0.920.930.930.970.980.95
LA16SRM10.110.290.460.450.810.42
SRM20.200.390.440.550.790.47
SRM30.200.200.260.470.770.38
SRM40.900.920.830.870.790.86
SRM50.810.750.730.810.800.78
SRM-R0.920.890.930.920.920.92
LA21SRM10.200.200.260.470.670.46
SRM20.240.220.450.470.740.40
SRM30.550.590.570.710.680.66
SRM40.790.800.820.780.800.80
SRM50.840.780.710.780.800.78
SRM-R0.900.880.920.900.880.90
LA26SRM10.100.150.370.640.710.39
SRM20.160.390.500.580.780.48
SRM30.640.700.740.650.710.69
SRM40.830.850.820.870.770.83
SRM50.810.830.800.840.820.82
SRM-R0.900.920.940.910.940.92
LA32SRM10.190.240.170.570.670.37
SRM20.120.220.260.450.580.33
SRM30.430.740.820.660.720.67
SRM40.880.940.920.880.870.90
SRM50.850.910.920.920.900.90
SRM-R0.920.930.940.910.930.93
Table 4. The value of NPS.
Table 4. The value of NPS.
ULBenchmark InstancesFT06FT10FT20LA06LA11LA16LA21LA26LA32Average
UL-MRMsim10.417.814.818.217.817109.410.414.0
SRM-R7.28.410.811.410.210.28.69.47.89.3
UL-HRMsim6.010.69.38.58.57.19.37.46.08.1
SRM-R4.26.36.26.07.94.28.77.66.16.4
Table 5. The SC of the Pareto optimal solutions obtained by RMsim and SRM-R.
Table 5. The SC of the Pareto optimal solutions obtained by RMsim and SRM-R.
ULBenchmark InstancesFT06FT10FT20LA06LA16LA16LA21LA26Average
UL-MSC11.000.980.780.900.890.901.000.740.90
SC20.660.670.920.750.780.850.740.980.78
UL-HSC10.710.840.960.830.950.860.770.750.84
SC20.950.720.730.620.660.890.971.000.79
Table 6. The comparison of average computational time (seconds).
Table 6. The comparison of average computational time (seconds).
ULCTFT06FT10FT20LA06LA16LA16LA21LA26Average
UL-MSRM-R71.31137.94146.30110.44137.62183.15261.58448.6371.31
RMsim317.10546.58499.50382.56527.04680.27844.291298.57317.10
PD (%)77.5174.7670.7171.1373.8973.0869.0265.4577.51
UL-HSRM-R69.29130.49134.48107.32127.84185.42257.03439.8369.29
RMsim308.59542.97516.87387.12593.40708.96862.711325.46308.59
PD (%)77.5575.9773.9872.2878.4673.8570.2166.8277.55
Table 7. The Pareto front obtained by SRM-R and RMsim.
Table 7. The Pareto front obtained by SRM-R and RMsim.
Pareto SolutionsSRM-RRMsim
MakespanRMsimMakespanRMsim
15711.25609.36
25412.825810.56
35611.265611.50
45810.665710.79
5599.57599.82
6619.225412.93
75511.90638.18
8638.285511.79
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Xiao, S.; Wu, Z.; Dui, H. Resilience-Based Surrogate Robustness Measure and Optimization Method for Robust Job-Shop Scheduling. Mathematics 2022, 10, 4048. https://doi.org/10.3390/math10214048

AMA Style

Xiao S, Wu Z, Dui H. Resilience-Based Surrogate Robustness Measure and Optimization Method for Robust Job-Shop Scheduling. Mathematics. 2022; 10(21):4048. https://doi.org/10.3390/math10214048

Chicago/Turabian Style

Xiao, Shichang, Zigao Wu, and Hongyan Dui. 2022. "Resilience-Based Surrogate Robustness Measure and Optimization Method for Robust Job-Shop Scheduling" Mathematics 10, no. 21: 4048. https://doi.org/10.3390/math10214048

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop