Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling

Ouyang, Weihao; Zhu, Xiaohong

doi:10.3390/su15021506

Open AccessArticle

Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling

by

Weihao Ouyang

and

Xiaohong Zhu

^*

College of Information and Science Technology, Jinan University, Guangzhou 510632, China

^*

Author to whom correspondence should be addressed.

Sustainability 2023, 15(2), 1506; https://doi.org/10.3390/su15021506

Submission received: 30 November 2022 / Revised: 31 December 2022 / Accepted: 11 January 2023 / Published: 12 January 2023

(This article belongs to the Special Issue Sustainable Development in Air Transport Management)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Airline crew scheduling is a very important part of the operational planning of commercial airlines, but it is a linear integer programming problem with multi-constraints. Traditionally, the airline crew scheduling problem is determined by solving the crew pairing problem (CPP) and the crew rostering problem (CRP), sequentially. In this paper, we propose a new heuristic solver based on the parallel genetic algorithm and an innovative crew scheduling algorithm, which improves traditional crew scheduling by integrating CPP and CRP into a single problem. The innovative scheduling method includes a global heuristic search and an adjustment for flights and crew so as to realize crew scheduling. The parallel genetic algorithm is used to divide the population into multiple threads for parallel calculation and to optimize the randomly generated flight sequence to maximize the number of flights that meet the crew configuration. Compared with the genetic algorithm, CPLEX and Gurobi, it shows high optimization efficiency, with a time reduction of 16.57–85.82%. The experiment shows that our crew utilization ratio is higher than that for traditional solvers, achieving almost 44 flights per month, with good scalability and stability in both 206 and 13,954 flight datasets, and can better manage airline crew scheduling in times of crew scarcity.

Keywords:

airline crew scheduling; parallel genetic algorithm; randomly generated flight sequence

1. Introduction

Air traffic planning plays an important role in the planning and the management of an airline [1,2,3]. With the globalization of economies and the rapid development of information and technology, the choice to travel by air and the use of airplanes as a means of transportation are becoming increasingly common today [2,4,5,6]. Before the coronavirus pandemic, air transport demand had been growing consistently. In 2019, the number of flights boarded by passenger airlines in mainland China reached 4.611 million, representing an increase of 6.1% over the previous year [7,8]. In the same year, the Boeing Company estimated that the civil aviation industry will need 790,000 new pilots in the next 20 years, with a scarcity of pilots, meaning that the number of monthly flights is projected to far outweigh the number of crew members. Therefore, it is likely that some flights will not have crew matches, which is an issue that cannot be solved in a short time because it takes an average of seven years for a general pilot to become a captain and costs between 0.2 and 1 million dollars to train an individual to this level [9,10]. Although the coronavirus pandemic has had a huge impact on the air transport industry, it is expected that air traffic will recover quickly [7,11]. The rapid increase in the demand for airline transportation and air traffic has led airline companies to use their resources more effectively, and the need for operation optimal planning has arisen [12,13,14,15,16]. Airline operation plans include a route plan [17,18], flight plan [19,20], and crew scheduling plan [21,22,23], among which crew scheduling is an important part. The aircraft and crew are two fundamental factors to consider in the operating costs of the airline industry; the flight is the direct means of income of the airline, and crew expenditures represent their second-highest source of spending after fuel costs [21,24]. Airline crew scheduling is related to the flight and crew, and it is significant in the management and decisions of commercial airlines. Therefore, how to develop a high-quality crew scheduling methodology with a limited number of crew members but a large number of flights is an important issue [25,26,27,28]. High-quality crew scheduling means a more rational crew schedule that minimizes total costs and maximizes total profitability, which is particularly important for commercial airlines to optimize their operation costs. For example, a 1% increase in flights meeting crew configurations or 1% decrease in crew scheduling equates to billions of dollars in additional profits in large commercial airlines [26,29,30,31].

Airline crew scheduling is the process of assigning flights to pilots and flight attendants within a flight plan in accordance with the relevant Civil Aviation Administration (CAA) airworthiness documents and a set of regulations set by the specific airline. The crew scheduling problem concerns the development of a crew schedule for a specific period (generally a month) under the condition that the flight information and the crew information are known; schedules should make clear when and where each crew member will perform what task in which flight so as to minimize the total cost and maximize the total profit of the flight [32]. In view of crew scarcity, the number of planned monthly flights exceeds the availability of crew members, which could lead to some flights not having a crew match, meaning that crew configuration of the flight cannot be met. In this paper, a high-quality crew scheduling method is developed to ensure as many flights as possible are made to meet the crew configuration. The airline crew scheduling problem is a constraint integer linear programming problem, which is NP-hard [33,34]. Because the flight and the crew that satisfy the constraints can be matched, the possible matching combination increases exponentially as the number of flights and crew increases. Finding a viable crew scheduling solution manually is almost impossible, leading to difficulty in finding a cost-effective solution [25,35].

In the definition of airline crew scheduling, a flight is considered as the take-off and landing of an aircraft. The tasks performed by the crew are divided into fly and deadhead. Fly refers to the crew that includes the captain or co-captain of the flight, while deadhead refers to the crew who board as passengers to take a flight to another airport to perform fly tasks. Duty consists of a series of flights (fly or deadhead) and intervals of rest. Pairing defines a series of duties and layovers that begin and end at the same base (the airport where the crew is stationed) [25]. Traditionally, the airline crew scheduling problem is divided into two sub-problems that need to be solved sequentially, namely CPP and CRP [21,36,37]. The goal of CPP is to find a feasible set of pairings that cover a given set of flights at the lowest cost [26]. CRP is the process of assigning all necessary staff members by using the pairing generated by CPP to create a work schedule for each crew member [38,39]. Alternatively, the CPP and the CRP could be integrated into a single problem [29,40,41,42].

However, traditional crew scheduling models and solvers face the following three problems:

A higher time complexity. An NP-hard large-scale integer programming problem is decomposed into two large-scale integer programming sub-problems, resulting in almost double the time complexity [43,44,45].
Poor generalization and reliance on the feasible sub-solutions generated by initialization. Once faced with the new flight dataset, it is necessary to create a large number of feasible sub-solutions that meet the constraints, which does not facilitate parallel calculation and consumes nearly 80% of the computation time of the total program [29,31,46].
Due to the need to store the large number of feasible sub-solutions and branch-cut tree searches, they usually need to run on servers equipped with professional multi-core and multi-threaded CPUs, such as Intel Xeon processors [21,26].

In order to alleviate the problem of time-consuming initialization, we propose an integrated crew scheduling solver based on the parallel genetic algorithm. The randomly generated flight sequence is defined to guarantee that each flight is represented as a 0–1 independent variable. The value of each position in the flight sequence corresponding to this flight successfully matched with the crew is equal to 1; otherwise, it is 0. After four optimization processes of matching the flight with the crew, the operation of removing ‘0’, returning to base, and pairing adjustment to the flight sequence, we obtain the maximum number of flights successfully matched with a crew as the fitness value and the qualified flight sequence. The genetic algorithm [9,27,47] performs evaluation, selection, crossover and mutation for the flight sequence to calculate the optimal flight sequence. MPI is used to divide the population into multiple threads for parallel calculation [48,49]. Drift strategy performs information exchange between subpopulations in parallel calculation. We integrate the above algorithms and develop the computational framework PGAF. Based on the parallel genetic algorithm and innovative crew scheduling algorithm, our heuristic solver achieves global random search iterative optimization and parallel calculation to match and optimize flights and crew directly. Our solver avoids spending multiple hours on initialization to generate feasible sub-solutions and improves the generalization with lower time complexity and model complexity. Meanwhile, the parallel optimization efficiency depends on the number of processor cores and running memory, but it can be run on ordinary personal computers and maximizes the use of processing cores and running memory to improve optimization efficiency. Its advantages can be exploited to a greater extent in formal laboratory clusters.

The integrated crew scheduling solver with the computational framework PGAF performs a parallel calculation of innovative the crew scheduling algorithm and genetic algorithm to ensure that the number of flights that meet the crew configuration can be maximized and then improves optimization efficiency. It avoids high time complexity and model complexity, which has high generalization and is suitable for research in common experimental environment. Compared with the traditional genetic algorithm and commercial solvers, such as CPLEX and Gurobi, which are at the core of the traditional crew scheduling model, our solver reduces the computation time by 16.57–85.82% on a small-scale dataset with 206 flights. Meanwhile, a more than 45.75% reduction in calculation time is achieved on a large-scale dataset with 13,954 flights, requiring only 284.59 min. More importantly, it achieves a near−optimal flight matched percentage of 97% and higher crew utilization ratio of nearly 44 flights than traditional models with a DCA [40] and GENCOL column generation library with the commercial solver CPLEX [26], which is more suitable in solving crew scarcity and reduces crew costs.

The remainder of this paper is structured as follows. Section 2 is the literature review on airline crew scheduling. Section 3 describes the construction of the integrated airline crew scheduling model, including its assumptions, objectives and constraints. Section 4 introduces the parallel genetic algorithm in crew scheduling, including the innovative crew scheduling algorithm, genetic algorithm, MPI multi-thread parallel calculation, and drift strategy. Section 5 demonstrates the calculation results, and we provide the conclusions in Section 6.

2. Literature Review

2.1. Description of CPP and Research Results

Let F be a set of flights that are likely to be operated in a given period (usually a week or a month) and

Ω

be the set of all feasible pairings that can be used to cover this set of flights. Let

a_{f}_{p}

,

f \in F

,

p \in Ω

be a constant equal to 1 if pairing p contains flight f, and 0 otherwise. Let

c_{p}

be the cost of this pairing. For each

p \in Ω

, we define

x_{p}

as a binary variable with a value of 1 if pairing p is selected, and 0 otherwise. Therefore, the solution of the crew pairing problem (CPP) is usually transformed into the solution of the set-partitioning problem as follows [26,30,39,45]:

\sum_{p \in Ω} c_{p} x_{p}

(1)

s . t . \sum_{p \in Ω} a_{f}_{p} x_{p} = 1 \forall f \in F

(2)

x_{p} \in \{0, 1\} \forall p \in Ω

(3)

where (1) is the objective function, (2) ensures each flight is only covered once by the pairing set exactly, and (3) enforces a binary value on whether the pairing is selected.

Frédéric Quesnel et al. [25] proposed an acceleration technique to compute high-quality solutions at the root node of the search tree. Desaulniers et al. [30] combined dynamic constraint aggregation (DCA) with column generation (CG) [45], and DCA in rolling horizon (RH) [23], which provided a wider time window and produced a better solution. In CPP instances with up to about 50,000 flights per month, the calculation time is reduced by 33% compared to traditional CG/RH, of just 60 h and 20 min. Yaakoubi et al. [50] improved the solver of Desaulniers et al. [30] by adding a dynamic control strategy to obtain an efficient solver for large-scale CPP, named Commercial-GENCOL-DCA, and used machine learning to generate high-probability consecutive executions by the same crew in the flight cluster.

2.2. Description of CRP and Research Results

Let M be the set of crew members, let B be the set of bases of the crew members, and

M_{b}

be the set of crew members that belongs to base

b \in B

. Let

Γ

be a set of feasible pairings such that

\sum_{p \in Γ} a_{f}_{p} = 1, \forall f \in F

(i.e., each flight is exactly covered by one pairing in

Γ

).

For the CRP, Gamache et al. [31] introduced a column-generating method capable of solving paired instances with up to 3000 task rings in less than 4 h. Similar results were obtained by Kasirzadeh et al. [26] for paired instances containing up to 1648 mission loops and 305 crew members. Frédéric Quesnel et al. [25] developed a branch-and-price heuristic model and solved an instance with language constraints in only 6.5 h using the dataset.

2.3. The Integrated Crew Scheduling and Research Results

Traditional airline crew scheduling is divided into two sub-problems, namely CPP and CRP, and solved sequentially. In general, fully integrating these subproblems into a single operation to find a more economical plan remains a difficult issue. Therefore, improving cost-effectiveness by integrating airline crew scheduling into a single problem has been the goal of some scholars in recent years [41,51]. Papadakos first proposed several integration models for optimizing airline scheduling and solved them by applying enhanced Benders decomposition methods combined with accelerated column generation [41]. Wu considered the quality and efficiency of pairing formation in crew scheduling, and optimizes the pairing formation in terms of the airline operating cost, production schedule stability, production schedule quality, and flight time balance by using Genetic Algorithm [42]. Chen et al. proposed a multi-objective combinatorial optimization model by integrating aircraft scheduling and crew scheduling problems, and used the NSGA-II variant method combined with a repair strategy to solve the problem [24]. Liu et al. proposed an optimal solution to airline crew scheduling based on satisfiability modulo theories (SMTs), which convert various constraints in airline crew scheduling into a first-order logical formulation and solve it using the SMT solver Z3 [51].

3. Airline Crew Scheduling

3.1. Model Assumptions

In order to make airline crew scheduling more applicable and to ensure it is in compliance with the real common needs of different commercial airlines, we make the following assumptions for our crew scheduling model:

Only pilots are considered for the crew of our model. Pilots, flight attendants, flight police, etc., are not interchangeable. However, due to the constraints and commonality of their scheduling, our crew scheduling model is equally applicable.
Crew scheduling only considers the human resources planning, without considering airline hardware resources, such as fleet resources, ground facilities, communication and navigation facilities.
Flight scheduling is determined without considering flight delays in our model. That is, every flight departs and arrives on time according to its schedule. All crew members work according to schedules and scheduling plans, without personal preferences, contingencies, etc.

3.2. Problem Analysis

Based on the feasible path method,

P^{k}

is denoted as the flight number sequence of pilot k performing fly and deadhead.

Q^{k}

is the flight number sequence of pilot k only performing deadhead.

P^{k} (m)

is denoted as the flight number at the mth position in

P^{k}

. If pilot k performs deadhead on the flight with flight number

P^{k} (m)

, then

P^{k} (m) \in Q^{k}

, otherwise

P^{k} (m) \cap Q^{k} = Ø

. Assuming that each flight only needs one captain and one co-captain to satisfy the crew configuration, there are

\underset{k \in N C}{\cap} P^{k} \ Q^{k} = Ø

, where NC is the set of pilot.

For duty, the sequence

B^{k}

is introduced to record the position, where each duty starts in

P^{k}

.

B^{k} (r)

is denoted as the number at the rth position in

B^{k}

, which is the position where the rth duty starts in

P^{k}

; therefore,

B^{k} (r) \subseteq m

. We note that

B^{k} (r) = b_{r}^{k}

and we know that the flight with the flight number from

P^{k} (b_{r}^{k})

to

P^{k} (b_{r + 1}^{k} - 1)

in

P^{k}

is a duty of pilot k; therefore,

b_{1}^{k} = 1

.

When dividing the pairing for crew scheduling, the sequence

D^{k}

is introduced to represent the starting position of each pairing in the duty sequence

B^{k}

.

D^{k} (s)

is denoted as the number at the sth position in

D^{k}

, which is the position where the sth pairing starts in

B^{k}

; therefore,

D^{k} (s) \subseteq r

and we define

D^{k} (s) = d_{s}^{k}

, so we find that the flight with the flight number from

P^{k} (B^{k} (d_{s}^{k}))

to

P^{k} (B^{k} (d_{s + 1}^{k} - 1))

in

P^{k}

is a pairing of pilot k, which means that

d_{1}^{k} = 1

.

3.3. Model Description

With the changing attitude toward COVID-19 in various countries and regions, the air transport industry will gradually return to normal levels [7,8,9,10,11]. In view of crew scarcity, the number of planned monthly flights exceeds the availability of crew members, which means that it is possible for some flights to not have crew matching; therefore, the necessary crew configuration of the flight cannot be met. In this paper, a high-quality crew scheduling method is developed to ensure that as many flights as possible meet the necessary crew configuration. The crew scheduling problem is transformed into the following optimization problem with constraints:

m a x n_{F}

(4)

According to the financial revenue of commercial airlines, the primary goal of airline crew scheduling is to maximize the number of flights that can satisfy the crew configuration in crew scarcity [52,53].

n_{F}

is the total number of flights meeting the crew configuration; therefore, our objective function is to maximize

n_{F}

:

{D T}^{P^{k} (m + 1)} - {A T}^{P^{k} (m)} ⩾ M i n C T \forall k, m

(5)

The connection time between two adjacent flights of pilot k to have interval rest not less than

M i n C T

:

\sum_{m} \sum_{i} F_{i}^{P^{k} (m)} = \sum_{m} \sum_{i} {\bar{F}}_{i}^{P^{k} (m)} \forall k, m, i

(6)

The constraint condition of flow conservation.

\sum_{m} \sum_{i} F_{i}^{P^{k} (m)}

is used to measure the indegree of airport i in

P^{k}

of pilot k;

\sum_{m} \sum_{i} {\bar{F}}_{i}^{P^{k} (m)}

is used to measure the outdegree of airport i in

P^{k}

of pilot k. According to the requirements, no aircraft will stay at airports without base attribute and they should come back to their base, so the airport’s outdegree is equal to the indegree. This ensures all aircraft and pilots eventually return to their bases:

[{D T}^{P^{k} (b_{r}^{k})}] < [{D T}^{P^{k} (b_{r + 1}^{k})}] \forall r, k

(7)

Comparing the start time of two adjacent duties with their dates means they cannot be on the same day.

{D T}^{P^{k} (b_{r + 1}^{k})} - {A T}^{P^{k} (b_{r + 1}^{k} - 1)} ⩾ M i n R T b A D \forall r, k

(8)

The rest time between two adjacent duties of pilot k is not less than

M i n R T b A D

:

{A T}^{P^{k} (b_{r + 1}^{k} - 1)} - {D T}^{P^{k} (b_{r}^{k})} ⩽ M a x D T \forall r, k

(9)

The maximum time of each duty of pilot k is not more than

M a x D T

:

\sum_{b_{r}^{k} ⩽ m < b_{r + 1}^{k}} ({A T}^{P^{k} (m)} - {D T}^{P^{k} (m)}) - ({A T}^{Q^{k} (m)} - {D T}^{Q^{k} (m)}) ⩽ M a x D F T \forall r, k

(10)

The maximum flight time of each duty, because the time when the pilot performs deadhead belongs to the time of duty, but not the fly time of duty, has to subtract the deadhead time

({A T}^{Q^{k} (m)} - {D T}^{Q^{k} (m)})

on duty:

{A T}^{P^{k} (B^{k} (d_{s + 1}^{k} - 1))} - {D T}^{P^{k} (B^{k} (d_{s}^{k}))} ⩽ M a x P T \forall s, k

(11)

The maximum total time of pairing is not more than

M a x P T

in the crew scheduling:

{D T}^{P^{k} (B^{k} (d_{s + 1}^{k}))} - {A T}^{P^{k} (B^{k} (d_{s + 1}^{k} - 1))} ⩾ M i n R T b A P \forall s, k

(12)

The interval rest time between two adjacent pairings of pilot k is not less than

M i n R T b A P

:

[{D T}^{P^{k} (b_{r + M a x S u c c O n}^{k})}] - [{D T}^{P^{k} (b_{r}^{k})}] \geq M a x C D D \forall r, k

(13)

For the upper limitation of continuous duty days, the continuous duty can only belong to the same pairing due to the limitation of the minimum rest time between pairings. For

\forall r

, if continuous duty days more than

M a x C D D

, the crew scheduling plan of this pilot does not meet the requirements.

Table 1 is a summary of the variables and their explanations mentioned in the above mathematical model:

4. The Proposed Airline Crew Scheduling with Parallel Genetic Algorithm

The mathematical model of airline crew scheduling proposed in the Section 3 is solved in this section. An innovative crew scheduling algorithm is proposed, which contains four optimization processes of matching the flight with the crew, the operation of removing ‘0’, returning to base, and pairing adjustment, to adjust the randomly generated flight sequence. After adjustment, we obtain the maximum number of the flight matched with the crew as the fitness value and the qualified flight sequence. The genetic algorithm performs evaluation, selection, crossover and mutation for the flight sequence to calculate the optimal flight sequence and maximum number of flights that meet the crew configuration. MPI is used for multi-thread parallel calculation to improve the calculation efficiency of the genetic algorithm. The drift strategy leads to the effect of information exchange between subpopulations after mutation in genetic algorithm parallel calculations. Algorithm 1 is the mathematical model for airline crew scheduling using the parallel genetic algorithm.

Algorithm 1 Heuristic crew scheduling algorithm.

1:: Process data to meet input standards
2:: Generate the initial population
3:: Divide the subpopulation of thread
4:: Match crew and flight according to flight sequence
5:: Adjust the flight sequence
6:: for $i = 0$ … do
7:: if all flights are matched or $i \geq m a x_i t e r a t i o n$ then
8:: return flight sequence with the highest number
9:: break;
10:: else
11:: Calculate the number of flights after adjustment
12:: Perform selection, crossover, mutation
13:: Execute Drift Strategy on subpopulation
14:: Update flight sequence as new population
15:: end if
16:: $i = i + 1$
17:: end for

The following section details the process of solving the model. Section 4.1 describes the preprocessing of flight data, which standardizes the flight data and crew data. Section 4.2 provides an introduction to generating the initial randomly generated flight sequence (generating initial populations) and the input of the genetic algorithm. Section 4.3 explains the process of matching the crew member and the planned flight based on the randomly generated flight sequence. Section 4.4 shows the three detailed steps used to adjust the randomly generated flight sequence, which ensures that the maximum number of flights meet the crew configuration under each flight sequence. Section 4.5 provides a description of the parallel genetic algorithm, with the construction of the genetic algorithm, MPI multi-thread parallel calculation, and drift strategy.

4.1. Data Preprocessing

Sort the planned flight data from smallest to largest by departure time. (Note: in this paper, flight rank number

r n u m

refers to the position of the sorted flight in the randomly generated flight sequence

L i s t

and the planned flight data

F l i g h t

, and flight number

n u m

refers to the number of each flight in

F l i g h t

.)

Assume that there are m crew members in

C r e w

, and each crew member is recorded as

c r e w_{i}

. For the crew information data

C r e w

, we add the following six kinds of information:

$F D$ : The list of $r n u m$ performing fly and deadhead.
H: The list of $r n u m$ only performing deadhead.
$D P$ : Departure time.
$A R$ : Arrival time.
$A I R$ : The passing airport for each flight performing fly and deadhead (the first passing airport of each flight is its base).
$D T$ : Duty list.

4.2. Generate the Initial Randomly Generated Flight Sequence—Initial Population

Suppose there are n flight in

F l i g h t

, and

L i s t

is the list that records whether each flight takes off, so

L i s t

is an n-dimensional vector. We write down the take-off situation of the ith flight in

F l i g h t

as

l i s t_{i}

, and its value is as follows:

l i s t_{i} = \{\begin{matrix} 1, & the i t h flight take off \\ 0, & the i t h does not flight take off \end{matrix}

(14)

So

L i s t = {[l i s t_{1}, l i s t_{2}, \dots \dots, l i s t_{n - 1}, l i s t_{n}]}^{T} = {[1, 0, 0, \dots \dots, 0, 1, 1, 0]}^{T}

, and

L i s t

is the chromosome in the genetic algorithm. If 10

L i s t

is required as the initial population

L I S T

in the genetic algorithm, then

L I S T = (L i s t_{1}, L i s t_{2}, \dots \dots, L i s t_{10})

; its population size is

N P

= 10. Figure 1 shows the population of the genetic algorithm.

4.3. Match the Crew and the Flight

We searched

L i s t

from the beginning. When the ith position in

L i s t

is retrieved, if

l i s t_{i}

= 0, the flight does not take off, and crew matching is not required. If

l i s t_{i}

= 1, we will verify whether the flight with

r n u m

= i can satisfy the crew configuration, and we also verify whether it is possible to find qualified crew members. The matching is based on the fact that the currently located airport of the crew member is the same as the departure airport of the flight and the crew is allowed to perform the fly task at the departure time point of the flight. If the flight cannot meet the crew configuration requirements, it is not allowed to take off and

l i s t_{i}

is changed to 0. If successfully matched, i.e., if the crew can be found to meet the crew configuration requirements, we select the crew with the lowest work cost from the crew members who meet the requirements, and update the information of the selected crew in

C r e w

. The pseudo code is shown in Algorithm 2.

Algorithm 2 Match the crew and the flight.

1:: $len (F l i g h t) = n$
2:: for $i = 1 \dots \dots n$ do
3:: if $L i s t [i] = 0$ then
4:: $i = i + 1$
5:: else
6:: if (the set of crew meet requirements) $C = Ø$ then
7:: $L i s t [i] = 0$
8:: else
9:: Select the crew with lowest cost in C
10:: Update these crews’ information in $C r e w$
11:: end if
12:: $i = i + 1$
13:: end if
14:: end for

4.4. Adjust the Randomly Generated Flight Sequence

4.4.1. Operation of Removing ‘0’

We traversed

L i s t

, as adjusted in Section 4.3. Suppose

l i s t_{i}

= 0, that is, the flight with

r n u m

= i does not take off. First of all, we find that the crew can match this flight from

C r e w

, which is adjusted in Section 4.3. If we find out the crew requirements, we update their information.

If we cannot find out the crew match for this flight, traverse forward from the

i - 1

position of

F l i g h t

to find the flight from the airport with base attribute to the departure airport of this flight that meets the time requirement. Suppose that the flight with

r n u m

= j meets these requirements. Find the crew who can perform fly in the flight with

r n u m

= j and perform deadhead in the flight with

r n u m

= i from

C r e w

adjusted in Section 4.3. If the flight and the crew meeting above requirements are found, update

C r e w

and change

l i s t_{i}

to 1. If it cannot be found after the traversal is completed, the flight cannot take off.

4.4.2. Handling of Crew Returning to Base

Adjust

C r e w

for each crew member finally needs to return to their base. After successfully matching

F l i g h t

with

C r e w

through above operations, we also need to meet the constraint for each crew to initially depart from base and eventual return to base. We find that the final airport of some crew members is not the base they belong to in

C r e w

. So we will return the unqualified crew members to their base by performing fly or deadhead tasks. Assuming that

c r e w_{i}

does not meet this requirement, the following is how to deal with it.

Solution 1: Find the flight with the closest departure time to the time point when

c r e w_{i}

is allowed to perform the task and meet the requirement that the flight that departs from the airport

c r e w_{i}

is located to his base in

L i s t

adjusted above. If the sign of this flight is

l i s t_{i}

= 0,

c r e w_{i}

performs the fly task on this flight and we change the sign of the flight to

l i s t_{i}

= 1. If the sign of this flight is

l i s t_{i}

= 1, then

c r e w_{i}

performs the deadhead task. Finally, update the information of

c r e w_{i}

according to the above situation.

Solution 2: If the flight in the above situation does not exist, the crew will not be able to return to his base by performing fly or deadhead. We find the last

r n u m

of

c r e w_{i}

in his

F D

and delete all information about this flight from

c r e w_{i}

, so that this flight does not meet the crew configuration. We modify

l i s t_{r n u m}

to 0 in

L i s t

. The information of

c r e w_{i}

is used for performing the previous flight. If the final airport of the crew at this state is the base he belongs to, the requirements are met; otherwise, we apply Solution 1.

When using Solution 2, we still cannot solve the problem of

c r e w_{i}

eventually returning to his base; repeat the solution and the worst result is that

c r e w_{i}

does not match any flight. The pseudo code of Section 4.4 is shown in Algorithm 3.

Algorithm 3 Adjust the randomly generated flight sequence.

1:: $len (F l i g h t) = n$
2:: $len (C r e w) = m$
3:: for $i = 1 \dots \dots n$ do
4:: if $L i s t [i] = 1$ then
5:: $i = i + 1$
6:: else
7:: if (the set of crew meets requirements) $C \neq Ø$ then
8:: Select the crew with lowest cost in
9:: Update these crew information in $C r e w$
10:: else
11:: $j = i - 1$
12:: while $j ⩾ 1$ do
13:: if $F [j] a r_a p$ = $F [i] . d e_a p$ and $F [j] . d e_a p in b a s e$ then
14:: Update crew information in $C r e w$
15:: continue;
16:: else
17:: $j = j - 1$
18:: end if
19:: end while
20:: end if
21:: end if
22:: end for
23:: for $i = 1 \dots \dots m$ do
24:: while $C r e w [i] . A I R [l a s t] \neq C r e w [i] . b a s e$ do
25:: for $r n = C r e w [i] . F D [l a s t] \dots \dots n$ do
26:: if $F [r n] . d e_a p = C [i] . A I R [l a s t]$ and $F [r n] . a r_a p = C r e w [i] . b a s e$ then
27:: Update these crews’ information in $C r e w$
28:: break;
29:: end if
30:: if $r n u m n$ then
31:: Delete the last flight information of $C r e w [i]$
32:: end if
33:: end for
34:: end while
35:: $i = i + 1$
36:: end for

4.4.3. Pairing Adjustment

We adjust the crew that does not meet the constraints of pairing in

C r e w

after the above operations. Each crew has a duty list

D T

, which is used to record the information of each duty. According to the constraint that both the departure and arrival airport belong to the base of the crew,

D T

is divided into pairings, and each pairing is independent of the other. After dividing

D T

into pairings, judge whether each pairing meets the constraints, find out whether the pairings do not meet the constraints and find the other crew members who can complete the pairings. If there are no other crew members who can complete them, this input is illegal.

4.5. Construction of Parallel Genetic Algorithm

T h r e a d D I m

threads are allocated in CPU by MPI; each thread is responsible for calculating

E a c h T h r e a d

chromosomes, that is, the total population is divided into

T h r e a d D I m

subpopulations. Each subpopulation contains

E a c h T h r e a d

solution. On the premise that the number of cores occupied by each thread is an integer, each thread performs parallel computations on allocated cores and memory.

The fitness function of the Genetic Algorithm is

f (x) = n e w_s u m (x)

, and x is the randomly generated flight sequence before processing

L i s t

. We remember the randomly generated flight sequence after it is adjusted as

L i s t_p r

, and the fitness function can be expressed as follows:

F (L i s t) = n e w_s u m (L i s t) = s u m (L i s t_p r)

(15)

It is the number of 1 in

L i s t_p r

.

Calculate the fitness function

F_{i} = F (L i s t_{i})

and sort from large to small according to the fitness value for all chromosomes in each subpopulation

s u b p

. Record each maximum value

F_{B e s t_{s u b p}}

and its chromosome

B e s t_{s u b p}

. Note the global maximum fitness value

F_{B e s t} = m a x F_{B e s t_{s u b p}}

and the corresponding chromosome

B e s t

.

Figure 2 demonstrates the iterative process of fitness function calculation, selection, crossover and mutation in the genetic algorithm. The selection strategy of the genetic algorithm is roulette wheel selection. The probability of each chromosome in the population being selected for genetic operations is proportional to its fitness value. In crossover, we use a single-point crossover method, which obtains two different sub-chromosomes by selecting two chromosomes, dividing them at a randomly selected position, and exchanging their right part. The traditional method of single gene mutation is used in mutation.

After the above four operations are completed, we need to exchange information between each subpopulation. The drift strategy is to replace the first-to-last, second-to-last and

E x c h a n g e

-to-last solution in the subsequent

E x c h a g e

subpopulations with its optimal solution, respectively. Figure 3 shows the drift strategy, supposing there are a total of four subpopulations and

E x c h a n g e = 3

.

For each new subpopulation obtained, if the maximum fitness value of the subpopulation

s u b p

is greater than

F_{B e s t_{s u b p}}

, update

B e s t_{s u b p}

and

F_{B e s t_{s u b p}}

; otherwise, there is no need to update

B e s t_{s u b p}

. It is the same for

F_{B e s t}

and

B e s t

.

5. Experiments

The algorithm proposed in this paper is coded using PYTHON 3.7.6. Microsoft MPI v10.0. CPLEX 12.8.0. Gurobi 9.1.1. All experiments were executed on a computer equipped with 14-core Intel(R) Core(TM) i7-12700H CPU @ 2.3 GHz and 16 GB memory. Section 5.1 provides a description of our dataset. Section 5.2 shows the experimental results. Section 5.3 provides a comparison of experiments according to different solvers and datasets.

5.1. Dataset Description

The experimental data were derived from “Huawei Cup”, The 18th China Post-Graduate Mathematical Contest, which contains two datasets, as shown in Table 2. Table 3 describes the values of parameters for the airline crew scheduling.

5.2. Experimental Results

For two datasets, random independent experiments are performed 20 times for 10 cases in which the value of 10–100% of positions is 1 in

L i s t

; then, we take the average value of each case.

5.2.1. Dataset A Result

Figure 4 displays the experimental results of Dataset A. The left y-axis is the number of flights after adjustment and the right y-axis is the ratio of the number of flights after adjustment to the initial number of flights. When the initial number of flights taking off is increased, the number of flights after adjustment also increases, and their ratio also increases and tends to 1. The maximum number of flights after adjustment is 199, which accounts for 96.60% of the total number of flights in Dataset A.

Because the above experiments are only randomized independent experiments for 10 cases, we did not experiment with each possible initial value. We propose an assumption, which is that there is a kind of

L i s t

whose initial values are not all 1, but the result of

L i s t_p r

is better than a case where the initial values are all 1 (assuming that all flights take off at the beginning). The number of flights satisfying the crew configuration in

L i s t_p r

is greater than 199.

We use the algorithm proposed in Section 4 to retrieve and optimize

L i s t

and use the parallel genetic algorithm to complete the optimal value iteration. According to the changing trend of Figure 4, we set the value of 80% random position in

L i s t

to 1 as the chromosome of the initial population, which can accelerate the iteration and make it difficult to reach the local optimal value. After comparative experiments, we take the most stable and optimal results, in which the population is set as 40, the maximum iteration is 100, and the number of genes on the chromosome is the total number of flights, i.e., 206.

To reduce the chance of experimental results, make them more convincing and reflect the stability of our algorithm, 20 repeated experiments were performed under these parameters. We chose the average optimal value instead of the maximum optimal value as the more representative result. In order to show the superiority of the drift strategy with the subpopulation proposed in Section 4.5, we selected the fourteen-thread parallel genetic algorithm with drift strategy for iterative optimization and compared it with the traditional single-thread serial genetic algorithm without drift strategy based on the results of the following parallel calculations. Figure 5 shows the change in the optimal values in the single-thread and fourteen-thread methods. The red dots represent the change in the maximum optimal value in each iteration of the 20 repeated experiments, and the blue line represents the change in the average optimal value. (a) shows that the maximum optimal value is equal to the average optimal value, which is 206; the improvement with the single-thread serial genetic algorithm is 3.40%. The maximum optimal value converges at iteration number 69, and the average optimal value converges at iteration number 73. In contrast to (b), which uses the fourteen-thread parallel genetic algorithm with drift strategy, the optimal value for both is 206 and the improvement is also 3.40%, but the maximum optimal value converges at iteration number 52, and the average optimal value converges at iteration number 57. Clearly, the multiple-thread with drift strategy has faster convergence, which benefits from the information exchange effect between subpopulations.

We integrated the genetic algorithm and the MPI tool package in Python mpi4py into a Python tool package named PGAF (a parallel genetic algorithm framework in Python). Consequently, PGAF is a computational framework that integrates an innovative crew scheduling algorithm, traditional genetic algorithm, drift strategy, and a tool package in Python named mpi4py that acts on MPI for computer multi-thread communication. On the premise that the number of cores occupied by each thread is an integer, we have four divisions of threads, namely single-thread, double-thread, seven-thread, and fourteen-thread. Figure 6 shows the calculation time and optimization ratio under the parallel calculation of different threads.

The optimization ratio of

T h r e a d D I m

-thread is defined as

\frac{s i n g l e - t h r e a d c a l c u l a t i o n t i m e}{T h r e a d D I m - t h r e a d c a l c u l a t i o n t i m e}

. Using MPI for the parallel genetic algorithm greatly reduces the calculation time of the traditional genetic algorithm by 1/5 and improves the efficiency of our solver in the fourteen-thread method. With the increase in the number of threads, the calculation time gradually decreases; however, due to the priority of computer memory, the ratio of optimization to the number of threads also gradually decreases.

5.2.2. Dataset B Result

Figure 7 shows the experimental results of Dataset B. When the initial number of flights that take off increases, the number of flights after adjustment also increases, but their percentage first decreases and then increases, and tends to 1. The maximum number of flights after adjustment is 13,039, which is 93.44% of the total number of flights in Dataset B.

We also make the above assumption for Dataset B. To verify this, we also use the parallel genetic algorithm to search and optimize

L i s t

. In Figure 7, the number of flights after adjustment increases with the increase in the number of flights that take off; however, the ratio decreases first and then increases, and reaches the minimum at 0.3. It is not until ratio = 0.7 that it begins to perform better than 0.1. Therefore, a ratio below 0.7 is not ideal. So, we set the value of 70% random position in

L i s t

to 1 as the genetics of the initial population. Similarly, after several comparative experiments, the population is set as 60, the maximum iteration is 200, and the number of genes on the chromosome is the total number of flights, which is 13,954.

Similarly, a total of 20 repeated experiments were performed for Dataset B under the above parameters. Based on the effect of the following parallel calculations, we selected the seven-thread parallel genetic algorithm with drift strategy for iterative optimization and compared it with the traditional single-thread serial genetic algorithm without drift strategy. Figure 8 shows the change in the optimal values in single-thread and seven-thread. Similarly, the red dots represent the change in the maximum optimal value in each iteration of 20 repeated experiments, and the blue line is the change in the average optimal value. (a) shows that the maximum optimal value is 13,548 and the average optimal value is 13,480, and the error rate between the average optimal value and the maximum optimal value is 0.502%. We choose the average optimal value as the more representative result by iteration, and the improvement with the single-thread serial genetic algorithm is 3.16%. The maximum optimal value converges at iteration number 146, and the average optimal value converges at iteration number 151. Compared with (b) using the seven-thread parallel genetic algorithm with drift strategy, the maximum optimal value is 13,532, and the average optimal value is 13,496, where the improvement is 3.28% and the error rate is 0.266%. The maximum optimal value converges at iteration number 104 and the average optimal value converges at iteration number 113. Apparently, the seven-thread parallel genetic algorithm with drift strategy has higher efficiency, stability and faster convergence of airline crew scheduling optimization, which benefits from the information exchange effect between subpopulations.

Similarly, we use the PGAF encapsulated with mpi4py to perform the parallel genetic algorithm in Dataset B. The calculation time and optimization ratio are shown in Figure 9. When the number of threads is not greater than 7, the optimization ratio increases, and the calculation time decreases gradually with the increase in the number of threads. However, when the number of threads is 14, the optimization ratio is not as good as it is when fewer threads are implemented. Because of the multi-thread parallel calculation, MPI divides the CPU and running memory according to the preset thread, and the CPU and running memory of each thread are independently calculated. Since the running memory occupied by each thread is almost equal, and the total running memory of the computer is determined, the more threads there are, the less CPU and running memory they occupy. We assume there is a certain threshold that makes it impossible to support the running memory required by the program, and the running efficiency is greatly slowed down.

Although there is an inflection point of the optimization ratio in Dataset B with the increase in the number of threads, this inflection point does not appear in Dataset A. As for the mechanism of computer CPU and running memory allocation described above, the size of Dataset B is about 67.38 times that of Dataset A. According to the innovative crew scheduling algorithm in Section 4, the model complexity is

n \times {(l o g n)}^{2}

, and the required running memory is positively correlated with the complexity. Therefore, there are no inflection points in Dataset A; however, inflection points appear in Dataset B that are largely caused by the size of the dataset.

According to the experimental results of Dataset A and Dataset B, the maximum number of flights after adjustment is significantly improved by 3.40% and 3.28%, respectively, after the iterative calculation using the single-thread genetic algorithm, which is further improved when the flight matched percentage is more than 93%. More importantly, we use PGAF, a Python tool package that encapsulates the genetic algorithm and MPI, to perform the parallel genetic algorithm iterative calculation, which greatly shortens the calculation time and improves our model execution efficiency by 2.5 to 5 times. Meanwhile, the iterative convergence of parallel computation is faster, and the error rate between the average optimal value and maximum optimal value is lower and more stable.

5.3. Experiment Comparison

5.3.1. Comparison of Parallel Genetic Algorithm, CPLEX and Gurobi

We use the nonlinear integer programming solver CPLEX, large-scale mathematical programming optimizer Gurobi and the parallel genetic algorithm to solve Dataset A and Dataset B. The experimental comparison results are shown in Table 4. There are more flights that meet the crew configuration in the shortest time when using the parallel genetic algorithm, while CPLEX cannot solve large-scale planning problems directly. The time complexity of Gurobi is positively related to the data size, and it does not facilitate parallel calculation. Our solver has a relative reduction of 16.57–85.82% in time consumption, and the larger the dataset, the more obvious the optimization; therefore, our solver has good scalability. The difference in the number of flights matched in different threads is caused by drift strategy, and its improvement is 0.119%.

5.3.2. Comparison with Current Research Results

The current method is to use the commercial GENCOL Column Generation Library to find the initial cluster of the pairing and then use CPLEX as the solver to optimize the pairing. Saddoune et al. [40] solved CPP and CRP with DCA. Kasirzadeh et al. [26] solved CPP and CRP with a simple preferences of up to 7765 flights and 305 crew for a large dataset. Table 5 shows the comparison of results between ours and theirs.

It can be seen in Table 5 that, regardless of smaller or larger datasets, our solver can achieve almost the same flight matched percentage, so it has good stability. Meanwhile, there are fewer crew members required, and our crew utilization ratio is higher, which is more favorable when faced with crew scarcity. The higher the crew utilization ratio, the less the crew is used, reducing the cost of the crew. Our model and algorithm are traditional with few parameters, no need to use some commercial solvers and computing machines with large running memory, and lower requirements on software and hardware.

6. Conclusions

We improve the traditional airline crew scheduling model by integrating the different problems into a single problem, and match the flight and crew that meet the requirements by performing local-scope optimal search and parallel calculation. A series of operations are innovatively proposed to ensure the number of flights that meet the crew configuration can be maximized. The genetic algorithm calculates the maximum number of flights that satisfy the crew configurations in each

L i s t

by iterating through the global search. The drift strategy provides higher efficiency, stability and faster convergence of airline crew scheduling optimization with the information exchange effect between subpopulations. Using multi-thread parallel calculation with MPI greatly reduces the calculation time, improves our solver efficiency and optimizes the high time complexity of generating initial feasible clusters in traditional methods. We integrate the innovative crew scheduling algorithm, genetic algorithm, drift strategy, and MPI into a computational framework PGAF, which is the synthesis of all algorithms and innovations in this paper. In the experimental results of Dataset A and Dataset B, the maximum number of flights after adjustment is significantly improved by 3.40% and 3.28%, respectively, with the genetic algorithm. The parallel genetic algorithm greatly shortens the calculation time and improves our model calculation efficiency by 2.5 to 5 times. Furthermore, the iterative convergence of parallel computation is faster, and the error rate between the average optimal value and maximum optimal value is lower and more stable with drift strategy. Compared with the traditional genetic algorithm, commercial solver CPLEX and Gurobi, the parallel genetic algorithm has significant optimization efficiency in terms of calculation time due to a reduction of 16.57–85.82% for time complexity and the maximum number of flights that meet the crew configuration being much higher. A comparison with the methods of the traditional model shows that our solver has good scalability and stability and that the flights-matched percentage is high and stable, of up to nearly more than 97% both in small and large datasets. Furthermore, fewer crew members are required and the crew utilization ratio is higher by our solver, achieving almost 44 flights per month; therefore, it could be universally applicable in times of crew scarcity. It has lower requirements for the software and hardware, which is more suitable for a general experimental research environment and has a strong generalization ability.

7. Discussion

Traditional airline crew scheduling is divided into two sub-problems and solved sequentially. We improve the time-effectiveness by integrating airline crew scheduling into a single problem, which has been the goal of some scholars in recent years. The experiments show that it alleviates the problem of time-consuming initialization and has a lower time and model complexity, resulting in better generalizability.

The airline crew scheduling problem is a type of resource scheduling problem. Using the solver based on the parallel genetic algorithm for an integrated solution means that the constraints of time costs of CPLEX and the commercial solver Gurobi are overcome so as to obtain a large number of feasible solutions for comparative optimization. Compared with the traditional intelligent optimization method, the genetic algorithm has the characteristics of multivariate coding processing and random optimization iteration, and parallel computing greatly shortens the total calculation time and improves the efficiency of our solver.

For MPI used in parallel calculation, if there is a laboratory cluster with multiple high-configuration hosts sharing a large running memory server, the optimization efficiency will be greatly improved.

We assume that if an airline’s scheduling is relatively stable, then we can use the machine learning method to train a scheduling prediction model that can predict the next most likely flight based on the information of the previous flight, using the scheduling of previous years as training data.

Author Contributions

Conceptualization, W.O.; methodology, W.O.; software, W.O.; validation, W.O.; formal analysis, W.O.; investigation, W.O.; resources, W.O.; data curation, W.O.; writing—original draft preparation, W.O.; writing—review and editing, X.Z.; visualization, W.O.; supervision, X.Z.; project administration, W.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets used in this paper to obtain the experimental results are publicly available at GitHub (https://github.com/OuyangWeihao/Airline-Crew-Scheduling accessed on 9 June 2020).

Conflicts of Interest

The authors declare no conflict of interest.

References

Nai, W.; Liu, L.; Wang, S.; Dong, D. An EMD–SARIMA-based modeling approach for air traffic forecasting. Algorithms 2017, 10, 139. [Google Scholar] [CrossRef] [Green Version]
Etschmaier, M.M.; Mathaisel, D.F. Airline scheduling: An overview. Transp. Sci. 1985, 19, 127–138. [Google Scholar] [CrossRef]
Ahmed, A.H.; Poojari, C.A. An overview of the issues in the airline industry and the role of optimization models and algorithms. J. Oper. Res. Soc. 2008, 59, 267–277. [Google Scholar] [CrossRef]
Bazargan, M. Airline Operations and Scheduling; Bazargan, M., Ed.; Routledge: London, UK, 2016. [Google Scholar]
Grosche, T. Airline scheduling process. In Computational Intelligence in Integrated Airline Scheduling; Grosche, T., Ed.; Springer: Berlin/Heidelberg, Germany, 2009; pp. 7–46. [Google Scholar]
Graf, V.; Teichmann, D.; Dorda, M.; Kontrikova, L. Dynamic Model of Contingency Flight Crew Planning Extending to Crew Formation. Mathematics 2021, 9, 2138. [Google Scholar] [CrossRef]
Wang, Z.; Liao, C.; Hang, X.; Li, L.; Delahaye, D.; Hansen, M. Distribution Prediction of Strategic Flight Delays via Machine Learning Methods. Sustainability 2022, 14, 15180. [Google Scholar] [CrossRef]
Civil Aviation Administration of China. 2019 Civil Aviation Industry Development Statistical Bulletin. 2020. Available online: http://www.caac.gov.cn/XXGK/XXGK/TJSJ/202006/t20200605_202977.html (accessed on 6 June 2022).
Ozdemir, H.T.; Mohan, C.K. Flight graph based genetic algorithm for crew scheduling in airlines. Inf. Sci. 2001, 133, 165–173. [Google Scholar] [CrossRef]
Boeing. Pilot & Technician Outlook 2018–2037. Available online: www.useit.com.cn/thread-21629-1-1.html (accessed on 9 June 2020).
Sun, X.; Wandelt, S.; Fricke, H.; Rosenow, J. The Impact of COVID-19 on Air Transportation Network in the United States, Europe, and China. Sustainability 2021, 13, 9656. [Google Scholar] [CrossRef]
Özkır, V.; Özgür, M.S. Two-Phase Heuristic Algorithm for Integrated Airline Fleet Assignment and Routing Problem. Energies 2021, 14, 3327. [Google Scholar] [CrossRef]
Cook, A.; Blom, H.; Lillo, F.; Mantegna, R.N.; Micciche, S.; Rivas, D.; Zanin, M. Applying complexity science to air traffic management. J. Air Transp. Manag. 2015, 42, 149–158. [Google Scholar] [CrossRef] [Green Version]
Evans, A.; Vaze, V.; Barnhart, C. Airline-driven performance-based air traffic management: Game theoretic models and multicriteria evaluation. Transp. Sci. 2016, 50, 180–203. [Google Scholar] [CrossRef]
Vossen, T.W.; Hoffman, R.; Mukherjee, A. Air traffic flow management. In Quantitative Problem Solving Methods in the Airline Industry; Vossen, T.W., Ed.; Springer Publishing House: Boston, MA, USA, 2012; pp. 385–453. [Google Scholar]
Smurov, M.Y.; Gubenko, A.V.; Ksenofontova, T.Y. Interrelation of the problems of the aircraft fleet development and the improvement of the air traffic control system. J. Internet Bank. Commer. 2016, 21, 1–15. [Google Scholar]
Belobaba, P. The airline planning process. In The Global Airline Industry; Belobaba, P., Odoni, A., Barnhart, C., Eds.; John Wiley & Sons: New York, NY, USA, 2009; pp. 153–181. [Google Scholar]
Sha, Z.; Moolchandani, K.A.; Maheshwari, A.; Thekinen, J.; Panchal, J.; DeLaurentis, D.A. Modeling airline decisions on route planning using discrete choice models. In Proceedings of the 15th AIAA Aviation Technology, Integration, and Operations Conference, Dallas, TX, USA, 22–26 June 2015; p. 2438. [Google Scholar]
Bolić, T.; Castelli, L.; Corolli, L.; Rigonat, D. Reducing ATFM delays through strategic flight planning. Transp. Res. E Logist. Transp. Rev. 2017, 98, 42–59. [Google Scholar] [CrossRef] [Green Version]
Jensen, C.K.; Chiarandini, M.; Larsen, K.S. Flight planning in free route airspaces. In Proceedings of the 17th Workshop on Algorithmic Approaches for Transportation Modelling, Optimization, and Systems (ATMOS 2017), Vienna, Austria, 4–8 September 2017. [Google Scholar]
Deveci, M.; Demirel, N.Ç. A survey of the literature on airline crew scheduling. Eng. Appl. Artif. Intell. 2018, 74, 54–69. [Google Scholar] [CrossRef]
Barnhart, C.; Cohn, A.M.; Johnson, E.L.; Klabjan, D.; Nemhauser, G.L.; Vance, P.H. Airline crew scheduling. In Handbook of Transportation Science; Hall, R.W., Ed.; Springer: Boston, MA, USA, 2003; pp. 517–560. [Google Scholar]
Schaefer, A.J.; Johnson, E.L.; Kleywegt, A.J.; Nemhauser, G.L. Airline crew scheduling under uncertainty. Transp. Sci. 2005, 39, 340–348. [Google Scholar] [CrossRef] [Green Version]
Liu, X.; Chou, F.I.; Chou, J.H. Multiobjective evolutionary scheduling and rescheduling of integrated aircraft routing and crew pairing problems. IEEE Access 2020, 8, 35018–35030. [Google Scholar]
Quesnel, F.; Desaulniers, G.; Soumis, F. A branch-and-price heuristic for the crew pairing problem with language constraints. Eur. J. Oper. Res. 2020, 283, 1040–1054. [Google Scholar] [CrossRef]
Kasirzadeh, A.; Saddoune, M.; Soumis, F. Airline crew scheduling: Models, algorithms, and data sets. EURO J. Transp. Logist. 2017, 6, 111–137. [Google Scholar] [CrossRef]
Souai, N.; Teghem, J. Genetic algorithm based approach for the integrated airline crew-pairing and rostering problem. Eur. J. Oper. Res. 2009, 199, 674–683. [Google Scholar] [CrossRef]
Cacchiani, V.; Salazar-González, J.J. Optimal solutions to a real-world integrated airline scheduling problem. Transp. Sci. 2017, 51, 250–268. [Google Scholar] [CrossRef]
Salazar-González, J.J. Approaches to solve the fleet-assignment, aircraft-routing, crew-pairing and crew-rostering problems of a regional carrier. Omega 2014, 43, 71–82. [Google Scholar] [CrossRef]
Desaulniers, G.; Lessard, F.; Saddoune, M.; Soumis, F. Dynamic Constraint Aggregation for solving very large-scale airline crew pairing problems. SN Oper. Res. Forum. 2020, 1, 1–23. [Google Scholar] [CrossRef]
Gamache, M.; Soumis, F.; Marquis, G.; Desrosiers, J. A column generation approach for large-scale aircrew rostering problems. Oper Res. 1999, 47, 247–263. [Google Scholar] [CrossRef]
Zhou, L.; Liang, Z.; Chou, C.A.; Chaovalitwongse, W.A. Airline planning and scheduling: Models and solution methodologies. FEM 2020, 7, 1–26. [Google Scholar] [CrossRef]
McCarver, M. Airline Crew Scheduling Problem. In Proceedings of the National Conference On Undergraduate Research (NCUR) 2019, Kennesaw, GE, USA, 11–13 April 2019; pp. 801–809. [Google Scholar]
Yan, S.; Tu, Y.P. A network model for airline cabin crew scheduling. Eur. J. Oper. Res. 2002, 140, 531–540. [Google Scholar] [CrossRef]
Desaulniers, G.; Desrosiers, J.; Solomon, M.M. Accelerating strategies in column generation methods for vehicle routing and crew scheduling problems. In Essays and Surveys in Metaheuristics; Desaulniers, G., Ed.; Springer: Boston, MA, USA, 2002; pp. 309–324. [Google Scholar]
Saemi, S.; Komijan, A.R.; Tavakkoli-Moghaddam, R.; Fallah, M. A new mathematical model to cover crew pairing and rostering problems simultaneously. J. Eng. Res. 2021, 9, 2. [Google Scholar] [CrossRef]
Guo, Y.; Mellouli, T.; Suhl, L.; Thiel, M.P. A partially integrated airline crew scheduling approach with time-dependent crew capacities and multiple home bases. Eur. J. Oper. Res. 2006, 171, 1169–1181. [Google Scholar] [CrossRef]
Shiau, J.Y.; Huang, M.K.; Huang, C.Y. A hybrid personnel scheduling model for staff rostering problems. Mathematics 2020, 8, 1702. [Google Scholar] [CrossRef]
Parmentier, A.; Meunier, F. Aircraft routing and crew pairing: Updated algorithms at Air France. Omega 2020, 93, 102073. [Google Scholar] [CrossRef] [Green Version]
Saddoune, M.; Desaulniers, G.; Elhallaoui, I.; Soumis, F. Integrated airline crew pairing and crew assignment by dynamic constraint aggregation. Transp. Sci. 2012, 46, 39–55. [Google Scholar] [CrossRef]
Papadakos, N. Integrated airline scheduling. Comput. Oper. Res. 2009, 36, 176–195. [Google Scholar] [CrossRef]
Wu, S.Y. Research and Implementation on Optimal Algorithm for Automatic Ring Setting in Field of Airlines Flight Mission. Master’s Thesis, Fudan University, Shanghai, China, 2014. [Google Scholar]
Haouari, M.; Zeghal Mansour, F.; Sherali, H.D. A new compact formulation for the daily crew pairing problem, Transportation Science. Transp. Sci. 2019, 53, 811–828. [Google Scholar]
Barnhart, C.; Hatay, L.; Johnson, E.L. Deadhead selection for the long-haul crew pairing problem. Oper. Res. 1995, 43, 491–499. [Google Scholar] [CrossRef]
Zeren, B.; Özkol, I. A novel column generation strategy for large scale airline crew pairing problems. Expert Syst. Appl. 2016, 55, 133–134. [Google Scholar] [CrossRef]
Saddoune, M.; Desaulniers, G.; Soumis, F. A rolling horizon solution approach for the airline crew pairing problem. In Proceedings of the International Conference on Computers and Industrial Engineering, Troyes, France, 6–9 July 2009; pp. 344–347. [Google Scholar]
Levine, D. Application of a hybrid genetic algorithm to airline crew scheduling. Comput. Oper. Res. 1996, 23, 547–558. [Google Scholar] [CrossRef]
Zhang, J.; Wu, G.Q.; Hu, X.; Li, S.; Hao, S. A Parallel Clustering Algorithm with MPI-MKmeans. J. Comput. 2013, 8, 10–17. [Google Scholar] [CrossRef] [Green Version]
Xiong, Q.; Zhang, X.; Wang, W.F.; Gu, Y. A parallel algorithm framework for feature extraction of EEG signals on MPI. Comput. Math. Methods Med. 2020, 2020, 9812019. [Google Scholar] [CrossRef]
Yaakoubi, Y.; Soumis, F.; Lacoste-Julien, S. Machine learning in airline crew pairing to construct initial clusters for dynamic constraint aggregation. EURO J. Transp. Logist. 2020, 9, 100020. [Google Scholar] [CrossRef]
Liu, X.P.; Chen, Y. Optimal Solution to Crew Scheduling Problem Based on SMT. Comput. Syst. Appl. 2021, 30, 279–287. [Google Scholar]
Feng, C.M.; Wang, R.T. Performance evaluation for airlines including the consideration of financial ratios. J. Air Transp. Manag. 2000, 6, 133–142. [Google Scholar] [CrossRef]
Feng, C.M.; Wang, R.T. Applying FMCDM to evaluate financial performance of domestic airlines in Taiwan. Expert Syst. Appl. 2008, 34, 1837–1845. [Google Scholar]

Figure 1. Population of genetic algorithm.

Figure 2. Iterative process of genetic algorithm.

Figure 3. Drift strategy.

Figure 4. Results of Dataset A in 10 cases.

Figure 5. The change in dataset A’s optimal value. (a) Traditional single-thread serial genetic algorithm without drift strategy. (b) Fourteen-thread parallel genetic algorithm with drift strategy.

Figure 6. Calculation time and optimization ratio of different thread with dataset A.

Figure 7. Results of Dataset B in 10 cases.

Figure 8. The change in Dataset B’s optimal value. (a) Traditional single-thread serial genetic algorithm without drift strategy. (b) Seven-thread parallel genetic algorithm with drift strategy.

Figure 9. Calculation time and optimization ratio of different thread counts with Dataset B.

Table 1. Variable and its explanation.

Variable	Explanation
$P^{k}$	the flight number sequence of pilot k performing fly and deadhead
$Q^{k}$	the flight number sequence of pilot k only performing deadhead
$P^{k} (m)$	the flight number at the mth position in $P^{k}$
$B^{k}$	the sequence of recording the position where each duty starts in $P^{k}$
$b_{r}^{k}$	the position of the rth duty starts in $P^{k}$
$D^{k}$	the sequence of recording the starting position of each pairing in $B^{k}$
$d_{s}^{k}$	the position of the sth pairing starts in $B^{k}$
${D T}^{n u m}$	departure time of the flight with flight number $n u m$
${A T}^{n u m}$	arrival time of the flight with flight number $n u m$
$M i n R T b A D$	minimum connection time between adjacent flights
$[a]$	select the date of a
$M i n C T$	minimum rest time between adjacent duties
$M a x D T$	maximum duty time
$M a x D F T$	maximum duty flight time
$M a x P T$	maximum pairing time
$M i n R T b A P$	minimum rest time between adjacent pairings
$M a x C D D$	maximum continuous duty days

Table 2. Dataset description.

Dataset	Base	Duration of Schedule	Flight	Crew Member	Airport
Dataset A	1	14 days	206	21	7
Dataset B	2	31 day	13,954	465	39

Table 3. Parameter value of the crew scheduling.

MinCT	MaxDFT	MaxDT	MinRTbAD	MaxPT	MinRTbAP	MaxCDD
40 min	600 min	720 min	660 min	14,400 min	2 day	4 day

Table 4. Parallel genetic algorithm, CPLEX and Gurobi.

Dataset	Solver	Time (Min)	Per. of Time Reduction	Num. of Flight Matched
Dataset A	$C P L E X$	147.26	85.82%	198
	$P G A_{1}$	103.57	79.84%	206
	$G u r o b i$	24.34	16.57%	206
	$P G A_{14}$	20.88	-	206
Dataset B	$C P L E X$	7200+	∞	-
	$P G A_{1}$	722.86	60.63%	13,480
	$G u r o b i$	524.63	45.75%	13,225
	$P G A_{7}$	284.59	-	13,496

Table 5. Results comparison.

Dataset	No. of Flight	No. of Airport	No. of Pilot	Per. of Flight Matched	Crew Utilization Ratio
$S_{1}$	1854	41	44	96.71%	40.75
$S_{2}$	5613	49	141	97.38%	38.77
$K_{1}$	1013	26	33	100%	30.69
$K_{2}$	7765	54	305	99.52%	25.45
Dataset A	206	7	8	100%	25.12 × 2
Dataset B	13,954	39	308	96.72%	43.82

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ouyang, W.; Zhu, X. Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling. Sustainability 2023, 15, 1506. https://doi.org/10.3390/su15021506

AMA Style

Ouyang W, Zhu X. Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling. Sustainability. 2023; 15(2):1506. https://doi.org/10.3390/su15021506

Chicago/Turabian Style

Ouyang, Weihao, and Xiaohong Zhu. 2023. "Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling" Sustainability 15, no. 2: 1506. https://doi.org/10.3390/su15021506

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Meta-Heuristic Solver with Parallel Genetic Algorithm Framework in Airline Crew Scheduling

Abstract

1. Introduction

2. Literature Review

2.1. Description of CPP and Research Results

2.2. Description of CRP and Research Results

2.3. The Integrated Crew Scheduling and Research Results

3. Airline Crew Scheduling

3.1. Model Assumptions

3.2. Problem Analysis

3.3. Model Description

4. The Proposed Airline Crew Scheduling with Parallel Genetic Algorithm

4.1. Data Preprocessing

4.2. Generate the Initial Randomly Generated Flight Sequence—Initial Population

4.3. Match the Crew and the Flight

4.4. Adjust the Randomly Generated Flight Sequence

4.4.1. Operation of Removing ‘0’

4.4.2. Handling of Crew Returning to Base

4.4.3. Pairing Adjustment

4.5. Construction of Parallel Genetic Algorithm

5. Experiments

5.1. Dataset Description

5.2. Experimental Results

5.2.1. Dataset A Result

5.2.2. Dataset B Result

5.3. Experiment Comparison

5.3.1. Comparison of Parallel Genetic Algorithm, CPLEX and Gurobi

5.3.2. Comparison with Current Research Results

6. Conclusions

7. Discussion

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI