Quantitative and Qualitative Analysis of Aircraft Round-Trip Times Using Phase Type Distributions

Chakravarthy, Srinivas R.

doi:10.3390/math12172795

Open AccessArticle

Quantitative and Qualitative Analysis of Aircraft Round-Trip Times Using Phase Type Distributions

by

Srinivas R. Chakravarthy

Departments of Industrial and Manufacturing Engineering & Mathematics, Kettering University, Flint, MI 48504, USA

Mathematics 2024, 12(17), 2795; https://doi.org/10.3390/math12172795

Submission received: 4 July 2024 / Revised: 5 September 2024 / Accepted: 9 September 2024 / Published: 9 September 2024

Download

Browse Figures

Versions Notes

Abstract

:

One of the major issues facing commercial airlines is the time that it takes to board passengers. Further, most airlines wish to increase the number of trips that an aircraft can make between two or more cities. Thus, reducing the overall boarding times by a few minutes will have a significant impact on the number of trips made by an aircraft, as well as enabling improvements in key measures such as the median and 75th and 95th percentiles. Looking at such measures other than the mean is critical as it is well known that the mean can under- or overestimate the performance of any model. While there is considerable literature on the study of strategies to decrease boarding times, the same cannot be said about the study of the boarding time given a particular strategy for boarding. Thus, the focus of this paper is to study analytically (using suitable stochastic models) and numerically the impact of reducing the average time on the key measures to help the system to plan accordingly. This is achieved using a well-known probability distribution, namely the phase type distribution, to model various events involved in the boarding process. Illustrative numerical results show a reduction in the percentile values when the average boarding times are decreased. Understanding the percentiles of the boarding times, as opposed to relying only on the average boarding times, will help management to adopt a better boarding strategy that in turn will lead to an increase in the number of trips that an aircraft can make.

Keywords:

phase type distribution; passenger load factor; computational probability; transportation

MSC:

60J28; 90B06

1. Introduction

The motivation for this work arose from a recent article in the Wall Street Journal [1], as well as personal experience in traveling to many cities, both within the USA and abroad, over the years. In the article, which has the subtitle “Southwest Airlines is studying ways to squeeze more flights per plane—a big focus is on passenger boarding bottlenecks”, there is a quote from the airline’s

C O O

that reads, “If you can collect up enough of these minutes in each turn, then you can start to squeeze out some more flying”. Further, the article mentions that boarding times are key to enabling more trips to be performed by the aircraft. This is probably due to the significant variations present in boarding as compared to other aspects, such as cleaning the aircraft before boarding the passengers and the flying time.

A number of articles and research works on this topic have mentioned the bottlenecks involved in the boarding process. For example, in [2], the author mentions that American Airlines uses a single aircraft to make about six or seven trips in a single day. Further, the author lists the activities that occur during the turnaround time, which is the time between the aircraft pulling into the gate with the passengers onboard and the aircraft taking off with a new set of passengers. According to [3], the most significant delays could occur in the boarding process, and the article points out that the boarding time has increased by more than 30% since the 1970s. Moreover, the distribution of the boarding time tends to have a longer tail due to events such as late arrivals of passengers, passengers needing assistance, and a few last-minute occurences. Thus, a clear understanding of the tail probabilities, namely percentiles, will help the system manager to provide the needed resources.

A number of publications (see, e.g., [4,5,6,7,8,9,10,11,12,13,14]) discuss boarding strategies, such as front-to-back, back-to-front,

W I L M A

(window seats board first, followed by middles and aisles), outside-in, reverse pyramid, first-come-first-served, random, or other methods in which boarding is performing by calling each passenger group individually, such as Steffen [12], and changes in the Steffen boarding method. Such strategies may provide insights to determine a strategy that will speed up the boarding process. We refer the reader to [11] for the different boarding methods adopted by a few airlines. In recent times, most airlines have adopted a strategy that ensures that their most loyal customers are treated better compared to others. Loyal customers are the ones who travel frequently and thus accumulate significant award miles to achieve premier status—silver, gold, or platinum (the category name depends on the type of airline). As is known, in any strategy that an airline adopts, there is always randomness involved in the actual boarding process.

Thus, airlines can reduce the time involved in carrying passengers from one city to another by considering various boarding strategies, as well as the boarding times. While there is research (as pointed out earlier) on boarding strategies, to our knowledge, there is no literature on the use of stochastic models to study the effect of a reduction in the average boarding time on the overall performance of the boarding process. The study of such stochastic models is very timely in view of the recent article by the Southwest

C O O

in the Wall Street Journal [1]. By building such stochastic models, in this paper, we try to answer questions such as “how much will a reduction in the boarding time result in an increase in the average number of trips made by an aircraft?”. With such quantitative descriptions of the reduction in the boarding times’ percentiles, management can adopt an appropriate boarding strategy to arrive at a reduction in the average boarding time.

Hence, our aim in this paper is to model the de-boarding–cleaning–boarding–flying sequence, from point A to point B to point A (for circular travel, as seen in many airlines in both local and international flights), using phase type (

P H

) distributions and show the impact of eliminating one minute or more from the average boarding time on key measures involving percentiles. It also seeks to quantify the guaranteed boarding time with a certain level of confidence. We also point out how such modeling can be generalized to include a travel path consisting of more than two cities, which does not always need to be circular. Overall, airline companies are interested in scheduling their flights to maximize the number of trips that an aircraft can make (under ideal conditions), and having a stochastic model to analyze the time will contribute to such planning.

P H

-distributions were introduced by Neuts [15] and have been extensively studied in the literature (see, e.g., [16,17,18,19,20]). Recall that a

P H

-distribution is obtained as the time until absorption in an irreducible Markov chain with an absorbing state. In other words, given an irreducible continuous-time Markov chain (

C T M C

) with m transient states and one absorbing state with the generator

\tilde{Q}

of the form

\tilde{Q} = (\begin{matrix} D & d^{0} \\ 0 & 0 \end{matrix}),

(1)

where D of dimension m governs the transitions corresponding to the transient states and the column vector

d^{0}

governs the rates of absorption into the absorbing state. Note that this column vector is such that the sum of this and the row sum of D will lead to a zero vector due to the property of the generator of being a

C T M C

. Suppose that the initial probability vector of this

C T M C

is taken as

α

of dimension m. If X denotes the time until absorption in the

C T M C

starting in one of the m transient states, then the probability distribution of X is said to be a

P H

-distribution with representation given by

(α, D)

of dimension m. We denote this statement by displaying

X \sim P H (α, D)

of dimension m.

Its use in stochastic modeling has been amply demonstrated in numerous publications since the seminal paper by Neuts. Very briefly,

P H

-distributions are obtained as the time until absorption in a finite Markov chain with one absorbing state. These distributions are defined in both discrete and continuous time, and, here, our focus is on the continuous-time version. To completely describe a

P H

-distribution, one needs an initial probability vector and a finite-dimensional matrix that governs the transitions among the transient states. For more details, including properties, examples, and computational aspects, we refer the reader to the above-mentioned references. In particular, we refer to the recent book by Chakravarthy [18] for detailed descriptions with a number of illustrative examples.

This paper is organized as follows. In Section 2, the basic model under study is described, along with its analysis. The analysis of the model in both steady state and a transient one is presented in Section 3. Illustrative numerical examples of the basic model are discussed in Section 4 and concluding remarks are presented in Section 5.

2. Model Description

In this paper, we study boarding times by looking at only two cities, e.g.,

C_{1}

and

C_{2}

, such that the same aircraft with a capacity to carry N passengers will shuttle back and forth between these two cities. If one is interested in extending this to include multiple-city trips, such as from city

C_{1}

to city

C_{2}

and to city

C_{3}

, or to reflect circular paths such as

C_{1}

to city

C_{2}

to city

C_{3}

to city

C_{1}

, the model studied here can easily be generalized but with more states to describe the system, and the details are left to the reader.

Generally speaking, the process involved in flying a commercial aircraft is as follows. After landing in a city, the aircraft pulls into the gate and the passengers de-board. The cleaning of the aircraft occurs before a new set of passengers boards the plane. Once the gate closes, the flight is ready to take off, and, after landing in a new city, the process continues. It should be pointed out that while the de-boarding, cleaning, and boarding times depend on the number of passengers, the time from the gate closing to landing in another city does not depend on the number of passengers.

We assume that the vector of the probability mass function (

P M F

) of the number of passengers boarding in city

C_{1}

is given by

p_{1} = (p_{1, 1}, \dots, p_{1, N})

and that of the one in city

C_{2}

is

p_{2} = (p_{2, 1}, \dots, p_{2, N})

. Thus, with probability

p_{1, j}

, the aircraft with a capacity N will leave city

C_{1}

with j passengers onboard. Similarly, with probability

p_{2, j}

, the same aircraft will leave city

C_{2}

with j passengers onboard. In this paper, we place no restriction on the nature of these two

P M F

s. They can be generally distributed (e.g., binomial, truncated geometric, and truncated Poisson) with support on the set

{1, 2, \dots, N}

.

We use

P H

-distributions to model the eight sets of random variables. Generally, the de-boarding, cleaning, and boarding times depend on the number of passengers on the plane; hence, we will model these dependencies by enabling the underlying

P H

-distribution to include another set of parameters. Our analysis here can be modified to model the dependencies under a more general setup using different

P H

-distributions. However, this will increase the number of input distributions significantly. To this end, we define the following (column) vectors of rates.

θ_{r} = (\begin{matrix} θ_{r, 1} \\ ⋮ \\ θ_{r, N} \end{matrix}), 1 \leq r \leq 6 .

(2)

Once again, we place no restriction on the nature of the rate vector,

θ_{r}, 1 \leq r \leq 6,

for our modeling purposes. The following sets of random variables are needed to study the model.

Define the random variables, $X_{1} (j), 1 \leq j \leq N$ , for de-boarding in city $C_{1}$ and assume that $X_{1} (j) \sim P H (β_{1}, θ_{1, j} S_{1})$ of dimension $m_{1}$ .
Define the random variables, $X_{2} (j), 1 \leq j \leq N$ , for the cleaning of the aircraft while in city $C_{1}$ and assume that $X_{2} (j) \sim P H (β_{2}, θ_{2, j} S_{2})$ of dimension $m_{2}$ .
Define the random variables, $X_{3} (j), 1 \leq j \leq N$ , for boarding in city $C_{1}$ and assume that $X_{3} (j) \sim P H (β_{3}, θ_{3, j} S_{3})$ of dimension $m_{3}$ .
Define the random variable, $X_{4}$ , to represent the time required for the aircraft to leave the gate in city $C_{1}$ and arrive at the gate for de-boarding in city $C_{2}$ . Let $X_{4} \sim P H (β_{4}, S_{4})$ of dimension $m_{4}$ .
Define the random variables, $X_{5} (j), 1 \leq j \leq N$ , for de-boarding in city $C_{2}$ and assume that $X_{5} (j) \sim P H (β_{5}, θ_{4, j} S_{5})$ of dimension $m_{5}$ .
Define the random variables, $X_{6} (j), 1 \leq j \leq N$ , for the cleaning of the aircraft while in city $C_{2}$ and assume that $X_{6} (j) \sim P H (β_{6}, θ_{5, j} S_{6})$ of dimension $m_{6}$ .
Define the random variables, $X_{7} (j), 1 \leq j \leq N$ , for boarding in city $C_{2}$ and assume that $X_{7} (j) \sim P H (β_{7}, θ_{6, j} S_{7})$ of dimension $m_{7}$ .
Define the random variable, $X_{8}$ , to represent the time taken for the aircraft to leave the gate in city $C_{2}$ and arrive at the gate for de-boarding in city $C_{1}$ . Let $X_{8} \sim P H (β_{8}, S_{8})$ of dimension $m_{8}$ .

In the following, we need the terms below.

$e$ is a column vector of 1s with appropriate dimensions, which should be clear from the context. Where clarity is needed, the dimensions will be displayed.
I is an identity matrix of appropriate dimensions. Again, when clarity is needed, the dimensions will be displayed.
Suppose that a is a vector such that $a = (a_{1}, \dots, a_{n})$ . Then, $Δ (a)$ denotes a diagonal matrix of dimension n whose ith diagonal element is given by $a_{i}$ . The inverse, when it exists, of this diagonal matrix will be denoted as $Δ^{- 1} (a)$ . In other words, $Δ^{- 1} (a) = {[Δ (a)]}^{- 1}$ .
The symbols ⊗ and ⊕, respectively, define the Kronecker product and sum of matrices. A few key works on these can be found in [21,22,23].

We define a column vector

S_{i}^{0}

of dimension

m_{i}

to be such that

S_{i} e + S_{i}^{0} = 0, 1 \leq i \leq 8 .

(3)

For use in this work, we define the mean (

μ_{i}^{'}

), the variance (

σ_{i}^{2}

), and the invariant vector (

δ_{i}

) of the

P H

-renewal process, namely

S_{i} + S_{i}^{0} β_{i}

, associated with the

P H

-distribution

P H (β_{i}, S_{i})

. These quantities are as given below (see, e.g., [18,20]).

μ_{i}^{'} = β_{i} {(- S_{i})}^{- 1} e, σ_{i}^{2} = 2 β_{i} {(- S_{i})}^{- 2} e - {(μ_{i}^{'})}^{2}, δ_{i} = \frac{1}{μ_{i}^{'}} β_{i} {(- S_{i})}^{- 1}, 1 \leq i \leq 8 .

(4)

3. Analysis of the Model

In this section, we perform the analysis of the model both in time dependence and steady state. The transient analysis will be focused on the boarding event. The reason that we perform the steady-state analysis is that there are some airlines that operate regular flights from city to city (due to constant demands in these sectors) and hence they would be interested in determining how may trips can be made in the long run. With regard to boarding, the transient analysis (i.e., time-dependent study) will shed light on the performance of the boarding process by looking at key measures such as mean, median, and some selected percentiles. Although we focus mainly on the boarding time and the total time for the transient analysis, it is easy to consider other events involved in this process.

In order to study the model using the continuous-time Markov chain (

C T M C

), we need to keep track of the state of the system under study. Before we display the state space of the

C T M C

, we first define a few terms. By

1, 2, 3, 4, 5, 6, 7,

and 8, we define the set of states as

\begin{matrix} 1 = {({\hat{j}}_{C_{1}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{1}}, 2 = {({\bar{j}}_{C_{1}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{2}}, \\ 3 = {({\tilde{j}}_{C_{1}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{3}}, 4 = {(j_{C_{1}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{4}} \\ 5 = {({\hat{j}}_{C_{2}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{5}}, 6 = {({\bar{j}}_{C_{2}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{6}}, \\ 7 = {({\tilde{j}}_{C_{2}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{7}}, 8 = {(j_{C_{2}}, k) : 1 \leq j \leq N, 1 \leq k \leq m_{8}}, \end{matrix}

where, for

1 \leq j \leq N

,

$({\hat{j}}_{C_{1}}, k)$ corresponds to the de-boarding process in city $C_{1}$ (after the aircraft has landed with j passengers onboard) and the phase is k;
$({\bar{j}}_{C_{1}}, k)$ corresponds to the aircraft (which originally had j passengers onboard) being cleaned while in city $C_{1}$ and the phase is in k;
$({\tilde{j}}_{C_{1}}, k)$ corresponds to the boarding process in city $C_{1}$ when the aircraft is to leave with j passengers onboard and the phase is in k;
$(j_{C_{1}}, k)$ corresponds to the state in which the aircraft is on its way to city $C_{2}$ and the phase is in k; note that we need to keep track of the number of passengers onboard even though the traveling time is not dependent on the number onboard since the de-boarding time in city $C_{2}$ depends on this number;
The other states, $({\hat{j}}_{C_{2}}, k), ({\bar{j}}_{C_{2}}, k), ({\tilde{j}}_{C_{2}}, k),$ and $(j_{C_{2}}, k)$ , are similarly defined by replacing city $C_{1}$ with $C_{2}$ and vice versa in the above definitions.

Noting that the system, at any given time, can only be in one of eight sets of states (corresponding to the state of the aircraft, such as de-boarding/boarding/cleaning/leaving city

C_{1}

or de-boarding/boarding/cleaning/leaving city

C_{2}

), the state space,

Ω,

is given by

Ω = {1, 2, 3, 4, 5, 6, 7, 8} .

(5)

The generator, Q, of the

C T M C

is given by

Q = \begin{matrix} \begin{matrix} 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8 \end{matrix} \\ \begin{matrix} 1 \\ 2 \\ 3 \\ 4 \\ 5 \\ 6 \\ 7 \\ 8 \end{matrix} & (\begin{matrix} A_{1} & A_{1, 1} \\ A_{2} & A_{2, 1} \\ A_{3} & A_{3, 1} \\ A_{4} & A_{4, 1} \\ A_{5} & A_{5, 1} \\ A_{6} & A_{6, 1} \\ A_{7} & A_{7, 1} \\ A_{8, 1} & A_{8} \end{matrix}) \end{matrix},

(6)

where the entries of Q are as given below:

\begin{matrix} A_{1} = Δ (θ_{1}) \otimes S_{1}, A_{1, 1} = Δ (θ_{1}) \otimes S_{1}^{0} β_{2}, A_{2} = Δ (θ_{2}) \otimes S_{2}, A_{2, 1} = θ_{2} p_{1} \otimes S_{2}^{0} β_{3}, \\ A_{3} = Δ (θ_{3}) \otimes S_{3}, A_{3, 1} = Δ (θ_{3}) \otimes S_{3}^{0} β_{4}, A_{4} = I \otimes S_{4}, A_{4, 1} = I \otimes S_{4}^{0} β_{5}, \\ A_{5} = Δ (θ_{4}) \otimes S_{5}, A_{5, 1} = Δ (θ_{4}) \otimes S_{5}^{0} β_{6}, A_{6} = Δ (θ_{5}) \otimes S_{6}, A_{6, 1} = θ_{5} p_{2} \otimes S_{6}^{0} β_{7}, \\ A_{7} = Δ (θ_{6}) \otimes S_{7}, A_{7, 1} = Δ (θ_{6}) \otimes S_{7}^{0} β_{8}, A_{8} = I \otimes S_{8}, A_{8, 1} = I \otimes S_{8}^{0} β_{1} . \end{matrix}

(7)

3.1. Steady-State Analysis

Let

u = (u_{1}, \dots, u_{8})

denote the steady-state probability vector of Q. In particular, u satisfies

u Q = 0, u e = 1 .

(8)

For later use, we further partition

u_{i}

, for

1 \leq i \leq 8

, as

u_{i} = (u_{i, 1}, \dots, u_{i, N})

. Note that the vector

u_{i, j}

of dimension

m_{i}

, for

1 \leq j \leq N

, gives the steady-state probability vector of the underling process to be in state

(i, j)

. The following theorem gives an explicit expression for the vector u.

Theorem 1.

The vector u is explicitly given by

\begin{matrix} u_{1} = d μ_{1}^{'} (p_{2} Δ^{- 1} (θ_{1}) \otimes δ_{1}), u_{2} = d μ_{2}^{'} (p_{2} Δ^{- 1} (θ_{2}) \otimes δ_{2}), u_{3} = d μ_{3}^{'} (p_{1} Δ^{- 1} (θ_{3}) \otimes δ_{3}), \\ u_{4} = d μ_{4}^{'} (p_{1} \otimes δ_{4}), u_{5} = d μ_{5}^{'} (p_{1} Δ^{- 1} (θ_{4}) \otimes δ_{5}), u_{6} = d μ_{6}^{'} (p_{1} Δ^{- 1} (θ_{5}) \otimes δ_{6}), \\ u_{7} = d μ_{7}^{'} (p_{2} Δ^{- 1} (θ_{6}) \otimes δ_{7}), u_{8} = d μ_{8}^{'} (p_{2} \otimes δ_{8}), \end{matrix}

(9)

where the invariant vectors

δ_{i}, 1 \leq i \leq 8,

are as given in Equation (4) and the constant d is given by

\begin{matrix} d = [μ_{1}^{'} p_{2} Δ^{- 1} (θ_{1}) e + μ_{2}^{'} p_{2} Δ^{- 1} (θ_{2}) e + μ_{3}^{'} p_{1} Δ^{- 1} (θ_{3}) e + μ_{4}^{'} \\ + μ_{5}^{'} p_{1} Δ^{- 1} (θ_{4}) e + μ_{6}^{'} p_{1} Δ^{- 1} (θ_{5}) e + μ_{7}^{'} p_{2} Δ^{- 1} (θ_{6}) e + μ_{8}^{'}]^{- 1} . \end{matrix}

(10)

Proof.

Using the properties of the Kronecker product (see, e.g., [18,21,23]) and the invariant vectors given in Equation (4), the steady-state equations given in Equation (8) can be written as

\begin{matrix} u_{1} = μ_{1}^{'} u_{8} (Δ^{- 1} (θ_{1}) \otimes S_{8}^{0} δ_{1}), u_{2} = μ_{2}^{'} u_{1} (Δ (θ_{1}) Δ^{- 1} (θ_{2}) \otimes S_{1}^{0} δ_{2}), \\ u_{3} = μ_{3}^{'} u_{2} (θ_{2} p_{1} Δ^{- 1} (θ_{3}) \otimes S_{2}^{0} δ_{3}), u_{4} = μ_{4}^{'} u_{1} (Δ (θ_{3}) \otimes S_{3}^{0} δ_{4}), \\ u_{5} = μ_{5}^{'} u_{4} (Δ^{- 1} (θ_{4}) \otimes S_{4}^{0} δ_{5}), u_{6} = μ_{6}^{'} u_{5} (Δ (θ_{4}) Δ^{- 1} (θ_{5}) \otimes S_{5}^{0} δ_{6}), \\ u_{7} = μ_{7}^{'} u_{6} (θ_{5} p_{2} Δ^{- 1} (θ_{6}) \otimes S_{6}^{0} δ_{7}), u_{8} = μ_{8}^{'} u_{7} (Δ (θ_{6}) \otimes S_{7}^{0} δ_{8}), \sum_{i = 1}^{8} u_{i} e = 1 . \end{matrix}

(11)

From the equations given in (11), it is easy to verify, for

1 \leq j \leq N

, that

\begin{matrix} u_{1, j} = \frac{μ_{1}^{'}}{θ_{1, j}} u_{8, j} S_{8}^{0} δ_{1}, u_{2, j} = \frac{μ_{2}^{'} θ_{1, j}}{θ_{2, j}} u_{1, j} S_{1}^{0} δ_{2}, u_{3, j} = \frac{μ_{3}^{'} p_{1, j}}{θ_{3, j}} \sum_{k = 1}^{N} θ_{2, k} u_{2, k} S_{2}^{0} δ_{3}, \\ u_{4, j} = μ_{4}^{'} θ_{3, j} u_{3, j} S_{3}^{0} δ_{4}, u_{5, j} = \frac{μ_{5}^{'}}{θ_{4, j}}, u_{4, j} S_{4}^{0} δ_{5}, u_{6, j} = \frac{μ_{6}^{'} θ_{4, j}}{θ_{5, j}} u_{5, j} S_{5}^{0} δ_{6}, \\ u_{7, j} = \frac{μ_{7}^{'} p_{2, j}}{θ_{6, j}} \sum_{k = 1}^{N} θ_{5, k} u_{6, k} S_{6}^{0} δ_{7}, u_{8, j} = μ_{8}^{'} θ_{6, j} u_{7, j} S_{7}^{0} δ_{8}, \end{matrix}

(12)

from which we obtain

\begin{matrix} θ_{1, j} u_{1, j} S_{1}^{0} = θ_{2, j} u_{2, j} S_{2}^{0} = u_{8, j} S_{8}^{0}, \\ θ_{3, j} u_{3, j} S_{3}^{0} = u_{4, j} S_{4}^{0} = θ_{4, j} u_{5, j} S_{5}^{0} = θ_{5, j} u_{6, j} S_{6}^{0} = p_{1, j} \sum_{k = 1}^{N} u_{8, k} S_{8}^{0}, \\ θ_{6, j} u_{7, j} S_{7}^{0} = u_{8, j} S_{8}^{0} = p_{2, j} \sum_{k = 1}^{N} u_{8, k} S_{8}^{0} . \end{matrix}

(13)

The stated result follows from Equations (12) and (13) and the normalizing condition given in (11). □

Suppose that

W_{1}, W_{2}, W_{3},

and

W_{4}

, respectively, denote the times spent in de-boarding the plane, cleaning the plane, boarding the plane, and leaving city

C_{1}

given that the aircraft starts in this particular mode. In other words,

W_{1}

is the random variable keeping track of the time that it takes to de-board the plane in city

C_{1}

given that de-boarding has started. Similarly, let

W_{5}, W_{6}, W_{7},

and

W_{8}

, respectively, denote the time spent in de-boarding the plane, cleaning the plane, boarding the plane, and leaving in city

C_{2}

. Then, the following result shows that these random variables are all of the

P H

type.

Theorem 2.

The random variables,

W_{i}, 1 \leq i \leq 8,

follow

P H

-distributions with representations given as

\begin{matrix} W_{1} \sim (p_{2} \otimes β_{1}, A_{1}), W_{2} \sim (p_{2} \otimes β_{2}, A_{2}), W_{3} \sim (p_{1} \otimes β_{3}, A_{3}), W_{4} \sim (β_{4}, S_{4}), \\ W_{5} \sim (p_{1} \otimes β_{5}, A_{5}), W_{6} \sim (p_{1} \otimes β_{6}, A_{6}), W_{7} \sim (p_{2} \otimes β_{7}, A_{7}), W_{4} \sim (β_{8}, S_{8}) . \end{matrix}

(14)

Proof.

Let

γ

denote the conditional probability that de-boarding starts in city

C_{1}

. This conditional probability is obtained by looking at the sequence starting with the cleaning of the aircraft at city

C_{2}

, which is then followed by the boarding of the aircraft and then the plane leaving the city at an arbitrary time. It is easy to verify that

γ = d_{1} u_{6} {(- A_{6})}^{- 1} A_{6, 1} {(- A_{7})}^{- 1} A_{7, 1} {(- A_{8})}^{- 1} A_{8, 1} = d_{1} d μ_{6}^{'} (p_{1} Δ^{- 1} (θ_{5}) e) (p_{2} \otimes β_{1}),

(15)

where

d_{1}

is the normalizing constant. Thus, the time spent in the de-boarding state in city

C_{1}

is of the

P H

type with the representation given in Equation (14). In a similar manner, we obtain the other representations. It is worth pointing out that the reason for the smaller-dimension representation for

W_{4}

and

W_{8}

is due to the fact that the flying time does not depend on the number of passengers onboard the aircraft, unlike the other times. □

Corollary 1.

The mean (

μ_{W_{i}}^{'}

) and the standard deviation (

σ_{W_{i}}^{'}

) of

W_{i}, 1 \leq i \leq 8

, are obtained explicitly as follows. In the following, we use the notation

ξ_{i}

for the second moment of

X_{i}, 1 \leq i \leq 8

. In other words,

ξ_{i} = 2 β_{i} e {(- S_{i})}^{- 2} e, 1 \leq i \leq 8 .

\begin{matrix} μ_{W_{1}}^{'} = μ_{1}^{'} p_{2} Δ^{- 1} (θ_{1}) e, σ_{W_{1}}^{'} = ξ_{1} p_{2} Δ^{- 2} (θ_{1}) e - {(μ_{W_{1}}^{'})}^{2}, \\ μ_{W_{2}}^{'} = μ_{2}^{'} p_{2} Δ^{- 1} (θ_{2}) e, σ_{W_{2}}^{'} = ξ_{2} p_{2} Δ^{- 2} (θ_{2}) e - {(μ_{W_{2}}^{'})}^{2}, \\ μ_{W_{3}}^{'} = μ_{3}^{'} p_{1} Δ^{- 1} (θ_{3}) e, σ_{W_{3}}^{'} = ξ_{3} p_{1} Δ^{- 3} (θ_{3}) e - {(μ_{W_{3}}^{'})}^{2}, μ_{W_{4}}^{'} = μ_{4}^{'}, σ_{W_{4}}^{'} = {(σ_{W_{4}}^{'})}^{2}, \\ μ_{W_{5}}^{'} = μ_{5}^{'} p_{1} Δ^{- 1} (θ_{4}) e, σ_{W_{5}}^{'} = ξ_{5} p_{1} Δ^{- 2} (θ_{4}) e - {(μ_{W_{5}}^{'})}^{2}, \\ μ_{W_{6}}^{'} = μ_{6}^{'} p_{1} Δ^{- 1} (θ_{5}) e, σ_{W_{6}}^{'} = ξ_{6} p_{1} Δ^{- 2} (θ_{5}) e - {(μ_{W_{6}}^{'})}^{2}, \\ μ_{W_{7}}^{'} = μ_{7}^{'} p_{2} Δ^{- 1} (θ_{6}) e, σ_{W_{7}}^{'} = ξ_{7} p_{2} Δ^{- 2} (θ_{6}) e - {(μ_{W_{7}}^{'})}^{2}, μ_{W_{8}}^{'} = μ_{8}^{'}, σ_{W_{8}}^{'} = {(σ_{W_{8}}^{'})}^{2} . \end{matrix}

(16)

Note 1.

It is worth pointing out that the means of

W_{i}

are related to the probabilities

u_{i}

, for

1 \leq i \leq 8,

as

μ_{W_{i}}^{'} = d u_{i} e

, where d is as given in Equation (10).

Suppose that T is the total time starting from city

C_{1}

until returning to city

C_{1}

. Then, we have the following result.

Theorem 3.

The random variable, T, follows a

P H

-distribution with representation

((p_{2} \otimes β_{1}, 0), A)

of order

m_{8} + N \sum_{i = 1}^{7} m_{i}

, where A is given by

A = [\begin{matrix} A_{1} & A_{1, 1} \\ A_{2} & A_{2, 1} \\ A_{3} & A_{3, 1} \\ A_{4} & A_{4, 1} \\ A_{5} & A_{5, 1} \\ A_{6} & A_{6, 1} \\ A_{7} & (θ_{6} \otimes S_{7}^{0} β_{8}) \\ S_{8} \end{matrix}],

(17)

and the matrices appearing in A are as given in Equation (7). Further, the mean and the variance of T are given by

\begin{matrix} μ_{T}^{'} = \frac{1}{d}, \\ σ_{T}^{2} = μ_{1}^{'} ξ_{1} p_{2} Δ^{- 2} (θ_{1}) e + μ_{2}^{'} [ξ_{2} p_{2} Δ^{- 2} (θ_{2}) e + μ_{1}^{'} Δ^{- 1} (θ_{1}) Δ^{- 1} (θ_{2}) e] \\ + μ_{3}^{'} [ξ_{3} p_{1} Δ^{- 2} (θ_{3}) e + p_{1} Δ^{- 1} (θ_{3}) e (μ_{W_{1}}^{'} + μ_{W_{2}}^{'})] + μ_{4}^{'} [ξ_{4} + \sum_{r = 1}^{3} μ_{W_{r}}^{'}] \\ + μ_{5}^{'} [ξ_{5} p_{1} Δ^{- 2} (θ_{4}) e + μ_{4}^{'} p_{1} Δ^{- 1} (θ_{4}) e + μ_{3}^{'} Δ^{- 1} (θ_{3}) Δ^{- 1} (θ_{4}) e] \\ + μ_{5}^{'} (μ_{W_{1}}^{'} + μ_{W_{2}}^{'}) p_{1} Δ^{- 1} (θ_{4}) e + μ_{6}^{'} [ξ_{6} p_{1} Δ^{- 2} (θ_{5}) e + μ_{5}^{'} p_{1} Δ^{- 1} (θ_{4}) Δ^{- 1} (θ_{5}) e] \\ + μ_{6}^{'} [μ_{4}^{'} p_{1} Δ^{- 1} (θ_{5}) e + μ_{3}^{'} p_{1} Δ^{- 1} (θ_{3}) Δ^{- 1} (θ_{5}) e + p_{1} Δ^{- 1} (θ_{5}) e (μ_{W_{1}}^{'} + μ_{W_{2}}^{'})] \\ + μ_{7}^{'} [ξ_{7} p_{2} Δ^{- 2} (θ_{6}) e + p_{2} Δ^{- 1} (θ_{6}) e \sum_{i = 1}^{6} μ_{W_{i}}^{'}] + μ_{8}^{'} [ξ_{8} + \sum_{i = 1}^{7} μ_{W_{i}}^{'}] - {(\frac{1}{d})}^{2} . \end{matrix}

(18)

Proof.

First, we define two vectors, a and b, as

a = (p_{2} \otimes β_{1}, 0) {(- A)}^{- 1}, b = a {(- A)}^{- 1},

(19)

so that

μ_{T}^{'} = a {(- A)}^{- 1} e, σ_{T}^{2} = 2 b {(- A)}^{- 1} e - {(μ_{T}^{'})}^{2} .

(20)

It should be pointed out that while we can obtain

μ_{T}^{'}

as

μ_{T}^{'} = \sum_{i = 1}^{8} μ_{W_{i}}^{'}

, one needs to exploit the structure of A to obtain

σ_{T}^{2}

. The latter is due to the fact that the variance of T cannot be obtained as the sum of the variances of

W_{i}, 1 \leq i \leq 8,

due to the possible dependencies of these random variables within themselves. In order to obtain this variance, we need to obtain the vector b, which depends on a. Moreover, one can use the vector a as part of an internal accuracy check. Partitioning

a = (a_{1}, \dots, a_{N}), b = (b_{1}, \dots, b_{N}),

(21)

and rewriting the equation

a = (p_{2} \otimes β_{1}, 0) {(- A)}^{- 1}

as

a (- A) = (p_{2} \otimes β_{1}, 0),

(22)

it is easy to verify, by exploiting the sparsity of the coefficient matrices, the following expressions for the vector a.

\begin{matrix} a_{1} = μ_{1}^{'} (p_{2} Δ^{- 1} (θ_{1}) \otimes δ_{1}), a_{2} = μ_{2}^{'} (p_{2} Δ^{- 1} (θ_{2}) \otimes δ_{2}), a_{3} = μ_{3}^{'} (p_{1} Δ^{- 1} (θ_{3}) \otimes δ_{3}), \\ a_{4} = μ_{4}^{'} (p_{1} \otimes δ_{4}), a_{5} = μ_{5}^{'} (p_{1} Δ^{- 1} (θ_{4}) \otimes δ_{5}), a_{6} = μ_{6}^{'} (p_{1} Δ^{- 1} (θ_{5}) \otimes δ_{6}), \\ a_{7} = μ_{7}^{'} (p_{2} Δ^{- 1} (θ_{6}) \otimes δ_{7}), a_{8} = μ_{8}^{'} δ_{8}, \end{matrix}

(23)

where the invariant vectors

δ_{i}, 1 \leq i \leq 8,

are as given in Equation (4). Thus, we see

μ_{T}^{'} = \sum_{i = 1}^{8} a_{i} e,

(24)

and, upon using Equations (23) and (10), we obtain the stated result for

μ_{T}^{'}

.

Having obtained the expressions for a, we once again exploit the structure of the matrix A to obtain the expressions for b. It is easy to verify that, for

1 \leq j \leq N

, we have

\begin{matrix} b_{1, j} = \frac{1}{θ_{1, j}} a_{1, j} {(- S_{1})}^{- 1}, b_{2, j} = \frac{1}{θ_{2, j}} [a_{2, j} + a_{1, j} e β_{2}] {(- S_{2})}^{- 1}, \\ b_{3, j} = \frac{1}{θ_{3, j}} [a_{3, j} + p_{1, j} (a_{1} e + a_{2} e) β_{3}] {(- S_{3})}^{- 1}, \\ b_{4, j} = [a_{4, j} + (a_{3, j} e + p_{1, j} [a_{1} e + a_{2} e]) β_{4}] {(- S_{4})}^{- 1}, \\ b_{5, j} = \frac{1}{θ_{4, j}} [a_{5, j} + (a_{4, j} e + a_{3, j} e + p_{1, j} [a_{1} e + a_{2} e]) β_{5}] {(- S_{5})}^{- 1}, \\ b_{6, j} = \frac{1}{θ_{5, j}} [a_{6, j} + (a_{5, j} e + a_{4, j} e + a_{3, j} e + p_{1, j} [a_{1} e + a_{2} e]) β_{6}] {(- S_{6})}^{- 1}, \\ b_{7, j} = \frac{1}{θ_{6, j}} [a_{7, j} + p_{2, j} (\sum_{i = 1}^{6} a_{i} e) β_{7}] {(- S_{7})}^{- 1}, b_{8} = [a_{8} + (\sum_{i = 1}^{7} a_{i} e) β_{8}] {(- S_{8})}^{- 1}, \end{matrix}

(25)

from which the stated result follows immediately. □

Note 2.

Suppose that one is interested in looking at the time, V, that it takes to travel from city

C_{1}

(from the instant that de-boarding starts) to city

C_{2}

(reaching the gate). Similarly to Theorem 3, it is easy to see that V follows a

P H

-distribution with representation

((p_{2} \otimes β_{1}, 0), B)

of order

m_{4} + N \sum_{i = 1}^{3} m_{i}

, where B is given by

B = [\begin{matrix} A_{1} & A_{1, 1} \\ A_{2} & A_{2, 1} \\ A_{3} & (θ_{3} \otimes S_{3}^{0}) β_{4} \\ S_{4} \end{matrix}],

(26)

and the entries of B are as given in Equation (7). The mean and the standard deviation of V can be obtained similarly and the details are omitted.

3.2. Transient Analysis

In this section, we perform a transient analysis with the main focus on the boarding time. However, we will briefly outline the analysis of the other events for the sake of completeness.

From Theorem 2, it is easy to verify (see, e.g., [18,19,20]) that the

P D F

, e.g.,

f_{i} (t)

, and the cumulative probability distribution function (

C D F

), e.g.,

F_{i} (t)

, for the random variable

W_{i}

, for

1 \leq i \leq 8,

are given as

\begin{matrix} f_{1} (t) = \sum_{j = 1}^{N} p_{2, j} θ_{1, j} β_{1} e^{θ_{1, j} S_{1} t} S_{1}^{0}, F_{1} (t) = 1 - \sum_{j = 1}^{N} p_{2, j} β_{1} e^{θ_{1, j} S_{1} t} e, t \geq 0, \\ f_{2} (t) = \sum_{j = 1}^{N} p_{2, j} θ_{2, j} β_{2} e^{θ_{2, j} S_{2} t} S_{2}^{0}, F_{2} (t) = 1 - \sum_{j = 1}^{N} p_{2, j} β_{2} e^{θ_{2, j} S_{2} t} e, t \geq 0, \\ f_{3} (t) = \sum_{j = 1}^{N} p_{1, j} θ_{3, j} β_{3} e^{θ_{3, j} S_{3} t} S_{3}^{0}, F_{3} (t) = 1 - \sum_{j = 1}^{N} p_{1, j} β_{3} e^{θ_{3, j} S_{3} t} e, t \geq 0, \\ f_{4} (t) = β_{4} e^{S_{4} t} S_{4}^{0}, F_{4} (t) = 1 - β_{4} e^{S_{4} t} e, t \geq 0, \\ f_{5} (t) = \sum_{j = 1}^{N} p_{1, j} θ_{4, j} β_{5} e^{θ_{4, j} S_{5} t} S_{5}^{0}, F_{5} (t) = 1 - \sum_{j = 1}^{N} p_{1, j} β_{5} e^{θ_{5, j} S_{1} t} e, t \geq 0, \\ f_{6} (t) = \sum_{j = 1}^{N} p_{1, j} θ_{5, j} β_{6} e^{θ_{5, j} S_{6} t} S_{6}^{0}, F_{6} (t) = 1 - \sum_{j = 1}^{N} p_{1, j} β_{6} e^{θ_{5, j} S_{6} t} e, t \geq 0, \\ f_{7} (t) = \sum_{j = 1}^{N} p_{2, j} θ_{6, j} β_{7} e^{θ_{2, j} S_{7} t} S_{7}^{0}, F_{7} (t) = 1 - \sum_{j = 1}^{N} p_{1, j} β_{7} e^{θ_{6, j} S_{7} t} e, t \geq 0, \\ f_{8} (t) = β_{8} e^{S_{8} t} S_{8}^{0}, F_{8} (t) = 1 - β_{8} e^{S_{8} t} e, t \geq 0 . \end{matrix}

(27)

The above functions can be computed easily using the algorithmic procedures published in the literature (see, e.g., [18,19]). Moreover, for any special cases of

P H

-distributions, such as Erlang or hyperexponential, the calculations can further be simplified.

While implementing the algorithm to compute the

C D F

, we can also compute various quartiles to supplement the mean and the standard deviation of the random variables,

W_{i}, 1 \leq i \leq 8

. Specifically, we discuss these measures for the boarding times in Section 4.

Using Theorem 3, we can compute the

P D F

and

C D F

of T and V as

f_{T} (t) = (p_{2} \otimes β_{1}, 0) e^{A t} A^{0}, F_{T} (t) = 1 - (p_{2} \otimes β_{1}, 0) e^{A t} e, t \geq 0,

(28)

and

f_{V} (t) = (p_{2} \otimes β_{1}, 0) e^{V t} V^{0}, F_{V} (t) = 1 - (p_{2} \otimes β_{1}, 0) e^{V t} e, t \geq 0,

(29)

where

A^{0}

and

V^{0}

are such that

A e + A^{0} = 0

and

V e + V^{0} = 0

.

Note that when using a general

P H

-distribution to compute the exponential matrix needed in the

P D F

and

C D F

, one can use the uniformization method (see, e.g., [18]). Here, we will briefly list the steps involved in the computation of the

P D F

and

C D F

of V (Algorithm 1).

Algorithm 1: Algorithmic Steps to Compute

f_{V} (t)

and

F_{V} (t)

Using Uniformization Method.

Step 0: Compute

η = {max}_{i} | V_{i, i} |

. That is,

η

is the maximum of the diagonal elements of the matrix V. Note that

η

is required for the computation of the

P D F

and

C D F

over a specified set of values, e.g.,

{t_{1}, \dots, t_{n}}

. Let

P = I + \frac{1}{η} V

. Let

ϵ

be a small positive number.
Step 1: For a given t, compute

ζ_{r} = e^{- η t} \frac{{(η t)}^{r}}{r!}, 0 \leq r \leq r^{*}

, where

r^{*} = r^{*} (t)

is such that

\sum_{r = 0}^{r^{*}} ζ_{r} > 1 - ϵ

. Let

i = 0

,

h^{(0)} = (p_{2} \otimes β_{1}, 0)

,

φ^{(0)} = 1 - ζ_{0}

, and

g^{(0)} = ζ_{0} h^{(0)} V^{0}

.
Step 2:

i \leftarrow i + 1,

h^{(i)} = h^{(i - 1)} P, φ^{(i)} = φ^{(i - 1)} - ζ_{i} h^{(i)} e,

and

g^{(i)} = g^{(i - 1)} + ζ_{i} h^{(i)} V^{0}

.
Step 3: If

i < r^{*}

, go to Step 2.
Step 4:

f_{V} (t) = g^{(r^{*})}

and

F_{V} (t) = φ^{(r^{*})}

.

Note 3.

It is worth pointing out that the unique structure of V and hence P should be exploited in the steps mentioned above. This is very important, especially when N is large, as well as when the orders of the underlying

P H

-representations are large. The key steps are outlined below.

Observing that

h^{(0)} = (p_{2} \otimes β_{1}, 0)

and partitioning

h^{(i)}

as

h^{(i)} = (h_{1}^{(i)}, \dots, h_{N}^{(i)}), 1 \leq i \leq 3, i \geq 0,

(30)

we proceed as follows. Given the current iterate value

h^{(k)}

for

k = i - 1

, the next iterate value for

k = i

is obtained as

h_{1, j}^{(i)} = h_{1, j}^{(i - 1)} + \frac{1}{η} θ_{1, j} h_{1, j}^{(i - 1)} S_{1}, 1 \leq j \leq N,

(31)

h_{2, j}^{(i)} = h_{2, j}^{(i - 1)} + \frac{1}{η} [θ_{1, j} h_{1, j}^{(i - 1)} S_{1}^{0} β_{2} + θ_{2, j} h_{2, j}^{(i - 1)} S_{2}], 1 \leq j \leq N,

(32)

h_{3, j}^{(i)} = h_{3, j}^{(i - 1)} + \frac{1}{η} [p_{1, j} \sum_{k = 1}^{N} θ_{2, k} h_{2, k}^{(i - 1)} S_{2}^{0} β_{3} + θ_{3, j} h_{3, j}^{(i - 1)} S_{3}], 1 \leq j \leq N,

(33)

h_{4}^{(i)} = h_{4}^{(i - 1)} + \frac{1}{η} [\sum_{k = 1}^{N} θ_{3, k} h_{3, k}^{(i - 1)} S_{3}^{0} β_{4} + h_{4}^{(i - 1)} S_{4}] .

(34)

One can further exploit any unique structure for the matrices

S_{i}, 1 \leq i \leq 4

. For example, when dealing with a hyperexponential distribution, the corresponding matrix in the

P H

-representation is a diagonal matrix and this will help to further exploit this structure. The details are omitted.

3.3. Extension to More than Two Cities

The approach taken for two cities can easily be extended to more than two cities. However, the dimensions of the problem increase. For example, if there are K cities involved, then, instead of using 8 random variables to describe the process for the two-city case, we need

4 K

random variables. The results that are true for the two-city case will also hold but with larger-dimension representations for the underlying random variables.

4. Illustrative Numerical Examples

In this section, we illustrate the key concepts with two sets of numerical examples. The time units, unless otherwise specified, are hours. The input parameters for the illustrative examples are chosen as follows. In the airline industry, the passenger load factor (

P L F

) [24] is defined as the ratio of the number of actual passengers to the number of available seats. We will denote this fraction as p in the following. Based on the data provided in [24], we note that this fraction (for domestic flights) ranges from 0.55 to 0.85 approximately. Thus, we conduct our analysis by taking

P L F

to be in this range. However, here, we discuss the examples by fixing p at 0.55 and 0.85.

It would be ideal to perform analyses with all practical data for the model studied here. However, except for

P L F

, to our knowledge, there are no data on boarding times available to use here. It is possible that these data are protected by each airline and are not available to the public. When these data are made available or when an airline wishes to explore the use of the model proposed here, one can fit the data to a

P H

-distribution (see, e.g., [25,26,27,28,29,30,31,32]). For the

P M F

of the number of passengers, we consider truncated binomial, truncated (reverse) geometric, and truncated Poisson forms. In particular, we take

p_{1}

and

p_{2}

to be one of the following three discrete distributions.

Truncated binomial ( $B$ ): This is a binomial distribution that is truncated so that the mass is within

{1, \dots, N}

. Specifically, the

P M F

is of the form

\frac{1}{[1 - {(1 - p_{b})}^{N}]} (\binom{N}{n}) p_{b}^{n} {(1 - p_{b})}^{N - n}, n = 1, \dots, N .

(35)

Truncated geometric ( $G$ ): This is a (reversed) geometric distribution that is truncated so that the mass is within

{1, \dots, N}

. Specifically, the

P M F

is of the form

\frac{1}{[1 - {(1 - p_{g})}^{N}]} p_{g} {(1 - p_{g})}^{N - n}, n = 1, \dots, N .

(36)

Truncated Poisson ( $P$ ): This is a Poisson distribution that is truncated so that the mass is within

{1, \dots, N}

. Specifically, the

P M F

is of the form

c e^{- λ} \frac{λ^{n}}{n!}, n = 1, \dots, N,

(37)

where c is the normalizing constant to ensure a legitimate

P M F

.

In order to compare the various scenarios (when varying the type of

P M F

), we have to choose the parameters of the above-mentioned

P M F

in such a way that the mean number of passengers will always be the same. For example, if

N = 100

and

p = 0.55

, then the mean number of passengers onboard will be 55. Thus, to arrive at this mean for the truncated binomial, one has to choose

p_{b} = p

,

p_{g} = 0.0054

, and

λ = 55

. It is worth mentioning that, due to the size of N and the values of p considered for the illustrative examples, the truncated binomial reduces to the binomial one since

{(1 - p_{b})}^{N} ≃ 0

.

It should be pointed out that when computing the binomial probabilities, one can encounter overflow or underflow issues, especially when N is large. To avoid this, one should find the mode of the binomial distribution and then compute the rest of the probabilities recursively. To facilitate such computation, we provide the mode values for the binomial case. For the set of parameters considered in this section, Table 1 lists the corresponding parameter values.

While, for the truncated binomial and the truncated Poisson, the choice of the values for the parameters N and p to guarantee that the probability of the number of passengers onboard is less than, e.g., 10, is insignificant (i.e., close to zero), this is not the case with the truncated geometric. This is due to the choice of the geometric parameter required to guarantee a given mean. However, it is easy to modify this by ensuring that the mass of this geometric distribution has a positive value beyond a specific number, such as 10.

For the rate vector,

θ_{r}, 1 \leq r \leq 6,

given the starting value, e.g.,

ϑ_{1}

(at 1), and the ending value, e.g.,

ϑ_{N}

(at N), we consider four possible scenarios consisting of (a) linearly decreasing rates (

L D

); (b) quadratically decreasing rates (

Q D

); (c) decreasing rates in a square root manner (

S D

); and (d) decreasing rates in a logarithmic manner (

L G

). Note that

ϑ_{1}

and

ϑ_{N}

are, respectively, the rates when one passenger and N passengers are onboard. Naturally, we impose a restriction in which

ϑ_{1} > ϑ_{N}

. Thus, we have the following.

$L D$ :: Here, we have (note that we suppress the suffix r in $θ_{r}$ )

$θ_{j} = ϑ_{1} - \frac{ϑ_{1} - ϑ_{N}}{N - 1} (j - 1), j = 1, \dots, N .$

(38)
$Q D$ :: Here, we have

$θ_{j} = ϑ_{1} - \frac{ϑ_{1} - ϑ_{N}}{{(N - 1)}^{2}} {(j - 1)}^{2}, j = 1, \dots, N .$

(39)
$S D$ :: Here, we have

$θ_{j} = ϑ_{1} - \frac{ϑ_{1} - ϑ_{N}}{\sqrt{N - 1}} \sqrt{j - 1}, j = 1, \dots, N .$

(40)
$L G$ :: Here, we have

$θ_{j} = ϑ_{1} - \frac{ϑ_{1} - ϑ_{N}}{l o g (N)} l o g (j), j = 1, \dots, N .$

(41)

Since the main goal of this work is to determine the impact of reducing the average boarding time on the other measures, we fix the average values of the times spent in various events, such as de-boarding, cleaning, boarding, and leaving a particular city for another city. In order to do this, we need to accordingly fix the means of the eight

P H

-distributions. To this end, we use Equation (16), which relates the means

μ_{W_{i}}^{'}

to

μ_{i}^{'}

, for

1 \leq i \leq 8

. Thus, given a specific probability vector

p_{r}

, the rate vector,

θ_{r}

, and the mean,

μ_{W_{r}}^{'}

, we can find the value of

μ_{r}^{'}

that will give the set value for

μ_{W_{r}}^{'}

.

For the following two examples, we fix the input parameters as follows. The unit is the number of hours, unless otherwise indicated.

μ_{W_{1}}^{'} = μ_{W_{2}}^{'} = μ_{W_{5}}^{'} = μ_{W_{6}}^{'} = \frac{1}{3}, μ_{W_{4}}^{'} = μ_{W_{8}}^{'} = 2 .

We take the probability vectors

p_{1}

and

p_{2}

to be identical, but we vary the common one to be one of the three, namely

B, G,

and P as listed above. Moreover, for the rate vectors, we take

θ_{1} = θ_{2} = θ_{4} = θ_{5},

and

θ_{3} = θ_{6}

. The parameter values of

(ϑ_{1}, ϑ_{N})

for these two sets, namely for

θ_{1}

and

θ_{3}

, will be

(12, 3)

and

(12, 1.5)

. However, we vary the type of decreasing to be one of the four listed above. In Figure 1, we display a sample plot of the values of

θ_{j}

under the four scenarios:

L D, Q D, S D,

and

L G

.

The means,

μ_{W_{3}}^{'}

and

μ_{W_{7}}^{'}

, are varied from

\frac{25}{60}

to

\frac{30}{60}

in increments of 1 min, i.e., in increments of

\frac{1}{60}

h. The capacity of the aircraft, N, is varied from 50 to 400 in increments of 50. To consider how, under the values chosen, the mean of the underlying random variable

X_{3}

(and hence others), which can be controlled by the system providing the needed resources, behaves as we vary N, the type of probability vector, and the type of rate vector, we can consult the plots in Figure 2, Figure 3 and Figure 4. Before we consider these (spider) figures, a few details are provided for explanation. The two-tuple values displayed at the perimeter of the outermost circle correspond to N and the mean times (in minutes). Thus, the two-tuple value 50 25 corresponds to

N = 50

and the mean boarding time is 25 min. The legend containing

B, G,

and P indicates the type of distribution used to model the

P M F

of the number of passengers.

One can clearly notice the patterns in these plots, indicating significant changes to the mean as N and

P L F

are varied, as well as the type of probability vector (either truncated binomial or truncated geometric). However, we do not see a significant difference between the truncated binomial and truncated Poisson. As is to be expected, the mean increases as the average boarding time is increased under both values of

P L F

considered. This behavior indicates that an increase in the rate calls for additional resources or additional strategies to quicken the process of boarding. For example, this can be achieved by increasing the number of gate attendants (based on the value of

P L F

, which should be known ahead of time). Among the four types of rate vector considered, it appears that a quadratically decreasing rate gives the smallest mean (for

X_{3}

and

X_{7}

), indicating that the system can dynamically provide gate attendants to help the passengers to board the aircraft.

Illustrative Example 1: In this example, we use Erlang distributions for all underlying random variables. Recall that an Erlang of order m with parameter

γ

, denoted as

E (m, γ)

, has the

P D F

given by

f (t) = \frac{γ^{m}}{(m - 1)!} t^{m - 1} E^{- γ t}, t \geq 0 .

(42)

Note that the mean and the variance are, respectively, given by

\frac{m}{γ}

and

\frac{m}{γ^{2}}

. One advantage of using this probability function is that by choosing the order to be a large positive integer, we can model a random variable that has very minimal variation.

Using the notation

μ_{i} = \frac{1}{μ_{i}^{'}}, 1 \leq i \leq 8,

the order and the parameter values for this example are as follows.

\begin{matrix} X_{1} \sim E (5, 5 μ_{1}), X_{2} \sim E (50, 50 μ_{2}), X_{3} \sim E (5, 5 μ_{3}), X_{4} \sim E (100, 100 μ_{4}), \\ X_{5} \sim E (5, 5 μ_{5}), X_{6} \sim E (50, 50 μ_{6}), X_{7} \sim E (5, 5 μ_{7}), X_{8} \sim E (100, 100 μ_{8}) . \end{matrix}

(43)

The parameters N and

μ_{W_{3}}^{'} = μ_{W_{6}}^{'}

are varied, respectively, from 50 to 400 and from 25 to 30 min in increments of 1 min. In Figure 5 and Figure 6, respectively, we display the key measures for probability vectors labeled B and G under a linearly decreasing (

L D

) rate vector. Since we saw similar behavior for the other types of theta vectors, namely

Q D, S D,

and

L G

, we display the results here only for the

L D

case.

It is clear by looking at these figures (as well as the ones not provided here due to the similarity of the plots) that the following occurs.

Only the type of probability vector (whether it is truncated binomial or truncated geometric) and the type of theta vector ( $L D$ through $L G$ ) appear to have an impact on the percentiles.
Comparing the truncated binomial probability (B) and the truncated geometric (G) schemes, we notice that (a) scheme G gives small values for the 50th percentile for both values of $P L F$ ; (b) for the 75th percentile, while scheme G gives small values when $P L F = 0.55$ , the values are similar for both schemes when $P L F = 0.85$ ; (c) scheme B gives small values for the 95th percentile for both values of $P L F$ . This indicates that scheme G starts with small values for the percentiles (as compared to scheme B) and then yields progressively larger values for higher percentiles.

Finally, we look at the percentage reduction in the percentiles when decreasing the average boarding time. We denote by

P_{(j)}^{(i)}

the value of the ith percentile when the average boarding time is j. Thus,

P_{(25)}^{(50)}

stands for the 50th percentile when the average boarding time is 25 min. The reduction percentage, for a given ith percentile and a given average boarding time j, is calculated as

100 [\frac{P_{(30)}^{(i)} - P_{(j)}^{(i)}}{P_{(30)}^{(i)}}] .

(44)

The reduction percentages do not appear to be significant when the probability vectors, the rate vectors, or

P L F

are changed. Hence, in Figure 7, we display the reduction percentages for the case of a truncated binomial and

L D

rate.

It is very clear from this figure that the reduction percentages are almost identical for all three percentiles. Further, a 5 min reduction in the average boarding time results in a more than 16% reduction in the percentile value. This translates into a guarantee that, for at least 95% of the time, the boarding time will not exceed between 45 and 69 min depending on the value of N. The average 95% guarantee time across the board is about 50 min. If one were to give a guarantee at a 50% level, the boarding time will not exceed anywhere between 21 and 26 min depending on the value of N. The average 50% guarantee time across the board is about 25 min.

Illustrative Example 2: In this example, we use Erlang as well as hyperexponential distributions for the underlying random variables. Recall that a hyperexponential of order m with parameters

γ_{1}, \dots, γ_{m},

with the corresponding mixing probabilities,

q_{1}, \dots, q_{m}

, has the

P D F

given by

f (t) = \sum_{k = 1}^{m} q_{j} γ_{j} e^{- γ_{j} t}, t \geq 0 .

(45)

Note that the mean and the variance are, respectively, given by

\sum_{k = 1}^{m} \frac{q_{k}}{γ_{k}}

and

2 \sum_{k = 1}^{m} \frac{q_{k}}{γ_{k}^{2}} - {(\sum_{k = 1}^{m} \frac{q_{k}}{γ_{k}})}^{2} .

This probability function can be used when there is large variability in the underlying random variable. We will denote this hyperexponential by

H e {(q_{1}, \dots, q_{m}),

(γ_{1}, \dots, γ_{m})}

. The order and the parameter values for this example are as follows.

\begin{matrix} X_{1} \sim E (5, 5 μ_{1}), X_{2} \sim E (50, 50 μ_{2}), \\ X_{3} \sim H E {(0.50, 0.30, 0.15, 0.04, 0.01), μ_{3} (10, 5, 2.5, 1.25, 0.625)}, \\ X_{4} \sim E (100, 100 μ_{4}), X_{5} \sim E (5, 5 μ_{5}), X_{6} \sim E (50, 50 μ_{6}), X_{8} \sim E (100, 100 μ_{8}), \\ X_{7} \sim H E {(0.50, 0.30, 0.15, 0.04, 0.01), μ_{7} (10, 5, 2.5, 1.25, 0.625)} . \end{matrix}

As in the previous example, we vary the parameters N and

μ_{W_{3}}^{'} = μ_{W_{6}}^{'}

, respectively, from 50 to 400 and from 25 to 30 min in increments of 1 min.

In Figure 8 and Figure 9, respectively, we display the key measures for probability vectors labeled B and G under a linearly decreasing (

L D

) rate vector.

It is clear by looking at these figures (as well as the ones not provided here due to the similarity of the plots) that the following occur.

Only the type of probability vector (whether it is truncated binomial or truncated geometric) and the type of theta vector ( $L D$ through $L G$ ) appear to have an impact on the percentiles. This observation is similar to the one seen in the previous example.
Comparing the truncated binomial probability (B) and the truncated geometric (G) schemes, we notice that (a) scheme G gives small values for the 50th percentile for both values of $P L F$ ; (b) for the 75th and the 95th percentiles, while scheme G gives small values when $P L F = 0.55$ , the values are rather similar for both schemes when $P L F = 0.85$ . This indicates that the large variability in the boarding times appears to nullify any significant differences in the $P L F$ value, especially when the percentiles increase.

The plot of the reduction percentage for this example is almost identical to the one seen in the previous example and hence the figure is not displayed here. However, the guarantee times differ and the details are as follows. A 5 min reduction in the average boarding time results in a more than 16% reduction in the percentile value. This translates to a guarantee that, for at least 95% of the time, the boarding time will not exceed between 73 and 79 min depending on the value of N. The average 95% guarantee time across the board is about 75 min. If one were to provide a guarantee at a 50% level, then the boarding time will not exceed between 9 and 11 min depending on the value of N. The average 50% guarantee time across the board is about 10 min.

It is worth pointing out that, comparing the two illustrative examples, while the guarantee times at the 50% level are much smaller for the boarding times with large variability, at a 95% level, the guarantee times are small for the boarding times with small variability. This is probably due to the long tail of the probability distribution associated with the boarding time event.

Illustrative Example 3: Here, we provide a brief discussion of the

P D F

of the time to travel by looking at the case when

N = 200, P L F = 0.85

, with the truncated binomial probabilities and

L D

rates for the above-mentioned two illustrative examples. In Figure 10, we plot the representative

P D F

of the time to travel from city

C_{1}

to city

C_{2}

and then back to city

C_{1}

; moreover, the

P D F

of the travel time from city

C_{1}

to city

C_{2}

under three scenarios is plotted in Figure 11.

It is evident from the above plots that, for all Erlang cases (see Illustrative Example 1), the

P D F

of the total travel time from city

C_{1}

to city

C_{2}

and back is a bell-shaped curve. Thus, one can try to fit a normal

P D F

with mean

\frac{19}{3}

h and standard deviation 0.488975 h. For the

P D F

of the total travel time from city

C_{1}

to city

C_{2}

in the Erlang case (see Illustrative Example 1), the normal curves for the three schemes,

B, G,

and P, have the same mean of

\frac{19}{6}

h but the standard deviations are, respectively, 0.343545 h, 0.430563 h, and 0.374347 h. This normal fit approximation will be helpful to managers seeking a quick solution using Excel or Excel-type worksheets in workplaces. Hence, we point out the possibility to implement the model proposed here.

In the case of Illustrative Example 2, wherein we used a hyperexponential distribution to model the boarding times, we notice a long-tailed distribution for the

P D F

. One can try to fit a three-parameter gamma, a three-parameter Weibull, or even a three-parameter lognormal distribution to approximate the

P D F

when dealing with a long-tailed distribution such as the one seen in the

P D F

plots. For example, we fitted a three-parameter gamma distribution with the shape, scale, and threshold parameters, respectively, of 29.0, 0.153, and 1.5 for the total travel time from city

C_{1}

and back to city

C_{1}

.

5. Concluding Remarks

In this paper, we sought to address the boarding time, one of the major issues facing commercial airlines, using phase type distributions. While there is research on the study of boarding strategies that minimize the average boarding time, in this paper, our aim was to study the boarding time (as part of the process of traveling from one city to another) given a boarding strategy already in place. We used well-known probability distributions for the number of passengers boarding, as well as different schemes for the rates of processing of the passengers. We used the passenger load factor information from the Bureau of Labor Statistics [24]. However, there were no data available on the time to board, clean, or de-board or the actual flying time. Hence, we used hypothetical values based on personal experience in traveling, and this is a limitation of this model. However, when actual data are made available, one can use

P H

to fit the data and then apply the methodology suggested here. Through the qualitative analysis of the modeling of aircraft boarding, we found that a 5 min reduction in the average boarding time resulted in a more than 16% reduction in the values of the percentiles. This translates to a guarantee that, for at least 95% of the time, the boarding time will not exceed between 73 and 79 min depending on the value of the capacity of the aircraft. The average 95% guarantee time across the board is about 75 min. If one were to give a guarantee at a 50% level, then the boarding time will not exceed between 9 and 11 min depending on the value of of the capacity of the aircraft. The average 50% guarantee time across the board is about 10 min. Understanding the percentiles of the boarding times, as opposed to relying only on the average boarding times, will help management to adopt a better boarding strategy, which in turn will lead to an increase in the number of trips that an aircraft can make. For example, when looking at the 95th percentile of the boarding time, there is a good understanding of how many trips an aircraft can make, as opposed to looking only at the average boarding time. Management can only incur a

5 %

error regarding the times in their estimations, as opposed to the error rate being unknown when using the mean, unless the boarding times are symmetric, in which case the error rate will be

50 %

. Thus, the use of percentiles to estimate and schedule the number of trips for an aircraft to make is more beneficial.

The model studied can be extended to include a variety of other aspects, such as (a) adding another random variable to model the time between the gate closing and the plane taking off (currently, we include this as part of the flying time); (b) the incorporation of catastrophic events (due to weather, a lack of aircraft personnel, or the mechanical failure of the aircraft), leading to the cancellation of a flight from a particular city; (c) extending from two cities to more than two cities; and (d) using practical data to fit the probability function for (i) the number of passengers boarding; (ii) the time to clean the aircraft; and (iii) the time taken from the moment that the gate closes to the aircraft’s landing in another city. Another task of interest for future work is to compare different boarding processes and identify which one will contribute to reducing the boarding times; then, this boarding process can be used in the model studied here to estimate the increase in the average number of trips that can be made.

Funding

This research received no external funding.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The author declares no conflicts of interest.

References

Available online: https://www.wsj.com/articles/southwest-airlines-boarding-seating-turn-ee12d3f2 (accessed on 1 June 2024).
Yeager, M. What Does It Take to Get a Plane Ready Between Flights? Americal Airlines Shows Us. The Arizona Repulbic, 14 May 2019. [Google Scholar]
Available online: https://www.nytimes.com/2011/11/01/business/airlines-are-trying-to-cut-boarding-times-on-planes.html (accessed on 1 June 2024).
Bachmat, E.; Berend, D.; Sapir, L.; Skiena, S.; Stolyarov, N. Analysis of Airplane Boarding Times. Oper. Res. 2009, 57, 499–513. [Google Scholar] [CrossRef]
Bachmat, E.; Khachaturov, V.; Kuperman, R. Optimal back-to-front airplane boarding. Phys. Rev. E 2013, 87, 062805. [Google Scholar] [CrossRef] [PubMed]
Bachmat, E. Airplane boarding, disk scheduling, and Lorentzian geometry. In Mathematical Adventures in Performance Analysis: From Storage Systems, through Airplane Boarding, to Express Line Queues; Springer International Publishing: Cham, Swizterland, 2014; pp. 51–129. [Google Scholar] [CrossRef]
Erland, S.; Bachmat, E.; Steiner, A. Let the fast passengers wait: Boarding an airplane takes shorter time when passengers with the most bin luggage enter first. Eur. J. Oper. Res. 2022, 317, 748–761. [Google Scholar] [CrossRef]
Hutter, L.; Jaehn, F.; Neumann, S. Influencing factors on airplane boarding times. Omega 2019, 87, 177–190. [Google Scholar] [CrossRef]
Available online: https://simpleflying.com/fastest-boarding-type-guide/ (accessed on 1 June 2024).
Available online: https://www.azcentral.com/story/travel/airlines/2019/05/14/how-long-it-takes-to-get-a-plane-ready-between-flights-airplane-turnaround-time/1123694001/ (accessed on 1 June 2024).
Available online: https://www.skyparksecure.com/blog/fastest-plane-boarding-methods/ (accessed on 1 June 2024).
Steffen, J. Optimal boarding method for airline passengers. J. Air Transp. Manag. 2008, 14, 146–150. [Google Scholar] [CrossRef]
Available online: https://www.usatoday.com/story/travel/columnist/hobica/2017/11/21/airline-boarding-order/881548001/ (accessed on 1 June 2024).
Willamowski, F.; Tillmann, A.M. Minimizing Airplane Boarding Time. Transp. Sci. 2022, 56, 1196–1218. [Google Scholar] [CrossRef]
Neuts, M.F. Probability distributions of phase type. In Liber Amicorum Prof. Emeritus H. Florin; Department of Mathematics, University of Louvain: Ottignies-Louvain-la-Neuve, Belgium, 1975; pp. 173–206. [Google Scholar]
Bladt, M.; Nielsen, B.F. Matrix-Exponential Distributions in Applied Probability; Probability Theory and Stochastic Modelling; Springer: Boston, MA, USA, 2017; Volume 81. [Google Scholar]
Buchholz, P.; Kriege, J.; Felko, I. Input Modeling with Phase-Type Distributions and Markov Models: Theory and Applications; Springer: Heidelberg, Germany, 2014. [Google Scholar]
Chakravarthy, S.R. Introduction to Matrix-Analytic Methods in Queues; John Wiley & Sons, Inc.: London, UK, 2022; Volume 1. [Google Scholar]
Latouche, G.; Ramaswami, V. Introduction to Matrix Analytic Methods in Stochastic Modeling; SIAM: Philadelphia, PA, USA, 1999. [Google Scholar]
Neuts, M.F. Matrix-Geometric Solutions in Stochastic Models: An Algorithmic Approach; John Hopkins University Press: Baltimore, MD, USA, 1981. [Google Scholar]
Graham, A. Kronecker Products and Matrix Calculus with Applications; Ellis Horwood: Chichester, UK, 1981. [Google Scholar]
Marcus, M.; Minc, H. A Survey of Matrix Theory and Matrix Inequalities; Allyn and Bacon: Boston, MA, USA, 1964. [Google Scholar]
Steeb, W.H.; Hardy, Y. Matrix Calculus and Kronecker Product; World Scientific Publishing: Singapore, 2011. [Google Scholar]
Available online: https://www.transtats.bts.gov/Data_Elements.aspx?Data=1 (accessed on 15 March 2023).
Asmussen, S.; Nerman, O.; Olsson, M. Fitting phase-type distributions via the EM algorithm. Scand. J. Stat. 1996, 23, 419–441. [Google Scholar]
Bobbio, A.; Cumani, A. ML estimation of the parameters of a PH distribution in triangular canonical form. In Proceedings of the 5th International Conference on Modelling Techniques and Tools for Computer Performance Evaluation (TOOLS), Torino, Italy, 13–15 February 1991. [Google Scholar]
Bobbio, A.; Telek, M. A benchmark for PH estimation algorithms: Results for acyclic-PH. Commun. Stat. Stoch. Model. 1994, 10, 661–677. [Google Scholar] [CrossRef]
Esparza, L.J.R. Maximum Likelihood Estimation of Phase-Type Distributions. Ph.D. Dissertation, Technical University of Denmark, Kongens Lyngby, Denmark, 2011. [Google Scholar]
Feldmann, A.; Whitt, W. Fitting mixtures of exponentials to long-tail distributions to analyze network performance models. Perform. Eval. 1998, 31, 245–279. [Google Scholar] [CrossRef]
Okamura, H.; Dohi, T. Fitting phase-type distributions and Markovian arrival processes: Algorithms and tools. In Principles of Performance and Reliability Modeling and Evaluation: Essays in Honor of Kishor Trivedi on his 70th Birthday; Springer: Heidelberg, Germany, April 2016. [Google Scholar]
Reinecke, P.; Krauss, T.; Wolter, K. Cluster-based fitting of phase-type distributions to empirical data. Comput. Math. Appl. 2012, 64, 3840–3851. [Google Scholar] [CrossRef]
Thummler, A.; Buchholz, P.; Telek, M. A novel approach for phase-type fitting with the EM algorithm. IEEE Trans. Depend. Secur. 2006, 3, 245–258. [Google Scholar] [CrossRef]

Figure 1. A sample plot of the rate vector under four scenarios.

Figure 2. Plot of the mean of the boarding event with the

L D

rate under various scenarios.

Figure 2. Plot of the mean of the boarding event with the

L D

rate under various scenarios.

Figure 3. Plot of the mean of the boarding event using truncated binomial probabilities under various scenarios.

Figure 4. Plot of the mean of the boarding event using truncated geometric probabilities under various scenarios.