1. Introduction
Retrial queues are characterized by the feature that there is no place for waiting by arriving customers in the case of the unavailability of service facilities at the arrival moment. However, a rejected customer may relocate to a virtual location known as “orbit” and attempt to access the service at random intervals. For instance, the monograph [
1] contains examples of the theory of retrial queues applied to analyze real-world systems. As an early work in the theory of retrial queues, [
2] has to be mentioned. Surveys of the research on retrial queues are given in [
3,
4,
5,
6,
7,
8,
9,
10]. Single-server retrial queues are investigated (see the excellent monograph [
4]) to a much better extent than queues with many servers due to the essentially higher dimension of the stochastic processes describing the multi-server retrial queues. In this paper, we consider a multi-server retrial queue.
The predominant body of study in queueing theory, particularly concerning retrial queues, focuses on systems with a stationary Poisson arrival process defined by a constant customer arrival rate. It is already well recognized that an arrival rate can fluctuate in many real-world systems. As a good model of arrivals with varying instantaneous arrival rates, the Markov arrival process (
) and its generalization, such as the batch Markov arrival process (
), are introduced in, e.g., [
11,
12]. For more information about the
, its properties, usefulness for modeling real-world flows, and literature surveys, see [
13,
14,
15,
16,
17]. The modeling of real-world flows by the
allows one to catch the inherent features of such flows as the fluctuating arrival rate, dependence between successive inter-arrival times, and possible high variance of these times. The issue of matching real-world flows with the
has been thoroughly examined in the current literature; refer to, for instance, [
17,
18,
19] and the citations therein.
The initial investigation of a multi-server retrial queue characterized by a
and phase-type (
) service time distribution is presented in [
20]. Phase-type distribution is an essential generalization of the most popular distribution among researchers, exponential distribution. In contrast to this distribution, which allows for the fitting of only the expectations of service times,
distribution allows for the fitting of their variance and higher moments. For more information about
distribution, see [
21,
22]. An approximation of any arbitrary distribution of a non-negative random variable can be accomplished through the utilization of the
distribution (see [
23]). The condition for the ergodicity of the
retrial queue is established in paper [
20]. A very important problem of the computation of this distribution is not touched upon there due to its complexity. This problem was later solved in [
24]. In [
24], sufficient ergodicity conditions and algorithms for calculating the stationary distribution of system states are given for the cases of a constant and infinitely increasing total retrial rates from the orbit.
The power of queueing theory consists in its ability to provide a tool not only for the probabilistic analysis of real-world systems under the fixed parameters of arrival, service, retrial, etc., processes but also for the elaboration of proposals on how these parameters might be modified to enhance the operation of the systems. Besides the traditional static optimization problems, e.g., finding the minimal number of servers sufficient to guarantee the fixed requirements of the quality of service indicators, which were solved in the beginning of queueing theory by its founder A.K. Erlang, currently, the problems of dynamic optimization are also actual. The swift advancement of electronic and communication technologies now enables the dynamic monitoring of queueing systems (e.g., the count of occupied servers, queue length, or the number of customers in the orbit) and facilitates immediate responses to unfavorable state changes. For instance, if the customer volume in the system escalates excessively, one may augment the number of servers delivering service, enhance the service rates of existing servers, or, alternatively, reduce the arrival rate, etc.
The standard retrial queueing models assume equal opportunities to occupy the idle servers for the primary and retrying customers. However, in various potential real-world applications, it is likely reasonable differentiate these opportunities different due to economic reasons. An analogous situation takes place in modeling so-called cognitive radio systems in which all types of customers may require the same number of system resources, but one of the types, the primary customers, have priority access as the owners of the channels. Another type, so-called cognitive customers, have opportunistic access and may obtain access only if a sufficient number of existing channels are idle.
By analogy, it seems to be reasonable to give some priority access to the primary customers in multi-server retrial queues over the retrying customers. The motivation for this is twofold. The first reason is that the main characteristic of quality of service in many systems is, namely, the probability of immediate access to service of an arbitrary arriving customer. Providing priority to the primary customers may increase this probability. The second economical reason is the following. If the primary customer is rejected, this customer may decide to abandon service in the system and permanently leave it without bringing any revenue to the system owner. If a customer decided to become a retrial customer, maybe he/she has a serious need to receive service in this system and will not leave the system without service and bringing the profit.
Priority access for the primary customers can be managed via the reservation of a few servers exclusively for the access of the primary customers when the number of idle servers is small, similar to the existing schemes of channel reservation in cognitive radio systems. The reservation makes sense even in the case when the rate of customer arrival is constant.
The use of the
as the model of a real-world arrival flow can ensure a more effective implementation of control based not solely on the present quantity of customers in service or within the system but also on the state of the fundamental process of the
. Specifically, in the context of multi-server retrial queues, it is prudent to regulate the retrial rate based on the number of occupied servers and the state of the arrival process. An interesting specific instance of the
is the Markov modulated Poisson process (
). In the
, during some exponentially distributed time interval, arrivals occur like, in the stationary Poisson process with some fixed rate. When this interval finishes, the rate admits another value. In the simplest case, we can speak about the intervals with high and low arrival rates. If primary customers can be lost when all servers are busy at their arrival moment, it makes sense to restrict the speed of retrials during the periods when the quantity of occupied servers is large and allow for a higher retrial rate when the number of busy servers is relatively smaller and a quick occupation of idle servers by the customers from orbit is desirable. The correctness of these intuitive considerations is confirmed in [
25], in which the
retrial queue is studied. The threshold-type strategy is employed to implement the admission of retrying customers for service. A retrying customer is acceptable only if the count of occupied servers does not surpass the threshold associated with the present state of the fundamental process of the
Alternatively, the customer reverts to the orbit. The system’s behavior, given a fixed set of thresholds, is characterized by a three-dimensional Markov chain whose stationary characteristics are analyzed in [
25]. The possibility of optimally choosing the thresholds is numerically demonstrated.
We classify [
25] as an opening of a new direction in dynamic control by queueing systems, including retrial queues, depending not only on the state of the orbit or buffer and servers but also on the state of the fundamental process of arrivals. Traditionally, control is executed solely based on the state of the orbit or the buffer and servers, as fluctuations in the arrival process are seldom considered. While these fluctuations are quite typical for many real-world systems, fluctuations in arrival rate can be caused by time intervals during the day or night, as well as the day of the week, weather conditions, and other random events and factors. Our paper investigates this new direction and essentially extends the results of [
25].
Two assumptions, which are quite restrictive from the point of view of potential applications, were imposed in the model analyzed in [
25]: service times have exponential distribution and customers can arrive only one by one. In this paper, we essentially relax both these assumptions. We suppose a much more general
distribution of service times, which allows for a better fit for the real distribution of the service time, or at least the variance of this time. Customers may arrive both singly and in random-sized batches, with their arrival characterized by the
. Batch arrival is characteristic of numerous real-world systems, e.g., telecommunication networks, where the arriving message represents a group of packets, or emergency services, where a group arrival of patients may occur as the result of an accident. It is clear that the possibility of batch arrivals of customers and their rejection when the number of available servers is insufficient for service of the arrived batch makes the reservation of servers (and access denial to retrials) for the primary customer service even more important than in the model considered in [
25]. The numerical experiments, including those provided in this study, conclusively demonstrate that the ignorance of batch arrivals (by replacing the
by the
with the same mean arrival rate) leads to too optimistic an evaluation of the performance of the system and an incorrect estimation of the amount of the resource required to provide the required quality of service level.
Thus, in this paper, we examine the
retrial queue. Analysis of this system is essentially more complicated than the analysis implemented in [
25] due to two reasons.
The first one is that the Markov chain describing the dynamics of the system has a higher dimension of a state space. In [
25], this chain has three components: the number of customers in orbit, the number of customers in service, and the fundamental process of the
. Here, several additional components, which characterize the service process in busy servers, are required, and the cardinality of the state space of these components can be pretty large.
The second reason is that customers may arrive in groups, resulting in the generator of the chain lacking a block tridiagonal structure and the chain not being classified as level-dependent quasi-birth-and-death processes (see [
21]) due to the potential for retrials and the impatience of orbiting customers.
This paper’s content is succinctly outlined as follows. The mathematical model delineating the system’s behavior is presented in
Section 2.
Section 3 is devoted to the description of a Markov chain that indicates the dynamics of the examined system. This section presents and elucidates the explicit form of the generator in detail. The affiliation of this chain with the category of asymptotically quasi-Toeplitz Markov chains is confirmed and the ergodicity of this chain for any configuration of the system parameters is demonstrated. The issue of calculating the stationary distribution of the chain is briefly addressed.
Section 4 presents formulas for calculating the key performance characteristics of the system.
Section 5 delineates the outcomes of two numerical experiments. In the first experiment, the system with the
, the fundamental process of which has two states and, therefore, control by retrials admission is defined by two thresholds, is considered, and the dependence of several performance characteristics of the system on these thresholds is numerically highlighted. An example of solving an optimization problem is presented. In the second experiment, the following question is answered: whether or not the generalization of the results from [
25] to the case of batch arrivals is essential. The answer is that the batch arrivals do matter.
Section 6 summarizes this paper and lists possible generalizations of the considered model.
2. Mathematical Model Description
We examine a retrial multi-server queuing system, the configuration of which is illustrated in
Figure 1.
The flow of primary customers enters the system. This arrival process is characterized by the fundamental process which is an irreducible continuous-time Markov chain with a finite state space and the matrices which represent the jump intensities of the process associated with the arrival of a batch of k customers. Here, K is the maximum feasible batch size. The notation indicates that the parameter k can take on values from the set . The matrix denotes the generator of the process
The average arrival rate (fundamental rate) of the is determined by the equation , where represents the invariant vector of the stationary distribution of the process The vector is determined as the single solution to the system . Here and below, denotes a column vector comprised of ones, while represents a zero row vector. If the dimension of the vectors and is not clear from the context, it may be indicated by the subscript, e.g., means a zero vector of size
Formulas for calculating the coefficients of variation and correlation of inter-arrival times are available in sources such as [
12,
13,
14,
16].
The system does not have a buffer and has N identical independent servers that provide service to customers. The service time for any customer by any server has a phase-type distribution characterized by the irreducible representation The distribution is defined by the Markov chain which has a state space comprising transient states and an absorbing state The duration exhibiting this distribution can be regarded as the walking time of a particle within a network including M nodes. The stochastic row vector of dimension M defines the starting position of the particle in the network at the walking beginning instant. The square matrix S of dimension M is the subgenerator that delineates the transition rates of the chain within the set . The transition to the absorbing state signifies the conclusion of the walking process. The transition rates to the absorbing states are represented by the column vector The average service time of a customer is defined as
Customer processing is described as follows. If a group of customers arrives and discovers the necessary number of idle servers, it begins processing on these servers. If the quantity of idle servers is fewer than the size of the incoming batch, then this batch is rejected and leaves the system permanently with probability p or, with the complementary probability, part of the batch occupies all idle servers, if any, while the remainder of the batch joins the orbit. The orbit is a virtual place from which customers individually and independently of each other make repeated attempts (retrials) to enter the service. The orbit capacity is assumed to be infinite. Inter-retrial times have an exponential distribution with the parameter When i customers stay in the orbit, the total retrial rate is
In the standard retrial queues, if a retrial is successful, the customer leaves the orbit and starts service if there is at least one idle server. We make the following assumption about a more strict restriction on customers’ access from the orbit.
We fix the set of integer parameters (thresholds) assuming that , . If, during the retrial epoch, the status of the arrival process is and the number of occupied servers does not exceed the threshold then this customer starts service on one of the idle servers. Otherwise, the customer remains in the orbit awaiting service.
Customers in the orbit may exhibit impatience and depart from the orbit and system without service independently after an exponentially distributed time characterized by the parameter
The further analysis aims to find the steady-state distribution of the considered system under any fixed set of the thresholds and examine how these thresholds affect the system’s performance characteristics.
3. The Process of System States and Its Analysis
The system’s behavior can be characterized by the regular irreducible continuous-time Markov chain
where, at time instant
t,
denotes the quantity of orbital customers, ;
denotes the state of the fundamental process of the , ;
denotes the number of occupied servers, ;
denotes the quantity of servers in the l-th phase of service,
It is important to acknowledge that the assumption about the
distribution of service time rather than its specific instance, the exponential distribution, which is assumed in [
25], essentially complicates the form of the Markov chain that should be analyzed. In [
25], three components,
completely characterize the dynamics of the queueing system. In the case of the
distribution of the service time, this three-dimensional process is not Markovian. To receive the Markovian process, it is necessary to supplement these components with the auxiliary process with components describing the state of the service process in busy servers.
Two popular ways for receiving the Markov process are as follows. The initial method is referred to as
(track phase for server); see [
26], which presumes the continuous re-enumeration of occupied servers according to their use and the monitoring of the service phase for each occupied server.
Consequently, when the number of customers receiving service equals n, the auxiliary process must consist of the components where represents the current phase of service for the rth active server, with for The cardinality of the state space of the process when the number of customers receiving service equals n is equal to .
The second method, referred to as
(count server for phase) in the formulation of the service process, entails monitoring the quantity of servers delivering service throughout various stages. Building upon the methodology established in [
27], we ascertain that when the quantity of customers receiving service is
n, the auxiliary process is delineated as
where the component
represents the number of servers in the
kth service phase, with
The cardinality
of the state space of the process
when the number of customers receiving service equals
n is given by
By default, we suppose that
As follows from the definition of the process
given above, we chose the second way to construct the Markov process. This choice can be explained by the fact that although the second way is characterized by the less transparent formulas for transition intensities, the cardinality of the state space of the process
can be drastically smaller, especially in the case of small values of
M, than the cardinality of the state space of the process
; see, e.g., [
26]. For example, if
and
then the cardinality of the state space of the process
is equal to
The cardinality of the state space of the process
is
The cardinality of the state space of the process describing the service of customers is critically significant during the computer implementation phase of the devised technique for calculating the stationary distribution of the Markov chain, particularly when dealing with a substantial number of servers This motivated our choice of the approach in this study.
We list the states of the Markov chain in direct lexicographic order based on the components and in reverse lexicographic order based on the components . We designate all states of the chain that possess the value i of the component as level i of the Markov chain .
We shall denote the infinitesimal generator of this chain as Q. The matrix Q encompasses the intensities of all potential transitions of the examined chain throughout an infinitesimally short period.
To formulate the expression for the generator we require the subsequent auxiliary result that delineates the behavior of the M-dimensional vector random process given a fixed value n of the component of the Markov chain
Lemma 1. Let the number of customers in service be denoted as n, where n ranges from 1 to N. The matrix delineates the transition intensities of the process upon the completion of service at one of the servers. The square matrix of size specifies the transition intensities of the process when service at one of the servers shifts to another phase. The matrix characterizes the transition probabilities of the process at the instant a new customer arrives and begins service. The diagonal entries of the diagonal matrix specify the exit rates of the process from the respective state.
The matrices and are calculated using the procedures for matrices of the same designation outlined in [28], pages 106938–106939. For use in the subsequent sections, the following denotations are used:
means the diagonal or block-diagonal matrix of an appropriate dimension with the diagonal entries or blocks listed in the brackets;
is the diagonal matrix of size
d defined as
O is the zero matrix of the proper dimension. If the dimension is not clear from the context, it may be indicated by the subscript;
is the matrix defined as
is a block upper-Hessenberg matrix of size
with non-zero blocks:
is a block matrix of size
with the non-zero blocks
is a block matrix of size with the non-zero blocks
The following statement is true.
Theorem 1. The infinitesimal generator Q of the Markov chain has an upper-Hessenberg structure: The non-zero blocks are the square matrices of dimension which are defined as follows:
where where where the blocks have the form
Proof. The theorem is established by examining the intensities of all potential transitions of the Markov chain during an infinitesimal time interval. Since during such a period, customers may enter the orbit in groups of up to K customers and leave it one at a time, the matrices are zero matrices for all such that
The matrices consist of the blocks
The diagonal elements of the diagonal blocks are negative and their absolute values determine the exit rates of the Markov chain from its states. The Markov chain may transition from its current state under the four following circumstances:
- (a)
The fundamental process of arrivals transitions from the present state , with transition rates determined by the absolute values of the diagonal entries of .
Note that the exit of the Markov chain from its state does not occur when the fundamental process makes a transition from the state to the state with the generation of a batch of customers, but this batch is lost due to a lack of a sufficient number of idle servers. This explains the presence of matrices which define such intensities, in the expression for the summand in the formula for the block .
- (b)
A customer departs the orbit due to impatience (the transition rates are represented by the diagonal elements of the matrix ).
- (c)
A customer departs the orbit due to a successful retry attempt. The transition rates are represented by the diagonal elements of the matrix . Note that the exit of the Markov chain from its state does not happen if a customer attempts to access service, provided that the number of occupied servers exceeds the threshold . In this case, the attempt is not successful, and a customer returns to orbit. Consequently, this explains the presence of the supplementary term , which defines the intensities of unsuccessful attempts in the diagonal blocks of the generator.
- (d)
the fundamental process of service departures from its present state. The transition rates in this case are specified by the matrix . This explains the presence of the diagonal matrices in summand
The non-diagonal elements of the matrices dictate the transition rates of the Markov chain while preserving the values of the components i and . The transitions are characterized by the non-diagonal elements of the matrix The non-diagonal elements of the matrix represent the intensities of the following transitions:
- (a)
The fundamental process of service makes a transition without completing the service in one of the occupied servers (transition rates are specified by the elements of the matrix ).
- (b)
The fundamental process transitions from state to state while generating a batch of customers, with the batch size being less than or equal to the number of idle servers. In this case, all customers from the batch start service. The relevant intensities are included in the blocks , which depend on the number of busy servers and the size of the incoming batch.
- (c)
The service is finished in one of the busy servers (transition rates are defined by the elements of the matrix , where is a block sub-diagonal matrix).
The matrices , where define the transition rates of the Markov chain that change the state of the arrival process but maintain the number of customers in the orbit. These matrices represent the summation of two matrices and The matrix delineates the rates of the fundamental process transition from state to state with the following:
- (i)
The generation of a batch, the size k of which does not exceed the number of idle servers. All arriving customers begin service. The transition probabilities of the process at this time are represented by the matrix .
- (ii)
The generation of a batch, the size of which exceeds the number of idle servers. This batch abandons the system.
The matrix defines the fundamental process transition rates from the state to the state without batch generation.
Therefore, we obtain the view of the blocks
The Markov chain can transition from level i to level only if (a) one of the i customers from the orbit is lost due to impatience (the number of busy servers remains unchanged) or (b) one of the i customers in the orbit successfully enters service (the number of busy servers increases by one). The matrices have only diagonal blocks because the fundamental process of the cannot make a transition during the moment of retrials or a customer departure due to impatience. Consequently, in scenario (a), the intensities are represented by the matrix , whereas, in scenario (b), the relevant intensities correspond to the elements of the matrix
Transitions of the Markov chain from level i to level occur exclusively due to the transitions of the fundamental process of the between states and under the following conditions: (a) the number of occupied servers equals N and the incoming batch of k customers remains in the system, joining the orbit, or (b) the number of available servers is insufficient for the incoming batch, resulting in part of the batch occupying the idle servers while the remaining k customers proceed to the orbit. The matrix delineates the transition intensities for case (a), while the matrix encompasses the transition intensities for case (b). Taking these reasonings into account, we obtain the form of the blocks
We define the view of all blocks of the generator; thus, the theorem is proven. □
Given that the blocks
and
of the generator are explicitly contingent on the level
i, the Markov chain
clearly does not fall within the category of the
-type Markov chain; see [
29]. This makes the analysis of this chain difficult.
It can be confirmed that the following matrices
exist:
where
is the so-called Kronecker delta:
if
and
otherwise, and the matrix
is defined by the formula
, where ∘ is the Hadamard product of the matrices symbol; see, e.g., [
30]. The matrix
is the stochastic one. This indicates that the Markov chain
is classified as an Asymptotically Quasi-Toeplitz Markov Chain (AQTMC), as introduced in [
31] and further discussed in [
16]. In [
31], sufficient requirements for the ergodicity of such chains and a numerically stable algorithm for calculating their stationary distribution are presented.
The direct application of the results of [
31] requires the derivation of the explicit form of the matrices
However, it is possible to avoid this work because we assume that customers residing in the orbit are impatient (
). Applying the result from [
32], we conclude that the following assertion is true.
Lemma 2. Due to the impatience of customers staying in the orbit (), the Markov chain is ergodic for every set of system parameters.
Therefore, there exist the stationary probabilities of these Markov chain states:
Utilizing the aforementioned enumeration of the states of the Markov chain we indicate by the row vectors representing the stationary probability of the states of the Markov chain corresponding to level
The vectors can be partitioned as , where
The vectors
are established to fulfill the system of linear algebraic equations:
where
Q is the generator of the
.
Due to the upper-Hessenberg structure of the matrix
it is possible to obtain from this system an expression for the unknown vectors
in the form
where the matrices
are computed recursively. However, the recursion for the matrices
lacks numerical stability, and it remains ambiguous how to determine the unknown vector
The problem of calculating this vector is quite difficult even in the case when the upper-Hessenberg structure of the matrix
Q is additionally quasi-Toeplitz, i.e., the value of the blocks
depends only on the difference between
i and
but not on
i and
j individually. In this case, certain additional probabilistic considerations can be used; see [
29]. Alternatively, the equation for the vector
is derived using the reasonings of the analyticity of the vector-generating function of the vectors
in the unit disk of the complex plane; see, e.g., [
33,
34,
35]. In the case of the absence of the quasi-Toeplitz property of the generator, the technique exploiting the analyticity does not work. Therefore, in [
31], system (1) for vectors
is replaced with another system that is derived via the construction of the infinite sequence of so-called censoring Markov chains; see, e.g., [
36] with different censoring levels. The corresponding algorithm for solving this alternative system, the solution of which coincides with the solution of system (1), is presented in [
31]. Another algorithm that does not need analytical derivations to obtain the explicit form of the matrices
is described in [
37]. This algorithm shows a higher convergence rate compared to [
31] in the technique for calculating the matrices that delineate transition probabilities of the finite components of the chain
during the initial passage time to the level
i starting from the level
4. System Performance Characteristics
Having calculated the vectors of stationary probabilities of the system states, we can determine the values of various probabilistic characteristics of the system.
The mean quantity
of customers in the system (including staying in the orbit and receiving service in servers) is
The average quantity
of customers present at a given moment in the orbit is
The mean quantity
of busy servers is computed as
The presented formulas for and are evidently derived as the expectations of the corresponding discrete random variables.
The probability that all servers are idle at an arbitrary moment is
The probability that the orbit is empty at an arbitrary moment is
The probability that the system is empty at an arbitrary moment is
The average intensity
of the output flow of serviced customers is
This formula evidently follows from the formula of total probability, taking into account that the output rate of serviced customers is equal to when the number of customers in the orbit is equal to the state of the fundamental process of arrivals is equal to and the number of busy servers is equal to
The probability
that an arbitrary arriving customer joins the orbit is
This probability is calculated as the ratio of the rate of arrival of customers who join the orbit to the average arrival rate The former rate is calculated as follows. Let the number of customers in the orbit be i and the number of busy servers be The instantaneous rate of arrival of customers in batches of size k is equal to when the fundamental process of arrivals makes a jump from the state to the state An arriving customer has a chance to join the orbit if the size k of the batch exceeds the number of idle servers. For an arbitrary customer arriving in a batch of size k, it is natural to assume that the customers are enumerated in a random manner, the customer receives the lth position in the batch with the probability for all l from 1 to k, and only the suitable number of customers from the head of a batch can be admitted to the orbit (with the probability ). As the result of these considerations, it is easy to see that the rate of arrival of customers who join the orbit is calculated as Thus, we obtain the expression under the proof for computation of the probability .
The proof of several subsequent formulas is similar.
The probability
that an arbitrary customer is lost upon arrival due to all servers being busy is computed as
The probability
that an arbitrary customer is lost upon arrival due to an insufficient number of idle servers, conditional on the fundamental process
of the
being in the state
, is computed as
The probability
that an arbitrary customer immediately starts service upon arrival is
The probability
that an arbitrary customer is lost from the orbit due to impatience is
This formula is clear because the probability of an arbitrary customer loss from the orbit due to impatience is the ratio of the rate of the lost due to impatience customers to the customer arrival rate.
The probability
that an arbitrary customer is lost from the orbit due to impatience when the fundamental process
of the
is in the state
is computed as
The loss probability
of an arbitrary customer is computed as
This formula, which delineates two distinct methods for calculating , is crucial in ensuring the accuracy of the computations for the generator Q and the stationary distribution of the system under examination.
5. Numerical Examples
In this section, we present the results of two numerical experiments.
Experiment 1. The purpose of Experiment 1 is to numerically highlight the dependence of the main performance measures of the system on the parameters
of the control strategy and show the possibility of obtaining the optimal values of these thresholds. To this end, let us assume that customers enter the system in the
defined by the matrices
In this arrival flow, the customers arrive in batches of the maximal size , with the average arrival rate of groups being The mean customer arrival rate in the is , the correlation coefficient of two consecutive periods between batch arrivals is , and the coefficient of variation is . The vector of the stationary distribution of the fundamental process of the is . The mean customer arrival rate at state k () of the fundamental process of the is the k-th element of the vector . The conditional mean arrival rates given states 1 and 2 of the fundamental process of the are equal to 14.23171 and 5.73185, correspondingly.
We assume that the service time distribution in each server is of phase type and is defined by the vector and the matrix The average service time of a customer is The squared coefficient of variation of service time is 4.
We assume that the number of servers is the retrial intensity is the intensity of impatience is and the probability of loss of a batch in the case of lack of enough idle servers to accommodate all arriving customers is
To construct the 3-D representations depicting the dynamics of several performance metrics of the system, we will adjust the thresholds and throughout the range in increments of 1.
Figure 2 and
Figure 3 illustrate the dependencies of the mean quantity
of customers in the orbit, the mean quantity
of busy servers, and loss probabilities
and
due to impatience on the thresholds
and
.
Figure 4 shows the dependencies of the probabilities
and
that an arbitrary customer is lost upon arrival due to the lack of available servers on the thresholds
and
.
Figure 5 and
Figure 6 illustrate the relationship between the loss probabilities
,
, and
and the varying values of
and
.
As seen from
Figure 2, the mean quantity of customers in orbit
takes its maximum value of 52.875 when
. In this case, the customers from orbit have a chance to start service only if, during the retrial epoch, all servers are free. Also,
decreases with the increase in the parameters
and
The minimal value of
is 4.838 when
i.e., the admission control for retrial customers is not applied.
The mean quantity of busy servers
sharply increases with growth in the parameters
and
when
and
are small, e.g., less than 6. This can be explained by the fact that with such growth, the mean quantity of customers in orbit
decreases, which causes the decrease in the loss probabilities of a customer due to impatience
, and
; see
Figure 3 and
Figure 5. The fewer customers are lost, the more customers receive service and the more servers are busy. With further growth in
and
, the value of
grows less essential and then starts to slowly decrease. This is caused by the fact that the growth in
and
causes the increase in the loss probabilities of customers upon arrival
, and
; see
Figure 4 and
Figure 5.
In contrast to the probabilities , and of customer loss due to impatience, which decrease with a decrease in and , the loss probabilities , and behave not so predictably. For example, one can anticipate that the loss probability takes its maximal value in the case when retrial customers can enter service without any restriction, but this is not true.
In this example, the maximal value of is 0.0569 for and while if . This can be explained as follows. Since the average arrival rate is essentially less under state 2 than under state 1, the load of the system is also less under state 2. If we fix , then when the arrival flow is in state 2, the customers from the orbit can start service only if all servers are idle. Thus, a lot of customers accumulate in the orbit, and when the fundamental process of transits to state 1, a lot of customers from orbit start to compete with primary customers. If no admission control is applied (), a lot of primary customers are lost. This also explains the non-monotonic behavior of some other loss probabilities of a customer upon arrival.
For example, if we fix , for , for , and for . When the admission control under state 2 of the is not applied (), many arriving primary customers are lost due to a lack of available servers. But, if the admission control is the strictest (), during state 2, retrial customers have a very low chance of entering service (while the arrival rate of primary customers is relatively low and, therefore, a scarcity of servers is possible), and a lot of customers wait in the orbit. After the return of the fundamental process of the to state 1, customers from the orbit occupy all servers quickly, and it is very likely that a lot of quickly arriving primary customers are rejected.
The total loss probability for a customer is the sum of the probabilities and . In this experiment, the probability is more sensitive to the parameters and , resulting in exhibiting behavior akin to that of . However, in contrast to , the probability behaves non-monotonically. It takes its minimal value of 0.05752 for and The fact that the optimal value of is less than the optimal value of i.e., the restriction of retrying customer access is more strict under state 1 of the fundamental process, is intuitively clear. Since the arrival rate of primary customers during state 1 is essentially higher, it makes sense to restrict the access of retrying customers more strongly.
Thus, if the purpose of the system optimization is to minimize the loss probability of customers, the values and should be considered as the optimal ones in this example. However, in real-world systems, cost criteria may be more complicated.
We will suppose that the efficacy of the system’s operation is assessed using the subsequent cost criterion:
where
a is the profit gained by the system for one customer service,
represents the cost associated with the loss of one customer at the system’s entrance, and
is the charge for one customer loss due to impatience.
The cost criterion delineates the average profit of the system per unit of time. The goal is to find the values of and that provide the maximum value to this criterion.
The following cost coefficients are set up:
Figure 7 illustrates the relationship between the cost criterion
J and the parameters
and
.
The optimal value of the cost criterion is , attained at and .
Experiment 2. This experiment aims to demonstrate the significance of considering the potential of customer arrivals in groups. Consequently, rather than the aforementioned
, we examine the
arrival flow with equivalent average intensity, coefficients of variation, and correlation. This
is characterized by the subsequent matrices:
The average customer arrival rate is equal to 9. The vectors and are the same as the corresponding vectors characterizing the considered in the first experiment.
Other parameters of the queueing system are assumed to be the same as in the previous experiment.
The dependencies of the loss probabilities
,
, and
and the cost criterion
J on the different values of
and
are shown in
Figure 8 and
Figure 9.
One can see from
Figure 8 and
Figure 9 that the consideration of
instead of
with an identical average arrival rate leads to overoptimistic results. If the batch arrivals are not taken into account, the loss probability of an arriving customer
will be more than two times less than its real value. The minimal value of
is 0.04807 in the case of
, while, in the case of
, this probability does not exceed 0.02 for any values of
and
The probabilities
and
are also essentially less in the case of
The optimal value of the cost criterion in the case of arrival flow is and is achieved for and , about 40 percent higher than in the case of .
It is worth noting that in Experiment 1, we considered the
with an average batch size of 1.7333. Thus, most of the batches were of size 1. It is evident that if someone considers a
with a higher average batch size, the difference between the characteristics of queues with
and
arrival models will be even more essential. The conclusions from Experiment 2 prove the importance of the extension of the research, the results of which are presented in [
25].