Next Article in Journal
New Conditions for Testing the Asymptotic Behavior of Solutions of Odd-Order Neutral Differential Equations with Multiple Delays
Previous Article in Journal
Forecasting Water Consumption in the Yangtze River Delta Based on Deformable Cumulative Multivariable Grey Model
Previous Article in Special Issue
Queueing System with Two Phases of Service and Service Rate Degradation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Optimal Input Rate in Queues with Batch Service

by
Michele Pagano
1,*,†,
Igor Tananko
2,*,† and
Elena Stankevich
2,*,†
1
Department of Information Engineering, University of Pisa, Via G. Caruso 16, 56122 Pisa, Italy
2
Department of System Analysis and Automatic Control, Faculty of Computer Science and Information Technologies, Saratov State University, 83 Astrakhanskaya St., 410012 Saratov, Russia
*
Authors to whom correspondence should be addressed.
These authors contributed equally to this work.
Axioms 2023, 12(7), 656; https://doi.org/10.3390/axioms12070656
Submission received: 22 May 2023 / Revised: 24 June 2023 / Accepted: 29 June 2023 / Published: 1 July 2023
(This article belongs to the Special Issue Queueing Theory and Network Applications)

Abstract

:
In recent years, queuing systems with batch service are emerging as powerful and flexible mathematical models in different frameworks. In this paper, we consider a single server queuing system with Poissonian arrivals, infinite buffers, and a constant batch size b. This paper addresses a little-studied optimization problem, namely the existence of an optimal arrival rate that minimizes the average sojourn time. Unlike the classical M / M / 1 queue, for any batch size b, the problem admits a non-trivial solution that can be found by solving a polynomial equation of degree b + 1 . Since, in general, only numerical solutions are available, a simple first-order approximation is also derived and the corresponding deviations (in terms of input rate and sojourn time) are calculated. In more detail, it is shown that the approximation improves as the batch size increases and, in any case, the relative error for the average sojourn time is less than 0.34%. Finally, the paper provides new theoretical results about the asymptotic service rate in the equivalent birth–death process, highlighting how it depends on all queue parameters.

1. Introduction

Queues and queuing networks are often used as powerful mathematical models of many real systems, in which a given service is shared by multiple customers. Starting from the pioneering works by Erlang [1], queuing theory has emerged as a powerful analysis tool in a wide range of cargo and passenger transport systems (including river ports, seaports, airports, and multimodal transport hubs) [2], in healthcare [3], in IoT–Fog–Cloud cooperation systems [4], and, in general, in telecommunication systems with Quality of Service guarantees [5], just to mention a few recent works.
Traditionally, it has been assumed that the service discipline is work conserving, i.e., a server (or, more in general, servers) cannot be free if the waiting line is not empty. Although such an assumption seems to be quite natural, there is currently a growing interest in developing new analysis methods for queuing systems that violate this, namely queues with batch service (also known in the literature as bulk service queues), since they represent suitable models in different real-life scenarios, ranging from production systems [6,7] to cloud computing and computer systems in general [8,9]. The widespread practical use of batch services makes the study of this kind of queue relevant in the 21st century [10,11].
The first analysis of queuing systems with batch service was published in [12], where the equilibrium distribution of the queue length was obtained using the embedded Markov chain method for a queuing system with one server that serves a batch of customers of a fixed maximum size at intervals with a given distribution. Then, in [13], the Laplace transform of the waiting time distribution was derived from the probability generating function of the queue length distribution. Explicit formulas for the mean and variance of the waiting time were shown and higher moments, if necessary, could be easily found. Later, the study of systems with a batch service was extended by Neuts, Bagchi, and Serfozo [14,15,16], considering different strategies of service, as highlighted below. Moreover, the fundamental results for batch service systems can be found in [17,18,19].
In more detail, several types of batch service have been considered in the literature. The general case, introduced in [14], assumes that the batch size has lower and upper thresholds, denoted in the following as a and b, respectively. This means that three situations are possible:
  • If the number of customers in the system is less than a, then the server is idle;
  • If the number of customers in the waiting line is between the two thresholds a and b, then they are all served as soon as the previous batch leaves the queue;
  • Otherwise, a group of b customers is selected for service.
Such systems are known in the literature as general bulk service (GBS) queues [20,21], and for a = 1 , they include the original model proposed in [12,13].
As another special case of GBS, the customers might be serviced in batches of a given fixed size [10,11,22], i.e., a = b , with the underlying assumption that the server remains idle when the number of customers in the queue is less then the batch size.
It is worth mentioning that batching based only on the number of customers in the queue has several drawbacks, for instance, when clients are perishable, i.e., the customer may rot before the service is completed. Recently, to overcome this issue, [23] introducd the flexible general bulk service (FGBS) rule, which modifies the GBS rule by taking into account the average waiting time. In FGBS, the server starts bulk service when:
  • There are at least a customers in the queue;
  • There are less than a customers in the queue but their measured average waiting time is greater than or equal to the waiting time tolerance T.
In other words, the server becomes idle only if at the service completion epochs, the number of customers is less than a and the average waiting time of a customer in the queue is less than T.
Another interesting variant of the GBS discipline has been proposed in [24], where the service process comprises of k stages (with distinct input queues) and customers can join the service batch (provided its size is less than b) at any stage depending on their service requirements. Moreover, the service rate at each stage depends on the stage as well as on the current batch size m (with a m b ). Finally, customers cannot leave the system after completing service in an intermediate stage.
In general, systems with batch service are very difficult to analyze, but, as in other queuing models, the assumptions that arrivals are Poissonian and service times for batches of customers have an exponential distribution permit to obtain some closed-form results. For example, in [25], it was proposed to use the supplementary variable technique and the theory of difference equations in the analysis of a single server queue with batch arrivals and services. In more detail, the authors obtained explicit expressions of the steady-state system–content distribution at pre-arrival and arbitrary epochs in terms of the roots of the associated characteristic equation. The authors of [10] obtained the closed-form steady-state distribution for a class of queues that serve batches of customers with different configurations of service rates that may depend on the load of the server, with special attention to the conditions that must be satisfied to make the system quasi-reversible.
In [11], using the probability generating function method, the authors derived the closed-form expression for a steady-state queue length distribution, the expected queue length, and the expected waiting time of a customer. In [26], a multiplicative form of the stationary distribution for the M / M b / 1 queue (i.e., with a fixed batch size b) was obtained on the basis of an equivalent birth–death process with (adequately chosen) state-dependent service rates, which can be derived by solving a polynomial equation of degree b + 1 .
Among all the works on queuing systems with batch services, only a few of them deal with various optimization problems. For example, in [27], the optimal number of servers that minimizes the total expected cost per unit time was found for both infinite and finite capacity systems. In [16], the authors analyzed the problem of finding policies which minimize the expected discounted cost, or the expected cost per unit time, over an infinite time horizon. The strategic customer behavior is considered in [28]; the customers decide whether or not to join the queue taking into account a linear reward–cost structure.
In this paper, we extend our previous work [29] on the M / M b / 1 queue, focusing on the derivation of the optimal input rate that minimizes the average sojourn time for fixed values of the other system parameters (namely, batch service rate μ and batch size b).
To the best of our knowledge, the existence of an optimal input rate has never been proven in the literature; indeed, as highlighted above, optimization techniques have been applied to other characteristics. Moreover, in our previous work [29], the batch size was considered as an optimization parameter, while this paper completes the investigation of the batch service mechanism, focusing the dependence of the average waiting time on the arrival and service rates. In more detail, the optimal input rate is determined as a unique solution (under the system stability condition) of a polynomial equation of degree b + 1 , which, in general, can be solved only numerically. Moreover, the first order approximation is provided and the relative errors for the optimal input rate and the corresponding sojourn time are investigated for different values of b.
Finally, as a side contribution, the equivalent birth–death process is revised, focusing on the mathematical properties of the characteristic equation and deriving auxiliary expressions, used in the following.
The rest of the paper is organized as follows. Section 2 deals with the analysis method based on the equivalent birth–death process we proposed in [26], which permitted us to find an explicit formula for the sojourn time. Then, the main contributions are reported in Section 3, which includes the general theoretical results on the existence of an optimal arrival rate λ opt as well as the first order approximation λ appr and the corresponding values of the minimum sojourn time as a function of the batch size. Finally, Section 4 discusses the goodness of the proposed approximation and highlights future research directions.

2. Materials and Methods

The aim of this section is twofold. On one hand, it briefly recalls the main results for the M / M b / 1 queue (see Section 2.1), focusing on the analysis method proposed in [26] and on the general expression of the sojourn time (more details can be found in [29] and the references therein). On the other hand, in Section 2.2, the characteristic equation is deeply investigated, highlighting the dependence of the asymptotic equivalent death rate M on the queue parameters; apart from the theoretical relevance of these results, some of the expressions will be used in Section 3 to derive the main contribution of the paper. Finally, Section 2.3 comes back to the equivalent birth–death process, highlighting that only the chosen value of M corresponds to a well-defined Markov chain.

2.1. The M / M b / 1 Queue

Let us consider a single server queue with infinite buffers and assume that the input process is Poissonian with rate λ . Customers are served in batches of fixed size b ( b 2 ) and the service time is exponentially distributed with parameter μ . Such a system is not work conserving since, after a batch departure, the server stays idle until b customers accumulate in the waiting line.
The existence of a non degenerate steady-state behavior requires that the utilization factor satisfies the stability condition, i.e.,
ρ = λ b μ < 1 .
Product form state probabilities of the underlying Markov chain can be derived, as originally proposed by the authors (see [26] for details), by introducing an equivalent birth–death process, with constant birth rate λ and state-dependent death rates μ ˜ ( n ) , determined by the following system of equations:
μ ˜ ( n ) = λ μ λ b μ ˜ ( n + 1 ) × × μ ˜ ( b + n ) , 1 n b 1 , μ ˜ ( n ) = λ + μ μ λ b μ ˜ ( n + 1 ) × × μ ˜ ( b + n ) , n b .
Let M = lim n μ ˜ ( n ) ; if the limit exists, for sufficiently high values of n we can assume that μ ˜ ( n ) = M and the second expression in (2), valid for n b , reduces to
M b + 1 ( λ + μ ) M b + λ b μ = 0 ,
and the existence of the equivalent birth–death process depends upon the existence of a solution to the previous equation, henceforth referred to as the characteristic equation of the system, satisfying stability condition (1).
Although (3) provides an implicit relation between the asymptotic service rate M and all the queuing characteristics ( λ , μ and b), i.e.,
f ( M , λ , μ , b ) = M b + 1 ( λ + μ ) M b + λ b μ = 0 ,
in the following, for the sake of clarity, we will highlight just the dependence on the relevant variables and omit the others, taking constant values in the considered settings (see, for instance, the calculation of the total derivatives of M with respect to λ , μ , and b in Section 2.2 as well as the derivation of the optimal arrival rate in Section 3).
As it is shown in [26], for any consistent set of queue parameters, the characteristic equation has only one root satisfying the stability condition, which will be denoted by M in the following (in analogy with the notation already used in [29]). Some interesting mathematical properties of M will be further discussed in Section 2.2, where new material is presented, while this subsection just summarizes previous results and can be skipped by readers familiar with [29].
Taking into account that M is a solution of (3), from the second set of equations in (2), it follows that
μ ˜ ( b ) = μ ˜ ( b + 1 ) = μ ˜ ( b + 2 ) = = M ,
and the equivalent service rates for the remaining states can be easily derived recursively (as also shown in Section 2.3). Moreover, the state probabilities can be written as (see [29] for details)
π ( n ) = π ( 0 ) i = 0 n x i , 1 n b 1 , π ( n ) = π ( 0 ) x n b λ μ , n b ,
where x = λ M and the probability of an empty system π ( 0 ) is
π ( 0 ) = 1 x b .
Then, taking into account Little’s law [19] and the definition of the average number of customers in the system, the average sojourn time is given by
u ¯ = 1 λ n = 0 n π ( n ) ,
which, after non trivial manipulations (see [29] for details), leads to the following compact expression:
u ¯ = b 1 2 λ + 1 M λ .

2.2. A Deeper Insight into the Roots of the Characteristic Equation

Let us consider the characteristic Equation (3). As shown in [26], f ( M ) = 0 has two positive roots, the largest of which belongs to the open interval
( M , M ) = b ( λ + μ ) b + 1 , ( λ + μ ) b + 1 λ b μ ( λ + μ ) b
and satisfies the stability condition. Note that
M = b ( λ + μ ) b + 1
is the only positive root of f ( M ) = 0 and ρ < 1 (i.e., λ < b μ ) implies that M > M > λ .
Let us consider how this root depends on λ . Indeed, for fixed values of μ and b, Equation (3) defines an implicit function of M and λ
f ( M , λ ) = 0
and, taking into account the differentiation rules for implicit functions (see, for instance, [30]), we get:
d M d λ = f λ ( M , λ ) f M ( M , λ ) ,
where
f λ ( M , λ ) = f ( M , λ ) λ = b λ b 1 μ M b
and
f M ( M , λ ) = f ( M , λ ) M = ( b + 1 ) M b b ( λ + μ ) M b 1 .
As a direct implication of the inequality M > M in (5), it is easy to see that f M ( M , λ ) > 0 when M is the root of the characteristic equation satisfying the stability condition. Moreover, taking into account (3), f M ( M , λ ) can be rewritten in a more compact form, i.e.,
f M ( M , λ ) = M b + b M b ( λ + μ ) M b 1 = M b b λ b μ M ,
which will be used in the next section.
Taking advantage of the same equality (3) that can be rewritten as
M b M ( λ + μ ) = λ b μ ,
the expression of f λ ( M , λ ) becomes
f λ ( M , λ ) = b λ b 1 μ M b = λ b 1 μ M ( λ + μ ) λ + b ( M ( λ + μ ) ) ,
and it is easy to see that for the value of M of interest
f λ ( M , λ ) < 0 M > M 0 = λ + μ λ / b .
To prove the last inequality, let us consider the function
f ( x ) = x b + 1 ( λ + μ ) x b + λ b μ ,
which is an increasing function for x > M , since M is the only positive root of f ( x ) = 0 , as stated above. Moreover, by definition f ( M ) = 0 ; hence, it is enough to show that f ( M 0 ) < 0 , where the latter inequality can be verified by direct calculation, taking into account that λ < b μ and truncating the binomial expansion
λ + ( μ λ / b ) b > λ b + b λ b 1 ( μ λ / b ) = b μ λ b 1 .
Indeed, for x = M 0 = λ + μ λ / b , we have:
f ( M 0 ) = ( λ + μ λ / b ) b λ + μ λ / b ( λ + μ ) + λ b μ = λ b λ + ( μ λ / b ) b + λ b μ < λ b b μ λ b 1 + λ b μ = 0 .
As a side result, we have found another lower bound M 0 for the largest root of (3), with M 0 = M + Δ , where
Δ = M 0 M = 1 b + 1 ( λ + μ ) λ b = 1 b ( b + 1 ) ( b μ λ ) > 0 .
In other words, we improved the relation (5), with a tighter lower bound
( M 0 , M ) = μ + λ b 1 b , ( λ + μ ) b + 1 λ b μ ( λ + μ ) b ( M , M ) .
Finally, if M is the largest positive root of the characteristic equation (and hence M > M 0 > M ), it is easy to see from Equation (6) that
d M d λ = f λ ( M , λ ) f M ( M , λ ) > 0
and so M is an increasing function of λ .
In a similar way, it is easy to prove that M is also an increasing function of μ and b, considering the batch size as a continuous parameter in order to take the derivative (note that the latter property has already been proven in [29] considering discrete values of b). Indeed, it is straightforward to verify that the partial derivatives with respect to M are still given by (8),
f μ ( M , μ ) = f ( M , μ ) μ = λ b M b < 0
and
f b ( M , b ) = f ( M , b ) b = M b + 1 ( λ + μ ) M b log M + μ λ b log λ ,
which can be rewritten as (again taking advantage of the characteristic Equation (3))
f b ( M , b ) = λ b μ ( log λ log M ) < 0 .
Hence, the corresponding total derivatives
d M d μ = f μ ( M , μ ) f M ( M , μ ) and d M d b = f b ( M , b ) f M ( M , b )
are positive and the root M (i.e., the asymptotic equivalent service rate) is a monotone increasing function of all the queue parameters ( λ , μ , and b).
Finally, it is worth mentioning another property of the roots of the characteristic equation: for a fixed value of the ratio η = λ / μ , the roots are linear in μ . Indeed, dividing all terms by μ b + 1 , the characteristic equation becomes
ξ b + 1 ( 1 + η ) ξ b + η b = 0 ,
where ξ = M / μ . The change in variable highlights that the roots of the previous equation depend only on the ratio η between λ and μ , so for a fixed value of η (and a constant utilization ρ if the batch size is also fixed), M = ξ μ is linear in μ (and in λ , since η is assumed to be a constant).

2.3. Analysis of the Equivalent Birth–Death Process

Let us consider the death rates of the equivalent birth–death process and verify that all the rates are positive. Since
μ ˜ ( b ) = μ ˜ ( b + 1 ) = μ ˜ ( b + 2 ) = = M ,
we need to verify that μ ˜ ( n ) > 0 for 1 n b 1 . To this aim, let us focus at first on the expression of μ ˜ ( b 1 ) :
μ ˜ ( b 1 ) = λ μ λ b μ ˜ ( b ) × × μ ˜ ( 2 b 1 ) = λ μ λ b M b .
Taking into account (3) and the lower bound M 0 in (10), it is easy to verify that
μ ˜ ( b 1 ) = M μ > M 0 μ > 0 .
Next, let us consider the expression of
μ ˜ ( b 2 ) = λ μ λ b ( M μ ) M b 1 ,
from which by direct comparison it follows that μ ˜ ( b 1 ) > μ ˜ ( b 2 ) . In a similar way, it is easy to see that
μ ˜ ( b 1 ) > μ ˜ ( b 2 ) > > μ ˜ ( 1 ) .
Therefore, to complete the proof, it is enough to show that μ ˜ ( 1 ) > 0 , where
μ ˜ ( 1 ) = λ μ λ b μ ˜ ( 2 ) × × μ ˜ ( b + 1 ) .
Taking advantage of the following equality, derived in [29],
λ μ = λ b μ ˜ ( 1 ) μ ˜ ( 2 ) × × μ ˜ ( b ) ,
we get
μ ˜ ( 1 ) = λ μ μ ˜ ( 1 ) λ b μ ˜ ( 1 ) μ ˜ ( 2 ) × × μ ˜ ( b + 1 ) = λ μ λ μ μ ˜ ( 1 ) M .
Therefore,
μ ˜ ( 1 ) = λ M M + λ > 0
and
μ ˜ ( b 1 ) > μ ˜ ( b 2 ) > > μ ˜ ( 1 ) > 0 ,
completing the proof.

3. Optimal Input Rate for Batch Services

Let us consider the general expression of the average sojourn time in M / M b / 1 queuing systems, given by (4), with the constraint (3) that provides the implicit relation between M and the queue parameters, i.e.,
u ¯ b ( λ , M ) = b 1 2 λ + 1 M λ , f ( M , λ ) = M b + 1 ( λ + μ ) M b + λ b μ = 0 ,
where we highlighted the dependence of the sojourn time on λ (directly and through M), while μ and b are fixed parameters, determined by the specific queuing system under study, since we are interested in the optimization with respect to λ .
It is well known that in the classical M / M / 1 queue, the average sojourn time
u ¯ 1 = 1 μ λ ,
which can be easily derived from (4) noticing that b = 1 and the largest root of the characteristic equation is M = μ , is a monotone increasing function of λ and u ¯ 1 as the utilization ρ 1 . Instead, in the case of batch services (i.e., b 2 ), there is an additional delay due to the completion of a batch, represented by the first term in (4). Indeed, u ¯ b also as λ 0 ; hence, it is reasonable to expect the existence of some intermediate values of λ between 0 and b μ (stability condition), for which the sojourn time is minimal.
This consideration is confirmed by Figure 1, in which the behavior of u ¯ b is plotted as a function of λ for b = 1 , 2 , 3 . Without loss of generality, as further discussed at the beginning of Section 4.2, μ = 1 is assumed in the plots.
In other words, we are looking for the optimal arrival rate λ (for fixed values of b 2 and μ ), i.e.,
u ¯ b λ opt = min λ u ¯ b ( λ ) ,
that corresponds to the search for a conditional minimum [30] of a function of several variables, which can be obtained through the total derivative
d u ¯ b d λ = 0 .
In the next Section 3.1, we derive the mathematical condition for the existence of a unique minimum of the sojourn time, while in Section 3.2 and Section 3.3, we determine the mathematical expression of the optimal arrival rate and provide its approximation, respectively. Finally, the goodness of the approximation will be discussed in Section 4.

3.1. Condition for the Existence of an Optimal Arrival Rate

In the calculation of the total derivative (12), it is important to take into account that the variables λ and M in (11) are not independent. The general solution is provided by the method of Lagrange multipliers [30], but in our case, the same condition can be achieved through direct calculation. Indeed, omitting the arguments of the functions, we can write:
d u ¯ d λ = u ¯ λ + u ¯ M d M d λ = b 1 2 λ 2 + 1 ( M λ ) 2 1 ( M λ ) 2 d M d λ = b 1 2 λ 2 + 1 ( M λ ) 2 1 d M d λ
and, taking into account the expressions in Section 2.2,
d u ¯ d λ = b 1 2 λ 2 + 1 ( M λ ) 2 1 + f λ ( M , λ ) f M ( M , λ ) = b 1 2 λ 2 + 1 ( M λ ) 2 b λ b 1 μ ( M λ ) M b + 1 b λ b μ = ( b 1 ) ( M λ ) ( M b + 1 b λ b μ ) 2 b b 1 λ b 1 μ 2 λ 2 ( M λ ) ( M b + 1 b λ b μ ) = μ ( b 1 ) φ ( M , λ ) 2 λ 2 ( M λ ) ( M b + 1 b λ b μ ) ,
where
μ φ ( M , λ ) = ( M λ ) ( M b + 1 b λ b μ ) 2 b b 1 λ b 1 μ .
Since M > λ and, according to Equation (9),
f M ( M , λ ) = M b b λ b μ M > 0 ,
it is easy to see that
d u ¯ d λ 0 φ ( M , λ ) 0 .
Taking into account the characteristic Equation (3), μ φ ( M , λ ) can be rewritten as follows:
μ φ ( M , λ ) = ( M λ ) ( M b + 1 b λ b μ ) 2 b b 1 λ b + 1 μ = ( M λ ) ( λ + μ ) M b ( b + 1 ) λ b μ 2 b b 1 λ b + 1 μ = μ M b + 1 + λ M b + 1 ( λ + μ ) M b ( b + 1 ) M λ b μ + ( b 1 ) λ b + 1 μ 2 b b 1 λ b + 1 μ = μ M b + 1 ( b + 1 ) λ b M + b λ b + 1 2 b b 1 λ b + 1 μ = μ M b + 1 ( b + 1 ) λ b M + b b 3 b 1 λ b + 1 ,
i.e.,
φ ( M , λ ) = M b + 1 ( b + 1 ) λ b M + b b 3 b 1 λ b + 1 .
Noticing that φ ( M , λ ) is a homogeneous polynomial of degree b + 1 , we can rewrite φ ( M , λ ) in terms of a new function ψ of the ratio M / λ :
φ ( M , λ ) = λ b + 1 M λ b + 1 ( b + 1 ) M λ + b b 3 b 1 = λ b + 1 ψ M λ
with
ψ ( x ) = x b + 1 ( b + 1 ) x + b b 3 b 1 .
Since λ represents the arrival rate and cannot be negative, (13) is equivalent to
d u ¯ d λ 0 ψ ( x ) = x b + 1 ( b + 1 ) x + b b 3 b + 1 0 , x > 1 ,
where the condition x > 1 is derived from the tighter lower bound in (10) and the stability condition λ < b μ . Indeed,
M > M 0 = μ + λ λ b > λ b + λ λ b = λ .
In this way, the existence of a conditional minimum for the average sojourn time is determined by the behavior of ψ ( x ) for x > 1 .
It is easy to see that ψ ( x ) is monotone increasing for x > 1 , since
ψ ( x ) = ( b + 1 ) x b 1 .
Moreover, ψ ( x ) + as x + and
ψ ( 1 ) = 2 b b 1 < 0 ,
so there exists a unique root x 0 > 1 of the equation ψ ( x ) = 0 , which corresponds to the unique extremum of u ¯ b ( λ , M ) , i.e.,
x 0 = M λ opt .
Figure 2 provides a visual illustration of the behavior of ψ ( x ) for different values of b, highlighting the existence and uniqueness of x 0 .
Finally, taking into account the general shape of u ¯ b that diverges to + for λ 0 and λ b μ , the extremum corresponds to the global minimum of the sojourn time.

3.2. Calculation of the Optimal Rate

The optimal input rate λ opt can be easily derived from the value of x 0 taking into account the characteristic equation. To this aim, let us divide (3) by λ b + 1 , taking into account (15) for λ = λ opt , we get
x 0 b + 1 1 + μ λ opt x 0 b + μ λ opt = 0
and, rearranging the terms:
λ opt = μ x 0 b 1 x 0 b + 1 x 0 b = μ x 0 b 1 x 0 b ( x 0 1 ) ,
i.e., the optimal input rate is proportional to μ . The corresponding minimum sojourn time can be simply derived by direct substitution in (4):
u ¯ min = b 1 2 λ opt + 1 x 0 λ opt λ opt = 1 λ opt b 1 2 + 1 x 0 1 .
Unfortunately, there is no general closed-form expression for x 0 = x 0 ( b ) , so the previous equation can be used to analytically derive λ opt just for b = 2 and b = 3 . In more detail, for b = 2 :
ψ 2 ( x ) = x 3 3 x 2 = ( x 2 ) ( x + 1 ) 2
and the unique solution of ψ 2 ( x ) = 0 satisfying the condition x > 1 is x 0 = 2 . Hence, from (16):
λ 2 , opt = μ x 0 2 1 x 0 2 ( x 0 1 ) = 3 4 μ .
In a similar way, for b = 3 :
ψ 3 ( x ) = x 4 4 x = x ( x 3 4 ) x 0 = 4 3
and
λ 3 , opt = μ x 0 3 1 x 0 3 ( x 0 1 ) = 3 4 μ 4 3 1 .
Although for higher values of b x 0 can be easily determined using numerical methods, in the next subsection, we provide an approximate formula, based on the assumption that x 0 1 as b , as suggested by Figure 2.

3.3. Approximate Expression for the Optimal Input Rate

Assume that
x 0 = 1 + α b + o 1 b ,
where the coefficient α > 0 is determined from the condition ψ ( x 0 ) = 0 . For b ,
b 3 b 1 b 1 2 b + o 1 b
and
1 + α b + o 1 b b + 1 b e α + o ( 1 ) ,
so we can rewrite the condition ψ ( x 0 ) = 0 as follows:
e α ( b + 1 ) 1 + α b + b 2 + o ( 1 ) = 0
and, neglecting the infinitesimal terms:
e α ( α + 3 ) = 0 .
As shown in Figure 3, the latter equation has two roots, α 1 2.947531 and α 2 1.505241 . Since x > 1 , and hence α > 0 , the following approximation
x 0 1 + 1.505241 b
can be used to estimate the optimal input rate
λ appr = μ 1 1 + α / b b α / b
as well as the corresponding minimum value of the average sojourn time from (17):
u ¯ appr = 1 λ appr b 1 2 + b α = b ( 2 + α ) α 2 α λ appr .
In the next section, at first we will verify the correctness of the approximation for x 0 , and then calculate the relative error for both the optimal input rate and the corresponding average sojourn time.

4. Discussion

The goodness of the first order approximation and the behavior of the minimum sojourn time (including its asymptotic value as b ) are illustrated in the next two subsections, while some remarks about optimization with respect to the other queue parameters are presented in Section 4.3. Finally, extensions to this work are briefly investigated in Section 4.4.

4.1. Numerical Validation of the First Order Approximation

As already stated above, the function ψ ( x ) has a unique solution satisfying the stability condition ( x 0 > 1 ), which can be found by numerical methods for b 4 (exact values are available only for b = 2 and b = 3 ). However, for a better understanding of the service mechanism as a function of the batch size, it would be useful to obtain an analytical expression (or, at least, some kind of approximation) of x 0 as a function of b. Hence, for the sake of simplicity, in Section 3.3, we proposed a first order approximation, given by (19).
In order to assess the goodness of (19), we compare our approximation with the numerical solutions for b 4 , which can be easily found, for instance, with the bisection method applied to the interval [ 1 , 2 ] , since ψ ( 2 ) > 0 for b > 3 as can be straightforwardly checked by direct substitution. As highlighted in Table 1, the root x 0 converges to 1 for high values of b; moreover, the relative error decreases monotonically and is already below 1% for b = 8 . Hence, our assumption seems to be adequate and it is meaningful to check the validity of the corresponding approximations (20) and (21) for the optimal arrival rate and the minimum sojourn time.

4.2. Evaluation of the Optimal Rate and Corresponding Sojourn Time

At first, let us consider the general expression of the sojourn time, focusing on its dependence on the batch service rate μ for a fixed value of η = λ / μ . Although (4) does not depend explicitly on μ , taking advantage of the notation introduced in Section 2.2, i.e., M = μ ξ and λ = η μ , the average sojourn time can be rewritten as
u ¯ = 1 μ b 1 2 η + 1 ξ η .
Since ξ is independent of μ , for a fixed value of η , the quantity in brackets does not depend on μ ; therefore, the sojourn time u ¯ is inversely proportional to μ , and its minimum is achieved for the same value of η = η ( b ) , in agreement with Equation (16). Hence, without loss of generality, in the numerical analysis we will consider the case μ = 1 .
Table 2 highlights the effect of the previous approximation of x 0 on the values of the optimal input rate and the corresponding minimum sojourn time. For the sake of completeness we also report the cases b = 2 and b = 3 in italics, for which exact analytical values are available and hence it is meaningless to use the approximate value of x 0 . As expected, both relative errors are decreasing functions of b (in agreement with Table 1 since all the values are based on x 0 and x appr ).
The numerical values in Table 2 highlight two relevant properties of the sojourn time:
  • The shape of the curve u ¯ opt ( λ ) is quite flat around its minimum; indeed, even significant deviations in λ lead to small variations in the sojourn time, as shown in the first lines of the table. In practice, this implies a high stability of the system even in case of variations in the input rate;
  • In spite of the fact that M λ as b + (i.e., x 0 1 ), the increments in u ¯ min become increasingly smaller as b goes to + .
To investigate the existence of an upper bound for u ¯ min , let us estimate the optimal arrival rate λ as b , according to Equation (20):
λ appr = μ 1 1 + α / b b α / b b μ b α 1 e α = μ b α α + 2 α + 3 ,
where we made use of equality (18), which defines α . The corresponding minimum value of the average sojourn time from (17) becomes
u ¯ appr = b ( 2 + α ) α 2 α λ appr b 1 μ α ( α + 3 ) b ( α + 2 ) b ( 2 + α ) α 2 α = α + 3 2 μ ,
i.e.,
u ¯ 1 μ α + 3 2 2.25262 μ
in perfect agreement with the last lines in Table 2. Moreover, as a side result, we obtained the following approximation of λ opt for high values of b:
λ b μ α + 2 α ( α + 3 ) b 0.51688 μ b ,
i.e., asymptotically λ opt grows linearly with the batch size. This final result also justifies the finite values of the sojourn time in spite of the fact that both terms inside the brackets in (4) diverge when b .

4.3. Optimal Sojourn Time with Respect to Other Queue Parameters

As already stated, the average sojourn time depends (directly and/or through M) on all the queue parameters, namely b, λ , and μ . To complete the analysis of u ¯ , we briefly consider the optimization with respect to b and μ , in both cases assuming that the other two parameters are fixed.
The optimization with respect to b (for fixed values of the input and service rates) has been deeply investigated in [29]; in a nutshell, u ¯ b has a unique minimum that can be attained for the minimal batch size (i.e., b = 2 ) or some intermediate value of b, depending on the specific values of λ and μ .
Finally, the optimization with respect to μ is trivial; the faster the server, the lower the sojourn time. This conclusion can be easily verified taking the total derivative of u ¯ with respect to μ , as we did in Section 3.1, but with a simpler calculation. Indeed, we can write:
d u ¯ d μ = u ¯ μ + u ¯ M d M d μ = 1 ( M λ ) 2 d M d μ < 0 ,
since
d M d μ > 0
as shown in Section 2.2; hence, u ¯ is a decreasing function of μ and attains its minimum as μ , with
u ¯ min ( μ ) = b 1 2 λ .

4.4. Final Remarks and Future Work

This paper presented a first order approximation for the root x 0 of the auxiliary equation ψ ( x ) = 0 which, together with the characteristic equation, permits to determine the optimal arrival rate λ opt , which minimizes the average sojourn time in the queue. It is shown that the goodness of both approximations increases with the batch size b and, even for b = 4 (for b = 2 and b = 3 the exact values are available analytically), the relative errors for λ opt and u ¯ min are less than 6.70% and 0.34%, respectively. This result can be exploited for the optimization of a wide variety of real-life systems, including collective transportation, batch request processes in cloud systems, and databases employing multi-query optimization.
As a future extension of this work, higher order approximations of x 0 deserve consideration, mainly from a theoretical point of view (as highlighted above, the first order expansion is good enough for practical purposes), to get a deeper insight into the asymptotic convergence of the sojourn time to its maximum value.
The existence of an optimal arrival rate can be employed for the minimization of the average response time of open queuing networks with batch services, where the response time is defined as the weighted sum (according to the node arrival rate) of the node sojourn times. In this case, the routing matrix is the optimization parameter, with the additional constraint that the number of possible destinations from each node should be significantly larger than the batch size; otherwise, the formulas for a queue in isolation based on the equivalent birth–death process cannot be extended to an open queuing network, as highlighted in [29], where we addressed the optimization problem with respect to the batch size.

Author Contributions

Conceptualization, M.P., I.T. and E.S.; methodology, M.P. and I.T.; software, M.P.; validation, M.P., I.T. and E.S.; formal analysis, M.P.; investigation, M.P., I.T. and E.S.; data curation, M.P.; writing—original draft preparation, M.P.; writing—review and editing, M.P., I.T. and E.S.; visualization, M.P.; supervision, M.P.; project administration, M.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was partially funded by the Italian Ministry of Education and Research (MIUR) in the framework of the FoReLab project (Departments of Excellence) and by the University of Pisa in the framework of the PRA_2022_64 “hOlistic Sustainable Management of distributed softWARE systems (OSMWARE)” project.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Brockmeyer, E.; Halstrøm, H.; Erlang, A.; Jensen, A. The Life and Works of A.K. Erlang; Academy of Technical Sciences, Copenhagen Telephone Comp.: Copenhagen, Denmark, 1948. [Google Scholar]
  2. Bychkov, I.; Kazakov, A.; Lempert, A.; Zharkov, M. Modeling of Railway Stations Based on Queuing Networks. Appl. Sci. 2021, 11, 2425. [Google Scholar] [CrossRef]
  3. Lee, H.; Choi, E.K.; Min, K.A.; Bae, E.; Lee, H.; Lee, J. Physician-Customized Strategies for Reducing Outpatient Waiting Time in South Korea Using Queueing Theory and Probabilistic Metamodels. Int. J. Environ. Res. Public Health 2022, 19, 2073. [Google Scholar] [CrossRef]
  4. Li, L.; Guo, M.; Ma, L.; Mao, H.; Guan, Q. Online Workload Allocation via Fog-Fog-Cloud Cooperation to Reduce IoT Task Service Delay. Sensors 2019, 19, 3830. [Google Scholar] [CrossRef]
  5. Andonov, V.; Poryazov, S.; Otsetova, A.; Saranova, E. A Queue in Overall Telecommunication System with Quality of Service Guarantees. In Future Access Enablers for Ubiquitous and Intelligent Infrastructures; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; pp. 243–262. [Google Scholar] [CrossRef]
  6. Gold, H.; Tran-Gia, P. Performance analysis of a batch service queue arising out of manufacturing system modelling. Queueing Syst. 1993, 14, 413–426. [Google Scholar] [CrossRef]
  7. Wu, K. Classifications for batch service queues in production systems. In Proceedings of the 2014 IEEE International Conference on Automation Science and Engineering (CASE), New Taipei, Taiwan, 18–22 August 2014; pp. 698–703. [Google Scholar]
  8. Chang, W. Bulk queue model for computer system analysis. IBM J. Res. Dev. 1974, 18, 370–372. [Google Scholar] [CrossRef]
  9. Santhi, K.; Saravanan, R.D. Performance analysis of cloud computing bulk service using queueing models. Int. J. Appl. Eng. Res. 2017, 12, 6487–6492. [Google Scholar]
  10. Balbo, G.; Vigliotti, M.G. On the analysis of a M/M/1 queue with bulk services. Comput. J. 2015, 58, 57–74. [Google Scholar] [CrossRef]
  11. Gupta, G.K.; Banerjee, A. Steady state analysis of system size-based balking in M/Mb/1 queue. Int. J. Math. Oper. Res. 2019, 14, 319–337. [Google Scholar] [CrossRef]
  12. Bailey, N.T.J. On queueing processes with bulk service. J. R. Stat. Soc. Ser. B Methodol. 1954, 16, 80–87. [Google Scholar] [CrossRef]
  13. Downton, F. Waiting time in bulk service queues. J. R. Stat. Soc. Ser. B Methodol. 1955, 17, 256–261. [Google Scholar] [CrossRef]
  14. Neuts, M.F. A general class of bulk queues with Poisson input. Ann. Stat. 1967, 38, 759–770. [Google Scholar] [CrossRef]
  15. Bagchi, T.P.; Templeton, J.G.C. Finite waiting space bulk queueing systems. J. Eng. Math. 1973, 7, 313–317. [Google Scholar] [CrossRef]
  16. Deb, R.K.; Serfozo, R.F. Optimal control of batch service queues. Adv. Appl. Probab. 1973, 5, 340–361. [Google Scholar] [CrossRef]
  17. Chaudhry, M.L.; Templeton, J.G.C. A First Course in Bulk Queues; John Wiley & Sons: New York, NY, USA, 1983. [Google Scholar]
  18. Gross, D.; Shortle, J.F.; Thompson, J.M.; Harris, C.M. Fundamentals of Queueing Theory; John Wiley & Sons: Hoboken, NJ, USA, 2008. [Google Scholar]
  19. Kleinrock, L. Queueing Systems: Volume I-Theory; John Wiley & Sons: New York, NY, USA, 1975. [Google Scholar]
  20. Krishna Reddy, G.V.; Anitha, R. Markovian bulk service queue with delayed vacations. Comput. Oper. Res. 1998, 25, 1159–1166. [Google Scholar] [CrossRef]
  21. Holman, D.F.; Chaudhry, M.L.; Ghosal, A. Some results for the general bulk service queueing system. Bull. Aust. Math. Soc. 1981, 23, 161–179. [Google Scholar] [CrossRef]
  22. Economou, A. An alternative model for queueing systems with single arrivals, batch services and customer coalescence. Queueing Syst 2002, 40, 407–432. [Google Scholar] [CrossRef]
  23. Deena Merit, C.K.; Haridass, M. Analysis of flexible batch service queueing system to constrict waiting time of customers. Math. Probl. Eng. 2021, 2021, 16. [Google Scholar]
  24. Krishnamoorthy, A.; Joshua, A.N.; Vishnevsky, V. Analysis of a k-Stage Bulk Service Queuing System with Accessible Batches for Service. Mathematics 2021, 9, 559. [Google Scholar] [CrossRef]
  25. Barbhuiya, F.P.; Gupta, U.C. A difference equation approach for analysing a batch service queue with the batch renewal arrival process. J. Differ. Equ. Appl. 2019, 25, 233–242. [Google Scholar] [CrossRef]
  26. Stankevich, E.; Tananko, I.; Pagano, M. Analysis of Open Queueing Networks with Batch Services. In Information Technologies and Mathematical Modelling. Queueing Theory and Applications; Dudin, A., Nazarov, A., Moiseev, A., Eds.; Springer International Publishing: Cham, Switzerland, 2022; pp. 40–51. [Google Scholar]
  27. Krishnamoorthy, A.; Ushakumari, P.V. A queueing system with single arrival bulk service and single departure. Math. Comput. Modelling 2000, 31, 99–108. [Google Scholar] [CrossRef]
  28. Bountali, O.; Economou, A. Equilibrium joining strategies in batch service queueing systems. Eur. J. Oper. Res. 2017, 260, 1142–1151. [Google Scholar] [CrossRef]
  29. Stankevich, E.; Tananko, I.; Pagano, M. Optimization of Open Queuing Networks with Batch Services. Mathematics 2022, 10, 3027. [Google Scholar] [CrossRef]
  30. Smirnov, V.I. A Course of Higher Mathematics; Volume I Elementary Calculus; International Series of Monographs in Pure and Applied Mathematics; Pergamon Press: Oxford, UK, 1964. [Google Scholar]
Figure 1. Average sojourn time for b = 1 , 2 , 3 .
Figure 1. Average sojourn time for b = 1 , 2 , 3 .
Axioms 12 00656 g001
Figure 2. Behavior of ψ b ( x ) for b = 4 , 5 , 6 .
Figure 2. Behavior of ψ b ( x ) for b = 4 , 5 , 6 .
Axioms 12 00656 g002
Figure 3. Graphical solution of the equation e α ( α + 3 ) = 0 .
Figure 3. Graphical solution of the equation e α ( α + 3 ) = 0 .
Axioms 12 00656 g003
Table 1. Comparison between exact and approximate roots of ψ ( x ) .
Table 1. Comparison between exact and approximate roots of ψ ( x ) .
b x 0 x appr RE (%)
41.4195521.3763103.04613
51.3270851.3010481.96194
61.2682581.2508741.37074
71.2274611.2150341.01239
81.1974791.1881550.77859
91.1745021.1672490.617527
101.1563271.1505240.501826
201.0766361.0752620.127612
301.0507751.0501750.057092
401.0379661.0376310.0322264
601.0252351.0250870.0143748
801.0188981.0188160.00810094
1001.0151051.0150520.00519053
2001.0075391.0075260.00130072
3001.0050231.0050170.0005786
4001.0037661.0037630.000325619
5001.0030131.0030100.000208465
6001.0025101.0025090.000144804
7001.0021511.0021500.000106408
8001.0018821.001882 8.148 × 10 5
9001.0016731.001672 6.439 × 10 5
10001.0015061.001505 5.217 × 10 5
Table 2. Numerical results— λ opt and sojourn time ( μ = 1 ).
Table 2. Numerical results— λ opt and sojourn time ( μ = 1 ).
b λ opt λ appr RE (%) u ¯ min u ¯ appr RE (%)
20.75000.896119.48392.00002.04072.0328
31.27681.404610.00582.11652.13090.6803
41.79651.91686.69282.16172.16890.3372
52.31462.43075.01752.18502.18940.2008
62.83202.94554.0092.19912.20200.1332
73.34913.46093.33712.20842.21050.0947
83.86623.97662.8572.21512.21660.0708
94.38314.49262.49762.22002.22120.0549
104.90015.00882.21822.22382.22480.0439
2010.068910.17421.04602.23942.23970.0103
3015.237615.34190.68412.24412.24420.0045
4020.406420.51010.50832.24632.24640.0025
6030.744030.84720.33572.24852.24850.0011
8041.081741.18460.25062.249552.24957 6.15 × 10 4
10051.419451.52230.19992.250182.25019 3.92 × 10 4
200103.108103.2100.09942.251412.25141 9.76 × 10 5
300154.796154.8990.06612.251822.25182 4.33 × 10 5
400206.485206.5870.04962.252022.25202 2.43 × 10 5
500258.173258.2750.03962.252142.25214 1.56 × 10 5
600309.862309.9640.03302.252222.25222 1.08 × 10 5
700361.550361.6520.02832.252282.25228 7.94 × 10 6
800413.234413.3410.02482.252322.25232 6.08 × 10 6
900464.927465.0290.02202.252352.25235 4.80 × 10 6
1000516.616516.7180.01982.252382.25238 3.89 × 10 6
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Pagano, M.; Tananko, I.; Stankevich, E. On the Optimal Input Rate in Queues with Batch Service. Axioms 2023, 12, 656. https://doi.org/10.3390/axioms12070656

AMA Style

Pagano M, Tananko I, Stankevich E. On the Optimal Input Rate in Queues with Batch Service. Axioms. 2023; 12(7):656. https://doi.org/10.3390/axioms12070656

Chicago/Turabian Style

Pagano, Michele, Igor Tananko, and Elena Stankevich. 2023. "On the Optimal Input Rate in Queues with Batch Service" Axioms 12, no. 7: 656. https://doi.org/10.3390/axioms12070656

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop