The analytical model proposed in the article makes it possible to determine the loss (the ratio of the number of rejected requests to create a VM in the cloud system to all requests of this type that occur) for different classes of requests in the cloud system. These requests differ in the number of system resources requested and the intensity of the influx, as well as the number of resources available in each physical machine. To determine the probability of loss, the authors used models known from the literature, i.e., a limited-availability system and an Erlang’s ideal system.
3.1. Limited-Availability System
A limited-availability system (LAS) is a system built of
k identical subsystems with full availability. A diagram of such a system is shown in
Figure 2. The capacity of each subsystem is equal to
=
, where
is the capacity of the entire system. The system is offered
m classes of requests, each of which requires
AUs
to be handled.
Request class
i appearing at the input to the system can be handled in the system if and only if at least one of the subsystems has enough free resources to handle it, i.e.,
. Consequently, the LAS excludes the possibility of splitting the
AUS request required by the class
i request among different subsystems. Analytical models for such systems have been proposed in [
24]. According to these models, the distribution of occupancy in LAS can be determined as follows:
where
is the offered traffic intensity of class i;
is the occupancy probability of n AUs in LAS with a total capacity of allocation units;
is the so-called conditional transition coefficient between neighboring occupancy states in LAS:
where
is the number of possible distributions of
x free (unoccupied) AUs in
k subsystems, where each of the subsystems has a capacity of
AUs:
By knowing the occupancy distribution, it becomes feasible to determine the loss probability for each request class serviced by the system under the formula
In a simplified manner, the results of LAS modeling can be symbolically represented as follows:
where
P represents the occupancy distribution in LAS, obtained based on (
1), and
E is a set of loss probabilities (
4), while
A and
c are sets of offered traffic and its demands (Formulas (
6) and (
7)).
The LAS model approximates the external structure of a cloud system, consisting of k physical machines with equal capacities. These are offered a set of virtual machines with different requirements, expressed in a dedicated class number of AUs.
3.2. Erlang Ideal System
An Erlang’s ideal system (EIS) is a certain abstract system in which requests of class
arriving at its input do not have access to all of the
of the system, but only a limited portion of it, called
availability (
Figure 3). The concept and analytical model for single-service traffic was proposed by Erlang back in the early 20th century [
25]. This model for multi-service traffic was generalized in the works [
26,
27]. In [
26], it was assumed that all classes of traffic have identical availability. In [
27], on the other hand, the assumption was that requests from different classes have access to resources with different values, i.e., they have different accessibility.
Requests that access the same system resources are called a load group. The number of load groups in an EIS system
, for request class
i (
), can be calculated using the following formula:
The availability parameter is defined for each class of requests in the system. Its value depends on the structure of the system, the mechanism for controlling access to resources, or specific requirements for the resources themselves. It is worth noting that within a single class of requests, understood as requests of the same size (i.e., the required number of
AUs (
) and value of availability parameter
), not all requests necessarily receive access to the same system AUs. Consequently, the demands of a class can be offered in different load groups.
Figure 3 illustrates a very simple example of such a situation—a single-service system (occurring in a system of 1 class of requests and requiring a single resource for service) with a capacity of 3 AUs and an availability of
AUs. In the situation shown, according to the Formula (
8), 3 load groups can be distinguished, having access to the 1st and the 2nd resource in the system, the 2nd and the 3rd resource in the system, and the 1st and the 3rd resource in the system, respectively. The figure shows the blocked state for the third load group, because even though there is one resource free in the system (the 2nd place in the system), this request does not have access to it and must be rejected.
Analogically to LAS, sufficient resources throughout the system do not guarantee that a request will be accepted for service. An incoming request of class
i will be accepted for service if and only if a sufficient number of resources
are free within the load group specified for that request. Given this assumption, the conditional transition coefficient in the EIS model can be determined as follows:
where
, if ,
, if .
After determining the conditional transition coefficient between states, the occupancy distribution can be determined using the equation
Once the conditional probability of transition between adjacent states is determined, the occupancy distribution can be determined by the formula
In this article, the EIS model was used to approximate the internal structure of the cloud system, in which the upcoming requests (to create MVs of a given class) at the input of a single physical machine have access to a uniquely defined part of the resources of this machine.
To simplify the notation, the results of the EIS model can symbolically be represented as follows:
where
represents the occupancy distribution in the EIS, obtained from an equation analogous to that obtained with Equation (
10), while E is the set of loss probabilities obtained using Equation (
11). While
,
and
are sets of offered traffic and its requests and availability:
The analytical models presented in
Section 3.1 and
Section 3.2 are well known in the literature of the subject and have been successfully used for years for the analysis, design and optimization of ICT systems. These models are used to analyze switching fields [
28], physical cloud computing infrastructure [
19] or prioritized systems [
29]. Regardless of the current work on the systems under consideration, the books [
30,
31] provide a coherent summary of knowledge on these models and their applications.
3.3. Model L-E-CBP
In articles [
22,
23], two analytical models of the cloud system were proposed, based on a product-based approach to determine the conditional transition coefficient in systems in which state dependence is influenced by several independent factors [
32]. In the model, the cloud system was treated as a unified system (i.e., a system without division into external and internal structures) with a productively determined conditional transition probability. The disadvantage of this approach is the necessity to scale the occupancy state in a single physical machine to the occupancy state in the entire system, which consequently negatively affects the accuracy of the adopted solution. In order to improve the accuracy of analytical modeling of cloud systems, the paper proposes a new model L-E-CBP (LAS-EIS-Cloud Blocking Probability). The authors describe model using the symbols and acronyms listed in
Table 2.
The model assumes that the external structure of the cloud system (a system of physical machines) is modeled based on LAS, while the internal structure (a single virtual machine) is modeled based on EIS. First, the L-E-CPB model determines the loss probability resulting from the external structure of the cloud system, i.e., a system consisting of
k independent physical machines with identical parameters. In these physical machines, a request for a virtual machine creation can be serviced only if there are free resources in at least one physical machine of the system to support the virtual machine completely, i.e., without participating in the support of other physical machines. Then, assuming that the algorithms for allocating resources to virtual machines strive to load all physical machines equally, it can be assumed that the traffic offered to individual physical machines is identical and results from the division of traffic flowing down from the external structure (i.e., traffic that has not been lost in the external structure) into
k traffic streams of identical intensity. In the article, such traffic is called equivalent traffic and is the basis for modeling the internal structure of the cloud system (a single physical machine) based on the EIS model. In the next step, the actual loss probability of the cloud system is determined, taking into account both the availability of a single physical machine and the number of physical machines in the system.
Figure 4 shows the concept of traffic distribution in a cloud system, according to the proposed L-E-CBP model.
In the proposed L-E-CBP model, the probability of loss of class i, resulting from the external structure of the cloud system, is approximated by a LAS model consisting of
k subsystems, which in turn are EIS systems with a capacity of
AUs. Thus, the probability of loss in the external structure of the system, expressed by Equation (
4), can be rewritten in the adopted notation as follows:
Simultaneously, the probability of loss due to the internal configuration of the system is approximated by the EIS model, consisting of
AUs, where requests of class
i have access to a certain portion of the physical machine’s resources determined by the value of the availability parameter
. This probability of loss is given by the Equation (
11).
Let us consider a cloud system with a total capacity of
providing access to
k PMs. Note that the external structure’s probability of loss
is determined by the traffic offered to the entire cloud system, i.e., the
traffic. On the other hand, the probability of internal structure loss
is determined from the equivalent traffic sets
, where each element of
is defined.
Thus defined equivalent traffic—offered to a single internal structure—is traffic that is not lost in the external structure. The total probability of loss in the cloud system
, taking into account losses in the external structure and
k internal structures in the L-E-CBP model, is approximated by the probability of traffic loss. The probability of traffic loss is defined as the ratio of traffic lost to traffic offered to a given system. Traffic lost of class
i in an external structure has a value of
. Traffic lost in a single internal structure (physical machine) is equal to
. Thus, the total class and lost traffic in an external structure and
k internal structures is equal to
. Now, based on the definition of traffic loss probability, the total loss probability of class
i requests in the cloud system can be approximated as follows:
The values of are the elements of the sets covering loss probabilities for all classes of traffic offered to the cloud system.
The determination of cloud system loss probabilities based on the L–E–CBP model can be written in the form of the L–E–CBP METHOD:
L–E–CBP METHOD:
Determine, according to the LAS model, the occupancy distribution and the loss probability in the external structure of the system according to the formula
.
Determine the equivalent traffic–based on Formula (
17)–appearing at the input of a single physical machine:
.
Determine the occupancy distribution and the loss probability in a single physical machine of capacity at the input of which the equivalent traffic offered determined in Step 2 appears. Based on the EIS model, it can be written as follows:
.
Determine the total losses in the cloud system based on the Formula (
18):
.