Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform

Katayama, Yukiko; Tachibana, Takuji

doi:10.3390/s22134825

Open AccessArticle

Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform

by

Yukiko Katayama

and

Takuji Tachibana

^*

Graduate School of Engineering, University of Fukui, 3-9-1 Bunkyo, Fukui 910-8507, Japan

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(13), 4825; https://doi.org/10.3390/s22134825

Submission received: 5 June 2022 / Revised: 21 June 2022 / Accepted: 22 June 2022 / Published: 25 June 2022

(This article belongs to the Special Issue Mobile Edge Computing for 5G and Future Internet)

Download

Browse Figures

Versions Notes

Abstract

:

For 5G and future Internet, in this paper, we propose a task allocation method for future Internet application to reduce the total latency in a mobile edge computing (MEC) platform with three types of servers: a dedicated MEC server, a shared MEC server, and a cloud server. For this platform, we first calculate the delay between sending a task and receiving a response for the dedicated MEC server, shared MEC server, and cloud server by considering the processing time and transmission delay. Here, the transmission delay for the shared MEC server is derived using queueing theory. Then, we formulate an optimization problem for task allocation to minimize the total latency for all tasks. By solving this optimization problem, tasks can be allocated to the MEC servers and cloud server appropriately. In addition, we propose a heuristic algorithm to obtain the approximate optimal solution in a shorter time. This heuristic algorithm consists of four algorithms: a main algorithm and three additional algorithms. In this algorithm, tasks are divided into two groups, and task allocation is executed for each group. We compare the performance of our proposed heuristic algorithm with the solution obtained by three other methods and investigate the effectiveness of our algorithm. Numerical examples are used to demonstrate the effectiveness of our proposed heuristic algorithm. From some results, we observe that our proposed heuristic algorithm can perform task allocation in a short time and can effectively reduce the total latency in a short time. We conclude that our proposed heuristic algorithm is effective for task allocation in a MEC platform with multiple types of MEC servers.

Keywords:

mobile edge computing; future internet application; optimization problem; task allocation; heuristic algorithm; queueing theory

1. Introduction

With the emergence of fifth generation (5G) mobile communication and Internet of Things, a variety of applications such as augmented reality, facial recognition, mobile game, smart city, and smart building, have been developed [1,2,3,4,5,6,7,8,9,10]. Many of these applications require high processing performance and low processing latency, and each task for these applications must be processed within an acceptable delay. However, it is difficult to process tasks for mobile applications within acceptable delays on mobile terminals [11,12,13]. This is because the processing capability of mobile terminals is low, and it takes a long time to process the tasks on these terminals.

By using task offloading, tasks for mobile applications can be processed on cloud servers, which are external servers with higher processing performance than mobile terminals [14]. The tasks can be processed on cloud servers in a short time [15]; however, the transmission delay is large due to the large distance between the mobile terminal and cloud servers [16]. Task offloading is a complex process and can be affected by a number of different factors [17], and it requires application partitioning, offloading decision making and distributed task execution [18].

Mobile edge computing (MEC) has attracted attention for processing tasks for applications that require low processing delay [19,20]. MEC was defined by the European Telecommunication Standards Institute [21], and it is also recently called Multi-Access Edge Computing. MEC is classified into one of the edge computing, which can support several kinds of characteristics including mobility support, location awareness, low latency, and heterogeneity [22,23,24]. In general, edge computing has more limited resources, limited computation and storage capabilities, and proximity to end devices than fog computing [25].

In a MEC platform in which MEC servers can be used, tasks can be processed on the MEC servers using task offloading, and the transmission delay for the task processing can be significantly reduced compared with cloud servers. However, the amount of available computing resources in a MEC server is limited, and the processing performance of a MEC server is lower than that of cloud servers. Thus, the number of tasks processed on a MEC server affects the performance of the MEC server. To process each task within an acceptable delay, some tasks should not be processed on the MEC server to avoid reducing the processing performance.

In some MEC platforms, MEC servers and cloud servers can be utilized for processing tasks [26]. MEC servers are classified into the following two groups based on the location and users: dedicated MEC servers and shared MEC servers. Dedicated MEC servers are utilized to process tasks that are sent from the closest access point, while shared MEC servers are utilized for tasks that are sent from any access point. Each task should be processed on an appropriate server among dedicated MEC servers, shared MEC servers, and cloud servers to satisfy the acceptable delay. Moreover, each task should be processed with low latency even if the acceptable delay is satisfied. Therefore, the total delay between sending a task and receiving a response for all tasks can be reduced by using MEC servers and cloud servers appropriately. However, the latency for each task is significantly affected by other task processes; therefore, it is difficult to perform task allocation for these servers. In addition, tasks transmitted from multiple access points are allocated to one of multiple MEC servers. Task allocation must be performed for tasks transmitted from multiple access points; however, it is difficult to consider the bottleneck node in an MEC platform. As far as the authors know, task allocation has not been studied in an MEC platform in which MEC servers and cloud servers are utilized from multiple access points and there is a bottleneck node.

In this paper, we propose a task allocation method for reducing the total latency in a MEC platform. In this platform, there are three types of servers: a dedicated MEC server, a shared MEC server, and a cloud server. For this platform, we first calculate the delay between sending a task and receiving a response for the dedicated MEC server, shared MEC server, and cloud server by considering the processing time and transmission delay. Here, the bottleneck node is modeled as an M/M/1 queueing model, and the transmission delay for the shared MEC server is derived using a queuing theory. Then, we formulate an optimization problem for task allocation to minimize the total latency for all tasks. By solving this optimization problem, tasks can be allocated to the MEC servers and cloud server appropriately. However, the calculation time is very large even if a meta-heuristic algorithm, such as the genetic algorithm [27], is used. Therefore, we also propose a heuristic algorithm to obtain the approximate optimal solution in a shorter time. This heuristic algorithm consists of four algorithms: a main algorithm and three additional algorithms. In this algorithm, tasks are divided into two groups, and task allocation is executed for each group. We compare the performance of our proposed heuristic algorithm with the solution obtained by the genetic algorithm and other methods and investigate the effectiveness of our algorithm.

Various studies have been conducted on task allocation methods for the MEC platform [20,26,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59], which are described in Section 2. In comparison with these studies, we offer the following contributions and benefits:

This paper considers task allocation for a MEC platform in which two types of MEC servers and a cloud server can be utilized.
Three different equations are formulated to calculate the latency for each server.
Our proposed heuristic algorithm can quickly derive the approximate optimal solution for the optimization problem in a situation in which three different servers are utilized.
Our proposed heuristic algorithm can be implemented in a MEC platform and a mobile application, such as our developed application and system [60,61], because this algorithm is not complex for implementation.

Task allocation may fall into the local minimum when our proposed heuristic algorithm is used because task allocation processes are simple so that it can be implemented in a MEC platform. However, we will avoid falling into the local minimum by adding random search technique (ARSET) and heuristic random optimization (HRO) [62]. It should be noted that this paper is an extension of our previous work [63].

The remainder of this paper is organized as follows. Section 2 presents related work on task allocation in a MEC platform. Section 3 describes our system model, and Section 4 formulates an optimization problem to reduce the total latency in the MEC platform. Section 5 proposes a heuristic algorithm for solving the optimization problem, and Section 6 calculates computational complexity of the heuristic algorithm. Section 7 presents numerical examples, and Section 8 concludes the paper.

2. Related Work

In this section, we introduce related work on task allocation in a MEC platform. In [20], an offloading algorithm was proposed for multiple users to perform the computation offloading in a MEC environment. In this environment, multi-channel radio interference was utilized for offloading, and the algorithm used game theory for task offloading. In [26], the authors studied resource allocation for a multi-user MEC offloading system based on time-division multiple access and orthogonal frequency-division multiple access. In [28], the authors proposed a task allocation in a hybrid non-orthogonal multiple access (NOMA) MEC system to reduce the processing delay and save the energy consumption. The proposed method formulates an optimization problem and utilizes a matching algorithm to obtain a better solution. In [29], the authors proposed a cooperative task allocation method to minimize the power consumption of mobile terminals in an environment with a MEC server and cloud server. In this environment, task processing can be performed on the MEC server near the base station via wireless communication. This method can also use cloud servers via optical line terminals or the Internet.

In [30], the authors defined a mathematical model of a MEC environment in which traffic flows can be managed. The proposed permissive underestimation system, which selects the destination server with the lowest latency, provides an effective solution for a MEC platform. In addition, in [31], the authors discussed how a MEC server can be used to realize serverless edge computing. Following the European Telecommunications Standards Institute (ETSI) MEC standard, two alternative design approaches were proposed to handle rapid changes in mobility and load conditions. Using numerical examples, it was demonstrated that the proposed approaches were effective in accommodating system changes in response time.

In [32], the authors proposed an optimization framework for computation offloading and resource allocation for a MEC environment with multiple servers. This framework can be used to minimize the total computational overhead. The individual computation decisions, transmit power of the users, and computation resources were optimized. MEC servers were utilized in this environment; however, cloud servers were not. In addition, this paper adopted a suboptimal approach by splitting the original problem into a computation offloading decision problem and a joint resource allocation problem.

In [33], the authors investigated a two-tier offloading method for multiple MEC servers in heterogeneous networks. In this method, the total computation overhead was minimized by solving a formulated optimization problem that was a mixed-integer nonlinear program problem. The original problem was also divided into a resource allocation problem and a computation offloading problem. In [34], the authors focused on a MEC platform in which there were two types of MEC serves: a near server and far server. In this platform, delay-sensitive tasks were allocated to the near server while computationally intensive tasks were allocated to the far server. However, this task allocation did consider the utilization of cloud servers. In [35], a resource management technique based on game theory was proposed in a MEC platform and small-scale data centers. This technique can minimize energy consumption and costs while ensuring applications’ performance using a semi-co-operative game.

In [36], the authors proposed a heuristic offloading algorithm to maximize the reliability performance of computation offloading. The method can be used in an Internet of Vehicle environment in which fixed edge computing node and MEC nodes are used, but cloud servers are not used. For a similar Internet of Vehicle environment, in [37], the authors modeled the data redundancy and proposed the collaborative task computing scheme. The proposed scheme can reduce the redundant data and utilize the idle resources in nearby MEC servers. In [38], the authors proposed an optimization framework of offloading from a single mobile device to multiple edge devices. This framework is based on a semi-definite relaxation (SDR), and tasks are allocated considering central process unit (CPU) to improve energy consumption and processing latency. In [39], for the industrial Internet of Things, the authors proposed a MEC-enabled architecture considering the task’s priority constraints. This architecture can minimize the response time using a task allocation strategy using a Bayesian network based evolutionary algorithm. In [40], for the latency and reliability sensitive computing tasks processed in swarm of drones, the authors proposed a task allocation based on an optimization problem. In the swarm of drones, nearby drones are used as MEC server for processing the tasks. This algorithm can minimize the energy consumption of the swarm of drones when the latency and reliability requirements are satisfied.

For cloud computing environments without MEC servers, in [41,42], the authors proposed resource management methods for cloud computing environments and cloud data centers. These methods can manage resources to improve energy consumption, service performance, and costs. In [43], the authors studied the combination of two virtualization technologies: virtual machine and containers. The authors presented the advantages of running containers on virtual machines.

For a environment where MEC servers and cloud servers are available, [44,45] proposed an algorithm that allocates tasks to a MEC server or cloud servers to minimize the total latency. Optimization problems were formulated for latency reduction and were solved using a genetic algorithm. In both problems, there was only one MEC server, and heuristic algorithms were not proposed. In [46] a task allocation to increase user satisfaction was proposed. The minimization of power consumption was also considered [47,48,49,50,51].

MEC is significantly expected to be utilized by future Internet applications. Therefore, various uses of MEC have been proposed [52,53,54,55]. Especially, machine learning and artificial intelligence are effective in a MEC platform [56,57,58,59]. For utilizing machine learning and artificial intelligence, a large number of data sets obtained from the real environment and a long training time to determine an appropriate task allocation.

3. System Model

3.1. MEC Platform

In this section, we explain our system model where our proposed method is applied. This system model is designed by considering [61] because our proposed method is used in real environments.

Figure 1 presents our system model, which consists of a MEC platform with three types of servers: a dedicated MEC server, shared MEC server, and cloud server. The dedicated MEC server

M_{1}

is utilized to process tasks that are sent from the closest access point, while the shared MEC server

M_{2}

is utilized for tasks that are sent from any access point. The cloud server S can also be utilized for tasks that are sent from any access point.

In this model, N tasks of N users can be processed on one of the three servers in the MEC platform. In the following, we focus on task allocation for users that connect to the MEC platform via access point

a_{1}

. The number of these users is

N_{1}

, and the

N_{1}

users can use the dedicated MEC server

M_{1}

, shared MEC server

M_{2}

, and cloud server S. In addition,

N_{2}

users (

N_{1} + N_{2} = N

) can use

M_{2}

and S via access point

a_{2}

. That is,

M_{2}

and S can process tasks for all N users, whereas

M_{1}

can process tasks for

N_{1}

users.

Here, let

D_{1}

[Gigacycle/s] be the processing efficiency of

M_{1}

, and let

D_{2}

[Gigacycle/s] be the processing efficiency of

M_{2}

. The transmission delay between

N_{1}

users and

M_{1}

is zero; however, the transmission delay between

N_{1}

users and

M_{2}

depends on the bottleneck node (see Figure 2). In this subsection, we model the bottleneck node between all access points and

M_{2}

as an M/M/1 queueing model, and the transmission delay l [s] is given by

l = \frac{1}{μ - (N_{1}^{M_{2}} + N_{2}) λ} .

(1)

In (1),

N_{1}^{M 2}

denotes the number of tasks that are not allocated to

M_{1}

, and

(N_{1}^{M 2} + N_{2})

denotes the number of tasks that pass through the bottleneck node. In addition,

λ

is the arrival rate of tasks at the bottleneck node, and

\frac{1}{μ}

is the average processing time of each task at the bottleneck node. It should be noted that

(N_{1}^{M 2} + N_{2}) λ / μ < 1

should be satisfied to obtain steady-state probabilities.

The processing efficiency of S is much higher than that of both

M_{1}

and

M_{2}

; thus, the processing time on S is assumed to be 0 [s]. The transmission delay between

N_{1}

users and S is large because the task transmission is via the Internet, and this transmission delay is assumed to be a large constant time, which is denoted as

τ

[s].

Here, let the ith task that is transmitted via access point

a_{1}

be denoted as

f_{i}

(

i = 1, \dots, N_{1}

). For task

f_{i}

, the acceptable latency is set to

t_{i}^{m a x}

[s] as the task allocation constraint. Each user must receive a response for their own task within the acceptable delay after sending the task to a server.

3.2. Calculation of Latency for Three Types of Servers

In this subsection, we calculate the latency for processing a task for three types of servers. For task

f_{i}

(

i = 1, \dots, N_{1}

), let

T_{1}^{i}

,

T_{2}^{i}

, and

T_{S}^{i}

be the latency for processing

f_{i}

on the dedicated MEC server

M_{1}

, shared MEC server

M_{2}

, and cloud server S, respectively. Figure 3 presents the latency for

f_{i}

in the three cases.

Now, let

c_{i}

[Gigacycle] denote the number of central processing unit (CPU) cycles that are required for processing

f_{i}

. When

f_{i}

is processed on

M_{1}

,

T_{1}^{i}

is equal to the processing time on

M_{1}

and does not include the transmission delay. This is because the transmission delay is zero for

M_{1}

. Here, the processing time depends on the total number of CPU cycles for the tasks processed on

M_{1}

. When the set of tasks processed on

M_{1}

is

F_{1}

,

T_{1}^{i}

is given by

T_{1}^{i} = \frac{\sum_{f_{j} \in F_{1}} c_{j}}{D_{1}} .

(2)

Next, we consider the processing of

f_{i}

on

M_{2}

. The latency

T_{2}^{i}

is derived from the processing time on

M_{2}

and the round-trip transmission delay

2 l

, where l is derived in Section 3.1. Here, the processing time also depends on the total number of CPU cycles for the tasks processed on

M_{2}

. When the set of tasks processed on

M_{2}

is

F_{2}

, the latency

T_{2}^{i}

is given by

T_{2}^{i} = \frac{\sum_{f_{j} \in F_{2}} c_{j}}{D_{2}} + 2 l .

(3)

It should be noted that for simplicity, the processing time, which is the first term in (3), does not consider the processing of tasks forwarded from

a_{2}

. This is because we do not focus on the allocation of tasks from

a_{2}

; however, this does not affect the allocation of tasks from

a_{1}

because we assume that

D_{2}

is the efficiency of processing only tasks from

a_{1}

.

Finally, when

f_{i}

is processed on S, the latency

T_{S}^{i}

is equal to the round-trip transmission delay

2 τ

. This is because the processing time of

f_{i}

on S is zero due to its high processing efficiency regardless of the number of tasks that are allocated to S. Therefore,

T_{S}^{i}

is given by

T_{S}^{i} = 2 τ .

(4)

4. Optimization Problem Formulation for Total Latency Reduction

In this section, we formulate an optimization problem for allocating tasks to three servers to minimize the total latency for the system model described in Section 3. For this optimization problem, we define the following variables for task

f_{i}

:

χ_{i} = \{\begin{matrix} 1, & f_{i} is allocated to M_{1}, \\ 0, & otherwise . \end{matrix}

ψ_{i} = \{\begin{matrix} 1, & f_{i} is allocated to M_{2}, \\ 0, & otherwise . \end{matrix}

ω_{i} = \{\begin{matrix} 1, & f_{i} is allocated to S, \\ 0, & otherwise . \end{matrix}

The above variables indicate the server where

f_{i}

is processed. For example,

χ_{i} = 1

indicates that

f_{i}

is allocated to

M_{1}

.

When the acceptable latency for

f_{i}

is

t_{i}^{m a x}

, we formulate the following optimization problem for minimizing the total latency for all

N_{1}

tasks:

min_{χ, ψ, ω} \sum_{i = 1}^{N} {T_{1}^{i} χ_{i} + T_{2}^{i} ψ_{i} + T_{S}^{i} ω_{i}},

(5)

subject to:

T_{1}^{i} χ_{i} + T_{2}^{i} ψ_{i} + T_{S}^{i} ω_{i} \leq t_{i}^{m a x}, \forall i,

(6)

χ_{i} + ψ_{i} + ω_{i} = 1, \forall i .

(7)

In this optimization problem, the objective function (5) signifies that tasks are allocated to servers to minimize the total latency. The constraint condition (6) indicates that the latency for each task must be equal to or lower than

t_{i}^{m a x}

. Moreover, (7) signifies that each task is allocated to only one of three servers. This optimization problem can be solved simply using meta heuristic algorithms, such as the genetic algorithm.

5. Proposed Heuristic Algorithm

In this section, we propose a heuristic algorithm for solving the formulated optimization problem. Our proposed heuristic algorithm consists of four algorithms that are denoted as Algorithms 1–4. Algorithm 1 is the main algorithm, while the remaining algorithms are used as a function in the main algorithm.

Figure 4 presents an overview of our proposed heuristic algorithm. In our algorithm, the allocation of a task whose acceptable latency is low is preferentially performed to satisfy the acceptable latency of all tasks. In Algorithm 1, first, all tasks are divided into two sets in line 1. This process is performed based on the acceptable latency in Algorithm 2.

Algorithm 1 Main algorithm.

Input: All parameters for our optimization problem
Output:

χ_{i}

,

ψ_{i}

,

ω_{i}

1: Task division (

(t_{i}^{m a x}, τ, f_{i})

,

N_{1}

) /*Algorithm 2*/
2: MEC allocation

(t_{i}^{m a x}, f_{i}, c_{i})

/*Algorithm 3*/
3: MEC cloud allocation

(t_{i}^{m a x}, f_{i}, c_{i})

/*Algorithm 4*/

In Algorithm 2, tasks are divided into two sets,

F_{S}

and

F_{\bar{S}}

.

F_{S}

includes tasks that can be processed on S, while

F_{\bar{S}}

includes tasks that are never processed on S. If the acceptable latency

t_{i}^{m a x}

of task

f_{i}

is smaller than

2 τ

,

f_{i}

is never processed on S and is included in

F_{\bar{S}}

in lines 3 and 4. Otherwise,

f_{i}

is included in

F_{S}

in lines 5 and 6.

Then, each task in

F_{\bar{S}}

is allocated to

M_{1}

or

M_{2}

in line 2 of Algorithm 1, and this allocation is performed in Algorithm 3. In Algorithm 3, let

L_{i}^{1}

and

L_{i}^{2}

be the latency when

f_{i}

is assumed to be processed on

M_{1}

and

M_{2}

, respectively. Furthermore,

V_{m i n}^{1}

and

V_{m i n}^{2}

are the minimum values of the acceptable latency

t_{i}^{m a x}

of a task allocated to

M_{1}

and

M_{2}

, respectively. As explained in the previous paragraph, a task in

F_{\bar{S}}

must be allocated to

M_{1}

or

M_{2}

because

t_{i}^{m a x}

is smaller than

2 τ

. In addition, a task whose acceptable latency is low should be allocated to

M_{1}

because the transmission delay for

M_{1}

is zero. Therefore, the allocation of tasks in

F_{\bar{S}}

is decided in ascending order of

t_{i}^{m a x}

, and

f_{i}

is sorted in ascending order of

t_{i}^{m a x}

in line 1. It should be noted that

t_{1}^{m a x}

is the minimum value while

t_{| F_{\bar{S}} |}^{m a x}

is the maximum value after line 1. In this task allocation,

L_{i}^{1}

and

L_{i}^{2}

are compared with

V_{m i n}^{1}

and

V_{m i n}^{2}

in lines 6, 18, and 23. When all tasks satisfy the acceptable latency even if

f_{i}

is allocated to

M_{1}

and

M_{2}

in line 6,

f_{i}

is allocated to a server to reduce the latency by comparing

L_{i}^{1}

with

L_{i}^{2}

in lines 7 or 12 (

χ_{i} \leftarrow 1

or

ψ_{i} \leftarrow 1

). After

f_{i}

is allocated to a server,

V_{m i n}^{1}

or

V_{m i n}^{2}

may be updated in line 10 or 15. When all tasks satisfy the acceptable latency if

f_{i}

is allocated to

M_{1}

but the acceptable latency is not satisfied for

M_{2}

in line 18,

f_{i}

is allocated to

M_{1}

(

χ_{i} \leftarrow 1

). In addition, when all tasks satisfy the acceptable latency if

f_{i}

is allocated to

M_{2}

, but the acceptable latency is not satisfied for

M_{1}

in line 23,

f_{i}

is allocated to

M_{2}

(

ψ_{i} \leftarrow 1

). In both cases,

V_{m i n}^{1}

or

V_{m i n}^{2}

may be updated in line 21 or 26.

Algorithm 2 Task division function.

Input:

t_{i}^{m a x}, τ, f_{i}, N_{1}

Output:

F_{S}, F_{\bar{S}}

Initialization:
1:

i \leftarrow 0

LOOP Process:
2: while

i < N_{1}

do
3: if

t_{i}^{m a x} < 2 τ

then
4:

F_{\bar{S}} \leftarrow f_{i}

5: else
6:

F_{S} \leftarrow f_{i}

7: end if
8:

i \leftarrow i + 1

9: end while

Algorithm 3 MEC allocation function.

Input:

t_{i}^{m a x}, f_{i}, c_{i}

Output:

χ_{i}

,

ψ_{i}

,

ω_{i}

for

f_{i} \in F_{\bar{S}}

Initialization:
1:

f_{i}

in

F_{\bar{S}}

is sorted in ascending order of

t_{i}^{m a x}

2:

V_{m i n}^{1} \leftarrow \infty

3:

V_{m i n}^{2} \leftarrow \infty

4:

i \leftarrow 1

LOOP Process:
5: while

i < | F_{\bar{S}} |

do
6: if

L_{i}^{1} \leq V_{m i n}^{1}

and

L_{i}^{2} \leq V_{m i n}^{2}

then
7: if

L_{i}^{1} \leq L_{i}^{2}

then
8:

χ_{i} \leftarrow 1

9: if

V_{m i n}^{1} > t_{i}^{m a x}

then
10:

V_{m i n}^{1} \leftarrow t_{i}^{m a x}

11:                  end if
12:             else
13:

ψ_{i} \leftarrow 1

14: if

V_{m i n}^{2} > t_{i}^{m a x}

then
15:

V_{m i n}^{2} \leftarrow t_{i}^{m a x}

16:                  end if
17:             end if
18:       else if

L_{i}^{1} \leq V_{m i n}^{1}

and

L_{i}^{2} > V_{m i n}^{2}

then
19:

χ_{i} \leftarrow 1

20: if

V_{m i n}^{1} > t_{i}^{m a x}

then
21:

V_{m i n}^{1} \leftarrow t_{i}^{m a x}

22:             end if
23:       else
24:

ψ_{i} \leftarrow 1

25: if

V_{m i n}^{2} > t_{i}^{m a x}

then
26:

V_{m i n}^{2} \leftarrow t_{i}^{m a x}

27:              end if
28:       end if
29:  end while

In Algorithm 4, tasks are allocated to

M_{1}

,

M_{2}

, or S because

f_{i}

can be allocated to the cloud server. Here, this task allocation can easily satisfy the acceptable latency for a task by allocating the task to S. This is because the processing time for

M_{1}

and

M_{2}

does not change when the task is allocated to S. Therefore, in this algorithm,

f_{i}

in

F_{S}

is sorted in descending order of

c_{i}

to reduce the total latency in line 1. It should be noted that

c_{1}

is the maximum value and

c_{| F_{S} |}

is the minimum value after line 1. Here, let

K_{i}^{1}

,

K_{i}^{2}

, and

K_{i}^{S}

be the total latency for

M_{1}

,

M_{2}

, and S in the case in which

f_{i}

is assumed to be processed on

M_{1}

,

M_{2}

, and S, respectively. In this task allocation,

L_{i}^{1}

and

L_{i}^{2}

are compared with

V_{m i n}^{1}

and

V_{m i n}^{2}

in lines 6, 20, 29, and 38. When all tasks satisfy the acceptable latency, even if

f_{i}

is allocated to

M_{1}

and

M_{2}

in line 6,

f_{i}

is allocated to a server so that the total latency becomes the smallest in lines 7, 12, or 17 (

χ_{i} \leftarrow 1

,

ψ_{i} \leftarrow 1

, or

ω_{i} \leftarrow 1

). After

f_{i}

is allocated to

M_{1}

or

M_{2}

,

V_{m i n}^{1}

or

V_{m i n}^{2}

may be updated in line 10 or 15. When all tasks satisfy the acceptable latency if

f_{i}

is allocated to

M_{1}

, but the acceptable latency is not satisfied for

M_{2}

in line 20,

f_{i}

is allocated to

M_{1}

or S. In line 21 or 26,

f_{i}

is allocated to a server so that the total latency becomes the smallest (

χ_{i} \leftarrow 1

or

ω_{i} \leftarrow 1

). In addition, when all tasks satisfy the acceptable latency if

f_{i}

is allocated to

M_{2}

, but the acceptable latency is not satisfied for

M_{1}

in line 29,

f_{i}

is allocated to

M_{2}

or S. In line 30 or 35,

f_{i}

is allocated to a server so that the total latency becomes the smallest (

ψ_{i} \leftarrow 1

or

ω_{i} \leftarrow 1

). When no task can satisfy the acceptable latency if

f_{i}

is allocated to

M_{1}

and

M_{2}

,

f_{i}

is allocated to S (

ω_{i} \leftarrow 1

).

Algorithm 4 MEC cloud allocation function.

Input:

t_{i}^{m a x}, f_{i}, c_{i}

Output:

χ_{i}

,

ψ_{i}

,

ω_{i}

for

f_{i} \in F_{S}

Initialization:
1:

f_{i}

in

F_{S}

is sorted in decreasing order of

c_{i}

2:

V_{m i n}^{1} \leftarrow \infty

3:

V_{m i n}^{2} \leftarrow \infty

4:

i \leftarrow 1

LOOP Process:
5: while

i < | F_{S} |

do
6: if

L_{i}^{1} \leq V_{m i n}^{1}

and

L_{i}^{2} \leq V_{m i n}^{2}

then
7: if

K_{i}^{1} < K_{i}^{2}

and

K_{i}^{1} < K_{i}^{S}

then
8:

χ_{i} \leftarrow 1

9: if

V_{m i n}^{1} > t_{i}^{m a x}

then
10:

V_{m i n}^{1} \leftarrow t_{i}^{m a x}

11: end if
12: else if

K_{i}^{2} < K_{i}^{1}

and

K_{i}^{2} < K_{i}^{S}

then
13:

ψ_{i} \leftarrow 1

14: if

V_{m i n}^{2} > t_{i}^{m a x}

then
15:

V_{m i n}^{2} \leftarrow t_{i}^{m a x}

   16:                  end if
   17:             else
   18:

ω_{i} \leftarrow 1

19: end if
20: else if

L_{i}^{1} \leq V_{m i n}^{1}

and

L_{i}^{2} > V_{m i n}^{2}

then
21: if

K_{i}^{1} < K_{i}^{S}

then
22:

χ_{i} \leftarrow 1

23: if

V_{m i n}^{1} > t_{i}^{m a x}

then
24:

V_{m i n}^{1} \leftarrow t_{i}^{m a x}

   25:                  end if
   26:             else
   27:

ω_{i} \leftarrow 1

28: end if
29: else if

L_{i}^{1} > V_{m i n}^{1}

and

L_{i}^{2} \leq V_{m i n}^{2}

then
30: if

K_{i}^{2} < K_{i}^{S}

then
31:

ψ_{i} \leftarrow 1

32: if

V_{m i n}^{2} > t_{i}^{m a x}

then
33:

V_{m i n}^{2} \leftarrow t_{i}^{m a x}

   34:                  end if
   35:             else
   36:

ω_{i} \leftarrow 1

   37:             end if
   38:       else
   39:

ω_{i} \leftarrow 1

40: end if
41:

i \leftarrow i + 1

42: end while

6. Computational Complexity

In order to investigate the scalability of our proposed algorithm, we derive computational complexity of our proposed heuristic algorithm. First, there is no loop process in Algorithm 1, which is the main algorithm; therefore, the computational complexity of this algorithm can be derived from Algorithm 2, Algorithm 3, or Algorithm 4.

In Algorithm 2, there is a loop process from line 2 to line 9, and the order of this loop process is

O (N)

, in which N is the number of tasks. In Algorithms 3 and 4, there is also a loop process from line 5 to line 29 and from line 5 to line 42, respectively. From line 5 of Algorithm 3, the order of this loop process is

O (N)

because

| F_{\bar{S}} |

is equal to or smaller than N. Moreover, from line 5 of Algorithm 4, the order of this loop process is also

O (N)

because

| F_{S} |

is equal to or smaller than N.

As a result, the computational complexity of our proposed algorithm is

O (1) \times (O (N) + O (N) + O (N)) = O (N)

. This signifies that the computational complexity of this algorithm does not depend on parameters of a MEC platform and is affected by only the number of tasks. Therefore, our proposed algorithm is scalable to a large-scale MEC platform.

7. Numerical Examples

In this section, we evaluate the performance of our proposed heuristic algorithm described in Section 5 through comparison with other methods such as near-optimal task allocation with the genetic algorithm.

In the MEC platform for the performance evaluation, the number of tasks

N_{1}

is 10, 20, 30, 40, or 50, and the number of tasks

N_{2}

is equal to 20. The processing efficiency for

M_{1}

is

D_{1} = 30

, and the processing efficiency for

M_{2}

is

D_{2} = 300

. In addition, the transmission delay of tasks for S is

τ = 0.1

, 0.2, 0.3, 0.4, or 0.5. For task

f_{i}

,

c_{i}

is determined according to a uniform distribution of

[0.1, 1.0]

, and

t_{i}^{m a x}

is determined according to a uniform distribution of

[1.0, 4.0]

. For the bottleneck node, we assume that

λ

is equal to 1.25, 1.5, 1.75, 2.0, or 2.25, and

μ

is equal to 100. Table 1 presents a list of parameter settings in the simulation. These parameter settings were decided according to our MEC platform and application [62].

For this MEC platform, we evaluate the performance of the proposed heuristic algorithm, denoted as Proposed, and the performance of near-optimal task allocation, denoted as GA. In near-optimal task allocation, the number of chromosomes in each generation is 1000 and the mutation probability is 0.005. GA algorithm stops if there is no improvement in the best objective value for 1000 generations. It should be noted that we determined that the result of GA is the same as the optimal value obtained by the CPLEX optimizer [64] when the number of tasks is small. Therefore, the result of GA is used as the optimal one, and the performance of our proposed heuristic algorithm is investigated by comparing with the result of GA.

We also evaluate another heuristic algorithm where

K_{i}^{1}

,

K_{i}^{2}

, and

K_{i}^{S}

are replaced by

L_{i}^{1}

,

L_{i}^{2}

, and

2 τ

in Algorithm 4. This signifies that the latency for

f_{i}

is considered, but the total latency is not considered in Algorithm 4. The performance evaluation of this method is useful to investigate the validity of our proposed heuristic algorithm where the total latency can be considered, and the result is denoted as Comp. Finally, as one of the simplest methods, we evaluate the performance of a random method, this is denoted as Random, in which tasks are allocated to the three servers at random. We evaluate the performance of this method by deriving the average value from 10 simulations; the result is denoted as Random. By comparing Proposed with Random, the processing complexity of Proposed can be investigated.

In the following performance evaluation, there are four performance metrics:

Total latency: The solution of (5).
Minimum latency: ${min}_{i} {T_{1}^{i} ξ_{i} + T_{2}^{i} ψ_{i} + T_{S}^{i} ω_{i}}$ .
Maximum latency: ${max}_{i} {T_{1}^{i} ξ_{i} + T_{2}^{i} ψ_{i} + T_{S}^{i} ω_{i}}$ .
Calculation time to perform task allocation by solving the optimization problem.

These metrics are derived by solving the optimization problem (5) using the four methods.

7.1. Impact of Number of Tasks

First, we investigate the impact of the number of tasks N on the performance of each method when the transmission delay

τ

via the Internet is 0.2 and the arrival rate

λ

is 2.0. Figure 5 presents the total latency versus the number of tasks N. This figure indicates that the total latency increases as the number of tasks increases for all methods. This is because the total number of CPU cycles required for processing tasks increases. Among the four methods, the total latency of GA is the lowest, as expected. Furthermore, the latency of Random is much higher than that of GA, which demonstrates that tasks should not be allocated to servers at random. For our proposed method (Proposed), the obtained latency is close to the near-optimal result of GA. This is because our proposed method is constructed to obtain appropriate solution for the optimization problem (5)–(7). Moreover, the latency of Comp is almost the same as that of GA when the number of tasks is small; however, the latency increases as the number of tasks increases. When the number of tasks is 50, the total latency of Comp is much higher than that of Random. Therefore, Figure 5 demonstrates that our proposed heuristic algorithm can effectively reduce the total latency compared to Random and Comp.

Next, we evaluate the minimum latency and maximum latency for each task allocation in Figure 6 and Figure 7, respectively. In these figures, the number of tasks N is 50,

τ

is 0.2, and

λ

is 2.0. Figure 6 demonstrates that the minimum latency of Proposed is larger than that of GA. This indicates that the minimum latency can not be obtained using our proposed algorithm. This means that our proposed method cannot obtain the optimal solution for the optimization problem. However, the minimum latency of Proposed is much lower than that of Comp by using appropriate parameters in Algorithm 4. Here, the minimum latency of Random is the lowest among the four methods by ignoring the total latency reduction. In terms of the maximum latency, Figure 7 demonstrates that the latency of Proposed is almost the same as that of GA. This result signifies that Proposed can allocate tasks to appropriate servers so as not to increase the maximum latency for the total latency reduction. Here, the maximum latency for GA, Proposed, and Random is 0.4 that is equal to

2 τ

, and this is the latency for the task offloading to cloud servers. Therefore, GA, Proposed, and Random can utilizes MEC servers appropriately, but MEC servers are overused in Comp. These results indicate that our proposed heuristic algorithm is effective in solving the optimization problem to reduce the total latency.

Figure 8 and Figure 9 illustrate how tasks are allocated to each server when N is equal to 30 and 50, respectively. In both figures,

τ

is 0.2 and

λ

is 2.0. In these figures, almost the same number of tasks are allocated to each server for Random because the task allocation is determined at random. By comparing Random with other methods, we observe that a large number of tasks are allocated to cloud server S. As a result, the minimum latency and maximum latency are low in Figure 6 and Figure 7; however, the total latency is high in Figure 5. In our proposed method, the number of tasks for

M_{1}

is almost the same as that of GA, but the number of tasks for

M_{2}

and S are somewhat different from that for GA. In our proposed method, the number of tasks offloaded to each server depends on the processing order that is predetermined at line 1 in Algorithms 3 and 4. Therefore, it is hard to obtain the optimal task offloading in our proposed method. In Comp, the number of tasks for S is the smallest for both cases because the total latency cannot be considered in Algorithm 4.

7.2. Impact of Transmission Delay $τ$ for Cloud Server

Next, we investigate the impact of the transmission delay

τ

on the performance of each method. Figure 10 presents the total latency versus

τ

in the case of

N_{1} = 50

and

λ = 2.0

. This figure indicates that the total latency increases as

τ

increases for GA, Proposed, and Random. This is because the latency for tasks that are allocated to the cloud server S increases. From this figure, we find that the total latency for Proposed is close to that for GA. This means that our heuristic algorithm is effective regardless of

τ

. In contrast, the total latency of Comp does not increase when

τ

is larger than 0.2. In Comp, many tasks are allocated to S by considering only the latency for each task when

τ

is small. However, the latency for each task allocated to S increases as

τ

increases. Therefore, the number of tasks for S decreases and the total latency does not increase even if

τ

increases.

We also evaluate the minimum latency and maximum latency for each task allocation versus

τ

. In Figure 11 and Figure 12,

N_{1}

is set to 50 and

λ

is set to

2.0

. Figure 11 demonstrates that the minimum latency of Comp is the largest because many tasks are allocated to S even if

M_{1}

and

M_{2}

are available. On the other hand, the minimum latency of Proposed is higher than that of GA and Random. The difference between Proposed and these two methods increases as

τ

increases. This signifies that many tasks are allocated to MEC servers using our proposed method, and the minimum latency of our method increases. However, in Figure 12, the maximum latency of Proposed is equal to that of GA in most cases. In contrast, the maximum latency of Comp is very different from that of other methods. This is because many tasks are allocated to MEC servers even if the processing time in these servers increases. Although the task allocation of Proposed is somewhat different from that of GA, our heuristic algorithm is more effective than Comp and Random.

7.3. Impact of Arrival Rate $λ$ in a MEC Platform

In this subsection, we investigate the impact of the arrival rate

λ

on the performance of each method in the case of

N_{1} = 50

and

τ = 0.2

. The change of arrival rate

λ

signifies that the change of the number of tasks, and the scalability of our heuristic algorithm and the heterogeneity of system model, can be investigated. It should be noted that there are no results of Random because the obtained random task allocation could not satisfy constraint conditions.

Figure 13 presents the total latency versus

λ

. This figure indicates that the total latency increases as

λ

increases for GA and Proposed. This is because the latency

T_{2}^{i}

increases from (1) and (3). In contrast, when

λ

increases from 2.0 to 2.25, the total latency of Comp decreases. This is because many tasks are allocated to the cloud server S and the processing time on MEC servers decreases. Figure 13 demonstrates that our proposed heuristic algorithm can effectively reduce the total latency compared to Comp regardless of

λ

.

In addition, we evaluate the minimum latency and maximum latency for each task allocation versus

λ

. Figure 14 demonstrates that the minimum latency of Proposed is higher than that of GA, but is lower than Comp. In Figure 15, the maximum latency of Proposed is equal to that of GA regardless of

λ

. These results present that our heuristic algorithm is more effective than Comp, although the task allocation of Proposed is somewhat different from that of GA. Here, the proposed method utilizes the total latency, which is given by

K_{i}^{1}

,

K_{i}^{2}

, or

K_{i}^{S}

, in our Algorithm 4, but Comp uses the latency for a server, which is given by

L_{i}^{1}

,

L_{i}^{2}

, or

2 τ

. This means that our proposed algorithm is effective by considering the total latency instead of the latency for a server. This tendency can be shown in the previous subsection.

7.4. Calculation Time

Finally, we investigate the calculation time of our proposed method using a computer running macOS Mojave 10.14.6 with 2.3 GHz Intel Core i5, and 8 GB memory. It should be noted that the calculation time changes every time it is measured and it is not constant.

Table 2 presents the calculation time of GA, Proposed, and Comp in the case of

N_{1} =

10, 20, 30, 40, and 50. Here,

τ

is equal to 0.2 and

λ

is equal to 2.0. This table indicates that the calculation time of our proposed method (Proposed) is much lower than that of near-optimal task allocation (GA). As the number of tasks increases, the difference between Proposed and GA becomes large. This is because meta-heuristic algorithms including GA take a longer processing time than heuristic algorithms as widely known. This means that the heuristic algorithm is effective for task allocation in real environments. GA is not appropriate to decide the task offloading in real time. Moreover, the calculation time of our proposed method is almost the same as that of another heuristic algorithm (Comp). This is because the two algorithms are almost identical, although our proposed algorithm is more effective. From these results, we can conclude that our proposed heuristic algorithm is effective for task allocation in a MEC platform with multiple types of MEC servers.

8. Conclusions and Future Work

For 5G and future Internet, in this paper, we proposed a task allocation method for reducing the total latency in a MEC platform with three types of servers: a dedicated MEC server, a shared MEC server, and a cloud server. The proposed method can perform approximate optimal task allocation in a shorter time than other meta heuristic algorithms. This heuristic algorithm consists of four algorithms: a main algorithm and three additional algorithms. In this algorithm, tasks are divided into two groups, and task allocation is executed for each group. Computational complexity of our proposed algorithm depends on only the number of tasks. We compared the performance of our proposed heuristic algorithm with the solution obtained by GA and evaluated the effectiveness of our algorithm. From numerical examples, we observed that the results of our proposed method were similar to the results of near-optimal task allocation with GA. When the number of tasks changed, the difference between our proposed method and GA did not change significantly. In addition, the proposed algorithm could reduce the total latency by comparing with other methods. In terms of the transmission delay, the effectiveness of the proposed method was much high even if the transmission delay increased. This is because our proposed method can utilize MEC servers as is the case with GA. On the other hand, as the arrival rate became large, the difference between the proposed method and GA increased. This is because the impact of incorrect task allocation became large as the arrival rate increased. Nevertheless, the proposed method was more effective than other methods. The calculation time of our proposed method was much lower than that of near-optimal task allocation with GA. This result signified that Proposed could allocate tasks to appropriate servers so as not to increase the maximum latency for the total latency reduction. These results indicated that our proposed heuristic algorithm was effective in solving the optimization problem to reduce the total latency. Our proposed heuristic algorithm was effective for task allocation in a MEC platform with multiple types of MEC servers.

For a large-scale MEC platform, our system model and proposed algorithm are utilized modeling multiple MEC servers and multiple access points as a shared MEC server and an access point group, respectively. If the impact of each MEC server and each access point should be evaluated, our proposed method must be extended. This extension is one of our future works. In addition, we have developed an open MEC platform and a mobile augmented reality application. In our future work, we will implement the proposed algorithm into our MEC platform and mobile application and experimentally evaluate the performance of the algorithm. Moreover, in order to improve the performance, a deep learning algorithm may be available in the future.

Author Contributions

Y.K. conceived the main conceptual ideas related to the proposed heuristic algorithm and made simulation programs for performance evaluation. Moreover, she obtained many simulation results with her programs. T.T. contributed to the write up of the main sections of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the NICT (the National Institute of Communication Technology, Japan), as a part of “Open Multi-access Network Platform for 5G and Beyond 5G Services”.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors wish to thank the editor and anonymous referees for their helpful comments in improving the quality of this paper.

Conflicts of Interest

The authors declare that there is no conflict of interest regarding the publication of this paper.

References

Shafique, K.; Khawaja, B.A.; Sabir, F.; Qazi, S.; Mustaqim, M. Internet of Things (IoT) for next-generation smart systems: A review of current challenges, future trends and prospects for emerging 5G-IoT scenarios. IEEE Access 2020, 8, 23022–23040. [Google Scholar] [CrossRef]
Bonomi, F.; Milito, R.; Zhu, J.; Addepalli, S. Fog computing and its role in the Internet of Things. In Proceedings of the 1st MCC Workshop on Mobile Cloud Computing, Helsinki, Finland, 17 August 2012; pp. 13–16. [Google Scholar]
Chen, M.; Liang, B.; Dong, M. Multi-user multi-task offloading and resource allocation in mobile cloud systems. IEEE Trans. Wirel. Commun. 2018, 17, 6790–6805. [Google Scholar] [CrossRef] [Green Version]
Zanella, A.; Bui, N.; Castellani, A.; Vangelista, L.; Zorzi, M. Internet of Things for smart cities. IEEE Internet Things J. 2014, 1, 22–32. [Google Scholar] [CrossRef]
Mach, P.; Becvar, Z. Mobile edge computing: A survey on architecture and computation offloading. IEEE Commun. Surv. Tutor. 2017, 19, 1628–1656. [Google Scholar] [CrossRef] [Green Version]
Mukherjee, M.; Shu, L.; Wang, D. Survey of fog computing: Fundamental, network applications, and research challenges. IEEE Commun. Surv. Tutor. 2018, 20, 1826–1857. [Google Scholar] [CrossRef]
Zhang, Q.; Cheng, L.; Boutaba, R. Cloud computing: State-of-the art and research challenges. J. Internet Serv. Appl. 2010, 1, 7–18. [Google Scholar] [CrossRef] [Green Version]
Ahmed, E.; Yaqoob, I.; Gani, A.; Imran, M.; Guizani, M. Internet-of Things-based smart environments: State of the art, taxonomy, and open research challenges. IEEE Wirel. Commun. 2016, 23, 10–16. [Google Scholar] [CrossRef]
Kim, Y.; Lee, Y. Automatic generation of social relationships between Internet of Things in smart home using SDN-based home cloud. In Proceedings of the IEEE 29th International Conference on Advanced Information Networking and Applications Workshops, Gwangiu, Korea, 24–27 March 2015; pp. 662–667. [Google Scholar]
Zhang, K.; Mao, Y.; Leng, S.; Vinel, A.; Zhang, Y. Delay constrained offloading for mobile edge computing in cloud-enabled vehicular networks. In Proceedings of the 11th International Workshop on Communication Technologies for Vehicles, Halmstad, Sweden, 15–17 September 2016. [Google Scholar]
Zhang, K.; Mao, Y.; Leng, S.; Vinel, A.; Zhang, Y. Regional intelligent resource allocation in mobile edge computing based vehicular network. IEEE Access 2020, 8, 7173–7182. [Google Scholar]
Sun, J.; Gu, Q.; Zheng, T.; Dong, P.; Valera, A.; Qin, Y. mVideo: Edge computing based mobile video processing systems. IEEE Access 2020, 8, 10466–10477. [Google Scholar] [CrossRef]
Sun, H.; Yu, Y.; Sha, K.; Lou, B. Joint optimization of computation offloading and task scheduling in vehicular edge computing networks. IEEE Access 2020, 8, 11615–11623. [Google Scholar] [CrossRef]
Sun, H.; Yu, Y.; Sha, K.; Lou, B. Joint task offloading and resource management in NOMA-based MEC systems: A swarm intelligence approach. IEEE Access 2020, 8, 190463–190474. [Google Scholar]
Aljamal, R.; El-Mousa, A.; Jubair, F. A comparative review of high-performance computing major cloud service providers. In Proceedings of the 9th International Conference on Information and Communication Systems (ICICS 2018), Irbid, Jordan, 3–5 April 2018. [Google Scholar]
Rodrigues, T.G.; Suto, K.; Nishiyama, H.; Kato, N. Hybrid method for minimizing service delay in edge cloud computing through VM migration and transmission power control. IEEE Trans. Comput. 2016, 66, 810–819. [Google Scholar] [CrossRef]
Khan, A.u.R.; Othman, M.; Madani, S.A.; Khan, S.U. A survey of mobile cloud computing application models. IEEE Commun. Surv. Tutor. 2014, 16, 393–413. [Google Scholar] [CrossRef] [Green Version]
Saeik, F.; Avgeris, M.; Spatharakis, D.; Santi, N.; Dechouniotis, D.; Violos, J.; Leivadeas, A.; Athanasopoulos, N.; Mitton, N.; Papavassiliou, S. Task offloading in edge and cloud computing: A survey on mathematical, artificial intelligence and control theory solutions. Comput. Netw. 2021, 195, 108177. [Google Scholar] [CrossRef]
Pham, Q.; Fang, F.; Ha, V.N.; Piran, M.J.; Le, M.; Le, L.B.; Hwang, W.; Ding, Z. A survey of multi-access edge computing in 5G and beyond: Fundamentals, technology integration, and state-of-the-art. IEEE Access 2020, 8, 116974–117017. [Google Scholar] [CrossRef]
Chen, X.; Jiao, L.; Li, W.; Fu, X. Efficient multi-user computation offloading for mobile-edge cloud computing. IEEE/ACM Trans. Netw. 2016, 24, 2795–2808. [Google Scholar] [CrossRef] [Green Version]
Kekki, S.; Featherstone, S.; Fang, Y.; Kuure, P.; Li, A.; Ranjan, A.; Purkayastha, D.; Jiangping, F.; Frydman, D.; Verin, G.; et al. Mec in 5g networks. ETSI White Pap. 2018, 28, 1–28. [Google Scholar]
Kalyani, Y.; Collier, R. A Systematic Survey on the Role of Cloud, Fog, and Edge Computing Combination in Smart Agriculture. Sensors 2021, 21, 5922. [Google Scholar] [CrossRef]
De Donno, M.; Tange, K.; Dragoni, N. Foundations and Evolution of Modern Computing Paradigms: Cloud, IoT, Edge, and Fog. IEEE Access 2019, 7, 150936–150948. [Google Scholar] [CrossRef]
Khan, W.Z.; Ahmed, E.; Hakak, S.; Yaqoob, I.; Ahmed, A. Edge computing: A survey. Future Gener. Comput. Syst. 2019, 97, 219–235. [Google Scholar] [CrossRef]
Hu, P.; Dhelim, S.; Ning, H.; Qiu, T. Survey on fog computing: Architecture, key technologies, applications and open issues. J. Netw. Comput. Appl. 2017, 98, 27–42. [Google Scholar] [CrossRef]
You, C.; Huang, K.; Chae, H.; Kim, B.H. Energy-efficient resource allocation for mobile-edge computation offloading. IEEE Trans. Wirel. Commun. 2017, 16, 1397–1411. [Google Scholar] [CrossRef]
McCall, J. Genetic algorithms for modelling and optimisation. J. Comput. Appl. Math. 2005, 184, 205–222. [Google Scholar] [CrossRef]
Zhu, J.; Wang, J.; Huang, Y.; Fang, F.; Navaie, K.; Ding, Z. Resource allocation for hybrid NOMA MEC offloading. IEEE Trans. Wirel. Commun. 2020, 19, 4964–4977. [Google Scholar] [CrossRef]
Guo, H.; Liu, J.; Qin, H.; Zhang, H. Collaborative computation offloading for mobile-edge computing over fiber-wireless networks. In Proceedings of the IEEE GLOBECOM 2017, Singapore, 4–8 December 2017. [Google Scholar]
Intharawijitr, K.; Iida, K.; Koga, H.; Yamaoka, K. Simulation study of low-latency network model with orchestrator in MEC. IEICE Trans. Commun. 2019, 102, 2139–2150. [Google Scholar] [CrossRef]
Cicconetti, C.; Conti, M.; Passarella, A.; Sabella, D. Toward distributed computing environments with serverless solutions in edge systems. IEEE Commun. Mag. 2020, 58, 40–46. [Google Scholar] [CrossRef]
Pham, Q.V.; Leanh, T.; Tran, N.H.; Park, B.J.; Hong, C.S. Decentralized computation offloading and resource allocation for mobile-edge computing: A matching game approach. IEEE Access 2018, 6, 75868–75885. [Google Scholar] [CrossRef]
Huynh, L.N.T.; Pham, Q.V.; Pham, X.Q.; Nguyen, T.D.T.; Hossain, M.D.; Huh, E.N. Efficient computation offloading in multi-tier multi-access edge computing systems: A particle swarm optimization approach. Appl. Sci. 2020, 10, 203. [Google Scholar] [CrossRef] [Green Version]
Zhang, L.; Wang, K.; Xuan, D.; Yang, K. Optimal task allocation in near-far computing enhanced C-RAN for wireless big data processing. IEEE Wirel. Commun. 2018, 25, 50–55. [Google Scholar] [CrossRef]
Zakarya, M.; Gillam, L.; Ali, H.; Rahman, I.; Salah, K.; Khan, R.; Rana, O.; Buyya, R. epcAware: A game-based, energy, performance and cost efficient resource management technique for multi-access edge computing. IEEE Trans. Serv. Comput. 2020, 15, 1634–1648. [Google Scholar] [CrossRef]
Hou, X.; Ren, Z.; Wang, J.; Cheng, W.; Ren, Y.; Chen, K.; Zhang, H. Reliable computation offloading for edge-computing-enabled software-defined IoV. IEEE Internet Things J. 2020, 7, 7097–7111. [Google Scholar]
Nguyen, T.D.T.; Nguyen, V.; Pham, V.; Huynh, L.N.T.; Hossain, M.D.; Huh, E. Modeling data redundancy and cost-aware task allocation in MEC-enabled Internet-of-Vehicles applications. IEEE Internet Things J. 2021, 8, 1687–1701. [Google Scholar] [CrossRef]
Dinh, T.Q.; Tang, J.; La, Q.D.; Quek, T.Q.S. Offloading in mobile edge computing: Task allocation and computational frequency scaling. IEEE Trans. Commun. 2017, 65, 3571–3584. [Google Scholar]
Sun, L.; Wang, J.; Lin, B. Task allocation strategy for MEC-enabled IIoTs via bayesian network based evolutionary computation. IEEE Trans. Ind. Inform. 2021, 17, 3441–3449. [Google Scholar] [CrossRef]
Hou, X.; Ren, Z.; Wang, J.; Zheng, S.; Cheng, W.; Zhang, H. Distributed fog computing for latency and reliability guaranteed swarm of drones. IEEE Access 2020, 8, 7117–7130. [Google Scholar] [CrossRef]
Khan, A.; Zakarya, M.; Rahman, I.; Khan, R.; Buyya, R. HeporCloud: An energy and performance efficient resource orchestrator for hybrid heterogeneous cloud computing environments. J. Netw. Comput. Appl. 2021, 173, 102869. [Google Scholar] [CrossRef]
Khan, A.; Zakarya, M.; Khan, R.; Rahman, I.; Khan, M.; Khan, A. An energy, performance efficient resource consolidation scheme for heterogeneous cloud datacenters. J. Netw. Comput. Appl. 2020, 150, 102497. [Google Scholar] [CrossRef]
Mavridis, I.; Karatza, H. Combining containers and virtual machines to enhance isolation and extend functionality on cloud computing. Future Gener. Comput. Syst. 2019, 94, 674–696. [Google Scholar] [CrossRef]
Katayama, Y.; Tachibana, T. Optimal task allocation for minimizing total response time in MEC platform. In Proceedings of the 2019 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), Yilan, Taiwan, 20–22 May 2019; pp. 1–2. [Google Scholar]
Katayama, Y.; Tachibana, T. Collaborative task assignment algorithm to reduce total response time in MEC platform. In Proceedings of the 2020 8th International Conference on Information and Education Technology, Okayama, Japan, 28–30 March 2020; pp. 273–278. [Google Scholar]
Bilal, K.; Erbad, A.; Hefeeda, M. Crowdsourced multi-view live video streaming using cloud computing. IEEE Access 2017, 5, 12635–12647. [Google Scholar] [CrossRef]
Mao, Y.; Zhang, J.; Letaief, K.B. Joint task offloading scheduling and transmit power allocation for mobile-edge computing systems. In Proceedings of the 2017 IEEE Wireless Communications and Networking Conference, San Francisco, CA, USA, 19–22 March 2017; pp. 1–6. [Google Scholar]
Zhang, J.; Hu, X.; Ning, Z.; Ngai, E.C.H.; Zhou, L.; Wei, J.; Cheng, J.; Hu, B. Energy-latency tradeoff for energy-aware offloading in mobile edge computing networks. IEEE Internet Things J. 2017, 5, 2633–2645. [Google Scholar] [CrossRef]
Du, J.; Zhao, L.; Feng, J.; Chu, X. Computation offloading and resource allocation in mixed fog/cloud computing systems with min-max fairness guarantee. IEEE Trans. Commun. 2018, 66, 1594–1608. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Tao, Y.; Qin, X.; Liu, L.; Zhang, Z.; Zhang, P. Energy-aware mobile edge computation offloading for iot over heterogenous networks. IEEE Access 2019, 7, 13092–13105. [Google Scholar] [CrossRef]
Guo, H.; Liu, J.; Zhang, J. Computation offloading for multiaccess mobile edge computing in ultra-dense networks. IEEE Commun. Mag. 2018, 56, 14–19. [Google Scholar] [CrossRef]
Wadhwa, S.; Rani, S.; Verma, S.; Shafi, J.; Wozniak, M. Energy efficient consensus approach of blockchain for IoT networks with edge computing. Sensors 2022, 22, 3733. [Google Scholar] [CrossRef]
Fawwaz, D.Z.; Chung, S.H.; Ahn, C.W.; Kim, W.S. Optimal distributed MQTT broker and services placement for SDN-edge based smart city architecture. Sensors 2022, 22, 3431. [Google Scholar] [CrossRef]
Yu, Z.; Liu, Y.; Yu, S.; Wang, R.; Song, Z.; Yan, Y.; Li, F.; Wang, Z.; Tian, F. Automatic detection method of dairy cow feeding behaviour based on YOLO improved model and edge computing. Sensors 2022, 22, 3271. [Google Scholar] [CrossRef]
Pereira, P.; Silva, J.; Silva, A.; Fernandes, D.; Machado, R. Efficient hardware design and implementation of the voting scheme-based convolution. Sensors 2022, 22, 2943. [Google Scholar] [CrossRef]
Hanzelik, P.P.; Kummer, A.; Abonyi, J. Edge-computing and machine-learning-based framework for software sensor development. Sensors 2022, 22, 4268. [Google Scholar] [CrossRef]
Rosenberger, J.; Urlaub, M.; Rauterberg, F.; Lutz, T.; Selig, A.; Bühren, M.; Schramm, D. Deep reinforcement learning multi-agent system for resource allocation in industrial internet of things. Sensors 2022, 22, 4099. [Google Scholar] [CrossRef]
Filho, C.P.; Marques, E.; Chang, V.; Santos, L.; Bernardini, F.; Pires, P.F.; Ochi, L.; Delicato, F.C. A systematic literature review on distributed machine learning in edge computing. Sensors 2022, 22, 2665. [Google Scholar] [CrossRef]
Dec, G.; Stadnicka, D.; Paśko, L.; Mądziel, M.; Figliè, R.; Mazzei, D.; Tyrovolas, M.; Stylios, C.; Navarro, J.; Solé-Beteta, X. Role of academics in transferring knowledge and skills on artificial intelligence, internet of things and edge computing. Sensors 2022, 22, 2496. [Google Scholar] [CrossRef] [PubMed]
Sawada, K.; Tachibana, T. Implementation of dynamic task assignment for smartphone application with MEC and cloud servers. In Proceedings of the 2020 IEEE International Conference on Consumer Electronics-Taiwan (ICCE-TW), Taipei, Taiwan, 6–8 September 2020; pp. 1–2. [Google Scholar]
Tachibana, T.; Sawada, K.; Fujii, H.; Maruyama, R.; Yamada, T.; Fujii, M.; Fukuda, T. Open Multi-Access Network Platform with Dynamic Task Offloading and Intelligent Resource Monitoring. IEEE Commun. Mag. 2022. [Google Scholar] [CrossRef]
Toksarı, M. A heuristic approach to find the global optimum of function. J. Comput. Appl. Math. 2007, 209, 160–166. [Google Scholar] [CrossRef] [Green Version]
Katayama, Y.; Tachibana, T. Task assignment with optimization problem for MEC platform with shard and non-shared MEC servers. In Proceedings of the 2020 International Conference on Emerging Technologies for Communications, Belgaum, India, 2–4 December 2020; p. 1. [Google Scholar]
CPLEX. Available online: https://www.ibm.com/analytics/cplex-optimizer (accessed on 4 June 2022).

Figure 1. System model consisting of a MEC platform with three types of servers.

Figure 2. M/M/1 queueing model for bottleneck node in a MEC platform.

Figure 3. Latency for processing task

f_{i}

.

Figure 3. Latency for processing task

f_{i}

.

Figure 4. Overview of our proposed heuristic algorithm.

Figure 5. Total latency versus number of tasks in the case of

τ = 0.2

and

λ = 2.0

.

Figure 5. Total latency versus number of tasks in the case of

τ = 0.2

and

λ = 2.0

.

Figure 6. Minimum latency for each method in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 6. Minimum latency for each method in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 7. Maximum latency for each method in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 7. Maximum latency for each method in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 8. Number of tasks allocated to each server in the case of

N_{1} = 30

,

τ = 0.2

, and

λ = 2.0

.

Figure 8. Number of tasks allocated to each server in the case of

N_{1} = 30

,

τ = 0.2

, and

λ = 2.0

.

Figure 9. Number of tasks allocated to each server in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 9. Number of tasks allocated to each server in the case of

N_{1} = 50

,

τ = 0.2

, and

λ = 2.0

.

Figure 10. Total latency versus transmission delay

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 10. Total latency versus transmission delay

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 11. Minimum latency for each method versus

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 11. Minimum latency for each method versus

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 12. Maximum latency for each method versus

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 12. Maximum latency for each method versus

τ

in the case of

N_{1} = 50

and

λ = 2.0

.

Figure 13. Total latency versus arrival rate

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Figure 13. Total latency versus arrival rate

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Figure 14. Minimum latency for each method versus

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Figure 14. Minimum latency for each method versus

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Figure 15. Maximum latency for each method versus

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Figure 15. Maximum latency for each method versus

λ

in the case of

N_{1} = 50

and

τ = 0.2

.

Table 1. Parameter settings in simulation.

Parameter	Value
Number of tasks transmitted via access point $a_{1}$	$N_{1} = 10$ , 20, 30, 40, or 50
Number of tasks transmitted via access point $a_{2}$	$N_{2} = 20$
Processing efficiency for MEC server $M_{1}$	$D_{1} = 30$
Processing efficiency for MEC server $M_{2}$	$D_{2} = 300$
Transmission delay of tasks for S	$τ =$ 0.1, 0.2, 0.3, 0.4, or 0.5
Task size $c_{i}$	Uniform distribution of $[0.1, 1.0]$
Acceptable latency $t_{i}^{m a x}$	Uniform distribution of $[1.0, 4.0]$
Arrival rate $λ$ of tasks for M/M/1 queueing model	$λ =$ 1.25, 1.5, 1.75, 2.0, or 2.25
Average processing time of task for M/M/1 queueing model	$\frac{1}{μ} = 0.01$

Table 2. Calculation time of each method in the case of

τ = 0.2

and

λ = 2.0

.

Table 2. Calculation time of each method in the case of

τ = 0.2

and

λ = 2.0

.

Number of Tasks $N_{1}$	GA [s]	Proposed [s]	Comp [s]
10	1004.345	0.093	0.092
20	1048.478	0.184	0.126
30	1720.223	0.266	0.246
40	3995.914	0.342	0.319
50	4983.124	0.411	0.316

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Katayama, Y.; Tachibana, T. Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform. Sensors 2022, 22, 4825. https://doi.org/10.3390/s22134825

AMA Style

Katayama Y, Tachibana T. Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform. Sensors. 2022; 22(13):4825. https://doi.org/10.3390/s22134825

Chicago/Turabian Style

Katayama, Yukiko, and Takuji Tachibana. 2022. "Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform" Sensors 22, no. 13: 4825. https://doi.org/10.3390/s22134825

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform

Abstract

1. Introduction

2. Related Work

3. System Model

3.1. MEC Platform

3.2. Calculation of Latency for Three Types of Servers

4. Optimization Problem Formulation for Total Latency Reduction

5. Proposed Heuristic Algorithm

6. Computational Complexity

7. Numerical Examples

7.1. Impact of Number of Tasks

7.2. Impact of Transmission Delay $τ$ for Cloud Server

7.3. Impact of Arrival Rate $λ$ in a MEC Platform

7.4. Calculation Time

8. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Optimal Task Allocation Algorithm Based on Queueing Theory for Future Internet Application in Mobile Edge Computing Platform

Abstract

1. Introduction

2. Related Work

3. System Model

3.1. MEC Platform

3.2. Calculation of Latency for Three Types of Servers

4. Optimization Problem Formulation for Total Latency Reduction

5. Proposed Heuristic Algorithm

6. Computational Complexity

7. Numerical Examples

7.1. Impact of Number of Tasks

7.2. Impact of Transmission Delay τ for Cloud Server

7.3. Impact of Arrival Rate λ in a MEC Platform

7.4. Calculation Time

8. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

7.2. Impact of Transmission Delay $τ$ for Cloud Server

7.3. Impact of Arrival Rate $λ$ in a MEC Platform