Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks

Long, Hao; Hu, Feng; Kong, Lingjun

doi:10.3390/drones8100528

Open AccessArticle

Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks

by

Hao Long

,

Feng Hu

^* and

Lingjun Kong

^*

College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China

^*

Authors to whom correspondence should be addressed.

Drones 2024, 8(10), 528; https://doi.org/10.3390/drones8100528

Submission received: 7 August 2024 / Revised: 21 September 2024 / Accepted: 25 September 2024 / Published: 27 September 2024

(This article belongs to the Special Issue Artificial Intelligence (AI) and Machine Learning (ML) in UAV Technology)

Download

Browse Figures

Versions Notes

Abstract

:

With the advancement of cloud-edge-local computing, Unmanned Aerial Vehicles (UAVs), as flexible mobile nodes, offer novel solutions for dynamic network deployment. However, existing research on UAV networks faces substantial challenges in accurately predicting link dynamics and efficiently managing traffic loads, particularly in highly distributed and rapidly changing environments. These limitations result in inefficient resource allocation and suboptimal network performance. To address these challenges, this paper proposes a UAV-based cloud-edge-local network resource elastic scheduling architecture, which integrates the Graph-Autoencoder–GAN-LSTM (GA–GLU) algorithm for accurate link prediction and the FlowBender-Enhanced Reinforcement Learning for Load Balancing (FERL-LB) algorithm for dynamic traffic load balancing. GA–GLU accurately predicts dynamic changes in UAV network topologies, enabling adaptive and efficient scheduling of network resources. FERL-LB leverages these predictions to optimize traffic load balancing within the architecture, enhancing both performance and resource utilization. To validate the effectiveness of GA–GLU, comparisons are made with classical methods such as CN and Katz, as well as modern approaches like Node2vec and GAE–LSTM, which are commonly used for link prediction. Experimental results demonstrate that GA–GLU consistently outperforms these competitors in metrics such as AUC, MAP, and error rate. The integration of GA–GLU and FERL-LB within the proposed architecture significantly improves network performance in highly dynamic environments.

Keywords:

UAV control system; edge computing; link prediction; load balancing

1. Introduction

Cloud-edge-local computing [1] is a computing paradigm that combines cloud, edge, and local resources to process data efficiently by distributing workloads across centralized cloud systems, edge devices, and local networks. This paradigm addresses challenges such as data transmission latency, bandwidth constraints, and privacy concerns by processing data closer to its source, thereby reducing the load on centralized cloud resources and enabling faster response times.

UAV networks are communication networks formed by Unmanned Aerial Vehicles (UAVs) that enable data exchange and coordination between drones and ground stations for various applications. UAVs have become essential for dynamic network deployment due to their rapid mobility and ability to extend network coverage in remote or inaccessible areas. These networks also play a vital role in mobile edge computing, offering the potential for task offloading and improving network flexibility.

Despite these advantages, existing research on UAV networks faces significant challenges in predicting the dynamic nature of communication links and managing network traffic in real time, particularly in highly distributed and evolving environments. These challenges often lead to suboptimal resource allocation and inefficient network performance, limiting the full potential of UAV-based architectures. As UAVs become more prevalent in cloud-edge-local computing, addressing these challenges is crucial for maintaining network reliability and performance in scenarios that demand quick responses and seamless connectivity.

To address these limitations, this paper proposes an integrated cloud-edge-local network resource scheduling architecture, specifically designed for UAV networks. The architecture is grounded in the fat-tree topology, a structure commonly used in data center networks due to its scalability and fault tolerance. By leveraging this topology, the proposed architecture provides a robust foundation for elastic resource scheduling, allowing for efficient network management in dynamic UAV environments. This architecture effectively allocates UAV nodes across hierarchical layers, enhancing resource flexibility and ensuring consistent performance, even in fluctuating network conditions.

We introduce the Graph-Autoencoder–GAN-LSTM (GA–GLU) method for UAV link prediction, which combines the advantages of Graph Autoencoders (GAEs) and Generative Adversarial Networks (GANs) with LSTM predictive networks for link prediction and dynamic resource scheduling in UAV networks. The GA–GLU method is essential to this architecture, as it addresses the challenge of predicting link dynamics in highly dynamic UAV topologies. By incorporating a GAE for feature extraction and GANs for optimizing these features, GA–GLU provides accurate link predictions, while the LSTM component enhances its temporal prediction capabilities, making it highly suitable for evolving UAV networks. This method’s ability to foresee link changes enables more adaptive and efficient resource management across the network.

This study significantly enhances the resource utilization of cloud-edge-local networks by introducing a UAV-based elastic scheduling architecture and the GA–GLU graph representation learning method. To complement the link prediction capabilities of GA–GLU, we propose the FlowBender-Enhanced Reinforcement Learning for Load Balancing (FERL-LB) algorithm, which dynamically adjusts network traffic distribution based on real-time link predictions. This approach achieves more efficient traffic management and service quality assurance by improving load balancing in highly dynamic network environments. By integrating FERL-LB within the architecture, traffic loads are dynamically balanced, thereby optimizing resource allocation and preventing congestion, even in volatile UAV network scenarios. Together, GA–GLU and FERL-LB provide a comprehensive solution for real-time network scheduling, improving both performance and resource efficiency across cloud-edge-local networks.

The innovations of this paper are as follows:

We propose a UAV-based elastic scheduling architecture for cloud-edge-local network resources, effectively utilizing UAV flexibility to enhance network deployment and employing the fat-tree topology structure to improve network scalability and management efficiency.
We introduce the GA–GLU graph representation learning method to predict UAV links, accurately forecasting UAV network topology changes for dynamic resource scheduling.
We propose the FERL-LB algorithm, which integrates reinforcement learning to optimize traffic load balancing based on link prediction, significantly improving network performance and resource utilization.

2. Related Works

2.1. UAV-Based Cloud-Edge-Local Resource Scheduling

Mobile edge computing (MEC) enables the deployment of innovative applications and services at the network edge, providing computational capabilities close to a large number of IoT devices [2]. With MEC deployment, devices can offload computationally intensive tasks to nearby powerful edge servers, thereby reducing latency and saving energy [3]. Recent research efforts have shifted from fixed edge servers to mobile edge servers, which offer greater flexibility, cost-effectiveness, and efficiency in challenging environments. Some recent studies have proposed using UAVs to improve connectivity for ground IoT devices [4]. UAV-assisted wireless communication offers advantages in flexible deployment, fully controllable mobility, and enhanced network performance, garnering increasing research interest. Hovering UAV-based FSO relay systems, as explored in recent studies under the effects of pointing errors and atmospheric turbulence, provide valuable insights into improving performance in UAV networks [5]. Efficient routing protocols have been proposed to enhance UAV communication, as demonstrated in recent work on hybrid routing schemes for 3D UAV networks [6]. Consequently, UAV-assisted edge computing networks are a natural and promising paradigm, where optimizing UAV flight paths to meet the communication and computing demands of numerous devices is a crucial and challenging issue.

Existing research primarily focuses on optimizing edge computing with fixed ground base stations. Some of the latest studies propose utilizing UAVs and satellite-assisted Space–Air–Ground Integrated Networks (SAGINs) to meet the service needs of remote IoT applications. Some studies [7,8] have employed SAGINs for online offloading decisions of ground IoT tasks and have used linear programming methods to solve stochastic strategies aimed at minimizing the expected long-term average latency. Meanwhile, machine learning techniques, such as deep learning [9], have been introduced into SAGIN-assisted MEC to optimize UAV positioning and resource scheduling, effectively reducing energy consumption. In addition to these machine learning approaches, [10] introduced deep reinforcement learning methods for resource allocation and UAV position optimization, achieving significant results by balancing system efficiency with energy savings. Moreover, applications of reinforcement learning in optimizing resource allocation in dynamic environments can benefit from considering mobility aspects in mobile distributed computing, as explored in [11,12]. However, current research based on reinforcement learning is often limited to single UAV resource allocation or multi-UAV position deployment, lacking exploration of multi-UAV cooperation for comprehensive resource scheduling and system optimization.

Although current research has made progress in the collaborative optimization of cloud-edge-local systems, further exploration is needed regarding the cooperative path design and system optimization among multiple UAVs to meet the requirements of cloud-edge-local network resource scheduling and traffic load balancing.

2.2. UAV Link Topology Prediction

In UAV networks, link topology prediction is a crucial task, and the application of machine learning algorithms in complex networks has made significant progress [13]. However, directly applying machine learning algorithms to network analysis has certain limitations, as the complex structure of networks needs to be transformed into a low-dimensional vector space through graph embedding techniques for more flexible and effective computation [4]. To address these limitations, various graph embedding techniques have been introduced.

To overcome the challenge of obtaining node pair representations in networks, Rahman et al. [14] proposed the DyLink2Vec algorithm, which models the network embedding task as an optimization encoding problem. The objective is to minimize reconstruction errors using gradient descent. Goyal [15] introduced the DGEM algorithm, which maps dynamic graph data into a highly nonlinear latent space using dynamically extended deep autoencoders and captures connectivity trends of graph snapshots at any given time period through a decoder to enhance the precision of link predictions.

To better capture the evolution patterns of links in temporal networks, Li et al. [16] proposed the DDNE framework based on autoencoders. It utilizes historical information from past temporal network snapshots to learn the latent representation of future networks. Additionally, this framework employs Gated Recurrent Units (GRUs) to adaptively capture the dependencies of networks at different scales, improving the framework’s ability to model dynamic link changes. In parallel, with advancements in Graph Neural Networks (GNNs), Pareja et al. [17] proposed the EvolveGCN algorithm based on GNNs. They built an encoder using Graph Convolutional Networks (GCNs) [18] and Recurrent Neural Networks (RNNs) to transform the adjacency matrices and node features of historical network snapshots into corresponding network embeddings. A Multi-Layer Perceptron (MLP) serves as the decoder to obtain the predicted network structure with enhanced accuracy in evolving network conditions.

Further extending these approaches, Min et al. [19] proposed STGSN from temporal and spatial perspectives, modeling the network from both dimensions and using a temporal attention mechanism to capture the network’s temporal features, thus enhancing the algorithm’s interpretability and enabling a more nuanced understanding of network evolution.

The advantages and disadvantages of the proposed GA–GLU algorithm compared to current popular link prediction methods are shown in Table 1.

2.3. Traffic Load Balancing in Cloud-Edge-Local Networks

In cloud-edge-local networks, traffic is categorized into elephant flows and mice flows, with 90% of data flows being mice flows. A few elephant flows can generate mice flows, which may lead to significant changes in the network environment and ultimately result in link congestion. Additionally, a large number of packets can occupy the switch’s buffer queue, resulting in head-of-line blocking and increased latency for mice flows. Cloud-edge-local networks typically use the Equal-Cost Multi-Path (ECMP) algorithm [20], which distributes each flow in the network evenly across paths using a hash function. However, the ECMP algorithm cannot detect whether there is congestion in the network. When elephant flows are present, ECMP can assign multiple elephant flows to the same link or path, causing or further exacerbating network congestion and significantly reducing network throughput. The rapid changes in network environments make traditional load-balancing schemes insufficiently responsive to real-time conditions, which often leads to network overload.

The Hedera algorithm [21] selects the optimal forwarding path using simulated annealing and global first-fit matching, effectively avoiding elephant flow congestion and improving bandwidth utilization and network performance. However, it only addresses elephant flow conflicts, neglecting the impact of mice flows on the network. Mao et al. [22] used deep learning to train convolutional neural networks (CNNs) for each path combination under different network scenarios, determining whether the generated paths are congested based on input traffic patterns. The dynamic traffic scheduling strategy proposed by Liu Zhenpeng et al. routes elephant flows based on bandwidth allocation to improve network quality. Sun et al. [23] applied control theory and deep reinforcement learning to reduce network transmission delays, with TIDE as the DRL mechanism for routing optimization. However, due to its reliance on modifying link weights, the optimal routing path can only be indirectly obtained through the shortest path algorithm [23]. Other researchers have proposed ANN-based routing optimization methods for SD-DCN topologies, improving network performance [24] and enhancing network quality [25].

Our algorithm also focuses on the impact of mice flows on network congestion. By combining link prediction results, it employs reinforcement learning algorithms to achieve load balancing in cloud-edge-local networks.

3. Methodology

3.1. Design of Cloud-Edge-Local Architecture

This section explores the design scheme for an elastic resource scheduling architecture based on UAVs in cloud-edge-local networks. The core design concept integrates UAVs into the cloud, edge, and local layers, creating a comprehensive and layered computing and communication framework. This design aims to leverage the unique advantages of UAVs, including dynamic mobility and high flexibility, to achieve efficient and flexible network resource scheduling, thereby ensuring low latency and high reliability of services. Additionally, we use the fat-tree topology for the architecture. The fat-tree topology is highly scalable and excels in handling large-scale network traffic and meeting high-density computing needs.

In the cloud layer, UAVs are assigned significant data processing and storage responsibilities, partially replacing traditional data center functions. UAVs in this layer focus on performing large-scale data analysis tasks, leveraging their rapid deployment capabilities to provide instant computing resources for specific areas or tasks. This also allows UAVs to integrate with existing cloud infrastructure, providing a seamless extension of cloud services.

In the edge layer, UAVs are closer to ground users or sensor networks, primarily undertaking data preprocessing and real-time analysis tasks. At this level, UAVs are flexibly scheduled to adjust their positions and assigned tasks based on actual network conditions and data processing needs, ensuring low latency and high efficiency in data processing. Multi-UAV cooperation at this layer is particularly important for achieving coordinated and efficient data processing, as multiple UAVs work together to balance computational loads and improve service reliability.

In the local layer, UAV deployment is further decentralized and specialized. These UAVs interact directly with ground devices or users and are responsible for collecting data and performing simple edge computing tasks. The deployment of end-layer UAVs significantly enhances network coverage and the real-time collection of data. This decentralized deployment ensures greater system robustness and scalability, allowing the architecture to be integrated with existing sensor networks and communication systems.

To ensure the efficient operation of the aforementioned layered architecture, this paper proposes a comprehensive management and scheduling mechanism that covers key aspects such as link prediction and traffic load balancing. Compared to traditional ground-based stations, this mechanism transforms into a network node that coordinates tasks among UAVs at all layers and dynamically adjusts their flight paths, ensuring the efficient utilization of network resources and the smooth completion of UAV tasks. The coordination between multiple UAVs at different layers facilitates better resource allocation and management, particularly in scenarios that require rapid responses and real-time data processing. The architecture is shown in Figure 1.

3.2. GA–GLU Link Prediction

3.2.1. Architecture

In this study, we designed the GA–GLU algorithm for link prediction in UAV-based cloud-edge-local networks. This model integrates a Graph Autoencoder (GAE), Long Short-Term Memory (LSTM) networks, and Generative Adversarial Networks (GANs) to capture the topological features and temporal evolution patterns of UAV dynamic networks, generating high-quality predictive graph snapshots.

The GA–GLU model consists of three key components, as illustrated in Figure 2.

First, we utilize a Graph Autoencoder (GAE) to learn the topological features of each individual graph snapshot in the dynamic network. The GAE maps the graph snapshot to a low-dimensional representation through an encoder. Then, we introduce Long Short-Term Memory (LSTM) networks to capture the temporal evolution patterns of the dynamic network. LSTM networks effectively remember and learn long-term dependencies in time-series data, helping us better understand the evolution of dynamic networks.

In the model combining GAE and LSTM for graph prediction, the decoder layer of the GAE is primarily used during the training phase to verify whether the generated node embeddings retain the graph’s structural information. The reconstruction ability of the GAE decoder is mainly used to validate the quality of the embeddings by optimizing both the reconstruction loss and the time series prediction loss during training.

We apply Generative Adversarial Networks (GANs) to generate high-quality predictive graph snapshots. In the GANs, we combine a GAE and LSTM networks as the generator to produce predictive graph snapshots. Additionally, we introduce a fully connected discriminator network to distinguish between generated graph snapshots and real records. Through the adversarial process, the generator is trained to produce high-quality predictive results, while the discriminator is trained to effectively differentiate between generated graph snapshots and real records.

In UAV networks, snapshots are represented as discrete time slices

G_{t_{k}} = {V_{t_{k}}, E_{t_{k}}}

, where

V_{t_{k}}

and

E_{t_{k}}

denote the set of nodes and edges at time

t_{k}

, respectively. The weight

w_{i j}

of a link between nodes

v_{i}

and

v_{j}

is calculated using the following equation:

w_{i j} = \{\begin{matrix} \frac{D_{i} \cdot {log}_{2} (1 + γ_{i j})}{d_{i j}}, & d_{i j} \leq D_{i} \\ 0, & d_{i j} > D_{i} \end{matrix}

(1)

where

d_{i j}

is the distance between nodes,

D_{i}

is the maximum communication radius, and

γ_{i j}

is the signal-to-noise ratio.

By integrating a GAE, LSTM networks, and GANs, the GA–GLU model comprehensively captures the topological features and temporal evolution patterns of UAV dynamic networks, generating high-quality predictive graph snapshots and thereby achieving effective temporal link predictions. The algorithm is shown in Algorithm 1.

Algorithm 1 GA–GLU Prediction Algorithm
Input: Graph snapshots ${G_{1}, G_{2}, \dots, G_{T}}$
Output: Predicted future graph snapshot $G_{T + 1}$
1: // Initialize GAE and LSTM networks
2: Initialize GAE encoder $E n c$ and decoder $D e c$
3: Initialize LSTM network $L S T M$
4: Initialize GAN generator G and discriminator D
5: // Process each graph snapshot
6: for $t = 1$ to T do
7: $z_{t} \leftarrow E n c (G_{t})$	▹ Encode the current graph snapshot
8: $G_{t}^{'} \leftarrow D e c (z_{t})$	▹ Decode to reconstruct the graph
9: Feed $z_{t}$ into $L S T M$	▹ Input to LSTM to update hidden state
10: end for
11: // Use LSTM’s output to predict the next state
12: $z_{T + 1} \leftarrow L S T M (last hidden state)$
13: $G_{T + 1}^{'} \leftarrow D e c (z_{T + 1})$	▹ Decode to predict the next graph snapshot
14: // Train GAN
15: repeat
16: Sample mini-batch of real snapshots ${G_{i}}$
17: Generate fake snapshots ${G_{i}^{'}}$ from G
18: Train discriminator D to distinguish real from fake
19: Train generator G to fool D
20: until convergence
21: return $G_{T + 1}^{'}$	▹ Return the predicted future graph snapshot

The GA–GLU prediction algorithm leverages a combination of a Graph Autoencoder (GAE), Long Short-Term Memory (LSTM) networks, and Generative Adversarial Networks (GANs) to predict future UAV network topologies. The algorithm starts by encoding each graph snapshot

G_{t}

into a low-dimensional latent representation

z_{t}

using the GAE encoder

E n c

, followed by reconstructing the graph snapshot

{\hat{G}}_{t}

through the decoder

D e c

. The latent representations

z_{t}

are sequentially fed into the LSTM network, which captures the temporal dependencies across snapshots and uses its final hidden state to predict the latent representation

z_{T + 1}

for the future graph. This predicted latent representation is then decoded into the future graph

{\hat{G}}_{T + 1}

using

D e c

. To further refine the predicted graph, a GAN consisting of a generator G and discriminator D is employed. The generator produces fake graph snapshots, while the discriminator is trained to distinguish between real and fake graphs. Through iterative training, the generator learns to produce more realistic graphs, ensuring the final prediction closely resembles the actual network structure.

3.2.2. GAE Layer

We utilize a Graph Autoencoder (GAE) to model the local topological structure of each individual graph snapshot in dynamic networks. The GAE consists of a Graph Convolutional Network (GCN) encoder layer and an inner-product decoder layer. The GCN is an efficient variant of convolutional neural networks that operates directly on graphs. Formally, assume a static graph with N nodes, where each node has M-dimensional features (or attributes). The topological structure and node attributes can be represented by an adjacency matrix

A \in R^{N \times M}

and a feature matrix

Z \in R^{N \times M}

(where the i-th row of

Z \in R^{N \times M}

corresponds to the feature vector of node i), respectively. A typical GCN unit takes the feature matrix

Z \in R^{N \times M}

as input and performs spectral graph convolution (a local first-order approximation according to

A \in R^{N \times M}

). The final output is generated in the same way as a standard fully connected layer. The overall operation of a specific GCN unit can be succinctly defined as follows:

X = G C N (Z, A) = f ({\hat{D}}^{- 1 / 2} \hat{A} {\hat{D}}^{- 1 / 2} ZW)

(2)

In the above equation,

{\hat{D}}^{- 1 / 2} \hat{A} {\hat{D}}^{- 1 / 2}

is the approximate graph convolution filter, where

\hat{A} = A + I_{N}

(which is an N-dimensional identity matrix) and

{\hat{D}}_{ii} = \sum_{j = 1}^{N} {\hat{A}}_{i j}

; W represents the weight matrix; f(·) is the activation function; and X is the representation output given by the GCN unit.

For the task of temporal link prediction considering multiple static graphs, the GA–GLU model inputs each graph snapshot.

A_{t} (t \in {τ - l, \dots, τ})

maintains a GCN unit

G C N (Z, A_{t}) = X_{t}

. In our model, the feature matrix Z is set as the noise input to the generative network, where the values of Z are generated according to a certain probability distribution (e.g., uniform distribution). Based on the input Z and

A_{τ - l}^{τ} = {A_{τ - l}, \dots, A_{τ}}

, the GCN layer of the generative network outputs a series of representations, denoted as

X_{τ - l}^{τ} = {X_{τ - l}, \dots, X_{τ}}

. The output of the encoder is then fed into the LSTM layer.

3.2.3. LSTM Layer

In the GA–GLU model, the learned comprehensive network representations are fed into an LSTM layer. The LSTM layer has a strong capability to learn long-term dependencies in sequential data to capture the evolution patterns of weighted dynamic networks. The standard LSTM architecture can be described as an encapsulated unit with multiple multiplicative gate units. For a specific time step t, the LSTM unit receives the current input vector

x_{t}

and the state vector

h_{t - 1}

from the previous time step as inputs and then outputs the state vector

h_{t}

for the current time step. The LSTM–decoder structure is shown in Figure 3.

i_{t} = σ (W_{x}^{i} x_{t} + W_{h}^{i} h_{t - 1} + b^{i})

(3)

f_{t} = σ (W_{x}^{f} x_{t} + W_{h}^{f} h_{t - 1} + b^{f})

(4)

o_{t} = σ (W_{x}^{o} x_{t} + W_{h}^{o} h_{t - 1} + b^{o})

(5)

s_{t} = f_{t} ⊙ s_{s t - 1} + i_{t} ⊙ {\hat{s}}_{t}

(6)

{\hat{s}}_{t} = σ (W_{x}^{s} x_{t} + W_{h}^{s} h_{t - 1} + b^{s})

(7)

h_{t} = o_{t} ⊙ tanh (s_{t})

(8)

In the above equations,

i_{t}

,

f_{t}

,

o_{t}

, and

s_{t}

represent the input gate, forget gate, output gate, and memory cell, respectively. The parameters

W_{x}

,

W_{h}

, and b correspond to the weights and biases of the units, which control how the information flows through the network. The activation function

f_{t}

is a sigmoid function that ensures the gates operate within the range [0, 1]. The operator ⊙ denotes element-wise multiplication, enabling selective updates to the cell state. Finally, we consider the last hidden state as the distributed representation of the historical snapshots and feed it into a fully connected layer to generate the prediction results

{\tilde{A}}_{τ + 1}

.

Since the LSTM framework has the ability to learn temporal information from sequential data, it can be directly used to handle temporal link prediction tasks with a multiple-input, single-output setup. However, there are still some limitations in predicting weighted dynamic networks. Specifically, to learn the temporal information of dynamic networks, the LSTM is usually trained using the mean squared error (MSE) loss function.

3.2.4. GAN Layer

To address the sparsity and wide range of edge weights in dynamic networks, we enhanced the generation capability of LSTM using a GAN framework. Generally, a GAN consists of a generator G and a discriminator Q, which compete in a minimax game. Initially, D attempts to distinguish between real data from the training set and data generated by G. Conversely, G aims to deceive D by generating high-quality samples (data). Formally, this process can be described as follows (with two alternating optimization steps):

min_{G} max_{D} (\begin{matrix} E_{x \sim P_{data} (x)} [log (D (x))] + \\ E_{z \sim P (z)} [log (1 - D (G (z)))] \end{matrix})

(9)

The input data x are derived from the training set, and z represents the noise generated from a certain probability distribution

p (z)

(e.g., uniform distribution). Similar to the standard GAN framework, our model optimizes two neural networks through a minimax two-player game, namely the generator G and the discriminator D. In this model, D attempts to distinguish between real image snapshots from the training data and snapshots generated by G, while G maximizes the probability of D making an error. This adversarial process is expected to eventually refine G to produce realistic and high-quality weighted links. The two neural networks are described as follows:

(a) Discriminator Network D: We implement the discriminator model D using a fully connected feedforward neural network with one hidden layer and one output layer. During training, D alternately uses the output of G,

{\tilde{A}}_{τ + 1}

, or the ground truth,

A_{τ + 1}

, as input. Since each input data point in a fully connected neural network is typically represented as a vector (rather than a matrix), we reshape the matrix inputs (i.e.,

{\tilde{A}}_{τ + 1}

or

A_{τ + 1}

) into corresponding row-major vectors before feeding them into D. We train the model within the Wasserstein GAN (WGAN) [26] framework, setting the output layer as a linear layer to directly generate outputs without a nonlinear activation function. In summary, the details of the discriminator network D are as follows:

D (A^{'}) = (σ (a^{'} W_{h}^{D} + b_{h}^{D}) W_{o}^{D} + b_{o}^{D})

(10)

where A′ represents the reshaped row vectors of

{\tilde{A}}_{τ + 1}

or

A_{τ + 1}

;

{W_{h}^{D}, b_{h}^{D}}

and

{W_{o}^{D}, b_{o}^{D}}

are the parameters of the hidden and output layers, respectively; and

σ (\cdot)

denotes the sigmoid activation function of the hidden layer.

Given that the edge weights of a network snapshot can vary widely (e.g., [0, 2000]), we normalize the values of

A_{τ + 1}

to the [0, 1] range when selecting

A_{τ + 1}

as the input for D. The raw prediction results from G, defined within the [0, 1] range, can thus be directly used as inputs for D.

(b) Generator Network: The generator model G comprises a GCN layer, an LSTM layer, and a fully connected output layer. The GCN layer takes the sequence of graph snapshots

A_{τ - l}^{τ}

and noise Z as inputs and outputs a representation sequence

X_{τ - l}^{τ}

, which is subsequently fed into the LSTM layer.

Note that each adjacency matrix input

A_{t} (t \in {τ - l, \dots, τ})

should be normalized to the [0, 1] range before being fed into the GCN layer. We use sigmoid as the activation function for all GCN units, and the noise input Z follows a uniform distribution within the [0, 1] range (denoted as

Z \sim U (0, 1)

).

The LSTM layer takes the representation sequence

X_{τ - l}^{τ}

from the GCN layer as input and outputs a hidden state

h_{τ - l}^{τ} = {h_{τ - l}, \dots, h_{τ}}

. It is important to note that each matrix input

X_{t} (t \in {τ - l, \dots, τ})

should be reshaped into row-major vectors

X_{t}

when fed into the LSTM since each LSTM unit treats the input data as vectors. Finally, the last hidden state

h_{τ}

is fed into the output layer to generate the graph snapshot

{\tilde{A}}_{τ + 1}

for the next time step (in the form of corresponding row vectors). Specifically, an element of a generated result lies within the [0, 1] range. By performing an inverse normalization process, we can obtain the final predicted snapshot

{\tilde{A}}_{τ + 1}

with the correct value range for the dynamic network.

3.3. FERL-LB Algorithm

3.3.1. Architecture

With the rapid advancement of UAV technology, UAV networks are playing an increasingly important role in cloud-edge-local architectures. Due to the highly dynamic nature and environmental uncertainties of UAV networks, traditional network resource scheduling and traffic load-balancing strategies often fail to meet their demands. This section aims to propose a reinforcement learning-based FERL-LB algorithm that combines the fat-tree network topology of UAVs and the GA–GLU link prediction method to optimize elastic resource scheduling and traffic load balancing in cloud-edge-local networks. The core of the FERL-LB algorithm is a reinforcement learning-based decision engine that learns and adapts in real time to changes in network conditions to optimize traffic routing decisions.

To enhance the performance of traffic load balancing in UAV cloud-edge-local networks under dynamic and uncertain environments, this study introduces a traffic management strategy that incorporates reinforcement learning, namely the FERL-LB algorithm, using a Double Deep Q-Network (DDQN). This algorithm intelligently predicts and adjusts traffic routing to adapt to rapid changes in network conditions, thereby optimizing network resource utilization and improving data transmission efficiency.

The framework of the FERL-LB algorithm is shown in Figure 4.

The FERL-LB algorithm is based on a Double Deep Q-Network (DDQN) to optimize routing decisions in network environments. Initially, a primary network Q and a target network

\hat{Q}

are randomly initialized with weights

θ

and

θ^{-}

, respectively. The replay memory D is also initialized to store transitions for experience replay. The algorithm operates in an iterative loop until convergence.

At each iteration, the current network state is observed, and future link states are predicted using a link prediction model. Based on the

ϵ

-greedy policy, an action a is selected from the Q-network, which dictates the rerouting of network traffic. The algorithm then observes the new state

s^{'}

and computes the reward r, storing the transition

(s, a, r, s^{'})

in memory D.

A mini-batch of transitions is sampled from D for training the DDQN. The target value

y_{j}

is computed using the target network

\hat{Q}

, while the Q network is updated through gradient descent. The target network

\hat{Q}

is periodically reset to the weights of the primary network Q. Additionally, in the event of predicted congestion, traffic flows are reallocated using the FlowBender mechanism to minimize congestion. The algorithm is shown in Algorithm 2.

Algorithm 2 FERL-LB Algorithm Using DDQN

Input:: Network state, Link prediction model
Output:: Optimized routing decisions
1:: // Initialize the network and DDQN model
2:: Initialize primary network Q with random weights $θ$
3:: Initialize target network $\hat{Q}$ with weights $θ^{-} = θ$
4:: Initialize replay memory D
5:: // Main loop
6:: while not converged do
7:: $s \leftarrow$ observe current network state
8:: $p r e d i c t e d L i n k s \leftarrow$ predict links using Link prediction model
9:: $a \leftarrow$ select action using $ϵ$ -greedy policy from Q
10:: Execute action a, reroute traffic accordingly
11:: $s^{'} \leftarrow$ observe new network state
12:: $r \leftarrow$ compute reward from $s, a, s^{'}$
13:: Store transition $(s, a, r, s^{'})$ in D
14:: // Sample a mini-batch from replay memory for learning
15:: Sample mini-batch of transitions $(s_{j}, a_{j}, r_{j}, s_{j}^{'})$ from D
16:: // DDQN update
17:: Set $y_{j} \leftarrow r_{j} + γ \hat{Q} (s_{j}^{'}, arg {max}_{a^{'}} Q (s_{j}^{'}, a^{'}; θ); θ^{-})$
18:: Perform a gradient descent step on ${(y_{j} - Q (s_{j}, a_{j}; θ))}^{2}$ with respect to $θ$
19:: Every C steps reset $\hat{Q} = Q$
20:: $s \leftarrow s^{'}$
21:: // Adjust traffic based on link prediction and network status
22:: if congestion detected in $p r e d i c t e d L i n k s$ then
23:: Reallocate flows to minimize congestion based on FlowBender
24:: end if
25:: end while

3.3.2. Deep Q-Networks and Double Q-Learning

The Deep Q-Network (DQN) algorithm combines deep learning and reinforcement learning techniques, using a single Q-network to approximate the action-value function. This Q-network, denoted as

Q (s, a; θ)

, where s represents the state, a represents the action, and

θ

represents the network parameters, is trained via gradient descent to minimize the difference between predicted and target Q-values. The DQN algorithm leverages experience replay to enhance learning efficiency and stability. However, it suffers from overestimation bias, which can degrade performance during training.

To address this limitation, the Double Deep Q-Network (DDQN) algorithm extends the DQN by incorporating two separate Q-networks: one for action selection and the other for action evaluation. This separation reduces overestimation by decoupling the selection of the best action from the evaluation of its value. Specifically, the first network selects the action, while the second network evaluates the Q-value of this selected action. This dual-network approach significantly improves stability and reduces bias.

The loss function in the DDQN is given by

L (θ) = {(R + γ Q (s^{'}, arg max_{a^{'}} Q (s^{'}, a^{'}; θ); θ^{-}) - Q (s, a; θ))}^{2}

(11)

where

L (θ)

is the loss function, R is the reward,

γ

is the discount factor,

s^{'}

is the next state,

a^{'}

is the next action,

θ^{-}

represents the parameters of the target Q-network, and

Q (s^{'}, a^{'}; θ)

is the estimated Q-value. The target network parameters

θ^{-}

are periodically updated to improve the stability of training.

3.3.3. Traffic Forwarding Design

In this subsection, we delve into the intricacies of traffic forwarding design within a network environment. Our focus is on comprehensively understanding the network state, devising effective actions for data flow routing, formulating appropriate reward functions for optimizing different types of traffic, and employing advanced link adjustment techniques to mitigate congestion. By leveraging these components, we aim to enhance the overall efficiency and reliability of network traffic management. The critical components of traffic forwarding design are as follows:

(1) State

In the current network environment, the port load of switches and the bandwidth of links are defined as the network state. The network state space can be represented as

S = [P_{S_{i}, t}, U_{S_{i}, t}], t \in [1, n]

, where S represents the network state space,

P_{S_{i}, t}

is the port load of the i-th switch at time t,

U_{S_{i}, t}

is the link bandwidth utilization of the i-th switch at time t, with values ranging from 0 to 1, and n is the number of switches.

(2) Action

An action a in the data flow implies selecting a subset of nodes

a p_{n}

from the network, constructing a list of nodes and their corresponding links to form single or multiple paths, thus transmitting traffic from the source to the destination. Each action corresponds to a specific end-to-end path, and the action space A is expressed as

A = [a p_{1}, a p_{2}, \dots, a p_{n}]

(12)

Here,

a p_{n}

refers to a specific subset of nodes that are involved in constructing one of the possible paths in the network. The action space A encompasses all potential paths that can be selected, where each path consists of a sequence of nodes facilitating the transmission of traffic.

(3) Reward

Different reward functions are designed for large flows and small flows using CNNs. For large flows, which require high throughput and low packet loss, the reward function is given by

R_{L} = α_{L} H_{L, A} + β_{L} (1 - L_{L, A})

(13)

where

R_{L}

is the reward function for large flows;

H_{L, A}

is the average throughput of large flows;

L_{L, A}

is the average packet loss rate of large flows; and

α_{L}

and

β_{L}

are weight coefficients, where

α_{L} + β_{L} = 1

.

For small flows, which require low latency and low packet loss, the reward function is expressed as

R_{N} = α_{N} (1 - t_{N, A}) + β_{N} (1 - L_{N, A})

(14)

where

R_{N}

is the reward function for small flows;

t_{N, A}

is the average round-trip time of small flows;

L_{N, A}

is the average packet loss rate of small flows; and

α_{N}

and

β_{N}

are weight coefficients, where

α_{N} + β_{N} = 1

.

(4) Link Adjustment

When the link prediction model detects congestion on a predicted link, the FERL-LB algorithm employs a FlowBender-based traffic redistribution mechanism to alleviate the congestion. It predicts future link states using a link prediction model and, in conjunction with the current network status, identifies links that may experience congestion. When the predicted traffic for a link exceeds its capacity threshold, it is considered to be on the verge of congestion. Upon detecting a congested link, the algorithm makes traffic redistribution decisions based on the FlowBender mechanism [27]. The core idea of FlowBender is to optimize traffic path selection to minimize congestion within the network. To perform traffic redistribution, the algorithm first determines alternative paths. The selection of these alternative paths is based on their available bandwidth and ability to handle the redistributed traffic, thereby avoiding further congestion on the new paths. Once the alternative paths are identified, the algorithm shifts a portion of the traffic from the congested link to these alternative paths. This process can be implemented by adjusting the routing table to ensure that data packets are forwarded along the new paths.

4. Experiment

4.1. Link Prediction

This study uses the AirSim simulation platform to simulate complex network topologies and UAV flight environments. The model is validated using classic link prediction evaluation metrics: AUC, MAP (mean average precision), and error rate [20].

As shown in Table 2, a total of 36 UAVs are deployed in a custom airspace, each with a maximum speed of 15 m/s. The UAVs start at an altitude of 100 m and have a communication range of 500 m. The simulation is divided into two phases: from 0 to 500 s, the UAV swarm is in the scanning phase, spreading outwards from the starting point. After 500 s, the UAVs enter the hovering phase, maintaining coverage of the target airspace. A distributed control algorithm is used to manage the UAVs.

To validate the effectiveness of the proposed GA–GLU link prediction algorithm, we conduct a comparison with several representative methods. These include the classical CN and Katz algorithms; the Node2vec method, which relies on node embeddings [28]; and the GAE–LSTM method, which is based on graph embeddings. Unlike GA–GLU, GAE–LSTM does not utilize a GAN component. The comparison is designed to highlight the specific improvements introduced by the GA–GLU algorithm.

In similarity-based link prediction methods, the probability of a connection between nodes is based on the similarity between those nodes. The AUC is calculated as follows:

AUC = \frac{M^{'} + 0.5 M^{″}}{M}

(15)

where M is the total number of comparisons, M′ is the number of times the score of an edge in the positive set is greater than that in the negative set, and M″ is the number of times the scores are equal. The larger M′ and M” are, the higher the probability that the positive set scores are higher than the negative set scores, indicating higher prediction accuracy.

The MAP is the area under the precision–recall (P–R) curve:

MAP = \frac{\sum_{t = 1}^{N} AP (t)}{N}

(16)

AP = \frac{1}{n} \sum_{i = 1}^{n} \sum_{j = 1}^{n} \frac{|\{a_{i, j} = 1\} \cap \{{\hat{a}}_{i, j} = 1\}|}{|\{a_{i, j} = 1\}|}

(17)

where MAP measures the accuracy of the prediction results. Specifically, MAP is computed as the mean of the average precision (AP) scores across all queries or tasks. The AP score itself quantifies the precision of the predicted results relative to the number of relevant items, averaged over all possible ranks.

The error rate is the ratio of incorrect links

L N_{false}

to all actual links

L N_{true}

. The error rate is calculated as

Error Rate = \frac{L N_{false}}{L N_{true}}

(18)

The error rate measures the proportion of links that are incorrectly identified relative to the total number of actual links. Specifically,

L N_{false}

represents the number of false links (incorrectly identified links), while

L N_{true}

represents the number of true links (actual links). A higher error rate indicates a greater proportion of misidentified links.

To ensure the validity of the experiments, all comparative experiments were conducted using data samples with the optimal model parameters proposed in this study. Each set of experiments was sampled 20 times to calculate the corresponding AUC, MAP, and error rate. The static network link prediction methods CN, Katz, and Node2vec were used to analyze the network status in the current snapshot to predict the network links at the next time point.

Figure 5 illustrates the prediction performance of different link prediction models under the same environment, as reflected by the AUC metric. It can be observed that deep learning-based link prediction methods, such as Node2vec-LSTM, GAE–LSTM, and our proposed GA–GLU, are relatively more stable compared to other methods. Overall, GA–GLU consistently achieves higher AUC values across 20 test samples than other methods, with CN and Katz exhibiting the poorest performance.

Figure 6 presents the MAP values obtained by different link prediction methods. It is evident that, except for the three deep learning-based link prediction methods, the MAP values of other methods remain below 0.5. Even though Node2vec-LSTM reaches 0.5 in the first case, its generalization performance is lower. The error rate metric similarly reflects that, aside from the three deep learning-based prediction methods, other methods fail to achieve satisfactory prediction results. Node2vec-LSTM exhibits more stable performance compared to Node2vec-LRCV, demonstrating that deep learning models possess better generalization than traditional machine learning models.

The experimental results presented in Figure 7 indicate that the GA-GLU algorithm consistently outperforms other methods in terms of error rates, achieving the lowest values. In Table 3, the GA–GLU method proposed in this paper demonstrates clear advantages in UAV network link prediction. The results shown are the average values from 20 sampling iterations, ensuring the robustness and reliability of the comparison. GA–GLU outperforms traditional methods such as CN and Katz across all key metrics, particularly in AUC and MAP, achieving values of 0.946060 and 0.756796, respectively. This indicates that GA–GLU has superior predictive accuracy and is more effective at identifying correct links in the network. Additionally, the error rate of GA–GLU is the lowest among all methods at 1.015201, further highlighting its robustness in minimizing prediction errors. These results validate the potential of GA–GLU to enhance link prediction performance in complex and dynamic UAV networks.

4.2. Load Balancing

This study utilizes a flow-level traffic scheduling simulator based on the fat-tree topology, with a link bandwidth of 1 Gbps. Traffic is generated using the DCTCP model, and the throughput and latency of various load-balancing algorithms are compared to assess their performance.

Three sets of traffic were generated, each with similar sizes and rates to ensure reproducibility of the simulation results. To apply a significant load on the network links and accurately reflect the algorithms’ effectiveness, approximately 25,000 back flows and 200,000 query flows were included. The total size of the back flows was around 30 billion bytes, while the query flows totaled 7 billion bytes, for a combined total of 220,000 flows. The average link bandwidth exceeded 0.8 Gbps, demonstrating the load-balancing algorithms’ scheduling capabilities.

The performance of the load-balancing algorithms was evaluated using five main metrics: average throughput, small-flow completion time (FCT), packet loss rate (PLR), the number of flows not completed within the expected time, and end-to-end latency.

The average throughput was measured across multiple simulation runs. Assuming there are n servers and m simulation runs, the average throughput is calculated as shown below. The unit is typically bits per second (bps) or bytes per second (Bps); in this experiment, Mbps (megabits per second) is used. The higher the average throughput, the better the network bandwidth utilization and the performance of the load-balancing algorithm.

T P S = \frac{\sum_{j = 1}^{m} T P S_{j}}{m}

(19)

T P S_{j}

represents the throughput for each simulation run. The calculation of the throughput uses the expected completion time, with the formula for

T P S_{j}

as follows:

T P S_{j} = \frac{\sum_{j = 1}^{n} T_{j}}{n}

(20)

where

T_{j}

is the throughput of the j-th server.

This metric mainly reflects the transmission conditions of small flows in the experiment. Since the generated file specifically includes a type of query flow data with a fixed size of 20,480 bytes (20 KB), it helps evaluate network congestion caused by large flows leading to small-flow congestion. The transmission time of small flows can effectively indicate network congestion. Assuming the simulation is conducted m times, the FCT is calculated as follows:

F C T = \frac{\sum_{j = 1}^{m} F C T_{j}}{m}

(21)

where

F C T_{j}

represents the transmission time of the k-th small flow.

Assuming that in a single simulation run, the average transmission delay of small flows received by each server is t and there are n servers in total, the formula for

F C T_{j}

is

F C T_{j} = \frac{t}{n}

(22)

The smaller the value of the small-flow FCT, the better the performance of the load-balancing algorithm.

Since the minimum expected completion time for the generated traffic is 10 s, the number of flows that remain unscheduled after 10 s can partially reflect the performance of various load-balancing algorithms. The faster the number of unscheduled flows converges over time, the better the performance of the load-balancing algorithm.

Since the simulator is task-driven, a smaller end-to-end delay of task traffic indicates better transmission performance. Here, the average delay of all traffic is considered. The smaller the end-to-end delay of the total task traffic, the better the performance of the load-balancing algorithm.

The packet loss rate (PLR) refers to the percentage of packets that are lost in the network during transmission. A lower packet loss rate indicates better performance of the load-balancing algorithm, as it reflects more reliable transmission of data. The PLR is calculated as follows:

P L R = \frac{P_{lost}}{P_{sent}} \times 100 %

(23)

where

P_{lost}

is the number of packets lost during transmission and

P_{sent}

is the total number of packets sent. The smaller the PLR, the better the network’s performance in terms of reliability and efficiency.

Using the traffic generation and experimental performance metrics described in the previous sections, three sets of experiments were conducted. The subjects of the experiments were four load-balancing algorithms—ECMP, WCMP, FlowBender, and FERL-LB—along with a control group that used random path selection without load balancing. Some experimental results are shown in the figures below.

After data processing, comparative experiments were conducted on the small-flow task transmission delay (FCT), packet loss rate (PLR), average throughput, and end-to-end task transmission delay. The experimental results are shown below.

From the experimental results (Figure 8, Figure 9, Figure 10 and Figure 11), the following key conclusion can be drawn: load-balancing algorithms significantly outperform non-load-balancing path selection algorithms under heavy traffic loads. Among these algorithms, ECMP performs the worst, while WCMP and FlowBender exhibit similar performance. The newly designed FERL-LB algorithm excels in throughput, matching the performance of the best algorithm, FlowBender, and exhibits better performance in small-flow transmission latency, improving on the original FlowBender.

The packet loss rates for the random, ECMP, WCMP, FlowBender, and FERL-LB algorithms are shown in Figure 12. At lower network loads (0.1 and 0.2), congestion is minimal, resulting in low packet loss for all algorithms. As the load increases, congestion causes the packet loss to rise sharply. Among the algorithms, FERL-LB performs the best by efficiently allocating forwarding paths for both large and small flows, leading to the slowest increase in the packet loss rate.

Analyzing the metrics, WCMP and FlowBender both achieve average throughputs close to 800 Mbps. However, in terms of the number of remaining flows that have not been successfully transmitted within the scheduled time for a transmission task, the random path selection algorithm and ECMP have many unallocated flows, while FlowBender converges faster. Regarding transmission delay, FlowBender and WCMP perform similarly, but in terms of small-flow average transmission delay, FlowBender underperforms. The FERL-LB algorithm improves on this, indicating a positive impact on small-flow transmission.

5. Summary

This study proposes a novel UAV-based elastic scheduling architecture for cloud-edge-local network resources, innovatively combining the fat-tree topology with the flexibility of UAVs to enhance the flexibility and efficiency of network resource management. By deploying UAV nodes at different levels, we achieved dynamic allocation and optimization of network resources, considerably improving the scalability and management efficiency of the network.

In terms of UAV network link prediction, we introduced the GA–GLU graph representation learning method. By integrating a Graph Autoencoder (GAE) and Generative Adversarial Networks (GANs), we successfully predicted changes in UAV network links, enabling dynamic resource scheduling. This approach effectively captures network dynamics and structural changes, providing real-time adjustments for optimized resource distribution. This method effectively captures the patterns of network dynamic changes through deep learning techniques, thereby improving the accuracy and efficiency of resource scheduling.

For UAV network load balancing, we introduced the FERL-LB algorithm, which integrates reinforcement learning to optimize traffic load balancing based on link predictions. This significantly improves network performance and resource utilization. By efficiently redistributing traffic based on predicted link changes, the algorithm adapts to evolving network conditions, performing excellently in handling highly dynamic network environments and providing a more efficient solution for traffic management and quality-of-service assurance.

Future research directions include further optimization of scheduling algorithms to adapt to more complex network environments and a wider range of application scenarios. Furthermore, it is essential to explore strategies for enhancing the security and stability of UAV networks to effectively address potential security threats and attacks.

Author Contributions

Conceptualization, H.L.; Methodology, H.L.; Software, H.L.; Formal analysis, F.H.; Investigation, F.H.; Resources, L.K.; Data curation, H.L.; Writing—review & editing, F.H. and L.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China, under Grant 62203219 and 62176122.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Gholami, A.; Baras, J.S. Collaborative cloud-edge-local computation offloading for multi-component applications. In Proceedings of the 2021 IEEE/ACM Symposium on Edge Computing (SEC), San Jose, CA, USA, 14–17 December 2021; pp. 361–365. [Google Scholar]
Abbas, N.; Zhang, Y.; Taherkordi, A.; Skeie, T. Mobile edge computing: A survey. IEEE Internet Things J. 2017, 5, 450–465. [Google Scholar] [CrossRef]
You, C.; Huang, K.; Chae, H.; Kim, B.H. Energy-efficient resource allocation for mobile-edge computation offloading. IEEE Trans. Wirel. Commun. 2016, 16, 1397–1411. [Google Scholar] [CrossRef]
Li, X.; Chen, H. Recommendation as link prediction in bipartite graphs: A graph kernel-based machine learning approach. Decis. Support Syst. 2013, 54, 880–890. [Google Scholar] [CrossRef]
Hayal, M.R.; Elsayed, E.E.; Kakati, D.; Singh, M.; Elfikky, A.; Boghdady, A.I.; Grover, A.; Mehta, S.; Mohsan, S.A.H.; Nurhidayat, I. Modeling and investigation on the performance enhancement of hovering UAV-based FSO relay optical wireless communication systems under pointing errors and atmospheric turbulence effects. Opt. Quantum Electron. 2023, 55, 625. [Google Scholar] [CrossRef]
Ullah, S.; Mohammadani, K.H.; Khan, M.A.; Ren, Z.; Alkanhel, R.; Muthanna, A.; Tariq, U. Position-monitoring-based hybrid routing protocol for 3D UAV-based networks. Drones 2022, 6, 327. [Google Scholar] [CrossRef]
Zhou, C.; Wu, W.; He, H.; Yang, P.; Lyu, F.; Cheng, N.; Shen, X. Delay-aware IoT task scheduling in space-air-ground integrated network. In Proceedings of the 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, HI, USA, 9–13 December 2019; pp. 1–6. [Google Scholar]
Kim, K.; Park, Y.M.; Hong, C.S. Machine learning based edge-assisted UAV computation offloading for data analyzing. In Proceedings of the 2020 International Conference on Information Networking (ICOIN), Barcelona, Spain, 7–10 January 2020; pp. 117–120. [Google Scholar]
Jiang, F.; Wang, K.; Dong, L.; Pan, C.; Xu, W.; Yang, K. Deep-learning-based joint resource scheduling algorithms for hybrid MEC networks. IEEE Internet Things J. 2019, 7, 6252–6265. [Google Scholar] [CrossRef]
Wang, L.; Wang, K.; Pan, C.; Xu, W.; Aslam, N.; Nallanathan, A. Deep reinforcement learning based dynamic trajectory control for UAV-assisted mobile edge computing. IEEE Trans. Mob. Comput. 2021, 21, 3536–3550. [Google Scholar] [CrossRef]
Sanabria, P.; Montoya, S.; Neyem, A.; Toro Icarte, R.; Hirsch, M.; Mateos, C. Connection-Aware Heuristics for Scheduling and Distributing Jobs under Dynamic Dew Computing Environments. Appl. Sci. 2024, 14, 3206. [Google Scholar] [CrossRef]
Bi, X.; Zhao, L. Two-Layer Edge Intelligence for Task Offloading and Computing Capacity Allocation with UAV Assistance in Vehicular Networks. Sensors 2024, 24, 1863. [Google Scholar] [CrossRef] [PubMed]
Julian, K.; Lu, W. Application of machine learning to link prediction. arXiv 2016. [Google Scholar]
Rahman, M.; Saha, T.K.; Hasan, M.A.; Xu, K.S.; Reddy, C.K. Dylink2vec: Effective feature representation for link prediction in dynamic networks. arXiv 2018, arXiv:1804.05755. [Google Scholar]
Goyal, P.; Kamra, N.; He, X.; Liu, Y. Dyngem: Deep embedding method for dynamic graphs. arXiv 2018, arXiv:1805.11273. [Google Scholar]
Li, T.; Zhang, J.; Philip, S.Y.; Zhang, Y.; Yan, Y. Deep dynamic network embedding for link prediction. IEEE Access 2018, 6, 29219–29230. [Google Scholar] [CrossRef]
Chen, J.; Pareja, A.; Domeniconi, G.; Ma, T.; Suzumura, T.; Kaler, T.; Schardl, T.B.; Leiserson, C.E. Evolving Graph Convolutional Networks for Dynamic Graphs. US Patent 11,537,852, 7 December 2022. [Google Scholar]
Lei, K.; Qin, M.; Bai, B.; Zhang, G.; Yang, M. GCN-GAN: A non-linear temporal link prediction model for weighted dynamic networks. In Proceedings of the IEEE INFOCOM 2019-IEEE Conference on Computer Communications, Paris, France, 29 April–2 May 2019; pp. 388–396. [Google Scholar]
Min, S.; Gao, Z.; Peng, J.; Wang, L.; Qin, K.; Fang, B. Stgsn—A spatial–temporal graph neural network framework for time-evolving social networks. Knowl. Based Syst. 2021, 214, 106746. [Google Scholar] [CrossRef]
Zhang, H.; Guo, X.; Yan, J.; Liu, B.; Shuai, Q. SDN-based ECMP algorithm for data center networks. In Proceedings of the 2014 IEEE Computers, Communications and IT Applications Conference, Beijing, China, 20–22 October 2014; pp. 13–18. [Google Scholar]
Al-Fares, M.; Radhakrishnan, S.; Raghavan, B.; Huang, N.; Vahdat, A. Hedera: Dynamic flow scheduling for data center networks. In Proceedings of the USENIX Symposium on Networked Systems Design and Implementation, San Jose, CA, USA, 7–11 June 2010; Volume 10, pp. 89–92. [Google Scholar]
Mao, B.; Tang, F.; Fadlullah, Z.M.; Kato, N. An intelligent route computation approach based on real-time deep learning strategy for software defined communication systems. IEEE Trans. Emerg. Top. Comput. 2019, 9, 1554–1565. [Google Scholar] [CrossRef]
Sun, P.; Lan, J.; Li, J.; Zhang, J.; Hu, Y.; Guo, Z. A scalable deep reinforcement learning approach for traffic engineering based on link control. IEEE Commun. Lett. 2020, 25, 171–175. [Google Scholar] [CrossRef]
Ruelas, A.M.; Rothenberg, C.E. A load balancing method based on artificial neural networks for knowledge-defined data center networking. In Proceedings of the 10th Latin America Networking Conference, New York, NY, USA, 3–5 October 2018; pp. 106–109. [Google Scholar]
Babayigit, B.; Ulu, B. Deep learning for load balancing of SDN-based data center networks. Int. J. Commun. Syst. 2021, 34, e4760. [Google Scholar] [CrossRef]
Gulrajani, I.; Ahmed, F.; Arjovsky, M.; Dumoulin, V.; Courville, A.C. Improved training of Wasserstein GANs. Adv. Neural Inf. Process. Syst. 2017, 30, 5767–5777. [Google Scholar]
Kabbani, A.; Vamanan, B.; Hasan, J.; Duchene, F. Flowbender: Flow-level adaptive routing for improved latency and throughput in datacenter networks. In Proceedings of the 10th ACM International on Conference on Emerging Networking Experiments and Technologies, Sydney, Australia, 2–5 December 2014; pp. 149–160. [Google Scholar]
Sun, P.; Hu, Y.; Lan, J.; Tian, L.; Chen, M. TIDE: Time-relevant deep reinforcement learning for routing optimization. Future Gener. Comput. Syst. 2019, 99, 401–409. [Google Scholar] [CrossRef]

Figure 1. UAV-based cloud-edge-local network architecture.

Figure 2. GA–GLU link prediction algorithm.

Figure 3. LSTM–decoder structure.

Figure 4. FERL-LB algorithm structure.

Figure 5. Link prediction evaluation metric: AUC.

Figure 6. Link prediction evaluation metric: MAP.

Figure 7. Link prediction evaluation metric: error rate.

Figure 8. Throughput comparison of different load-balancing strategies.

Figure 9. Small-flow transmission delay.

Figure 10. The number of remaining flows that have not been successfully transmitted within the scheduled time for a transmission task.

Figure 11. End-to-end average latency for different load-balancing strategies.

Figure 12. Packet loss rates under different load-balancing strategies.

Table 1. Comparison of UAV link prediction algorithms.

Algorithm	Advantages	Disadvantages
DyLink2Vec [14]	Efficient at capturing dynamic network changes; reduces reconstruction errors using gradient descent	Does not fully capture nonlinear patterns in temporal networks; struggles with scalability in large networks
DGEM [15]	Maps dynamic graph data to nonlinear latent space; effectively handles dynamic graph evolution	High computational cost; requires fine-tuning for optimal performance
DDNE [16]	Leverages historical data with GRUs to predict future link changes; effectively models temporal dependencies	Struggles to adapt in highly dynamic environments; performance heavily relies on temporal data quality
EvolveGCN [17]	Combines GCNs and RNNs to capture spatial and temporal dependencies; improves accuracy for evolving graphs	High model complexity; difficult to interpret results
STGSN [19]	Models both spatial and temporal features; utilizes attention mechanisms for better interpretability	High computational overhead; sensitive to noise in the data
GA–GLU (Proposed)	Combines a GAE, GANs, and LSTM networks for more accurate link prediction in UAV networks; captures both spatial and temporal dynamics; improves performance in highly dynamic environments	Requires careful tuning of GAN and LSTM components; slightly higher computational cost compared to simpler models

Table 2. AirSim configuration for UAV swarm simulation.

Configuration Parameter	Value
Number of UAVs	36
Simulation Duration (s)	1000
Max Speed (m/s)	15
Communication Range (m)	500
Starting Altitude (m)	100
Control Algorithm	Distributed Control
Scanning Phase Duration (s)	0–500
Hovering Phase Duration (s)	500–1000
Data Collection Frequency (Hz)	10
Network Type	Mesh

Table 3. Comparison of link prediction results.

Method	AUC	MAP	Error Rate
CN	0.883659	0.107073	4.038485
Katz	0.897741	0.118954	3.811238
Node2vec-LRCV	0.915489	0.157554	3.157558
Node2vec-LSTM	0.901738	0.409950	3.462399
GAE–LSTM	0.984325	0.777592	1.324715
GA–GLU	0.946060	0.756796	1.015201

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Long, H.; Hu, F.; Kong, L. Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks. Drones 2024, 8, 528. https://doi.org/10.3390/drones8100528

AMA Style

Long H, Hu F, Kong L. Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks. Drones. 2024; 8(10):528. https://doi.org/10.3390/drones8100528

Chicago/Turabian Style

Long, Hao, Feng Hu, and Lingjun Kong. 2024. "Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks" Drones 8, no. 10: 528. https://doi.org/10.3390/drones8100528

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.

Article Menu

Enhanced Link Prediction and Traffic Load Balancing in Unmanned Aerial Vehicle-Based Cloud-Edge-Local Networks

Abstract

1. Introduction

2. Related Works

2.1. UAV-Based Cloud-Edge-Local Resource Scheduling

2.2. UAV Link Topology Prediction

2.3. Traffic Load Balancing in Cloud-Edge-Local Networks

3. Methodology

3.1. Design of Cloud-Edge-Local Architecture

3.2. GA–GLU Link Prediction

3.2.1. Architecture

3.2.2. GAE Layer

3.2.3. LSTM Layer

3.2.4. GAN Layer

3.3. FERL-LB Algorithm

3.3.1. Architecture

3.3.2. Deep Q-Networks and Double Q-Learning

3.3.3. Traffic Forwarding Design

4. Experiment

4.1. Link Prediction

4.2. Load Balancing

5. Summary

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI