Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism

Gupta, Abhishek; Fernando, Xavier

doi:10.3390/drones9070497

Open AccessArticle

Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism

by

Abhishek Gupta

^1,2,*,†

and

Xavier Fernando

^1,†

¹

Department of Electrical and Computer Engineering, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada

²

Department of Computer Science, Dhirubhai Ambani University, Gandhinagar 382007, Gujarat, India

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Drones 2025, 9(7), 497; https://doi.org/10.3390/drones9070497

Submission received: 14 June 2025 / Revised: 9 July 2025 / Accepted: 14 July 2025 / Published: 15 July 2025

(This article belongs to the Special Issue Drone Communication, Networking, and Trajectory Control in Urban Environments)

Download

Browse Figures

Versions Notes

Abstract

In this paper, unmanned aerial vehicle (UAV)-assisted vehicular communications are investigated to minimize latency and maximize the utilization of available UAV battery power. As communication and cooperation among UAV and vehicles is frequently required, a viable approach is to reduce the transmission of redundant messages. However, when the sensor data captured by the varying number of vehicles is not independent and identically distributed (non-i.i.d.), this becomes challenging. Hence, in order to group the vehicles with similar data distributions in a cluster, we utilize federated learning (FL) based on an attention mechanism. We jointly maximize the UAV’s available battery power in each transmission window and minimize communication latency. The simulation experiments reveal that the proposed personalized FL approach achieves performance improvement compared with baseline FL approaches. Our model, trained on the V2X-Sim dataset, outperforms existing methods on key performance indicators. The proposed FL approach with an attention mechanism offers a reduction in communication latency by up to 35% and a significant reduction in computational complexity without degradation in performance. Specifically, we achieve an improvement of approximately 40% in UAV energy efficiency, 20% reduction in the communication overhead, and 15% minimization in sojourn time.

Keywords:

unmanned aerial vehicles; vehicle selection; federated reinforcement learning; personalized federated learning; attention mechanism

1. Introduction

Unmanned aerial vehicles (UAVs) find diverse applications in vehicular communications in sixth-generation (6G) networks. UAVs can serve as aerial computing equipment or aerial base stations (BSs) to provide a wide range of services to vehicular networks [1]. Moreover, UAVs have a great number of degrees of freedom in their trajectory and can achieve line of sight (LoS) with vehicles [2]. However, optimizing the available UAV battery power during massive data transfers and multi-link communications originating from multiple vehicles is a critical issue [3]. The operational height and elevation angle of a UAV also impact its link reliability and coverage range. The optimal trajectory of UAVs is subject to interference constraints while maximizing the coverage area. The mobility of vehicles and the UAV and the associated Doppler spread further impact the precision in channel estimation methods [4].

In 6G communications, the volume of data traffic has surged as sensors generate, offload, and process huge amounts of data [5]. These applications require substantial computing resources and need energy-efficient mechanisms to store and process the incoming packets at the UAV’s server. A combination of federated learning (FL) and mobile edge computing (MEC) facilitates collaborative intelligence in vehicular edge computing [6]. Mobile cloud computing (MCC) and MEC have been used as viable solutions to process computationally intensive tasks at vehicular edge servers [7]. In some approaches, Lyapunov optimization has been used to reduce latency, enhance energy efficiency, and achieve convergence of underlying machine learning models. When MEC is combined with FL, partial offloading improves the quality of service (QoS) in low-latency and delay-sensitive vehicular communications [8]. Edge intelligence for FL-driven 6G communications was discussed in [9], while emerging wireless networks equipped with machine learning were discussed in [10]. An example scenario with integration of a UAV in vehicular communications is depicted in Figure 1. Here, a conventional road side unit (RSU) communicates with vehicles. To facilitate this communication, the UAV can provide communication range for all vehicles [11]. With the UAV serving as a mobile base station (BS), the handoff delay can be reduced to minimal. Figure 1 illustrates that with a traditional terrestrial BS, the signal strength is low when the vehicles are distant from the BS. Figure 1 also illustrates that the handoff delay accumulation can be reduced even in the presence of some no-coverage zones by deploying a UAV as a mobile BS. However, a UAV has limited battery power and needs to remain in a vehicle’s LoS for a prolonged time.

The paradigm of FL addresses challenges such as limited scalability of centralized computing server, centralized storage and management of vehicular data, computational complexity at the vehicular edge, and inconsistencies due to the straggler effect [12]. Excluding stragglers reduces the amount of training data, leading to sparse gradients and model aggregation errors [13]. Furthermore, federated stochastic gradient descent (FedSGD) and federated averaging (FedAvg) limit the scalability of FL approaches [14]. Some recent applications of distributed FL approaches for intelligent transportation systems (ITSs) have been discussed in [15]. Furthermore, the transmission of messages from multiple vehicles leads to wastage of communication resources when the messages at successive transmission time intervals (TTIs) have similar content [16]. The relevance of a global model depends on data distributions among vehicles. Specifically, a global model works well when vehicles’ data is independent and identically distributed (i.i.d.). Hence, this paper explores the application of personalized federated learning (PFL) in UAV-assisted C-V2X communications to optimize the available UAV transmit power and addresses resource demands of vehicles [17]. The multi-agent PFL is further enhanced using an attention mechanism, which considers each vehicle and the UAV to determine the transmission, scheduling and queuing strategy. Based on an adaptive asynchronous optimal policy, the vehicles maximize a global objective function [18]. The asynchronous FL-based task offloading schemes minimize delay and power consumption. Transformer personalization methods have also been explored to model the dependency, priority, and associations between tasks [19].

For trajectory prediction, hidden Markov models (HMMs) and graph theory-based approaches have been applied and explored in existing works such as [20,21,22]. By considering state transitions between an agent’s trajectories as Markovian dynamics, a likelihood matrix is generated to predict the trajectory. Memoryless Markov models, however, assume that an agent’s present state separates its future trajectory from its past locations [23]. In UAV-assisted communications, the UAV’s state transitions may not only depend on the current state, but also on the past trajectories, covering different vehicles [24]. Consequently, HMM-based models do not explain the heterogeneity of UAVs’ trajectories, resulting from their different past locations and random trajectories. The attention mechanism determines the degree of dependence of future UAV trajectories and vehicle states on a UAV’s available battery power [25]. Using model aggregation based on PFL, we predict UAV trajectory coordinates [26]. To maximize learning efficiency, our method allows local model training with subsequent parameter transmission to the UAV [27]. The simulations indicate that our proposed approach significantly improves the computation rate and maintains queue stability compared to several baseline methods.

1.1. Related Work

Machine learning and game-theoretic approaches continue to find novel applications in performance optimization of vehicular communication networks. The recent advances in UAV-assisted 6G communications that are driving the developments in ITSs have been discussed in [28] and UAV-assisted satellite–aerial–ground integrated networks have been analyzed in [29]. Based on Stackelberg game theory, a novel approach for coordinated UAV-assisted vehicular communication networks was proposed in [30]. The performance of vehicular communications was enhanced using multiple candidates for single-subframe resources in [31].

1.1.1. Personalized Federated Learning Approaches and Their Applications in UAV-Assisted Communications

A variant of FL where the vehicles train personalized local models and make predictions jointly with the UAV’s global model is known as PFL. The complexity of the central shared model can be minimized while maintaining the benefits of joint training [32]. In recent applications of PFL, a cloud-based indoor positioning model aggregates parameters from multiple environments to build a cloud-based encoder to create personalized models based on channel state information (CSI). The authors of [33] proposed weight anonymized factorization-based FL for performance improvement and fairness [33]. In addition, a clustered FL algorithm maintains multiple global models to explore latent relationships between clients. The model losses and learning errors are minimized to achieve the optimal number of clusters [34]. Compared to FedAvg and FedSGD, the federated Gaussian process regression (FGPR) framework improves personalization [35]. Dynamic reward shaping improves the selection of actions as state parameters dynamically change, while maintaining reward balance [36].

The authors of [37] proposed an approach that captures common priors from distributed data and generates customized models for heterogeneous clients. Here, a layer-wise PFL algorithm generates and learns different parameters. In a recent work, PFL and DRL maximize the network throughput that adapts to time-varying network states [38]. The authors of [39] proposed a communication-efficient FL through gradient compression and adaptive aggregation. The proposed approach reduced communication overhead between clients and the server [39]. However, a large number of hyper-parameters leads to congestion. To improve communication efficiency, a dynamic sampling and selective masking was used to select fewer clients dynamically in [40]. For efficient wireless FL using layers for personalization, the proposed approach reduced congestion and derived an upper bound for non-convex loss functions. The method also revealed that dynamic network topologies in FL help new participants to reach higher accuracy [41].

The authors of [42] proposed a communication-efficient PFL algorithm and introduced a personalization parameter. The proposed approach improved the model’s accuracy and accelerated model convergence by varying the personalization parameter [42]. The authors of [43] proposed a hierarchical attention-enhanced meta-learning approach that analyzes similarities in clients’ data. The method achieves a tradeoff between the clients’ personalized models and a common model [43]. In a recent approach, collaborative edge caching was investigated, where fog access points make caching decisions when communicating with a macro base station (MBS) [44]. The authors of [45] applied variational inequality, an incentive-based FL approach, and the block coordinate descent algorithm to edge-based FL [45]. The authors of [46] explored mechanisms to avoid duplicate computations and adapt to data distribution shifts. The authors of [17] considered non-convex Gaussian mixture model (GMM) optimization in edge computing environments and proposed adaptive solutions to the GMM problem. By using an attention mechanism, the priority of services was determined, and resource allocation mechanisms were developed to maintain QoS. The results demonstrated a decrease in task processing delays, and effectively delivered QoS [14].

1.1.2. Personalized Federated Learning Approaches for Performance Enhancement in Aerial–Ground Integrated Networks

Existing FL schemes do not adequately account for performance complexity when clients conduct a varying number of training iterations [47]. When clients have low computational or communication capabilities, global model aggregation is slow, causing higher latency. The authors of [48] proposed time-sensitive FL and investigated whether FL leads to improvement in the convergence rate. The authors proposed a DRL mechanism to develop an optimal deterministic algorithm for assigning local iterations based on the computational capabilities of clients [48]. In addition, to minimize training time, the authors in [49] implemented a framework that proposes a spatial–temporal learning optimization based on the resources available at the UAV. The solution approach in [49] utilizes dual fitting to create an online algorithm with a competitive ratio and reduces average flow time by 19% [49]. The authors of [50] proposed collaborative neural filtering for communication efficiency. To reduce model complexity, the authors constructed a standard neural collaborative filtering to select smaller payloads. FL models from selected nodes resulted in a payload-efficient global model. The authors of [51] integrate FL with multi-agent DRL in the vehicular environment enhance energy efficiency and task completion rates. The task-offloading mechanism establishes an FL framework to determine resource allocation proportions [51]. Conventionally, communication and information fusion models have been designed based on belief maps, and multi-objective optimization problems have been formulated as Markov decision processes (MDPs) [52].

A novel method examines the criterion for selecting participating devices under energy constraints and their impact on device selection. To improve training efficiency, the method uses the age-of-information (AoI) metric to determine how quickly participants update their gradients [53]. The authors of [54] propose an innovative Stackelberg game-based framework that explores global loss minimization and latency minimization for faster convergence. To reduce communication rounds, global loss minimization is formulated at the leader, while latency minimization is formulated at the followers. Utilizing monotonic optimization and matching theory, they achieve an optimal follower strategy through subproblems pertaining to resource allocation and sub-channel assignment [54]. The authors of [55] obtained an upper bound for the convergence rate and proposed a new algorithm for selecting devices based on the age of update. It was shown that AoI-based device selection accelerates convergence rates and makes efficient use of sub-channels [55].

The authors of [56] investigate an actor–critic framework for heterogeneous asynchronous FRL. A three-layer MEC system involves entities acting both as actors, interacting with the environment, and critics, learning optimal policies, with independent training for each agent. The method solves the problem of non-stationary environments and enables cooperation among entities [57]. To minimize delay, the authors in [58] formulate an MDP. A solution approach based on DDPG and k-nearest neighbors is used [58]. In a novel approach in [59], an actor network generates user scheduling, UAV trajectory, and power allocation policies, updating dual variables. The authors of [60] proposed an air slicing in which multiple UAVs serve as aerial BSs. With network slicing, the network efficiency is maximized while achieving a simultaneous improvement in latency. The authors formulate a stochastic game in MDP environments where multiple adaptive agents compete simultaneously [60].

The authors of [24] formulated uncertainty minimization as a reward maximization problem and proposed a deep Q-network to optimize task offloading and flying orientation, where different parameters affect the search performance [24]. Another approach transferred wireless power from towers to UAVs and minimized computation overheads; less data led to overfitting. Providing joint power charging between towers and UAVs using workload-aware scheduling algorithms a solution approach was proposed based on MDP to handle workload-aware scheduling decisions [23]. For a dynamic wireless sensor network that employs orthogonal frequency division multiplexing (OFDM), a UAV trajectory design problem is investigated in [61]. A similar approach reduced delay on edge computing networks by reducing dynamic variability in task arrival [62]. In [63], a reinforcement learning (RL)-based cooperative navigation algorithm is proposed to plan efficient paths to avoid collisions.

For multi-UAV cooperative navigation, a prioritized experience replay strategy was designed by considering both temporal difference errors and node connectivity [63]. By optimizing location and velocity at each time slot, a UAV was trained to navigate faster [64]. Multi-agent communication explores relationships between message entropy and bandwidth and introduces mutual information to approximate message entropy [64]. Using hierarchical federated learning (HFL), clusters of UAVs were divided into cluster heads [65]. With content caching, vehicles’ context information can be used to allow UAVs to cache popular content, where clusters of UAVs are formed based on vehicles’ mobility patterns [66]. In another approach, using adaptive FRL, the UAV energy consumption is calibrated to determine the local update frequency in [67]. Other work aimed to optimize network environments and fluctuations in data traffic, and developed an algorithm to select a UAV as a central server [68].

The authors of [69] developed an approach with multiple objectives for UAVs with power constraints [69]. An optimization problem for dynamic data caching and computation offloading to reduce the average processing delay and maximize cache hits for UAVs was formulated in [70]. An optimization problem was constructed to minimize FL convergence time considering constraints such as uplink latency, selection strategy, local model training latency, and energy consumption in [71]. The authors of [72] optimized the position of UAVs based on the instantaneous signal-to-interference noise ratio (SINR). The resource allocation strategy improved the FL execution by 66.67% and attained lower latency and energy consumed compared to other resource allocation techniques [72]. The authors of [73] enhance the generalization capability of routing decisions using FL where domain controllers and servers communicate. The proposed method facilitates knowledge transfer, accelerates training, and reduces communication costs [73]. The authors of [74] integrate FL with imitation learning to coordinate UAVs’ maneuvers. The authors remove biased estimations of imitation parameters using generative adversarial imitation learning. The work aims to regularize the federated gradient updates to achieve efficient parameter interactions and coordinated swarm policies using self-imitated learning [74].

1.2. Contributions

This paper proposes PFL with an attention mechanism. As each vehicle gathers diverse data with varying statistical characteristics, challenges arise in effectively generalizing the global model at the UAV. The global model may yield suboptimal performance due to differences in data distributions and transmission patterns. This degradation in the global model becomes significant as the diversity among local data from different vehicles continues to increase [14]. The attention weights capture the relationship between the hidden states and the past trajectories of the UAV. By leveraging the collective data from all participating vehicles, the proposed approach integrates an attention mechanism to enhance collaboration among vehicles with similar data distributions and mitigates data scarcity and straggler issues [26]. Instead of using a single global model to aggregate local updates, the UAV stores several personalized models, which encapsulate common data from multiple vehicles [75]. The UAV aggregates several global models based on the data distribution parameters calculated by each vehicle [76]. The principal contributions are listed as follows:

For PFL model convergence, optimal policy evaluation, and improvement, we consider the multi-objective MDP as a sequence of parameterized optimization problems. The optimal solutions correspond to the optimal actions in the proposed bin-packing problem (BPP). This leads to an improvement of state evaluation and action evaluation in a reduced action space.
We investigate the vehicle selection losses and communication errors in the PFL model convergence rate. We identify the factors that impact the global optimum and minimize training loss [77].
We analyze the importance of past UAV trajectories in the proposed system using the V2X-Sim dataset from [78] and the LTE I/Q dataset from [79]. Consequently, we develop a sequential prediction model for future UAV trajectories.
We extend our previous works in [80,81] by incorporating FL with an attention mechanism and predict the UAV trajectories that lead to maximum utilization of available battery power [82].

1.3. Organization

The remainder of this paper is structured as follows: Section 2 outlines our system model. In Section 3, we formulate the proposed optimization problem as a Markov decision process (MDP). Section 4 is our proposed solution based on PFL with an attention mechanism to minimize the UAVs’ power consumption. Section 5 discusses the results and investigates the variations in UAV power consumption and optimal UAV trajectory prediction. Lastly, Section 6 concludes the paper.

2. System Model

Figure 2 illustrates the system model comprising the UAV–vehicle communication architecture that comprises

{V_{1}

, …,

V_{n}}

vehicles. The communication channel uses orthogonal time–frequency space (OTFS) modulation, which introduces frequency and spatial selectivity in the fading channel. It considers both LoS and NLoS components to estimate the channel characteristics. The UAV trajectory is a set of sequential coordinates denoted by

Δ_{(x, y)}

. The trajectory coordinates comprise hovering and turning points, represented as

Δ_{(x, y)} \in [Δ_{(0, 0)}, Δ_{(0, 1)} \dots Δ_{(1, 0)}, Δ_{(1, 1)} \dots Δ_{(m, m)}]

.

The past UAV coordinates are stored and are used to predict the future UAV path. We model the changes in the UAV trajectory as incremental steps based on their velocity and directions. We assume a system with slot duration

τ

that consists of a block-fading model. The downlink channel gain from the UAV (u) to a vehicle (v) is given by Equation (1).

{\bar{g}}_{u \to v}^{(t)} = {|h_{u \to v}^{(t)}|}^{2} σ_{u \to v}^{(t)}, t = 1, 2, \dots, τ

(1)

where

σ_{u \to v}^{(t)}

represents the large-scale fading component that varies as vehicles change their position. The location of vehicle v at slot t is represented as

x_{v}^{(t)}

. The large-scale fading in

d B

is given by Equation (2).

σ_{dB, u \to v}^{(t)} = PL (x_{u}, x_{v}^{(t)}) + ϑ_{u \to v}^{(t)}

(2)

where

P L

is the path loss and follows log-normal shadowing from

x_{u}

to

x_{v}^{(t)}

. The standard deviation in log-normal shadowing is denoted by

ϑ_{u \to v}^{(t)}

that varies with the displacement of the vehicle during the last slot. The SINR

γ_{v}^{(t)}

at a vehicle at time slot t is defined as a function of the communication bandwidth

b_{w}^{(t)}

and power

p_{w}^{(t)}

given by Equation (3) as

γ_{v}^{(t)} (b_{w}^{(t)}, p_{w}^{(t)}) = \frac{{\bar{g}}_{b_{u}^{(t)} \to v}^{(t)} p_{v}^{(t)}}{\sum_{V} {\bar{g}}_{b_{u}^{(t)} \to v} p_{v}^{(t)} + N^{2}}

(3)

where

N^{2}

is the power spectral density of additive white Gaussian noise. The downlink spectral efficiency

C_{v}^{(t)}

of vehicle v at time t is given by Equation (4).

C_{v}^{(t)} = log (1 + γ_{v}^{(t)} (b_{w}^{(t)}, p_{w}^{(t)}))

(4)

The channel coefficient is modeled as follows:

h_{u, v} = \sqrt{\frac{κ_{u}}{κ_{u} + 1}} {\bar{h}}_{u, v} + \sqrt{\frac{1}{κ_{u} + 1}} {\tilde{h}}_{u, v}

(5)

The symbols considered in this paper are described in Table 1.

3. Problem Formulation

In this paper, an optimization problem is formulated to jointly optimize the UAV’s trajectory, UAV’s available battery power, vehicles’ packet transmission scheduling, and the queue length at the UAV. The vehicles compute the local stochastic gradient and communicate with the UAV to seek a global optimum. Subsequently, we minimize the queue length at the UAV by reducing the transmission rate from multiple vehicles [83]. The data distribution experiences unpredictable changes and needs to periodically update the training set. The queue at the UAV follows the standard

M / M / k

model and is a bin set of capacity

c_{q}

, comprising a packet set

a_{Ψ}

, where the size of each packet is

Ψ_{i}

. To optimize the queue length at the UAV, we formulate a bin-packing problem (BPP) as an MDP to pack all the packets into a queue such that the total number of queues is minimized for queue capacity

c_{q}

. The variable

ζ_{i k}

denotes whether packet i is assigned to queue k, and the variable

ϕ_{k}

denotes whether bin k is utilized. The problem (

P 1

) is formulated as follows in Equation (6):

\begin{matrix} P 1 : & min_{ψ^{(t)}, D^{(t)}} {w_{1} \underset{S P_{1}}{\underset{︸}{[\sum_{t = 1}^{T} min (max_{v \in V} (d_{v}^{t}), τ_{d}) + T τ_{u}]}} + \\ w_{2} \underset{S P_{2}}{\underset{︸}{[\sum_{t = 1}^{T} \sum_{v \in V} D_{v}^{(t)} min (d_{v}^{(t)}, τ_{d})]}} + w_{3} \underset{S P_{3}}{\underset{︸}{[\sum_{t = 1}^{T} \sum_{v \in V} (\sum_{j \in V} \sum_{k \in K} t_{i} p_{i, j}^{(k)} ς_{i, j}^{(k)} (t))]}}} \\ subject to \end{matrix}

(6)

\begin{matrix} C 1 : & min \sum_{v \in V} ϕ_{k}, ϕ_{k} \in {0, 1}, \forall v \in V \end{matrix}

(7)

\begin{matrix} C 2 : & \sum_{i \in a_{Ψ}} ζ_{i k} Ψ_{i} \leq c_{q ϕ_{k}}, \forall k \in K, \forall v \in V \end{matrix}

(8)

\begin{matrix} C 3 : & \sum_{k \in K} ζ_{i k} = 1 \forall i \in a_{Ψ}, ζ_{i k} \in {0, 1}, \forall i \in a_{Ψ} \forall k \in K \end{matrix}

(9)

\begin{matrix} C 4 : & | | Δ_{i (t + 1)} - Δ_{i (t)} | | \leq v_{m a x} (t) t_{i, i + 1} \end{matrix}

(10)

\begin{matrix} C 5 : & | | Δ_{i (t + 1)} - Δ_{i (t)} | | \geq d_{m i n} \end{matrix}

(11)

\begin{matrix} C 6 : & Δ_{(x, y)} = Δ_{(x_{0}, y_{0})} \cdot e^{- α (\frac{x^{2}}{a^{2}} + \frac{y^{2}}{b^{2}})} + H \end{matrix}

(12)

\begin{matrix} C 7 : & w_{1} + w_{2} + w_{3} = 1 \end{matrix}

(13)

where problem (

P 1

) is a multi-objective and multi-variable non-convex optimization problem. In problem (

P 1

) in Equation (6), the parameter

ψ^{(t)}

is a vector consisting of different packets comprising PFL model hyper-parameters. Rather than transmitting the local model hyper-parameters from all the vehicles to the UAV, PFL aims to select a subset of vehicles that communicate their local model hyper-parameters to the UAV in a TTI. The variable

τ_{d}

is the delay experienced by a vehicle while downloading the global model hyper-parameters from the UAV. The term

τ_{u}

implies the delay experienced by a vehicle while uploading its local model hyper-parameters to the UAV. The variable

t_{i}

is the time duration of the TTI under consideration and

p_{i, j}

is the power consumption of a UAV in a TTI.

The constraint

C 1

emphasizes the decision variable

ϕ_{k}

that denotes whether bin k is utilized in a transmission window or is idle. The constraint

C 2

limits packet size (

ψ

). This ensures that the queue does not exceed the capacity and limits the utilization of the UAV’s resources. The constraint

C 3

introduces a control parameter for the transmitted data. The constraints

C 4

and

C 5

restrict the UAV’s trajectory within the vehicle’s coverage range such that the UAV traverses a minimum distance (

d_{m i n}

) and receives packets from at least one vehicle in a TTI. The maximum velocity of the UAV is restricted to (

v_{m a x}

). In constraint

C 6

, the UAV’s trajectory is constrained to an elliptical path. In constraint

C 6

, the variables a and b indicate the major axis and minor axis of the UAV’s elliptical path. We introduce a variable

α

as a tuning factor to expand or contract the elliptical trajectory. The selection of an elliptical trajectory over a circular trajectory was motivated by the results demonstrated in [84,85,86]. The authors in these works postulate that, for UAVs, an elliptical path is a common trajectory for surveillance, target tracking, and other applications. Through extensive experiments and analysis, the authors have illustrated that an elliptical path provides optimal obstacle avoidance and motion stability compared to a circular path [84]. Moreover, a rectangular or a square path has abrupt discontinuities and may lead to a UAV becoming stuck in a specific segment of the path. The variable

α

is a tuning factor between 0 and 1. When

α

= 0, the UAV is stationary at height (

H

) and does not move along the x and y axes. When

α

= 1, the largest possible elliptical trajectory is realized. For other values of

α

between 0 and 1, the area of the elliptical path gradually increases as

α

is increased from 0 to 1. The constraint

C 7

implies that the weighting coefficients in Equation (6) should add to unity. In this paper, we use block coordinate descent (BCD) to decompose the optimization problem

P 1

into three subproblems. BCD optimizes a subset of variables in a subproblem while keeping others constant. This process is repeated until convergence by optimizing smaller blocks of variables in each TTI. To account for the underlying hardware’s computing limitations, it is important to achieve a positive semi-definite matrix and to have the rank of the variables’ matrices as less than or equal to one. This allows the non-convex optimization problem to be transformed into a convex optimization problem. If the weighting coefficients

w_{1}

,

w_{2}

, and

w_{3}

in Equation (6) do not add to unity or are large integers, it takes a comparatively longer time to solve all the subproblems.

4. Solution Approach

Using BCD, problem

P 1

in Equation (6) is divided into three subproblems. Subproblem

S P_{1}

pertains to optimizing the UAV trajectory; subproblem

S P_{2}

pertains to optimizing the available battery power; and

S P_{3}

aims to optimize the queue occupancy at the UAV. We also utilize personalized federated learning (PFL) with an attention mechanism to reduce the number of communication rounds between the vehicles and UAV. The reduction in communication overhead is realized by selecting a subset of vehicles for communicating local model hyper-parameters in each TTI. When, instead of all the vehicles, a subset of vehicles communicate their local model hyper-parameters with the UAV, a lesser number of communication rounds can be used to transmit the relevant information to the UAV for further processing. Figure 3 illustrates the PFL approach.

4.1. Training Vehicles with Data

We consider a dataset

D_{n}

with n samples that comprises a d-dimensional feature space bounded and closed in

R^{d}

. The vehicle selection is assigned to the UAV according to a policy that varies with the UAV’s battery power and trajectory coordinates. The non-optimal actions are iteratively removed from the action set. We introduce a trajectory storage vector

T_{r} (x)

that stores the difference between the two adjacent trajectory coordinates. We identify the trajectory predictors of

T_{r} (x)

using samples from

D_{n}

. To build estimators of

T_{r} (x)

, we calculate an optimal minimax rate to estimate

T_{r} (x)

using

D_{n}

. The optimal actions are updated as

V (s) = {max}_{a} {Q (s, a) : a \in A (s), A (s) \subseteq A}

, where

A

denotes the available actions for the vehicles and the UAV. The state value

V^{*} (s_{t})

is the total reward a vehicle can accumulate and an action value

Q^{*} (s_{t}, a_{t})

is the cumulative reward. The Bellman equations are given by Equations (14) and (15) as

V^{*} (s_{t}) = max_{a_{t}} {R (s_{t}, a_{t}) + V^{*} (s_{t + 1})} \forall t \in T

(14)

where

R (s_{t}, a_{t})

is the accumulated reward for a state–action pair. The value

Q_{\bar{θ}}

is updated as in Equation (15):

Q_{\bar{θ}} (s_{t}, a_{t + 1}) = min (Q_{{\bar{θ}}_{1}} (s_{t}, a_{t}), Q_{{\bar{θ}}_{2}} (s_{t}, a_{t})) + γ_{d} E_{a_{t} \sim π} [Q_{\bar{θ}} (s_{t}, a_{t}) + φ H (π_{ϕ})]

(15)

where

H (π_{ϕ})

is the policy learned by the UAV at height

H

. At different heights, the available battery power is different, leading to different policies,

γ_{d}

is the discount factor;

a_{(t + 1)}

is the action at the next time; and

φ

is the personalization parameter. The expectation

E_{a_{t} \sim π}

is over state s. The policy aims to maximize the cumulative reward and hence we parameterize the policy using past interactions. For UAV trajectory prediction, we propose a multidimensional time series. The UAV’s action space is continuous and an actor–critic algorithm is used to train the trajectory predictor. This extracts the temporal dependencies among vehicles’ data to forecast the data traffic of each vehicle when spatial correlations exist. This enables PFL to learn the long-term dependency among the local models of different clusters. As illustrated in Figure 4, the local model hyper-parameters are stored and communicated to the UAV. Using PFL, the redundant local model hyper-parameters are discarded to reduce the overall computational complexity of the proposed solution approach.

4.2. Action Set for Vehicles and UAV

The action set comprises five vectors

[a_{1}, a_{2}, a_{3}, a_{4}, a_{5}]

. Here,

a_{1}

represents the characteristics of a packet, its current state, type of packet, and processing time. The vector

a_{2}

indicates the priority of a packet. The vector

a_{3}

implies a UAV’s processor utilization, queue occupancy status, queuing delay, and processing delay for a packet. The vector

a_{4}

indicates the status of the UAV queue, queue length, waiting time, and the number of processed packets. The vector

a_{5}

represents the local model hyper-parameters of a vehicle’s clusters and rejected redundant local models in each TTI. The rewards accumulated by the vehicles are stored in memory used to train a global model. At slot t, each vehicle observes its local state and decides its next action. We quantify the estimator performance by its squared-

L^{2} (p)

error, which implies the precision in estimating the UAV’s trajectory as

ψ (\hat{T}) ≜ E {∥ {\hat{T}}_{r} (x) - T_{r} (x) ∥}_{L^{2} (p)}^{2}

(16)

where

L^{2} (p)

is the

L^{2}

norm. The magnitude of Equation (16) varies with all possible local model transmissions from individual vehicles and clusters to the UAV and identifies a configuration

p_{θ^{*}}^{*}

as

\begin{matrix} p_{θ^{*}}^{*} \in arg max_{p_{θ} \in p_{Θ}} \frac{1}{N} \sum_{i = 1}^{V} L (p_{θ}; D_{t r a i n}^{(i)}, D_{v a l i d}^{(i)}), \end{matrix}

(17)

where

L

is the loss function, and

D_{t r a i n}^{(i)}

and

D_{v a l i d}^{(i)}

are the training dataset and validation dataset. The posterior distribution for an action selected by a vehicle pertaining to a UAV’s state is

\forall a \in A_{v}, \forall v \in {a_{1}, a_{2}, a_{3}, a_{4}, a_{5}}}

, and

H_{t}

is the history of the past heights and trajectories of the UAV up to iteration t. The batch selection scheme encourages diverse exploration and vehicle selection via a Poisson point process. The method has an exponential complexity for n samples.

4.3. Personalized Federated Learning with Attention Mechanism

At the UAV, we fit the global model, denoted as

{\vec{γ}}_{T}

, which is a set of sequential data collected and stored during repeated transmissions. A UAV’s global model hyper-parameter vector is

{\vec{γ}}_{T} = {D_{t}}_{t = 1}^{T},

(18)

where

D_{t}

is the data gathered during the

t^{t h}

TTI, and T implies the total number of coordinate changes traversed by the UAV. At each time step t, the UAV’s location is characterized over the hovering time T. The progression of states correspond to different trajectory coordinates

Δ_{(x, y)}

. We model the joint distribution of the UAV’s states and past observations as

\begin{matrix} p_{θ} ({\vec{γ}}_{T}, {\vec{Ψ}}_{T}) = \prod_{t = 1}^{T} \underset{Past - trajectory}{\underset{︸}{p_{θ} (x_{t} | ψ_{t})}} \underset{{UAV}^{'} s future trajectories}{\underset{︸}{p_{θ} (ψ_{t} | {\vec{γ}}_{t - 1}, {\vec{Ψ}}_{t - 1})}} \end{matrix}

(19)

where

{\vec{Ψ}}_{t}

is the set of local model hyper-parameters. The transition probability in Equation (19) assumes that the UAV’s state at time t depends on the past observations

({\vec{γ}}_{t - 1}, {\vec{Ψ}}_{t - 1})

. This is different from the Markovian assumption, where

p_{θ} (ψ_{t} | {\vec{γ}}_{t - 1}, {\vec{Ψ}}_{t - 1}) = p_{θ} (ψ_{t} | ψ_{t - 1})

, i.e., future states depend only on the current state. To capture non-Markovian dynamics using PFL and an attention mechanism, the state transitions are modeled as

p_{θ} (ψ_{t} | {\vec{γ}}_{t - 1}, {\vec{Ψ}}_{t - 1}) = p_{θ} (ψ_{t} | {\vec{ψ}}_{t}, {\vec{Ψ}}_{t - 1})

(20)

where

{\vec{ψ}}_{t} = {ψ_{1}^{t}, \dots, ψ_{t - 1}^{t}}

is a set of attention weights to infer the statistical characteristics of the future states. The attention weights are used to determine the influence of a UAV’s past states on the future state transitions through a non-linear mapping based on the attention mechanism [87,88,89,90,91]. The attention mechanism is commonly used in Transformer architectures and utilizes query, key, and value vectors to calculate the attention weights [87]. In PFL, a vehicle is designated to communicate with the UAV in a TTI. In that TTI, the other vehicle’s model parameter transmissions will be redundant until the next TTI. An adaptive weighting approach compares the statistical similarity in model weight parameters of different vehicles as well as vehicles and the UAV. If two vehicles have a high similarity, they are considered to belong to a single subset, so the UAV can communicate with any one vehicle out of the two. If there are three such vehicles in a subset, the UAV can communicate with any one out of these three vehicles because all three vehicles are communicating similar data. Moreover, in a TTI, more than one such subset can exist. Hence, in order to ensure that the UAV does not miss different types of information transmitted by different vehicles, the UAV should communicate with at least one vehicle from each subset. The attention mechanism trains the vehicles and the UAV to learn the state transitions from a sequence of past data [88]. Note, the vehicles generate local models from a limited number of data samples from each TTI. The previously learned local models are augmented with the arrival of new data, allowing the agents to adapt the current PFL model to new model updates. The attention function

A_{t}

is introduced to utilize the features from the previous time steps. As the UAV’s trajectory coordinates are determined, the corresponding attention weights cluster the vehicles with similar local models within a selected TTI. This approach utilizes the past knowledge of optimal trajectory coordinates, where the UAV can serve a maximum number of vehicles while adapting to fresh local model hyper-parameters received from the vehicles.

The attention weights

{\vec{ψ}}_{t}

from past states at time t are gathered from the UAV’s context

{\vec{γ}}_{t}

using an attention mechanism

A_{t}

as

{\vec{ψ}}_{t} = A_{t} ({\vec{γ}}_{t})

, where

A

is a deterministic attention mechanism that leads to a sequence of functions

{A_{t}}_{t}

,

A_{t} : \to {[0, 1]}^{t}

. Here,

A

is a set of actions that maps a UAV’s context

{\vec{γ}}_{t}

to a set of attention weights

{\vec{ψ}}_{t}

. The model output is a sequence of attention weights at each time step that enables to learn the model hyper-parameter

θ

and to infer a UAV’s current state from the posterior distribution

p_{θ} ({\vec{Ψ}}_{t} | {\vec{γ}}_{t})

. Here, we utilize variational learning to jointly learn the model hyper-parameter

θ

and an inference network assists to approximate the posterior distribution

p_{θ} ({\vec{Ψ}}_{t} | {\vec{γ}}_{t})

. Using variational learning, we maximize the expected lower bound as

\begin{matrix} log p_{θ} ({\vec{γ}}_{T}) \geq E_{m_{ϕ}} [log p_{θ} ({\vec{γ}}_{T}, {\vec{Ψ}}_{T}) - log m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})] \end{matrix}

(21)

where

m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})

is an intermediate variational distribution that approximates the posterior distribution

p_{θ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})

. This intermediate variational distribution is modeled as

m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})

using a jointly trained inference network as

\begin{matrix} ϕ^{*}, θ^{*} = arg max_{ϕ, θ} E_{m_{ϕ}} [log p_{θ} ({\vec{γ}}_{T}, {\vec{Ψ}}_{T}) - log m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})] \end{matrix}

(22)

Hence, by calculating the estimations of

θ

and

ϕ

from the training data, we calculate

p_{θ} ({\vec{γ}}_{T}, {\vec{Ψ}}_{T})

to extract the environmental knowledge, and the inference network

m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})

to infer the trajectory of the UAV. The inference network

m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T})

infers the structure of the posterior distribution.

p_{θ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T}) = p_{θ} (ψ_{1} | {\vec{γ}}_{T}) \prod_{t = 1}^{T} p_{θ} (ψ_{t} | {\vec{ψ}}_{t - 1}, {\vec{Ψ}}_{t - 1}, {\vec{γ}}_{t : T})

(23)

For the inference network, for all TTIs,

m_{ϕ} ({\vec{Ψ}}_{T} | {\vec{γ}}_{T}) = m_{ϕ} (ψ_{1} | {\vec{γ}}_{T}) \prod_{t = 1}^{T} m_{ϕ} (ψ_{t} | {\vec{ψ}}_{t - 1}, {\vec{Ψ}}_{t - 1}, {\vec{γ}}_{t : T}) .

(24)

The attention weights generated in

A

are used to infer the statistical characteristics of

ψ_{t}

. We calculate an estimate of the distribution

\approx p_{θ} (ψ_{t} | {\vec{γ}}_{t})

to consider the impact of past states with their relevant contributions, which are indicated by the attention weights. We estimate the gradients via stochastic backpropagation and update the parameters of the attention mechanism from their maximum likelihood estimates. The attention function enables sharing of the parameters between the vehicle’s clusters to accelerate learning. The attention function

m_{ϕ} (ψ_{t} | {\vec{γ}}_{T})

stores the attention weights with

{\vec{Ψ}}_{t - 1}

and

{\vec{γ}}_{t : T}

for state transitions. The inference network leads the posterior network to learn the relevant trajectory coordinates and the variance of gradient estimates. This enables us to precisely identify the UAV’s location and the vehicles to which the resources should be allocated in a TTI.

5. Simulation Results and Discussion

We processed the datasets on an Amazon elastic compute-2 (EC-2) instance and the PFL collaborators were constructed using TensorFlow and the TensorFlow Federated frameworks. For PFL model training, we split the V2X-Sim dataset into 60% training and 40% testing data, and for UAV trajectory prediction, we split the LTE-I/Q dataset into 60% training and 40% testing data. We randomly train the agents for approximately 25–30 sets of training instances. Each training set consists of 50 instances, comprising model hyper-parameter transmissions from the vehicles to the UAV. The packet size was varied between 1 byte and 10 megabytes and the number of packets was varied from 100 to 4000. The inter-arrival time interval for packets at the UAV was varied between 100 ms and 1000 ms. In each instance, the capacity of the queue was set as up to 10,000 packets in each bin. Table 2 lists the main parameters used in the simulations.

5.1. Performance Comparison of PFL with Traditional FL Methods

In this section, we compare the performance of our proposed PFL approach with FedAvg and FedSGD for a varying number of vehicles (V) and varying road lengths (

R_{L}

). We assume a scenario with 100 vehicles. Our algorithm converged after 30 iterations and processed 100% of the local model hyper-parameter updates from all the vehicles in the FedAVg and FedSGD scenarios. However, as all the vehicles communicated their local model hyper-parameter updates to the UAV, there was a notable delay in receiving the global model hyper-parameter from the UAV. Using PFL, only a subset of vehicles in each TTI communicated their local model hyper-parameter updates to the UAV, resulting in lower latency and sojourn time.

The parameter percentage of latency violations represents the percentage of instances where the round-trip latency exceeds a predefined threshold. The benchmark predefined threshold values are specified and periodically updated in the 3rd Generation Partnership Project (3GPP) specifications [1,92,93]. However, in our simulated environment, because of limited computing power, we did not set the threshold to a lower value between 2 ms and 5 ms, as appears in some 3GPP specifications. In our simulations, the maximum delay in ms was around 30 ms for the FL, FedAvg, and PFL approaches. So, we set the latency threshold (

L_{m a x}

) as 30 ms, and for any instance when the delay, comprising queuing delay plus processing delay, exceeded 30 ms, we assumed the packet was either dropped or retransmitted. In an iteration, if the delay exceeds the

L_{m a x}

of 30 ms, we consider that as an instance of latency violation. As the number of vehicles was increased, a larger number of packets exceeded the latency threshold

L_{m a x}

of 30 ms for the queuing delay and processing delay metrics; this is plotted as the percentage of latency violations in Figure 5 and Figure 6. It was noted that the percentage of latency violations was around 30% for PFL and 15% for PFL with attention mechanism. The 50% reduction in the percentage of latency violations for PFL with attention mechanism validates the advantage of using attention weights and utilizing dominant states for UAV trajectory prediction. In PFL, a minor increase in delay is observed compared to PFL with attention mechanism to ensure that the UAV’s processing capacity is not exceeded and the queue does not overflow. The proposed PFL approach ensures that whenever the latency threshold constraint is breached, the next set of arriving packets are dropped. If the UAV’s queue or buffer becomes available in this TTI, the dropped packets are retransmitted. Otherwise, the next set of packets is transmitted in the next TTI. Some other recent works that have utilized a threshold latency as a metric for performance evaluation are [94,95].

The percentage of latency violations in PFL and conventional FL is illustrated in Figure 5 and Figure 6. Here, we vary the number of vehicles (V) from 1 to 100; the vehicle velocity is set as 60 km/h and 80 km/h; and

R_{L}

is set as 2 km and 4 km. The local learning rate for each vehicle is initially set to 0.005 and increased gradually with periodic increments. The number of global model and local model update iterations is set to 2000 and 1000, respectively. In each training round in PFL, 40–70% of vehicles are selected to participate in the training of the global model at the UAV. The subset of vehicles selected by the UAV is based on the similarity of data distributions in the model hyper-parameters. Out of two vehicles with high similarity in their data distributions, only one vehicle is selected by the UAV. This saves redundant computations and lowers latency.

As noted from Figure 5 and Figure 6, the percentage of latency violations lies between 25 and 30% for FedAvg and between 23 and 31% for FedSGD. For the PFL approach, for a similar number of vehicles, the percentage of latency violations lies between 18 and 28%, indicating a gain in performance. PFL uses an adaptive weighting approach that compares the statistical similarity in model weight parameters of different vehicles as well as vehicles and the UAV. If two vehicles have a high similarity, they are treated as a single cluster, so the UAV can communicate with any of the vehicle out of the two. If there are three such vehicles in a cluster, the UAV can communicate with any one out of these three vehicles as all three vehicles are communicating similar data. Moreover, in a TTI, more than one such cluster can exist. So, to not miss different types of information transmitted by different vehicles, the UAV should communicate with at least one vehicle from each cluster. The lower latency in packet transmissions is further noted from Figure 7. The average delay experienced by a packet in PFL and conventional FL is illustrated in Figure 7.

5.2. Performance Comparison of Attention-Mechanism-Based PFL with Traditional FL Methods

In this section, we compare the performance of the proposed attention mechanism-based PFL approach with the traditional FL methods, particularly FedAvg and FedSGD, when varying the number of vehicles (V) and varying road length (

R_{L}

). The percentage of latency violations in PFL with attention mechanism and conventional FL and the average delay experienced by a packet are illustrated in Figure 8. During the local model hyper-parameter updates, each local model weight is normalized to a value between 0.20 and 0.85 to measure the variance in the data distribution during the dataset partitioning. The FedAvg and FedSGD algorithms emphasize vehicle-specific model hyper-parameter transmissions to the UAV. As illustrated, PFL with attention mechanism also significantly outperforms FedAvg and FedSGD as well as PFL, which implies that the attention mechanism leverages the similarity of data distributions in clusters of vehicles to improve the performance over heterogeneous data.

It is also observed that a vehicle accumulates a higher reward to maintain a higher probability to remain in a cluster during all TTIs. The higher reward of a vehicle emphasizes consistent presence in a PFL cluster and model transmission to the UAV, which improves the global model’s convergence rate. The consistency of local model updates and transmission to the UAV increases the accumulated reward and leads to quicker local model convergence. However, the trend is adversely impacted as the number of vehicles increases, as more vehicles complete their local model hyper-parameter updates and compete to gain access to the UAV’s resources, i.e., bandwidth and available battery power over state–action pairs. As compared to PFL, where the percentage of latency violations lies between 18 and 28%, the percentage of latency violations for PFL with attention mechanism decreases considerably. As noted from Figure 8, the percentage of latency violations for PFL with attention mechanism lies between 12 and 20%, marking a steep decline of 15% as compared to PFL.

The variation in average latency (ms) in the FedSGD and PFL with attention mechanism scenarios is illustrated in Figure 9. Figure 10 illustrates the variation in average packet delay (ms) in PFL for a varying number of vehicles (V) for different road lengths (

R_{L}

). As illustrated in Figure 10, when the road length (

R_{L}

) is increased to 4 km, the delay exceeds the latency threshold (

L_{m a x}

) of 30 ms. From Figure 9 and Figure 10, we observe that there is an approximate 20% reduction in latency for the same number of vehicles that are served by the UAV in a given TTI when utilizing the trajectory predictions of the PFL model with attention mechanism. The attention mechanism leads to improved trajectory prediction compared to conventional FL as well as PFL. Also, PFL with attention mechanism handles the data heterogeneity effectively. Figure 11 illustrates the variations in average latency (ms) in the FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios with a varying number of vehicles (V) for road length (

R_{L}

) = 2 km. Figure 11 illustrates the outputs of Figure 8 and Figure 9 on a single plot for comparative analysis. It is observed from Figure 11 that the average latency achieved in the case of FedAvg and FedSGD over a road length (

R_{L}

) of 2 km for varying vehicle velocities is between 16 ms and 24 ms. For the same vehicle velocities and road length, the average latency for PFL and PFL with attention mechanism is between 10 ms and 18 ms. Hence, PFL and PFL with attention mechanism lead to an approximately 30% reduction in latency, that comprises queuing delay and processing delay, compared to conventional FL approaches such as FedAvg and FedSGD. However, a significant discernible difference in the latency of PFL and PFL with attention mechanism is not observed in Figure 11. This is attributed to the delay accumulated in calculating the attention weights and to identifying the dominant local model hyper-parameters to be transmitted from vehicles to the UAV. Although this difference in the two approaches is negligible over a smaller road length, this difference may be larger and significant in real-world traffic scenarios.

5.3. Variation in UAV’s Transmit Power (dBm) with Number of Vehicles (V)

The variation in the UAV’s transmit power (dBm) vs. number of vehicles (V) for FedAvg and PFL is illustrated in Figure 12. The variation in the UAV’s transmit power (dBm) vs. number of vehicles (V) for FedSGD and PFL with attention mechanism is illustrated in Figure 13 and Figure 14 for V = 1–100. We use the Adam optimizer to update the inference network. We set the UAV’s target region to an elliptical-shaped area where 100 vehicles are randomly and uniformly distributed on the road, with a direct LoS with the UAV. We set the initial coordinates of the UAV to [15, 15], [80, 80], [15, 80], and [80, 15] m. The UAV’s coordinates along the x-axis and y-axis are bound by an elliptical path and the altitude along the z-axis is varied between 100 m and 2 km. Within the elliptical path, the UAV can traverse the trajectory coordinates from a theoretically infinite number of possible paths, distributed as a Poisson point process.

It is noted from Figure 12 that for FedAvg, for 500 ms of training iterations, the UAV power consumption is approximately 14 dBm. For a similar duration of training iterations, the UAV power consumption is approximately 8 dBm for the PFL scenario, implying a 40% reduction in battery power consumption. This reduction in the UAV’s battery power consumption is due to the fact that a reduced number of vehicles compete for the UAV’s communication and computation resources in PFL. When the vehicle velocity is increased to 80 km/hr and the road length (

R_{L}

) is increased to 2 km, it can be noted that PFL with attention mechanism leads to the UAV’s battery power consumption being similar to that of PFL at lower vehicle velocity. It is evident from Figure 12 that the UAV power consumption is higher in the conventional FL scenario since all the vehicles transmit their local model hyper-parameters to the UAV. Conversely, in the PFL scenario, the UAV power consumption is marginally lower as only a select number of vehicles from a subset are required to transmit their local model hyper-parameters to the UAV.

Additionally, each vehicle generates multiple packets in each TTI. The UAV moves around certain areas, since its coverage range is limited to an elliptical path and it should serve a maximum number of vehicles to increase the fairness index and maximize its reward. With target trajectory prediction, the UAV can arrive at the next set of coordinates from where it can be in the LoS of the maximum number of vehicles. From the trend in Figure 13 and Figure 14, when using PFL with attention mechanism, we observe a reduction of approximately 20% in the UAV’s transmit power and an approximate 15% time reduction in model convergence. For a similar environment, PFL offers a reduction of 15% in the UAV’s transmit power and energy or a 10.5% time reduction in model convergence. Hence, using PFL, improved prediction accuracy in the UAV’s trajectory leads to an approximate reduction of 15% in the UAV’s transmit power, with a 20% reduction in communication overhead from vehicles. Combining PFL with attention mechanism leads to a further reduction in the UAV’s transmit power as well as a reduction in the communication overhead from vehicles.

Figure 15 illustrates the variation in the UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for FedAvg: The FedSGD, PFL, and PFL with attention mechanism scenarios over model training time slots. Figure 15 illustrates the outputs of Figure 12 and Figure 13 on a single plot for comparative analysis. It is observed from Figure 15 that the UAV’s average transmit power (dBm) is approximately 14 dBm for FedSGD and 8 dBm for PFL mechanism. This implies an approximate 40% reduction in the UAV’s transmit power when using PFL. For PFL with attention mechanism, the transmit power is approximately 10 dBm. This slight increase in the UAV’s transmit power as compared to PFL is due to the fact that a portion of the UAV’s power is dissipated in selecting a vehicle for local model hyper-parameter transmission from a subset of vehicles. This selection is based on the attention weights that allow the UAV to identify a vehicle with dominant local model hyper-parameters to be transmitted from vehicles to the UAV. Although this difference in the two approaches, i.e., PFL and PFL with attention mechanism, is negligible over a smaller road length, this difference may be larger and significant in real-world traffic scenarios.

5.4. Variation in Average Mean Square Error (MSE) Losses with Personalization Parameter ( $φ$ ) Using PFL and PFL with Attention Mechanism

The variation in normalized MSE losses with number of training episodes for the PFL and PFL with attention mechanism scenarios is illustrated in Figure 16 and Figure 17 for V = 1–100. We evaluate the effectiveness of the proposed PFL model aggregation approach.

The UAV’s transmit power and bandwidth limits are considered along with the vehicles’ wireless resource limitations. As the selected vehicles upload their local model hyper-parameters for aggregation, PFL requires more computation and bandwidth resources than the PFL approach with attention mechanism. It is observed from the model convergence characteristics that for the available battery power at the UAV, the MSE of PFL is reduced by approximately 6.75% for

V = 50

, by 8.5% for

V = 100

, and by 12.5% for PFL with attention mechanism, in comparison with the conventional FL approaches. For the UAV’s optimal trajectory prediction with PFL and PFL with attention mechanism on the LTE I/Q dataset, the MSE is reduced by 9.5% for

V = 50

, 5.5% for

V = 100

, and 3.75% for PFL with attention mechanism, in comparison with the conventional FL approaches. The above results demonstrate the superior performance of our proposed PFL method. Here, we set the learning rate to 0.001 and utilize the Adam optimizer to update the model in subsequent iterations.

To verify the advantages of the PFL with attention mechanism, we compare its performance with PFL to regularize each local loss function and to address the data heterogeneity. Moreover, in PFL, the selected vehicles upload the entire model to the UAV for aggregation in each round. In each round, the selected vehicles sequentially train the feature extractor and predictor. Hence, the PFL algorithm converged after approximately 50 iterations, whereas PFL with attention mechanism converged after approximately 25 iterations. We selected 60% of the vehicles such that at least one vehicle was selected from each cluster. The personalization parameter was varied for other vehicles in a cluster. We tuned each vehicle’s personalization parameter to select between the cluster-wise embedding vectors with lower dimensionality and to train the model in fewer iterations. We also verified that the tunable parameter to balance the training performance and power consumption of the UAV with a varying number of vehicles impacts the average delay.

As illustrated in Figure 18, as the number of vehicles and vehicle velocity increases, the vehicles consume more energy, resulting in scheduling more data transmissions to the UAV. On the V2X-Sim dataset, the PFL results in 4.50% and 3.50% improvement in delay for

φ

= 0.001 and

φ

= 0.01, and obtains a slight improvement in delay when

φ

= 0.1 compared with the traditional FL algorithms. The simulation results reveal that the proposed approach can schedule fewer data samples and still achieve higher learning accuracy. The performance gain is derived from the joint optimization of vehicle selection, bandwidth allocation, communication and computation time, and TTI allocation policies. It is noted that if

φ

is too large, there is less emphasis on the UAV’s power consumption and more vehicles are clustered together. This increases the UAV’s power consumption and also increases delay. Thus, the value of the personalization parameter (

φ

) should be adjusted to optimize the training performance while satisfying the transmit power constraints of the UAV and overall delay. As we increased

φ

, the model convergence time improved gradually. Figure 18 also implies that the attention mechanism maintains personalization through an adaptive approach. PFL also outperforms the traditional FL methods in terms of average delay for a varying number of vehicles and road length. Figure 19 illustrates the variation in normalized MSE losses with number of training episodes for varying the personalization parameter

φ

. As illustrated in Figure 19, as the personalization parameter

φ

is gradually increased from 0.0000001 to 0.01 in multiples of 10, the MSE error losses tend to gradually decrease. For all the values of the personalization parameter

φ

, the PFL model tends to converge after approximately 200 training episodes. However, for larger values of

φ

= 0.001 and

φ

= 0.01, the initial MSE losses are slightly lower compared to smaller values of

φ

= 0.0001 and

φ

= 0.00001. This implies a reduction in model convergence time and hence a reduction in the UAV’s power consumption when computing global model hyper-parameters from local model hyper-parameters. Furthermore, it is noted that the local model hyper-parameters generated using the attention mechanism add approximately an additional 10% of attention weight parameters compared to the hyper-parameters of the PFL model. Although PFL uses a slightly smaller ratio of personalized parameters, it significantly outperforms FedAvg and FedSGD on latency and the UAV’s power consumption metrics. Hence, PFL leads to a significant improvement in the models’ ability to overcome data heterogeneity. The decrease in the MSE losses for PFL and PFL with attention mechanism indicates that the performance of PFL is improved with the attention mechanism.

5.5. Discussion and Comparison of the Proposed PFL Approach with Existing Works

In Table 3, we have compared our proposed solution approach using PFL with the previous FL and DRL approaches. Whereas earlier state-of-the-art FL and DRL methods achieved an approximate 50% reduction in model convergence time, our method based on PFL achieved an approximate reduction of 40% in the number of transmitted local model hyper-parameters from the vehicles to the UAV for a similar convergence time and similar number of training episodes. The proposed PFL approach also reduced the communication overhead between the vehicles and the UAV. However, unlike existing FL approaches, where joint learning from multiple datasets leads to higher model convergence time, our proposed PFL approach reduces the UAV’s power consumption in a TTI. This is attributed to the fact that in PFL, rather than communicating with each vehicle, the UAV only communicates with select vehicles in a subset in each TTI. Another phenomenon observed for PFL with attention mechanism was that the average latency and model convergence time was lower by approximately 20% as compared to FedAvg and FedSGD. This also implies that PFL is suited to scenarios that consist of non-i.i.d. training data. To fine-tune the performance of our model, we introduced a personalization parameter and noted that utilizing PFL consistently improved the latency and the UAV’s power consumption by 10–15% as compared to FedAvg and FedSGD.

6. Conclusions

This paper proposed a PFL approach to maximize the available UAV battery power and minimize the sojourn time in UAV-assisted C-V2X communications. The proposed PFL approach utilizes an attention mechanism to model aggregation errors and ensure an adaptive and personalized vehicle selection in an uncertain and time-varying environment, subject to bandwidth, UAV power, and vehicle QoS constraints. Furthermore, using an attention mechanism, our PFL model achieved a reduction in model training time with an increase in the number of vehicles compared to existing FL methods. The PFL mechanism addresses the straggler effect by quantifying the contribution of vehicles participating in the model computation and training. This enables us to achieve a balance between model accuracy and straggler impact by dynamically managing the straggler rate. To address packets of varying size, arrival rate, and service rate, we proposed that PFL automates the selection of vehicles by the UAV through attentive state-space modeling. This is achieved by clustering the vehicles to model the similarities between the datasets in a cluster. The experimental findings reveal a greater than 25% reduction in training time and 20% increase in model accuracy across varying non-i.i.d. data scenarios. We propose using PFL with attention mechanism to collaborate with vehicles with similar data distribution through attention-based grouping. PFL achieves approximately 20% lower latency than conventional FL methods such as FedAvg and FedSGD. In the future, we aim to include the complexity in assessing changes in the channel state when using multi-modal datasets. The proposed approach can be improved to predict outcomes such as recurrence of similar data transmissions and incorporate channel state-based predictive information. In addition, future research includes extending the proposed PFL framework to adaptive optimization methods or group-based training methods.

Moreover, in future work, the authors aim to conduct an in-depth computational complexity analysis of the proposed approach. The model training complexity can act as a severe bottleneck for real-world deployment of PFL approaches. Furthermore, introducing a channel state estimator into the system model is expected to contribute to reducing the model convergence-time. A few other challenges in applying PFL and attention mechanism to UAV-assisted vehicular communications are to track the changes in the UAV and vehicle states with variations in the instantaneous channel characteristics. The feasibility of utilizing reward functions based on channel estimation is an avenue for further work.

Author Contributions

Conceptualization, A.G. and X.F.; methodology, A.G.; writing—original draft preparation, A.G.; writing—review and editing, X.F.; supervision, X.F.; funding acquisition, X.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

3GPP	3rd Generation Partnership Project
6G	Sixth-generation (communication networks)
A3C	Asynchronous advantage actor-critic
AoI	Age of information
BCD	Block coordinate descent
BPP	Bin-packing problem
BS	Base station
CSI	Channel state information
DDPG	Deep deterministic policy gradient
DRL	Deep reinforcement learning
FedAvg	Federated averaging
FedSGD	Federated stochastic gradient descent
FGPR	Federated Gaussian process regression
FL	Federated learning
FRL	Federated reinforcement learning
G-FML	Group-based federated meta-learning framework
GMM	Gaussian mixture model
HFL	Hierarchical federated learning
HMM	Hidden Markov model
i.i.d.	Independent and identically distributed
ITS	Intelligent transportation system
LoS	Line of sight
MBS	Macro base station
MCC	Mobile cloud computing
MDP	Markov decision process
MEC	Mobile edge computing
Non-i.i.d.	Non-independent and identically distributed
NLoS	Non-line-of-sight
OFDM	Orthogonal frequency division multiplexing
OTFS	Orthogonal time frequency space
PFL	Personalized federated learning
QoS	Quality of service
RL	Reinforcement learning
SINR	Signal-to-interference noise ratio
TTI	Transmission time interval
UAV	Unmanned aerial vehicle

References

Bazzi, A.; Cecchini, G.; Zanella, A.; Masini, B.M. Study of the Impact of PHY and MAC Parameters in 3GPP C-V2X Mode 4. IEEE Access 2018, 6, 71685–71698. [Google Scholar] [CrossRef]
Zhu, J.; Song, Y.; Tang, Y.; Jin, T.; Liu, W. Performance Trade-off in Waveform Design for Dual-function Radar and Communication System. IEEE Wirel. Commun. Lett. 2024, 13, 74–78. [Google Scholar] [CrossRef]
Wang, W.; Fei, Z.; Guo, J.; Durrani, S.; Yanikomeroglu, H. Outage Performance of Multi-Tier UAV Communication with Random Beam Misalignment. IEEE Internet Things J. 2023, 11, 4163–4178. [Google Scholar] [CrossRef]
Wang, T.; Du, W.; Jiang, C.; Li, Y.; Zhang, H. Safety Constrained Trajectory Optimization for Completion Time Minimization for UAV Communications. IEEE Internet Things J. 2024, 11, 34482–34491. [Google Scholar] [CrossRef]
Du, Y.; Liu, Y.; Han, K.; Jiang, J.; Wang, W.; Chen, L. Multi-User and Multi-Target Dual-Function Radar-Communication Waveform Design: Multi-Fold Performance Tradeoffs. IEEE Trans. Green Commun. Netw. 2023, 7, 483–496. [Google Scholar] [CrossRef]
Duan, Q.; Huang, J.; Hu, S.; Deng, R.; Lu, Z.; Yu, S. Combining Federated Learning and Edge Computing Toward Ubiquitous Intelligence in 6G Network: Challenges, Recent Advances, and Future Directions. IEEE Commun. Surv. Tutor. 2023, 25, 2892–2950. [Google Scholar] [CrossRef]
Yuan, J.; Chen, G.; Wen, M.; Tafazolli, R.; Panayirci, E. Secure Transmission for THz-Empowered RIS-Assisted Non-Terrestrial Networks. IEEE Trans. Veh. Technol. 2023, 72, 5989–6000. [Google Scholar] [CrossRef]
Wu, W.; Yang, Q.; Li, B.; Kwak, K.S. Adaptive Resource Allocation Algorithm of Lyapunov Optimization for Time-Varying Wireless Networks. IEEE Commun. Lett. 2016, 20, 934–937. [Google Scholar] [CrossRef]
Al-Quraan, M.; Mohjazi, L.; Bariah, L.; Centeno, A.; Zoha, A.; Arshad, K.; Assaleh, K.; Muhaidat, S.; Debbah, M.; Imran, M.A. Edge-Native Intelligence for 6G Communications Driven by Federated Learning: A Survey of Trends and Challenges. IEEE Trans. Emerg. Top. Comput. Intell. 2023, 7, 957–979. [Google Scholar] [CrossRef]
Noman, H.M.F.; Hanafi, E.; Noordin, K.A.; Dimyati, K.; Hindia, M.N.; Abdrabou, A.; Qamar, F. Machine Learning Empowered Emerging Wireless Networks in 6G: Recent Advancements, Challenges & Future Trends. IEEE Access 2023, 11, 83017–83051. [Google Scholar]
Liu, S.; Yu, J.; Deng, X.; Wan, S. FedCPF: An Efficient Communication Federated Learning Approach for Vehicular Edge Computing in 6G Communication Networks. IEEE Trans. Intell. Transp. Syst. 2022, 23, 1616–1629. [Google Scholar] [CrossRef]
Manias, D.M.; Shami, A. Making a Case for Federated Learning in the Internet of Vehicles and Intelligent Transportation Systems. IEEE Netw. 2021, 35, 88–94. [Google Scholar] [CrossRef]
Li, X.; Cheng, L.; Sun, C.; Lam, K.Y.; Wang, X.; Li, F. Federated-Learning-Empowered Collaborative Data Sharing for Vehicular Edge Networks. IEEE Netw. 2021, 35, 116–124. [Google Scholar] [CrossRef]
Sun, Z.; Yang, H.; Li, C.; Yao, Q.; Teng, Y.; Zhang, J.; Liu, S.; Li, Y.; Vasilakos, A.V. A Resource Allocation Scheme for Edge Computing Network in Smart City Based on Attention Mechanism. ACM Trans. Sens. Netw. 2024. [CrossRef]
Shinde, S.S.; Tarchi, D. Joint Air-Ground Distributed Federated Learning for Intelligent Transportation Systems. IEEE Trans. Intell. Transp. Syst. 2023, 24, 1–16. [Google Scholar] [CrossRef]
Huang, Y.; Cui, M.; Zhang, G.; Chen, W. Bandwidth, Power and Trajectory Optimization for UAV Base Station Networks with Backhaul and User QoS Constraints. IEEE Access 2020, 8, 67625–67634. [Google Scholar] [CrossRef]
Qiao, D.; Liu, G.; Guo, S.; He, J. Adaptive Federated Learning for Non-Convex Optimization Problems in Edge Computing Environment. IEEE Trans. Netw. Sci. Eng. 2022, 9, 3478–3491. [Google Scholar] [CrossRef]
Ma, G.; Bian, Y.; Qin, H.; Yin, C.; Chen, C.; Li, S.E.; Li, K. Advance-FL: A3C-based Adaptive Asynchronous Online Federated Learning for Vehicular Edge Cloud Computing Networks. IEEE Trans. Intell. Veh. 2024, 9, 6971–6989. [Google Scholar] [CrossRef]
Shen, S.; Shen, G.; Dai, Z.; Zhang, K.; Kong, X.; Li, J. Asynchronous Federated Deep-Reinforcement-Learning-Based Dependency Task Offloading for UAV-Assisted Vehicular Networks. IEEE Internet Things J. 2024, 11, 31561–31574. [Google Scholar] [CrossRef]
Liu, Y.; Ge, Q.; Luo, W.; Huang, Q.; Zou, L.; Wang, H.; Li, X.; Liu, C. GraphMM: Graph-Based Vehicular Map Matching by Leveraging Trajectory and Road Correlations. IEEE Trans. Knowl. Data Eng. 2024, 36, 184–198. [Google Scholar] [CrossRef]
Wang, S.; Wang, W.; Huang, S.; Han, Y.; Wei, F.; Yin, B. Nowcasting the Vehicular Control Delay From Low-Ping Frequency Trajectories via Incremental Hypergraph Learning. IEEE Trans. Veh. Technol. 2024, 73, 185–199. [Google Scholar] [CrossRef]
Xue, H.; Zhang, D.; Wu, C.; Ji, Y.; Long, S.; Wang, C.; Sato, T. Parameter Estimation-Aided Edge Server Selection Mechanism for Edge Task Offloading. IEEE Trans. Veh. Technol. 2024, 73, 2506–2519. [Google Scholar] [CrossRef]
Park, S.; Park, C.; Jung, S.; Kim, J.H.; Kim, J. Workload-Aware Scheduling using Markov Decision Process for Infrastructure-Assisted Learning-Based Multi-UAV Surveillance Networks. IEEE Access 2023, 11, 1. [Google Scholar] [CrossRef]
Luo, Q.; Luan, T.H.; Shi, W.; Fan, P. Deep Reinforcement Learning Based Computation Offloading and Trajectory Planning for Multi-UAV Cooperative Target Search. IEEE J. Sel. Areas Commun. 2023, 41, 504–520. [Google Scholar] [CrossRef]
Chen, S.; Jin, T.; Xia, Y.; Li, X. Metadata and Image Features Co-Aware Semi-Supervised Vertical Federated Learning with Attention Mechanism. IEEE Trans. Veh. Technol. 2024, 73, 2520–2532. [Google Scholar] [CrossRef]
Shaik, T.; Tao, X.; Li, L.; Xie, H.; Cai, T.; Zhu, X.; Li, Q. FRAMU: Attention-Based Machine Unlearning Using Federated Reinforcement Learning. IEEE Trans. Knowl. Data Eng. 2024, 36, 5153–5167. [Google Scholar] [CrossRef]
Qureshi, K.I.; Wang, L.; Xiong, X.; Lodhi, M.A. Asynchronous Federated Learning for Resource Allocation in Software-Defined Internet of UAVs. IEEE Internet Things J. 2024, 11, 20899–20911. [Google Scholar] [CrossRef]
Liu, R.; Liu, A.; Qu, Z.; Xiong, N.N. An UAV-Enabled Intelligent Connected Transportation System with 6G Communications for Internet of Vehicles. IEEE Trans. Intell. Transp. Syst. 2023, 24, 2045–2059. [Google Scholar] [CrossRef]
Nguyen, T.V.; Le, H.D.; Pham, A.T. On the Design of RIS-UAV Relay-Assisted Hybrid FSO/RF Satellite-Aerial-Ground Integrated Network. IEEE Trans. Aerosp. Electron. Syst. 2023, 59, 1–15. [Google Scholar] [CrossRef]
Hosseini, M.; Ghazizadeh, R. Stackelberg Game-Based Deployment Design and Radio Resource Allocation in Coordinated UAVs-Assisted Vehicular Communication Networks. IEEE Trans. Veh. Technol. 2023, 72, 1196–1210. [Google Scholar] [CrossRef]
A., G.P.W.N.B.; Samarasinghe, T.; Haapola, J. Performance Enhancement of C-V2X Mode 4 Utilizing Multiple Candidate Single-Subframe Resources. IEEE Trans. Intell. Transp. Syst. 2023, 24, 15328–15333. [Google Scholar]
Song, R.; Zhou, L.; Lyu, L.; Festag, A.; Knoll, A. ResFed: Communication-Efficient Federated Learning With Deep Compressed Residuals. IEEE Internet Things J. 2024, 11, 9458–9472. [Google Scholar] [CrossRef]
Hao, W.; Mehta, N.; Liang, K.J.; Cheng, P.; El-Khamy, M.; Carin, L. WAFFLe: Weight Anonymized Factorization for Federated Learning. IEEE Access 2022, 10, 49207–49218. [Google Scholar] [CrossRef]
Li, Z.; Lu, J.; Luo, S.; Zhu, D.; Shao, Y.; Li, Y.; Zhang, Z.; Wang, Y.; Wu, C. Towards Effective Clustered Federated Learning: A Peer-to-peer Framework with Adaptive Neighbor Matching. IEEE Trans. Big Data 2024, 10, 812–826. [Google Scholar] [CrossRef]
Yue, X.; Kontar, R. Federated Gaussian Process: Convergence, Automatic Personalization and Multi-Fidelity Modeling. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 4246–4261. [Google Scholar] [CrossRef]
Chafii, M.; Bariah, L.; Muhaidat, S.; Debbah, M. Twelve Scientific Challenges for 6G: Rethinking the Foundations of Communications Theory. IEEE Commun. Surv. Tutor. 2023, 25, 868–904. [Google Scholar] [CrossRef]
Li, H.; Cai, Z.; Wang, J.; Tang, J.; Ding, W.; Lin, C.T.; Shi, Y. FedTP: Federated Learning by Transformer Personalization. IEEE Trans. Neural Netw. Learn. Syst. 2023, PP, 1–15. [Google Scholar] [CrossRef]
Xu, X.; Feng, G.; Qin, S.; Liu, Y.; Sun, Y. Joint UAV Deployment and Resource Allocation: A Personalized Federated Deep Reinforcement Learning Approach. IEEE Trans. Veh. Technol. 2024, 73, 1–14. [Google Scholar] [CrossRef]
Gao, F.; Zhang, C.; Qiao, J.; Li, K.; Cao, Y. Communication-Efficient Wireless Traffic Prediction with Federated Learning. Mathematics 2024, 12, 2539. [Google Scholar] [CrossRef]
Ji, S.; Jiang, W.; Walid, A.; Li, X. Dynamic Sampling and Selective Masking for Communication-Efficient Federated Learning. IEEE Intell. Syst. 2022, 37, 27–34. [Google Scholar] [CrossRef]
Gupta, V.; Luqman, A.; Chattopadhyay, N.; Chattopadhyay, A.; Niyato, D. TravellingFL: Communication Efficient Peer-to-Peer Federated Learning. IEEE Trans. Veh. Technol. 2024, 73, 5005–5019. [Google Scholar] [CrossRef]
Yu, F.; Lin, H.; Wang, X.; Garg, S.; Kaddoum, G.; Singh, S.; Hassan, M.M. Communication-Efficient Personalized Federated Meta-Learning in Edge Networks. IEEE Trans. Netw. Serv. Manag. 2023, 20, 1558–1571. [Google Scholar] [CrossRef]
Gao, Y.; Wang, P.; Liu, L.; Zhang, C.; Ma, H. Configure Your Federation: Hierarchical Attention-enhanced Meta-Learning Network for Personalized Federated Learning. ACM Trans. Intell. Syst. Technol. 2023, 14, 1–24. [Google Scholar] [CrossRef]
Liu, S.; Li, Q.; Pandharipande, A.; Ge, X. Personalized Collaborative Edge Caching with Federated Transfer Deep Reinforcement Learning. IEEE Commun. Lett. 2024, 28, 2096–2100. [Google Scholar] [CrossRef]
Yu, Y.; Chen, D.; Tang, X.; Song, T.; Hong, C.S.; Han, Z. Incentive Framework for Cross-Device Federated Learning and Analytics With Multiple Tasks Based on a Multi-Leader-Follower Game. IEEE Trans. Netw. Sci. Eng. 2022, 9, 3749–3761. [Google Scholar] [CrossRef]
Vettoruzzo, A.; Bouguelia, M.R.; Vanschoren, J.; Rognvaldsson, T.; Santosh, K. Advances and Challenges in Meta-Learning: A Technical Review. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 4763–4779. [Google Scholar] [CrossRef] [PubMed]
Yang, P.; Cao, X.; Quek, T.Q.S.; Wu, D.O. Networking of Internet of UAVs: Challenges and Intelligent Approaches. IEEE Wirel. Commun. 2022, 31, 156–163. [Google Scholar] [CrossRef]
Pan, W.; Wang, X.; Zhou, P.; Lin, W. Time-Sensitive Federated Learning with Heterogeneous Training Intensity: A Deep Reinforcement Learning Approach. IEEE Trans. Emerg. Top. Comput. Intell. 2024, 8, 1402–1415. [Google Scholar] [CrossRef]
Xu, Z.; Qiao, H.; Liang, W.; Xu, Z.; Xia, Q.; Zhou, P.; Rana, O.F.; Xu, W. Flow-Time Minimization for Timely Data Stream Processing in UAV-Aided Mobile Edge Computing. ACM Trans. Sens. Netw. 2024, 20, 1–28. [Google Scholar] [CrossRef]
Ali, W.; Ammad-ud din, M.; Zhou, X.; Zhang, Y.; Shao, J. Communication-Efficient Federated Neural Collaborative Filtering with Multi-Armed Bandits. ACM Trans. Recomm. Syst. 2024. [Google Scholar] [CrossRef]
Wu, H.; Gu, A.; Liang, Y. Federated Reinforcement Learning-Empowered Task Offloading for Large Models in Vehicular Edge Computing. IEEE Trans. Veh. Technol. 2024, 74, 1979–1991. [Google Scholar] [CrossRef]
Moreira, I.; Pimentel, C.; Barros, F.P.; Chaves, D.P.B. Modeling Fading Channels With Binary Erasure Finite-State Markov Channels. IEEE Trans. Veh. Technol. 2017, 66, 4429–4434. [Google Scholar] [CrossRef]
Abou El Houda, Z.; Brik, B.; Ksentini, A.; Khoukhi, L.; Guizani, M. When Federated Learning Meets Game Theory: A Cooperative Framework to secure IIoT Applications on Edge Computing. IEEE Trans. Ind. Inform. 2022, 18, 7988–7997. [Google Scholar] [CrossRef]
Oubbati, O.S.; Chaib, N.; Lakas, A.; Lorenz, P.; Rachedi, A. UAV-Assisted Supporting Services Connectivity in Urban VANETs. IEEE Trans. Veh. Technol. 2019, 68, 3944–3951. [Google Scholar] [CrossRef]
Wang, K.; Ma, Y.; Mashhadi, M.B.; Foh, C.H.; Tafazolli, R.; Ding, Z. Convergence Acceleration in Wireless Federated Learning: A Stackelberg Game Approach. IEEE Trans. Veh. Technol. 2024, 74, 714–729. [Google Scholar] [CrossRef]
Asad, A.; Fouda, M.M.; Fadlullah, Z.M.; Ibrahem, M.I.; Nasser, N. Moreau Envelopes-Based Personalized Asynchronous Federated Learning: Improving Practicality in Network Edge Intelligence. In Proceedings of the GLOBECOM 2023–2023 IEEE Global Communications Conference, Kuala Lumpur, Malaysia, 4–8 December 2023; pp. 2033–2038. [Google Scholar]
Li, H.; Zhang, J.; Zhao, H.; Ni, Y.; Xiong, J.; Wei, J. Joint Optimization on Trajectory, Computation and Communication Resources in Information Freshness Sensitive MEC System. IEEE Trans. Veh. Technol. 2024, 73, 4162–4177. [Google Scholar] [CrossRef]
Karabulut Kurt, G.; Khoshkholgh, M.G.; Alfattani, S.; Ibrahim, A.; Darwish, T.S.J.; Alam, M.S.; Yanikomeroglu, H.; Yongacoglu, A. A Vision and Framework for the High Altitude Platform Station (HAPS) Networks of the Future. IEEE Commun. Surv. Tutor. 2021, 23, 729–779. [Google Scholar] [CrossRef]
Qin, Y.; Xing, Z.; Li, X.; Zhang, Z.; Zhang, H. Primal-Dual Deep Reinforcement Learning for Periodic Coverage-Assisted UAV Secure Communications. IEEE Trans. Veh. Technol. 2024, 73, 19641–19652. [Google Scholar] [CrossRef]
Xu, Y.H.; Li, J.H.; Zhou, W.; Chen, C. Learning-Empowered Resource Allocation for Air Slicing in UAV-Assisted Cellular V2X Communications. IEEE Syst. J. 2023, 17, 1008–1011. [Google Scholar] [CrossRef]
Yan, M.; Xiong, R.; Wang, Y.; Li, C. Edge Computing Task Offloading Optimization for a UAV-assisted Internet of Vehicles via Deep Reinforcement Learning. IEEE Trans. Veh. Technol. 2024, 73, 1–12. [Google Scholar] [CrossRef]
Mahajan, P.; Palanisamy, B.; Kumar, A.; Chalapathi, G.S.S.; Chamola, V.; Khabbaz, M. Multi-Objective MDP-Based Routing in UAV Networks for Search-Based Operations. IEEE Trans. Veh. Technol. 2024, 73, 13777–13789. [Google Scholar] [CrossRef]
Zhang, L.; Yi, W.; Lin, H.; Peng, J.; Gao, P. An Efficient Reinforcement Learning-Based Cooperative Navigation Algorithm for Multiple UAVs in Complex Environments. IEEE Trans. Ind. Inform. 2024, 20, 12396–12406. [Google Scholar] [CrossRef]
Zhang, S.; Liu, W.; Ansari, N. Completion Time Minimization for Data Collection in a UAV-enabled IoT Network: A Deep Reinforcement Learning Approach. IEEE Trans. Veh. Technol. 2023, 72, 1–10. [Google Scholar] [CrossRef]
Samarakoon, S.; Bennis, M.; Saad, W.; Debbah, M. Distributed Federated Learning for Ultra-Reliable Low-Latency Vehicular Communications. IEEE Trans. Commun. 2020, 68, 1146–1159. [Google Scholar] [CrossRef]
Maale, G.T.; Sun, G.; Kuadey, N.A.E.; Kwantwi, T.; Ou, R.; Liu, G. DeepFESL: Deep Federated Echo State Learning-Based Proactive Content Caching in UAV-Assisted Networks. IEEE Trans. Veh. Technol. 2023, 72, 1–13. [Google Scholar] [CrossRef]
Hao, J.; Naja, R.; Zeghlache, D. Adaptive federated reinforcement learning for critical realtime communications in UAV assisted vehicular networks. Comput. Netw. 2024, 247, 110456. [Google Scholar] [CrossRef]
Li, F.; Zhang, K.; Wang, J.; Li, Y.; Xu, F.; Wang, Y.; Tong, N. Multi-UAV Hierarchical Intelligent Traffic Offloading Network Optimization Based on Deep Federated Learning. IEEE Internet Things J. 2024, 11, 21312–21324. [Google Scholar] [CrossRef]
Yahya, M.; Maghsudi, S.; Stanczak, S. Federated Learning in UAV-Enhanced Networks: Joint Coverage and Convergence Time Optimization. IEEE Trans. Wirel. Commun. 2024, 23, 6077–6092. [Google Scholar] [CrossRef]
Huang, J.; Zhang, M.; Wan, J.; Chen, Y.; Zhang, N. Joint Data Caching and Computation Offloading in UAV-assisted Internet of Vehicles via Federated Deep Reinforcement Learning. IEEE Trans. Veh. Technol. 2024, 73, 17644–17656. [Google Scholar] [CrossRef]
Ren, P.; Wang, J.; Tong, Z.; Chen, J.; Pan, P.; Jiang, C. Federated Learning Via Nonorthogonal Multiple Access for UAV-Assisted Internet of Things. IEEE Internet Things J. 2024, 11, 27994–28006. [Google Scholar] [CrossRef]
Raja, K.; Kottursamy, K.; Ravichandran, V.; Balaganesh, S.; Dev, K.; Nkenyereye, L.; Raja, G. An Efficient 6G Federated Learning-enabled Energy-Efficient Scheme for UAV Deployment. IEEE Trans. Veh. Technol. 2024, 74, 2057–2066. [Google Scholar] [CrossRef]
Li, J.; Liu, A.; Han, G.; Cao, S.; Wang, F.; Wang, X. FedRDR: Federated Reinforcement Distillation-Based Routing Algorithm in UAV-Assisted Networks for Communication Infrastructure Failures. Drones 2024, 8, 49. [Google Scholar] [CrossRef]
Yang, B.; Shi, H.; Xia, X. Federated Imitation Learning for UAV Swarm Coordination in Urban Traffic Monitoring. IEEE Trans. Ind. Inform. 2023, 19, 6037–6046. [Google Scholar] [CrossRef]
Li, X.C.; Song, S.; Li, Y.; Li, B.; Shao, Y.; Yang, Y.; Zhan, D.C. MAP: Model Aggregation and Personalization in Federated Learning With Incomplete Classes. IEEE Trans. Knowl. Data Eng. 2024, 36, 6560–6573. [Google Scholar] [CrossRef]
Ma, X.; Ma, G.; Liu, Y.; Qi, S. APCSMA: Adaptive Personalized Client-Selection and Model-Aggregation Algorithm for Federated Learning in Edge Computing Scenarios. Entropy 2024, 26, 712. [Google Scholar] [CrossRef]
Gupta, A.; Fernando, X. Latency Analysis of Drone-Assisted C-V2X Communications for Basic Safety and Co-Operative Perception Messages. Drones 2024, 8, 600. [Google Scholar] [CrossRef]
Li, Y.; Ma, D.; An, Z.; Wang, Z.; Zhong, Y.; Chen, S.; Feng, C. V2X-Sim: Multi-Agent Collaborative Perception Dataset and Benchmark for Autonomous Driving. IEEE Robot. Autom. Lett. 2022, 7, 10914–10921. [Google Scholar] [CrossRef]
Maeng, S.J.; Ozdemir, O.; Guvenc, I.; Sichitiu, M.L.; Mushi, M.; Dutta, R. LTE I/Q Data Set for UAV Propagation Modeling, Communication, and Navigation Research. IEEE Commun. Mag. 2023, 61, 90–96. [Google Scholar] [CrossRef]
Gupta, A.; Fernando, X. Federated Reinforcement Learning for Collaborative Intelligence in UAV-assisted C-V2X Communications. Drones 2024, 8, 321. [Google Scholar] [CrossRef]
Fernando, X.; Gupta, A. Analysis of Unmanned Aerial Vehicle-Assisted Cellular Vehicle-to-Everything Communication Using Markovian Game in a Federated Learning Environment. Drones 2024, 8, 238. [Google Scholar] [CrossRef]
Fernando, X.; Gupta, A. UAV Trajectory Control and Power Optimization for Low-Latency C-V2X Communications in a Federated Learning Environment. Sensors 2024, 24, 8186. [Google Scholar] [CrossRef] [PubMed]
Plaisted, D. Some Polynomial and Integer Divisibility Problems are NP-Hard. SIAM J. Comput. 1978, 7, 458–464. [Google Scholar] [CrossRef]
Liao, Y.; Wu, Y.; Zhao, S.; Zhang, D. Unmanned Aerial Vehicle Obstacle Avoidance Based Custom Elliptic Domain. Drones 2024, 8, 397. [Google Scholar] [CrossRef]
Xia, Y.; Shao, X.; Ding, T.; Liu, J. Prescribed intelligent elliptical pursuing by UAVs: A reinforcement learning policy. Expert Syst. Appl. 2024, 249, 123547. [Google Scholar] [CrossRef]
Fu, J.; Yao, W.; Sun, G.; Tian, H.; Wu, L. An FTSA Trajectory Elliptical Homotopy for Unmanned Vehicles Path Planning with Multi-Objective Constraints. IEEE Trans. Intell. Veh. 2023, 8, 1–11. [Google Scholar] [CrossRef]
Gupta, A.; Fernando, X. Automatic Modulation Classification for Cognitive Radio Systems using CNN with Probabilistic Attention Mechanism. In Proceedings of the IEEE 95th Vehicular Technology Conference: (VTC2022-Spring), Helsinki, Finland, 19–22 June 2022; pp. 1–6. [Google Scholar]
Gupta, A.; Fernando, X.; Illanko, K. Object Detection for Connected and Autonomous Vehicles using CNN with Attention Mechanism. In Proceedings of the IEEE 95th Vehicular Technology Conference: (VTC2022-Spring), Helsinki, Finland, 19–22 June 2022; pp. 1–6. [Google Scholar]
Chu, Y.W.; Han, D.J.; Brinton, C.G. Only Send What You Need: Learning to Communicate Efficiently in Federated Multilingual Machine Translation. IEEE Trans. Audio, Speech Lang. Process. 2025, 33, 1907–1921. [Google Scholar] [CrossRef]
Liu, Y.; Wang, H.; Wang, Z.; Zhu, X.; Liu, J.; Sun, P.; Tang, R.; Du, J.; Leung, V.C.M.; Song, L. CRCL: Causal Representation Consistency Learning for Anomaly Detection in Surveillance Videos. IEEE Trans. Image Process. 2025, 34, 2351–2366. [Google Scholar] [CrossRef]
Pan, Z.; Yu, X.; Gao, Y. Session-Guided Attention in Continuous Learning With Few Samples. IEEE Trans. Image Process. 2025, 34, 2654–2666. [Google Scholar] [CrossRef]
Amadeo, M.; Campolo, C.; Molinaro, A.; Harri, J.; Rothenberg, C.E.; Vinel, A. Enhancing the 3GPP V2X Architecture with Information-Centric Networking. Future Internet 2019, 11, 199. [Google Scholar] [CrossRef]
Garcia-Roger, D.; Gonzalez, E.E.; Martin-Sacristan, D.; Monserrat, J.F. V2X Support in 3GPP Specifications: From 4G to 5G and Beyond. IEEE Access 2020, 8, 190946–190963. [Google Scholar] [CrossRef]
Papadakis, A.; Mamatas, L.; Petridou, S. Investigating the Latency Dynamics in Vehicular Platooning Networks. In Proceedings of the 2024 IEEE International Mediterranean Conference on Communications and Networking (MeditCom), Madrid, Spain, 8–11 July 2024; pp. 371–376. [Google Scholar]
Khamari, S.; Ahmed, T.; Mosbah, M. Efficient Edge Server Placement under Latency and Load Balancing Constraints for Vehicular Networks. In Proceedings of the GLOBECOM 2022–2022 IEEE Global Communications Conference, Piscataway, NJ, USA, 4–8 December 2022; pp. 4437–4442. [Google Scholar]
Roshdi, M.; Bhadauria, S.; Hassan, K.; Fischer, G. Deep Reinforcement Learning based Congestion Control for V2X Communication. In Proceedings of the 2021 IEEE 32nd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Helsinki, Finland, 13–16 September 2021; pp. 1–6. [Google Scholar]
Kumar, A.S.; Zhao, L.; Fernando, X. Multi-Agent Deep Reinforcement Learning-Empowered Channel Allocation in Vehicular Networks. IEEE Trans. Veh. Technol. 2022, 71, 1726–1736. [Google Scholar] [CrossRef]
Kumar, A.S.; Zhao, L.; Fernando, X. Task Offloading and Resource Allocation in Vehicular Networks: A Lyapunov-based Deep Reinforcement Learning Approach. IEEE Trans. Veh. Technol. 2023, 72, 1–14. [Google Scholar] [CrossRef]
Chen, M.; Poor, H.V.; Saad, W.; Cui, S. Convergence Time Optimization for Federated Learning Over Wireless Networks. IEEE Trans. Wirel. Commun. 2021, 20, 2457–2471. [Google Scholar] [CrossRef]
Arani, A.H.; Hu, P.; Zhu, Y. Re-envisioning Space-Air-Ground Integrated Networks: Reinforcement Learning for Link Optimization. In Proceedings of the ICC 2021 - IEEE International Conference on Communications, Virtual, 14–23 June 2021; pp. 1–7. [Google Scholar]

Figure 1. An illustrative scenario where continuous and uninterrupted coverage is provided to vehicles using a UAV as local model hyper-parameters are updated and aggregated. (a) Handoff delay accumulation is minimal when coverage zones are continuous. However, the signal strength is low when the vehicles are distant from the base station. (b) In 6G communication, handoff delay accumulation can be reduced to minimal even in the presence of some no-coverage zones by deploying a UAV as a mobile base station.

Figure 2. System model: The vehicles sense environmental data and process them locally at the in-vehicle edge servers. The resulting local model hyper-parameters are communicated to the UAV. The UAV aggregates various local model hyper-parameters in a TTI and transmits the global model hyper-parameters to the vehicles.

Figure 3. A block diagram representing the PFL-based solution approach for UAV-assisted vehicular communications.

Figure 4. The local model storage and communication from vehicles to UAV. Using PFL, the redundant local model hyper-parameters are discarded to reduce communication overhead and latency.

Figure 5. Variation in percentage of latency violations with number of vehicles (V) in FedAvg and PFL scenarios.

Figure 6. Variation in percentage of latency violations with number of vehicles (V) in FedSGD and PFL scenarios.

Figure 7. Variation in average latency (ms) in FedAvg and PFL scenarios with V for different road lengths (

R_{L}

).

Figure 7. Variation in average latency (ms) in FedAvg and PFL scenarios with V for different road lengths (

R_{L}

).

Figure 8. Variation in percentage of latency violations with number of vehicles (V) in PFL and PFL with attention mechanism.

Figure 9. Variation in average latency (ms) in FedSGD and PFL with attention mechanism scenarios with a varying number of vehicles (V) for road length (

R_{L}

) = 2 km.

Figure 9. Variation in average latency (ms) in FedSGD and PFL with attention mechanism scenarios with a varying number of vehicles (V) for road length (

R_{L}

) = 2 km.

Figure 10. Variation in average packet delay (ms) in PFL for a varying number of vehicles (V) for different road lengths (

R_{L}

).

Figure 10. Variation in average packet delay (ms) in PFL for a varying number of vehicles (V) for different road lengths (

R_{L}

).

Figure 11. Variation in average latency (ms) in FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios with varying number of vehicles (V) for road length (

R_{L}

) = 2 km.

Figure 11. Variation in average latency (ms) in FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios with varying number of vehicles (V) for road length (

R_{L}

) = 2 km.

Figure 12. Variation in UAV’s transmit power (dBm) with number of vehicles (V) for varying vehicle speeds and road lengths (

R_{L}

) for FedAvg and PFL scenarios over model training time slots.

Figure 12. Variation in UAV’s transmit power (dBm) with number of vehicles (V) for varying vehicle speeds and road lengths (

R_{L}

) for FedAvg and PFL scenarios over model training time slots.

Figure 13. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for FedSGD and PFL with attention mechanism scenarios over model training time slots.

Figure 13. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for FedSGD and PFL with attention mechanism scenarios over model training time slots.

Figure 14. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for PFL and PFL with attention mechanism scenarios over model training time slots.

Figure 14. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for PFL and PFL with attention mechanism scenarios over model training time slots.

Figure 15. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios over model training time slots.

Figure 15. Variation in UAV’s transmit power (dBm) with V for varying vehicle speed and road length (

R_{L}

) for FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios over model training time slots.

Figure 16. Variation in normalized MSE losses with number of training episodes for PFL and PFL with attention mechanism scenarios. Here, number of vehicles is set to 50.

Figure 17. Variation in normalized MSE losses with number of training episodes for PFL and PFL with attention mechanism scenarios. Here, number of vehicles is set to 100.

Figure 18. Variation in average packet delay (ms) in FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios for varying number of vehicles (V) for different road lengths (

R_{L}

).

Figure 18. Variation in average packet delay (ms) in FedAvg, FedSGD, PFL, and PFL with attention mechanism scenarios for varying number of vehicles (V) for different road lengths (

R_{L}

).

Figure 19. Variation in normalized MSE losses with number of training episodes for varying personalization parameter

φ

.

Figure 19. Variation in normalized MSE losses with number of training episodes for varying personalization parameter

φ

.

Table 1. Definition of symbols used in this paper.

Symbol	Definition
$Δ$	UAV trajectory coordinates, denoted by $[Δ_{1}, Δ_{2}, \dots, Δ_{n}]$
$H$	UAV’s altitude in meters (m)
$V$	Number of vehicles ${V_{1}$ , …, $V_{n}}$
$Δ_{(x, y)}$	UAV trajectory; is a set of sequential coordinates
$T$	UAV’s flying time
$τ$	Slot duration for communication between vehicles and UAV
${\bar{g}}_{u \to v}^{(t)}$	Downlink channel gain from UAV to a vehicle
${\|h_{u \to v}^{(t)}\|}^{2}$	Channel coefficient
$σ_{u \to v}^{(t)}$	Large-scale fading component
$x_{v}^{(t)}$	Location of vehicle v at slot t
$ϑ_{u \to v}^{(t)}$	Standard deviation in log-normal shadowing
$γ_{v}^{(t)}$	SINR at receiver at time slot t
$b_{w}^{(t)}$	Communication bandwidth available in a transmission window
$p_{w}^{(t)}$	UAV’s available power in a TTI
$N^{2}$	Power spectral density of noise
$C_{v}^{(t)}$	Downlink spectral efficiency of vehicle v at time t
$κ_{u}$	Rician K-factor
$c_{q}$	Bin set capacity
$a_{Ψ}$	Packet set transmitted from a vehicle
$Ψ_{i}$	Size of each packet comprising PFL model hyper-parameters
$ζ_{i k}$	Decision variable to denote if packet i is assigned to queue k
$ϕ_{k}$	Decision variable to denote whether bin k is utilized
$D^{(t)}$	Total delay encountered by a packet
$d_{v}^{t}$	Delay experienced by a packet communicated from vehicle v
$ς_{i, j}^{(k)} (t)$	Binary variable to identify a vehicle scheduled for transmission or waiting state
$ψ^{(t)}$	Vector consisting of different packets comprising PFL model hyper-parameters
$τ_{d}$	Delay experienced by a vehicle while downloading global model hyper-parameters
$τ_{u}$	Delay experienced by a vehicle while uploading local model hyper-parameters
$t_{i}$	Time duration of the TTI under consideration
$p_{i, j}$	Power consumption of a UAV in a TTI
$a, b$	Major and minor axes of UAV’s elliptical path
$α$	Tuning factor to expand or contract the elliptical trajectory
$D_{n}$	Dataset with n samples
$T_{r} (x)$	Trajectory storage vector
$A$	Set of available actions for UAV
$V^{*} (s_{t})$	State value
$Q^{*} (s_{t}, a_{t})$	Action value that implies the expected cumulative reward
$R (s_{t}, a_{t})$	Accumulated reward for state–action pair
$H (π_{ϕ})$	Policy learned by the UAV at height $H$
$γ_{d}$	Discount coefficient
$φ$	Personalization parameter
$E_{a_{t} \sim π}$	Expectation over state s
${\vec{γ}}_{T}$	Global model, collection of sequential data gathered during repeated transmissions
$p_{θ}$	Probability of state transitions of UAV
$θ$	Set of parameters of local models
$A_{t}$	Attention mechanism to capture dominant past states’ influence on future state transitions
$m_{ϕ} ({\vec{Ψ}}_{T} \| {\vec{γ}}_{T})$	Variational distribution that approximates the posterior $p_{θ} ({\vec{Ψ}}_{T} \| {\vec{γ}}_{T})$

Table 2. Simulation parameters.

Parameter	Value
Vehicle Mobility	Manhattan Mobility
Number of vehicles (V)	1–100
Number of $PS$ in UAV	1
UAV deployment altitude	100 m–2 km
Elliptical path’s major axis	300 m–1000 m
Elliptical path’s minor axis	150 m–550 m
Edge server location	In vehicle
Communication frequency	5.9 GHz
Modulation technique	OTFS
Distance between vehicles	10–100 m
Road length ( $R_{L}$ )	1–4 km
Vehicle speed	0–100 kmph
Mean speed of vehicles	50 km/h
Standard deviation in vehicle speed	10 km/h
Packet size for gross data offloading	1 byte–3 megabytes
Packet size of FL models	1 byte–10 megabytes
Dataset used	V2X-Sim, LTE I/Q
Inter-arrival time for packets at UAV	100 ms–1000 ms
OTFS base station transmit power	40 dBm (10 W)
UAV transmission power	20 dBm (100 mW)
UAV receiving threshold	−80 dBm
Vehicle transmission power	25 dBm (316.2 mW)
Noise power	−50 dBm ( $10^{- 8}$ W)

Table 3. Performance comparison of the proposed PFL approach with recent applications of different deep learning and federated learning methods in vehicular communications.

Reference	Proposed Method	Objectives	Cost Function	Reported Results
[96]	Alternative scheduling mechanisms	Low packet drop ratio Packet retransmission	Percentage of dropped messages Network latency	Achieved lower network latency
[97,98]	Proposed deep Q-networks Used actor–critic mechanism	Enhanced agent mobility Improved decision making on centralized server	System’s energy consumption Model complexity Task completion time	Improved system performance
[99]	FL for network performance optimization	Lowering model convergence time for FL Lowering model training loss for FL	Optimal user selection for task association Joint learning for task association	Lowered FL model convergence time by 56% Improvement in FL model accuracy by 3%
[65]	Power allocation Resource allocation Lower latency	Less power consumption Less queuing delay	Round-trip delay	Lesser data exchange by 79% Smaller queue length by 60% Lower power consumption
[100]	Satisfaction-based multi-armed bandit approach	Optimal UAV trajectories Efficient resource allocation	Transmit power Received signal strength	Achieved improvement in throughput Lower system outage probability
[80]	Q-learning and FRL	Minimize local model hyper-parameters Minimize communication rounds	Number of communication rounds FRL model convergence time	Reduced communication rounds Quicker model convergence for FRL
Our Work	PFL PFL with attention mechanism	Minimize average latency and communication overhead Minimize UAV’s power consumption	Average latency (ms) UAV’s power consumption (dBm)	Reduced average latency and communication overhead Reduced UAV’s power consumption UAV’s optimal trajectory prediction

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gupta, A.; Fernando, X. Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism. Drones 2025, 9, 497. https://doi.org/10.3390/drones9070497

AMA Style

Gupta A, Fernando X. Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism. Drones. 2025; 9(7):497. https://doi.org/10.3390/drones9070497

Chicago/Turabian Style

Gupta, Abhishek, and Xavier Fernando. 2025. "Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism" Drones 9, no. 7: 497. https://doi.org/10.3390/drones9070497

APA Style

Gupta, A., & Fernando, X. (2025). Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism. Drones, 9(7), 497. https://doi.org/10.3390/drones9070497

Article Menu

Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism

Abstract

1. Introduction

1.1. Related Work

1.1.1. Personalized Federated Learning Approaches and Their Applications in UAV-Assisted Communications

1.1.2. Personalized Federated Learning Approaches for Performance Enhancement in Aerial–Ground Integrated Networks

1.2. Contributions

1.3. Organization

2. System Model

3. Problem Formulation

4. Solution Approach

4.1. Training Vehicles with Data

4.2. Action Set for Vehicles and UAV

4.3. Personalized Federated Learning with Attention Mechanism

5. Simulation Results and Discussion

5.1. Performance Comparison of PFL with Traditional FL Methods

5.2. Performance Comparison of Attention-Mechanism-Based PFL with Traditional FL Methods

5.3. Variation in UAV’s Transmit Power (dBm) with Number of Vehicles (V)

5.4. Variation in Average Mean Square Error (MSE) Losses with Personalization Parameter ( $φ$ ) Using PFL and PFL with Attention Mechanism

5.5. Discussion and Comparison of the Proposed PFL Approach with Existing Works

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Latency Analysis of UAV-Assisted Vehicular Communications Using Personalized Federated Learning with Attention Mechanism

Abstract

1. Introduction

1.1. Related Work

1.1.1. Personalized Federated Learning Approaches and Their Applications in UAV-Assisted Communications

1.1.2. Personalized Federated Learning Approaches for Performance Enhancement in Aerial–Ground Integrated Networks

1.2. Contributions

1.3. Organization

2. System Model

3. Problem Formulation

4. Solution Approach

4.1. Training Vehicles with Data

4.2. Action Set for Vehicles and UAV

4.3. Personalized Federated Learning with Attention Mechanism

5. Simulation Results and Discussion

5.1. Performance Comparison of PFL with Traditional FL Methods

5.2. Performance Comparison of Attention-Mechanism-Based PFL with Traditional FL Methods

5.3. Variation in UAV’s Transmit Power (dBm) with Number of Vehicles (V)

5.4. Variation in Average Mean Square Error (MSE) Losses with Personalization Parameter ( φ ) Using PFL and PFL with Attention Mechanism

5.5. Discussion and Comparison of the Proposed PFL Approach with Existing Works

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.4. Variation in Average Mean Square Error (MSE) Losses with Personalization Parameter ( $φ$ ) Using PFL and PFL with Attention Mechanism