1. Introduction
Unmanned aerial vehicles (UAVs), also commonly known as drones, have attracted significant attention in recent years for various applications, such as disaster rescue, aerial inspection, and Internet of Things (IoT). In wireless IoT networks, UAVs are frequently deployed as mobile base stations, wireless relays, or data collectors to improve communication reliability and ensure timely coverage [
1,
2]. UAV-assisted IoT networks offer superior environmental adaptability and communication quality compared to terrestrial networks, making them a promising solution to support the rapidly growing and highly dynamic wireless data traffic in future 6G networks.
Recently, UAV-assisted IoT networks have been widely adopted in various time-sensitive scenarios [
3,
4,
5], in which data generated by IoT devices often have strict freshness requirements to avoid errors or catastrophic consequences caused by outdated information. The freshness of the received information is thus a critical concern in time-sensitive systems. To quantify the freshness of information, a new performance metric named Age of Information (AoI) was introduced in [
6]. AoI represents the time elapsed since the latest update was generated [
7], which captures the freshness of information from the destination’s perspective.
Due to the increasing popularity of UAVs in IoT systems and the significance of AoI in time-sensitive systems, AoI-oriented UAV-assisted IoT network designs have been extensively studied. For example, in [
8], the authors aimed to minimize both the maximum AoI (Max-AoI) and average AoI (Avg-AoI) of all sensor nodes (SNs) by optimizing the UAV’s trajectory under the ideal fly–hover–collect strategy. In [
9], the authors divided all SNs into non-overlapping clusters and assigned each cluster a data collection point (DCP), then optimized the UAV’s trajectory and DCP-IoT association to minimize the Max-AoI and Avg-AoI. In [
10], the authors derived the peak AoI (PAoI) and Avg-AoI for UAV-assisted data collection and dissemination on a graph, then minimized both by optimizing the UAV’s random flight trajectory. In [
11], the authors proposed a UAV-aided ground nodes (GNs) localization and communication framework that jointly optimizes localization accuracy, beamwidth of GNs, bandwidth, and the UAV’s trajectory to minimize the Avg-AoI. Inspired by advances in artificial intelligence, the authors of [
12] utilized an end-to-end framework that integrates a clustering module and an Enhanced Pointer Neural Network (EPNet) to minimize the Max-AoI and Avg-AoI. In [
13], the authors jointly optimized the selection of hovering positions and their visiting sequence using a transformer and the weighted
algorithm to minimize the total AoI of the collected data. Collectively, these studies demonstrate that AoI-oriented network designs ensure information freshness in time-sensitive systems, in contrast to traditional energy efficiency-oriented and throughput-oriented network designs [
14,
15,
16,
17,
18].
However, the above research predominantly addresses single-UAV trajectory optimization to improve AoI performance. Given the size, weight, and power (SWAP) limitations of UAVs [
2], a single UAV is insufficient to perform data collection missions for large-scale IoT networks within stringent time requirements. This limitation has spurred interest in AoI-oriented multi-UAV collaborative data collection for IoT networks. In [
19], the authors proposed an energy-constrained multi-UAV cooperative data collection framework for time-sensitive dense IoT networks to minimize the Max-AoI. In [
20], the authors aimed to minimize the PAoI and Avg-AoI of all SNs by optimizing the number of DCPs, the SN-DCP and DCP-UAV associations, and the trajectories of UAVs, subject to the endurance of the UAV. The heuristic algorithms were used to find the optimal trajectory in [
19,
20], whereas in [
21,
22], deep reinforcement learning-based (DRL) algorithms were introduced to solve the trajectory-planning problem. In [
21], the authors proposed a QMIX-based algorithm to jointly optimize UAV trajectories and UAV-SN scheduling to minimize the expected total AoI over time. In [
22], the authors proposed a Deep Q-network (DQN)-based algorithm to jointly minimize the weighted Avg-AoI and transmission power of the IoT devices.
Despite these advancements, prior studies often neglect energy constraints or assume ample energy availability in IoT devices. In fact, most IoT devices are powered by batteries with limited capacities, which requires periodic recharging or replacement. Manual intervention for battery maintenance is highly inefficient or even impractical, especially in remote areas such as deserts or dense forests. Motivated by the success of integrating radio frequency (RF)-based Wireless Power Transfer (WPT) into UAV-assisted IoT networks [
23,
24,
25,
26], the research focus has progressively shifted to AoI-oriented UAV-assisted wireless-powered IoT network designs. For example, in [
27], the authors adopted dynamic programming (DP) and ant colony (AC) algorithms to jointly optimize UAVs’ trajectory, energy harvesting (EH) time, and data collection time with the objective of minimizing the Avg-AoI of all SNs. In [
28], the authors formulated the Avg-AoI minimization problem as a Markov problem with large state spaces and proposed a DQN-based algorithm to jointly optimize the UAV’s trajectory, information transmission, and EH scheduling of the SNs. In [
29], the authors derived a lower bound on covertness constraint based on the Kullback–Leibler divergence and jointly optimized the WPT duration, the data transmission duration, and the WPT transmission power to minimize the average covert AoI. For multi-UAV-assisted wireless-powered IoT networks, the authors of [
30] proposed a DRL-based algorithm under time-varying channel conditions to minimize the sum of AoI of all the IoT devices. In [
31], the authors aimed to minimize the Avg-AoI by jointly optimizing the selection of hovering positions, hovering time, and flight paths of UAVs, and relay pairing relationships among UAVs. In [
32], the authors jointly optimized the number and selection of hovering positions, the UAV-IoT association, the flight speed and trajectories of UAVs, the computing resources, and the EH and data transmission time of IoT devices to minimize the Avg-AoI.
AoI-oriented multi-UAV-assisted energy transfer and data collection have emerged as a promising solution, providing sustainable energy support for IoT devices, addressing the energy storage limitations of each UAV, and ensuring data freshness [
31,
32]. However, the large-scale deployment of IoT devices and the cooperative operation of multiple UAVs present several challenges in multi-UAV-assisted wireless-powered IoT networks: (i) how to optimize the trajectories of multi-UAVs to cooperatively cover all IoT devices with the fewest DCPs, reduce the UAVs’ energy consumption, and minimize the Avg-AoI; (ii) the determination and assignment of DCP tasks to each UAV are complicated by factors such as the energy capacity of each UAV, the distribution of IoT devices, and the number of UAVs and IoT devices; (iii) in order to obtain more accurate measurements of Avg-AoI and energy consumption, it is necessary to consider models that closely align with real-world systems; (iv) how to design an algorithm that balances accuracy and efficiency for the Avg-AoI minimization problem, particularly in large-scale scenarios.
To address the above challenges, in this paper, we delve into the problem of AoI-oriented multi-UAV cooperative energy transfer and data collection, in which multiple energy-limited rotary-wing UAVs are dispatched to collect update packets from all IoT devices. Different from [
27,
30], we account for the number and distribution of IoT devices, as well as the number and energy capacities of UAVs. Our research aims to minimize the Avg-AoI of all IoT devices and reduce UAV energy consumption. The main contributions are summarized as follows.
We formulate the multi-UAV-assisted energy transfer and data collection problem to minimize the Avg-AoI in wireless-powered IoT networks by jointly optimizing the IoT devices’ uploading sequence, the locations of DCPs, the DCP-IoT associations, task assignment, the EH time, the data collection time, flight speed, and UAVs’ trajectories.
To tackle this NP-hard optimization problem, we decouple the original problem into three subproblems through theoretical derivation: hovering time allocation, DCP-IoT association, and multi-UAV cooperative task assignment and trajectory optimization.
The first subproblem, proven to be convex, is solved using the Karush–Kuhn–Tucker (KKT) conditions. The second subproblem, fundamentally a clustering problem, is addressed with an optimal hovering time allocation-based affinity propagation (OHTAP) clustering algorithm. The final subproblem is modeled as a stage-weighted multiple-traveling-salesmen problem (MTSP) and solved using an improved partheno-genetic algorithm with enhancement mechanisms (EIPGA) or a hybrid algorithm that combines improved MinMax k-means (IMinMax k-means) clustering with EIPGA.
Our proposed scheme significantly outperforms benchmark schemes in terms of Avg-AoI optimization and solution stability, as evidenced by extensive experiments. Furthermore, we analyze the impacts of various factors on Avg-AoI, including the number of IoT devices and UAVs, the selection of EH models, and the energy storage, flight speed, and acceleration of UAVs.
The remainder of this paper is organized as follows.
Section 2 describes the system model for multi-UAV-assisted wireless-powered IoT networks.
Section 3 formulates the problem of Avg-AoI-minimal energy transfer and data collection.
Section 4 details the implementation steps of our iterative three-step scheme.
Section 5 presents simulation experiments to validate the effectiveness of the proposed scheme. Finally, we briefly conclude the paper in
Section 6.
2. System Model
2.1. Network Description
We consider a multi-UAV-assisted wireless-powered IoT network that consists of one data center (DC)
,
M rotary-wing UAVs equipped with half-duplex hybrid access points (HAPs), denoted by
, and
K ground IoT devices, denoted by
. The IoT devices are assumed to be remotely distributed over a wide area to perform environmental sensing tasks. Without loss of generality, we adopt a three-dimensional (3D) Cartesian coordinate system, where the horizontal locations of the DC
and each IoT device
are fixed and represented by the vector
. Throughout the mission, all UAVs execute the fly–hover–collect strategy [
27] and the stop–wait strategy [
7] to ensure stable transmissions and collision avoidance. The acceleration of each UAV for switching between hovering and flying states is assumed to be constant, denoted by
. Additionally, the set of data collection points (DCPs) at which the UAVs hover to transfer energy and collect data is denoted by
. The location of each DCP
is given by
, where
H is a constant that specifies the vertical flight altitude of UAVs relative to the DC and IoT devices. Overall,
Figure 1 depicts the process of multi-UAV-assisted energy transfer and data collection.
Through the DCPs, the connections between UAVs and IoT devices can be indirectly established. For clarity, we use binary indicators
and
to represent association between UAV
and DCP
, and between DCP
and IoT device
, respectively. If UAV
hovers at DCP
to service IoT devices, then
; otherwise,
. Similarly, if DCP
is associated with IoT device
, then
; otherwise,
. Accordingly, the set of DCPs visited by UAV
and the set of IoT devices associated with DCP
are denoted by
and
, respectively. It is assumed that each DCP is visited by only one UAV and each IoT device is associated with only one DCP, resulting in the following constraints:
Additionally, the total number of IoT devices visited by the UAVs through all the DCPs must satisfy
When the UAVs hover at the DCPs, they can establish communication links with associated IoT devices using various multiple access schemes. In this paper, we assume the Time Division Multiple Access (TDMA) scheme, whereby IoT devices associated with each DCP sequentially upload data to the UAVs. If is the j-th DCP in the trajectory of UAV , then , , and are relabeled as , , and , respectively. Similarly, if is the i-th IoT device that uploads data to UAV hovering at DCP , then , , , and are relabeled as , , , and , respectively. Thus, given and , the trajectory of UAV can be represented by a permutation , where is the cardinality of the set .
2.2. Channel Modeling
The communication links between UAVs and IoT devices mainly depend on the propagation environment, which can be either line-of-sight (LoS) or non-line-of-sight (NLoS). According to [
33], the path loss for both LoS and NLoS links between IoT device
and UAV
hovering at DCP
can be expressed in decibels (dB) as follows:
where
denotes the distance between IoT device
and UAV
hovering at DCP
,
is the path loss exponent,
is the carrier frequency,
c is the speed of light, and
and
are the excessive path losses for LoS and NLoS links, respectively. The probability of an LoS link between IoT device
and UAV
hovering at DCP
is given by [
34]
where
a and
b are constants determined by environmental conditions and the carrier frequency, respectively.
is the elevation angle between IoT device
and UAV
hovering at DCP
. Given
, the corresponding probability of an NLoS link is
. Based on Equations (4)–(6), the average path loss for the UAV-IoT communication can be expressed as
Accordingly, the average channel gain is
. When UAV
returns to DC
, we assume that it offloads the data at a constant power
immediately. The data offloading rate
and data offloading time
can be calculated by
where
B and
are the channel bandwidth and the noise power at the UAV receiver, respectively.
is the average channel gain when UAVs hover over
.
represents the total number of IoT devices serviced by UAV
.
2.3. Energy Harvesting Model and Data Collection Model
The IoT devices utilize the harvested energy to transmit data to the UAVs. To account for various non-linear elements in practical RF-based EH circuits, such as diodes, diode-connected transistors, and circuit saturation characteristics, we introduce a non-linear EH model [
35,
36] based on actual EH circuit measurement data. The power harvested at IoT device
can be described as
where
represents the maximum output direct current power when the EH circuit reaches saturation, and
denotes the fixed transmission power of the UAVs. The constants
C and
D are related to specific circuit characteristics, with values dependent on resistance, capacitance, and circuit sensitivity. Thus, the energy harvested at IoT device
during the EH time interval
is given by
In the data collection stage, we consider only the dominant transmission power of IoT devices and neglect their circuit power, as described in [
14]. Additionally, it is assumed that all UAVs operate on orthogonal frequency channels, allowing multiple UAVs to transfer energy or collect data simultaneously without interference. The data uploading rate between IoT device
and UAV
hovering at DCP
during the data collection period
can be calculated by
where
is allocated uniformly over
. Let
represent the updated perceptual data volume at IoT device
within the
. To ensure the data uploading requirement of each IoT device
, we must satisfy
where
.
2.4. Energy Consumption Model of UAVs
In general, the total energy consumption of a UAV mainly consists of communication energy consumption and propulsion energy consumption. The communication energy consumption of a UAV mainly occurs during the process of communication circuitry operations, signal processing, signal radiation/reception, etc. [
7]. Assume that the communication-related power of UAVs remains constant
, and the communication energy consumption of UAV
can be calculated by
Throughout the mission, we disregard energy consumption during the take-off and landing stages. As derived in [
37], we adopt a power consumption model (in watts) to represent the UAV’s propulsion power consumption, which is approximated by
where
and
are two constants representing the profile power and induced power of UAV in hovering state,
is the tip speed of the rotor blade,
denotes the mean rotor-induced speed in hovering, and
,
,
s, and
A are the fuselage drag ratio, the air density, rotor solidity, and rotor disc area, respectively. More detailed information about the parameters and the power consumption model can be found in [
37]. Then, the total propulsion energy consumption of UAV
can be calculated by
where
and
denote the acceleration time of UAV
and the time that UAV
flies at the speed
V, respectively.
represents linear acceleration or deceleration, and
is the initial speed. Since acceleration and deceleration in this paper are symmetric, the energy consumed during deceleration is equal to that during acceleration. On the above basis, the total energy consumption of UAV
is
.
4. Methodology
4.1. Solution Architecture
As analyzed above, the Avg-AoI of all IoT devices is represented as the weighted sum of the EH time, the data collection time, the flight time, and the data offloading time. Additionally, the EH time and the data collection time for each IoT device is independent of the UAVs’ trajectories throughout the energy transfer and data collection mission. Therefore, we propose a novel DCP association and trajectory-planning scheme that utilizes an iterative three-step process to address the Avg-AoI-optimal multi-UAV-assisted energy transfer and data collection problem
. As illustrated in
Figure 3, the scheme consists of two main modules: (i)
DCP-IoT association and optimization, intended to minimize the EH and data collection time of IoT devices, and (ii)
task assignment and trajectory optimization, designed to reduce the UAVs’ flight time. The DCP-IoT association and optimization module is further divided into two submodules: hovering time allocation and DCP selection.
In a nutshell, the coordinates of all IoT devices are first sent to the hovering time allocation module, which determines the optimal EH and data collection time allocation for one UAV hovering at DCP to service IoT device . Then, the DCP selection module determines the set of DCPs and the binary DCP-IoT association matrix . Meanwhile, the optimal EH and data collection time for IoT devices and the data uploading order can be obtained. After that, the task assignment and trajectory optimization module finds the set of DCPs visited by UAV and the energy-constrained optimal trajectories to achieve the optimal flight time and data offloading time. Lastly, the minimum Avg-AoI of all IoT devices is calculated based on the data collection time, the EH time, the flight time, and the data offloading time.
4.2. DCP-IoT Association and Optimization
For the DCP-IoT association and optimization problem, the goal is to minimize the total hovering time of UAVs and the number of DCPs by jointly optimizing the DCP locations, the DCP-IoT association, and the minimum EH and data collection time for each IoT device.
where
denotes the index set of the neighboring nodes of IoT device
within a coverage radius
r, and
. The parameter
represents the ability of an IoT device to act as a cluster centroid, which is a non-negative weight factor. When
, each IoT device serves as its own DCP. Noticeably, an appropriate
is helpful to balance the hovering time and flight time of the UAVs, thereby further reducing the Avg-AoI.
Clearly, the optimization variables
for the DCP-IoT association are binary; thus, integer constraints are introduced in constraints (22a)–(22c). Furthermore, even with a fixed binary DCP-IoT association matrix
, the variables
in the objective function depend on the solutions for optimal hovering time allocation, which must satisfy the energy and data causality constraints. As a result, problem
is hard to solve directly. To improve the tractability of problem
, we decompose it into two subproblems: the hovering time allocation subproblem
and the DCP selection subproblem
. For problem
, the goal is to achieve the optimal EH time and the data collection time allocation
for each pair of DCP
and IoT device
, while ensuring that each IoT device
can upload the required amount of data using the harvested energy.
With any given optimal hovering time allocation solution
, the goal of problem
is to minimize both the total hovering time of UAVs and the number of DCPs by determining the optimal DCP positions and establishing DCP-IoT associations.
Based on the obtained DCP set and DCP-IoT association, the optimal EH and data collection time for each IoT device can be derived.
4.2.1. Optimization of Hovering Time Allocation
When one UAV hovers at DCP to service IoT device , the minimum EH time and data collection time depend solely on the data amount generated by IoT device . Additionally, constraints (23a) and (23b) are independent for different IoT devices. Consequently, problem can be viewed as independent minimization problems, each aiming to minimize the sum of EH time and data collection time for an individual IoT device.
Lemma 1. Problem is a convex optimization problem.
Proof. First, the objective function of
is an affine function with respect to
. Second, constraint (23a) is evidently convex. Finally, for constraint (23b), given
,
and
, the terms
,
and
can be viewed as constants. By calculating the Hessian matrix [
39] of the function
, it can be known that
is a convex function. Consequently, constraint (23b) is also convex. Based on the definition of the convex optimization problem [
39], Lemma 1 is proved. □
Suppose that the optimal solution to
is
. Through contradiction, we can easily derive that in the optimal solution to
, the constraints (23b) for all IoT devices should be active. In other words, the optimal EH time and data collection time must satisfy
Otherwise, we can always adjust
to satisfy the equality without decreasing the objective value.
According to Lemma 1 and Equation (25), we adopt the Lagrange multiplier method [
39] to solve problem
. The Lagrangian function of
is
where
is the non-negative Lagrangian dual variable associated with the constraint (23b).
Applying KKT conditions [
39] and Equation (25), we can obtain the system of equations
By solving the system of Equation (27), the optimal hovering time allocation
for one UAV hovering at DCP
to service IoT device
can be obtained by
where
is the Lambert function,
.
4.2.2. Optimization of DCP Selection
Problem is essentially a clustering process that aims to classify all IoT devices into L exclusive clusters and determine the clustering centroid. To reduce the search space for the optimal DCP locations, we propose an optimal hovering time allocation-based affinity propagation (OHTAP) clustering algorithm to determine the optimal DCP locations and establish appropriate DCP-IoT associations.
The OHTAP algorithm mainly relies on a “message passing” mechanism. It utilizes the similarity between pairs of IoT devices to partition
K IoT devices into
L exclusive clusters and subsequently designates some IoT devices as DCPs at which UAVs hover. Formally, let
represent the similarity matrix. The similarity
between IoT devices
and
is defined as the hovering time that one UAV hovers over candidate IoT device
to service IoT device
, which can be expressed as
In the OHTAP clustering process, two types of bi-directional messages, namely responsibility and availability, are passed and iterated between adjacent IoT devices to produce clustering results. Let
and
represent the responsibility matrix and availability matrix, respectively. The iterative messages
and
are updated according to the following rules until convergence:
Here,
denotes the suitability of IoT device
to serve as a cluster centroid for IoT device
, whereas
denotes the suitability of IoT device
to select
as its cluster centroid. A damping factor
is introduced at each iteration to mitigate numerical oscillations during the message update process.
When convergence is reached, the DCP-IoT association matrix
can be obtained by
where
denotes the indices of the initial cluster centroids. Subsequently, two DCP candidate sets
and
along with their respective positions
and
can be determined by solving the mean problem and the 1-center problem separately for each cluster of IoT devices:
Based on the DCP candidate sets, the optimal set of DCPs
and the corresponding positions
can be obtained by calculating and comparing the hovering time. The procedure is detailed in Algorithm 1.
Algorithm 1 The OHTAP clustering algorithm with parameter |
Input: The system parameters for all IoT devices and UAVs, the optimal hovering time allocation that one UAV hovers at DCP to service IoT device , and a sufficiently large constant .
- 1:
Calculate optimal hovering time allocation that one UAV hovers at DCP to service IoT device according to Equation (28); - 2:
Initialize the parameter: , and ; - 3:
while do - 4:
Update messages by Equations (30) and (31); - 5:
if converge then - 6:
break - 7:
end if - 8:
; - 9:
end while - 10:
Obtain the DCP-IoT association matrix by Equation (32); - 11:
Find two DCP candidate sets by Equation (33), and then calculate and compare the hovering time to determine the set of DCPs and the corresponding positions ; Output: The set of DCP , the location of each DCP , and the DCP-IoT matrix .
|
4.3. Task Assignment and UAV Trajectory Optimization
For the task assignment and trajectory-planning problem, the goal is to find the optimal UAV-DCP association and the age-optimal trajectories of energy-constrained UAVs, using the given optimal solution
to problem
Problem
can essentially be viewed as a stage-weighted MTSP process, which is known to be NP-hard. As discussed in [
8,
9], the optimal Avg-AoI trajectory of the UAV is a stage-weighted shortest Hamiltonian path when no energy constraint is imposed. This trajectory can be found using a genetic algorithm (GA) [
40] or other appropriate methods.
Let and represent the age-optimal trajectory of UAV without energy constraints and its corresponding traveling salesman problem (TSP) trajectory, respectively. It is well established that the TSP trajectory of UAV is the shortest and consumes the least energy at a given MR speed. Given that , we analyze the energy-constrained age-optimal trajectory under the following three cases. Case 1: When , the age-optimal solution is . Case 2: If the UAV’s energy capacity satisfies , an age-optimal trajectory for each UAV can be found using intelligent search methods, such as GA. Case 3: If the UAV’s energy capacity satisfies , no feasible trajectories exist for all UAVs under the given energy constraints. In this scenario, a UAV-expansion strategy is executed, i.e., one more UAV is added, and then the processes of task assignment and trajectory planning are repeated.
The key to solving problem lies in determining . To achieve this, we develop two algorithms. One is an improved partheno-genetic algorithm with enhancement mechanisms (EIPGA), which incorporates a “mutation-before-selection” mechanism and a hybrid selection operator to enhance global optimization capabilities. The other is a hybrid algorithm that combines IMinMax k-means clustering with EIPGA, where the former achieves an equilibrium-based task assignment, and the latter optimizes the trajectory for each UAV.
4.3.1. EIPGA-Based Algorithm
The improved partheno-genetic algorithm (IPGA) [
41] replaces the crossover operation in traditional GAs with multiple mutation operations, making it well suited for solving the TSP and other combinatorial optimization problems [
41,
42]. Although the IPGA offers advantages such as simple structure, high efficiency, and fast convergence, it often neglects valuable information from slightly inferior individuals, leading to limited global search capability. To overcome this limitation, we propose an improved partheno-genetic algorithm with enhancement mechanisms (EIPGA), which incorporates the concepts of “mutation-before-selection” and a hybrid selection operator. For clarity, the proposed EIPGA-based algorithm is summarized in Algorithm 2, with key details outlined below:
Algorithm 2 EIPGA-based task assignment and trajectory optimization algorithm |
Input: The system parameters for all IoT devices and UAVs, the DCP set , the DCP-IoT association matrix , the optimal hovering time allocation , and the EIPGA related parameters .
- 1:
Generate an initial population of chromosomes, and set ; - 2:
while do - 3:
Generate offspring chromosomes of chromosomes according to mutation operators and breakpoint update operator; - 4:
Calculate the Avg-AoI of chromosomes according to Equation (19); - 5:
Use a hybrid selection operator on chromosomes to obtain a new population of chromosomes and calculate relevant ; - 6:
Add n by one: ; - 7:
end while - 8:
Find the optimal task assignment and UAV trajectories . Output: The UAV-DCP association matrix , the optimal flight trajectories of UAVs .
|
Chromosome structure: The Path-breakpoint sequence encoding method [
41] is adopted to represent the chromosome structure of the solution. It ensures that except for the DC, each DCP is visited by one UAV only once. As shown in
Figure 4, the first part of the chromosome encodes a permutation of integers from 1 to
L, indicating the
L DCPs visited by all UAVs. The second part of the chromosome delineates the breakpoints that divide the DCP permutation into
M segments. Each segment corresponds to the path assigned to a particular UAV.
Mutation operators: The mutation operator randomly selects individuals and modifies specific entries within their chromosomes. As illustrated in
Figure 5, the algorithm implements four types of mutation operations: Flip, Swap, Left Slide (LSlide), and Right Slide (RSlide). A breakpoint update operator is also included to adjust the partition points within the chromosome. This operator distinguishes the task set and trajectory for each UAV, facilitating equitable distribution of workloads among UAVs.
Hybrid selection operator: The selection operation identifies individuals with higher fitness based on the principle of “survival of the fittest”. To preserve genetic diversity and prevent high-fitness individuals from dominating the population, a hybrid selection operator that combines elite selection, roulette selection, and original population generation is introduced. During the i-th population update, the new population is generated by sequentially selecting , and individuals using elite selection, roulette selection, and original population generation operators, respectively.
4.3.2. Hybrid Algorithm
In this part, we propose a hybrid algorithm that solves the task assignment and trajectory-planning problem by combining IMinMax
k-means clustering and EIPGA, as summarized in Algorithm 3. The IMinMax
k-means clustering [
43] minimizes the maximum intra-cluster variance, rather than the sum of intra-cluster variances in traditional
k-means clustering [
44]. This approach ensures an equilibrium-based task assignment among the UAVs, thereby reducing the Avg-AoI. Below are the detailed steps of the IMinMax
k-means clustering algorithm.
Step 1: Initialize the locations of cluster centroids for each using random equilibrium space partition with , and set , and .
Step 2: Construct a weighted formulation
of the sum of the intra-cluster variances to mimic the behavior of the maximum variance criterion
where
is a weight factor that controls cluster variances, and
regulates the sensitivity of weight updates to relative differences in cluster variances. The higher the values of
p, the larger the relative differences in variances among clusters, and vice versa.
Algorithm 3 Hybrid algorithm for UAV task assignment and trajectory optimization |
Input: The system parameters for all IoT devices and UAVs, the DCP set , the DCP-IoT association matrix , the optimal hovering time allocation , the EIPGA related parameters , and the clustering parameters .
- 1:
Initialize the locations of cluster centroids for every based on random equilibrium space partition with , and set ; - 2:
Construct a weighted formulation of the sum of the intra-cluster variances according to Equation (35); - 3:
while or do - 4:
for all
, to M, to L do - 5:
Update the cluster assignments using Equation (36); - 6:
end for - 7:
if empty or singleton clusters have emerged then - 8:
Reduce p by : ; - 9:
if then - 10:
return NULL; - 11:
end if - 12:
; - 13:
end if - 14:
for all , to M do - 15:
Update the cluster centroids according to Equation (37); - 16:
end for - 17:
if then - 18:
; - 19:
end if - 20:
for all , to M do - 21:
Update the weights based on Equations (37) and (38); - 22:
end for - 23:
Add t by one: ; - 24:
end while - 25:
Obtain the UAV-DCP association matrix , and find trajectories of UAVs using EIPGA. Output: The UAV-DCP association matrix , the optimal flight trajectories of UAVs .
|
Step 3: Update the cluster assignments
for each DCP
to the nearest cluster centroid
. The cluster assignments
at
t-th iteration are given by
Step 4: Verify the cluster results. If an empty or singleton cluster appears, reduce
p by
and revert to the previous assignments and weights. The cluster centroid
at the
t-th iteration is updated as
Step 5: Update the weights. If
, save the cluster assignments and the weights
. Also, update
. Afterward, incorporate the weight constraints
and
into
using the Lagrange multiplier and set the derivatives with respect to
to zero. Consequently, the closed-form solution of
is determined by
where
is the cluster variance. As
for
, larger cluster variances
lead to higher weights
. To improve the stability of the algorithm, a memory factor
is introduced into the weights, i.e.,
.
Step 6: Check the convergence. If or , terminate the algorithm. Otherwise, update and return to Step 3. Here, is the predetermined maximum number of iterations, and is a tiny convergence threshold.
After completing the task assignment, a relative equilibrium in task assignment is achieved. The EIPGA-based algorithm is then used to find the age-optimal trajectory for each UAV.
4.4. Impact of the AoI Performance
In our proposed scheme, the OHTAP clustering algorithm is designed to determine the locations of DCPs and establish the DCP-IoT association. The optimization objective of this algorithm is related to the EH time and data collection time of IoT devices. The EH time and data collection time are parts of the IoT devices’ Avg-AoI. Consequently, the Avg-AoI is influenced by the results of the OHTAP clustering algorithm. Based on the results of the OHTAP clustering algorithm, we perform EIPGA-based task assignment and use the trajectory-planning algorithm to establish the UAV-DCP association and find age-optimal trajectories through all the DCPs. The EIPGA algorithm incorporates a “mutation-before-selection” mechanism and a hybrid selection operator to enhance global optimization capabilities. Through iteration and recursion, it determines the DCP sequence and age-optimal trajectory for each UAV. However, as the number of IoT devices K increases, the computational efficiency and precision of the algorithm may decrease. To address this issue, we develop a hybrid-based task assignment and trajectory-planning algorithm that combines the task assignment capability of IMinMax k-means clustering with the trajectory optimization capability of EIPGA. The clustering component ensures equilibrium-based task assignment among UAVs, while the EIPGA component optimizes trajectories, resulting in a reduced Avg-AoI and improved computational efficiency.
4.5. Complexity Analysis
To sum up, the complete solution scheme for problem is outlined in Algorithm 4. It consists of three main steps: (1) determine the optimal hovering time allocation using KKT conditions; (2) run Algorithm 1 with parameter to determine DCP sets and establish the DCP-IoT association; (3) apply Algorithm 2 or Algorithm 3 to find the age-optimal trajectories through all the DCPs. The last two steps are performed alternately until the convergence condition is met.
We analyze the complexity of Algorithm 1. According to [
45], the messages
and
are calculated in
time per iteration, where
K is the number of IoT devices. The algorithm terminates after at most
iterations, resulting in a total time complexity of
. The space complexity is
, since the memory used to store the messages
and
is reused.
Algorithm 4 The age-optimal multi-UAV-assisted energy transfer and data collection algorithm |
Input: The system parameters for all IoT devices and UAVs, a small constant , and a sufficiently large constant .
- 1:
Initialize the parameter: ; - 2:
while do - 3:
Run Algorithm 1 with parameter to obtain the DCP-IoT association matrix , the set of DCPs , and the location of each DCP ; - 4:
Calculate the optimal EH and data collection time for IoT devices by Equation (28), and determine data uploading order based on Equation (20); - 5:
Generate initial waypoints set of UAVs ; - 6:
Run Algorithm 2 or Algorithm 3 to obtain UAV-DCP association matrix and UAVs’ trajectories , and then determine the energy consumption ; - 7:
if then - 8:
The Aol-optimal solution ; - 9:
else if - 10:
Find a trajectory for each UAV by EIPGA; - 11:
else if - 12:
No feasible solution exists and the processes of task assignment and trajectory planning are repeated with ; - 13:
end if - 14:
Calculate the Avg-AoI using Equation (19). - 15:
Update the parameter ; - 16:
end while - 17:
Find the minimum Avg-AoI with optimal parameter ; Output: The set of DCP positions , the binary DCP-IoT and UAV-DCP association matrix , the hovering time allocation , the flight trajectories of UAVs , and the minimum Avg-AoI.
|
We also examine the time and space complexity of Algorithm 2, which finds near age-optimal trajectories for Avg-AoI through recursion and iteration. The time complexity is approximately , and the space complexity is , where and are the maximum number of generations and the number of offspring produced per iteration, respectively.
For Algorithm 3, the complexity analysis follows. The IMinMax k-means clustering algorithm has time and space complexities of approximately and , respectively. When combined with the complexity analysis of Algorithm 2, the total time and space complexities of Algorithm 3 are and , where and are the maximal generation and the number of offspring produced per iteration, respectively.
Therefore, the complexity of Algorithm 4 is obtained. The time and space complexities of the OHTAP-based EIPGA scheme are and , respectively. Similarly, the time and space complexities of the OHTAP-based hybrid scheme are and , respectively.
6. Conclusions
This paper studied AoI-oriented task assignment and trajectory planning for multi-UAV-assisted wireless-powered IoT networks. Taking into account the strict energy and data causality constraints of IoT devices, as well as the limited endurance capability of UAVs, we formulated an optimization problem to minimize the Avg-AoI of all IoT devices by co-optimizing the EH and data collection time for IoT devices, the selection of DCPs, DCP-IoT association, and task assignment, flight speed, and trajectories of UAVs. Due to the nonconvex nature of this optimization problem, we decoupled it into three subproblems through theoretical derivation, and then proposed a novel DCP association and trajectory-planning scheme based on an iterative three-step process using KKT conditions, the OHTAP clustering algorithm, and the EIPGA/hybrid-based algorithm. Simulation results demonstrate that the proposed scheme consistently outperforms benchmark schemes in Avg-AoI performance and solution stability across various scenarios. In particular, our scheme is scalable up to 2000 IoT devices while maintaining significant advantages in accuracy and efficiency. Furthermore, by incorporating acceleration and speed optimization into age-optimal trajectory planning, UAVs achieve more accurate measurements of Avg-AoI and energy consumption, striking a balance between the two metrics. Additionally, we analyzed several factors that impact the Avg-AoI, including the number of IoT devices and UAVs, the selection of EH models, and the energy storage of UAVs. Proper adjustment of these parameters can strike a balance between the Avg-AoI of IoT devices and the energy consumption of UAVs, thereby further improving the overall performance of the system. In future research, we intend to extend our work to scenarios with dynamically mobile IoT devices and optimize UAV flight altitudes. We also want to integrate more advanced EH models, such as adaptive EH models, and explore emerging methodologies, such as DRL-based approaches.