UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications

Gao, Ying; Xue, Hongmei; Zhang, Long; Sun, Enchang

doi:10.3390/s23063005

Open AccessArticle

UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications

by

Ying Gao

¹,

Hongmei Xue

^1,2,

Long Zhang

^1,*

and

Enchang Sun

^3,4

¹

School of Information and Electrical Engineering, Hebei University of Engineering, Handan 056038, China

²

Chongqing Engineering Research Center of Intelligent Sensing Technology and Microsystem, Chongqing 400065, China

³

Beijing Advanced Innovation Center for Future Internet Technology, Beijing University of Technology, Beijing 100124, China

⁴

Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China

^*

Author to whom correspondence should be addressed.

Sensors 2023, 23(6), 3005; https://doi.org/10.3390/s23063005

Submission received: 14 February 2023 / Revised: 4 March 2023 / Accepted: 6 March 2023 / Published: 10 March 2023

(This article belongs to the Special Issue Unmanned Aerial Vehicle (UAV)-Enabled Wireless Communications and Networking II)

Download

Browse Figures

Versions Notes

Abstract

Sixth generation (6G) wireless networks require very low latency and an ultra-high data rate, which have become the main challenges for future wireless communications. To effectively balance the requirements of 6G and the extreme shortage of capacity within the existing wireless networks, sensing-assisted communications in the terahertz (THz) band with unmanned aerial vehicles (UAVs) is proposed. In this scenario, the THz-UAV acts as an aerial base station to provide information on users and sensing signals and detect the THz channel to assist UAV communication. However, communication and sensing signals that use the same resources can cause interference with each other. Therefore, we research a cooperative method of co-existence between sensing and communication signals in the same frequency and time allocation to reduce the interference. We then formulate an optimization problem to minimize the total delay by jointly optimizing the UAV trajectory, frequency association, and transmission power of each user. The resulting problem is a non-convex and mixed integer optimization problem, which is challenging to solve. By resorting to the Lagrange multiplier and proximal policy optimization (PPO) method, we propose an overall alternating optimization algorithm to solve this problem in an iterative way. Specifically, given the UAV location and frequency, the sub-problem of the sensing and communication transmission powers is transformed into a convex problem, which is solved by the Lagrange multiplier method. Second, in each iteration, for given sensing and communication transmission powers, we relax the discrete variable to a continuous variable and use the PPO algorithm to tackle the sub-problem of joint optimization of the UAV location and frequency. The results show that the proposed algorithm reduces the delay and improves the transmission rate when compared with the conventional greedy algorithm.

Keywords:

terahertz band; UAV; integrated sensing and communications; power optimization; trajectory design

1. Introduction

Following the birth of various emerging applications, such as holographic communication, sensory interconnection, three-dimension immersive experiences, and the metaverse, terahertz (THz) band communication is envisioned as one of the key enabling technologies to satisfy the needs of emerging applications [1,2]. Specifically, the ultra-wide THz band that ranges from 0.1 THz to 10 THz promises to support applications with a high quality of service and terabits per second data rates [3]. The THz frequency band will provide new applications for future ultra-high data rate communication because of the ultra-wide THz band [4]. In addition to communication applications, the THz frequency will also enable high resolution and accuracy sensing, such as radar, augmented human senses, and other scenarios [5]. Furthermore, THz networks can realize massive communication connectivity with plenty of available spectrum resources, as more than 10 billion devices are expected to be connected in the coming years [6].

However, the realization of ultra-bandwidth terahertz communication faces three major technical challenges: the first is to technically reach a high-speed terahertz signal of over 100 Gbps, the second is to be able to process high-speed terahertz signals in real time, and the third is to overcome the high channel loss characteristics of terahertz signals. We are going to focus on the third case.

On the one hand, the THz frequency has the characteristic of a high path loss; thus, with the increase in carrier frequencies and communication distances, THz wave propagation suffers from higher spreading losses and stronger non-line-of-sight path losses attributed to scattering, reflection, diffraction, and shadowing [7]. Thus, non-line-of-sight transmission in the THz spectrum is rarely received at the receiving end due to these phenomena. However, line-of-sight transmission is almost nonexistent because of the amount of cover in cities. The deployment of UAVs has been regarded as a complementary alternative to existing cellular systems to achieve higher transmission efficiency and capacity [8]. Therefore, UAVs are needed as aerial base stations to provide line-of-sight transmission links for THz frequencies.

On the other hand, THz communications are highly affected by molecular absorption loss caused by water molecules in the atmosphere. The atmospheric water molecule content varies during the day, and traditionally, the relevant papers, such as [9], usually just give a constant value, which causes errors when choosing channels. Error deviation from the traditional estimation method of path loss is unacceptable in the THz frequency. However, THz-UAVs can detect real-time environmental changes through sensing signals, thus measuring terahertz channel parameters. Therefore, the performance of sensing systems should be taken into account when optimizing THz communication resources [10,11]. In short, integrated sensing and communications endows THz-UAV communication networks with new abilities to interact to perceive the physical world and then improve user information rates. Thus, this topic has importance. We included Table 1 to clearly demonstrate the novelty of our paper and this will be discussed in Section 2. Thus, it is highly necessary to achieve integrated sensing and communications for THz transmission.

Against this background, in this paper, we propose a THz band sensing-assisted UAV communication network to provide wireless communication for users. Particularly, we focus on downlink communications while jointly optimizing the UAV trajectory, frequency association, and power association. The main contributions of our work include:

•: Designing a new sensing and communication power optimization method that considers interference between sensing and communication signals in a THz sensing-assisted UAV communication network.
•: Formulating an optimization problem (Figure 1) and proposing an efficient alternative optimization to solve this problem. First, we use the Lagrangian dual decomposition method to obtain the power of sensing and communication with a fixed trajectory. Second, we use the policy optimization (PPO) algorithm for joint optimization of the UAV location and frequency association with a fixed power of sensing and communication.
•: Designing a PPO algorithm for optimizing the UAV trajectory and frequency association. The PPO algorithm uses the critic network with global information and the actor network with local information to achieve cooperation to explore the angle of UAV and the frequency association.

The rest of this paper is organized as follows: The prior works are described in Section 2. In Section 3, the system model is described. In Section 4, the decomposition problem and the joint optimization design are presented. In Section 5, the simulation results are provided and discussed. Finally, this paper is concluded in Section 6.

Figure 1. Alternating optimization algorithm.

2. Prior Works

The study of UAVs is considered as a new frontier field [12,13,14,15,16]. In [12], the authors designed an optimization problem to maximize the sum rate of a satellite and aerial integrated network. In [13], the authors aimed to maximize the energy efficiency of UAV-enabled communication by optimizing its trajectory. In [14], the authors designed a limited storage space and energy for a UAV-assisted wireless communication system to realize the multi-user communication. In [15], the authors proposed a new protocol for UAV-to-UAV and UAV-to-GCS communication. In [16], the authors give a short overview of the possible threats, attacks, and countermeasures related to UAV communications.

To exploit THz band UAV wireless communication, some initial works have considered THz-enabled aerial communications [17,18,19]. In [17], the authors proposed a UAV-to-user THz sub-band association scheme to eliminate interference in the THz frequency transmission. They proved that terahertz frequencies could be used for communication, and extensions of the wireless charging window and THz-transmitting window are derived. In [18], the authors minimized the total delays of the uplink and downlink transmissions between the UAV and the users by jointly optimizing the location of the operating UAV and the bandwidth of the users, as well as minimizing the transmitting power of the users. They optimized the performance of the drones to communicate using terahertz frequencies. In [19], the authors studied how UAVs support THz communications and an IRS was deployed to help the transmission. Yijin Pan’s aim is to maximize the minimum average rates of all users. They optimized and evaluated the resource optimization problem for terahertz UAVs.

Many works have been dedicated to integrated sensing and communications [20,21,22]. In [20], the authors provided a brief explanation of communication rate maximization theory. Their goals were to research the basic communications phenomenology and to study dealing with systems in an information theory context. In [21], the authors aimed to further investigate the achievable performance of spectrally overlapping radar and communication systems by conjugating the detection. In [22], the authors developed a new approach for producing joint radar communications performance bounds. The authors studied the boundary question of combined communication and sensing.

There are growing research interests in power optimization [23,24,25,26]. In [23], the authors’ design objective was to minimize the total transmission power of both the satellite and BS with a limited onboard power resource. In [24], the authors designed an objective function to maximize the system secrecy energy efficiency under the constraint of the total transmission power budget. In [25], the authors investigated the energy minimization problem of a UAV-assisted data collection sensor network. In [26], the authors designed a function that maximized the sum rate in a satellite–terrestrial integrated network, aiming to satisfy the constraints of per-antenna transmission power and quality-of-service requirements of both satellite and cellular users.

Although there are many papers on UAV communications, most of these existing works [20,21,22,23,24,25,26] do not focus on integrated sensing and communication. Therefore, this area is well worth studying.

We have summarized the relevant work in Table 1.

3. System Model and Problem Formulation

3.1. System Model

Let us now consider a downlink from a THz UAV to N users during time horizon T, shown in Figure 2. We suppose that the user equipment is taken as a two-dimensional (2D) homogeneous Poisson point process (PPP)

Φ_{u}

with intensity

λ_{u}

. For ease of calculation, the time horizon of T is equally divided into

K + 1

time slots with length

\frac{T}{K + 1}

. THz-UAVs use integrated sensing and communication to improve the performance of system. As a result of shared spectrum resources in sensing and communication signals, it is challenging to achieve the critical trade-off between these two integrated functionalities. In order to reduce the interference of communication and sensing signals of the same frequency, at time slot 0, the UAV sends sensing signals and users receive sensing signals. During the time slot of 1 to

K + 1

, the UAV sends communication and sensing signals and users receive communication and sensing signals.

Therefore, for N targets, the user signal received at time slot k can be expressed as:

z_{k} = \sum_{n = 1}^{N} z_{k}^{n} = \sum_{n = 1}^{N} [h_{k}^{n} {(P_{k}^{S, n})}^{\frac{1}{2}} S_{k} + h_{k}^{n} {(P_{k}^{C, n})}^{\frac{1}{2}} C_{k}] + n_{k},

(1)

where

S_{k}

is the sensing signal,

C_{k}

is the communication signal,

P_{k}^{S, n}

and

P_{k}^{C, n}

are the transmitting power of sensing and communication signals at time slot k, respecitvely, and

h_{k}^{n}

is the THz channel gain from the UAV to the user n.

Without loss of generality, we assume that the UAV is moving with a constant speed denoted by V, and the location of the UAV is denoted by

L_{k} = (x_{k}, y_{k}, H)

at time slot k. Here, the altitude, H, of the UAV is assumed to be constant. Therefore, the following coordinates of the UAV at time slot k should be satisfied

\begin{matrix} x_{k} & = x_{k - 1} + V k c o s ψ_{k - 1}^{n}, \\ y_{k} & = y_{k - 1} + V k s i n ψ_{k - 1}^{n}, \end{matrix}

(2)

where

ψ_{k - 1}^{n} \in [0, 4 π]

is the direction of the UAV at time slot

k - 1

from the UAV to the user n.

The following trajectory constraints of the UAV should be satisfied [27]

\begin{matrix} {(L_{1} - L_{0})}^{2} \leq {(V \frac{T}{K + 1})}^{2}, \\ {(L_{k} - L_{k - 1})}^{2} \leq {(V \frac{T}{K + 1})}^{2}, \\ {(L_{K} - L_{K - 1})}^{2} \leq {(V \frac{T}{K + 1})}^{2}, \end{matrix}

(3)

where

L_{0}

and

L_{K}

are the initial location and finial location, respectively.

Considering the LoS transmission, the path loss between the UAV and the user, n, can be written as [28]:

h_{k}^{n} (f_{k, i}^{n}, ε_{n} (f_{k, i}^{n}, ϵ_{k})) = H_{k}^{S p r} (f_{k}^{n}) H_{k}^{A b s} (f_{k}^{n}, ε_{n} (f_{k, i}^{n}, ϵ_{k})) e^{- j 2 π f_{k}^{n}},

(4)

where

f_{k, i}^{n}, i \in {f_{1}, f_{2}, . . ., f_{I}}

is the carrier frequency adpoted by the UAV for communicating with user n and

ε_{n} (f_{k, i}^{n}, ϵ_{k})

is the absorption coefficient parameter related to the carrier frequency

f_{k, i}^{n}

and the number of water molecules in the atmosphere,

ϵ_{k}

, at time slot k.

The free space direct ray or LoS channel transfer function,

H_{L o S}

, consists of the spreading loss function,

H_{S p r}

, and the molecular absorption loss function,

H_{A b s}

. The transfer function due to the spreading loss is given by:

H_{k}^{S p r} (f_{k, i}^{n}) = \frac{c}{4 π f_{k, i}^{n} d_{k}^{u, n}} .

(5)

The transfer function of the molecular absorption loss can be expressed as:

H_{A b s} (f_{k, i}^{n}, ε_{n} (f_{k, i}^{n}, ϵ_{k})) = e^{- ε_{n} (f_{k, i}^{n}, ϵ_{k}) d_{k}^{u, n}},

(6)

where the accuracy of

ε_{n} (f_{k, i}^{n}, ϵ_{k})

is positively correlated with the sensing power. For the specific formula, please refer to [22].

The environmental parameters change slowly; therefore, we can use time slot

k - 1

to represent the sensing estimate value at time slot k. The communication signal of the user at time slot k is the total signal received at time slot k,

z_{k}^{n}

, minus the sensing estimated signal at time slot k. Thus, user n receives communication signals at time slot k, which can be determined by:

C_{k}^{n} = z_{k}^{n} - {\tilde{h}}_{k - 1}^{n} (f_{k - 1, i}^{n}, ε_{n} (f_{k - 1, i}^{n}, ϵ_{k - 1})) {(P_{k - 1}^{S, n})}^{\frac{1}{2}} S_{k - 1},

(7)

where

{\tilde{h}}_{k - 1}^{n} (f_{k - 1, i}^{n}, ε_{n} (f_{k - 1, i}^{n}, ϵ_{k - 1}))

is the THz channel gain at frequency

f_{k, i}^{n}

, which is obtained by sensing signals.

The THz-UAV needs to extract sensing signals to estimate

ε_{n} (f_{k, i}^{n}, ϵ_{k})

and to assign a THz carrier to users. The accuracy of

ε_{n} (f_{k, i}^{n}, ϵ_{k})

affects the THz carrier distribution. Similarly, at time slot k, the sensing signals received by the THz-UAV can be expressed as:

S_{k}^{n} = z_{k}^{n} - {\tilde{h}}_{k - 1}^{n} (f_{k - 1, i}^{n}, ε_{n} (f_{k - 1, i}^{n}, ϵ_{k - 1})) {(P_{k}^{C, n})}^{\frac{1}{2}} C_{k} .

(8)

As a result of sensing and communication signals sharing spectrum resources, the error between the real sensing signal at time k and the estimated sensing signal will interfere with communication signals. In addition, other users using the same THz carrier will also interfere with user n. Therefore, the SINR received at user n can be expressed as:

γ_{k}^{n} = \frac{p_{k}^{C, n} {\tilde{h}}_{k - 1}^{n} (\cdot)}{N_{0} + p_{k}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) - p_{k - 1}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) + \sum_{j = 1}^{j / n} (p_{k}^{C, j} {\tilde{h}}_{k - 1}^{j} (\cdot) + p_{k}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot) - p_{k - 1}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot))},

(9)

where

N_{0}

is the additive white gaussian noise power at user n using the ith carrier frequency of the THz band.

Correspondingly, the achievable downlink rate of the UAV to user n can be written as [29]:

r_{k}^{n} = B l o g (1 + γ_{k}^{n}),

(10)

where B is the bandwidth of the UAV to user n, which is assumed to be equal for each user.

Thus, the delay of all the users at time slot k can be written as follows:

Φ_{k} = \sum_{n = 1}^{N} \frac{D_{n}}{B_{u} l o g (1 + γ_{k}^{n})},

(11)

where

D_{n}

is the amount of data required by user n.

3.2. Problem Formulation

Using the above setup, we aim to minimize the delay over time slots

K + 1

by jointly optimizing the UAV trajectory, frequency association, and transmission power. This optimization problem is mathematically formulated as:

min_{f_{k}^{n}, L_{k}, p_{k}^{C, n}, p_{k}^{S, n}} \sum_{k = 1}^{K} \sum_{n = 1}^{N} \frac{D_{n}}{B_{u} l o g (1 + γ_{k}^{n})}

(12)

so that

\begin{matrix} C 1 : & \sum_{i = 1}^{I} f_{k, i}^{n} \leq 1, \\ C 2 : & {(L_{1} - L_{0})}^{2} \leq {(V \frac{T}{K})}^{2}, \\ C 3 : & {(L_{k} - L_{k - 1})}^{2} \leq {(V \frac{T}{K})}^{2}, \\ C 4 : & {(L_{K} - L_{k - 1})}^{2} \leq {(V \frac{T}{K})}^{2}, \\ C 5 : & \sum_{n = 1}^{N} p_{k}^{C, n} + p_{k}^{S, n} \leq P_{k}^{m a x}, \end{matrix}

where constraint C1 ensures each user can be associated with one carrier frequency at each time slot k. C2–C4 ensure that the UAV cannot exceed the maximum speed at the time horizon T. C5 limits the maximum transmission power of sensing signals and communication signals.

4. Problem Decomposition and Joint Optimizing Design

4.1. Problem Decomposition

We note that the challenges of solving problem (12) lie in the following reasons. First, the optimization variable

f_{k, i}^{n}

for user n at time slot k is binary, and thereby the feasible set of problem (12) is non-convex. Second, the variables

L_{k}

and

f_{k, i}^{n}

are strongly coupled with the sensing power and communication power. Hence, problem (12) is a mixed integer non-convex optimization problem and in general there is no standard method for solving it efficiently.

To tackle the above challenges, we decompose the original problem (12) into two sub-problems by separating the power allocation optimization (P1) and the trajectory and frequency variables (P2).

We first consider the power variables

p_{k}^{C, n}

and

p_{k}^{S, n}

in (P1) by fixing the trajectory variable

L_{k}^{n}

and the frequency variable

f_{k, i}^{n}

. Therefore, subproblem (P1) can be expressed as:

\begin{matrix} (P 1) : & min_{p_{k}^{C, n}, p_{k}^{S, n}} \frac{D_{n}}{B_{u} l o g (1 + γ_{k}^{n})} \\ s . t . C 5 \end{matrix}

(13)

We next consider the trajectory variable in (14) by fixing the UAV power allocation variables

p_{k}^{C, n}

and

p_{k}^{S, n}

. Therefore, subproblem (P2) can be formulated by:

\begin{matrix} (P 2) : & min_{L_{k}, f_{k, i}^{n}} \sum_{n = 1}^{N} \frac{D_{n}}{B_{u} l o g (1 + γ_{k}^{n})} \\ s . t . C 1 - C 4 \end{matrix}

(14)

The two subproblems are separately optimized with multiple iterations. In the

j + 1

-th iteration

(j = 0, 1, 2, \cdot \cdot \cdot, j_{m a x})

, we first optimize

p_{k}^{C, n}

and

p_{k}^{S, n}

using the Lagrange multiplier method in (P1) with fixed trajectory variable

L_{k}^{n}

and frequency variable

f_{k, i}^{n}

, and find that the solution can be expressed by

p_{k}^{* C, n}, p_{k}^{* S, n}

. We then optimize the variables

L_{k}

and

f_{k, i}^{n}

in (P2) using the PPO algorithm, and find that the solution can be expressed by

L_{k}^{j + 1}, f_{k, i}^{n, j}

. After the solution converges or a the maximum number of iterations or

j_{m a x}

is reached, the solution of (14) can be obtained.

4.2. Joint Optimization Design

In this section, we will present the solution to the above two subproblems, and then propose a joint algorithm via separately optimizing the subproblems in an iterative way.

4.2.1. Joint Sensing and Communication Power

Before solving (12), we first demonstrate the convexity of this problem in Theorem 1 shown below.

Theorem 1.

Problem (P1) is convex. Please refer to Appendix A.

As a result of sub-problem (13) being a convex problem, we chose the Lagrangian dual decomposition method to solve it and obtain the optimal solution of

p_{k}^{* S, n}

and

p_{k}^{* C, n}

. The Lagrangian function of (P1) can be given by:

L (p_{k}^{S, n}, p_{k}^{C, n}, f_{k, i}^{n}, χ, η, ϑ) = Φ^{1} + \sum_{k = 1}^{K} η_{k} (\sum_{n = 1}^{N} p_{k}^{C, n} + p_{k}^{S, n} - P^{m a x}),

(15)

where

η_{k}

is the Lagrange multiplier associated with constraint C5.

Since (P1) is convex, it satisfies the Karush–Kuhn–Tucker (KKT) conditions, which can be specifically derived as:

η_{k} (\sum_{n = 1}^{N} p_{k}^{* C, n} + p_{k}^{* S, n} - P_{k}^{m a x}) = 0,

(16)

\frac{\partial L (\cdot \cdot \cdot)}{\partial p_{k}^{C, n}} = - \frac{B_{u}}{l n 2} \frac{1}{l o g (1 + r_{k}^{n})} \frac{1}{1 + r_{k}^{n}} λ_{1} + \sum_{k = 1}^{K} η_{k} = 0,

(17)

\begin{matrix} \frac{\partial L (\cdot \cdot \cdot)}{\partial p_{k}^{S, n}} = & \frac{B_{u}}{l n 2} \sum_{k = 1}^{K} \frac{p_{k}^{C, n} {\tilde{h}}_{k - 1}^{n} (\cdot) + N_{0} + p_{k}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) - p_{k - 1}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) + \sum_{j = 1}^{j / n} p_{k}^{C, j} {\tilde{h}}_{k - 1}^{j} (\cdot) + p_{k}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot) - p_{k - 1}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot)}{{(N_{0} + p_{k}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) - p_{k - 1}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) + \sum_{j = 1}^{j / n} p_{k}^{C, j} {\tilde{h}}_{k - 1}^{j} (\cdot) + p_{k}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot) - p_{k - 1}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot))}^{2}} \\ \times \frac{1}{l o g (1 + r_{k}^{n})} + \sum_{k = 1}^{K} η_{k} = 0 . \end{matrix}

(18)

Case 1. If

η_{k} \neq 0

, the KKT conditions (16) can be written as:

η_{k} (\sum_{n = 1}^{N} P_{k}^{S, m a x} + P_{k}^{C, m a x} - p_{k}^{C, n} - p_{k}^{C, n}) = 0,

(19)

where

P_{k}^{C, m a x}

and

P_{k}^{S, m a x}

indicate the maximum sensing and communication powers for time slot k, respectively.

η_{k} \neq 0

; therefore, the solution of

{\overset{´}{p}}_{k}^{S, n}

and

{\overset{´}{p}}_{k}^{S, n}

in (P1) can be denoted in closed-form as

{\overset{´}{p}}_{k}^{C, n} = P_{k}^{C, m a x}

and

{\overset{´}{p}}_{k}^{S, n} = P_{k}^{S, m a x}

.

Case 2. If

η_{k} = 0

, combining

η_{k} = 0

and (17) and (18), the solution of

{\overset{´}{p}}_{k}^{S, n}

and

{\overset{´}{p}}_{k}^{S, n}

in (P1) can be denoted in closed-form as:

{\overset{‘}{p}}_{k}^{S, n} = \frac{\sum_{j = 1}^{j / n} p_{k}^{C, j} h_{k}^{j} (f_{k, i}^{j})}{h_{k}^{n} (f_{k, i}^{j})},

(20)

{\overset{‘}{p}}_{k}^{C, n} = \frac{- p_{k - 1}^{S, n} {\tilde{h}}_{k - 1}^{n} (\cdot) + \sum_{j = 1}^{j / n} p_{k}^{C, j} . {\tilde{h}}_{k - 1}^{j} (\cdot) + p_{k}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot) - p_{k - 1}^{S, j} {\tilde{h}}_{k - 1}^{j} (\cdot)}{{\tilde{h}}_{k - 1}^{n}} .

(21)

In summary, the optimal solutions of

{\overset{´}{p}}_{k}^{S, n}

and

{\overset{´}{p}}_{k}^{S, n}

in (P1) can be denoted in closed-form as:

a r c m i n_{p_{k}^{* C, n}, p_{k}^{* S, n}} \{Φ_{k} (p_{k}^{C, n} = {\overset{´}{p}}_{k}^{C, n}, p_{k}^{S, n} = {\overset{´}{p}}_{k}^{S, n}), Φ_{k} (p_{k}^{C, n} = {\overset{‘}{p}}_{k}^{C, n}, p_{k}^{S, n} = {\overset{‘}{p}}_{k}^{S, n})\}

(22)

4.2.2. Joint UAV Trajectory and Frequency Association

As shown in Figure 3, we pursue an intelligent UAV trajectory optimization aided by the PPO algorithm for improving the system’s delay. The proposed PPO algorithm framework considers the UAV as a learning agent. The learning process of the PPO algorithm for the UAV by interacting with the THz environment can be expressed as:

(S, A, R, γ),

(23)

where

S

is the state space,

A

is the action space, and

R = S \times A \to R

is the infinite set of rewards that contain the set of immediate rewards when moving from one state to next state resulting from the actions taken by the agents. The state, action, and reward are defined as follows:

•: State: The states observed by an agent are determined by a combination of the transmission powers of sensing and communication. Thus, we define the state of a UAV at time step t as follows:

$S_{k}^{(t)} = (\sum_{n = 1}^{N} p_{k}^{S, n, (t)}, \sum_{n = 1}^{N} p_{k}^{C, n, (t)}) .$

(24)
•: Action: The action is to choose proper flight direction and proper frequency association to obtain better rewards. Furthermore, we define the action performed in time-step t as $a_{k}^{(t)}$ . Let us suppose the possibility of state $s_{k}$ taking action $a_{k}$ at time-step t is $P_{θ} (a_{k}^{(t)} | s_{k}^{(t)})$ , where $θ$ is the probability density function with parameter $θ$ . The action is denoted by the all possible actions at time step t, i.e., $A_{k}^{(t)} = {0 \sim 4 π} \times f_{k, i}^{n, (t)}$ .
•: Reward: The agent receives an immediate reward, denoted as $T_{k}^{(t)} ≜ T {s_{k}^{(t)}, a_{k}^{(t)}} \in R$ , which describes its benefit from taking action $a_{k}^{(t)}$ . Thus, the function of reward can be written as:

$T_{n} = \sum_{k^{'} = k}^{K} η^{k^{'} - k} Φ_{k},$

(25)

where $η^{k^{'} - k} \in [0, 1]$ is the discount rate, which determines the effect of future rewards on the current action. $η^{k^{'} - k} \to 1$ means that the reward value of the future state has a great influence on the action state function, while $η^{k^{'} - k} \to 0$ means that the reward value of the future state has little influence on the action state function.

In the policy gradient algorithm, shown in Algorithm 1, the agent updates the policy by gradient augmentation. In PPO, the old actor modifies its parameters by duplicating the actor’s parameters. In order not to incur too much error, we introduce

r a t i o_{k}

to limit the magnitude of rewards. In other words, when calculating the rewards, by limiting the ratio of the new policy and the old policy, the amplitudes of the state can be limited. As a result, it not only improves the stability of the PPO algorithm, but also reduces its complexity and improves the efficiency of the calculation. In this paper, the ratio of the old to new policy of each agent is calculated as follows:

r a t i o_{k} = \frac{θ_{k}}{θ_{k - 1}}, k \in {1, 2, . . ., K} .

(26)

Figure 3 describes the operation of the PPO algorithm. During training, a set of samples are chosen from the storage system to update the THz network parameters. The value of the network determines the choice of action through the rewards value of these sampled values. The rewards value in turn affects the sampling probability density functions. When the agent explores the THz network parameters, it will select an action at random, targeting a higher long-term reward. Furthermore, it selects the action that gains the most rewards immediately. In order to improve the sampling efficiency, PPO adopts an important sampling method to change the policy gradient algorithm from the on policy to the off policy. At this time, the update formula of the actor network is:

\begin{matrix} min_{π_{θ_{k}}} Φ_{k}^{2} (θ_{k}) = E_{s_{t} \sim P_{θ_{k}} (τ)} [J_{θ_{k}} (θ_{k})], \end{matrix}

(27)

where

τ = {s_{1} . a_{1}, s_{2}, a_{2}, . . . ., s_{K}, a_{K}}

represents the trajectory of the agent in the entire episode.

PPO uses a clip function to directly limit the update range to

[1 - ε, 1 + ε]

. From Figure 4, this function of PPO can be written as follows:

\begin{matrix} J_{θ_{k}} (θ_{k}) \approx \sum_{(s_{t}, a_{t})} m i n { & \frac{P_{θ_{k}} (a_{k} | s_{k})}{P_{θ} (a_{k} | s_{k})} A^{θ_{k}} (s_{t}, a_{t}), \\ c l i p (\frac{P_{θ_{k}} (a_{k} | s_{k})}{P_{θ} (a_{k} | s_{k})}, 1 - ε, 1 + ε) A^{θ_{k}} (s_{t}, a_{t})}, \end{matrix}

(28)

where

ε

is a hyperparameter that represents the maximum difference between

P_{θ_{k}}

and

P_{θ}

.

P_{θ_{k}} (τ)

interacts with the environment and

P_{θ} (τ)

has already interacted with the environment. Furthermore,

A_{θ^{k}} (s_{t}, a_{t})

represents the estimation of the advantage function at time step t and can be written as:

\begin{matrix} A_{θ_{k}} (s_{k}, a_{k}) \approx \frac{1}{J} \sum_{j = 1}^{J} (Φ_{k}^{2} - E [Φ_{k}^{2}]), \end{matrix}

(29)

where J is the number of points to sample with the probability of

P_{θ} (a_{k} | s_{k})

.

P_{θ_{k}} (a_{k} | s_{k})

is the modified probability density function parameters (

θ_{k}

). Furthermore, the function of

c l i p

can be written as:

\begin{matrix} c l i p (x, x_{m i n}, x_{m a x}) = \{\begin{matrix} x, & if x_{m i n} \leq x \leq x_{m a x} . \\ x_{m i n}, & if x < x_{m i n} . \\ x_{m a x}, & if x > x_{m a x} . \end{matrix} \end{matrix}

The formula for updating the action of possibility,

P_{θ_{k}} (τ)

, can be written as:

\begin{matrix} θ_{k + 1} ⟵ θ_{k} + η^{k^{'} - k} ▽ Φ_{k} (θ_{k}) . \end{matrix}

(30)

4.3. Computational Complexity

Theorem 2.

The complexity of Algorithm 2 is given by

O (N + j_{m a x} \cdot K N)

.

Proof.

In Algorithm 2, the computationally most expensive part is solving the sub-problems in (P1) (line 2) and (P2) (line 3).

In line 2 of Algorithm 2, sub-problem (P1) is solved. Every user needs to calculate function (22). Since there are N users, the computational complexity using method Lagrangian function is

O (N)

.

In line 3 of Algorithm 2, sub-problem (P2) is solved by Algorithm 1. The computationally most expensive part is lines 3 and 4 of Algorithm 1. In lines 3 and 4 of Algorithm 1, we need to calculate the probability density function parameter

θ_{k}

and calculate the rewards function. Thus, the computational complexity is

O (K N)

. We assume that the maximum number of iterations of Algorithm 1 is

j_{m a x}

. Therefore, the total computational complexity of Algorithm 1 can be written as

O (j_{m a x} \cdot K N)

.

To summarize, the overall computational complexity of Algorithm 2 is calculated as

O (N + j_{m a x} \cdot K N)

. This concludes the proof. □

Algorithm 1 The Proximal Policy Optimization Algorithm

1:: for iteration = 1,2,.... $j_{m a x}$ do
2:: for action = 1,2,....K do
3:: Run policy $θ_{k}$ in environment for K time steps according to (30)
4:: Compute advantage estimates $A_{θ_{k}}$ according to (29)
5:: end for
6:: Optimize surrogate $θ_{k}$
7:: $θ_{k + 1} \leftarrow θ_{k}$
8:: Calculate $J_{θ_{k}} (θ_{k})$ according to (28)
9:: end for

Algorithm 2 The Proposed Alternating Optimization Algorithm to Solve Problem (12)

1:: for iteration = 1,2,.... $j_{m a x}$ do
2:: Solve problem (P1) for given $L_{k}, f_{k, i}^{n}$ and denote the optimal solution as $P_{k}^{* C, n}, P_{k}^{* S, n}$ .
3:: Solve problem (P2) for given $P_{k, j}^{* C, n}, P_{k, j}^{* S, n}$ and denote the suboptimal solution as $L *_{k, j + 1}, f_{k, j + 1}^{* n}$ .
4:: j = j + 1;
5:: end for

5. Simulation Results

In this section, we numerically evaluate the performance of the overall alternating optimization algorithm of intelligent trajectory planning by implementing simulations in MATLAB. The radius of the UAV coverage area was set to 50 m. We set the bandwidth which is allocated to the UAV as 10 GHz. We adopted THz carrier frequencies of 300 GHz, 310 GHz, 320 GHz, 330 GHz, 340 GHz, and 350 GHz. The details of the relevant parameters are listed in Table 2.

To investigate the convergence behavior of the proposed algorithm, we start with illustrating the accumulation of the UAV communication rate versus the number of iterations when the user Poisson distribution parameter is

λ_{u} = 0.2, 0.3

, or

0.4

persons per meter, Figure 5. It is observed that the proposed algorithm provides a higher sum rate of the system than that of the greedy sampling algorithm. This is because the PPO algorithm considers the rewards from the time of

k + 1

to

K + 1

. The greedy algorithm is the result of the k-time obtained by mass sampling. Without considering other possible cases in general, the local optimal solution is selected each time and no backtracking is carried out, so the optimal solution is rarely obtained. This highlights the importance of the PPO algorithm, and how it theoretically gives the better sum rate for the system.

In Figure 6, we show a comparison of the system’s sum rate in the THz and Sub-6G frequency ranges, respectively, between the proposed algorithm and the greedy algorithm under varying user distribution functions. It is discovered that the proposed algorithm provides a higher sum rate of the system than that of the greedy algorithm, because in the greedy algorithm, there is a large number of random sampling at time k, while the PPO algorithm not only considers the system performance at time k, but also considers the system performance from time k to time

K + 1

. It is also discovered that the THz frequency provides a higher sum rate of the system than that of the Sub-6G. That is because the signal-to-noise ratio is much higher at the terahertz frequency than at the sub-6G frequency due to the high pathloss characteristic of THz channel resulting in low interference between users. This highlights the importance of an appropriate algorithm for the THz frequency.

In Figure 7, the relationship between the maximum communication power and sensing power is shown. It is observed that as the maximum communication power increases, the transmitting sensing power increases, but once the maximum value is reached, the sensing and communication powers start to decrease to maintain the same communication rate. This is because the communication and sensing signals share a spectrum. When the value of the maximum transmitted communication power increases, the THz-UAV increases the power of communication in order to obtain a higher information rate. As a result of the C5 constraint, the sensing power becomes smaller. The precision of sensing the terahertz channel will be affected by the decrease in sensing power. This will affect the allocation of the THz-UAV channel and cause the information rate to decrease. Therefore, there must be a maximum value of the sensing power to obtain the minimum delay.

In Figure 8, we show the relationship between frequency efficiency in the THz and Sub-6G frequencies, respectively. The numbers of users under the parameter of user density function are

λ_{u} = 0.2

and

λ_{u} = 0.3

. We can see that as the number of users increases, the frequency efficiency increases. This is due to the fact that as the number of users increases, the information rate has been greatly improved. As can be seen from the figure, with the same number of users, the higher the user density function parameter, the lower the spectrum density. This is because when the user density function parameter is higher, the interference between users is stronger, resulting in a reduction in the information rate, so the spectral efficiency is lower. Therefore, the frequency spectrum efficiency of THz wireless communication is easily affected by the user density.

6. Conclusions

This paper investigated the problem of joint UAV trajectory, frequency association, and power optimization, aiming to minimize the sum delay in the terahertz band. The sum delay minimization was formulated as a convex optimization problem. This problem was transformed into the Lagrange multiplier method and a PPO problem. A Lagrange sub-problem was devised, aiming to obtain the sensing and communication powers. A PPO algorithm was devised to obtain the UAV trajectory and frequency association. Our results showed that the proposed algorithm achieved a good performance with a significant increase in the sum delay compared with the greedy algorithm and the Sub-6G frequency scenario, indicating its potential in a practical design. However, the method used in this paper has not used in a real UAV. Thus, there is a certain gap between theory and practice, which provides a direction for future research.

Author Contributions

Conceptualization, Y.G.; methodology, Y.G.; software, L.Z.; validation, H.X., L.Z. and E.S.; formal analysis, L.Z.; investigation, L.Z.; resources, L.Z.; data curation, L.Z.; writing—original draft preparation, Y.G.; writing—review and editing, Y.G. and L.Z.; visualization, L.Z.; supervision, L.Z., H.X. and E.S.; project administration, L.Z.; funding acquisition, L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under grant 61971032, in part by the Hebei Natural Science Foundation under grant F2022402001 and grant A2020402013, and in part by the Open Fund of Chongqing Engineering Research Center of Intelligent Sensing Technology and Microsystem under grant D2021337.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to legal restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof.

The second-order derivative of objective function (13) with respect to

p_{k}^{C, n}

and

p_{k}^{S, n}

can be, respectively, obtained by

\frac{\partial^{2} Φ_{k}^{1}}{\partial^{2} p_{k}^{C, n}} = \sum_{n = 1}^{N} 2 \frac{D_{n}}{B_{u} l n 2} λ_{1} \frac{1}{{(1 + γ_{k}^{n})}^{2}} (λ_{1} - \frac{1}{l o g (1 + γ_{k}^{n})}) \geq 0

(A1)

where

λ_{1} = \frac{h_{k}^{n} (f_{k, i}^{n})}{N_{0} + p_{k}^{S, n} h_{k}^{n} (f_{k, i}^{n}) - p_{k - 1}^{S, n} h_{k - 1}^{n} (f_{k, i}^{n}) + \sum_{j = 1}^{j / n} p_{k}^{C, j} h_{k}^{j} (f_{k, i}^{j}) + p_{k}^{S, j} h_{k}^{j} (f_{k, i}^{n}) - p_{k - 1}^{S, j} h_{k - 1}^{j} (f_{k, i}^{n})}

.

\frac{\partial^{2} Φ_{k}^{1}}{\partial^{2} p_{k}^{S, n}} = \sum_{n = 1}^{N} 2 \frac{D_{n}}{B_{u} l n 2} λ_{2} \frac{1}{{(1 + γ_{k}^{n})}^{2}} {(\frac{h_{n}^{n} (f_{k, i}^{n}) + λ_{2}}{p_{k}^{C, n} h_{n}^{n} (f_{k, i}^{n})})}^{2} (2 + r_{k}^{n} + \frac{h_{n}^{n} (f_{k, i}^{n}) + λ_{2}}{p_{k}^{C, n} h_{n}^{n} (f_{k, i}^{n})}) \geq 0

(A2)

where

λ_{2} = N_{0} - p_{k - 1}^{S, n} h_{k - 1}^{n} (f_{k, i}^{n}) + \sum_{j = 1}^{j / n} p_{k}^{C, j} h_{k}^{j} (f_{k, i}^{j}) + p_{k}^{S, j} h_{k}^{j} (f_{k, i}^{n}) - p_{k - 1}^{S, j} h_{k - 1}^{j} (f_{k, i}^{n})

.

Therefore, the problem (P1) is convex. This concludes the proof. □

References

Zhang, L.; Zhao, H.; Hou, S.; Zhao, Z.; Xu, H.; Zhang, R. A Survey on 5G Millimeter Wave Communications for UAV-Assisted Wireless Networks. IEEE Access 2019, 7, 117460–117504. [Google Scholar] [CrossRef]
Andrews, J.G.; Buzzi, S.; Choi, W.; Hanly, S.V.; Lozano, A.; Soong, A.C.K.; Zhang, J.C. What will 5G be? IEEE J. Sel. Area in Comm. 2014, 32, 1065–1082. [Google Scholar] [CrossRef]
Sarieddeen, H.; Saeed, N.; Al-Naffouri, T.Y.; Alouini, M.-S. Next generation terahertz communications: A rendezvous of sensing, imaging, and localization. IEEE Commun. Mag. 2020, 58, 69–75. [Google Scholar] [CrossRef]
Zhang, Z.; Xiao, Y.; Ma, Z.; Xiao, M.; Ding, Z.; Lei, X.; Karagiannidis, G.K.; Fan, P. 6G wireless networks: Vision, requirements, architecture, and key technologies. IEEE Veh. Technol. Mag. 2019, 14, 28–41. [Google Scholar] [CrossRef]
Liu, A.; Huang, Z.; Li, M.; Wan, Y.; Li, W.; Han, T.X.; Liu, C.; Du, R.; Tan, D.K.P.; Lu, J.; et al. A Survey on Fundamental Limits of Integrated Sensing and Communication. IEEE Commun. Surv. Tutor. 2022, 24, 994–1034. [Google Scholar] [CrossRef]
Liu, F.; Cui, Y.; Masouros, C.; Xu, J.; Han, T.X.; Eldar, Y.C.; Buzzi, S. Integrated Sensing and Communications: Toward Dual-Functional Wireless Networks for 6G and Beyond. IEEE J. Sel. Areas Commun. 2022, 40, 1728–1767. [Google Scholar] [CrossRef]
Zhang, J.; Fei, Z.; Wang, X.; Liu, P.; Huang, J.; Zheng, Z. Integrated Scheduling of Sensing, Communication, and Control for mmWave/THz Communications in Cellular Connected UAV Networks. IEEE J. Sel. Areas Comm. 2022, 40, 2103–2113. [Google Scholar]
Zhang, L.; Wang, Y.; Min, M.; Guo, C.; Sharma, V.; Han, Z. Privacy-Aware Laser Wireless Power Transfer for Aerial Multi-Access Edge Computing: A Colonel Blotto Game Approach. IEEE Internet Things 2022, 15, 2327–4662. [Google Scholar] [CrossRef]
Wang, X.; Wang, P.; Ding, M.; Lin, Z.; Lin, F.; Vucetic, B.; Hanzo, L. Performance Analysis of Terahertz Unmanned Aerial Vehicular Networks. IEEE Trans. Veh. Technol. 2020, 69, 16330–16335. [Google Scholar] [CrossRef]
Griffiths, H.; Cohen, L.; Watts, S.; Mokole, E.; Baker, C.; Wicks, M.; Blunt, S. Radar spectrum engineering and management: Technical and regulatory issues. Proc. IEEE 2015, 103, 85–102. [Google Scholar] [CrossRef]
Roberton, M.; Brown, E.R. Integrated radar and communications based on chirped spread-spectrum techniques. IEEE MTT-S Int. Microw. Symp. Dig. 2013, 1, 611–614. [Google Scholar]
Lin, Z.; Lin, M.; de Cola, T.; Wang, J.; Zhu, W.; Cheng, J. Supporting IoT With Rate-Splitting Multiple Access in Satellite and Aerial-Integrated Networks. IEEE Internet Things J. 2021, 8, 11123–11134. [Google Scholar] [CrossRef]
Yuan, Z.; Yang, Y.; Wang, D.; Ma, X. Energy-Efficient Trajectory Optimization for UAV-Enabled Cellular Communications Based on Physical-Layer Security. Aerospace 2022, 9, 50. [Google Scholar] [CrossRef]
Lan, T.; Qin, D.; Sun, G. Joint Optimization on Trajectory, Cache Placement, and Transmission Power for Minimum Mission Time in UAV-Aided Wireless Networks. ISPRS Int. J. Geo-Inf. 2021, 10, 426. [Google Scholar] [CrossRef]
Ko, Y.; Kim, J.; Duguma, D.G.; Astillo, P.V.; You, I.; Pau, G. Drone Secure Communication Protocol for Future Sensitive Applications in Military Zone. Sensors 2021, 21, 2057. [Google Scholar] [CrossRef]
Krichen, M.; Adoni, W.Y.H.; Mihoub, A.; Alzahrani, M.Y.; Nahhal, T. Security Challenges for Drone Communications: Possible Threats, Attacks and Countermeasures; SMARTTECH: Riyadh, Saudi Arabia, 2022; pp. 184–189. [Google Scholar]
Li, Q.; Nayak, A.; Zhang, Y.; Yu, F.R. A Cooperative Recharging-Transmission Strategy In Powered UAV-Aided Terahertz Downlink Networks. IEEE Trans. Veh. Technol. 2022, 1939–9359. [Google Scholar] [CrossRef]
Xu, L.; Chen, M.; Chen, M.; Yang, Z.; Chaccour, C.; Saad, W.; Hong, C.S. Joint Location, Bandwidth and Power Optimization for THz-enabled UAV Communications. IEEE Commun. Lett. 2021, 25, 1984–1988. [Google Scholar] [CrossRef]
Raza, A.; Ijaz, U.; Ishfaq, M.K.; Ahmad, S.; Liaqat, M.; Anwar, F.; Iqbal, A.; Sharif, M.S. Intelligent reflecting surface-assisted terahertz communication towards B5G and 6G: State-of-the-art. Microw. Opt. Technol. Lett. 2022, 64, 858–866. [Google Scholar] [CrossRef]
Chiriyath, A.R.; Paul, B.; Bliss, D.W. Radar-Communications Convergence: Coexistence, Cooperation, and Co-Design. IEEE Trans. Cogn. Commun. Netw. 2017, 3, 1–12. [Google Scholar] [CrossRef]
Zheng, L.; Lops, M.; Wang, X.; Grossi, E. Joint Design of Overlaid Communication Systems and Pulsed Radars. IEEE Trans. Signal Process. 2018, 66, 139–154. [Google Scholar] [CrossRef]
Chiriyath, A.R.; Paul, B.; Jacyna, G.M.; Bliss, D.W. Inner Bounds on Performance of Radar and Communications Co-Existence. IEEE Trans. Signal Process. 2015, 64, 464–474. [Google Scholar] [CrossRef]
Lin, Z.; Niu, H.; An, K.; Wang, Y.; Zheng, G.; Chatzinotas, S.; Hu, Y. Refracting RIS-Aided Hybrid Satellite-Terrestrial Relay Networks: Joint Beamforming Design and Optimization. IEEE Trans. Aerosp. Electron. Syst. 2022, 58, 3717–3724. [Google Scholar] [CrossRef]
Lin, Z.; An, K.; Niu, H.; Hum, Y.; Hu, Y.; Chatzinotas, S.; Zheng, G.; Wang, J. SLNR-based Secure Energy Efficient Beamforming in Multibeam Satellite Systems. IEEE Trans. Aerosp. Electron. Syst. 2022, in press. [CrossRef]
Wang, Y.; Chen, M.; Pan, C.; Wang, K.; Pan, Y. Joint Optimization of UAV Trajectory and Sensor Uploading Powers for UAV-Assisted Data Collection in Wireless Sensor Networks. IEEE Interent Things 2022, 9, 11214–11226. [Google Scholar] [CrossRef]
Lin, Z.; Lin, M.; Wang, J.; de Cola, T.; Wang, J. Joint Beamforming and Power Allocation for Satellite-Terrestrial Integrated Networks With Non-Orthogonal Multiple Access. IEEE J.-STSP 2019, 13, 657–670. [Google Scholar] [CrossRef]
Zhang, L.; Ma, X.; Zhuang, Z.; Xu, H.; Sharma, V.; Han, Z. Q-Learning Aided Intelligent Routing with Maximum Utility in Cognitive UAV Swarm for Emergency Communications. IEEE Trans. Veh. Technol. 2022, in press. [Google Scholar] [CrossRef]
Han, C.; Bicen, A.O.; Akyildiz, I.F. Multi-Ray Channel Modeling and Wideband Characterization for Wireless Communications in the Terahertz Band. IEEE Trans. Wirel. Commun. 2015, 14, 2402–2412. [Google Scholar] [CrossRef]
Zhang, L.; Zhang, H.; Guo, C.; Xu, H.; Song, L.; Han, Z. Satellite-Aerial Integrated Computing in Disasters: User Association and Offloading Decision. In Proceedings of the 2020 IEEE International Conference on Communications (ICC), Dublin, Ireland, 7–11 June 2020; pp. 554–559. [Google Scholar]

Figure 2. Illustration of a terahertz band integrated sensing and communications network.

Figure 3. PPO algorithm.

Figure 4. The value of

J_{θ_{k}} (θ_{k})

.

Figure 4. The value of

J_{θ_{k}} (θ_{k})

.

Figure 5. Number of iterations of the algorithms.

Figure 6. Relationship between rate and number of users.

Figure 7. Relationship between sensing and communication powers.

Figure 8. Relationship between frequency efficiency and number of users.

Table 1. Our novel contribution contrasted to the state-of-the-art in UAV communication research.

	[12,13,14,15,16]	[17,18,19]	[20,21,22]	[23,24,25,26]	Our Work
THz Frequency	×	✔	×	✔	✔
UAVs Communication	✔	✔	×	×	✔
Power Optimization	✔	×	×	✔	✔
Integrated Sensing and Communication	×	×	✔	×	✔
UAV Trajectory Design	✔	✔	×	✔	✔

Table 2. Simulation parameters.

Parameter	Value	Parameter	Value
Time, T	20 s	A-BS Height, H	5 m
Time slot, $K + 1$	1 ms	ABS Speed, V	[0,3] m/s
Noise power, $N_{0}$	−20 bBm	Reference pressure, $p_{0}$	101.325 kPa
Reference temperature, $T_{S T P}$	20	Maximum sensing transmission power, $p_{k}^{S, n}$	30 dBm
Maximum communication transmission power, $p_{k}^{C, n}$	30 dBm	Discount rate, $η$	0.1

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gao, Y.; Xue, H.; Zhang, L.; Sun, E. UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications. Sensors 2023, 23, 3005. https://doi.org/10.3390/s23063005

AMA Style

Gao Y, Xue H, Zhang L, Sun E. UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications. Sensors. 2023; 23(6):3005. https://doi.org/10.3390/s23063005

Chicago/Turabian Style

Gao, Ying, Hongmei Xue, Long Zhang, and Enchang Sun. 2023. "UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications" Sensors 23, no. 6: 3005. https://doi.org/10.3390/s23063005

APA Style

Gao, Y., Xue, H., Zhang, L., & Sun, E. (2023). UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications. Sensors, 23(6), 3005. https://doi.org/10.3390/s23063005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

UAV Trajectory Design and Power Optimization for Terahertz Band-Integrated Sensing and Communications

Abstract

1. Introduction

2. Prior Works

3. System Model and Problem Formulation

3.1. System Model

3.2. Problem Formulation

4. Problem Decomposition and Joint Optimizing Design

4.1. Problem Decomposition

4.2. Joint Optimization Design

4.2.1. Joint Sensing and Communication Power

4.2.2. Joint UAV Trajectory and Frequency Association

4.3. Computational Complexity

5. Simulation Results

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI