1. Introduction
Classical guidance laws, such as proportional navigation, do not give enough consideration to impact time and angle, which results in poor cooperative performance. For example, an impact angle constraint should be imposed for the UAVs to approach the target from a specified direction in some cases [
1,
2,
3,
4]. When multiple UAVs need to be gathered in a formation, the impact time should be coordinated [
5,
6,
7,
8]. Three performance indicators, namely miss distance, impact time, and impact angle, should be satisfied simultaneously during cluster flight. Therefore, impact time and angle control guidance (ITACG) laws have attracted extensive attention in both the academic and industrial worlds [
9,
10,
11,
12,
13,
14,
15,
16,
17,
18].
Scholars have conducted a series of studies based on different theories and methods, including optimal control theory, sliding mode control theory, computational geometry, and those methods based on proportional navigation. Lee et al. [
9] proposed the first ITACG law based on optimal control theory, which consists of a feedback loop and an additional control command. In [
10], a two-phase optimal ITACG law based on the virtual target method is proposed, which is suitable for all-around control in planar formations under admissible initial conditions and predetermined terminal constraints. The guidance problem is described as an optimal control model with discontinuities in [
11], and an iterative guidance method is used to realize multi-constrained control. In [
12], a new robust second-order sliding mode control law was developed by incorporating backstepping control, the line-of-sight rate shaping method, and second-order sliding mode control. Hou et al. [
13] designed two different terminal sliding surfaces based on nonsingular terminal sliding mode control theory. This sliding mode control-based guidance law enables the UAV to satisfy both time and angle constraints while reaching the target. Reference [
14], which combined the impact angle control guidance law based on backstepping control and the impact time control guidance law based on proportional navigation, achieved simultaneous control of both impact angle and time. A new ITACG law with adjustable coefficients was proposed in [
15]. The bias proportional guidance with impact angle constraint is extended with feedback control of impact time error. However, the UAVs’ flight trajectories under the aforementioned guidance laws are implicit, making it difficult to accurately estimate the time-to-go in advance. Zhang et al. [
16] proposed a trajectory planning method based on three circular arcs to control impact time and angle. An explicit two-phase flight trajectory based on second-order Bezier curves was proposed in [
17], allowing for precise impact time and angle requirements. In [
18], the geometric ITACG law was extended to three-dimensional space using log-aesthetic space curves (LASCs). However, the offline trajectory design methods in [
16,
17,
18] cannot be computed quickly enough to achieve an optimal solution and do not adequately consider time-varying velocity.
Besides the classical theories and methods mentioned above, data-driven methods [
19,
20,
21] have gained significant attention in the field of guidance law design in recent years. Data-driven methods focus on the relationships between input and output data in a system and approximate the behavior of the system without analytical modeling. After collecting large amounts of input and output data, a data-driven model can be established using machine learning techniques. In [
22], a deep neural network (DNN) was trained to learn the mapping relationship between flight states and distances. Based on this DNN, a multi-constrained prediction–correction guidance algorithm was proposed. Guo et al. [
23] proposed a guidance law that combines data-driven methods and proportional navigation. The data-driven methods were used to compensate for the impact time error of proportional navigation guidance. Based on this, a biased proportional navigation guidance law was designed to control the impact time. In [
24], another impact time control guidance law was designed based on proportional navigation and data-driven methods, which do not rely on time-to-go information. Reference [
25] designed a two-phase guidance law based on data-driven methods to control both impact time and angle.
However, the aforementioned guidance laws are developed based on the assumption of constant velocity and two-dimensional kinematics. In practical applications, the UAV’s velocity varies with engine thrust and air drag in three dimensions. As a result, it is challenging to control impact time and angle precisely under time-varying velocity. This manuscript addresses the guidance problem of impact time and angle control in three dimensions. A guidance law based on data-driven methods and computational geometry is proposed. The main contributions of this manuscript are summarized as follows:
The proposed three-dimensional, multi-constrained guidance method develops a geometric trajectory planning framework. It uses data-driven methods to solve the trajectory parameters, taking impact time, impact angle, maximum overload, and time-varying velocity into account.
Different from references [
9,
10,
11,
12,
13,
14,
15], the proposed guidance method overcomes the reliance on accurate time-to-go estimation, which is quite difficult under time-varying velocities. The proposed method achieves better performance in terms of impact time and angle under time-varying velocity.
Compared to references [
16,
17,
18], the proposed guidance law adopts data-driven methods to solve trajectory parameters, which greatly speeds up the trajectory generation process. This improvement benefits the update of trajectories due to different mission requirements.
The remainder of this manuscript is organized as follows:
Section 2 introduces foundational concepts, including the LASC and guidance model. In
Section 3, a novel ITACG law is proposed based on geometric planning and data-driven methods. Numeric simulations and results are given in
Section 4.
Section 5 provides the conclusions.
3. Guidance Law Design
In this section, the design concept of a two-phase trajectory is initially introduced (
Section 3.1). Then, the impact angle, impact time, and acceleration limitation are analyzed (
Section 3.2). Afterward, a trajectory generation data-driven method is proposed to satisfy multiple constraints (
Section 3.3). Lastly, a pure pursuit and line-of-sight (PLOS) tracking algorithm is employed to generate the guidance commands (
Section 3.4).
Figure 3 shows the block diagram of the guidance method.
3.1. Trajectory Design Concept
The impact time and angle control are realized by adjusting the LASC trajectory’s length, endpoint position, and tangent vector. Denote the initial launch angle of the UAV as , and the desired angle at impact as . Near the moment that the UAV approaches the target, the terminal-phase flight trajectory can be approximated as a straight line. If the angle formed by this straight line with respect to the target is equal to the desired impact angle, the impact angle constraint is satisfied. To ensure a smooth transition between the initial and terminal trajectories, these two trajectories must be tangent at the phase switching point. Moreover, by adjusting the position of this phase switching point along the straight-line trajectory, the length of the two-phase trajectory can be adjusted along with the impact time.
The parameters of the multi-constrained trajectory remain to be determined, including LASC parameters and phase switching point position. However, the complexity of the trajectory generation process makes it very difficult and even impractical to find an analytical solution. Consequently, a data-driven method is used to design the two-phase trajectory. The design framework is illustrated in
Figure 4. The initial phase trajectory is a LASC that coincides with the launch angle at the origin, while the terminal phase trajectory is a straight-line segment that fulfills the impact angle constraint. The phase switching point is denoted as
.
is the local coordinate system established for solving the LASC parameters under given conditions.
3.2. Multi-Constrained Analysis
In order to satisfy the impact angle constraint, it is necessary to make the initial phase trajectory tangent to the starting line at the origin point
and tangent to the collision line at the phase switching point
. Equations (5) and (6) provide insight that the LASC involves five trajectory parameters, namely
,
,
,
, and
. With the values of
,
, and
fixed, adjusting the values of
and
is adequate to satisfy the terminal positional and tangential constraints [
30]. Thus, we set
in this manuscript.
For simplicity, the LASC can be normalized into a standard form by scaling and rotating transformations [
18,
30]. The coordinates of the two endpoints are transformed to
and
, with the scaling factor of
. The initial condition is determined by the starting position
and the launch angle
. The terminal condition is determined by the endpoint position
and the impact angle
. Thus, the LASC in the standard form can be obtained by integrating Equation (9). The generated LASC conforms to the following mapping relation
:
where
is the length of the LASC in the standard form.
To apply the standard-form trajectory to the inertial frame, the size of the LASC needs to be scaled as
The length
of the entire two-phase trajectory is
To satisfy the impact time constraint, the trajectory length needs to be adjusted by tuning the position of the phase switching point. The position of the phase switching point is determined by the length of the line segment
. Therefore, the desired two-phase trajectory satisfies the following mapping relation
:
The following geometric relationship can be obtained from
Figure 4.
Then, Equation (18) can be rewritten as
Although the trajectory length and the trajectory parameters have been obtained, the impact time still needs to be calculated due to time-varying velocity. The following relationship between the trajectory length and the impact time needs to be satisfied.
After obtaining the two-phase trajectory that satisfies the constraints of impact time and angle, it is necessary to evaluate the maximum acceleration demand during trajectory tracking. Because the acceleration demand imposes a constraint on the curvature of the trajectory as follows:
According to Equation (7), the curvature of the LASC is monotonically decreasing. Hence, we have
where
and
are the curvature in the inertial frame and the standard form, respectively.
Equation (20) can be supplemented as
While the UAV velocity varies with time, it is important to determine the exact maximum acceleration during trajectory tracking. To ensure the feasibility of flying along the trajectory, the maximum acceleration is evaluated using the maximum velocity and maximum curvature as
3.3. Trajectory Generation
3.3.1. Analysis of Mapping Relationship
The LASC plays a crucial role in ensuring a smooth transition between the two phases. To satisfy the impact angle constraint while maintaining the adjustability of the trajectory length, a predefined collision line is set. The trajectory length is adjusted by tuning the phase switching point, which further realizes control over the impact time. Consequently, the trajectory parameters that need to be determined include LASC parameters and , initial relative distance , and straight-line trajectory length .
If the trajectory scaling factor is
, then the initial relative distance
, straight-line trajectory length
, and trajectory length
are all scaled in the same proportion. Moreover, the maximum curvature becomes
, while the LASC parameters
and
remain unchanged. For the sake of simplicity, we assume the initial distance between the objects is a definite value
for further analysis. Thus, the Equation (24) can be rewritten as
It can be inferred from Equation (26) that the trajectory parameters are determined by the initial angle, impact angle, and trajectory length. There is also a maximum curvature associated with these trajectory parameters. The corresponding mapping relation
can be expressed as
It is worth pointing out that when a curve rotates around the
X-axis, its geometric features, such as relative distances and angles, remain unchanged. As shown in
Figure 5 and
Figure 6,
and
are the projection vectors of the initial unit vector
and terminal unit vector
in the
plane, respectively.
is the angle between
and the
plane, while
is the angle between
and the
plane.
is the angle between
and the
Z-axis, while
is the angle between
and the
Z-axis. The related relation expressions are as follows:
,
, and
represent the relative geometry of the initial vector and terminal vector in space. The corresponding values remain unchanged in spite of rotating around the
X-axis. Therefore, the Equation (27) can be further simplified as follows:
where
.
Compared with Equation (27), Equation (29) reduces the dimensions of the input variables from 5 to 4, which significantly reduces the computational burden of constructing the database. Additionally, the proposed trajectory design method gains better flexibility by using fewer parameters in trajectory generation.
3.3.2. Acquisition of Trajectory Parameters
The LASC that satisfies the terminal constraints can be obtained by solving the LASC parameters
and
with the improved simplex method [
30,
34]. However, the existing trajectory generation problem involves time-varying velocity, which makes it difficult to solve analytically and rapidly. Local optimization algorithms like the improved simplex method would struggle to obtain the global optimal solution, while global optimization algorithms would suffer from time-consuming computation. Hence, a data-driven method is introduced here to solve the trajectory parameters efficiently. Instead of relying on complex analytical equations, the data-driven method focuses on establishing the relationship between input and output variables, which benefits solving the trajectory parameters accurately and efficiently.
Due to the strong capability of nonlinear fitting, DNN-based modeling is a widely adopted data-driven method and has gained more and more popularity in various fields [
22,
35,
36]. DNN is composed of multiple layers and massive neurons, which allows it to model complex relationships and patterns through training data. Through learning from large datasets, DNN can perform nonlinear mappings between input and output variables, which makes it highly effective in handling complex and high-dimensional problems.
The general training process has three main steps [
37]: forward propagation, loss function calculation, and backpropagation. In the step of forward propagation, input data are passed through the network’s input layer to the output layer:
where
and
are the weight coefficients and biases of the
th layer, respectively.
represents the output value of the
th layer. To achieve better performance in nonlinear mapping, the activation function
is applied.
The predicted output values are obtained through forward propagation. Then, these predicted values are compared with the actual output values to further assess the DNN’s mapping performance. The prediction error is obtained by the loss function, while the weights and biases of each neuron are tuned according to the gradient of the loss function. The commonly used loss function is the mean squared error (MSE), which is defined as follows:
where
is the number of samples.
and
represent the predicted output and actual output values of the
th sample, respectively.
In the step of backpropagation, the updates for weights and biases are performed as follows:
where
represents the weights of the
th layer at time
.
denotes the biases of the
th layer at time
.
is the learning rate that determines the step size for searching weight values.
represents the gradient sign. The “
” sign indicates that the weight updates are made to reduce the loss function.
Therefore, a deep neural network, denoted as
and shown in
Figure 7, can be trained to establish the mapping in Equation (29).
is used to solve the trajectory parameters
,
,
and the maximum curvature
. Then, the multiple constraints can be checked according to Equations (9) and (25).
3.4. Trajectory Tracking
It is crucial to guide the UAV along the designed trajectory in the presence of disturbances. Otherwise, the actual flight trajectory may deviate from the designed trajectory. Therefore, a closed-loop trajectory tracking method is needed. Four commonly used three-dimensional trajectory tracking algorithms are compared in [
38], namely lookahead, nonlinear guidance law (NLGL), pure pursuit and line-of-sight (PLOS), and vector field. Among these algorithms, the PLOS algorithm performs the best in both accuracy and computational efficiency. Therefore, the PLOS trajectory tracking method depicted as Algorithm 1 is used to track the designed multi-constrained trajectory.
Algorithm 1 Pure Pursuit and Line-of-Sight algorithm. |
- 1:
for each time step do - 2:
Obtain the current waypoint , next waypoint , current UAV position and corresponding pitch and yaw angles , in the inertial frame. - 3:
Calculate . - 4:
Calculate . - 5:
Calculate , , . - 6:
Calculate . - 7:
Calculate . - 8:
Calculate . - 9:
Calculate - 10:
Calculate and commands from Equation (10) - 11:
end for
|
Where and are the pitch and yaw acceleration commands, respectively. and are the rotation angles used to transform the inertial frame into the local frame with respect to UAV velocity. These rotations can be represented by rotation matrices and . and are the vertical and cross-track errors, respectively. and are the proportional gains of the pure pursuit strategy. and are the proportional gains of the LOS strategy.
4. Numerical Simulation
4.1. Simulation Setup
To assess the performance of the proposed guidance method, it is compared with the guidance method in [
14]. The simulation experiment involves four UAVs:
,
,
, and
. Their simulation settings are depicted in
Table 1. The UAV engine thrust
Ten = 10,000 N, maximum working time of the engine
ten = 10 s, UAV’s initial mass
m0 = 200 kg, fuel mass
mfu = 50 kg, fuel consumption rate
µ = 5 kg/s, induced drag coefficient
, and maximum acceleration limit
amax = 100 m/s
2. The four proportional gains of the PLOS algorithm are set as
and
. The position of the target is (15,000, 0, 0) m. The autopilot is assumed to have a first-order lag with a time constant of 0.2 s. Thus, the flight velocity, distance, and time can be obtained through simulation.
The simulation was performed on a hardware platform with a 2.5 GHz six-core processor and 16 GB RAM. It takes approximately 50 ms to generate a flight trajectory that satisfies the constraints of impact time, impact angle, and maximum acceleration limit.
4.2. Establishment of Mapping Network
Before training network
, trajectory dataset samples need to be obtained first. Here, the optimal Latin hypercube method is employed to sample the trajectory data within the design space [
39]. This sampling technique ensures that the samples are evenly distributed across the entire design space. The value ranges of the trajectory parameters are set to
,
and
. Other trajectory parameters are set as
and
R0 = 15,000 m. Based on the trajectory parameters, the flight trajectories are obtained by integrating the Frenet–Serret Equation (9). Then the corresponding parameters
,
,
,
and
can be obtained from the generated trajectories. During the generation process of LASC, an undesired condition may occur in which the cumulative angular variation in tangent or binormal vectors is greater than 2π. Such conditions should be excluded because of the excessive loss of UAV velocity in practical applications. Then, a dataset of 1000 samples is obtained, as shown in
Table 2. The input data
are composed of parameters
,
,
, and
. The output data
are composed of parameters
,
,
, and
. The mapping network
is then trained using
and
. All the data are normalized before training and split into two parts. A total of 70% of the data is used for training, while the remaining 30% is used for testing.
Then, a feed-forward fully connected DNN is developed to learn the nonlinear relationship between
and
. The appropriate selection of the hyperparameters is critical to achieving optimal performance. Here, the optimized hyperparameters are given in
Table 3. The corresponding optimization process is similar to that in [
36]. The training loss curve in
Figure 8 shows that the DNN model keeps approaching the real model during the training process. The coefficient of determination
is adopted to assess the approximation error [
40].
Figure 9 illustrates the learning effect, which visualizes the prediction performance of DNN. The results demonstrate that the coefficients of determination of all four output parameters are above 0.99. Therefore, the prediction accuracy of the mapping network
is high.
4.3. Case 1: UAV Guidance under Constant Velocity
In Case 1, the velocity is set to be constant and equal to . The desired impact time is set to . According to Equation (28), we have , , , L1 = 19,500 m, and , , , L2 = 18,900 m. Based on mapping network , we obtain , , , , and , , , . Both trajectories of the UAVs and satisfy the maximum acceleration constraint according to Equation (25).
The simulation results are shown in
Figure 10 and
Table 4. It can be observed that all the UAVs have successfully reached the target position. The impact time errors for UAVs
and
are both within
, while the impact angle errors are both within
. Obviously, compared with UAVs
and
, UAVs
and
achieve better accuracy in both impact time and impact angle. This is because the guidance method in [
14] adopts a linear estimation method to calculate the time-to-go, which would degenerate under nonlinear kinematics.
4.4. Case 2: UAV Guidance under Time-Varying Velocity
In this case, the UAV velocities are time-varying according to Equation (11). The desired impact time is set to . All other simulation conditions are the same as in Case 1. According to Equation (28), we have , , , L1 = 19,433.92 m, and , , , L2 = 18,786.81 m. Based on mapping network , we obtain , , , , and , , , . Both trajectories of the UAVs and satisfy the maximum acceleration constraint according to Equation (25).
The simulation results are shown in
Figure 11 and
Table 5. It can be revealed that although all the UAVs have reached the target point, the guidance performances of UAVs
and
have significantly decreased. Their impact time errors are close to
and the impact pitch angle errors exceed
. This is mainly because the guidance law in [
14] adopts a linear estimation method for time-to-go prediction, which significantly degenerates under time-varying velocity. By contrast, the impact time errors of UAVs
and
are both less than
, and the impact angle errors are both less than
. It can be seen that the time-varying velocity has little influence on the performance of the proposed guidance method. This is because the proposed method has taken the influence of variable velocity into account during neural network training.