1. Introduction
In certain evasive situations, a larger miss distance is not always preferable. An excessive miss distance may result in unnecessary energy waste and hinder course recovery. For instance, when an unmanned aerial vehicle (UAV) primarily tasked with engaging a target is evading an interceptor, completing the evasion with a large miss distance may result in insufficient energy to engage the subsequent target. Critical safe miss distance evasion refers to evasive action that ensures the miss distance exceeds a critical safe value while minimizing energy consumption. Consequently, critical safe miss distance evasion represents a more efficient maneuvering strategy [
1].
Traditional maneuver evasion techniques primarily consist of programmatic maneuvers such as sinusoidal maneuvers [
2] and spiral maneuvers [
3], which exhibit relatively low levels of proactivity. With the advancements in optimal control theory, methods utilizing detected pursuer information for active evasion have become mainstream. These methods involve the application of Pontryagin’s Maximum Principle and differential game theory to derive analytical solutions for optimal guidance laws [
4,
5,
6]. Recently, scholars have expanded and refined research on optimal maneuver evasion strategies by incorporating high-order complex models, conditions of incomplete information, and multi-agent cooperative evasion strategies. For instance, reference [
7] investigated optimal maneuver evasion strategies by establishing a high-order guidance system state-space model for interceptors under proportional guidance control, and utilizing a formula for miss distance series. Reference [
8] combined guidance laws derived from complete and incomplete information models using mixed-strategy game theory to propose a new adaptive weighted differential game guidance law. Reference [
9] integrated the covariance matrix analysis of Kalman filtering into differential game theory to propose an orientation-driven guidance law. Additionally, references [
10,
11,
12] developed maneuver evasion guidance laws for evaders in active defense scenarios using differential game theory. In the study of the one-evader two-pursuer game problem, the work in reference [
13] is based on ideal dynamic characteristics, whereas reference [
14] adopted first-order dynamic assumptions and bounded control. Furthermore, with the development of machine learning, intelligent evasion methods have been proposed, such as acquiring evasion guidance law through deep reinforcement learning [
15] and employing machine learning to identify saddle-point solutions in nonlinear optimal control problems [
16,
17].
The aforementioned methods provide robust guidance laws for evasion in various scenarios. However, the objective function, which aims to maximize the miss distance, often results in excessive energy consumption and over-maneuvering issues. To address this, traditional solutions propose treating the control variables as process costs, which are subsequently weighted with terminal costs to form a quadratic differential game [
18]. Nevertheless, quadratic differential games yield suboptimal solutions, as they do not consider the boundaries of control variables when solving for co-state variables, and the weighting parameters for the objective function necessitate manual tuning. Consequently, building on previous optimal control theories, several studies have explored optimal guidance laws under critical safe miss distance constraints. Reference [
19] derived a solution for the critical safe miss distance evasion problem in a two-dimensional plane under bounded control. However, the miss distance constraint remains a soft constraint weighted within the objective function. Reference [
1] employed a similar objective function to investigate three-dimensional critical safe miss distance guidance laws and optimized the selection of the miss distance setting in the objective function using a neural network surrogate model. Reference [
20] simplified the evasion maneuver problem to the timing of maneuver selection and utilized LSTM networks to predict the intercept miss distance in real-time during flight, thereby determining the optimal timing for evasion. However, this method does not analyze its optimality.
As discussed above, the current research on critical safe miss distance evasion remains incomplete, particularly regarding the optimality of energy consumption. To address this gap, this study focuses on the maneuvering evasion problem in three-dimensional space, formulating a guidance law for critical safe miss distance evasion grounded in optimal control theory. Initially, the state equation for the zero-effort miss (ZEM) in three-dimensional space is established by treating the pursuer’s maneuvers as disturbances and estimating their values. Subsequently, an optimal control problem is formulated with the control cost defined as the objective function and the ZEM at the terminal time specified as the terminal constraint. The problem is subsequently solved using Pontryagin’s Maximum Principle and iterative algorithms. Finally, the effectiveness of the proposed method is validated through comparative simulations.
This paper presents three substantial and innovative contributions. First, the consideration of bounded control enhances the realism of the modeling process. The introduction of bounded control also complicates the optimal control problem, necessitating an iterative solution approach. Second, it introduces a maneuver estimation method for the pursuer, effectively transforming the game problem into a unilateral optimal control problem, thereby significantly simplifying the derivation of the optimal solution. Third, the miss distance constraint is no longer weighted within the objective function, filling a gap in the research on critical safe miss distance evasion regarding terminal constraints. The terminal constraints are more aligned with the actual requirements of critical miss distance. This research can enrich the evasion strategies and provide valuable insights for optimal maneuvering strategies under other complex model constraints.
4. Simulation and Analysis
4.1. Simulation Setup
Continuous updates to the guidance law during flight are essential due to the simplifications and assumptions inherent in the ZEM model. Given that the simulation time step is frequently shorter than the duration required for an iterative solution, the evader continues to employ the previous guidance law until the updates are finalized. Furthermore, as the distance diminishes, the angle between the evader’s velocity vector and the line-of-sight increases, leading to a heightened linearization error. Consequently, it is mandated that updates to the guidance law and control commands cease when the distance falls below . When is too large, it may cause the guidance law to stop updating prematurely. Conversely, if is too small, significant linearization errors may lead to an inaccurate guidance law. According to simulation results, a value of provides stable guidance and control performance for the evader.
The iteration initial values for the first computation of the guidance law are derived from
Section 3.3, whereas the initial values for each subsequent update adopt the calculated result of the preceding update. A concise overview of the simulation process is illustrated in
Figure 6, and the general parameters are detailed in
Table 1. As indicated in
Section 2.3, the critical safe miss distance synthesized in two directions is denoted as
. The computer hardware configuration comprises an Intel Core i7-13620H processor operating at 4.90 GHz. All results in this study are based on computational simulations and do not include experimental data.
In the simulation, the pursuer’s guidance law employs proportional navigation with bounded control:
where
is the effective navigation ratio.
4.2. Simulation of Critical Safe Miss Distance Evasion
First, simulations were conducted under two typical operating conditions, shown in
Table 2, with the effective navigation ratio of the pursuer set to 3. The simulation results are illustrated in
Figure 7,
Figure 8,
Figure 9 and
Figure 10. In
Figure 7 and
Figure 9, points
and
denote the initial positions of the evader and the pursuer, respectively. The miss distances for two conditions were 70.68 m and 71.37 m, with errors of −0.04% and 0.93% relative to the critical value, respectively. The energy consumption of the evader under two conditions is
and
, respectively.
Under both conditions, the miss distances are near the critical safe value, with Condition 2 demonstrating a greater miss distance and reduced energy consumption. This phenomenon is attributed to the initial ZEM of zero in Condition 1, while Condition 2 features a significant initial ZEM of 70.80 m. The absolute values of the evasion control for both conditions display a trend of initially increasing and subsequently decreasing. In Condition 1, when the pursuer’s maximum control commands are insufficient to reduce the ZEM to the critical safe value, the evasion control command becomes zero. In Condition 2, as the time-to-go decreases, the co-state variable in the pitch direction does not converge to zero, leading to a non-zero evasion control command. When the distance between the two parties is below the established threshold of 500 m, the control command remains constant. The average duration for the guidance law updates in two conditions is , with a maximum duration of 0.0096 s, which satisfies the requirement for rapid updates.
Monte Carlo simulations were conducted. The initial states were generated uniformly at random within the range specified in
Table 3. The random variables for the initial states include the relative positions of two parties and the direction of the evader’s velocity. The evader’s initial position was always located at the origin of the coordinate system, while the direction of pursuer’s initial velocity was oriented toward the evader, aligning with the initial line of sight. To evaluate the adaptability of the guidance law to different effective navigation ratios of pursuer, simulations were conducted 500 times for
and
, respectively. The simulation trajectories and statistical results are presented in
Figure 11. Due to the influence of gravity, the average positions at the interception moment for both parties are significantly lower than the origin. The statistics on the miss distance indicate that the average miss distances for
and
are 136.51 m and 116.37 m, respectively, with 12.4% and 32.0% of them falling below the critical value. The minimum miss distances are 70.45 m and 70.37 m, with relative errors to the critical value of −0.37% and −0.48%, respectively. The average energy consumption for the evader is 9.847 × 10
2 m
2·s
−3 and 2244 × 10
3 m
2·s
−3, while for the pursuer, it is
and
, respectively. In 1000 simulations, the average update time for the guidance law is 1.17 × 10
−5 s, with a maximum update time of 0.0088 s, which meets the requirements for rapid updates.
It can be observed that under random initial states, the average miss distance is greater when compared to . The average energy consumption for the evader is higher at ; however, the energy consumption for the pursuer is also greater. When , Case 1 of occurs more frequently, leading to more instances in which no iteration is required, resulting in a shorter average update time. Due to simplifications in the optimal control model, the proportion of miss distances below the critical safe value is relatively high. Nevertheless, for both effective navigation ratios, the relative errors of the minimum miss distances compared to the critical safe value are less than 1%, indicating that the guidance law for critical safe miss distance evasion is effective.
4.3. Comparison with Other Methods
To illustrate the effectiveness of the guidance law for critical safe miss distance evasion in reducing energy consumption, comparisons were made with the sinusoidal maneuver, spiral maneuver, and norm differential game maneuver [
22]. The sinusoidal and differential game maneuvers are applied in the yaw plane, with the control commands for the three methods detailed in
Table 4. Considering that some of the miss distances in
Section 4.2 fall below the critical value while the relative error does not exceed 1%, this section modifies the original
in the guidance law to
. Initial states were uniformly and randomly generated within the range specified in
Table 3, and 500 simulations were conducted for each of the four methods with
.
The simulation results are presented in
Table 5 and
Figure 12. In
Figure 12, the results of the critical safe miss distance maneuver exhibit a significant difference compared to other maneuvering methods. The primary reason for this difference is that the proposed guidance law has an objective function specifically aimed at achieving the critical safe miss distance. In contrast, the programmatic oscillatory maneuvers lack the objective functions, while the norm differential game maneuver is designed to maximize the miss distance.
Table 5 provides the statistics of the miss distances and energy consumption, including the mean value, relative standard deviation (RSD), and other relevant statistical metrics. In the context of anti-interception operations, evasion is deemed unsuccessful when the miss distance is less than the critical safe miss distance. In the results of the critical safe miss distance evasion, occurrences of miss distance below the critical value are no longer observed, indicating that the 1% adjustment to
is effective. Comparatively, among the four evaluated methods, the critical safe miss distance evasion exhibits the smallest average miss distance. Moreover, the critical safe miss distance evasion has no instances of failure, which is comparable to the differential game maneuver. The energy consumption of the critical safe miss distance evasion is also lower and significantly less than that of the other methods. Although the average miss distances for the sinusoidal and spiral maneuvers are larger, they exhibit relatively high RSDs and greater proportions of failure. This indicates that the miss distances for the programmatic maneuvers are more dependent on initial state values, exhibiting lower proactivity. When flight times are similar, the energy consumption of sinusoidal and spiral maneuvers shows a small variation after fixing the frequency and phase, resulting in lower RSDs in energy consumption. Since the control magnitude of the differential game maneuver is set at
, the differences in flight times are also minimal, leading to a lower RSD for energy consumption.
In summary, the proposed guidance law exhibits a greater probability of successful evasion compared to programmatic oscillatory maneuvers while incurring lower energy costs. Furthermore, in comparison to the norm differential game maneuver, it additionally achieves a reduction in energy consumption and effectively satisfies the requirements for critical safe miss distance evasion.
4.4. Evaluation of the Adaptability to Unpowered Aerial Vehicle
To reduce the mathematical and computational complexity, the proposed guidance law is formulated for the constant-speed vehicle. However, when the engagement time is short and the vehicle’s speed variation is relatively small, the guidance law could be adapted for the unpowered aerial vehicle. The aerodynamic characteristics and mass data for the CAV-H model are provided in reference [
23]. In the subsequent simulation, we consider a scenario in which both the pursuer and evader are CAV-H models, aiming to evaluate the adaptability of the guidance law to the more complex model. The specific procedure for applying the guidance law under the CAV-H model is outlined as follows:
Step 1: Determine the maximum control and based on the current speeds, maximum attack angles, and aerodynamic characteristics of the evader and pursuer.
Step 2: If the guidance law needs to be updated, substitute the current speeds of the evader and pursuer into and , respectively, and compute the required acceleration command for the evader.
Step 3: Based on the aerodynamic characteristics of CAV-H, calculate the attack angle and the bank angle from the acceleration command. Substitute the attack angle and bank angle into the CAV-H dynamic model and perform the simulation through numerical integration.
The effective navigation ratio of the pursuer is set to 4, and the initial altitude of the evader is set to 25 km. Under Condition 1 and Condition 2 provided in
Table 2, the four guidance laws in
Section 4.3 are applied in the simulations. The resulting flight trajectories and speed curves are shown in
Figure 13 and
Figure 14, while the miss distances and the evader’s terminal speeds are presented in
Table 6. For the unpowered vehicle, a larger acceleration in the objective function leads to a larger attack angle, which results in greater speed loss. Therefore, in this section, energy consumption is represented by the evader’s speed loss, defined as the difference between the initial and the terminal speeds. Under both conditions, the miss distance of the critical safe miss distance maneuver exceeds the critical value. The speed curves for the spiral maneuver and the differential game maneuver are similar, as both use maximum overload, resulting in a comparable aerodynamic drag. In Condition 1, the speed loss of the critical safe miss distance maneuver is only slightly lower than that of the sinusoidal maneuver. However, the miss distance of the sinusoidal maneuver is smaller than
, whereas the critical safe miss distance maneuver ensures safe evasion. In Condition 2, all the guidance laws achieve safe evasion, with the critical safe miss distance maneuver resulting in the smallest speed loss.
The results indicate that the critical safe miss distance maneuver can be effectively applied to the CAV-H model. In comparison with the programmatic maneuver and the differential game maneuver, the critical safe miss distance guidance law achieves a safe evasion with smaller speed losses in head-on intercepts.
5. Conclusions
This paper addresses the pursuit–evasion problem for constant-speed vehicles in three-dimensional space and proposes a guidance law for critical safe miss distance evasion. Simulation results validate the effectiveness of the guidance law. The main conclusions are as follows:
Through the state equation of the ZEM and the approximation of the disturbances experienced by the pursuer, an optimal control problem for critical safe miss distance evasion was established. Using the Maximum Principle, the optimal guidance law under bounded control and terminal constraints was derived. Furthermore, an iterative method was designed for solving co-state variables with the homotopy method and Newton iteration.
The simulation results demonstrate that, within a certain range of initial conditions, the proposed iterative method meets real-time requirements. Under head-on intercept conditions, the guidance law effectively achieves critical safe miss distance evasion and adapts to different effective navigation ratios of the pursuer. Compared to other methods, the critical safe miss distance evasion results in a lower probability of the miss distance being below the critical value and incurs a smaller energy cost. In head-on intercept scenarios, the proposed guidance law is applicable to unpowered vehicle models.
Future research will focus on comparing various guidance methods across a broader range of engagement configurations, such as non-head-on intercepts, and improving the feasibility of the proposed guidance law in practical engineering applications.