Optimal Guidance Law for Critical Safe Miss Distance Evasion

Wang, Chengze; Yan, Jiamin; Lyu, Rui; Liang, Zhuo; Chen, Yang

doi:10.3390/aerospace11121041

Open AccessArticle

Optimal Guidance Law for Critical Safe Miss Distance Evasion

by

Chengze Wang

^*,

Jiamin Yan

,

Rui Lyu

,

Zhuo Liang

and

Yang Chen

China Academy of Launch Vehicle Technology, Beijing 100076, China

^*

Author to whom correspondence should be addressed.

Aerospace 2024, 11(12), 1041; https://doi.org/10.3390/aerospace11121041

Submission received: 2 November 2024 / Revised: 11 December 2024 / Accepted: 18 December 2024 / Published: 19 December 2024

(This article belongs to the Section Aeronautics)

Download

Browse Figures

Versions Notes

Abstract

:

In pursuit–evasion scenarios, the pursuer typically possesses a lethal zone. If the evader effectively utilizes perceptual information, they can narrowly escape the lethal zone while minimizing energy consumption, thereby avoiding excessive and unnecessary maneuvers. Based on optimal control theory, we propose a guidance law for achieving critical safe miss distance evasion under bounded control. First, we establish the zero-effort miss (ZEM) state equation for the evader, while approximating disturbances from the pursuer. Next, we formulate an optimal control problem with energy consumption as the objective function and the ZEM at the terminal time as the terminal constraint. Subsequently, we design an iterative algorithm that combines the homotopy method and Newton’s iteration to solve the optimal control problem, applying Pontryagin’s Maximum Principle. The simulation results indicate that the designed iterative method converges effectively; through online updates, the proposed guidance law can successfully achieve critical safe miss distance evasion. Compared to programmatic maneuvering and norm differential game guidance law, this approach not only stabilizes the evader’s evasion capabilities but also significantly reduces energy consumption.

Keywords:

evasion maneuver; critical safe miss distance; optimal guidance law; Pontryagin’s maximum principle

1. Introduction

In certain evasive situations, a larger miss distance is not always preferable. An excessive miss distance may result in unnecessary energy waste and hinder course recovery. For instance, when an unmanned aerial vehicle (UAV) primarily tasked with engaging a target is evading an interceptor, completing the evasion with a large miss distance may result in insufficient energy to engage the subsequent target. Critical safe miss distance evasion refers to evasive action that ensures the miss distance exceeds a critical safe value while minimizing energy consumption. Consequently, critical safe miss distance evasion represents a more efficient maneuvering strategy [1].

Traditional maneuver evasion techniques primarily consist of programmatic maneuvers such as sinusoidal maneuvers [2] and spiral maneuvers [3], which exhibit relatively low levels of proactivity. With the advancements in optimal control theory, methods utilizing detected pursuer information for active evasion have become mainstream. These methods involve the application of Pontryagin’s Maximum Principle and differential game theory to derive analytical solutions for optimal guidance laws [4,5,6]. Recently, scholars have expanded and refined research on optimal maneuver evasion strategies by incorporating high-order complex models, conditions of incomplete information, and multi-agent cooperative evasion strategies. For instance, reference [7] investigated optimal maneuver evasion strategies by establishing a high-order guidance system state-space model for interceptors under proportional guidance control, and utilizing a formula for miss distance series. Reference [8] combined guidance laws derived from complete and incomplete information models using mixed-strategy game theory to propose a new adaptive weighted differential game guidance law. Reference [9] integrated the covariance matrix analysis of Kalman filtering into differential game theory to propose an orientation-driven guidance law. Additionally, references [10,11,12] developed maneuver evasion guidance laws for evaders in active defense scenarios using differential game theory. In the study of the one-evader two-pursuer game problem, the work in reference [13] is based on ideal dynamic characteristics, whereas reference [14] adopted first-order dynamic assumptions and bounded control. Furthermore, with the development of machine learning, intelligent evasion methods have been proposed, such as acquiring evasion guidance law through deep reinforcement learning [15] and employing machine learning to identify saddle-point solutions in nonlinear optimal control problems [16,17].

The aforementioned methods provide robust guidance laws for evasion in various scenarios. However, the objective function, which aims to maximize the miss distance, often results in excessive energy consumption and over-maneuvering issues. To address this, traditional solutions propose treating the control variables as process costs, which are subsequently weighted with terminal costs to form a quadratic differential game [18]. Nevertheless, quadratic differential games yield suboptimal solutions, as they do not consider the boundaries of control variables when solving for co-state variables, and the weighting parameters for the objective function necessitate manual tuning. Consequently, building on previous optimal control theories, several studies have explored optimal guidance laws under critical safe miss distance constraints. Reference [19] derived a solution for the critical safe miss distance evasion problem in a two-dimensional plane under bounded control. However, the miss distance constraint remains a soft constraint weighted within the objective function. Reference [1] employed a similar objective function to investigate three-dimensional critical safe miss distance guidance laws and optimized the selection of the miss distance setting in the objective function using a neural network surrogate model. Reference [20] simplified the evasion maneuver problem to the timing of maneuver selection and utilized LSTM networks to predict the intercept miss distance in real-time during flight, thereby determining the optimal timing for evasion. However, this method does not analyze its optimality.

As discussed above, the current research on critical safe miss distance evasion remains incomplete, particularly regarding the optimality of energy consumption. To address this gap, this study focuses on the maneuvering evasion problem in three-dimensional space, formulating a guidance law for critical safe miss distance evasion grounded in optimal control theory. Initially, the state equation for the zero-effort miss (ZEM) in three-dimensional space is established by treating the pursuer’s maneuvers as disturbances and estimating their values. Subsequently, an optimal control problem is formulated with the control cost defined as the objective function and the ZEM at the terminal time specified as the terminal constraint. The problem is subsequently solved using Pontryagin’s Maximum Principle and iterative algorithms. Finally, the effectiveness of the proposed method is validated through comparative simulations.

This paper presents three substantial and innovative contributions. First, the consideration of bounded control enhances the realism of the modeling process. The introduction of bounded control also complicates the optimal control problem, necessitating an iterative solution approach. Second, it introduces a maneuver estimation method for the pursuer, effectively transforming the game problem into a unilateral optimal control problem, thereby significantly simplifying the derivation of the optimal solution. Third, the miss distance constraint is no longer weighted within the objective function, filling a gap in the research on critical safe miss distance evasion regarding terminal constraints. The terminal constraints are more aligned with the actual requirements of critical miss distance. This research can enrich the evasion strategies and provide valuable insights for optimal maneuvering strategies under other complex model constraints.

2. Optimal Control Problem of Critical Safe Miss Distance Evasion

2.1. Dynamics of Pursuer and Evader

In this paper, both the pursuer and the evader are modeled with constant-speed, three-degree-of-freedom vehicle dynamics. As illustrated in Figure 1, the evader’s initial position is defined as the origin O, and the OX axis is oriented along the evader’s target direction within the horizontal plane. The OY axis is vertical and points upward, and the OZ axis completes the right-handed coordinate system. Thus, we have the following:

\{\begin{cases} {\dot{x}}_{i} = V_{i} \cos θ_{i} \cos σ_{i} \\ {\dot{y}}_{i} = V_{i} \sin θ_{i} \\ {\dot{z}}_{i} = - V_{i} \cos θ_{i} \sin σ_{i} \\ {\dot{θ}}_{i} = (a_{y i} - g \cos θ_{i}) / V_{i} \\ {\dot{σ}}_{i} = - a_{z i} / (V_{i} \cos θ_{i}) \end{cases}

(1)

where the subscripts

i = E, P

denote the evader and pursuer, respectively;

(x_{i}, y_{i}, z_{i})

represents the position;

V_{i}

is the speed;

θ_{i}

is the trajectory inclination angle;

σ_{i}

is the trajectory deflection angle; and

g

is the gravitational acceleration.

a_{y i}

and

a_{z i}

are the normal control accelerations that are perpendicular to the velocity vector, with

a_{y i}

lying in the plane containing the velocity vector and

a_{z i}

lying in the horizontal plane. Let

a_{i} = {[\begin{matrix} a_{y i} & a_{z i} \end{matrix}]}^{T}

, and the upper limit of control is

a_{i, \max}

. Equation (1) is based on the aircraft dynamic model presented in reference [1], with the gravitational acceleration and control acceleration separated to emphasize their independent effects on the dynamics.

In Figure 1,

q_{y}

and

q_{z}

are the line-of-sight angles in the pitch and yaw directions of the evader, respectively. In the figure, the directions of

q_{y}

,

q_{z}

,

θ_{E}

, and

σ_{P}

are positive, while the directions of

σ_{E}

and

θ_{P}

are negative. In this study, the following assumptions are established:

Assumption 1.

The initial relative motion between the pursuer and evader is characterized by a head-on interception, with the line-of-sight angle exhibiting minimal variation throughout most of the engagement process.

Assumption 2.

There is no control delay for either the pursuer or the evader, and both have ideal dynamic characteristics.

Assumption 3.

The evader is aware of the pursuer’s maximum control input and can obtain real-time information on both parties’ velocities and positions.

The head-on interception refers to the pursuer approaching the evader along the direction opposite to the velocity vector of the evader. This interception method facilitates the stable tracking of the evader by the pursuer and enables the interception of a higher-speed evader. Assumptions 1 and 2 simplify the analytical process of the model, ensuring that it can be effectively solved within the constraints of limited derivations and computational methods. Assumption 3 posits that, in engagement scenarios, both parties’ aircrafts are equipped with radar, a velocity detection system, and other relevant equipment.

2.2. State Equation for Zero-Effort Miss

As shown in Figure 2, the normal control acceleration is projected onto the initial line-of-sight coordinate system

{O - X}_{L} Y_{L} Z_{L}

, resulting in

[u_{x i} u_{y i} u_{z i}], i = E, P

. Let

r

denote the distance between the two entities, and

r = r_{E} - r_{P}

represent the distance vector. Let

y_{d}

and

z_{d}

denote the components of the distance vector along the

{OY}_{L}

and

{OZ}_{L}

axes, respectively. Then, we have the following:

\{\begin{cases} {\ddot{y}}_{d} = u_{y E} - u_{y P} \\ {\ddot{z}}_{d} = u_{z E} - u_{z P} \end{cases}

(2)

In calculating the time-to-go, it is assumed that the impact of the second derivative of distance on the result is relatively small, and the following typical approach is adopted for the estimation [10]:

t_{g} = - \frac{r}{\dot{r}}

(3)

In the

{OY}_{L}

direction, with

{[\begin{matrix} y_{d} & {\dot{y}}_{d} \end{matrix}]}^{T}

as the state variable, the state equation is established as follows:

[\begin{matrix} {\dot{y}}_{d} \\ {\ddot{y}}_{d} \end{matrix}] = [\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] [\begin{matrix} y_{d} \\ {\dot{y}}_{d} \end{matrix}] + [\begin{matrix} 0 \\ 1 \end{matrix}] u_{y E} - [\begin{matrix} 0 \\ 1 \end{matrix}] u_{y P}

(4)

At time

t

, the zero-input state transition matrix is introduced:

Φ (t_{g}, t) = \exp ([\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}] (t_{g} - t)) = [\begin{matrix} 1 & t_{g} - t \\ 0 & 1 \end{matrix}]

(5)

Thus, the zero-effort miss in the

{OY}_{L}

direction at time

t

is obtained as follows:

Z_{y} (t) = [\begin{matrix} 1 & 0 \end{matrix}] Φ (t_{g}, t) [\begin{matrix} y_{d} \\ {\dot{y}}_{d} \end{matrix}] = y_{d} + {\dot{y}}_{d} (t_{g} - t)

(6)

Similarly, the zero-effort miss in the

O Z_{L}

direction is as follows:

Z_{z} (t) = [\begin{matrix} 1 & 0 \end{matrix}] Φ (t_{g}, t) [\begin{matrix} z_{d} \\ {\dot{z}}_{d} \end{matrix}] = z_{d} + {\dot{z}}_{d} (t_{g} - t)

(7)

Taking the derivatives of

Z_{y}

and

Z_{z}

, and combining the result with Equation (2), we obtain the following:

\{\begin{cases} {\dot{Z}}_{y} (t) = {\dot{y}}_{d} + {\ddot{y}}_{d} (t_{g} - t) - {\dot{y}}_{d} = (t_{g} - t) (u_{y E} - u_{y P}) \\ {\dot{Z}}_{z} (t) = {\dot{z}}_{d} + {\ddot{z}}_{d} (t_{g} - t) - {\dot{z}}_{d} = (t_{g} - t) (u_{z E} - u_{z P}) \end{cases}

(8)

According to the transformation relationship between the coordinate systems

O - XYZ

and

{O - X}_{L} Y_{L} Z_{L}

, we have the following:

[\begin{matrix} u_{y i} \\ u_{z i} \end{matrix}] = [\begin{matrix} 0 & 1 & 0 \\ 0 & 0 & 1 \end{matrix}] M (q_{y}, q_{z}) M^{- 1} (θ_{i}, σ_{i}) [\begin{matrix} 0 \\ a_{y i} \\ a_{z i} \end{matrix}], i = E, P

(9)

where the transformation matrix function

M

is given by the following:

M (ψ, ζ) = [\begin{matrix} \cos ψ \cos ζ & \sin ψ & - \cos ψ \sin ζ \\ - \sin ψ \cos ζ & \cos ψ & \sin ψ \sin ζ \\ \sin ζ & 0 & \cos ζ \end{matrix}]

(10)

Substituting the above expression into Equation (8), we obtain the following:

\{\begin{cases} {\dot{Z}}_{y} (t) = (t_{g} - t) (α_{11} a_{y E} + α_{12} a_{z E} + u_{y P}) \\ {\dot{Z}}_{z} (t) = (t_{g} - t) (α_{21} a_{y E} + α_{22} a_{z E} + u_{z P}) \end{cases}

(11)

where the coefficients of the state equation are as follows:

\{\begin{cases} α_{11} = \cos q_{y} \cos θ_{E} + \sin q_{y} \sin θ_{E} \cos (q_{z} - σ_{E}) \\ α_{12} = \sin q_{y} \sin (q_{z} - σ_{E}) \\ α_{21} = \sin θ_{E} \sin (σ_{E} - q_{z}) \\ α_{22} = \cos (q_{z} - σ_{E}) \end{cases}

(12)

Thus, we have obtained the state equation for the zero-effort miss.

2.3. Objective Function and Constraints

To transform the critical safe miss distance evasion problem into a unilateral optimal control problem, the ZEM caused by the maneuvers of the pursuer and evader is addressed separately. By integrating Equation (11), the miss distance at time

t_{g}

is obtained:

\{\begin{cases} Z_{y} (t_{g}) = Z_{y, 0} + \int_{0}^{t_{g}} (t_{g} - t) (α_{11} a_{y E} + α_{12} a_{z E}) d t + \int_{0}^{t_{g}} - (t_{g} - t) u_{y P} d t \\ Z_{z} (t_{g}) = Z_{z, 0} + \int_{0}^{t_{g}} (t_{g} - t) (α_{21} a_{y E} + α_{22} a_{z E}) d t + \int_{0}^{t_{g}} - (t_{g} - t) u_{z P} d t \end{cases}

(13)

The integral terms of the pursuer are considered as disturbances and are approximated using the following method. At

t = 0

, the pursuer’s strategy is assumed to minimize the absolute value of the initial ZEM by the terminal time. When

0.5 a_{P, \max} t_{g}^{2} \geq \sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}

, if

u_{y p} = 2 Z_{y, 0} / t_{g}^{2}

and

u_{z p} = 2 Z_{z, 0} / t_{g}^{2}

, according to Equation (13), it can be ensured that

\int_{0}^{t_{g}} - (t_{g} - t) u_{y P} d t = - Z_{y, 0}

and

\int_{0}^{t_{g}} - (t_{g} - t) u_{z P} d t = - Z_{z, 0}

, meaning that the pursuer can successfully eliminate the initial ZEM at the terminal time. When

0.5 a_{P, \max} t_{g}^{2} < \sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}

, the pursuer is unable to fully eliminate the initial ZEM. In this case, the maximum control

a_{P, \max}

is distributed proportionally between the two directions, i.e., setting

u_{y p} = a_{P, \max} Z_{y, 0} / \sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}

and

u_{z p} = a_{P, \max} Z_{z, 0} / \sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}

. Based on the above approximation method, the disturbance terms are calculated as follows:

I_{d y} = \int_{0}^{t_{g}} - (t_{g} - t) u_{y P} d t \approx - Z_{y, 0} sat (\frac{0.5 a_{P, \max} t_{g}^{2}}{\sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}})

(14)

I_{d z} = \int_{0}^{t_{g}} - (t_{g} - t) u_{z P} d t \approx - Z_{z, 0} sat (\frac{0.5 a_{P, \max} t_{g}^{2}}{\sqrt{Z_{y, 0}^{2} + Z_{z, 0}^{2}}})

(15)

where the saturation function sat is given by the following:

sat (\frac{a}{b}) ≜ \{\begin{array}{l} a / b & , |a| < |b| \\ sgn (a / b) & , |a| \geq |b| \end{array}

(16)

The basis for the above approximation is that, at

t = 0

, the pursuer’s strategy is assumed to minimize the absolute value of the initial ZEM by the terminal time. This approximation considers, on one hand, the limitations of the pursuer’s control and, on the other hand, the trend of the disturbance. Additionally, the approximation can be updated in real time during the guidance process.

Let the ZEM caused by the evader be treated as the new state variables in the optimal control problem, and the new state equation is as follows:

\{\begin{cases} {\dot{Z}}_{y E} (t) = (t_{g} - t) (α_{11} a_{y E} + α_{12} a_{z E}) \\ {\dot{Z}}_{z E} (t) = (t_{g} - t) (α_{21} a_{y E} + α_{22} a_{z E}) \end{cases}

(17)

To minimize energy consumption in control, the objective function is defined as follows:

J = \int_{0}^{t_{g}} \frac{1}{2} (a_{y E}^{2} + a_{z E}^{2}) d t

(18)

The energy is expressed using the energy concept commonly employed in optimal control theory [12], which is the integral of the square of the control acceleration with respect to time. Therefore, the unit of energy in this paper is

m^{2} \cdot s^{- 3}

.

By decoupling the constraints on the terminal ZEM in two directions, the evader’s requirements for the magnitudes of the terminal ZEM in both directions are established to be no less than

Z_{s e t}

. Consequently, the constraints are defined as follows:

\{\begin{cases} Z_{s e t} - |Z_{y} (t_{g})| \leq 0 \\ Z_{s e t} - |Z_{z} (t_{g})| \leq 0 \end{cases}

(19)

In the aforementioned approximate model, the constraints on

Z_{y} (t_{g})

and

Z_{z} (t_{g})

utilize absolute values. However, in practical applications, to increase the energy consumption of the pursuer, it is necessary for the signs of

Z_{y} (t_{g})

and

Z_{z} (t_{g})

to match those of

Z_{y, 0}

and

Z_{z, 0}

at the initial time, respectively. Then, we have the following:

\{\begin{cases} Z_{s e t} - sgn (Z_{y, 0}) Z_{y} (t_{g}) \leq 0 \\ Z_{s e t} - sgn (Z_{z, 0}) Z_{z} (t_{g}) \leq 0 \end{cases}

(20)

It is evident that when the initial zero-effort miss is zero, the sign function should not equal zero. Therefore, the sign function is defined as follows:

sgn (a) ≜ \{\begin{array}{l} 1 & , a \geq 0 \\ - 1 & , a < 0 \end{array}

(21)

Moreover, the control is bounded, with the constraints given by the following:

{‖a_{E}‖}_{2} \leq a_{E, \max}

(22)

The preceding discussion outlines the optimal control problem concerning critical safe miss distance evasion, where the constraint on the miss distance is not incorporated into the objective function but is instead treated as a hard constraint. The distinction between the maneuver for a critical safe miss and the traditional maneuver aimed at maximizing the miss distance is illustrated in Figure 3, where

Z_{\max}

represents the maximized miss distance and

Z_{s a f e}

denotes the critical safe miss distance.

3. Solution Method of the Optimal Guidance Law

3.1. Optimal Guidance Law

The aforementioned optimal control problem is solved using the Pontryagin’s Maximum Principle. The Hamiltonian is as follows:

H (a_{t y}, a_{t z}, λ_{1}, λ_{2}, t) = \frac{1}{2} (a_{t y}^{2} + a_{t z}^{2}) + λ_{1} (t_{g} - t) (α_{11} a_{t y} + α_{12} a_{z}) + λ_{2} (t_{g} - t) (α_{21} a_{t y} + α_{22} a_{z})

(23)

where

λ_{1}

and

λ_{2}

are co-state variables. According to the Maximum Principle, the adjoint equations are as follows:

\{\begin{cases} {\dot{λ}}_{1} = - \frac{\partial H}{\partial Z_{y}} = 0 \\ {\dot{λ}}_{2} = - \frac{\partial H}{\partial Z_{z}} = 0 \end{cases}

(24)

\{\begin{cases} λ_{1} (t_{g}) = - γ_{1} sgn (Z_{y, 0}) \\ λ_{2} (t_{g}) = - γ_{2} sgn (Z_{z, 0}) \end{cases}

(25)

where

γ_{1}

and

γ_{2}

are Lagrange multipliers. The transversality condition is as follows:

\{\begin{cases} γ_{1} [Z_{s e t} - sgn (Z_{y, 0}) Z_{y} (t_{g})] + γ_{2} [Z_{s e t} - sgn (Z_{z, 0}) Z_{z} (t_{g})] = 0 \\ Z_{s e t} - sgn (Z_{y, 0}) Z_{y} (t_{g}) \leq 0 \\ Z_{s e t} - sgn (Z_{z, 0}) Z_{z} (t_{g}) \leq 0 \\ γ_{1} \geq 0, γ_{2} \geq 0 \end{cases}

(26)

The extremum condition states that for any permissible

{a^{'}}_{y E}

and

{a^{'}}_{z E}

, the following is held:

H (a_{y E}, a_{z E}, λ_{1}, λ_{2}, t) \leq H ({a^{'}}_{y E}, {a^{'}}_{z E}, λ_{1}, λ_{2}, t)

(27)

By combining the quadratic form in Equation (23) with the control boundary constraint in Equation (22), the optimal control is derived as follows:

a_{E} = sat (\frac{a_{E, \max}}{(t_{g} - t) \sqrt{{(λ_{1} α_{11} + λ_{2} α_{21})}^{2} + {(λ_{1} α_{12} + λ_{2} α_{22})}^{2}}}) [\begin{matrix} (- λ_{1} α_{11} - λ_{2} α_{21}) (t_{g} - t) \\ (- λ_{1} α_{12} - λ_{2} α_{22}) (t_{g} - t) \end{matrix}]

(28)

Consequently, once the co-state variables are determined, the optimal control can be obtained, which necessitates jointly solving Equations (23)–(28).

3.2. Method for Solving Co-State Variables

As the co-state variables do not possess an analytical solution, a numerical solution method has been designed, consisting of the following steps.

Step 1: Assume

λ_{1} = λ_{2} = 0

and determine whether Equation (26) is satisfied.

According to Equation (25),

γ_{1} = γ_{2} = 0

, and the Hamiltonian is as follows:

H = \frac{1}{2} (a_{y E}^{2} + a_{z E}^{2})

(29)

Furthermore, according to Equation (28),

a_{y E} = a_{z E} = 0

; then, the following is obtained:

\{\begin{cases} Z_{y} (t_{g}) = Z_{y, 0} + I_{d y} \\ Z_{z} (t_{g}) = Z_{z, 0} + I_{d z} \end{cases}

(30)

Given that

γ_{1} = γ_{2} = 0

, the assumption holds true only if the following is true:

\{\begin{cases} Z_{s e t} - sgn (Z_{y, 0}) (Z_{y, 0} + I_{d y}) \leq 0 \\ Z_{s e t} - sgn (Z_{z, 0}) (Z_{z, 0} + I_{d z}) \leq 0 \end{cases}

(31)

Then, the co-states are obtained as

[\begin{matrix} λ_{1} & λ_{2} \end{matrix}] = [\begin{matrix} 0 & 0 \end{matrix}]

.

Step 2: If the assumption in Step 1 is not satisfied, we can further assume that

λ_{1} = 0

,

λ_{2} \neq 0

and determine whether Equation (26) is satisfied.

Based on the assumption, it follows that

γ_{1} = 0

,

γ_{2} \neq 0

. The first task is to determine whether

Z_{s e t} - sgn (Z_{z, 0}) (Z_{z, 0} + I_{d z}) < 0

is valid. If this condition is satisfied, then

γ_{2} = 0

, and the assumption

λ_{2} \neq 0

is not valid. Otherwise, the following equality constraint must be met:

Z_{s e t} - sgn (Z_{z, 0}) Z_{z} (t_{g}) = 0

(32)

For convenience in subsequent use, the expression above can be rewritten as follows:

Z_{z} (t_{g}) - sgn (Z_{z, 0}) Z_{s e t} = 0

(33)

After solving for

λ_{2}

from Equation (33), substitute

λ_{1} = 0

and the solution

λ_{2}^{*}

into the inequality:

Z_{s e t} - sgn (Z_{y, 0}) Z_{y} (t_{g}) \leq 0

(34)

If inequality (34) is satisfied, then the assumption is validated, resulting in the co-states

[\begin{matrix} λ_{1} & λ_{2} \end{matrix}] = [\begin{matrix} 0 & λ_{2}^{*} \end{matrix}]

.

Step 3: If the assumption in Step 2 is not satisfied, we can further assume that

λ_{2} = 0

,

λ_{1} \neq 0

and determine whether Equation (26) is satisfied.

Similarly, based on the assumption, it follows that

γ_{1} \neq 0

,

γ_{2} = 0

. The first task is to determine whether

Z_{s e t} - sgn (Z_{y, 0}) (Z_{y, 0} + I_{d z}) \leq 0

is valid. If this condition is met, then

γ_{1} = 0

, and the assumption

λ_{1} \neq 0

is not valid. Otherwise, the following equality constraint must be satisfied:

Z_{y} (t_{g}) - sgn (Z_{y, 0}) Z_{s e t} = 0

(35)

After solving for

λ_{1}

from Equation (35), substitute

λ_{2} = 0

and the solution

λ_{1}^{*}

into the inequality:

Z_{s e t} - sgn (Z_{z, 0}) Z_{z} (t_{g}) \leq 0

(36)

If inequality (36) is satisfied, then the assumption is validated, resulting in the co-states

[\begin{matrix} λ_{1} & λ_{2} \end{matrix}] = [\begin{matrix} λ_{1}^{*} & 0 \end{matrix}]

.

Step 4: If the assumption in Step 3 is not valid, then

λ_{1} \neq 0

,

λ_{2} \neq 0

. According to Equation (26), the following equality constraint must be satisfied:

\{\begin{cases} Z_{y} (t_{g}) - sgn (Z_{y, 0}) Z_{s e t} = 0 \\ Z_{z} (t_{g}) - sgn (Z_{z, 0}) Z_{s e t} = 0 \end{cases}

(37)

Values of

λ_{1}

and

λ_{2}

can be derived from Equation (37).

The steps outlined above detail the process for solving the co-state variables, as illustrated in Figure 4.

3.3. Iteration Method for Solving Nonlinear Equations

Equations (33), (35), and (37) are all nonlinear equations that necessitate iterative solutions. According to Equation (28),

{‖a_{E}‖}_{2}

exhibits two types of variation curves, with

t_{x}

in Figure 5 representing the inflection point of the control variable.

When solving Equation (33), the assumptions

λ_{1} = 0

and

λ_{2} \neq 0

are made. By combining Equations (13) and (28), the variation of

{‖a_{E}‖}_{2}

corresponds to Case 1 when the following is obtained:

|\frac{3 α_{22} [sgn (Z_{z, 0}) Z_{s e t} - Z_{z, 0} - I_{d z}]}{t_{g}^{2} (- α_{21}^{2} - α_{22}^{2})}| \leq a_{t, \max}

(38)

At this point,

λ_{2}

does not require iterative solving and is expressed as follows:

λ_{2} = \frac{3 [sgn (Z_{z, 0}) Z_{s e t} - Z_{z, 0} - I_{d z}]}{t_{g}^{3} (- α_{21}^{2} - α_{22}^{2})}

(39)

For Case 2, the nonlinear equation required for the iteration process in Equation (33) is given by the following:

G_{2} (λ_{2}) = Z_{z, 0} + I_{d z} + I_{h} (α_{21}, λ_{2}) + I_{h} (α_{22}, λ_{2}) - sgn (Z_{z, 0}) Z_{s e t} = 0

(40)

where

I_{h} (α, λ)

is defined as follows:

I_{h} (α, λ) ≜ \{\begin{array}{l} - t_{g}^{3} α^{2} λ / 3 & |- λ α t_{g}| \leq a_{E, \max} \\ sgn (- λ α) [α (2 t_{g} t_{x} - t_{x}^{2}) a_{E, \max} / 2] - {(t_{g} - t_{x})}^{3} α^{2} λ / 3 & |- λ α t_{g}| > a_{E, \max} \end{array}

(41)

and

t_{x}

can be calculated by the following:

t_{x} = t_{g} - \frac{a_{E, \max}}{|- λ α|}

(42)

Similarly, for Equation (35), the variation of

{‖a_{E}‖}_{2}

corresponds to Case 1 when the following is obtained:

|\frac{3 α_{11} [sgn (Z_{y, 0}) Z_{s e t} - Z_{y, 0} - I_{d y}]}{t_{g}^{2} (- α_{11}^{2} - α_{12}^{2})}| \leq a_{E, \max}

(43)

At this point,

λ_{1}

does not require iterative solving, leading to the following:

λ_{1} = \frac{3 [sgn (Z_{y, 0}) Z_{s e t} - Z_{y, 0} - I_{d y}]}{t_{g}^{3} (- α_{11}^{2} - α_{12}^{2})}

(44)

For Case 2, the nonlinear equation required for the iteration process in Equation (35) is given by the following:

G_{1} (λ_{1}) = Z_{y, 0} + I_{d y} + I_{h} (α_{11}, λ_{1}) + I_{h} (α_{12}, λ_{1}) - sgn (Z_{y, 0}) Z_{s e t} = 0

(45)

Combining with Equation (28), the system of nonlinear equations required for the iteration process in Equation (37) is given by:

F (λ_{1}, λ_{2}) = [\begin{matrix} Z_{y, 0} + I_{d y} + I_{y} (λ_{1}, λ_{2}) - sgn (Z_{y, 0}) Z_{s e t} \\ Z_{z, 0} + I_{d z} + I_{z} (λ_{1}, λ_{2}) - sgn (Z_{z, 0}) Z_{s e t} \end{matrix}] = [\begin{matrix} 0 \\ 0 \end{matrix}]

(46)

where

\{\begin{cases} I_{y} (λ_{1}, λ_{2}) = [(- α_{11}^{2} - α_{12}^{2}) λ_{1} + (- α_{11} α_{21} - α_{12} α_{22}) λ_{2}] W (λ_{1}, λ_{2}) \\ I_{z} (λ_{1}, λ_{2}) = [(- α_{11} α_{21} - α_{12} α_{22}) λ_{1} + (- α_{21}^{2} - α_{22}^{2}) λ_{2}] W (λ_{1}, λ_{2}) \end{cases}

(47)

For Case 1, when

t_{g} \sqrt{{(λ_{1} α_{11} + λ_{2} α_{21})}^{2} + {(λ_{1} α_{12} + λ_{2} α_{22})}^{2}} \leq a_{E, \max}

, the following is obtained:

W (λ_{1}, λ_{2}) = t_{g}^{3} / 3

(48)

In Case 1, Equation (46) is a linear equation in terms of

λ = {[\begin{matrix} λ_{1} & λ_{2} \end{matrix}]}^{T}

, and it can be solved analytically to yield the following:

λ = {[\begin{array}{l} α_{11}^{2} + α_{12}^{2} & α_{11} α_{21} + α_{12} α_{22} \\ α_{11} α_{21} + α_{12} α_{22} & α_{21}^{2} + α_{22}^{2} \end{array}]}^{- 1} [\begin{matrix} Z_{y, 0} + I_{d y} - sgn (Z_{y, 0}) Z_{s e t} \\ Z_{z, 0} + I_{d z} - sgn (Z_{z, 0}) Z_{s e t} \end{matrix}]

(49)

For Case 2, when

t_{g} \sqrt{{(λ_{1} α_{11} + λ_{2} α_{21})}^{2} + {(λ_{1} α_{12} + λ_{2} α_{22})}^{2}} \leq a_{E, \max}

, the following is obtained:

W (λ_{1}, λ_{2}) = \frac{a_{E, \max} (2 t_{g} t_{x} - t_{x}^{2})}{2 \sqrt{{(λ_{1} α_{11} + λ_{2} α_{21})}^{2} + {(λ_{1} α_{12} + λ_{2} α_{22})}^{2}}} + \frac{{(t_{g} - t_{x})}^{3}}{3}

(50)

where

t_{x} = t_{g} - \frac{a_{E, \max}}{\sqrt{{(λ_{1} α_{11} + λ_{2} α_{21})}^{2} + {(λ_{1} α_{12} + λ_{2} α_{22})}^{2}}}

(51)

Equations (40) and (45) each possess a single independent variable and are relatively straightforward in form, facilitating the identification of appropriate initial values. Consequently, the Newton iteration method is employed as follows:

λ_{i}^{(k + 1)} = λ_{i}^{(k)} - \frac{G_{i} (λ_{i}^{(k)})}{{\dot{G}}_{i} (λ_{i}^{(k)})}, k = 0, 1, …, N_{n} - 1; i = 1, 2

(52)

where

N_{n}

denotes the upper limit of the Newton iteration. Let the error tolerance be

ε

, and the convergence criterion for iteration process is as follows:

|λ_{i}^{(k + 1)} - λ_{i}^{(k)}| \leq ε, i = 1, 2

(53)

To ensure a suitable initial value for the Newton iteration, we assume that

q_{z}

and

σ_{E}

differ only slightly. Consequently,

α_{12}

and

α_{21}

are small quantities that can be disregarded, enabling an analytical approximation of

λ

:

λ_{1}^{(0)} = - sgn (Z_{y, 0}) {[\frac{a_{t, \max}^{3} sgn (Z_{y, 0})}{6 α_{11} (Z_{y, 0} + I_{d y} + sgn (Z_{y, 0}) (α_{11} a_{t, \max} t_{g}^{2} / 2 - Z_{s e t}))}]}^{\frac{1}{2}}

(54)

λ_{2}^{(0)} = - sgn (Z_{z, 0}) {[\frac{a_{t, \max}^{3} sgn (Z_{z, 0})}{6 α_{22} (Z_{z, 0} + I_{d z} + sgn (Z_{z, 0}) (α_{22} a_{t, \max} t_{g}^{2} / 2 - Z_{s e t}))}]}^{\frac{1}{2}}

(55)

λ^{(0)} = {[λ_{1}^{(0)} λ_{2}^{(0)}]}^{T}

represent the initial value for the iteration of Equation (52).

To solve Equation (46), a preliminary value is initially calculated based on Equation (49). If the preliminary value satisfies the inequality for Case 1, then

λ = λ^{(0)}

. If the preliminary value does not satisfy Case 1, the iteration solution is required. Due to the coupling between

λ_{1}

and

λ_{2}

, the nonlinear equations become more complex, necessitating the adoption of the Newton–homotopy iteration method [21] for the solution:

\{\begin{cases} λ^{(k + 1)} = λ^{(k)} - {[\nabla F (λ^{(k)})]}^{- 1} [F (λ^{(k)}) + (\frac{k}{N_{h}} - 1) F (λ^{(0)})], k = 0, 1, …, N_{h} - 1 \\ λ^{(k + 1)} = λ^{(k)} - {[\nabla F (λ^{(k)})]}^{- 1} F (λ^{(k)}), k = N_{h}, N_{h} + 1, …, N_{h} + N_{n} - 1 \end{cases}

(56)

where

N_{h}

is the number of homotopy iterations and

N_{n}

is the upper limit for the Newton iteration. The homotopy iteration method demonstrates strong adaptability to initial values, so the purpose of

N_{h}

iterations is to obtain a suitable initial value for the subsequent Newton iteration. The computed

λ^{(0)}

serves as the initial value for homotopy iteration, and the convergence criterion for Newton iteration is given by the following:

{‖λ^{(k + 1)} - λ^{(k)}‖}_{2} \leq ε

(57)

In summary, once the solution for

λ

is complete, the optimal evasion guidance law can be derived from Equation (28).

4. Simulation and Analysis

4.1. Simulation Setup

Continuous updates to the guidance law during flight are essential due to the simplifications and assumptions inherent in the ZEM model. Given that the simulation time step is frequently shorter than the duration required for an iterative solution, the evader continues to employ the previous guidance law until the updates are finalized. Furthermore, as the distance diminishes, the angle between the evader’s velocity vector and the line-of-sight increases, leading to a heightened linearization error. Consequently, it is mandated that updates to the guidance law and control commands cease when the distance falls below

r_{C}

. When

r_{C}

is too large, it may cause the guidance law to stop updating prematurely. Conversely, if

r_{C}

is too small, significant linearization errors may lead to an inaccurate guidance law. According to simulation results, a value of

r_{C} = 500 m

provides stable guidance and control performance for the evader.

The iteration initial values for the first computation of the guidance law are derived from Section 3.3, whereas the initial values for each subsequent update adopt the calculated result of the preceding update. A concise overview of the simulation process is illustrated in Figure 6, and the general parameters are detailed in Table 1. As indicated in Section 2.3, the critical safe miss distance synthesized in two directions is denoted as

Z_{s a f e} = \sqrt{2} Z_{s e t}

. The computer hardware configuration comprises an Intel Core i7-13620H processor operating at 4.90 GHz. All results in this study are based on computational simulations and do not include experimental data.

In the simulation, the pursuer’s guidance law employs proportional navigation with bounded control:

[\begin{matrix} a_{y P} \\ a_{z P} \end{matrix}] = [\begin{matrix} k_{P} V_{P} {\dot{q}}_{y} \\ k_{P} V_{P} {\dot{q}}_{z} \end{matrix}] sat (\frac{a_{P, \max}}{k_{P} V_{P} \sqrt{{\dot{q}}_{y}^{2} + {\dot{q}}_{z}^{2}}})

(58)

where

k_{P}

is the effective navigation ratio.

4.2. Simulation of Critical Safe Miss Distance Evasion

First, simulations were conducted under two typical operating conditions, shown in Table 2, with the effective navigation ratio of the pursuer set to 3. The simulation results are illustrated in Figure 7, Figure 8, Figure 9 and Figure 10. In Figure 7 and Figure 9, points

E_{0}

and

P_{0}

denote the initial positions of the evader and the pursuer, respectively. The miss distances for two conditions were 70.68 m and 71.37 m, with errors of −0.04% and 0.93% relative to the critical value, respectively. The energy consumption of the evader under two conditions is

4.787 \times 10^{3} m^{2} \cdot s^{- 3}

and

8.467 \times 10^{2} m^{2} \cdot s^{- 3}

, respectively.

Under both conditions, the miss distances are near the critical safe value, with Condition 2 demonstrating a greater miss distance and reduced energy consumption. This phenomenon is attributed to the initial ZEM of zero in Condition 1, while Condition 2 features a significant initial ZEM of 70.80 m. The absolute values of the evasion control for both conditions display a trend of initially increasing and subsequently decreasing. In Condition 1, when the pursuer’s maximum control commands are insufficient to reduce the ZEM to the critical safe value, the evasion control command becomes zero. In Condition 2, as the time-to-go decreases, the co-state variable in the pitch direction does not converge to zero, leading to a non-zero evasion control command. When the distance between the two parties is below the established threshold of 500 m, the control command remains constant. The average duration for the guidance law updates in two conditions is

3.68 \times 10^{- 5} s

, with a maximum duration of 0.0096 s, which satisfies the requirement for rapid updates.

Monte Carlo simulations were conducted. The initial states were generated uniformly at random within the range specified in Table 3. The random variables for the initial states include the relative positions of two parties and the direction of the evader’s velocity. The evader’s initial position was always located at the origin of the coordinate system, while the direction of pursuer’s initial velocity was oriented toward the evader, aligning with the initial line of sight. To evaluate the adaptability of the guidance law to different effective navigation ratios of pursuer, simulations were conducted 500 times for

k_{p} = 3

and

k_{p} = 4

, respectively. The simulation trajectories and statistical results are presented in Figure 11. Due to the influence of gravity, the average positions at the interception moment for both parties are significantly lower than the origin. The statistics on the miss distance indicate that the average miss distances for

k_{p} = 3

and

k_{p} = 4

are 136.51 m and 116.37 m, respectively, with 12.4% and 32.0% of them falling below the critical value. The minimum miss distances are 70.45 m and 70.37 m, with relative errors to the critical value of −0.37% and −0.48%, respectively. The average energy consumption for the evader is 9.847 × 10² m²·s⁻³ and 2244 × 10³ m²·s⁻³, while for the pursuer, it is

2.435 \times 10^{4} m^{2} \cdot s^{- 3}

and

2.627 \times 10^{4} m^{2} \cdot s^{- 3}

, respectively. In 1000 simulations, the average update time for the guidance law is 1.17 × 10⁻⁵ s, with a maximum update time of 0.0088 s, which meets the requirements for rapid updates.

It can be observed that under random initial states, the average miss distance is greater when

k_{p} = 3

compared to

k_{p} = 4

. The average energy consumption for the evader is higher at

k_{p} = 4

; however, the energy consumption for the pursuer is also greater. When

k_{p} = 3

, Case 1 of

{‖a_{E}‖}_{2}

occurs more frequently, leading to more instances in which no iteration is required, resulting in a shorter average update time. Due to simplifications in the optimal control model, the proportion of miss distances below the critical safe value is relatively high. Nevertheless, for both effective navigation ratios, the relative errors of the minimum miss distances compared to the critical safe value are less than 1%, indicating that the guidance law for critical safe miss distance evasion is effective.

4.3. Comparison with Other Methods

To illustrate the effectiveness of the guidance law for critical safe miss distance evasion in reducing energy consumption, comparisons were made with the sinusoidal maneuver, spiral maneuver, and norm differential game maneuver [22]. The sinusoidal and differential game maneuvers are applied in the yaw plane, with the control commands for the three methods detailed in Table 4. Considering that some of the miss distances in Section 4.2 fall below the critical value while the relative error does not exceed 1%, this section modifies the original

Z_{s e t} = 50 m

in the guidance law to

{Z^{'}}_{s e t} = 1.01 \times Z_{s e t}

. Initial states were uniformly and randomly generated within the range specified in Table 3, and 500 simulations were conducted for each of the four methods with

k_{p} = 3

.

a_{y E}

a_{E, \max} \cos (π t / 4)

a_{z E}

a_{E, \max} \sin (π t / 4)

a_{E, \max} \sin (π t / 4)

a_{E, \max} sgn (Z_{z} (t))

The simulation results are presented in Table 5 and Figure 12. In Figure 12, the results of the critical safe miss distance maneuver exhibit a significant difference compared to other maneuvering methods. The primary reason for this difference is that the proposed guidance law has an objective function specifically aimed at achieving the critical safe miss distance. In contrast, the programmatic oscillatory maneuvers lack the objective functions, while the norm differential game maneuver is designed to maximize the miss distance. Table 5 provides the statistics of the miss distances and energy consumption, including the mean value, relative standard deviation (RSD), and other relevant statistical metrics. In the context of anti-interception operations, evasion is deemed unsuccessful when the miss distance is less than the critical safe miss distance. In the results of the critical safe miss distance evasion, occurrences of miss distance below the critical value are no longer observed, indicating that the 1% adjustment to

Z_{s e t}

is effective. Comparatively, among the four evaluated methods, the critical safe miss distance evasion exhibits the smallest average miss distance. Moreover, the critical safe miss distance evasion has no instances of failure, which is comparable to the differential game maneuver. The energy consumption of the critical safe miss distance evasion is also lower and significantly less than that of the other methods. Although the average miss distances for the sinusoidal and spiral maneuvers are larger, they exhibit relatively high RSDs and greater proportions of failure. This indicates that the miss distances for the programmatic maneuvers are more dependent on initial state values, exhibiting lower proactivity. When flight times are similar, the energy consumption of sinusoidal and spiral maneuvers shows a small variation after fixing the frequency and phase, resulting in lower RSDs in energy consumption. Since the control magnitude of the differential game maneuver is set at

a_{E, \max}

, the differences in flight times are also minimal, leading to a lower RSD for energy consumption.

In summary, the proposed guidance law exhibits a greater probability of successful evasion compared to programmatic oscillatory maneuvers while incurring lower energy costs. Furthermore, in comparison to the norm differential game maneuver, it additionally achieves a reduction in energy consumption and effectively satisfies the requirements for critical safe miss distance evasion.

4.4. Evaluation of the Adaptability to Unpowered Aerial Vehicle

To reduce the mathematical and computational complexity, the proposed guidance law is formulated for the constant-speed vehicle. However, when the engagement time is short and the vehicle’s speed variation is relatively small, the guidance law could be adapted for the unpowered aerial vehicle. The aerodynamic characteristics and mass data for the CAV-H model are provided in reference [23]. In the subsequent simulation, we consider a scenario in which both the pursuer and evader are CAV-H models, aiming to evaluate the adaptability of the guidance law to the more complex model. The specific procedure for applying the guidance law under the CAV-H model is outlined as follows:

Step 1: Determine the maximum control

a_{E, \max}

and

a_{P, \max}

based on the current speeds, maximum attack angles, and aerodynamic characteristics of the evader and pursuer.

Step 2: If the guidance law needs to be updated, substitute the current speeds of the evader and pursuer into

V_{E}

and

V_{P}

, respectively, and compute the required acceleration command for the evader.

Step 3: Based on the aerodynamic characteristics of CAV-H, calculate the attack angle

α_{E}

and the bank angle

ν_{E}

from the acceleration command. Substitute the attack angle and bank angle into the CAV-H dynamic model and perform the simulation through numerical integration.

The effective navigation ratio of the pursuer is set to 4, and the initial altitude of the evader is set to 25 km. Under Condition 1 and Condition 2 provided in Table 2, the four guidance laws in Section 4.3 are applied in the simulations. The resulting flight trajectories and speed curves are shown in Figure 13 and Figure 14, while the miss distances and the evader’s terminal speeds are presented in Table 6. For the unpowered vehicle, a larger acceleration in the objective function leads to a larger attack angle, which results in greater speed loss. Therefore, in this section, energy consumption is represented by the evader’s speed loss, defined as the difference between the initial and the terminal speeds. Under both conditions, the miss distance of the critical safe miss distance maneuver exceeds the critical value. The speed curves for the spiral maneuver and the differential game maneuver are similar, as both use maximum overload, resulting in a comparable aerodynamic drag. In Condition 1, the speed loss of the critical safe miss distance maneuver is only slightly lower than that of the sinusoidal maneuver. However, the miss distance of the sinusoidal maneuver is smaller than

Z_{s a f e}

, whereas the critical safe miss distance maneuver ensures safe evasion. In Condition 2, all the guidance laws achieve safe evasion, with the critical safe miss distance maneuver resulting in the smallest speed loss.

The results indicate that the critical safe miss distance maneuver can be effectively applied to the CAV-H model. In comparison with the programmatic maneuver and the differential game maneuver, the critical safe miss distance guidance law achieves a safe evasion with smaller speed losses in head-on intercepts.

5. Conclusions

This paper addresses the pursuit–evasion problem for constant-speed vehicles in three-dimensional space and proposes a guidance law for critical safe miss distance evasion. Simulation results validate the effectiveness of the guidance law. The main conclusions are as follows:

Through the state equation of the ZEM and the approximation of the disturbances experienced by the pursuer, an optimal control problem for critical safe miss distance evasion was established. Using the Maximum Principle, the optimal guidance law under bounded control and terminal constraints was derived. Furthermore, an iterative method was designed for solving co-state variables with the homotopy method and Newton iteration.

The simulation results demonstrate that, within a certain range of initial conditions, the proposed iterative method meets real-time requirements. Under head-on intercept conditions, the guidance law effectively achieves critical safe miss distance evasion and adapts to different effective navigation ratios of the pursuer. Compared to other methods, the critical safe miss distance evasion results in a lower probability of the miss distance being below the critical value and incurs a smaller energy cost. In head-on intercept scenarios, the proposed guidance law is applicable to unpowered vehicle models.

Future research will focus on comparing various guidance methods across a broader range of engagement configurations, such as non-head-on intercepts, and improving the feasibility of the proposed guidance law in practical engineering applications.

Author Contributions

Conceptualization, C.W., J.Y. and R.L.; methodology, C.W., J.Y. and R.L.; software, C.W.; validation, C.W. and J.Y.; formal analysis, C.W. and R.L.; investigation, R.L.; resources, J.Y.; data curation, Z.L.; writing—original draft preparation, C.W.; writing—review and editing, Y.C.; visualization, Y.C.; supervision, Z.L.; project administration, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in the study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

Yu, X.; Wang, X.; Lin, H. Optimal penetration guidance law with controllable missile escape distance. J. Astronaut. 2023, 44, 1053–1063. [Google Scholar]
Zarchan, P. Proportional Navigation and Weaving Targets. J. Guid. Control Dyn. 1995, 18, 969–974. [Google Scholar] [CrossRef]
Luo, W.; Lei, G.; Lai, C.; Wang, H. Research of Integrated Spiral Maneuvering and Guidance Based on Virtual Target. J. Eng. Res. 2024. Advanced online publication. [Google Scholar] [CrossRef]
Shinar, J.; Rotsztein, Y.; Bezner, E. Analysis of Three-Dimensional Optimal Evasion with Linearized Kinematics. J. Guid. Control 1979, 2, 353–360. [Google Scholar] [CrossRef]
Gutman, S.; Esh, M.; Gefen, M. Simple Linear Pursuit-Evasion Games. Comput. Math. Appl. 1987, 13, 83–95. [Google Scholar] [CrossRef]
Anderson, G.M. Comparison of Optimal Control and Differential Game Intercept Missile Guidance Laws. J. Guid. Control 1981, 4, 109–115. [Google Scholar] [CrossRef]
Wang, Y.; Zhou, T.; Chen, W.; He, T. Optimal maneuver penetration strategy based on power series solution of miss distance. J. Beijing Univ. Aeronaut. Astronaut. 2020, 46, 159–169. [Google Scholar]
Zhang, P.; Fang, Y.; Zhang, F.; Xiao, B.; Hu, S.; Zong, S. An Adaptive Weighted Differential Game Guidance Law. Chin. J. Aeronaut. 2012, 25, 739–746. [Google Scholar] [CrossRef]
Battistini, S.; Shima, T. Differential Games Missile Guidance with Bearings-Only Measurements. IEEE Trans. Aerosp. Electron. Syst. 2014, 50, 2906–2915. [Google Scholar] [CrossRef]
Sun, Q.; Zhang, C.; Liu, N.; Zhou, W.; Qi, N. Guidance Laws for Attacking Defended Target. Chin. J. Aeronaut. 2019, 32, 2337–2353. [Google Scholar] [CrossRef]
Liu, F.; Dong, X.; Li, Q.; Ren, Z. Cooperative Differential Games Guidance Laws for Multiple Attackers against an Active Defense Target. Chin. J. Aeronaut. 2022, 35, 374–389. [Google Scholar] [CrossRef]
Liang, H.; Li, Z.; Wu, J.; Zheng, Y.; Chu, H.; Wang, J. Optimal Guidance Laws for a Hypersonic Multiplayer Pursuit-Evasion Game Based on a Differential Game Strategy. Aerospace 2022, 9, 97. [Google Scholar] [CrossRef]
Zhao, S.; Zhang, H.; Lyu, R.; Yang, J.; Xue, C. Optimal avoidance strategy based on nonlinear approximate analytic solution of non-cooperative differential game. Aeronaut. J. 2024, 128, 2906–2923. [Google Scholar] [CrossRef]
Hayoun, S.Y.; Shima, T. A Two-on-One Linear Pursuit–Evasion Game with Bounded Controls. J. Optim. Theory Appl. 2017, 174, 837–857. [Google Scholar] [CrossRef]
Gao, M.; Yan, T.; Li, Q.; Fu, W.; Zhang, J. Intelligent Pursuit–Evasion Game Based on Deep Reinforcement Learning for Hypersonic Vehicles. Aerospace 2023, 10, 86. [Google Scholar] [CrossRef]
Peng, C.; Ma, J.; Liu, X. An Online Data Driven Actor-Critic-Disturbance Guidance Law for Missile-Target Interception with Input Constraints. Chin. J. Aeronaut. 2022, 35, 144–156. [Google Scholar] [CrossRef]
Kartal, Y.; Subbarao, K.; Dogan, A.; Lewis, F. Optimal Game Theoretic Solution of the Pursuit-evasion Intercept Problem Using On-policy Reinforcement Learning. Int. J. Robust Nonlinear Control 2021, 31, 7886–7903. [Google Scholar] [CrossRef]
Turetsky, V.; Shinar, J. Missile Guidance Laws Based on Pursuit–Evasion Game Formulations. Automatica 2003, 39, 607–618. [Google Scholar] [CrossRef]
Yan, T.; Cai, Y.; Xu, B. Evasion Guidance Algorithms for Air-Breathing Hypersonic Vehicles in Three-Player Pursuit-Evasion Games. Chin. J. Aeronaut. 2020, 33, 3423–3436. [Google Scholar] [CrossRef]
Chen, S.; Yan, J.; Pu, K. Anti-intercept maneuver method of vehicle based on prediction of miss distance. Syst. Eng. Electron. 2023, 45, 2922–2930. [Google Scholar]
Deuflhard, P. Newton Methods for Nonlinear Problems; Spinger: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
Gutman, S. On Optimal Guidance for Homing Missiles. J. Guid. Control 1979, 2, 296–300. [Google Scholar] [CrossRef]
Richie, G. The Common Aero Vehicle–Space delivery system of the future. In Proceedings of the Space Technology Conference and Exposition, Albuquerque, NM, USA, 28–30 September 1999. [Google Scholar]

Figure 1. Relative motion diagram of the pursuer and evader.

Figure 2. Projection of control acceleration in initial line-of-sight coordinate system.

Figure 3. Schematic diagram of critical safe miss distance evasion.

Figure 4. Process for solving co-state variables.

Figure 5. Two variation curves of

{‖a_{E}‖}_{2}

.

Figure 5. Two variation curves of

{‖a_{E}‖}_{2}

.

Figure 6. Simulation flow chart.

Figure 7. Trajectory and co-state variables in Condition 1. (a) The trajectory; (b) the co-state variables.

Figure 8. Evasion and pursuit control commands in Condition 1. (a) a_y; (b) a_z.

Figure 9. Trajectory and co-state variables in Condition 2. (a) The trajectory; (b) the co-state variables.

Figure 10. Evasion and pursuit control commands in Condition 2. (a) a_y; (b) a_z.

Figure 11. Trajectories and statistical results of guidance law with critical safe miss distance. (a) The trajectories; (b) the statistical results.

Figure 12. Statistics of miss distances and energy consumption for different methods. (a) The miss distances; (b) the energy consumption.

Figure 13. Simulation results of the CAV-H model under Condition 1. (a) The trajectories; (b) the speed curves.

Figure 14. Simulation results of the CAV-H model under Condition 2. (a) The trajectories; (b) the speed curves.

Table 1. General parameters used in simulation.

Time Step	$Z_{s e t}$	$N_{n}$	$N_{h}$	$ε$	$V_{E}$	$V_{P}$	$a_{E, \max}$	$a_{P, \max}$
0.001 s	50 m	50	100	0.001	2 km/s	2 km/s	10 g	12 g

Table 2. Initial states under typical conditions.

	Condition 1	Condition 2
$[x_{E} (m) y_{E} (m) z_{E} (m) θ_{E} (rad) σ_{E} (rad)]$	$[0 0 0 0 0]$	$[0 0 0 0 0]$
$[x_{P} (m) y_{P} (m) z_{P} (m) θ_{P} (rad) σ_{P} (rad)]$	$[20000 0 0 0 π]$	$[20000 1000 1000 - 0.0499 3.0916]$

Table 3. Range of initial states distributions in Monte Carlo simulations.

	θ_E(rad)	σ_E(rad)	x_P(km)	y_P(km)	z_P(km)
Range	[−0.1,0.1]	[−0.1,0.1]	[18,22]	[−2,2]	[−2,2]

Table 4. Control commands for three methods.

Control Variable	Sinusoidal Maneuver	Spiral Maneuver	Norm Differential Game
$a_{y E}$	0	$a_{E, \max} \cos (π t / 4)$	0
$a_{z E}$	$a_{E, \max} \sin (π t / 4)$	$a_{E, \max} \sin (π t / 4)$	$a_{E, \max} sgn (Z_{z} (t))$

Table 5. Statistics of miss distance and energy consumption for different methods.

Maneuvering Method	Average Miss (m)	Minimum Miss (m)	RSD of Miss	Failure Proportion	Average Energy Consumption (m² s⁻³)	RSD of Energy Consumption
Critical safe miss maneuver	129.70	71.14	82.2%	0	$9.368 \times 10^{2}$	108.2%
Sinusoidal maneuver	295.52	0.36	90.8%	22.8%	$1.081 \times 10^{4}$	7.3%
Spiral maneuver	316.76	4.63	85.2%	15.2%	$2.419 \times 10^{4}$	5.8%
Norm differential	870.74	383.02	35.3%	0	$2.443 \times 10^{4}$	5.9%

Table 6. Simulation results based on the CAV-H model.

Maneuvering Method	Miss in Condition 1 (m)	Miss in Condition 2 (m)	Speed Loss in Condition 1 (m/s)	Speed Loss in Condition 2 (m/s)
Critical safe miss maneuver	71.23	268.99	42.91	23.97
Sinusoidal maneuver	60.02	123.97	43.98	44.31
Spiral maneuver	72.85	72.45	74.96	75.32
Norm differential	178.24	675.29	75.27	75.77

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, C.; Yan, J.; Lyu, R.; Liang, Z.; Chen, Y. Optimal Guidance Law for Critical Safe Miss Distance Evasion. Aerospace 2024, 11, 1041. https://doi.org/10.3390/aerospace11121041

AMA Style

Wang C, Yan J, Lyu R, Liang Z, Chen Y. Optimal Guidance Law for Critical Safe Miss Distance Evasion. Aerospace. 2024; 11(12):1041. https://doi.org/10.3390/aerospace11121041

Chicago/Turabian Style

Wang, Chengze, Jiamin Yan, Rui Lyu, Zhuo Liang, and Yang Chen. 2024. "Optimal Guidance Law for Critical Safe Miss Distance Evasion" Aerospace 11, no. 12: 1041. https://doi.org/10.3390/aerospace11121041

APA Style

Wang, C., Yan, J., Lyu, R., Liang, Z., & Chen, Y. (2024). Optimal Guidance Law for Critical Safe Miss Distance Evasion. Aerospace, 11(12), 1041. https://doi.org/10.3390/aerospace11121041

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimal Guidance Law for Critical Safe Miss Distance Evasion

Abstract

1. Introduction

2. Optimal Control Problem of Critical Safe Miss Distance Evasion

2.1. Dynamics of Pursuer and Evader

2.2. State Equation for Zero-Effort Miss

2.3. Objective Function and Constraints

3. Solution Method of the Optimal Guidance Law

3.1. Optimal Guidance Law

3.2. Method for Solving Co-State Variables

3.3. Iteration Method for Solving Nonlinear Equations

4. Simulation and Analysis

4.1. Simulation Setup

4.2. Simulation of Critical Safe Miss Distance Evasion

4.3. Comparison with Other Methods

4.4. Evaluation of the Adaptability to Unpowered Aerial Vehicle

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI