1. Introduction
When a re-entry hypersonic vehicle is launched, it reaches a specific altitude and then re-enters the atmosphere from near-Earth orbit. The lifting body configuration is favored for re-entry hypersonic vehicles (RHVs) due to the advantages of high maneuverability and great longitudinal and lateral ranges [
1,
2]. There are many cutting edge techniques for RHVs and one of them is guidance, navigation and control (GNC). Authors have conducted research on high-accuracy integrated navigation technologies for RHVs [
3,
4] and this paper focuses on integrated guidance and control. The mathematical foundation of re-entry guidance and control is the optimal solution for tightly coupled, high-rank nonlinear problems with complicated constraints under disturbances and uncertainties. Therefore, the design of guidance and control system for RHVs is a multiobjective, highly constrained and strongly nonlinear control problem. For lifting body RHVs, the states of the guidance and control nonlinear systems include position, velocity, attitude and flow angles while the actuators (commands) include moments imposed on the RHV through aerodynamic forces. For re-entry flight, the guidance and control are tightly coupled and their mutual influences are non-negligible, so integrated guidance and control (IGC) is necessary. Additionally, during re-entry the IGC systems are challenged by many uncertainties such as high-altitude wind speeds up to 170 m/s [
1], mathematical model errors under large angle of attack (AOA) and even insufficient failure of actuators. Under these scenarios, inaccuracy of aerodynamic coefficients may happen. Additionally, in cases of actuator failure and target change, a new trajectory needs to be generated online for guidance and control. Capabilities of online trajectory generation and online coefficient estimation will improve the IGC performance of RHVs, widening their mission range and reinforcing reliability. Research has been conducted related to IGC, trajectory generation and aerodynamic coefficient estimation. The Gauss pseudospectral method (GPM) is widely used for trajectory generation [
5,
6,
7]. A novel time-optimal memetic whale optimization algorithm integrating the GPM is proposed in [
5] for the hypersonic vehicle re-entry trajectory optimization problem with no-fly zones, proving that the GPM possesses a rapid convergence speed around the optimum and higher accuracy in the trajectory construction field. Mao et al. also applied the GPM for re-entry trajectory optimization because of its high efficiency and accuracy [
6]. The GPM is also adopted to generate trajectory offline for guidance and control systems of re-entry vehicles in the presence of control constraints and multiple disturbances based on unified enhanced trajectory linearization control (TLC) [
7]. Other methods of trajectory generation are studied [
8,
9,
10] but the solutions are suboptimal due to online application. Generally, in the above research on GPM, the trajectory is generated with commands of flow angles—namely, AOA, bank angle (BA) and sideslip angle. However, the real commands of RHV guidance are moments and forces provided by control surfaces or a Reaction Control System (RCS). The constraints of angular rates and accelerations of the flow angles should also be considered along with the magnitude of the flow angles when generating the trajectory, otherwise the required flow angles of commands may not be accessible.
Various IGC schemes have been proposed by researchers. An IGC scheme is developed for a reusable launch vehicle (RLV) in [
11] with the aim of improving the flexibility, safety and autonomy. An outer loop re-entry guidance law with online trajectory reshaping capability is designed and a novel re-entry attitude control strategy is proposed based on a sliding mode controller (SMC) and disturbance observer. A three-dimensional partial IGC law is proposed in [
12] for cooperative flight of multiple hypersonic re-entry missiles. An IGC scheme combining with the virtual target and the filter for fixed-trim RHV is proposed in [
13]. The IGC is based on a back-stepping method and an Extended State Observer (ESO) is designed for estimating the acceleration of the virtual target. Additionally, a small-gain method for IGC in the terminal phase of re-entry is designed using an SMC [
14]. In order to reduce control chattering, the virtual control is introduced into the design of third-order SMC [
15]. Shao et al. propose a novel composite IGC law for a missile intercepting against an unknown maneuvering target with multiple uncertainties based on the back-stepping technique. [
16]. More research on guidance and control is detailed in [
17,
18,
19]. It can be summarized that many efforts have been made to improve the capacity of resisting disturbances and overcoming coefficient uncertainty in the field of re-entry IGC.
In the field of online coefficient estimation, Seo et al. developed two Kalman filter based techniques to estimate aerodynamic coefficients for fixed-wing aircraft in upset conditions [
20]. The proposed methods are tested with real ordinary flight data sets but the re-entry scenario is not discussed. Additionally, Ferreres et al. conducted research on the adaptive Linear Fractional Transformation (LFT) control of a civil aircraft with online frequency-domain parameter estimation [
21] and Subrahmanyam et al. also developed an online identification system in the frequency domain for mini cropped delta Unmanned Aerial Vehicles (UAVs) [
22]. Chowdhary et al. compare the performance of three online parameter estimation algorithms with real flight data from two aircrafts—a fixed-wing aircraft (HFB-320) and a rotary wing UAV (ARTIS). An Extended Kalman Filter (EKF) and Unscented Kalman Filter (UKF) are compared and results show that the UKF performs better [
23]. Similarly to all the work above, only aircrafts in normal conditions are researched, showing a lack of high-maneuver and large AOA scenarios of re-entry flight. A novel robust adaptive gliding guidance strategy based on online identification of aerodynamic coefficients for hypersonic vehicles is proposed in [
24]. Aerodynamic models are constructed in the form of quadratic polynomial functions and an Extended Kalman Filter (EKF) is employed to estimate the unknown parameters. Additionally, online aerodynamic coefficient identification based on EKF is tested with real data from two hypersonic vehicles, X-31 and HARV [
25], showing the effectiveness of coefficient estimation. However, considering the one-order approximation accuracy of EKF, it can be improved by applying UKF whose approximation accuracy is at least second-order. Additionally, the computation complexity is relieved for online applications in the case of UKFs since a Jacobian matrix is not necessary.
In the case of RHV actuator failure, the control capability is crippled, leading to the original target or the landing site not being reachable. Firstly, the changed aerodynamic coefficients need to be estimated online for trajectory regeneration and IGC. Secondly, a new trajectory needs to be generated online to guide the RHV to a feasible landing site. Additionally, online trajectory generation is also required for a changing target scenario; online aerodynamic coefficient estimation is also meaningful in terms of improving the IGC performance under obvious disturbances and uncertainty.
In this paper, a novel IGC scheme for RHV is proposed with the capabilities of online aerodynamic coefficient estimation based on UKF and online trajectory generation based on the GPM. A Linear Quadratic Regulator (LQR) is adopted for trajectory tracking guidance and a second-layer SMC is designed for attitude control. The variation of lift and drag coefficients are modeled and estimated online through a UKF. In the procedure of online trajectory generation, commands are set as moments of actuators and their extremums impose constraints on the rate and acceleration of flow angles such as AOA and BA. Comprehensive simulations are conducted to prove the effectiveness and advantages of the proposed IGC scheme.
Compared with other known re-entry IGC schemes, the following contributions are made by our research from a scientific view: (1) Constraints of the rate and acceleration of flow angles are considered in trajectory generation based on the GPM. (2) Dynamic models with 6 degrees of freedom (DOFs) are applied in the guidance laws of LQR, which takes the RHV’s control constraints into account and can provide expected flow angles and their rates for attitude control system. (3) During guidance lift and drag, coefficients of RHV are estimated online for re-entry vehicles through UKF for robust IGC and possible online trajectory construction.
The remainder of this paper is organized as follows. In
Section 2, the dynamics models and multi-constraints during re-entry are presented.
Section 3 introduces the novel IGC scheme, including the online trajectory generation algorithm and coefficient estimation algorithm. The simulations are shown in
Section 4 and conclusions are drawn in
Section 5.
3. The Novel IGC Schemes
The guidance and control systems of RHVs are usually designed separately and their mutual influences are ignored, so the synergistic relationships between the two subsystems are not fully exploited. In order to improve the guidance and control performance, a novel IGC scheme is proposed with the capabilities of online aerodynamic coefficient estimation based on a UKF and online trajectory generation based on the GPM. The whole diagram of IGC is illustrated in
Figure 1.
3.1. Online Trajectory Generation Based on the GPM
Essentially, re-entry trajectory generation is a nonlinear optimization problem under various constraints. There are two main types of methods to solve the problem—a direct method and indirect method. The former turns continuous nonlinear problems into discrete nonlinear programing for solution. The GPM is one of the direct methods at a global scale that has been widely used in re-entry trajectory generation for its high accuracy and efficiency. The GPM discretizes the continuous states and commands, approximates them in the form of polynomials and satisfies the constraints at certain match points. As the procedure is conducted at a global scale, the parameters that need to be determined are less than traditional local approximation methods while the accuracy is higher.
In this paper, both point-mass and attitude dynamic models, namely Equations (1) and (2), are applied in trajectory generation based on the GPM. As stated previously, the controls (commands) are the moments of the actuators:
where the subscript
x,
y and
z refer to the moment along the front side, right side and down side of the RHV body frame. The states are expanded to 12 dimensions:
Each state has its range limitation.
Two cost functions are designed in this paper: one is the maximum lateral range and the other is the least final point error at a specific position:
where
tf is the time at terminal and
rT,
θT and
φT represent the desired final location.
Kθ and
Kφ are constant scale factors with units of meters.
The GPM will provide the numerical states and controls at each match point while their values between match points can be achieved by polynomial fitting with respect to time. The number of match points affects the computation load and accuracy at the same time. It is always true that the more match points the higher computation load, but this is not always true for accuracy. Therefore, the appropriate number of match points should be determined, especially for online trajectory generation which demands more for computation efficiency. Discussions will be scheduled in the simulation section.
Moreover, unlike the offline scenario, the online trajectory is being generated when the RHV is guided according to the offline trajectory. The initial motion information used by the GPM is at time ttg0 and it takes ttgf to finish online trajectory generation, so the new trajectory is referenced by IGC from its time of ttgf. This is also detailed in the simulations.
3.2. Online Aerodynamic Coefficient Estimation Based on UKF
As mentioned previously, the maximum and average errors of the fitting model for aerodynamic coefficients CL and CD are 12% and 3.8%, respectively. Additionally, in the case of actuator failure, the aerodynamic coefficients may change greatly. To deal with the failure problem and improve the IGC performance, the aerodynamic coefficients should be estimated online.
Because both the coefficient model and motion model are strongly nonlinear, a UKF is adopted in our scheme due to its better accuracy and lower complexity for online application, since the Jacobian matrices are not needed.
(1) State functions
The state of a UKF has 12 elements:
The first six elements are the same with
while the last six elements are the aerodynamic coefficient factors ranging from 0 to 1, which indicate the extent that the aerodynamic coefficients have changed. According to Equation (4):
In fact, the coefficients
CL3 and
DL3 have much lower impacts on
CL and
CD, which is detailed in
Section 4, so their changing factors are not included in the states.
Assuming the factors are random constants or slowly changing values, they are modeled as follows:
The input vector consists of AOA
α and BA
σ with the assumption that the sideslip angle is zero:
α affects the
CL and
CD together with velocity and aerodynamic coefficients. Therefore, they determine the lift and drag forces
L and
D.
σ directly affects the velocity heading angle
ψ and flight path angle
γ according to Equation (1). So,
α and
σ are chosen as inputs.
The three-DOF point-mass dynamic function and the coefficient factor function together constitute the nonlinear system state function. After discretization, the state transition function
FUKF yields:
where
k is the discretized time while
WUKF is the zero-mean unrelated Gaussian process noise vector with a variation matrix of
QUKF.
(2) Measurement functions
The measurements are the radial distance from the center of the Earth to the RHV,
r; the longitude and the latitude,
θ and
φ; the Earth-relative velocity
V:
These measurements can be obtained through navigation systems such as an inertial navigation system, Global Satellite Navigation System, etc. The measurement function is linear so the measurement matrix can be shown as:
where eye(4) is four-dimensional unit matrix. The measurement function can then be expressed as:
where
VUKF is the zero-mean unrelated Gaussian measurement noise vector with a variation matrix of
RUKF.
The coefficient estimation can then be realized through the UKF algorithm. The estimated coefficients will be used for online trajectory generation and IGC. Parameters of UKF will be introduced in
Section 4.
3.3. IGC Laws Based on LQR and SMC
Traditionally, the guidance system and attitude control system are designed separately. The guidance outputs command according to the current motion information, generally in the form of flow angles such as AOA and BA. The attitude control system just follows the commands as best as possible and no feedback is provided to the guidance system. However, as guidance and control systems are highly interactive, their mutual influences should be considered and the design should be integrated.
In our research, the principle of spectrum separation is applied for the IGC design so that the attitude control system is the inner loop with a small time constant while the guidance system is the outer loop with a large time constant. The trajectory tracking guidance laws based on the LQR and attitude control method based on two-layer SMC are designed. The guidance laws generate commands under the control capability constraints and attitude tracking errors will be fed back to the guidance system for command updates.
(1) Guidance laws based on LQR
LQRs can be applied to multi-input and multioutput control systems with the advantages of high accuracy, high efficiency and easy realization online. For re-entry trajectory tracking guidance, the dynamic models need to be linearized for the LQR. Generally, only the three-DOF dynamic model is applied and AOA and BA are the commands or inputs. In our IGC scheme, with the same thought of trajectory generation based on the GPM, we take the ability of the attitude control system into consideration by applying the six-DOF dynamic model and setting moments as commands. Therefore, the LQR based guidance system can also provide expected flow angles and their rates for an attitude control system based on an SMC.
The state of LQR has 12 elements that are the differences between the reference trajectory and the actual one.
The commands of the LQR are the moment differences that need to be updated:
At any point of the trajectory, the linearized system function can be written as:
where
and
are the state matrix and command matrix, yielding:
Fi (i = 1, 2…12) are the expressions at the right side of the equal sign in Equations (1) and (2).
The LQR calculated
to tune
so that the minimum of the cost function can be reached (subscript
LQR of
and
is omitted):
where
and
are non-negative scale matrices that can be determined by Bryson’s rule.
The subscript max means the maximum of each element. In other words, each element is expected within these setting ranges.
The updated command can be calculated as:
where
is derived from the Riccati equation as follows (all
LQR subscripts are omitted):
Thus, guidance LQR updates the moments to control RHV and tracks the reference trajectory. Moreover, as the LQR state contains the expected flow angle and angular rate information, they can be provided for the attitude control system.
(2) Attitude control method based on SMC
A second-layer SMC is designed for RHV attitude control in the novel IGC scheme. Based on the expected angular rates in body frame and flow angles, ignoring the velocity-related terms in the first three equations in Equation (2), the rate of flow angles can be derived:
The guidance outer loop provides the expected flow angles and their angular rates and they become the commands for the attitude control inner loop.
The first-layer slide mode surface of the attitude SMC is designed based on the flow angle commands:
Exponent approaching is adopted as follows:
where
and
are positive feedback factors and sign() is sign function. Using the set virtual angular rate in the body frame as commands and according to Equations (27)–(29), it can be shown that:
The command of angular rates of flow angles can be derived from guidance system according to Equation (27).
Based on virtual angular rates, the second-layer sliding surface is designed as follows:
Similarly, exponent approaching is adopted:
where
and
are positive feedback factors. Therefore, the moment commands for attitude control can be derived as:
The virtual command of the second layer does not affect the control performance as much and they can be set to zero.
Finally, a transient process is considered where the actuators are modeled as second-order inertial element with the response frequency of 10π and damping ratio of 0.707.
4. Simulations and Verifications
Comprehensive simulations were conducted to verify the effectiveness and advantages of the proposed IGC scheme. Firstly, a reference trajectory was generated offline with the GPM and its cost function was JGPM1—namely, the largest lateral range. If the dynamic coefficients are accurate, the IGC guarantees that RHV follow the trajectory with small errors. However, if the coefficients have errors or the actuator failure happens, it can be seen the performance of IGC obviously degrades for the offline reference trajectory, even if coefficient estimation online is applied. Finally, based on the online coefficient estimation, a new trajectory was generated to a specific final position. The IGC performance is comparable with that of accurate coefficients for offline trajectory.
4.1. Configurations of the Simulations
Some constants and aerodynamic coefficients in the simulations are listed as follows. The parameters of CAV-H, developed by Lockheed Martin Space Systems Company of USA, are adopted in the simulations and related parameters are listed in
Table 1 [
26].
4.2. Simulation in Normal Conditions
A trajectory of the largest lateral range with a cost function of
JGPM1, as Equation (10) shows, is generated offline, as
Figure 2 shows.
Figure 2a shows the three-dimension position.
Figure 2b shows the profiles of longitude, latitude, height and speed with respect to time.
Figure 2c shows the profiles of flight-path angle, velocity heading angle, AOA and BA with respect to time.
The initial height of RHV is 60 km and position is (0, 0). The initial velocity is 6400 m/s and its heading is 90 degree—namely, flying towards the East. The maximum lateral range reaches about 45.4° N in 2110 s. Terminal constraints are that the final height is 24 km and final velocity is 760 m/s with the path angle of −3 degree, which are requirements from the horizontal landing phase of RHV.
As for the parameters of IGC system, the diagonal elements of QLQR are (1/1000)2, (Re/1000)2, (Re/1000)2, (1/30)2, (1800/π)2, (1800/π)2, (18/π)2, (18/π)2, (18/π)2, (18/5π)2, (18/5π)2, and (18/5π)2, respectively, while the diagonal elements of RLQR are (45/π)2, (180/π)2, and (90/π)2, respectively. The GMC parameters K01, K02, K11, and K12 are 0.012, 0.0015, 13, and 0.001, respectively.
In normal conditions, the dynamic coefficients are all accurate and disturbances are small, the IGC based on LQR and GMC is capable of leading the vehicle to the target location with height and position error within 120 m while velocity error within 0.4 m/s, as
Figure 3 shows. The simulation verifies the effectiveness of IGC algorithms based on LQR and GMC under normal conditions.
4.3. Simulation with Large Dynamic Coefficients Errors
However, when dynamic coefficients have large errors or actuator failure happens, the performance of same IGC system degrades obviously. In this simulation, 10% of
CD0 error and −20% of
CL0 error are added and the IGC results are shown in
Figure 4.
According to the offline reference trajectory, the final height error is 2700 m, with a position error of up to 5500 m and velocity error of 5 m/s, which are much larger than those in normal conditions. There are two reasons for this:
- (1)
The offline reference trajectory is generated based on accurate coefficients and it may not be reachable for the degraded flight ability;
- (2)
The guidance system also uses aerodynamic coefficients to update commands, which are also erroneous if coefficients are not accurate.
The results of the dynamic coefficient estimation based on UKF are provided in
Figure 5.
The estimation begins from the very start of the flight and the accuracy of all coefficients is within 1% after about 70 s except from the CL2 factor because it has lower impact on the lift coefficient and its observability is lower. For example, when the AOA is 10 degrees and the velocity is 6000 m/s, the CL = 0.3170 while the part of is only 0.038—namely, 12% of the whole efficient. Thus, the error of 2.5% of CL2 in this simulation leads to only 0.3% error of the whole efficient. In total, the estimation accuracies of CL and CD after 100 s are 0.86% and 0.75%, respectively. Although the estimation based on UKF can provide accurate dynamic coefficients, the final position and velocity errors are still large due to reason (1), detailed previously. Therefore, once the coefficient change detected or mission profile changed, the online trajectory is required.
4.4. Simulation with Online Trajectory Generation
The number of GPM match points affects the computation load and accuracy at the same time. For the online scenario, more match points can be tolerated as time limitation is weak. For the online scenario, the new trajectory must be generated within seconds otherwise RHV may be in a dangerous situation with the potential of actuator failure. In our research, maximum match points for offline is 200 while for online the number is limited to 100. Indeed, in the simulations, the number of match points for offline is 186 and for online is 89. The GPM is complemented by Matlab script on our computer with Intel Core i5-2520M CPU @2.5 GHz and a Windows 7 operating system and it takes 5.8 s to generate the online trajectory with cost function of JGMP2, which guides the RHV to the specific location of (50 E, 25 N, 30 km) for emergency.
As the coefficient estimation settles down at about 100 s, the online trajectory generation is scheduled to begin at the 120th s of the whole flight. Ten seconds are left for online trajectory generation. From the very beginning of the flight to the 130th s, the RHV was guided according to the offline trajectory. The initial motion information at the 120th s was used for online trajectory generation, but the reference trajectory was switched to the online one at the 130th s. In practice, the ten seconds are left for online GPM based trajectory computation.
Figure 6a shows the comparisons of the 3D positions of the online trajectory and the offline one. The IGC led the RHV according to the online trajectory for a new target after the 130th s.
Figure 6c shows the 3D positions, velocity, path angle, heading, AOA and BA profiles. AOA ranges from 10 to 30 degrees while its limitations are −5 to 35 degrees. BA ranges from −72 to 35 degrees while its limitations are −89 to 89 degrees. The angular velocity of AOA and BA are mostly within 1 deg/s apart from the time of switched reference trajectory.
Figure 6d shows the angular velocities and accelerations of AOA and BA. Since moment constraints are considered, it can be seen that the angular acceleration of AOA and BA are within 0.4 and 2 deg/s
2, respectively, which ensures that the flow angle commands are smooth and achievable.
Figure 7 shows that the online trajectory satisfied all the path constraints during the whole flight. The RHV passed through the velocity-height corridor perfectly.
The IGC errors at the final point under large coefficient errors according to online trajectory are similar with those in normal conditions according to offline trajectory—namely, the results described in
Section 4.2. The height and position errors are within 200 m while velocity error is within 0.5 m/s, which is depicted in
Figure 8.
Figure 9a shows that the AOA and BA provided by guidance system are basically the same with reference trajectory. The difference comes from the LQR based guidance system for the purpose of eliminating motion error, especially when the online trajectory is initially referenced. The enlarged
Figure 9b shows that the attitude control system keeps the AOA and BA in accordance with guidance system states. The standard deviation of attitude error is within 0.05 deg, which verifies the effectiveness of attitude control system based on GMC.