Convex Optimization for Rendezvous and Proximity Operation via Birkhoff Pseudospectral Method

Zhiwei Zhang; Dangjun Zhao; Xianbin Li; Chunyang Kong; Ming Su

doi:10.3390/aerospace9090505

Abstract

Rapid and accurate rendezvous and proximity operations for spacecraft are crucial to the success of most space missions. In this paper, a sequential convex programming method, combined with the first-order and second-order Birkhoff pseudospectral methods, is proposed for the autonomous rendezvous and proximity operations of spacecraft. The original nonlinear and nonconvex close-range rendezvous problem with thrust constraints and no-fly zone constraints is converted into its convex version by using the sequential convexification techniques; then, the Birkhoff pseudospectral method is used to transcribe the dynamic constraints into a series of linear algebraic equality constraints, in other words, a convex second-order conic programming problem with a relatively small condition number. Thus, the resulting problem can be accurately and efficiently solved by a convex solver. The simulation results indicate that the proposed methods, especially the second-order Birkhoff pseudospectral method, have obvious advantages over other methods in computational efficiency and sensitivity.

Keywords:

close-range rendezvous; Birkhoff pseudospectral method; convexification; computational efficiency; sensitivity analysis

1. Introduction

The rendezvous and proximity operation (RPO), a key technology in on-orbit space missions such as the assembly, supply, maintenance, or astronaut rotation, usually refers to the process in which the chaser spacecraft approaches the target spacecraft to complete docking or accompany a flight. In the last century, the early space programs carried out by the United States and Russia have accumulated rich RPO technological experiences, hence, some famous projects have naturally become the main research objects of scholars. Fehse [1] presented the background information, related concepts, and control strategies of early RPO tasks, such as the Gemini [2], Apollo [3], and Space Shuttle of the US, as well as the Soyuz/Progress [4] of the former Soviet Union, or Russia. Goodman [5] surveyed the overall development and flight history of the Space Shuttle from 1983 to 2005 and gave a detailed introduction to the procedural limitations and technical challenges encountered in the early space shuttle RPO missions. Based on the characteristics of different technical routes between the United States and Russia, Woffinden and Geller [6] comprehensively summarized the orbital rendezvous standards, tasks, and technologies, and concluded that autonomous technology would become the mainstream trend of RPO tasks in the future.

Generally, an RPO can be divided into five periods: the launch, phasing, close-range rendezvous, final approach, and docking [7,8]. The spacecraft’s maneuver requires comprehensive navigation and control support from the ground during the phasing period. As a result, researchers are concentrating their efforts on the close-range rendezvous and final approach to prepare for final docking or berthing using autonomous rendezvous technology [9]. Autonomous rendezvous technology means that the entire rendezvous procedure relies solely on onboard equipment to create a full set of maneuver solutions, without the need for ground-based assistance. In the decades before and after the beginning of the 21st century, to achieve this goal, the researchers of the Engineering Test Satellite VII (ETS-VII) [10], Experimental Satellite System-11 (XSS-11) [11], Demonstration of Autonomous Rendezvous Technology (DART) [12], and Orbital Express [13], have actively investigated and made significant progress in the autonomous rendezvous technologies. In recent years, China has also achieved several new technological breakthroughs [14,15], among which the Shenzhou manned spacecraft, as well as the rapid autonomous rendezvous and docking process of the Tianzhou cargo spacecraft, which has been reduced to around 6.5 h. SpaceX, a private US aerospace company, also has matured autonomous rendezvous and docking technology through its Dragon spacecraft and Crew Dragon spacecraft projects [16]. However, owing to the limited computing power of onboard equipment, it is challenging to achieve the online and real-time trajectory generating requirements when dealing with complicated conditions, such as power loss and safety obstacle avoidance.

The process of close-range rendezvous is a typical trajectory optimization problem with various constraints, which can be treated as an optimal control problem. Numerical methods for solving optimal control problems (OCPs) are mainly divided into indirect methods and direct methods [17,18,19]. The indirect method converts the OCPs into extrema for multi-point boundary value problems based on the calculus of variations and the first-order optimality condition [20]. Despite having a globally optimal solution with high accuracy, the indirect method is not suitable for solving complicated OCPs because of the tedious derivation process and sensitive initial values. The direct method discretizes the state and control, transforms the continuous-time OCPs into nonlinear programming (NLP) problems, and then employs the optimization techniques [21] to directly solve the index function [22,23]. The convex optimization method has swiftly drawn the interest of scholars because of its polynomial complexity, great computational efficiency, and theoretical guarantee of a solution for optimal control issues [20,24,25]. In the convex optimization framework, an original OCP is often converted into a convex programming problem by using convexification techniques for the non-convex constraints and discretization techniques for the dynamical constraints. Malyuta et al. [26] comprehensively reviewed the success experience and latest progress of convex optimization in the fields of planetary landing, RPOs, and asteroid landing.

Discretization methods are crucial to an NLP problem or a convex programming problem because the converted problem can be numerically solved using a well-known NLP-solver [27,28]. Actually, the discretization approach necessitates a trade-off between the computing efficiency and solution accuracy. The early discretization methods [29] (e.g., the Euler, Trapezoidal, or Runge–Kutta methods) were developed based on discretized uniform grids, where the state or control in each grid interval was approximated by the same fixed-order piecewise polynomials. This simple construction means that only a large number of grid points achieves sufficient accuracy. The applications are constrained, since an increase in the variables typically results in significant computing overhead [30]. In recent years, the pseudospectral discretization methods have made great progress in theory [31,32,33,34,35] and engineering practice [36,37,38]. The pseudospectral discretization methods, based on non-uniform grid points, have spectral convergence accuracy with an increase in the number of grid points, but hence bring a sharp deterioration in the condition number [39]. The modified pseudospectral knots method [40] suppresses the growth of the condition number at the expense of the computational efficiency and convergence rate. Some preconditioning techniques have made effective attempts [41,42]. At present, the most attractive work is the Birkhoff pseudospectral method framework for solving OCPs, proposed by Ross et al. [39,43]. They investigated the first-order Birkhoff interpolation on arbitrary grids and transcribed the dynamic constraints into algebraic constraints using the idea of pseudospectral methods. The condition number of the original system has decreased significantly, from

O (N^{2})

to

O (\sqrt{N})

, and, in some cases, it has even reached

O (1)

, thanks to the advantage of the Birkhoff interpolation in dealing with higher-order differential equations [44,45,46].

In light of the progress of the Birkhoff interpolation [39,43,44,45], we extend the paradigm of the Birkhoff pseudospectral method and supplement the research findings in computational efficiency and stability. Three main contributions can be summarized in our work. First, we provide a novel sequential convex programming method that combines convex techniques and the Birkhoff pseudospectral method, focusing on the rendezvous and proximity operation of the spacecraft. Second, we provide a modified pseudospectral discretization technique based on the second Birkhoff interpolation to deal with the dynamic constraints. According to the simulation results, the revised technique performs better than the first-order Birkhoff pseudospectral method. Finally, we discover that the Birkhoff pseudospectral methods have clear advantages in computing efficiency and sensitivity, in addition to the improvement in the condition number.

The content of this paper is arranged as follows. The close-range rendezvous problem is discussed in the second section, along with several convexification techniques. Section 3 presents the discretization of the optimal control problem by using the classic pseudospectral method. In Section 4, two well-conditioned pseudospectral discretization approaches, based on the Birkhoff interpolation, are provided for the rendezvous trajectory optimization. The computational efficiency and sensitivity of the proposed algorithm are described through simulation experiments and results analysis in Section 5. Finally, Section 6 provides a conclusion and evaluates the work of this paper.

2. Problem Formulation

2.1. Relative Equations of Motion

A close-range rendezvous maneuver scenario is discussed in this paper, in which the deputy spacecraft (called the Chaser) passes by the chief spacecraft (called the Target) without colliding. The motion of the Chaser relative to the Target is normally used to illustrate the above proximity. As shown in Figure 1, a local-vertical, local-horizontal (LVLH) coordinate system is defined at the center of the Target

O_{T}

and moves with it. The

x

axis, or R-bar, is radially outward. The

y

axis, or V-bar, is perpendicular to the

x

axis and points in the direction of the Target’s motion. The

z

axis, or H-bar, completes the right-hand rule. The position and velocity of the Chaser are indicated by the vectors

r = {[x, y, z]}^{T}

and

v = {[\dot{x}, \dot{y}, \dot{z}]}^{T}

, respectively. The superscript

T

represents the transpose operation of a vector or matrix.

Figure 1. A close-range rendezvous maneuver scenario.

Figure 1 also contains the geocentric equatorial inertial (GEI) coordinate system, which sets its origin at the center of the Earth

O_{E}

, the X-axis point in the vernal equinox direction, and the Z-axis point in the direction of the North Pole. The Y-axis is in the equatorial plane and completes the right-hand rule. The vector positions of the Chaser and Target relative to the center of the Earth

O_{E}

are denoted by

r_{T}

and

r_{C}

, accordingly. The angular velocity of the Target is given by the vector

ω

, and the scalar

ω

indicates its magnitude. Therefore, the vector position of the Chaser satisfies

r_{C} = r_{T} + r .

(1)

Differentiating both sides of the Equation (1) twice with respect to time for the GEI coordinate system yields the Chaser’s motion equation as

{\ddot{r}}_{C} = {\ddot{r}}_{T} + \ddot{r} + 2 (ω \times \dot{r}) + \dot{ω} \times r + ω \times (ω \times r),

(2)

where the vectors,

{\ddot{r}}_{C}

and

{\ddot{r}}_{T}

, denote the inertial acceleration of the Chaser and the Target. The vector

\ddot{r}

represents the acceleration of the Chaser relative to the Target. The remaining three terms on the right side of Equation (2) are the Coriolis acceleration, the tangential acceleration, and the centripetal acceleration, respectively. The dot notation

[\cdot]

on signals is defined as the derivative

d / d t

. The double dot [⋅⋅] on signals represents the second-order derivative

d^{2} / d t^{2}

. The cross-product of the two vectors is indicated by the operator

[\times]

. The following are the operation rules.

ω \times \dot{r} = \hat{ω} \dot{r}, \dot{ω} \times r = \hat{\dot{ω}} r, ω \times (ω \times r) = \hat{ω} (\hat{ω} r)

(3)

where

\hat{ω}

and

\hat{\dot{ω}}

are the skew-symmetric matrices corresponding to

ω

and

\dot{ω}

, respectively.

The following essential assumptions are made to simplify the model of the close-range rendezvous motion. Assume that

the Target is in a circular orbit near the Earth, and $\dot{ω} = 0$ ;
the orbit radius of the Target $r_{T}$ . is much larger than the distance between the two spacecraft $r$ ;
both spacecraft are regarded as mass points, and all the orbit perturbations are ignored.

The motion of the Chaser, relative to the Target, can be described by the Clohessy–Wiltshire(CW) equations, based on Equations (2) and (3) and the above assumptions. The CW equations are a set of linear, time-invariant differential equations [47]. The detailed derivation is shown in Chapter 7, reference [48], and the results are given here, directly.

{\begin{array}{r} \ddot{x} - 2 ω \dot{y} - 3 ω^{2} x = \frac{T_{x}}{m} \\ \ddot{y} + 2 ω \dot{x} = \frac{T_{y}}{m} \\ \ddot{z} + ω^{2} z = \frac{T_{z}}{m} \end{array}

(4)

where scalar

T_{i} (i = x, y, z)

represents a thrust component of the Chaser’s engine. The mass of the Chaser is given by

m

. Therefore, the relative equations of motion or dynamics equations can be written as a first-order system of variables

(r, v)

[\begin{matrix} \dot{r} \\ \dot{v} \end{matrix}] = [\begin{matrix} 0_{3 \times 3} & I_{3 \times 3} \\ C_{1} & C_{2} \end{matrix}] [\begin{matrix} r \\ v \end{matrix}] + [\begin{matrix} 0_{3 \times 3} \\ I_{3 \times 3} \end{matrix}] \frac{T}{m},

(5)

or a second-order system of

r

\ddot{r} = C_{1} r + C_{2} \dot{r} + \frac{T}{m},

(6)

where

C_{1} = [\begin{matrix} 3 ω^{2} & 0 & 0 \\ 0 & 0 & 0 \\ 0 & 0 & - ω^{2} \end{matrix}], C_{2} = [\begin{matrix} 0 & 2 ω & 0 \\ - 2 ω & 0 & 0 \\ 0 & 0 & 0 \end{matrix}], T = [\begin{matrix} T_{x} \\ T_{y} \\ T_{z} \end{matrix}] .

(7)

The matrix

0_{3 \times 3}

and

I_{3 \times 3}

denote the

[3 \times 3]

-dimensional zero matrix and the identity matrix, respectively.

The analysis of the relative motion of two bodies using the CW equations, which can offer analytical solutions, is widespread and effective [49]. However, the dynamical constraints in the close-range rendezvous process must be balanced against the events, path constraints, and other factors. The analytical solution of the CW equations becomes invalid in this instance. The RPO problem is treated as an optimal control problem (OCP), with various nonlinear and nonconvex constraints in the later subsection.

2.2. Rendezvous Trajectory Optimization

The Chaser completes the objective of approaching the Target with the shortest maneuvering time, or the least fuel consumption, under complicated constraints using the law of relative motion equations. This process is called rendezvous trajectory optimization (RTO). Let

[r, v, m] \in ℝ^{7 \times 1}

be the state variables and

T \in ℝ^{3 \times 1}

be the control variable. Therefore, the OCP version of the RTO in the LVLH coordinate system can be defined as follows:

P r o b l e m 1 (Nonconvex RTO) {\begin{cases} minimize J = \int_{0}^{t_{f}} ‖ T ‖ d t . \\ \begin{matrix} subject to & \begin{array}{l} Eq . (4), \\ \dot{m} = - \frac{‖ T ‖}{I_{s p} g_{0}}, \\ ‖ T ‖ \leq T_{\max}, \\ ‖ r ‖ = \sqrt{x^{2} + y^{2} + z^{2}} \geq R_{s}, \\ {[r, v, m]}^{T} |_{t = 0} = {[r_{0}, v_{0}, m_{0}]}^{T}, \\ {[r, v]}^{T} |_{t = t_{f}} = {[r_{f}, v_{f}]}^{T} . \end{array} \end{matrix} \end{cases}

(8)

where scalar

t_{f}

signifies the terminal time, which is set as a constant, i.e., the time of RTO is fixed. Scalar

I_{s p}

is the engine-specific impulse, and

g_{0}

is called the sea level gravitational acceleration. The operator

‖ \cdot ‖

here denotes the modulus of a vector. The magnitude of the thrust vector,

‖ T ‖

, must not be greater than the maximum thrust amplitude

T_{\max}

. The objective function

J

is the integral of thrust

‖ T ‖

to time. The initial condition

[r_{0}; v_{0}]

and the terminal condition

[r_{f}; v_{f}]

for the Chaser are generally determined according to the mission requirements and serve as reference indexes for the accuracy of the subsequent algorithms. The radius of the no-fly zone, denoted by

R_{s}

, is the minimum safe distance at which the Chaser will not collide with the Target when approaching it.

The existence of nonlinear or nonconvex terms, such as

‖ T ‖

,

T / m

and

‖ r ‖

, makes it challenging to obtain the optimal solution to Problem 1. Convexification techniques will be utilized in the following section to convert the original problem into a SOCP problem with a linear objective function and convex constraints.

2.3. Convexification Techniques Applied to RTO

Convexification techniques are used to handle the dynamic constraints and no-fly zone constraints in Problem 1, which was inspired by the study of Aciknese et al. [50,51]. In this study, the key convexification techniques are equivalent transformation, change of variables, and successive approximation, as stated in reference [52].

2.3.1. Equivalent Transformation of the Objective Function

The objective function in Problem 1 cannot become a linear function after discretization, because of the nonlinear factor

‖ T ‖

. Equivalent transformation is used to deal with this situation, as is done in [20]. A slack variable,

ζ

, is introduced to replace the norm,

‖ T ‖

. The new objective function,

J_{1}

, is as follows:

minimize J_{1} = \int_{0}^{t_{f}} ζ d t

(9)

subject to

‖ T ‖ \leq ζ,

(10)

0 \leq ζ \leq T_{\max} .

(11)

The transformation from

J

to

J_{1}

is equivalent and satisfies the following property at the optimal solution of the problem.

‖ T^{*} (t) ‖ = ζ^{*} (t) .

(12)

Proof of the equivalence and property is given in Proposition 1, reference [53].

2.3.2. Change of Variables for Dynamic Constraints

The nonlinear terms in dynamic constraints are replaced by linear terms, using a change of variables. The following variables are defined:

γ = \frac{T}{m}, ρ = \ln m, σ = \frac{ζ}{m}

The dynamics equations can be rewritten as

\dot{r} = v

(13)

\dot{v} = C_{1} r + C_{2} v + γ

(14)

\dot{ρ} = - μ σ

(15)

where the constant

1 / I_{s p} g_{0}

is replaced by

μ

for brevity.

Let

x = {[r, v, ρ]}^{T} \in ℝ^{7 \times 1}

be the state vector and

u = {[γ, σ]}^{T} \in ℝ^{4 \times 1}

be the control vector. Therefore, the new description form of the dynamic constraints is as follows:

\dot{x} = A x + B u

(16)

where

A = [\begin{matrix} 0_{3 \times 3} & I_{3 \times 3} & 0_{3 \times 1} \\ C_{1} & C_{2} & 0_{3 \times 1} \\ 0_{1 \times 3} & 0_{1 \times 3} & 0 \end{matrix}], B = [\begin{matrix} 0_{3 \times 3} & 0_{3 \times 1} \\ I_{3 \times 3} & 0_{3 \times 1} \\ 0_{1 \times 3} & - μ \end{matrix}]

(17)

The following is an update to Equation (10)

‖ γ ‖ \leq σ,

(18)

0 \leq σ \leq T_{\max} e^{- ρ} .

(19)

Then, using the first-order Taylor expansion, the nonlinear term

e^{- ρ}

in Equation (19) is approximated at the time,

t

, yielding:

\begin{array}{l} e^{- ρ (t)} & \approx e^{- ρ_{0} (t)} - e^{- ρ_{0} (t)} (ρ (t) - ρ_{0} (t)) \\ = e^{- ρ_{0} (t)} [1 - (ρ (t) - ρ_{0} (t))] \end{array}

(20)

where

ρ = \ln m

and the lower bound

ρ_{0}

is

\ln (m_{0} - μ T_{\max} t)

.

Now, we can replace Equation (9) with

minimize J_{2} = \int_{t_{0}}^{t_{f}} σ (t) d t

(21)

derived from Equation (15),

m (t) = m_{0} e^{- μ J_{2}} .

(22)

From Equation (22), minimizing

J_{2}

is equivalent to maximizing

m (t)

, which also implies the optimal fuel consumption. Therefore, replacing

J_{1}

with

J_{2}

as the objective function is identical.

2.3.3. Linearization of the No-Fly Zone

Setting a no-fly zone is a crucial step in ensuring the safety of the proximity process. The no-fly zone constraint in Problem 1 is defined in the LVLH coordinate system, as shown in Figure 2a. The magnitude of the position vector

‖ r ‖

also represents the distance between the two spacecraft, as the origin is set at the Target centroid. The distance must at least meet the minimum safe distance,

R_{s}

, to avoid collisions.

Figure 2. No-fly zone (red area) and safety zone (white area): (a) Nonconvex no-fly zone; (b) Linear no-fly zone.

The no-fly zone constraint can also be expressed as follows:

r^{T} r \geq R_{s}^{2} .

(23)

The affine function of the position vector

r

is denoted by

g (r)

and approximated using the first-order Taylor expansion at a given reference point

r^{r e f}

. The left side of Equation (23) becomes a linear term.

\begin{matrix} g (r) & = r^{T} r \approx g (r^{r e f}) + g^{'} (r^{r e f}) (r - r^{r e f}) \\ = {(r^{r e f})}^{T} r^{r e f} + 2 {(r^{r e f})}^{T} (r - r^{r e f}) \\ = 2 {(r^{r e f})}^{T} r - {(r^{r e f})}^{T} r^{r e f} \geq R_{s}^{2} \end{matrix}

(24)

The trust region constraint is added to guarantee a reasonable approximation of the original value.

‖ r - r^{r e f} ‖ \leq ε,

(25)

where the radius of trust-region ε is positive and small enough. The reference trajectory

r_{ref}

commonly takes the solution of the last iteration using the SC algorithm, as mentioned in Section 4.4. Inspired by the separating hyperplane theorem [24], we define a separating hyperplane, as shown in Figure 2b, and the describing equation is

r_{P}^{T} r = R_{s}^{2},

(26)

where the vector

r_{P}

is in the same direction as

r^{r e f}

and the endpoint P is on the boundary of the no-fly zone. The relationship can be denoted as

r_{P} = \frac{r^{r e f}}{‖ r^{r e f} ‖} R_{s}

(27)

Based on Equations (24)–(27), we can derive a more intuitive and linear expression of the no-fly zone constraint.

r_{P}^{T} r \geq R_{s}^{2}

(28)

Therefore, the original Problem 1 is converted into a convex Problem 2 through the above convexification techniques.

P r o b l e m 2 (ConvexRTO) {\begin{cases} minimize J_{2} = \int_{t_{0}}^{t_{f}} σ (t) d t \\ \begin{matrix} subject to & \begin{array}{l} \dot{x} = A x + Β u, \\ ‖ γ ‖ \leq σ, \\ 0 \leq σ \leq T_{\max} e^{- ρ_{0} (t)} [1 - (ρ (t) - ρ_{0} (t))], \\ r_{P}^{T} r \geq R_{s}^{2}, ‖ r - r^{r e f} ‖ \leq ε, \\ x (t_{0}) = x_{0}, x (t_{f}) = x_{f} . \end{array} \end{matrix} \end{cases}

(29)

For Problem 2, the linear time-varying system (LTV) in Equation (29) can be converted from a continuous infinite dimension to a discrete finite dimension by some discretization methods, including the zero-order hold (ZOH), first-order hold (FOH), the Runge–Kutta (RK) method, and the global pseudospectral (PS) method [26]. After discretization, Problem 2 with the convex constraints can obtain the optimal solution by solving a convex programming problem in each iteration until the convergence criterion is satisfied, via the sequential convex (SC) method. In this study, the PS method based on the Birkhoff interpolation is combined with the SC method to solve the RTO problem.

3. Discretization Based on Classical PS Method

3.1. Time-Domain Transformation

In the standard PS method, the time domain is transformed from the physical time t₀∈[t₀,t_f] to the PS time

τ \in [- 1, 1]

. The mapping function

Γ (τ)

is given as:

t = Γ (τ) = \frac{t_{f} - t_{0}}{2} τ + \frac{t_{f} + t_{0}}{2},

(30)

where the time mapping factor is defined as

κ = d Γ / d τ = (t_{f} - t_{0}) / 2

.

The discretization method of dynamic constraints, or the transformation of differential equations into algebraic equations, is the focus of this section. For the convenience of explanation, we describe the convex RTO problem as a distilled optimal control problem [35], which simplifies the expression of the boundary constraints and path constraints in Problem 2. For multivariable differential systems, the dynamic constraints follow the vector representation. The time-domain transformation of dynamic constraints is defined as

x^{'} (τ) = \frac{d x}{d τ} = \frac{d x}{d t} \frac{d t}{d τ} = κ \dot{x}

(31)

where the notation ‘′’ denotes the derivative

d / d τ

. On the one hand, the boundary constraints were simplified into unified equality constraints

G (x (- 1), x (1)) = 0_{e q}

(32)

where the dimension of the zero vector

0_{e q}

was determined by the number of equality constraints. On the other hand, the path constraints included the thrust constraints Equations (18)–(20) and the no-fly zone constraints, Equations (25)–(28), and were simplified into unified inequality constraints.

H (x (τ), u (τ)) \leq 0_{i n e q}

(33)

where the dimension of the zero vector

0_{i n e q}

depended on the number of inequality constraints.

As a result, a distilled Problem 3 [39,54] is rephrased in the PS time domain as follows:

\begin{array}{l} Varibles : x \in ℝ^{7}, u \in ℝ^{4}, τ \in [- 1, 1] \\ Problem 3 (Distilled) & {\begin{matrix} minimize & J = κ \int_{- 1}^{1} σ (τ) d τ \\ x^{'} (τ) = κ (Ax (τ) + Bu (τ)), \\ subject to & G (x (- 1), x (1)) = 0_{eq}, \\ H (x (τ), u (τ)) \leq 0_{ineq} . \end{matrix} \end{array}

(34)

3.2. Classical PS Method for RTO

Firstly, an arbitrary grid of distinct nodes is defined in the interval

[- 1, 1]

. These nodes are chosen to be the roots of orthogonal polynomials, such as the Legendre polynomials and the Chebyshev polynomials [55]

\begin{matrix} π^{N} ≜ [τ_{0}, τ_{1}, \dots, τ_{N}], \\ - 1 \leq τ_{0} < τ_{1} < \dots < τ_{N} \leq 1 \end{matrix}

(35)

where the scalar

N

represents the order of the orthogonal polynomials and the number of grid points is

N + 1

. The well-conditioned Gauss–Lobatto (GL) points are chosen as the discrete grid points [35].

Let

p (τ)

denote a component of the states

x = {[r, v, ρ]}^{T}

on a grid

π^{N}

. In the PS theory, the state function

p (τ)

can be approximated by the approximation function

p^{N} (τ)

with the interpolation basis polynomial at these collocation points. For the GL points, the collocation points are the same as the discrete grid points [35]. Therefore, on the grid

π^{N}

, we define

p (τ) ≅ p^{N} (τ) = \sum_{k = 0}^{N} p (τ_{k}) L_{k} (τ)

(36)

where the Lagrange basis functions

L_{k} (τ) (k = 0, 1, 2, \dots, N)

satisfy the Kronecker delta condition.

L_{k} (τ_{j}) = δ_{k j} = {\begin{matrix} 1, k = j \\ 0, k \neq j \end{matrix}, j = 0, 1, 2, \dots N .

(37)

The symbol ‘

≅

’ can be replaced by the symbol ‘

=

’ when the order

N

tends toward positive infinity.

By introducing Equation (37) into Equation (36), we can get

p (τ_{k}) = p^{N} (τ_{k}), k = 0, 1, 2, \dots N

(38)

To simplify the representation, we use

p_{k}

instead of

p (τ_{k})

, and Equation (38) can be written as

p_{k} = p^{N} (τ_{k})

.

One of the features of the PS methods, or any Galerkin method, is that the differentiation of the approximation function can be converted into the differentiation of the interpolation polynomial [56,57,58,59]. Consequently, taking the derivative of the time

τ

on both sides of Equation (36), we get

p^{'} (τ) = \frac{d}{d τ} \sum_{k = 0}^{N} p_{k} L_{k} (τ) = \sum_{k = 0}^{N} p_{k} \frac{d}{d τ} L_{k} (τ)

(39)

Define

D = {(D_{kj})}_{0 \leq k, j \leq N}

as the first-order PS differential matrix (PSDM), which is represented as

D_{kj} = {\frac{d L_{k} (τ)}{d τ} |}_{τ - τ_{j}}, k, j = 0, 1, 2, \dots N,

(40)

where

D_{in} = {(D_{kj})}_{1 \leq k, j \leq N - 1}

is part of

D

. Matrix

D

is also referred to as the differential operator, and it possesses a key characteristic proposed in [14].

D^{(λ)} = D • D • \dots • D = D^{λ}, λ \geq 1

(41)

where

D^{(λ)}

is called the

λ

-th order PSDM.

In addition,

p_{k} (k = 0, 1, 2 \dots N)

can also be denoted by the vector

p

.

p = {[p_{0}, p_{1}, p_{2}, \dots, p_{N}]}^{T}

(42)

Equation (39) can be simplified as a matrix-vector product by combining (40) and (42).

p^{'} = D p

(43)

Equation (43) is extended to a higher-order form by applying Equation (41) [44]

D^{(λ)} p = D^{λ} p = p^{(λ)}

(44)

where

p (λ)

denotes the λ-th derivative of

p

with respect to the PS time

τ

.

For example, for a second-order system, Equation (44) can be written as

p^{(2)} = D^{(2)} p .

(45)

The discretized decision variables are expressed as follows:

X : = [x_{0}, x_{1}, \dots, x_{N}] \in ℝ^{7 \times (N + 1)}, U : = [u_{0}, u_{1}, \dots, u_{N}] \in ℝ^{4 \times (N + 1)}

(46)

where

x_{k} = x (τ_{k}), u_{k} = u (τ_{k}), k = 0, 1, \dots, N .

(47)

Therefore, the dynamic constraints in Problem 3 can be discretized as

\frac{d X_{k}}{d τ} = κ (A X_{k} + B U_{k}), k = 0, 1, \dots, N .

(48)

Defining an overloaded function

F (X, U)

, Equation (48) can be written as

X^{'} = κ F (X, U) = κ [\begin{matrix} A X_{0} + B U_{0} & A X_{1} + B U_{1} & \dots & A X_{N} + B U_{N} \end{matrix}] .

(49)

Based on Equation (43), a new matrix form of differential transformation is given by:

{D X}^{T} = {\dot{X}}^{T}

(50)

Note that the transpose operation here is because the decision variable

X

is a combination of seven variables, and, after discretization, it is a matrix of

7 \times (N + 1)

, as stated in Equation (46).

Substituting Equation (50) into Equation (49), we get

{D X}^{T} = k F {(X, U)}^{T}

(51)

The dynamic constraints have been turned into a set of algebraic equations, using the differential transformation of the PS method. This means that the RTO problem has become an NLP problem that can be solved by specific solvers [27,28].

For the objective function Equation (21), we make an affine transformation on the time domain, according to Equation (30).

\int_{t_{0}}^{t_{f}} σ (t) d t = \frac{t_{f} - t_{0}}{2} \int_{- 1}^{1} σ (τ) d τ

(52)

The integral term in the objective function is calculated using the Gauss quadrature formula, as described in [44].

\int_{- 1}^{1} σ (τ) d τ = \sum_{i = 0}^{N} w_{i} σ_{i} = w^{T} σ

(53)

where

σ_{i} = σ (τ_{i})

and

w = {[w_{0}, w_{1}, \dots, w_{N}]}^{T}, σ = {[σ_{0}, σ_{1}, \dots, σ_{N}]}^{T}

(54)

The weight, w_i(i = 0,1,2,⋯,N), and the first-order differential matrix,

D

, as mentioned before, have many representations based on different grid

π^{N}

values. Readers can refer to [39,44] for further information, owing to the article’s length and major topics.

Due to the length of the paper, the specific derivation process is omitted and the results are directly used in Section 4.4. Readers can refer to Sagliano’s research [58] for a detailed processing framework. Thus, the PS transformation of the boundary constraints and the path constraints on the grid

π^{N}

remains as in Problem 3. As a result, Problem 3 on the discretized points

τ \in π^{N}

can be rewritten, using the first-order differential PS method (FDPSM).

\begin{array}{l} Varibles : X \in ℝ^{7 \times N + 1}, U \in ℝ^{4 \times N + 1}, τ \in π^{N} \\ Problem 4 (EDPSM) & {\begin{array}{l} minimize & w^{T} σ \\ {D X}^{T} = κ F {(X, U)}^{T} \\ subject to & G (X (- 1), X (1)) = 0, \\ H (X (τ), U (τ)) \leq 0 . \end{array} \end{array}

(55)

4. Well-Conditioned Birkhoff PS Methods for Multiple Dynamic Systems

4.1. Basic Properties of Birkhoff Interpolation

As the basis of the Birkhoff PS method, the Birkhoff interpolation is regarded as a new extension of the Lagrange interpolation and Hermite interpolation [60,61]. Unlike the Lagrange interpolation in Equation (36), the Birkhoff interpolation function is a mix of various orders of the derivatives of a function [60]. Two general forms of the Birkhoff interpolation, based on the first-order or second-order boundary value problems (BVPs), were proposed in [44]. The dynamics system in the RTO can be a first-order system about

(r, v)

, as indicated in Equation (5), or a second-order system about

r

, as shown in Equation (6).

Let

ϕ (τ)

be a first-order, continuously bounded function on the grid points

τ \in π^{N}

. Therefore, a first-order Birkhoff interpolation polynomial

ϕ^{N} (τ)

is defined as [39],

ϕ (τ) ≅ ϕ^{N} (τ) = ϕ (τ_{0}) B_{0}^{1} (τ) + \sum_{k = 1}^{N} ϕ^{'} (τ_{k}) B_{k}^{1} (τ),

(56)

where the Birkhoff interpolation basis polynomials,

B_{k}^{1} (τ), k = 0, 1, \dots, N

, can be regarded as the counterpart of the Lagrange polynomials [39] and must satisfy

\begin{array}{l} B_{0}^{1} (τ_{0}) = 1, B_{k}^{1} (τ_{0}) = 0, k = 1, 2, \dots N . \\ {\dot{B}}_{0}^{1} (τ_{j}) = 0, {\dot{B}}_{k}^{1} (τ_{j}) = δ_{k j}, j = 1, 2, \dots N . \end{array}

(57)

The interpolation polynomial

ϕ^{N} (τ)

needs to meet the interpolation conditions, which can be regarded as an extension of Equation (38)

ϕ^{N} (τ_{0}) = ϕ (τ_{0}), \frac{d ϕ^{N} (τ_{k})}{d τ} = ϕ^{'} (τ_{k}), k = 1, 2, \dots, N .

(58)

Let

B_{j k}^{1} = B_{k}^{1} (τ_{j})

, thus a first-order Birkhoff PS integral matrix (FBPSIM)

B^{1}

is defined as

B^{1} = {[B_{jk}^{1}]}_{0 \leq j, k \leq N}, B_{in}^{1} = {[B_{jk}^{1}]}_{1 \leq j, k \leq N}

(59)

and satisfies that

\tilde{D} B^{1} = I_{N + 1}, D_{in} B_{in}^{1} = I_{N},

(60)

where the matrix

\tilde{D}

is generated by substituting the initial row of

D

with

e_{1} = (1, 0, \dots, 0)

. The calculation of

B^{1}

and the property of

B^{1}

and

B_{i n}^{1}

in Equation (60) are omitted here, and the detailed derivation can be found in Section 4.1, reference [44]. Therefore, the Equation (56) can be rewritten in vector-matrix form:

Φ = B^{1} {\tilde{Φ}}^{'}

(61)

where

Φ = {[\begin{matrix} ϕ (τ_{0}) & ϕ (τ_{1}) & \dots & ϕ (τ_{N}) \end{matrix}]}^{T}, {\tilde{Φ}}^{'} = {[\begin{matrix} ϕ (τ_{0}) & ϕ^{'} (τ_{1}) & \dots & ϕ^{'} (τ_{N}) \end{matrix}]}^{T} .

(62)

Let

ψ (τ)

be a second-order, continuously bounded function on the grid points

τ \in π^{N}

. Similarly, a second-order Birkhoff interpolation polynomial

ψ^{N} (τ)

can be written as

ψ (τ) ≅ ψ^{N} (τ) = ψ (τ_{0}) B_{0}^{2} (τ) + \sum_{k = 1}^{N - 1} ψ^{(2)} (τ_{k}) B_{k}^{2} (τ) + ψ (τ_{N}) B_{N}^{2} (τ),

(63)

the interpolation conditions Equations (57) and (63) can be rewritten as

\begin{array}{l} B_{0}^{2} (τ_{0}) = 1, B_{k}^{2} (τ_{0}) = 0, B_{N}^{2} (τ_{0}) = 1, k = 1, 2, \dots N - 1 \\ B_{0}^{2} (τ_{N}) = 0, B_{k}^{2} (τ_{N}) = 0, B_{N}^{2} (τ_{N}) = 1, k = 1, 2, \dots N - 1 \\ {\ddot{B}}_{0}^{2} (τ_{j}) = 0, {\ddot{B}}_{N}^{2} (τ_{j}) = 0, {\ddot{B}}_{k}^{2} (τ_{j}) = δ_{k j}, j = 1, 2, \dots N - 1 \end{array}

(64)

and

ψ^{N} (τ_{0}) = ψ (τ_{0}), ψ^{N} (τ_{N}) = ψ (τ_{N}), \frac{d^{2} ψ^{N} (τ_{k})}{d τ^{2}} = ψ^{(2)} (τ_{k}), k = 1, 2, \dots, N - 1 .

(65)

Let

B_{j k}^{2} = B_{k}^{2} (τ_{j})

, thus a second-order Birkhoff PS integral matrix (SBPSIM)

B^{2}

is defined as

B^{2} = {[B_{jk}^{2}]}_{0 \leq j, k \leq N}, B_{in}^{2} = {[B_{jk}^{2}]}_{1 \leq j, k \leq N}

(66)

and satisfies that

{\tilde{D}}^{(2)} B^{2} = I_{N + 1}, D_{in}^{(2)} B_{in}^{2} = I_{N}

(67)

where the matrix

{\tilde{D}}^{(2)}

can be produced by replacing the initial and last rows of the matrix

{\tilde{D}}^{(2)}

with

e_{1} = (1, 0, \dots, 0)

and

e_{N} = (0, 0, \dots, 1)

, respectively. The detailed derivation can be found in Sections 3.1 and 3.2, reference [44]. The vector-matrix form of Equation (63) is given as

Ψ = B^{2} {\tilde{Ψ}}^{(2)}

(68)

where

Ψ = {[\begin{matrix} ψ (τ_{0}) & ψ (τ_{1}) & \dots & ψ (τ_{N}) \end{matrix}]}^{T}, {\tilde{Ψ}}^{(2)} = {[\begin{matrix} ψ (τ_{0}) & ψ^{(2)} (τ_{1}) & \dots & ψ^{(2)} (τ_{N - 1}) & ψ (τ_{N}) \end{matrix}]}^{T} .

(69)

4.2. The First-Order Birkhoff PS Method for RTO

The first-order Birkhoff PS method (FBPSM) is based on the first-order Birkhoff interpolation in Equation (56). For the first-order system, we approximate the state function

p (τ)

by applying Equation (56) to Problem 3. The approximation function

p^{N} (τ)

on the grid

π^{N}

can be rewritten as

p (τ) ≅ p^{N} (τ) = p (τ_{0}) B_{0}^{1} (τ) + \sum_{k = 1}^{N} p^{'} (τ_{k}) B_{k}^{1} (τ) .

(70)

where the vector

{\tilde{p}}^{'} = {[\begin{matrix} p_{0} & {p^{'}}_{1} & \dots & {p^{'}}_{N} \end{matrix}]}^{T}

is regarded as the unknown optimization variable and

B_{k}^{1}, k = 0, 1, \dots, N

satisfies Equation (57). Taking the derivative of the time

τ

on both sides of Equation (70), we get

p^{'} (τ) ≅ \frac{d p^{N} (τ)}{d τ} = p (τ_{0}) {\dot{B}}_{0}^{1} (τ) + \sum_{k = 1}^{N} p^{'} (τ_{k}) {\dot{B}}_{k}^{1} (τ) .

(71)

As in Equation (61), the vector-matrix form of Equation (70) is given by

p = B^{1} {\tilde{p}}^{'} .

(72)

Therefore, the vector-matrix form of Equation (71) can be given by

p^{'} = I^{1} {\tilde{p}}^{'} .

(73)

where

I^{1} = {[I_{k}^{1} (τ_{j})]}_{0 \leq j, k \leq N} = \frac{d B^{1}}{d τ} = {[{\dot{B}}_{j k}^{1}]}_{0 \leq j, k \leq N}

represents the derivative of

B^{1}

over

τ

.

According to Equations (43) and (70), it can be deduced that

I^{1} = D B^{1}, I_{in}^{1} = D_{in} B_{in}^{1} = I_{N}

(74)

where

I_{in}^{1} = {[{\dot{B}}_{jk}^{1}]}_{1 \leq j, k \leq N}

(75)

Applying the property of the FBPSM to Problem 3, the discretized unknown variables can be denoted by

P : = [x (τ_{0}), x^{'} (τ_{1}), \dots, x^{'} (τ_{N})] \in ℝ^{7 \times (N + 1)}

(76)

Combining the decision variables defined in Equation (46), the dynamic constraint in Equation (51) is rewritten as

I_{0}^{1} P^{T} = κ F_{0} (X_{0}, U_{0})

(77)

I_{i n}^{1} P_{i n}^{T} = P_{i n}^{T} = κ F_{i n} {(X_{i n}, U_{i n})}^{T}

(78)

where

\begin{array}{l} I_{0}^{1} = [\begin{matrix} I_{0}^{1} (τ_{0}) & I_{1}^{1} (τ_{0}) & \dots & I_{N}^{1} (τ_{0}) \end{matrix}], P_{i n} = [x^{'} (τ_{1}), x^{'} (τ_{2}), \dots, x^{'} (τ_{N})], \\ F_{0} (X_{0}, U_{0}) = F (X_{0}, U_{0}) = A X_{0} + B U_{0}, \\ F_{i n} (X_{i n}, U_{i n}) = F (X_{i n}, U_{i n}) = [\begin{matrix} A X_{1} + B U_{1} & A X_{2} + B U_{2} & \dots & A X_{N} + B U_{N} \end{matrix}] . \end{array}

(79)

Therefore, Problem 3 on the discretized points

τ \in π^{N}

is given by using the first-order Birkhoff PS method (FBPSM).

\begin{array}{l} Variables : P \in ℝ^{7 \times (N + 1)}, U \in ℝ^{4 \times (N + 1)}, \\ Intermediate Variable : X = {(B^{1} P^{T})}^{T} \in ℝ^{7 \times (N + 1)} \\ P r o b l e m 5 (FBPSM) {\begin{cases} minimize w^{T} σ \\ \begin{matrix} subject to & \begin{array}{l} I_{0}^{1} P^{T} = κ F_{0} (X_{0}, U_{0}) \\ P_{i n}^{T} = k F_{i n} {(X_{i n}, U_{i n})}^{T}, \\ G (X (- 1), X (1)) = 0, \\ ℋ (X (τ), U (τ)) \leq 0 \end{array} \end{matrix} \end{cases} \end{array}

(80)

.

4.3. The Second-Order Birkhoff PS Method for RTO

The second-order Birkhoff PS method (SBPSM) is based on the second-order Birkhoff interpolation in Equation (63). The Equations (13) and (14) in the dynamic constraint can be expressed in a second-order form.

\ddot{r} = C_{1} r + C_{2} \dot{r} + γ .

(81)

Let set

q (τ)

be a component of

r

. Applying Equation (63) to Equation (81) on the grid

π^{N}

, we get

q (τ) ≅ q^{N} (τ) = q (τ_{0}) B_{0}^{2} (τ) + \sum_{k = 1}^{N - 1} q^{(2)} (τ_{k}) B_{k}^{2} (τ) + q (τ_{N}) B_{N}^{2} (τ) .

(82)

where the vector

{\tilde{q}}^{(2)} = {[\begin{matrix} q (τ_{0}) & q^{(2)} (τ_{1}) & \dots & q^{(2)} (τ_{N - 1}) & q (τ_{N}) \end{matrix}]}^{T}

is the unknown optimization variable,

q^{N} (τ)

is the approximation function, and

B_{k}^{2} (τ), k = 0, 1, \dots, N

satisfies the condition in Equation (64). Differentiating twice on both sides of Equation (82), we get

{\ddot{q}}^{N} (τ) = q (τ_{0}) {\ddot{B}}_{0}^{2} (τ) + \sum_{k = 1}^{N - 1} q^{(2)} (τ_{k}) {\ddot{B}}_{k}^{2} (τ) + q (τ) {B ¨}_{N}^{2} (τ) .

(83)

The vector-matrix form of Equation (82) can be written as

q = B^{2} {\tilde{q}}^{(2)} .

(84)

where

q = {[\begin{matrix} q (τ_{0}) & q (τ_{1}) & \dots & q (τ_{N}) \end{matrix}]}^{T}

and

B^{2}

is the SBPSIM defined in Equation (66).

The vector-matrix form of Equation (83) is given as

q^{(2)} = I^{2} {\tilde{q}}^{(2)} .

(85)

where

I^{2} = {[I_{k}^{2} (τ_{j})]}_{0 \leq j, k \leq N} = \frac{d^{2} B^{2}}{d τ^{2}} = {[{\dot{B}}_{j k}^{2}]}_{0 \leq j, k \leq N}

represents the derivative of

B^{2}

over

τ

.

Combining the properties of Equations (45) and (67), we have

I^{2} = D^{(2)} B^{2}, I_{in}^{2} = D_{in}^{(2)} B_{in}^{2} = I_{N - 1}

(86)

where

I_{in}^{2} = {[{\overset{˙ ˙}{B}}_{jk}^{2}]}_{1 \leq j, k \leq N - 1}

(87)

Applying the SBPSM to Problem 3, the unknown optimization variables are defined as

Q = {[\begin{matrix} r (τ_{0}) & r^{(2)} (τ_{1}) & \dots & r^{(2)} (τ_{N - 1}) & r (τ_{N}) \end{matrix}]}^{T} \in ℝ^{3 \times (N + 1)}

(88)

The new discretized representation of the decision variables is given as

X^{r} {(B^{2} Q^{T})}^{T} \in ℝ^{3 \times (N + 1)}, {X^{'}}^{r} = \frac{d X^{r}}{d τ} = {(D B^{2} Q^{T} / k)}^{T} \in ℝ^{3 \times (N + 1)}

(89)

Thus, Equation (81) is rewritten as

\begin{array}{l} I_{0}^{2} Q^{T} = κ^{2} F_{0} (X_{0}^{r}, X_{0}^{' r}, U_{0}), \\ I_{i n}^{2} Q_{i n}^{T} = Q_{i n}^{T} = κ^{2} F_{i n} {(X_{i n}^{r}, {X^{'}}_{i n}^{r}, U_{i n})}^{T}, \\ I_{N}^{2} Q^{T} = κ^{2} F_{N} (X_{N}^{r}, {X^{'}}_{N}^{r}, U_{N}) \end{array}

(90)

where

\begin{array}{l} I_{0}^{2} = [\begin{matrix} I_{0}^{2} (τ_{0}) & I_{1}^{2} (τ_{0}) & \dots & I_{N}^{2} (τ_{0}) \end{matrix}], I_{N}^{2} = [\begin{matrix} I_{0}^{2} (τ_{N}) & I_{1}^{2} (τ_{N}) & \dots & I_{N}^{2} (τ_{N}) \end{matrix}], \\ Q_{i n} = [\begin{matrix} r^{(2)} (τ_{1}) & r^{(2)} (τ_{2}) & \dots & r^{(2)} (τ_{N - 1}) \end{matrix}], X_{i n}^{r} = [\begin{matrix} X_{1}^{r} & X_{2}^{r} & \dots & X_{N - 1}^{r} \end{matrix}], \\ {X^{'}}_{i n}^{r} = [\begin{matrix} {X^{'}}_{1}^{r} & {X^{'}}_{2}^{r} & \dots & {X^{'}}_{N - 1}^{r} \end{matrix}], U_{i n} = [\begin{matrix} U_{1} & U_{2} & \dots & U_{N - 1} \end{matrix}], C_{3} = [\begin{matrix} I_{3 \times 3} & 0 \\ 0_{1 \times 3} & 0 \end{matrix}] \\ F_{0} (X_{0}^{r}, {X^{'}}_{0}^{r}, U_{0}) = C_{1} X_{0}^{r} + C_{2} {X^{'}}_{0}^{r} + C_{3} U_{0}, F_{N} (X_{N}^{r}, {X^{'}}_{N}^{r}, U_{N}) = C_{1} X_{N}^{r} + C_{2} {X^{'}}_{N}^{r} + C_{3} U_{N}, \\ F_{i n} (X_{i n}^{r}, {X^{'}}_{i n}^{r}, U_{i n}) = [\begin{matrix} F_{1} (X_{1}^{r}, {X^{'}}_{1}^{r}, U_{1}) & F_{2} (X_{2}^{r}, {X^{'}}_{2}^{r}, U_{2}) & \dots & F_{N - 1} (X_{N - 1}^{r}, {X^{'}}_{N - 1}^{r}, U_{N - 1}) \end{matrix}] . \end{array}

In addition, the FBPSM is used to deal with the remaining first-order equation, Equation (15). We define the unknown variable as

Z = [\begin{matrix} ρ (τ_{0}) & ρ^{'} (τ_{1}) & \dots & ρ^{'} (τ_{N}) \end{matrix}] \in ℝ^{1 \times (N + 1)}

(91)

The new discretized representation of the decision variables

ρ

is given as

X^{ρ} = {(B^{1} Z^{T})}^{T} \in ℝ^{1 \times (N + 1)}

(92)

Thus, Equation (15) can be rewritten as

\begin{array}{l} I_{0}^{1} Z_{0} = M_{0} (X_{0}^{ρ}, U_{0}), \\ I_{i n}^{1} Z_{i n}^{T} = M_{i n} (X_{i n}^{ρ}, U_{i n}^{ρ}), \end{array}

(93)

where

\begin{array}{l} Z_{i n} = [\begin{matrix} ρ^{'} (τ_{1}) & ρ^{'} (τ_{2}) & \dots & ρ^{'} (τ_{N}) \end{matrix}], C_{4} = [\begin{matrix} 0 & 0 & 0 & 1 \end{matrix}], U_{i n}^{ρ} = [\begin{matrix} U_{i n} & U_{N} \end{matrix}] \\ M (Z_{0}, U_{0}) = μ C_{4} U_{0}, M_{i n} (X_{i n}^{ρ}, U_{i n}^{ρ}) = {[\begin{matrix} μ C_{4} U_{1} & μ C_{4} U_{2} & \dots & μ C_{4} U_{N} \end{matrix}]}^{T} . \end{array}

Therefore, Problem 3 on the discretized points

τ \in π^{N}

is given by using the second-order Birkhoff PS method (SBPSM).

\begin{array}{l} Varibles : Q \in ℝ^{3 \times N + 1}, Z = \in ℝ^{1 \times N + 1}, U \in ℝ^{4 \times N + 1}, τ \in π^{N} \\ Intermediate Varibles : X^{r} = {(B^{2} Q^{T})}^{T} \in ℝ^{3 \times N + 1}, {X^{'}}^{r} = {(D B^{2} Q^{T} / κ)}^{T} \in ℝ^{3 \times N + 1}, \\ \begin{array}{l} X^{ρ} = B^{1} Z^{T} \in ℝ^{1 \times N + 1} \\ Problem 6 (SBPSM) & {\begin{matrix} minimize & w^{T} σ \\ I_{0}^{2} Q^{T} = κ^{2} F_{0} (X_{0}^{r}, {X^{'}}_{0}^{r}, U_{0}), \\ Q_{in}^{T} = κ^{2} F_{in} {(X_{in}^{r}, {X^{'}}_{in}^{r}, U_{in})}^{T}, \\ I_{N}^{2} Q^{T} = κ^{2} F_{N} (X_{N}^{r}, {X^{'}}_{N}^{r}, U_{N}), \\ subject to & I_{0}^{1} Z_{0} = M_{0} (X_{0}^{ρ}, U_{0}), \\ I_{in}^{1} Z_{in}^{T} = M_{in} (X_{in}^{ρ}, U_{in}^{ρ}), \\ G (X (- 1), X (1)) = 0, \\ H (X (τ), U (τ)) \leq 0 . \end{matrix} \end{array} \end{array}

(94)

4.4. Sequential Convex Algorithm for RTO

In this section, the sequential convex (SC) algorithm is used to solve the finite convex problem after the convexification and discretization. The main idea is to solve a series of convex subproblems to converge to the optimal solution

{x^{*}, u^{*}}

. The iteration process via the SC algorithm is described in Algorithm 1.

Algorithm 1 The iteration process of the SC algorithm

SC Algorithm for RTO

Input: The initial state x₀ = [r₀,v₀] and the terminal state x_f = [r_f,v_f]; the initial mass of the Chaser m₀; the initial and terminal time [t₀,t_f] .

Step 1: Calculate the initial trajectory r⁰ without the no-fly constraint and set it as the reference trajectory r^ref of the first iteration. Go to step 2.

Step 2: Start the loop until meeting the trust-region constraint.

1. For k = 1:Maxiter;

2. Calculate the current trajectory {r^k,u^k};

3. Calculate the maximum distance Δd_max between r^k and r^ref;

4. Check the trust-region constraint:

Δ d_{\max} = ‖ r^{k} - r^{r e f} ‖ \leq ε .

If Δd_max ≤ ε established, exit the loop and go to step 3; otherwise, set r^ref = r^k and continue.

5. End.

Step 3: Evaluate the accuracy of the solution e. Go to step 4.

Step 4: Output the result. The optimal trajectory and optimal control are {r*,u*} ; the optimal objective value is j*.

Remark 1.

The convergence of the SC algorithm is significantly influenced by a good initial trajectory. We propose to cancel the no-fly zone constraint to achieve this goal, which is also given by the subsequent simulation. Therefore, the first subproblem is given as

\begin{matrix} Variables : x^{0} \in ℝ^{7}, u^{0} \in ℝ^{4}, τ \in [- 1, 1] \\ Subproblem 1 {\begin{matrix} minimize & J = k \int_{- 1}^{1} σ^{0} (τ) d τ \\ {\dot{x}}^{0} (τ) = κ (A x^{0} (τ) + {Bu}^{0} (τ)), \\ ‖ γ^{0} ‖ \leq σ^{0}, \\ subject to & 0 \leq σ^{0} \leq T_{\max} e^{- ρ_{0} (τ)} [1 - (ρ^{0} (τ) - ρ_{0} (τ))] \\ x^{0} (τ_{0}) = x_{0}, x^{0} (τ_{f}) = x_{f} . \end{matrix} \end{matrix}

(95)

Remark 2.

In step 2, the subproblem required to be solved in the

k

-th iteration with all constraints is as follows:

\begin{matrix} Variables : x^{k} \in ℝ^{7}, u^{k} \in ℝ^{4}, τ \in [- 1, 1] \\ Subproblem 2 {\begin{matrix} minimize & J = κ \int_{- 1}^{1} σ^{k} (τ) d τ \\ {\dot{x}}^{k} (τ) = k (A x^{k} (τ) + {Bu}^{k} (τ)), \\ ‖ γ^{k} ‖ \leq σ^{k}, \\ subject to & 0 \leq σ^{k} \leq T_{\max} e^{- ρ_{0} (τ)} [1 - (ρ^{k} (τ) - ρ_{0} (τ))] \\ r_{p}^{T} r^{k} \geq R_{s}^{2}, \\ x^{k} (τ_{0}) = x_{0}, x^{k} (τ_{f}) = x_{f} . \end{matrix} \end{matrix}

(96)

The parameter maxiter in step 2 represents the SC algorithm’s maximum number of iterations. The SC algorithm does not converge to the optimal solution, fulfilling the trust-region condition, if

k

takes the value of maxiter. It’s feasible to replace more relaxed convergence conditions or add the mesh points to improve the convergence of the SC algorithm.

Remark 3.

In step 3, we linearly interpolate the optimal control

u^{k}

of the iterative result of the SC algorithm, integrate the original dynamic equation with fourth-order Runge–Kutta to obtain the terminal state

{\hat{x}}^{k}

, and then compare the terminal value

{\hat{r}}^{k} (τ_{f})

to the given accurate value

r (τ_{f})

to calculate the terminal error

e

, which is used as the algorithm’s accuracy index.

e = | {\hat{r}}^{k} (τ_{f}) - r (τ_{f}) | .

(97)

The stepsize of the fourth-order Runge–Kutta integral was 0.001 in the PS time domain.

Remark 4.

To simplify the problem, only one no-fly zone constraint is considered in step 2, and the geometric shape is a sphere or an ellipsoid. Readers can learn more about the excellent work on multiple no-fly zones and complex geometries through [62,63].

For the ease of comprehension, we provide the complete process framework of the SC algorithm, using the FBPSM discretization method as an illustration, in Figure 3.

Figure 3. The complete process framework of the SC algorithm takes the FBPSM discretization method as an example.

5. Numerical Simulation

In this section, two cases are given to illustrate the feasibility and advantages of the proposed SC algorithm, based on the first-order and second-order Birkhoff pseudospectral discretization methods (FBPSM and SBPSM). As comparison methods, other discretization techniques, such as the zero-order hold (ZOH) [26] and the classic pseudospectral method, as mentioned in Section 3 (FDPSM), are chosen.

All numerical simulations were performed on a desktop with an Intel (R) Core (TM) i5-10400 (CPU 2.90 GHz) and with MATLAB version 2020a. The modeling tool is Yalmip [64] and the solver tool is Mosek [21]. Additionally, we found through research that the preprocessing option of the MOSEK solver significantly affects the results of the solution. Therefore, we changed the value of the option “MSK_IPAR_PRESOLVE_USE” to OFF and the reason is given in the following section.

Nondimensionalization is an effective method to simplify the calculation and improve accuracy. Table A1 in Appendix A provides a list of the principal dimensionless units. The radius of the earth,

R_{e}

, is 6378.14 km, and the gravitational acceleration,

g_{0}

, is 9.807 m/s. The value of the geocentric gravitational constant,

G_{M}

, is

3.986012 \times 10^{14} m^{3} {/ s}^{2}

. The satellite completes the maneuvering and rendezvous at an orbital altitude,

H_{0}

, of 600 km. The initial mass of the Chaser,

M_{0}

, is 1000 kg. The initial parameters of the simulations are dimensionless, as shown in Table A2, Appendix A. Note that the variables contained in this section tables and figures are defined in the LVLH coordinate system.

5.1. Case 1: Excellent Computational Efficiency and Solution Accuracy under Different Grid Points

We set up several numerical tests with different grid points to evaluate the accuracy of the solution and the computational efficiency of the proposed methods. In general, the grid points of the ZOH method are evenly distributed. The Runge phenomenon [54] becomes increasingly prevalent as the number of grid points rises, which reduces the precision of the solution. Although the FDPSM improves the situation to a certain extent, it destroys the sparsity of the system, which is manifested in the exponential growth of the condition number of the coefficient matrix and leads to a longer solution time. This situation can be improved by using the proposed methods, based on the Birkhoff interpolation. Given that reference [39] contains the outcomes of the first-order system, we just provide the change law of the condition number of the coefficient matrix under the second-order system, as depicted in Figure A1, Appendix A. The definitions of three types of coefficient matrices can be found in Section 5 of [39]. The condition number, based on the second-order Birkhoff PSIM

B^{2}

, shows an obvious decrease from

O (N^{4})

to

O (\sqrt{N})

, even with an invariable constant

O (1)

, which indicates that the Birkhoff PS method also has a significant effect on improving the condition number of the second-order system.

First, we divided the tests into seven groups, each with a number of grid points ranging from 30 to 210. The total time,

T_{t o t a l}

, required to arrive at the optimal solution is utilized as the evaluation criterion for computational efficiency, and it primarily involves the two stages of problem-modeling and solver-solving. The expression is as follows:

T_{t o t a l} = (T_{m o d e l}^{0} + T_{s o l v e r}^{0}) + \sum_{k = 1}^{S_{i t e r}} (T_{m o d e l}^{k} + T_{s o l v e r}^{k}) + T_{o t h e r},

(98)

where the term

(T_{m o d e l}^{0} + T_{s o l v e r}^{0})

represents the time of solving Subproblem 1 in Equation (95), the intermediate-term denotes the time of solving Subproblem 2 in Equation (96), and

S_{i t e r}

is the number of iterations. The term

T_{o t h e r}

is the time of the remaining parts in Figure 3 and is equal, in principle, for the above four methods. The terminal state error,

e

, which is stated in Equation (97) of remark 3, is used to determine the accuracy of the solution. The value of the performance index,

J_{o b j}

, is related to the optimality of the solution. The statistical results are shown in Table 1.

Table 1. The statistical results under different grid points N.

The results in Table 1 demonstrate that the number of grid points,

N

, does have an impact on the computational efficiency and some new findings are given here. The indicators of the final three methods show a minimal difference, whereas the ZOH method’s total time,

T_{total}

, has nearly tripled and has a significant terminal error, according to the findings of the first four groups. However, when

N

is selected as 150 or more, the total solution time of the FDPSM rises sharply, which is roughly three times that of the Birkhoff PS methods.

According to our analysis, the Birkhoff PS methods decrease the condition number of the coefficient matrix, enhance the dynamics equations, and thus shorten the modeling time of the optimization problems. Additionally, we discover that the total solution time of the SBPSM is less than the FBPSM. In comparison to the FDPSM and FBPSM, the SBPSM reduces the number of unknown variables, which, in turn, reduces the dimension of the problem and ultimately speeds up the solving process. Figure 4 illustrates the model time and solver time at various grid points and supports the validity of the conclusion. For example, the model time of the FDPSM in Figure 4a is greater than that of the FBPSM and SBPSM, when

N

is set as 150 or more. In contrast, the two first-order approaches take longer to solve the problem than the SBPSM in Figure 4b.

Figure 4. The curve of model time and solver time under the different number of grid points.

A perfect discretization method has a modest terminal state error, in addition to a brief total solution time. Figure 5 demonstrates that, although the terminal error of ZOH steadily lowers as the number of grid points increases, it is still higher than that of other discretization approaches. In terms of the solution accuracy, this result demonstrates that the ZOH is inferior to the pseudospectral discretization method. The outcomes also indicate that the new discrete approach, which is based on the Birkhoff interpolation, inherits the superior qualities of the conventional pseudospectral method. Figure 6 displays the position errors as measured by the difference between the optimal trajectory and the propagated trajectory in both the x-axis and y-axis directions. The solution accuracy of the SBPSM is higher because, during the entire maneuver, the error of the SBPSM is smaller than the result of the ZOH in both directions.

Figure 5. The terminal position and velocity error under the different number of grid points.

Figure 6. The position errors, as measured by the difference between the optimal trajectory and the propagated trajectory. (N = 90).

Furthermore, we investigated the impact of the grid points count on the no-fly zone and thrust constraints. The findings indicate that increasing the grid points was necessary to improving the practicality of the ideal solution and the maneuverability. Readers can refer to Appendix B for details.

In summary, to ensure that the constrained problem has an optimal solution, we need to set the number of grid points,

N

, to be large enough, which will reduce the computational efficiency and solution accuracy. The proposed methods based on the first-order and second-order Birkhoff interpolation can significantly improve the efficiency of the solution, while maintaining high accuracy.

5.2. Case 2: Sensitivity Analysis under Different Conditions

In this section, the performance of the proposed algorithm was verified by a sensitivity analysis under different test conditions. The main measurement indicators were the total solution time, the number of iterations, and the terminal state error. Firstly, two conditions were considered: the type of grid points, and the shape of the no-fly zone. By adding the perturbation

Δ y

to the initial position in the y-axis direction, a Monte Carlo analysis was conducted. The grid points were chosen between LGL and CGL [26] for the fuel optimization problem over a fixed time interval. In addition to the sphere, the ellipsoid was also selected for the shape of the no-fly zone. Table A2 in Appendix A displays the shape characteristics of no-fly zones. The value of the perturbation

Δ y (m)

was randomly generated within the range of [−1,1] m and the number of simulation groups was set as 300. The different simulation settings are denoted by the titles of the subgraphs in Figure 7 and Figure 8, and three pseudospectral methods are described in the legend. The ZOH technique is not considered in this section, due to the uniform distribution of the grid points and the difficulty in meeting the accuracy requirements with the maximum number of iterations, due to the huge terminal position error.

Figure 7. The curves of the total solution time T_total under different types of grid points and no-fly zones.

Figure 8. The curves of the number of iterations S_iter under different types of grid points and no-fly zones.

It is clear from Figure 7 that the FBPSM and SBPSM have more stable total solution times than the FDPSM, as evidenced by the roughly linear growth and square growth trends once the grid point exceeds 100, respectively. At the same time, the SBPSM performs marginally better than the FBPSM, which is in line with the analysis in the preceding section. The number of iterations of the three methods in Figure 8 displays a “steady” fluctuation, but the interpretation is different. The fact that the number of iterations fluctuates between five and eight for the FBPSM and SBPSM indicates that the two approaches can steadily converge to the optimal solution. The approach of the FDPSM cannot ensure a stable convergence to the optimal solution since the number of iterations varies between the maximum and the finite number of iterations (

M a x i t e r = 20

). The findings demonstrate that the SC algorithm, based on the FBPSM and SBPSM, is suitable for the four circumstances listed and can maintain a faster solution and more stable convergence when there are more grid points.

The purpose of the Monte Carlo simulation tests is to evaluate the sensitivity of these three methods in dealing with the initial state perturbations. The tests are divided into two scenarios:

N = 90

and

N = 150

. Each test only changes the initial position in the

y

-axis direction, and the perturbations are randomly generated. The total solution time,

T_{t o t a l}

, and the terminal position error,

e_{y}

, are chosen as the evaluation indices. The results are presented in Appendix A.

A visual evaluation model is created in Figure A2 by using the total solution time as the vertical axis and the terminal position error as the horizontal axis. The overall performance of this method is better the closer the results are to the lower left corner. In scenario 1, as depicted in Figure A2a, the output from the three algorithms reveals two identical clustering zones, with an error line of 0.4 m serving as the dividing line. The increase in the terminal position accuracy is accompanied by a jump in the total solution time, which rises from the right side of the dividing line to the left side of the dividing line, with an increase of roughly two seconds. The clustering regions’ findings indicate that, while the solution accuracy may increase in some ranges, the total solution time may remain unchanged. Figure A2b depicts the differential distribution between the FDPSM and the Birkhoff pseudospectral methods results in scenario 2. Although the majority of the FDPSM results are clustered around

T_{t o t a l} = 60 s

, there are some scattered points in a wider range in both the horizontal and vertical directions. The results of the FBPSM and the SBPSM are more concentrated, and the advantage in the total solution time can be perceived. To some extent, the approach of the FDPSM is more sensitive to the initial position disturbance.

Figure A3 illustrates the findings of our quantitative analysis of the Monte Carlo simulation data, which is an addition to the results of the qualitative study discussed above. The terminal position errors of the three methods are primarily spread in the range of [0,1.2] m depicted in Figure A3a, with a median value of roughly 0.5 m. The highest limit of the FBPSM is closer to 15 s in terms of total solution time, whereas the majority of the SBPSM outcomes (about 75%) are 1.5 s faster than the other two approaches. When the number of grid points is set to 150, in Figure A3b, the terminal position error of the FDPSM spreads to 1.5 m, the median value of the total solution time rises from 10 s to 60 s, and the properties of the centralized distribution deteriorate. The original error level is maintained by the FBPSM and SBPSM, and the median value of the total solution time is 21 s and 19 s, respectively. In terms of both the solution accuracy and computational efficiency, the Birkhoff pseudospectral approaches are more stable than the FDPSM.

Through sensitivity analysis, it is found that, compared with the FDPSM, the Birkhoff pseudospectral methods have the following advantages: (i) they can converge to the optimal solution steadily at a faster speed and are suited for two different types of grid points and no-fly zones. (ii) In the case of the initial position disturbance, they can maintain a more stable terminal position error distribution and higher computational efficiency. (iii) Under the same conditions, the second-order Birkhoff pseudospectral method (SBPSM) is better than the first-order Birkhoff pseudospectral method (FBPSM), both in solution time and the robustness against the initial position disturbances.

6. Conclusions

With the increase in the series of space missions, such as the space station construction, on-orbit maintenance, material supply, and astronaut rotation, in the future, the RPOs will play a more important role. The popularization of automatic rendezvous technology will be of great significance to save human and material resources. The method proposed in this paper combines convex optimization technology with the Birkhoff pseudospectral method. The advantages of convex optimization techniques, such as polynomial complexity and global optimal solution, are inherited. At the same time, the pseudospectral method, based on the Birkhoff interpolation, overcomes the shortcomings of the original, traditional differential operator pseudospectral method. With the increase of the polynomial order,

N

, the proposed methods can not only maintain the solution accuracy but also greatly improve the computational efficiency. Especially for the dynamic system, the second-order Birkhoff pseudospectral method will have a more obvious improvement effect. Under certain conditions, the improvement of the computational efficiency can be expanded by up to three times. This work discovery will provide a possible basis for realizing spacecraft autonomous online real-time trajectory planning. Further exploring the computational performance of the proposed method and striving for practical onboard applications will be the goal of our future work.

Author Contributions

Conceptualization, Z.Z., D.Z. and X.L.; methodology, Z.Z. and D.Z.; software, Z.Z.; validation, Z.Z., D.Z. and X.L.; formal analysis, Z.Z.; investigation, Z.Z.; resources, Z.Z.; data curation, D.Z. and X.L.; writing—original draft preparation, Z.Z.; writing—review and editing, D.Z. and X.L.; visualization, C.K. and M.S.; supervision, D.Z.; project administration, D.Z.; funding acquisition, D.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key Research and Development Program of China, grant number 2021YFC3090401.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank the anonymous reviewers for providing many detailed comments. Their comments greatly improved the quality of this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A contains some of the tables and result charts that are needed for Chapter 5.

Table A1. Dimensionless units and values for Section 5.1.

Dimensionless Unit	Value
Length $L_{u}$	$R_{e}$
Speed $V_{u}$	$\sqrt{G_{M} / R_{e}}$
Time $T_{u}$	$\sqrt{{(R_{e})}^{3} / G_{M}}$
Acceleration $a_{u}$	${G_{M} / (R_{e})}^{2}$
Mass $M_{u}$	$M_{0}$
Force $F_{u}$	${M_{0} G_{M} / (R_{e})}^{2}$

Table A2. Initialization parameters of the RTO problem for Section 5.1.

Initialization Parameters	Value
Initial Mass	$m_{0} = 1000 / M_{u}$
Initial and Terminal Time	$t_{0} = 0, t_{f} = 500 / T_{u}$
Initial and Terminal Position	$r_{0} = [0, - 100, 0] / L_{s}; r_{f} = [0, 20, 0] / L_{u}$
Initial and Terminal Speed	$v_{0} = [0, 0, 0] / V_{u}; v_{f} = [0, 0, 0] / V_{u}$
No-fly Zone Radius	$\begin{matrix} R_{s} = 10 / L_{u} (S p h e r i c a l); \\ R_{x} = 10 / L_{u}; R_{y} = 15 / L_{u}; R_{z} = 10 / L_{u} (E l l i p s o i d) \end{matrix}$
Maximum Thrust Limit	$[\begin{matrix} T_{\min} & T_{\max} \end{matrix}] = [\begin{matrix} 0 & 10 / F_{u} \end{matrix}]$
Reciprocal of effective exhaust velocity	$μ = V_{u} / 2000$

Figure A1. The change law of condition number of the coefficient matrix under the second-order system.

Figure A2. 6 × 300 groups of Monte Carlo simulation results based on initial position perturbation

Δ y

.

Figure A2. 6 × 300 groups of Monte Carlo simulation results based on initial position perturbation

Δ y

.

Figure A3. Distribution statistics data of 6 × 300 groups of Monte Carlo simulation results.

Appendix B

Figure A4a illustrates that the no-fly zone constraint fails when N is 30 because the Chaser’s optimal trajectory, as determined by the FDPSM, passes across the no-fly zone. The optimal trajectory, as seen in Figure A4b, flies around the boundary of the no-fly zone and finally reaches the desired position safely when there are 120 grid points. This demonstrates that the effectiveness of the no-fly zone constraint is correlated with the number of grid points.

Figure A4. The two-dimensional trajectory of the Chaser satellite approaches the spherical no-fly zone.

Generally, the distribution characteristics of the Chebyshev–Gauss–Lobatto (CGL) grid points cluster at both ends and are sparse in the middle. As a result, when N is too small, there are few interpolation points near the no-fly zone, causing the trajectory to directly cross this area, as illustrated in Figure A5. Finally, the optimal trajectory fails to meet the no-fly zone constraint, and the two satellites may collide in the close-range rendezvous mission. There are enough interpolation nodes close to the no-fly zone’s boundary when

N

is a suitable value. Through the successive iterative optimization process, the Chaser satellite avoids the no-fly zone along the optimal trajectory and reaches the endpoint, as shown in Figure A6. The results and conclusions obtained by using the FBPSM and SBPSM are consistent with the above findings using the FDPSM. Therefore, after assessing the computational efficiency and precision of the solution, the Birkhoff pseudospectral method is unquestionably a better option when the number of grid points must be raised to satisfy the requirements.

Figure A5. Distribution of trajectory grid points (red circles) near the no-fly zone. (N = 30).

Figure A6. Distribution of trajectory grid points near the no-fly zone. (N = 120).

Figure A7 depicts the variation curve of the component and the magnitude of the Chaser’s thrust when the number,

N

, is taken as 30 and 120, respectively. First, from the two subgraphs on the right, it can be found that the slack variable,

ζ

, is completely consistent with the change curve of the thrust magnitude,

‖ T ‖

. This illustrates that the lossless convexity operation is effective and verifies that the solution of the relaxed Problem 2 is equivalent to the original Problem 1. From the subgraphs of the thrust component, it can be found that the thrust mainly acts on the

x

-axis and the

y

-axis, which is also in line with the characteristics of the maneuvering transfer in the

x O y

plane, with minimum fuel consumption. Moreover, when the number of grid points is small, the change in thrust is relatively stable and slow, especially when passing through the no-fly zone, when the working time of the whole engine is about 80 s. In sharp contrast, when

N

is 120, the satellite makes a rapid maneuver response when approaching the no-fly zone, which lasts only 15 s. From the previous result analysis, it is known that, in the end, the former mission failed, while the latter perfectly avoided the no-fly zone and reached the destination.

Figure A7. The curve of the thrust components and magnitude for the spherical no-fly zone when N is 30 and 120,respectively.

References

Fehse, W. Automated Rendezvous and Docking of Spacecraft; Cambridge University Press: Cambridge, UK, 2003. [Google Scholar] [CrossRef]
Chamberlin, J.A.; Rose, J.T. Gemini rendezvous program. J. Spacecr. Rocket. 1964, 1, 13–18. [Google Scholar] [CrossRef]
Young, K.A.; Alexander, J.D. Apollo lunar rendezvous. J. Spacecr. Rocket. 1970, 7, 1083–1086. [Google Scholar] [CrossRef]
Murtazin, R.F.; Budylov, S.G. Short rendezvous missions for advanced Russian human spacecraft. Acta Astronaut. 2010, 67, 900–909. [Google Scholar] [CrossRef]
Goodman, J.L. History of space shuttle rendezvous and proximity operations. J. Spacecr. Rocket. 2006, 43, 944–959. [Google Scholar] [CrossRef]
Woffinden, D.C.; Geller, D.K. Navigating the road to autonomous orbital rendezvous. J. Spacecr. Rocket. 2007, 44, 898–909. [Google Scholar] [CrossRef]
Luo, Y.; Zhang, J.; Tang, G. Survey of orbital dynamics and control of space rendezvous. Chin. J. Aeronaut. 2014, 27, 1–11. [Google Scholar] [CrossRef]
Morante, D.; Sanjurjo Rivo, M.; Soler, M. A Survey on Low-Thrust Trajectory Optimization Approaches. Aerospace 2021, 8, 88. [Google Scholar] [CrossRef]
Bucchioni, G.; Innocenti, M. Rendezvous in Cis-Lunar Space near Rectilinear Halo Orbit: Dynamics and Control Issues. Aerospace 2021, 8, 68. [Google Scholar] [CrossRef]
Kawano, I.; Mokuno, M.; Kasai, T.; Suzuki, T. Result and evaluation of autonomous rendezvous docking experiment of ETS-VII. Guid. Navig. Control Conf. Exhib. 1999, 38, 105–111. [Google Scholar] [CrossRef]
Thomas, M.D.; David, M. XSS-10 microsatellite flight demonstration program results. In Spacecraft Platforms and Infrastructure; SPIE: Orlando, FL, USA, 2004; Volume 5419, pp. 16–25. [Google Scholar]
Timothy, E.R. Demonstration of autonomous rendezvous technology (DART) project summary. In Space Systems Technology and Operations; SPIE: Orlando, FL, USA, 2003; Volume 5088, pp. 10–19. [Google Scholar]
Manny, R.L.; Chih-Tsai, C.; Michael, W.B.; Thomas, P.W.; David, L.C.; William, B.G.; Peter, W.S.; Peter, A.S.; Mark, A.L. Orbital express autonomous rendezvous and capture sensor system (ARCSS) flight test results. In Sensors and Systems for Space Applications II; SPIE: Orlando, FL, USA, 2008. [Google Scholar]
Zhou, J. Tiangong-1/Shenzhou-8 rendezvous and docking process. In Space Rendezvous and Docking Technology; National Defense Industry Press: Beijing, China, 2013; pp. 56–62. [Google Scholar]
Yang Cheng, Z.X. Tianzhou-3 Completes Autonomous Rapid Rendezvous and Docking. Available online: https://www.chinadefenseobservation.com/?p=8682 (accessed on 20 September 2021).
Trent, J.; Perrotto, J.B. Spacex Dragon Attached to Space Station in Spaceflight First. Available online: https://www.nasa.gov/home/hqnews/2012/may/HQ_12-172_SpaceX_Dragon_Berth.html (accessed on 25 May 2012).
Betts, J.T. Survey of numerical methods for trajectory optimization. J. Guid. Control Dyn. 1998, 21, 193–207. [Google Scholar] [CrossRef]
Rao, A.V. A survey of numerical methods for optimal control. Adv. Astronaut. Sci. 2010, 135, 1–32. [Google Scholar]
Trélat, E. Optimal control and applications to aerospace: Some results and challenges. J. Optim. Theory Appl. 2012, 154, 713–758. [Google Scholar] [CrossRef]
Tang, G.; Jiang, F.; Li, J. Fuel-Optimal Low-Thrust Trajectory Optimization Using Indirect Method and Successive Convex Programming. IEEE Trans. Aerosp. Electron. Syst. 2018, 54, 2053–2066. [Google Scholar] [CrossRef]
Andersen, E.D.; Roos, C.; Terlaky, T. On implementing a primal-dual interior-point method for conic quadratic optimization. Math. Program. 2003, 95, 249–277. [Google Scholar] [CrossRef]
Guo, X.; Zhu, M. Direct trajectory optimization based on a mapped Chebyshev pseudospectral method. Chin. J. Aeronaut 2013, 26, 401–412. [Google Scholar] [CrossRef]
Benson, D.A.; Huntington, G.T.; Thorvaldsen, T.P.; Rao, A.V. Direct Trajectory Optimization and Costate Estimation via an Orthogonal Collocation Method. J. Guid. Control Dyn. 2006, 29, 1435–1440. [Google Scholar] [CrossRef]
Boyd, S.; Boyd, S.P.; Vandenberghe, L. Convex Optimization; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
Oumer, A.M.; Kim, D.-K. Real-Time Fuel Optimization and Guidance for Spacecraft Rendezvous and Docking. Aerospace 2022, 9, 276. [Google Scholar] [CrossRef]
Malyuta, D.; Reynolds, T.; Szmuk, M.; Mesbahi, M.; Acikmese, B.; Carson, J.M. Discretization Performance and Accuracy Analysis for the Rocket Powered Descent Guidance Problem. In Proceedings of the AIAA Scitech 2019 Forum, San Diego, CA, USA, 7–11 January 2019. [Google Scholar] [CrossRef]
Wächter, A.; Biegler, L.T. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math. Program. 2006, 106, 25–57. [Google Scholar] [CrossRef]
Gill, P.E.; Murray, W.; Saunders, M.A. SNOPT: An SQP Algorithm for Large-Scale Constrained Optimization. SIAM Rev. 2005, 47, 99–131. [Google Scholar] [CrossRef]
Betts, J.T. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming; Cambridge University Press: Cambridge, UK, 2009. [Google Scholar]
Betts, J.T.; Huffman, W.P. Mesh refinement in direct transcription methods for optimal control. Optim. Control Appl. Methods 1998, 19, 1–21. [Google Scholar] [CrossRef]
Elnagar, G.; Kazemi, M.A.; Razzaghi, M. The pseudospectral Legendre method for discretizing optimal control problems. IEEE Trans. Autom. Control 1995, 40, 1793–1796. [Google Scholar] [CrossRef]
Elnagar, G.; Kazemi, M.A. Pseudospectral Chebyshev Optimal Control of Constrained Nonlinear Dynamical Systems. Comput. Optim. Appl. 1998, 11, 195–217. [Google Scholar] [CrossRef]
Fahroo, F.; Ross, I.M. Costate Estimation by a Legendre Pseudospectral Method. J. Guid. Control Dyn. 2001, 24, 270–277. [Google Scholar] [CrossRef]
Benson, D. A Gauss Pseudospectral Transcription for Optimal Control. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2005. [Google Scholar]
Fahroo, F.; Ross, I.M. Advances in pseudospectral methods for optimal control. In Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Honolulu, HI, USA, 18 August–21 August 2008. [Google Scholar] [CrossRef]
Rea, J. Launch Vehicle Trajectory Optimization Using a Legendre Pseudospectral Method. In Proceedings of the AIAA Guidance, Navigation, and Control Conference and Exhibit, Austin, TX, USA, 11–14 August 2003. [Google Scholar] [CrossRef]
Bedrossian, N.; Kang, W. Pseudospectral Optimal Control Theory Makes Debut Flight, Saves NASA $1M in Under Three Hours. SIAM News 2007, 40, 1–3. [Google Scholar]
Sagliano, M. Generalized hp Pseudospectral Convex Programming for Powered Descent and Landing. J. Guid. Control. Dyn. 2018, 42, 1562–1570. [Google Scholar] [CrossRef]
Koeppen, N.; Ross, I.M.; Wilcox, L.C.; Proulx, R.J. Fast Mesh Refinement in Pseudospectral Optimal Control. J. Guid. Control Dyn. 2019, 42, 711–722. [Google Scholar] [CrossRef]
Gong, Q.; Fahroo, F.; Ross, I.M. Spectral Algorithm for Pseudospectral Methods in Optimal Control. J. Guid. Control Dyn. 2008, 31, 460–471. [Google Scholar] [CrossRef]
Hesthaven, J.S. Integration Preconditioning of Pseudospectral Operators. I. Basic Linear Operators. SIAM J. Numer. Anal. 1998, 35, 1571–1593. [Google Scholar] [CrossRef]
Elbarbary, E.M.E. Integration Preconditioning Matrix for Ultraspherical Pseudospectral Operators. SIAM J. Sci. Comput. 2006, 28, 1186–1201. [Google Scholar] [CrossRef]
Ross, I.M.; Proulx, R.J. Further Results on Fast Birkhoff Pseudospectral Optimal Control Programming. J. Guid. Control Dyn. 2019, 42, 2086–2092. [Google Scholar] [CrossRef]
Wang, L.-L.; Samson, M.D.; Zhao, X. A well-conditioned collocation method using a pseudospectral integration matrix. SIAM J. Sci. Comput. 2014, 36, A907–A929. [Google Scholar] [CrossRef]
Du, K. On Well-Conditioned Spectral Collocation and Spectral Methods by the Integral Reformulation. SIAM J. Sci. Comput. 2016, 38, A3247–A3263. [Google Scholar] [CrossRef]
McCoid, C.; Trummer, M.R. Preconditioning of spectral methods via Birkhoff interpolation. Numer. Algorithm 2017, 79, 555–573. [Google Scholar] [CrossRef]
Clohessy, W.H.; Wiltshire, R.S. Terminal guidance system for satellite rendezvous. J. Aerosp. Sci. 1960, 27, 653–658. [Google Scholar] [CrossRef]
Chobotov, V.A. (Ed.) Orbital Mechanics; AIAA, Inc.: Reston, WV, USA, 2002. [Google Scholar]
Luo, Y.-Z.; Tang, G.-J.; Lei, Y.-J. Optimal multi-objective linearized impulsive rendezvous. J. Guid. Control Dyn. 2007, 30, 383–389. [Google Scholar] [CrossRef]
Acikmese, B.; Ploen, S.R. Convex programming approach to powered descent guidance for Mars landing. J. Guid Control Dyn. 2007, 30, 1353–1366. [Google Scholar] [CrossRef]
Açıkmeşe, B.; Carson, J.M.; Blackmore, L. Lossless convexification of nonconvex control bound and pointing constraints of the soft landing optimal control problem. IEEE Trans. Control Syst. Technol. 2013, 21, 2104–2113. [Google Scholar] [CrossRef]
Liu, X.; Lu, P.; Pan, B. Survey of convex optimization for aerospace applications. Astrodynamics 2017, 1, 23–40. [Google Scholar] [CrossRef]
Lu, P.; Liu, X. Autonomous Trajectory Planning for Rendezvous and Proximity Operations by Conic Optimization. J. Guid. Control Dyn. 2013, 36, 375–390. [Google Scholar] [CrossRef]
Gong, Q.; Ross, I.M.; Fahroo, F. Spectral and Pseudospectral Optimal Control Over Arbitrary Grids. J. Optim. Theory Appl. 2016, 169, 759–783. [Google Scholar] [CrossRef]
Fornberg, B. A Practical Guide to Pseudospectral Methods; Cambridge University Press: Cambridge, UK, 1996. [Google Scholar] [CrossRef]
Fahroo, F.; Ross, I.M. Pseudospectral methods for infinite-horizon optimal control problems. J. Guid Control Dyn. 2008, 31, 927–936. [Google Scholar] [CrossRef]
Garg, D.; Hager, W.W.; Rao, A.V. Pseudospectral Methods for Solving Infnite-Horizon Optimal Control Problems. Automatica 2011, 47, 829–837. [Google Scholar] [CrossRef]
Sagliano, M. Pseudospectral Convex Optimization for Powered Descent and Landing. J. Guid. Control Dyn. 2017, 41, 320–334. [Google Scholar] [CrossRef]
Fletcher, C.A.J. Computational Galerkin Methods. In Computational Galerkin Methods; Springer: Berlin/Heidelberg, Germany, 1984; pp. 72–85. [Google Scholar] [CrossRef]
Lorentz, G.G.; Zeller, K.L. Birkhoff Interpolation. SIAM J. Numer. Anal. 1971, 8, 43–48. [Google Scholar] [CrossRef]
Schoenberg, I.J. On Hermite-Birkhoff interpolation. J. Math. Anal. Appl. 1966, 16, 538–543. [Google Scholar] [CrossRef]
Zhao, D.-J.; Song, Z.-Y. Reentry trajectory optimization with waypoint and no-fly zone constraints using multiphase convex programming. ACTA Astronaut. 2017, 137, 60–69. [Google Scholar] [CrossRef]
Hu, Q.; Xie, J.; Liu, X. Trajectory optimization for accompanying satellite obstacle avoidance. Aerosp. Sci. Technol. 2018, 82, 220–233. [Google Scholar] [CrossRef]
Löfberg, J. YALMIP: A Toolbox for Modeling and Optimization in MATLAB2004. In Proceedings of the 2004 IEEE International Conference on Robotics and Automation, CACSD Conference, Taipei, Taiwan, 2–4 September 2004. [Google Scholar]

Figure 1. A close-range rendezvous maneuver scenario.

Figure 2. No-fly zone (red area) and safety zone (white area): (a) Nonconvex no-fly zone; (b) Linear no-fly zone.

Figure 3. The complete process framework of the SC algorithm takes the FBPSM discretization method as an example.

Figure 4. The curve of model time and solver time under the different number of grid points.

Figure 5. The terminal position and velocity error under the different number of grid points.

Figure 6. The position errors, as measured by the difference between the optimal trajectory and the propagated trajectory. (N = 90).

Figure 7. The curves of the total solution time T_total under different types of grid points and no-fly zones.

Figure 8. The curves of the number of iterations S_iter under different types of grid points and no-fly zones.

Table 1. The statistical results under different grid points N.

N	ZOH			FDPSM ¹		FBPSM ²					SBPSM ³
N	T_total (s)	e(m) ⁴	J_obj (kg)	T_total (s)	e(m)	J_obj (kg)	T_total (s)	e(m)	J_obj (kg)	T_total (s)	e(m)	J_obj (kg)
30	19.50	40.471	0.5547	5.35	0.296	0.5524	5.01	0.296	0.5524	4.92	0.296	0.5524
60	24.88	20.196	0.5538	8.78	0.328	0.5536	9.69	0.328	0.5536	8.33	0.328	0.5536
90	31.46	13.398	0.5525	10.39	0.177	0.5531	10.62	0.177	0.5531	10.42	0.177	0.5531
120	40.82	10.038	0.5502	18.97	0.100	0.5532	17.19	0.100	0.5532	18.03	0.100	0.5532
150	51.33	8.042	0.5523	64.11	0.131	0.5515	24.33	0.002	0.5529	20.98	0.002	0.5529
180	61.63	6.819	0.5508	76.44	0.034	0.5524	26.85	0.021	0.5529	23.78	0.021	0.5529
210	70.91	5.744	0.5522	94.61	0.326	0.5501	35.76	0.034	0.5529	29.19	0.034	0.5528

¹ First-order differential pseudospectral method in Section 3.2, FDPSM; ² first-order Birkhoff pseudospectral method in Section 4.2, FBPSM; ³ second-order Birkhoff pseudospectral method in Section 4.3, SBPSM (the same below). ⁴ Error here refers to the terminal position error in the y-axis direction.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.