Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control

Cichella, Venanzio; Kaminer, Isaac; Walton, Claire; Hovakimyan, Naira; Pascoal, António

doi:10.3390/machines10121132

Open AccessArticle

Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control

¹

Department of Mechanical Engineering, University of Iowa, Iowa City, IA 52242, USA

²

Department of Mechanical and Aerospace Engineering, Naval Postgraduate School, Monterey, CA 93940, USA

³

Department of Electrical and Computer Engineering, University of Texas at San Antonio, San Antonio, TX 78249, USA

⁴

Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA

⁵

Institute for Systems and Robotics (ISR), Instituto Superior Tecnico (IST), University of Lisbon, 1049-001 Lisbon, Portugal

^*

Author to whom correspondence should be addressed.

Machines 2022, 10(12), 1132; https://doi.org/10.3390/machines10121132

Submission received: 28 September 2022 / Revised: 14 November 2022 / Accepted: 23 November 2022 / Published: 28 November 2022

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Bernstein polynomial approximation of continuous function has a slower rate of convergence compared to other approximation methods. “The fact seems to have precluded any numerical application of Bernstein polynomials from having been made. Perhaps they will find application when the properties of the approximant in the large are of more importance than the closeness of the approximation.”—remarked P.J. Davis in his 1963 book, Interpolation and Approximation. This paper presents a direct approximation method for nonlinear optimal control problems with mixed input and state constraints based on Bernstein polynomial approximation. We provide a rigorous analysis showing that the proposed method yields consistent approximations of time-continuous optimal control problems and can be used for costate estimation of the optimal control problems. This result leads to the formulation of the Covector Mapping Theorem for Bernstein polynomial approximation. Finally, we explore the numerical and geometric properties of Bernstein polynomials, and illustrate the advantages of the proposed approximation method through several numerical examples.

Keywords:

numerical optimal control; Bernstein polynomials; Bezier curves

1. Introduction

Motion planning plays an important role in enabling robotic systems to accomplish tasks assigned to them autonomously, safely and reliably. Over the past decades, many approaches to generating trajectories have been proposed. Examples include bug algorithms, artificial potential functions, roadmap path planners, cell decomposition methods, and optimal control-based trajectory generation. The reader is referred to [1,2,3,4,5,6,7,8] and references therein for detailed discussions and comparisons of these methods. Each technique has different advantages and disadvantages, and is best suited to certain types of problems. Motion planning based on optimal control, i.e., optimal motion planning, is particularly suitable for applications that require the trajectory to optimize some costs while guaranteeing satisfaction of a complex set of vehicle and problem constraints. These applications include multi-robot road search [9], coordinated tracking [10], optimal and constrained formation control [11], and adversarial swarm defense [12].

Optimal control problems that arise from robotics and motion-planning applications are, in general, very complex. Finding a closed-form solution to these problems can be difficult or even impossible, and therefore they must be solved numerically. Numerical methods include indirect and direct methods [13]. Indirect methods solve the problems by converting them into boundary value problems. Then, the solutions are found by solving systems of differential equations. On the other hand, direct methods are based on transcribing optimal control problems into nonlinear programming problems (NLPs) using a discretization scheme [6,13,14,15]. These NLPs can be solved using ready-to-use NLP solvers (e.g., MATLAB, SNOPT, etc.) and do not require calculation of costate and adjoint variables, as indirect methods do.

A wide range of direct methods that use different discretization schemes have been developed, including direct single shootings, direct multiple shooting and direct collocation methods [6,13,14,15]. The software packages that implement some of these methods (e.g., PSOPT [16], NLOptControl [17], GPOPS II [18], PROPT [19], DIDO [20] and CasADi [21]) are particularly relevant; some of these have been applied successfully to solve a wide range of real-world problems [22,23,24,25,26,27,28]. Theoretical results in the literature on direct methods include those related to consistency of approximation theory; see [29], which provides a framework to assess the convergence properties of Euler and Range–Kutta discretization schemes. Motivated by the consistency of approximation theory, direct methods that use different discretization schemes have been developed, including Pseudospectral methods based on Legendre, Chebyshev and Lagrange polynomials [28]. One drawback of direct methods is that the costate of the original optimal control problem cannot be readily obtained from the approximated solution. Nevertheless, in several applications—such as motion planning and control for safety-critical robotic systems—the knowledge of the costate is important because it allows for the evaluation of the fulfillment of necessary conditions of optimality. This evaluation, in turn, provides important insights into the validity and optimality of the solution. Therefore, approaches for obtaining estimates of the costate from direct methods have been proposed in the literature on direct collocation [30,31,32] and direct shooting [33].

In [34] we presented a direct method based on Bernstein polynomials. We showed that the geometric properties of these polynomials allow for the implementation of efficient algorithms for the computation of state and input constraints, which are particularly useful for motion planning and trajectory generation applications [35,36]. Additional works that exploit the properties of Bernstein polynomials for nonlinear optimal control can be found in [37,38,39,40,41]. Furthermore, in [42] we used the approximation properties of Bernstein polynomials to derive consistency and convergence results for the proposed direct method. In the present paper, we propose an approximation scheme for primal and dual optimal control problems based on Bernstein polynomials. In particular, we propose an approach to approximate the costate of a general non-linear optimal control problem of Bolza type using the Lagrange multipliers of the Bernstein polynomial-based discrete approximation. We derive transformations that relate the Lagrange multipliers of the nonlinear programming problem to the costate of the original optimal control problem. These transformations are often referred to as covector mapping in the literature on direct methods for optimal control [28,29,43]. Finally, we demonstrate uniform convergence properties of the method.

The paper is structured as follows: in Section 2, we present the notation and the mathematical results, which will be used later in the paper. Section 3 introduces the optimal control problem of interest and some related assumptions, and presents the approximation method based on Bernstein approximation that approximates the optimal control problem into an NLP. In Section 4 we derive the Karush–Kuhn–Tucker (KKT) conditions associated with the NLP. Section 5 compares these conditions to the first-order optimality conditions for the original optimal control problem and states the Covector Mapping Theorem for Bernstein approximation. Numerical examples are discussed in Section 6, while Section 7 highlights the significance of the theoretical findings applied to a specific multi-robot simulation scenario, namely optimal defense against swarm attacks. The paper ends with conclusions in Section 8.

2. Notation and Mathematical Background

Vector-valued functions are denoted by bold letters,

x (t) = {[x_{1} (t), \dots, x_{n} (t)]}^{⊤}

, while vectors are denoted by bold letters with an upper bar,

\bar{x} = {[x_{1}, \dots, x_{n}]}^{⊤} \in R^{n}

. The symbol

C^{r}

denotes the space of functions with r continuous derivatives.

C_{n}^{r}

denotes the space of n-vector valued functions in

C^{r}

.

| | \cdot | |

denotes the Euclidean norm,

| | \bar{x} | | = \sqrt{x_{1}^{2} + \dots + x_{n}^{2}}

.

The Bernstein basis polynomials of degree N are defined as

b_{j, N} (t) = (\binom{N}{j}) t^{j} {(1 - t)}^{N - j}, t \in [0, 1],

for

j = 0, \dots, N

, with

(\binom{N}{j}) = \frac{N!}{j! (N - j)!} .

A Nth-order Bernstein polynomial

x_{N} : [0, 1] \to R

is a linear combination of

N + 1

Bernstein basis polynomials of order N, i.e.,

x_{N} (t) = \sum_{j = 0}^{N} {\bar{x}}_{j} b_{j, N} (t), t \in [0, 1],

where

{\bar{x}}_{j} \in R

,

j = 0, \dots, N

, are referred to as Bernstein coefficients. For the sake of generality, and with a slight abuse of terminology, in this paper, we extend the definition of a Bernstein polynomial given above to a vector of Nth-order polynomials

x_{N} : [0, 1] \to R^{n}

expressed in the following form

x_{N} (t) = \sum_{j = 0}^{N} {\bar{x}}_{j, N} b_{j, N} (t), t \in [0, 1],

(1)

where

{\bar{x}}_{0, N}, \dots, {\bar{x}}_{N, N} \in R^{n}

.

In what follows, we provide a review of numerical properties of Bernstein polynomials that are used throughout this paper. The derivative and integral of a Bernstein polynomial

x_{N} (t)

can be easily computed as

{\dot{x}}_{N} (t) = N \sum_{j = 0}^{N - 1} ({\bar{x}}_{j + 1, N} - {\bar{x}}_{j, N}) b_{j, N - 1} (t)

and

\int_{0}^{1} x_{N} (t) d t = w \sum_{j = 0}^{N} {\bar{x}}_{j, N}, w = \frac{1}{N + 1},

(2)

respectively.

Bernstein polynomials can be used to approximate smooth functions. Consider a n-vector valued function

x : [0, 1] \to R^{n}

. The Nth order Bernstein approximation of

x (t)

is a vector of Bernstein polynomials

x_{N} (t)

computed as in (1) with

{\bar{x}}_{j, N} = x (t_{j})

and

t_{j} = \frac{j}{N}

for all

j = 0, \dots, N

. Namely,

x (t) \approx x_{N} (t) = \sum_{j = 0}^{N} x (t_{j}) b_{j, N} (t), t_{j} = \frac{j}{N} .

(3)

The following results hold for Bernstein approximations.

Lemma 1

(Uniform convergence of Bernstein approximation). Let

x (t) \in C_{n}^{0}

on

[0, 1]

, and let

x_{N} (t)

be computed as in Equation (3). Then, for arbitrary order of approximation

N \in Z^{+}

, the Bernstein approximation

x_{N} (t)

satisfies

| | x_{N} (t) - x (t) | | \leq C_{0} W_{x} (N^{- \frac{1}{2}}),

where

C_{0}

is a positive constant satisfying

C_{0} < 5 n / 4

, and

W_{x} (\cdot)

is the modulus of continuity of

x (t)

in

[0, 1]

[44,45,46]. Moreover, if

x (t) \in C_{n}^{1}

, then

∥ {\dot{x}}_{N} (t) - \dot{x} (t) ∥ \leq C_{1} W_{x^{'}} (N^{- \frac{1}{2}}),

where

C_{1}

is a positive constant satisfying

C_{1} < 9 n / 4

and

W_{x^{'}} (\cdot)

is the modulus of continuity of

\dot{x} (t)

in

[0, 1]

[47].

Lemma 2

([48]). Assume

x (t) \in C_{n}^{r + 2}

,

r \geq 0

, and let

x_{N} (t)

be computed as in Equation (3). Let

x^{(r)} (t)

denote the rth derivative of

x (t)

. Then, the following inequalities hold for all

t \in [0, 1]

:

\begin{matrix} | | x_{N} (t) - x (t) | | & \leq \frac{C_{0}}{N}, \\ ⋮ \\ | | x_{N}^{(r)} (t) - x^{(r)} (t) | | & \leq \frac{C_{r}}{N}, \end{matrix}

where

C_{0}, \dots, C_{r}

are independent of N.

Lemma 3.

If

x (t) \in C_{n}^{0}

on

[0, 1]

, then we have

∥\int_{0}^{1} x (t) d t - w \sum_{j = 0}^{N} x (\frac{j}{N})∥ \leq C_{I} W_{x} (N^{- \frac{1}{2}}),

with

w = \frac{1}{N + 1}

, where

C_{I} > 0

is independent of N. Moreover, if

x (t) \in C_{n}^{2}

, then

∥\int_{0}^{1} x (t) d t - w \sum_{j = 0}^{N} x (\frac{j}{N})∥ \leq \frac{C_{I}}{N} .

The Lemma above follows directly from Lemmas 1 and 2 and Equation (2).

The following property of Bernstein polynomials is relevant to this paper.

Property 1

(End point values). The Bernstein polynomial given by Equation (1) satisfies

x_{N} (0) = {\bar{x}}_{0, N}

and

x_{N} (1) = {\bar{x}}_{N, N}

.

3. Problem Formulation

This paper considers the following optimal control problem:

Problem 1

(Problem P). Determine

x : [0, 1] \to R^{n_{x}}

and

u : [0, 1] \to R^{n_{u}}

that minimize

I (x (t), u (t)) = E (x (0), x (1)) + \int_{0}^{1} F (x (t), u (t)) d t,

(4)

subject to

\begin{matrix} \dot{x} = f (x (t), u (t)), \forall t \in [0, 1], \end{matrix}

(5)

\begin{matrix} e (x (0), x (1)) = 0, \end{matrix}

(6)

\begin{matrix} h (x (t), u (t)) \leq 0, \forall t \in [0, 1], \end{matrix}

(7)

where

E : R^{n_{x}} \times R^{n_{x}} \to R

and

F : R^{n_{x}} \times R^{n_{u}} \to R

are the terminal and running costs, respectively,

f : R^{n_{x}} \times R^{n_{u}} \to R^{n_{x}}

describes the system dynamics,

e : R^{n_{x}} \times R^{n_{x}} \to R^{n_{e}}

is the vector of boundary conditions, and

h : R^{n_{x}} \times R^{n_{u}} \to R^{n_{h}}

is the vector of state and input constraints.

Next, we formulate a discretized version of Problem P, here referred to as Problem

P_{N}

, where N denotes the order of approximation. This requires that we approximate the input and state functions, the cost function, the system dynamics and the equality and inequality constraints in Problem P. First, consider the following Nth-order vectors of Bernstein polynomials:

x_{N} (t) = \sum_{j = 0}^{N} {\bar{x}}_{j, N} b_{j, N} (t), u_{N} (t) = \sum_{j = 0}^{N} {\bar{u}}_{j, N} b_{j, N} (t),

(8)

with

x_{N} : [0, 1] \to R^{n_{x}}

,

u_{N} : [0, 1] \to R^{n_{u}}

,

{\bar{x}}_{j, N} \in R^{n_{x}}

and

{\bar{u}}_{j, N} \in R^{n_{u}}

. Let

{\bar{x}}_{N} \in R^{n_{x} \times (N + 1)}

and

{\bar{u}}_{N} \in R^{n_{u} \times (N + 1)}

be defined as

{\bar{x}}_{N} = [{\bar{x}}_{0, N}, \dots, {\bar{x}}_{N, N}], {\bar{u}}_{N} = [{\bar{u}}_{0, N}, \dots, {\bar{u}}_{N, N}] .

Let

0 = t_{0} < t_{1} < \dots < t_{N} = 1

be a set of equidistant time nodes, i.e.,

t_{j} = \frac{j}{N}

. Then, Problem

P_{N}

can be stated as follows:

Problem 2

(Problem

P_{N}

). Determine

{\bar{x}}_{N}

and

{\bar{u}}_{N}

that minimize

\begin{matrix} I_{N} ({\bar{x}}_{N}, {\bar{u}}_{N}) = \\ E (x_{N} (0), x_{N} (t_{N})) + w \sum_{j = 0}^{N} F (x_{N} (t_{j}), u_{N} (t_{j})), \end{matrix}

(9)

subject to

\begin{matrix} ∥{\dot{x}}_{N} (t_{j}) - f (x_{N} (t_{j}), u_{N} (t_{j}))∥ \leq δ_{P}^{N}, \forall j = 0, \dots, N, \end{matrix}

(10)

\begin{matrix} e (x_{N} (0), x_{N} (t_{N})) = 0, \end{matrix}

(11)

\begin{matrix} h (x_{N} (t_{j}), u_{N} (t_{j})) \leq δ_{P}^{N} 1, \forall j = 0, \dots, N, \end{matrix}

(12)

where

w = \frac{1}{N + 1}

, and

δ_{P}^{N}

is a small positive number that depends on N and converges uniformly to 0, i.e.,

\lim_{N \to \infty} δ_{P}^{N} = 0

.

Remark 1.

Compared to the constraints of Problem P, the dynamic and inequality constraints given by Equations (10) and (12) are relaxed. Motivated by previous work on consistency of approximation theory [29], the bound

δ_{P}^{N}

, referred to as relaxation bound, is introduced to guarantee that Problem

P_{N}

has a feasible solution. As will become clear later, the relaxation bound can be made arbitrarily small by choosing a sufficiently large order of approximation N. Furthermore, note that when

N \to \infty

, the right-hand sides of Equations (10) and (12) are equal to zero, i.e., the difference between the constraints imposed by Problems P and

P_{N}

vanishes.

Remark 2.

The outcome of Problem

P_{N}

is a set of optimal Bernstein coefficients

{\bar{x}}_{N}^{*}

and

{\bar{u}}_{N}^{*}

that determine the vectors of Bernstein polynomials

x_{N}^{*} (t)

and

u_{N}^{*} (t)

, i.e.,

x_{N}^{*} (t) = \sum_{j = 0}^{N} {\bar{x}}_{j, N}^{*} b_{j, N} (t), u_{N}^{*} (t) = \sum_{j = 0}^{N} {\bar{u}}_{j, N}^{*} b_{j, N} (t) .

(13)

In our previous work, see [42], we provide theoretical results demonstrating: (i) the existence of a feasible solution to Problem

P_{N}

, and (ii) the convergence of the pair

(x_{N}^{*} (t), u_{N}^{*} (t))

to the optimal solution of Problem P, given by

(x^{*} (t), u^{*} (t))

. Nevertheless, the present paper focuses on the existence and convergence of the estimates of the costates of Problem P, which are introduced next.

4. Costate Estimation for Problem P

4.1. First-Order Optimality Conditions of Problem P

We start by deriving the first-order necessary conditions for Problem P. Let

λ (t) : [0, 1] \to R^{n_{x}}

be the costate trajectory, and let

μ (t) : [0, 1] \to R^{n_{h}}

and

ν \in R^{n_{e}}

be the multipliers. By defining the Lagrangian of the Hamiltonian (also known as the D-form [49]) as

\begin{matrix} L (x (t), u (t), λ (t), μ (t)) = \\ H (x (t), u (t), λ (t)) + μ^{⊤} (t) h (x (t), u (t)), \end{matrix}

where the Hamiltonian

H

is given by

H (x (t), u (t), λ (t)) = F (x (t), u (t)) + λ^{⊤} (t) f (x (t), u (t)),

the dual of Problem P can be formulated as follows [49].

Problem 3

(Problem

P_{λ}

). Determine

x (t)

,

u (t)

,

λ (t)

,

μ (t)

and

ν

that for all

t \in [0, 1]

satisfy Equations (5)–(7) and

\begin{matrix} μ^{⊤} (t) h (x (t), u (t)) = 0, μ (t) \geq 0, \end{matrix}

(14)

\begin{matrix} {\dot{λ}}^{⊤} (t) + L_{x} (x (t), u (t), λ (t), μ (t)) = 0, \end{matrix}

(15)

\begin{matrix} λ^{⊤} (0) = - ν^{⊤} e_{x (0)} (x (0), x (1)) - E_{x (0)} (x (0), x (1)), \end{matrix}

(16)

\begin{matrix} λ^{⊤} (1) = ν^{⊤} e_{x (1)} (x (0), x (1)) + E_{x (1)} (x (0), x (1)), \end{matrix}

(17)

\begin{matrix} L_{u} (x (t), u (t), λ (t), μ (t)) = 0 . \end{matrix}

(18)

In the above problem, subscripts are used to denote partial derivatives, e.g.,

F_{x} (x, u) = \frac{\partial}{\partial x} F (x, u)

.

The following assumptions are imposed onto Problem

P_{λ}

.

Assumption 1.

E, F, f, e

and

h

are continuously differentiable with respect to their arguments, and their gradients are Lipschitz continuous over the domain.

Assumption 2.

Solutions

x^{*} (t)

,

u^{*} (t)

,

λ^{*} (t)

,

μ^{*} (t)

and

ν^{*}

of Problem

P_{λ}

exist and satisfy

x^{*} (t) \in C_{n_{x}}^{1}

,

u^{*} (t) \in C_{n_{u}}^{0}

,

λ^{*} (t) \in C_{n_{x}}^{1}

and

μ^{*} (t) \in C_{n_{h}}^{0}

in

[0, 1]

.

Remark 3.

Notice that Problem

P_{λ}

implicitly assumes the absence of pure state constraints in Problem P. If the inequality constraint in Equation (7) is independent of

u (t)

, then the costate

λ (t)

must also satisfy the following jump condition [49]:

λ (t_{e}^{-}) = λ (t_{e}^{+}) + h_{x (t_{e})}^{⊤} η,

where

t_{e}

is the entry or exit time into a constrained arc in which the inequality constraint is active,

t_{e}^{-}

and

t_{e}^{+}

denote the left-hand side and right-hand side limits of the trajectory, respectively, and

η

is a constant covector. For simplicity, the theoretical results that will be presented in Section 5 do not consider the jump conditions above, i.e., the inequality constraints are dependent on

u (t)

. Nevertheless, numerical examples will be presented in Section 6, showing the applicability of the discretization method to pure state-constrained problems.

4.2. KKT Conditions of Problem $P_{N}$

Now, we derive the necessary conditions of Problem

P_{N}

. Let us introduce the following Nth-order Bernstein polynomials:

λ_{N} (t) = \sum_{j = 0}^{N} {\bar{λ}}_{j, N} b_{j, N} (t), μ_{N} (t) = \sum_{j = 0}^{N} {\bar{μ}}_{j, N} b_{j, N} (t),

(19)

with

λ_{N} : [0, 1] \to R^{n_{x}}

,

μ_{N} : [0, 1] \to R^{n_{h}}

,

{\bar{λ}}_{j, N} \in R^{n_{x}}

and

{\bar{μ}}_{j, N} \in R^{n_{h}}

, and the vector

\bar{ν} \in R^{n_{e}}

. Finally, let

{\bar{λ}}_{N} \in R^{n_{x} \times (N + 1)}

and

{\bar{μ}}_{N} \in R^{n_{u} \times (N + 1)}

be defined as

{\bar{λ}}_{N} = [{\bar{λ}}_{0, N}, \dots, {\bar{λ}}_{N, N}], {\bar{μ}}_{N} = [{\bar{μ}}_{0, N}, \dots, {\bar{μ}}_{N, N}] .

With the above notation, the Lagrangian for problem

P_{N}

can be written as

\begin{matrix} L_{N} & = E (x_{N} (0), x_{N} (t_{N})) + w \sum_{j = 0}^{N} F (x_{N} (t_{j}), u_{N} (t_{j})) \\ + \sum_{j = 0}^{N} λ_{N}^{⊤} (t_{j}) (- {\dot{x}}_{N} (t_{j}) + f (x_{N} (t_{j}), u_{N} (t_{j}))) \\ + \sum_{j = 0}^{N} μ_{N}^{⊤} (t_{j}) h (x_{N} (t_{j}), u_{N} (t_{j})) \\ + {\bar{ν}}^{⊤} e (x_{N} (0), x_{N} (t_{N})) . \end{matrix}

Then, the duality of Problem

P_{N}

can be stated as follows:

Problem 4

(Problem

P_{N λ}

). Determine

{\bar{x}}_{N}

,

{\bar{u}}_{N}

,

{\bar{λ}}_{N}

,

{\bar{μ}}_{N}

and

\bar{ν}

that satisfy the primal feasibility conditions, namely Equations (10)–(12), the complementary slackness and dual feasibility conditions

\begin{matrix} ∥μ_{N}^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k}))∥ \leq N^{- 1} δ_{D}^{N}, \\ μ_{N} (t_{k}) \geq - N^{- 1} δ_{D}^{N} 1, \forall k = 0, \dots, N, \end{matrix}

(20)

and the stationarity conditions

∥\frac{\partial L_{N}}{\partial {\bar{x}}_{k, N}}∥ \leq δ_{D}^{N}, ∥\frac{\partial L_{N}}{\partial {\bar{u}}_{k, N}}∥ \leq δ_{D}^{N}, \forall k = 0, \dots, N,

(21)

where

δ_{D}^{N}

is a small positive number that depends on N and satisfies

\lim_{N \to \infty} δ_{D}^{N} = 0

.

At this point, similarly to most results on costate estimation [50,51,52], we introduce additional conditions that must be added to Equations (10)–(12), (20) and (21) in order to obtain consistent approximations of the solutions of Problem

P_{λ}

. These conditions, often referred to as closure conditions in the literature, are given as follows:

\begin{matrix} ∥ \frac{λ_{N}^{⊤} (0)}{w} + {\bar{ν}}^{⊤} e_{x (0)} (x_{N} (0), x_{N} (t_{N})) + E_{x (0)} (x_{N} (0), x_{N} (t_{N})) ∥ \leq δ_{D}^{N}, \end{matrix}

(22)

\begin{matrix} ∥ \frac{λ_{N}^{⊤} (t_{N})}{w} - {\bar{ν}}^{⊤} e_{x (1)} (x_{N} (0), x_{N} (t_{N})) - E_{x (1)} (x_{N} (0), x_{N} (t_{N})) ∥ \leq δ_{D}^{N} . \end{matrix}

(23)

In other words, the closure conditions are constraints that must be added to Problem

P_{N λ}

so that the solution of this problem approximates the solution of Problem

P_{λ}

. We notice that the conditions given above are discrete approximations of the conditions given by Equations (16) and (17). With this setup, we define the following problem:

Problem 5

(Problem

P_{N λ}^{c l o s}

). Determine

{\bar{x}}_{N}

,

{\bar{u}}_{N}

,

{\bar{λ}}_{N}

,

{\bar{μ}}_{N}

and

\bar{ν}

that satisfy the primal feasibility conditions, namely Equations (10)–(12), the complementary slackness and dual feasibility conditions (20), the stationarity conditions (21), and the closure conditions (22) and (23).

The solution of Problem

P_{N λ}^{c l o s}

presents a set of optimal Bernstein coefficients

{\bar{x}}_{N}^{*}

,

{\bar{u}}_{N}^{*}

,

{\bar{λ}}_{N}^{*}

,

{\bar{μ}}_{N}^{*}

(which determine the Bernstein polynomials

x_{N}^{*} (t)

,

u_{N}^{*} (t)

,

λ_{N}^{*} (t)

and

μ_{N}^{*} (t)

) and a vector

{\bar{ν}}^{*}

.

5. Feasibility and Consistency of Problem $P_{N λ}^{c l o s}$

The objective of this section is to investigate the ability of the solutions of Problem

P_{N λ}^{c l o s}

to approximate the solutions of Problem

P_{λ}

. In what follows, we first show the existence of a solution to Problem

P_{N λ}^{c l o s}

(feasibility). Second, we investigate the convergence properties of this solution as

N \to \infty

(consistency). Third, by combining these two results, we finally formulate the covector mapping theorem for Bernstein approximations, which provides a map between the solution of Problem

P_{N λ}^{c l o s}

and the solution of Problem

P_{λ}

. The main results of this section are reported in the three theorems below and summarized in Figure 1.

Theorem 1

(Feasibility). Let

\begin{matrix} δ_{D}^{N} & = C_{D} \max {W_{x^{'}} (N^{- \frac{1}{2}}), W_{x} (N^{- \frac{1}{2}}), W_{u} (N^{- \frac{1}{2}}), W_{λ^{'}} (N^{- \frac{1}{2}}), W_{λ} (N^{- \frac{1}{2}}), W_{μ} (N^{- \frac{1}{2}})}, \end{matrix}

(24)

δ_{P}^{N} = C_{P} \max {W_{x^{'}} (N^{- \frac{1}{2}}), W_{x} (N^{- \frac{1}{2}}), W_{u} (N^{- \frac{1}{2}})},

(25)

where

C_{D}

and

C_{P}

are positive constants independent of N, and

W_{x^{'}} (\cdot)

,

W_{x} (\cdot)

,

W_{u} (\cdot)

,

W_{λ^{'}} (\cdot)

,

W_{λ} (\cdot)

and

W_{μ} (\cdot)

are the moduli of continuity of

\dot{x} (t)

,

x (t)

,

u (t)

,

\dot{λ} (t)

,

λ (t)

and

μ (t)

, respectively. Then Problem

P_{N λ}^{c l o s}

is feasible for arbitrary order of approximation

N \in Z^{+}

.

Proof.

This proof follows by constructing a solution for Problem

P_{N λ}^{c l o s}

, with

δ_{D}^{N}

given by Equation (24). To this end, let

x (t)

,

u (t)

,

λ (t)

,

μ (t)

and

ν

be a solution of Problem

P_{λ}

, which exists by Assumption 2, and define

{\bar{x}}_{j, N} = x (t_{j}), {\bar{u}}_{j, N} = u (t_{j}),

(26)

{\bar{λ}}_{j, N} = w λ (t_{j}), {\bar{μ}}_{j, N} = w μ (t_{j}), \bar{ν} = ν,

(27)

for all

j = 0, \dots, N

,

t_{j} = \frac{j}{N}

,

w = \frac{1}{N + 1}

, with corresponding Bernstein polynomials given by

\begin{matrix} x_{N} (t) = \sum_{j = 0}^{N} {\bar{x}}_{j, N} b_{j, N} (t), u_{N} (t) = \sum_{j = 0}^{N} {\bar{u}}_{j, N} b_{j, N} (t), \\ λ_{N} (t) = \sum_{j = 0}^{N} {\bar{λ}}_{j, N} b_{j, N} (t), μ_{N} (t) = \sum_{j = 0}^{N} {\bar{μ}}_{j, N} b_{j, N} (t) . \end{matrix}

(28)

The remainder of this proof shows that

x_{N} (t)

,

u_{N} (t)

,

λ_{N} (t)

,

μ_{N} (t)

and

\bar{ν}

given above satisfy Equations (20)–(23). The satisfaction of Equations (10)–(12) can be demonstrated using a proof similar to the one of [42], and is thus omitted. We start by defining the Bernstein coefficients

{\tilde{\bar{λ}}}_{j, N}

and

{\tilde{\bar{μ}}}_{j, N}

as follows

{\tilde{\bar{λ}}}_{j, N} = \frac{{\bar{λ}}_{j, N}}{w}, {\tilde{\bar{μ}}}_{j, N} = \frac{{\bar{μ}}_{j, N}}{w},

(29)

with corresponding Bernstein polynomials given by

{\tilde{λ}}_{N} (t) = \sum_{j = 0}^{N} {\tilde{\bar{λ}}}_{j, N} b_{j, N} (t), {\tilde{μ}}_{N} (t) = \sum_{j = 0}^{N} {\tilde{\bar{μ}}}_{j, N} b_{j, N} (t) .

Notice that

{\tilde{λ}}_{N} (t) = \frac{λ_{N} (t)}{w}, {\tilde{μ}}_{N} (t) = \frac{μ_{N} (t)}{w} .

(30)

Combining Equations (26), (27) and (29) and using Assumption 2 and Lemma 1, we get

\begin{matrix} | | x_{N} (t) - x (t) | | \leq C_{x} W_{x} (N^{- \frac{1}{2}}), \\ | | u_{N} (t) - u (t) | | \leq C_{u} W_{u} (N^{- \frac{1}{2}}), \\ | | {\dot{x}}_{N} (t) - \dot{x} (t) | | \leq C_{x^{'}} W_{x^{'}} (N^{- \frac{1}{2}}), \\ | | {\tilde{λ}}_{N} (t) - λ (t) | | \leq C_{λ} W_{λ} (N^{- \frac{1}{2}}), \\ | | {\tilde{μ}}_{N} (t) - μ (t) | | \leq C_{μ} W_{μ} (N^{- \frac{1}{2}}), \\ | | {\dot{\tilde{λ}}}_{N} (t) - \dot{λ} (t) | | \leq C_{λ^{'}} W_{λ^{'}} (N^{- \frac{1}{2}}), \end{matrix}

(31)

where

C_{λ} < \frac{5 n_{x}}{4}, C_{μ} < \frac{5 n_{h}}{4}, C_{λ^{'}} < \frac{9 n_{x}}{4}

and

W_{λ} (\cdot)

,

W_{μ} (\cdot)

and

W_{μ} (\cdot)

are the moduli of continuity of

λ (t)

,

μ (t)

and

\dot{λ} (t)

, respectively.

Now, we show that the bound in Equation (20) is satisfied. Using Equation (30), and adding and subtracting

w (μ^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k})) + μ^{⊤} (t_{k}) h (x (t_{k}), u (t_{k})))

, we get

\begin{matrix} ∥ μ_{N}^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k})) ∥ & = ∥ w {\tilde{μ}}_{N}^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k})) | | \\ \leq w ∥ ({\tilde{μ}}_{N}^{⊤} (t_{k}) - μ^{⊤} (t_{k})) h (x_{N} (t_{k}), u_{N} (t_{k})) ∥ \\ + w ∥ μ^{⊤} (t_{k}) h (x (t_{k}), u (t_{k})) ∥ \\ + w ∥ μ^{⊤} (t_{k}) (h (x_{N} (t_{k}), u_{N} (t_{k})) - h (x (t_{k}), u (t_{k})) ∥ \end{matrix}

Using Equation (14), the above inequality reduces to

\begin{matrix} ∥ μ_{N}^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k})) ∥ & \leq w ∥ ({\tilde{μ}}_{N}^{⊤} (t_{k}) - μ^{⊤} (t_{k})) h (x_{N} (t_{k}), u_{N} (t_{k})) ∥ \\ + w ∥ μ^{⊤} (t_{k}) (h (x_{N} (t_{k}), u_{N} (t_{k})) - h (x (t_{k}), u (t_{k})) ∥ \\ \leq w | | h (x_{N} (t_{k}), u_{N} (t_{k})) | | C_{μ} W_{μ} (N^{- \frac{1}{2}}) \\ + w ∥ μ^{⊤} (t_{k}) ∥ L_{h} (C_{x} W_{x} (N^{- \frac{1}{2}}) + C_{u} W_{u} (N^{- \frac{1}{2}})), \end{matrix}

where we used the bounds in Equation (31) together with the Lipschitz assumption on

h

(see Assumptions 1). Finally, from using Assumptions 1 and 2, it follows that

h

and

μ

are bounded on

[0, 1]

with bounds

h_{\max}

and

μ_{\max}

, respectively. Therefore, we get

\begin{matrix} ∥ μ_{N}^{⊤} (t_{k}) h (x_{N} (t_{k}), u_{N} (t_{k})) ∥ \leq w [h_{\max} C_{μ} W_{μ} (N^{- \frac{1}{2}}) + μ_{\max} L_{h} (C_{x} W_{x} (N^{- \frac{1}{2}}) + C_{u} W_{u} (N^{- \frac{1}{2}}))], \end{matrix}

which implies that the bound in Equation (20) is satisfied with

δ_{D}^{N}

given by Equation (24) and

C_{D} > h_{\max} C_{μ} + μ_{\max} L_{h} (C_{x} + C_{u})

. Similarly,

\begin{matrix} μ_{N} (t_{k}) & = w {\tilde{μ}}_{N} (t_{k}) \geq w μ (t_{k}) - w ∥ μ (t_{k}) - {\tilde{μ}}_{N} (t_{k}) ∥ 1 - N^{- 1} C_{μ} W_{μ} (N^{- \frac{1}{2}}) 1, \end{matrix}

which proves that Equation (20) holds.

Now, consider the left equation in (21). For

k = 0

, we have

\begin{matrix} ∥ \frac{\partial L_{N}}{\partial {\bar{x}}_{0, N}} ∥ & = ∥ E_{x (0)} (x_{N} (0), x_{N} (t_{N})) \\ + w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) \\ + \sum_{j = 0}^{N} λ_{N}^{⊤} (t_{j}) (f_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - {\dot{b}}_{0, N} (t_{j})) \\ + \sum_{j = 0}^{N} μ_{N}^{⊤} (t_{j}) h_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) \\ + {\bar{ν}}^{⊤} e_{x (0)} (x_{N} (0), x_{N} (t_{N})) ∥ . \end{matrix}

(32)

Substituting

w {\tilde{λ}}_{N} (t_{j}) = λ_{N} (t_{j})

and

w {\tilde{μ}}_{N} (t_{j}) = μ_{N} (t_{j})

, the equation above can be written as

\begin{matrix} ∥ \frac{\partial L_{N}}{\partial {\bar{x}}_{0, N}} ∥ & = ∥ E_{x (0)} (x_{N} (0), x_{N} (t_{N})) \\ + w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) \\ + w \sum_{j = 0}^{N} {\tilde{λ}}_{N}^{⊤} (t_{j}) (f_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - {\dot{b}}_{0, N} (t_{j})) \\ + w \sum_{j = 0}^{N} {\tilde{μ}}_{N}^{⊤} (t_{j}) h_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) \\ + {\bar{ν}}^{⊤} e_{x (0)} (x_{N} (0), x_{N} (t_{N})) ∥ . \end{matrix}

(33)

Notice that the following inequalities are satisfied:

\begin{matrix} ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq {\bar{C}}_{1} (N^{- \frac{1}{2}} + W_{x} (N^{- \frac{1}{2}}) + W_{u} (N^{- \frac{1}{2}})), \end{matrix}

(34a)

\begin{matrix} ∥w \sum_{j = 0}^{N} {\tilde{λ}}_{N}^{⊤} (t_{j}) {\dot{b}}_{0, N} (t_{j}) - \int_{0}^{1} λ^{⊤} (t) {\dot{b}}_{0, N} (t) d t∥ \leq {\bar{C}}_{2} (N^{- \frac{1}{2}} + W_{λ} (N^{- \frac{1}{2}})), \end{matrix}

(34b)

\begin{matrix} ∥ w \sum_{j = 0}^{N} {\tilde{λ}}_{N}^{⊤} (t_{j}) f_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} λ^{⊤} (t) f_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq {\bar{C}}_{3} (N^{- \frac{1}{2}} + W_{λ} (N^{- \frac{1}{2}}) + W_{x} (N^{- \frac{1}{2}}) + W_{u} (N^{- \frac{1}{2}})), \end{matrix}

(34c)

\begin{matrix} ∥ w \sum_{j = 0}^{N} {\tilde{μ}}_{N}^{⊤} (t_{j}) h_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} μ^{⊤} (t) h_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq {\bar{C}}_{4} (N^{- \frac{1}{2}} + W_{μ} (N^{- \frac{1}{2}}) + W_{x} (N^{- \frac{1}{2}}) + W_{u} (N^{- \frac{1}{2}})), \end{matrix}

(34d)

for some positive

{\bar{C}}_{1}

,

{\bar{C}}_{2}

,

{\bar{C}}_{3}

and

{\bar{C}}_{4}

independent of N. Proof of the above inequalities is given in Appendix A. Then, the combination of Equations (33) and (34) yields the following inequality

\begin{matrix} ∥ \frac{\partial L_{N}}{\partial {\bar{x}}_{0, N}} ∥ & \leq ∥ E_{x (0)} (x (0), x (1)) + \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t \\ - \int_{0}^{1} λ^{⊤} (t) {\dot{b}}_{0, N} (t) d t \\ + \int_{0}^{1} λ^{⊤} (t) f_{x} (x (t), u (t)) b_{0, N} (t) d t \\ + \int_{0}^{1} μ^{⊤} (t) h_{x} (x (t), u (t)) b_{0, N} (t) d t \\ + {\bar{ν}}^{⊤} e_{x (0)} (x (0), x (1)) ∥ \\ + \bar{C} \max {N^{- \frac{1}{2}}, W_{x} (N^{- \frac{1}{2}}), W_{u} (N^{- \frac{1}{2}}), W_{λ} (N^{- \frac{1}{2}}), W_{μ} (N^{- \frac{1}{2}})}, \end{matrix}

(35)

with

\bar{C} \geq 4 \max {{\bar{C}}_{1}, {\bar{C}}_{2}, {\bar{C}}_{3}, {\bar{C}}_{4}}

. Using integration by parts, we have

\int_{0}^{1} λ^{⊤} (t) {\dot{b}}_{0, N} (t) d t = - \int_{0}^{1} {\dot{λ}}^{⊤} (t) b_{0, N} (t) d t + {[λ^{⊤} (t) b_{0, N} (t)]}_{0}^{1}

. Thus, since

b_{0, N} (0) = 1, b_{N, N} (0) = 0

, the above inequality becomes

\begin{matrix} ∥ \frac{\partial L_{N}}{\partial {\bar{x}}_{0, N}} ∥ & \leq ∥ E_{x (0)} (x (0), x (1)) + λ^{⊤} (0) + ν^{⊤} e_{x (0)} (x (0), x (1)) \\ + \int_{0}^{1} ({\dot{λ}}^{⊤} (t) + F_{x} (x (t), u (t)) + λ^{⊤} (t) f_{x} (x (t), u (t)) \\ + μ^{⊤} (t) h_{x} (x (t), u (t))) b_{0, N} (t) d t ∥ \\ + \bar{C} \max {N^{- \frac{1}{2}}, W_{x} (N^{- \frac{1}{2}}), W_{u} (N^{- \frac{1}{2}}), W_{λ} (N^{- \frac{1}{2}}), W_{μ} (N^{- \frac{1}{2}})} . \end{matrix}

(36)

Finally, using Equations (15) and (16), the above inequality reduces to the left condition in Equation (21) for

k = 0

, with

δ_{D}^{N}

given by Equation (24) and

C_{D} \geq \bar{C}

. The same condition for

k = 1, \dots, N

can be shown to be satisfied using an identical argument. The stationarity condition in the right of Equation (21) can also be verified similarly, and the computations are thus omitted. To show that the closure condition (22) is satisfied, we use the definitions in Equations (26) and (27) together with the end point values property of Bernstein polynomials, Property 1 in Section 2, which gives

\begin{matrix} ∥ \frac{λ_{N}^{⊤} (0)}{w} + {\bar{ν}}^{⊤} e_{x (0)} (x_{N} (0), x_{N} (t_{N})) + E_{x (0)} (x_{N} (0), x_{N} (t_{N})) ∥ \\ \leq ∥λ^{⊤} (0) + ν^{⊤} e_{x (0)} (x (0), x (1)) + E_{x (0)} (x (0), x (1))∥ = 0, \end{matrix}

where the last equality follows from Equation (16). An identical argument can be used to show that the closure condition (23) holds, thus completing the proof of Theorem 1.

Corollary 1.

If solutions

x^{*} (t)

,

u^{*} (t)

,

λ^{*} (t)

,

μ^{*} (t)

and

ν^{*}

of Problem

P_{λ}

exist and satisfy

{\dot{x}}^{*} (t) \in C_{n_{x}}^{2}

,

u^{*} (t) \in C_{n_{u}}^{2}

,

{\dot{λ}}^{*} (t) \in C_{n_{x}}^{2}

, and

μ^{*} (t) \in C_{n_{h}}^{2}

in

[0, 1]

, then Theorem 1 holds with

δ_{P}^{N} = C_{P} N^{- 1}

and

δ_{D}^{N} = C_{D} N^{- 1},

where

C_{P}

and

C_{D}

are positive constants independent of the order of approximation, N.

□

Proof.

The proof of Corollary 1 follows easily by applying Lemma 2 to the proof of Theorem 1.

Remark 4.

We notice that for arbitrarily small scalar

ϵ_{D} > 0

, there exists

N_{1}

such that for all

N \geq N_{1}

, we have

δ_{D}^{N} \leq ϵ_{D}

; i.e., the relaxation bound in Problem

P_{N λ}^{clos}

can be made arbitrarily small by choosing sufficiently large N.

Theorem 2 (Consistency)

Let

{({\bar{x}}_{N}^{*}, {\bar{u}}_{u}^{*}, {\bar{λ}}_{N}^{*}, {\bar{μ}}_{N}^{*}, {\bar{ν}}^{*})}_{N = N_{1}}^{\infty}

be a sequence of solutions of Problem

P_{N λ}^{clos}

. Consider the sequence of transformed solutions

{({\bar{x}}_{N}^{*}, {\bar{u}}_{N}^{*}, {\tilde{\bar{λ}}}_{N}^{*}, {\tilde{\bar{μ}}}_{N}^{*}, {\bar{ν}}^{*})}_{N = N_{1}}^{\infty}

, with

{\tilde{\bar{λ}}}_{j, N}^{*} = \frac{{\bar{λ}}_{j, N}^{*}}{w}, {\tilde{\bar{μ}}}_{j, N}^{*} = \frac{{\bar{λ}}_{j, N}^{*}}{w},

(37)

and the corresponding polynomial approximation

{(x_{N}^{*} (t), u_{N}^{*} (t), {\tilde{λ}}_{N}^{*} (t), {\tilde{μ}}_{N}^{*} (t), {\bar{ν}}^{*})}_{N = N_{1}}^{\infty}

. Assume that the latter has a uniform accumulation point, i.e.,

\begin{matrix} \lim_{N \to \infty} (x_{N}^{*} (t), u_{N}^{*} (t), {\tilde{λ}}_{N}^{*} (t), {\tilde{μ}}_{N}^{*} (t), {\bar{ν}}^{*}) = (x^{\infty} (t), u^{\infty} (t), {\tilde{λ}}^{\infty} (t), {\tilde{μ}}^{\infty} (t), {\bar{ν}}^{\infty}), \forall t \in [0, 1], \end{matrix}

and assume

{\dot{x}}^{\infty} (t)

,

u^{\infty} (t)

,

{\dot{\tilde{λ}}}^{\infty} (t)

and

{\tilde{μ}}^{\infty} (t)

are continuous on

[0, 1]

. Then,

(x^{\infty} (t), u^{\infty} (t), {\tilde{λ}}^{\infty} (t), {\tilde{μ}}^{\infty} (t), {\bar{ν}}^{\infty})

is a solution of Problem

P_{λ}

.

□

Proof.

The objective is to show that

x^{\infty} (t), u^{\infty} (t), {\tilde{λ}}^{\infty} (t), {\tilde{μ}}^{\infty} (t)

and

{\bar{ν}}^{\infty}

satisfy Equations (5)–(6) and (14)–(18). The satisfaction of Equations (5)–(7) has been demonstrated in ([42] [Proof of Theorem 2]). We start by showing Equation (14), and we do so using a proof by contradiction. Assume that

x^{\infty} (t), u^{\infty} (t), {\tilde{μ}}^{\infty} (t)

do not satisfy Equation (14). Then, there exists

t^{'} \in [0, 1]

, such that

∥ {\tilde{μ}}^{\infty ⊤} (t^{'}) h (x^{\infty} (t^{'}), u^{\infty} (t^{'})) ∥ > 0 .

(38)

Since the nodes

{t_{k}}_{k = 0}^{N}

are dense in

[0, 1]

, there exists a sequence of indices

{k_{N}}_{N = 0}^{\infty}

, such that

\lim_{N \to \infty} t_{k_{N}} = t^{'},

which implies

\lim_{N \to \infty} ∥ {\tilde{μ}}^{\infty} (t^{'}) - {\tilde{μ}}^{\infty} (t_{k_{N}}) ∥ = 0,

\lim_{N \to \infty} ∥ x^{\infty} (t^{'}) - x^{\infty} (t_{k_{N}}) ∥ = 0,

\lim_{N \to \infty} ∥ u^{\infty} (t^{'}) - u^{\infty} (t_{k_{N}}) ∥ = 0 .

Then, we have

\begin{matrix} | | {\tilde{μ}}^{\infty ⊤} (t^{'}) h (x^{\infty} (t^{'}), u^{\infty} (t^{'})) | | & \leq \lim_{N \to \infty} | | ({\tilde{μ}}_{N}^{* ⊤} (t^{'}) - {\tilde{μ}}_{N}^{* ⊤} (t_{k_{N}})) h (x_{N}^{*} (t^{'}), u_{N}^{*} (t^{'})) | | \\ + \lim_{N \to \infty} | | {\tilde{μ}}_{N}^{* ⊤} (t_{k_{N}}) (h (x_{N}^{*} (t^{'}), u_{N}^{*} (t^{'})) \\ - h (x_{N}^{*} (t_{k_{N}}), u_{N}^{*} (t_{k_{N}}))) | | \\ + \lim_{N \to \infty} | | {\tilde{μ}}_{N}^{* ⊤} (t_{k_{N}}) h (x_{N}^{*} (t_{k_{N}}), u_{N}^{*} (t_{k_{N}})) | | \\ = \lim_{N \to \infty} \frac{1}{w} | | μ_{N}^{* ⊤} (t_{k_{N}}) h (x_{N}^{*} (t_{k_{N}}), u_{N}^{*} (t_{k_{N}})) | | = 0, \end{matrix}

where we used Equation (20). This contradicts Equation (38). Similarly, we can show that

{\tilde{μ}}^{\infty} (t) \geq 0

, thus proving that

x^{\infty} (t), u^{\infty} (t)

and

{\tilde{μ}}^{\infty} (t)

satisfy Equation (14).

Furthermore, we notice that if

x^{\infty} (t), u^{\infty} (t), {\tilde{λ}}^{\infty} (t), {\tilde{μ}}^{\infty} (t)

and

{\bar{ν}}^{\infty}

satisfy Equations (21)–(23), then the following holds for all

k = 0, \dots, N

:

\begin{matrix} ∥ {\tilde{λ}}^{\infty ⊤} (0) + {\bar{ν}}^{\infty ⊤} e_{x (0)} (x^{\infty} (0), x^{\infty} (1)) + E_{x (0)} (x^{\infty} (0), x^{\infty} (1)) ∥ = 0, \end{matrix}

\begin{matrix} ∥ λ^{\infty ⊤} (1) - {\bar{ν}}^{\infty ⊤} e_{x (1)} (x^{\infty} (0), x^{\infty} (1)) - E_{x (1)} (x^{\infty} (0), x^{\infty} (1)) ∥ = 0, \end{matrix}

\begin{matrix} ∥ \int_{0}^{1} [{\dot{\tilde{λ}}}^{\infty ⊤} (t) + F_{x} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{λ}}^{\infty ⊤} (t) f_{x} (x^{\infty} (t), u^{\infty} (t)) \\ + {\tilde{μ}}^{\infty ⊤} (t) h_{x} (x^{\infty} (t), u^{\infty} (t))] b_{k, N} (t) d t ∥ = 0, \end{matrix}

\begin{matrix} ∥ \int_{0}^{1} [F_{u} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{λ}}^{\infty ⊤} (t) f_{u} (x^{\infty} (t), u^{\infty} (t)) \\ + {\tilde{μ}}^{\infty ⊤} (t) h_{u} (x^{\infty} (t), u^{\infty} (t))] b_{k, N} (t) d t ∥ = 0 . \end{matrix}

Since

{b_{k, N} (t)}_{k = 0}^{N}

is a linearly independent basis set, the last two equations above imply

\begin{matrix} ∥ {\dot{\tilde{λ}}}^{\infty ⊤} (t) + F_{x} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{λ}}^{\infty ⊤} (t) f_{x} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{μ}}^{\infty ⊤} (t) h_{x} (x^{\infty} (t), u^{\infty} (t)) ∥ = 0, \end{matrix}

\begin{matrix} ∥ F_{u} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{λ}}^{\infty ⊤} (t) f_{u} (x^{\infty} (t), u^{\infty} (t)) + {\tilde{μ}}^{\infty ⊤} (t) h_{u} (x^{\infty} (t), u^{\infty} (t)) ∥ = 0, \end{matrix}

for all

t \in [0, 1]

. This proves that

x^{\infty} (t), u^{\infty} (t), {\tilde{λ}}^{\infty} (t), {\tilde{μ}}^{\infty} (t)

and

{\bar{ν}}^{\infty} (t)

satisfy Equations (15)–(18). □

Theorem 3

(Covector Mapping Theorem). Under the same assumptions of Theorems 1 and 2, when

N \to \infty

, the covector mapping

\begin{matrix} x_{N}^{*} (t) \mapsto x^{*} (t), u_{N}^{*} (t) \mapsto u^{*} (t), \\ \frac{λ_{N}^{*} (t)}{w} \mapsto λ^{*} (t), \frac{μ_{N}^{*} (t)}{w} \mapsto μ^{*} (t), {\bar{ν}}^{*} \mapsto ν^{*} \end{matrix}

(39)

is a bijective mapping between the solution of Problem

P_{N λ}^{clos}

and the solution of Problem

P_{λ}

.

Proof.

The above result follows directly from Theorems 1 and 2. In fact, if

{x^{*} (t), u^{*} (t), λ^{*} (t), μ^{*} (t), ν^{*}}

is a solution to Problem

P_{λ}

, which exists by Assumption 2, then from Theorem 1, it follows that

{x^{*} (t), u^{*} (t), w λ^{*} (t), w μ^{*} (t), ν^{*}}

is a solution to Problem

P_{N λ}^{clos}

(see Equations (26)–(28)). Conversely, by using Equation (37), a solution

(x_{N}^{*} (t), u_{N}^{*} (t), λ_{N}^{*} (t), μ_{N}^{*} (t), {\bar{ν}}^{*})

that solves Problem

P_{N λ}^{clos}

provides a solution

(x_{N}^{*} (t), u_{N}^{*} (t), \frac{λ_{N}^{*} (t)}{w}, \frac{μ_{N}^{*} (t)}{w}, {\bar{ν}}^{*}) = (x_{N}^{*} (t), u_{N}^{*} (t), {\tilde{λ}}_{N}^{*} (t), {\tilde{μ}}_{N}^{*} (t), {\bar{ν}}^{*})

that converges to a solution to Problem

P_{λ}

(see Theorem 2). □

Remark 5.

Define the Hamiltonian approximation

H_{N} (t) = F (x_{N}^{*} (t), u_{N}^{*} (t)) + \frac{λ_{N}^{*} (t)}{w} f (x_{N}^{*} (t), u_{N}^{*} (t)),

then, Theorem 3 implies

\lim_{N \to \infty} H_{N} (t) = H (t) .

6. Numerical Examples

6.1. Example 1: 1D Minimum Time Problem

The first example we consider is the classical minimum time problem for a double integrator plant.

\min_{u} J = t_{f},

subject to

\begin{matrix} {\dot{x}}_{1} = x_{2}, {\dot{x}}_{2} = u, x_{1} (0) = 1, x_{2} (0) = - 1 \\ x_{1} (t_{f}) = x_{2} (t_{f}) = 0, \\ | u (t) | \leq 1, \forall t \in [0, t_{f}] . \end{matrix}

The analytical solution to this problem is well known: the optimal control input is bang-bang and the Hamiltonian along the optimal trajectories is equal to −1 [53]. Figure 2 includes plots of the state and control trajectories for this problem for

N = 45

, as well as a graph of the analytical solution. It is clear that the Bernstein polynomial solution captures the switching time precisely and closely approximates the optimal control input solution. Figure 3 includes the plot of the Hamiltonian approximation

H_{N}

computed using Covector Mapping Theorem. As expected, it is equal to −1 with the exception of a small bump around the switching time.

6.2. Example 2: 3D Minimum Time Problem

In this example, we consider a minimum-time problem for a simplified 3D model of a multi-rotor drone. The vehicle is asked to reach the origin in minimum time from a given initial condition with all the control inputs bounded by

\pm 1

. Unlike Example 1, there is no known analytical solution to this problem. However, we know that the Hamiltonian along the optimal trajectories is equal to −1 [53].

\min_{u} J = t_{f},

subject to

\begin{matrix} {\dot{x}}_{1} = x_{4}, {\dot{x}}_{2} = x_{5}, {\dot{x}}_{3} = x_{6}, \\ {\dot{x}}_{4} = u_{1}, {\dot{x}}_{5} = u_{2}, {\dot{x}}_{6} = - g + u_{3}, \\ x_{10} = 1 x_{20} = 2, x_{30} = 3, \\ x_{40} = - 1, x_{50} = - 1, x_{60} = - 1, \\ x_{1} (t_{f}) = x_{2} (t_{f}) = x_{3} (t_{f}) = 0, \\ x_{4} (t_{f}) = x_{5} (t_{f}) = x_{6} (t_{f}) = 0, \\ | u_{1} (t) | \leq 1, | u_{2} (t) | \leq 1, | - g + u_{3} (t) | \leq 1, \forall t \in [0, t_{f}] . \end{matrix}

Figure 4 shows the 3D plot of the position of the vehicle. The vehicle clearly reaches the origin from the given initial condition. Figure 5 includes graphs of the control inputs. They satisfy the

\pm 10

bound imposed in the problem formulation. Finally, Figure 6 shows the plot of the Hamiltonian approximation

H_{N}, N = 45

. It is equal to −1, as predicted by theory, thus indicating that the solution obtained is indeed close to optimal.

7. Defense against a Swarm Attack

The numerical analysis presented here involves a scenario in which an enemy swarm is attempting to destroy an high-value unit (HVU). The HVU is defended by a number of defending agents whose trajectories are optimized to maximize the probability of the HVU survival. The attacking agents dynamics are defined using Leonard swarm dynamics model [54]. A virtual leader is guiding the attacking swarm towards the HVU’s position. The attacking and defending agents are equipped with weapons systems which allow them to inflict damage on each other. The attacking agents inflict damage on the defending agents and try to destroy the HVU. The defending agents inflict damage on the attacking agents and attempt to destroy them or herd them away.

Attacking agent

i \in {1, \dots, N_{A}}

has position

x_{i} (t) \in R^{3}

and defending agent

k \in {1, \dots, N_{D}}

has position

s_{k} (t) \in R^{3}

. The equations of motion for attacker i is

\begin{matrix} {\ddot{x}}_{i} = & \sum_{j \neq i}^{N} \frac{f_{I} (x_{i j})}{∥x_{i j}∥} x_{i j} + \sum_{k = 1}^{M} \frac{f_{d} (s_{i k})}{∥s_{i k}∥} s_{i k} + K \frac{h_{i}}{∥h_{i}∥} - b {\dot{x}}_{i}, \end{matrix}

(40)

for

i = 1 \dots N

. There are four terms in this equation, representing: (1) attractive and repulsive forces

f_{I} (x_{i j})

from other attacking agents j, where

x_{i j} = x_{i} - x_{j}

is the distance between attackers i and j; (2) a constant “virtual leader” force with magnitude K pulling them toward the HVU’s position, where

h_{i} = h - x_{i}

and h is the position of the HVU; (3) purely repulsive forces

f_{d} (s_{i k})

due to defending agents, where

s_{i k} = x_{i} - s_{k}

is the distance between attacker i and defender k; and (4) a damping force proportional to the

{\dot{x}}_{i}

.

For the mathematical forms of

f_{I}

and

f_{d}

, we have chosen the Leonard model [54], i.e.,

f_{I}

and

f_{d}

can be written as gradients of a scalar potential functions that depends only on

x_{i j}

and

s_{i k}

. The force

f_{I}

is repulsive when

∥x_{i j}∥ \leq d_{0}

, attractive when

d_{0} < ∥x_{i j}∥ \leq d_{1}

and zero when

∥x_{i j}∥ > d_{1}

. For

f_{d}

, we only keep the repulsive term (since attackers should not be attracted to defenders), i.e.,

f_{d}

is repulsive when

∥s_{i k}∥ \leq s_{0}

and zero when

∥s_{i k}∥ > s_{0}

.

Defending agent i’s dynamics are given by

{\ddot{s}}_{i} = u_{i}, i = 1 \dots N_{D}, s_{i} (t), u_{i} (t) \in R^{3},

(41)

where the absolute value of each element of

u_{i}

,

(| u_{i j} |, j = 1, 2, 3)

is bounded by

u_{\max} = 1

.

Mutual attrition model: for hostile swarm engagements, agents are equipped with some weapons systems. The likelihood of destruction of an agent depends on its position (how close it has come to enemies) as well as the positions of those enemy agents, since each agent’s ability to inflict damage is contingent on its own survival. To model this mutual attrition, we use a damage function to track the probability that defender k is destroyed by a shot from attacker i, and vice versa. We choose a cumulative normal distribution function,

Φ

, to model the damage function [55]. Next, we define (i) the attrition rate at which attacker i is destroyed due to defender k,

d_{i k}^{att}

, (ii) the attrition rate of defender k due to attacker i,

d_{k i}^{def}

, and (iii) the attrition rate of the HVU,

d_{i}^{hvu}

, as follows:

\begin{matrix} d_{i k}^{att} = λ_{d} Φ (\frac{∥ s_{i k} ∥^{2}}{σ_{d}}), d_{k i}^{def} = λ_{a} Φ (\frac{∥ s_{i k} ∥^{2}}{σ_{a}}), d_{i}^{hvu} = λ_{a} Φ (\frac{∥ h_{i} ∥^{2}}{σ_{a}}) . \end{matrix}

(42)

In the above equation

σ_{d}

is a defender Poisson parameter that corresponds to the range of the defenders’ weapons,

λ_{d}

is a defender Poisson parameter that corresponds to the defenders’ rate of fire,

σ_{a}

is an attacker Poisson parameter that corresponds to the attackers’ range and

λ_{a}

is an attacker Poisson parameter that corresponds to the attackers’ rate of fire. The probability of defender k destroying attacker i during a time interval of duration

Δ t

is weighted by the current survival probability of defender k,

P_{k}^{d} (t)

. Thus, the probability that defender k will destroy attacker i during a given time interval

[t, t + Δ t]

is

P_{k}^{d} (t) d_{i k}^{att} Δ t

. Assuming independence (i.e., defenders do not coordinate their fire), the expression

\prod_{k}^{M_{}} (1 - [d_{i k}^{att} P_{k}^{d} (t)] Δ t)

represents the probability that ith attacker would survive during a time interval

[t, t + Δ t]

due to attrition from all defenders. Therefore, the survival probability

Q_{i} (t + Δ t)

of attacker i is governed by

Q_{i} (t + Δ t) = Q_{i} (t) \prod_{k}^{{N_{D}}_{}} (1 - [d_{i k}^{att} P_{k}^{d} (t)] Δ t),

(43)

where we assumed that probabilities of attacker i survival

Q_{i} (t_{1})

,

Q_{i} (t_{2})

are independent for any

t_{1}, t_{2}

. Similarly, the survival probability

P_{k}^{d} (t)

of defender k and the survival probability

P (t)

of the HVU are governed by

\begin{matrix} P_{k}^{d} (t + Δ t) = P_{k}^{d} (t) \prod_{i}^{{N_{A}}_{}} (1 - [d_{k i}^{def} Q_{i} (t)] Δ t), \\ P (t + Δ t) = P (t) \prod_{k}^{{N_{A}}_{}} (1 - [d_{k}^{hvu} Q_{k} (t)] Δ t) . \end{matrix}

(44)

Initial conditions are set to

Q_{i} (0) = P (0) = P_{k}^{d} (0) = 1

for all agents and the HVU.

Further rearranging Equations (43) and (44) and letting

Δ t \to 0

, as derived in [56] we obtain:

\begin{matrix} {\dot{Q}}_{i} (t) = - Q_{i} (t) \sum_{k}^{{N_{D}}_{}} (1 - [d_{i k}^{att} P_{k}^{d} (t)]), \\ {\dot{P}}_{k}^{d} (t) = - P_{k}^{d} (t) \sum_{i}^{{N_{A}}_{}} (1 - [d_{k i}^{def} Q_{i} (t)]), \end{matrix}

(45)

\dot{P} (t) = - P (t) \sum_{k}^{N_{A}} (1 - [d_{k}^{hvu} Q_{k} (t)]) .

(46)

The optimal control problem at hand can be expressed as Problem P by properly rescaling the time variable, i.e.,

τ = t / t_{f}

; see Equations (4)–(7). In particular, we seek to maximize the probability of HVU survival at the terminal time

t = t_{f}

(

τ = 1

), i.e., minimize

I = 1 - P (t_{f}) .

The system’s state

x (t)

includes the attacker and the defender positions and velocities, as well as probabilities of the attacker and defender survivals and the probability of the HVU survival:

\begin{matrix} x = [x_{1}^{T}, \dots, x_{N_{A}}^{T}, {(v_{1}^{x})}^{T}, \dots, {(v_{N_{A}}^{x})}^{T}, s_{1}^{T}, \dots s_{N_{D}}^{T}, \\ {{(v_{1}^{s})}^{T}, \dots, {(v_{N_{D}}^{s})}^{T}, Q_{1}, \dots, Q_{N_{A}}, P_{1}^{d}, \dots, P_{N_{D}}^{d}, P]}^{T}, \end{matrix}

(47)

where

v_{i}^{x}

is the velocity of the i-th attacker and

v_{k}^{s}

is the velocity of the k-th defender. The control input vector is defined by stacking accelerations of each defender

u = {[u_{1}^{T}, \dots, u_{N_{D}}^{T}]}^{T}

. Using definitions of the system’s state and control inputs the system dynamics function

f (., .)

is given by concatenation of Equations (40), (41), (45), and (46). Finally, the function

h (., .)

in our case becomes a function of defender control inputs only and includes

\pm u_{\max}

.

Figure 7 shows results for an optimization with one defender protecting an HVU against a swarm of five attackers for

N = 45

. The HVU is at the origin and the defender trajectory is color purple. The defender has a 50% larger weapons range (

σ_{d} / σ_{a} = 1.5

), as well as double the fire rate with respect to the attackers (

λ_{d} / λ_{a} = 2

). The defender initially herds the attackers on his right away from HVU than approaches the HVU and similarly herds the attackers to his left away from the HVU. Figure 8 illustrates the control inputs that drive the motion of the defender. Unlike minimum time problems, e.g., the previous example, in this case, the inequality constraints on the control input are never active. Figure 9 shows a sequence of the Hamiltonian approximations

H_{N}, N = 5, \dots, 45

. The sequence clearly converges to zero, indicating that the final numerical solution for

N = 45

is indeed a close approximation of the true optimal solution.

The reader is referred to [57,58] for additional numerical examples.

8. Conclusions

This paper proposed a numerical method for costate estimation of nonlinear constrained optimal control problems using Bernstein polynomials. A rigorous analysis is provided that shows convergence of the costate estimates to the dual variables of the continuous-time problem. To this end, a set of conditions are derived under which the Karush–Kuhn–Tucker multipliers of the NLP converge to the costates of the optimal control problem. This led to the formulation of the Covector Mapping Theorem for Bernstein approximation. The theoretical findings are validated through several numerical examples.

Author Contributions

Conceptualization, V.C., I.K., A.P., C.W.; methodology, V.C., I.K., A.P., C.W.; software, V.C, I.K.; validation, V.C, I.K., C.W.; investigation, V.C., I.K., A.P., C.W.; resources, V.C., I.K., A.P., N.H.; writing—original draft preparation, V.C., I.K.; writing—review and editing, V.C, I.K.; supervision, V.C., I.K., A.P., C.W., N.H.; project administration, V.C.; funding acquisition, V.C., I.K., A.P., N.H. All authors have read and agreed to the published version of the manuscript.

Funding

Venanzio Cichella was supported by Amazon, by the Office of Naval Research (grants N000141912106 and N000142112091), and by the National Science Foundation (grant 2136298). Antonio Pascoal was supported by H2020-EU.1.2.2—FET Proactive RAMONES (grant GA 101017808) and LARSyS-FCT (grant UIDB/50009/2020). Isaac Kaminer was supported by the Office of Naval Research (grants N0001421WX01974, N0001419WX00155, N0001422WX01906) and NPS CRUSER. Naira Hovakimyan was supported by NASA ULI (grant #80NSSC22M0070) and NSF RI (grant 2133656).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Equation (34)

Let us focus on Equation (34a). Adding and subtracting

\int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t

, we have

\begin{matrix} ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t \\ + \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t ∥ \\ + ∥ \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ . \end{matrix}

(A1)

Using Lemma 3 and continuity of

F_{x} (x_{N} (t), u_{N} (t))

and

b_{0, N} (t)

, the first term on the right-hand side of the inequality above satisfies

\begin{matrix} ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t ∥ \leq C_{I} W_{F_{x} b_{0, N}} (N^{- \frac{1}{2}}), \end{matrix}

where

W_{F_{x} b_{0, N}} (\cdot)

is used to denote the modulus of continuity of the product

F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t)),

with

F_{x} (x_{N} (t), u_{N} (t))

being a bounded function due to its continuity over a bounded domain. Denote its bound as

F_{x, \max}

. Notice that

b_{0, N} (t)

is bounded, as

\max_{t \in [0, 1]} b_{0, N} (t) \leq 1

. Then, using the properties of the modulus of continuity, we get

\begin{matrix} ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t ∥ \\ \leq C_{I} F_{x, \max} W_{b_{0, N}} (N^{- \frac{1}{2}}) + C_{I} W_{F_{x}} (N^{- \frac{1}{2}}) \\ \leq C_{I} F_{x, \max} N^{- \frac{1}{2}} + C_{I} W_{F_{x}} (N^{- \frac{1}{2}}), \end{matrix}

(A2)

where

W_{F_{x}} (\cdot)

is the modulus of continuity of

F_{x}

, and

C_{I}

is a positive constant independent of N. Furthermore, we have

\begin{matrix} ∥ \int_{0}^{1} F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) d t - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq \int_{0}^{1} ∥ F_{x} (x_{N} (t), u_{N} (t)) b_{0, N} (t) - F_{x} (x (t), u (t)) b_{0, N} (t) ∥ d t \\ \leq L_{F_{x}} (C_{x} W_{x} (N^{- \frac{1}{2}}) + C_{u} W_{u} (N^{- \frac{1}{2}})), \end{matrix}

(A3)

where

L_{F_{x}}

is the Lipschitz constant of

F_{x}

,

C_{x} < 5 n_{x} / 4

,

C_{u} < 5 n_{u} / 4

, and

W_{x} (\cdot)

and

W_{u} (\cdot)

are the moduli of continuity of

x

and

u

, respectively. Combining Equations (A2) and (A3) with Equation (A1) yields

\begin{matrix} ∥ w \sum_{j = 0}^{N} F_{x} (x_{N} (t_{j}), u_{N} (t_{j})) b_{0, N} (t_{j}) - \int_{0}^{1} F_{x} (x (t), u (t)) b_{0, N} (t) d t ∥ \\ \leq C_{I} F_{x, \max} N^{- \frac{1}{2}} + C_{I} W_{F_{x}} (N^{- \frac{1}{2}}) + L_{F_{x}} (C_{x} W_{x} (N^{- \frac{1}{2}}) + C_{u} W_{u} (N^{- \frac{1}{2}})), \end{matrix}

which proves the bound in Equation (34a). The bounds in Equation (34b–d) follow easily using an identical argument.

References

Ng, J.; Bräunl, T. Performance comparison of bug navigation algorithms. J. Intell. Robot. Syst. 2007, 50, 73–84. [Google Scholar] [CrossRef]
Khatib, O. Real-time obstacle avoidance for manipulators and mobile robots. In Autonomous Robot Vehicles; Springer: Berlin/ Heidelberg, Germany, 1986; pp. 396–404. [Google Scholar]
Siegwart, R.; Nourbakhsh, I.R.; Scaramuzza, D. Introduction to Autonomous Mobile Robots; MIT Press: Cambridge, MA, USA, 2011. [Google Scholar]
Choset, H.M. Principles of Robot Motion: Theory, Algorithms, and Implementation; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
Latombe, J.C. Robot Motion Planning; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2012; Volume 124. [Google Scholar]
Betts, J.T. Survey of numerical methods for trajectory optimization. J. Guid. Control Dyn. 1998, 21, 193–207. [Google Scholar] [CrossRef] [Green Version]
Von Stryk, O.; Bulirsch, R. Direct and indirect methods for trajectory optimization. Ann. Oper. Res. 1992, 37, 357–373. [Google Scholar] [CrossRef]
LaValle, S.M. Planning Algorithms; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Cichella, V. Cooperative Autonomous Systems: Motion Planning and Coordinated Tracking Control for Multi-Vehicle Missions. Ph.D. Thesis, University of Illinois at Urbana-Champaign, Champaign, IL, USA, 2018. [Google Scholar]
Cichella, V.; Kaminer, I.; Dobrokhodov, V.; Xargay, E.; Choe, R.; Hovakimyan, N.; Aguiar, A.P.; Pascoal, A.M. Cooperative Path-Following of Multiple Multirotors over Time-Varying Networks. IEEE Trans. Autom. Sci. Eng. 2015, 12, 945–957. [Google Scholar] [CrossRef]
Sun, X.; Cassandras, C.G. Optimal dynamic formation control of multi-agent systems in constrained environments. Automatica 2016, 73, 169–179. [Google Scholar] [CrossRef] [Green Version]
Walton, C.; Kaminer, I.; Gong, Q.; Clark, A.H.; Tsatsanifos, T. Defense against adversarial swarms with parameter uncertainty. Sensors 2022, 22, 4773. [Google Scholar] [CrossRef]
Rao, A.V. A survey of numerical methods for optimal control. Adv. Astronaut. Sci. 2009, 135, 497–528. [Google Scholar]
Betts, J.T. Practical Methods for Optimal Control and Estimation Using Nonlinear Programming; SIAM: Philadelphia, PA, USA, 2010. [Google Scholar]
Conway, B.A. A survey of methods available for the numerical optimization of continuous dynamic systems. J. Optim. Theory Appl. 2012, 152, 271–306. [Google Scholar] [CrossRef]
Becerra, V.M. Solving complex optimal control problems at no cost with PSOPT. In Proceedings of the 2010 IEEE International Symposium on Computer-Aided Control System Design, Yokohama, Japan, 8–10 September 2010; pp. 1391–1396. [Google Scholar]
Febbo, H.; Jayakumar, P.; Stein, J.L.; Ersal, T. NLOptControl: A modeling language for solving optimal control problems. arXiv 2020, arXiv:2003.00142. [Google Scholar]
Patterson, M.A.; Rao, A.V. GPOPS-II: A MATLAB software for solving multiple-phase optimal control problems using hp-adaptive Gaussian quadrature collocation methods and sparse nonlinear programming. ACM Trans. Math. Softw. (TOMS) 2014, 41, 1–37. [Google Scholar] [CrossRef] [Green Version]
Rutquist, P.E.; Edvall, M.M. Propt—Matlab Optimal Control Software; Tomlab Optimization Inc.: Washington, DC, USA, 2010. [Google Scholar]
Ross, I.M. Enhancements to the DIDO Optimal Control Toolbox. arXiv 2020, arXiv:2004.13112. [Google Scholar]
Andersson, J.A.; Gillis, J.; Horn, G.; Rawlings, J.B.; Diehl, M. CasADi: A software framework for nonlinear optimization and optimal control. Math. Program. Comput. 2019, 11, 1–36. [Google Scholar] [CrossRef]
Fahroo, F.; Ross, I.M. On discrete-time optimality conditions for pseudospectral methods. In Proceedings of the AIAA/AAS Astrodynamics Specialist Conference and Exhibit, Keystone, CO, USA, 21–24 August 2006; p. 6304. [Google Scholar]
Bollino, K.; Lewis, L.R.; Sekhavat, P.; Ross, I.M. Pseudospectral optimal control: A clear road for autonomous intelligent path planning. In Proceedings of the AIAA Infotech@ Aerospace 2007 Conference and Exhibit, Rohnert Park, CA, USA, 7–10 May 2007; p. 2831. [Google Scholar]
Gong, Q.; Lewis, R.; Ross, M. Pseudospectral motion planning for autonomous vehicles. J. Guid. Control. Dyn. 2009, 32, 1039–1045. [Google Scholar] [CrossRef]
Bedrossian, N.S.; Bhatt, S.; Kang, W.; Ross, I.M. Zero-propellant maneuver guidance. IEEE Control. Syst. 2009, 29. [Google Scholar]
Bollino, K.; Lewis, L.R. Collision-free multi-UAV optimal path planning and cooperative control for tactical applications. In Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Honolulu, HI, USA, 18–21 August 2008; p. 7134. [Google Scholar]
Bedrossian, N.; Bhatt, S.; Lammers, M.; Nguyen, L.; Zhang, Y. First Ever Flight Demonstration of Zero Propellant Maneuver (TM) Attitute Control Concept. In Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Hilton Head, SC, USA, 20–23 August 2007; p. 76734. [Google Scholar]
Ross, I.M.; Karpenko, M. A review of pseudospectral optimal control: From theory to flight. Annu. Rev. Control. 2012, 36, 182–197. [Google Scholar] [CrossRef]
Polak, E. Optimization: Algorithms and Consistent Approximations; Springer: Berilin, Germany, 1997. [Google Scholar]
Fahroo, F.; Ross, I.M. Costate estimation by a Legendre pseudospectral method. J. Guid. Control Dyn. 2001, 24, 270–277. [Google Scholar] [CrossRef]
Darby, C.L.; Garg, D.; Rao, A.V. Costate estimation using multiple-interval pseudospectral methods. J. Spacecr. Rocket. 2011, 48, 856–866. [Google Scholar] [CrossRef]
Hager, W.W. Runge-Kutta methods in optimal control and the transformed adjoint system. Numer. Math. 2000, 87, 247–282. [Google Scholar] [CrossRef]
Grimm, W.; Markl, A. Adjoint estimation from a direct multiple shooting method. J. Optim. Theory Appl. 1997, 92, 263–283. [Google Scholar] [CrossRef]
Cichella, V.; Kaminer, I.; Walton, C.; Hovakimyan, N. Optimal Motion Planning for Differentially Flat Systems Using Bernstein Approximation. IEEE Control Syst. Lett. 2018, 2, 181–186. [Google Scholar] [CrossRef]
Kielas-Jensen, C.; Cichella, V. BeBOT: Bernstein polynomial toolkit for trajectory generation. In Proceedings of the 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Venetian Macao, Macau, 3–8 November 2019; pp. 3288–3293. [Google Scholar]
Kielas-Jensen, C.; Cichella, V.; Berry, T.; Kaminer, I.; Walton, C.; Pascoal, A. Bernstein Polynomial-Based Method for Solving Optimal Trajectory Generation Problems. Sensors 2022, 22, 1869. [Google Scholar] [CrossRef] [PubMed]
Ricciardi, L.A.; Vasile, M. Direct transcription of optimal control problems with finite elements on Bernstein basis. J. Guid. Control Dyn. 2018. [Google Scholar] [CrossRef]
Choe, R.; Puig-Navarro, J.; Cichella, V.; Xargay, E.; Hovakimyan, N. Cooperative Trajectory Generation Using Pythagorean Hodograph Bézier Curves. J. Guid. Control Dyn. 2016, 39, 1744–1763. [Google Scholar] [CrossRef]
Ghomanjani, F.; Farahi, M.H. Optimal control of switched systems based on Bezier control points. Int. J. Intell. Syst. Appl. 2012, 4, 16. [Google Scholar] [CrossRef]
Huo, M.; Yang, L.; Peng, N.; Zhao, C.; Feng, W.; Yu, Z.; Qi, N. Fast costate estimation for indirect trajectory optimization using Bezier-curve-based shaping approach. Aerosp. Sci. Technol. 2022, 126, 107582. [Google Scholar] [CrossRef]
Zhao, Z.; Kumar, M. Split-bernstein approach to chance-constrained optimal control. J. Guid. Control Dyn. 2017, 40, 2782–2795. [Google Scholar] [CrossRef]
Cichella, V.; Kaminer, I.; Walton, C.; Hovakimyan, N.; Pascoal, A.M. Optimal Multi-Vehicle Motion Planning using Bernstein Approximants. IEEE Trans. Autom. Control 2020. [Google Scholar] [CrossRef]
Schwartz, A.; Polak, E. Consistent approximations for optimal control problems based on Runge–Kutta integration. SIAM J. Control Optim. 1996, 34, 1235–1269. [Google Scholar] [CrossRef]
Bojanic, R.; Cheng, F. Rate of convergence of Bernstein polynomials for functions with derivatives of bounded variation. J. Math. Anal. Appl. 1989, 141, 136–151. [Google Scholar] [CrossRef]
Popoviciu, T. Sur l’approximation des fonctions convexes d’ordre supérieur. Mathematica 1935, 10, 49–54. [Google Scholar]
Sikkema, P. Der wert einiger konstanten in der theorie der approximation mit Bernstein-Polynomen. Numer. Math. 1961, 3, 107–116. [Google Scholar] [CrossRef]
Powell, M.J.D. Approximation Theory and Methods; Cambridge University Press: Cambridge, UK, 1981. [Google Scholar]
Floater, M.S. On the convergence of derivatives of Bernstein approximation. J. Approx. Theory 2005, 134, 130–135. [Google Scholar] [CrossRef]
Hartl, R.F.; Sethi, S.P.; Vickson, R.G. A survey of the maximum principles for optimal control problems with state constraints. SIAM Rev. 1995, 37, 181–218. [Google Scholar] [CrossRef]
Garg, D.; Patterson, M.A.; Francolin, C.; Darby, C.L.; Huntington, G.T.; Hager, W.W.; Rao, A.V. Direct trajectory optimization and costate estimation of finite-horizon and infinite-horizon optimal control problems using a Radau pseudospectral method. Comput. Optim. Appl. 2011, 49, 335–358. [Google Scholar] [CrossRef] [Green Version]
Gong, Q.; Ross, I.M.; Kang, W.; Fahroo, F. Connections between the covector mapping theorem and convergence of pseudospectral methods for optimal control. Comput. Optim. Appl. 2008, 41, 307–335. [Google Scholar] [CrossRef]
Singh, B.; Bhattacharya, R.; Vadali, S.R. Verification of optimality and costate estimation using Hilbert space projection. J. Guid. Control Dyn. 2009, 32, 1345–1355. [Google Scholar] [CrossRef]
Kirk. Optimal Control Theory: An Introduction; Prentice-Hall: Hoboken, NJ, USA, 1970. [Google Scholar]
Ogren, P.; Fiorelli, E.; Leonard, N.E. Cooperative control of mobile sensor networks: Adaptive gradient climbing in a distributed environment. IEEE Trans. Autom. Control 2004, 49, 1292–1302. [Google Scholar] [CrossRef] [Green Version]
Washburn, A.; Kress, M. Combat Modeling; Springer: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Walton, C.; Lambrianides, P.; Kaminer, I.; Royset, J.; Gong, Q. Optimal motion planning in rapid-fire combat situations with attacker uncertainty. Naval Res. Logist. 2018, 65, 101–119. [Google Scholar] [CrossRef]
Cichella, V.; Kaminer, I.; Walton, C.; Hovakimyan, N.; Pascoal, A. Bernstein approximation of optimal control problems. arXiv 2018, arXiv:1812.06132. [Google Scholar]
Cichella, V.; Kaminer, I.; Walton, C.; Hovakimyan, N.; Pascoal, A.M. Consistent approximation of optimal control problems using Bernstein polynomials. In Proceedings of the 2019 IEEE 58th Conference on Decision and Control (CDC), Nice, France, 11–13 December 2019; pp. 4292–4297. [Google Scholar]

Figure 1. Diagram of the covector mapping principle for Bernstein approximation. The solution to Problem

P_{N λ}^{clos}

converges to that of Problem

P_{λ}

as

N \to \infty

.

Figure 1. Diagram of the covector mapping principle for Bernstein approximation. The solution to Problem

P_{N λ}^{clos}

converges to that of Problem

P_{λ}

as

N \to \infty

.

Figure 2. Example 1: state trajectories

x_{1} (t)

and

x_{2} (t)

, and control input

u (t)

. The control input approximates the optimal solution (bang-bang).

Figure 2. Example 1: state trajectories

x_{1} (t)

and

x_{2} (t)

, and control input

u (t)

. The control input approximates the optimal solution (bang-bang).

Figure 3. Example 1: the approximated Hamiltonian converges to the Hamiltonian of the problem, i.e.,

- 1

. See Theorem 3 and Remark 5.

Figure 3. Example 1: the approximated Hamiltonian converges to the Hamiltonian of the problem, i.e.,

- 1

. See Theorem 3 and Remark 5.

Figure 4. Example 2: 3D position plot, i.e.,

p = [x_{1}, x_{2}, x_{3}]

. The solid line represents the solution obtained with

N = 45

, while the dashed line depicts the (near optimal) solution obtained with

N = 250

.

Figure 4. Example 2: 3D position plot, i.e.,

p = [x_{1}, x_{2}, x_{3}]

. The solid line represents the solution obtained with

N = 45

, while the dashed line depicts the (near optimal) solution obtained with

N = 250

.

Figure 5. Example 2: Control inputs, i.e., vehicle’s acceleration along the three axis. The solid lines represent the solution obtained with

N = 45

, while the dashed lines depict the (near optimal) solution obtained with

N = 250

.

Figure 5. Example 2: Control inputs, i.e., vehicle’s acceleration along the three axis. The solid lines represent the solution obtained with

N = 45

, while the dashed lines depict the (near optimal) solution obtained with

N = 250

.

Figure 6. Example 2: the approximated Hamiltonian converges to the Hamiltonian of the problem, i.e.,

- 1

. See Theorem 3 and Remark 5.

Figure 6. Example 2: the approximated Hamiltonian converges to the Hamiltonian of the problem, i.e.,

- 1

. See Theorem 3 and Remark 5.

Figure 7. Defense against swarm attack. The plot shows optimal trajectory of one defender (purple) protecting a high-value unit (positioned at the origin) against five attackers.

Figure 8. Defense against swarm attack. The plot shows the time history of the control input.

Figure 9. Hamiltonian Convergence. The plot shows a sequence of the Hamiltonian approximations

H_{N}, N = 5, \dots, 45

, indicating that the numerical solution converges to the true optimal solution.

Figure 9. Hamiltonian Convergence. The plot shows a sequence of the Hamiltonian approximations

H_{N}, N = 5, \dots, 45

, indicating that the numerical solution converges to the true optimal solution.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cichella, V.; Kaminer, I.; Walton, C.; Hovakimyan, N.; Pascoal, A. Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control. Machines 2022, 10, 1132. https://doi.org/10.3390/machines10121132

AMA Style

Cichella V, Kaminer I, Walton C, Hovakimyan N, Pascoal A. Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control. Machines. 2022; 10(12):1132. https://doi.org/10.3390/machines10121132

Chicago/Turabian Style

Cichella, Venanzio, Isaac Kaminer, Claire Walton, Naira Hovakimyan, and António Pascoal. 2022. "Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control" Machines 10, no. 12: 1132. https://doi.org/10.3390/machines10121132

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control

Abstract

1. Introduction

2. Notation and Mathematical Background

3. Problem Formulation

4. Costate Estimation for Problem P

4.1. First-Order Optimality Conditions of Problem P

4.2. KKT Conditions of Problem $P_{N}$

5. Feasibility and Consistency of Problem $P_{N λ}^{c l o s}$

6. Numerical Examples

6.1. Example 1: 1D Minimum Time Problem

6.2. Example 2: 3D Minimum Time Problem

7. Defense against a Swarm Attack

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Equation (34)

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Consistency of Approximation of Bernstein Polynomial-Based Direct Methods for Optimal Control

Abstract

1. Introduction

2. Notation and Mathematical Background

3. Problem Formulation

4. Costate Estimation for Problem P

4.1. First-Order Optimality Conditions of Problem P

4.2. KKT Conditions of Problem P N

5. Feasibility and Consistency of Problem P N λ c l o s

6. Numerical Examples

6.1. Example 1: 1D Minimum Time Problem

6.2. Example 2: 3D Minimum Time Problem

7. Defense against a Swarm Attack

8. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Proof of Equation (34)

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. KKT Conditions of Problem $P_{N}$

5. Feasibility and Consistency of Problem $P_{N λ}^{c l o s}$