Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation

Zhao, Pan; Guo, Ziyao; Hovakimyan, Naira

doi:10.3390/s22134743

Open AccessArticle

Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation

by

Pan Zhao

^*

,

Ziyao Guo

and

Naira Hovakimyan

Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(13), 4743; https://doi.org/10.3390/s22134743

Submission received: 9 May 2022 / Revised: 9 June 2022 / Accepted: 20 June 2022 / Published: 23 June 2022

(This article belongs to the Special Issue Motion Optimization and Control of Single and Multiple Autonomous Aerial, Land, and Marine Robots)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a tracking controller for nonlinear systems with matched uncertainties based on contraction metrics and disturbance estimation that provides exponential convergence guarantees. Within the proposed approach, a disturbance estimator is proposed to estimate the pointwise value of the uncertainties, with a pre-computable estimation error bounds (EEB). The estimated disturbance and the EEB are then incorporated in a robust Riemannian energy condition to compute the control law that guarantees exponential convergence of actual state trajectories to desired ones. Simulation results on aircraft and planar quadrotor systems demonstrate the efficacy of the proposed controller, which yields better tracking performance than existing controllers for both systems.

Keywords:

robust control; nonlinear control; uncertain systems; disturbance estimation; robot safety

1. Introduction

Robotic systems generally have nonlinear dynamics and are subject to model uncertainties and disturbances. Moreover, many robotic systems are underactuated, i.e., having fewer independent control inputs than degrees of freedom, including fixed-wing aircraft, quadrotors and dynamic walking robots. The design of tracking controllers for underactuated robotic systems is a much more challenging problem compared to that for fully-actuated systems. Recently, the concept of a control contraction metric (CCM) was introduced in [1] to synthesize trajectory tracking controllers for general nonlinear systems, including underactuated ones. The CCM extends contraction theory [2] from analysis to constructive control design, while contraction theory is focused on analyzing nonlinear systems in a differential framework by studying the convergence between pairs of state trajectories toward each other. It was shown in [3] that CCM reduces to conventional sliding and energy-based designs for fully-actuated systems. On the other hand, for underactuated systems, compared to prior approaches based on local linearization [4], the CCM approach leads to a convex optimization problem for controller synthesis and generates controllers that stabilize every feasible trajectory in a region, instead of just a single target trajectory that must be known a priori [3].

On the other hand, control design methods to deal with dynamic uncertainties in the deterministic setting can be roughly classified into adaptive and robust approaches. Robust approaches, such as

H_{\infty}

control [5],

μ

synthesis [6] and robust/tube model predictive control (MPC) [7,8], usually consider parametric uncertainties or bounded disturbances and aim to find controllers with performance guarantees for the worst case of such uncertainties. The consideration of worst-case scenarios associated with robust approaches often leads to conservative nominal performance. Disturbance–observer (DOB) based control and related methods such as active disturbance rejection control (ADRC) [9] lump all uncertainties that may include parametric uncertainties, unmodeled dynamics and external disturbances, together as a “total disturbance”, estimate it via an observer and then compute control actions to compensate for the estimated disturbance [10] to recover the nominal performance. However, for state-dependent uncertainties, DOB-based control methods usually ignore the dependence of “disturbance” on system states and rely on assumptions on the derivative of the “disturbance” that are difficult to verify for theoretical guarantees [10,11]. Alternatively, adaptive control methods such as model reference adaptive control (MRAC) [12] usually need a parametric structure for the uncertainties, rely on online estimation of the parameters for control law construction and provide asymptotic performance guarantees in most cases. One of the exceptions is

L_{1}

adaptive control [13] that does not need a parameterization of the uncertainties (similar to DOB-based control) and focuses on transient performance guarantees in terms of uniformly bounded error between the ideal and uncertain systems.

Both robust and adaptive control approaches have been explored in the context of CCM-based control in the presence of uncertainties and disturbances. In particular, adaptive control was combined with CCM to handle nonlinear control-affine systems with both parametric [14] and non-parametric uncertainties [15]. The case of bounded disturbances in CCM-based control was addressed by leveraging input-to-state stability analysis [16] or robust CCM [17,18]. CCM for stochastic systems was developed in [19] to minimize the mean squared tracking error in the presence of stochastic disturbances. Closely relevant to this paper, [15] designed an

L_{1}

adaptive controller to augment a baseline CCM-based controller to compensate for matched nonlinear non-parametric uncertainties that can depend on both time and states. The authors of [15] proved that transient tracking performance was guaranteed in the sense that the actual state trajectory exponentially converges to a neighborhood or a tube around the desired one. Compared to [15], our approach relies on a disturbance observer that yields an estimation error bound and robust Riemannian energy condition and ensures that the actual state trajectory exponentially converges to the nominal one.

Statement of Contributions: We present a tracking controller for nonlinear systems subject to matched uncertainties that can depend on both time and states based on contraction metrics and disturbance estimation. Our controller leverages a disturbance estimator to estimate the pointwise value of the uncertainties, with a pre-computable estimation error bound. The estimated disturbance and the estimation error bound are then incorporated into a robust Riemannian energy condition to compute the control law that guarantees exponential convergence of actual state trajectories to nominal ones. We validate the efficacy of our controller on two simulation examples and demonstrate its advantages over existing controllers.

The idea presented in this paper is leveraged in [20] for safe learning of uncertain dynamics using deep neural networks. Compared to [20], this paper is not relevant to learning and allows the uncertainty to be dependent on both time and states, as opposed to the dependence on states only in [20]. Additionally, this paper includes an additional aircraft example for performance illustration and conducts extensive comparisons with existing adaptive approaches in simulations that are not available in [20].

Notations: Let

R^{n}

,

R^{+}

and

R^{m \times n}

denote the n-dimensional real vector space, the set of non-negative real numbers and the set of real m by n matrices, respectively. I and 0 denote an identity matrix, and a zero matrix of compatible dimensions, respectively; ‖·‖ denotes the 2-norm of a vector or a matrix. For a vector y,

y_{i}

denotes its ith element. For a matrix-valued function

M : R^{n} \to R^{n \times n}

and a vector

y \in R^{n}

,

\partial_{y} M (x) ≜ \sum_{i = 1}^{n} \frac{\partial M (x)}{\partial x_{i}} y_{i}

denotes the directional derivative of

M (x)

along y. For symmetric matrices P and Q,

P > Q

(

P \geq Q

) means

P - Q

is positive definite (semidefinite).

〈 X 〉

is the shorthand notation of

X + X^{⊤}

. Finally, ⊖ denotes the Minkowski set difference.

2. Problem Statement and Preliminaries

Consider a nonlinear control-affine system with uncertainties

\dot{x} (t) = f (x (t)) + B (x (t)) (u (t) + d (t, x (t))),

(1)

where

x (t) \in X \subset R^{n}

is the state vector,

u (t) \in U \subset R^{m}

is the control input vector,

f : R^{n} \to R^{n}

and

B : R^{n} \to R^{m}

are known and locally Lipschitz continuous functions, and

d (t, x)

represents the matched model uncertainty that can depend on both time and states. We assume that

B (x)

has full column rank for any

x \in X

. Suppose

X

is a compact set that contains the origin, and the control constraint set

U

is defined as

U ≜ {u \in R^{m} : \underset{̲}{u} \leq u \leq \bar{u}}

, where

\underset{̲}{u}, \bar{u} \in R^{m}

denote the lower and upper bounds of all control channels, respectively. Furthermore, we make the following assumptions on

B (x)

and

d (t, x)

.

Assumption 1.

There exist known positive constants

L_{B}

,

L_{d}

,

l_{d}

and

b_{d}

such that for any

x, y \in X

and

t, τ \geq 0

, the following inequalities hold:

\begin{matrix} ∥B (x) - B (y)∥ & \leq L_{B} ∥x - y∥, \end{matrix}

(2)

\begin{matrix} ∥d (t, x) - d (τ, y)∥ & \leq L_{d} ∥x - y∥ + l_{d} | t - τ |, \end{matrix}

(3)

\begin{matrix} ∥d (t, x)∥ & \leq b_{d} . \end{matrix}

(4)

Remark 1.

Assumption 1 indicates that the uncertain function

d (t, x)

is locally Lipschitz in both t and x with known Lipschitz constants and is uniformly bounded by a known constant in the compact set

X

.

In fact, given the local Lipschitz constants

L_{d}

and

l_{d}

, a uniform bound on

d (t, x)

in

X

can always be derived by using Lipschitz continuity properties if the bound on

d (t, x^{*})

for an arbitrary

x^{*}

in

X

and any

t \geq 0

is known. For instance, assuming

‖ d (t, 0) ‖ \leq b_{d}^{0}

, from (3), we have

‖ d (t, x) ‖ \leq b_{d}^{0} + L_{d} {max}_{x \in X} ‖ x ‖

for any

x \in X

and

g \geq 0

. In practice, some prior knowledge about the actual system and the uncertainty may be leveraged to obtain a tighter bound than the one based on the Lipschitz continuity explained earlier, which is why we directly make an assumption on the uniform bound. With Assumption 1, we will show (in Section 3.3) that the pointwise value of

d (t, x (t))

at any time t can be estimated with a pre-computable estimation error bound.

For the system in (1), assume we have a nominal state and input trajectory,

x^{⋆} (\cdot)

and

u^{⋆} (\cdot)

, which satisfy the nominal, i.e., uncertainty-free, dynamics:

{\dot{x}}^{⋆} = f (x^{⋆}) + B (x^{⋆}) u^{⋆} .

(5)

We would like to design a state-feedback controller in the form of

u (t) = k (t, x (t), x^{⋆} (t)) + u^{⋆} (t),

(6)

so that the actual state trajectory

x (\cdot)

exponentially converges to the nominal one

x^{⋆} (\cdot)

. Our solution is based on CCM and disturbance estimation. Next, we briefly review CCM for uncertainty-free systems.

Control Contraction Metrics (CCMs)

We first introduce some notations related to Riemannian geometry, most of which are from [1]. A Riemannian metric on

R^{n}

is a symmetric positive-definite matrix function

M (x)

, smooth in x, which defines a “local Euclidean” structure for any two tangent vectors

δ_{1}

and

δ_{2}

through the inner product

{〈 δ_{1}, δ_{2} 〉}_{x} ≜ δ_{1}^{⊤} M (x) δ_{2}

and the norm

\sqrt{{〈 δ_{1}, δ_{2} 〉}_{x}}

. A metric is called uniformly bounded if

a_{1} I \leq M (x) \leq a_{2} I

holds

\forall x

and for some scalars

a_{2} \geq a_{1} > 0

. Let

Γ (a, b)

be the set of smooth paths connecting two points a and b in

R^{n}

, where each

c \in Γ (a, b)

is a piecewise smooth mapping,

c : [0, 1] \to R^{n}

, satisfying

c (0) = a, c (1) = b

. We use the notation

c (s), s \in [0, 1]

, and

c_{s} (s) ≜ \frac{\partial c}{\partial s}

. Given a metric

M (x)

and a curve

c (s)

, we define the Riemannian energy of

c (s)

as

E (c) ≜ \int_{0}^{1} c_{s}^{⊤} M (c (s)) c_{s} (s) d s

. The Riemannian energy between a and b is defined as

E (a, b) ≜ {inf}_{c \in Γ (a, b)} E (c)

.

Contraction theory [2] draws conclusions on the convergence between pairs of state trajectories toward each other by studying the evolution of the distance between any two infinitesimally close neighbouring trajectories. CCM generalizes contraction analysis to the controlled dynamics setting in which the analysis jointly searches for a controller and a metric that describes the contraction properties of the resulting closed-loop system. Following [1,14], we now briefly review CCMs by considering the nominal, i.e., uncertainty-free, system:

\dot{x} = f (x) + B (x) u,

(7)

where

x (t) \in R^{n}

and

u (t) \in R^{m}

. The differential form of (7) is given by

{\dot{δ}}_{x} = A (x, u) δ_{x} + B (x) δ_{u},

where

A (x, u) ≜ \frac{\partial f}{\partial x} + \sum_{i = 1}^{m} \frac{\partial b_{i}}{\partial x} u_{i}

with

b_{i} (x)

denoting the ith column of

B (x)

. Consider a function

V (x, δ_{x}) = δ_{x}^{⊤} M (x) δ_{x}

for some positive definite metric

M (x)

, which can be viewed as the Riemannian squared differential length at point x. Differentiating and imposing that the squared length decreases exponentially with rate

2 λ

, one obtains

\dot{V} (x, δ_{x}) = δ_{x}^{⊤} (〈M A〉 + \dot{M}) δ_{x} + 2 δ_{x}^{⊤} M B δ_{u} \leq - 2 λ δ_{x}^{⊤} M δ_{x},

(8)

where

\dot{M} = \partial_{f + B u} M = \partial_{f} M + \sum_{i = 1}^{m} \partial_{b_{i}} M u_{i}

. We first recall some basic results related to CCM.

Definition 1

([1]). The system (7) is said to be universally exponentially stabilizable if, for any feasible desired trajectory

x^{⋆} (t)

and

u^{⋆} (t)

, a feedback controller can be constructed that for any initial condition

x (0)

, a unique solution to (7) exists and satisfies

‖ x (t) - x^{⋆} (t) ‖ \leq R ‖ x (0) - x^{⋆} (0) ‖ e^{- λ t},

where λ and R are the convergence rate and overshoot, respectively, independent of the initial conditions.

Lemma 1

([1]). If there exists a uniformly bounded metric M(x), i.e.,

α_{1} I \leq M (x) \leq α_{2} I

for some positive constants

α_{1}

and

α_{2}

, such that for all x and

δ_{x} \neq 0

satisfying

δ_{x}^{⊤} M B = 0

,

\begin{matrix} δ_{x}^{⊤} (〈M \frac{\partial f}{\partial x}〉 + \partial_{f} M + 2 λ M) δ_{x} \leq 0, \end{matrix}

(9a)

\begin{matrix} δ_{x}^{⊤} (〈M \frac{\partial b_{i}}{\partial x}〉 + \partial_{b_{i}} M) δ_{x} = 0, i = 1, \dots, m \end{matrix}

(9b)

hold, then the system (7) is universally exponentially stabilizable in the sense of Definition 1 via continuous feedback defined almost everywhere, and everywhere in the neighborhood of the target trajectory with the convergence rate λ and overshoot

R = \sqrt{\frac{α_{2}}{α_{1}}}

.

The condition (9) ensures that the dynamics orthogonal to the input are contracting, i.e., (8) holds in the presence of

δ_{x}^{⊤} M B = 0

and is often termed as the strong CCM condition [1]. In particular, the condition (9b) can be satisfied by enforcing that each column of

B (x)

forms a killing vector field for the metric

M (x)

, i.e.,

〈M \frac{\partial b_{i}}{\partial x}〉 + \partial_{b_{i}} M = 0

for all

i = 1, \dots, m

. The CCM condition (9) can be transformed into a convex constructive condition for the metric

M (x)

by a change of variables. Let

W (x) = M^{- 1} (x)

(commonly referred to as the dual metric), and

B_{⊥} (x)

be a matrix whose columns span the null space of the input matrix B (i.e.,

B_{⊥}^{⊤} B = 0

). Then, condition (9) can be cast as convex constructive conditions for

W (x)

:

\begin{matrix} B_{⊥}^{⊤} (〈\frac{\partial f}{\partial x} W〉 - \partial_{f} W + 2 λ W) B_{⊥} \leq 0 \end{matrix}

(10a)

\begin{matrix} 〈\frac{\partial b_{i}}{\partial x} W〉 - \partial_{b_{i}} W = 0, for i = 1, \dots, m . \end{matrix}

(10b)

The existence of a contraction metric

M (x)

is sufficient for stabilizability via Lemma 1. What remains is constructing a feedback controller that achieves the universal exponential stabilizability (UES). As mentioned in [1,16], one way to derive the controller is to interprete the Riemann energy,

E (x^{⋆} (t), x (t))

, as an incremental control Lyapunov function and use it to construct a min-norm controller that renders for any time t

\dot{E} (x^{⋆} (t), x (t)) \leq - 2 λ E (x^{⋆} (t), x (t)) .

(11)

Specifically, at any time

t > 0

, given the metric

M (x)

and a desired/actual state pair

(x^{⋆} (t), x (t))

, a minimum-energy path, i.e., a geodesic,

γ (\cdot, t)

connecting these two states (i.e.,

γ (0, t) = x^{⋆} (t)

and

γ (1, t) = x (t)

), can be computed (e.g., using the pseudospectral method in [21] to solve a nonlinear programming problem). Consequently, the Riemannian energy of the geodesic is defined as

E (x^{⋆} (t), x (t)) = \int_{0}^{1} γ_{s} {(s, t)}^{⊤} M (γ (s, t))) γ_{s} (s, t) d s

, where

γ_{s} (s) ≜ \frac{\partial γ}{\partial s}

, can be calculated. As noted in [16], from the formula for the first variation of energy [22],

\dot{E} (x^{⋆} (t), x (t)) = 2 γ_{s}^{⊤} (1, t) M (x (t)) \dot{x} (t) - 2 γ_{s}^{⊤} (0, t) M (x^{⋆} (t)) {\dot{x}}^{⋆} (t)

. Therefore, (11) can be rewritten as

γ_{s}^{⊤} (1, t) M (x (t)) \dot{x} (t) - γ_{s}^{⊤} (0, t) M (x^{⋆} (t)) {\dot{x}}^{⋆} (t) \leq - λ E (x^{⋆} (t), x (t)),

(12)

where

\dot{x} (t) = f (x (t)) + B (x (t)) u (t)

and

{\dot{x}}^{⋆} (t) = f (x^{⋆} (t)) + B (x^{⋆} (t)) u^{⋆} (t)

. Therefore, the control signal with a minimum norm for

u (t) - u^{⋆} (t)

can then be obtained by solving the following quadratic programming (QP) problem:

u (t) = \underset{k \in R^{m}}{argmin} ‖ {k - u^{⋆} (t) ‖}^{2} subject to (12)

(13)

at each time t, which is guaranteed to be feasible under condition (9) [1]. The minimization problem (13) is often termed as the pointwise min-norm control problem and has an analytic solution [23]. The above discussions can be summarized in the following theorem. The proof is trivial by following Lemma 1 and the subsequent discussions and is thus omitted.

Theorem 1

([1]). Given a nominal system (7), assume that there exists a uniformly bounded metric

W (x)

that satisfies (10) for all

x \in R^{n}

. Then, the control law constructed by solving (13) with

M (x) = W^{- 1} (x)

, universally exponentially stabilizes the system (7) in the sense of Definition 1, where

R = \sqrt{\frac{α_{2}}{α_{1}}}

with

α_{1}

and

α_{2}

being two positive constants satisfying

α_{1} I \leq M (x) \leq α_{2} I

.

Remark 2.

According to Definition 1 and Theorem 1, under the conditions of Theorem 1, given any feasible trajectory (

x^{⋆} ((\cdot), u^{⋆} (\cdot)

) of (7), a controller can always be constructed to ensure that the actual state trajectory

x (\cdot)

exponentially converges to

x^{⋆} (\cdot)

.

3. Robust Trajectory Tracking Using CCM and Disturbance Estimation

In Section 2, we have shown that existence of a CCM for a nominal (i.e., uncertainty-free) system can be used to construct a feedback control law to guarantee the universal exponential stabilizability (UES) of the system. In this section, we present a controller based on CCM and disturbance estimation to ensure the UES of the uncertain system (1), whose architecture is depicted in Figure 1.

3.1. CCMs for the Actual System

To apply the contraction method to design a controller to guarantee the UES of the uncertain system (1), we need to first search a valid CCM for it. Following Section 2, we can derive the counterparts of the strong CCM condition (9) or (10). Due to the particular structure with (1) attributed to the matched uncertainty assumption, we have the following lemma. A similar observation has been made in [14] for the case of matched parametric uncertainties. The proof is straightforward and thus omitted. One can refer to [14] for more details.

Lemma 2.

The strong (dual) CCM condition for the uncertain system (1) is the same as the strong (dual) CCM condition, i.e., (9) and (10), for the nominal system.

Remark 3.

As a result of Lemma 2, a metric

M (x)

(dual metric

W (x)

) satisfying the condition (9) and (10) for the nominal system (7) is always a CCM (dual CCM) for the true system (1).

Define

D = {y \in R^{m} : ‖ y ‖ \leq b_{d}}

, where

b_{d}

is introduced in Assumption 1. Assumption 1 indicates

d (t, x) \in D

for any

t \geq 0

and

x \in X

. As mentioned in Section 2, given a CCM and a desired trajectory

x^{⋆} (t)

and

u^{⋆} (t)

for a nominal system, a control law can be constructed to ensure exponential convergence of the actual state trajectory

x (t)

to the desired state trajectory

x^{⋆} (t)

. In practice, we have access to only the nominal dynamics (5) instead of the true dynamics to plan a trajectory

x^{⋆} (t)

and

u^{⋆} (t)

. The following lemma gives the condition when

x^{⋆} (t)

, planned using the nominal dynamics (5), is also a feasible state trajectory for the true system.

Lemma 3.

Given a desired trajectory

x^{⋆} (t)

and

u^{⋆} (t)

satisfying the nominal dynamics (5) with

x^{⋆} (t) \in X

, if

u^{⋆} (t) \in U ⊖ D, \forall t \geq 0,

(14)

then

x^{⋆} (t)

is also a feasible state trajectory for the true system (1).

Proof.

Define

{\bar{u}}^{⋆} (t) ≜ u^{⋆} (t) - d (t, x^{⋆} (t))

. Since

u^{⋆} (t) \in U ⊖ D

and

- d (t, x^{⋆} (t)) \in D

, which is due to

x^{⋆} (t) \in X

and Assumption 1, we have

{\bar{u}}^{⋆} (t) \in U

. By comparing the dynamics in (1) and (5), we conclude that

x^{⋆} (t)

and

{\bar{u}}^{⋆} (t)

satisfy the true dynamics (1) and thus are a feasible state and input trajectory for the true system. □

Lemma 3 provides a way to verify whether a trajectory planned using the nominal dynamics is a feasible trajectory for the true system in the presence of actuator limits. In the absence of such limits, any feasible trajectory for the learned dynamics is also a feasible trajectory for the true dynamics due to the particular structure of (1) associated with the matched uncertainty assumption.

3.2. Robust Riemannian Energy Condition

Section 2 shows that, given a nominal system and a CCM for such a system, a control law can be constructed via solving a QP problem (13) with a condition to constrain the decreasing rate of the Riemannian energy, i.e., condition (12). When considering the uncertain dynamics in (1), the condition (12) becomes

γ_{s}^{⊤} (1, t) M (x (t)) \dot{x} (t) - γ_{s}^{⊤} (0, t) M (x^{⋆} (t)) {\dot{x}}^{⋆} (t) \leq - λ E (x^{⋆} (t), x (t)),

(15)

where

\dot{x} (t) = f (x (t)) + B (x (t)) (u (t) + d (x (t)))

represents the true dynamics evaluated at

x (t)

, and

{\dot{x}}^{⋆} (t) = f (x^{⋆}) + B (x^{⋆}) u^{⋆}

as defined in (5). Several observations follow immediately. First, it is clear that (15) is not implementable due to its dependence on the true uncertainty

d (x (t))

through

\dot{x} (t)

. Second, if we could have access to the pointwise value of

d (x (t))

at each time t, (15) will become implementable even when we do not know the exact functional representation of

d (x)

. Third, if we could estimate the pointwise value of

d (x (t))

at each time t with a bound to quantify the estimation error, then we could derive a robust condition for (15). Specifically, assume

d (x (t))

is estimated as

\hat{d} (t)

at each time t with a uniform estimation error bound (EEB)

δ

, i.e.,

‖ \hat{d} (t) - d (x (t)) ‖ \leq δ, \forall t \geq 0 .

Then, we could immediately get the following sufficient condition for (15):

\begin{matrix} γ_{s}^{⊤} (1, t) & M (x) \dot{\overset{ˇ}{x}} (t) - γ_{s}^{⊤} (0, t) M (x^{⋆}) {\dot{x}}^{⋆} + ‖ γ_{s}^{⊤} (1, t) M (x) B (x) ‖ δ \leq - λ E (x^{⋆}, x), \end{matrix}

(16)

where

\dot{\overset{ˇ}{x}} (t) ≜ f (x) + B (x) (u (t) + \hat{d} (t)) .

(17)

Moreover, since

M (x)

satisfies the CCM condition (9),

u (t)

that satisfies (16) is guaranteed to exist for any

t \geq 0

, regardless of the size of

δ

, if the input constraint set

U

is sufficiently large. We term condition (16) the robust Riemannian energy (RRE) condition.

3.3. Disturbance Estimation with a Computable EEB

We now introduce a disturbance estimation scheme to estimate the pointwise value of the uncertainty

d (x)

with a pre-computable EEB, which can be systematically improved by tuning a parameter in the estimation law. The estimation scheme is based on the piecewise-constant estimation (PWCE) law in [24], which was originally from [25]. The PWCE law consists of two elements, namely a state predictor and a piecewise-constant update law. The state predictor is defined as:

\dot{\hat{x}} (t) = f (x (t)) + B (x (t)) u (t) + \hat{σ} (t) - a \tilde{x} (t), \hat{x} (0) = x (0),

(18)

where

\tilde{x} (t) ≜ \hat{x} (t) - x (t)

is the prediction error, and a is an arbitrary positive constant. The estimation,

\hat{σ} (t)

, is updated in a piecewise-constant way:

\{\begin{matrix} \hat{σ} (t) & = \hat{σ} (i T), t \in [i T, (i + 1) T), \\ \hat{σ} (i T) & = - \frac{a}{e^{a T} - 1} \tilde{x} (i T), \end{matrix}

(19)

where T is the estimation sampling time, and

i = 0, 1, 2, \dots

. Finally, the pointwise value of

d (x (t))

at time t is estimated as

\hat{d} (t) = B^{†} (x (t)) \hat{σ} (t),

(20)

where

B^{†} (x (t))

is the pseudoinverse of

B (x (t))

. The following lemma establishes the EEB associated with the estimation scheme in (18) and (19). The proof is similar to that in [24]. For completeness, it is given in Appendix A.

Lemma 4.

Given the dynamics (1) subject to Assumption 1, and the estimation law in (18) and (19), if

x \in X

and

u \in U

for any

t \geq 0

, the estimation error can be bounded as

\begin{matrix} ‖ \hat{d} (t) - d (t, x (t)) ‖ & \leq δ (t, T) ≜ \{\begin{matrix} b_{d}, \forall 0 \leq t < T, \\ α (T) max_{x \in X} B^{†} (x), \forall t \geq T, \end{matrix} \end{matrix}

(21)

where

\begin{matrix} α (T) & ≜ (2 \sqrt{n} T (L_{d} ϕ + l_{d}) + (1 - e^{- a T}) \sqrt{n} b_{d}) max_{x \in X} ‖ B (x) ‖ + 2 \sqrt{n} T L_{B} b_{d}, \end{matrix}

(22)

\begin{matrix} ϕ & ≜ max_{x \in X, u \in U} ‖ f (x) + B (x) u ‖ + b_{d} max_{x \in X} ‖ B (x) ‖, \end{matrix}

(23)

with constants

L_{B}

,

L_{d}

and

b_{d}

from Assumption 1, and ϕ defined in (23). Moreover,

{lim}_{T \to 0} δ (t, T) = 0,

for any

t \geq T

.

Proof.

See Appendix A. □

Remark 4.

Lemma 4 implies that theoretically, for

t \geq T

, the disturbance estimation after a single sampling interval can be made arbitrarily accurate by reducing T, which further indicates that the conservatism with the RRE condition can be arbitrarily reduced after a sampling interval.

In practice, the value of T is subject to the limitations related to computational hardware and sensor noise. Additionally, using a very small T tends to introduce high frequency components in the control loop, potentially harming the robustness of the closed-loop system, e.g., against time delay. This is similar to the use of a high adaptation rate in model reference adaptive control schemes as discussed in [13]. Therefore, one should avoid the use of a very small T for the sake of robustness unless a low-pass filter is used to filter the estimated disturbance before fed into (16), as suggested by the

L_{1}

adaptive control theory [13].

Remark 5.

The estimation in

[0, T)

cannot be arbitrarily accurate. This is because the estimation in

[0, T)

depends on

\tilde{x} (0)

according to (19). Considering that

\tilde{x} (0)

is purely determined by the initial state of the system,

x (0)

, and the initial state of the predictor,

\hat{x} (0)

, it does not contain any information of the uncertainty. Since T is usually very small in practice, lack of a tight estimation error bound for the interval

[0, T)

will not cause an issue from a practical point of view. Additionally, the estimation of ϕ defined in (23) could be quite conservative. Further considering the frequent use of Lipschitz continuity and inequalities related to matrix/vector norms in deriving the constant

α (T)

,

α (T)

can be overly conservative. Therefore, for practical implementation, one should leverage some empirical study, e.g., performing simulations under a few user-selected functions of

d (t, x)

and determining a bound for

δ (t, T)

. In our experiments, we found the theoretical bound

δ (t, T)

computed according to (21) was usually at least 10 and could be

10^{4}

times more conservative.

3.4. Exponentially Convergent Trajectory Tracking

Based on the review of contraction control in Section 2 and the discussions in Section 3.2 and Section 3.3, the control law can be obtained by solving the following QP problem at each time t:

u (t) = \underset{k \in R^{m}}{argmin} ‖ {k - u^{⋆} (t) ‖}^{2}

(24)

subject to

γ_{s}^{⊤} (1, t) M (x) \dot{\overset{ˇ}{x}} - γ_{s}^{⊤} (0, t) M (x^{⋆}) {\dot{x}}^{⋆} + ‖ γ_{s}^{⊤} (1, t) M (x) B (x) ‖ δ (t, T) \leq - λ E (x^{⋆}, x),

(25)

where

\dot{\overset{ˇ}{x}} (t) = f (x) + B (x) (k + \hat{d} (t))

, according to (17), depends on

\hat{d} (t)

, which is from the disturbance estimation law defined by (18) to (20),

δ (t, T)

as defined in (21), and

{\dot{x}}^{⋆} (t) = f (x^{⋆}) + B (x^{⋆}) u^{⋆}

as defined in (5). Similar to (13), problem (24) is a pointwise min-norm control problem and has an analytic solution [23]. Specifically, denoting

ϕ_{0} (t, x^{⋆}, x) ≜ γ_{s}^{⊤} (1, t) M (x) (f (x) + B (x) (u^{⋆} (t) + \hat{d} (t)) + ‖ γ_{s}^{⊤} (1, t) M (x) B (x) ‖ δ (t, T) - γ_{s}^{⊤} (0, t) M (x^{⋆}) {\dot{x}}^{⋆} + λ E (x^{⋆}, x)

and

ϕ_{1} (x^{⋆}, x) ≜ B^{⊤} (x) M (x) γ_{s} (1, t)

, (25) can be written as

ϕ_{0} (t, x^{⋆}, x) + ϕ_{1}^{⊤} (x^{⋆}, x) (k - u^{⋆} (t)) \leq 0

, and the solution for (24) is given by

u (t) = k^{⋆} = \{\begin{matrix} u^{⋆} (t) & if ϕ_{0} (t, x^{⋆}, x) \leq 0, \\ u^{⋆} (t) - \frac{ϕ_{0} (t, x^{⋆}, x) ϕ_{1} (x^{⋆}, x)}{{‖ ϕ_{1} (x^{⋆}, x) ‖}^{2}} & if ϕ_{0} (t, x^{⋆}, x) > 0 . \end{matrix}

(26)

To move forward with analysis, we need to verify that when

x (t), x^{⋆} (t) \in X

, the control signal

u (t)

resulting from solving the QP problem (24) satisfies

u (t) \in U

. Deriving verifiable conditions to ensure this set bound is outside the scope of this paper and will be addressed as future work. We are now ready to state the main result of the paper.

Theorem 2.

Given an uncertain system represented by (1) satisfying Assumption 1, assume that there exists a metric

W (x)

such that for all

x \in X

, (10) holds and

α_{1} I \leq M (x) = W^{- 1} (x) \leq α_{2} I

holds for positive constants

α_{1}

and

α_{2}

. Furthermore, suppose that a nominal trajectory (

x^{⋆} (\cdot), u^{⋆} (\cdot)

) planned using the nominal dynamics (5) and the initial actual states

x (0)

satisfy (14) and

Ω (t) ≜ \{y \in R^{n} : y \leq ‖ x^{⋆} (t) ‖ + \sqrt{\frac{α_{2}}{α_{1}}} ‖ x (0) - x^{⋆} (0) ‖ e^{- λ t}\} \subset X,

(27)

for any

t \geq 0

. Then, if

u (t)

from solving (24) satisfies

u (t) \in U

for any

t \geq 0

, the control law constructed by solving (24) ensures

x (t) \in X

for any

t \geq 0

, and furthermore, universally exponentially stabilizes the uncertain system (1) in the sense of Definition 1 with

R = \sqrt{\frac{α_{2}}{α_{1}}}

, i.e.,

‖ x (t) - x^{⋆} (t) ‖ \leq \sqrt{\frac{α_{2}}{α_{1}}} ‖ x (0) - x^{⋆} (0) ‖ e^{- λ t}, \forall t \geq 0 .

(28)

Proof.

We use contradiction to show

x (t) \in X

for all

t \geq 0

. Assume this is not true. According to (27),

x (0) \in X

. Since

x (t)

is continuous, there must exist a time

τ

such that

x (t) \in X, \forall t \in [0, τ^{-}] and x (τ) \notin X .

(29)

Now let us consider the system evolution in

[0, τ^{-}]

. Since

u (t) \in U

by assumption and

x (t) \in X

for any t in

[0, τ^{-}]

, the EEB in (21) holds in

[0, τ^{-}]

. As a result, the control law obtained from solving (24) ensures satisfaction of the RRE condition (16) and thus satisfaction of the Riemannian energy condition (15) for the uncertain system (1), and thereby universally exponentially stabilizes the uncertain system (1) in

[0, τ^{-}]

, in the sense of Definition 1 with

R = \sqrt{\frac{α_{2}}{α_{1}}}

, according to Theorem 1. On the other hand, satisfaction of (14) implies that

x^{⋆} (t)

is a feasible state trajectory for the uncertain system (1) according to Lemma 3. Further considering Theorem 1, we have

‖ x (t) ‖ \leq ‖ x^{⋆} (t) ‖ + \sqrt{\frac{α_{2}}{α_{1}}} ‖ x (0) - x^{⋆} (0) ‖ e^{- λ t}

for any t in

[0, τ^{-}]

. Due to (27), the preceding inequality indicates that

x (t)

remains in the interior of

X

for t in

[0, τ^{-}]

. This, together with the continuity of

x (t)

, immediately implies

x (τ) \in X

, which contradicts (29). Therefore, we conclude that

x (t) \in X

for all

t \geq 0

. From the development of the proof, it is clear that with the control law given by the solution of (24), the UES of the closed-loop system in the sense of Definition 1 with

R = \sqrt{\frac{α_{2}}{α_{1}}}

for all

t \geq 0

is achieved, which is mathematically represented by (28). The proof is complete. □

3.5. Discussion

Theorem 2 essentially states that under certain assumptions, the proposed controller guarantees exponential convergence of the actual state trajectory

x (t)

to a desired one

x^{⋆} (t)

. With the exponential guarantee, if the actual trajectory meets the desired trajectory at certain time

τ

, then these two trajectories will stay together afterward. While the exponential convergence guarantee is stronger than the performance guarantees provided by existing adaptive CCM-based approaches [14,15] that deal with similar settings (i.e., matched uncertainties), the proposed method requires the knowledge of the Lipschitz bound of the uncertainty

d (t, x)

and the input matrix function

B (x)

to be in a compact set known a priori (see Assumption 1), and the actual control inputs to stay in a compact set known a priori, which cannot be verified at this moment due to the lack of a bound on the control inputs. These requirements are not needed in [14,15].

The approach here is related to the robust control Lyapunov-based approaches [23] which provide robust stabilization around an equilibrium point (as opposed to a trajectory considered in this paper) in the presence of uncertainties.

Remark 6.

The exponential convergence guarantee stated in Theorem 2 is based on a continuous-time implementation of the controller. In practice, a controller is normally implemented on a digital processor or controller with a fixed sampling time. As a result, the property of exponential convergence may be slightly violated.

Computational cost: As can be seen from Section 2 and Section 3.2, Section 3.3 and Section 3.4, computation of the control signal at each time t includes three steps: (i) updating the estimated disturbance

\hat{d} (t)

via (18) to (20), (ii) computing the geodesic

γ (\cdot, t)

connecting the actual and nominal states (see the discussion below (11)), and (iii) computing the control signal

u (t)

via (26). The computation costs of steps (i) and (iii) are quite low as they only involve integration and algebraic calculation. In comparison, step (ii) has a relatively high computational cost as it necessitates solving a nonlinear programming (NLP) problem. However, since the NLP problem does not involve dynamic constraints, it is much easier to solve than a nonlinear model predictive control (MPC) problem [21]. Following [21], such a problem can be efficiently solved by applying a pseudospectral method.

4. Simulation Results

In this section, we illustrate the performance of our proposed tracking controller based on the RRE condition and disturbance estimation, denoted as DE-CCM, using aircraft and planar quadrotor examples. For both examples, we perform comparisons of DE-CCM with standard CCM controllers that ignore the uncertainties and adaptive CCM (Ad-CCM) controllers considering parametric uncertainties designed using the approach in [14]. All the computations and simulations were performed in Matlab R2021b.

4.1. Longitudinal Dynamics of an Aircraft

We first implement our method on the simplified pitch dynamics of an aircraft borrowed from [26]:

\dot{x} ≜ [\begin{matrix} \dot{θ} \\ \dot{α} \\ \dot{q} \end{matrix}] = [\begin{matrix} q \\ q - \bar{L} (α) \\ - k_{q} q + \bar{M} (α) \end{matrix}] + [\begin{matrix} 0 \\ 0 \\ 1 \end{matrix}] u,

(30)

where

θ

,

α

and q are the pitch angle (in rad), angle of attack (in rad), and pitch rate (in rad/s). Here

\bar{L} (α)

and

\bar{M} (α)

are aerodynamic lift and moment, respectively. Using the flat plat theory, these two aerodynamic terms are approximated by

\bar{L} (α) = 0.8 sin (2 α)

and

\bar{M} (α) = - l_{α} \bar{L} (α)

with unknown parameters

k_{q} \in [0.1 0.8]

and

l_{α} \in [- 3 1]

. For all the simulations, the true values are chosen to be

k_{q} = 0.8

,

l_{α} = - 3

, while the nominal values of these parameters used in designing all the tested controllers are limited to

k_{q}^{n o m} = 0.2

,

l_{α}^{n o m} = - 1

. As a result, the dynamics can be recast in the form of (30) with

f (x) = {[q, q - \bar{L} (α), - k_{q}^{n o m} q - l_{α}^{n o m} \bar{L} (α)]}^{⊤}

,

B (x) = {[0, 0, 1]}^{⊤}

and

d (t, x) = - (k_{q} - k_{q}^{n o m}) q - (l_{α} - l_{α}^{n o m}) \bar{L} (α)

. The control objective is to drive the system from nominal initial states

{[0, 0, 0]}^{T}

to terminal states

{[180, 0, 0]}^{T}

. For CCM search and trajectory planning, the following constraints are enforced:

x \in X = [- 10 °, 180 °] \times [- 5 °, 40 °] \times [- 10, 50] ° /

s,

u \in U = [- 15, 15] °

/s

^{2}

.

We set the convergence rate

λ

to 1. By gridding the set of

α

and evaluating the constraints (9) in those grid points, we found a CCM

W (x)

as a quadratic function of

α

with the SPOT toolbox [27] (to formulate the convex optimization problem) and Mosek solver [28]. Additionally, the constants

α_{1}

and

α_{2}

in (28) such that

α_{1} I \leq M (x) = W^{- 1} (x) \leq α_{2} I

for all

x \in X

were found to be

α_{1} = 0.1

and

α_{2} = 396.5

. We planned a nominal trajectory (

x^{⋆} (\cdot)

,

u^{⋆} (\cdot)

) using OptimTraj [29,30], to drive the system from the initial states

{[0, 0, 0]}^{T}

to the terminal states

{[180 °, 0, 0]}^{T}

, while minimizing the task completion time (

T_{a}

) and energy consumption characterized by the cost function

J = \int_{0}^{T_{a}} {u (t)}^{2} d t + 5 T_{a}

. For simulation, OPTI [31] and Matlab fmincon solvers were used to solve the geodesic optimization problem (see Section 2). The initial states of the actual system were chosen to be

{[5 °, 5 °, 0]}^{T}

, slightly deviated from that planned ones to better illustrate the tracking performance. We implemented our proposed DE-CCM, Ad-CCM from [14] and a standard CCM which neglects all the uncertainty. For Ad-CCM design, the adaptation gain was chosen to be diag

([10^{3}, 10^{3}])

to achieve a relatively good tracking result, while further increasing it did not help much with the tracking performance. The design procedure for Ad-CCM in [14] requires a parametric structure for the uncertainty, which is given by

d (t, x) = \underset{Δ (t, x)}{\underset{︸}{[\begin{matrix} q & \bar{L} (α) \end{matrix}]}} \underset{θ}{\underset{︸}{[\begin{matrix} k_{q}^{n o m} - k_{q} \\ l_{α}^{n o m} - l_{α} \end{matrix}]}} = [\begin{matrix} q & 0.8 sin (2 α) \end{matrix}] [\begin{matrix} - 0.6 \\ 2 \end{matrix}],

(31)

where

∆ (t, x)

is the known base function and

θ

is the unknown parameter vector to be estimated by the adaptive law proposed in [14]. The control signals under all three controllers were updated at 200 Hz.

It is easy to notice from Assumption 1 that

L_{B} = 0

and

l_{d} = 0

since the input matrix is constant, and the uncertainty is time-invariant. We can also verify that the disturbance is bounded by

b_{d} = 2.12

and has a Lipschitz constant

L_{d} = 3.80

. By gridding the space

X

and making use of the control input bound, the system derivative can also be bounded by a constant

ϕ = 2.11

. According to (21), if we want to achieve an EEB

δ (t, T) = 0.05

for all

t \geq T

, the maximum value for the estimation sample time T is

7.76 \times 10^{- 4}

s. However, as mentioned in Remark 5, the way to compute the EEB is quite conservative. In the simulations we found that

T_{s} = 0.005

was more than enough to ensure the EEB and therefore used

T_{s} = 0.005

for implementing the DE-CCM controller.

As shown in Figure 2 and Figure 3, due to ignoring the uncertainties, CCM yielded a large tracking error between 2 and 6 s. The state trajectories under Ad-CCM had some oscillations, which lasted roughly up to 8 s. All three states yielded by DE-CCM achieve good tracking performance without large deviations from the planned trajectories, unlike the performance yielded by Ad-CCM and CCM. From Figure 3 we notice that the tracking error represented by

x - x^{⋆}

under DE-CCM monotonically decreases and achieves the smallest steady-state error. The small non-zero tracking error at the end under DE-CCM, which is inconsistent with the performance guarantee in (25), is due to the limited control update frequency, while the performance guarantee in Lemma 4 holds under continuous update of the control signal, i.e., corresponding to an infinitely high update frequency. Table 1 shows the mean squared error (MSE) for state trajectory tracking defined by

MSE = \frac{1}{N} \sum_{i = 1}^{N} ‖ {x (t_{i}) - x^{⋆} (t_{i}) ‖}^{2},

(32)

where N is the number of data points, under DE-CCM, Ad-CCM and CCM. We observe that DE-CCM outperforms CCM and Ad-CCM in terms of MSE by 54% and 2%, respectively.

From Figure 4, we observe that the input of DE-CCM is smoother than Ad-CCM. The small oscillations between 2 s to 8 s in DE-CCM input are due to the finite tolerance in the geodesic optimization. Decreasing the tolerance and the sample time will reduce the oscillations but request more iterations (and thus more time) to compute the control signal at each time step.

4.2. Planar Quadrotor

A planar quadrotor system is borrowed from [16]. The state vector is defined as

x = {[p_{x}, p_{z}, ϕ, v_{x}, v_{z}, \dot{ϕ}]}^{⊤}

, where

p_{x}

and

p_{z}

are the positions in x and z directions, respectively,

v_{x}

and

v_{z}

are the slip velocity (lateral) and the velocity along the thrust axis in the body frame of the vehicle,

ϕ

is the angle between the x direction of the body frame and the x direction of the inertia frame. The input vector

u = [u_{1}, u_{2}]

contains the thrust force produced by each of the two propellers. The dynamics of the vehicle are given by

\dot{x} = [\begin{matrix} \dot{} p_{x} \\ \dot{} p_{z} \\ \dot{ϕ} \\ \dot{} v_{x} \\ \dot{} v_{z} \\ \ddot{ϕ} \end{matrix}] = [\begin{matrix} v_{x} cos (ϕ) - v_{z} sin (ϕ) \\ v_{x} sin (ϕ) + v_{z} cos (ϕ) \\ \dot{ϕ} \\ v_{z} \dot{ϕ} - g sin (ϕ) \\ - v_{x} \dot{ϕ} - g cos (ϕ) \\ 0 \end{matrix}] + [\begin{matrix} 0 & 0 \\ 0 & 0 \\ 0 & 0 \\ 0 & 0 \\ \frac{1}{m} & \frac{1}{m} \\ \frac{l}{J} & - \frac{l}{J} \end{matrix}] (u + d (t, x)),

where m and J denote the mass and moment of inertia about the out-of-plane axis, and l is the distance between each of the propellers and the vehicle center, and

d (t, x)

denotes the unknown disturbances exerted on the propellers. The parameters were set as

m = 0.486

kg,

J = 0.00383 K g m^{2}

, and

l = 0.25

m. The uncertainty

d (t, x)

was set to be

d (t, x) = 0.15 (v_{x}^{2} + v_{z}^{2}) [- 1 + 0.3 sin (2 t); - 1 + 0.3 cos (2 t)]

. We imposed the following constraints:

x \in X ≜ [0, 10] \times [0, 10] \times [- \frac{π}{3}, \frac{π}{3}] \times [- 2, 2] \times [- 1, - 1] \times [- \frac{π}{3}, \frac{π}{3}]

,

u \in U ≜ [0, \frac{3}{2} m g] \times [0, \frac{3}{2} m g]

.

When searching for CCM, we parameterized the CCM W by

ϕ

and

v_{x}

and imposed the constraint

W \geq 0.01 I

. The convergence rate

λ

was chosen to be 0.8. More details about synthesizing the CCM can be found in [17]. For estimating the disturbance using (18) to (20), we set

a = 10

. It is easy to verify that

L_{d} = 1.23

,

l_{d} = 0.64

,

b_{d} = 1.38

, and

L_{B} = 0

(due to the fact that B is constant) satisfy (2). By gridding the space

X

, the constant

ϕ

in (23) can be determined as

ϕ = 584.6

. According to (21), if we want to achieve an EEB

δ (t, T) = 0.1

for all

t \geq T

, then the estimation sampling time needs to satisfy

T \leq 6.42 \times 10^{- 7}

s. However, as noted in Remark 5, the EEB computed according to (21) could be overly conservative. In the simulations, we found that the estimation sampling time of

0.002

s was more than enough to ensure the desired EEB and therefore simply set

T = 0.002

s.

We consider the task of navigation from

(2, 0)

to

(8, 8)

while avoiding three obstacles depicted by black circles in Figure 5. A nominal trajectory (

x^{⋆} (\cdot)

,

u^{⋆} (\cdot)

) was generated using OptimTraj [29] to minimize the cost

J = \int_{0}^{T_{a}} {u (t)}^{2} d t + 5 T_{a}

, where

T_{a}

is the arrival time. OPTI [31] and Matlab fmincon solvers were used to solve the geodesic optimization problem (see Section 2). The actual start point was set to be

(0, 0)

, which was different from the planned start point, to reveal the trajectory convergence pattern.

For comparison, we also designed a standard CCM controller by completely ignoring the uncertainty and two adaptive CCM (Ad-CCM) controllers following the approach in [14]. To apply the approach in [14] which can only handle parametric uncertainties, we parameterized the uncertainty as

d (t, x) = \underset{Δ (t, x)}{\underset{︸}{[\begin{matrix} v_{x}^{2} (- 1 + 0.3 sin (2 t)) & v_{z}^{2} (- 1 + 0.3 sin (2 t)) \\ v_{x}^{2} (- 1 + 0.3 cos (2 t)) & v_{z}^{2} (- 1 + 0.3 cos (2 t)) \end{matrix}]}} \underset{θ}{\underset{︸}{[\begin{matrix} 0.15 \\ 0.15 \end{matrix}]}},

(33)

where

Δ (t, x)

is the basis function that is assumed to be known, and

θ

is th vector of unknown parameters. With the parametric structure (33), we designed two adaptive CCM controllers using

Γ = 10

and

Γ = 100

, respectively, where

Γ

denotes the adaptive gain. Figure 5 shows the planned and actual trajectories under the CCM, Ad-CCM, and our proposed controller based on the RRE condition and disturbance estimation, denoted as DE-CCM, while Figure 6 and Figure 7 show the control inputs and Riemannian energy. One can see that the actual trajectories yielded by the CCM controller deviated quite a lot from the planned ones and collided with one obstacle. On the other hand, the actual trajectories yielded by the DE-CCM controller converged to the desired trajectory as expected and almost overlapped with it afterward. In fact, the slight deviations of actual trajectories from the desired ones under the DE-CCM controller were due to the finite step size associated with the ODE solver used for the simulations (see Remark 6). Table 2 shows the MSE for state trajectory tracking defined in (32). We observe that DE-CCM outperforms CCM and Ad-CCM in terms of MSE by 46% and 14%, respectively.

From Figure 6, one can see that the magnitude of

E (x^{⋆}, x)

under the RD-CCM controller decreased exponentially, and the magnitude was bounded by the curve

E (x^{⋆} (0), x (0)) e^{- 2 λ t}

from above except at the very end when the energy is close to zero. In comparison, Ad-CCM with

Γ = 100

yielded similar tracking performance to DE-CCM, while the tracking performance of Ad-CCM with

Γ = 10

was relatively worse and characterized by larger oscillations. Additionally, from Figure 7, one can see that the control inputs generated by both of the Ad-CCM controllers have high-frequency oscillations before 3 s, which is undesired for practical deployment. Finally, Figure 8 shows the actual and estimated disturbances as well as the estimation error. One can see that the estimated disturbance is quite close to the actual one for both channels, and the EEB of 0.1 is respected throughout the simulation.

5. Conclusions

This paper presents a robust trajectory tracking controller with exponential convergence for uncertain nonlinear systems based on control contraction metrics (CCM) and disturbance estimation. The controller uses a disturbance estimator to estimate the pointwise value of the uncertainty with a pre-computable estimation error bound (EEB). The estimated disturbance and the EEB are then incorporated into a robust Riemannian energy condition, which guarantees exponential convergence of actual trajectories to desired ones. The efficacy of the proposed controller is validated in simulations. In particular, the proposed controller outperforms an existing adaptive CCM controller in terms of tracking performance by 2% for the aircraft example and 14% for the planar quadrotor example, while not needing to know the basis functions to parameterize the uncertainties that are needed by the adaptive CCM controller.

This paper considers only matched uncertainties, which are added to the system through the same channels as control inputs. In the future, we would like to address unmatched uncertainties that widely exist in practical systems. Additionally, we would like to experimentally validate the proposed controller on real hardware.

Author Contributions

The individual contributions of the authors are as follows. Conceptualization, methodology and formal analysis, P.Z.; software and investigation, Z.G. and P.Z.; validation and visualization, Z.G.; supervision, N.H.; writing—original draft preparation, P.Z.; writing—review and editing, Z.G. and N.H.; project administration, P.Z.; funding acquisition, N.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded in part by AFOSR, in part by NASA, and in part by NSF under the RI grant #2133656 and NRI grant #1830639.

Institutional Review Board Statement

Not available.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not available.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Lemma 4:

Hereafter, we use the notations

Z_{i}

and

Z_{1}^{n}

to denote the integer sets

{i, i + 1, i + 2, \dots}

and

{1, 2, \dots, n}

, respectively. Additionally, for notation brevity, we define

σ (t) ≜ B (x (t)) d (x (t))

. From (1) and (18), the prediction error dynamics are obtained as

\dot{\tilde{x}} (t) = - a \tilde{x} (t) + \hat{σ} (t) - σ (t), \tilde{x} (0) = 0 .

(A1)

Note that

\hat{σ} (t) = 0

(and thus

\hat{d} (t) = 0

due to (20)) for any

t \in [0, T)

according to (19). Further considering the bound on

d (t, x)

in (4), we have

‖ \hat{d} (t) - d (t, x (t)) ‖ \leq b_{d}, \forall t \in [0, T) .

(A2)

We next derive the bound on

‖ \hat{σ} (t) - σ (t) ‖

for

t \geq T

. For any

t \in [i T, (i + 1) T)

(

i \in Z_{0}

), we have

\tilde{x} (t) = e^{- a (t - i T)} \tilde{x} (i T) + \int_{i T}^{t} e^{- a (t - τ)} (\hat{σ} (τ) - σ (τ)) d τ .

Since

\tilde{x} (t)

is continuous, the preceding equation implies

\begin{matrix} \tilde{x} ((i + 1) T) & = e^{- a T} \tilde{x} (i T) + \int_{i T}^{(i + 1) T} e^{- a ((i + 1) T - τ)} d τ \hat{σ} (i T) - \int_{i T}^{(i + 1) T} e^{- a ((i + 1) T - τ)} σ (τ) d τ \\ = e^{- a T} \tilde{x} (i T) + \frac{1 - e^{- a T}}{a} \hat{σ} (i T) - \int_{i T}^{(i + 1) T} e^{- a ((i + 1) T - τ)} σ (τ) d τ \\ = - \int_{i T}^{(i + 1) T} e^{- a ((i + 1) T - τ)} σ (τ) d τ, \end{matrix}

(A3)

where the first and last equalities are due to the estimation law (19).

Since

x (t)

is continuous,

σ (t) (= B (x) d (x))

is also continuous given Assumption 1. Furthermore, considering that

e^{- a ((i + 1) T - τ)}

is always positive, we can apply the first mean value theorem in an element-wise manner (Note that the mean value theorem for definite integrals only holds for scalar valued functions) to (A3), which leads to

\begin{matrix} \tilde{x} ((i + 1) T) = & - \int_{i T}^{(i + 1) T} e^{- a ((i + 1) T - τ)} d τ [σ_{j} (τ_{j}^{*})] = - \frac{1}{a} (1 - e^{- a T}) [σ_{j} (τ_{j}^{*})], \end{matrix}

(A4)

for some

τ_{j}^{*} \in (i T, (i + 1) T)

with

j \in Z_{1}^{n}

and

i \in Z_{0}

, where

σ_{j} (t)

is the j-th element of

σ (t)

, and

[σ_{j} (τ_{j}^{*})] ≜ {[σ_{1} (τ_{1}^{*}), \dots, σ_{n} (τ_{n}^{*})]}^{⊤} .

The estimation law (19) indicates that for any t in

[(i + 1) T, (i + 2) T)

, we have

\hat{σ} (t) = - \frac{a}{e^{a T} - 1} \tilde{x} ((i + 1) T) .

The preceding equality and (A4) imply that for any t in

[(i + 1) T, (i + 2) T)

with

i \in Z_{0}

, there exist

τ_{j}^{*} \in (i T, (i + 1) T)

(

j \in Z_{1}^{n}

) such that

\hat{σ} (t) = e^{- a T} [σ_{j} (τ_{j}^{*})] .

(A5)

Note that

\begin{matrix} ‖ σ (t) - [σ_{j} (τ_{j}^{*})] ‖ \leq \sqrt{n} {‖ σ (t) - [σ_{j} (τ_{j}^{*})] ‖}_{\infty} = \sqrt{n} | σ_{{\bar{j}}_{t}} (t) - σ_{{\bar{j}}_{t}} (τ_{{\bar{j}}_{t}}^{*}) | \leq \sqrt{n} ‖ σ (t) - σ (τ_{{\bar{j}}_{t}}^{*}) ‖, \end{matrix}

(A6)

where

{\bar{j}}_{t} = arg {max}_{j \in Z_{1}^{n}} | σ_{j} (t) - σ_{j} (τ_{j}^{*}) |

. Similarly,

\begin{matrix} ‖ [σ_{j} (τ_{j}^{*})] ‖ \leq & \sqrt{n} {∥[σ_{j} (τ_{j}^{*})]∥}_{\infty} = \sqrt{n} | σ_{\hat{j}} (τ_{\hat{j}}^{*}) | \leq \sqrt{n} ‖ σ (τ_{\hat{j}}^{*}) ‖ \leq \sqrt{n} b_{d} max_{x \in X} ‖ B (x) ‖, \end{matrix}

(A7)

where

\hat{j} = arg {max}_{j \in Z_{1}^{n}} | σ_{j} (τ_{j}^{*}) |

, and the last inequality is due to the fact

‖ B (x) d (t, x) ‖ \leq ‖ B (x) ‖ ‖ d (t, x) ‖

and (4). Therefore, for any

t \in [(i + 1) T, (i + 2) T)

(

i \in Z_{0}

), we have

\begin{matrix} ‖ σ (t) - \hat{σ} (t) ‖ = ‖ σ (t) - e^{- a T} [σ_{j} (τ_{j}^{*})] ‖ \leq ‖ & σ (t) - [σ_{j} (τ_{j}^{*})] ‖ + (1 - e^{- a T}) ‖ [σ_{j} (τ_{j}^{*})] ‖ \\ \leq & ‖ \sqrt{n} σ (t) - σ (τ_{{\bar{j}}_{t}}^{*}) ‖ + (1 - e^{- a T}) \sqrt{n} b_{d} max_{x \in X} ‖ B (x) ‖, \end{matrix}

(A8)

for some

τ_{{\bar{j}}_{t}}^{*} \in (i T, (i + 1) T)

, where the equality is due to (A5), and the last inequality is due to (A6) and (A7). The dynamics in (1) indicates that

‖ \dot{x} ‖ \leq ‖ f (x) + B (x) u ‖ + ‖ B (x) ‖ ‖ d (t, x) ‖ \leq ϕ,

(A9)

where

ϕ

is defined in (23). As a result, the inequality (A9) implies that

\begin{matrix} ‖ x (t) - x (τ_{{\bar{j}}_{t}}^{*}) ‖ & \leq \int_{τ_{{\bar{j}}_{t}}^{*}}^{t} ‖ \dot{x} (τ) ‖ d τ \leq \int_{τ_{{\bar{j}}_{t}}^{*}}^{t} ϕ d τ = ϕ (t - τ_{{\bar{j}}_{t}}^{*}) \leq 2 ϕ T, \end{matrix}

(A10)

where the last inequality is due to the fact that

t \in [(i + 1) T, (i + 2) T) and τ_{{\bar{j}}_{t}}^{*} \in (i T, (i + 1) T)

(A11)

Therefore, we have

\begin{matrix} ∥σ (t) - σ (τ_{{\bar{j}}_{t}}^{*})∥ = ∥B (x (t)) (d (t, x (t)) - d (τ_{{\bar{j}}_{t}}^{*}, x (τ_{{\bar{j}}_{t}}^{*}))) + (B (x (t)) - B (x (τ_{{\bar{j}}_{t}}^{*}))) d (τ_{{\bar{j}}_{t}}^{*}, x (τ_{{\bar{j}}_{t}}^{*}))∥ \\ \leq ∥B (x (t))∥ ∥d (t, x (t)) - d (τ_{{\bar{j}}_{t}}^{*}, x (τ_{{\bar{j}}_{t}}^{*}))∥ + ∥B (x (t)) - B (x (τ_{{\bar{j}}_{t}}^{*}))∥ ∥d (τ_{{\bar{j}}_{t}}^{*}, x (τ_{{\bar{j}}_{t}}^{*}))∥ \\ \leq (L_{d} ∥x (t) - x (τ_{{\bar{j}}_{t}}^{*})∥ + l_{d} (t - τ_{{\bar{j}}_{t}}^{*})) max_{x \in X} ∥B (x)∥ + L_{B} ∥x (t) - x (τ_{{\bar{j}}_{t}}^{*})∥ b_{d} \\ \leq 2 T ((L_{d} ϕ + l_{d}) max_{x \in X} ∥B (x)∥ + L_{B} b_{d}), \end{matrix}

(A12)

where the second inequality is due to Assumption 1 and the last inequality is due to (A10) and (A11). Finally, plugging (A12) into (A8) leads to

\begin{matrix} ‖ σ (t) - \hat{σ} (t) ‖ & \leq 2 \sqrt{n} T ((L_{d} ϕ + l_{d}) max_{x \in X} ∥B (x)∥ + L_{B} b_{d}) + (1 - e^{- a T}) \sqrt{n} b_{d} max_{x \in X} B (x) \\ (A13) & = (2 \sqrt{n} T (L_{d} ϕ + l_{d}) + (1 - e^{- a T}) \sqrt{n} b_{d}) max_{x \in X} ∥B (x)∥ + 2 \sqrt{n} T L_{B} b_{d} \\ (A14) & = α (T), \end{matrix}

for any

t \geq T

. From (A2) and (A14) and the relation between

\hat{σ} (t)

and

\hat{d} (t)

in (20), we arrive at (21). Considering Assumption 1 and the assumption that

X

and

U

are compact, the constants involved in the definition of

α (T)

in (22) are all finite. As a result, we have

{lim}_{T \to 0} α (T) = 0

, which further indicates that

{lim}_{T \to 0} δ (t, T) = 0,

for any

t \geq T

. The proof is complete. □

References

Manchester, I.R.; Slotine, J.J.E. Control contraction metrics: Convex and intrinsic criteria for nonlinear feedback design. IEEE Trans. Autom. Control 2017, 62, 3046–3053. [Google Scholar] [CrossRef] [Green Version]
Lohmiller, W.; Slotine, J.J.E. On contraction analysis for non-linear systems. Automatica 1998, 34, 683–696. [Google Scholar] [CrossRef] [Green Version]
Manchester, I.R.; Tang, J.Z.; Slotine, J.J.E. Unifying robot trajectory tracking with control contraction metrics. In Robotics Research; Springer: Cham, Switzerland, 2018; pp. 403–418. [Google Scholar]
Tedrake, R.; Manchester, I.R.; Tobenkin, M.; Roberts, J.W. LQR-trees: Feedback motion planning via sums-of-squares verification. Int. J. Robot. Res. 2010, 29, 1038–1052. [Google Scholar] [CrossRef] [Green Version]
Doyle, J.; Glover, K.; Khargonekar, P.; Francis, B. State-space solutions to standard H₂ and H_∞ control problems. IEEE Trans. Autom. Control 1989, 34, 831–847. [Google Scholar] [CrossRef]
Packard, A.; Doyle, J. The complex structured singular value. Automatica 1993, 29, 71–109. [Google Scholar] [CrossRef] [Green Version]
Mayne, D.Q.; Seron, M.M.; Raković, S. Robust model predictive control of constrained linear systems with bounded disturbances. Automatica 2005, 41, 219–224. [Google Scholar] [CrossRef]
Mayne, D.Q. Model predictive control: Recent developments and future promise. Automatica 2014, 50, 2967–2986. [Google Scholar] [CrossRef]
Han, J. From PID to active disturbance rejection control. IEEE Trans. Ind. Electron. 2009, 56, 900–906. [Google Scholar] [CrossRef]
Chen, W.H.; Yang, J.; Guo, L.; Li, S. Disturbance-observer-based control and related methods—An overview. IEEE Trans. Ind. Electron. 2015, 63, 1083–1095. [Google Scholar] [CrossRef] [Green Version]
Li, S.; Yang, J.; Chen, W.; Chen, X. Generalized extended state observer based control for systems With mismatched uncertainties. IEEE Trans. Ind. Electron. 2012, 59, 4792–4802. [Google Scholar] [CrossRef] [Green Version]
Ioannou, P.A.; Sun, J. Robust Adaptive Control; Dover Publications, Inc.: Mineola, NY, USA, 2012. [Google Scholar]
Hovakimyan, N.; Cao, C. $L_{1}$ Adaptive Control Theory: Guaranteed Robustness with Fast Adaptation; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2010. [Google Scholar]
Lopez, B.T.; Slotine, J.J.E. Adaptive nonlinear control with contraction metrics. IEEE Control Syst. Lett. 2020, 5, 205–210. [Google Scholar] [CrossRef]
Lakshmanan, A.; Gahlawat, A.; Hovakimyan, N. Safe feedback motion planning: A contraction theory and $L_{1}$ -adaptive control based approach. In Proceedings of the 59th IEEE Conference on Decision and Control (CDC), Jeju Island, Korea, 14–18 December 2020; pp. 1578–1583. [Google Scholar]
Singh, S.; Landry, B.; Majumdar, A.; Slotine, J.J.; Pavone, M. Robust feedback motion planning via contraction theory. Int. J. Robot. Res. 2019. under review. [Google Scholar]
Zhao, P.; Lakshmanan, A.; Ackerman, K.; Gahlawat, A.; Pavone, M.; Hovakimyan, N. Tube-certified trajectory tracking for nonlinear systems with robust control contraction metrics. IEEE Robot. Autom. Lett. 2022, 7, 5528–5535. [Google Scholar] [CrossRef]
Manchester, I.R.; Slotine, J.J.E. Robust control contraction metrics: A convex approach to nonlinear state-feedback H_∞ control. IEEE Control Syst. Lett. 2018, 2, 333–338. [Google Scholar] [CrossRef]
Tsukamoto, H.; Chung, S.J. Robust controller design for stochastic nonlinear systems via convex optimization. IEEE Trans. Autom. Control 2020, 66, 4731–4746. [Google Scholar] [CrossRef]
Zhao, P.; Guo, Z.; Cheng, Y.; Gahlawat, A.; Kang, H.; Hovakimyan, N. Guaranteed nonlinear tracking control in the presence of DNN-learned dynamics with contraction metrics and disturbance estimation. IEEE Conf. Decis. Control. 2022. under review. [Google Scholar]
Leung, K.; Manchester, I.R. Nonlinear stabilization via control contraction metrics: A pseudospectral approach for computing geodesics. In Proceedings of the American Control Conference, Seattle, WA, USA, 24–26 May 2017; pp. 1284–1289. [Google Scholar]
Do Carmo, M.P.; Flaherty Francis, J. Riemannian Geometry; Springer: Boston, MA, USA, 1992. [Google Scholar]
Freeman, R.; Kokotovic, P.V. Robust Nonlinear Control Design: State-Space and Lyapunov Techniques; Springer Science & Business Media: Berlin, Germany, 2008. [Google Scholar]
Zhao, P.; Mao, Y.; Tao, C.; Hovakimyan, N.; Wang, X. Adaptive robust quadratic programs using control Lyapunov and barrier functions. In Proceedings of the 59th IEEE Conference on Decision and Control, Jeju Island, Korea, 14–18 December 2020; pp. 3353–3358. [Google Scholar]
Cao, C.; Hovakimyan, N. $L_{1}$ adaptive output feedback controller for non strictly positive real reference systems with applications to aerospace examples. In Proceedings of the AIAA Guidance, Navigation and Control Conference and Exhibit, Honolulu, HI, USA, 18–21 August 2008; p. 7288. [Google Scholar]
Lopez, B.T.; Slotine, J.J.E.; How, J.P. Robust Adaptive Control Barrier Functions: An Adaptive and Data-Driven Approach to Safety. IEEE Control Syst. Lett. 2021, 5, 1031–1036. [Google Scholar] [CrossRef]
Megretski, A. Systems Polynomial Optimization Tools (SPOT). 2010. Available online: https://github.com/spot-toolbox/spotless (accessed on 1 September 2021).
Andersen, E.D.; Andersen, K.D. The MOSEK interior point optimizer for linear programming: An implementation of the homogeneous algorithm. In High Performance Optimization; Springer: New York, NY, USA, 2000; pp. 197–232. [Google Scholar]
Kelly, M. An introduction to trajectory optimization: How to do your own direct collocation. SIAM Rev. 2017, 59, 849–904. [Google Scholar] [CrossRef]
Kelly, M.P. OptimTraj User’s Guide, Version 1.5. 2016. Available online: https://github.com/MatthewPeterKelly/OptimTraj (accessed on 1 September 2021).
Currie, J.; Wilson, D.I. OPTI: Lowering the barrier between open source optimizers and the industrial MATLAB user. In Proceedings of the Foundations of Computer-Aided Process Operations, Savannah, GA, USA, 8–11 January 2012. [Google Scholar]

Figure 1. Block diagram of the closed-loop system with the proposed DE-CCM controller.

Figure 2. Trajectory tracking performance of different controllers.

Figure 3. Tracking error under different controllers.

Figure 4. Control inputs yielded by different controllers.

Figure 5. Trajectory tracking performance of different controllers.

Figure 6. Riemannian energy under different controllers.

E_{0} ≜ E (x^{⋆} (0), x (0))

.

Figure 6. Riemannian energy under different controllers.

E_{0} ≜ E (x^{⋆} (0), x (0))

.

Figure 7. Control inputs yielded by different controllers.

Figure 8. Actual and estimated disturbances (top) and the estimation error (bottom). Note that

d_{i}

and

{\hat{d}}_{i}

(

i = 1, 2

) represent the ith element of d (actual disturbance) and

\hat{d}

(estimated disturbance), respectively. The blue dashed line in the bottom plot denotes the EEB used in computing the control inputs.

Figure 8. Actual and estimated disturbances (top) and the estimation error (bottom). Note that

d_{i}

and

{\hat{d}}_{i}

(

i = 1, 2

) represent the ith element of d (actual disturbance) and

\hat{d}

(estimated disturbance), respectively. The blue dashed line in the bottom plot denotes the EEB used in computing the control inputs.

Table 1. MSE for state trajectory tracking for the aircraft example.

	CCM	Ad-CCM	DE-CCM
MSE	$2.994 \times 10^{- 3}$	$1.397 \times 10^{- 3}$	$1.369 \times 10^{- 3}$

Table 2. MSE for state trajectory tracking for the planar quadrotor example. For Ad-CCM, only the result for

Γ = 100

, which corresponds to better tracking performance compared to

Γ = 10

, is included.

Table 2. MSE for state trajectory tracking for the planar quadrotor example. For Ad-CCM, only the result for

Γ = 100

, which corresponds to better tracking performance compared to

Γ = 10

, is included.

	CCM	Ad-CCM ( $Γ = 100$ )	DE-CCM
MSE	1.175	0.738	0.634

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, P.; Guo, Z.; Hovakimyan, N. Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation. Sensors 2022, 22, 4743. https://doi.org/10.3390/s22134743

AMA Style

Zhao P, Guo Z, Hovakimyan N. Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation. Sensors. 2022; 22(13):4743. https://doi.org/10.3390/s22134743

Chicago/Turabian Style

Zhao, Pan, Ziyao Guo, and Naira Hovakimyan. 2022. "Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation" Sensors 22, no. 13: 4743. https://doi.org/10.3390/s22134743

APA Style

Zhao, P., Guo, Z., & Hovakimyan, N. (2022). Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation. Sensors, 22(13), 4743. https://doi.org/10.3390/s22134743

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Robust Nonlinear Tracking Control with Exponential Convergence Using Contraction Metrics and Disturbance Estimation

Abstract

1. Introduction

2. Problem Statement and Preliminaries

Control Contraction Metrics (CCMs)

3. Robust Trajectory Tracking Using CCM and Disturbance Estimation

3.1. CCMs for the Actual System

3.2. Robust Riemannian Energy Condition

3.3. Disturbance Estimation with a Computable EEB

3.4. Exponentially Convergent Trajectory Tracking

3.5. Discussion

4. Simulation Results

4.1. Longitudinal Dynamics of an Aircraft

4.2. Planar Quadrotor

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI