Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control

Zeile, Clemens; Weber, Tobias; Sager, Sebastian

doi:10.3390/a15040121

Open AccessArticle

Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control

by

Clemens Zeile

^*

,

Tobias Weber

and

Sebastian Sager

Department of Mathematics, Otto-von-Guericke-Universität Magdeburg, 39106 Magdeburg, Germany

^*

Author to whom correspondence should be addressed.

Algorithms 2022, 15(4), 121; https://doi.org/10.3390/a15040121

Submission received: 17 February 2022 / Revised: 24 March 2022 / Accepted: 30 March 2022 / Published: 31 March 2022

(This article belongs to the Special Issue Simulation-Based Optimization: Methods and Applications in Engineering Design)

Download

Browse Figures

Versions Notes

Abstract

:

Solving mixed-integer nonlinear programs (MINLPs) is hard from both a theoretical and practical perspective. Decomposing the nonlinear and the integer part is promising from a computational point of view. In general, however, no bounds on the objective value gap can be established and iterative procedures with potentially many subproblems are necessary. The situation is different for mixed-integer optimal control problems with binary variables that switch over time. Here, a priori bounds were derived for a decomposition into one continuous nonlinear control problem and one mixed-integer linear program, the combinatorial integral approximation (CIA) problem. In this article, we generalize and extend the decomposition idea. First, we derive different decompositions and analyze the implied a priori bounds. Second, we propose several strategies to recombine promising candidate solutions for the binary control functions in the original problem. We present the extensions for ordinary differential equations-constrained problems. These extensions are transferable in a straightforward way, though, to recently suggested variants for certain partial differential equations, for algebraic equations, for additional combinatorial constraints, and for discrete time problems. We implemented all algorithms and subproblems in AMPL for a proof-of-concept study. Numerical results show the improvement compared to the standard CIA decomposition with respect to objective function value and compared to general-purpose MINLP solvers with respect to runtime.

Keywords:

optimal control; switched dynamic systems; mixed-integer nonlinear programming; mixed-integer linear programming; ordinary differential equations; approximation methods

1. Introduction

1.1. General Context

The goal of optimal control is to find control functions and state trajectories that are feasible and optimal. State trajectories are solutions of systems of differential equations for given control functions and boundary conditions. Feasibility refers to constraints on control functions and differential states; optimality refers to an objective functional of controls and states. Mixed-integer optimal control problems (MIOCPs) do have additional integrality constraints on some of the control functions. Hence, the differential equations of the underlying system depend on the value of integer control functions, or equivalently, one can switch instantly between several differential equations [1,2,3,4]. This problem class is ubiquitous in various application areas, e.g., water, gas, traffic, and supply chain networks [5,6,7,8,9,10,11], distributed autonomous systems [12], processes in chemical engineering that involve valves [13,14], cardiac assist devices [15], or the choice of gears in automotive control [16,17]. We choose problems from a web-based MIOCP benchmark collection [18] to evaluate our algorithms.

Several families of methods have been proposed for solving MIOCPs. This comprehends indirect or first-optimize-then-discretize approaches [19], dynamic programming or Hamilton–Jacobi–Bellman equations [20], switching time optimization [16,21,22,23,24,25], and direct or first-discretize-then-optimize approaches [3,26,27]. Surveys, references, and comparisons can be found in, e.g., [19,28,29,30].

1.2. Motivation

Our approach is based on previous ideas [27,31,32] to decompose the MIOCP into a relaxed continuous optimal control problem (OCP) and a mixed-integer linear program (MILP). In contrast to general MINLP decompositions (see [33] for recent results on error bounds for rounding approaches and [34] for a MINLP survey), the particular setting with dependent (states) and independent (controls) variables allows the derivation of a priori bounds [35]. As specified in Section 4, the difference between state trajectories

x (\cdot)

and

y (\cdot)

, which are the unique solutions of the initial value problems for given control functions

ω

and

α

, respectively, is bounded for a constant C and all

t \in T

by:

\begin{matrix} ||x (t) - y (t)|| \leq C max_{t \in T} ||\int_{t_{0}}^{t} α (τ) - ω (τ) d τ|| . \end{matrix}

(1)

This motivates us to solve the relaxed problem (OCP) to obtain

α (\cdot)

, to calculate an integer control function

ω

in a second step such that the maximum on the right-hand side of (1) is as small as possible, and to obtain

y (\cdot)

by solving the initial value problem (IVP). The continuity of constraint and objective functions implies “good” behavior with respect to feasibility and objective function value in the original MIOCP. The combinatorial integral approximation (CIA) problem in the second step can be formulated as an MILP [32] and can be solved either with generic MILP methods, with specifically tailored branch-and-bound methods [32], and in some cases even in linear time, with the sum-up rounding (SUR) method [27].

1.3. Review of the State of the Art

The idea of CIA decomposition has also been used in the context of hyperbolic partial differential equations [36,37], differential-algebraic equation systems [19], with additional combinatorial constraints [10,32], and based on discrete time formulations [10]. Recently, algorithms to solve CIA problems have been implemented into the software package pycombina [38].

In this publication, we generalize all of these approaches by presenting alternative ways to calculate

ω

based on

α

, using different MILPs. We formulate them for the case of initial value problems with ordinary differential equations, although they are applicable to all mentioned variants in a straightforward way.

Our results are also independent of the method that is applied to solve the relaxed problem (OCP). For the numerical results, we are going to use a first-discretize-then-optimize approach with Radau collocation. First-discretize refers (a) to approximating the control functions with parameterized basis functions, such as finitely many piecewise constant functions, and (b) to relaxing path and control constraints from the domain of a time horizon to finitely many time points. Then-optimize refers to solving the resulting finite dimensional optimization problem numerically to optimality. An overview of direct methods for continuous optimal control problems can be found in, e.g., [39,40]. For comparison, we also apply this approach directly to the MIOCP and solve the resulting mixed-integer nonlinear program (MINLP) with the general purpose solver Bonmin.

For a general problem class of MIOCPs, the integer gap depends linearly on the control discretization [41]. For many applications, this control discretization is not fixed, but can be refined. Thus, the integer gap can be driven to zero (usually at the expense of frequent switching). For practical reasons, good or even optimal solutions for a fixed control discretization are of interest, which is our main focus.

Recently, the MIOCP class has been increasingly studied in the context of combinatorial constraints that couple over time. For example, as part of the CIA decomposition, the switching-cost aware rounding problem (SCARP) has been proposed to solve it [42,43]. In addition, the CIA decomposition has been studied in the context of multibang and total variation regularization [44,45]. Switching constraints have been also investigated for mixed-integer partial differential equation optimal control problems [46,47]. Switching costs can also be included into the switching time optimization approach based on cardinality constraints [48].

1.4. Contributions

We derive different versions of the CIA problem, leading to multiple MILPs. Noting that the computational effort to solve one MILP is usually small compared to the solution of the relaxed nonlinear problem, and even smaller compared to the deterministic solution of the original MIOCP, we propose solving several approximation problems. These solutions are candidate solutions themselves, and can be recombined into new switching sequences. We derive theoretical a priori bounds for them, discuss computational (dis)advantages, and show numerically the improvement to existing decomposition approaches with respect to objective function value, and to general-purpose MINLP solvers with respect to runtime.

1.5. Outline of the Article

We define the considered MIOCP class and propose a general decomposition framework in Section 2. The algorithm consists of several MILP formulations, which are discussed in Section 3. We discuss theoretical properties in Section 4. In Section 5, we look at strategies that combine and improve existing binary control functions. In Section 6, we provide and discuss numerical results for the MIOCP benchmark library [18]. Finally, we conclude the article in Section 7, where we also summarize our findings.

2. Problem Class, Definitions, and Main Algorithm

We denote the considered time horizon by

T : = [t_{0}, t_{f}] \subset R

, and write “for a.a.

t \in T

” for all

t \in T

, except on a set of measure zero. Let

[n] : = {1, \dots, n}, {[n]}_{0} : = 0 \cup [n],

for

n \in N

. The null vector is written as

o

. We are interested in the following class of mixed-integer optimal control problems.

Definition 1.

(MIOCP) We refer to the following control problem (2) as (MIOCP).

\begin{matrix} inf_{x, ω} & Φ (x (t_{f})) \end{matrix}

(2a)

\begin{matrix} s . t . & \dot{x} (t) = f_{0} (t, x (t)) + \sum_{i = 1}^{n_{ω}} ω_{i} (t) f_{i} (t, x (t)), & f o r a . a . t \in T, \end{matrix}

(2b)

\begin{matrix} x (t_{0}) = x_{0}, \end{matrix}

(2c)

\begin{matrix} 1 = \sum_{i = 1}^{n_{ω}} ω_{i} (t) & f o r a . a . t \in T, \end{matrix}

(2d)

\begin{matrix} ω (t) \in {0, 1}^{n_{ω}} & f o r a . a . t \in T, \end{matrix}

(2e)

\begin{matrix} o \leq c (t, x (t)) & f o r a . a . t \in T . \end{matrix}

(2f)

We minimize a Mayer term

Φ \in C^{1} (R^{n_{x}}, R)

over differential states

x \in W^{1, \infty} (T, R^{n_{x}})

for binary control functions

ω \in L^{\infty} (T, {0, 1}^{n_{ω}})

. The system of ordinary differential equations (ODE), (2b), is written using partial outer convexification to model the switched system, using a one-hot encoding (1hot) constraint (2d), a drift term

f_{0}

, and

n_{ω} \geq 2

functions

f_{i}

, ref. [27], both out of

C^{0} (R^{n_{x} + 1}, R^{n_{x}})

. We assume fixed initial values

x_{0} \in R^{n_{x}}

for the differential states. The functions

c \in C^{1} (R^{n_{x}}, R^{n_{c}})

model nonlinear state inequalities.

In the following, we assume that (MIOCP) has an optimal solution, so that we can write “min” instead of “inf”. We refer to [19,29,49] for a discussion of the generality of (MIOCP) and extensions to cope with, e.g., Lagrange functionals, boundary and multi-point constraints, vanishing constraints, free final time, control values, and the like. Particularly interesting are continuous control functions

u \in L^{\infty} (T, U)

that often enter (2b) and (2f) in practical applications. From a theoretical point of view, in the interest of comparability, and for computational speed, it is convenient to consider the continuous controls

u (\cdot)

as fixed to the solution that was obtained by solving the continuous relaxation of (MIOCP) in our approach. It is also possible, though, and often makes sense in practice, to improve the objective function value by reoptimizing

u

when (MIOCP) is evaluated for fixed

ω

. For notational convenience and without loss of generality, we omit the continuous controls

u

in the following. An evaluation of (MIOCP) for fixed

ω

is then the solution of an initial value problem. We stress that there are only

n_{ω}

possible solutions for the integer part of the problem due to the constraint (2d).

Our algorithm solves continuous relaxations of (MIOCP), defined as follows.

Definition 2.

(OCP) We define (OCP) as the canonical relaxation of (MIOCP) with respect to (2e), where we substitute

ω \in L^{\infty} (T, {0, 1}^{n_{ω}})

for

α \in L^{\infty} (T, {[0, 1]}^{n_{ω}})

.

The problem (OCP) in function space can be solved by different approaches, as mentioned above. For using MILPs to approximate control functions, we map between function space and

{[0, 1]}^{n_{ω} \times M}

, using a time grid as follows.

Definition 3.

(

G_{ω}

, Δ, φ,

φ^{- 1}

, Ω,

Ω_{M}

) Let the ordered set

G_{ω} : = {t_{0} < \dots < t_{M} = t_{f}}

denote a time grid with

Δ_{j} : = t_{j + 1} - t_{j}

and

Δ_{\max} : = {max}_{j} Δ_{j}

for

j \in {[M - 1]}_{0}

. We define the mapping:

\begin{matrix} φ : {[0, 1]}^{n_{ω} \times M} \to L^{\infty} (T, {[0, 1]}^{n_{ω}}), α = φ (a) \end{matrix}

using

n_{ω}

piecewise constant functions:

\begin{matrix} α_{i} (t) & : = a_{i, j}, i \in [n_{ω}], t \in [t_{j}, t_{j + 1}), j \in {[M - 1]}_{0}, t_{j} \in G_{ω} . \end{matrix}

A mapping in reverse direction:

\begin{matrix} φ^{- 1} : L^{\infty} (T, {[0, 1]}^{n_{ω}}) \to {[0, 1]}^{n_{ω} \times M}, a = φ^{- 1} (α) \end{matrix}

is defined by extracting integrals on the grid

G_{ω}

:

\begin{matrix} a_{i, j} : = \frac{1}{Δ_{j}} \int_{t_{j}}^{t_{j + 1}} α_{i} (τ) d τ, i \in [n_{ω}], j \in {[M - 1]}_{0}, t_{j} \in G_{ω} . \end{matrix}

We denote integrality and (2d) using the sets:

\begin{matrix} Ω & : = \{ω \in L^{\infty} (T, {0, 1}^{n_{ω}}) : 1 = \sum_{i = 1}^{n_{ω}} ω_{i} (t) f o r a . a . t \in T\}, \\ Ω_{M} & : = \{w \in {0, 1}^{n_{ω} \times M} : 1 = \sum_{i = 1}^{n_{ω}} w_{i, j} f o r j \in {[M - 1]}_{0}\} . \end{matrix}

To scale control variables, we are going to need function evaluations and adjoint (dual) variables on the grid

G_{ω}

.

Definition 4 ( $(\tilde{λ}, \tilde{f})$ ).

Let

x^{*}

be the optimal solution of (OCP). The evaluated right-hand side function terms

{{\tilde{f}}_{i, j, k}}_{k \in [n_{x}]}

from (2b) are defined as the entries of:

\begin{matrix} R^{n_{x}} ∋ {\tilde{f}}_{i, j} : = \frac{1}{Δ_{j}} \int_{t_{j}}^{t_{j + 1}} f_{i} (τ, x^{*} (τ)) d τ, i \in [n_{ω}], t_{j} \in G_{ω} . \end{matrix}

We denote, by

{\tilde{λ}}_{j, k} \in R, t_{j} \in G_{ω}, k \in [n_{x}],

the discretized and evaluated dual variables of the ODE constraints (2b) in (OCP).

Our decomposition algorithm is based on the algorithmic choices of the sets

S^{CIA}

and

S^{REC}

, which we define next.

Definition 5

(

S^{CIA}, S^{REC}

). We introduce the set of CIA problems

S^{CIA}

as:

\begin{matrix} S^{CIA} & : = \{(CIAmax), (CIA 1), (CIAmaxB), (CIA 1 B), (λ CIA 1), (λ CIA 1 B), \\ (SCIAmax), (SCIA 1), (SCIAmaxB), (SCIA 1 B)\}, \end{matrix}

where we define the specific CIA problems in the next section. For a subset

{\tilde{S}}^{CIA} \subseteq S^{CIA}

, we denote, with

n_{CIA} : = | {\tilde{S}}^{CIA} |

, the number of different CIA problem formulations. Let the elements of

{\tilde{S}}^{CIA}

be numbered by

1, \dots, n_{CIA}

. Let the set

S^{REC}

of recombination mappings

F^{rec} \in S^{REC}

be defined via:

F^{rec} : \underset{k \in [n_{CIA}]}{\times} Ω_{N} \to Ω_{N}, F^{rec} (w^{1}, \dots, w^{n_{CIA}}) \mapsto w^{rec},

(3)

where

w^{k}

denotes the optimal solution of the problem

{(m i l p)}^{k} \in {\tilde{S}}^{CIA}

.

We propose to use Algorithm 1 to approximate the solution of (MIOCP) efficiently with a priori bounds. Relaxing (MIOCP) to (OCP) results in state and control trajectories (line 1). We solve different MILPs to approximate the relaxed controls with binary ones in lines 2–3. Their performance is evaluated in line 4 by calculating their corresponding (feasible) state trajectories and objective values. In lines 6–7, we create new candidate binary controls in several recombination heuristics based on the existing binary controls, which we evaluate as well (line 8). As a final step, we select the best-performing binary control as the solution in line 10.

Algorithm 1: Decomposition of (MIOCP).

The main idea of Algorithm 1 is to decouple controls and states. We approximate the relaxed control function

α^{*}

that is optimal for (OCP) with binary controls

ω

, such that a good objective function value is obtained when (MIOCP) is evaluated for

ω

. Which MILPs and recombination heuristics are used in the algorithm depends on the definition of the sets

{\tilde{S}}^{CIA}

and

{\tilde{S}}^{REC}

, which we discuss in Section 3 and Section 5. A theoretical motivation and error bounds on the approximation quality are given in Section 4. Algorithm 1 is a generalization of the decomposition approach in [27,32], for which

{\tilde{S}}^{REC}

is empty and

{\tilde{S}}^{CIA}

contains only one CIA problem formulation.

3. Combinatorial Integral Approximation MILPs

In the following, we define the MILP formulations of CIA type for

S^{CIA}

in Algorithm 1. CIA,

λ

CIA, and SCIA refer to different scalings (Section 3.1), “1” and “max” to different norms

∥ \cdot ∥

(Section 3.2), and the presence of “B” to a reversal of time (Section 3.3).

3.1. Combinatorial Integral Approximation and Scaled Variants

As part of the approximation step, we aim at finding binary control values that are close to the relaxed values with respect to the accumulated difference over all grid points. The following definition specifies the so-far-applied (CIA) problem together with two novel variants. We let the vector norm

∥ \cdot ∥

be unspecified here, but discuss applicable norms later.

Definition 6 ( $θ_{CIA}^{*}, θ_{SCIA}^{*}, θ_{λ CIA}^{*}$ , cf. Definition 4.17 in [50] ).

Let

a^{*}

be the given optimal control solution of (OCP), and let the evaluated model function values

\tilde{f}

and dual variables

\tilde{λ}

be given as introduced in Definition 4. Consider a vector norm

∥ \cdot ∥

. We introduce the following optimization problems:

\begin{matrix} θ_{CIA}^{*} & : = min_{w \in Ω_{N}} max_{j \in {[M - 1]}_{0}} ||\sum_{l \in [j]} (a_{\cdot, l}^{*} - w_{\cdot, l}) Δ_{l}||, \end{matrix}

(4)

\begin{matrix} θ_{SCIA}^{*} & : = min_{w \in Ω_{N}} max_{j \in {[M - 1]}_{0}} ||\sum_{l \in [j]} \sum_{i \in [n_{ω}]} (a_{i, l}^{*} - w_{i, l}) Δ_{l} {\tilde{f}}_{i, l}||, \end{matrix}

(5)

\begin{matrix} θ_{λ CIA}^{*} & : = min_{w \in Ω_{N}} max_{j \in {[M - 1]}_{0}} |\sum_{k \in [n_{x}]} {\tilde{λ}}_{j, k} \sum_{l \in [j]} \sum_{i \in [n_{ω}]} (a_{i, l}^{*} - w_{i, l}) Δ_{l} {\tilde{f}}_{i, l, k}| . \end{matrix}

(6)

3.2. Norm Dependent MILP Formulation

By introducing the auxiliary variable

θ

, we can reformulate (4) and (5) with the maximum norm.

Definition 7.

(CIAmax, SCIAmax) Let

a^{*}

be the given optimal control solution of (OCP) and let the evaluated model function values

\tilde{f}

be given as introduced in Definition 4. We define (CIAmax) as:

\begin{matrix} min_{θ \geq 0, w \in Ω_{M}} & θ \end{matrix}

(7a)

\begin{matrix} s . t . & θ \geq \pm \sum_{j = 0}^{l} (a_{i, j}^{*} - w_{i, j}) Δ_{j}, & f o r i \in [n_{ω}], l \in {[M - 1]}_{0}, \end{matrix}

(7b)

and (SCIAmax) as:

\begin{matrix} min_{θ \geq 0, w \in Ω_{M}} & θ \end{matrix}

(8a)

\begin{matrix} s . t . & θ \geq \pm \sum_{j = 0}^{l} \sum_{i = 1}^{n_{ω}} (a_{i, j}^{*} - w_{i, j}) Δ_{j} {\tilde{f}}_{i, j, k}, & f o r k \in [n_{x}], l \in {[M - 1]}_{0} . \end{matrix}

(8b)

We introduce the MILP analogue formulations for the Manhattan norm with auxiliary variables

s_{i, j} \geq 0, i \in [n_{ω}], j \in {[M - 1]}_{0}

. In this way, we specify the norm choices for Definition 6.

Definition 8.

(CIA1, SCIA1, λCIA1) Consider

a^{*}

as the given optimal control solution of (OCP). Let the evaluated model function values

\tilde{f}

and dual variables

\tilde{λ}

be given as introduced in Definition 4. Based on the auxiliary variables

s_{i, l}

we define (CIA1) as:

\begin{matrix} min_{θ, s_{i, l} \geq 0, w \in Ω_{M}} & θ \end{matrix}

(9)

\begin{matrix} s . t . θ & \geq \sum_{i = 1}^{n_{ω}} s_{i, l}, & f o r l \in {[M - 1]}_{0}, \end{matrix}

(10)

\begin{matrix} s_{i, l} & \geq \pm \sum_{j = 0}^{l} (a_{i, j}^{*} - w_{i, j}) Δ_{j}, & f o r i \in [n_{ω}], l \in {[M - 1]}_{0} . \end{matrix}

(11)

With a different dimension for

s_{k, l}

, we define (SCIA1) as:

\begin{matrix} min_{θ, s_{k, l} \geq 0, w \in Ω_{M}} & θ \end{matrix}

(12a)

\begin{matrix} s . t . θ & \geq \sum_{k = 1}^{n_{x}} s_{k, l}, & f o r l \in {[M - 1]}_{0}, \end{matrix}

(12b)

\begin{matrix} s_{k, l} & \geq \pm \sum_{j = 0}^{l} \sum_{i = 1}^{n_{ω}} (a_{i, j}^{*} - w_{i, j}) Δ_{j} {\tilde{f}}_{i, j, k}, & f o r k \in [n_{x}], l \in {[M - 1]}_{0} . \end{matrix}

(12c)

Finally, we define ( $λ$ CIA1)by modifying (12c) with the dual variables:

\begin{matrix} min_{θ, s_{k, l} \geq 0, w \in Ω_{M}} & θ \end{matrix}

(13a)

\begin{matrix} s . t . θ & \geq \sum_{k = 1}^{n_{x}} s_{k, l}, & f o r l \in {[M - 1]}_{0}, \end{matrix}

(13b)

\begin{matrix} s_{k, l} & \geq \pm λ_{l, k} \cdot \sum_{j = 0}^{l} \sum_{i = 1}^{n_{ω}} (a_{i, j}^{*} - w_{i, j}) Δ_{j} {\tilde{f}}_{i, j, k}, & f o r k \in [n_{x}], l \in {[M - 1]}_{0} . \end{matrix}

(13c)

Of course, also other norms, such as the Euclidean norm, can be used. We do not consider them here because of the resulting nonlinearity of the constraints.

3.3. Chronologically Ordered Constraints

We consider the possibility to modify the (CIA) problems by altering the chronological order in the constraints for the accumulated difference

∥ \sum_{l \in [j]} (a_{\cdot, l}^{*} - w_{\cdot, l}) Δ_{l} ∥

for

j \in {[M - 1]}_{0}

. We may use an arbitrary ordering of time intervals, instead of starting from the first interval

j = 0

. Here, we consider backward accumulation, starting from the interval with index

j = M - 1

, i.e.,

[t_{M - 1}, t_{M}]

:

\begin{matrix} θ \geq \pm \sum_{j = l}^{M - 1} (a_{i, j}^{*} - w_{i, j}) Δ_{j}, & f o r i \in [n_{ω}], l \in {[M - 1]}_{0} . \end{matrix}

(14)

Let the problem where (14) replaces (7b) in (CIAmax) be denoted by (CIAmaxB). The other introduced MILPs can be modified analogously with backward time accumulation and are named accordingly; e.g., (SCIA1B) refers to (SCIA1) with backward accumulation.

3.4. Combinatorial Constraints

One advantage of using an MILP for obtaining binary controls after the relaxation step, rather than using SUR, is the possibility to impose combinatorial constraints that couple over time. An example of such constraints is to limit the number of allowed switches

σ_{\max} \in N

between activated binary controls, which can be formulated as:

σ_{\max} \geq \frac{1}{2} \sum_{i = 1}^{n_{ω}} \sum_{j = 1}^{M - 1} | w_{i, j} - w_{i, j - 1} | .

(15)

We will omit this constraint class in the next section, but refer to [51,52] for a priori bounds and reformulations. Nevertheless, we are going to come back to these constraints in the numerical experiments section for also testing the different MILP approaches in this situation.

4. A Priori Bounds for CIA Decompositions

We revise results for the a priori bounds resulting from a CIA decomposition [41] and extend them to alternative decompositions that may be used in

S^{CIA}

in Algorithm 1. We stress here that Manns et al. [53] have recently presented a proof of improved regularity conditions. Nevertheless, we revise the theorem from [41] here, because it results in natural algorithmic extensions.

4.1. Combinatorial Integral Approximation

We recapitulate a variant of Grönwall’s Lemma from [41], which is needed for the main theorem.

Lemma 1 (A variant of Grönwall’s Lemma, see [41], Lemma 1).

Let

z_{1}, z_{2} : T \to R

be real-valued integrable functions and let

z_{2}

also belong to

L^{\infty} (T, R)

. If, for a constant

L \geq 0

, the following holds:

z_{1} (t) \leq z_{2} (t) + L \int_{t_{0}}^{t} z_{1} (τ) d τ f o r a . a . t \in T,

then we have:

z_{1} (t) \leq {∥ z_{2} ∥}_{\infty} e^{L (t - t_{0})} f o r a . a . t \in T .

(16)

Proof.

See [41], proof of Lemma 1. □

In the following results, we analyze the evolution of two trajectories,

x

and

y

, based on the same ODE system (2b) but driven by two different controls,

α

and

ω

. The following theorem gives a statement on the distance between the two trajectories depending on the distance of the controls.

Theorem 1.

Consider

α

and

ω \in Ω

. We reuse the model functions

f_{0}, f_{i} : T \times R^{n_{x}} \to R^{n_{x}}

from Definition 2 for

i \in [n_{ω}]

. Let

x (\cdot)

and

y (\cdot)

be the unique solutions of the IVP:

\begin{matrix} \dot{x} (t) & = f_{0} (t, x (t)) + \sum_{i = 1}^{n_{ω}} α_{i} (t) f_{i} (t, x (t)), x (t_{0}) = x_{0}, \end{matrix}

(17a)

\begin{matrix} \dot{y} (t) & = f_{0} (t, y (t)) + \sum_{i = 1}^{n_{ω}} ω_{i} (t) f_{i} (t, y (t)), y (t_{0}) = y_{0}, \end{matrix}

(17b)

where

x_{0}, y_{0} \in R^{n_{x}}

. Assume that there are positive constants

L, C \in R^{+},

together with a vector norm

||\cdot||

, such that, for a.a.,

t \in T

holds:

\begin{matrix} ||f_{i} (t, x (t)) - f_{i} (t, y (t))|| & \leq L ||x (t) - y (t)||, f o r i \in {[n_{ω}]}_{0}, \end{matrix}

(17c)

\begin{matrix} ||\frac{d}{d t} f_{i} (t, x (t))|| & \leq C, f o r i \in [n_{ω}] . \end{matrix}

(17d)

Furthermore, let

f_{i} (\cdot, x (\cdot)), i \in [n_{ω}]

be essentially bounded by

B \in R^{+}

on

T

, and assume that for all

t \in T

. It holds that:

\begin{matrix} ||\int_{t_{0}}^{t} α (τ) - ω (τ) d τ|| \leq θ, \end{matrix}

(17e)

with the constant

θ \geq 0

. Then, for a.a.

t \in T

we also have:

||x (t) - y (t)|| \leq (||x_{0} - y_{0}|| + θ n_{ω} (B + C (t - t_{0}))) e^{L (t - t_{0})} .

(17f)

Proof.

This Theorem is an (equivalent) reformulation of Theorem 2 from [41]. The only differences are modified notations and the usage of

f_{i}, i \in {[n_{ω}]}_{0}

as differentiable mapping instead of

A

in [41]. □

We first recognize that Theorem 1 is applicable for the CIA problem with any vector norm, which can be guessed from the equivalence of norms.

Corollary 1 (Approximation bounds via (CIA), cf. Corollary 5.2 in [50]).

Consider the setting of Theorem 1; in particular, let the regularity assumptions on

f_{i}, i \in {[n_{ω}]}_{0},

hold. Assume that

x

and

y_{CIA}

are the solutions of the IVP (2b) and (2c), where

x

is based on a given relaxed control

a

and

y_{CIA}

is based on

w^{*}

, which is the optimal solution of (CIA

n o

),

n o \in {max, 1}

, with objective value

θ_{CIA}^{*}

from Definition 6. Then, the state approximation error is bounded for a.a.

t \in T

by:

∥ x (t) - y_{CIA} (t) ∥ \leq θ_{CIA}^{*} (B + C (t - t_{0})) e^{L (t - t_{0})} .

(18)

Proof.

This corollary is a direct result from Theorem 1 with

x_{0} = y_{0}

. We note that

θ_{CIA}^{*}

represents the norm of the accumulated control deviation, which appears in the proof of the Theorem and is bounded by

θ n_{ω}

so that this replaced term is settled. □

4.2. Scaled Combinatorial Integral Approximation

The proof of Theorem 1 contains a motivation for the (SCIA) problems as derived in the following result.

Corollary 2 (Approximation bounds via (SCIA), cf. Corollary 5.3 in [50]).

Consider the setting of Theorem 1, and let

{∥ \cdot ∥}_{n o}

refer to the maximum or 1-norm, i.e.,

n o \in {max, 1}

. Assume that

x

and

y_{CIA}

are the solutions of the IVP (2b) and (2c), where

x

is based on a given

a

, and

y_{CIA}

is driven by

w^{*}

, which is the optimal solution of (SCIA

n o

). Then, for a.a.

t \in T

, the state approximation error is bounded by:

∥ x (t) - y_{SCIA} (t) ∥ \leq θ_{SCIA}^{*} e^{L (t - t_{0})} \leq θ_{CIA}^{*} (B + C (t - t_{0})) e^{L (t - t_{0})},

(19)

where

θ_{SCIA}^{*}

is the optimal objective value of (SCIA

n o

),

n o \in {max, 1}

.

Proof.

From the second and last inequalities in the proof of Theorem 2 from [41] and Corollary 1, it follows that, for a.a.

t \in T

and any

ω \in Ω

:

∥\int_{t_{0}}^{t} \sum_{i = 1}^{n_{ω}} (α_{i} (τ) - ω_{i} (τ)) f_{i} (x (τ)) d τ∥ \leq ∥\int_{t_{0}}^{t} α (τ) - ω (τ) d τ∥ (B + C (t - t_{0})) .

Let

ω^{CIA}

denote the control based on the optimal solution

w^{*}

of (CIA

n o

),

n o \in {max, 1}

. We take the minimum in the above inequality and obtain:

θ_{SCIA}^{*} \leq ∥\int_{t_{0}}^{t} \sum_{i = 1}^{n_{ω}} (α_{i} (τ) - ω_{i}^{CIA} (τ)) f_{i} (x (τ)) d τ∥ \leq θ_{CIA}^{*} (B + C (t - t_{0})) .

□

We conclude from Corollary 1 that the approximation and convergence results of the decomposition still hold if (SCIA

n o

),

n o \in {max, 1}

, is used to construct the binary control. The approximation bound based on (SCIA

n o

) is tighter than the existing (CIA

n o

)-related bound. Hence, it is an obvious choice to consult these alternative binary controls for an approximation study. Ideally, the binary control constructed in this way will result in an improved state approximation and objective value for (MIOCP). However, we oppose this hope next.

Remark 1 (Construction by (SCIA) does not guarantee superior quality).

The binary control based on the optimal solution of (SCIA

n o

),

n o \in {max, 1}

and used in the decomposition does not necessarily result in a state approximation or objective value that is superior to that obtained using (CIA

n o

). It may hold that:

∥ x (t) - y_{CIA} (t) ∥ < ∥ x (t) - y_{SCIA} (t) ∥ < θ_{SCIA}^{*}, f o r s o m e t \in T,

where we use the notation from Corollaries 1 and 2. Because of a possible non-convex objective, the computed trajectories may lead to a superior objective value for the solution based on (CIA

n o

), compared with that based on (SCIA

n o

), even if the above inequality does not hold.

4.3. $λ$ -Combinatorial Integral Approximation

The

λ

-approximation stems from the aforementioned theorem of approximating differential states, but with assessing the difference of the cost-to-go function. We define it in a more general MIOCP setting with a Bolza objective:

Φ (x (t_{f})) + \int_{t \in T} L (x (τ), ω (τ)) d τ,

(20)

where

L \in C^{1} (R^{n_{x} + 1} \times R^{n_{ω}}, R)

.

Definition 9 (Cost-to-go function by Hamilton–Jacobi–Bellman).

Let the cost-to-go function

J \in L^{\infty} (R^{n_{x}} \times T, R)

with Bolza objective (20) be implicitly defined as:

\begin{matrix} J (x (t_{f}), t_{f}) & = Φ (x (t_{f})), \\ - \frac{\partial J}{\partial t} (x (t), t) & = min_{ω \in Ω} L (x (t), ω (t)) + \frac{\partial J}{\partial x} (x (t), t) (f_{0} (t, x (t)) + \sum_{i = 1}^{n_{ω}} ω_{i} (t) f_{i} (t, x (t))) . \end{matrix}

We recognize that

\frac{\partial J}{\partial x}

can be interpreted as Lagrange multiplier or dual variables of the ODE constraints in (MIOCP). Therefore, we write, in short,

λ (t)

instead of

\frac{\partial J}{\partial x}

. With the groundwork above, we are ready to deduce the corresponding bound.

Corollary 3 (Approximation bounds via ( $λ$ CIA), cf. Corollary 5.4 in [50]).

Consider the setting of Theorem 1. In particular, let the regularity assumptions (17d), (17c), and the essential boundedness of

f

be true. Assume that

x

and

y_{λ CIA}

are the solutions of the IVP (2b) and (2c), where

x

is based on a given

a

, and

y_{λ CIA}

is based on

w^{*}

, which is the optimal solution of (λCIA1). Let J be the cost-to-go function as defined in Definition 9 for (MIOCP) and

λ (t)

be the adjoint vector at

t \in T

for the ODE system (2b). For a.a.

t \in T

, it follows that:

| J (x (t), t) - J (y_{λ CIA} (t), t) | \leq θ_{λ CIA}^{*} e^{L (t - t_{0})} + o (∥ x (t) - y_{λ CIA} {(t) ∥}^{2}),

where o refers to Landau’s little-o notation.

Proof.

Let us consider the difference of the cost-to-go functions by approximation with a partial first-order Taylor expansion around

J (x (t), t)

. We perform the approximation with respect to the trajectories

x, y_{λ CIA}

. Thus, we apply Taylor’s theorem for

t \in T

:

J (y_{λ CIA} (t), t) - J (x (t), t) = \frac{\partial J}{\partial x} (x (t), t) (y_{λ CIA} (t) - x (t)) + o ({||y_{λ CIA} (t) - x (t)||}^{2}) .

(21)

As pointed out above, the dual variables of the ODE constraint (2b) are equal to

\frac{d J}{d x} (x (t), t)

. We use the notation

{∥ x (t) ∥}_{λ (t)} : = |\sum_{k \in [n_{x}]} λ_{k} (t) x_{k} (t)|

for

t \in T

, which defines a semi-norm. Next, for a.a.

t \in T

, we transfer the proof of Theorem 1 to this notation and to (21):

\begin{matrix} | J (y_{λ CIA} (t), t) - J (x (t), t) | & \leq ∥ y_{λ CIA} {- x ∥}_{λ (t)} + o ({||y_{λ CIA} - x||}^{2}) \\ \leq \dots (as in proof of Theorem 1) \\ \leq {||y_{0} - x_{0}||}_{λ (t)} + L \int_{t_{0}}^{t} {||y_{λ CIA} (τ) - x (τ) d τ||}_{λ (t)} \\ + {||\sum_{i = 1}^{n_{ω}} \int_{t_{0}}^{t} (ω_{i}^{λ CIA} (τ) - α_{i} (τ)) \cdot f_{i} (τ, x (τ)) d τ||}_{λ (t)} \\ + o ({||y_{λ CIA} - x||}^{2}) . \end{matrix}

The third summand of the last inequality is equal to the objective

θ_{λ CIA}^{*}

that is to be minimized in (

λ

CIA1). Finally, we apply

x_{0} = y_{0}

and use the Grönwall Lemma 1 with the integrable functions:

z_{1} (t) = {∥x - y_{λ CIA}∥}_{λ (t)}, z_{2} (t) = {||\sum_{i = 1}^{n_{ω}} \int_{t_{0}}^{t} (ω_{i}^{λ CIA} (τ) - α_{i} (τ)) \cdot f_{i} (τ, x (τ)) d τ||}_{λ (t)},

so that the claim is proven. □

Due to the first-order Taylor approximation, (

λ

CIA1) requires that the relaxed trajectory

x

can be well approximated by a trajectory

y

that is based on binary controls. On the other hand, if there is no such trajectory in a close neighborhood of

x

, then (

λ

CIA1) may have an unintuitive binary control as its optimal solution.

4.4. Backwards Accumulating Constraints

If we adapt the MIOCP instance with fixed final states

x_{f} \in R^{n_{x}}

and with a Lagrangian objective type, we can also apply this modified setting to Theorem 1. We express this issue in the following corollary.

Corollary 4

(Approximation bounds via backward constraints). Consider the setting of Theorem 1. Let

x

and

y

be the state trajectory solutions of the terminal value problems (2b), with

x (t_{f}) = x_{f}

and

y (t_{f}) = y_{f}

for

x_{f}, y_{f} \in R^{n_{x}}

. Assume that for all

t \in T, θ_{b} \in R^{+}

, it holds that:

∥\int_{t}^{t_{f}} α (τ) - ω (τ) d τ∥ \leq θ_{b} .

(22)

Then, for a.a.,

t \in T

it also holds that:

||x (t) - y (t)|| \leq (||x_{f} - y_{f}|| + θ_{b} n_{ω} (B + C (t_{f} - t))) e^{L (t_{f} - t)} .

(23)

Proof.

We apply the proof of Theorem 1 to the altered setting, in which we integrate over

[t, t_{f}]

instead of integrating over

[t_{0}, t]

.

□

With the assumption that

∥ x (t_{f}) - y (t_{f}) ∥

is small, the backward (CIA) rounding problem approach from Section 3.3 is not only applicable for terminal constraint problems but is also appropriate for an (MIOCP) instance with a given initial value

x_{0}

and variable final-state values.

4.5. Connection to Decomposition Algorithm and Optimization Problem

As a last step in this chapter, the previous results are related to (MIOCP) and the decomposition Algorithm 1.

Remark 2 (Arbitrary close approximation of (OCP) solution).

We could extend Algorithm 1 with an outer loop that checks if

Φ^{opt}

is sufficiently close to

Φ^{rel}

, and if not, we would refine the grid

G_{ω}

. In [29], an arbitrary close approximation of the (OCP) solution has been deduced with this procedure and based on (CIA). With the assumption of

Φ (\cdot), c (\cdot)

being continuous and that there exists a feasible trajectory

x

for (OCP), it follows that for any

ϵ > 0

there exists a grid

G_{ω}

, with grid size

Δ_{\max}

, such that there is a feasible trajectory

{y (t_{j})}_{t_{j} \in G_{ω}}

with:

\begin{matrix} | Φ (x (t_{f})) - Φ (y (t_{f})) | & \leq ϵ, \\ ∥ c (t_{j}, x (t_{j})) - c (t_{j}, y (t_{j})) ∥ & \leq ϵ . \end{matrix}

The proof uses the sum-up rounding scheme that derives the binary control approximation with

θ_{CIA}^{*} \leq Const (n_{ω}) Δ_{\max}

[35,41], where

Const (n_{ω})

is a constant depending on

n_{ω}

. In case we extend Algorithm 1 with the refinement procedure, and if (CIA) or (SCIA) are chosen to be elements of

S^{CIA}

, the same approximation result holds.

Corollary 5 (Solution accuracy of differential states of Algorithm 1).

Let

θ_{SCIA, \max}^{*}, θ_{SCIA, 1}^{*}

be the optimal objective values of (SCIAmax) and (SCIA1), respectively. Consider the setting of Theorem 1. In particular, let the regularity assumptions (17d), (17c), and essential boundedness of

f

be true. Assume that

x

and

y

are the solutions of the IVP (2b) and (2c), where

x

is based on a given

a

, and

y

is based on Algorithm 1. Let

G_{ω}

be the applied grid. It follows for

t_{i} \in G_{ω}

that:

\begin{matrix} {||y (t_{i}) - x (t_{i})||}_{j} & \leq ({||y_{0} - x_{0}||}_{j} + θ_{SCIA, j}^{*}) e^{L (t_{i} - t_{0})}, j \in {\max, 1} . \end{matrix}

(24)

Proof.

The claim is a direct result of Corollary 2. □

We have deliberately chosen the tightest bound from the previous corollaries, but could also use others. The received grid-specific rounding error bounds aim primarily at approximating the differential states. With (

λ

CIA) or Lipschitz continuity of the objective, there are also tools to discuss the rounding error of the objectives. The recombination heuristics work in the area of objective approximation and, hence, are to be discussed next.

5. Recombination Heuristics

We present several recombination heuristics that recombine different binary controls

w

to new candidate solutions with potentially smaller objective values. The general framework is open to apply different heuristics, such as genetic algorithms [54], that are not introduced in this article.

5.1. GreedyTime

Algorithm 2 is a routine for using the MILP solutions in a greedy pattern with the aim of constructing binary controls

w

that exhibit an improved objective value

Φ (φ (w))

.

Algorithm 2: GreedyTime heuristic for finding improved

w

variables.

In GreedyTime, we iterate over all intervals

j \in {[M - 1]}_{0}

in chronological order (line 1). On every interval, we check if there are MILP pairs

(m_{1}, m_{2})

that differ in their binary control vectors in line 2. We recombine for each of these pairs the

m_{1}

solution with the binary control vector from

m_{2}

at interval j to construct a temporary control solution

{\tilde{w}}^{m_{1}}

(line 3). Based on this construction, we evaluate the objective of this new control solution in line 4 and overwrite the binary control

w^{m_{1}}

with the recombined solution

{\tilde{w}}^{m_{1}}

if that latter results in an improved objective value (lines 5–6). We proceed in the same way with the second solution

m_{2}

when the (same) pair

(m_{2}, m_{1})

appears in the inner loop.

A large number of calculated MILPs may result in a large number of pairs with unequal control solutions. In this case, it is advisable to only swap the

w^{m_{2}}

solution with the currently smallest objective value, instead of swapping and testing each variation for every

w^{m_{1}}

.

If the control problem at hand involves no continuous controls

u

, it is straightforward to evaluate the (MIOCP) in line 4 with the previously found and fixed

x

until grid interval j. This speeds up the process in problems with fine grids and numerous MILP solutions, because (MIOCP) needs to be solved iteratively. Moreover, if an MILP solution

m_{1}

differs from two MILPs

m_{2}, m_{3}

with identical binary control vectors

w_{\cdot, j}^{m_{2}} = w_{\cdot, j}^{m_{3}}

, it is sufficient to test the recombination with only one of the two. We illustrate an example recombination step for the pairs (CIA, SCIA) and (SCIA, CIA) in Figure 1.

Remark 3 (Modifications of GreedyTime).

1.: We can also apply the outer loop in Algorithm 2 backward in time and name the backward version GreedyTimeBackward;
2.: We may consider only singular arcs, instead of looping over all intervals, since the constructed binary controls are likely to be equal on bang–bang arcs. With singular arcs, we mean the intervals where $ϵ < a_{i, j}^{*} < 1 - ϵ$ holds for the optimal control solution $a^{*}$ of (OCP), with a certain threshold $ϵ > 0$ ;
3.: Greedy-cost-to-go modification: Assume we have obtained the dual variables ${\tilde{λ}}_{j, k}, j \in {[M - 1]}_{0}, k \in [n_{x}],$ of the state equations of (OCP). Then, re-sort the intervals ${[M - 1]}_{0}$ in descending order according to $\sum_{k \in [n_{x}]} | {\tilde{λ}}_{j, k} |, j \in {[M - 1]}_{0} .$ In this way, we construct a new ordered grid $G_{ω}^{λ}$ to be iterated over in Algorithm 2.

5.2. Singular Arc Recombination

If

a^{*}

is (almost) binary on certain intervals,

w

should typically attain these binary values as well as an optimal solution of a CIA rounding problem—regardless of the MILP choice. To this end, we formalize singular arcs of

a^{*}

as sets of consecutive intervals on which the relaxed control takes values smaller than

ϵ

or larger than

1 - ϵ

. Here,

ϵ > 0

is a chosen small tolerance.

Definition 10 (Number of singular arcs $n_{sing}$ , singular arc interval sets $J_{l}^{sing}$ ).

Consider

a^{*}

as the optimal control solution of (OCP) and a small chosen tolerance

ϵ > 0

. Let

k_{0}^{end} : = - 1

. We introduce the following singular arc interval index sets iteratively for

l \geq 1

:

\begin{matrix} k_{l}^{start} & : = min \{j \in {[M - 1]}_{0} ∣ j > k_{l - 1}^{end} \land \exists i \in [n_{ω}] : a_{i, j}^{*} \in [ϵ, 1 - ϵ]\}, \\ k_{l}^{end} & : = max \{j \in {[M - 1]}_{0} ∣ \forall r = k_{l}^{start}, \dots, j \exists i \in [n_{ω}] : a_{i, r}^{*} \in [ϵ, 1 - ϵ]\}, \\ J_{l}^{sing} & = \{k_{l}^{start}, \dots, k_{l}^{end}\} . \end{matrix}

Let the number of singular arcs

n_{sing}

be defined as:

n_{sing} : = arg max_{l \in N} \{k_{l}^{end}\} .

We aim to recombine singular arc realizations of the different MILP solutions from

S^{CIA}

, which is performed in Algorithm 3.

We initialize the set of visited binary controls as empty in the algorithm and set the so-far best objective value

Φ^{rec}

to infinity (line 1). Next, the temporary binary control

w^{tmp}

on the bang–bang arcs is set as equal to the rounded relaxed control (line 2). In line 3, we test every possible variation of the different MILP solutions on the singular arcs. In this way, we fill up the singular arcs of the temporary binary control

w^{tmp}

(line 4). The constructed control

w^{tmp}

is checked if it has already been visited (line 5), and if so, the algorithm jumps to the next iteration (line 6). Otherwise, we include

w^{tmp}

in the set of visited controls (line 8), and we evaluate its objective value (line 9). When a recombination has an improved objective value than the so-far best control, it will be saved as the so-far best control (lines 10–12). We illustrate the algorithm in Figure 2.

Algorithm 3:Singular arc block heuristic for recombining binary controls

w^{m}

,

m \in S^{CIA}

.

We have to take care of the number of possible variations of singular blocks and MILP solutions

| S^{CIA} |^{n_{arc}}

to avoid a combinatorial explosion. Therefore, it is advisable to choose

{\tilde{S}}^{CIA}

in Algorithm 1 with a small number of MILPs. Only a few singular arcs result usually after solving (OCP). We may modify Algorithm 3 to be greedy, i.e., to apply the idea of GreedyTime on arcs instead of on single intervals, if there are more than four singular arcs.

The singular arc recombination yields an objective value that is at least as good as those previously constructed via the MILPs. However, there currently exists no framework for quantifying these possible improvements in terms of new rounding errors of the objective.

6. Computational Results

6.1. Software Implementation and Instances

We implemented Algorithm 1 in AMPL [55] using the code ampl_mintoc, which is a modeling framework to solve optimal control problems. It features different discretization schemes of ODEs, though we used only a Radau collocation from [39]. The tool is advantageous for our purpose, since it includes automatic differentiation, interfaces to MILP solvers, and its problem formulation stays close to mathematics. In addition, AMPL provides the dual variables

λ

. Throughout the numerical study, we applied Gurobi 8.1 as the MILP solver and IPOPT 3.12.4 as the NLP solver, with default settings in each case, to solve the discretized (OCP). We assumed that the choice of the MILP solver has little influence on solution quality and verified this by testing also with CPLEX 12.9. We tested our algorithms also with CasADi and received similar results as with ampl_mintoc. All results were obtained on a workstation with four Intel i5-4210U CPUs (1.7 GHz) and 7.7 GB RAM.

We included MIOCPs from the benchmark collection site mintoc.de [18] in our numerical study, which we specify further in the following subsections. For these problems, we chose a differential states discretization with N intervals such that it was fine enough respect to the objective value. In this way, the objective value differs only to the fifth decimal place, with respect to a finer discretization for constant M. Afterwards, we varied M with fixed N in order to construct different instances. We refer to Appendix A for further details. Solving the binary approximation problem and then solving the MIOCP with fixed binary controls might result in infeasible solutions for problems involving path or terminal constraints. To this end, we relaxed these constraints and applied a merit function that penalizes constraint violation to be added to the objective with a sufficiently high penalty factor.

6.2. Scaled Combinatorial Integral Approximation

Our hypothesis is that the MILPs based on a scaled combinatorial integral approximation perform the best on instances where the binary control enters the control dependent right-hand side terms

f_{i}

of the ODE in an affine way, i.e.,:

\dot{x} (t) = f_{0} (t, x (t)) + \sum_{i = 1}^{n_{ω}} ω_{i} (t) c_{i}, f o r a . a . t \in T,

(25)

where

c_{i} \in R

. On the other hand, if

f_{i}

depends on

x (t)

, it may change rapidly over time. To this end, this results in possibly inaccurate

ω

solutions, since we use only the discretized state trajectory

x (t)

value. The MIOCPs “Double tank (Multimode)” and “Lotka–Volterra (absolute fishing variant)” are identified as candidate problems with the above right-hand side structure and constructed their solutions for different discretizations and both with and without the combinatorial constraint (15); see Appendix A for details. The results are presented in Figure 3.

We chose to evaluate the MIOCP solutions according to the distance in the

{∥ \cdot ∥}_{\infty}

-norm of the differential state trajectories corresponding to either binary or relaxed controls. We argued in Section 4 that the CIA decomposition is built on this distance, and, particularly, the proximity of objective values and constraint satisfaction follows as derived in Section 4.5. The differential state trajectories based on the (SCIA1) and (SCIAmax) solutions are significantly closer to the relaxed solution compared with their CIA counterparts, as shown in the performance plot. There are hardly differences between

{∥ \cdot ∥}_{1}

- and

{∥ \cdot ∥}_{\infty}

-norm results, although a tendency can be detected of better-performing

{∥ \cdot ∥}_{\infty}

variants.

We examined whether the (SCIA1) and (SCIAmax) are outperformed by (CIA1) or (CIAmax), if the state trajectory distance is measured in

{∥ \cdot ∥}_{1}

-norm, or if the objective value deviation to the relaxed solution is taken into account, but the SCIA variants remained the clear winners. Analogously, the result remains similar when comparing the algorithms solely on instances (not) including combinatorial constraints.

6.3. $λ$ -Combinatorial Integral Approximation

We derived (

λ

CIA) as an approximation of the cost-to-go function difference to the relaxed solution. Since this approximation is linear, the standard (CIA) approach is more suitable on most of the (nonlinear) MIOCPs. Our hypothesis is that the situation is different when a regularization term enters the objective function, accounting for the cost of activating binary controls in the form of, e.g.,:

Φ (x (t_{f})) + \int_{t \in T} \sum_{i = 1}^{n_{ω}} ω_{i} (τ) c_{i} d τ,

where

c_{i} \in R

. The problem “Quadrotor (binary variant)” includes a cost function, where the controls enter in the above form, so that we used it with different discretizations and both with and without the combinatorial constraint (15) for comparing the (

λ

CIA) solutions with the ones obtained via (CIA). We present the computation results in Figure 4.

In contrast to the previous section, we compared here the objective deviations from the relaxed solution in percentages, since the

λ

-combinatorial integral approximation aims directly at improving the objective values. However, we remark that the latter algorithm performs worse than (CIA1) and (CIAmax) if the distance to the relaxed solution is measured in differential state space. The performance plot shows that (

λ

CIA) provides solutions with improved objective values on some instances, but on many others it does not. Since (

λ

CIA) turned out to provide even weaker approximations of the relaxed solutions for other MIOCPs, as will be shown in Section 6.5, we do not recommend to use it in general as a single MILP approximation step. It serves, however, as beneficial candidate solution for recombination and might be useful for not-yet-explored problem classes.

6.4. Backwards Accumulating Constraints

Our hypothesis for the MILP variants based on backwards accumulated constraints is that they are beneficial if the MIOCP involves terminal equality constraints on the differential states. Because the deviation to the relaxed solution can become large, the standard (CIAmax) approach may construct a solution that does not satisfy this constraint. Nevertheless, the direct incorporation of terminal constraints into the MIOCP may lead to numerical difficulties already for the relaxed problem. To this end, we considered soft constraints, meaning that we introduce slack variables in order to penalize a deviation of the differential states from a desired terminal value. We identified the MIOCP “Lotka–Volterra (terminal constraint violation)” as a candidate problem and calculated its solutions for different discretizations and both with and without the combinatorial constraint (15). We illustrate the objective deviation from the relaxed solution in percentage of (CIA) and its backward variant solutions in Figure 5.

We chose the objective deviation as the performance measure for our comparison study because the objective accounts for a violation of the terminal constraints via a slack variable penalty term. The graphs of (CIAmaxB) and (CIA1B) indicate that their respective MIOCP solutions involve smaller objective values than their forward CIA counterparts. We observed that this result seems to be independent from the chosen norm, since the performance differences of (CIAmaxB) to (CIA1B) are neglectable.

6.5. Recombination Heuristics

We used the MILP solutions by (CIAmax), (CIA1), (SCIAmax), (

λ

CIA1), and (CIAmaxB) as a base for running the recombination heuristics on a set of 13 MIOCPs from the benchmark collection site mintoc.de with different discretizations (see Appendix A for details). The box plot in Figure 6 illustrates the numerical results with respect to objective deviation of each algorithm to the relaxed solution in percentages.

The boxes including median values of the SCIA and backwards approaches appear to be slightly larger and their mean values and their outliers a bit smaller, respectively, than the ones of the (CIA) MILPs. The numerical study revealed several instances where (SCIAmax) or (SCIA1) ran into a binary solution with active controls on some intervals with relaxed values close to zero. Under the assumption that the combinatorial approximation is conducted mainly on singular arcs, these cases might be called degenerated. We experienced underperforming objective values for SCIA in case of degenerated controls, which explains some of the underperforming instances. We conclude that (SCIA1), (SCIAmax), and (CIAmaxB) should be used with caution. For specific problem classes, as shown in the previous chapters, they can be very helpful. Here, we have not specifically selected the problems, and on this general problem class there is no guarantee that these algorithms provide any real improvement.

The solutions of (

λ

CIA1) clearly underperform, but we stress, as mentioned in Section 6.3, their importance for recombination. As a comparative calculation, we have also computed the solutions based on SUR and see that they provide similarly good, albeit somewhat worse, solutions compared with (CIA1) and (CIAmax). Note that depending on the selected algorithm, some instances resulted in a deviation of more than 100%, which can be explained by highly penalized infeasible solutions of path- or terminal-constrained problems.

The depicted recombination heuristics provide significantly better solutions in terms of the objective than the MILP-constructed binary solutions. The median values are reduced by a factor of about 2 (ArcRecombination) to a factor of 3 (GreedyTime) in comparison with (CIA). The other characteristics such as mean values, box borders, and outliers also reflect the improvements. Particularly noteworthy is the GreedyTime heuristic, which is robust against outliers as well as constructs on average solutions with small objective values. The ArcRecombination selects the solution of the MILP algorithms with the smallest objective value in the case of only one singular arc. Since many of the selected problems have only one singular arc, the box plot illustrates that this minimum over all MILPs can already provide a significant improvement.

6.6. Runtime Evaluation

Figure 7 shows exemplarily the relationship between runtime and objective function values for the Lotka–Volterra multimode problem with N = 12,000 and varying M. We compare (CIAmax) values both with GreedyTime and the solutions obtained by the MINLP solver Bonmin 1.8.6. For a fair comparison, we run Bonmin with its four different main algorithms, B-BB, B-OA, B-QG, and B-Hyb, and depicted the shortest runtime of these algorithms. Elapsed real time from AMPL represents runtime in our computations, since CPU time appeared to be very similar for our Bonmin calculations, and Gurobi, on the other hand, is known to be a multi-threaded solver. First of all, the illustration shows that the objective spread to the relaxed solution vanished with increasing M—regardless of the selected approach. Second, CIA was, for some instances, already quite close to Bonmin (

M = 25, 50

) in terms of objective quality, so GreedyTime cannot improve much. For other discretizations with a considerable gap between CIA and Bonmin solution, GreedyTime could close most of this gap while being two orders of magnitude faster than Bonmin.

The average runtime over all instances for (CIAmax) was about a few seconds and increased slightly for (SCIAmax); see Appendix A.2. Gurobi needed on average more than one minute for the MILPs with 1-norm and thus considerably more time. For instances involving a fine discretization, the runtime increased enormously, so we set a time limit of 30 min.

We remark that the greedy heuristics and the ArcRecombination are to be used cautiously, because an input of many MILPs leads to a high number of recombinations that have to be evaluated. ArcRecombination is relatively inexpensive and offers a solution that is at least as good as the best MILP. The algorithm is most beneficial in cases of several singular arcs, in contrast to most applied problem instances where there is only one arc. The greedy algorithm variants are quite expensive (run times of up to 15 min), yet provide solutions with objective function values very close to those of the relaxed problem.

A way to significantly reduce the computation time of the MILPs is to apply branch-and-bound [32] or to use SUR for constructing approximate solutions. These algorithms are implemented in the open-source software package pycombina (see https://github.com/adbuerger/pycombina) (accessed on 14 February 2022) [56] and might be adapted to the scaled MILP case as part of a future study. If the binary controls enter linearly into the dynamics, as in Equation (25), then the modification is straightforward, since only all differences

(a_{i} - w_{i})

have to be scaled with the factors

c_{i}

. Finally, run times of days, or even weeks when it comes to the MINLP solver, cast a positive light on the proposed decomposition algorithm including recombination.

7. Summary and Conclusions

We have extended the decomposition approach based on combinatorial integral approximation [32], using multiple MILP formulations and recombination heuristics in an outer loop. At the price of additional MILP solutions and MIOCP evaluations, we obtain an improvement of the objective function value for every fixed control discretization grid.

A numerical study with benchmark problems shows that the novel MILP solutions indeed improve the existing CIA solutions in terms of objective value on specific problem classes. We conclude that the CIA decomposition can be reasonably modified for certain MIOC subproblem classes. Furthermore, the computational results for a set of non-specific MIOCPs resulted in a substantial improvement of the CIA solution through recombination strategies.

The main added value of this study is a decomposition algorithm that works much faster than Bonmin, but still offers qualitatively similar solutions and performs on average better than the existing (CIA) approach. The framework is open to extensions, both on the MILP and on the post-processing level.

Additional work is necessary to incorporate other constraints, such as vanishing constraints, to derive further tailored MILP formulations for specific problem classes and to develop numerical algorithms that generalize SUR and/or branch-and-bound algorithms to the various MILP formulations.

Furthermore, it would be interesting to include and to compare the results of this study computationally with recent approaches for the CIA decomposition [43,44].

Author Contributions

C.Z. conducted the implementations and the numerical study, and was responsible for writing most of the article. T.W. contributed major algorithmic ideas and proofread the manuscript. S.S. contributed to the research design and discussions of the algorithmic ideas and wrote parts of the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by European Research Council grant number 647573.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We acknowledge the financial support by the Federal Ministry of Education and Research of Germany with in the project P2Chem (support code 05M18OCB). This project has received funding from the European Research Council (ERC, grant agreement No 647573) under the European Union’s Horizon 2020 research and innovation program and and from Deutsche Forschungsgemeinschaft (DFG, German Research Foundation)—314838170, GRK 2297 MathCoRe and SPP 1962 and SPP 2331.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

The following mathematical symbols are used in this manuscript:

$T$	Time horizon
t	Time variable
$x, y$	Differential state vectors
$ω, α$	Binary control and relaxed binary control
$w, a$	Discretized binary control and relaxed binary control variables
$Φ$	Objective function for (MIOCP)
$f_{0}, f_{i}$	Model functions of the ODE system
$c$	Path constraint function for (MIOCP)
$M, n_{x}, n_{ω}$	Number of discretization intervals, differential states, and controls
$G_{ω}$	Control discretization grid
$Δ$	Discretization grid length
$φ$	Canonical mapping from dicretized controls to control functions
$Ω, Ω_{M}$	Space of binary control functions and discretized binary control variables
$λ$	Evaluated dual variables of the ODE constraints (2b)
$S^{CIA}, S^{REC}$	Set of CIA problems and set of recombination mappings
$F^{rec}$	Recombination mapping
$θ^{*}, θ$	Optimal MILP objective value, auxiliary MILP objective variable
s	Auxiliary MILP constraint variable
$σ_{\max}$	Number of allowed switches
L	Lipschitz constant
$B, C$	Upper bounds on $f_{i}$ and their derivatives
J	Cost-to-go function
$J_{l}^{sing}$	The lth singular arc interval index set

Appendix A. Detailed Numerical Results

Appendix A.1. Problem Discretization Details

For generating the performance and box plots in Section 6, we applied Algorithm 1 on the following discretized problems:

“Lotka–Volterra (absolute fishing variant)”:
$N = 12, 000, M \in {25, 50, 75, 80, 100, 120, 150, 160, 200}$ , $σ_{\max} \in {10, 20, \infty}$ ,
“Quadrotor (binary variant)”:
$N = 12, 000, M \in {25, 50, 60, 80, 100, 150, 200, 300}$ , $σ_{\max} \in {4, 10, 20, \infty}$ ;
“Lotka–Volterra (terminal constraint violation)”:
$N = 12, 000, M \in {20, 30, 40, 50, 60, 100, 120, 200, 240, 300, 400, 600}$ ,
$σ_{\max} \in {4, 10, 20, \infty}$ ;
“F-8 aircraft (AMPL variant)”:
$N = 6000, M \in {30, 40, 50, 60, 100, 120, 150, 200, 240, 300, 400, 500}$ ;
“Egerstedt standard problem”:
$N = 6000, M \in {20, 30, 40, 60, 100, 120, 150, 200, 240, 300}$ ;
“Double Tank”:
$N = 18, 000, M \in {25, 50, 100, 180, 250, 300, 360, 720}$ ;
“Double Tank multimode”:
$N = 12, 000, M \in {20, 25, 50, 100, 200, 250, 300, 400, 600}$ ; $σ_{\max} \in {10, 20, \infty}$ ,
“Lotka–Volterra fishing problem”:
$N = 12, 000, M \in {20, 30, 40, 60, 100, 120, 200, 300, 400, 600}$ ;
“Lotka–Volterra multi-arcs problem”:
$N = 18, 000, M \in {25, 50, 100, 150, 200, 250, 300, 400, 600}$ ;
“Lotka–Volterra multimode problem”:
$N = 12, 000, M \in {25, 50, 100, 150, 200, 250, 300, 400, 800}$ ;
“Van der Pol Oscillator (binary variant)”:
$N = 6000, M \in {20, 30, 40, 50, 60, 100, 120, 150, 200, 300}$ ;
“D’Onofrio chemotherapy model”:
Scenario 1,2, and 3 with $N = 6000, M \in {20, 30, 40, 50, 60, 100, 120, 150, 200, 300}$ ;
only $M \in {20, 30, 60, 120}$ for scenario 1, $M = 100$ for scenario 2, and $M \in {40, 100}$ for scenario 3 resulted in feasible relaxed solutions and were included;
“Catalyst Mixing problem”:
$N = 3000, M \in {10, 15, 20, 30, 50, 60, 75, 100, 120, 150}$ .

Appendix A.2. Average Performance Indicators and Individual Problem Results

Table A1. Comparison of mean values and standard deviation (

σ

) of objective deviation, switching values, and runtime for different approaches. Objective deviation is given in percentages compared to relaxed objective, and runtime describes elapsed real time.

Table A1. Comparison of mean values and standard deviation (

σ

) of objective deviation, switching values, and runtime for different approaches. Objective deviation is given in percentages compared to relaxed objective, and runtime describes elapsed real time.

Approach	Obj. dev (%)	Switches (#)	Runtime (s)	$σ$ (Obj. dev)	$σ$ (Switches)	$σ$ (Runtime)
CIAmax	27.32	40.08	8.84	95.91	40.16	29.60
CIA1	27.08	39.54	106.11	93.60	38.99	385.77
SCIAmax	17.13	31.12	12.38	38.06	31.54	42.35
SCIA1	23.71	30.94	78.17	96.18	29.91	317.55
$λ$ CIA1	47.51	28.08	54.91	139.97	45.10	290.47
CIAmaxB	32.37	40.41	19.15	110.28	40.10	166.22
GreedyTime	2.06	33.36	106.26	4.27	34.47	133.61
GreedyTimeB	2.68	33.61	103.05	5.11	33.64	131.84
Greedy-Cost-to-go	2.01	34.05	117.24	4.24	34.09	172.88
ArcRecombination	6.53	35.34	11.26	13.55	37.01	37.12

Table A2. Results for the Lotka–Volterra multimode problem with N = 12,000 and varying M. The tables list objective values, differences to relaxed objective, number of switches, and runtime in seconds.

	(CIAmax)				(CIA1)
M	Obj.	Diff. to rel.	S (#)	R (s)	Obj.	Diff. to rel.	S (#)	R (s)
25	1.84519	0.00920032	6	0.419997	1.84519	0.00920032	6	0.323492
50	1.83353	0.00189968	9	0.498163	1.83353	0.00189968	9	0.526022
100	1.83458	0.00470921	15	0.564993	1.83458	0.00470921	15	0.849123
150	1.83049	0.00129738	20	0.979946	1.83058	0.00138375	20	3.37327
200	1.8294	0.000412465	23	0.983907	1.8294	0.000412465	23	9.61383
250	1.82887	8.52473 $\times 10^{- 4}$	30	2.01582	1.82887	8.52473 $\times 10^{- 4}$	30	6.84566
300	1.82884	2.1597 $\times 10^{- 4}$	33	1.87382	1.82884	2.1597 $\times 10^{- 4}$	33	27.4496
400	1.82879	3.40892 $\times 10^{- 4}$	47	3.42292	1.82879	3.40892 $\times 10^{- 4}$	47	45.0224
800	1.82875	2.58672 $\times 10^{- 4}$	87	66.8739	1.82875	2.58672 $\times 10^{- 4}$	87	484.285
	(SCIAmax)				(SCIA1)
25	1.84519	0.00920032	6	0.313487	1.84519	0.00920032	6	0.925208
50	1.83399	0.00235793	8	0.493533	1.83399	0.00235793	8	0.723298
100	1.91199	0.0821278	16	1.05474	1.91199	0.0821278	16	3.62938
150	1.8834	0.0542079	20	2.84568	1.8834	0.0542079	20	9.7413
200	1.86972	0.0407389	25	7.90383	1.86972	0.0407389	25	36.7948
250	1.82887	8.52473 $\times 10^{- 4}$	30	11.6632	1.82887	8.5189 $\times 10^{- 4}$	30	65.8286
300	1.82887	4.80446 $\times 10^{- 4}$	32	8.75161	1.82887	4.80449 $\times 10^{- 4}$	32	55.7173
400	1.82877	1.94567 $\times 10^{- 4}$	47	30.4913	1.82877	1.94577 $\times 10^{- 4}$	47	188.408
800	1.83859	0.00987316	88	233.701	1.8381	0.00937638	89	1479.19
	( $λ$ CIA1)				(CIAmaxB)
25	1.84543	0.0094458	5	0.443169	1.87559	0.0395975	7	0.511785
50	1.84372	0.0120927	5	0.581385	1.84076	0.00912746	9	0.574335
100	1.8533	0.0234329	16	0.839147	1.8347	0.00483784	15	0.735999
150	1.85038	0.0211798	23	2.50497	1.83041	0.00121583	19	0.840867
200	1.83509	0.00610253	30	2.52277	1.82932	0.000336555	25	1.53587
250	1.8289	0.000119317	25	13.3181	1.82894	0.000159443	31	2.02886
300	1.8553	0.0264818	29	6.28425	1.82887	5.56473 × 10⁻⁵	35	2.94341
400	2.07161	0.242853	128	12.5695	1.82878	2.53493 $\times 10^{- 4}$	47	5.77022
800	3.44174	1.61302	420	18.8157	1.82875	3.20174 $\times 10^{- 4}$	89	34.2567
	GreedyTime				GreedyTimeBackward
25	1.84519	0.00920032	6	3.81442	1.84519	0.00920032	6	4.55248
50	1.83353	0.00189968	9	14.398	1.83353	0.00189968	9	14.5728
100	1.83059	0.000723242	15	16.5069	1.83117	0.00130419	13	16.7239
150	1.82956	0.000364781	19	63.9035	1.83	0.000802598	20	60.1375
200	1.82931	0.000326273	24	52.5325	1.82932	0.000336555	25	53.6014
250	1.82887	8.52473 $\times 10^{- 4}$	30	25.9954	1.82887	8.52473 $\times 10^{- 4}$	30	25.6038
300	1.82884	2.1597 $\times 10^{- 4}$	33	81.1383	1.82884	2.1597 $\times 10^{- 4}$	33	82.31
400	1.82877	1.94567 $\times 10^{- 4}$	47	217.31	1.82877	1.94567 $\times 10^{- 4}$	47	179.64
800	1.82874	2.35655 $\times 10^{- 4}$	87	553.42	1.82874	2.3582 $\times 10^{- 4}$	87	605.012
	ArcRecombination				Greedy-Cost-to-Go
25	1.84519	0.00920032	6	0.6978	1.84519	0.00920032	6	3.89079
50	1.83353	0.00189968	9	0.5322	1.83353	0.00189968	9	14.4531
100	1.83458	0.00470907	15	0.3819	1.83318	0.00331505	17	27.2192
150	1.83041	0.00121583	19	0.8785	1.82965	0.00045483	17	67.8054
200	1.82932	0.000336555	25	0.6278	1.82931	0.000326273	24	60.569
250	1.82887	8.52473 $\times 10^{- 4}$	30	0.6946	1.82887	8.52473 $\times 10^{- 4}$	30	25.6708
300	1.82884	2.1597 $\times 10^{- 4}$	33	0.9826	1.82884	2.1597 $\times 10^{- 4}$	33	103.187
400	1.82877	1.94567 $\times 10^{- 4}$	47	0.5933	1.82877	1.94567 $\times 10^{- 4}$	47	302.851
800	1.82874	2.3582 $\times 10^{- 4}$	87	0.7660	1.82874	2.35652 $\times 10^{- 4}$	87	1166.96

References

Egerstedt, M.; Wardi, Y.; Delmotte, F. Optimal Control of Switching Times in Switched Dynamical Systems. In Proceedings of the 42nd IEEE Concference of Decision and Control, Maui, HI, USA, 9–12 December 2003. [Google Scholar]
Seatzu, C.; Corona, D.; Giua, A.; Bemporad, A. Optimal control of continuous-time switched affine systems. IEEE Trans. Autom. Control. 2006, 51, 726–741. [Google Scholar] [CrossRef]
Buss, M.; Glocker, M.; Hardt, M.; Stryk, O.V.; Bulirsch, R.; Schmidt, G. Nonlinear Hybrid Dynamical Systems: Modelling, Optimal Control, and Applications; Springer: Berlin/Heidelberg, Germany, 2002; Volume 279, pp. 311–335. [Google Scholar]
Goebel, R.; Sanfelice, R.G.; Teel, A.R. Hybrid dynamical systems. IEEE Control. Syst. 2009, 29, 28–93. [Google Scholar] [CrossRef]
Burgschweiger, J.; Gnädig, B.; Steinbach, M. Nonlinear Programming Techniques for Operative Planning in Large Drinking Water Networks. Open Appl. Math. J. 2009, 3, 1–16. [Google Scholar] [CrossRef]
Doban, A.I.; Lazar, M. A switched systems approach to cancer therapy. In Proceedings of the 2015 European Control Conference (ECC), Linz, Austria, 15–17 July 2015; pp. 2718–2724. [Google Scholar]
Koch, T.; Hiller, B.; Pfetsch, M.E.; Schewe, L. (Eds.) Evaluating Gas Network Capacities; SIAM-MOS Series on Optimization; SIAM: Philadelphia, PA, USA, 2015; Available online: https://archive.siam.org/books/mo21/mo21_toc.pdf (accessed on 15 February 2022).
Gugat, M.; Herty, M.; Klar, A.; Leugering, G. Optimal Control for Traffic Flow Networks. J. Optim. Theory Appl. 2005, 126, 589–616. [Google Scholar] [CrossRef]
Fügenschuh, A.; Herty, M.; Klar, A.; Martin, A. Combinatorial and Continuous Models for the Optimization of Traffic Flows on Networks. SIAM J. Optim. 2006, 16, 1155–1176. [Google Scholar] [CrossRef]
Göttlich, S.; Potschka, A.; Ziegler, U. Partial Outer Convexification for Traffic Light Optimization in Road Networks. SIAM J. Sci. Comput. 2017, 39, B53–B75. [Google Scholar] [CrossRef] [Green Version]
Göttlich, S.; Herty, M.; Kirchner, C.; Klar, A. Optimal control for continuous supply network models. Netw. Heterog. Media 2007, 1, 675–688. [Google Scholar]
Abichandani, P.; Benson, H.; Kam, M. Multi-vehicle path coordination under communication constraints. In Proceedings of the American Control Conference, Seattle, WA, USA, 11–13 June 2008; pp. 650–656. [Google Scholar] [CrossRef] [Green Version]
Kawajiri, Y.; Biegler, L. A Nonlinear Programming Superstructure for Optimal Dynamic Operations of Simulated Moving Bed Processes. Ind. Eng. Chem. Res. 2006, 45, 8503–8513. [Google Scholar] [CrossRef]
Sonntag, C.; Stursberg, O.; Engell, S. Dynamic Optimization of an Industrial Evaporator using Graph Search with Embedded Nonlinear Programming. IFAC Proc. Vol. 2006, 39, 211–216. [Google Scholar] [CrossRef]
Zeile, C.; Rauwolf, T.; Schmeisser, A.; Mizerski, J.K.; Braun-Dullaeus, R.C.; Sager, S. An Intra-Cycle Optimal Control Framework for Ventricular Assist Devices Based on Atrioventricular Plane Displacement Modeling. Ann. Biomed. Eng. 2021, 49, 3508–3523. [Google Scholar] [CrossRef]
Gerdts, M. A variable time transformation method for mixed-integer optimal control problems. Optim. Control. Appl. Methods 2006, 27, 169–182. [Google Scholar] [CrossRef]
Robuschi, N.; Zeile, C.; Sager, S.; Braghin, F. Multiphase mixed-integer nonlinear optimal control of hybrid electric vehicles. Automatica 2021, 123, 109325. [Google Scholar] [CrossRef]
Sager, S. A benchmark library of mixed-integer optimal control problems. In Mixed Integer Nonlinear Programming; Lee, J., Leyffer, S., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; pp. 631–670. [Google Scholar]
Gerdts, M.; Sager, S. Mixed-Integer DAE Optimal Control Problems: Necessary conditions and bounds. In Control and Optimization with Differential-Algebraic Constraints; Biegler, L., Campbell, S., Mehrmann, V., Eds.; SIAM: Philadelphia, PA, USA, 2012; pp. 189–212. [Google Scholar]
Hellström, E.; Ivarsson, M.; Aslund, J.; Nielsen, L. Look-ahead control for heavy trucks to minimize trip time and fuel consumption. Control. Eng. Pract. 2009, 17, 245–254. [Google Scholar] [CrossRef] [Green Version]
Lee, H.; Teo, K.; Jennings, L.; Rehbock, V. Control Parametrization Enhancing Technique for Optimal Discrete-Valued Control Problems. Automatica 1999, 35, 1401–1407. [Google Scholar] [CrossRef]
Stellato, B.; Ober-Blöbaum, S.; Goulart, P.J. Second-Order Switching Time Optimization for Switched Dynamical Systems. IEEE Trans. Autom. Control. 2017, 62, 5407–5414. [Google Scholar] [CrossRef] [Green Version]
Till, J.; Engell, S.; Panek, S.; Stursberg, O. Applied Hybrid System Optimization: An Empirical Investigation of Complexity. Control. Eng. Pract. 2004, 12, 1291–1303. [Google Scholar] [CrossRef]
Ringkamp, M.; Ober-Blöbaum, S.; Leyendecker, S. On the time transformation of mixed integer optimal control problems using a consistent fixed integer control function. Math. Program. 2017, 161, 551–581. [Google Scholar] [CrossRef]
Sager, S.; Tetschke, M.; Zeile, C. A Numerical Study of Transformed Mixed-Integer Optimal Control Problems. Technical Report. 2022. Available online: http://www.optimization-online.org/DB_FILE/2020/03/7698.pdf (accessed on 15 February 2022).
Gerdts, M. Solving mixed-integer optimal control problems by Branch&Bound: A case study from automobile test-driving with gear shift. Optim. Control. Appl. Methods 2005, 26, 1–18. [Google Scholar]
Sager, S. Numerical Methods for Mixed–Integer Optimal Control Problems; Der Andere Verlag: Marburg, Germany, 2005; ISBN 3-89959-416-9. [Google Scholar]
Sager, S.; Claeys, M.; Messine, F. Efficient upper and lower bounds for global mixed-integer optimal control. J. Glob. Optim. 2015, 61, 721–743. [Google Scholar] [CrossRef]
Sager, S.; Reinelt, G.; Bock, H. Direct Methods With Maximal Lower Bound for Mixed-Integer Optimal Control Problems. Math. Program. 2009, 118, 109–149. [Google Scholar] [CrossRef]
Zhu, F.; Antsaklis, P.J. Optimal control of hybrid switched systems: A brief survey. Discret. Event Dyn. Syst. 2015, 25, 345–364. [Google Scholar] [CrossRef]
Burgschweiger, J.; Gnädig, B.; Steinbach, M. Optimization Models for Operative Planning in Drinking Water Networks; Technical Report ZR-04-48. ZIB, 2004. Available online: file:///C:/Users/MDPI/Downloads/ZR-04-48.pdf (accessed on 15 February 2022).
Sager, S.; Jung, M.; Kirches, C. Combinatorial Integral Approximation. Math. Methods Oper. Res. 2011, 73, 363–380. [Google Scholar] [CrossRef] [Green Version]
Stein, O. Error bounds for mixed integer nonlinear optimization problems. Optim. Lett. 2016, 10, 1153–1168. [Google Scholar] [CrossRef]
Belotti, P.; Kirches, C.; Leyffer, S.; Linderoth, J.; Luedtke, J.; Mahajan, A. Mixed-Integer Nonlinear Optimization. In Acta Numerica; Iserles, A., Ed.; Cambridge University Press: Cambridge, UK, 2013; Volume 22, pp. 1–131. [Google Scholar] [CrossRef] [Green Version]
Kirches, C.; Lenders, F.; Manns, P. Approximation Properties and Tight Bounds for Constrained Mixed-Integer Optimal Control. SIAM J. Control. Optim. 2016, 58, 1371–1402. [Google Scholar] [CrossRef]
Hante, F.; Sager, S. Relaxation Methods for Mixed-Integer Optimal Control of Partial Differential Equations. Comput. Optim. Appl. 2013, 55, 197–225. [Google Scholar] [CrossRef] [Green Version]
Hante, F.M. Relaxation methods for hyperbolic PDE mixed-integer optimal control problems. Optim. Control. Appl. Methods 2017, 38, 1103–1110. [Google Scholar] [CrossRef] [Green Version]
Bürger, A.; Zeile, C.; Hahn, M.; Altmann-Dieses, A.; Sager, S.; Diehl, M. pycombina: An open-source tool for solving combinatorial approximation problems arising in mixed-integer optimal control. IFAC-PapersOnLine 2020, 53, 6502–6508. [Google Scholar] [CrossRef]
Biegler, L. Nonlinear Programming: Concepts, Algorithms, and Applications to Chemical Processes; Series on Optimization; SIAM: Philadelphia, PA, USA, 2010. [Google Scholar]
Gerdts, M. Optimal Control of ODEs and DAEs; De Gruyter: Berlin, Germany, 2012. [Google Scholar]
Sager, S.; Bock, H.; Diehl, M. The Integer Approximation Error in Mixed-Integer Optimal Control. Math. Program. A 2012, 133, 1–23. [Google Scholar] [CrossRef]
Bestehorn, F.; Hansknecht, C.; Kirches, C.; Manns, P. Switching Cost Aware Rounding for Relaxations of Mixed-Integer Optimal Control Problems: The 2-D Case. IEEE Control. Syst. Lett. 2021, 6, 548–553. [Google Scholar] [CrossRef]
Bestehorn, F.; Hansknecht, C.; Kirches, C.; Manns, P. Mixed-integer optimal control problems with switching costs: A shortest path approach. Math. Program. 2021, 188, 621–652. [Google Scholar] [CrossRef]
Manns, P. Relaxed multibang regularization for the combinatorial integral approximation. SIAM J. Control. Optim. 2021, 59, 2645–2668. [Google Scholar] [CrossRef]
Leyffer, S.; Manns, P. Sequential Linear Integer Programming for Integer Optimal Control with Total Variation Regularization. Technical Report. 2021. Available online: https://arxiv.org/pdf/2106.13453.pdf (accessed on 15 February 2022).
Buchheim, C.; Kuhlmann, R.; Meyer, C. Combinatorial optimal control of semilinear elliptic PDEs. Comput. Optim. Appl. 2018, 70, 641–675. [Google Scholar] [CrossRef]
Göttlich, S.; Hante, F.M.; Potschka, A.; Schewe, L. Penalty alternating direction methods for mixed-integer optimal control with combinatorial constraints. Math. Program. 2021, 188, 599–619. [Google Scholar] [CrossRef]
De Marchi, A. On the Mixed-Integer Linear-Quadratic Optimal Control With Switching Cost. IEEE Control. Syst. Lett. 2019, 3, 990–995. [Google Scholar] [CrossRef]
Sager, S. Reformulations and Algorithms for the Optimization of Switching Decisions in Nonlinear Optimal Control. J. Process. Control. 2009, 19, 1238–1247. [Google Scholar] [CrossRef] [Green Version]
Zeile, C. Combinatorial Integral Decompositions for Mixed-Integer Optimal Control. Ph.D. Thesis, Otto–von–Guericke–Universität Magdeburg, Magdeburg, Germany, 2021. [Google Scholar]
Zeile, C.; Robuschi, N.; Sager, S. Mixed-integer optimal control under minimum dwell time constraints. Math. Program. 2021, 188, 653–694. [Google Scholar] [CrossRef]
Sager, S.; Zeile, C. On mixed-integer optimal control with constrained total variation of the integer control. Comput. Optim. Appl. 2021, 78, 575–623. [Google Scholar] [CrossRef]
Manns, P.; Kirches, C. Improved Regularity Assumptions for Partial Outer Convexification of Mixed-Integer PDE-Constrained Optimization Problems. ESAIM Control. Optim. Calc. Var. 2020, 26, 32. [Google Scholar] [CrossRef] [Green Version]
Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley Longman Publishing Co., Inc.: Boston, MA, USA, 1989. [Google Scholar]
Fourer, R.; Gay, D.; Kernighan, B. AMPL: A Modeling Language for Mathematical Programming; Duxbury Press, 2002. Available online: https://vanderbei.princeton.edu/307/textbook/AMPLbook.pdf (accessed on 15 February 2022).
Buerger, A.; Zeile, C.; Altmann-Dieses, A.; Sager, S.; Diehl, M. Design, Implementation and Simulation of an MPC algorithm for Switched Nonlinear Systems under Combinatorial Constraints. Process. Control. 2019, 81, 15–30. [Google Scholar] [CrossRef]

Figure 1. Example visualization of the GreedyTime algorithm. We use two candidate control solutions, here from CIA and SCIA, to construct new candidates. We perform an enumeration between 0 and 1 at all times

t_{j}

when the input vectors differ. Then, the two candidate solutions

w

are fixed and we evaluate (MIOCP) for both vectors. We compare the resulting objective function values with their previous values. Moreover, the binary

w_{j}

values with the lower objective values are fixed in the candidate solutions. This procedure is repeated on the next grid point with unequal candidate solutions.

Figure 1. Example visualization of the GreedyTime algorithm. We use two candidate control solutions, here from CIA and SCIA, to construct new candidates. We perform an enumeration between 0 and 1 at all times

t_{j}

when the input vectors differ. Then, the two candidate solutions

w

are fixed and we evaluate (MIOCP) for both vectors. We compare the resulting objective function values with their previous values. Moreover, the binary

w_{j}

values with the lower objective values are fixed in the candidate solutions. This procedure is repeated on the next grid point with unequal candidate solutions.

Figure 2. Visualization of the singular arc block recombination heuristic for two MILP control vectors (which we name CIA and SCIA) with three singular arcs. Every possible variation from the singular arcs and candidate controls is generated and we evaluate (MIOCP) for each of the constructed variation. The minimal objective value of all variations represents the heuristic’s result.

Figure 3. Performance profile comparing the deviation of differential states based on SCIA and CIA solutions. Relaxed solutions are shown in maximum norm and log-scale. The results are based on the instances “Double tank (Multimode)” and “Lotka–Volterra (absolute fishing variant)” from the mintoc.de benchmark library. Using (SCIA1) or (SCIAmax) can improve the performance of the CIA decomposition significantly.

Figure 4. Performance profile comparing objective deviation from the relaxed solution in percentage and log-scale of

λ

-CIA and CIA solutions. The results are based on the instance “Quadrotor (binary variant)” from the mintoc.de benchmark library. (

λ

CIA) appeared to provide no clear improvement compared with the (CIA) solutions.

Figure 4. Performance profile comparing objective deviation from the relaxed solution in percentage and log-scale of

λ

-CIA and CIA solutions. The results are based on the instance “Quadrotor (binary variant)” from the mintoc.de benchmark library. (

λ

CIA) appeared to provide no clear improvement compared with the (CIA) solutions.

Figure 5. Performance profile comparing objective deviation from the relaxed solution in percentage and log-scale of (CIA) and its backward variant solutions. The results are based on the instance “Lotka–Volterra (terminal constraint violation)” from the mintoc.de benchmark library. Using (CIA1B) or (CIAmaxB) can improve the performance of the CIA decomposition significantly.

Figure 6. Box plot comparing objective deviation from the relaxed solution in percentage and log-scale of several MILP (marked in blue) and recombination heuristic (marked in red) solutions. The results are based on instances from the mintoc.de benchmark library. The box borders are 1/4 and 3/4-quantiles, whereas the whiskers represent 1/20 and 19/20-quantiles. We visualize the median values by black lines in the box and additionally display them numerically above the box. We represent the average values of the respective algorithms by red asterisks and the outliers by black crosses. The boxes of recombination strategies are shifted towards lower objective values compared with (CIA) algorithms and, thus, can improve the CIA decomposition performance significantly.

Figure 7. Log plot of run time and objective value deviation from the relaxed solution of the constructed solutions for different approaches and for the Lotka–Volterra multimode problem, with differential state discretization N = 12,000. The numbers in the plot indicate the applied corresponding number of control grid points M. We illustrate the outcomes of the solutions constructed by (CIAmax),

Opt (S^{CIA})

, the GreedyTime recombination heuristic, and the MINLP solver Bonmin. By

Opt (S^{CIA})

, we denote the best objective value outcome of over all MILP solutions. For each control discretization, we connect the outcomes of the four approaches with lines in order to compare the behavior for different discretizations. One observes the convergence of all approaches towards the lower bound provided by the relaxed solution and the closure of the gap between (CIAmax) and Bonmin solutions for a fixed discretization. GreedyTime is roughly two orders of magnitude slower than (CIAmax), but is faster than Bonmin.

Figure 7. Log plot of run time and objective value deviation from the relaxed solution of the constructed solutions for different approaches and for the Lotka–Volterra multimode problem, with differential state discretization N = 12,000. The numbers in the plot indicate the applied corresponding number of control grid points M. We illustrate the outcomes of the solutions constructed by (CIAmax),

Opt (S^{CIA})

, the GreedyTime recombination heuristic, and the MINLP solver Bonmin. By

Opt (S^{CIA})

, we denote the best objective value outcome of over all MILP solutions. For each control discretization, we connect the outcomes of the four approaches with lines in order to compare the behavior for different discretizations. One observes the convergence of all approaches towards the lower bound provided by the relaxed solution and the closure of the gap between (CIAmax) and Bonmin solutions for a fixed discretization. GreedyTime is roughly two orders of magnitude slower than (CIAmax), but is faster than Bonmin.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zeile, C.; Weber, T.; Sager, S. Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control. Algorithms 2022, 15, 121. https://doi.org/10.3390/a15040121

AMA Style

Zeile C, Weber T, Sager S. Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control. Algorithms. 2022; 15(4):121. https://doi.org/10.3390/a15040121

Chicago/Turabian Style

Zeile, Clemens, Tobias Weber, and Sebastian Sager. 2022. "Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control" Algorithms 15, no. 4: 121. https://doi.org/10.3390/a15040121

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Combinatorial Integral Approximation Decompositions for Mixed-Integer Optimal Control

Abstract

1. Introduction

1.1. General Context

1.2. Motivation

1.3. Review of the State of the Art

1.4. Contributions

1.5. Outline of the Article

2. Problem Class, Definitions, and Main Algorithm

3. Combinatorial Integral Approximation MILPs

3.1. Combinatorial Integral Approximation and Scaled Variants

3.2. Norm Dependent MILP Formulation

3.3. Chronologically Ordered Constraints

3.4. Combinatorial Constraints

4. A Priori Bounds for CIA Decompositions

4.1. Combinatorial Integral Approximation

4.2. Scaled Combinatorial Integral Approximation

4.3. λ -Combinatorial Integral Approximation

4.4. Backwards Accumulating Constraints

4.5. Connection to Decomposition Algorithm and Optimization Problem

5. Recombination Heuristics

5.1. GreedyTime

5.2. Singular Arc Recombination

6. Computational Results

6.1. Software Implementation and Instances

6.2. Scaled Combinatorial Integral Approximation

6.3. λ -Combinatorial Integral Approximation

6.4. Backwards Accumulating Constraints

6.5. Recombination Heuristics

6.6. Runtime Evaluation

7. Summary and Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Nomenclature

Appendix A. Detailed Numerical Results

Appendix A.1. Problem Discretization Details

Appendix A.2. Average Performance Indicators and Individual Problem Results

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.3. $λ$ -Combinatorial Integral Approximation

6.3. $λ$ -Combinatorial Integral Approximation