Minirobots Moving at Different Partial Speeds

Constantin Udrişte; Ionel Ţevy

doi:10.3390/math8061036

Abstract

In this paper, we present the mathematical point of view of our research group regarding the multi-robot systems evolving in a multi-temporal way. We solve the minimum multi-time volume problem as optimal control problem for a group of planar micro-robots moving in the same direction at different partial speeds. We are motivated to solve this problem because a similar minimum-time optimal control problem is now in vogue for micro-scale and nano-scale robotic systems. Applying the (weak and strong) multi-time maximum principle, we obtain necessary conditions for optimality and that are used to guess a candidate control policy. The complexity of finding this policy for arbitrary initial conditions is dominated by the computation of a planar convex hull. We pointed this idea by applying the technique of multi-time Hamilton-Jacobi-Bellman PDE. Our results can be extended to consider obstacle avoidance by explicit parameterization of all possible optimal control policies.

Keywords:

multi-time motion planning; multi-time multi-robot systems; multi-time optimal control; multi-time Hamilton-Jacobi-Bellman PDE

MSC:

49K20; 90C46; 68T40; 93C85

1. Introduction

Our multi-time model extends the single-time case formulated and solved by T. Bretl [1,2] (see also, [3,4,5,6]). We refer to a microrobotic system consisting of n planar robots which evolve in multi-temporal sense. The control of this system is hard, at least from an algorithmic point of view. We solve the problem via a multi-time maximum optimal control problem and via the technique of multi-time Hamilton-Jacobi-Bellman PDE (see, [7,8,9,10,11,12,13]). The problem of multi-temporal evolution has many pitfalls due to the correlation between the dimension of state variables and that of evolution variables.

The microrobotic systems are intended for a wide range of applications that include microfabrication, minimally invasive medical diagnosis and treatment, adaptive optics, regenerative electronics, and biosensing for environmental monitoring and toxin detection [2].

The term “multi-time” was used for the first time by Dirac (1932) [14] to introduce “multi-time wave function” as candidate for relativistic many-particle quantum mechanics.

Section 2 formulates a multi-time optimal control problem for a system of many robots that move at different partial speeds, but that must all move in the same partial direction. Section 3 shows how we can solve the problem via the weak multi-time maximum principle. Here, the solution of the adjoint PDEs system is obtained by geometrical techniques. As it is too complicated to continue with this method, Section 4 solves the problem by the strong multi-time maximum principle. Section 5 gives a geometrical solution of our problem. Section 6 proves that the multi-time dynamic programming method permits the design of multi-time optimal controls for the problem in Section 3. Section 7 refers to originality of the subject and to the possibility of further research.

We consistently use mathematical language from multi-temporal dynamical systems and differential geometry. Particularly the Einstein convention of summation, and a short dictionary for notations in differential geometry (∧ = exterior product or wedge product of two differential forms,

δ_{α β}, δ_{β}^{α}, δ^{α β}

= Kronecker symbols, ⌟ = interior product or inner derivative) are used throughout. The tensor fields are written also via their components etc.

2. Many Robots That Move at Different Partial Speeds

The evolutive multivariate parameter

t = (t^{1}, \dots, t^{m}) \in R_{+}^{m}

is called multi-time. A multi-temporal evolution is conceived as follows: It is considered a generic hyper-parallelepiped

Ω_{0 T} \in R_{+}^{m}

determined by the diagonal opposite points

0, T \in R_{+}^{m}

. An evolution in

Ω_{0 T}

is determined by the partial order in

R_{+}^{m}

and by a positive sense of movement. A

C^{1}

curve

γ : [0, 1] \to Ω_{0 T}, t^{α} = t^{α} (τ), τ \in [0, 1]

, joining the points

γ (0) = 0

and

γ (1) = T

, is called marker of evolution in

Ω_{0 T}

if

\frac{d t^{α}}{d τ} \geq 0

(increasing curve). The simplest marker of evolution is the main diagonal

t^{α} = T^{α} τ, τ \in [0, 1]

that joins the points 0 and T.

Now let us consider a

C^{1}

function

φ : Ω_{0 T} \to R

. The evolution in

φ (Ω_{0 T})

means that the image of the function

φ

runs from the point

φ (0)

to the point

φ (T)

. The graph

(t, φ (t))

can be more suggestive, being a hypersurface in

Ω_{0 T} \times R

, running from the point

(0, φ (0))

to the point

(T, φ (T))

. The normal vector field to this hypersurface is

(\nabla φ, - 1)

. The marker of evolution in

Ω_{0 T}

induces a marker of evolution in the image

φ (Ω_{0 T})

, if

⟨ \nabla φ, \frac{d t}{d τ} ⟩ \geq 0

(acute angle), and more suggestive, a marker of evolution on the hypersurface

(t, φ (t))

.

To study the multi-temporal evolution of micro-scale and nano-scale robotic systems we must create a controlled m-flow evolution, an elapsed volume functional and a minimum type problem. We underline that the initial positions of the robots are given, and the goal is to bring them to the origin, minimizing the elapsed time volume. The solution

(x (t), y (t))

of a controlled completely integrable system takes the place of evolutionary function

φ (t)

.

If we leave the multi-time T free, then for n planar robots the following problem of multi-time optimal control appears: Let

(x, y) = (x^{1}, y^{1}, \dots, x^{n}, y^{n}) \in {(R^{2})}^{n}

be the state variables (one pair

(x^{i}, y^{i})

means one robot) and

u \in R

,

v = (v_{α}^{i}) \in R^{m n}, α = 1, \dots, m; i = 1, \dots, n

be the controls (inputs). The main goal is to find

min_{u, v} I (u (\cdot), v (\cdot)) = \int_{Ω_{0 T}} d t^{1} \land \dots \land d t^{m}

subject to

\frac{\partial}{\partial t^{α}} (\begin{matrix} x \\ y \end{matrix}) (t) = v_{α} (t) (\begin{matrix} cos u (t) \\ sin u (t) \end{matrix}), t \in Ω_{0 T}, | v_{α}^{i} (t) | \leq 1,

x (0) = x_{0}, y (0) = y_{0}, x (T) = 0, y (T) = 0 .

The previous controlled PDEs can be written

\frac{\partial x^{i}}{\partial t^{α}} (t) = v_{α}^{i} (t) cos u (t), \frac{\partial y^{i}}{\partial t^{α}} (t) = v_{α}^{i} (t) sin u (t) .

If

rank (v_{α}^{i} (t)) = m \leq n

, then the

2 m

vector fields

X_{α}^{i} (t) = v_{α}^{i} (t) cos u (t)

,

Y_{α}^{i} (t) = v_{α}^{i} (t) sin u (t)

,

α = 1, \dots, m

, are linearly independent.

For each robot

(x^{i}, y^{i})

, it appears the square of speed

δ^{α β} \frac{\partial x^{i}}{\partial t^{α}} \frac{\partial x^{i}}{\partial t^{β}} + δ^{α β} \frac{\partial y^{i}}{\partial t^{α}} \frac{\partial y^{i}}{\partial t^{β}} = δ^{α β} v_{α}^{i} v_{β}^{i}, i = 1, \dots, n .

Consequently, the group of n robots move in a planar workspace at different (although bounded) speeds, but that must all move in the same partial direction fixed by the unit vector

(cos u (t), sin u (t))

. In fact, the speeds

\sqrt{δ^{α β} v_{α}^{i} v_{β}^{i}}, i = 1, \dots, n

and the direction

(cos u (t), sin u (t))

are the only physically observable measures.

The complete integrability conditions of this PDE system are

\frac{\partial v_{α}^{i}}{\partial t^{β}} (t) - \frac{\partial v_{β}^{i}}{\partial t^{α}} (t) = 0, v_{α}^{i} (t) \frac{\partial u}{\partial t^{β}} (t) - v_{β}^{i} (t) \frac{\partial u}{\partial t^{α}} (t) = 0 .

It follows the piecewise general solution

v_{α}^{i} (t) = \frac{\partial φ^{i}}{\partial t^{α}} (t) = λ^{i} (u (t)) \frac{\partial u}{\partial t^{α}} (t) .

Remark 1.

(i) The quadruple

(x (t), y (t), u (t), v (t))

constitutes an admissible m-mapping if it has the following properties: (1)

(u (t), v (t))

is a measurable function from

Ω_{0 T}

to

U \times V

; (2) for

t \in Ω_{0 T}

,

x^{i} (t) = x_{0}^{i} + \int_{Γ_{0 t}} cos u (s) v_{α}^{i} (s) d s^{α}

,

y^{i} (t) = y_{0}^{i} + \int_{Γ_{0 t}} sin u (s) v_{α}^{i} (s) d s^{α}

(path independent curvilinear integrals), (3)

(x (T), y (T)) \in X_{T} \times Y_{T}

(compact set). Please note that the second property implies that

(x (t), y (t))

is differentiable almost everywhere as a function of multi-time t, satisfying the previous PDE system for almost all

t \in Ω_{0 T}

.

(ii) If the previous PDE system is not completely integrable, we can formulate and solve a similar problem using the nonholonomic evolution

d x^{i} = cos u (t) v_{α}^{i} (t) d t^{α}, d y^{i} = sin u (t) v_{α}^{i} (t) d t^{α}

.

Because of periodicity of sine and cosine, we can take

u \in [- π, π]

. Also, we can restrict

u \in [0, π)

without loss of generality.

We solve the foregoing problem using the multi-time maximum principle (see, [7,8,9,10,11,12,13]). We introduce the Lagrange multipliers

p (t) = (p_{i}^{α} (t)), q (t) = (q_{i}^{α} (t))

, the Hamiltonian

H (x, y, u, v, p, q) = - 1 + (p_{i}^{α} cos u + q_{i}^{α} sin u) v_{α}^{i}

and its anti-trace

H_{β}^{α} (x, y, u, v, p, q) = - \frac{1}{m} δ_{β}^{α} + (p_{i}^{α} cos u + q_{i}^{α} sin u) v_{β}^{i},

called the control Hamiltonian tensor field.

3. Solution via Weak Multi-Time Maximum Principle

According the weak multi-time maximum principle [7] (coming from variational calculus techniques), along any optimal sheet

(x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}),

we must have

\frac{\partial p_{i}^{* α}}{\partial t^{α}} = - \frac{\partial H}{\partial x^{i}} (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}), \frac{\partial q_{i}^{* α}}{\partial t^{α}} = - \frac{\partial H}{\partial y^{i}} (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*})

H (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}) = max_{u, v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

.

Due to the fact that this Hamiltonian is a linear function with respect to v, its extremum point cannot be interior. Moreover, we have

max_{u, v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*}) = max_{u} max_{v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

= max_{v} max_{u} H (x^{*}, y^{*}, u, v, p^{*}, q^{*}) .

Solving the Adjoint PDEs System

Since this Hamiltonian has no dependence on the state vector variables

(x, y)

, the adjoint PDEs are of divergence form

\frac{\partial p_{i}^{* α}}{\partial t^{α}} = 0, \frac{\partial q_{i}^{* α}}{\partial t^{α}} = 0, i = 1 \dots, n .

To find the general solution of this adjoint divergence PDEs system, we recall some facts from differential geometry [15] about closed and exact forms.

An r-form

ω

is called closed if

d ω = 0

. We say that

ω

is exact if there exists an

(r - 1)

-form

η

such that

d η = ω

.

To characterize situations in which closed forms are also exact, we call a famous.

Theorem 1

(The Poincaré Lemma). Let U be a contractible domain in

R^{n}

. If ω is a closed r-form, then there exists an

(r - 1)

-form η such that

d η = ω

. In other words, all closed differential r-forms on contractible domains are exact.

In particular, if

ω

is a closed r-form on

R^{n}

, then it is exact.

The m-form (volume form)

ω = d t^{1} \land \dots \land d t^{m}

and the vector fields

\frac{\partial}{\partial t^{α}}

produce (see the inner derivative) the

(m - 1)

-forms

ω_{α} = \frac{\partial}{\partial t^{α}} ⌟ ω

and the

(m - 2)

-forms

ω_{β α} = \frac{\partial}{\partial t^{β}} ω_{α}

. These satisfy

d t^{γ} \land ω_{α} = δ_{α}^{γ} ω, d t^{γ} \land ω_{α β} = δ_{α}^{γ} ω_{β} - δ_{β}^{γ} ω_{α} .

Now, the Lagrange multipliers

p, q

are the m-forms

p = p_{i}^{α} ω_{α} \land d x^{i}, q = q_{i}^{α} ω_{α} \land d y^{i} .

As solutions of the adjoint PDEs, they are closed, i.e.,

d p = \frac{\partial p_{i}^{α}}{\partial t^{γ}} d t^{γ} \land ω_{α} \land d x^{i} = \frac{\partial p_{i}^{α}}{\partial t^{α}} ω \land d x^{i} = 0,

d q = \frac{\partial q_{i}^{α}}{\partial t^{γ}} d t^{γ} \land ω_{α} \land d y^{i} = \frac{\partial q_{i}^{α}}{\partial t^{α}} ω \land d y^{i} = 0 .

According the Poincaré Lemma, there exist two

(m - 1)

-forms

η = N_{i}^{α β} ω_{α β} \land d x^{i}, μ = M_{i}^{α β} ω_{α β} \land d y^{i}

such that

p = d η = \frac{\partial N_{i}^{α β}}{\partial t^{γ}} d t^{γ} \land ω_{α β} \land d x^{i} = \frac{\partial}{\partial t^{α}} (N_{i}^{α β} - N_{i}^{β α}) ω_{β} \land d x^{i},

q = d μ = \frac{\partial M_{i}^{α β}}{\partial t^{γ}} d t^{γ} \land ω_{α β} \land d y^{i} = \frac{\partial}{\partial t^{α}} (M_{i}^{α β} - M_{i}^{β α}) ω_{β} \land d y^{i} .

It follows that the solution of the adjoint system is

p_{i}^{β} (t) = \frac{\partial}{\partial t^{α}} (N_{i}^{α β} - N_{i}^{β α}) (t), q_{i}^{β} (t) = \frac{\partial}{\partial t^{α}} (M_{i}^{α β} - M_{i}^{β α}) (t) .

On the other hand, the strong multi-time maximum principle actually shows that the particular solution

p_{i}^{β} (t) = const

,

q_{i}^{β} (t) = const

is sufficient to obtain the complete solution of our problem.

4. Solution via Strong Multi-Time Maximum Principle

According the strong multi-time maximum principle [11] (coming from m-needle techniques), along any optimal sheet

(x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}),

we must have

\frac{\partial p_{i}^{* α}}{\partial t^{β}} = - \frac{\partial H_{β}^{α}}{\partial x^{i}} (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}), \frac{\partial q_{i}^{* α}}{\partial t^{β}} = - \frac{\partial H_{β}^{α}}{\partial y^{i}} (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*})

H (x^{*}, y^{*}, u^{*}, v^{*}, p^{*}, q^{*}) = max_{u, v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

.

Also, the function

t \to H (x^{*} (t), y^{*} (t), u^{*} (t), v^{*} (t), p^{*} (t), q^{*} (t))

is constant.

Since

v \to H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

is a linear function with respect to v, the Hamiltonian

H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

has no interior extremum point. Also, we have

max_{u, v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*}) = max_{u} max_{v} H (x^{*}, y^{*}, u, v, p^{*}, q^{*})

= max_{v} max_{u} H (x^{*}, y^{*}, u, v, p^{*}, q^{*}) .

4.1. Solving the Adjoint PDEs System

Since the control Hamiltonian tensor field has no dependence on the state

(x, y)

, the adjoint PDEs reduce to

\frac{\partial p_{i}^{* α}}{\partial t^{β}} = 0, \frac{\partial q_{i}^{* α}}{\partial t^{β}} = 0,

with the piecewise constant solution

p_{i}^{* α} (t) = p_{i}^{α}, q_{i}^{* α} (t) = q_{i}^{α} .

4.2. Finding the Maximum with Respect to v

To prove the existence of a bang-bang control v, we use the following steps.

Lemma 1.

The maximum of the Hamiltonian

H (x, y, u, v, p, q)

with respect to the control v is

H (x, y, u, v^{*}, p, q) = - 1 + \sum_{α = 1}^{m} \sum_{i = 1}^{n} | p_{i}^{α} cos u + q_{i}^{α} sin u | .

(13)

Proof.

The inputs

v_{α}^{i}

belong to the control set

V = {[- 1, 1]}^{m n} \subset R^{m n}

. The maximum of the linear function

v \to H

exists since each control variable

v_{α}^{i}

belongs to the interval

[- 1, 1]

; for maximum, the control must be at a vertex of

\partial V

(see, linear optimization, simplex method). If

Q_{i}^{α} (t) = p_{i}^{α} cos u (t) + q_{i}^{α} sin u (t)

are the switching functions, then each optimal control

v_{α}^{* i}

must be the function

v_{α}^{* i} = sign Q_{i}^{α} (t) = \{\begin{matrix} 1 for Q_{i}^{α} (t) > 0 : bang - bang control \\ undetermined for Q_{i}^{α} (t) = 0 : \sin gular control \\ - 1 for Q_{i}^{α} (t) < 0 : bang - bang control . \end{matrix}

If

p_{i}^{α} = 0, q_{i}^{α} = 0

, then

Q_{i}^{α} (t) = 0, \forall t \in Ω_{0 T}

, and hence

v_{α}^{i}

is undetermined. Otherwise, the function

Q_{i}^{α} (t)

vanishes only for one value of

u (t)

. Then, the singular control is ruled out and the remaining possibilities are bang-bang controls. This optimal control is discontinuous since each component jumps from a minimum to a maximum and vice versa, in response to each change in the sign of each switching function. The form of the optimal Hamiltonian follows. □

4.3. Finding the Maximum with Respect to u

Although we follow the path of finding the maximum with respect to v and then those with respect to u, it is useful to keep in mind the reverse procedure. This facilitates the understanding of some formulas in the following text.

According Formula (13), along any multi-time optimal sheet

v^{*}

, the Hamiltonian is a function only of the heading angle u. As continuous function it has a maximum on the compact interval

[0, π]

. Since

H (0) = H (π)

, the same maximum value is on the interval

[0, π)

. We shall show that at least one and at most

m n

values of u maximize the Hamiltonian. We conclude that the input

u (t)

is piecewise constant and takes on at most

m n

values along any multi-time optimal sheet.

To simplify, we use the

m n

functions

ϕ_{i}^{α} (u) = p_{i}^{α} cos u + q_{i}^{α} sin u, α = 1, \dots, m; i = 1, \dots, n

.

Lemma 2.

(i) The equality

H (x, y, u, v^{*}, p, q) = - 1

is true if and only if each term of the sum

\sum_{α = 1}^{m} \sum_{i = 1}^{n} | ϕ_{i}^{α} (u) |

(14)

is zero.

(ii) A zero

u_{0} \in [0, π)

of one of the functions

ϕ_{i}^{α} (u)

, with

(p_{i}^{α}, q_{i}^{α}) \neq (0, 0)

, is not a maximum point of

H (x, y, u, v^{*}, p, q)

.

Proof.

Let

ϕ_{1}^{1} (u_{0}) = 0

, with

(p_{1}^{1}, q_{1}^{1}) \neq (0, 0)

, for example,

p_{1}^{1} > 0

. Then the function

H (x, y, u, v^{*}, p, q) = h (u)

,

h (u) = \{\begin{matrix} p_{1}^{1} cos u + q_{1}^{1} sin u + A (u) & for & 0 < u_{0} - ϵ < u \leq u_{0} \\ - p_{1}^{1} cos u - q_{1}^{1} sin u + A (u) & for & u_{0} < u < u_{0} + ϵ < π \end{matrix}

has the derivative

h^{'} (u) = \{\begin{matrix} - p_{1}^{1} sin u + q_{1}^{1} cos u + A^{'} (u) & for & 0 < u_{0} - ϵ < u < u_{0} \\ p_{1}^{1} sin u - q_{1}^{1} cos u + A^{'} (u) & for & u_{0} < u < u_{0} + ϵ < π . \end{matrix}

If

u_{0}

is a maximum point, then we should have

h^{'} (u_{0} -) > 0

and

h^{'} (u_{0} +) < 0

, i.e.,

- p_{1}^{1} sin u_{0} + q_{1}^{1} cos u_{0} + A^{'} (u_{0}) > 0,

p_{1}^{1} sin u_{0} - q_{1}^{1} cos u_{0} + A^{'} (u_{0}) < 0 .

Consequently

p_{1}^{1} - q_{1}^{1} ctan u_{0} < 0 .

On the other hand,

p_{1}^{1} cos u_{0} + q_{1}^{1} sin u_{0} = 0

or

ctan u_{0} = - \frac{q_{1}^{1}}{p_{1}^{1}}

, whence

{(p_{1}^{1})}^{2} + {(q_{1}^{1})}^{2} < 0,

which is a contradiction. □

Lemma 3.

If

φ (u) \neq 0

in an open interval I, then the first two derivatives of the function

| φ (u) | : I \to R

are

\frac{d}{d u} | φ (u) | = (sign φ (u)) φ^{'} (u), \frac{d^{2}}{d u^{2}} | φ (u) | = (sign φ (u)) φ^{″} (u) .

Each function

ϕ_{i}^{α} (u)

, which is not identically zero, has exactly one zero in the interval

[0, π)

. Totally, we have a set A consisting of at most

m n

zeros in

[0, π)

.

Lemma 4.

On an interval fixed by two consecutive zeros in A, the Hamiltonian (13) has the properties: (i) it is a

C^{\infty}

function, (ii) it is a concave function, (iii) the derivative

\frac{d H}{d u}

has at most one zero.

Proof.

(i) The function

u \to H (u) + 1

is a sum of absolute values of smooth functions and consequently it is piecewise smooth. (ii) Since

\frac{d^{2} ϕ_{i}^{α}}{d u^{2}} (u) = - ϕ_{i}^{α} (u),

we find

\frac{d^{2} H}{d u^{2}} (u) < 0 .

(iii) It is almost obvious. □

Lemma 5.

The maximum of the Hamiltonian

H (x, y, u, v, p, q)

with respect to the control u, for an optimal value

v^{*}

, is

H (x, y, u^{*}, v^{*}, p, q) = - 1 + \sqrt{{(\sum_{α = 1}^{m} \sum_{i = 1}^{n} v_{α}^{* i} p_{i}^{α})}^{2} + {(\sum_{α = 1}^{m} \sum_{i = 1}^{n} v_{α}^{* i} q_{i}^{α})}^{2}} .

(15)

Proof.

The maximum of the Hamiltonian

H (x, y, u, v, p, q)

with respect to the control v is given in Lemma 1. On the other hand, the maximum of the function

\sum_{α = 1}^{m} \sum_{i = 1}^{n} (p_{i}^{α} v_{α}^{* i} cos u + q_{i}^{α} v_{α}^{* i} sin u),

with respect to u, is

\sqrt{{(\sum_{α = 1}^{m} \sum_{i = 1}^{n} v_{α}^{* i} p_{i}^{α})}^{2} + {(\sum_{α = 1}^{m} \sum_{i = 1}^{n} v_{α}^{* i} q_{i}^{α})}^{2}} .

It follows the maximum of the Hamiltonian. □

Lemma 6.

For any t, the maximum value is

H (x, y, u^{*}, v^{*}, p, q) = 0

.

Proof.

Suppose w is a maximum value function and

w^{1} (x, y) = - \frac{1}{2} D_{t^{2}} w (x, y), w^{2} (x, y) = - \frac{1}{2} D_{t^{1}} w (x, y)

is the generating vector field. The multi-time Hamilton-Jacobi-Bellman PDE (feedback law) is [11]

\frac{\partial w^{α}}{\partial t^{α}} + max_{u \in U; v \in V} \{(\frac{\partial w^{α}}{\partial x} cos u + \frac{\partial w^{α}}{\partial y} sin u) v_{α} - 1\} = 0 .

On the other hand, the evolution PDEs and the Lagrangian

L = - 1

do not depend on the variable t. Then, the generating vector field is independent on t. The multi-time Hamilton-Jacobi-Bellman PDE becomes

max_{u \in U; v \in V} \{(\frac{\partial w^{α}}{\partial x} cos u + \frac{\partial w^{α}}{\partial y} sin u) v_{α} - 1\} = 0,

equivalent to

max_{u \in U; v \in V} H (x, y, u, v, p, q) = 0 .

Consequently, the statement is true. □

Guess Solution for Maximum with Respect to u

In our target problem, the adjoint variables (co-states

p, q

) have no conditions on the boundary and so they may not be initially specified. However, giving extrema points u, we can calculate the optimal co-states

p, q

. Indeed, for

k \leq n

, and the sequence

u_{k} - π = u_{0} < u_{1} < u_{2} < \dots < u_{k} < π,

we can define

p_{i}^{α} = \frac{1}{m} p_{i}, p_{i} = \{\begin{matrix} \frac{cos u_{i - 1} - cos u_{i}}{2} & for i = 1, \dots, k \\ 0 & for i = k + 1, \dots, n . \end{matrix}

q_{i}^{α} = \frac{1}{m} q_{i}, q_{i} = \{\begin{matrix} \frac{sin u_{i - 1} - sin u_{i}}{2} & for i = 1, \dots, k \\ 0 & for i = k + 1, \dots, n, \end{matrix}

where

α = 1, \dots, m

. The function (13) becomes

H (u) = - 1 + \sum_{i = 1}^{n} (\sum_{α = 1}^{m} | p_{i}^{α} cos u + q_{i}^{α} sin u |)

= - 1 + \sum_{i = 1}^{n} | \sum_{α = 1}^{m} (p_{i}^{α} cos u + q_{i}^{α} sin u) |

= - 1 + \sum_{i = 1}^{k} | p_{i} cos u + q_{i} sin u | .

This expression demonstrates the following properties:

(i) sign (p_{i} cos u_{j} + q_{i} sin u_{j}) = \{\begin{matrix} - 1 & for i \leq j \\ 1 & for i > j . \end{matrix}

(i i) H (u_{j}) = 0, \forall j = 1, \dots, k .

(iii) The points

u_{j}

,

j = 1, \dots, k

, are the only maximum points for the function

H (u)

. Consequently,

{max}_{u} H (u) = 0

.

4.4. Finding the Optimal Evolution

The optimal control has a piecewise form

v_{α}^{* i} = sign Q_{i}^{α} (t), u^{*} = u^{*} (t) = const .

In this way, we have transformed the foregoing problem from an infinite-dimensional one, in which we are required to specify the functions

u : [0, T] \to [0, π)

and

v_{α}^{i} : [0, T] \to [- 1, 1]

, for

α = 1, \dots, m; i = 1, \dots, n

, into a finite-dimensional problem, in which we are required only to specify a double sequence of

m n

values of v. Then the optimal evolution is a piecewise solution to the Pfaff equations

d x^{i} (t) = v_{α}^{* i} d t^{α} cos u^{*}, d y^{i} (t) = v_{α}^{* i} d t^{α} sin u^{*} .

The general solution is

x^{i} (t) = v_{α}^{* i} t^{α} cos u^{*} + a^{i}, y^{i} (t) = v_{α}^{* i} t^{α} sin u^{*} + b^{i} .

These formulas generate a piecewise general solution, splitting the domain

Ω_{0 T}

into sub-domains depending on the optimal values

u^{*}

. As example, for a single optimal

u^{*}

and the boundary conditions

x (0) = x_{0}, y (0) = y_{0}, x (T) = 0, y (T) = 0,

we obtain the optimal evolution

x^{i} (t) = v_{α}^{* i} (t^{α} - T^{α}) cos u^{*}, y^{i} (t) = v_{α}^{* i} (t^{α} - T^{α}) sin u^{*},

x_{0}^{i} = - v_{α}^{* i} T^{α} cos u^{*}, y_{0}^{i} = - v_{α}^{* i} T^{α} sin u^{*} .

5. Geometrical Solution

Suppose the set of robots

{z_{1}, \dots, z_{n}; - z_{1}, \dots, - z_{n}}

determine, in this order, a convex polygon in

R^{2}

. We select the point

z_{j}

. Applying the Bretl theory [1], the point

z_{j}

can attend the origin in n steps

P_{1}, \dots, P_{n}

defined by

P_{i} = along \frac{1}{2} {\overset{⟶}{z_{i} z}}_{i + 1} with velocity \{\begin{matrix} v_{i} = - 1, & i < j \\ v_{i} = + 1, & j \leq i \leq n, \end{matrix}

with the convention

z_{n + 1} = - z_{1} .

The spend time for each step

P_{i}

is

t_{i} = \frac{1}{2} | | z_{i + 1} - z_{i} | |

,

i = 1, \dots, n

.

For the point

z_{j}

, the connection between our point of view and the theory of Bretl [1] is

v_{i 1} t_{i}^{1} + v_{i 2} t_{i}^{2} = v_{i} t_{i}, | v_{i 1} | = 1, | v_{i 2} | = 1, i = 1, \dots, n .

If

T_{1} = \sum_{i = 1}^{n} t_{i}^{1}

and

T_{2} = \sum_{j = 1}^{n} t_{j}^{2}

, we must solve the first problem:

min_{(t^{1}, t^{2})} [{(T_{1})}^{2} + {(T_{2})}^{2}], subject to (19) .

To solve this problem, we use the Lagrange function

L = {(\sum_{i = 1}^{n} t_{i}^{1})}^{2} + {(\sum_{j = 1}^{n} t_{j}^{2})}^{2} + \sum_{i = 1}^{n} 2 λ_{i} (v_{i 1} t_{i}^{1} + v_{i 2} t_{i}^{2} - v_{i} t_{i}) .

From the equations of critical points, we find

\sum_{i} t_{i}^{1} = - λ_{k} v_{k 2}, \sum_{j} t_{j}^{2} = - λ_{k} v_{k 1}, for each k = 1, \dots, n .

Since

| v_{k α} | = 1

, it follows

\sum_{i} t_{i}^{1} = \sum_{j} t_{j}^{2} = | λ_{k} | = \frac{1}{2} \sum_{i} t_{i},

and hence

T_{1} = T_{2}

(square). On the other hand, the product

T_{1} T_{2}

depends on t. According Bretl, for

min (T_{1} T_{2})

, we have

min \sum_{i} | v_{i} t_{i} | = \sum_{i} t_{i} = \frac{1}{4} perim {z_{1}, \dots, z_{n}; - z_{1}, \dots, - z_{n}} .

Hence

min \sum_{i} | v_{i 1} t_{i}^{1} + v_{i 2} t_{i}^{2} | = \frac{1}{4} perim {z_{1}, \dots, z_{n}; - z_{1}, \dots, - z_{n}} .

But,

Q = perim {z_{1}, \dots, z_{n}; - z_{1}, \dots, - z_{n}} = \frac{1}{2} (| | z_{1} + z_{n} | | + \sum_{i = 1}^{n - 1} | | z_{i + 1} - z_{i} | |) .

Running the point

z_{j}

, the relations (19) are changed into

v_{i 1}^{j} t_{i}^{1} + v_{i 2}^{j} t_{i}^{2} = v_{i}^{j} t_{i}, j = 1, \dots, n .

Fixing the index i, we obtain a system of n linear equations with two unknowns

t_{i}^{1}, t_{i}^{2}

. If the rank of the system is two, one obtains the uni-temporal case of Bretl, i.e., either

t_{i}^{1} = 0

or

t_{i}^{2} = 0

. For significant two-time case, the rank must be one, and we can take the repartition

t_{i}^{1} = t_{i}^{2} = \frac{t_{i}}{2}

,

v_{i 1}^{j} = v_{i 2}^{j} = v_{i}^{j}

. It follows (square)

T_{1} = T_{2} = \frac{Q}{2}, T_{1} T_{2} = \frac{Q^{2}}{4} .

6. Multi-Time Hamilton-Jacobi-Bellman PDE

To solve the problem formulated in Section 2, let us use the idea that the multi-time dynamic programming method permits the design of multi-time optimal controls.

To simplify, let us accept

α = 1, 2

. Also, to use the multi-time maximum principle, we replace the initial multiple integral functional

I (u (\cdot), v (\cdot))

by

J (u (\cdot), v (\cdot)) = - \int_{Ω_{0 T}} d t^{1} \land d t^{2} = \max

(equivalent minimum area). Let us consider the set

Ω_{(t^{1}, t^{2}) (T^{1}, T^{2})}

, where

t = (t^{1}, t^{2})

. Since

J_{t, (x^{1}, y^{1}), (x^{2}, y^{2})} (u (\cdot), v (\cdot)) = - \int_{t^{1}}^{T^{1}} \int_{t^{2}}^{T^{2}} d s^{1} d s^{2},

we transform the maximum problem in Section 1 into similar problems: find

max_{u (\cdot), v (\cdot)} J_{t, (x^{1}, y^{1}), (x^{2}, y^{2})} (u (\cdot), v (\cdot)) = (t^{1} - T^{1}) (T^{2} - t^{2})

(16)

subject to

\frac{\partial X^{i}}{\partial s^{α}} (s^{1}, s^{2}) = v_{α}^{i} (s^{1}, s^{2}) cos u (s^{1}, s^{2}), \frac{\partial Y^{i}}{\partial s^{α}} (s^{1}, s^{2}) = v_{α}^{i} (s^{1}, s^{2}) sin u (s^{1}, s^{2}),

X (t^{1}, t^{2}) = x, Y (t^{1}, t^{2}) = y, (s^{1}, s^{2}) \in Ω_{(t^{1}, t^{2}) (T^{1}, T^{2})}

X (T^{1}, T^{2}) = 0, Y (T^{1}, T^{2}) = 0,

were

(T^{1} - t^{1}, T^{2} - t^{2})

is selected to have a minimum norm.

Remark 2.

For m-volume multi-time optimal problems, the maximum value function w does not depend on the multi-time t.

6.1. One Optimal Value of the Control u

6.1.1. Case $α = 1, 2$ , $i = 1$

Omitting the index “star”, the constraints (boundary value problem) rewrite in the form

x = v_{α} (t^{α} - T^{α}) cos u, y = v_{α} (t^{α} - T^{α}) sin u, α = 1, 2 .

(17)

Generally,

\frac{y}{x} = tan u

and the relation

x = v_{α} (t^{α} - T^{α}) cos u

connects linearly the differences

t^{1} - T^{1}

and

t^{2} - T^{2}

. We need to find

min [{(T^{1} - t^{1})}^{2} + {(T^{2} - t^{2})}^{2}] subject to x = v_{α} (t^{α} - T^{α}) cos u .

Denoting

L = {(T^{1} - t^{1})}^{2} + {(T^{2} - t^{2})}^{2} + ℓ (v_{α} (t^{α} - T^{α}) cos u - x),

we find the critical point condition

T^{1} - t^{1} = - \frac{1}{2} ℓ v_{2} cos u, T^{2} - t^{2} = - \frac{1}{2} ℓ v_{1} cos u,

Because

| v_{1} | = | v_{2} | = 1

, it follows

T^{1} - t^{1} = T^{2} - t^{2} = \frac{x}{2 | cos u |} = \frac{O M}{2}

and

max_{u (\cdot), v (\cdot)} J_{t, (x^{1}, y^{1}), (x^{2}, y^{2})} (u (\cdot), v (\cdot)) = - \frac{x^{2}}{4 {cos}^{2} u} = - \frac{O M^{2}}{4} = - \frac{x^{2} + y^{2}}{4} .

Let us correlate this result with the Hamilton-Jacobi-Bellman PDE. Since

w ((x, y))

does not depend on t, the generating vector

(w^{1} ((x, y)), w^{2} ((x, y)))

does not depend on t.

Suppose w is a maximum value function and

w^{1} (x, y) = - \frac{1}{2} D_{t^{2}} w (x, y), w^{2} (x, y) = - \frac{1}{2} D_{t^{1}} w (x, y)

is the generating vector field. The two-time Hamilton-Jacobi-Bellman PDE (feedback law) is [11]

\frac{\partial w^{α}}{\partial t^{α}} + max_{u \in U; v \in V} \{(\frac{\partial w^{α}}{\partial x} cos u + \frac{\partial w^{α}}{\partial y} sin u) v_{α} - 1\} = 0

The maximum with respect v is obtained for

v_{α} = s i g n (\frac{\partial w^{1}}{\partial x} cos u + \frac{\partial w^{1}}{\partial y} sin u) .

It follows the PDE

max_{u} (|\frac{\partial w^{1}}{\partial x} cos u + \frac{\partial w^{1}}{\partial y} sin u| + |\frac{\partial w^{2}}{\partial x} cos u + \frac{\partial w^{2}}{\partial y} sin u|) - 1 = 0 .

Taking

\frac{\partial w^{1}}{\partial x} = \frac{\partial w^{2}}{\partial x} = \frac{x}{2 \sqrt{x^{2} + y^{2}}}, \frac{\partial w^{1}}{\partial y} = \frac{\partial w^{2}}{\partial y} = \frac{y}{2 \sqrt{x^{2} + y^{2}}},

the value

{max}_{u}

is 1 for

tan u = \frac{y}{x}

. Consequently,

w^{1} ((x, y)) = w^{2} ((x, y)) = \frac{\sqrt{x^{2} + y^{2}}}{2}

is a generating vector field.

In this case, using the total derivative operator D, we have

- 2 w^{1} = D_{t^{2}} w = \frac{\partial w}{\partial x} \frac{\partial x}{\partial t^{2}} + \frac{\partial w}{\partial y} \frac{\partial y}{\partial t^{2}}, - 2 w^{2} = D_{t^{1}} w = \frac{\partial w}{\partial x} \frac{\partial x}{\partial t^{1}} + \frac{\partial w}{\partial y} \frac{\partial y}{\partial t^{1}} .

For

v_{1} = v_{2} = - 1

, one obtains a single PDE

x \frac{\partial w}{\partial x} + y \frac{\partial w}{\partial y} = - \frac{x^{2} + y^{2}}{2}, w (0, 0) = 0,

whose solution is

w (x, y) = - \frac{x^{2} + y^{2}}{2} .

On the other hand, according [11],

max_{u (\cdot), v (\cdot)} J (u (\cdot), v (\cdot)) = w (x (t^{1} - T^{1}, t^{2} - T^{2}), y (t^{1} - T^{1}, t^{2} - T^{2}))

- w (x (t^{1}, t^{2} - T^{2}), y (t^{1}, t^{2} - T^{2})) - w (x (t^{1} - T^{1}, t^{2}), y (t^{1} - T^{1}, t^{2})) .

From the evolution (17), it follows

x (t^{1} - T^{1}, t^{2}) = x (t^{1}, t^{2} - T^{2}) = \frac{x}{2}, y (t^{1} - T^{1}, t^{2}) = y (t^{1}, t^{2} - T^{2}) = \frac{y}{2} .

The equality

- \frac{x^{2} + y^{2}}{4} = - \frac{x^{2} + y^{2}}{2} + \frac{x^{2} + y^{2}}{8} + \frac{x^{2} + y^{2}}{8}

confirms the previous results.

6.1.2. Case $α = 1, 2$ , $i = 1, 2$

Omitting the index “star”, the constraints (boundary value problem) rewrite in the form

x^{i} = v_{α}^{i} (t^{α} - T^{α}) cos u, y^{i} = v_{α}^{i} (t^{α} - T^{α}) sin u, i = 1, 2; α = 1, 2 .

(18)

Since

y^{i} = (tan u) x^{i}

,

i = 1, 2

, to find the maximum value

{max}_{u (\cdot), v (\cdot)} J

, we need to solve the problem:

min [{(T^{1} - t^{1})}^{2} + {(T^{2} - t^{2})}^{2}] subject to x = v_{α} (t^{α} - T^{α}) cos u .

Case $det v \neq 0$ If

det v = det (v_{α}^{i}) = \pm 2

, then we find

t^{1} - T^{1} = \frac{1}{det v cos u} (v_{2}^{2} x^{1} - v_{2}^{1} x^{2}), T^{2} - t^{2} = \frac{1}{det v cos u} (v_{1}^{1} x^{2} - v_{1}^{2} x^{1}),

It follows

max_{u, v} J_{((x^{1}, y^{1}), (x^{2}, y^{2}))} = \frac{- 1}{{(det v cos u)}^{2}} (v_{2}^{1} x^{2} - v_{2}^{2} x^{1}) (v_{1}^{1} x^{2} - v_{1}^{2} x^{1})

or

max_{u, v} J_{((x^{1}, y^{1}), (x^{2}, y^{2}))} = - \frac{1}{4} |{(\frac{x^{1}}{cos u})}^{2} - {(\frac{x^{2}}{cos u})}^{2}|

= - \frac{1}{4} | O M_{1}^{2} - O M_{2}^{2} |,

where

M_{1} = (x^{1}, y^{1}), M_{2} = (x^{2}, y^{2})

.

The two-time Hamilton-Jacobi-Bellman PDE (feedback law) is [11]

max_{u \in U; v \in V} \{(\frac{\partial w^{α}}{\partial x^{i}} cos u + \frac{\partial w^{α}}{\partial y^{i}} sin u) v_{α}^{i} - 1\} = 0 .

This PDE can be rewritten in the form

- 1 + max_{u \in U} \{\sum_{α, i = 1}^{2} |\frac{\partial w^{α}}{\partial x^{i}} cos u + \frac{\partial w^{α}}{\partial y^{i}} sin u|\} = 0,

since each optimal control

v_{α}^{* i}

is

v_{α}^{* i} (t) = s i g n (\frac{\partial w^{α}}{\partial x^{i}} cos u + \frac{\partial w^{α}}{\partial y^{i}} sin u) .

Using a single optimal control

u^{*} (t) = const

, the previous two-time Hamilton-Jacobi-Bellman PDE reduces to

- 1 + \sqrt{{(\sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial x^{i}})}^{2} + {(\sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial y^{i}})}^{2}} = 0 .

We obtain an eikonal PDE

{(\sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial x^{i}})}^{2} + {(\sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial y^{i}})}^{2} = 1,

with the unknown functions

w^{1} (x, y), w^{2} (x, y)

. This PDE is equivalent to the system

\sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial x^{i}} = cos χ, \sum_{α, i = 1}^{2} v_{α}^{* i} \frac{\partial w^{α}}{\partial y^{i}} = sin χ .

(19)

Consequently, for

\sum_{α, i = 1}^{2} v_{α}^{* i} = 2

, a solution

(w^{1}, w^{2})

of the Hamilton-Jacobi-Bellman PDE is obtained from

w^{1} ((x^{1}, y^{1}), (x^{2}, y^{2})) = \frac{1}{2} (x^{1} + x^{2}) cos χ + \frac{1}{2} (y^{1} + y^{2}) sin χ

+ ϕ^{1} (v_{1}^{* 2} x^{1} - v_{1}^{* 1} x^{2}, v_{1}^{* 2} y^{1} - v_{1}^{* 1} y^{2}),

w^{2} ((x^{1}, y^{1}), (x^{2}, y^{2})) = \frac{1}{2} (x^{1} + x^{2}) cos χ + \frac{1}{2} (y^{1} + y^{2}) sin χ

+ ϕ^{2} (v_{2}^{* 2} x^{1} - v_{1}^{* 1} x^{2}, v_{2}^{* 2} y^{1} - v_{1}^{* 1} y^{2}),

for

(x^{1}, y^{1}) \in R^{2}, (x^{2}, y^{2}) \in R^{2}

. The solution obtained via the strong multi-time maximum principle is recovered by the conditions

ϕ^{1} (v_{1}^{* 2} x^{1} - v_{1}^{* 1} x^{2}, v_{1}^{* 2} y^{1} - v_{1}^{* 1} y^{2}) = a^{1} (v_{1}^{* 2} x^{1} - v_{1}^{* 1} x^{2}) + b^{1} (v_{1}^{* 2} y^{1} - v_{1}^{* 1} y^{2})

ϕ^{2} (v_{2}^{* 2} x^{1} - v_{1}^{* 1} x^{2}, v_{2}^{* 2} y^{1} - v_{1}^{* 1} y^{2}) = a^{2} (v_{2}^{* 2} x^{1} - v_{1}^{* 1} x^{2}) + b^{2} (v_{2}^{* 2} y^{1} - v_{1}^{* 1} y^{2}) .

In this case, the complexity of finding an optimal policy (for arbitrary initial conditions) is dominated by the computation of a planar convex hull.

Case $det v = 0$ . In this case, we need to solve the problem

min [{(T^{1} - t^{1})}^{2} + {(T^{2} - t^{2})}^{2}] subject to x^{1} = v_{α}^{1} (t^{α} - T^{α}) cos u .

The result is similar to those when

α = 1, 2

,

i = 1

.

Remark 3.

Consider the eikonal PDE

| | D u (x) | | = 1, x \in Ω \subset R^{n} {; u (x) |}_{\partial Ω} = 0 .

Show that:

(i)

| | D u (x) | | = 1 \Leftrightarrow {sup}_{| | q | | \leq 1} (D u (x) \cdot q - 1) = 0, \forall x \in Ω;

(ii) the function

u (x) = d i s t (x; \partial Ω)

solves the eikonal PDE in the viscosity sense.

6.2. Two Optimal Values of the Control u

Let us consider the partitions

t^{1} = t_{0}^{1} < t_{1}^{1} < \dots < t_{k}^{1} = T^{1}

,

t^{2} = t_{0}^{2} < t_{1}^{2} < \dots < t_{k}^{2} = T^{2}

, and the rectangles

Ω_{j} = Ω_{(t^{1}, t^{2}) (t_{j}^{1}, t_{j}^{2})}, j = 1, \dots, k

. We order the optimal values

u_{α i}^{*}

in an increasing sequence

u_{1}, \dots, u_{k}

and we set

u_{j}

for the multi-time set

Ω_{j} ∖ Ω_{j - 1}

. For finding the optimal evolution it is enough to consider the diagonal rectangles

Ω_{(t_{j - 1}^{1}, t_{j - 1}^{2}) (t_{j}^{1}, t_{j}^{2})}, j = 1, \dots, k

. The points

t_{j} = (t_{j}^{1}, t_{j}^{2})

are determined by

u_{j}

and are connected to

(T^{1}, T^{2})

.

To simplify, in

Ω_{(t^{1}, t^{2}) (T^{1}, T^{2})}

, let us consider two diagonal rectangles

Ω_{1} = Ω_{(t^{1}, t^{2}) (\frac{1}{2} (t^{1} + T^{1}), \frac{1}{2} (t^{2} + T^{2}))}, Ω_{2} = Ω_{(\frac{1}{2} (t^{1} + T^{1}), \frac{1}{2} (t^{2} + T^{2})) (T^{1}, T^{2})}

the first corresponding to the optimal value

u_{1}

, and the second to

u_{2}

. Denoting

t = (t^{1}, t^{2}), t^{*} = (\frac{1}{2} (t^{1} + T^{1}), \frac{1}{2} (t^{2} + T^{2}), T = (T^{1}, T^{2}),

and imposing

x (t) = x, y (t) = y; x (t^{*}) = x_{*}, y (t^{*}) = y_{*}; x (T) = 0, y (T) = 0,

the optimal evolution splits as:

x^{i} = x_{*}^{i} + \frac{1}{2} v_{α}^{i} (t^{α} - T^{α}) cos u_{1}, y^{i} = y_{*}^{i} + \frac{1}{2} v_{α}^{i} (t^{α} - T^{α}) sin u_{1}, on Ω_{1};

x^{i} = v_{α}^{i} (t^{α} - T^{α}) cos u_{2}, y^{i} = v_{α}^{i} (t^{α} - T^{α}) sin u_{2}, on Ω_{2} .

We need to solve the problem of finding the maximum cost on

Ω_{1}

, then on

Ω_{2}

and to add them.

Maximum on

Ω_{1}

. Using the Lagrangian function

L_{1} = (t^{1} - t^{* 1}) (t^{* 2} - t^{2}) + λ_{i} (x_{*}^{i} + \frac{1}{2} v_{α}^{i} (t^{α} - T^{α}) cos u_{1} - x^{i}), det v = det (v_{α}^{i}) = \pm 2,

we find

t^{1} - t^{* 1} = \frac{1}{2} λ_{i} v_{2}^{i} cos u_{1}, t^{* 2} - t^{2} = - \frac{1}{2} λ_{i} v_{1}^{i} cos u_{1},

where

λ_{1} = - \frac{2}{det v {cos}^{2} u_{1}} (x^{2} - x_{*}^{2}), λ_{2} = \frac{2}{det v {cos}^{2} u_{1}} (x^{1} - x_{*}^{1}) .

Denoting

A = (- v_{2}^{1} (x^{2} - x_{*}^{2}) + v_{2}^{2} (x^{1} - x_{*}^{1})) (v_{1}^{1} (x^{2} - x_{*}^{2}) - v_{1}^{2} (x^{1} - x_{*}^{1})),

it follows

w_{1} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) = \frac{A}{{(det v cos u_{1})}^{2}}

or

w_{1} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) = - \frac{1}{4} |{(\frac{x^{1} - x_{*}^{1}}{cos u_{1}})}^{2} - {(\frac{x^{2} - x_{*}^{2}}{cos u_{1}})}^{2}|

= - \frac{1}{4} | {M_{1} M_{1}^{*}}^{2} - {M_{2} M_{2}^{*}}^{2} |,

on

Ω_{1}

, where

M_{1} = (x^{1}, y^{1}), M_{2} = (x^{2}, y^{2}), M_{1}^{*} = (x_{*}^{1}, y_{*}^{1}), M_{2}^{*} = (x_{*}^{2}, y_{*}^{2}) .

Maximum on

Ω_{2}

. Since the constraints have the form

x^{i} = v_{α}^{i} (t^{* α} - T^{α}) cos u_{2},

the result is similar to those in “Case: One optimal value of the control”. Hence

w_{2} (t, (x_{*}^{1}, y_{*}^{1}), (x_{*}^{2}, y_{*}^{2})) = - \frac{1}{4} | {O M_{1}^{*}}^{2} - {O M_{2}^{*}}^{2} |,

on

Ω_{2}

. It follows

w (t, (x^{1}, y^{1}), (x^{2}, y^{2})) = w_{1} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) + w_{2} (t, (x_{*}^{1}, y_{*}^{1}), (x_{*}^{2}, y_{*}^{2})) .

6.3. Viscosity Solution

The Hamilton-Jacobi-Bellman PDE has smooth solution on

Ω_{1}

, respectively on

Ω_{2}

. Since at the point

t^{*}

we have a discontinuity of the partial derivatives, we must refer to the PDE system (18) and to its viscosity solutions. The basic idea is to replace the differentials

D_{(x^{1}, y^{1}; x^{2}, y^{2})} φ^{α} (t, (x^{1}, y^{1}), (x^{2}, y^{2}))

at a point

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

where it does not exist (for example because of a kink in

φ

) with the differentials

D_{(x^{1}, y^{1}; x^{2}, y^{2})} ψ^{α} (t, (x^{1}, y^{1}), (x^{2}, y^{2}))

of a smooth function

ψ

touching the graph of

φ

, from above for the subsolution condition and from below for the supersolution one, at the point

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

.

Definition 1.

(i) A continuous function

φ = (φ^{1}, φ^{2})

is said to be a viscosity subsolution of the PDE system (18) if, for any point

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

and for any smooth function

ψ = (ψ^{1}, ψ^{2})

such that each function

φ^{α} - ψ^{α}

,

α = 1, 2,

has a maximum point at

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

, we have

\sum_{α, i = 1}^{2} \frac{\partial ψ^{α}}{\partial x^{i}} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) \leq cos χ, \sum_{α, i = 1}^{2} \frac{\partial ψ^{α}}{\partial y^{i}} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) \leq sin χ .

(ii) A continuous function

φ = (φ^{1}, φ^{2})

is said to be a viscosity supersolution of (16) if, for any point

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

and for any smooth function ψ such that each function

φ^{α} - ψ^{α}

,

α = 1, 2,

has a minimum point at

(t, (x^{1}, y^{1}), (x^{2}, y^{2}))

, we have

\sum_{α, i = 1}^{2} \frac{\partial ψ^{α}}{\partial x^{i}} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) \geq cos χ, \sum_{α, i = 1}^{2} \frac{\partial ψ^{α}}{\partial y^{i}} (t, (x^{1}, y^{1}), (x^{2}, y^{2})) \geq sin χ .

(iii) A continuous function

φ = (φ^{1}, φ^{2})

is said to be a viscosity solution of the PDE system (16) if it is a viscosity subsolution and supersolution.

The viscosity solution of the PDE system is

φ = (φ^{1}, φ^{2}) = (Q, Q),

where Q is a quarter of the perimeter of the parallelogram

(x^{1}, y^{1}), (x^{2}, y^{2}),

(- x^{1}, - y^{1}), (- x^{2}, - y^{2}),

i.e.,

2 Q ((x^{1}, y^{1}), (x^{2}, y^{2})) = \sqrt{{(x^{1} - x^{2})}^{2} + {(y^{1} - y^{2})}^{2}} + \sqrt{{(x^{1} + x^{2})}^{2} + {(y^{1} + y^{2})}^{2}} .

7. Conclusions

Our work is the first which introduces and studies the theory of minirobots moving at different partial speeds (in a multi-temporal sense), but that must all move in the same partial direction. We are motivated to solve this problem because constraints of previous sort must be common in micro-scale and nano-scale robotic systems appearing in applied fields mentioned above. To understand a multi-temporal evolution we must think the dependence on multi-time either as an immersion, or as a diffeomorphism, or as a submersion, and that the partial order in

R_{+}^{m}

induces a partial order on the image of such a function.

The phenomenon described by us takes place in spaces with at least four dimensions. That is why graphic representations lose their meaning.

By application of the (weak and strong) multi-time maximum principle, we obtain necessary conditions for optimality and use them to guess a candidate control policy. By the multi-time Hamilton-Jacobi-Bellman PDE, we verify that our guess is optimal. The complexity of finding this policy for arbitrary initial conditions is only quasilinear in the number of robots, and in fact is dominated by the computation of a planar convex hull.

In our minds the previous theory can be extended to the situation of three-dimensional robots, using the versor of unit sphere, which we will do in a future paper. We tested the theory of multi-time optimal control in relevant applications: multi-time control strategies for skilled movements [13], optimal control of electromagnetic energy [16], multi-time optimal control for quantum systems [10] etc.

Author Contributions

The contributions of both authors are equal. The main results and illustrative examples were developed together. All authors have read and agreed to the published version of the manuscript.

Funding

This research received funding from Balkan Society of Geometers, Bucharest, Romania.

Acknowledgments

Thanks to referees for pertinent remarks.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bretl, T. Minimum-time optimal control of many robots that move in the same direction at different speeds. IEEE Trans. Robbot. 2012, 28, 351–363. [Google Scholar] [CrossRef]
DeVon, D.A.; Bretl, T. Control of many robots moving in the same direction with different speeds: A decoupling approach. In Proceedings of the 2009 American Control Conference, St. Louis, MO, USA, 10–12 June 2009. [Google Scholar]
Becker, A.; Onyuksel, C.; Bretl, T.; McLurkin, J. Controlling many differential-drive robots with uniform control inputs. Int. J. Robot. Res. 2014, 33, 1626–1644. [Google Scholar] [CrossRef]
Bien, Z.; Lee, J. A minimum-time trajectory planning method for two robots. IEEE Trans. Robot. Autom. 1992, 8, 414–418. [Google Scholar] [CrossRef]
Bloch, A.M.; Baillieul, J.; Crouch, P.E.; Marsden, J.E. Nonholonomic Mechanics and Control; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 2003. [Google Scholar]
Mauder, M. Time-Optimal Control of the Bi-Steerable Robot. Ph.D. Thesis, Fakultät für Mathematik und Informatik der Julius-Maximilians-Universität, Würzburg, Germany, 2012. [Google Scholar]
Udrişte, C. Multitime controllability, observability and bang-bang principle. J. Optim. Theory Appl. 2008, 39, 141–157. [Google Scholar] [CrossRef]
Udrişte, C.; Ţevy, I. Multitime dynamic programming for curvilinear integral actions. J. Optim. Theory Appl. 2010, 146, 189–207. [Google Scholar] [CrossRef]
Udrişte, C. Equivalence of multitime optimal control problems. Balk. J. Geom. Appl. 2010, 15, 155–162. [Google Scholar]
Udrişte, C. Multitime optimal control for quantum systems. In Proceedings of the Third International Conference on Lie-Admissible Treatments of Irreversible Processes (ICLATIP-3), Kathmandu University, Dhulikhel, Nepal, 3–7 January 2011. [Google Scholar]
Udrişte, C.; Ţevy, I. Multitime dynamic programming for multiple integral actions. J. Glob. Optim. 2011, 51, 345–360. [Google Scholar] [CrossRef]
Udrişte, C.; Bejenaru, A. Multitime optimal control with area integral costs on boundary. Balk. J. Geom. Appl. 2011, 16, 138–154. [Google Scholar]
Iliuţă, M.; Udrişte, C.; Ţevy, I. Multitime control strategies for skilled movements. Balk. J. Geom. Appl. 2013, 18, 31–46. [Google Scholar]
Dirac, P.A.M. Relativistic quantum mechanics. Proc. R. Soc. A 1932, 136, 453–464. [Google Scholar] [CrossRef]
Taubes, C.H. Differential Geometry: Bundles, Connections, Metrics and Curvature; Oxford University Press: Oxford, UK, 2011. [Google Scholar]
Pîrvan, M.; Udrişte, C. Optimal control of electromagnetic energy. Balk. J. Geom. Appl. 2010, 15, 131–141. [Google Scholar]

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

Minirobots Moving at Different Partial Speeds

Abstract

1. Introduction

2. Many Robots That Move at Different Partial Speeds

3. Solution via Weak Multi-Time Maximum Principle

Solving the Adjoint PDEs System

4. Solution via Strong Multi-Time Maximum Principle

4.1. Solving the Adjoint PDEs System

4.2. Finding the Maximum with Respect to v

4.3. Finding the Maximum with Respect to u

Guess Solution for Maximum with Respect to u

4.4. Finding the Optimal Evolution

5. Geometrical Solution

6. Multi-Time Hamilton-Jacobi-Bellman PDE

6.1. One Optimal Value of the Control u

6.1.1. Case α = 1 , 2 , i = 1

6.1.2. Case α = 1 , 2 , i = 1 , 2

6.2. Two Optimal Values of the Control u

6.3. Viscosity Solution

7. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Article Access Statistics

6.1.1. Case $α = 1, 2$ , $i = 1$

6.1.2. Case $α = 1, 2$ , $i = 1, 2$