Integrable Systems: In the Footprints of the Greats

Velimir Jurdjevic

doi:10.3390/math11041063

Abstract

In his 1842 lectures on dynamics C.G. Jacobi summarized difficulties with differential equations by saying that the main problem in the integration of differential equations appears in the choice of right variables. Since there is no general rule for finding the right choice, it is better to introduce special variables first, and then investigate the problems that naturally lend themselves to these variables. This paper follows Jacobi’s prophetic observations by introducing certain “meta” variational problems on semi-simple reductive groups G having a compact subgroup K. We then use the Maximum Principle of optimal control to generate the Hamiltonians whose solutions project onto the extremal curves of these problems. We show that there is a particular sub-class of these Hamiltonians that admit a spectral representation on the Lie algebra of G. As a consequence, the spectral invariants associated with the spectral curve produce a large number of integrals of motion, all in involution with each other, that often meet the Liouville complete integrability criteria. We then show that the classical integrals of motion associated, with the Kowalewski top, the two-body problem of Kepler, and Jacobi’s geodesic problem on the ellipsoid can be all derived from the aforementioned Hamiltonian systems. We also introduce a rolling geodesic problem that admits a spectral representation on symmetric Riemannian spaces and we then show the relevance of the corresponding integrals on the nature of the curves whose elastic energy is minimal.

Keywords:

symplectic; manifolds; Lie-Poisson bracket; Lie algebras; co-adjoint orbits; extremal curves; integrable systems

MSC:

53C17; 53C22; 53B21; 53C25; 30C80; 26D05; 49J15; 58E40

1. Introduction

The theory of integrable systems begins with W.R. Hamilton who in 1835 pronounced that the equations of motion of an n body system conform to the principle of least action, and consequently can be represented as

\frac{d q_{i}}{d t} = \frac{\partial H}{\partial p_{i}}, \frac{d p_{i}}{d t} = - \frac{\partial H}{\partial q_{i}}, i = 1, \dots, n,

(1)

under the transformation

p_{i} = \frac{\partial T}{\partial {\dot{q}}_{i}} (q_{1}, \dots, q_{n}, {\dot{q}}_{1}, \dots, {\dot{q}}_{n}), {\dot{q}}_{i} = \frac{d q_{i}}{d t}, i = 1, \dots, n,

(2)

where

H = T + V

is the total energy, with T the kinetic and V the potential energy of the system. He then observed that H is conserved along the solutions of the system. Hamilton’s discovery gave rise to a new class of differential equations of the form

\frac{d x}{d t} = \frac{\partial H}{\partial y} (x (t), y (t)) \frac{d y}{d t} = - \frac{\partial H}{\partial x} (x (t), y (t)) .

(3)

associated with any function H of

2 n

variables

x = x_{1}, \dots, x_{n}

and

y = y_{1}, \dots, y_{n}

. Such equations became known as the canonical equations. Then the transformations

(x, y) \to (x^{'}, y^{'})

that preserved the canonical form of these equations were also called canonical, and the functions whose values were conserved by canonical systems became known as integrals.

Hamilton’s discovery had an immediate impact on the scientific community of the nineteenth century. Canonical equations became the central object of study in the mathematics of that period with the contributions of J. Liouville, S.D. Poisson, C.G. Jacobi and H. Poincaré leading the way towards a new branch in mathematics known today as the theory of integrable systems. This theory was principally driven by a lasting interest in the existence of extra integrals of motion and the symmetries that are accountable for the existence of these integrals. Its defining moment may be attributed to S.D. Poisson who in 1809 [1] introduced his bracket (known since as the Poisson bracket)

{f, g} = \sum_{i = 1}^{n} \frac{\partial f}{\partial x_{i}} \frac{\partial g}{\partial y_{i}} - \frac{\partial f}{\partial y_{i}} \frac{\partial g}{\partial x_{i}}

(4)

for functions f and g in the canonical variables

x_{1}, \dots, x_{n}, y_{1}, \dots, y_{n}

.

The introduction of the Poisson bracket greatly facilitated the emerging theory of that period. It provided an alternative definition of canonical systems as differential systems that satisfy

\frac{d x_{i}}{d t} = {x_{i}, H}, and \frac{d y_{i}}{d t} = {y_{i}, H}, i = 1, \dots, n

(5)

and it also redefined integrals of motion associated with H as functions F that satisfy

{F, H} = 0

. It was Jacobi, however, who noticed the fundamental property of the Poisson bracket

{f, {g, h}} + {h, {f, g}} + {g, {h, f}} = 0,

(6)

that has been known ever since as the Jacobi’s identity. It is then an easy consequence of Jacobi’s identity that

F_{3} = {F_{1}, F_{2}}

is a third integral of motion for H for any two integrals

F_{1}

and

F_{2}

(known as Poisson’s theorem [2]). Alternatively integrals of motion were detected through a suitable change of canonical coordinates. Jacobi characterized such changes of coordinates through a generating function

S (x, y^{'})

. According to Jacobi

(x, y) \to (x^{'}, y^{'})

is canonical if and only if

y_{i} = \frac{\partial S}{\partial x_{i}}, x_{i}^{'} = \frac{\partial S}{\partial y^{'}} .

Poincaré characterized canonical change of coordinates in terms of differential forms:

(x, y) \to (x^{'}, y^{'})

is canonical if and only if

\sum_{i = 1}^{n} x_{i} d y_{i} - x_{i}^{'} d y_{i}^{'} = d S

for some function S.

From contemporary perspectives the theory of integrable systems begins with C.G. Jacobi and his seminal book Lectures in Dynamics [3]. Jacobi demostrated that the canonical Equation (3) can be integrated with the aid of a partial differential equation

H (x_{1}, \dots, x_{n}, \frac{d S}{d x_{1}}, \dots, \frac{d S}{d x_{n}}) = c,

(7)

in terms of an unknown function S. He showed that if a particular solution of (7) can be found in terms of n arbitrary constants of motion

h_{1}, \dots, h_{n}

then

c = ϕ (h_{1}, \dots, h_{n})

for some function

ϕ

, and the transformation

y_{i} = \frac{\partial S}{\partial x_{i}}, h^{'} = \frac{\partial S}{\partial h_{i}}

(8)

transforms the canonical coordinates

(x, y)

into new canonical coordinates

(h^{'}, h)

relative to which the canonical Equation (3) are transformed into the equations

\frac{d h^{'}}{d t} = \frac{d ϕ}{d h}, \frac{d h}{d t} = - \frac{d ϕ}{d h^{'}} = 0,

whose solutions are given by

h^{'} (t) = c_{2} t + c_{3}, h (t) = c_{1}, c_{2} = \frac{d ϕ}{d h} .

(9)

Canonical coordinates whose solutions are given by (9) are called action-angle coordinates [4].

Equation

H (x_{1}, \dots, x_{n}, \frac{\partial S}{\partial x_{1}}, \dots, \frac{\partial S}{\partial x_{n}}) = c

is known as Jacobi’s equation. Poincaré referred to the above result as the first theorem of Jacobi in his treatise of celestial mechanics [2]. Jacobi’s solution of the above partial differential equation in terms of the elliptic coordinates stands out as the most original and, perhaps, the most enigmatic contribution to the theory of canonical systems. Jacobi’s use of elliptic coordinates suggested the existence of a special class of variational problems whose solutions can be described by Abelian integrals in some privileged system of coordinates, exemplified by the geodesic problem on the ellipsoid. In the absence of any apparent symmetries on the ellipsoid that account for the integrability of the geodesic problem, this result of Jacobi seemed particularly mysterious.

In Jacoby summary, the main problem in the integration of differential equations appears in the choice of right variables. Given no general rule for finding the right choice, it is better to introduce special variables first, and then investigate the problems that naturally lend themselves to these variables [3]. Jacobi, however, does not comment on another exceptional aspect of his discovery, namely the mysterious presence of partial differential equations for the problems of variational calculus, an issue that remained open for a long time.

Almost a hundred years later, C. Carathéodory in the introduction to his famous book on the calculus of variations [5] remarks that “ neither Jacobi, nor his students, nor the many other prominent men who so brilliantly represented and advanced this discipline during the nineteenth century, thought in any way of the relationship between the calculus of variations and partial differential equation”. H. Poincaré also sidestepped this issue by treating canonical systems as the solutions of a dynamical system

\frac{d}{d t} \sum_{k = 1}^{2 n} x_{i} \frac{d y_{i}}{d α_{k}} - \frac{d}{d α_{k}} \sum_{k = 1}^{2 n} x_{i} \frac{d y_{i}}{d t} = \frac{d F}{d α_{k}}, k = 1, \dots, 2 n,

(10)

where

α_{1}, \dots, α_{2 n}

denote the constants

x_{i} (t_{0}) = α_{i}, y_{i} (t_{0}) = α_{i + n}, i = 1, \dots, n

. Since

\frac{d F}{d α_{k}} = \sum_{i = 1}^{n} \frac{\partial F}{\partial x_{i}} \frac{\partial x_{i}}{\partial α_{k}} + \frac{\partial F}{\partial y_{i}} \frac{\partial y_{i}}{\partial α_{k}}

the above differential equation can be reformulated as

\sum_{i = 1}^{n} (\frac{d x_{i}}{d t} - \frac{\partial F}{\partial y_{i}}) \frac{d y_{i}}{d α_{k}} - (\frac{d y_{i}}{d t} + \frac{\partial F}{\partial x_{i}}) \frac{d x_{i}}{d α_{k}} = 0,

which shows that Equations (3) and (10) have the same solutions. Poincaré equation used Equation (10) to show that a transformation

(x, y) \to (x^{'}, y^{'})

is canonical if and only if the differential form

\sum_{i = 1}^{n} y_{i} d x_{i}

satisfies

\sum_{i = 1}^{n} y_{i} d x_{i} - y_{i}^{'} d x_{i}^{'} = d S

for some function

S (x, x^{'})

.

Among many other stellar advancements of that epoch, the following result of J. Liouville, reported in 1855 [6], seemed particularly influential for the present mathematics [4]. Liouville considered a differential system

\frac{d x}{d t} = \frac{\partial}{\partial y} F (t, x (t), y (t)), \frac{d y}{d t} = - \frac{\partial}{\partial x} F (t, x (t), y (t))

(11)

associated with a function

F (t, x_{1}, \dots, x_{n}, y_{1}, \dots, y_{n})

. He then assumed the existence of n integrals of motion

h_{1} (t, x, y), \dots, h_{n} (t, x, y)

such that the system of equations

h_{1} = h_{1} (t, x, y), h_{2} = h_{2} (t, x, y), \dots, h_{n} = h_{n} (t, x, y)

can be solved for

y_{1}, \dots, y_{n}

in the variables

t, x_{1}, \dots, x_{n}, h_{1}, \dots h_{n}

. He also imposed the condition that

h_{1}, \dots, h_{n}

are in involution, that is,

{h_{i}, h_{j}} = \sum_{k = 1}^{n} \frac{\partial h_{i}}{\partial x_{k}} \frac{\partial h_{j}}{\partial y_{h}} - \frac{\partial h_{i}}{\partial y_{k}} \frac{\partial h_{j}}{\partial x_{k}} = 0, 1 \geq i, j \leq n .

(12)

Liouville interpreted

\frac{d x_{i}}{d t} = \frac{\partial}{\partial y_{i}} F (t, x (t), y (t))

as the exactness condition for the differential form

\sum_{i = 1}^{n} y_{i} (t, x, h) d x_{i} - F (t, x, p (t, x, h)) d t

and concluded that there is a function

S (t, x, h)

such that

\sum_{i = 1}^{n} y_{i} d x_{i} - F (t, x, p (t, x, h)) d t = \frac{\partial S}{\partial x_{i}} d x_{i} + \frac{d S}{d t} d t,

that is,

y_{i} = \frac{\partial S}{\partial x}, \frac{d S}{d t} + F (t, x, \frac{\partial S}{\partial x}) = 0

. But then S can be used as the generating function for the canonical transformation

(x, y) \to (h, h^{'})

where

h^{'} = - \frac{\partial S}{\partial h}

. Liouville refers to

y_{i} = \frac{\partial S}{\partial x_{i}}, h_{i}^{'} = - \frac{\partial S}{\partial h_{i}}, i = 1, \dots, n

(13)

as a complete system. Indeed, in the new coordinates

h_{1}, \dots, h_{n}

remain constants of motion and therefore

\frac{d h_{i}}{d t} = 0, i = 1, \dots, n

. Since

\frac{d h_{i}}{d t} = \frac{\partial F}{\partial h^{'}}

, F is independent of

h^{'}

, that is, F is a function of t and h. But then

- \frac{\partial \tilde{F}}{\partial h}

is a given function of time, and

h^{'} (t)

is given by its integral. When F is a function of x and y, and not explicitly dependent on time, then

\tilde{F}

is only a function of h. Therefore, the general solution is given by

h (t) = h (0), h^{'} (t) = ω t + h^{'} (0), ω = - \frac{\partial \tilde{F}}{\partial h} .

(14)

This heritage from 19-th century mathematics forms a core of knowledge indispensable for problems of mathematical physics, symplectic geometry, calculus of variations and optimal control theory, and its unanswered questions still motivate much of the current research in integrable systems.

This paper will address the “hidden” symmetries that account for the existence of extra integrals of motion. We will show that the canonical integrable systems, such as Jacobi’s geodesic problem on the ellipsoid, Neumann’s mechanical problem on the sphere, Euler’s top, and the associated heavy tops, all derive their constants of motion from certain “meta” systems on Lie groups that admit isospectral representations of the form

\frac{d L_{λ}}{d t} (t) = [M_{λ} (t), L_{λ} (t)]

(15)

on the Lie algebra

g

of G.

We will confine our attention to semi-simple Lie groups G having a compact subgroup K, for then the Lie algebra

g

admits a decomposition

g = k + p

, where

k

the Lie algebra of K and

p

is the orthogonal complement of

k

relative to the Killing form

K l (A, B) = T r (a d A \circ a d (B)

. But then

[p, k] \subseteq p]

and therefore

g

as a vector space also carries the semi-direct Lie algebra

g_{s}

associated with the semi-direct product

G_{s} = p ⋊ K

. We will then single out a class of left-invariant variational problems on G that admit an isospectral representation with

L_{λ, s} = L_{p} - λ L_{k} + (λ^{2} - s) A,

(16)

where

s = 0

in the semi-direct case and

s = 1

in the semi-simple case,

L = L_{k} + Ł_{p}

,

L_{k} \in k, L_{p} \in p

, and where A is a fixed element in

p

. It is then known that the spectral invariants

ϕ_{λ, s}^{k} (L) = T r (L_{λ, s}^{k})

are in involution relative to the canonical Poisson bracket on

g

, respectively on

g_{s}

. We will show that these invariants shed light on the hidden symmetries that surround many of the aforementioned integrable systems. In the process we will be able to demonstrate that the quest for the geometric origins behind the “mysterious” integrals of motions also leads to new and unexpected encounters with problems of Riemannian and sub-Riemannian geometry in which geometric control theory plays a major role.

2. Symplectic Background, Hamiltonian Systems

The theoretic framework upon which above claims are made is rooted in symplectic geometry. Below is a brief summary of the theoretical ingredients required for our main results.

Recall that a manifold M together with a non-degenerate and closed 2-form

ω

is called symplectic. The symplectic form yields a correspondence between functions and vector fields: to every function f there is a vector field

\vec{f}

defined by

ω (\vec{f}, X) = d f (X)

for all vector fields X on M. Then

\vec{f}

is called the Hamiltonian vector field generated by f. Every symplectic manifold is even dimensional, and at each point of M there is a neighbourhood with coordinates

(x_{1}, \dots, x_{n}, p_{1}, \dots, p_{n})

on which Hamiltonian vector fields are given by

\vec{f} = \sum_{i = 1}^{n} \frac{\partial f}{\partial p_{i}} \frac{\partial}{\partial x_{i}} - \frac{\partial f}{\partial x_{i}} \frac{\partial}{\partial p_{i}} .

(17)

This choice of coordinates in which

\vec{f}

is given by (17) is called symplectic, or canonical in the terminology of the 19-th century.

Every cotangent bundle

T^{*} M

is a symplectic manifold with its canonical symplectic form,

ω = d p \land d x

in terms of the symplectic coordinates

(x_{1}, \dots, x_{n}, p_{1}, \dots, p_{n})

. As a symplectic manifold the cotangent bundle is special, in the sense that it is also a vector bundle. Hence every vector field X on M can be lifted to a unique Hamiltonian vector field

{\vec{f}}_{X}

in

T^{*} M

via the function

f_{X} (ξ) = ξ (X (x))

,

ξ \in T_{x}^{*} M

. Vector field

{\vec{f}}_{X}

is called the Hamiltonian lift of X. The same procedure is applicable to any time varying vector field, and by extension to any differential system on M. Thus any differential system in M can be lifted to a Hamiltonian system in

T^{*} M

. This fact is also important for problems of optimal control where the Maximum Principle singles out the appropriate Hamiltonian lifts that govern the optimal solutions [7].

When the base manifold is a Lie group G, and when the underlying differential system is either left or right invariant, then there is a special system of coordinates based on the representation of

T^{*} G

as

G \times g^{*}

, with

g^{*}

the dual of

g

. This coordinate system preserves the left invariant symmetries and elucidates the conserved quantities of the associated Hamiltonian systems. The passage to these coordinates and the associated formalism was amply documented in my earlier publications [7,8,9]. Below we will highlight the main points in this theory required for our results.

2.1. Left-Invariant Trivializations and the Symplectic Form

Having in mind applications that involve left-invariant variational systems the cotangent bundle

T^{*} G

and the tangent bundle

T G

will be viewed as the products

G \times g^{*}

and

G \times g

via the left-translations. More explicitly, tangent vectors

v \in T_{g} G

will be identified with pairs

(g, X) \in G \times g

via the relation

v = {L_{g}}_{*} X

, where

{L_{g}}_{*}

denotes the tangent map associated with the left translation

L_{g} (h) = g h

. Similarly, points

ξ \in T_{g}^{*} G

will be identified with pairs

(g, ℓ) \in G \times g^{*}

via

ξ = ℓ \cdot {L_{g}}_{*}^{- 1}

. Then

T (T^{*} G)

, the tangent bundle of the cotangent bundle

T^{*} G

, will be identified with

(G \times g^{*}) \times (g \times g^{*})

, with the understanding that an element

((g, ℓ), (A, a)) \in (G \times g^{*}) \times (g \times g^{*})

denotes the tangent vector

(A, a)

at the base point

(g, ℓ)

.

We will make use of the fact that

G \times g^{*}

is a Lie group in its own right since

g^{*}

, as a vector space, is an abelian Lie group. Then left-invariant vector fields V in

G \times g^{*}

will be denoted by

V (g, ℓ) = (g A, a)

,

(g, ℓ)

in

G \times g^{*}

. In this setting the canonical symplectic form on

T^{*} G

is given by

ω_{(g, ℓ)} (V_{1}, V_{2}) = a_{2} (A_{1}) - a_{1} (A_{2}) - ℓ ([A_{1}, A_{2}])

(18)

for any left-invariant vector fields

V_{1} = (g A_{1}, a_{1})

and

V_{2} = (g A_{2}, a_{2})

[7]. The above form is invariant under the left-translations in

G \times g^{*}

, and is especially revealing for the Hamiltonian vector fields generated by left-invariant functions on

G \times g^{*}

.

A function H on

G \times g^{*}

is left-invariant if

H (h g, ℓ) = H (g, ℓ)

for all

g, h \in G

and all

ℓ \in g^{*}

. That is, left-invariant functions coincide with functions of

g^{*}

. Each left-invariant vector field

X (g) = g A

on G lifts to a linear function

ℓ \to ℓ (A)

on

g^{*}

because

h_{X} (ξ) = ξ (X (g)) = ℓ \circ {L_{g}}_{*}^{- 1} \circ {(L_{g})}_{*} (A) = ℓ (A), ξ \in T_{g}^{*} G .

Functions H on

g^{*}

generate Hamiltonian vector fields

\vec{H}

on

G \times g^{*}

whose integral curves are the solutions of

\frac{d g}{d t} (t) = g (t) d H_{ℓ (t)}, \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t)) .

(19)

In a more general case, where H depends on both

g \in G

and

ℓ \in g^{*}

, the integral curves of

\vec{H}

are the solutions of

\frac{d g}{d t} (t) = g (t) d H_{ℓ (t)}, \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t)) - d H_{g} \circ {L_{g}}_{*},

(20)

that can be easily shown through the relations

b (d H_{ℓ}) + d H_{g} \circ {L_{g}}_{*} B = b (A) - a (B) - ℓ [A, B] .

This situation occurs in problems of mechanics in the presence of potential functions. For example, the movements of a three-dimensional rigid body with a potential function

V : S O (3) \to R

are described by the Hamiltonian

H (R, ℓ) = H_{0} (ℓ) + V (α_{1}, α_{2}, α_{3})

on the cotangent bundle of

S O (3)

, where

α_{1}, α_{2}, α_{3}

denote the columns of the matrix transpose of the rotation R in

S O (3)

. For then the directional derivative of V in the direction

R X

is given by

d V (R X) = \sum_{i = 1}^{3} ⟨ \frac{\partial V}{\partial α_{i}} \land α_{i}, X ⟩

where

⟨, ⟩

denotes the standard inner product

- \frac{1}{2} T r (X Y)

in

so (3)

. Thus

d H_{g} \circ d L_{g} = \sum_{i = 1}^{3} \frac{\partial V}{\partial α_{i}} \land α_{i}

and the equations of motion for H are given by

\frac{d g}{d t} (t) = g (t) d H_{0} (ℓ (t)), \frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{0} (ℓ (t)) (ℓ (t)) + \sum_{i = 1}^{3} α_{i} \land \frac{\partial V}{\partial α_{i}} .

(21)

These equations extend to an “n-dimensional rigid body” with the Hamiltonian

H (R, ℓ) = H_{0} (ℓ) + V (α_{1}, \dots, α_{n})

where

\begin{matrix} \frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + \sum_{i = 1}^{n} α_{i} \land \frac{\partial V}{\partial α_{i}} \\ P (Ω (t)) = M (t), α_{i} (t) = R^{T} (t) e_{i}, i = 1, \dots, n . \end{matrix}

(22)

In this context,

M (t)

is the generalization of the angular momentum,

Ω (t)

is the generalization of the angular velocity,

P

is the generalized inertia tensor, and

\sum_{i = 1}^{n} α_{i} \land \frac{\partial V}{\partial α_{i}}

is the external torque.

2.2. Poisson Manifolds, Coadjoint Orbits

Equation (19) lend themselves to an insightful description in terms of the Poisson structure on

g^{*}

inherited from the symplectic form

ω

. Recall that a manifold M together with a bilinear, skew-symmetric form

{,} : C^{\infty} (M) \times C^{\infty} (M) \to C^{\infty} (M)

that satisfies

\begin{matrix} {f g, h} = f {g, h} + g {f, h}, (L e i b n i z^{'} s r u l e), a n d \\ {f, {g, h}} + {h, {f, g}} + {g, {h, f}} = 0, (J a c o b i^{'} s i d e n t i t y), \end{matrix}

for all functions

f, g, h

on M, is called a Poisson manifold.

Every symplectic manifold is also a Poisson manifold with the Poisson bracket given by

{f, g} (p) = ω_{p} (\vec{f} (p), \vec{g} (p)), p \in M

. However, the converse may not be true due to the fact that the Poisson bracket may be degenerate at some points of M. Nevertheless, each function f on M induces a Poisson vector field

\vec{f}

through the formula

\vec{f} (g) = {f, g}

as in the symplectic case. Poisson vector fields clarify the relation with symplectic manifolds through the following fundamental fact: every Poisson manifold is foliated by the orbits of its family of Poisson vector fields and each orbit is a symplectic submanifold of M with its symplectic form

ω_{p} (\vec{f}, \vec{h}) = {f, h} (p)

[7].

The dual

g^{*}

of a Lie algebra

g

is a Poisson manifold with the Poisson bracket

{f, h} (ℓ) = ℓ ([d h, d f])

(23)

for any functions f and h on

g^{*}

. In the literature on integrable systems the bracket

{f, h} (ℓ) = ℓ ([d f, d h])

is known as the Lie-Poisson bracket [10]. We have taken its negative to be compatible with the projections of left-invariant Hamiltonian vector fields on

g^{*}

(and also to agree with the sign conventions in [7]).

It follows that each function H on

g^{*}

defines a Poisson vector field

\vec{H}

on

g^{*}

via the formula

\vec{H} (f) (ℓ) = {H, f} (ℓ) = - ℓ ([d H, d f])

in which case the integral curves of

\vec{H}

are the solutions of

\frac{d ℓ}{d t} (t) = - {ad}^{*} d H_{ℓ (t)} (ℓ (t)) .

(24)

Thus, as we already mentioned above, each function H on

g^{*}

may be simultaneously viewed as a Hamiltonian on

T^{*} G

, and a function on the Poisson space

g^{*}

. Of course, Poisson equations coincide with the projections of the Hamiltonian equations on

g^{*}

.

Solutions of Equation (24) are intimately linked with the coadjoint orbits of G through the following proposition. due to of A.A. Kirillov [11] (the proof is also given in [7]).

Proposition 1.

Let

F

denote the family of Poisson vector fields on

g^{*}

and let

M = O_{F} (ℓ_{0})

denote the orbit of

F

through a point

ℓ_{0} \in g^{*}

. Then M is equal to the connected component of the coadjoint orbit of G that contains

ℓ_{0}

. Consequently each coadjoint orbit is a symplectic submanifold of

g^{*}

.

Recall that the coadjoint orbit of G through a point

ℓ \in g^{*}

is given by

{Ad}_{g}^{*} (ℓ) = {ℓ \circ {Ad}_{g^{- 1}}, g \in G} .

The fact that the Poisson equations can be naturally restricted to coadjoint orbits implies useful reductions in the theory of Hamiltonian systems.

2.3. Representation of Coadjoint Orbits on Lie Algebras

On semi-simple Lie groups Poisson Equation (24) can be expressed on

g

as

\frac{d L}{d t} = [d H, L],

(25)

because the Killing form, or any scalar multiple of it

⟨, ⟩

is non-degenerate, and invariant, in the sense that,

⟨ X, [Y, Z] ⟩ = ⟨ [X, Y], Z ⟩, X, Y, Z \in g

, and can be used to identify

g

with

g^{*}

via the formula

⟨ L, X ⟩ = ℓ (X), ℓ \in g^{*}, X \in g .

Then coadjoint orbits are identified with the adjoint orbits and the Poisson vector fields

{\vec{f}}_{X} (ℓ) = - {ad}^{*} X (ℓ)

are identified with vector fields

\vec{X} (L) = [X, L]

. Each vector field

[X, L]

is tangent to an orbit at L, and

ω_{L} ([X, L], [Y, L]) = ⟨ L, [Y, X] ⟩

,

X, Y

in

g

is the symplectic form on each orbit

O (L_{0})

.

In a reductive semi-simple Lie group G there is also the semi-direct product

G_{0} = p ⋊ K

described earlier which generates its own coadjoint orbits on the dual of the Lie algebra

g_{0}

of

G_{0}

. Recall that the Lie algebra

g_{0}

of

G_{0}

consists of pairs

(A, B), A \in p, B \in k

together with the Lie bracket

[(A_{1}, B_{1}), (A_{2}, B_{2})] = ([A_{1}, B_{2}] - [A_{2}, B_{1}], [B_{1}, B_{2}]) .

When the elements

(A, B) \in g_{0}

are identified with the sums the sums

A + B

in

g

,

g

as a vector space, carries a double Lie algebra; the semi-direct product Lie algebras

g_{0}

, and the semi-simple Lie algebra

g_{1} = g

. We then have

[A + B, C + D] = [A, B] s + [A, D + [B, C] + [A, D], s = 0, 1,

for any

A, C

in

p

and any

B, D

in

k

, with

s = 0

in the semi-direct case, and

s = 1

in the semi-simple case.

Since both

g

and

g_{0}

Lie algebras over the same vector space, the Poisson equations on

g_{0}^{*}

can be also represented on

g_{0}

via the quadratic form

⟨, ⟩

, but the resulting expression takes a slightly different form. To see the difference, let

d H = d H_{p} + d H_{k}

and

L = L_{p} + L_{k}

denote the decompositions of

d H

and L onto the factors

p

and

k

. On the semi-direct product Poisson equations reduce to

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [d H_{p}, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] .

(26)

This equation can be combined with the equations for the semi-simple case in terms of the parameter s as

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [d H_{p}, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] + s [d H_{p}, L_{k}], s = 0, 1 .

(27)

One can show that

P = A d_{h} (P_{0}), Q = [A d_{h} (P_{0}), X] + A d_{h} (Q_{0}), (X, h) \in G_{0}

(28)

is the coadjoint orbit through

P_{0} \in p, Q_{0} \in k

under the action of

G_{0} = p ⋊ K

when

ℓ_{0} \in g_{s}^{*}

is identified with

L_{0} = P_{0} + Q_{0}

in

g_{0}

, and when

ℓ = {Ad}_{(X, h)}^{*} (ℓ_{0})

is identified with

L = P + Q

[7].

The adjoint orbits of a non-compact semi-simple Lie groups G can be realized as the cotangent bundles of flag manifolds [12], and the same has been shown recently for the coadjoint orbits under the action of the semi-direct products [13,14]. We will make use of that fact later on in the paper.

3. Affine-Quadratic Problems

As stated earlier, we will restrict our attention to semi-simple Lie groups G and compact subgroups K with zero centre. We refer to

(G, K)

as a reductive pair. Then

g

and

k

will denote their Lie algebras, and

p

will denote the orthogonal complement of

k

in

g

relative to the Killing

K l (A, B) = T r (a d A \circ a d (B))

in

g

. Recall that

K l

is non-degenerate and satisfies

K l (A, [B, C]) = K l ([A, B], C), A, B, C in g .

Hence

p

is well defined and satisfies

[p, k] \subseteq p

( in fact,

[p, k] = p

because

g

is semi-simple). We will also assume that

[p, p] = k

. Note that the Killing form is negative-definite on

k

because K has zero centre [15], hence any negative scalar multiple

⟨, ⟩

of it is positive definite on

k

. We shall assume that such a scalar product is fixed.

An affine quadratic problem is defined through a positive definite quadratic form Q on

k

, and a regular element A in the Cartan space

p

. An element A in

p

is called regular if

{X \in p : [A, X] = 0}

is an abelian subalgebra in

p

. The corresponding affine-quadratic problem consists of finding the solutions

g (t)

in G of the affine control system

\frac{d g}{d t} = g (t) (A + U (t)),

(29)

generated by a square-integrable control

U (t)

in

k

that transfers a given state

g_{0}

in G to a given terminal state

g_{1}

in time T with a minimal energy

\frac{1}{2} \int_{0}^{T} Q (U (s)) d s

. Any positive definite quadratic form Q is of the form

Q (U) = \frac{1}{2} ⟨ P (U), U ⟩

for some self-adjoint and positive linear operator

P

on

k

. Then there exists an orthonormal basis

U_{1}, \dots, U_{k}

in

k

such that

P

is diagonal relative to it. That is, if

U (t) = \sum_{i = 1}^{k} u_{i} (t) U_{i}

then

P (U (t)) = \sum_{i = 1}^{k} c_{i} u_{i} (t) U_{i}

for some constants

c_{1}, \dots, c_{k}

. Then (29) can be rewritten as as

\frac{d g}{d t} = X_{0} (g) + \sum_{i = 1}^{k} u_{i} (t) X_{i} (g),

(30)

where

X_{0}, \dots, X_{k}

are the left-invariant vector fields with

X_{0} (g) = g A

and

X_{i} (g) = g U_{i}, i = 1, \dots, k

, with

\frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{k} c_{i} u_{i}^{2} (t) d t .

the energy associated with each solution. The most natural case occurs when

P = I

, that is, when

c_{i} = 1, i = 1, \dots, k

. We will refer to this case as the canonical affine-quadratic problem.

When A is regular, then (29) is controllable, a consequence of our assumption

[p, p] = k

, that is, any terminal state

g_{1}

can be reached in some finite time

T > 0

from any initial state

g_{0}

. But then there is an optimal solution

(\bar{g} (t), \bar{U} (t))

on the interval

[0, T]

for which the energy of transfer

\int_{0}^{T} Q (\bar{U} (s)) d s

is minimal (see [7] for the proof). Therefore the above optimal control problem is well-posed.

To each affine-quadratic problem there is an analogous “shadow problem” defined on the semi-direct product

G_{o} = p ⋉ K

defined by the same data as in the original problem. It follows that every affine space

Γ = {A + U : U \in k}

that defines an affine left-invariant system on G also defines a corresponding left-invariant affine system on the semi-direct product

G_{0}

. Thus behind every affine quadratic optimal problem on G there is a corresponding affine-quadratic “shadow” problem on the semi-direct product

G_{s}

. The shadow problem is also well defined in the sense that optimal solutions exist on some interval

[0, T]

for each pair of boundary points

g (0) = g_{0}

and

g (T) = g_{1}

.

According to Pontriyagin’s Maximum Principle every optimal trajectory generated by a bounded and measurable control is the projection of an extremal curve, and each extremal curve is an integral curve of a suitable Hamiltonian system on the cotangent bundle of the ambient space. The Maximum Principle is also valid for optimal problems with

L^{2}

controls over affine systems with quadratic costs ([16]).

Let now

g (t)

be an optimal trajectory generated by a control

u (t)

. According to the Maximum Principle,

g (t)

is the projection of an extremal curve

ξ (t)

in

T^{*} G

along which the cost extended Hamiltonian

- \frac{λ}{2} \sum_{i = 1}^{k} c_{i} u_{i}^{2} (t) + H_{0} (ξ) + \sum_{i = 1}^{k} u_{i} (t) H_{i} (ξ (t)), λ = 0, 1

is maximal at

u (t)

relative to all competing controls

u (t)

. In this notation, each

H_{i}

is the Hamiltonian lift of

X_{i}

, i.e.,

H_{i} (ξ (t)) = ξ (t) (X_{i} (g (t)))

. In the abnormal case, which we will not treat here,

λ = 0

, and the Maximum principle results in the constraints

H_{i} (ξ (t)) = 0, i = 1, \dots, k .

In the normal case,

λ = 1

, the maximality condition implies that the optimal controls are of the form

u_{i} (t) = \frac{1}{c_{i}} H_{i} (ξ (t)), i = 1, \dots, k

. Consequently, optimal solutions are the projections of solution curves of a single Hamiltonian vector field

\vec{H}

generated by the Hamiltonian

H (ξ) = \frac{1}{2} \sum_{i = 1}^{k} \frac{1}{c_{i}} H_{i}^{2} (ξ) + H_{0} (ξ) = \frac{1}{2} \sum_{i = 1}^{k} \frac{1}{c_{i}} {(ℓ (U_{i}))}^{2} + ℓ (A) .

(31)

Recall that each lift

H_{i} (ξ)

is a linear function on

g^{*}

given by

H_{i} (ξ) = ℓ (U_{i})

with

H_{0} (ξ) = ℓ (A)

. Thus H is left-invariant, hence its Hamiltonian equations are given by

\frac{d g}{d t} = X_{0} (g) + \sum_{i = 1}^{n} \frac{1}{c_{i}} H_{i} (ℓ (t)) X_{i} (g (t)), \frac{d ℓ}{d t} = - a d^{*} d H (ℓ (t)) (ℓ (t)) .

The associated Poisson equations can be now written in

g

as

\frac{d L_{k}}{d t} = [P^{- 1} (L_{k}), L_{k}] + [A, L_{p}], \frac{d L_{p}}{d t} = [P^{- 1} (L_{k}), L_{p}] + s [A, L_{k}], s = 0, 1,

(32)

after the identification of

ℓ \in g^{*}

with

L \in g

via the scalar product

⟨, ⟩

, and the decomposition

L = L_{k} + L_{p}, L_{k} \in k, L_{p} \in p

(Equation (27)). In the canonical case (

P = I

) the preceding equations reduce to

\frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [L_{k}, L_{p}] + s [A, L_{k}], s = 0, 1 .

(33)

Note that

s ⟨ L_{k}, L_{k} ⟩ + ⟨ L_{p}, L_{p} ⟩

is an integral for (32). This integral is a universal integral of motion in the sense that it remains constant for any left-invariant Hamiltonian on

g_{s}

.

3.1. Isospectral Representations

We now single out a remarkable class of affine-quadratic Hamiltonians that plays a prominent role in the theory of integrable systems. It consists of Hamiltonians

H = \frac{1}{2} ⟨ P^{- 1} L_{k}, L_{k} ⟩ + ⟨ L_{p}, A ⟩

that admit a spectral representation of the form

\begin{matrix} \frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}], \\ with M_{λ} = P^{- 1} (L_{k}) - λ A, and L_{λ, s} (L) = L_{p} - λ L_{k} + (λ^{2} - s) B, \end{matrix}

(34)

for some element

B \in p

that comutes with A, where

L_{p}

and

L_{k}

are the solutions of the Poisson Equation (32). Such a class is called isospectral and

L_{λ} (s)

is called the associated spectral curve. This terminology has origins in J. Zimmerman’s PhD thesis in 2002, in which he showed that the rolling sphere problem is isospectral [17]. We will return to Zimmerman’s problem and relate its results to the canonical affine-quadratic problem [18].

For Hamiltonian systems that admit an isospectral representation, the discrete spectral invariants of L are replaced by the functional invariants

ϕ_{λ, s}^{(k)} (L) = T r a c e (L_{λ, s}^{k} (L))

. Remarkably, the functional invariants

ϕ_{λ, s}^{k}

are in involution with each other, both with respect to the semi-simple and the semi-direct product Lie bracket, and in some instances generate a sufficient number of integrals of motion to ensure complete integrability ([7], 9.2). For instance, the family of functions

F_{0} = {ϕ_{λ, 0}^{k}, k \geq 1, λ \in R} \cup {h_{X} : [X, B] = 0, X \in k}

is completely integrable on each coadjoint orbit in

p ⋊ K

[19]. This means that H is completely integrable on each coadjoint orbit in

p ⋊ k

whenever H is in involution with the Hamiltonian lifts

h_{X} (L) = ⟨ L, X ⟩, X \in k, [X, B] = 0

. This implies that the canonical affine Hamiltonian is completely integrable on coadjoint orbits since each left-invariant vector field with values in the isotropy group of A is a symmetry for the canonical system. It is reasonable to expect that the analogous family of functions is also completely integrable on coadjoint orbits of G, but, to the best of my knowledge, the proofs have not yet appeared in the literature.

The focus on the affine-quadratic problem and the associated Hamiltonians allows for the following characterization of isospectral Hamiltonians (proved in [7]).

Theorem 1.

An affine Hamiltonian

H = \frac{1}{2} ⟨ P^{- 1} L_{k}, L_{k} ⟩ + ⟨ L_{p}, A ⟩

is isospectral if and only

[P^{- 1} (L_{k}), B] = [L_{k}, A]

for some element

B \in p

that commutes with A. In the isospectral case,

L_{p} = s B

is an invariant set for equations (32). On this set (32) are given by

\frac{d L_{k}}{d t} = [P^{- 1} (L_{k}), L_{k}],

(35)

and admit the reduced spectral representation

\frac{d}{d t} (L_{k} - λ B) = [P^{- 1} (L_{k}) - λ A, L_{k} - λ B] .

(36)

This theorem shows that the fundamental results A.T. Fomenko, A. S. Mischenko, and V.V. Trofimov on integrable left-invariant Riemannian metrics on compact Lie groups [20,21] based on Manakov’s seminal work on the n-dimensional Euler’s top [22] are subordinate to the isospectral properties of the affine Hamiltonian system, in the sense that the spectral invariants of

L_{k} - λ B

on

k

are always in involution with a larger family of functions generated by the spectral invariants of

L_{λ} = - L_{p} + λ L_{k} + (λ^{2} - s) B

on

g_{s}

associated with an affine Hamiltonian H.

3.2. Affine Hamiltonians and Mechanical Tops

Let us now draw comparisons between the semi-direct Poisson equations

\frac{d L_{k}}{d t} = [P^{- 1} (L_{k} (t)), L_{k} (t)] + [A, L_{p} (t)], \frac{d L_{p}}{d t} = [P^{- 1} (L_{k} (t)), L_{p} (t)]

(37)

and the “top-like” equations:

\frac{d R}{d t} = R (t) (P^{- 1} (M (t))), \frac{d M}{d t} = [P^{- 1} (M (t)), M (t)] + \sum_{i = 1}^{n} α_{i} (t) \land \frac{\partial V}{\partial α_{i}},

(38)

associated with the Hamiltonian

H = \frac{1}{2} ⟨ P^{- 1} (M), M ⟩ + V (α_{1}, \dots, α_{n})

. We will consider two cases- tops with linear potentials and tops with quadratic potentials.

Linear potentials. Equation (38) will be referred to heavy top-like equations when the potential energy V is generated by a linear Newtonian field, that is, when

V = - \sum_{i = 1}^{n} c_{i} (α_{i}, a)

, where a is a vector in

R^{n}

, and

c_{1}, \dots, c_{n}

are constants. When

a = 0

, the external torque

\sum_{i = 1}^{n} α_{i} (t) \land \frac{\partial V}{\partial α_{i}}

is equal to zero, and Equation (38) reduces to the Hamiltonian equation associated with a left-invariant Riemannian metric induced by the operator

P

(called the n-dimensional Euler’s top in some Russian literature [20]).

Heavy top-like equations can be written more compactly as

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + a \land p (t),

(39)

where

Ω (t) = P^{- 1} M (t)

, and

p (t) = \sum_{i = 1}^{n} c_{i} α_{i} (t)

. Since

α_{i} (t) = R {(t)}^{T} e_{i}

,

p (t)

is a solution of

\frac{d p}{d t} = - Ω (t) p (t)

. Hence each solution resides on the sphere

{p \in R^{n} : | | p (t) | | = | | p (0) | |}

.

Our theorems below relate Equation (39) to the Poisson Equation (37) on the reductive Lie algebras

so (n + 1)

and

so (1, n)

associated with reductive pairs

(S O_{ϵ}, K)

where

S O_{ϵ}

is

S O (n + 1)

when

ϵ = 1

and

S O (1, n)

when

ϵ = - 1

, and

K = {1} \times S O (n)

.

We will tackle both cases simultaneously but first we will need to introduce additional notation and terminology. We will use

{so}_{ϵ}

to denote the Lie algebra of

S O_{ϵ}

endowed with the trace form

⟨ A, B ⟩ = - \frac{1}{2} T r (A B)

. Relative to

S O_{ϵ}

we define its invariant bilinear form

{(x, y)}_{ϵ} = x_{0} y_{0} + ϵ \sum_{i = 1}^{n} x_{i} y_{i}

in the ambient space

R^{n + 1}

.

Then

a \otimes_{ϵ} b, a \in R^{n + 1}, b \in R^{n + 1}

, will denote the matrix defined by

(a \otimes_{ϵ} b) x = {(a, x)}_{ϵ} b, x \in R^{n + 1} .

and

a \land_{ϵ} b

denotes the matrix

a \otimes_{ϵ} b - b \otimes_{ϵ} a

. Since

{((a \land_{ϵ} b) x, y)}_{ϵ} + {(x, (a \land_{ϵ} b) y)}_{ϵ} = 0,

a \land_{ϵ} b

belongs to

{so}_{ϵ} (n + 1)

for any

a, b

in

R^{n + 1}

. We then have

Theorem 2.

Heavy top-like Equation (39) are isomorphic to the Poisson Equation (37) on the coadjoint orbit through

P_{0} = p (0) \land_{ϵ} e_{0}, Q_{0} = (\begin{matrix} 0 & 0 \\ 0 & M (0) \end{matrix})

under the coadjoint action of

p_{ϵ} ⋉ S O (n)

. The passage to the affine Hamiltonian is via the following correspondences

A = ϵ a \land_{ϵ} e_{0}, L_{p} = p \land_{ϵ} e_{0}, L_{k} = (\begin{matrix} 0 & 0 \\ 0 & M \end{matrix}), P^{- 1} (L_{k}) = (\begin{matrix} 0 & 0 \\ 0 & P^{- 1} (M) \end{matrix}) .

(40)

For a proof see [14]. The preceding theorem clarifies the presence of heavy tops in the Hamiltonian equations on Lie algebras [10]. It also clarifies the relation between the tops and elastic rods initiated by G. Kirchhoff known as the “kinetic analogues” [23,24]. It also proves that the classification of completely integrable elastic rods in [7,8] carries over to the heavy tops.

Quadratic potentials. We will now show that the tops with quadratic potential V are also present in the equations of affine Hamiltonians, but this time on the tangent bundle of

S L (n)

, or more precisely on the tangent bundle of the semi-direct product

s y m^{0} (n) ⋊ S O (n)

where

s y m^{0} (n)

denotes the space of symmetric

n \times n

matrices with zero trace. For that purpose let

H (R, M) = \frac{1}{2} (P^{- 1} (M), M) + \frac{1}{2} \sum_{i = 1}^{n} a_{i} ⟨ S α_{i}, α_{i} ⟩,

with

R \in S O (n), M \in s o (n)

,

R^{T} e_{i} = α_{i}

, and S a symmetric

n \times n

. In accordance with (38) the Hamiltonian equations of

\vec{H}

are given by

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + \sum_{i = 1}^{n} a_{i} α_{i} (t) \land S α_{i} (t),

(41)

where

Ω (t) = P^{- 1} (M (t))

.

Theorem 3.

Top-like Equation (41) are isomorphic with the Poisson equations generated by the affine Hamiltonian

H = \frac{1}{2} ⟨ P^{- 1} (L_{k}), L_{k} ⟩ + ⟨ L_{p}, S ⟩

on the coadjoint orbit through

P_{0} = \sum_{i = 1}^{n} a_{i} (e_{i} \otimes e_{i}) - (\frac{1}{n} \sum_{i = 1}^{n} a_{i}) I

and

Q_{0} = M (0)

under the action of the semi-direct product

s y m^{0} (n) ⋊ S O (n)

.

Proof.

Every solution

(M (t), R (t))

of (41) generates symmetric matrices

L_{p} (t)

and

X (t)

given by

\begin{matrix} L_{p} (t) = A d_{h (t)} P_{0} = \sum_{i = 1}^{n} a_{i} (α_{i} (t) \otimes α_{i} (t)) - \frac{1}{n} \sum_{i = 1}^{n} a_{i} I, \\ X (t) = {Ad}_{h (t)} Y (t), Y (t) = - \int_{0}^{t} {Ad}_{h^{- 1} (s)} S d s, \end{matrix}

with

h (t) = R^{T} (t)

. Then,

\begin{matrix} \frac{d L_{p}}{d t} = - \sum_{i = 1}^{n} a_{i} (Ω α_{i} \otimes α_{i} + α_{i} \otimes Ω α_{i}) = [Ω, L_{p}], \\ \frac{d X}{d t} = [Ω (t), X (t)] + {Ad}_{h (t)} \dot{Y} = [Ω (t), X (t)] - S . \end{matrix}

Additionally,

\begin{matrix} [S, L_{p} (t)] = \sum_{i = 1}^{n} (a_{i} (α_{i} \otimes α_{i}) S - S a_{i} (α_{i} \otimes α_{i})) = \\ \sum_{i = 1}^{n} a_{i} α_{i} \otimes S α_{i} - a_{i} S α_{i} \otimes α_{i} = \sum_{i = 1}^{n} a_{i} (α_{i} \land S α_{i}), \end{matrix}

which in turn implies that (41) can be written as

\frac{d R}{d t} = R (t) Ω (t), \frac{d M}{d t} = [Ω (t), M (t)] + [S, L_{p} (t)] .

Let now

Q (t) = [{Ad}_{h (t)} (P_{0}), X (t)] + {Ad}_{h (t)} Q_{0} = [L_{p} (t), X (t)] + {Ad}_{h (t)} Q_{0}

. Note first that

\begin{matrix} [[Ω, L_{p}], X] = - [[X, Ω], L_{p}] - [[L_{p}, X], Ω] \\ = - [[X, Ω], L_{p}] + [Ω, Q] - [Ω, {Ad}_{h} Q_{0}] . \end{matrix}

Then,

\begin{matrix} \frac{d Q}{d t} = [[Ω (t), L_{p} (t)], X (t)] + [L_{p} (t), \frac{d X}{d t} (t)] + [Ω (t), {Ad}_{h (t)} (Q_{0}] = \\ [Ω (t), Q (t)] + [L_{p}, [X, Ω (t)]] + [L_{p}, \frac{d X}{d t}] = \\ [Ω (t), Q (t)] + [S, L_{p}] . \end{matrix}

Therefore

Q (t)

and

M (t)

satisfy the same differential equation. Hence

Q (t) = M (t)

whenever

Q_{0} = M (0)

. If we now rename

Q (t)

as

L_{k} (t)

we get the Poisson equations for the shadow Hamiltonian

H = \frac{1}{2} ⟨ P^{- 1} (L_{k}), L_{k} ⟩ + ⟨ S, L_{p} ⟩ .

□

The preceding theorem links isospectral Hamiltonians to the equations of the top under quadratic potentials and paves a way to the n-dimensional generalization of O. Bogoyavlensky’s famous result on integrability of three-dimensional mechanical tops in the presence of a quadratic potential [25]. The path to isospectral Hamiltonians is provided by Manakov’s observation that the inertia tensor

⟨ P (U), U ⟩

for a rigid body is confined to the transformations

P (U) = S U + U S

, for some positive definite matrix S. For then,

[P^{- 1} (M), S^{2}] = [M, S]

. Indeed, in this situation

P (U) = S U + U S = M

, and

[P^{- 1} M, S^{2}] = [U, S^{2}] = [S U + U S, S] = [M, S] .

Hence the corresponding affine Hamiltonian

\hat{H} = \frac{1}{2} ⟨ P^{- 1} L_{k}, L_{k} ⟩ + ⟨ S, L_{p} ⟩

is isospectral on

sl (n)

(Theorem 1). Since the equations of the Hamiltonian

H = \frac{1}{2} ⟨ P^{- 1} M, M ⟩ + \sum_{i = 1}^{n} a_{i} (α_{i}, S α_{i})

corresponding to the top with quadratic potential

V = \frac{1}{2} \sum_{i = 1}^{n} a_{i} (S α_{i}, α_{i})

can be identified with the Poisson equations of

\hat{H}

on the coadjoint orbit through

L_{p} = P_{0}, L_{k} = M (0)

, the isospectral invariants of

L_{λ} = \sum_{i = 1}^{n} a_{i} α_{i} - λ M + λ^{2} S

(42)

are integrals of motion for the top. (Theorem 3). Since

{X \in so (n) : [X, S] = 0} = 0

for each non-singular symmetric matrix S, the spectral invariants of

L_{λ} (s) = \sum_{i = 1}^{n} a_{i} (α_{i} \otimes α_{i} - λ M + (λ^{2} - s) S

(43)

form a completely integrable family of functions on each coadjoint orbit in

sl (n)

(semi-simple and semi-direct). the top with a quadratic potential is completely integrable in all dimensions.

3.3. Three-Dimensional Tops- Kirchhoff-Kowalewski Type

We will now turn our attention to the class of affine-quadratic systems of Kirchhoff-Kowalewski type on complex Lie algebras with a particular interest on the symmetries that account for the existence of Kowalewski’s integral reported in her seminal paper on the motions of a rigid body around a fixed point under the influence of gravity [26]. We will follow our recent paper [27] and show that there is a natural Hamiltonian on

sp (4, C)

that answers the fundamental questions raised by Kowalewki’s paper, namely, what is the geometric rational behind her approach in which all the variables were treated as complex quantities, and secondly. what are the symmetries that account for the existence of not only her integral of motion, but also of similar integrals, known as Kowalewski type integrals, that subsequently appeared in the literature on integrable systems [8,28,29,30,31].

Theorem 2 suggests that the search for the answers to the above questions should begin with the Poisson equations associated with an affine-quadratic Hamiltonian on

so (4, C)

since both

so (1, 3)

and

so (4)

are real forms for

so (4, C)

(see also [24]). We will show that Kowalewski’s “mysterious” change of variables appear naturally in the passage from

so (4, C)

to

sl (2, C) \times sl (2, C)

an important intermediate step towards the right Hamiltonian on

sp (4, C)

. The journey from

so (4, C)

to

sp (4, C)

to this remarkable Hamiltonian begins with

H = \frac{1}{2} (\frac{m_{1}}{λ_{1}} + \frac{m_{2}}{λ_{2}} + \frac{m_{3}}{λ_{3}}) + b_{1} p_{1} + b_{2} p_{2} + p_{3} b_{3},

(44)

where

L = m_{1} A_{1} + m_{2} A_{2} + m_{3} A_{3} + p_{1} B_{1} + p_{2} B_{2} + p_{3} B_{3}

is the coordinate representation of a point L in

so (4, C)

relative to an orthonormal basis

A_{1}, A_{2}, A_{3}, B_{1}, B_{2}, B_{3}

that conforms to the following Lie bracket Table 1:

Table 1. Lie brackets for

s = 0, 1

.

Then

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [B, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] + s [B, L_{k}], s = 0, 1,

(45)

are the Poisson equations generated by H, where

B = b_{1} B_{1} + b_{2} B_{2} + b_{3} B_{3}

denote the drift element in

p

,

d H_{k} = \sum_{i = 1}^{3} \frac{m_{i}}{λ_{i}} A_{i}

,

L_{k} = \sum_{i = 1}^{3} m_{i} A_{i}

, and

L_{p} = \sum_{i = 1}^{3} p_{i} B_{i}

. The same equations can be also expressed as

\begin{matrix} \frac{d m_{1}}{d t} = m_{2} m_{3} (\frac{1}{λ_{3}} - \frac{1}{λ_{2}}) + p_{2} b_{3} - p_{3} b_{2}, \\ \frac{d m_{2}}{d t} = m_{1} m_{3} (\frac{1}{λ_{1}} - \frac{1}{λ_{3}}) + p_{3} b_{1} - p_{1} b_{3}, \\ \frac{d m_{3}}{d t} = m_{1} m_{2} (\frac{1}{λ_{2}} - \frac{1}{λ_{1}}) + p_{1} b_{2} - p_{2} b_{1}, \\ \frac{d p_{1}}{d t} = \frac{1}{λ_{3}} p_{2} m_{3} - \frac{1}{λ_{2}} p_{3} m_{2} + s (m_{2} b_{3} - m_{3} b_{2}), \\ \frac{d p_{2}}{d t} = \frac{1}{λ_{1}} p_{3} m_{1} - \frac{1}{λ_{3}} p_{1} m_{3} + s (m_{3} b_{1} - m_{1} b_{3}), \\ \frac{d p_{3}}{d t} = \frac{1}{λ_{2}} p_{1} m_{2} - \frac{1}{λ_{1}} p_{2} m_{1} + s (m_{1} b_{2} - m_{2} b_{1}) . \end{matrix}

(46)

When

s = 0

the above equations formally coincide with the equations of the top:

\frac{d M}{d t} = [Ω (t), M (t)] + b \land p, \frac{d p}{d t} = - Ω (t) p (t)

(Equation (21)).

On

g_{s}

there are two Casimirs:

I_{1} = ⟨ L_{p}, L_{p} ⟩ + s ⟨ L_{k}, L_{k} ⟩ = (p, p) + s (m, m), I_{2} = ⟨ L_{k}, L_{p} ⟩ = (m, p),

Hence generic coadjoint orbits in

g_{s}

are four-dimensional. Since each coadjoint orbit is symplectic, integrable cases occur whenever there is an extra integral of motion functionally independent of H,

I_{1}

, and

I_{2}

. Since the motion of the top is subordinate to the Poisson system of H on

se (3, C)

, the search for integrable tops reduces to the search for an additional integral of motion functionally independent from

I_{1}

,

I_{2}

and H.

Let us now come to the conditions of Kowalewski

λ = λ_{1} = λ_{2} = 2 λ_{3}, b_{3} = 0 .

(47)

and her “mysterious” variables

\begin{matrix} z_{1} = m_{1} + i m_{2}, z_{2} = m_{1} - i m_{2}, w_{1} = p_{1} + i p_{2}, \\ w_{2} = p_{1} - i p_{2}, z_{3} = i m_{3}, w_{3} = i p_{3}, \\ b = b_{1} + i b_{2}, \bar{b} = b_{1} - i b_{2} . \end{matrix}

(48)

After the substitutions, Equation (46) become

\begin{matrix} \frac{d z_{1}}{d t} = - \frac{1}{λ} z_{1} z_{3} + b w_{3}, \frac{d z_{2}}{d t} = \frac{1}{λ} z_{2} z_{3} - \bar{b} w_{3} \\ \frac{d z_{3}}{d t} = \frac{1}{2} (b w_{2} - \bar{b} w_{1}) \\ \frac{d w_{1}}{d t} = \frac{1}{λ} z_{1} w_{3} - \frac{2}{λ} z_{3} w_{1} + s b z_{3}, \frac{d w_{2}}{d t} = - \frac{1}{λ} z_{2} w_{3} + \frac{2}{λ} z_{3} w_{2} - s \bar{b} z_{3} \\ \frac{d w_{3}}{d t} = \frac{1}{2 λ} (z_{1} w_{2} - z_{2} w_{1}) + \frac{s}{2} (b z_{2} - \bar{b} z_{1}), \end{matrix}

(49)

from which it can be easily extracted that

I = (\frac{z_{1}^{2}}{2 λ} - b w_{1} + \frac{1}{2} s λ b^{2}) (\frac{z_{2}^{2}}{2 λ} - \bar{b} w_{2} + \frac{1}{2} s λ {\bar{b}}^{2}),

(50)

is an integral of motion. Following the terminology in [7] we will refer refer to this integral as the Kirchhoff-Kowalewski integral. It is only in the special case

s = 0

and

λ = 2

that this integral coincides with the integral of motion found by Kowalewski. The real versions of the Kirchhoff-Kowalewski integral were originally discovered by V. Kuznetsov and I.V. Komarov in their studies of the hydrogen atom [28,29].

Let us reveal the geometric rational behind Kowalewski’s change of variables. The explanations are most naturally articulated through the root system in

so (4, C)

. Recall that any maximal commutative sub-algebra

h

of a Lie algebra

g

is called a Cartan subalgebra. All Cartan subalgebras in a semi-simple Lie algebra are conjugate, and hence all have the same dimension. The dimension of any Cartan algebra is the rank of

g

. The rank of

{so}_{4} (C)

is two. Evidently each pair

(A_{i}, B_{i}), i = 1, 2, 3

, in Table 1 generates a Cartan algebra in

so (4, C)

. Since these algebras are conjugate, there is no preferential choice. However, in regard to the equations of the top, there is a preferential choice when two moments of inertia are equal. In the case that

λ_{1} = λ_{2}

the natural choice is the Cartan algebra generated by the pair

{A_{3}, B_{3}}

.

An element

α

in the dual

h^{*}

of a Cartan algebra

h

is called a root if for some

v \in g

,

[h, v] = α (h) v

for all

h \in h

. An easy calculation shows that there are four roots

\pm α_{1}

,

\pm α_{2}

given by

α_{1} (x A_{3} + y B_{3}) = - i (x + y), α_{2} (x A_{3} + y B_{3}) = - i (x - y), x, y \in C .

(51)

The corresponding root spaces are one dimensional, and are generated by

\begin{matrix} C_{1} = \frac{1}{2} (A_{1} - i A_{2}) + \frac{1}{2} (B_{1} - i B_{2}), α = α_{1}, \\ C_{2} = \frac{1}{2} (A_{1} + i A_{2}) + \frac{1}{2} (B_{1} + i B_{2}), α = - α_{1}, \\ D_{1} = \frac{1}{2} (A_{1} - i A_{2}) - \frac{1}{2} (B_{1} - i B_{2}), α = α_{2}, \\ D_{2} = \frac{1}{2} (A_{1} + i A_{2}) - \frac{1}{2} (B_{1} + i B_{2}), α = - α_{2} . \end{matrix}

(52)

Together with

C_{3} = \frac{i}{2} (A_{3} + B_{3})

and

D_{3} = \frac{i}{2} (A_{3} - B_{3})

these matrices form a basis for

so (4, C)

. A simple calculation shows that

\begin{matrix} α_{1} (C_{3}) = 1, α_{2} (D_{3}) = 1, α_{1} (D_{3}) = α_{2} (C_{3}) = 0, hence, \\ [C_{3}, C_{1}] = C_{1}, [C_{3}, C_{2}] = - C_{2}, [D_{3}, D_{1}] = D_{1}, [D_{3}, D_{2}] = - D_{2} . \end{matrix}

Furthermore,

[C_{1}, C_{2}] = - 2 C_{3}

,

[D_{1}, D_{2}] = - 2 D_{3}

, and

[C_{i}, D_{j}] = 0

, for all i and j.

The Lie algebras

g_{1}

and

g_{2}

spanned by

C_{1}, C_{2}, C_{3}

, and

D_{1}, D_{2}, D_{3}

satisfy

[g_{1}, g_{2}] = 0

. and each is isomorphic to

sl (2, C)

under the identification

C_{1}, D_{1} \to (\begin{matrix} 0 & 1 \\ 0 & 0 \end{matrix}), C_{2}, D_{2} \to (\begin{matrix} 0 & 0 \\ - 1 & 0 \end{matrix}), C_{3}, D_{3} \to \frac{1}{2} (\begin{matrix} - 1 & 0 \\ 0 & 1 \end{matrix}) .

(53)

An easy calculation shows that the coordinates

a_{1}, a_{2}, a_{3}, b_{1}, b_{2}, b_{3}

of an arbitrary point

X \in so (4, C)

relative to the basis

A_{1}, A_{2}, A_{3}, B_{1}, B_{2}, B_{3}

are transformed to the coordinates

c_{1}, c_{2}, c_{3}, d_{1}, d_{2}, d_{3}

relative to the basis

C_{1}, C_{2}, C_{3}, D_{1}, D_{2}, D_{3}

according to the following formulas:

\begin{matrix} c_{1} = \frac{1}{2} (a_{1} + i a_{2}) + \frac{1}{2} (b_{1} + i b_{2}), d_{1} = \frac{1}{2} (a_{1} + i a_{2}) - \frac{1}{2} (b_{1} + i b_{2}), \\ c_{2} = \frac{1}{2} (a_{1} - i a_{2}) + \frac{1}{2} (b_{1} - i b_{2}), d_{2} = \frac{1}{2} (a_{1} - i a_{2}) - \frac{1}{2} (b_{1} - i b_{2}) . \\ c_{3} = - i (a_{3} + b_{3}), d_{3} = - i (a_{3} - b_{3}) . \end{matrix}

Let us now

Φ : so (4, C) \to sp (4, C)

be given by

\begin{matrix} Φ (\sum_{i = 1}^{3} (c_{i} C_{i} + d_{i} D_{i})) = (\begin{matrix} 1 & 0 \\ 0 & 0 \end{matrix}) \otimes (\begin{matrix} - \frac{c_{3}}{2} & c_{1} \\ - c_{2} & \frac{c_{3}}{2} \end{matrix}) + (\begin{matrix} 0 & 0 \\ 0 & 1 \end{matrix}) \otimes (\begin{matrix} - \frac{d_{3}}{2} & d_{1} \\ - d_{2} & \frac{d_{3}}{2} \end{matrix}) = \\ I \otimes \frac{1}{2} (\begin{matrix} - \frac{1}{2} (c_{3} + d_{3}) & c_{1} + d_{1} \\ - (c_{2} + d_{2}) & \frac{1}{2} (c_{3} + d_{3}) \end{matrix}) + (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \otimes \frac{1}{2} (\begin{matrix} - \frac{1}{2} (c_{3} - d_{3}) & c_{1} - d_{1} \\ - (c_{2} - d_{2}) & \frac{1}{2} (c_{3} - d_{3}) \end{matrix}) = \\ I \otimes \frac{1}{2} (\begin{matrix} i a_{3} & a_{1} + i a_{2} \\ - a_{1} + i a_{2} & - i a_{3} \end{matrix}) + (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \otimes \frac{1}{2} (\begin{matrix} i b_{3} & b_{1} + i b_{2} \\ - b_{1} + i b_{2} & - i b_{3} \end{matrix}) . \end{matrix}

where

A \otimes B

denotes the Kronecker product of matrices A and B.

To see that

Φ (so (4, C)) \subset sp (4, C)

recall first that

sp (4, C)

consists of matrices M that satisfy

J M J^{- 1} = - M^{T}

, where J is the matrix that defines the symplectic form

(z, J w)

on

C^{4}

, i.e.,

J^{2} = - I

. It is easy to check that both

I \otimes \frac{1}{2} (\begin{matrix} i a_{3} & a_{1} + i a_{2} \\ - a_{1} + i a_{2} & - i a_{3} \end{matrix})

and

(\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \otimes \frac{1}{2} (\begin{matrix} i b_{3} & b_{1} + i b_{2} \\ - b_{1} + i b_{2} & - i b_{3} \end{matrix})

satisfy

J M J^{- 1} = - M^{T}

with

J = (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) \otimes (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix})

. Since

J^{2} = - I_{2} \otimes I_{2} = - I

, our claim follows.

We will identify

sl (2, C)

with pure complex quaternions

Q

via the correspondence

q = q_{1} \vec{i} + q_{2} \vec{j} + q_{3} \vec{k} \Leftrightarrow Q = q_{1} E_{1} + q_{2} E_{2} + q_{3} E_{3} = (\begin{matrix} i q_{3} & q_{1} + i q_{2} \\ - q_{1} + i q_{2} & - i q_{3} . \end{matrix})

Then the standard basis

\vec{i}, \vec{j}, \vec{k}

in

Q

is identified with the matrices

E_{1} = (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}), E_{2} = (\begin{matrix} 0 & i \\ i & 0 \end{matrix}), E_{3} = (\begin{matrix} i & 0 \\ 0 & - i \end{matrix}),

and any element

X = (\begin{matrix} a & b \\ c & - a \end{matrix})

in

sl (2, C)

is represented by the quaternion

X = \frac{1}{2} (\begin{matrix} i a_{3} & a_{1} + i a_{2} \\ - a_{1} + i a_{2} & - i a_{3} \end{matrix})

,

a_{1} = b - c

,

a_{2} = - (b + c)

,

a_{3} = - 2 i a

.

Let now

A_{i} = \frac{1}{2} E_{i}

,

A_{i} = I \otimes A_{i}, B_{i} = E_{3} \otimes A_{i}

, so that

Φ (A_{i}) = A_{i}

and

Φ (B_{i}) = B_{i}

for each i,

i = 1, 2, 3

, and let

{so}_{4} = Φ (so (4, C))

. Matrices

A_{i}, B_{i}, i = 1, 2, 3 .

form an orthonormal basis in

{so}_{4}

relative to the inner product

⟨ X, Y ⟩ = - T r (X Y)

on

sp (4, C)

.

It is easy to verify that

Φ : so (4, C) \to {so}_{4}

is a Poisson map. Therefore

\tilde{H} (\tilde{ℓ}) = H (Φ^{*} (\tilde{ℓ})) = H (ℓ), Φ^{*} (\tilde{ℓ}) = ℓ, \tilde{ℓ} \in {so}_{4},

(54)

for any function H on

{so}^{*} (4, C)

, where

Φ^{*}

denotes the dual map of

Φ

. After the identification of

{so}_{4}^{*}

with

{so}_{4}

via the trace form, the Poisson equations of

\tilde{H}

associated with H in (44) become

\begin{matrix} \frac{d}{d t} (I \otimes Z) & = & [I \otimes Ω, I \otimes Z] + [E_{3} \otimes B, E_{3} \otimes W] = I \otimes ([Ω, Z] + [B, W]), \\ \frac{d}{d t} (E_{3} \otimes W) & = & [I \otimes Z, E_{3} \otimes W] + s [E_{3} \otimes B, I \otimes Z] = E_{3} \otimes ([Ω, W] + s [B, Z]), \end{matrix}

or, in simpler form,

\frac{d Z}{d t} = [Ω, Z] + [B, W], \frac{d W}{d t} = [Ω, W] + s [B, Z],

(55)

where

Z = \frac{1}{2} (\begin{matrix} i m_{3} & m_{1} + i m_{2} \\ - m_{1} + i m_{2} & - i m_{3} \end{matrix}), W = \frac{1}{2} (\begin{matrix} i p_{3} & p_{1} + i p_{2} \\ - p_{1} + i p_{2} & - i p_{3} \end{matrix})

,

Ω = \frac{1}{2} (\begin{matrix} \frac{1}{λ_{3}} z_{3} & \frac{1}{λ_{1}} m_{1} + \frac{i}{λ_{2}} m_{2} \\ - \frac{1}{λ_{1}} m_{1} + \frac{i}{λ_{2}} m_{2} & - \frac{1}{λ_{3}} z_{3} \end{matrix})

, and

B = \frac{1}{2} (\begin{matrix} i b_{3} & b_{1} + i b_{2} \\ - b_{1} + i b_{2} & - i b_{3} \end{matrix}) .

Now we see Kowalewski variables

z_{1} = m_{1} + i m_{2}, z_{2} = m_{1} - i m_{2}, w_{1} = p_{1} + i p_{2}, w_{2} = p_{1} - i p_{2}

as the natural coordinates in this Poisson representation. Under Kowalewski’s conditions

λ = λ_{1} = λ_{2} = 2 λ_{3}

,

b_{3} = 0

, Equation (55) reduce to Equation (49). The passage from H to

\tilde{H}

reveals the geometric rational behind the ad-hoc change of variables in (48) and serves as a natural segue to our ultimate Hamiltonian on

sp (4, C)

.

3.4. Kowalewski’s Conditions and Isospectral Representations

We now address the origins of the “enigmatic” conditions (47) through an extended affine-quadratic Hamiltonian

H = \sum_{i = 1}^{3} \frac{m_{i}^{2}}{2 λ_{i}} + b_{i} p_{i} + c_{i} q_{i}

(56)

on

{sp}^{*} (4, C)

defined by complex numbers

λ_{1}, λ_{2}, λ_{3}

,

b_{1}, b_{2}, b_{3}

, and

c_{1}, c_{2}, c_{3}

and an extended basis

A_{i}, B_{i}, i = 1, 2, 3

,

A_{4} = \frac{1}{2} E_{2} \otimes I,

and

C_{1} = E_{1} \otimes A_{1}, C_{2} = E_{1} \otimes A_{2}, C_{3} = E_{1} \otimes A_{3}

, where

E_{1} = (\begin{matrix} 0 & 1 \\ - 1 & 0 \end{matrix}), E_{2} = (\begin{matrix} 0 & 1 \\ 1 & 0 \end{matrix}),

and

E_{3} = (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}) .

The reader can easily verify that

g = sp (4, C)

has the following decomposition

g = k \oplus p, p = p_{1} \oplus p_{2}, k = k_{0} \oplus C A_{4}

(57)

where

k_{0}

is the Lie algebra spanned by

A_{1}, A_{2}, A_{3}

and where

p_{1}

and

p_{2}

are respectively the linear spans of

B_{i}, i = 1, 2, 3

and

C_{i}, i = 1, 2, 3

. These spaces conform to the following Lie algebraic relations:

\begin{matrix} [A_{4}, k_{0}] = 0, [A_{4}, p_{1}] = p_{2}, [A_{4}, p_{2}] = p_{1}, [k_{o}, p_{1}] = p_{1}, [k_{0}, p_{2}] = p_{2}, \\ [p_{1}, p_{1}] = k_{0}, [p_{2}, p_{2}] = k_{0}, [p_{1}, p_{2}] = C A_{4} . \end{matrix}

After

g^{*}

is identified with

g

via the scalar product

⟨ X, Y ⟩ = - \frac{1}{2} T r (X Y)

the above Hamiltonian can be written as

H = \frac{1}{2} ⟨ P (L_{k}^{0}), L_{k}^{0} ⟩ + ⟨ L_{p}, A ⟩,

(58)

where

L_{k}^{0} = m_{1} A_{1} + m_{2} A_{2} + m_{3} A_{3}, L_{p} = \sum_{i = 1}^{3} p_{i} B_{i} - q_{i} C_{i}

and

A = B + C

,

B = \sum_{i = 1}^{3} b_{i} B_{i}

,

C = \sum_{i = 1}^{3} c_{i} C_{i}

(note that

⟨, ⟩

is negative on

p_{2}

which accounts for the negative signs in the expression for

L_{p}

).

Since

g = sp (4, C)

is semi-simple the Poisson equations for

H

are given by

\frac{d L_{k}}{d t} = [d H_{k}, L_{k}] + [A, L_{p}], \frac{d L_{p}}{d t} = [d H_{k}, L_{p}] + s [A, L_{k}], s = 0, 1 .

These equations can be written in a more succinct form as

\begin{matrix} \frac{d Z}{d t} = [Ω, Z] + [B, W] + [C, S], \\ \frac{d m_{4}}{d t} = - (C \cdot W + B \cdot S), \\ \frac{d W}{d t} = [Ω, W] + s ([B, Z] - m_{4} C), \\ \frac{d S}{d t} = [Ω, S] - s ([C, Z] - m_{4} B) . \end{matrix}

(59)

in terms of the following notations:

\begin{matrix} L_{k} = I \otimes Z + m_{4} A_{4}, L_{p} = E_{3} \otimes W - E_{1} \otimes S, \\ d H = I \otimes Ω + E_{3} \otimes B + E_{1} \otimes C, Ω = \sum_{i = 1}^{3} \frac{m_{i}}{˘_{i}} A_{i}, \\ B = \sum_{i = 1}^{3} b_{i} A_{i}, C = \sum_{i = 1}^{3} c_{i} A_{i} . \end{matrix}

Z = \frac{1}{2} (\begin{matrix} z_{3} & z_{1} \\ - z_{2} & - z_{3} \end{matrix}), W = \frac{1}{2} (\begin{matrix} w_{3} & w_{1} \\ - w_{2} & - w_{3} \end{matrix}),

z_{1, 2} = m_{1} \pm i m_{2}, z_{3} = i m_{3}, w_{1, 2} = p_{1} \pm i p_{2}, w_{3} = i p_{3}

, as in the previous section, and

S = \frac{1}{2} (\begin{matrix} s_{3} & s_{1} \\ - s_{2} & - s_{3} \end{matrix}),

with

s_{1, 2} = q_{1} \pm i q_{2}, s_{3} = i q_{3}

.

We now come to the crux of the matter, the existence of integrals of motion for the above system. The intermediate question is the existence of an integral I of the form

I = α_{1} m_{1} + α_{2} m_{2} + α_{3} m_{3} + β m_{4}

for some constants

α_{1}, α_{2}, α_{3},

and

β

.

Proposition 2.

I = α_{1} m_{1} + α_{2} m_{2} + α_{3} m_{3} + β m_{4}

is an integral of motion for

H

in exactly two cases: when

λ_{1} = λ_{2}

and

b_{1} = b_{2} = c_{1} = c_{2} = 0

, then

I = m_{3}

, and in the second case, when

λ_{1} = λ_{2}, b_{3} = c_{3} = 0, b_{1} = \pm i c_{2}, b_{2} = \mp i c_{1}

, then

I = i m_{3} + m_{4}

. (for the proof see [27]).

This first condition singles out the top of Lagrange, while the second condition is a precursor to Kowalewski’s top as will be demonstrated below. Note that the second condition

b_{1} = i c_{2}

and

b_{2} = - i c_{1}

can be also written as

b = c

, and

\bar{b} = - \bar{c}

where

b = b_{1} + i b_{2}

,

\bar{b} = b_{1} - i b_{2}

,

c = c_{1} + i c_{2}

and

\bar{c} = c_{1} - i c_{2}

. Then

C = \frac{1}{2} (\begin{matrix} 0 & b \\ \bar{b} & 0 \end{matrix})

, and since it is orthogonal to

B = \frac{1}{2} (\begin{matrix} 0 & b \\ - \bar{b} & 0 \end{matrix})

, it will be denoted by

B^{⊥}

.

We will say that (59) satisfies the preliminary condition of Kowalewski whenever

λ_{1} = λ_{2}

and

C = B^{⊥}

. It follows that the preliminary condition of Kowalewski is synonymous with the integral of motion

I = i m_{3} + m_{4} = z_{3} + m_{4}

. (This integral of motion was also discovered earlier by A.M Savu in [32]).

We will now assume that the preliminary condition holds and we will pursue conditions on the ratio

δ = \frac{λ}{λ_{3}}

, where

λ = λ_{1} = λ_{2}

, that guarantee extra integrals of motion for system (59). Note that in this situation

A = B + C = E_{3} \otimes B + E_{1} \otimes B^{⊥}

. Systems that satisfy the preliminary condition of Kowalewski and also satisfy

δ = 2

will be said to satisfy the Kowalewsky conditions. The following proposition provides an important characterization of Kowalewski’s conditions for both

s = 0

and

s = 1

.

Proposition 3.

Assume that (59) satisfies the preliminary condition of Kowalewski and is restricted to the manifold

z_{3} + m_{4} = 0

. Then

H

satisfies the isospectrality condition

[d H_{k}, B] = [L_{k}, A]

(as in Theorem 1) for some matrix

B \in p

, with

[A, B] = 0

if only if (59) satisfies the Kowalewski conditions. In fact,

B = λ (E_{1} \otimes B^{⊥} + E_{3} \otimes B) = λ A

[27].

Indeed, under Kowalewski’s conditions

Ω = \frac{1}{λ} Z + \frac{1}{2 λ} z_{3} E_{3} and L_{k} = I \otimes Z - z_{3} A_{4} .

Therefore,

[d H_{k}, B] = [I \times Ω, B] = [I \times Z, A] + \frac{z_{3}}{2} [I \times E_{3}, A], and [L_{k}, A] = [I \otimes Z, A] - z_{3} [A_{4}, A] .

Since

[\frac{z_{3}}{2} I \otimes E_{3}, A] = - z_{3} [A_{4}, A]

,

[I \otimes Ω, B] = [L_{k}, A]

.

It follows from Theorem 1 that Kowalewski’s condition is necessary and sufficient for the existence of isospectral representation

\frac{d L_{μ}}{d t} = [M_{μ}, L_{μ}], L_{μ} = L_{p} - μ (L_{k}^{0} - z_{3} A_{4}) + (μ^{2} - s) λ A

on the invariant manifold

z_{3} + m_{4} = 0

. Consequently,

ϕ_{k} = T r (L_{μ}^{2 k}) = ⟨ L_{μ}^{k}, L_{μ}^{k} ⟩

are integrals of motion for (59), in involution with each other for each

s = 0

, or

s = 1

. Remarkably, the prototype of Kowalewski’s integrals of motion is found among the above spectral invariants. (see also [33,34] for other spectral representations).

We will show the existence of Kowalewki’s integral of motion directly from the equations

\begin{matrix} \frac{d Z}{d t} = [Ω, Z] + [B, U] + [B^{⊥}, V], \\ \frac{d U}{d t} = [Ω, U], \frac{d V}{d t} = [Ω, V], \end{matrix}

(60)

obtained from (59) under the change of variables

U = W - s λ B

,

V = S + s λ B^{⊥} .

Equation (60) may be seen as a semisimple extension of the Kowalewski-type gyrostat in two constant fields introduced in [35].

Equations (60) may be also expressed in terms of the coordinates as

\begin{matrix} \frac{d z_{1}}{d t} = - \frac{1}{λ} z_{1} z_{3} + b (u_{3} + v_{3}), \frac{d z_{2}}{d t} = \frac{1}{λ} z_{2} z_{3} - \bar{b} (u_{3} - v_{3}), \\ \frac{d z_{3}}{d t} = \frac{1}{2} (b u_{2} - \bar{b} u_{1} + b v_{2} + \bar{b} v_{1}), \\ \frac{d u_{1}}{d t} = \frac{u_{3} z_{1}}{λ} - \frac{2 u_{1} z_{3}}{λ}, \frac{d u_{2}}{d t} = \frac{2 u_{2} z_{3}}{λ} - \frac{z_{2} u_{3}}{λ}, \\ \frac{d u_{3}}{d t} = \frac{1}{2 λ} (z_{1} u_{2} - u_{1} z_{2}) \\ \frac{d v_{1}}{d t} = \frac{v_{3} z_{1}}{λ} - \frac{2 v_{1} z_{3}}{λ}, \frac{d v_{2}}{d t} = \frac{2 v_{2} z_{3}}{λ} - \frac{z_{2} v_{3}}{λ}, \\ \frac{d v_{3}}{d t} = \frac{1}{2 λ} (z_{1} v_{2} - v_{1} z_{2}) . \end{matrix}

(61)

One readily obtains the following fundamental equalities

\begin{matrix} \frac{d}{d t} (u_{1} + v_{1}) = \frac{z_{1}}{λ} (u_{3} + v_{3}) - \frac{2 z_{3}}{λ} (u_{1} + v_{1}), \\ \frac{d}{d t} (u_{2} - v_{2}) = \frac{2 z_{3}}{λ} (u_{2} - v_{2}) - \frac{z_{2}}{λ} (u_{3} - v_{3}) \end{matrix}

(62)

Let now

e_{1} = \frac{z_{1}^{2}}{2 λ} - b (u_{1} + v_{1}), and e_{2} = \frac{z_{2}^{2}}{2 λ} - \bar{b} (u_{2} - v_{2}) .

Then

\begin{matrix} \frac{d e_{1}}{d t} = \frac{d}{d t} (\frac{z_{1}^{2}}{2 λ} - b (u_{1} + v_{1})) = \\ \frac{z_{1}}{λ} (- \frac{1}{λ} z_{1} z_{3} + b (u_{3} + v_{3})) - b (\frac{z_{1}}{λ} (u_{3} + v_{3}) - \frac{2 z_{3}}{λ} (u_{1} + v_{1})) = \\ - \frac{2 z_{3}}{λ} (\frac{z_{1}^{2}}{2 λ} - b (u_{1} + v_{1})) = - \frac{2 z_{3}}{λ} e_{1}, \end{matrix}

and

\begin{matrix} \frac{d e_{2}}{d t} = \frac{d}{d t} (\frac{z_{2}^{2}}{2 λ} - \bar{b} (u_{2} - v_{2})) = \\ \frac{z_{2}}{λ} (\frac{1}{λ} z_{2} z_{3} - \bar{b} (u_{3} - v_{3})) - \bar{b} (\frac{2 z_{3}}{λ} (u_{2} - v_{2}) - \frac{z_{2}}{λ} (u_{3} - v_{3}) = \\ \frac{2 z_{3}}{λ} (\frac{z_{2}^{2}}{2 λ} - \bar{b} (u_{2} - v_{2})) = \frac{2 z_{3}}{λ} e_{2} . \end{matrix}

Hence,

c = e_{1} e_{2}

is an integral of motion for (61) since

\frac{d}{d t} c = \frac{d}{d t} e_{1} e_{2} = \frac{d e_{1}}{d t} e_{2} + e_{1} \frac{d e_{2}}{d t} = 0 .

An interested reader may want to show that the following are also integrals of motion

\begin{matrix} c_{0} = & | | V^{2} | |, c_{1} = {| | U | |}^{2}, c_{2} = ⟨ U, V ⟩, \\ c_{3} = & [U, V] \cdot (λ ([B, V] + [B^{⊥}, U]) - z_{3} Z) + \frac{1}{2} ({(V \cdot Z)}^{2} - {(U \cdot Z)}^{2}) . \end{matrix}

The preceding calculation also draws attention to the following general fact:

Proposition 4.

c = (\frac{z_{1}^{2}}{2 λ} - b (u_{1} + v_{1})) (\frac{z_{2}^{2}}{2 λ} - \bar{b} (u_{2} - v_{2}))

is a constant of motion for any differential system in the variables

z_{i}, u_{i}, v_{i}, i = 1, 2, 3

that satisfy

\begin{matrix} \frac{d z_{1}}{d t} = - \frac{1}{λ} z_{1} z_{3} + b (u_{3} + v_{3}), \\ \frac{d z_{2}}{d t} = \frac{1}{λ} z_{2} z_{3} - \bar{b} u_{3} - v_{3}), \\ \frac{d}{d t} (u_{1} + v_{1}) = \frac{z_{1}}{λ} (u_{3} + v_{3}) - \frac{2 z_{3}}{λ} (u_{1} + v_{1}), \\ \frac{d}{d t} (u_{2} - v_{2}) = \frac{2 z_{3}}{λ} (u_{2} - v_{2}) - \frac{z_{2}}{λ} (u_{3} - v_{3}), \end{matrix}

(63)

independently of the equations that govern the evolution of

u_{3}

and

v_{3}

.

To come back to the top of Kowalewski, note that

V = 0

is an invariant subsystem for (60). On this set,

S = - λ s B^{⊥}

, and c reduces to

c = (\frac{z_{1}^{2}}{2 λ} - b u_{1}) (\frac{z_{2}^{2}}{2 λ} - \bar{b} u_{2}) = (\frac{z_{1}^{2}}{2 λ} - b w_{1} + s λ b^{2}) (\frac{z_{2}^{2}}{2 λ} - \bar{a} b w_{2} + s λ {\bar{b}}^{2}),

(64)

and remains an integral of motion for the reduced system

\frac{d Z}{d t} = [Ω, Z] + [B, U], \frac{d U}{d t} = [Ω, U]),

(65)

with its fundamental relations (63)

\begin{matrix} \frac{d z_{1}}{d t} = - \frac{1}{λ} z_{1} z_{3} + b u_{3}, \frac{d z_{2}}{d t} = \frac{1}{λ} z_{2} z_{3} - \bar{b} u_{3}, \\ \frac{d}{d t} u_{1} = \frac{z_{1}}{λ} u_{3} - \frac{2 z_{3}}{λ} u_{1}, \frac{d}{d t} u_{2} = \frac{2 z_{3}}{λ} u_{2} - \frac{z_{2}}{λ} u_{3} . \end{matrix}

(66)

This reduced system coincides the Kirchhoff-Kowalewski system on

se (3, C)

(Equation (49),

s = 0

, after u is replaced by w). Then

c = (\frac{z_{1}^{2}}{2 λ} - b u_{1}) (\frac{z_{2}^{2}}{2 λ} - \bar{b} u_{2})

coincides with the integral of motion discovered by Kowalewski. The remaining isospectral integrals of motion

c_{1} = {| | U | |}^{2}

and

c_{3} = (U \cdot Z)

coincide with the Casimirs on

se (3, C)

.

To recover the semi-simple form of the Kirchhoff-Kowalewski integral, let

Y = U + \frac{s}{2} λ B

. In terms of

Z

and

Y

the preceding system becomes

\frac{d Z}{d t} = [Ω, Z] + [B, Y], \frac{d Y}{d t} = [Ω, Y] - \frac{s λ}{2} [Ω, B] .

(67)

This system satisfies the same equations as the Kirchhoff-Kowalewski system except for

\frac{d y_{3}}{d t}

. Indeed,

\begin{matrix} - \frac{s λ}{2} [Ω, B] = - \frac{s}{2} ([Z, B] + z_{3} [A_{3}, B]) = \\ - s [Z, B] + \frac{s}{2} ([Z, B] + z_{3} B^{⊥}) = s [B, Z] + \frac{s}{2} (\bar{b} z_{1} - b z_{2}) A_{3} . \end{matrix}

The remaining equations given by

\begin{matrix} \frac{d z_{1}}{d t} = - \frac{1}{λ} z_{1} z_{3} + b y_{3}, \frac{d z_{2}}{d t} = \frac{1}{λ} z_{2} z_{3} - \bar{b} y_{3}, \\ \frac{d y_{1}}{d t} = \frac{z_{1}}{λ} y_{3} - \frac{2 z_{3}}{λ} y_{1} + s b z_{3}, \\ \frac{d y_{2}}{d t} = \frac{2 z_{3}}{λ} y_{2} - \frac{z_{2}}{λ} y_{3} - s \bar{b} z_{3}, \end{matrix}

(68)

are the same as (66), and consequently yield

c = (\frac{z_{1}^{2}}{2 λ} - b y_{1} + \frac{s}{2} λ b^{2}) (\frac{z_{2}^{2}}{2 λ} - \bar{b} y_{2} + \frac{s}{2} λ {\bar{b}}^{2})

as an integral of motion for the preceding system, as well as for the Kirchhoff-Kowalewski system (Equation (49)) when

Y

is replaced by

W

.

The papers of V. Dragovi

\overset{´}{c}

and K. Kuki

\overset{´}{c}

[30] and V. V. Sokolov [31] produce differential systems which admit Kowalewski type integrals different from the ones in this paper and yet follow the same integration procedure used by S. Kowalewski in her original paper. Remarkably, all these systems satisfy the fundamental relations (63) from which the existence of their extra integrals of motion could be easily ascertained.

4. Kepler, Jacobi, Neumann and Moser

Let us now return to

G = S L (n + 1)

and its Lie algebra

sl (n + 1)

endowed with the trace form

⟨ X, Y ⟩ = \frac{1}{2} T r (X Y)

. As a vector space V, the set of

(n + 1) \times (n + 1)

matrices with zero trace admits several kinds of Lie algebras and each of these Lie algebras induces its own Poisson structure on V. The most common Lie algebra is

sl (n + 1)

itself. Then

K = S O (n + 1)

induces the orthogonal decomposition

sl (n + 1) = s y m_{0} \oplus so (n + 1)

where

s y m_{0}

denotes the vector space of symmetric matrices in V. But then V also carries the semi-direct product structure

s y m_{0} ⋊ so (n + 1)

.

However,

K = S O (p, q), p + q = n + 1,

is also a closed subgroup of G and hence the pair

(S L (n + 1), S O (p, q))

induces its own Cartan decomposition

V = p \otimes k

, where

p

is the orthogonal complement to

k = so (p, q)

. In fact K is the set of points in G fixed by the automorphism

σ (g) = D {g^{T}}^{- 1} D^{- 1}

where D denotes diagonal matrix with its first p diagonal entries equal to 1 and the remaining q diagonal entries equal to

- 1 .

The set of points

g \in G

such that

σ (g) = g

satisfies

D = g D g^{T}

, that is,

g \in S O (p, q)

. It follows that its tangent map

σ_{*}

induces the above decomposition with

k = {X \in V : D X D^{T} = - X}, p = {X \in V : D X^{T} D = X} .

(69)

Consequently, matrices in

p

are symmetric relative to the scalar product

{(x, y)}_{p, q} = (x, D y)

,

x, y

in

R^{n + 1}

.

We will now return to the canonical Hamiltonians

H (L) = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ + ⟨ A, L_{p} ⟩

and their Poisson Equation (33) restricted to the coadjoint orbits through rank one matrices

X_{0}

in

sl (n + 1)

. We will consider two cases: the coadjoint orbit through a symmetric rank-one matrix

X_{0}

of unit length under the action of

G_{1} = s y m_{0} ⋊ S O (n + 1)

, and the second case, the coadjoint orbit through rank-one matrix

X_{0}

of unit length, symmetric relative to the Lorentzian inner product in

R^{n + 1}

under the action of

p ⋊ S O (1, n)

. The above matrices can be naturally expressed in terms of the notations introduced earlier, the scalar product

{(x, y)}_{ϵ} = x_{0} y_{0} + ϵ \sum_{i = 1}^{n} x_{i} y_{i}, ϵ = \pm 1,

in the ambient space

R^{n + 1}

, and matrices

a \otimes_{ϵ} b

and

a \land_{ϵ} b = a \otimes_{ϵ} b - b \otimes_{ϵ} a

. For then

X_{0} = x_{0} \otimes_{ϵ} x_{0} - \frac{{(x_{0}, x_{0})}_{ϵ}}{n + 1} I .

If

{(x_{0}, x_{0})}_{ϵ} > 0

then let

S_{ϵ}^{n} = {x \in R^{n + 1} : {(x, x)}_{ϵ} = {(x_{0}, x_{0})}_{ϵ}, x_{0} . 0}

. It follows that

S_{ϵ}^{n}

is the Euclidean sphere of radius

| | x_{0} | |

when

ϵ = 1

and a hyperboloid of two sheets when

ϵ = - 1

. We have chosen

S_{- 1}^{n}

to be the sheet defined by

x_{0} > 0

.

Proposition 5.

The coadjoint orbit through

X_{0} = x_{0} \otimes_{ϵ} x_{0} - \frac{{(x_{0}, x_{0})}_{ϵ}}{n + 1} I

is symplectomorphic to the cotangent bundle of the real projective space

P^{n + 1}

in the semi-simple case, and it is symplectomorphic to the cotangent bundle of

S_{ϵ}^{n}

in the semi-direct case.

For the proof see [36]. Here it is implicitly understood that the cotangent bundles are identified with the tangent bundles via the ambient inner product

{(,)}_{ϵ}

. Then each tangent vector

(x, y), x \in S_{ϵ}^{n}, {(x, y)}_{ϵ} = 0

is identified with

L_{p} = x \otimes_{ϵ} x - \frac{{(x_{0}, x_{0})}_{ϵ}}{n + 1} I

in

p_{ϵ}

and

L_{k} = x \land_{ϵ} y

in

k_{ϵ}

.

On the orbit through

X_{0}

,

H = \frac{1}{2} {(x, x)}_{ϵ} {(y, y)}_{ϵ} - \frac{1}{2} {(A x, x)}_{ϵ}

, and the associated Poisson equations are of the form

\frac{d}{d t} (x \land_{ϵ} y) = [A, x \otimes_{ϵ} x], \frac{d}{d t} (x \otimes_{ϵ} x) = [x \land_{ϵ} y, x \otimes_{ϵ} x]

(70)

A simple calculation show that

\dot{x} = {(x, x)}_{ϵ} y, \dot{y} = A x - (\frac{{A x, x)}_{ϵ}}{{(x, x)}_{ϵ}} + {(y, y)}_{ϵ}) x .

(71)

On the unit sphere, Equation (71) after A is replaced by

- A

coincide with the equations for the mechanical problem of C. Neumann for a particle on the sphere moving under a quadratic potential [37]. The preceding equations for

ϵ = - 1

could be analogously interpreted as the equations on the hyperboloid for a particle moving under quadratic potential [7].

The canonical affine-quadratic problem illuminates deep and beautiful connections between Kepler’s gravitational problem, Jacobi’s geodesic problem on the ellipsoid, and Neumann’s mechanical problems.

Let us first examine the isospectral integrals associated with the spectral curve

L_{λ} = L_{p} - λ L_{k} + λ^{2} A

on the coadjoint orbit through rank-one matrices. The zero trace requirement is inessential for the calculations below and will be disregarded. Additionally, A will be replaced by

- A

and

L_{λ}

will be rescaled by dividing by

- λ^{2}

to read

L_{λ} = - \frac{1}{λ^{2}} L_{p} + \frac{1}{λ} L_{k} + A . = - \frac{1}{λ^{2}} x \otimes_{ϵ} x + \frac{1}{λ} x \land_{ϵ} y + A .

The spectrum of

L_{λ}

is then given by

0 = D e t (z I - L_{λ}) = D e t (z I - A) D e t (I - {(z I - A)}^{- 1} (- \frac{1}{λ^{2}} L_{p} + \frac{1}{λ} L_{k})),

Matrix

M = I - {(z I - A)}^{- 1} (- \frac{1}{λ^{2}} L_{p} + \frac{1}{λ} L_{k})

is of the form.

M = I + \frac{1}{λ^{2}} R_{z} x \otimes_{ϵ} x - \frac{1}{λ} (R_{z} x \otimes_{ϵ} y - R_{z} y \otimes_{ϵ} x),

where

R_{z} = {(z I - A)}^{- 1} .

We then have the following proposition

Lemma 1.

D e t (M) = \frac{1}{λ^{2}} ({(R_{z} x, x)}_{ϵ} + {(R_{z} x, x)}_{ϵ} {(R_{z} y, y)}_{ϵ} - {(R_{z} x, y)}_{ϵ}^{2}) + 1 .

For the proof see [7] (p. 200).

Corollary 1.

Function

F (z) = {(R_{z} x, x)}_{ϵ} + {(R_{z} x, x)}_{ϵ} {(R_{z} y, y)}_{ϵ} - {(R_{z} x, y)}_{ϵ}^{2}, z \in R

is an integral of motion for H.

Function F is a rational function with poles at the eigenvalues of the matrix

A .

Hence,

F (z)

is an integral of motion for H if and only if the residues of F are constants of motion for H.

In the Euclidean case the eigenvalues of

A

are real and distinct since

A

is symmetric and regular. Hence there is no loss in generality in assuming that A is diagonal. Let

a_{0}, \dots, a_{n}

denote its diagonal entries Then

F (z) = \sum_{k = 0}^{n} \frac{F_{k}}{z - a_{k}},

where

F_{0}, \dots, F_{n}

denote the residues of

F .

It follows that

\begin{matrix} F (z) = \sum_{k = 0}^{n} \frac{x_{k}^{2}}{z - a_{k}} + \sum_{k = 0}^{n} \sum_{j = 0}^{n} \frac{x_{k}^{2} y_{j}^{2}}{(z - a_{k}) (z - a_{j})} - {(\sum_{k = 0}^{n} \frac{x_{k} y_{k}}{z - a_{k}})}^{2} = \\ \sum_{k = 0}^{n} \frac{x_{k}^{2}}{z - a_{k}} + \sum_{k = 0}^{n} \sum_{j = 0, j \neq k}^{n} \frac{x_{k}^{2} y_{j}^{2}}{(z - a_{k}) (z - a_{j})} - 2 \sum_{k = 0}^{n} \sum_{j = 0, j \neq k}^{n} \frac{x_{k} y_{k} x_{j} y_{j}}{(z - a_{k}) (z - a_{j})} . \end{matrix}

Hence,

\begin{matrix} F_{k} = lim_{z \to a_{k}} (z - a_{k}) F (z) = \\ x_{k}^{2} + \sum_{j = 0, j \neq k}^{n} \frac{x_{j}^{2} y_{k} + x_{k}^{2} y_{j}^{2}}{(a_{k} - a_{j})} - 2 \sum_{j = 0, j \neq k}^{n} \frac{x_{k} y_{k} x_{j} y_{j}}{(a_{k} - a_{j})} = \\ x_{k}^{2} + \sum_{j = 0, j \neq k}^{n} \frac{{(x_{j} y_{k} - x_{k} y_{j})}^{2}}{(a_{k} - a_{j})}, k = 0, \dots, n . \end{matrix}

The preceding calculation yields the following proposition.

Proposition 6.

Each residue

F_{k} = x_{k}^{2} + \sum_{j = 0, j \neq k}^{n} \frac{{(x_{j} y_{k} - x_{k} y_{j})}^{2}}{(a_{k} - a_{j})}, k = 0, \dots, n

is an integral of motion for Newmann’s spherical system, and functions

F_{0}, \dots, F_{n}

are in involution.

These results coincide with the ones reported in [38,39,40], but the connection with the affine-quadratic problem shows that similar integrals of motion exist for the hyperbolic Neumann problem [7] (p. 191).

In the literature on integrable systems the integrals of motion for Neumann’s problem are related to the integrals of motion for Jacobi’s problem on the ellipsoid through the transformation of H. Knörrer that transforms the Neumann’s equations on energy level

H = 0

onto the equations of Jacobi on the ellipsoid [39,41]. Our exposition takes another route: we will instead show that an “elliptic” problem on the sphere is completely integrable with its integrals of motion as in Neumann’s problem, and then we will show that the Hamiltonian equations for the elliptic problem on the sphere and Jacobi’s problem on the ellipsoid are symplectomorphic. We will then use this symplectomorphism to show the existence of Jacobi’s integrals of motion on the ellipsoid.

Let now

H = \frac{1}{2} ⟨ D^{- 1} (L_{k}) D^{- 1}, L_{k} ⟩ + ⟨ D^{- 1}, L_{p} ⟩

denote an affine-quadratic Hamiltonian on

G = S L (n + 1)

defined by a diagonal matrix D with positive diagonal entries. As before, we will dispense with zero-trace requirements since they are inessential. The above Hamiltonian is generated by a positive definite operator

P (X) = D X D, X \in so (n + 1)

and the drift

A = D^{- 1}

. We will call this Hamiltonian elliptic for reasons that will be made clear later on. Let

B = - D

. Then

[P^{- 1} (L_{k}), B] = [P^{- 1} (L_{k}), - D] = - L_{k} D^{- 1} + D^{- 1} L_{k} = [L_{k}, D^{- 1}] = [L_{k}, A] .

Therefore

H

is isospectral (Proposition 1) and its Hamiltonian equations admit a representation

\frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}], L_{λ} = L_{p} - λ L_{k} - (λ^{2} - s) A .

Since

L_{λ} = L_{p} - λ L_{k} - (λ^{2} - s) A

is a spectral curve for the canonical affine Hamiltonian

H = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ - ⟨ A, L_{p} ⟩

we have the following corollary.

Corollary 2.

The spectral invariants of

L_{p} - λ L_{k} - (λ^{2} - s) A

are common integrals of motion for both the canonical Hamiltonian

H = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ - ⟨ A, L_{p} ⟩

and the elliptic Hamiltonian

H = \frac{1}{2} ⟨ A^{- 1} L_{k} A^{- 1}, L_{k} ⟩ + ⟨ A^{- 1}, L_{p} ⟩

.

As before, on coadjoint orbit through

X_{0} = x_{0} \otimes_{ϵ} x_{0}, {(x_{0}, x_{0})}_{ϵ} = 1

, the Poisson equations of

H

( the semi-direct version) are given by

\begin{matrix} \frac{d}{d t} (x \land_{ϵ} y) = [A^{- 1} (x \land_{ϵ} y) A^{- 1}, x \land_{ϵ} y] + [A^{- 1}, x \otimes_{ϵ} x], \\ \frac{d}{d t} (x \otimes_{ϵ} x) = [A^{- 1} (x \land_{ϵ} y) A^{- 1}), x \otimes_{ϵ} x] . \end{matrix}

(72)

We then have

\begin{matrix} ⟨ A^{- 1} L_{k} A^{- 1}, L_{k} ⟩ = ⟨ A^{- 1} x \land y A^{- 1}, x \land y ⟩ = (A^{- 1} x \cdot x) (A^{- 1} y \cdot y) - {(A^{- 1} x \cdot y)}^{2}, \\ ⟨ A^{- 1}, L_{p} ⟩ = - \frac{1}{2} (x \cdot A^{- 1} x) . \end{matrix}

which shows that the Hamiltonian

H = \frac{1}{2} ⟨ A^{- 1} L_{k} A^{- 1}, L_{k} ⟩ + ⟨ A^{- 1}, L_{p} ⟩

is given by

H = \frac{1}{2} ((A^{- 1} y \cdot y) - \frac{{(A^{- 1} x \cdot y)}^{2}}{(A^{- 1} x \cdot x)} - 1) (A^{- 1} x \cdot x) .

The correspondence

(x, y) \to x \land_{ϵ} y + x \otimes_{ϵ} x

defines a symplectomorphism between the cotangent bundle of

S_{ϵ}^{n}

with its canonical Poisson bracket and the coadjoint orbit through

X_{0}

(Proposition 5).

Proposition 7.

On energy level

H = 0

Equation (72) correspond to

\begin{matrix} \frac{d x}{d t} = (A^{- 1} x \cdot x) (A^{- 1} y - \frac{(A^{- 1} x \cdot y)}{(A^{- 1} x \cdot x)} A^{- 1} x) \\ \frac{d y}{d t} = (A^{- 1} x \cdot x) (\frac{(A^{- 1} x \cdot y)}{(A^{- 1} x \cdot x)} A^{- 1} y - \frac{{(A^{- 1} x \cdot y)}^{2}}{{(A^{- 1} x \cdot x)}^{2}} A^{- 1} x - x .) \end{matrix}

under the correspondence

(x \otimes x, x \land y) \to (x, y)

.

These equations can be reparametrized by a parameter

s = \int (A^{- 1} x (t) \cdot x (t)) d t

to read

\begin{matrix} \frac{d x}{d s} = \frac{d x}{d t} \frac{d t}{d s} = A^{- 1} y - \frac{(A^{- 1} x \cdot y)}{(A^{- 1} x \cdot x)} A^{- 1} x \\ \frac{d y}{d s} = \frac{d y}{d t} \frac{d t}{d s} = \frac{(A^{- 1} x \cdot y)}{(A^{- 1} x \cdot x)} (A^{- 1} y - \frac{(A^{- 1} x \cdot y)}{(A^{- 1} x \cdot x)} A^{- 1} x) - x . \end{matrix}

(73)

We will presently show that Equation (73) are Hamiltonian equations that correspond to the geodesic problem on the sphere relative to the elliptic metric

\frac{1}{2} (A \dot{x}, \dot{x})

.

As an intermediate step we will now derive the Hamiltonian equations associated with the geodesic problem on the quadric surface

(A^{- 1} x, x) = 1

induced by the scalar product

\frac{1}{2} (D \dot{x}, \dot{x})

. We will follow the procedure based on the version of the Maximum Principle for variational problems with constraints outlined in [7] (p. 218) and identify the quadric surface with the submanifold

N = {x \in R^{n + 1} : (x, A^{- 1} x) = 1}

. Then its cotangent bundle will be defined in terms of the constraints

G_{1} = {(x, A^{- 1} x) - 1 = 0}, and G_{2} = {(x, A^{- 1} p) = 0} .

The Hamiltonian lift of a curve

\dot{x} = u (t)

that belongs to

T_{x (t)} N

is given by

h_{u (t)} (x, p) = - \frac{1}{2} (D u, u) + (p, u) + λ_{1} G_{1} + λ_{2} G_{2}

for the multipliers

λ_{1}

and

λ_{2}

that satisfy

{h_{u (t)}, G_{1}} = {h_{u (t)}, G_{2}} = 0

. If

h_{u}^{0} = - \frac{1}{2} (D u, u) + (p, u)

, then

{h_{u}, G_{1}} = {h_{u}^{0}, G_{1}} + λ_{2} {G_{2}, G_{1}}, {h_{u}, G_{2}} = {h_{u}^{0}, G_{2}} + λ_{1} {G_{1}, G_{2}} .

It follows that

{h_{u}^{0}, G_{1}} = - 2 u \cdot A^{- 1} x, {h_{u}^{0}, G_{2}} = - u \cdot A^{- 1} p, {G_{1}, G_{2}} = 2 A^{- 1} x \cdot A^{- 1} x .

Hence,

λ_{1} = - \frac{1}{{G_{1}, G_{2}}} {h_{u}^{0}, G_{2}} = \frac{1}{2} \frac{(u, A^{- 1} p)}{(A^{- 1} x, A^{- 1} x)}, λ_{2} = - \frac{1}{{G_{2}, G_{1}}} {h_{u}^{0}, G_{1}} = - \frac{(u, A^{- 1} x)}{(A^{- 1} x, A^{- 1} x)} .

According to the Maximum Principle an extremal control

u (t)

must optimize

h_{u}^{0} = - \frac{1}{2} (u \cdot D u) + p \cdot u

on

G_{1} = G_{2} = 0

over all controls that satisfy

u \cdot A^{- 1} x = 0

. Hence extremal controls are the critical points of

- \frac{1}{2} (u, D u) + (p, u) - α_{0} (u, A^{- 1} x)

for some multiplier

α_{0}

, that is, they are solutions of

- D u + p - α_{0} A^{- 1} x = 0

. It follows that the extremal controls are of the form

u = D^{- 1} (p - α_{0} A^{- 1} x)

, But then

(u, A^{- 1} x) = 0

implies that

α_{0} = \frac{(A^{- 1} x, D^{- 1} p)}{(D^{- 1} A^{- 1} x, A^{- 1} x)} .

For this choice of controls

h_{u} = H + λ_{1} G_{1} + λ_{2} G_{2},

where

H = \frac{1}{2} (D^{- 1} (p - α_{0} A^{- 1} x), p))

. An easy calculation shows that

H = \frac{1}{2} ((D^{- 1} p, p) - \frac{{(D^{- 1} p, A^{- 1} x)}^{2}}{(A^{- 1} x, A^{- 1} x)}) .

Then the extremal curves are the solutions of the following differential equation:

\begin{matrix} \frac{d x}{d t} = \frac{\partial H}{\partial p} = D^{- 1} (p - α_{0} A^{- 1} x), \\ \frac{d p}{d t} = - \frac{\partial H}{\partial x} - 2 λ_{1} \frac{\partial G_{1}}{\partial x} = α_{0} (A^{- 1} (D^{- 1} (p - α_{0} A^{- 1} x)) - 2 λ_{1} A^{- 1} x, \end{matrix}

(74)

which emanate from

H = \frac{1}{2}

, that is, satisfy

(D^{- 1} p, p) (D^{- 1} A^{- 1} x, A^{- 1} x) - {(D^{- 1} p, A^{- 1} x)}^{2} = (D^{- 1} A^{- 1} x, A^{- 1} x) .

(75)

We will now single out the cases relevant for our earlier claims.

The geodesic problem on the ellipsoid. In this classic case initiated by C. Jacobi

D = I

and

(A^{- 1} x, p) = 0

. Hence

α_{0} = \frac{(A^{- 1} x, p)}{(A^{- 1} x, A^{- 1} x)} = 0, λ_{1} = \frac{1}{2} \frac{(p, A^{- 1} p)}{(A^{- 1} x, A^{- 1} x)} .

Then Equation (74) reduce to

\frac{d x}{d t} = p, \frac{d p}{d t} = - \frac{(p, A^{- 1} p)}{(A^{- 1} x, A^{- 1} x)} A^{- 1} x .

(76)

The preceding equation agree with the equations in J. Moser [39].

The elliptic problem on the sphere. Here the ambient metric is defined by a positive-definite matrix D and

A = I

. In such a case Equation (74) are given by

(D^{- 1} p, p) - \frac{{(D^{- 1} p, x)}^{2}}{(D^{- 1} x, x)} = 1 .

Furthermore,

λ_{1} = \frac{1}{2} D^{- 1} (p - α_{0} x), p) = \frac{1}{2} (D^{- 1} p, p) - \frac{{(x, D^{- 1} p)}^{2}}{(D^{- 1} x, x)}) = \frac{1}{2}, α_{0} = \frac{(D^{- 1} x, p)}{(D^{- 1} x, x)} .

The Hamiltonian equations are then given by

\begin{matrix} \frac{d x}{d t} = D^{- 1} p - \frac{(D^{- 1} x, p)}{(D^{- 1} x, x)} D^{- 1} x, \\ \frac{d p}{d t} = \frac{(D^{- 1} x, p)}{(D^{- 1} x, x)} (D^{- 1} p - \frac{(D^{- 1} x, p)}{(D^{- 1} x, x)} D^{- 1} x)) - x, \end{matrix}

(77)

which agrees with Equation (73) when

D = A

.

Proposition 8.

The Hamiltonian systems that correspond to the elliptic problem on the sphere and the geodesic problem on the ellipsoid are symplectomorphic.

Proof.

Let

(x, y)

denote the coordinates on the tangent bundle of the sphere and let

(q, p)

denote the coordinates on the tangent bundle of the ellipsoid

E = {q \in R^{n + 1} : (q, A^{- 1} q) - 1 = 0}

. In these coordinates the systems in question are given by

\frac{d x}{d t} = u, \frac{d y}{d t} = α u - x, and \frac{d q}{d t} = p, \frac{d p}{d t} = - \frac{(A^{- 1} p, p)}{(A^{- 1} q, A^{- 1} q)} A^{- 1} q,

(78)

where

u = A^{- 1} (y - α x)

and

α = \frac{(A^{- 1} x, y)}{(A^{- 1} x, A^{- 1} x)}

.

Let

Φ

denote the mapping from the cotangent bundle of the sphere to the cotangent bundle of E defined by

q = A^{\frac{1}{2}} x, p = A^{- \frac{1}{2}} (y - α x) = A^{\frac{1}{2}} u .

Let

θ = \sum_{i = 0}^{n} p_{i} d q_{i} = (p, d q), (d q, A^{- 1} q) = 0,

denote the Liouville-Poincaré canonical form on

T^{*} E

. Then

Φ^{*} θ = (A^{- \frac{1}{2}} (y - α x), A^{\frac{1}{2}} d x) = (y, d x) - α (x, d x) = (y, d x),

because

0 = d q \cdot A^{- 1} q = A^{\frac{1}{2}} d x \cdot A^{- 1} A^{\frac{1}{2}} x = d x \cdot x

. Since

Φ^{*}

takes the Liouville form on

T^{*} E

to the Liouville form on

T^{*} S^{n}

, it also takes the canonical symplectic form on

T^{*} E

to the canonical symplectic for on

T^{*} S_{n}

and hence is a symplectomorphism.

It now follows from (78) that

\frac{d u}{d t} = - (1 + \frac{d α}{d t}) A^{- 1} x

and that

1 + \frac{d α}{d t} = \frac{(u, u)}{(A^{- 1} x, x)}

. Then,

\begin{matrix} \frac{d q}{d t} = A^{\frac{1}{2}} \frac{d x}{d t} = A^{\frac{1}{2}} u = p, \\ \frac{d p}{d t} = A^{\frac{1}{2}} \frac{d u}{d t} = - (\frac{(u, u)}{(A^{- 1} x, x)}) A^{- 1} q = - \frac{(A^{- 1} p, p)}{(A^{- 1} q, A^{- 1} q)} A^{- 1} q, \end{matrix}

and thus

Φ_{*}

takes the Hamiltonian flow on the sphere onto the Hamiltonian flow on the ellipsoid. □

Proposition 9.

Jacobi’s problem on the ellipsoid is completely integrable. Functions

G_{k} = p_{k}^{2} + \sum_{j = 1, j \neq k}^{n + 1} \frac{{(q_{j} p_{k} - q_{k} p_{j})}^{2}}{(a_{k} - a_{j})}, k = 1, \dots, (n + 1)

are constants of motion, all in involution with each other, for the Hamiltonian system

\frac{d q}{d t} = p, \frac{d p}{d t} = - \frac{(A^{- 1} p, p)}{(A^{- 1} q, A^{- 1} q)} A^{- 1} q

on the cotangent bundle of the ellipsoid.

Proof.

We have shown that

F_{k} = x_{k}^{2} + \sum_{j = 0, j \neq k}^{n} \frac{{(x_{j} y_{k} - x_{k} y_{j})}^{2}}{(α_{k} - α_{j})}, k = 0, \dots, n

(79)

are an involutive family of integrals of motion for the elliptic-geodesic problem on the sphere. We have also shown that the above integrals of motion are the residues of the function

F (z) = (R_{z} x, x) + (R_{z} x, x) (R_{z} y, y) - {(R_{z} x, y)}^{2}, R_{z} = {(z I - A)}^{- 1} .

We will now show that functions (79) are the residues of the pull-back of F under the symplectomorphism

Φ

. First note that F remains unchanged if the variable y is replaced by

y + α x

with

α

an arbitrary number. Since

A^{\frac{1}{2}} p = y - \frac{(A^{- 1} x, y)}{(A^{- 1} x, x)} x

, we may replace y by

A^{\frac{1}{2}} p

and x by

A^{- \frac{1}{2}} q

. Also note that

(p, p) = (A^{- 1} y, y) - \frac{{(A^{- 1} x, y)}^{2}}{(A^{- 1} x, x)} = 1

(use Equation (75)). Then,

\begin{matrix} 1 + (R_{z} y, y) = 1 + (R_{z} A p, p) = 1 + \sum_{k = 0}^{n} \frac{a_{k} p_{k}^{2}}{z - a_{k}} = \\ \sum_{k = 0}^{n} p_{k}^{2} + \frac{a_{k} p_{k}^{2}}{z - a_{k}} = z \sum_{k = 0}^{n} \frac{p_{k}^{2}}{z - a_{k}} = z (R_{z} p, p), \\ (R_{z} x, x) = (R_{z} A^{- 1} q, q) = \sum_{k = 0}^{n} \frac{q_{k}^{2}}{a_{k} (z - a_{k})} = \frac{1}{z} \sum_{k = 0}^{n} \frac{q_{k}}{a_{k}} + \frac{q_{k}^{2}}{z - a_{k}} = \frac{1}{z} (1 + (R_{z} q, q), \\ and (R_{z} x, y) = (R_{z} q, p) . \end{matrix}

It follows that

F (z) = (R_{z} p, p) (1 + R_{z} q, q) - {(R_{z} q, p)}^{2}

is constant along the solutions of Jacobi’s equations.. A calculation identical to the one used for Neumann’s system shows that

G_{k} = p_{k}^{2} + \sum_{j = 0, j \neq k}^{n} \frac{{(q_{j} p_{k} - q_{k} p_{j})}^{2}}{(a_{k} - a_{j})}, k = 1, \dots, (n + 1)

are the residues of F, and hence are integrals of motion for Jacobi’s equations. □

Degenerate Case $A = 0$ and Kepler’s Problem

Let us now return to the Hamiltonian equations generated by the canonical affine Hamiltonian

H = \frac{1}{2} {(x, x)}_{ϵ} {(y, y)}_{ϵ} - \frac{1}{2} {(A x, x)}_{ϵ}

on the coadjoint orbit through

X_{0} = a \otimes_{ϵ} a

for some

a \in R^{n + 1}

with

\frac{d}{d t} (x \land_{ϵ} y) = [A, x \otimes_{ϵ} x], \frac{d}{d t} (x \otimes_{ϵ} x) = [x \land_{ϵ} y, x \otimes_{ϵ} x],

(80)

and their equivalent formulation on the tangent bundle of

S_{ϵ}^{n}

:

\dot{x} = {(x, x)}_{ϵ} y, \dot{y} = A x - (\frac{{A x, x)}_{ϵ}}{{(x, x)}_{ϵ}} + {(y, y)}_{ϵ}) x .

(81)

When

A = 0

the Hamiltonian H reduces to

H = \frac{1}{2} {(x, x)}_{ϵ} {(y, y)}_{ϵ}

and the corresponding equations reduce to

\frac{d}{d t} (x \land_{ϵ} y) = 0, \frac{d}{d t} (x \otimes_{ϵ} x) = [x \land_{ϵ} y, x \otimes_{ϵ} x] .

(82)

Then Equation (82) yield an integral of motion

x \land_{ϵ} y = c o n s t

, and Equation (81) reduce to

\dot{x} = {| | x | |}_{ϵ}^{2} y, \dot{y} = - {| | y | |}_{ϵ}^{2} {x, where | | x | |}_{ϵ}^{2} = {(x, x)}_{ϵ} {, | | y | |}_{ϵ}^{2} = {(y, y)}_{ϵ} .

Upon differentiating we get

\ddot{x} + {| | x | |}_{ϵ}^{2} {| | y | |}_{ϵ}^{2} x = 0 .

(83)

We will now assume that

{(a, a)}_{ϵ} = h^{2}

so that

S_{- 1}^{n}

is the hyperboloid

x_{0}^{2} = h^{2} + \sum_{i = 1}^{n} x_{i}^{2}, x_{0} \geq 0

. On energy level

H = \frac{ϵ}{2 h^{2}}

,

{| | x | |}_{ϵ}^{2} {| | y | |}_{ϵ}^{2} = \frac{ϵ}{h^{2}}

the solutions of (83) are given by

x (t) = c_{1} cos \frac{t}{h} \sqrt{ϵ} + c_{2} sin \frac{t}{h} \sqrt{ϵ}

where

c_{1}

and

c_{2}

are constant vectors (complex when

ϵ = - 1

) that satisfy

{| | c | |}_{ϵ}^{2} = {| | c | |}_{ϵ}^{2} = h^{2}, {(c_{1}, c_{2})}_{ϵ} = 0

.

For

ϵ = 1

the above curves trace great circles on the sphere

{| | x | |}^{2} = | | x_{0} {| |}^{2}

and for

ϵ = - 1

the solutions trace great hyperbolas on the hyperboloid

{| | x | |}_{- 1}^{2} = | | x_{0} {| |}_{- 1}^{2}

(an immediate consequence of the fact that

x (t) \land_{ϵ} \dot{x} (t) = a \land_{ϵ} b

). That is, solutions are the geodesics on spaces of constant non-zero curvature. The zero curvature case may be obtained by considering

ϵ

as a continuous parameter and then letting it tend to zero (as will be explained below).

We will now show that there exists a canonical change of coordinates

{(x_{0}, \dots, x_{n},

y_{0}, \dots, y_{n}), {(x, x)}_{ϵ} = h^{2}, {(x, y)}_{ϵ} = 0} \to (p_{1}, \dots, p_{n}, q_{1}, \dots, q_{n})

in which p is the stereographic projection through the point

x_{0} = h e_{0}

given by

λ (x - h e_{0}) + h e_{0} = (0, p) with λ = \frac{h}{h - x_{0}}, x \in S_{ϵ}^{n}

(84)

such that in the new coordinates the preceding geodesic differential system is transformed into the n-dimensional Kepler’s system, an n-dimensional generalization of the Hamiltonian equations that describe the motion of a planet around an immovable planet in the presence of the gravitational force.

Equation (84) yields

p = \frac{h}{h - x_{0}} \bar{x}

, where

\bar{x} = (x_{1}, \dots, x_{n})

. Therefore the inverse map

x = Φ_{ϵ} (p)

is given by

x_{0} = \frac{h (| | p | |^{2} - ε h^{2})}{{| | p | |}^{2} + ε h^{2}}, \bar{x} = \frac{2 ϵ h^{2}}{{| | p | |}^{2} + ε h^{2}} p .

(85)

Assume that the cotangent bundle of

R^{n}

is identified with its tangent bundle

R^{n} \times R^{n}

via the Euclidean inner product

(\cdot)

, and let

(p, q)

denote the points of

R^{n} \times R^{n}

. We will next find

q = Ψ (x, y)

such that

{(d x, y)}_{ϵ} = (d p \cdot Ψ (x, y)),

for all

(x, y)

with

x \in S_{ϵ}^{n} (h)

and

{(x, y)}_{ϵ} = 0

. For then the transformation

(p, q) \in R^{n} \times R^{n} \to (x, y) \in T S_{ϵ}^{n} (h)

is a symplectomorphism since it pulls back the Liouville form

{(d x, y)}_{ϵ}

on

T S_{ϵ}^{n} (h)

onto the Liouville form

(d p \cdot q)

in

R^{n} \times R^{n}

(The symplectic form is the exterior derivative of the Liouville-Poincaré form). It follows that

\begin{matrix} {(y, d x)}_{ϵ} = \sum_{j = 1}^{n} y_{0} \frac{\partial x_{0}}{\partial p_{j}} d p_{j} + ϵ \sum_{i = 1}^{n} \sum_{j = 1}^{n} y_{i} \frac{\partial x_{i}}{\partial p_{j}} d p_{j} = \\ \sum_{j = 1}^{n} (y_{0} \frac{\partial x_{0}}{\partial p_{j}} + ϵ \sum_{i = 1}^{n} y_{i} \frac{\partial x_{i}}{\partial p_{j}}) d p_{j} = \sum_{j = 1}^{n} q_{j} d p_{j} = (d p \cdot q) . \end{matrix}

Therefore,

q_{j} = \frac{\partial x_{0}}{\partial p_{j}} y_{0} + ϵ \sum_{i = 1}^{n} y_{i} \frac{\partial x_{i}}{\partial p_{j}}, j = 1, \dots, n .

After the appropriate differentiations in (85) we get

q = \frac{2 h^{2}}{{| | p | |}^{2} + ϵ h^{2}} (\frac{2 ϵ h y_{0}}{{| | p | |}^{2} + ϵ h^{2}} p + \bar{y} - \frac{2 (\bar{y} \cdot p)}{{| | p | |}^{2} + ϵ h^{2}} p), \bar{y} = (y_{1}, \dots, y_{n}) .

Hence,

\frac{{| | p | |}^{2} + ϵ h^{2}}{2 h^{2}} (q \cdot p) = \frac{2 ϵ h y_{0}}{{| | p | |}^{2} | | + ϵ h^{2}} {| | p | |}^{2} - \frac{{| | p | |}^{2} - ϵ h^{2}}{{| | p | |}^{2} + ϵ h^{2}} (\bar{y} \cdot p) .

Since y is orthogonal to x,

(\bar{y} \cdot p) = - \frac{y_{0}}{2 h} {(| | p | |}^{2} - ϵ h^{2})

. Therefore,

y_{0} = \frac{1}{h} q \cdot p, \bar{y} = \frac{{| | p | |}^{2} + ε h^{2}}{2 h^{2}} q - \frac{q \cdot p}{h^{2}} p .

(86)

After the substitutions

{| | p | |}^{2} = \frac{h^{2}}{h - x_{0}} | | \bar{x} {| |}^{2}

into the preceding equation we get

q = \frac{2 (h - x_{0})}{| | \bar{x} {| |}^{2} + (h - x_{0})} ({(h - x_{0})}^{2} \bar{y} + y_{0} \bar{x}) .

To pass to the problem of Kepler, write the Hamiltonian

H = \frac{1}{2} {| | x | |}_{ε}^{2} {| | y | |}_{ε}^{2}

in the variables

(p, q)

. An easy calculation in (86) yields

{(y, y)}_{ϵ} = ε \frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4 h^{4}} {| | q | |}^{2}

. Therefore,

H = \frac{1}{2} h^{2} ε \frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4 h^{4}} {| | q | |}^{2} = \frac{1}{2} ε \frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4 h^{2}} {| | q | |}^{2} .

The corresponding flow is given by

\frac{d p}{d s} = \frac{\partial H}{\partial q} = ε \frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4 h^{2}} q, \frac{d q}{d s} = - \frac{\partial H}{\partial p} = - ε \frac{{| | p | |}^{2} + ε h^{2}}{2 h^{2}} {| | q | |}^{2} p

On energy level

H = \frac{ε}{2 h^{2}}, \frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4} {| | q | |}^{2} = 1,

and the preceding equations reduce to

\frac{d p}{d s} = ε \frac{q}{h^{2} {| | q | |}^{2}}, \frac{d q}{d s} = - ε \frac{| | q | |}{h^{2}} p .

After the reparametrization

t = - \frac{- ε}{h^{2}} \int_{0}^{s} | | q (τ) | | d τ

Equation (23) become

\frac{d p}{d t} = \frac{d p}{d s} \frac{d s}{d t} = - \frac{q}{{| | q | |}^{3}}, \frac{d q}{d t} = \frac{d q}{d s} \frac{d s}{d t} = p .

On

H = \frac{ϵ}{h^{2}}

,

\frac{{(| | p | |}^{2} + ε h^{2})^{2}}{4} {| | q | |}^{2} = 1

and

E = \frac{1}{2} {| | p | |}^{2} - \frac{1}{| | q | |} = \frac{1}{2 | | q | |} {(| | p | |}^{2} | | q | | - 2) = \frac{1}{2 | | q | |} (2 - ε h^{2} | | q | | - 2) = - \frac{1}{2} ε h^{2} .

So

E < 0

in the spherical case and

E > 0

in the hyperbolic case.

The Euclidean case

E = 0

can be obtained by a limiting argument in which

ε

is regarded as a continuous parameter which tends to zero.

To explain in more detail, let

w_{0} = lim_{ϵ \to 0} = h, w = lim_{ϵ \to 0} \frac{1}{2 ϵ h^{2}} \bar{x} = \frac{1}{{| | p | |}^{2}} .

The transformation

p \to w

with

w = \frac{1}{{| | p | |}^{2}} p

is the inversion about the circle

{| | p | |}^{2} = 1

in the affine hyperplane

w_{0} = h

, and

{| | d w | |}^{2} = \frac{1}{{| | p | |}^{4}} {| | d p | |}^{2}

is the corresponding transformation of the Euclidean metric

{| | d p | |}^{2}

. The Hamiltonian

H_{0}

associated with this metric is equal to

\frac{1}{2} \frac{{| | p | |}^{4}}{4} {| | q | |}^{2} .

This Hamiltonian can be also obtained as the limit of

(\frac{h^{2}}{ϵ}) \frac{1}{2} \frac{{(| | p | |}^{2} + ϵ h^{2})^{2}}{4 h^{2}} {| | q | |}^{2}

when

ϵ \to 0

. On energy level

H = \frac{1}{2} {, | | p | |}^{2} | | q | | = 2

and therefore,

E = 0

. Of course, the solutions of (12) tend to the Euclidean geodesics as

ϵ

tends to zero. Consequently,

w (t) = {lim}_{ϵ \to 0} \frac{1}{2 h^{2} ε} (\bar{x} (t))

is a solution of

\frac{d^{2} w}{d t^{2}} = 0,

and hence, is a geodesic corresponding to the standard Euclidean metric.

Let us also note that the angular momentum

L = q \land p

and the Laplace-Runge-Lenz vector

F = L p - \frac{q}{| | q | |}

for Kepler’s problem have simple geometric interpretation on the coadjoint orbits according to the following proposition.

Proposition 10.

Let

x = x_{0} e_{0} + \bar{x}

and

y = y_{0} e_{0} + \bar{y} .

On energy level

H = \frac{ϵ}{2 h^{2}}

,

L = (\bar{y} \land_{ϵ} \bar{x}) and F = h {(y_{0} {(e_{0} \land \bar{x})}_{ε} - x_{0 (} e_{0} \land \bar{y})}_{ε}) e_{0} .

For a proof see [7].

This remarkable discovery that the solutions of Kepler’s problem are intimately related to the geometry of spaces of constant curvature goes back to A.V. Fock’s paper of 1935 [42] in which he reported that the symmetry group for the motions of the hydrogen atom is

O_{4} (R)

for negative energy,

E^{3} ⋊ O_{3} (R)

for zero energy and

O (1, 3)

for positive energy. It is then not altogether surprising that similar results apply to the problem of Kepler since the energy function for Kepler’s problem is formally the same as the energy function for the hydrogen atom.

This connection between the problem of Kepler and the geodesics on the sphere was reported by J. Moser in 1970 [43], while Y. Osipov [44] reported similar results later for geodesics on spaces of negative constant curvature. In spite of their brilliance, these papers did not attempt any explanations in regard to this enigmatic connection between planetary motions and geodesics on space forms. This issue later inspired V. Guillemin and S. Sternberg to take up the problem of Kepler in a larger geometric context, with Moser’s observation as the background, in a paper titled Variations on a theme by Kepler [45]. The introduction of Kepler’s problem through the canonical affine-quadratic problem exemplifies, once again, this fascinating and recurrent interplay between mathematical physics, geometry and integrable systems.

5. Homogeneous Riemannian Manifolds and Rolling Geodesics

Our overview of integrable systems raises a natural question: what is the geometric origin behind the affine-quadratic problem that accounts for its ubiquitous presence in the theory of integrable systems? A partial answer to this question comes, somewhat unexpectedly, from a new class of variational problems, called rolling problems. We will take up this issue next. Since the underlying variational problems require new concepts and terminology, we will be obliged to make a slight detour into an earlier paper [9] in order to introduce the necessary ingredients.

The general setting is defined by a reductive pair

(G, K)

with G semi-simple and K compact. We assume that the Lie algebra decomposition

g = p \oplus k

, with

p

the orthogonal complement of

k

relative to the Killing form on

g

satisfies the strong Cartan conditions

[p, k] = p, [p, p] = k, [k, k] \subseteq k .

(87)

We will also assume the Killing form is of definite sign on

p

in which case

⟨, ⟩

will denote a scalar multiple of the Killing form that is positive on

p

. We recall that the Killing form is invariant under any linear automorphism of

g

and hence the quadratic form

⟨, ⟩

is

A d_{G}

invariant [15].

We consider G a semi-Riemannian manifold relative to the left-invariant metric

{⟨ ⟨ g X, g Y ⟩ ⟩}_{g} = ⟨ X, Y ⟩, X, Y \in g

induced by

⟨, ⟩

(the Killing form is not necessarily positive on

g

, hence the metric is in general of indefinite sign, i.e., it is semi-Riemannian [46]). The left-invariant distributions

D (g) = {g X : X \in p}

and

V (g) = {g X) : X \in k}

are called horizontal and vertical respectively. Then curves that are tangent to

D

, i.e., satisfy

\frac{d g}{d t} \in D (g (t))

are called horizontal. Likewise curves that are tangent to

V

are called vertical. It follows that

D (g) \oplus V (g) = T_{g} G, g \in G .

(88)

We will assume that

M = G / K

consisting of the left coset

g K

is endowed with a manifold structure so that the natural projection

π (g) = g K

is a smooth surjection [46]. A curve

g (t)

in G is called a lift of a curve

p (t) \in M

if

π (g (t)) = p (t)

. A lift is called horizontal when

g (t)

is a horizontal curve. Every curve

p (t)

in M is the projection of a horizontal curve

g (t)

. If a curve

g (t)

is a solution of

\frac{d g}{d t} = g (t) U (t)

for some curve

U (t) \in p

then

d_{g (t)} π (g (t) U (t)) = \frac{d p}{d t} .

The correspondence

D (g) \to T_{π (g)} M

given by

d_{g} π (g U) = \frac{d p}{d t}

is an isomorphism and induces a metric on M

{(d_{g} π (g V), d_{g} π {(g W)}_{π (g)} = ⟨ g V, g W ⟩ ⟩}_{g} = ⟨ V, W ⟩, V, W \in p .

(89)

Let now

{τ_{g} : g \in G}

denote the group of diffeomorphisms defined by the left action

π (L_{g} (h)) = τ_{g} (π (h)), h \in G, L_{g} (h) = g h .

We then have

Proposition 11.

The metric (89) is invariant under

{τ_{g} : g \in G}

, that is,

(d_{o} τ_{g} (V (p), d_{o} τ_{g} {(W (p))}_{τ_{g} (p)} = {(V (p), W (p))}_{p},

(90)

for any

g \in G

and any tangent vectors

V (p)

and

W (p)

in

T_{p} M

.

For a proof see [47].

It follows that each

τ_{g}

is an isometry. Since G acts transitively on M, M can be represented by the orbit

{τ_{g} (o) : g \in G}

where

o = π (e)

and e is the group identity in G. It follows that

π ((exp t U) g) = τ_{exp t U} (π (g))

for any

U \in g

. Note that

g \to (exp t U) g

is the flow generated by a right-invariant vector field

U_{r} (g) = U g

. Therefore the flow of

U_{r}

is

π

-related to the flow

{τ_{exp t U} : t \in R}

in M. We will let

\vec{U}

denote the infinitesimal generator of the flow

{τ_{exp t U} : t \in R}

.

It follows that each

\vec{U}

is a Killing vector field on M. A vector field whose flow acts on M by isometries is called a Killing vector field (see [46] for additional details). The correspondence

U_{r} (g) \to \vec{U} (π (g))

is one to one and onto

T_{π (g)} M

. Since the Lie brackets of vector fields related by a mapping F are also F-related, the Lie brackets

[U_{r}, V_{r}]

are

d π

-related to

[\vec{U}, \vec{V}]

. Therefore the correspondence

U_{r} (g) \to \vec{U} (π (g))

is a Lie algebra homomorphism, and hence

F = {\vec{U} : U \in g}

is a finite dimensional Lie algebra of Killing vector fields that satisfies

F (p) = T_{p} M

for each

p \in M

.

Note that

π (exp t U) = τ_{e^{t U}} (o) = exp t \vec{U} (o)

. So if

U \in k

then

π (exp t U) = o

and therefore

\vec{U} (o) = 0

. It then follows that

d_{e} π (U) = \vec{U} (o)

is an isometric isomorphism from

p

onto

T_{o} M

. More generally if

g (t)

is any horizontal curve then

p (t) = π (g (t)) = τ_{g} (t) π (e)

implies that

\frac{d p}{d t} = d_{g (t)} (π (g (t) U (t))) = d_{o} τ_{g (t)} d_{e} π (U (t)) = d_{o} τ_{g (t)} \vec{U} (t) (o),

(91)

and

{(d_{o} τ_{g (t)} \vec{U} (t) (o), d_{o} τ_{g (t)} \vec{V} (t) (o))}_{p (t)} = {(\vec{U} (o), \vec{V} (o))}_{o} .

Therefore

d_{o} τ_{g (t)}

is an isometry that maps

T_{o} M

onto

T_{p (t)} M

.

A homogeneous manifold

M = G / K

with a G-invariant metric defined by a reductive pair (G,K) with G semi-simple and K compact, will be referred to as semi-simple (it is defined by a semi-simple Lie group G, a compact subgroup K, and the metric induced by the Killing form). It can be shown that any symmetric Riemannian space with no Euclidean factors can be reduced to a semi-simple manifold (so that

[p, p] = k

holds). Conversely, every semi-simple manifold is locally symmetric. It is symmetric when G is simply connected (see [48], Proposition 6.27). We will not pursue further proximities with symmetric spaces since the present exposition makes no use of geodesic symmetries.

We now come to the main topic of this section, rolling of semi-simple manifolds on their tangent spaces. We begin by recalling the basic definition.

Definition 1.

A curve

α (t)

on a Riemannian manifold M rolls on a curve

\hat{α} (t)

on another Riemannian manifold

\hat{M}

if there exists an isometry

A (t) : T_{α (t)} M \to T_{\hat{α} (t)} \hat{M}

that satisfies:

\frac{d \hat{α}}{d t} = A (t) \frac{d α}{d t},

(92)

and also satisfies the condition that

A (t) v (t)

is a parallel vector field in

\hat{M}

along

\hat{α} (t)

for each parallel vector field

v (t)

along

α (t)

in M.

This intrinsic definition of rolling was introduced in [49], and later used in [50,51]. In this context the triple

(α (t), \hat{α} (t), A (t))

is called a rolling curve. It is clear that rolling is reflexive in the sense that if

α (t)

is rolled on

\hat{α} (t)

by an isometry

A (t)

then

\hat{α} (t)

is rolled on

α (t)

by the isometry

A^{- 1} (t)

, and therefore

(\hat{α} (t), α (t), A^{- 1} (t))

is also a rolling curve.

We will now examine rollings of semi-simple manifolds on their tangent planes. It comes as a pleasant surprise that such rollings are essentially described by Equation (91) reinterpreted in terms of rolling. So the passage to rolling becomes largely a question of semantics, as demonstrated in the text below.

We will consider rollings of M on

\hat{M} = T_{o} M

with its metric

{(u, v)}_{o}

defined by (89). The rollings on other tangent spaces are conjugate to the rollings on

T_{o} M

[47]. Let

α (t)

be an arbitrary curve in M and let

\hat{α} (t)

be a curve in

\hat{M}

that

α (t)

is rolled on. It follows that

α (t) = π (g (t)) = τ_{g (t)} (o)

for some horizontal curve

g (t)

. If

g (t)

is a solution of

\frac{d g}{d t} = g (t) U (t), U (t) \in p

then according to (91)

\frac{d α (t)}{d t} = d_{g (t)} π (g (t) U (t)) = d_{o} τ_{g (t)} \vec{U} (t) (o),

If we now let

\hat{α} (t)

be any solution in

\hat{M}

of

\frac{d \hat{α} (t)}{d t} = \vec{U} (t) (o)

then

A (t) = d_{o} τ_{g (t)}

is an isometry that rolls

\hat{α} (t)

on

α (t)

since the parallel transport condition is satisfied (for proofs see [47]). Of course, then

A^{- 1} (t)

rolls

α (t)

on

\hat{α} (t)

.

It follows that each horizontal curve

g (t)

in G defines a family of curves

\hat{α} (t)

in

\hat{M}

, each a solution of

\frac{d \hat{α}}{d t} = \vec{U} (t) (o)

, with

\vec{U} (t)

induced by

U (t) = g^{- 1} (t) \frac{d g}{d t}

, that roll on

α (t) = π (g (t))

. The converse is also true: every solution

(g (t), \hat{α} (t))

of the differential system

\frac{d g}{d t} = g (t) U (t), \frac{d \hat{α} (t)}{d t} = \vec{U} (t) (o), U (t) \in p

(93)

defines a curve

α (t) = π (g (t))

in M on which

\hat{α} (t)

in

\hat{M}

is rolled by the isometry

d_{o} τ_{g (t)}

.

We will regard (93) as the fundamental object in rolling defined on

G = G \times \hat{M}, \hat{M} = T_{o} M

, a Lie group with its group operation

g h = (g, p) (h, q) = (g h, p + q), g = (g, p), h = (h, q) .

Then

G = g \times T_{o} M

will denote the Lie algebra of

G

with the Lie bracket

[(X, \vec{U} (o)), (Y, \vec{V} (o)]

= ([X, Y], 0)

.

Let now

H (g, p) = {(g U, \vec{U} (o)) : U \in p}, (g, p) \in G

. We will view

H

as a left-invariant distribution on

G

defined by the left-translates of vector space

Γ = {(U, \vec{U} (o)) : U \in p}

in

G

. The distribution

H

is called the rolling distribution and its integral curves are called rolling motions. Any rolling motion

g (t) = (g (t), p (t))

is a solution of

\frac{d g}{d t} = g (t) U (t), \frac{d p}{d t} = \vec{U} (t) (o),

(94)

and can be associated with the rolling curve

(\hat{α} (t), α (t)), d_{o} τ_{g (t)})

, where

α (t) = τ_{g (t)} (o)

and

\hat{α} (t) = p (t)

.

Since

p

and

k

satisfy strong Cartan conditions

[p, k] = p

and

[p, p] = k

,

Γ

satisfies

[Γ, Γ] = (k, 0),

and

[Γ, [Γ], Γ]] = (p, 0)

. Therefore,

Γ + [Γ, Γ] + [Γ, [Γ, Γ]] = G,

(95)

Hence the Lie algebra generated by the left-invariant vector fields tangent to

H

is equal to

G

, therefore any two points in

G

can be connected by a rolling motion, and each rolling motion inherits a natural length

\int_{0}^{T} \sqrt{⟨ U (t), U (t) ⟩} d t

from G. It is then known that any pair of points in

G

can be connected by an integral curve of

H

of minimal length because vector fields in

H

are complete [50]. The above shows that

G

with the above metric is a sub-Riemannian manifold. We will refer to the associated sub-Riemannian geodesics as the rolling geodesics.

We will now turn to the Maximum principle to find the necessary conditions that the rolling geodesics must satisfy. To put the matter in the control theoretic context, let

A_{1}, \dots, A_{m}

be an orthonormal basis in

p

so that

(A_{i}, {\vec{A}}_{i} (o))

becomes an orthonormal basis in

Γ

. Then an absolutely continuous curve

g (t) = (g (t), p (t))

is a rolling motion if and only if

\frac{d g}{d t} = \sum_{i = 1}^{m} u_{i} (t) g (t) A_{i}, \frac{d p}{d t} = \sum_{i = 1}^{m} u_{i} (t) {\vec{A}}_{i} (o),

(96)

for some bounded and measurable control functions

u_{1} (t), \dots, u_{m} (t)

, in which case the length of

g (t)

is given by

\int_{0}^{T} \sqrt{u_{1}^{2} (t) + \dots + u_{m}^{2} (t)} d t

. The rolling problem is an optimal control problem and consists of finding the solutions

g (t) = (g (t), p (t))

on a fixed time interval

[0, T]

that satisfy the given boundary conditions

g (0) = g_{0}

and

g (T) = g_{1}

along which the energy of transfer

\frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{m} u_{i}^{2} (t) d t

is minimal. It is known that each rolling geodesic is locally optimal and hence is a solution to the above control problem [7,50].

5.1. Rolling Hamiltonians

To emphasize the invariant properties of the problem we will rewrite (96) as

\frac{d g}{d t} = \sum_{i = 1}^{m} u_{i} (t) X_{i} (g),

(97)

where each

X_{i}

a left-invariant vector field

X_{i} (g) = (g A_{i}, {\vec{A}}_{i} (o))

,

g = (g, p)

. If

g (t)

is an optimal trajectory then, according to the Maximum Principle,

g (t)

is the projection of an extremal curve

ξ (t)

in

T^{*} G

along which the cost extended Hamiltonian

- \frac{λ}{2} \sum_{i = 1}^{m} u_{i}^{2} (t) + \sum_{i = 1}^{m} u_{i} (t) H_{i} (ξ (t)), λ = 0, 1

is maximal relative to all other control functions. Here

H_{i}

is the Hamiltonian lift of

X_{i}

, i.e.,

H_{i} (ξ (t)) = ξ (t) (X_{i} (g (t))

.

There are two kinds of extremal curves depending whether

λ = 0

(abnormal case) or

λ = 1

(normal case). In the abnormal case the Maximum principle results in the constraints

H_{i} (ξ (t)) = 0, i = 1, \dots, m,

(98)

and beyond that gives no further information about the optimal control in question. In the normal case, however, the above maximum yields

u_{i} (t) = H_{i} (ξ (t)), i = 1, \dots, m

, where

ξ (t)

is a solution curve of a single Hamiltonian vector field corresponding to the Hamiltonian

H (ξ) = \frac{1}{2} \sum_{i = 1}^{m} H_{i}^{2} (ξ) .

(99)

Each optimal solution

g (t)

is either the projection of an abnormal or a normal extremal curve. If

g (t)

is the projection of a normal extremal curve

ξ (t)

then

ξ (t)

is an integral curve of

\vec{H}

and the control

u (t)

that generates

g (t)

is of the form

u_{i} (t) = H_{i} (ξ (t)), i = 1, \dots, m

.

We will not concern ourselves with the abnormal extremals. It is very likely that every optimal trajectory is the projection of a normal extremal curve, as in [52], in which case the abnormal extremals could be ignored. Instead we will turn to the normal Hamiltonian H and its Hamiltonian equations

\frac{d g}{d t} = \sum_{i = 1}^{n} H_{i} (ℓ (t)) X_{i} (g (t)), \frac{d ℓ}{d t} = - a d^{*} d H (ℓ (t)) (ℓ (t)) .

Let us first consider the solutions of the associated Poisson equation

\frac{d ℓ}{d t} = - a d^{*} d H (ℓ (t)) (ℓ (t))

(100)

and the structure of the coadjoint orbits.

Since

\hat{M}

is a Euclidean vector space, its tangent space at the origin can be identified with

\hat{M}

. Then the Lie algebra

G

will be identified with

g \times \hat{M}

, and its dual with

G^{*} = g^{*} \oplus {\hat{M}}^{*}

, where

g^{*} = {ℓ \in G^{*} : ℓ (\dot{p}) = 0, \dot{p} \in \hat{M}}, {\hat{M}}^{*} = {ℓ \in G^{*} : ℓ (g) = 0} .

It then follows that every

ℓ \in G^{*}

can be written as

ℓ = ℓ_{1} + ℓ_{2}

with

ℓ_{1} \in g^{*}

and

ℓ_{2} \in {\hat{M}}^{*}

. Since

\hat{M}

is an abelian algebra the projection

ℓ_{2}

on

{\hat{M}}^{*}

is constant on each coadjoint orbit of

G

. The argument is straightforward:

A d_{g}^{*} (ℓ) (X + \dot{p}) = ℓ (A d_{g^{- 1}} (X + \dot{p})) = ℓ (A d_{g^{- 1}} (X) + \dot{p}) = ℓ_{1} (A d_{g^{- 1}} (X)) + ℓ_{2} (\dot{p}),

for any

g = (g, p) \in G

. It follows that the coadjoint orbits in

G

are of the form

{A d_{g}^{*} (ℓ_{1}) : g \in G} + ℓ_{2}, for any ℓ = ℓ_{1} + ℓ_{2} .

This fact can be also verified directly from Equation (100): we have

\frac{d ℓ}{d t} V = - ℓ [d H, V], for any V = X + \dot{p} in G,

where

d H = \sum_{i = 1}^{m} H_{i} (ℓ) (A_{i} + {\vec{A}}_{i} (o))

and

H_{i} (ℓ) = ℓ_{1} (A_{i}) + ℓ_{2} ({\vec{A}}_{i} (o))

. Therefore,

\frac{d ℓ_{1}}{d t} (X) + \frac{d ℓ_{2}}{d t} (\dot{p}) = - (ℓ_{1} + ℓ_{2}) ([d H, X + \dot{x}]) = - \sum_{i = 1}^{m} H_{i} (ℓ_{i}) [A_{i}, X] .

from which follows that

\frac{d ℓ_{1}}{d t} (X) = - \sum_{i = 1}^{n} H_{i} (ℓ_{i}) [A_{i}, X], X \in g, \frac{d ℓ_{2}}{d t} (\dot{p}) = 0 .

Since

\dot{p}

is arbitrary

\frac{d ℓ_{2}}{d t} = 0

.

To uncover other constants of motion identify

G^{*}

with

G

via the natural quadratic forms on each of the factors, and then recast the preceding equations on

G

. More precisely, identify each

ℓ_{2}

in

{\hat{M}}^{*}

with a tangent vector

l = \sum_{i = 1}^{m} l_{i} {\vec{A}}_{i} (o)

via the formula

ℓ_{2} (\dot{p}) = (l, \dot{p}), \dot{p} \in \hat{M}

. Similarly, identify

ℓ_{1} \in g^{*}

with

L \in g

via the formula

ℓ_{1} (X) = ⟨ L, X ⟩, X \in g

. Then decompose

L \in g

into the sum

L = L_{p} + L_{k}

,

L_{p} \in p

and

L_{k} \in k

. Relative to the basis

A_{1}, \dots, A_{m}

in

p

,

L_{p} = \sum_{i = 1}^{m} P_{i} A_{i}

where

P_{i} = ℓ_{1} (A_{i}) = ⟨ L, A_{i} ⟩

. It follows that

H_{i} (ξ) = ℓ (A_{i} + {\vec{A}}_{i} (o)) = ℓ_{1} (A_{i}) + ℓ_{2} ({\vec{A}}_{i} (o)) = P_{i} + l_{i},

and

\begin{matrix} \frac{d ℓ_{1}}{d t} (X) = ⟨ \frac{d L}{d t}, X ⟩ = - ⟨ L, [\sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}, X] ⟩ = - ⟨ [L, \sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}], X ⟩, \\ (\frac{d l}{d t}, \dot{p}) = \frac{d ℓ_{2}}{d t} (t) (\dot{p}) = 0 \end{matrix}

Since X and

\dot{p}

are arbitrary,

\frac{d L}{d t} = [\sum_{i = 1}^{m} (l_{i} + P_{i}) A_{i}, L] = [A + L_{p}, L], A = \sum_{i = 1}^{m} l_{i} A_{i}, \frac{d l}{d t} = 0 .

(101)

Coupled with

\frac{d g}{d t} = g (t) (A + L_{p}), \frac{d p}{d t} = \sum_{i = 1}^{n} (l_{i} + P_{i}) {\vec{A}}_{i} (o),

(102)

Equation (101) constitute the Hamiltonian equations on

G \times G

generated by the Hamiltonian

H = \frac{1}{2} \sum_{i = 1}^{m} H_{i}^{2} = \frac{1}{2} \sum_{i = 1}^{m} {(l_{i} + P_{i})}^{2}

.

Each extremal curve projects onto a geodesic

g (t) = (g (t), p (t))

, and each geodesic further projects onto the pair of curves

α (t) = τ_{g (t)} (o)

in M and

β (t) = p (t)

in

\hat{M}

that are rolled upon each other by

g (t)

. Note that in this identification of the Lie algebras with their duals, coadjoint orbits

{A d_{g}^{*} (ℓ_{1}) + ℓ_{2} : g \in G}

are identified with the affine sets

{A d_{g} (L) + l : g \in G}

.

Recall now the Hamiltonian equations associated with the canonical affine-quadratic problem (Equation (33)):

\frac{d g}{d t} = g (t) (A + L_{k} (t)), \frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [L_{k}, L_{p}] + s [A, L_{k}], s = 0, 1 .

The propositions below reveal a remarkable fact that the Poisson equations of a canonical affine-quadratic Hamiltonian are subordinate to the Poisson equations associated with a rolling Hamiltonian. This connection identifies the drift term in the affine-quadratic system with a coadjoint invariant of the rolling Poisson system. To keep the systems apart we will use bold letters when referring to the variables in the rolling Hamiltonian in contrast to the variables in the affine-quadratic Hamiltonian which will remain the same.

Proposition 12.

Let

(g (t), p (t)), L_{p} (t), L_{k} (t)

be an integral curve of the rolling Hamiltonian

H = \frac{1}{2} | | A + L_{p} {| |}^{2}

, that is,

\begin{matrix} \frac{d g}{d t} = g (t) (A + L_{p} (t)), \frac{d p}{d t} = \sum_{i = 1}^{m} (l_{i} + P_{i}) {\vec{A}}_{i} (o), \\ \frac{d L_{k}}{d t} = [A, L_{p}], \frac{d L_{p}}{d t} = [A + L_{p}, L_{k}], A = \sum_{i = 1}^{m} l_{i} A_{i} \end{matrix}

Then

\tilde{g} (t) = g (t) h (t), L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)), L_{k} = A d_{h^{- 1} (t)} (L_{k} (t))

(103)

is an integral curve of the affine Hamiltonian

H = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ + ⟨ A, L_{p} ⟩

, where

A = A d_{h^{- 1} (t)} (A + L_{p} (t))

, and

h (t)

is the solution of

\frac{d h}{d t} = L_{k} (t) h (t)

with

h (0) = I

.

Moreover, if

x (t)

a solution of

\frac{d x}{d t} = A + L_{p} (t)

then

\tilde{g} (t) = (x (t), h (t))

in

p ⋊ K

is the projection of an extremal curve

L_{k} (t) = A d_{h^{- 1} (t)} L_{k} (t), L_{p} (t) = A d_{h^{- 1} (t)} (L_{p} (t)) - A, A = A d_{h^{- 1} (t)} (A + L_{p} (t))

associated with the shadow Hamiltonian

H = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ + ⟨ A, L_{p} ⟩

.

The converse also holds according to the following proposition.

Proposition 13.

Suppose that

(\tilde{g} (t), L_{p} (t), L_{k} (t))

is an extremal curve of the affine Hamiltonian

H = \frac{1}{2} ⟨ L_{k}, L_{k} ⟩ + ⟨ A, L_{p} ⟩

. Let

g (t) = \tilde{g} (t) h^{- 1} (t), L_{p} (t) = A d_{h (t)} (L_{p} (t)), L_{k} (t) = A d_{h (t)} (L_{k} (t)), A = A d_{h (t)} (A - L_{p} (t))

where

h (t)

is a solution of

\frac{d h}{d t} = h (t) (L_{k} (t))

and let

p (t)

be a solution of

\frac{d p}{d t} = \vec{A} (o) + {\vec{L}}_{p} (t) (o) .

Then

(g (t), p (t))

together with

L_{p} (t) = A d_{h (t)} (L_{p} (t)), L_{k} (t) = A d_{h (t)} (L_{k} (t)), A = A d_{h (t)} (A - L_{p} (t))

is an extremal curve of the rolling Hamiltonian

H = \frac{1}{2} ⟨ A + L_{p}, A + L_{p} ⟩

.

However, if

\tilde{g} (t) = (x (t), R (t))

,

L_{p} (t) + L_{k} (t))

is an extremal curve of the shadow Hamiltonian H, then

(g (t), p (t))

, solutions of

\frac{d g}{d t} = g (t) A d_{R (t)} (A)), \frac{d p}{d t} = \vec{\frac{d x}{d t}} (o),

together with

L_{p} (t) = A d_{R (t)} (A + L_{p} (t)), L_{k} (t) = A d_{R (t)} (L_{k} (t)), A = - A d_{R} L_{p}

define an extremal curve of the rolling Hamiltonian

H = \frac{1}{2} ⟨ A + L_{p}, A + L_{p} ⟩

.

The proofs follow by straightforward calculations (also done in [9]).

Let us now come back to isospectral representations and Zimmerman’s method [17,52]. For that purpose let

X_{0} (t) = A + L_{p} (t), X_{1} (t) = L_{k} (t), X_{2} (t) = - A, X_{3} = 0

. Then Poisson’s equations for the rolling problem can be written as

\frac{d X_{i}}{d t} = [X_{0} (t), X_{i + 1} (t)], i = 0, 1, 2 .

(104)

These equations are invariant under a dilational change of variables

X_{i} \to λ^{i - 1} X_{i}

. It then follows that

L_{λ} = \sum_{i = 0}^{3} λ^{i} X_{i} = L_{p} (t) + λ L_{k} (t) + (1 - λ^{2}) A

(105)

satisfies the equation

\frac{d L_{λ}}{d t} = [M_{λ} (t), L_{λ} (t)], M_{λ} (t) = \frac{1}{λ} (A + L_{p} (t)) .

(106)

Therefore

L_{λ}

is the spectral curve for

H

. But then the Poisson system associated with the affine-quadratic Hamiltonian also admits an isospectral representation after the substitutions

A = A d_{h (t)} (A - L_{p}), L_{k} = A d_{h (t)} (L_{k}), L_{p} = A d_{h (t)} (L_{p}), \frac{d h}{d t} = h (t) L_{k} (t) .

For then

(L_{p} (t), L_{k} (t))

are the extremal curves for the Poisson system associated with the affine-quadratic system (Proposition 13) and further satisfy

\begin{matrix} L_{λ} = A d_{h (t)} (L_{p}) + λ A d_{h (t)} (L_{k}) + (1 - λ^{2} (A d_{h (t)} (A - L_{p}) = \\ A d_{h (t)} (λ^{2} L_{p} + λ L_{k} + (1 - λ^{2}) A) = A d_{h (t)} L_{λ} . \end{matrix}

But then

\begin{matrix} A d_{h (t)} [\frac{1}{λ} A, L_{λ}] = [\frac{1}{λ} (A + L_{p}), L_{λ}] = \\ \frac{d L_{λ}}{d t} = \frac{d}{d t} (A d_{h (t)} (L_{λ}) = A d_{h (t)} [L_{λ}, L_{k}] + A d_{h (t)} \frac{d L_{λ}}{d t} \end{matrix}

implies

\frac{d L_{λ}}{d t} = [L_{k}, L_{λ}] + [\frac{1}{λ} A, L_{λ}] = [\frac{1}{λ} A + L_{k}, L_{λ}] .

To be consistent with my earlier publications, replace

λ

by

- \frac{1}{λ}

to get

\frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}],

(107)

where

M_{λ} = L_{k} - λ A

, and

L_{λ} = L_{p} - λ L_{k} + (λ^{2} - 1) A

. Equation (107) agrees with the isospectral representation (34).

To get the spectral curve

L_{λ}

for the shadow Hamiltonian, use relations

L_{k} = A d_{h} (L_{k})

,

L_{p} = A d_{h} (L_{p} + A)

and

A = - A d_{h} L_{p}

from Proposition 13. Then

L_{λ} = L_{p} + λ L_{k} + (1 - λ^{2}) A = A d_{h} L_{λ}, L_{λ} = λ^{2} L_{p} + λ L_{k} + A .

Then a calculation analogous to the one above gives

\frac{d L_{λ}}{d t} = [\frac{1}{λ} A + L_{k}, L_{λ}]

. After the rescaling

λ \to - \frac{1}{λ}

we get a modified Lax pair

\frac{d L_{λ}}{d t} = [M_{λ}, L_{λ}], M_{λ} = L_{k} - λ A, L_{λ} = L_{p} - λ L_{k} + λ^{2} A .

(108)

5.2. Rolling Problem on Spaces of Constant Curvature

We will now introduce another optimal problem intertwined with the rolling problem. It consists of finding a continuously differentiable curve

p (t)

in M in an interval

[0, T]

, with its tangent vector

\dot{p} (t)

of unit length and its covariant derivative bounded and measurable in

[0, T]

that satisfies fixed tangential directions

\dot{p} (0) = v_{0}, v_{0} \in T_{p (0)} M

and

\dot{p} (T) = v_{1}, v_{1} \in T_{p (T)} M

along which the integral

\frac{1}{2} \int_{0}^{T} κ^{2} (s) d s

minimal among all other curves that satisfy the same boundary conditions. Here

κ (t) = | | \frac{d D_{p (t)}}{d t} (\dot{p} (t)) | |

, where

\frac{d D_{p (t)}}{d t}

denotes the covariant derivative along

p (t)

. The integral

\frac{1}{2} \int_{0}^{T} κ^{2} (s) d s

is known as the elastic energy of the curve

p (t)

[24]. Curves

p (t)

defined on some interval

[0, T]

are called elastic if for each

t \in (0, T)

there exits an interval

[t_{0}, t_{1}] \subset [0, T]

over which the elastic energy of

p (t)

is minimal relative to the boundary conditions

\dot{p} (t_{0})

and

\dot{p} (t_{1})

[7].

On semi-simple manifolds the curvature problem can be lifted to the unit tangent bundle of G, and it is this lifted version of the problem that will be of interest for this paper. In this formulation of the problem the tangent bundle of G is realized as the product

G \times g

with

(g, X) \in G \times g

identified with

g X \in T_{g} G

. Then each tangent vector

v \in T_{p} M

is the projection of a manifold

V = {(g h, A d_{h} (U)), h \in K}

in

G \times g

where

p = π (g)

and

v = d_{g} π (g) U, U \in p

. The lifted curvature problem consists of finding a curve

(g (t), Λ (t))

in

G \times S_{p}

,

S_{p} = {Λ \in p : ⟨ Λ, Λ ⟩ = 1}

, a solution of

\frac{d g}{d t} = g (t) Λ (t), \frac{d Λ}{d t} = U (t), ⟨ U (t), Λ (t) ⟩ = 0,

(109)

that originates in the manifold

V_{0} = {(g_{0} h, A d_{h^{- 1}} Λ_{0}), h \in K, Λ_{0} \in p}

at

t = 0

and terminates at the manifold

V_{1} = {(g_{1} h, A d_{h^{- 1}} Λ_{1}) : h \in K, Λ_{1} \in p}

at

t = T

for which the energy of transfer

\frac{1}{2} \int_{0}^{T} {| | U (s) | |}^{2} d s

is minimal. If

p (t) = π (g (t)) = τ_{g (t)} (o)

is the projected curve, then

p (t)

is the solution of

\dot{p} (t) = d_{g (t)} π (g (t)) Λ (t) = d_{o} τ_{g (t)} \vec{Λ} (t) (o) .

that satisfies

| | \dot{p} (t) | | = 1

and the boundary conditions

\begin{matrix} p (0) = π (g (0)), \dot{p} (0) = d_{g (0)} π (V_{0}) = d_{g (0)} π (g (0) Λ_{0}), \\ p (T) = π (g (T)), \dot{p} (T) = d_{g (T)} π (V_{1}) = d_{g (T)} π (g (T) Λ_{1}) . \end{matrix}

It is a simple exercise to show that a curve

p (t)

is elastic if and only if it is the projection of a solution of the lifted curvature problem on a fixed interval

[0, T]

.

The Hamiltonian system for the curvature problem (Equation (109)) can also be obtained through the Maximum principle properly modified to account for the constraints, as outlined in ([7], Chapter 11). To go into these details would take us away from the central theme of the paper, so instead, we will just quote the relevant equations from [7] (pp. 354–355).

The curvature Hamiltonian H is given by

H = \frac{1}{2} {| | X | |}^{2} + ⟨ Λ, P ⟩

together with the associated Hamiltonian equations

\begin{matrix} \frac{d g}{d t} = g Λ (t), \frac{d P}{d t} = [Λ, Q], \frac{d Q}{d t} = [Λ, P], \\ \frac{d Λ}{d t} = X (t), \frac{d X}{d t} = - P - {(| | X | |}^{2} - ⟨ P, Λ ⟩) Λ, \end{matrix}

subject to the transversality condition

Q (t) + [Λ (t), X (t)] = 0

. The transversality condition can be incorporated into the above equations to yield an equivalent system

\begin{matrix} \frac{d g}{d t} = g Λ (t), \frac{d Λ}{d t} = X (t), \frac{d X}{d t} = - P - {(| | X | |}^{2} - ⟨ P, Λ ⟩) Λ, \\ \frac{d P}{d t} = - [Λ, [Λ, X]], \frac{d Q}{d t} = [Λ, P] . \end{matrix}

(110)

We will now confine our attention to spaces of constant curvature, with a particular interest on the connections between the rolling problems and the elastic curves reported in [52]. For those reasons let us return to the “spheres”

S_{ϵ}^{n} (ρ) = {x \in R^{n + 1} : {(x, x)}_{ϵ} = ρ^{2}, x_{0} > 0 when ϵ = - 1}

and their rollings on the isometry groups groups

S O_{ϵ}

,

ϵ = \pm 1

endowed with the quadratic form

{⟨ A, B ⟩}_{ϵ} = - \frac{1}{2} ϵ ρ^{2} T r (A B) .

The rolling equations associated with the rollings of

S_{ϵ}^{n} (ρ)

on the tangent plane

\hat{M} = T_{ρ e_{0}} S_{ϵ}^{n} (ρ)

are given by

\frac{d g}{d t} = g (t) (u (t) \land_{ϵ} e_{0})), \frac{d p}{d t} (t) = ρ u (t) .

(111)

In what follows we will make use of the following isospectral integrals of motion associated with the preceding rolling problem extracted from the functions

f_{2, λ} = T r (L_{λ}^{2})

and

f_{4, λ} = T r (L_{λ}^{4})

\begin{matrix} I_{0} = 2 H = | | A + L_{p} {| |}^{2}, I_{1} = | | L_{p} {| |}^{2} + ϵ | | L_{k} {| |}^{2} \\ I_{2} = | k | | | L_{k} {| |}^{2} | | L_{p} {| |}^{2} - | | [L_{p}, L_{k}] {| |}^{2} + \frac{ϵ}{2} (k | | L_{k} {| |}^{4} - | | L_{k}^{2} {| |}^{2}), \\ I_{4} = | k | | | L_{k} {| |}^{2} | | A + L_{p} {| |}^{2} - | | [A + L_{p}, L_{k}] {| |}^{2} . \end{matrix}

(112)

These integrals of motion are rescaled variants of the integrals of motion in [52] after the metric is replaced by

{⟨ A, B ⟩}_{ρ} = ρ {⟨ A, B ⟩}_{ϵ}

(the metric in this paper is a scalar multiple of the metric used in [52]).

Recall that on spaces of constant Riemannian curvature the curvature k is defined by

[V, [V, X]] = - k X,

(113)

for any V and X in

p

that satisfy

| | V | | = 1

and

⟨ V, X ⟩ = 0

. In particular k is equal to

\frac{ϵ}{ρ^{2}}

on

S_{ϵ}^{n} (ρ)

. Note that

- \frac{1}{2} T r (A B) = k ⟨ A, B ⟩

.

Proposition 14.

Rolling geodesics that are the projections of the extremal curves on

H = \frac{1}{2}

and

I_{4} = 0

project on the elastic curves in

S_{ϵ}^{n} (ρ)

. Conversely each elastic curve in

S_{ϵ}^{n} (ρ)

is the projection of such an extremal curve.

Proof.

Each elastic curve on

S_{ϵ}^{n} (ρ)

is the projection of an extremal curve corresponding to the curvature problem (Equation (110)). On spaces of constant Riemannian curvature

[Λ, [Λ, X]] = - k X .

(114)

Therefore, Equation (110) can be written as

\begin{matrix} \frac{d g}{d t} = g Λ (t), \frac{d Λ}{d t} = X (t), \frac{d X}{d t} = - P - {(| | X | |}^{2} - ⟨ P, Λ ⟩) Λ, \\ \frac{d P}{d t} = k X, \frac{d Q}{d t} = [Λ, P] . \end{matrix}

(115)

It follows that

k \frac{d Λ}{d t} - \frac{d P}{d t} = 0

, and therefore,

k Λ - P = k A

for some constant element A in

p

. The transversality condition

Q + [Λ, X] = 0

can be recast as

0 = [Λ, Q] + [Λ, [Λ, X]] = [Λ, Q] - k X

. These observations can be incorporated in the preceding equations to get

\begin{matrix} \frac{d g}{d t} = g (t) Λ (t) = g (t) \frac{1}{k} (k A + P), \\ \frac{d P}{d t} = k X = [Λ, Q] = \frac{1}{k} [k A + P, Q], \\ \frac{d Q}{d t} = [Λ, P] = \frac{1}{k} [k A + P, P] = \frac{1}{k} [k A, P] . \end{matrix}

(116)

If we now identify

\frac{1}{k} P

with

L_{p}

, and

\frac{1}{k} Q

with

L_{k}

, then the preceding equations reduce to the rolling Hamiltonian system. Moreover,

Λ = A + \frac{1}{k} P = A + L_{p}, and L_{k} = \frac{1}{k} Q = \frac{1}{k} [A + L_{p}, X] .

Hence

| | A + L_{p} | | = 1

so the first constraint is satisfied. To verify the second constraint note that

L_{k} = \frac{1}{k} [A + L_{p}, X]

, and therefore

| | L_{k} {| |}^{2} = \frac{1}{k^{2}} | | [A + L_{p}, X] {| |}^{2} = \frac{1}{k^{2}} ⟨ [[A + L_{p}, X], A + L_{p}], X ⟩ = \frac{1}{| k |} {| | X | |}^{2},

and

| | [A + L_{p}, L_{k}] {| |}^{2} = \frac{1}{k^{2}} | | [A + L_{p}, [A + L_{p}, X]] |^{2} = {| | X | |}^{2}

. Therefore,

I_{4} = | | L_{k} {| |}^{2} | k | | | A + L_{p} {| |}^{2} - | | [A + L_{p}, L_{k}] {| |}^{2} = {| | X | |}^{2} - {| | X | |}^{2} = 0 .

To prove the converse assume that

g (t), p (t), A, L_{k} (t), L_{p} (t)

is a rolling extremal curve on

I_{4} = 0

. As a geodesic it satisfies

H = \frac{1}{2}

, or

| | A + L_{p} | | = 1

. We need to show that

L_{k} (t) = [A + L_{p} (t), X (t)]

for some

X (t) \in p

such that

⟨ X (t), A + L_{p} (t) ⟩ = 0

.

Let

\begin{matrix} Λ (t) = A + L_{p} (t), p_{Λ (t)}^{⊥} = {X (t) \in p : ⟨ X (t), Λ (t) ⟩ = 0}, \\ k_{Λ (t)} = {Q (t) \in k : [Q (t), Λ (t)] = 0}, k_{Λ (t)}^{⊥} = {Q \in k; : ⟨ Q, k_{Λ} ⟩ = 0} . \end{matrix}

Then

Λ (t) = λ (t) \land_{ϵ} e_{0}, {(λ (t), e_{0})}_{ϵ} = 0

for some vector

λ (t) \in R^{n + 1}

. It then follows that

p_{Λ (t)}^{⊥} = {u (t) \land_{ϵ} e_{0} : {(u (t), e_{0})}_{ϵ} = {(λ (t), u (t))}_{ϵ} = 0}

, and

k_{Λ (t)}^{⊥} = {λ (t) \land_{ϵ} u (t) : {(u (t), λ (t))}_{ϵ} = {(e_{0}, u (t))}_{ϵ} = 0}

.

Hence, dim

(p_{Λ (t)}^{⊥}) =

dim(

k_{Λ}^{⊥})

. The mapping

F (t) X = a d Λ (t) (X), X \in p_{Λ (t)}^{⊥}

satisfies

F (p_{Λ (t)}^{⊥}) \subseteq k_{Λ (t)}^{⊥}

because

⟨ [Λ, X], k_{Λ} ⟩ = 0

. On spaces of non-zero constant curvature, the kernel of this mapping is zero because

a d Λ (t) X = 0

implies that

0 = a d^{2} Λ (t) (X (t)) = - ϵ ρ^{2} X (t)

. Since

p_{Λ (t)}^{⊥}

and

k_{Λ (t)}^{⊥}

have the same dimension, F maps

p_{Λ (t)}^{⊥}

onto

k_{Λ (t)}^{⊥}

. So every curve

L (t) \in k_{Λ (t)}^{⊥}

is of the form

L (t) = [Λ (t), X (t)]

for some

X (t) \in p

perpendicular to

Λ (t)

.

It remains to show that

L_{k} (t)

belongs to

k_{Λ (t)}^{⊥}

when the rolling geodesic is on

I_{4} = 0

, that is, when

| | L_{k} {| |}^{2} = \frac{1}{k} | | [Λ (t), L_{k} (t) {| |}^{2}

. Now assume that

L_{k} (t) = U_{1} (t) + U_{2} (t)

,

U_{1} (t) \in k_{Λ (t)}

and

U_{2} (t) \in k_{Λ (t)}^{⊥}

. It follows from above that

U_{2} (t) = [Λ (t), X (t)]

, and therefore

| | U_{2} {(t) | |}^{2} = {| | [Λ (t), X (t)], [Λ (t), X (t)] | |}^{2} = | ⟨ a d^{2} Λ (t) (X), X (t) ⟩ {| = | k | | | X | |}^{2}

Hence,

\frac{1}{| k |} | | [Λ (t), U_{1} (t) + U_{2} (t)] {| |}^{2} = \frac{1}{| k |} | | [Λ (t), U_{2} (t)] {| |}^{2} = {| k | | | X | |}^{2} = | | L_{k} {| |}^{2} .

But

| | L_{k} {(t) | |}^{2} = | | U_{1} {| |}^{2} + | | U_{2} {(t) | |}^{2} = | | U_{1} {(t) | |}^{2} + {| k | | | X | |}^{2}

, and therefore

U_{1} (t) = 0

. □

The following proposition characterizes elastic curves [7].

Proposition 15.

Let

κ (t)

and

τ (t)

denote the geodesic curvature and the torsion of the projection curve

p (t)

associated with an extremal curve of the curvature problem. Then

ξ (t) = κ^{2} (t)

is the solution of the following equation

{(\frac{d ξ}{d t})}^{2} = - ξ^{3} + 4 (H - ϵ) ξ^{2} + 4 (I_{1} - H^{2}) ξ - 4 I_{2},

(117)

and

{(κ^{2} (t) τ (t))}^{2} = k I_{2}

. All other curvatures in the Serret-Frenet frame along

p (t)

are zero.

I believe that the proof given below is more to the point than similar proofs given elsewhere [7,52].

Proof.

We leave it to the reader to verify that

k | | L_{k} {| |}^{4} - | | L_{k}^{2} {| |}^{2} = 0

when

L_{k} = \frac{1}{k} [A + L_{p}, X]

. Let

P (t) = k L_{p} (t)

and let

Q (t) = k L_{k} (t)

. Then

\begin{matrix} I_{2} k^{2} = k^{2} (| k | | | L_{k} {| |}^{2} | | L_{p} {| |}^{2} - | | [L_{p}, L_{k}] {| |}^{2} {) = | | P | |}^{2} {| | X | |}^{2} | | - | | [L_{p}, Q] {| |}^{2} = \\ {| | P | |}^{2} {| | X | |}^{2} - | | [L_{p}, [A + L_{p}, X]] {| |}^{2} = {| | P | |}^{2} {| | X | |}^{2} - {⟨ A + L_{p}, L_{p} ⟩}^{2} {| | X | |}^{2} - {⟨ P, X ⟩}^{2} . \end{matrix}

Also

I_{1} k^{2} = k^{2} (| | L_{p} {| |}^{2} + ϵ | | L_{k} {| |}^{2} {) = | | P | |}^{2} + ϵ k^{2} {| | Q | |}^{2} = {| | P | |}^{2} + {k | | X | |}^{2} .

Since

κ^{2} (t) = {| | X | |}^{2}

,

\frac{d ξ}{d t} = 2 ⟨ X, \dot{X} ⟩ = 2 ⟨ X, P ⟩

. Therefore,

\begin{matrix} {(\frac{d ξ}{d t})}^{2} = 4 {⟨ P, X ⟩}^{2} = {4 (| | P | |}^{2} {| | X | |}^{2} - {⟨ A + L_{p}, P ⟩}^{2} {| | X | |}^{2}) - 4 k^{2} I_{2} \\ = 4 ((I_{1} k^{2} - {k | | X | |}^{2} {) | | X | |}^{2} - (H - \frac{1}{2} {| | X | |}^{2})^{2} {| | X | |}^{2}) - 4 k^{2} I_{2} \\ = 4 (I_{1} k^{2} - k ξ) ξ - 4 {(H - \frac{1}{2} ξ)}^{2} ξ - 4 k^{2} I_{2} = - ξ^{3} + 4 (H - k) ξ^{2} + (I_{1} k^{2} - H^{2}) ξ - 4 k^{2} I_{2} . \end{matrix}

As to the second part, let

T = A + L_{p} (t)

. Since

| | A + L_{p} (t) | | = 1

,

T (t)

is a unit vector that projects onto the tangent vector

\dot{p} (t)

. Then

\frac{d T}{d t} = [A + L_{p} (t), L_{k} (t)] = [A + L_{p} (t), - [A + L_{p} (t), \frac{1}{k} X (t)]] = X (t) .

Therefore

\frac{d T}{d t} = κ (t) N (t)

where

N (t) = \frac{1}{| | X (t) | |} X (t)

is a unit vector in

p

that projects onto the unit normal

n (t)

along

p (t)

. Continuing,

\begin{matrix} \frac{d N}{d t} = \frac{1}{| | X (t)} | | (- P - {(| | X | |}^{2} - ⟨ Λ, P ⟩ (A + L_{p})) - \frac{1}{{| | X | |}^{2}} ⟨ X, \dot{X} ⟩ X) = \\ - | | X | | (A + L_{p}) + \frac{1}{| | X | |} (- P + ⟨ A + L_{p}, P ⟩ (A + L_{p})) + \frac{1}{{| | X | |}^{2}} ⟨ P, X ⟩ X) = \\ - κ (t) T (t) + Y (t), \end{matrix}

where

Y (t) = \frac{1}{| | X | |} (- P (t) + ⟨ T (t), P (t) ⟩ T (t)) + \frac{1}{{| | X | |}^{2}} ⟨ P (t), X (t) ⟩ X .

Since

Y (t)

is is orthogonal to

A + L_{p}

and X, it is in the direction of the binormal vector

B (t)

. So if we define

τ (t) = | | Y (t) | |

and

B (t) = \frac{1}{| | Y | |} Y

then

\frac{d N}{d t} = κ (t) T (t) + τ B (t)

and

B (t)

projects onto the binormal vector

b (t)

along

p (t)

. Hence,

{| | X | |}^{2} τ^{2} = {| | P | |}^{2} - {⟨ A + L_{p}, P ⟩}^{2} - \frac{1}{{| | X | |}^{2}} {⟨ P, X ⟩}^{2},

or

| {(κ^{2} τ)}^{2} = {| | X | |}^{4} τ^{2} = {| | P | |}^{2} {| | X | |}^{2} - {⟨ A + L_{p}, P ⟩}^{2} - {⟨ P, X ⟩}^{2} = k^{2} I_{2},

Evidently

\frac{d B}{d t}

is in the linear span of

T (t), N (t), B (t)

, hence the Serret-Frenet frame along

p (t)

terminates. □

Corollary 3.

Elastic curves in

M_{ϵ} = S_{ϵ}^{n} (ρ)

are rolled on the elastic curves in the tangent space

\hat{M} = T_{e} M

.

Proof.

Since the geodesic curvature is preserved under rolling, the elastic curves in

S_{ϵ}^{n} (ρ)

are rolled on the elastic curves in

\hat{M}

relative to the Euclidean metric inherited from the metric on

p

. So the statement follows from the rolling definition. □

This remarkable relation between the elastic curves and the rolling geodesics breaks down on spaces of non-constant Riemannian curvature, as it becomes evident when one compares Equation (110) for the curvature problem to the Equation (101) for the rolling problem. It is interesting to note that the solutions of either of these two Equations (101) and (110) are not known beyond the spaces of constant curvature. While the curvature equation seems particularly challenging beyond the spaces of constant curvature, the rolling geodesic equations remain integrable on all semi-simple spaces and should be “solvable” according to the general theory of integrable systems.

Apart from the above remarks, there is another spectacular property of elastic curves that makes them special: elastic curves appear as soliton solutions in the non-linear Schroedinger equation [53]. More generally it was shown in [53] that the space of periodic horizontal curves of fixed length L in the isometry group G over a three dimensional space of constant curvature can be given a structure of an infinite dimensional Poisson manifold relative to which some famous equations of mathematical physics appear as Poisson equations associated with geometric invariants of curves on the base space. In particular, Heisenberg’s magnetic equation and Schroedinger’s non-linear equation appear as Poisson equation associated with

f_{0} (g (s)) = \frac{1}{2} \int_{0}^{L} | | \frac{d Λ}{d s} (s) {| |}^{2} d s

where

Λ (s) = g^{- 1} (s) \frac{d g}{d s} (s)

,

| | Λ (s) | | = 1

. Since this function can be also expressed as

\frac{1}{2} \int_{0}^{L} κ^{2} (s) d s

where

κ (s)

is the geodesic curvature of the projected curve in the underlying symmetric space, elastic curves appear naturally in this setting (see also [54,55,56,57] for related results). This leap to infinite dimensional Hamiltonians and related hierarchies of commuting Hamiltonians further illustrates the relevance of Lie algebraic methods in the theory of integrable systems.

Funding

The research received no external funding.

Data Availability Statement

No additional data.

Conflicts of Interest

There is no conflict of interest.

References

Poisson, S.D. Sur les inégalités séculaires des moyens mouvemens des planétes. J. L’École Polytech. 1809, 8, 15. [Google Scholar]
Poincaré, H. Les Méthodes Nouvelles de la Mécanique Célecte; Tome I; Gauther-Villars: Paris, France, 1892. [Google Scholar]
Jacobi, C.G.J. Vorlersungen Über Dynamic; Druck und Verlag: Berlin, Germany, 1884. [Google Scholar]
Arnold, V.I. Mathematical Methods of Classical Mechanics; Springer: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
Carathéodory, C. Calculus of Variations and Partial Differential Equations of the First Order; Second (revised) English Translation; Chelsea Publishing Co.: White River Junction, VT, USA, 1982. [Google Scholar]
Liouville, J. Note sur l’intégration des équations différentielles de la Dynamique, présenté au Bureau des Longitudes le 29 juin 1853. JMPA 1855, Tome XX, 137–138. [Google Scholar]
Jurdjevic, V. Optimal Control and Geometry: Integrable Systems; Cambridge Studies in Advanced Mathematics 154; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
Jurdjevic, V. Integrable Hamiltonian Systems on Lie groups: Kowalewski type. Ann. Math. 1999, 150, 605–644. [Google Scholar] [CrossRef]
Jurdjevic, V. Rolling geodesics, mechanical systems and elastic curves. Mathematics 2022, 10, 4827. [Google Scholar] [CrossRef]
Reyman, A.G.; Semenov Tian-Shansky, M.A. Group Theoretic Methods in the Theory of Finite Dimensional Inrtegrable Systems; Dynamical Systems VII, Chapter 2, Encyclopaedia of Mathematical Sciences; Springer: Berlin/Heidelberg, Germany, 1994; Volume 16. [Google Scholar]
Kirillov, A.A. Elements of the Theory of Representations; Springer: Berlin/Heidelberg, Germany, 1976. [Google Scholar]
Gasparim, E.; Grama, L.; San Martin, L. Adjoint orbits of semi-simple Lie groups and Lagrangian submanifolds. Proc. Edinb. Math. Soc. 2017, 60, 361–385. [Google Scholar] [CrossRef]
Báez, J.; San Martin, L.A.B. Deformations of adjoint orbits for semisimple Lie algebras and Lagrangian submanifolds. Differ. Geom. Its Appl. 2021, 75, 101719. [Google Scholar] [CrossRef]
Jurdjevic, V. Affine-Quadratic problems on Lie Groups: Tops and Integrable Systems. J. Lie Theory 2020, 30, 425–444. [Google Scholar]
Helgason, S. Differential Geometry, Lie Groups and Symmetric Spaces; Academic Press: New York, NY, USA, 1978. [Google Scholar]
Trélat, E. Some properties of the value function and its level sets for affine control systems with quadratic cost. J. Dyn. Control Syst. 2000, 6, 511–541. [Google Scholar] [CrossRef]
Zimmerman, J. The Rolling Sphere Problem. Ph.D. Thesis, University of Toronto, Toronto, ON, Canada, 2002; 94p. [Google Scholar]
Zimmerman, J.A. Optimal control of the sphere Sⁿ rolling on Eⁿ. Math. Control Signals Syst. 2005, 17, 14–37. [Google Scholar] [CrossRef]
Bolsinov, A. A completeness criterion for a family of functions in involution obtained by the shift method. Soviet Math. Dokl. 1989, 38, 161–165. [Google Scholar]
Fomenko, A.T.; Mischenko, A.S. Euler equation on finite-dimensional Lie groups. Math. USSR Izv. 1978, 12, 371–389. [Google Scholar]
Fomenko, A.T.; Trofimov, V.V. Integrability in the sense of Liouville of Hamiltonian systems on Lie algebras. Uspekhi Mat. Nauk 1984, 2, 3–56. (In Russian) [Google Scholar]
Manakov, S.V. Note on the integration of Euler’s equations of the dynamics of an n dimensional rigid body. Funct. Anal. Appl. 1976, 10, 328–329. [Google Scholar] [CrossRef]
Love, A.E. A Treatise on the Mathematical Theory of Elasticity, 4th ed.; Dover: New York, NY, USA, 1927. [Google Scholar]
Jurdjevic, V. Integrable Hamiltonian Systems on Complex Lie Groups. Mem. Am. Math. Soc. 2005, 178, 838. [Google Scholar] [CrossRef]
Bogoyavlenski, O. New Integrable Problem of Classical Mechanics. Commun. Math. Phys. 1984, 94, 255–269. [Google Scholar] [CrossRef]
Kowalewski, S. Sur le problème de la rotation d’un corps solide autor d’un point fixé. Acta Math. 1889, 12, 177–232. [Google Scholar] [CrossRef]
Jurdjevic, V. Kowalewski top and complex Lie algebras. Anal. Math. Phys. 2021, 11, 38. [Google Scholar] [CrossRef]
Komarov, I.V.; Kuznetsov, V.B. Kowalewski top on the Lie algebras o(4), e(3) and o(3, 1). J. Phys. A 1990, 23, 841–846. [Google Scholar] [CrossRef]
Komarov, I.V. Kowalewski top for the hydrogen atom. Theor. Math. Phys. 1981, 47, 67–72. [Google Scholar] [CrossRef]
Dragović, V.; Kukić, K. Systems of Kowalevski Type and Discriminantly Separable Polynomials. Regul. Chaotic Dyn. 2014, 19, 162–184. [Google Scholar]
Sokolov, V.V. A New Integrable Case for the Kirchhoff Equation. Theor. Math. Phys. 2001, 129, 1335–1340. [Google Scholar] [CrossRef]
Ivanescu, C.; Savu, A. The Kowalewski top as a reduction of a Hamiltonian system on Sp(4,ℝ). Proc. Am. Math. Soc. 2003, 131, 607–618. [Google Scholar] [CrossRef]
Haine, L.; Horozov, E. A Lax pair for Kowalewski top. Phys. D 1987, 29, 173–180. [Google Scholar] [CrossRef]
Horozov, E.; van Moerbeke, P. The full geometry of Kowalewski’s top and (1,2)-abelian surfaces. Commun. Pure Appl. Math. 1989, 42, 357–407. [Google Scholar] [CrossRef]
Bobenko, A.I.; Reyman, A.G.; Semenov-Tian Shansky, M.A. The Kowalewski top 99 years later; a Lax pair, generalizations and explicit solutions. Commun. Math. Phys. 1989, 122, 321–354. [Google Scholar] [CrossRef]
Jurdjevic, V. Affine-quadratic problems on Lie groups. Math. Control Rel. Fields 2013, 3, 347–374. [Google Scholar] [CrossRef]
Neumann, C. De problemate quodam mechanico, quod ad primam integralium ultraellipticorum classem revocatum. J. Reine. Angew. Math. 1859, 56, 46–63. [Google Scholar]
Ratiu, T. The C. Newmann problem as a completely integrable system on a coadjoint orbit. Trans. Am. Mat. Soc. 1981, 264, 321–329. [Google Scholar] [CrossRef]
Moser, J. Integrable Hamiltonian Systems and Spectral Theory; Lezioni Fermiane, Academia Nazionale dei Lincei, Scuola Normale Superiore: Pisa, Italy, 1981. [Google Scholar]
Perelomov, A.M. Integrable Systems of Classical Mechanics and Lie Algebras; Birkhauser Verlag: Basel, Switzerland, 1990; Volume 1. [Google Scholar]
Knörrer, H. Geodesics on quadrics and a mechanical problem of C. Newmann. J. Riene Angew. Math. 1982, 334, 69–78. [Google Scholar]
Fock, V.A. The hydrogen atom and non-Euclidean geometry. Izv. Akad. Nauk SSSR Ser. Fizika 1935, 8. [Google Scholar]
Moser, J. Regularization of Kepler’s problem and the averaging method on a manifold. Commun. Pure Appl. Math. 1970, 23, 609–623. [Google Scholar] [CrossRef]
Osipov, Y. The Kepler problem and geodesic flows in spaces of constant curvature. Celest. Mech. 1977, 16, 191–208. [Google Scholar] [CrossRef]
Guillemin, V.; Sternberg, S. Variations on a Theme by Kepler; American Mathematical Society: Providence, RI, USA, 1990; Volume 42. [Google Scholar]
O’Neill, B. Semi-Riemannian Geometry; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 1983. [Google Scholar]
Jurdjevic, V.; Markina, I.; Silva Leite, F. Symmetric spaces rolling on flat spaces. J. Geom. Anal. 2023, 33, 94. [Google Scholar] [CrossRef]
Ziller, W. Lie Groups. Representation Theory and Symmetric Spaces; University of Pennsylvania: Philadelphia, PA, USA, 2010. [Google Scholar]
Bryant, R.; Hsu, L. Rigidity of integral curves of rank 2 distributions. Invent. Math. 1993, 114, 435–461. [Google Scholar] [CrossRef]
Agrachev, A.; Sachkov, Y. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
Chitour, Y.; Godoy-Molina, M.; Kokkonen, P. The Rolling Problem: Overview and Challenges. Geometric Control Theory and Sub-Riemannian Geometry; Springer INdAM Ser. 5; Springer: Berlin/Heidelberg, Germany, 2014; pp. 103–122. [Google Scholar]
Jurdjevic, V.; Zimmerman, J. Rolling sphere problems on spaces of constant curvature. Math. Proc. Camb. Phil. Soc. 2008, 144, 729–747. [Google Scholar] [CrossRef]
Jurdjevic, V. The symplectic structure of curves in three dimensional spaces of constant curvature and the equations of mathematical physics. Ann. I. H. Poincaré 2009, 26, 1483–1515. [Google Scholar] [CrossRef]
Langer, J.; Perline, R. Poisson Geometry of the Filament Equation. J. Nonlinear Sci. 1978, 1, 71–93. [Google Scholar] [CrossRef]
Chabat, C.; Zakharov, V. Exact theory of two dimensional self-focusing and one dimensional self-modulation of waves in non-linear media. Sov. Phys. JETP 1972, 34, 62–69. [Google Scholar]
Hasimoto, H. A soliton on a vortex element. J. Fluid Mech. 1972, 51, 477–485. [Google Scholar] [CrossRef]
Millson, J.; Zombro, B.A. A Kähler structure on the moduli spaaces of isometric maps of a circle into Euclidean spaces. Invent. Math. 1995, 123, 35–59. [Google Scholar] [CrossRef]

Table 1. Lie brackets for

s = 0, 1

.

Table 1. Lie brackets for

s = 0, 1

.

[ , ]	$A_{1}$	$A_{2}$	$A_{3}$	$B_{1}$	$B_{2}$	$B_{3}$
$A_{1}$	0	$- A_{3}$	$A_{2}$	0	$- B_{3}$	$B_{2}$
$A_{2}$	$A_{3}$	0	$- A_{1}$	$B_{3}$	0	$- B_{1}$
$A_{3}$	$- A_{2}$	$A_{1}$	0	$- B_{2}$	$B_{1}$	0

$B_{1}$	0	$- B_{3}$	$B_{2}$	0	$- s A_{3}$	$s A_{2}$
$B_{2}$	$B_{3}$	0	$- B_{1}$	$s A_{3}$	0	$- s A_{1}$
$B_{3}$	$- B_{2}$	$B_{1}$	0	$- s A_{2}$	$s A_{1}$	0

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.

Integrable Systems: In the Footprints of the Greats

Abstract

1. Introduction

2. Symplectic Background, Hamiltonian Systems

2.1. Left-Invariant Trivializations and the Symplectic Form

2.2. Poisson Manifolds, Coadjoint Orbits

2.3. Representation of Coadjoint Orbits on Lie Algebras

3. Affine-Quadratic Problems

3.1. Isospectral Representations

3.2. Affine Hamiltonians and Mechanical Tops

3.3. Three-Dimensional Tops- Kirchhoff-Kowalewski Type

3.4. Kowalewski’s Conditions and Isospectral Representations

4. Kepler, Jacobi, Neumann and Moser

Degenerate Case A = 0 and Kepler’s Problem

5. Homogeneous Riemannian Manifolds and Rolling Geodesics

5.1. Rolling Hamiltonians

5.2. Rolling Problem on Spaces of Constant Curvature

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Article Access Statistics

Degenerate Case $A = 0$ and Kepler’s Problem