Abstract
In his 1842 lectures on dynamics C.G. Jacobi summarized difficulties with differential equations by saying that the main problem in the integration of differential equations appears in the choice of right variables. Since there is no general rule for finding the right choice, it is better to introduce special variables first, and then investigate the problems that naturally lend themselves to these variables. This paper follows Jacobi’s prophetic observations by introducing certain “meta” variational problems on semi-simple reductive groups G having a compact subgroup K. We then use the Maximum Principle of optimal control to generate the Hamiltonians whose solutions project onto the extremal curves of these problems. We show that there is a particular sub-class of these Hamiltonians that admit a spectral representation on the Lie algebra of G. As a consequence, the spectral invariants associated with the spectral curve produce a large number of integrals of motion, all in involution with each other, that often meet the Liouville complete integrability criteria. We then show that the classical integrals of motion associated, with the Kowalewski top, the two-body problem of Kepler, and Jacobi’s geodesic problem on the ellipsoid can be all derived from the aforementioned Hamiltonian systems. We also introduce a rolling geodesic problem that admits a spectral representation on symmetric Riemannian spaces and we then show the relevance of the corresponding integrals on the nature of the curves whose elastic energy is minimal.
Keywords:
symplectic; manifolds; Lie-Poisson bracket; Lie algebras; co-adjoint orbits; extremal curves; integrable systems MSC:
53C17; 53C22; 53B21; 53C25; 30C80; 26D05; 49J15; 58E40
1. Introduction
The theory of integrable systems begins with W.R. Hamilton who in 1835 pronounced that the equations of motion of an n body system conform to the principle of least action, and consequently can be represented as
under the transformation
where is the total energy, with T the kinetic and V the potential energy of the system. He then observed that H is conserved along the solutions of the system. Hamilton’s discovery gave rise to a new class of differential equations of the form
associated with any function H of variables and . Such equations became known as the canonical equations. Then the transformations that preserved the canonical form of these equations were also called canonical, and the functions whose values were conserved by canonical systems became known as integrals.
Hamilton’s discovery had an immediate impact on the scientific community of the nineteenth century. Canonical equations became the central object of study in the mathematics of that period with the contributions of J. Liouville, S.D. Poisson, C.G. Jacobi and H. Poincaré leading the way towards a new branch in mathematics known today as the theory of integrable systems. This theory was principally driven by a lasting interest in the existence of extra integrals of motion and the symmetries that are accountable for the existence of these integrals. Its defining moment may be attributed to S.D. Poisson who in 1809 [1] introduced his bracket (known since as the Poisson bracket)
for functions f and g in the canonical variables .
The introduction of the Poisson bracket greatly facilitated the emerging theory of that period. It provided an alternative definition of canonical systems as differential systems that satisfy
and it also redefined integrals of motion associated with H as functions F that satisfy . It was Jacobi, however, who noticed the fundamental property of the Poisson bracket
that has been known ever since as the Jacobi’s identity. It is then an easy consequence of Jacobi’s identity that is a third integral of motion for H for any two integrals and (known as Poisson’s theorem [2]). Alternatively integrals of motion were detected through a suitable change of canonical coordinates. Jacobi characterized such changes of coordinates through a generating function . According to Jacobi is canonical if and only if Poincaré characterized canonical change of coordinates in terms of differential forms: is canonical if and only if for some function S.
From contemporary perspectives the theory of integrable systems begins with C.G. Jacobi and his seminal book Lectures in Dynamics [3]. Jacobi demostrated that the canonical Equation (3) can be integrated with the aid of a partial differential equation
in terms of an unknown function S. He showed that if a particular solution of (7) can be found in terms of n arbitrary constants of motion then for some function , and the transformation
transforms the canonical coordinates into new canonical coordinates relative to which the canonical Equation (3) are transformed into the equations
whose solutions are given by
Equation is known as Jacobi’s equation. Poincaré referred to the above result as the first theorem of Jacobi in his treatise of celestial mechanics [2]. Jacobi’s solution of the above partial differential equation in terms of the elliptic coordinates stands out as the most original and, perhaps, the most enigmatic contribution to the theory of canonical systems. Jacobi’s use of elliptic coordinates suggested the existence of a special class of variational problems whose solutions can be described by Abelian integrals in some privileged system of coordinates, exemplified by the geodesic problem on the ellipsoid. In the absence of any apparent symmetries on the ellipsoid that account for the integrability of the geodesic problem, this result of Jacobi seemed particularly mysterious.
In Jacoby summary, the main problem in the integration of differential equations appears in the choice of right variables. Given no general rule for finding the right choice, it is better to introduce special variables first, and then investigate the problems that naturally lend themselves to these variables [3]. Jacobi, however, does not comment on another exceptional aspect of his discovery, namely the mysterious presence of partial differential equations for the problems of variational calculus, an issue that remained open for a long time.
Almost a hundred years later, C. Carathéodory in the introduction to his famous book on the calculus of variations [5] remarks that “ neither Jacobi, nor his students, nor the many other prominent men who so brilliantly represented and advanced this discipline during the nineteenth century, thought in any way of the relationship between the calculus of variations and partial differential equation”. H. Poincaré also sidestepped this issue by treating canonical systems as the solutions of a dynamical system
where denote the constants . Since the above differential equation can be reformulated as
which shows that Equations (3) and (10) have the same solutions. Poincaré equation used Equation (10) to show that a transformation is canonical if and only if the differential form satisfies for some function .
Among many other stellar advancements of that epoch, the following result of J. Liouville, reported in 1855 [6], seemed particularly influential for the present mathematics [4]. Liouville considered a differential system
associated with a function . He then assumed the existence of n integrals of motion such that the system of equations
can be solved for in the variables . He also imposed the condition that are in involution, that is,
Liouville interpreted as the exactness condition for the differential form and concluded that there is a function such that
that is, . But then S can be used as the generating function for the canonical transformation where . Liouville refers to
as a complete system. Indeed, in the new coordinates remain constants of motion and therefore . Since , F is independent of , that is, F is a function of t and h. But then is a given function of time, and is given by its integral. When F is a function of x and y, and not explicitly dependent on time, then is only a function of h. Therefore, the general solution is given by
This heritage from 19-th century mathematics forms a core of knowledge indispensable for problems of mathematical physics, symplectic geometry, calculus of variations and optimal control theory, and its unanswered questions still motivate much of the current research in integrable systems.
This paper will address the “hidden” symmetries that account for the existence of extra integrals of motion. We will show that the canonical integrable systems, such as Jacobi’s geodesic problem on the ellipsoid, Neumann’s mechanical problem on the sphere, Euler’s top, and the associated heavy tops, all derive their constants of motion from certain “meta” systems on Lie groups that admit isospectral representations of the form
on the Lie algebra of G.
We will confine our attention to semi-simple Lie groups G having a compact subgroup K, for then the Lie algebra admits a decomposition , where the Lie algebra of K and is the orthogonal complement of relative to the Killing form . But then and therefore as a vector space also carries the semi-direct Lie algebra associated with the semi-direct product . We will then single out a class of left-invariant variational problems on G that admit an isospectral representation with
where in the semi-direct case and in the semi-simple case, , , and where A is a fixed element in . It is then known that the spectral invariants are in involution relative to the canonical Poisson bracket on , respectively on . We will show that these invariants shed light on the hidden symmetries that surround many of the aforementioned integrable systems. In the process we will be able to demonstrate that the quest for the geometric origins behind the “mysterious” integrals of motions also leads to new and unexpected encounters with problems of Riemannian and sub-Riemannian geometry in which geometric control theory plays a major role.
2. Symplectic Background, Hamiltonian Systems
The theoretic framework upon which above claims are made is rooted in symplectic geometry. Below is a brief summary of the theoretical ingredients required for our main results.
Recall that a manifold M together with a non-degenerate and closed 2-form is called symplectic. The symplectic form yields a correspondence between functions and vector fields: to every function f there is a vector field defined by for all vector fields X on M. Then is called the Hamiltonian vector field generated by f. Every symplectic manifold is even dimensional, and at each point of M there is a neighbourhood with coordinates on which Hamiltonian vector fields are given by
This choice of coordinates in which is given by (17) is called symplectic, or canonical in the terminology of the 19-th century.
Every cotangent bundle is a symplectic manifold with its canonical symplectic form, in terms of the symplectic coordinates . As a symplectic manifold the cotangent bundle is special, in the sense that it is also a vector bundle. Hence every vector field X on M can be lifted to a unique Hamiltonian vector field in via the function , . Vector field is called the Hamiltonian lift of X. The same procedure is applicable to any time varying vector field, and by extension to any differential system on M. Thus any differential system in M can be lifted to a Hamiltonian system in . This fact is also important for problems of optimal control where the Maximum Principle singles out the appropriate Hamiltonian lifts that govern the optimal solutions [7].
When the base manifold is a Lie group G, and when the underlying differential system is either left or right invariant, then there is a special system of coordinates based on the representation of as , with the dual of . This coordinate system preserves the left invariant symmetries and elucidates the conserved quantities of the associated Hamiltonian systems. The passage to these coordinates and the associated formalism was amply documented in my earlier publications [7,8,9]. Below we will highlight the main points in this theory required for our results.
2.1. Left-Invariant Trivializations and the Symplectic Form
Having in mind applications that involve left-invariant variational systems the cotangent bundle and the tangent bundle will be viewed as the products and via the left-translations. More explicitly, tangent vectors will be identified with pairs via the relation , where denotes the tangent map associated with the left translation . Similarly, points will be identified with pairs via . Then , the tangent bundle of the cotangent bundle , will be identified with , with the understanding that an element denotes the tangent vector at the base point .
We will make use of the fact that is a Lie group in its own right since , as a vector space, is an abelian Lie group. Then left-invariant vector fields V in will be denoted by , in . In this setting the canonical symplectic form on is given by
for any left-invariant vector fields and [7]. The above form is invariant under the left-translations in , and is especially revealing for the Hamiltonian vector fields generated by left-invariant functions on .
A function H on is left-invariant if for all and all . That is, left-invariant functions coincide with functions of . Each left-invariant vector field on G lifts to a linear function on because
Functions H on generate Hamiltonian vector fields on whose integral curves are the solutions of
In a more general case, where H depends on both and , the integral curves of are the solutions of
that can be easily shown through the relations
This situation occurs in problems of mechanics in the presence of potential functions. For example, the movements of a three-dimensional rigid body with a potential function are described by the Hamiltonian
on the cotangent bundle of , where denote the columns of the matrix transpose of the rotation R in . For then the directional derivative of V in the direction is given by
where denotes the standard inner product in . Thus and the equations of motion for H are given by
These equations extend to an “n-dimensional rigid body” with the Hamiltonian where
In this context, is the generalization of the angular momentum, is the generalization of the angular velocity, is the generalized inertia tensor, and is the external torque.
2.2. Poisson Manifolds, Coadjoint Orbits
Equation (19) lend themselves to an insightful description in terms of the Poisson structure on inherited from the symplectic form . Recall that a manifold M together with a bilinear, skew-symmetric form
that satisfies
for all functions on M, is called a Poisson manifold.
Every symplectic manifold is also a Poisson manifold with the Poisson bracket given by . However, the converse may not be true due to the fact that the Poisson bracket may be degenerate at some points of M. Nevertheless, each function f on M induces a Poisson vector field through the formula as in the symplectic case. Poisson vector fields clarify the relation with symplectic manifolds through the following fundamental fact: every Poisson manifold is foliated by the orbits of its family of Poisson vector fields and each orbit is a symplectic submanifold of M with its symplectic form [7].
The dual of a Lie algebra is a Poisson manifold with the Poisson bracket
for any functions f and h on . In the literature on integrable systems the bracket is known as the Lie-Poisson bracket [10]. We have taken its negative to be compatible with the projections of left-invariant Hamiltonian vector fields on (and also to agree with the sign conventions in [7]).
It follows that each function H on defines a Poisson vector field on via the formula in which case the integral curves of are the solutions of
Thus, as we already mentioned above, each function H on may be simultaneously viewed as a Hamiltonian on , and a function on the Poisson space . Of course, Poisson equations coincide with the projections of the Hamiltonian equations on .
Solutions of Equation (24) are intimately linked with the coadjoint orbits of G through the following proposition. due to of A.A. Kirillov [11] (the proof is also given in [7]).
Proposition 1.
Let denote the family of Poisson vector fields on and let denote the orbit of through a point . Then M is equal to the connected component of the coadjoint orbit of G that contains . Consequently each coadjoint orbit is a symplectic submanifold of .
Recall that the coadjoint orbit of G through a point is given by
The fact that the Poisson equations can be naturally restricted to coadjoint orbits implies useful reductions in the theory of Hamiltonian systems.
2.3. Representation of Coadjoint Orbits on Lie Algebras
On semi-simple Lie groups Poisson Equation (24) can be expressed on as
because the Killing form, or any scalar multiple of it is non-degenerate, and invariant, in the sense that, , and can be used to identify with via the formula
Then coadjoint orbits are identified with the adjoint orbits and the Poisson vector fields are identified with vector fields . Each vector field is tangent to an orbit at L, and , in is the symplectic form on each orbit .
In a reductive semi-simple Lie group G there is also the semi-direct product described earlier which generates its own coadjoint orbits on the dual of the Lie algebra of . Recall that the Lie algebra of consists of pairs together with the Lie bracket
When the elements are identified with the sums the sums in , as a vector space, carries a double Lie algebra; the semi-direct product Lie algebras , and the semi-simple Lie algebra . We then have
for any in and any in , with in the semi-direct case, and in the semi-simple case.
Since both and Lie algebras over the same vector space, the Poisson equations on can be also represented on via the quadratic form , but the resulting expression takes a slightly different form. To see the difference, let and denote the decompositions of and L onto the factors and . On the semi-direct product Poisson equations reduce to
This equation can be combined with the equations for the semi-simple case in terms of the parameter s as
One can show that
is the coadjoint orbit through under the action of when is identified with in , and when is identified with [7].
The adjoint orbits of a non-compact semi-simple Lie groups G can be realized as the cotangent bundles of flag manifolds [12], and the same has been shown recently for the coadjoint orbits under the action of the semi-direct products [13,14]. We will make use of that fact later on in the paper.
3. Affine-Quadratic Problems
As stated earlier, we will restrict our attention to semi-simple Lie groups G and compact subgroups K with zero centre. We refer to as a reductive pair. Then and will denote their Lie algebras, and will denote the orthogonal complement of in relative to the Killing in . Recall that is non-degenerate and satisfies
Hence is well defined and satisfies ( in fact, because is semi-simple). We will also assume that . Note that the Killing form is negative-definite on because K has zero centre [15], hence any negative scalar multiple of it is positive definite on . We shall assume that such a scalar product is fixed.
An affine quadratic problem is defined through a positive definite quadratic form Q on , and a regular element A in the Cartan space . An element A in is called regular if is an abelian subalgebra in . The corresponding affine-quadratic problem consists of finding the solutions in G of the affine control system
generated by a square-integrable control in that transfers a given state in G to a given terminal state in time T with a minimal energy . Any positive definite quadratic form Q is of the form for some self-adjoint and positive linear operator on . Then there exists an orthonormal basis in such that is diagonal relative to it. That is, if then for some constants . Then (29) can be rewritten as as
where are the left-invariant vector fields with and , with the energy associated with each solution. The most natural case occurs when , that is, when . We will refer to this case as the canonical affine-quadratic problem.
When A is regular, then (29) is controllable, a consequence of our assumption , that is, any terminal state can be reached in some finite time from any initial state . But then there is an optimal solution on the interval for which the energy of transfer is minimal (see [7] for the proof). Therefore the above optimal control problem is well-posed.
To each affine-quadratic problem there is an analogous “shadow problem” defined on the semi-direct product defined by the same data as in the original problem. It follows that every affine space that defines an affine left-invariant system on G also defines a corresponding left-invariant affine system on the semi-direct product . Thus behind every affine quadratic optimal problem on G there is a corresponding affine-quadratic “shadow” problem on the semi-direct product . The shadow problem is also well defined in the sense that optimal solutions exist on some interval for each pair of boundary points and .
According to Pontriyagin’s Maximum Principle every optimal trajectory generated by a bounded and measurable control is the projection of an extremal curve, and each extremal curve is an integral curve of a suitable Hamiltonian system on the cotangent bundle of the ambient space. The Maximum Principle is also valid for optimal problems with controls over affine systems with quadratic costs ([16]).
Let now be an optimal trajectory generated by a control . According to the Maximum Principle, is the projection of an extremal curve in along which the cost extended Hamiltonian
is maximal at relative to all competing controls . In this notation, each is the Hamiltonian lift of , i.e., . In the abnormal case, which we will not treat here, , and the Maximum principle results in the constraints In the normal case, , the maximality condition implies that the optimal controls are of the form . Consequently, optimal solutions are the projections of solution curves of a single Hamiltonian vector field generated by the Hamiltonian
Recall that each lift is a linear function on given by with . Thus H is left-invariant, hence its Hamiltonian equations are given by
The associated Poisson equations can be now written in as
after the identification of with via the scalar product , and the decomposition (Equation (27)). In the canonical case () the preceding equations reduce to
Note that is an integral for (32). This integral is a universal integral of motion in the sense that it remains constant for any left-invariant Hamiltonian on .
3.1. Isospectral Representations
We now single out a remarkable class of affine-quadratic Hamiltonians that plays a prominent role in the theory of integrable systems. It consists of Hamiltonians that admit a spectral representation of the form
for some element that comutes with A, where and are the solutions of the Poisson Equation (32). Such a class is called isospectral and is called the associated spectral curve. This terminology has origins in J. Zimmerman’s PhD thesis in 2002, in which he showed that the rolling sphere problem is isospectral [17]. We will return to Zimmerman’s problem and relate its results to the canonical affine-quadratic problem [18].
For Hamiltonian systems that admit an isospectral representation, the discrete spectral invariants of L are replaced by the functional invariants . Remarkably, the functional invariants are in involution with each other, both with respect to the semi-simple and the semi-direct product Lie bracket, and in some instances generate a sufficient number of integrals of motion to ensure complete integrability ([7], 9.2). For instance, the family of functions
is completely integrable on each coadjoint orbit in [19]. This means that H is completely integrable on each coadjoint orbit in whenever H is in involution with the Hamiltonian lifts . This implies that the canonical affine Hamiltonian is completely integrable on coadjoint orbits since each left-invariant vector field with values in the isotropy group of A is a symmetry for the canonical system. It is reasonable to expect that the analogous family of functions is also completely integrable on coadjoint orbits of G, but, to the best of my knowledge, the proofs have not yet appeared in the literature.
The focus on the affine-quadratic problem and the associated Hamiltonians allows for the following characterization of isospectral Hamiltonians (proved in [7]).
Theorem 1.
This theorem shows that the fundamental results A.T. Fomenko, A. S. Mischenko, and V.V. Trofimov on integrable left-invariant Riemannian metrics on compact Lie groups [20,21] based on Manakov’s seminal work on the n-dimensional Euler’s top [22] are subordinate to the isospectral properties of the affine Hamiltonian system, in the sense that the spectral invariants of on are always in involution with a larger family of functions generated by the spectral invariants of on associated with an affine Hamiltonian H.
3.2. Affine Hamiltonians and Mechanical Tops
Let us now draw comparisons between the semi-direct Poisson equations
and the “top-like” equations:
associated with the Hamiltonian . We will consider two cases- tops with linear potentials and tops with quadratic potentials.
Linear potentials. Equation (38) will be referred to heavy top-like equations when the potential energy V is generated by a linear Newtonian field, that is, when , where a is a vector in , and are constants. When , the external torque is equal to zero, and Equation (38) reduces to the Hamiltonian equation associated with a left-invariant Riemannian metric induced by the operator (called the n-dimensional Euler’s top in some Russian literature [20]).
Heavy top-like equations can be written more compactly as
where , and . Since , is a solution of . Hence each solution resides on the sphere .
Our theorems below relate Equation (39) to the Poisson Equation (37) on the reductive Lie algebras and associated with reductive pairs where is when and when , and .
We will tackle both cases simultaneously but first we will need to introduce additional notation and terminology. We will use to denote the Lie algebra of endowed with the trace form . Relative to we define its invariant bilinear form in the ambient space .
Then , will denote the matrix defined by
and denotes the matrix . Since
belongs to for any in . We then have
Theorem 2.
For a proof see [14]. The preceding theorem clarifies the presence of heavy tops in the Hamiltonian equations on Lie algebras [10]. It also clarifies the relation between the tops and elastic rods initiated by G. Kirchhoff known as the “kinetic analogues” [23,24]. It also proves that the classification of completely integrable elastic rods in [7,8] carries over to the heavy tops.
Quadratic potentials. We will now show that the tops with quadratic potential V are also present in the equations of affine Hamiltonians, but this time on the tangent bundle of , or more precisely on the tangent bundle of the semi-direct product where denotes the space of symmetric matrices with zero trace. For that purpose let
with , , and S a symmetric . In accordance with (38) the Hamiltonian equations of are given by
where .
Theorem 3.
Top-like Equation (41) are isomorphic with the Poisson equations generated by the affine Hamiltonian on the coadjoint orbit through and under the action of the semi-direct product .
Proof.
Let now . Note first that
Then,
Therefore and satisfy the same differential equation. Hence whenever . If we now rename as we get the Poisson equations for the shadow Hamiltonian □
The preceding theorem links isospectral Hamiltonians to the equations of the top under quadratic potentials and paves a way to the n-dimensional generalization of O. Bogoyavlensky’s famous result on integrability of three-dimensional mechanical tops in the presence of a quadratic potential [25]. The path to isospectral Hamiltonians is provided by Manakov’s observation that the inertia tensor for a rigid body is confined to the transformations , for some positive definite matrix S. For then, . Indeed, in this situation , and
Hence the corresponding affine Hamiltonian is isospectral on (Theorem 1). Since the equations of the Hamiltonian
corresponding to the top with quadratic potential can be identified with the Poisson equations of on the coadjoint orbit through , the isospectral invariants of
are integrals of motion for the top. (Theorem 3). Since for each non-singular symmetric matrix S, the spectral invariants of
form a completely integrable family of functions on each coadjoint orbit in (semi-simple and semi-direct). the top with a quadratic potential is completely integrable in all dimensions.
3.3. Three-Dimensional Tops- Kirchhoff-Kowalewski Type
We will now turn our attention to the class of affine-quadratic systems of Kirchhoff-Kowalewski type on complex Lie algebras with a particular interest on the symmetries that account for the existence of Kowalewski’s integral reported in her seminal paper on the motions of a rigid body around a fixed point under the influence of gravity [26]. We will follow our recent paper [27] and show that there is a natural Hamiltonian on that answers the fundamental questions raised by Kowalewki’s paper, namely, what is the geometric rational behind her approach in which all the variables were treated as complex quantities, and secondly. what are the symmetries that account for the existence of not only her integral of motion, but also of similar integrals, known as Kowalewski type integrals, that subsequently appeared in the literature on integrable systems [8,28,29,30,31].
Theorem 2 suggests that the search for the answers to the above questions should begin with the Poisson equations associated with an affine-quadratic Hamiltonian on since both and are real forms for (see also [24]). We will show that Kowalewski’s “mysterious” change of variables appear naturally in the passage from to an important intermediate step towards the right Hamiltonian on . The journey from to to this remarkable Hamiltonian begins with
where is the coordinate representation of a point L in relative to an orthonormal basis that conforms to the following Lie bracket Table 1:
Table 1.
Lie brackets for .
Then
are the Poisson equations generated by H, where denote the drift element in , , , and . The same equations can be also expressed as
When the above equations formally coincide with the equations of the top: (Equation (21)).
On there are two Casimirs:
Hence generic coadjoint orbits in are four-dimensional. Since each coadjoint orbit is symplectic, integrable cases occur whenever there is an extra integral of motion functionally independent of H, , and . Since the motion of the top is subordinate to the Poisson system of H on , the search for integrable tops reduces to the search for an additional integral of motion functionally independent from , and H.
Let us now come to the conditions of Kowalewski
and her “mysterious” variables
After the substitutions, Equation (46) become
from which it can be easily extracted that
is an integral of motion. Following the terminology in [7] we will refer refer to this integral as the Kirchhoff-Kowalewski integral. It is only in the special case and that this integral coincides with the integral of motion found by Kowalewski. The real versions of the Kirchhoff-Kowalewski integral were originally discovered by V. Kuznetsov and I.V. Komarov in their studies of the hydrogen atom [28,29].
Let us reveal the geometric rational behind Kowalewski’s change of variables. The explanations are most naturally articulated through the root system in . Recall that any maximal commutative sub-algebra of a Lie algebra is called a Cartan subalgebra. All Cartan subalgebras in a semi-simple Lie algebra are conjugate, and hence all have the same dimension. The dimension of any Cartan algebra is the rank of . The rank of is two. Evidently each pair , in Table 1 generates a Cartan algebra in . Since these algebras are conjugate, there is no preferential choice. However, in regard to the equations of the top, there is a preferential choice when two moments of inertia are equal. In the case that the natural choice is the Cartan algebra generated by the pair .
An element in the dual of a Cartan algebra is called a root if for some , for all . An easy calculation shows that there are four roots , given by
The corresponding root spaces are one dimensional, and are generated by
Together with and these matrices form a basis for . A simple calculation shows that
Furthermore, , , and , for all i and j.
The Lie algebras and spanned by , and satisfy . and each is isomorphic to under the identification
An easy calculation shows that the coordinates of an arbitrary point relative to the basis are transformed to the coordinates relative to the basis according to the following formulas:
Let us now be given by
where denotes the Kronecker product of matrices A and B.
To see that recall first that consists of matrices M that satisfy , where J is the matrix that defines the symplectic form on , i.e., . It is easy to check that both and satisfy with . Since , our claim follows.
We will identify with pure complex quaternions via the correspondence
Then the standard basis in is identified with the matrices
and any element in is represented by the quaternion , , , .
Let now , , so that and for each i, , and let . Matrices form an orthonormal basis in relative to the inner product on .
It is easy to verify that is a Poisson map. Therefore
for any function H on , where denotes the dual map of . After the identification of with via the trace form, the Poisson equations of associated with H in (44) become
or, in simpler form,
where ,
, and
Now we see Kowalewski variables as the natural coordinates in this Poisson representation. Under Kowalewski’s conditions , , Equation (55) reduce to Equation (49). The passage from H to reveals the geometric rational behind the ad-hoc change of variables in (48) and serves as a natural segue to our ultimate Hamiltonian on .
3.4. Kowalewski’s Conditions and Isospectral Representations
We now address the origins of the “enigmatic” conditions (47) through an extended affine-quadratic Hamiltonian
on defined by complex numbers , , and and an extended basis , and , where and
The reader can easily verify that has the following decomposition
where is the Lie algebra spanned by and where and are respectively the linear spans of and . These spaces conform to the following Lie algebraic relations:
After is identified with via the scalar product the above Hamiltonian can be written as
where and , , (note that is negative on which accounts for the negative signs in the expression for ).
Since is semi-simple the Poisson equations for are given by
These equations can be written in a more succinct form as
in terms of the following notations:
, as in the previous section, and with .
We now come to the crux of the matter, the existence of integrals of motion for the above system. The intermediate question is the existence of an integral I of the form for some constants and .
Proposition 2.
is an integral of motion for in exactly two cases: when and , then , and in the second case, when , then . (for the proof see [27]).
This first condition singles out the top of Lagrange, while the second condition is a precursor to Kowalewski’s top as will be demonstrated below. Note that the second condition and can be also written as , and where , , and . Then , and since it is orthogonal to , it will be denoted by .
We will say that (59) satisfies the preliminary condition of Kowalewski whenever and . It follows that the preliminary condition of Kowalewski is synonymous with the integral of motion . (This integral of motion was also discovered earlier by A.M Savu in [32]).
We will now assume that the preliminary condition holds and we will pursue conditions on the ratio , where , that guarantee extra integrals of motion for system (59). Note that in this situation . Systems that satisfy the preliminary condition of Kowalewski and also satisfy will be said to satisfy the Kowalewsky conditions. The following proposition provides an important characterization of Kowalewski’s conditions for both and .
Proposition 3.
Indeed, under Kowalewski’s conditions
Therefore,
Since , .
It follows from Theorem 1 that Kowalewski’s condition is necessary and sufficient for the existence of isospectral representation
on the invariant manifold . Consequently, are integrals of motion for (59), in involution with each other for each , or . Remarkably, the prototype of Kowalewski’s integrals of motion is found among the above spectral invariants. (see also [33,34] for other spectral representations).
We will show the existence of Kowalewki’s integral of motion directly from the equations
obtained from (59) under the change of variables , Equation (60) may be seen as a semisimple extension of the Kowalewski-type gyrostat in two constant fields introduced in [35].
Equations (60) may be also expressed in terms of the coordinates as
One readily obtains the following fundamental equalities
Let now
Then
and
Hence, is an integral of motion for (61) since
An interested reader may want to show that the following are also integrals of motion
The preceding calculation also draws attention to the following general fact:
Proposition 4.
is a constant of motion for any differential system in the variables that satisfy
independently of the equations that govern the evolution of and .
To come back to the top of Kowalewski, note that is an invariant subsystem for (60). On this set, , and c reduces to
and remains an integral of motion for the reduced system
with its fundamental relations (63)
This reduced system coincides the Kirchhoff-Kowalewski system on (Equation (49), , after u is replaced by w). Then
coincides with the integral of motion discovered by Kowalewski. The remaining isospectral integrals of motion and coincide with the Casimirs on .
To recover the semi-simple form of the Kirchhoff-Kowalewski integral, let . In terms of and the preceding system becomes
This system satisfies the same equations as the Kirchhoff-Kowalewski system except for . Indeed,
The remaining equations given by
are the same as (66), and consequently yield
as an integral of motion for the preceding system, as well as for the Kirchhoff-Kowalewski system (Equation (49)) when is replaced by .
The papers of V. Dragovi and K. Kuki [30] and V. V. Sokolov [31] produce differential systems which admit Kowalewski type integrals different from the ones in this paper and yet follow the same integration procedure used by S. Kowalewski in her original paper. Remarkably, all these systems satisfy the fundamental relations (63) from which the existence of their extra integrals of motion could be easily ascertained.
4. Kepler, Jacobi, Neumann and Moser
Let us now return to and its Lie algebra endowed with the trace form . As a vector space V, the set of matrices with zero trace admits several kinds of Lie algebras and each of these Lie algebras induces its own Poisson structure on V. The most common Lie algebra is itself. Then induces the orthogonal decomposition where denotes the vector space of symmetric matrices in V. But then V also carries the semi-direct product structure .
However, is also a closed subgroup of G and hence the pair induces its own Cartan decomposition , where is the orthogonal complement to . In fact K is the set of points in G fixed by the automorphism where D denotes diagonal matrix with its first p diagonal entries equal to 1 and the remaining q diagonal entries equal to The set of points such that satisfies , that is, . It follows that its tangent map induces the above decomposition with
Consequently, matrices in are symmetric relative to the scalar product , in .
We will now return to the canonical Hamiltonians
and their Poisson Equation (33) restricted to the coadjoint orbits through rank one matrices in . We will consider two cases: the coadjoint orbit through a symmetric rank-one matrix of unit length under the action of , and the second case, the coadjoint orbit through rank-one matrix of unit length, symmetric relative to the Lorentzian inner product in under the action of . The above matrices can be naturally expressed in terms of the notations introduced earlier, the scalar product in the ambient space , and matrices and . For then
If then let . It follows that is the Euclidean sphere of radius when and a hyperboloid of two sheets when . We have chosen to be the sheet defined by .
Proposition 5.
The coadjoint orbit through is symplectomorphic to the cotangent bundle of the real projective space in the semi-simple case, and it is symplectomorphic to the cotangent bundle of in the semi-direct case.
For the proof see [36]. Here it is implicitly understood that the cotangent bundles are identified with the tangent bundles via the ambient inner product . Then each tangent vector is identified with in and in .
On the orbit through , , and the associated Poisson equations are of the form
A simple calculation show that
On the unit sphere, Equation (71) after A is replaced by coincide with the equations for the mechanical problem of C. Neumann for a particle on the sphere moving under a quadratic potential [37]. The preceding equations for could be analogously interpreted as the equations on the hyperboloid for a particle moving under quadratic potential [7].
The canonical affine-quadratic problem illuminates deep and beautiful connections between Kepler’s gravitational problem, Jacobi’s geodesic problem on the ellipsoid, and Neumann’s mechanical problems.
Let us first examine the isospectral integrals associated with the spectral curve on the coadjoint orbit through rank-one matrices. The zero trace requirement is inessential for the calculations below and will be disregarded. Additionally, A will be replaced by and will be rescaled by dividing by to read
The spectrum of is then given by
Matrix is of the form.
where We then have the following proposition
Lemma 1.
For the proof see [7] (p. 200).
Corollary 1.
Function is an integral of motion for H.
Function F is a rational function with poles at the eigenvalues of the matrix Hence, is an integral of motion for H if and only if the residues of F are constants of motion for H.
In the Euclidean case the eigenvalues of are real and distinct since is symmetric and regular. Hence there is no loss in generality in assuming that A is diagonal. Let denote its diagonal entries Then
where denote the residues of It follows that
Hence,
The preceding calculation yields the following proposition.
Proposition 6.
Each residue is an integral of motion for Newmann’s spherical system, and functions are in involution.
These results coincide with the ones reported in [38,39,40], but the connection with the affine-quadratic problem shows that similar integrals of motion exist for the hyperbolic Neumann problem [7] (p. 191).
In the literature on integrable systems the integrals of motion for Neumann’s problem are related to the integrals of motion for Jacobi’s problem on the ellipsoid through the transformation of H. Knörrer that transforms the Neumann’s equations on energy level onto the equations of Jacobi on the ellipsoid [39,41]. Our exposition takes another route: we will instead show that an “elliptic” problem on the sphere is completely integrable with its integrals of motion as in Neumann’s problem, and then we will show that the Hamiltonian equations for the elliptic problem on the sphere and Jacobi’s problem on the ellipsoid are symplectomorphic. We will then use this symplectomorphism to show the existence of Jacobi’s integrals of motion on the ellipsoid.
Let now denote an affine-quadratic Hamiltonian on defined by a diagonal matrix D with positive diagonal entries. As before, we will dispense with zero-trace requirements since they are inessential. The above Hamiltonian is generated by a positive definite operator and the drift . We will call this Hamiltonian elliptic for reasons that will be made clear later on. Let . Then
Therefore is isospectral (Proposition 1) and its Hamiltonian equations admit a representation
Since is a spectral curve for the canonical affine Hamiltonian we have the following corollary.
Corollary 2.
The spectral invariants of are common integrals of motion for both the canonical Hamiltonian and the elliptic Hamiltonian .
As before, on coadjoint orbit through , the Poisson equations of ( the semi-direct version) are given by
We then have
which shows that the Hamiltonian is given by
The correspondence defines a symplectomorphism between the cotangent bundle of with its canonical Poisson bracket and the coadjoint orbit through (Proposition 5).
Proposition 7.
These equations can be reparametrized by a parameter to read
We will presently show that Equation (73) are Hamiltonian equations that correspond to the geodesic problem on the sphere relative to the elliptic metric .
As an intermediate step we will now derive the Hamiltonian equations associated with the geodesic problem on the quadric surface induced by the scalar product . We will follow the procedure based on the version of the Maximum Principle for variational problems with constraints outlined in [7] (p. 218) and identify the quadric surface with the submanifold . Then its cotangent bundle will be defined in terms of the constraints
The Hamiltonian lift of a curve that belongs to is given by
for the multipliers and that satisfy . If , then
It follows that
Hence,
According to the Maximum Principle an extremal control must optimize
on over all controls that satisfy . Hence extremal controls are the critical points of for some multiplier , that is, they are solutions of . It follows that the extremal controls are of the form , But then implies that For this choice of controls
where . An easy calculation shows that
Then the extremal curves are the solutions of the following differential equation:
which emanate from , that is, satisfy
We will now single out the cases relevant for our earlier claims.
The geodesic problem on the ellipsoid. In this classic case initiated by C. Jacobi and . Hence
Then Equation (74) reduce to
The preceding equation agree with the equations in J. Moser [39].
The elliptic problem on the sphere. Here the ambient metric is defined by a positive-definite matrix D and . In such a case Equation (74) are given by
Furthermore,
The Hamiltonian equations are then given by
which agrees with Equation (73) when .
Proposition 8.
The Hamiltonian systems that correspond to the elliptic problem on the sphere and the geodesic problem on the ellipsoid are symplectomorphic.
Proof.
Let denote the coordinates on the tangent bundle of the sphere and let denote the coordinates on the tangent bundle of the ellipsoid . In these coordinates the systems in question are given by
where and .
Let denote the mapping from the cotangent bundle of the sphere to the cotangent bundle of E defined by
Let denote the Liouville-Poincaré canonical form on . Then
because . Since takes the Liouville form on to the Liouville form on , it also takes the canonical symplectic form on to the canonical symplectic for on and hence is a symplectomorphism.
It now follows from (78) that and that . Then,
and thus takes the Hamiltonian flow on the sphere onto the Hamiltonian flow on the ellipsoid. □
Proposition 9.
Jacobi’s problem on the ellipsoid is completely integrable. Functions
are constants of motion, all in involution with each other, for the Hamiltonian system on the cotangent bundle of the ellipsoid.
Proof.
We have shown that
are an involutive family of integrals of motion for the elliptic-geodesic problem on the sphere. We have also shown that the above integrals of motion are the residues of the function
We will now show that functions (79) are the residues of the pull-back of F under the symplectomorphism . First note that F remains unchanged if the variable y is replaced by with an arbitrary number. Since , we may replace y by and x by . Also note that (use Equation (75)). Then,
It follows that
is constant along the solutions of Jacobi’s equations.. A calculation identical to the one used for Neumann’s system shows that
are the residues of F, and hence are integrals of motion for Jacobi’s equations. □
Degenerate Case and Kepler’s Problem
Let us now return to the Hamiltonian equations generated by the canonical affine Hamiltonian on the coadjoint orbit through for some with
and their equivalent formulation on the tangent bundle of :
When the Hamiltonian H reduces to and the corresponding equations reduce to
Upon differentiating we get
We will now assume that so that is the hyperboloid . On energy level , the solutions of (83) are given by where and are constant vectors (complex when ) that satisfy .
For the above curves trace great circles on the sphere and for the solutions trace great hyperbolas on the hyperboloid (an immediate consequence of the fact that ). That is, solutions are the geodesics on spaces of constant non-zero curvature. The zero curvature case may be obtained by considering as a continuous parameter and then letting it tend to zero (as will be explained below).
We will now show that there exists a canonical change of coordinates in which p is the stereographic projection through the point given by
such that in the new coordinates the preceding geodesic differential system is transformed into the n-dimensional Kepler’s system, an n-dimensional generalization of the Hamiltonian equations that describe the motion of a planet around an immovable planet in the presence of the gravitational force.
Equation (84) yields , where . Therefore the inverse map is given by
Assume that the cotangent bundle of is identified with its tangent bundle via the Euclidean inner product , and let denote the points of . We will next find such that for all with and . For then the transformation is a symplectomorphism since it pulls back the Liouville form on onto the Liouville form in (The symplectic form is the exterior derivative of the Liouville-Poincaré form). It follows that
Therefore,
After the appropriate differentiations in (85) we get
Hence,
Since y is orthogonal to x, . Therefore,
After the substitutions into the preceding equation we get
To pass to the problem of Kepler, write the Hamiltonian in the variables . An easy calculation in (86) yields . Therefore,
The corresponding flow is given by
On energy level and the preceding equations reduce to
After the reparametrization Equation (23) become
On , and
So in the spherical case and in the hyperbolic case.
The Euclidean case can be obtained by a limiting argument in which is regarded as a continuous parameter which tends to zero.
To explain in more detail, let
The transformation with is the inversion about the circle in the affine hyperplane , and is the corresponding transformation of the Euclidean metric . The Hamiltonian associated with this metric is equal to This Hamiltonian can be also obtained as the limit of when . On energy level and therefore, . Of course, the solutions of (12) tend to the Euclidean geodesics as tends to zero. Consequently, is a solution of and hence, is a geodesic corresponding to the standard Euclidean metric.
Let us also note that the angular momentum and the Laplace-Runge-Lenz vector for Kepler’s problem have simple geometric interpretation on the coadjoint orbits according to the following proposition.
Proposition 10.
Let and On energy level ,
For a proof see [7].
This remarkable discovery that the solutions of Kepler’s problem are intimately related to the geometry of spaces of constant curvature goes back to A.V. Fock’s paper of 1935 [42] in which he reported that the symmetry group for the motions of the hydrogen atom is for negative energy, for zero energy and for positive energy. It is then not altogether surprising that similar results apply to the problem of Kepler since the energy function for Kepler’s problem is formally the same as the energy function for the hydrogen atom.
This connection between the problem of Kepler and the geodesics on the sphere was reported by J. Moser in 1970 [43], while Y. Osipov [44] reported similar results later for geodesics on spaces of negative constant curvature. In spite of their brilliance, these papers did not attempt any explanations in regard to this enigmatic connection between planetary motions and geodesics on space forms. This issue later inspired V. Guillemin and S. Sternberg to take up the problem of Kepler in a larger geometric context, with Moser’s observation as the background, in a paper titled Variations on a theme by Kepler [45]. The introduction of Kepler’s problem through the canonical affine-quadratic problem exemplifies, once again, this fascinating and recurrent interplay between mathematical physics, geometry and integrable systems.
5. Homogeneous Riemannian Manifolds and Rolling Geodesics
Our overview of integrable systems raises a natural question: what is the geometric origin behind the affine-quadratic problem that accounts for its ubiquitous presence in the theory of integrable systems? A partial answer to this question comes, somewhat unexpectedly, from a new class of variational problems, called rolling problems. We will take up this issue next. Since the underlying variational problems require new concepts and terminology, we will be obliged to make a slight detour into an earlier paper [9] in order to introduce the necessary ingredients.
The general setting is defined by a reductive pair with G semi-simple and K compact. We assume that the Lie algebra decomposition , with the orthogonal complement of relative to the Killing form on satisfies the strong Cartan conditions
We will also assume the Killing form is of definite sign on in which case will denote a scalar multiple of the Killing form that is positive on . We recall that the Killing form is invariant under any linear automorphism of and hence the quadratic form is invariant [15].
We consider G a semi-Riemannian manifold relative to the left-invariant metric induced by (the Killing form is not necessarily positive on , hence the metric is in general of indefinite sign, i.e., it is semi-Riemannian [46]). The left-invariant distributions and are called horizontal and vertical respectively. Then curves that are tangent to , i.e., satisfy are called horizontal. Likewise curves that are tangent to are called vertical. It follows that
We will assume that consisting of the left coset is endowed with a manifold structure so that the natural projection is a smooth surjection [46]. A curve in G is called a lift of a curve if . A lift is called horizontal when is a horizontal curve. Every curve in M is the projection of a horizontal curve . If a curve is a solution of for some curve then The correspondence given by is an isomorphism and induces a metric on M
Let now denote the group of diffeomorphisms defined by the left action
We then have
Proposition 11.
For a proof see [47].
It follows that each is an isometry. Since G acts transitively on M, M can be represented by the orbit where and e is the group identity in G. It follows that for any . Note that is the flow generated by a right-invariant vector field . Therefore the flow of is -related to the flow in M. We will let denote the infinitesimal generator of the flow .
It follows that each is a Killing vector field on M. A vector field whose flow acts on M by isometries is called a Killing vector field (see [46] for additional details). The correspondence is one to one and onto . Since the Lie brackets of vector fields related by a mapping F are also F-related, the Lie brackets are -related to . Therefore the correspondence is a Lie algebra homomorphism, and hence is a finite dimensional Lie algebra of Killing vector fields that satisfies for each .
Note that . So if then and therefore . It then follows that is an isometric isomorphism from onto . More generally if is any horizontal curve then implies that
and
Therefore is an isometry that maps onto .
A homogeneous manifold with a G-invariant metric defined by a reductive pair (G,K) with G semi-simple and K compact, will be referred to as semi-simple (it is defined by a semi-simple Lie group G, a compact subgroup K, and the metric induced by the Killing form). It can be shown that any symmetric Riemannian space with no Euclidean factors can be reduced to a semi-simple manifold (so that holds). Conversely, every semi-simple manifold is locally symmetric. It is symmetric when G is simply connected (see [48], Proposition 6.27). We will not pursue further proximities with symmetric spaces since the present exposition makes no use of geodesic symmetries.
We now come to the main topic of this section, rolling of semi-simple manifolds on their tangent spaces. We begin by recalling the basic definition.
Definition 1.
A curve on a Riemannian manifold M rolls on a curve on another Riemannian manifold if there exists an isometry that satisfies:
and also satisfies the condition that is a parallel vector field in along for each parallel vector field along in M.
This intrinsic definition of rolling was introduced in [49], and later used in [50,51]. In this context the triple is called a rolling curve. It is clear that rolling is reflexive in the sense that if is rolled on by an isometry then is rolled on by the isometry , and therefore is also a rolling curve.
We will now examine rollings of semi-simple manifolds on their tangent planes. It comes as a pleasant surprise that such rollings are essentially described by Equation (91) reinterpreted in terms of rolling. So the passage to rolling becomes largely a question of semantics, as demonstrated in the text below.
We will consider rollings of M on with its metric defined by (89). The rollings on other tangent spaces are conjugate to the rollings on [47]. Let be an arbitrary curve in M and let be a curve in that is rolled on. It follows that for some horizontal curve . If is a solution of then according to (91)
If we now let be any solution in of then is an isometry that rolls on since the parallel transport condition is satisfied (for proofs see [47]). Of course, then rolls on .
It follows that each horizontal curve in G defines a family of curves in , each a solution of , with induced by , that roll on . The converse is also true: every solution of the differential system
defines a curve in M on which in is rolled by the isometry .
We will regard (93) as the fundamental object in rolling defined on , a Lie group with its group operation
Then will denote the Lie algebra of with the Lie bracket .
Let now . We will view as a left-invariant distribution on defined by the left-translates of vector space in . The distribution is called the rolling distribution and its integral curves are called rolling motions. Any rolling motion is a solution of
and can be associated with the rolling curve , where and .
Since and satisfy strong Cartan conditions and , satisfies and . Therefore,
Hence the Lie algebra generated by the left-invariant vector fields tangent to is equal to , therefore any two points in can be connected by a rolling motion, and each rolling motion inherits a natural length from G. It is then known that any pair of points in can be connected by an integral curve of of minimal length because vector fields in are complete [50]. The above shows that with the above metric is a sub-Riemannian manifold. We will refer to the associated sub-Riemannian geodesics as the rolling geodesics.
We will now turn to the Maximum principle to find the necessary conditions that the rolling geodesics must satisfy. To put the matter in the control theoretic context, let be an orthonormal basis in so that becomes an orthonormal basis in . Then an absolutely continuous curve is a rolling motion if and only if
for some bounded and measurable control functions , in which case the length of is given by . The rolling problem is an optimal control problem and consists of finding the solutions on a fixed time interval that satisfy the given boundary conditions and along which the energy of transfer is minimal. It is known that each rolling geodesic is locally optimal and hence is a solution to the above control problem [7,50].
5.1. Rolling Hamiltonians
To emphasize the invariant properties of the problem we will rewrite (96) as
where each a left-invariant vector field , . If is an optimal trajectory then, according to the Maximum Principle, is the projection of an extremal curve in along which the cost extended Hamiltonian
is maximal relative to all other control functions. Here is the Hamiltonian lift of , i.e., .
There are two kinds of extremal curves depending whether (abnormal case) or (normal case). In the abnormal case the Maximum principle results in the constraints
and beyond that gives no further information about the optimal control in question. In the normal case, however, the above maximum yields , where is a solution curve of a single Hamiltonian vector field corresponding to the Hamiltonian
Each optimal solution is either the projection of an abnormal or a normal extremal curve. If is the projection of a normal extremal curve then is an integral curve of and the control that generates is of the form .
We will not concern ourselves with the abnormal extremals. It is very likely that every optimal trajectory is the projection of a normal extremal curve, as in [52], in which case the abnormal extremals could be ignored. Instead we will turn to the normal Hamiltonian H and its Hamiltonian equations
Let us first consider the solutions of the associated Poisson equation
and the structure of the coadjoint orbits.
Since is a Euclidean vector space, its tangent space at the origin can be identified with . Then the Lie algebra will be identified with , and its dual with , where
It then follows that every can be written as with and . Since is an abelian algebra the projection on is constant on each coadjoint orbit of . The argument is straightforward:
for any . It follows that the coadjoint orbits in are of the form
This fact can be also verified directly from Equation (100): we have
where and . Therefore,
from which follows that
Since is arbitrary .
To uncover other constants of motion identify with via the natural quadratic forms on each of the factors, and then recast the preceding equations on . More precisely, identify each in with a tangent vector via the formula . Similarly, identify with via the formula . Then decompose into the sum , and . Relative to the basis in , where . It follows that
and
Since X and are arbitrary,
Coupled with
Equation (101) constitute the Hamiltonian equations on generated by the Hamiltonian .
Each extremal curve projects onto a geodesic , and each geodesic further projects onto the pair of curves in M and in that are rolled upon each other by . Note that in this identification of the Lie algebras with their duals, coadjoint orbits are identified with the affine sets .
Recall now the Hamiltonian equations associated with the canonical affine-quadratic problem (Equation (33)):
The propositions below reveal a remarkable fact that the Poisson equations of a canonical affine-quadratic Hamiltonian are subordinate to the Poisson equations associated with a rolling Hamiltonian. This connection identifies the drift term in the affine-quadratic system with a coadjoint invariant of the rolling Poisson system. To keep the systems apart we will use bold letters when referring to the variables in the rolling Hamiltonian in contrast to the variables in the affine-quadratic Hamiltonian which will remain the same.
Proposition 12.
Let be an integral curve of the rolling Hamiltonian , that is,
Then
is an integral curve of the affine Hamiltonian , where , and is the solution of with .
Moreover, if a solution of then in is the projection of an extremal curve
associated with the shadow Hamiltonian .
The converse also holds according to the following proposition.
Proposition 13.
Suppose that is an extremal curve of the affine Hamiltonian . Let
where is a solution of and let be a solution of Then together with
is an extremal curve of the rolling Hamiltonian .
However, if , is an extremal curve of the shadow Hamiltonian H, then , solutions of
together with
define an extremal curve of the rolling Hamiltonian .
The proofs follow by straightforward calculations (also done in [9]).
Let us now come back to isospectral representations and Zimmerman’s method [17,52]. For that purpose let . Then Poisson’s equations for the rolling problem can be written as
These equations are invariant under a dilational change of variables . It then follows that
satisfies the equation
Therefore is the spectral curve for . But then the Poisson system associated with the affine-quadratic Hamiltonian also admits an isospectral representation after the substitutions
For then are the extremal curves for the Poisson system associated with the affine-quadratic system (Proposition 13) and further satisfy
But then
implies
To be consistent with my earlier publications, replace by to get
where , and . Equation (107) agrees with the isospectral representation (34).
To get the spectral curve for the shadow Hamiltonian, use relations , and from Proposition 13. Then
Then a calculation analogous to the one above gives . After the rescaling we get a modified Lax pair
5.2. Rolling Problem on Spaces of Constant Curvature
We will now introduce another optimal problem intertwined with the rolling problem. It consists of finding a continuously differentiable curve in M in an interval , with its tangent vector of unit length and its covariant derivative bounded and measurable in that satisfies fixed tangential directions and along which the integral minimal among all other curves that satisfy the same boundary conditions. Here , where denotes the covariant derivative along . The integral is known as the elastic energy of the curve [24]. Curves defined on some interval are called elastic if for each there exits an interval over which the elastic energy of is minimal relative to the boundary conditions and [7].
On semi-simple manifolds the curvature problem can be lifted to the unit tangent bundle of G, and it is this lifted version of the problem that will be of interest for this paper. In this formulation of the problem the tangent bundle of G is realized as the product with identified with . Then each tangent vector is the projection of a manifold in where and . The lifted curvature problem consists of finding a curve in , , a solution of
that originates in the manifold at and terminates at the manifold at for which the energy of transfer is minimal. If is the projected curve, then is the solution of
that satisfies and the boundary conditions
It is a simple exercise to show that a curve is elastic if and only if it is the projection of a solution of the lifted curvature problem on a fixed interval .
The Hamiltonian system for the curvature problem (Equation (109)) can also be obtained through the Maximum principle properly modified to account for the constraints, as outlined in ([7], Chapter 11). To go into these details would take us away from the central theme of the paper, so instead, we will just quote the relevant equations from [7] (pp. 354–355).
The curvature Hamiltonian H is given by together with the associated Hamiltonian equations
subject to the transversality condition . The transversality condition can be incorporated into the above equations to yield an equivalent system
We will now confine our attention to spaces of constant curvature, with a particular interest on the connections between the rolling problems and the elastic curves reported in [52]. For those reasons let us return to the “spheres” and their rollings on the isometry groups groups , endowed with the quadratic form The rolling equations associated with the rollings of on the tangent plane are given by
In what follows we will make use of the following isospectral integrals of motion associated with the preceding rolling problem extracted from the functions and
These integrals of motion are rescaled variants of the integrals of motion in [52] after the metric is replaced by (the metric in this paper is a scalar multiple of the metric used in [52]).
Recall that on spaces of constant Riemannian curvature the curvature k is defined by
for any V and X in that satisfy and . In particular k is equal to on . Note that .
Proposition 14.
Rolling geodesics that are the projections of the extremal curves on and project on the elastic curves in . Conversely each elastic curve in is the projection of such an extremal curve.
Proof.
Each elastic curve on is the projection of an extremal curve corresponding to the curvature problem (Equation (110)). On spaces of constant Riemannian curvature
Therefore, Equation (110) can be written as
It follows that , and therefore, for some constant element A in . The transversality condition can be recast as . These observations can be incorporated in the preceding equations to get
If we now identify with , and with , then the preceding equations reduce to the rolling Hamiltonian system. Moreover,
Hence so the first constraint is satisfied. To verify the second constraint note that , and therefore
and . Therefore,
To prove the converse assume that is a rolling extremal curve on . As a geodesic it satisfies , or . We need to show that for some such that .
Let
Then for some vector . It then follows that , and .
Hence, dim dim(. The mapping satisfies because . On spaces of non-zero constant curvature, the kernel of this mapping is zero because implies that . Since and have the same dimension, F maps onto . So every curve is of the form for some perpendicular to .
It remains to show that belongs to when the rolling geodesic is on , that is, when . Now assume that , and . It follows from above that , and therefore
Hence,
But , and therefore . □
The following proposition characterizes elastic curves [7].
Proposition 15.
Let and denote the geodesic curvature and the torsion of the projection curve associated with an extremal curve of the curvature problem. Then is the solution of the following equation
and . All other curvatures in the Serret-Frenet frame along are zero.
I believe that the proof given below is more to the point than similar proofs given elsewhere [7,52].
Proof.
We leave it to the reader to verify that when . Let and let . Then
Also
Since , . Therefore,
As to the second part, let . Since , is a unit vector that projects onto the tangent vector . Then
Therefore where is a unit vector in that projects onto the unit normal along . Continuing,
where
Since is is orthogonal to and X, it is in the direction of the binormal vector . So if we define and then and projects onto the binormal vector along . Hence,
or
Evidently is in the linear span of , hence the Serret-Frenet frame along terminates. □
Corollary 3.
Elastic curves in are rolled on the elastic curves in the tangent space .
Proof.
Since the geodesic curvature is preserved under rolling, the elastic curves in are rolled on the elastic curves in relative to the Euclidean metric inherited from the metric on . So the statement follows from the rolling definition. □
This remarkable relation between the elastic curves and the rolling geodesics breaks down on spaces of non-constant Riemannian curvature, as it becomes evident when one compares Equation (110) for the curvature problem to the Equation (101) for the rolling problem. It is interesting to note that the solutions of either of these two Equations (101) and (110) are not known beyond the spaces of constant curvature. While the curvature equation seems particularly challenging beyond the spaces of constant curvature, the rolling geodesic equations remain integrable on all semi-simple spaces and should be “solvable” according to the general theory of integrable systems.
Apart from the above remarks, there is another spectacular property of elastic curves that makes them special: elastic curves appear as soliton solutions in the non-linear Schroedinger equation [53]. More generally it was shown in [53] that the space of periodic horizontal curves of fixed length L in the isometry group G over a three dimensional space of constant curvature can be given a structure of an infinite dimensional Poisson manifold relative to which some famous equations of mathematical physics appear as Poisson equations associated with geometric invariants of curves on the base space. In particular, Heisenberg’s magnetic equation and Schroedinger’s non-linear equation appear as Poisson equation associated with where , . Since this function can be also expressed as where is the geodesic curvature of the projected curve in the underlying symmetric space, elastic curves appear naturally in this setting (see also [54,55,56,57] for related results). This leap to infinite dimensional Hamiltonians and related hierarchies of commuting Hamiltonians further illustrates the relevance of Lie algebraic methods in the theory of integrable systems.
Funding
The research received no external funding.
Data Availability Statement
No additional data.
Conflicts of Interest
There is no conflict of interest.
References
- Poisson, S.D. Sur les inégalités séculaires des moyens mouvemens des planétes. J. L’École Polytech. 1809, 8, 15. [Google Scholar]
- Poincaré, H. Les Méthodes Nouvelles de la Mécanique Célecte; Tome I; Gauther-Villars: Paris, France, 1892. [Google Scholar]
- Jacobi, C.G.J. Vorlersungen Über Dynamic; Druck und Verlag: Berlin, Germany, 1884. [Google Scholar]
- Arnold, V.I. Mathematical Methods of Classical Mechanics; Springer: Berlin/Heidelberg, Germany, 1989. [Google Scholar]
- Carathéodory, C. Calculus of Variations and Partial Differential Equations of the First Order; Second (revised) English Translation; Chelsea Publishing Co.: White River Junction, VT, USA, 1982. [Google Scholar]
- Liouville, J. Note sur l’intégration des équations différentielles de la Dynamique, présenté au Bureau des Longitudes le 29 juin 1853. JMPA 1855, Tome XX, 137–138. [Google Scholar]
- Jurdjevic, V. Optimal Control and Geometry: Integrable Systems; Cambridge Studies in Advanced Mathematics 154; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
- Jurdjevic, V. Integrable Hamiltonian Systems on Lie groups: Kowalewski type. Ann. Math. 1999, 150, 605–644. [Google Scholar] [CrossRef]
- Jurdjevic, V. Rolling geodesics, mechanical systems and elastic curves. Mathematics 2022, 10, 4827. [Google Scholar] [CrossRef]
- Reyman, A.G.; Semenov Tian-Shansky, M.A. Group Theoretic Methods in the Theory of Finite Dimensional Inrtegrable Systems; Dynamical Systems VII, Chapter 2, Encyclopaedia of Mathematical Sciences; Springer: Berlin/Heidelberg, Germany, 1994; Volume 16. [Google Scholar]
- Kirillov, A.A. Elements of the Theory of Representations; Springer: Berlin/Heidelberg, Germany, 1976. [Google Scholar]
- Gasparim, E.; Grama, L.; San Martin, L. Adjoint orbits of semi-simple Lie groups and Lagrangian submanifolds. Proc. Edinb. Math. Soc. 2017, 60, 361–385. [Google Scholar] [CrossRef]
- Báez, J.; San Martin, L.A.B. Deformations of adjoint orbits for semisimple Lie algebras and Lagrangian submanifolds. Differ. Geom. Its Appl. 2021, 75, 101719. [Google Scholar] [CrossRef]
- Jurdjevic, V. Affine-Quadratic problems on Lie Groups: Tops and Integrable Systems. J. Lie Theory 2020, 30, 425–444. [Google Scholar]
- Helgason, S. Differential Geometry, Lie Groups and Symmetric Spaces; Academic Press: New York, NY, USA, 1978. [Google Scholar]
- Trélat, E. Some properties of the value function and its level sets for affine control systems with quadratic cost. J. Dyn. Control Syst. 2000, 6, 511–541. [Google Scholar] [CrossRef]
- Zimmerman, J. The Rolling Sphere Problem. Ph.D. Thesis, University of Toronto, Toronto, ON, Canada, 2002; 94p. [Google Scholar]
- Zimmerman, J.A. Optimal control of the sphere Sn rolling on En. Math. Control Signals Syst. 2005, 17, 14–37. [Google Scholar] [CrossRef]
- Bolsinov, A. A completeness criterion for a family of functions in involution obtained by the shift method. Soviet Math. Dokl. 1989, 38, 161–165. [Google Scholar]
- Fomenko, A.T.; Mischenko, A.S. Euler equation on finite-dimensional Lie groups. Math. USSR Izv. 1978, 12, 371–389. [Google Scholar]
- Fomenko, A.T.; Trofimov, V.V. Integrability in the sense of Liouville of Hamiltonian systems on Lie algebras. Uspekhi Mat. Nauk 1984, 2, 3–56. (In Russian) [Google Scholar]
- Manakov, S.V. Note on the integration of Euler’s equations of the dynamics of an n dimensional rigid body. Funct. Anal. Appl. 1976, 10, 328–329. [Google Scholar] [CrossRef]
- Love, A.E. A Treatise on the Mathematical Theory of Elasticity, 4th ed.; Dover: New York, NY, USA, 1927. [Google Scholar]
- Jurdjevic, V. Integrable Hamiltonian Systems on Complex Lie Groups. Mem. Am. Math. Soc. 2005, 178, 838. [Google Scholar] [CrossRef]
- Bogoyavlenski, O. New Integrable Problem of Classical Mechanics. Commun. Math. Phys. 1984, 94, 255–269. [Google Scholar] [CrossRef]
- Kowalewski, S. Sur le problème de la rotation d’un corps solide autor d’un point fixé. Acta Math. 1889, 12, 177–232. [Google Scholar] [CrossRef]
- Jurdjevic, V. Kowalewski top and complex Lie algebras. Anal. Math. Phys. 2021, 11, 38. [Google Scholar] [CrossRef]
- Komarov, I.V.; Kuznetsov, V.B. Kowalewski top on the Lie algebras o(4), e(3) and o(3, 1). J. Phys. A 1990, 23, 841–846. [Google Scholar] [CrossRef]
- Komarov, I.V. Kowalewski top for the hydrogen atom. Theor. Math. Phys. 1981, 47, 67–72. [Google Scholar] [CrossRef]
- Dragović, V.; Kukić, K. Systems of Kowalevski Type and Discriminantly Separable Polynomials. Regul. Chaotic Dyn. 2014, 19, 162–184. [Google Scholar]
- Sokolov, V.V. A New Integrable Case for the Kirchhoff Equation. Theor. Math. Phys. 2001, 129, 1335–1340. [Google Scholar] [CrossRef]
- Ivanescu, C.; Savu, A. The Kowalewski top as a reduction of a Hamiltonian system on Sp(4,ℝ). Proc. Am. Math. Soc. 2003, 131, 607–618. [Google Scholar] [CrossRef]
- Haine, L.; Horozov, E. A Lax pair for Kowalewski top. Phys. D 1987, 29, 173–180. [Google Scholar] [CrossRef]
- Horozov, E.; van Moerbeke, P. The full geometry of Kowalewski’s top and (1,2)-abelian surfaces. Commun. Pure Appl. Math. 1989, 42, 357–407. [Google Scholar] [CrossRef]
- Bobenko, A.I.; Reyman, A.G.; Semenov-Tian Shansky, M.A. The Kowalewski top 99 years later; a Lax pair, generalizations and explicit solutions. Commun. Math. Phys. 1989, 122, 321–354. [Google Scholar] [CrossRef]
- Jurdjevic, V. Affine-quadratic problems on Lie groups. Math. Control Rel. Fields 2013, 3, 347–374. [Google Scholar] [CrossRef]
- Neumann, C. De problemate quodam mechanico, quod ad primam integralium ultraellipticorum classem revocatum. J. Reine. Angew. Math. 1859, 56, 46–63. [Google Scholar]
- Ratiu, T. The C. Newmann problem as a completely integrable system on a coadjoint orbit. Trans. Am. Mat. Soc. 1981, 264, 321–329. [Google Scholar] [CrossRef]
- Moser, J. Integrable Hamiltonian Systems and Spectral Theory; Lezioni Fermiane, Academia Nazionale dei Lincei, Scuola Normale Superiore: Pisa, Italy, 1981. [Google Scholar]
- Perelomov, A.M. Integrable Systems of Classical Mechanics and Lie Algebras; Birkhauser Verlag: Basel, Switzerland, 1990; Volume 1. [Google Scholar]
- Knörrer, H. Geodesics on quadrics and a mechanical problem of C. Newmann. J. Riene Angew. Math. 1982, 334, 69–78. [Google Scholar]
- Fock, V.A. The hydrogen atom and non-Euclidean geometry. Izv. Akad. Nauk SSSR Ser. Fizika 1935, 8. [Google Scholar]
- Moser, J. Regularization of Kepler’s problem and the averaging method on a manifold. Commun. Pure Appl. Math. 1970, 23, 609–623. [Google Scholar] [CrossRef]
- Osipov, Y. The Kepler problem and geodesic flows in spaces of constant curvature. Celest. Mech. 1977, 16, 191–208. [Google Scholar] [CrossRef]
- Guillemin, V.; Sternberg, S. Variations on a Theme by Kepler; American Mathematical Society: Providence, RI, USA, 1990; Volume 42. [Google Scholar]
- O’Neill, B. Semi-Riemannian Geometry; Academic Press: Cambridge, MA, USA; Elsevier: Amsterdam, The Netherlands, 1983. [Google Scholar]
- Jurdjevic, V.; Markina, I.; Silva Leite, F. Symmetric spaces rolling on flat spaces. J. Geom. Anal. 2023, 33, 94. [Google Scholar] [CrossRef]
- Ziller, W. Lie Groups. Representation Theory and Symmetric Spaces; University of Pennsylvania: Philadelphia, PA, USA, 2010. [Google Scholar]
- Bryant, R.; Hsu, L. Rigidity of integral curves of rank 2 distributions. Invent. Math. 1993, 114, 435–461. [Google Scholar] [CrossRef]
- Agrachev, A.; Sachkov, Y. Control Theory from the Geometric Viewpoint; Springer: Berlin/Heidelberg, Germany, 2004. [Google Scholar]
- Chitour, Y.; Godoy-Molina, M.; Kokkonen, P. The Rolling Problem: Overview and Challenges. Geometric Control Theory and Sub-Riemannian Geometry; Springer INdAM Ser. 5; Springer: Berlin/Heidelberg, Germany, 2014; pp. 103–122. [Google Scholar]
- Jurdjevic, V.; Zimmerman, J. Rolling sphere problems on spaces of constant curvature. Math. Proc. Camb. Phil. Soc. 2008, 144, 729–747. [Google Scholar] [CrossRef]
- Jurdjevic, V. The symplectic structure of curves in three dimensional spaces of constant curvature and the equations of mathematical physics. Ann. I. H. Poincaré 2009, 26, 1483–1515. [Google Scholar] [CrossRef]
- Langer, J.; Perline, R. Poisson Geometry of the Filament Equation. J. Nonlinear Sci. 1978, 1, 71–93. [Google Scholar] [CrossRef]
- Chabat, C.; Zakharov, V. Exact theory of two dimensional self-focusing and one dimensional self-modulation of waves in non-linear media. Sov. Phys. JETP 1972, 34, 62–69. [Google Scholar]
- Hasimoto, H. A soliton on a vortex element. J. Fluid Mech. 1972, 51, 477–485. [Google Scholar] [CrossRef]
- Millson, J.; Zombro, B.A. A Kähler structure on the moduli spaaces of isometric maps of a circle into Euclidean spaces. Invent. Math. 1995, 123, 35–59. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).