Reactive Balance Control for Legged Robots under Visco-Elastic Contacts

Flayols, Thomas; Del Prete, Andrea; Khadiv, Majid; Mansard, Nicolas; Righetti, Ludovic

doi:10.3390/app11010353

Open AccessArticle

Reactive Balance Control for Legged Robots under Visco-Elastic Contacts

by

Thomas Flayols

^1,*

,

Andrea Del Prete

²

,

Majid Khadiv

³,

Nicolas Mansard

¹ and

Ludovic Righetti

^3,4

¹

Centre National de la Recherche Scientifique (CNRS), Laboratoire d’Analyse et d’Architecture des Systemes (LAAS), 7 Avenue du Colonel Roche, Univ de Toulouse, LAAS, F-31400 Toulouse, France

²

Industrial Engineering Department, University of Trento, 30123 Trento, Italy

³

Max Planck Institute for Intelligent Systems, 72076 Tuebingen, Germany

⁴

Tandon School of Engineering, New York University, New York, NY 10012, USA

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2021, 11(1), 353; https://doi.org/10.3390/app11010353

Submission received: 3 December 2020 / Revised: 17 December 2020 / Accepted: 24 December 2020 / Published: 31 December 2020

(This article belongs to the Special Issue Advances in Robot Motion and Control—In Memory of Professor Krzysztof Kozlowski)

Download

Browse Figures

Versions Notes

Abstract

:

Contacts between robots and environment are often assumed to be rigid for control purposes. This assumption can lead to poor performance when contacts are soft and/or underdamped. However, the problem of balancing on soft contacts has not received much attention in the literature. This paper presents two novel approaches to control a legged robot balancing on visco-elastic contacts, and compares them to other two state-of-the-art methods. Our simulation results show that performance heavily depends on the contact stiffness and the noises/uncertainties introduced in the simulation. Briefly, the two novel controllers performed best for soft/medium contacts, whereas “inverse-dynamics control under rigid-contact assumptions” was the best one for stiff contacts. Admittance control was instead the most robust, but suffered in terms of performance. These results shed light on this challenging problem, while pointing out interesting directions for future investigation.

Keywords:

balance control; force control; elastic contacts; optimization-based control; humanoid robots

1. Introduction

1.1. Problem Overview

Balance control of legged robots is a fundamental problem in robotics and is at the heart of numerous recent publications in the literature [1]. In this task, depending on whether the contact between the robot and the environment is rigid or soft, the robot will behave differently. However, for the sake of simplicity, a large part of studies has focused on the problem of controlling the balance of a rigid robot in contact with a rigid environment [2,3,4,5,6,7]. In this scenario, theoretically, the contact forces between the robot and the environment can be modified instantaneously. This is no longer possible if the environment is visco-elastic, or, equivalently, if a visco-elastic element is located between the robot actuators and the contact points, as in many legged robots which include soft elements at the ankles or the feet [7,8,9,10]. These compliant elements are extremely useful: they absorb impact during walking/running, protect the mechanical structure, tend to improve the quality of force measurements, and can make walking more efficient [10]. Moreover, the assumption of rigid contact is always an approximation because in the real world all contacts are visco-elastic up to a certain extent. In the presence of elasticity, the contact forces cannot be instantaneously modified, but their derivative—either of first or second order—can be modified depending on the contact damping (as detailed in Section 2). Despite the popularity of visco-elastic elements, balancing on visco-elastic contacts is still an open problem. This question is at the heart of the paper. It is addressed through the introduction of two new compliant controllers and a comparative study with existing strategies.

1.2. State of the Art

The most common approach is to neglect elasticity in the balance controller and simply assume that contacts are rigid. This method works fine for contacts that are not too soft and that are sufficiently well damped, as was demonstrated with a passivity-based controller on the Toro humanoid robot [11]. However, in general, making this hypothesis results in destabilizing oscillations [1]. Force feedback can then be used for damping these oscillations. This is usually done by using force feedback to modify the position references given to a high-gain joint position controller, leading to admittance-control schemes [5,7,9]. Although this approach proved to be successful in practice, its lack of theoretical foundations makes it challenging to tune and analyze in a robust and repeatable way.

A crucial issue of admittance-control schemes is the resulting delay in the force tracking. To deal with it, two approaches have been proposed so far. The first one [3] is to prove that the controller is robust to these delays. The second one is to account for the delay by introducing a 1-st order low-pass filter in the dynamic model used by balance controller. Our analysis reveals that the force tracking delay is actually not an issue, so long as the admittance-control gains are properly set.

Instead of relying on admittance control, another approach is to try to change the contact forces at the next time step [12] since they cannot be changed instantaneously. This approach exploits a model of the visco-elastic contacts for the controller design. However, as it mainly relies on contact damping, it cannot be applied if the damping is too low (see Section 2.1).

Recently, the state-of-the-art Whole-Body Controller [13] has been extended to ensure consistency to terrain compliance [14]. In this approach the controller incorporated a soft-contact model and was connected with an online learning algorithm that estimates the terrain compliance. Though that work focused on the case of well-damped contacts, we focus here on the underdamped case.

1.3. Contributions

The contribution of this paper is two-fold. First two novel strategies are introduced for controlling the balance of legged robots under visco-elastic contacts with their environment. The proposed approach unifies position and force feedback, leading to a simple gain-tuning procedure for which standard tools from linear system control can be used (e.g., LQR, pole placement). Second, a thorough comparison of different controllers is proposed for different kinds of compliant contact and noisy measurements to discuss the efficiency of the different approaches. More precisely, the two novel controllers that were designed to deal with visco-elastic contacts are compared in simulation with two state-of-the-art approaches for the same stabilization task of a model of the lateral dynamics of HRP-2 in double support and under different contact stiffnesses and noisy conditions. The study highlights pros and cons of each method according to the different parameters and demonstrates the performance of the proposed controllers in case of compliant contact.

Even though the paper mainly focuses on legged robots, the results are also of great interest for manipulation problems for which the control of compliant contacts is required.

1.4. Paper Structure

Section 2 introduces the visco-elastic contact model and the robot dynamics. Section 3 summarizes two state-of-the-art methods. The first one (Section 3.1) is classically used for rigid contacts (TSID-Rigid [4]). The second one (Section 3.2) is an admittance-control scheme (Adm-Ctrl) relying on joint position control [8]. Section 4 presents an extension of Adm-Ctrl (TSID-Adm), while Section 5 proposes a novel approach to balance on elastic contacts (TSID-Flex-K). Section 6 presents simulation results comparing the four methods. Finally, Section 7 concludes the paper.

2. Background: Dynamics and Contact Model

2.1. Visco-Elastic Contact Model

Consider a robot in contact with the environment at k contact points. We assume that the contact surfaces are visco-elastic, or, equivalently, the robot is equipped with visco-elastic elements between each contact point and the last actuator of the corresponding kinematic chain. In the latter case, we can model the part of the robot after the visco-elastic element as part of the environment, and the contact point as the last point of the robot before the visco-elastic element. We also assume that each contact force

f_{i} \in R^{3}

is proportional to the induced contact position

p_{i} \in R^{3}

and velocity

{\dot{p}}_{i} \in R^{3}

:

f_{i} = K_{i} (p_{i}^{0} - p_{i}) - B_{i} {\dot{p}}_{i}, \forall i \in [1 . . k]

(1)

where

K_{i}, B_{i} \in R^{3 \times 3}

are the positive-definite diagonal stiffness and damping matrices, respectively, and

p_{i}^{0} \in R^{3}

is the contact position corresponding to a null force. Stacking all contact forces together we can rewrite (1) as:

f = K (p_{0} - p) - B \dot{p}

(2)

Given that contacts are unilateral, this model is valid as long as normal forces are positive (i.e., pushing). Since the contact forces are function of the robot configuration, we cannot change them instantaneously as in the rigid contact case, because of the finite maximum acceleration of the actuation. However, if

B \neq 0

, we can affect their time derivative through the contact point accelerations

\ddot{p}

:

\dot{f} = - K \dot{p} - B \ddot{p}

(3)

If instead

B \approx 0

, we can only affect the second time derivative of f:

\ddot{f} = - K \ddot{p}

(4)

2.2. Importance of Stiffness vs. Damping

Our main interest lies in the underdamped case—which is more challenging in our experience—which means that

B ≪ 2 \sqrt{K}

(

B = 2 \sqrt{K}

correspond to the critically damped case). In these cases, relying on (3) to control

\dot{f}

may not be convenient because very large values of

\ddot{p}

(hence motor commands) may be required.

Appendix C includes an order-of-magnitude analysis that supports this claim for the usual dynamics of humanoid robots and illustrates why we prefer to rely on (4) rather than on (3) for our controller design.

2.3. Centroidal Dynamics

To balance a legged robot, we must control its CoM c and its angular momentum l. The dynamics of these two quantities is described by the Newton–Euler equations, where all quantities are expressed in an arbitrary inertial frame with z aligned with gravity:

m \ddot{c} = \sum_{i = 1}^{k} f_{i} + m g

(5)

\dot{l} = \sum_{i = 1}^{k} (p_{i} - c) \times f_{i}

(6)

where

m \in R^{}

is the robot mass, and

g = (0, 0, - 9.81)

is the gravity acceleration. We can write (5) and (6) in matrix form as:

\begin{matrix} \underset{\ddot{x}}{\underset{︸}{[\begin{matrix} m \ddot{c} \\ \dot{l} \end{matrix}]}} = \underset{A}{\underset{︸}{[\begin{matrix} I_{3} & \dots & I_{3} \\ {(p_{1} - c)}_{\times} & \dots & {(p_{k} - c)}_{\times} \end{matrix}]}} \underset{f}{\underset{︸}{[\begin{matrix} f_{1} \\ ⋮ \\ f_{k} \end{matrix}]}} + \underset{b}{\underset{︸}{[\begin{matrix} m g \\ 0_{3} \end{matrix}]}}, \end{matrix}

(7)

where

y_{\times} \in R^{3 \times 3}

is the cross-product matrix associated with y.

2.4. Whole-Body Dynamics

The dynamics of a floating-base robot with n joints is described by the following equations:

M \dot{v} + h - J^{⊤} f = S^{⊤} τ,

(8)

where

M \in R^{(n + 6) \times (n + 6)}

is the mass matrix,

v \in R^{n + 6}

is the robot velocity vector,

h \in R^{n + 6}

contains the gravitational, centrifugal and Coriolis forces,

J \in R^{3 k \times (n + 6)}

is the contact Jacobian,

S = [\begin{matrix} 0_{n \times 6} & I_{n \times n} \end{matrix}] \in R^{n \times (n + 6)}

is the selection matrix of the actuated joints, and

τ \in R^{n}

are the joint torques. The same dynamics can be expressed by splitting the first 6 rows, which correspond to the unactuated floating base, from the last n rows, which correspond to the actuated joints:

M_{u} \dot{v} + h_{u} - J_{u}^{⊤} f = 0

(9)

M_{a} \dot{v} + h_{a} - J_{a}^{⊤} f = τ

(10)

Equation (9) is equivalent to the centroidal dynamics [15] and is sufficient to ensure dynamic consistency in the controllers based on Task-Space Inverse Dynamics (TSID) [4]. Finally, the relationship between the accelerations of the contact points and the robot configuration is given by:

\ddot{p} = J \dot{v} + \dot{J} v

(11)

This expression is obtained by derivative of the kinematic related to the contact point p:

\dot{p} = J v

.

3. State of the Art

3.1. Inverse Dynamics with Rigid Contacts (TSID-Rigid)

This section summarizes the classic approach for balancing a legged robot in rigid contact with the environment [4].

The desired momentum rate of change is typically computed with a simple PD control law:

{\ddot{x}}_{d} = [\begin{matrix} m {\ddot{c}}_{d} \\ {\dot{l}}_{d} \end{matrix}] = [\begin{matrix} m {\ddot{c}}^{*} + K_{d}^{c o m} {\dot{e}}_{c} + K_{p}^{c o m} e_{c} \\ {\dot{l}}^{*} + K_{d}^{a m} {\dot{e}}_{l} + K_{p}^{a m} e_{l} \end{matrix}],

(12)

where

e_{c}

and

e_{l}

are the tracking error of the linear and angular part of the centroidal state, respectively. Often, the proportional part of the angular momentum feedback is neglected, but we use it here to ensure stability [6]. The contact forces and the robot accelerations are computed by solving the following least-squares problem:

\begin{matrix} \underset{\dot{v}, f}{minimize} & ∥ A f - {\ddot{x}}_{d} {+ b ∥}^{2} + w_{f} | | f - f^{*} {| |}^{2} + w_{p o s t} | | {\dot{v}}^{p o s t} - \dot{v} {| |}^{2} \\ subject to & M_{u} \dot{v} + h_{u} = J_{u}^{⊤} f \\ J \dot{v} + \dot{J} v = 0 \\ | M_{a} \dot{v} + h_{a} - J_{a}^{⊤} f | \leq τ^{m a x} \\ B f \leq 0, \end{matrix}

(13)

where

w_{f} \in R^{}

is the weight and

f^{*} \in R^{3 k}

is the reference of the force regularization task,

w_{p o s t} \in R^{}

is the weight of the postural task, the first constraint represents the centroidal dynamics, the second one is the rigid-contact constraint, the third constraint represents the joint torque limits, and the last one represents the friction cone limits. The reference postural task accelerations are:

{\dot{v}}^{p o s t} = K_{p}^{p o s t} e_{q} (q^{*}, q) - K_{d}^{p o s t} v,

(14)

where

K_{p}^{p o s t}, K_{d}^{p o s t} \in R^{(n + 6) \times (n + 6)}

are positive-definite diagonal gain matrices,

q^{*} \in S E (3) \times R^{n}

is a given reference posture, and

e_{q} (q_{1}, q_{2})

is an error function mapping two configurations

q_{1}, q_{2} \in S E (3) \times R^{n}

to the log of their relative displacement. Once the optimal contact forces

f_{d}

and robot accelerations

{\dot{v}}_{d}

are found, we compute the desired joint torques using (10). Please note that this approach does not require (nor exploit) any force measurement.

3.2. Admittance Control (Adm-Ctrl)

A classic way to control the contact wrenches in case of flexible contacts is to rely on admittance control. Several versions of admittance control exist and have been shown to perform well on real humanoid robots [7,8,16,17]. We decided to use the version with the minimum number of gains to simplify the gain-tuning procedure.

First, we compute the desired contact forces

f_{d}

as:

\begin{matrix} \underset{f}{minimize} & | | A f - {\ddot{x}}_{d} + b {| |}^{2} \\ subject to & B f \leq 0, \end{matrix}

(15)

where

{\ddot{x}}_{d}

is defined as in (12). We then compute the reference velocity of the contact points according to the force tracking error:

{\dot{p}}_{d} = - K_{f} (f_{d} - f)

(16)

This reference velocity is directly used in an inverse kinematics (IK) algorithm to compute reference joint velocities. The IK is computed on each limb independently:

{\dot{q}}_{j d} = {(J S^{⊤})}^{†} {\dot{p}}_{d}

(17)

These joint velocities are then integrated and given to the high-gain position controller, which computes the joint torque commands:

τ_{d} = K_{p}^{j} (q_{j d} - q_{j}) - K_{d}^{j} {\dot{q}}_{j}

(18)

4. Inverse Dynamics Admittance Control (TSID-Adm)

To improve the performance of admittance control we suggest integrating it with an inverse dynamics control law. Once we have computed

{\dot{p}}_{d}

with (16), we compute the desired contact point accelerations as:

{\ddot{p}}^{*} = K_{d}^{a d m} ({\dot{p}}_{d} - \dot{p})

(19)

Finally, we rely on an inverse-dynamics control law to track these contact point accelerations:

\begin{matrix} \underset{\dot{v}}{minimize} & ∥ J \dot{v} + \dot{J} v - {\ddot{p}}^{*} ∥^{2} + w_{p o s t} | | {\dot{v}}^{p o s t} - \dot{v} {| |}^{2} \\ subject to & M_{u} \dot{v} + h_{u} = J_{u}^{⊤} \hat{f} \\ | M_{a} \dot{v} + h_{a} - J_{a}^{⊤} \hat{f} | \leq τ^{m a x} \end{matrix}

(20)

5. Flexible TSID (TSID-Flex-K)

This section presents an original control formulation, which consists of a standard feedback linearization. In the case of visco-elastic contacts we cannot directly control f, but only its first or second derivative. As already mentioned, since we are mainly interested in underdamped contacts, we assume we can only control

\ddot{f}

. Thus, we differentiate (7) twice and use (4) to express

x^{(4)}

as a function of

\ddot{p}

:

\begin{matrix} x^{(4)} & = A \ddot{f} + 2 \dot{A} \dot{f} + \ddot{A} f \\ = \underset{A_{K}}{\underset{︸}{(A_{f} - A K)}} \ddot{p} + \underset{a_{K}}{\underset{︸}{2 \dot{A} \dot{f} - {\ddot{A}}_{c} f}}, \end{matrix}

(21)

where:

{\ddot{A}}_{c} ≜ [\begin{matrix} 0_{3} & \dots & 0_{3} \\ {\ddot{c}}_{\times} & \dots & {\ddot{c}}_{\times} \end{matrix}] A_{f} ≜ [\begin{matrix} 0_{3} & \dots & 0_{3} \\ {f_{1}}_{\times} & \dots & {f_{k}}_{\times} \end{matrix}]

(22)

5.1. Feedback Linearization

We can find the accelerations

\dot{v}

that track at best the desired

x_{d}^{(4)}

by solving the following least-squares problem [4]:

\begin{matrix} \underset{\dot{v}}{minimize} & ∥ A_{K} J \dot{v} + A_{K} \dot{J} v + a_{K} - x_{d}^{(4)} ∥^{2} + \\ w_{p o s t} | | {\dot{v}}^{p o s t} - \dot{v} {| |}^{2} \\ subject to & M_{u} \dot{v} + h_{u} = J_{u}^{⊤} \hat{f} \\ | M_{a} \dot{v} + h_{a} - J_{a}^{⊤} \hat{f} | \leq τ^{m a x} \end{matrix}

(23)

where

\hat{f} \in R^{3 k}

are the measured/estimated contact forces. Once we have the optimal accelerations

{\dot{v}}_{d}

, we can compute the desired joint torques using (10) with

\dot{v} = {\dot{v}}_{d}

and

f = \hat{f}

.

5.2. Accounting for Force Variations during the Time Step

Since we can only update the motor commands at discrete time steps, there is always an error due to state variations in between time steps. Normally, these errors are negligible because small state variations result in small variations of the quantities in (9) and (10). However, when the robot is in contact with a stiff environment, small displacements of the contact points lead to large variations of the contact forces. Therefore, we can improve performance by accounting for the variation of f during the time step, assuming the following approximated time evolution:

f (t) = f + t \dot{f} t \in [0, δ t],

(24)

where

δ t

is the controller time step. Under this assumption the accelerations

\dot{v}

vary during the time step. To get the desired average value of

\dot{v}

during the time step the controller must compensate for the average value of f, which is:

f^{a v g} = \hat{f} + \frac{δ t}{2} \hat{\dot{f}}

(25)

In (23) we can thus replace

\hat{f}

with

\hat{f} + \frac{δ t}{2} \hat{\dot{f}}

.

5.3. Linear Feedback Regulator

The least-squares problem (23) allows us to directly impose

x^{(4)}

—if it is compatible with the problem constraints. Thus, the resulting dynamics is a 4-th order integrator. We define x as the centroidal state

x = (m c, l^{Σ})

, where

l^{Σ} \in S O (3)

should be the integral of the angular momentum. However, since this is not a measurable quantity [18], we approximate it with the orientation of the base link (which is typically the heaviest link), scaled by the 3D robot inertia. We then regulate x through a linear feedback control law:

\begin{matrix} x_{d}^{(4)} = & K_{p}^{m} e_{x} (x^{*}, x) + K_{d}^{m} ({\dot{x}}^{*} - \dot{x}) + \\ K_{a}^{m} ({\ddot{x}}^{*} - \ddot{x}) + K_{j}^{m} ({\overset{⃛}{x}}^{*} - \overset{⃛}{x}) + x^{(4) *}, \end{matrix}

(26)

where

x^{*} (t)

is a reference centroidal trajectory, and

e_{x} (x_{1}, x_{2})

is an error function mapping two centroidal states

x_{1}, x_{2} \in S E (3)

to the log of their relative displacement. The diagonal positive-definite feedback gain matrices

K_{p}^{m}, K_{d}^{m}, K_{a}^{m}, K_{j}^{m} \in R^{6 \times 6}

need to be chosen so that the closed-loop system be stable.

5.4. Friction Force Constraints

The inverse-dynamics least-squares problem typically contains a linear approximation of the force friction cone constraints. This is no longer possible in the case of visco-elastic contacts, because the contact forces are not a problem variable. However, we can still try to satisfy the friction cone constraints by bounding the contact force accelerations, which are affine functions of

\dot{v}

. This problem is similar to trying not to hit the joint position bounds by constraining the joint accelerations [19]. The friction cone constraints can be approximated by a set of linear constraints [4] of the form:

b^{⊤} f_{i} \leq 0

(27)

Using the approach of [19], given a bound on the force accelerations

{\ddot{f}}_{m a x} \in R^{3}

(i.e.,

| {\ddot{f}}_{i} | \leq {\ddot{f}}_{m a x}

), we can compute the maximum

{\dot{f}}_{i}

in direction b such that it is possible to satisfy (27) in the future:

b^{⊤} {\dot{f}}_{i} \leq \sqrt{- 2 | b^{⊤} {\ddot{f}}_{m a x} | (b^{⊤} f_{i})}

(28)

Even though

{\ddot{f}}_{m a x}

depends on q, the method that we use [19] assumes constant acceleration bounds. Therefore,

{\ddot{f}}_{m a x}

should be seen here as a user parameter that defines how conservative the algorithm should be. Putting together all the friction cone constraints as

B f \leq 0

we can bound the force accelerations to ensure that (28) be satisfied at the next time step:

B \ddot{f} \leq b_{f}^{m a x},

(29)

where

b_{f}^{m a x}

is a function of f and

\dot{f}

[19]. Finally, this constraint can be expressed as a function of

\dot{v}

:

- B K (J \dot{v} + \dot{J} v) \leq b_{f}^{m a x}

(30)

5.5. Summary

The controller is finally obtained by adding (30) as a constraint to (23), with

x_{d}^{(4)}

computed by (26).

6. Results

This section presents simulation results to compare the different approaches discussed in the paper:

TSID-Rigid: a state-of-the-art approach, see Section 3.1.
Adm-Ctrl: a state-of-the-art approach, see Section 3.2.
TSID-Adm: a novel approach, see Section 4.
TSID-Flex-K: a novel approach, see Section 5.

6.1. Simulation Environment

We have carried out all our simulations using a simple 2D biped robot (see Figure 1), which moves in the YZ plane. The robot is composed by two legs and a torso, and it has 4 actuated joints: two rotatory hip roll joints and two prismatic knee joints. The geometric and inertial parameters of the links have been taken from HRP-2’s model. The simulation is based on a simple explicit Euler integration scheme, and all simulation parameters are listed in Table 1. The robot has two point feet in contact with a visco-elastic ground. We have investigated different values for the contact stiffness

K = 2 α 10^{5}

N/m, ranging from

α = 0.01

(i.e., soft) to

α = 1

(i.e., stiff), while keeping the same contact damping ratio.

We chose to limit our tests to this simple 2D model with point contacts for several reasons. First, this simple model already captures the main features of a biped robot, and it was able to reproduce behaviors that we had previously observed on the real HRP-2 robot [20]. Second, results obtained with a simple model are easier to analyze and can give us more insights into the behavior of the controllers. Finally, simulations are faster with a simple model, which has allowed us to run more of them, making our statistics more significant.

6.2. Gain Tuning

For tuning the momentum gains we have used the gain-tuning procedure described in Appendix A for all controllers (except Adm-Ctrl) and for 7 values of a user-specified parameter (

w_{u}

) that defines the trade-off between tracking and control effort. The other task gains and weights have been tuned by hand and are reported in Table 2. We have set the weight of the postural task so that it would not significantly affect the primary momentum task.

6.3. Test Description

Our tests aim to evaluate the ability of each controller to balance the robot on visco-elastic contacts. To do so, we start the simulation with a non-zero CoM velocity, and we observe whether and how each controller can decelerate the CoM and bring it back to its initial position.

We start by performing three tests, each of which focuses on one specific controller. In our first test (Test A) we show how TSID-Rigid struggles with soft contacts. Then, in Test B, we examine TSID-Flex-K and TSID-Adm, which work well with soft contacts, but get unstable for stiff contacts. Test C shows that Adm-Ctrl remains stable in both cases. Finally, in Test D, we show a comprehensive comparison of all controllers in different situations.

We have tested the controllers both in an ideal simulation (no noise, modeling errors, and perfect state estimation) and in more realistic conditions by introducing:

realistic encoder quantization errors and white Gaussian noise on force sensing and gyroscope (see Table 3);
an Extended Kalman Filter (explained in Appendix B) to estimate the robot state, with the covariances specified in Table 4;
limited torque bandwidth by filtering the desired joint torques with a first-order low-pass filter with a cut frequency of 30 Hz. The best torque-tracking bandwidths that have been reported for high-performance actuators are between 40 Hz and 60 Hz (e.g., 40 Hz for hydraulic actuators [2], 46 Hz for electric motors with harmonic drives [21], 60 Hz for series elastic actuators [22]).
joint Coulomb friction of about 1% of the maximum joint force/torque (0.4 Nm for hip joints, and 4 N for knee joints, which are prismatic).

In the following we will refer to the first kind of simulations as noiseless, and to the second kind as noisy.

6.4. Test A: TSID-Rigid

We have tested TSID-Rigid on both rigid and soft contacts. Figure 2 shows the CoM trajectories obtained with noiseless simulation. With rigid contacts the controller regulates well the CoM position. Even the CoM velocity is smooth, but looking at the CoM jerk/snap we can see high-frequency oscillations at the beginning of the motion. For soft contacts, even though the system is still stable, performance has deteriorated. We can still see high CoM jerk/snap, which is undesirable, but since this time the oscillations last longer, they are also visible in the CoM velocity. Moreover, with rigid contacts the system remained stable for all gains, but this was not the case with soft contacts, where most gains led to instability (more details in Test D). This clearly highlights the need for a different control strategy to deal with soft contacts.

6.5. Test B: TSID-Flex-K and TSID-Adm

TSID-Flex-K and TSID-Adm have performed extremely well with soft contacts. Figure 3 shows that TSID-Flex-K gives smoother CoM trajectories than TSID-Rigid, e.g., for the noiseless case, peak CoM jerk is ≈7 ms

^{- 3}

rather than ≈26 ms

^{- 3}

, while settling time is smaller. TSID-Adm has shown comparable performance on soft contacts, and Figure 4 shows that it has performed well also on medium contacts. TSID-Flex-K instead performed well on medium contacts only in the noiseless case, whereas it became unstable when noise was introduced. For hard contacts, TSID-Flex-K and TSID-Adm still performed well in noiseless simulations, but got unstable in the noisy case, as we will show in detail in Test D.

6.6. Test C: Adm-Ctrl

Adm-Ctrl has performed well across a large range of contact stiffnesses, as shown in Figure 5. Moreover, this controller has proven a remarkable robustness to noise, giving similar performance in the noiseless and noisy cases. The price to pay for this robustness seems to be a loss in performance, as measured by the cost function used in our gain-tuning procedure (more details in the following).

6.7. Test D: All Controllers

We now compare all four controllers, with different gains and contact stiffnesses. The results are summarized by Figure 6. Each plot shows the results of the four controllers for all the gain sets (1 for Adm-Ctrl and 7 for the other controllers), but for a specific contact stiffness (either soft, medium, or stiff) and a specific simulation type (either noiseless or noisy). The points missing in the plots are the ones that resulted in unstable simulations (i.e., NaN values). The plots highlight the trade-off between average CoM tracking error (x axis) and average CoM snap (y axis). The average CoM tracking error (“State cost”) measures how fast balance is recovered after an external perturbation, and it is defined as a weighted sum of CoM position and its first three derivatives along the simulation time:

State \cos t = \sum_{i = 0}^{N} y_{i}^{⊤} Q y_{i}, y ≜ (c, \dot{c}, \ddot{c}, \overset{⃛}{c}), Q ≜ diag ([\begin{matrix} 1, 0.1, 10^{- 3}, 10^{- 6} \end{matrix}])

(31)

The average CoM snap (“Snap cost”) quantifies both the CoM oscillations and the control effort, and it is defined as:

Snap \cos t = \sum_{i = 0}^{N} {(c_{i}^{(4)})}^{⊤} c_{i}^{(4)}

(32)

6.7.1. Soft Contacts

In the noiseless simulations both TSID-Flex-K and TSID-Adm perform well, meaning that they achieve a good trade-off between state tracking and control effort. For lower feedback gains (i.e., higher position tracking errors) TSID-Flex-K and TSID-Adm achieve very similar performance, whereas for higher feedback gains they differ because of the ways in which they handle friction cones.

The unavoidable force tracking delay of admittance-control strategies is typically seen as an issue in the literature [3,8], because it clashes with the assumption of second-order dynamics. However, if we accept a fourth-order dynamics for our system, we can see that this delay is not at all incompatible with good CoM tracking (as long as gains are properly tuned), as shown for instance in Figure 7. The only downside of this delay is that it may lead TSID-Adm to violate friction cone constraints. Indeed, even though TSID-Adm constrains the desired forces inside friction cones, real forces may violate these constraints because of the tracking delay.

Adm-Ctrl performs reasonably well too, but less than TSID-Flex-K and TSID-Adm. On the contrary, TSID-Rigid is unstable for all the tested gain sets, except one, and even in this one case its performance is inferior to the other controllers (see Figure 2).

In the noisy simulations, things change remarkably. First, the performance of TSID-Flex-K and TSID-Adm gets worse, especially in terms of CoM snap, and in particular for lower feedback gains. Nonetheless, they remain the best controllers overall. Adm-Ctrl is only slightly affected by the introduction of noise, showing a remarkable robustness. Surprisingly, TSID-Rigid performs much better than in the noiseless case, showing a stable behavior with 3 gain sets, and getting a performance that is much closer to the Pareto front. To understand why, we have performed another test removing Coulomb friction (see Figure 8). This test shows that Coulomb friction was the cause of the improved stability of TSID-Rigid, and of much of the performance deterioration of the other controllers.

6.7.2. Medium Contacts

In the noiseless case TSID-Flex-K and TSID-Adm perform well, similarly to the soft-contact case. Adm-Ctrl also performs well, resulting in just a slightly higher CoM snap than for soft contacts. The main difference regarding the soft-contact case is that TSID-Rigid is stable for most gain sets (5 out of 7), even though it performs worse than the other controllers.

In the noisy simulations TSID-Flex-K becomes unstable for all feedback gains. TSID-Adm becomes unstable only for low feedback gains, but results in an increased CoM snap for higher gains. The performance of Adm-Ctrl and TSID-Rigid instead gets only slightly worse, showing a significant robustness. Out of the 7 Pareto-optimal points, 4 are associated with TSID-Rigid, and 3 to TSID-Adm.

Despite the performance of TSID-Rigid and TSID-Adm may seem comparable according to this plot, they are qualitatively different. Figure 9 shows one example of contact forces generated by the two approaches. TSID-Rigid generates fast-changing forces because it immediately applies the joint torques needed to generate the desired contact forces, and given the relatively high contact stiffness, these joint torques quickly result in the desired force change. TSID-Adm instead generates forces with larger oscillations, but with lower frequencies, which may be preferable on real hardware.

6.7.3. Stiff Contacts

In the noiseless case we can already observe a significant performance deterioration for TSID-Flex-K and TSID-Adm, which are unstable for many more gain sets (2 and 4 out of 7, respectively). Nonetheless, they remain the best controllers in this scenario. This deterioration is due to the control frequency, which is insufficient for this contact stiffness, so the sampling starts affecting stability and performance. For high contact stiffnesses, performance could be improved by designing and tuning controllers in the discrete-time domain.

TSID-Rigid and Adm-Ctrl perform similarly to the medium-contact case: stable, but with high CoM snap. In particular TSID-Rigid results in high-frequency oscillations of the contact forces (see Figure 10), which may excite unmodelled dynamics (e.g., link flexibility) in real systems.

In the noisy case TSID-Flex-K and TSID-Adm became unstable for all gains. We believe that this is due to the delay in the compensation of the estimated contact forces, which triggers instability. This delay is caused by the limited joint torque bandwidth and the joint Coulomb friction.

TSID-Rigid and Adm-Ctrl remained stable, but their performance is far from satisfactory. Figure 5 and Figure 11 show the obtained CoM and force trajectories, which are extremely oscillatory (especially for Adm-Ctrl) and are thus unlikely to work on real hardware.

The interested reader is invited to watch the accompanying video for more details about the simulation results.

7. Conclusions

This paper presented two novel control methods for balancing a legged robot on visco-elastic contacts: TSID-Flex-K and TSID-Adm. To ensure a fair comparison between novel and state-of-the-art methods we have used a unified gain-tuning procedure (except for Adm-Ctrl, which we have tuned by hand). We have performed several tests in simulation using a simple 4-DoF robot.

The presented results depict a broad picture of performance for 4 controllers, across 3 contact stiffnesses and 7 gain sets each (except for Adm-Ctrl, for which we have tested only one gain set). We highlight here the main pros and cons of each controller, which are also summarized in Table 5.

The novel controllers presented in this paper (TSID-Flex-K and TSID-Adm) perform well for soft contacts, reasonably well for medium contacts, but they have showed instability for stiff contacts and noisy simulations.

TSID-Rigid has unsurprisingly shown the opposite trend, getting unstable for soft contacts (even though it was able to regain stability when introducing joint Coulomb friction), but then becoming more and more competitive as the contact stiffness increased. This is actually reasonable because the higher the contact stiffness, the more the system behaves as if contacts were rigid, which is a key assumption in TSID-Rigid.

One of the main advantages of the novel controllers is their ease of gain tuning, which allows for a unified tuning of force and position feedback gains (as acceleration/jerk gains can be seen as force gains). However, the novel controllers have demonstrated a remarkable sensitivity to uncertainties. We believe that this is due to the attempt to compensate for the contact forces, which are rapidly changing (especially for stiff contacts), thus easily leading to destabilizing compensation errors. This makes these methods sensitive to low control frequencies and actuation delays, such as the ones introduced by limited joint torques bandwidth and joint Coulomb friction.

Adm-Ctrl has been the only controller that was always stable, making it the most robust of them all. Despite its superior robustness, in no condition Adm-Ctrl has outperformed all the other controllers. Moreover, its gain tuning remains a heuristic procedure, and its convergence properties are not yet clearly understood. All of this makes this controller hard to use in practice, highlighting the need for more work on this subject.

Overall, the case of stiff contacts was the hardest for the tested controllers: only TSID-Rigid and Adm-Ctrl remained stable, but they resulted in oscillatory trajectories, which are (at best) unpleasant on real hardware. These oscillations are due to the small contact damping, which combined with the high contact stiffness makes it hard for the controller to damp the high-frequency force oscillations.

Finally, these results show that controllers that are well supported by theoretical results and perform exquisitely in ideal conditions, can then fail miserably in realistic simulations. Thus, working on robustness issues seems paramount for future work.

We believe that the subject of balance control on visco-elastic contacts still requires investigation. We are especially interested in improving the robustness of the proposed controllers, in the hope to find a better trade-off between performance and robustness. Another interesting direction could be to understand better the theoretical properties of Adm-Ctrl, which has clearly showed great robustness in our tests, but it remains unclear whether these capabilities can somehow be guaranteed in general.

Author Contributions

T.F. and A.D.P. developed the methodology; T.F., A.D.P. and M.K. conceived and designed the experiments; T.F., A.D.P. and N.M. wrote the software; T.F. and A.D.P. wrote the original draft of the paper; M.K. and L.R. reviewed and edited the paper; A.D.P., L.R. and N.M. supervised the work; L.R. and N.M. acquired the funding that supported the work. All authors have read and agreed to the published version of the manuscript.

Funding

This work was partially supported by the European Unions Horizon 2020 research and innovation program under Grant Agreement 780684, the European Research Councils under Grant 637935 and the National Science Foundation under Grant CMMI-1825993.

Informed Consent Statement

Not applicable.

Data Availability Statement

Publicly available code was used in this study. This code can be found here: https://github.com/thomasfla/simple-biped.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

TSID	Task-Space Inverse Dynamics
CoM	Center of Mass
Adm-Ctrl	Admittance Control

Appendix A. Gain Tuning

The performance of each controller strongly depends on how well its gains are tuned. However, this is seldom considered in comparisons. Therefore, the ability of the user in tuning a specific controller (e.g., because of experience) may bias the results. To avoid this, this section presents a unified approach for tuning the controllers presented above. The key idea is to write down the closed-loop dynamics of the centroidal state as:

\begin{matrix} u (t) = - \bar{K} (θ) y (t) \\ \dot{y} (t) = \bar{A} y (t) + \bar{B} u (t) + r (y (t)) \approx H (θ) y (t) \end{matrix}

(A1)

where y is a function of the robot state

(q, v)

that contains (at least) the centroidal state x and its first derivative

\dot{x}

,

H ≜ \bar{A} - \bar{B} \bar{K}

is the closed-loop transfer matrix, which is a function of the gain parameters

θ

, and

r (y)

is the residual nonlinear part of the dynamics. Our goal will be to derive (A1) for each controller such that

r (y)

is as small as possible (ideally null), so that we can neglect it and tune the gains for the resulting linear system. Once we have (A1) we can look for a value of

θ

that solves the following optimal control problem:

\begin{matrix} \underset{θ, y (t), u (t)}{minimize} & \int_{0}^{T} [y {(t)}^{⊤} Q y (t) + u {(t)}^{⊤} R u (t)] dt \\ subject to & \dot{y} (t) = \bar{A} y (t) + \bar{B} u (t) \forall t \in [0, T] \\ u (t) = - \bar{K} (θ) y (t) \forall t \in [0, T] \\ y (0) = y_{0}, \end{matrix}

(A2)

where

Q, R, T

and

y_{0}

are provided by the user. In the following we will assume that:

the robot dynamical model is perfect,
$w_{f}$ and $w_{p o s t}$ are sufficiently small not to significantly affect the momentum task,
the inequality constraints (e.g., friction cones) are not active.

Thanks to the last two assumptions, we can approximate the desired contact forces computed by (13) and (15) as:

f_{d} = A_{0}^{†} ({\ddot{x}}_{d} - b),

(A3)

where

A_{0}

is the value of A computed at a reference configuration

q_{0}

.

Appendix A.1. TSID-Flex-K

This controller performs an exact feedback linearization, therefore the closed-loop dynamics of x is a 4-th order integrator, as shown by (26). Consequently, we can find the gain parameters

θ = (K_{p}^{m}, K_{d}^{m}, K_{a}^{m}, K_{j}^{m})

using LQR.

Appendix A.2. Admittance Control

We could not find a proper way of linearizing the closed-loop dynamics for this case, so we have simply tuned the controller by hand.

Appendix A.3. TSID-Admittance

Let us define the state of the system as

y = (x, \dot{x}, f, \dot{f})

. The matrices

\bar{A}, \bar{B}, \bar{K}

are defined as:

\begin{matrix} \bar{A} = [\begin{matrix} 0 & I & 0 & 0 \\ 0 & 0 & A_{0} & 0 \\ 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 \end{matrix}] \bar{B} = [\begin{matrix} 0 \\ 0 \\ 0 \\ I \end{matrix}] \\ \bar{K} = [\begin{matrix} K_{1} A_{0}^{†} K_{p}^{m} & K_{1} A_{0}^{†} K_{d}^{m} & K_{1} & K_{d}^{a} \end{matrix}] \end{matrix}

(A4)

where:

K_{1} ≜ K K_{d}^{a} K_{f}

. The gain parameters are

θ = (K_{d}^{a}, K_{p}^{m}, K_{d}^{m}, K_{f})

. Since we cannot freely choose

\bar{K}

we should rely on global optimization to find

θ

. However, if we focus on the CoM only (i.e., neglecting the angular momentum), we can derive a simpler expression of the closed-loop dynamics, which allows us to use LQR. First, we assume that:

contact damping is negligible: $B \approx 0$ ;
$K_{d}^{a d m}$ and $K K_{f}$ are diagonal matrices;
all entries of $K_{d}^{a d m}$ , and $K K_{f}$ corresponding to the same direction (X, Y, Z) have the same value; for instance, the admittance gain $K_{d}^{a d m}$ in direction Z must be the same for all contact points.

We can then define each of these matrices in terms of the 3d diagonal matrices

K_{d}^{a 3}

, and

K_{f}^{3}

:

\begin{matrix} K_{d}^{a d m} & = diag ([\begin{matrix} K_{d}^{a 3}, & \dots, & K_{d}^{a 3} \end{matrix}]) \\ K K_{f} & = diag ([\begin{matrix} K_{f}^{3}, & \dots, & K_{f}^{3} \end{matrix}]) \end{matrix}

(A5)

We define the state as

y = (c, \dot{c}, \ddot{c}, \overset{⃛}{c})

, and we get the following closed-loop dynamics:

\begin{matrix} \bar{A} = [\begin{matrix} 0 & I & 0 & 0 \\ 0 & 0 & I & 0 \\ 0 & 0 & 0 & I \\ 0 & 0 & 0 & 0 \end{matrix}] \bar{B} = [\begin{matrix} 0 \\ 0 \\ 0 \\ I \end{matrix}] \\ \bar{K} = [\begin{matrix} K_{f}^{3} K_{d}^{a 3} K_{p}^{m} & K_{f}^{3} K_{d}^{a 3} K_{d}^{m} & K_{f}^{3} K_{d}^{a 3} & K_{d}^{a 3} \end{matrix}] \end{matrix}

(A6)

This closed-loop dynamics is equivalent to the one obtained by TSID-Flex-K (26), thus, we can find the desired gains using LQR.

Appendix A.4. TSID-Rigid

Since TSID-Rigid is a control scheme for a second-order system we define the state as

y = (x, \dot{x})

, and we get the following closed-loop dynamics:

\begin{matrix} \bar{A} = [\begin{matrix} 0 & I \\ 0 & 0 \end{matrix}] \bar{B} = [\begin{matrix} 0 \\ I \end{matrix}] \\ \bar{K} = [\begin{matrix} K_{p}^{m} & K_{d}^{m} \end{matrix}] \end{matrix}

(A7)

Appendix A.5. Cost Function

Ideally we would like to tune the gains of all controllers based on the same cost function. However, we do not have the same state for all controllers, in particular:

for TSID-Flex-K, $y = (x, \dot{x}, \ddot{x}, \overset{⃛}{x})$ and $u = x^{(4)}$ ,
for TSID-Adm, $y = (c, \dot{c}, \ddot{c}, \overset{⃛}{c})$ and $u = c^{(4)}$ ,
for TSID-Rigid, $y = (x, \dot{x})$ and $u = \ddot{x}$ .

Therefore, we start from a cost function for TSID-Adm defined by

(Q, R)

:

Q = [\begin{matrix} I_{3} & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \end{matrix}] R = w_{u} I_{3},

(A8)

where

w_{u} \in R^{}

is a user-defined hyper-parameter. Now we must find the equivalent cost functions for the other controllers. To do so, we introduce a matrix P that projects the state-control pair

(y, u)

of a given controller to the state-control pair of TSID-Adm. Once P is defined we can use it in the cost function of (A2):

[\begin{matrix} y^{⊤} & u^{⊤} \end{matrix}] P^{⊤} [\begin{matrix} Q & 0 \\ 0 & R \end{matrix}] P [\begin{matrix} y \\ u \end{matrix}]

(A9)

For TSID-Flex-K we have:

P = m^{- 1} diag ([\begin{matrix} I_{3 \times 6} & I_{3 \times 6} & I_{3 \times 6} & I_{3 \times 6} & I_{3 \times 6} \end{matrix}])

(A10)

where

I_{3 \times 6}

is a matrix that selects the first 3 elements of a 6d vector. Unfortunately, for TSID-Rigid it is not possible to have the same cost function because its state does not contain high-order derivatives as the other controllers, so we will use a different cost function:

Q = [\begin{matrix} I_{3} & 0 \\ 0 & 0 \end{matrix}] R = w_{u} I_{3}

(A11)

For TSID-Flex-K and TSID-Adm we have used values of

w_{u}

evenly spaced in logarithmic scale between

10^{- 12}

and

10^{- 6}

. For TSID-Rigid instead we have used values between

10^{- 3.5}

and

10^{- 1.5}

.

Appendix B. Estimation

The control method TSID-Flex-K requires an estimation of the CoM position, its first three derivatives, the angular momentum, and its first two derivatives. To estimate these quantities, we suggest relying on an Extended Kalman Filter (EKF), which is an extension of the approach presented in [23]. We define the state of the system as:

s = (c, \dot{c}, l, f, \dot{f})

(A12)

The continuous time system dynamics is:

\dot{s} = [\begin{matrix} 0 & I & 0 & 0 & 0 \\ 0 & 0 & 0 & m^{- 1} I^{k} & 0 \\ 0 & 0 & 0 & {(p - c)}_{\times} & 0 \\ 0 & 0 & 0 & 0 & I^{k} \\ 0 & 0 & 0 & 0 & 0 \end{matrix}] [\begin{matrix} c \\ \dot{c} \\ l \\ f \\ \dot{f} \end{matrix}] + [\begin{matrix} 0 \\ w_{\ddot{c}} - g \\ 0 \\ 0 \\ u + w_{u} \end{matrix}],

(A13)

where

w_{\ddot{c}}

and

w_{u}

are the process noise on the CoM acceleration and the force accelerations, respectively, and:

\begin{matrix} I^{k} & = [\begin{matrix} I_{3} \dots I_{3} \end{matrix}] \\ {(p - c)}_{\times} & = [\begin{matrix} {(p_{0} - c)}_{\times} \dots {(p_{k - 1} - c)}_{\times} \end{matrix}] \end{matrix}

(A14)

The system dynamics is linear, except for the angular momentum. The choice of modeling a noise on the CoM acceleration is motivated by the fact that the robot might get pushed, so we need to account for disturbances acting directly at the CoM acceleration level. The measurement model is:

s_{m e a s} = [\begin{matrix} I & 0 & 0 & 0 & 0 \\ 0 & I & 0 & 0 & 0 \\ 0 & 0 & I & 0 & 0 \\ 0 & 0 & 0 & I & 0 \end{matrix}] [\begin{matrix} c \\ \dot{c} \\ l \\ f \\ \dot{f} \end{matrix}] + [\begin{matrix} w_{c} \\ w_{\dot{c}} \\ w_{l} \\ w_{f} \end{matrix}]

(A15)

Of course c,

\dot{c}

and l are not directly measured, but they are computed from the encoder measurements and the floating-base state estimation. The estimation of the floating-base state typically relies on the IMU measurements and the kinematics of the limbs in contact [24]. Once we have an estimate of the state s, we can easily compute the quantities

\ddot{c}

,

\overset{⃛}{c}

,

\dot{l}

, and

\ddot{l}

, which are needed by our controller. Please note that assuming the contact damping B is sufficiently small, the contact point positions needed in (A13) can be directly computed from the contact force measurements as:

p = p_{0} - K^{- 1} f

(A16)

Appendix C. Order-of-Magnitude Analysis on Importance of Stiffness vs. Damping

Let us assume that the desired CoM trajectory

c^{*} (t)

is a sinusoid with frequency

\frac{ω_{c}}{2 π}

Hz and amplitude

ψ

.

c^{*} (t) = ψ sin (ω_{c} t)

(A17)

Since the CoM acceleration

\ddot{c}

is an affine function of the contact forces f, their time derivatives are also linearly related.

\begin{matrix} {\dot{f}}^{*} (t) \propto {\overset{⃛}{c}}^{*} (t) & = - ψ ω_{c}^{3} cos (ω_{c} t) \\ {\ddot{f}}^{*} (t) \propto {\overset{⃜}{c}}^{*} (t) & = ψ ω_{c}^{4} sin (ω_{c} t) \end{matrix}

(A18)

Thus, relying on (3) to control

\dot{f}

we would get:

{\ddot{p}}^{*} (t) \propto B^{- 1} {\dot{f}}^{*} (t) \propto B^{- 1} ψ ω_{c}^{3}

(A19)

If we relied instead on (4) (i.e., neglecting the contact damping) we would get:

{\ddot{p}}^{*} (t) \propto K^{- 1} {\ddot{f}}^{*} (t) \propto K^{- 1} ψ ω_{c}^{4}

(A20)

The latter approach results in smaller values of

{\ddot{p}}^{*}

if this condition is satisfied:

\begin{matrix} K^{- 1} ψ ω_{c}^{4} & < B^{- 1} ψ ω_{c}^{3} \\ ζ & < \frac{\sqrt{K}}{2 ω_{c}}, \end{matrix}

(A21)

where the damping ratio

ζ

is defined as the ratio between B and the critical damping (i.e.,

B ≜ ζ 2 \sqrt{K}

). Since typically

K > 10^{4}

, while

ζ < 1

and

ω_{c} < 30

, this condition is usually satisfied. For this reason, we prefer to rely on (4) rather than on (3) for our controller design.

References

Wieber, P.B.; Tedrake, R.; Kuindersma, S. Modeling and Control of Legged Robots. In Handbook of Robotics, 2nd ed.; Siciliano, B., Oussama, K., Eds.; Springer: Berlin/Heidelberg, Germany, 2015; Chapter 48. [Google Scholar]
Boaventura, T.; Semini, C.; Buchli, J.; Frigerio, M.; Focchi, M.; Caldwell, D.G. Dynamic torque control of a hydraulic quadruped robot. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation, Saint Paul, MI, USA, 14–18 May 2012; pp. 1889–1894. [Google Scholar]
Englsberger, J.; Ott, C.; Albu-Schäffer, A. Three-Dimensional Bipedal Walking Control Based on Divergent Component of Motion. IEEE Trans. Robot. 2015, 31, 355–368. [Google Scholar] [CrossRef]
Herzog, A.; Rotella, N.; Mason, S.; Grimminger, F.; Schaal, S.; Righetti, L. Momentum control with hierarchical inverse dynamics on a torque-controlled humanoid. Auton. Robot. 2016, 40, 473–491. [Google Scholar] [CrossRef] [Green Version]
Lim, H.O.; Setiawan, S.A.; Takanishi, A. Balance and impedance control for biped humanoid robot locomotion. In Proceedings of the IEEE International Conference on Intelligent Robots and Systems, Maui, HI, USA, 29 October–3 November 2001; Volume 1, pp. 494–499. [Google Scholar]
Nava, G.; Romano, F.; Nori, F.; Pucci, D. Stability Analysis and Design of Momentum-based Controllers for Humanoid Robots. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Deajeon, Korea, 9–14 October 2016. [Google Scholar]
Takenaka, T.; Matsumoto, T.; Yoshiike, T.; Hasegawa, T.; Shirokura, S.; Kaneko, H.; Orita, A. Real time motion generation and control for biped robot-4 th report: Integrated balance control. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, St. Louis, MO, USA, 11–15 October 2009; pp. 1601–1608. [Google Scholar]
Kajita, S.; Morisawa, M.; Miura, K.; Nakaoka, S.; Harada, K.; Kaneko, K.; Kanehiro, F.; Yokoi, K. Biped walking stabilization based on linear inverted pendulum tracking. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Taipei, Taiwan, 18–22 October 2010; pp. 4489–4496. [Google Scholar]
Li, Z.; Zhou, C.; Zhu, Q.; Xiong, R. Humanoid Balancing Behavior Featured by Underactuated Foot Motion. IEEE Trans. Robot. 2017, 33, 298–312. [Google Scholar] [CrossRef]
Reher, J.; Cousineau, E.A.; Hereid, A.; Hubicki, C.M.; Ames, A.D. Realizing dynamic and efficient bipedal locomotion on the humanoid robot DURUS. In Proceedings of the IEEE International Conference on Robotics and Automation, Stockholm, Sweden, 16–21 May 2016; pp. 1794–1801. [Google Scholar]
Henze, B.; Roa, M.A.; Ott, C. Passivity-based whole-body balancing for torque-conrolled humanoid robots in multi-contact scenarios. Int. J. Robot. Res. 2016, 35, 1522–1543. [Google Scholar] [CrossRef]
Azad, M.; Mistry, M.N. Balance control strategy for legged robots with compliant contacts. In Proceedings of the IEEE International Conference on Robotics and Automation, Seattle, WA, USA, 26–30 May 2015; pp. 4391–4396. [Google Scholar]
Fahmi, S.; Mastalli, C.; Focchi, M.; Semini, C. Passive Whole-Body Control for Quadruped Robots: Experimental Validation over Challenging Terrain. IEEE Robot. Autom. Lett. 2019, 4, 2553–2560. [Google Scholar] [CrossRef] [Green Version]
Fahmi, S.; Focchi, M.; Radulescu, A.; Fink, G.; Barasuol, V.; Semini, C. STANCE: Locomotion Adaptation over Soft Terrain. IEEE Trans. Robot. 2020, 36, 443–457. [Google Scholar] [CrossRef]
Orin, D.E.; Goswami, A.; Lee, S.H. Centroidal dynamics of a humanoid robot. Auton. Robot. 2013, 35, 161–176. [Google Scholar] [CrossRef]
Hirai, K.; Hirose, M.; Haikawa, Y.; Takenaka, T. The development of Honda humanoid robot. In Proceedings of the IEEE International Conference on Robotics and Automation, Leuven, Belgium, 16–20 May 1998. [Google Scholar]
Caron, S.; Kheddar, A.; Tempier, O. Stair Climbing Stabilization of the HRP-4 Humanoid Robot using Whole-body Admittance Control. arXiv 2018, arXiv:1809.07073. [Google Scholar]
Saccon, A.; Traversaro, S.; Nori, F.; Nijmeijer, H. On Centroidal Dynamics and Integrability of Average Angular Velocity. IEEE Robot. Autom. Lett. 2017, 2, 943–950. [Google Scholar] [CrossRef] [Green Version]
Del Prete, A. Joint Position and Velocity Bounds in Discrete-Time Acceleration/ Torque Control of Robot Manipulators. IEEE Robot. Autom. Lett. 2018, 3, 281–288. [Google Scholar] [CrossRef] [Green Version]
Kaneko, K.; Kanehiro, F. Design of prototype humanoid robotics platform for HRP. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Lausanne, Switzerland, 30 September–4 October 2002. [Google Scholar]
Dallali, H.; Medrano-Cerda, G.A.; Focchi, M.; Boaventura, T.; Frigerio, M.; Semini, C.; Buchli, J.; Caldwell, D.G. On the use of positive feedback for improved torque control. Control. Theory Technol. 2015, 13, 266–285. [Google Scholar] [CrossRef]
Paine, N.; Oh, S.; Sentis, L. Design and Control Considerations for High-Performance Series Elastic Actuators. IEEE/ASME Trans. Mechatron. 2014, 19, 1080–1091. [Google Scholar] [CrossRef] [Green Version]
Rotella, N.; Herzog, A.; Schaal, S.; Righetti, L. Humanoid momentum estimation using sensed contact wrenches. In Proceedings of the IEEE-RAS International Conference on Humanoid Robots, Seoul, Korea, 3–5 November 2015; pp. 556–563. [Google Scholar]
Flayols, T.; Del Prete, A.; Wensing, P.; Mifsud, A.; Benallegue, M.; Stasse, O. Experimental Evaluation of Simple Estimators for Humanoid Robots. In Proceedings of the IEEE International Conference on Humanoid Robots (Humanoids), Birmingham, UK, 15–17 November 2017; pp. 889–895. [Google Scholar]

Figure 1. 2d biped robot used for simulations.

Figure 2. Test A: performance of TSID-Rigid in noiseless simulation.

Figure 3. Test B: CoM trajectory obtained with TSID-Flex-K, and soft contacts.

Figure 4. Test B: CoM trajectory obtained with TSID-Adm, and medium contacts.

Figure 5. Test C: CoM trajectory obtained with Adm-Ctrl in noisy simulations.

Figure 6. Summary of all results. Pareto-optimal tests are depicted in red, while the others in blue.

Figure 7. TSID-Adm, soft contacts (

α = 0.01

), noiseless simulation and low feedback gains (

w_{u} = 10^{- 6}

). CoM tracking is good despite the significant delay in force tracking.

Figure 7. TSID-Adm, soft contacts (

α = 0.01

), noiseless simulation and low feedback gains (

w_{u} = 10^{- 6}

). CoM tracking is good despite the significant delay in force tracking.

Figure 8. Results for soft contacts (

α = 0.01

) and noisy simulation, but without Coulomb friction. Coulomb friction has a detrimental effect to the performance of TSID-Flex-K and TSID-Adm, while it helps stabilizing TSID-Rigid.

Figure 8. Results for soft contacts (

α = 0.01

) and noisy simulation, but without Coulomb friction. Coulomb friction has a detrimental effect to the performance of TSID-Flex-K and TSID-Adm, while it helps stabilizing TSID-Rigid.

Figure 9. Comparison between contact forces given by TSID-Rigid and TSID-Adm for medium contacts, noisy simulations.

Figure 10. Contact force generated by TSID-Rigid with stiff contacts, noiseless simulation.

Figure 11. Stiff contacts, noisy simulation.

Table 1. Simulation parameters.

Symbol	Meaning	Value
$δ t_{s i m}$	Simulation time step	0.0001 s
$δ t_{c t r l}$	Control time step	0.001 s
$μ$	Force friction coefficient	0.3
K	Contact stiffness	$α$ 200,000
$ζ$	Contact damping ratio	0.3
T	Simulation time	6 s

Table 2. Controller Parameters.

diag (M_{0})

is the diagonal part of the mass matrix evaluated at

q (0)

.

Table 2. Controller Parameters.

diag (M_{0})

is the diagonal part of the mass matrix evaluated at

q (0)

.

Symbol	Controller	Meaning	Value
$w_{p o s t}$	TSID-Flex-K	Postural task weight	0.3
$w_{p o s t}$	TSID-Adm	Postural task weight	1 × 10⁻³
$w_{p o s t}$	Adm-Ctrl	Postural task weight	1 × 10⁻³
$w_{p o s t}$	TSID-Rigid	Postural task weight	1 × 10⁻²
$w_{f}$	TSID-Rigid	Force regularization weight	1 × 10⁻⁴
$K_{p}^{m}$	Adm-Ctrl	Proportional momentum gain	30.7
$K_{d}^{m}$	Adm-Ctrl	Derivative momentum gain	10.3
$K_{p}^{j}$	Adm-Ctrl	Proportional joint position gain	$10^{4}$ $diag (M_{0})$
$K_{d}^{j}$	Adm-Ctrl	Derivative joint position gain	200 $diag (M_{0})$
$K_{f}$	Adm-Ctrl	Proportional force gain	8 × 10⁻³
$K_{p}^{p o s t}$	All	Proportional posture gain	10
$K_{d}^{p o s t}$	All	Derivative posture gain	6

Table 3. Sensor noise standard deviations (

σ

) and quantization errors (

δ

).

Table 3. Sensor noise standard deviations (

σ

) and quantization errors (

δ

).

Symbol	Meaning	Value
$σ_{f y}$	Force sensor (Y axis)	1.3 × 10⁻²
$σ_{f z}$	Force sensor (Z axis)	1.0
$σ_{ω}$	Gyroscope	6.4 × 10⁻³
$δ_{f y}$	Force sensor (Y axis)	1.8 × 10⁻²
$δ_{f z}$	Force sensor (Z axis)	7.3 × 10⁻²
$δ_{ω}$	Gyroscope	1.0 × 10⁻³
$δ_{q}$	Encoders	8.2 × 10⁻⁵

Table 4. Extended Kalman Filter noise standard deviations.

Symbol	Meaning	Value
$σ_{c}$	CoM position measurement	1 × 10⁻³
$σ_{\dot{c}}$	CoM velocity measurement	1 × 10⁻²
$σ_{l}$	Angular momentum measurement	0.1
$σ_{f}$	Force measurement	1
$σ_{u}$	Control	1 × 10⁴
$σ_{\ddot{c}}$	CoM acceleration disturbance	10

Table 5. Summary of main features of each controller.

PROS	CONS
TSID-Flex-K and TSID-Adm
• Easy to tune (unified pos-force feedback).	• Need high frequency for hard contacts.
• Best for soft/medium contact.	• Unstable for hard contact with noise.
TSID-Rigid
• Ok for hard/medium contacts.	• Unstable for soft contacts.
• Easy to tune (assuming perfect joint torque tracking).	• Undesired oscillations (no force feedback).
Adm-Ctrl
• Always stable.	• Hard to tune/analyze.
• Good for soft contacts.	• Never the best.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Flayols, T.; Del Prete, A.; Khadiv, M.; Mansard, N.; Righetti, L. Reactive Balance Control for Legged Robots under Visco-Elastic Contacts. Appl. Sci. 2021, 11, 353. https://doi.org/10.3390/app11010353

AMA Style

Flayols T, Del Prete A, Khadiv M, Mansard N, Righetti L. Reactive Balance Control for Legged Robots under Visco-Elastic Contacts. Applied Sciences. 2021; 11(1):353. https://doi.org/10.3390/app11010353

Chicago/Turabian Style

Flayols, Thomas, Andrea Del Prete, Majid Khadiv, Nicolas Mansard, and Ludovic Righetti. 2021. "Reactive Balance Control for Legged Robots under Visco-Elastic Contacts" Applied Sciences 11, no. 1: 353. https://doi.org/10.3390/app11010353

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Reactive Balance Control for Legged Robots under Visco-Elastic Contacts

Abstract

1. Introduction

1.1. Problem Overview

1.2. State of the Art

1.3. Contributions

1.4. Paper Structure

2. Background: Dynamics and Contact Model

2.1. Visco-Elastic Contact Model

2.2. Importance of Stiffness vs. Damping

2.3. Centroidal Dynamics

2.4. Whole-Body Dynamics

3. State of the Art

3.1. Inverse Dynamics with Rigid Contacts (TSID-Rigid)

3.2. Admittance Control (Adm-Ctrl)

4. Inverse Dynamics Admittance Control (TSID-Adm)

5. Flexible TSID (TSID-Flex-K)

5.1. Feedback Linearization

5.2. Accounting for Force Variations during the Time Step

5.3. Linear Feedback Regulator

5.4. Friction Force Constraints

5.5. Summary

6. Results

6.1. Simulation Environment

6.2. Gain Tuning

6.3. Test Description

6.4. Test A: TSID-Rigid

6.5. Test B: TSID-Flex-K and TSID-Adm

6.6. Test C: Adm-Ctrl

6.7. Test D: All Controllers

6.7.1. Soft Contacts

6.7.2. Medium Contacts

6.7.3. Stiff Contacts

7. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A. Gain Tuning

Appendix A.1. TSID-Flex-K

Appendix A.2. Admittance Control

Appendix A.3. TSID-Admittance

Appendix A.4. TSID-Rigid

Appendix A.5. Cost Function

Appendix B. Estimation

Appendix C. Order-of-Magnitude Analysis on Importance of Stiffness vs. Damping

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI