**1. Introduction**

The main objective of bio-mimetics is to find a practical solution of human needs imitating models or movements of animals or even plants. One of its main applications can be found in the field of robotics [1]. The development of bio-inspired robotic systems involves the adaptation of different modes of locomotion like the running inspired in leopards [2], swimming inspired in fishes [3], climbing like gecko robots [4], or crawling by worms [5,6], among others. In the case of worm bio-inspired robots, the movement of the so-called *inchworm* has interesting applications exploring narrow places in contrast to mobile robots [7]. The inchworm moves with a looping movement in which the anterior and posterior legs are alternately made fast and released. The alternation of fastening enables a propelling motion [8]. These bio-inspired robots can be applied in medical applications like colonoscopies [9], in the inspection of narrow pipes [7] and robotic manipulators [10]. There exist diverse configurations of inchworm robots from two DOF (Degrees of freedom) to five. Two DOF

in-pipe robots such as [7] reproduces contraction and expansion of inchworm's gait cycle using two sets of magnetic clamps switching an electro-valve: rear clamp grasps the pipe firmly while the front clamp slides forward gaining traction in the process. Similarly, in [11], a system of two mass with a spring that contracts/expands by its anisotropic skin is described. The inching mechanism was proposed also in [12] for planetary surface exploration vehicles (rovers) to overcome the limitations of traditional rolling mobility. The vehicle wheel bases were expanded and contracted to achieve an increase of net traction potential. In addition, in [13], a three-module bore robot was constructed to carry out investigations on planetary subsurfaces such as geothermal gradient, chemical composition and analysis of regolith. *Climbot* is a tele-operated five DOF (Degrees) robot able to climb a variety of media and grasp objects [14].

One interesting problem to solve is the tracking trajectory problem in these kinds of robots [15]. The complex structures that emulate the displacement of an inchworm bio-inspired robot require robust techniques to cope with parametric uncertainties, no modeled dynamics, and noisy measurements. Classical PID controllers, sliding modes, and fuzzy logic controllers have been applied without considering the hybrid behavior of the gait cycle of an inchworm represented by a multi-link robot manipulator [16]. Modeling an inchworm robot that alternates the grasping between its anterior and posterior legs implies a switching structure that should be studied under the concept of hybrid systems.

The hybrid framework allows for studying more complex dynamics and allows more flexibility in modeling dynamic phenomena [17]. A hybrid system is composed of two or more sets of differential equations describing a particular stage of behavior in dynamic systems. In the case of the WBRD, two robot manipulators with five DOF represent its gait cycle. In order to deal with non-modeled dynamics and parameter uncertainties, an active disturbance rejection (H-ADRC) approach can be considered. ADRC is a technique centered on providing an effective estimation of unknown nonlinearities by means of algebraic techniques [18]. The main concepts considered in this control designs are: (1) simplify the plant description so as to group all disturbances and uncertainties, as well as all unknown or ignored quantities and expressions into a single disturbance term, (2) proceed to estimate the effects of this disturbance, in some accurate manner, and (3) devise the means to cancel its effects, using a feasible gathered estimate as part of the feedback control action. One way to fulfill this task is to perform a polynomial expansion and translate it into the state space as the output of an extended state observer [19].

The control algorithm has to be able not only to force the WBRD to reach a desired trajectory; it needs to take into account the problem of finite-time convergence and states constraints to avoid any damage of the mechanical structure. A classical tool to deal with state constraints is the concept of Barrier Lyapunov functions (BLF) that is a function that tends to infinite as the argumen<sup>t</sup> approach to a boundary. BLF has been applied to control nonlinear systems and linear perturbed systems [20].

This manuscript proposes a novel adaptive algorithm to deal with the trajectory tracking problem of nonlinear hybrid systems with state constraints. The proposed algorithm is applied in the WBRD with five DOF represented by a hybrid structure. The main contributions of this study are:


This manuscript is organized in the following manner. Section 2 provides a general overview of the WBRD design as well as the links–joints configuration. Section 3 introduces the control design problem statement considering the hybrid nature of the gait cycle realization by the WBRD. Section 4 provides the formulation of the WBRD realization in terms of the hybrid systems framework. The next Section 5 details all the elements of the output feedback controller to solve the gait cycle of the WBRD. Section 6 describes some aspects regarding the implementation (numerical and experimental) of the output feedback controller. Section 7 provides the evidence of the controller numerical implementation over a virtualized representation of the WBRD. Section 8 demonstrates the application of the suggested controller on a developed WBRD using the tri-dimensional (3D) printing technique. Finally, Section 9 closes the paper with some final remarks.

#### **2. Worm Bio-inspired Robotic Device**

The proposed WBRD structure satisfies a class of multi-articulated manipulator with 5 DOF. The WBRD displacement is realized by the switched fixation of the non-inertial frames (1st and 5th) to the supporting surface (Figure 1). Considering that WBRD moves following a path tracking based on sequenced steps, the odd steps occur with the 1st frame as the reference and the even steps happened considering the 5th frame as reference. The sequence formed by odd-even steps defines a gait cycle of the WBRD. As one may notice, the change of the reference frame justifies the use of switched systems theory to develop the output feedback controller to regulate the WBRD mobilization. This can be noticed with the alternated reference frame marked with black squares at the bottom of Figure 1.

**Figure 1.** Distribution of the WBRD's degrees of freedom dependent on the activated vacuum pump.

The multi-articulated manipulator is formed by five solid links *ls*,*i* connected with rotational joints characterized with angular displacement defined by *<sup>θ</sup>s*(*t*),*i*. The variable *s* can be 1 if the step is odd and 2 if the step is even. This variable is playing the role of the switching sequence usually considered in switching systems analysis. The switching action is performed by a set of vacuum pumps that emulates the front and rear legs subjection to the floor (see Figure 2). Each link is conformed by a direct current (DC) motor for actuation, and a set of mechanical elements to transmit the movements. To obtain a feedback of the actual position of each link, a set of five markers were placed in the robot. A vision analysis system obtained the corresponding absolute angles of each link. These measurements were the data input into the control algorithm.

**Figure 2.** Elements of each link in the WBRD.

Table 1 describes the dimensions of the angles.

**Table 1.** Angular range of the WBRD.


Each link was designed with a scale of 50 : 1 yielding a total length of 71.6 cm in the zero position and a total height of 7.2 cm including the pumps (Figure 3).

**Figure 3.** Dimensions of the WBRD.

#### **3. Problem Definition**

The WBRD displacement is represented as an alternated extension–contraction sequences (Figure 4). This simplified representation of the WBRD displacement can be described as a hybrid device alternating the movement of two multi-articulated (5 DOF) manipulators.

**Figure 4.** Alternative representation of a gait cycle for the WBRD; (**a**) The rear pump is *on* and a first robot manipulator structure is adopted; (**b**) The reference trajectories force the robot to expand almost to reach a 0 degrees configuration; (**c**) The pumps switch and the second robot manipulator configuration is adopted, the reference trajectories for the movement of the robot until a desired position; (**d**) A second switching in the gait cycle is performed to complete the walking path emulating the real inchworm.

The fixation of the reference frame in the suggested alternate way modifies the description of the WBRD. Indeed, this variation of the reference frame forces two distinct dynamic representations for the WBRD. Such condition provides a challenging scenario for developing automatic controllers which can ensure the tracking of reference trajectories that correspond to a bio-inspired gait cycle. This section aims to formulate the controller design problem within the hybrid systems' framework.

Let us consider the vector of angular displacements within a fixed part (*s*(*t*) equal either *a* or *b* of the gait cycle *<sup>θ</sup>s*(*t*) = |*<sup>θ</sup>s*(*t*),*<sup>i</sup>*|*<sup>i</sup>*=1,...,5. Now, assume that, during the given part of the gait cycle, the angular displacements must track the corresponding reference angles *θ*∗*s*(*t*) = |*θ*<sup>∗</sup>*s*(*t*),*<sup>i</sup>*|*<sup>i</sup>*=1,...,5. Then, enforcing the gait cycle for the WBRD can be represented as an stabilization problem for the tracking error Δ ∈ R5, defined as <sup>Δ</sup>*s*(*t*) = *<sup>θ</sup>s*(*t*) − *θ*∗*s*(*t*)within each continuous domain of *<sup>s</sup>*(*t*).

This problem statement obligates to consider the WBRD dynamics changes only if the vector of tracking errors for all the articulations has attained a sufficiently small value (defined by the user), namely *SW*∗ > 0. Therefore, the triggering signal which enforces the dynamics changing can be obtained by measuring the norm of the tracking error within each domain of *<sup>s</sup>*(*t*). Once <sup>Δ</sup>*s*(*t*) ≤ *SW*<sup>∗</sup>, then *s*(*t*) changes from *a* to *b* or vice versa.

The problem statement concept given above enforces the fact that the vector of the angular displacements *θ<sup>s</sup>*,*<sup>i</sup>* at the WBRD must track the desired reference angles *θ*∗*s*,*i* ensuring the tracking errors of all articulations enter the region characterized by <sup>Δ</sup>*s*(*t*) ≤ *SW*∗ at some given finite moment *Tc*

which must be bounded ( *Ts* > 0). Such tracking problem can be described as designing the hybrid controller *us*(*t*) such that

$$\left\|\Delta\_{\mathbf{s}\left(t\right)}\left(t\right)\right\| \leq \mathcal{S} \mathcal{W}^\* \qquad \forall t \in \left[T\_{\mathbf{c}\prime}, T\_{\mathbf{s}}\right] \tag{1}$$

The maximum allowed switching time is introduced here in order to have a tracking trajectory independent safety condition that can turn off the WBRD if the switching condition is not attained in a reasonable period. Notice that the switching condition introduces a class of a non-constant sampling discrete state which depends on the accomplishment of the condition provided in (1).

This study assumes that only *<sup>θ</sup>s*(*t*) is continuously locally measurable all the time. However, ˙ *<sup>θ</sup>s*(*t*) is not available. Therefore, the control design considers an output feedback realization.

#### **4. Hybrid Formulation of the Worm Walking Cycle**

The changing dynamics of WBRD can be characterized using a combination of continuous and discrete states. Such representation agrees with the fundamentals of hybrid systems [21]. This formulation to describe the gait evolution of the proposed WBRD is enforced because there is not a strict periodicity which may define the transition between the gait domains (a or b) that is from continuous to continuous (a to b or vice versa) dynamics passing through the discrete state domain.

If the WBRD exerts a regular mono-directional walking gait, the transitions between the continuous stages follows an ordered sequence (a -> b -> a ->...). This sequenced dynamical behavior justifies the application of a class of multi-domain hybrid systems framework considering a predefined order of phases (or domains). Such representation leads to defining a so-called coherent cycle.

Formally, a multi-domain hybrid control system can be described considering a tuple [22,23] HD = (<sup>Γ</sup>, *D*, *U*, *S*, Δ, *FG*), where Γ = ( *V*; *E*) describes the sequenced cycle of transitions. Consequently, *v* ∈ *V* defines a transition vertex connection the continuous domains, *v*<sup>+</sup> represents the subsequent vertex of *v* during the gait cycle, while *e* = {*v* → *v*<sup>+</sup>} corresponds to the transition from the analyzed vertex *v* to *<sup>v</sup>*+.

**Continuous Dynamical Representations**. Considering the links masses, their inertia as well as their lengths properties of the WBRD, the equation of motion (EOM) that can be used within a given continuous domain *Dv* can be determined by the Euler–Lagrange equations (considering that, within a given domain, the WBRD obeys a manipulator representation) [24]. Therefore, assuming that *xa* = *<sup>θ</sup>s*(*t*) in each fixed domain, the dynamics of the WBRD corresponds to:

$$
\dot{\mathbf{x}}\_a(t) = \mathbf{x}\_b(t)
$$

$$
\dot{\mathbf{x}}\_b(t) = f\left(\mathbf{x}\_a(t), \mathbf{x}\_b(t)\right) + g\left(\mathbf{x}\_a(t)\right)u(t) + D^{-1}\left(\mathbf{x}\_a(t)\right)f\_v^\top\left(\mathbf{x}\_a(t)\right)F\_v\left(\mathbf{x}\_a(t), \mathbf{x}\_b(t)\right) + \tag{2}
$$

$$
\mathbf{f}\left(\mathbf{x}\_b(t), \mathbf{x}\_b(t), t\right)
$$

Here, *xa* ∈ **X***a* ⊂ R5, *xb* ∈ **X***b* ⊂ R<sup>5</sup> and *u* ∈ R*n* are the vectors of angular displacements, angular velocities, and the applied torques (operating as the control actuators) respectively for the WBRD.

The drift vector field *f* : **X***a* × **X***b* → R<sup>5</sup> corresponds to *f* (*xa*, *xb*) = − *D*−<sup>1</sup> (*xa*) [ *C* (*xa*, *xb*) *xb* + *<sup>G</sup>*(*xa*)], the state dependent matrix *D* : **X***a* → R5×<sup>5</sup> defines the inertia of the WBRD, the matrix *C* : R<sup>5</sup> × R<sup>5</sup> → R5×<sup>5</sup> defines the Coriolis effects while *G* : R<sup>5</sup> → R<sup>5</sup> defines the effect of gravitational force over the WBRD dynamics. The vector function *g* : R<sup>5</sup> → R5×<sup>5</sup> characterizes the control action effect over the WBRD dynamics with *g* = *D*−1.

The uncertain section of the model is gathered in *ξ* : **X***a* × R*nv* × R → R<sup>5</sup> which characterizes the presence of external perturbations and internal modeling uncertainties in the WBRD. Usually, this term aggregates nonlinear behavior such as joint frictions, backslash, and some other elements that are usually complex for modeling.

The function *Fv* : *TQ* × *Uv* → R*nv* (*nv* is the number of total holonomic restrictions) represents the contact wrenches containing the constraint forces and/or moments. Here, *TQ* represents the characteristic states occurring during the contact wrenches, and *Uv* is the corresponding set of control actions which leads to the contact wrenches *uv*, which are relevant during the WBRD transition from continuous to continuous domains (floor contact). To enforce the velocity independent (holonomic) constraints, the second order differentiation of the constraints should be set to zero; that is,

$$J\_v(\mathbf{x}\_a)\dot{\mathbf{x}}\_b + J\_v\left(\mathbf{x}\_a, \mathbf{x}\_b\right)\mathbf{x}\_b = \mathbf{0} \tag{3}$$

The constrained dynamics of the system must be determined using the trajectories of (2) together with (3).

**Holonomic Constraints**. Given that the WBRD model with coordinates *xa* ∈ *Q*, *Q* ∈ R<sup>5</sup> is the configuration space, the complete dynamics within a domain depends simultaneously on the Lagrangian as well as the contact constraints. All potential contacts of the WBRD with the floor (if not physical obstacles are considered) forces a holonomic constraint, *ηc*(*xa*). Considering that *Cv* is an indexing set of the possible holonomic constraints defined on *Dv*, then the holonomic constraints of the domain corresponds to *ηc*(*xa*) = {*ηc*(*xa*)}*c*<sup>∈</sup>*Cv* constant while the corresponding kinematic constraints *∂ηc*(*xa*)

corresponds to *Jv*(*xa*)*q*˙ = 0, *Jv*(*q*) is the Jacobian of *ηc*(*xa*), i.e., *Jv*(*xa*) = *∂xa* .

The nature of the WBRD justifies that all the states (angular displacements and velocities) are uniformly bounded in time. Therefore, the state *x* = - *x a* , *x b* is included in the set *X*<sup>+</sup> = *X*<sup>+</sup> *a* ∪ *X*<sup>+</sup> *b* defined as: 

$$\begin{aligned} X\_a^+ &= \left\{ \mathbf{x}\_a \mid -\infty < \mathbf{x}\_{a,i}^- \le \mathbf{x}\_{a,i} \le \mathbf{x}\_{a,i}^+ < +\infty, \\ X\_b^+ &= \left\{ \mathbf{x}\_b \mid -\infty < \mathbf{x}\_{b,i}^- \le \mathbf{x}\_{b,i} \le \mathbf{x}\_{b,i}^+ < +\infty, \end{aligned} \right\} \tag{4}$$

with *xi* the *i*-th component of *x*, and the corresponding limits *sup t*≥0 (*xi*) + = *x*<sup>+</sup> *i* and *inf t*≥0 (*xi*) − = *x*<sup>−</sup> *i* with a small constant real scalar and *xi* is either *xi*,*<sup>a</sup>* or *xi*,*b*. Indeed, the set *X*<sup>+</sup> defines the holonomic restrictions for the WDRD structure.

**Domains and Guards**. A limited number of forces/moments appears if the holonomic constraints are active. These conditions can be represented in the form of component-wise inequalities *<sup>ν</sup>vFv*(*xa*, *xb*, *uv*) ≥ 0, *νv* is the function containing the parameters of the WBRD. The unilateral constraints in the set *Uv* complete the admissible configurations for the WBRD via the domain of admissibility *Dv*:

$$D\_{\mathcal{V}} = \left\{ (\mathbf{x}\_{a}, \mathbf{x}\_{b}, \boldsymbol{\mu}\_{\mathcal{V}}) \in T\mathbb{Q} \times \mathcal{U}\_{\mathcal{V}} \mid A\_{\mathcal{V}}(\mathbf{x}\_{a}, \mathbf{x}\_{b}, \boldsymbol{\mu}\_{\mathcal{V}}) \ge \mathbf{0} \right\} \tag{5}$$

for *v* ∈ *V*. The boundary of each sub-domain are characterized with

$$A\_{\upsilon}(\mathbf{x}\_{a}, \mathbf{x}\_{b}, \boldsymbol{\mu}\_{\upsilon}) = \begin{bmatrix} \nu\_{\upsilon} F\_{\upsilon}(\mathbf{x}\_{a}, \mathbf{x}\_{b}, \boldsymbol{\mu}\_{\upsilon})\\\\ h\_{\upsilon}(\mathbf{x}\_{a}) \end{bmatrix} \geq 0 \tag{6}$$

A state guard *Se* corresponds to a proper subset of the domain *Dv* boundary, which is determined by an edge condition connected to the transition from *Dv* to the subsequent domain, *D*<sup>+</sup> *v* . Let us define *He*(*xa*; *xb*; *uv*) as an appropriate set of elements taken from (6) which characterize the edge condition. Using such elements, the guard can be characterized as

$$S\_{\varepsilon} = \left\{ (\mathbf{x}\_{a}, \mathbf{x}\_{b}, \boldsymbol{\mu}\_{v}) \in T\mathbb{Q} \times \mathcal{U}\_{v} \mid H\_{\varepsilon}(\mathbf{x}\_{a}; \mathbf{x}\_{b}; \boldsymbol{\mu}\_{v}) = 0; \quad H\_{\varepsilon}(\mathbf{x}\_{a}; \mathbf{x}\_{b}; \boldsymbol{\mu}\_{v}) < 0 \right\} \tag{7}$$

**Discrete Dynamics**. Consider the guard *Se* as a reset map *Re* that connects the system states over the guard to the subsequent domain. Considering the pre-impact states (*x*<sup>−</sup> *a* ; *x*<sup>−</sup> *b* ) on *Se*, the post-impact states (*x*<sup>+</sup> *a* ; *x*<sup>+</sup> *b* ) of *D*<sup>+</sup> *v* are computed using a reset map *Re* by assuming the contact characterized by a perfectly plastic impact (if an impact occurs) [25]. Following the ideas in [26], the states configurations of the WBRD remain invariant during the impact, i.e., (*x*<sup>−</sup>*a* ; *<sup>x</sup>*+*a* ); however, post-impact velocities must satisfy the plastic impact equation:

$$
\begin{bmatrix} D(\mathbf{x}\_{a}^{-}) & -J\_{v^{+}}^{\top}(\mathbf{x}\_{a}^{-}) \\ J\_{v^{+}}^{\top}(\mathbf{x}\_{a}^{-}) & 0 \end{bmatrix} \cdot \begin{bmatrix} \mathbf{x}\_{b}^{+} \\ \delta F\_{v} \end{bmatrix} = \begin{bmatrix} D(\mathbf{x}\_{a}^{-})\mathbf{x}\_{b}^{-} \\ 0 \end{bmatrix} \tag{8}
$$

where *δ* defines the impulse function for the forces in the WBRD during the contact with the floor.

**Virtual Constraints**. Analogously to the described holonomic constraints, virtual constraints (recognized as the tracking errors in the control literature) correspond to the functions that modulates the dynamics the WBRD to track certain reference trajectories. The term *virtual* arises from the fact that such operative constraints must be enforced via a set of feedback (state or output) control instead of using forced physical restrictions. In equivalence to tracking errors, virtual constraints correspond to <sup>Δ</sup>*s*(*t*). In here, the desired trajectories are proposed accordingly to the technique proposed in [24], where a novel technique to design monotonic and differentiable trajectories over a gait cycle is precisely detailed [27].

Here, one may notice that the goal of the proposed controller is steering <sup>Δ</sup>*s*(*t*) to the origin if possible or at least to the zone (indeed, an invariant set) defined in (1) within each continuous domain. In this study, we avoid driving <sup>Δ</sup>*s*(*t*) to the invariant set tracking through discrete dynamics. Considering that WBRD must realize movements with the aim of attaining the next switching configurations, the sequence of desired movements *x*∗*a* ∈ R<sup>5</sup> and *x*∗*b* ∈ R<sup>5</sup> should be calculated considering the distance between objects, the stable configurations for the WBRD, and so on. Notice that the position of the *j*-th articulation *<sup>x</sup>*<sup>∗</sup>*a*,*<sup>j</sup>* is known in advance assuming that the desired velocity *<sup>x</sup>*<sup>∗</sup>*b*,*<sup>j</sup>* can be estimated by direct differentiation (notice that the design of reference trajectories provides differentiable with continuous derivative flows).

Once the conditions to describe the WBRD have been detailed, it is feasible to propose the controller that can steer the virtual constraints to the origin or at least to the invariant set in (1).

#### **5. Controller Design**

#### *5.1. Abstracted Representation of the WBRD*

The aim of this research work is developing an output feedback loop controller for a WBRD, which should take into account the hybrid nature of the gait cycle and the state restrictions which define the angular restrictions at each joint. The proposed controller considers then the joints restrictions formed during the standing stage of the WBRD. The dynamics of the WBRD (considering the hybrid nature) is described as follows:

$$\begin{aligned} \frac{d}{dt}\mathbf{x}\_{a}(t) &= \mathbf{x}\_{b,s(t)}(t) \\ \frac{d}{dt}\mathbf{x}\_{b}(t) = f\_{s(t)}\left(\mathbf{x}(t)\right) + \mathbf{g}\_{s(t)}\left(\mathbf{x}\_{d}(t)\right)u(t) + \boldsymbol{\xi}\_{s(t)}\left(\mathbf{x}(t), t\right) \end{aligned} \tag{9}$$

Here, *xa* ∈ R<sup>5</sup> is the vector of angular displacements of the joints considered in the WBRD. The vector *xb* ∈ R<sup>5</sup> is the vector of the angular velocities of all joints. The nature of WBRD structure enforces the existence of restrictions for all components in the state vector that is (4).

The function *fs*(*t*) : R<sup>10</sup> × R<sup>+</sup> → R<sup>5</sup> in (9) represents the drift term that corresponds to internal dynamics of the BIMR:

$$\begin{array}{c} \|f\_{s(t)}\left(\mathbf{x}^{1}\right) - f\_{s(t)}\left(\mathbf{x}^{2}\right)\| \leq L\_{f} \|\mathbf{x}^{1} - \mathbf{x}^{2}\|, \\\mathbf{x}^{1} \in \mathbb{R}^{10}, \mathbf{x}^{2} \in \mathbb{R}^{10}, L\_{f} \in \mathbb{R}^{+} \end{array} \tag{10}$$

The function *gs*(*t*) : R<sup>5</sup> → R<sup>5</sup> characterizes how the input function affects the robot dynamics. This function is invertible by the nature of the biped robot (formed as class of alternated robotic muti-articulated arm) and satisfies

$$0 < g^- \le |g\_{s(t)}\left(\mathbf{x}\_d\right)| \le g^+ < +\infty \qquad g^- \in \mathbb{R}^+, g^+ \in \mathbb{R}^+ \tag{11}$$

The bounded function *u* ∈ R<sup>5</sup> is referred to as the control function, which must take into account the hybrid nature of the WBRD dynamics. By assumption, all the admissible controls belong to the following so-called admissible set:

$$\mathcal{U}\_{adm} = \left\{ \mu : \left\| \left| \mu \right| \right\|^2 \le \mu\_0 + \mu\_1 \left\| \left| \mathbf{x} \right| \right\|^2, \mu\_0 \in \mathbb{R}^+, \mu\_1 \in \mathbb{R}^+ \right\} \tag{12}$$

The term *ξs*(*t*) : R<sup>10</sup> × R<sup>+</sup> → R<sup>5</sup> corresponds to admissible class of uncertainties and perturbations affecting the dynamics of WBRD. By assumption, the term *ξs*(*t*) satisfies the following restriction:

$$\|\zeta\_{s(t)}(\mathbf{x},t)\|^2 \le \mathfrak{Z}\mathbf{o} + \mathfrak{J}\_1 \|\mathbf{x}\|^2, \ \mathfrak{Z}\mathbf{o} \in \mathbb{R}^+, \mathfrak{J}\_1 \in \mathbb{R}^+ \tag{13}$$

#### *5.2. H-ADRC Design*

Considering the hybrid nature of the WBRD and the state restrictions, there are a few possible controllers that can be used. This study considers the application of a class of output feedback hybrid ADRC which can take into account the state constraints.

The design of the proposed H-ADRC considers the design of an approximation for the uncertain section of the WBRD which is valid within each continuous domain. In this study, let assume that the control free right-hand section of the WBRD dynamics (*Fs*(*t*) = *fs*(*t*) + *ξs*(*t*)) can be represented as the composition of a nominal model *f*0,*s*(*t*)(*x*) added with a modeling function ˜ *fs*(*t*)(*<sup>x</sup>*, *t*), which represents those dynamical behaviors that are not modeled, which is *Fs*(*t*)(*<sup>x</sup>*, *t*) = *f*0,*s*(*t*)(*x*) + ˜ *fs*(*t*)(*<sup>x</sup>*, *t*).

In this case, this uncertain section added to the external disturbances element can be represented as ˜ *fs*(*t*)(*<sup>x</sup>*, *t*) + *ξs*(*t*)(*x*(*t*), *t*), where ˜ *fs*(*t*)(*<sup>x</sup>*, *t*) represents the modeling error ˜ *fs*(*t*)(*<sup>x</sup>*, *t*) = *fs*(*t*)(*<sup>x</sup>*, *t*) − *f*0,*s*(*t*)(*x*) with *f*0,*s*(*t*) : R<sup>10</sup> → R<sup>5</sup> describing the nominal model of the WBRD that could be estimated by diverse methods in such a way that the Euler–Lagrange modeling technique is still applicable. In this study, the first option is considered. Consequently, consider the following necessary assumption which must be used in the design of the H-ADRC.

**Assumption 1:** There exists a matrix of constants for each continuous subsystem *as*(*t*) ∈ R(*p*+<sup>1</sup>)×<sup>5</sup> such that the function *Fs*(*t*) evaluated over the trajectories *x* = *x*(*t*) could be represented as *Fs*(*t*)(*<sup>x</sup>*, *t*) = *a s*(*t*)*<sup>κ</sup>*(*x*) + ¯ *fs*(*t*)(*<sup>x</sup>*, *t*).

In this study, the time-dependent vector *κ* ∈ R*p*+<sup>1</sup> (see [28,29] for further details) is

$$\kappa = [1, t, \dots, t^p] \tag{14}$$

The term ¯ *fs*(*t*)(*<sup>x</sup>*, *t*) is called the modeling error produced by the approximation of *Fs*(*t*)(*<sup>x</sup>*, *t*) by a finite number *p* of elements in the basis and admits the following bounds by assumption

$$\|\bar{f}\_{s(t)}(\mathbf{x},t)\| \le f\_0^+, \quad \forall t \ge 0 \tag{15}$$

The so-called nominal model for each continuous domain *a s*(*t*)*<sup>κ</sup>*(*t*) can be expressed as *as*(*t*)*<sup>κ</sup>*(*t*) = *<sup>a</sup>*0,*s*(*t*) + *<sup>a</sup>*1,*s*(*t*)*<sup>t</sup>* + *<sup>a</sup>*2,*s*(*t*)*t*<sup>2</sup> + ··· + *ap*,*<sup>s</sup>*(*t*)*tp* [30]. In this study, the function *as*(*t*)*<sup>κ</sup>*(*x*(*t*)) can be represented as a chain of integrators of some predefined constant matrices. Thus, the approximation presented above states that *Fs*(*t*)(*<sup>x</sup>*, *t*) must be the solution of an integration operation of an uncertain function plus the approximation error: that is,

$$\begin{aligned} a\_{s(t)}^\top \kappa(t) &= a\_{0,s(t)} + \int\_{\tau\_1=0}^t a\_{1,s(t)} d\tau\_1 + \int\_{\tau\_1=0}^t \int\_{\tau\_2=0}^{\tau\_1} 2a\_{2,s(t)} d\tau\_2 d\tau\_1 \\ &+ \dots + \int\_{\tau\_1=0}^t \dots \cdot \int\_{\tau\_p=0}^{\tau\_{p-1}} p! a\_{p,s(t)} d\tau\_p \dotsm d\tau\_1 \end{aligned} \tag{16}$$

Equation (16) can be reorganized in an equivalent differential form:

$$\begin{aligned} \boldsymbol{a}\_{s(t)}^\top \mathbf{x}(t) &= \rho\_{0,s(t)}(t) \; \rho\_{0,s(t)} = \boldsymbol{D}^\top \rho\_{s(t)}, \; \boldsymbol{D} = \begin{bmatrix} I\_5, \boldsymbol{0}, \cdots \; \boldsymbol{0} \end{bmatrix}^\top, \boldsymbol{D} \in \mathbb{R}^{5\*(p+1)} \\\\ \frac{d\rho\_{s(t)}(t)}{dt} &= \boldsymbol{\Phi} \rho\_{s(t)}(t), \; \boldsymbol{\Phi} = \begin{cases} \boldsymbol{1} & \text{if} \quad i = j - 1 \\\boldsymbol{0} & \text{if} \quad otherwise \end{cases} \end{aligned} \tag{17}$$

The vector of initial conditions for *ρs*(*t*) is *<sup>ρ</sup>s*(*t*)(0)=[*<sup>a</sup>*0,*s*(*t*), *<sup>a</sup>*1,*s*(*t*), *<sup>a</sup>*2,*s*(*t*), ..., *ap*,*<sup>s</sup>*(*t*)]. Now, the problem formulation given can be rephrased as follows: Given an output reference trajectory *x*<sup>∗</sup> for the system (9), let us design an output feedback controller that, regardless of the unknown non-modeled dynamics or external disturbances that forces the states *x* to track asymptotically the desired reference trajectories, with the tracking error restricted to a small neighborhood near the origin and proportional to a power of the uncertainties and perturbations. The first stage in solving this problem is designing an extended state observer to reconstruct the non-measurable part of the state.

#### *5.3. Closed-Loop Dynamics Based on the H-ADRC Structure and Extended State Observer*

Let us consider the reference trajectories *x*∗*a* and *x*∗*b*that are governed by

$$\begin{cases} \frac{d}{dt} \mathbf{x}\_a^\*(t) = \mathbf{x}\_{b,s(t)}^\*(t) \\ \frac{d}{dt} \mathbf{x}\_b^\*(t) = h\_{s(t)}^\*(t) \end{cases} \tag{18}$$

where *h*∗ *s*(*t*) : R<sup>+</sup> → R<sup>5</sup> is a continuous function with respect to time which can vary according to the active semi-cycle of the WBRD. The proposed reference trajectories satisfy the following bound for *x*<sup>∗</sup> = [(*x*<sup>∗</sup>*a* ),(*x*<sup>∗</sup>*<sup>b</sup>* )], *x*∗<sup>2</sup> ≤ *<sup>x</sup>*<sup>∗</sup>,+, *<sup>x</sup>*<sup>∗</sup>,<sup>+</sup> > 0.

Based on the approximation proposed for *Fs*(*t*)(*<sup>x</sup>*, *t*) and the reference trajectories given in (18), the dynamics of the tracking error Δ are given by

$$\frac{d}{dt}\Delta\_{\mathbf{z}}(t) = \Delta\_{b\boldsymbol{s}(t)}(t)$$

$$\frac{d}{dt}\Delta\_{\mathbf{b}}(t) = D^{\top}\rho\_{\mathbf{s}(t)}(t) + \bar{f}\_{\mathbf{s}(t)}(\mathbf{x}(t), t) + \xi\_{\mathbf{s}(t)}\left(\mathbf{x}(t), t\right) + g\_{\mathbf{s}(t)}\left(\mathbf{x}\_{\mathbf{a}}(t)\right)u(t) - h\_{\mathbf{s}(t)}^{\*}(t) \tag{19}$$

$$\frac{d}{dt}\rho\_{\mathbf{s}(t)}(t) = \Phi \rho\_{\mathbf{s}(t)}(t)$$

*Math. Comput. Appl.* **2020**, *25*, 13

Notice that the bounds for the state *x* presented as holonomic constraints and the bounds for the reference trajectories provide the following estimation for the bounds of the tracking error Δ in each continuous domain:

$$\begin{aligned} \|\Delta(t)\|\_{H\_{s(t)}}^2 &< V^+, V^+ = 2\lambda\_{\max} \left\{ H\_{s(t)} \right\} (\mathbf{x}^+ + \mathbf{x}^{\*,+}) + \varepsilon \\ \mathbf{x}^+ &= \sum\_{i=1}^5 \left( \max \left\{ \mathbf{x}\_{a,i}^-, \mathbf{x}\_{a,i}^+ \right\} \right)^2 + \left( \max \left\{ \mathbf{x}\_{b,i}^-, \mathbf{x}\_{b,i}^+ \right\} \right)^2 \qquad \varepsilon > 0 \end{aligned} \tag{20}$$

The design of the output feedback controller needs to provide an extended state robust state estimator of (9) which in this case satisfies the following hybrid dynamics:

$$\begin{aligned} \frac{d}{dt}\mathfrak{X}\_{a}(t) &= \mathfrak{X}\_{b,s(t)}(t) + L\_{a,s(t)}e\_{a,s(t)}(t) \\\\ \frac{d}{dt}\mathfrak{X}\_{b}(t) &= D^{\top}\hat{\rho}\_{s(t)}(t) + \mathfrak{g}\_{s(t)}\left(\mathfrak{x}\_{a,s(t)}(t)\right)u(t) + L\_{b,s(t)}e\_{a,s(t)}(t) \\\\ \frac{d}{dt}\mathfrak{f}\_{s(t)}(t) &= \Phi\hat{\rho}\_{s(t)}(t) + L\_{c,s(t)}e\_{a,s(t)}(t) \end{aligned} \tag{21}$$

$$e\_{a,s(t)}(t) = \mathfrak{x}\_{a,s(t)}(t) - \mathfrak{x}\_{a,s(t)}(t)$$

Notice here that *ea*,*<sup>s</sup>*(*t*)(*t*) = *<sup>C</sup>*(*x*<sup>ˆ</sup>*s*(*t*)(*t*) − *xs*(*t*)(*t*)), *<sup>x</sup>*<sup>ˆ</sup>*s*(*t*) = [*x*<sup>ˆ</sup>*a*,*<sup>s</sup>*(*t*), *<sup>x</sup>*<sup>ˆ</sup>*b*,*<sup>s</sup>*(*t*)] and *C* = [*<sup>I</sup>*5, 05]. The observer gains are defined by *La*,*<sup>s</sup>*(*t*) ∈ R5×<sup>5</sup> and *La*,*<sup>s</sup>*(*t*) ∈ R5×5. These gains must be calculated depending on what the active WBRD semi-cycle is.

The dynamics of *ea*,*<sup>s</sup>*(*t*) are associated with an extended state observer connected to:

$$\begin{aligned} \frac{d}{dt}\varepsilon\_{a}(t) &= \varepsilon\_{b,s(t)}(t) + L\_{a,s(t)}\varepsilon\_{a,s(t)}(t) \\\\ \frac{d}{dt}\varepsilon\_{b}(t) &= D^{\top}\vec{\rho}\_{s(t)}(t) + L\_{b,s(t)}\varepsilon\_{a,s(t)}(t) \\\\ \frac{d}{dt}\vec{\rho}\_{s(t)}(t) &= \Phi \vec{\rho}\_{s(t)}(t) + L\_{c,s(t)}\varepsilon\_{a,s(t)}(t) \end{aligned} \tag{22}$$

where *<sup>ρ</sup>*˜*s*(*t*) = *<sup>ρ</sup>*<sup>ˆ</sup>*s*(*t*) − *ρs*(*t*).

Let us consider the proposed output-based controller satisfying:

$$u(t) = \mathbf{g}\_{s(t)}^{-1} \left( \mathbf{x}\_{a,s(t)}(t) \right) \left[ -\mathbf{K}\_{a,s(t)}^{\top} \boldsymbol{\Delta}\_{a,s(t)} - \mathbf{K}\_{b,s(t)}^{\top} \left( \mathbf{\hat{x}}\_{b,s(t)}(t) - \mathbf{x}\_{b,s(t)}^{\*}(t) \right) + \mathbf{h}\_{s(t)}^{\*}(t) - \mathbf{D}^{\top} \boldsymbol{\beta}\_{s(t)}(t) \right] \tag{23}$$

where *K <sup>a</sup>*,*<sup>s</sup>*(*t*) ∈ R5×<sup>5</sup> and *<sup>K</sup>b*,*<sup>s</sup>*(*t*) ∈ R5×<sup>5</sup> are the piece-wise constant gains of the controller which are adjusted in each continuous dynamics.

Let us introduce the extended state vector *z* ∈ R10+10+<sup>5</sup>(*p*+<sup>1</sup>) defined as *z* = [Δ,*e*, *ρ*˜] with *e* = [*ea* ,*e<sup>b</sup>*]. The dynamics of *z* are described by

$$\frac{d}{dt}z(t) = \Pi(K\_{\mathbf{s}(t)}, L\_{\text{es},\mathbf{s}(t)}, L\_{\text{c},\mathbf{s}(t)})z(t) + \Xi\_{\mathbf{s}(t)}(\mathbf{x}(t), \mathbf{t}) \tag{24}$$

with

$$\begin{aligned} \Pi(\mathbf{K}\_{s(t)}, \mathbf{L}\_{cs,s(t)}, \mathbf{L}\_{cs(t)}) &= \begin{bmatrix} A\_{\mathbf{K},s(t)} & -B\mathbf{K}\_{b,s(t)}\mathbf{E} & -B\mathbf{D}^{\top} \\ \mathbf{0}\_5 & A\_{\mathbf{L},s(t)} & B\mathbf{D} \\ \mathbf{0}\_5 & L\_{cs(t)}\mathbf{C}^{\top} & \Phi \end{bmatrix} \\\\ \Xi\_{s(t)}(\mathbf{x}, t) &= \begin{bmatrix} B\widetilde{f}\_{s(t)}(\mathbf{x}, t) + B\widetilde{\mathbf{c}}\_{s(t)}\left(\mathbf{x}(t), t\right) \\ B\widetilde{f}\_{s(t)}(\mathbf{x}, t) + B\widetilde{\mathbf{c}}\_{s(t)}\left(\mathbf{x}(t), t\right) \\ \mathbf{0}\_{5(p+1)} \end{bmatrix} \end{aligned} \tag{25}$$

where *AK*,*<sup>s</sup>*(*t*) = *A* − *BKs*(*t*), *AL*,*<sup>s</sup>*(*t*) = *A* + *Les*,*<sup>s</sup>*(*t*)*C*, Λ = Λ ∈ R *Np*×*Np* , *B* = [05, *<sup>I</sup>*5]; and *E* = [05, *<sup>I</sup>*5]. TheconsidersthetheofThistheresult

 stability analysis study over dynamics *z*. analysis provides of the tracking controller, the state estimator, and the reconstruction of the uncertain section in the WBRD dynamics. This methodology yields the satisfaction of the close-loop analysis of the output feedback controller which offers a class of separation-principle for the proposed design. This is an additional theoretical contribution of this study. The following theorem details the main result of this study.

**Theorem 1.** *Consider the state observer given in* (21) *and the output feedback controller proposed in* (23) *with gains adjusted such that all matrices A* − *BKs*(*t*) *and A* + *Les*,*<sup>s</sup>*(*t*)*C are Hurwitz for the WBRD dynamics with incomplete information approximated with* (17)*.*

*If there is a sequence of positive definite matrices QR*,*<sup>s</sup>*(*t*),*T<sup>κ</sup> and QL*,*<sup>s</sup>*(*t*),*T<sup>κ</sup> such that positive definite and symmetric solutions Hs*(*t*),*T<sup>κ</sup>* > 0 *and Ms*(*t*),*T<sup>κ</sup> exist for the following matrix inequalities Rics*(*t*)(*Hs*(*t*),*T<sup>κ</sup>* ) ≤ 0*, Lyaps*(*t*)(*Ms*(*t*),*T<sup>κ</sup>* ) ≤ 0 *and Lyap<sup>D</sup> <sup>s</sup>*(*t*)*Ns*(*t*),*T<sup>κ</sup>* ) ≤ 0 *with*

$$\operatorname{Ric}\_{\mathbf{s}(t)} \left( H\_{\mathbf{s}(t), T\_{\mathbf{k}}} \right) = H\_{\mathbf{s}(t), T\_{\mathbf{k}}} A\_{\mathbf{K}, \mathbf{s}(t)} + A\_{\mathbf{K}, \mathbf{s}(t)}^{\top} H\_{\mathbf{s}(t), T\_{\mathbf{k}}} + H\_{\mathbf{s}(t), T\_{\mathbf{k}}} \left( \Lambda\_{1} + \Lambda\_{2} \right) H\_{\mathbf{s}(t), T\_{\mathbf{k}}} + Q\_{\mathbf{R}, \mathbf{s}(t), T\_{\mathbf{k}}} \tag{26}$$

$$Lgap\_{s(t)}(M\_{\mathbf{s}(t),T\_{\mathbf{t}}}) = M\_{\mathbf{s}(t),T\_{\mathbf{t}}}\Pi\_{2,s(t)} + \Pi\_{2,s(t)}^{\top}M\_{\mathbf{s}(t),T\_{\mathbf{t}}} + \Pi\_{2,s(t)}^{\top}\Lambda\_{\mathbf{3}}\Pi\_{2,s(t)} + \epsilon^{-1}\Pi\_{1,s(t)}^{\top}\Lambda\_{\mathbf{1}}^{-1}\Pi\_{1,s(t)} + Q\_{L,s(t),T\_{\mathbf{t}}} \tag{27}$$

$$Lugar\_{s(t)}^{D}N\_{s(t),T\_{k}}(\\\ ) = \begin{bmatrix} \Pi^{\top}(T\_{\mathbf{x}})N\_{s(t),T\_{\mathbf{x}+1}}\Pi(T\_{\mathbf{x}}) - N\_{s(t),T\_{\mathbf{x}}} & \Pi^{\top}(T\_{\mathbf{x}})\Xi\_{s(t)}(\mathbf{x}(T\_{\mathbf{x}}),T\_{\mathbf{x}})\\ \ \Xi\_{s(t)}^{\top}(\mathbf{x}(T\_{\mathbf{x}}),T\_{\mathbf{x}})\Pi(T\_{\mathbf{x}}) & N\_{k,T\_{\mathbf{x}+1}} \end{bmatrix} \tag{28}$$

<sup>Π</sup>(*<sup>T</sup>κ*) = <sup>Π</sup>(*Ks*(*<sup>T</sup>κ* ), *Les*,*<sup>s</sup>*(*<sup>T</sup><sup>κ</sup>* ), *Lc*,*<sup>s</sup>*(*<sup>T</sup><sup>κ</sup>* ))

*then the extended state z converges exponentially to the invariant set* I*D*,*T<sup>κ</sup>*× I*Z*,*T<sup>κ</sup>defined by*

$$\mathcal{L}\_{\mathcal{D},T\_{\mathbb{K}}} \times \mathcal{Z}\_{\mathbb{Z},T\_{\mathbb{K}}} = \left\{ (\Delta, z\_0) \quad \mid \quad \|\Delta\|\_{Q\_{\mathbb{K},T\_{\mathbb{K}}} - Q\_{\mathbb{K},T\_{\mathbb{K}},0}}^2 \ge \beta\_{2\prime} \quad \|z\_0\|\_{Q\_{\mathbb{L},k,T\_{\mathbb{K}}} - Q\_{\mathbb{L},k,T\_{\mathbb{K}},0}}^2 \ge \beta\_2 \right\}$$

*with βj* = 2*λmax B*Λ−<sup>1</sup> *j B ξ*0 + *ξ*1(*x*+)<sup>2</sup> + *f* + 0 , *j* = 2, 3*,* Λ2 ∈ <sup>R</sup>5×5*,* Λ3 ∈ R5×<sup>5</sup> *are positive and symmetric definite matrices and Qk*,*T<sup>κ</sup>*,<sup>0</sup> ∈ <sup>R</sup>5×5*, QL*,*k*,*T<sup>κ</sup>* ,0 ∈ R(<sup>5</sup>+<sup>5</sup>(*p*+<sup>1</sup>))×(<sup>5</sup>+<sup>5</sup>(*p*+<sup>1</sup>)) *are positive definite matrices fulfilling Qk*,*T<sup>κ</sup>* > *QL*,*k*,*T<sup>κ</sup>* ,0*, Qk*,*T<sup>κ</sup>* > *QL*,*k*,*T<sup>κ</sup>*,0*.*

*The rate of exponential convergence is given by:*

$$\alpha\_k = \min \left\{ \lambda\_{\min} \left\{ H\_k^{-1/2} Q\_{k,0} H\_k^{-1/2} \right\} , \lambda\_{\min} \left\{ M\_k^{-1/2} Q\_{L,k,0} M\_k^{-1/2} \right\} \right\}$$

**Proof.** The estimation of the adjustment laws for the controller gains uses the concept of Lyapunov stability based on BLF. Formally, the proposed BLF which is used to prove the stability of the origin considers as a class of practical equilibrium point for the movement of the WBRD. In this study, the logarithmic function is used, one of the most common BLFs [31,32]. The suggested BLF function to ge<sup>t</sup> the stability analysis in this study is given by

$$\begin{aligned} V(z) &:= \sum\_{k=1}^{2} V\_{k, T\_k}(\Delta, z\_0) \\ V\_k(\Delta, z\_0) &= \left[ \ln \left( \frac{V^+}{V^+ - \|\Delta\|\_{\tilde{H}\_{k, T\_k}}^2} \right) + z\_0^\top M\_{k, T\_k} z\_0 \right], \quad z\_0 = [\epsilon^\top, \rho^\top]^\top \end{aligned} \tag{29}$$

where *k* = 1, 2. Notice then that *s*(*t*) = 1 represents the case *a* and *s*(*t*) = 2 represents the case *b*.

The full-time derivative of *<sup>V</sup>*(*z*) is

$$\frac{d}{dt}V\_{k,T\_{\mathbf{x}}}(\Delta(t),z\_{0}(t)) = \left[\frac{2\Delta^{\top}(t)H\_{k,T\_{\mathbf{x}}}}{V^{+} - \|\Delta(t)\|\_{H\_{k,T\_{\mathbf{x}}}}^{2}}\frac{d}{dt}\Delta(t) + 2z\_{0}^{\top}(t)M\_{k,T\_{\mathbf{x}}}\frac{d}{dt}z\_{0}(t)\right] \tag{30}$$

Reorganizing the differential equation (30) yields

$$\frac{d}{dt}V\_k(\Delta(t), z\_0(t)) = 2\left[\frac{\Delta^\top(t)H\_{k,T\_k}}{V^+ - \|\Delta(t)\|\_{H\_{k,T\_k}}^2}, \quad z\_0^\top(t)M\_{k,T\_k}\right]\frac{d}{dt}z(t) \tag{31}$$

The substitution of *ddt z*(*t*) on the full-time derivative of *<sup>V</sup>*(*z*(*t*)) leads to the following form:

$$\frac{d}{dt}V\_k(\Delta(t), z\_0(t)) = 2\left[\frac{\Delta^\top(t)H\_{k,T\_k}}{V^+ - \|\Delta(t)\|\_{H\_{k,T\_k}}^2}, \quad z\_0^\top(t)M\_{k,T\_k}\right]\left[\Pi(K\_{k'}L\_{\text{es},k'}L\_{\text{c},k})z(t) + \Xi\_k(\mathbf{x}(t),t)\right] \tag{32}$$

Notice that the term <sup>Δ</sup>(*t*)*Hk*,*Tκ*<sup>Π</sup>(*Kk*, *Les*,*k*, *Lc*,*<sup>k</sup>*)*z*(*t*) can be handled as follows:

$$
\Delta^\top(t) H\_{k,T\_k} \Pi(K\_k, L\_{\varepsilon \varepsilon, k}, L\_{\varepsilon, k}) z(t) = \Delta^\top(t) H\_{k,T\_k} \left( A - B K\_k \right) \Delta(t) + \Delta^\top(t) H\_{k,T\_k} \Pi\_{1,k} z\_0(t) \tag{33}
$$

where <sup>Π</sup>1,*k* = <sup>−</sup>*BKb*,*k<sup>E</sup>* −*BD* .

Let us consider the application of the Young inequality, which satisfies:

$$X^\top Y + Y^\top X \le X^\top N X + Y^\top N^{-1} Y$$

valid for any *X*,*Y* ∈ R*r*×*s* and any 0 < *N* = *N* ∈ R*s*×*s* [33]. Therefore, the following upper bound for <sup>Δ</sup>(*t*)*Hk*Π1,*kz*0(*t*) is valid:

$$\left\| \left( \mathbf{2} \boldsymbol{\Delta}^{\top} (t) H\_{\mathbf{k}, T\_{\mathbf{x}}} \Pi\_{1, \mathbf{k}} z\_{0} (t) \right) \leq \left\| H\_{\mathbf{k}, T\_{\mathbf{x}}} \Delta (t) \right\|\_{\Lambda\_{1}}^{2} + \left\| \Pi\_{1, \mathbf{k}} z\_{0} (t) \right\|\_{\Lambda\_{1}^{-1}}^{2} \tag{34}$$

In equivalent form, <sup>Δ</sup>(*t*)*Hk*,*Tκ*Ξ*k*(*x*(*t*), *t*) accepts the following upper bound:

$$\Delta^{\top}(t)H\_{k,T\_{\mathbf{x}}}Bf\_{k}(\mathbf{x},t) \le \|H\_{k,T\_{\mathbf{x}}}\Delta(t)\|\_{\Lambda\_{2}}^{2} + \|Bf\_{k}(\mathbf{x},t) + B\xi\_{k}(\mathbf{x}(t),t)\|\_{\Lambda\_{2}^{-1,\prime}}^{2} \tag{35}$$

Introducing the following matrix Π2

$$
\Pi\_2 = \begin{bmatrix} 0\_5 & I\_5 & 0\_{5(p+1)} \\ 0\_5 & 0\_5 & I\_{5(p+1)} \end{bmatrix} \times \begin{bmatrix} A\_{K,k} & -BK\_{b,k}E & -BD^\top \\ 0\_5 & A\_{L,k} & B \cdot D \\ 0\_5 & L\_{c,k}\mathbb{C}^\top & \Phi \end{bmatrix} = \begin{bmatrix} A\_{L,k} & B \, D \\ L\_{c,k}\mathbb{C}^\top & \Phi \end{bmatrix}.$$

The terms including *z*0 (*t*)*Mk*,*T<sup>κ</sup>* in the time derivative of *<sup>V</sup>*(*z*(*t*)) can be presented as

$$\begin{aligned} 2z\_0^\top(t)M\_{k,T\_k}\Pi\_{2,k}z\_0(t) + 2z\_0^\top(t)M\_{k,T\_k}\Pi\_{2,k}\Xi\_{s(t)}(\mathbf{x}(t),t) \le \\ z\_0^\top(t)\left(M\_{k,T\_k}\Pi\_{2,k} + \Pi\_{2,k}^\top M\_{k,T\_k}\right)z\_0(t) + \left\|\Pi\_{2,k}z\_0(t)\right\|\_{\Lambda\_3}^2 + \left\|B\tilde{f}\_k(\mathbf{x},t) + B\tilde{s}\_k\left(\mathbf{x}(t),t\right)\right\|\_{\Lambda\_3^{-1}}^2 \end{aligned} \tag{36}$$

Taking together the results in (33) to (36) yields

*d dt Vk* (Δ(*t*), *<sup>z</sup>*0(*t*)) ≤ ⎡⎣ <sup>2</sup>Δ(*t*)*Hk*,*T<sup>κ</sup> AK*,*k*<sup>Δ</sup>(*t*) + *Hk*,*Tκ*<sup>Δ</sup>(*t*)<sup>2</sup>Λ1+Λ<sup>2</sup> + *B*( ˜*fk* (*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 2 + <sup>Π</sup>1*z*0(*t*)<sup>2</sup>Λ−<sup>1</sup> 1 *V*+ − Δ(*t*)<sup>2</sup>*Hk*,*T<sup>κ</sup>* ⎤⎦ + *z*0 (*t*) *Mk*,*T<sup>κ</sup>* <sup>Π</sup>2,*k* + <sup>Π</sup>2,*<sup>k</sup>Mk*,*T<sup>κ</sup> <sup>z</sup>*0(*t*) + <sup>Π</sup>2,*k <sup>z</sup>*0(*t*)<sup>2</sup>Λ3 + *B*( ˜*fk* (*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 3 (37)

Noticing that *min*Δ *V*<sup>+</sup> − Δ(*t*)<sup>2</sup>*Hk*,*T<sup>κ</sup>* = , then

*d dt Vk* (Δ(*t*), *<sup>z</sup>*0(*t*)) ≤ ⎡⎢⎣ Δ(*t*) *Hk*,*T<sup>κ</sup> AK*,*<sup>k</sup>* + *<sup>A</sup>K*,*<sup>k</sup>Hk*,*T<sup>κ</sup>* + *Hk*,*T<sup>κ</sup>* (<sup>Λ</sup>1 + <sup>Λ</sup>2)*Hk*,*T<sup>κ</sup>* Δ(*t*) + *B*( ˜*fk* (*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 2 *V*+ − Δ(*t*)<sup>2</sup>*Hk*,*T<sup>κ</sup>* ⎤⎥⎦ + *z*0 (*t*) *Mk*,*T<sup>κ</sup>* <sup>Π</sup>2,*k* + <sup>Π</sup>2,*<sup>k</sup>Mk*,*T<sup>κ</sup>* + <sup>Π</sup>2,*k*Λ3Π2,*<sup>k</sup>* + <sup>−</sup>1Π1,*k*Λ−<sup>1</sup> 1 <sup>Π</sup>1,*k <sup>z</sup>*0(*t*) + *B*( ˜*fk* (*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 3 (38)

Based on the upper bounds of (13) and the bounds for the modeling error yields to estimating upper norms of *B*( ˜ *fk*(*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 2and *B*( ˜*fk*(*<sup>x</sup>*, *t*) + *ξk* (*x*(*t*), *<sup>t</sup>*))<sup>2</sup>Λ−<sup>1</sup> 3as:

$$\left\| \left| B(\tilde{f}\_k(\mathbf{x}, t) + \tilde{\xi}\_k(\mathbf{x}(t), t)) \right| \right\|\_{\Lambda\_j^{-1}}^2 \le \beta\_{j'} \tag{39}$$

Similarly, the time derivative of (38) can be bounded as

$$\frac{d}{dt}V\_k(\boldsymbol{\Delta}(t),\boldsymbol{z}\_0(t)) \le \left[\frac{\boldsymbol{\Delta}^\top(t)\left(\boldsymbol{H}\_{k,\mathrm{T}\_\mathbf{x}}\boldsymbol{A}\_{K,\boldsymbol{k}} + \boldsymbol{A}\_{K,\mathrm{k}}^\top \boldsymbol{H}\_{k,\mathrm{T}\_\mathbf{x}} + \boldsymbol{H}\_k(\boldsymbol{\Delta}\_1 + \boldsymbol{\Delta}\_2)\boldsymbol{H}\_k\right)\boldsymbol{\Delta}(t) + \boldsymbol{\beta}\_2}{V^+ - \|\boldsymbol{\Delta}(t)\|\_{\boldsymbol{H}\_{k,\mathrm{T}\_\mathbf{k}}}^2}\right] + \\ \boldsymbol{z}\_0^\top(t)\left(\boldsymbol{M}\_{k,\mathrm{T}\_\mathbf{x}}\boldsymbol{\Pi}\_{2,\mathrm{k}} + \boldsymbol{\Pi}\_{2,\mathrm{k}}^\top \boldsymbol{M}\_{k,\mathrm{T}\_\mathbf{x}} + \boldsymbol{\Pi}\_{2,\mathrm{k}}^\top \boldsymbol{\Delta}\_3\boldsymbol{\Pi}\_{2,\mathrm{k}} + \boldsymbol{\varepsilon}^{-1}\boldsymbol{\Pi}\_{1,\mathrm{k}}^\top \boldsymbol{\Delta}\_1^{-1}\boldsymbol{\Pi}\_{1,\mathrm{k}}\right)\boldsymbol{z}\_0(t) + \boldsymbol{\beta}\_3\end{cases} \tag{40}$$

Notice that (40) can be represented as follows:

$$\begin{split} \frac{d}{dt} V\_k(\Delta(t), z\_0(t)) \le & \left[ \frac{\Delta^\top(t) \text{Ric}\_k(H\_{k, T\_k}) \Delta(t) - \Delta^\top(t) Q\_{k, T\_k} \Delta(t) + \beta\_2}{V^+ - \|\Delta(t)\|\_{H\_{k, T\_k}}^2} \right] + \\ & \left[ z\_0^\top(t) \text{Lyp}\_k(M\_{k, T\_k}) z\_0(t) - z\_0^\top(t) Q\_{k, k, T\_k} z\_0(t) + \beta\_3 \right] \end{split} \tag{41}$$

Taking into account the assumptions that *Rick*(*Hk*,*T<sup>κ</sup>* ) < 0 and *Lyapk*(*Mk*,*T<sup>κ</sup>* ) < 0 yielding

$$\frac{d}{dt}V\_k(\Delta(t), z\_0(t)) \le -\left[\frac{\Delta^\top(t)Q\_{k,T\_k}\Delta(t) - \beta\_2}{V^+ - \|\Delta(t)\|\_{H\_{k,T\_k}}^2} + z\_0^\top(t)Q\_{L,k,T\_k}z\_0(t) - \beta\_3\right] \tag{42}$$

If we consider that Δ ∈ I*D*,*T<sup>κ</sup>* and *z*0 ∈ I*Z*,*T<sup>κ</sup>* , then

$$\frac{d}{dt}V\_k(\Delta(t), z\_0(t)) \le -\left[\frac{\Delta^\top(t)Q\_{k,T\_k,0}\Delta(t)}{V^+ - \|\Delta(t)\|\_{H\_{k,T\_k}}^2} + z\_0^\top(t)Q\_{L,k,T\_k,0}z\_0(t)\right] \tag{43}$$

Following the ideas given in [20], it is possible to prove that

$$\frac{d}{dt}V\_k(\Delta(t), z\_0(t)) \le -a\_k V\_k(\Delta(t), z\_0(t)) \quad \forall (\Delta, z\_0) \notin \mathcal{Z}\_{D, \mathcal{T}\_k} \times \mathcal{Z}\_{\mathcal{Z}, \mathcal{T}\_k} \tag{44}$$

Consequently, *Vk*(Δ(*t*), *<sup>z</sup>*0(*t*)) converges asymptotically to the invariant set I*D*,*T<sup>κ</sup>* × I*Z*,*T<sup>κ</sup>* within a given continuous sub-domain. This is enough to prove the stability within each continuous sub-domain. Now, to prove the stability of the hybrid form, let us consider that the tracking error is already bounded; then, let us propose the discrete analysis for the dynamics of *z* evaluated on the specific times where the sequential transition from a -> b or vice versa. With the aim of evaluating this stability analysis, one may propose the discrete Lyapunov-like function such as

$$\begin{aligned} V\_{T\_\mathbf{x}}^d(z(T\_\mathbf{x})) &:= \sum\_{k=1}^2 V\_{k,T\_\mathbf{x}}^d(z(T\_\mathbf{x})) & V\_{k,T\_\mathbf{x}}^d(z(T\_\mathbf{x})) &= z^\top(T\_\mathbf{x}) N\_{k,T\_\mathbf{x}} z(T\_\mathbf{x})\\ N\_{k,T\_\mathbf{x}} &= \begin{bmatrix} H\_{k,T\_\mathbf{x}} & 0\_5\\ 0\_{5+5(p+1)} & M\_{k,T\_\mathbf{x}} \end{bmatrix} \end{aligned} \tag{45}$$

The discrete analysis of the discrete Lyapunov like function yields

$$
\Delta V\_{k,T\_\kappa}^d(z(T\_\kappa)) = z^\top(T\_\kappa) \mathcal{N}\_{k,T\_{\kappa+1}} z(T\_\kappa) - z^\top(T\_\kappa) \mathcal{N}\_{k,T\_\kappa} z(T\_\kappa) \tag{46}
$$

Notice that

$$\begin{aligned} \Delta V\_{\mathbb{k},T\_{\mathbb{k}}}^{d}\left(z(T\_{\mathbb{k}})\right) &= \boldsymbol{z}^{\top}(T\_{\mathbb{k}+1})\boldsymbol{N}\_{\mathbb{k},T\_{\mathbb{k}+1}}z(T\_{\mathbb{k}+1}) - \boldsymbol{z}^{\top}(T\_{\mathbb{k}})\boldsymbol{N}\_{\mathbb{k},T\_{\mathbb{k}}}z(T\_{\mathbb{k}})\\ = \left[\boldsymbol{z}^{\top}(T\_{\mathbb{k}}) \quad \boldsymbol{\Sigma}\_{\mathbb{k}}^{\top}(\mathbf{x}(T\_{\mathbb{k}}),T\_{\mathbb{k}})\right] \begin{bmatrix} \boldsymbol{\Pi}^{\top}(\boldsymbol{T}\_{\mathbb{k}})\boldsymbol{N}\_{\mathbb{k},T\_{\mathbb{k}+1}}\boldsymbol{\Pi}(\boldsymbol{T}\_{\mathbb{k}}) - \boldsymbol{N}\_{\mathbb{k},T\_{\mathbb{k}}} & \boldsymbol{\Pi}^{\top}(\boldsymbol{T}\_{\mathbb{k}})\boldsymbol{\Xi}\_{\mathbb{k}}(\mathbf{x}(T\_{\mathbb{k}}),T\_{\mathbb{k}})\\ \boldsymbol{\Sigma}\_{\mathbb{k}}^{\top}(\mathbf{x}(T\_{\mathbb{k}}),T\_{\mathbb{k}})\boldsymbol{\Pi}(T\_{\mathbb{k}}) & \boldsymbol{N}\_{\mathbb{k},T\_{\mathbb{k}+1}} \end{bmatrix} \begin{bmatrix} \boldsymbol{z}(T\_{\mathbb{k}})\\ \boldsymbol{\Sigma}\_{\mathbb{k}}(\mathbf{x}(T\_{\mathbb{k}}),T\_{\mathbb{k}}) \end{bmatrix} \end{aligned} \tag{47}$$

With the assumption that the matrix inequality (28) is negative definite, then, Δ *V<sup>d</sup> k*,*T<sup>κ</sup>* (*z*(*<sup>T</sup>κ*)) is negative and, therefore, the discrete jumps remain negative confirming the local asymptotically stability of the origin for the extended system based on the state *z*.

**Remark 1.** *Notice that the H-ADRC controller can be useful if the proposed control gains can be sufficiently adequate such that SW*∗ ≤ *β*<sup>2</sup>*. This fact can be guaranteed a priori if a formal optimization of the size for the invariant set proposed in the statement of Theorem 1. The solution of this aspect is outside the scope of this study. However, we assume that the condition described in this remark is fulfilled.*

**Remark 2.** *Notice that adjusting the gains in adaptive form could reduce the large amplitude oscillations along the transient period of the tracking trajectory process. The adaptive adjustment of the gains satisfies:*

$$\frac{d}{dt}\mathcal{R}\_{\mathbf{s}(t)} = -\left(V^{+} - \left\|\Delta\right\|\_{H\_{\mathbf{k}, \mathbf{T}\_{\mathbf{k}}}}^{2}\right)^{-1} \Omega\_{\mathbf{k}}^{-1} H\_{\mathbf{K}, \mathbf{k}} \Delta\_{\mathbf{s}(t)}\tag{48}$$

*with* Ω ∈ R5×5*. This result can be obtained directly with a similar stability analysis to the one introduced in Theorem 1. The main change is introducing a modified Lyapunov like function satisfying*

$$\begin{aligned} V^{ad}(z) &:= \sum\_{k=1}^{2} V\_{k,T\_k}^{ad}(\Delta, z\_0) \\ V\_k^{ad}(\Delta, z\_0) &= \ln \left( \frac{V^+}{V^+ - \|\Delta\|\_{H\_{k,T\_k}}^2} \right) + z\_0^\top M\_{k,T\_k} z\_0 + trace \left\{ \bar{K}\_{s(t)}^\top \Omega\_k \bar{K}\_{s(t)} \right\} \end{aligned} \tag{49}$$

*where K*˜ *s*(*t*) = *K*ˆ *s*(*t*) − *Ks*(*t*) *and trace refers to the trace operator. A similar study analysis yields the design of the adaptive gains which can presumably reduce the oscillating transitions.*

**Remark 3.** *The result attained above requires the design of the extended state observer* (21)*, which must provide efficient approximation of the angular velocities of all the articulations. Such condition implies complex instrument requirements for the BIMR. Such condition enforces that the estimation error must converge faster to the corresponding invariant set than the tracking controller does. A possible alternative is using some variant* *of robust time differentiator which can produce the estimation of the required angular velocities. The so-called super-twisting algorithm can provide such solution, but the close loop stability analysis requires some further work. The reader is referred to the studies given in [34] for more details.*

**Remark 4.** *The application presented in this manuscript needs to solve six different matrix inequalities offline. All of them are Riccati equations and their solutions are quite regular in control theory. Indeed, there exist numerical solvers that can help to find the solution of these inequalities. The requirements to find the solution of a Riccati equation (in general form) given by*

$$A^\top P + PA + PRP + Q = 0, \quad P = P^\top > 0\tag{50}$$

*are*


*Notice that the stability of matrix A in our case is related with the gain matrices Ks*(*t*) *and Les*,*<sup>s</sup>*(*t*) *that can be selected in such a way A* − *BKs*(*t*) *and A* − *Les*,*<sup>s</sup>*(*t*)*C are Hurwitz.*

#### **6. Implementation Issues**

The proposed H-ADRC controller requires several technical aspects that must be considered before it can be implemented. This section details the arrangemen<sup>t</sup> of all the aspects needed to realize both the numerical and the experimental evaluations.

#### *6.1. Numerical Evaluation*

In the simulation system, it is necessary to introduce a force sensing element. The information of the force sensor is used to detect the moment when the corresponding ending section of the WBRD has touched the surface. Notice that such contact must be part of the condition to switch between the subsystems that define the gait scenarios *a* and *b*. Including this additional sensor in the condition ensures that the WBRD is completing the semi-cycle in the adequate configuration.

In this study, the numerical simulations used a virtualized representation of the WBRD based on the SimMechanics Toolbox® of Matlab®. The virtual model includes all the articulations and the mechanical representation of vacuum pumps that are going to be used as the electro-mechanical elements to change the reference frame in the hybrid representation of the gait cycle of the WBRD and allowing for evaluating the suggested controller. The mechanical representation of the WBRD was prepared in the Solid-Works® software including all the mobile actions that must be exerted by the WBRD (Figure 5).

**Figure 5.** WBRD exported to simMechanics for simulation.

#### *6.2. Experimental Evaluation*

The experimental evaluation of the proposed controller was implemented in a polymer-based WBRD. The design of the robot followed the structure proposed in the numerical evaluation. The experimental prototype was constructed using the 3D printing technique using poly-lactic acid (PLA) as building material.

The constructed WBRD used DC motors to realize the mobilization of all joints. A set of gears transmits perpendicular movement to the mechanical structure to reach the desired angular trajectories. Each of the DC actuators was regulated with a DC source to alternate power converter using a pulse width modulation (PWM) methodology.

The numerical realization of the controller used a distributed strategy considering the combination of a processing board (TIVA1294 from Texas Instruments) and a personal computer (Alienware 17S from Dell Computers). The processing board realizes the PWM formulation based on the calculated control action in the personal computer (PC).

The PC realizes an image-based-processing algorithm which calculates the articulation angles using physical markers placed over the WBRD structure (Figure 6). The algorithm is described in Algorithm 1 which is based on the application of simplified morphological image processing methods. The first algorithm is complemented with the calculus of the state estimator (21) and then the output feedback controller proposed in (23) is evaluated according to Algorithm 2. The estimated control action is sent to the processing board via a serial protocol (RS-232).

The H-ADRC requires including vacuum pumps at the first and the last links of the bio-inspired robot. Each of the pumps is activated once all the angles have attained their reference values. The pump is activated to define the change of the reference framework. The activation action is also evaluated in the algorithm and then sent to the processing board.

**Figure 6.** Implementation control in the WBRD.

**Algorithm 1** Position image recovery

1: **Start** 2: *P* ← a frame form the camera - The threshold is selected to find the marker place on the robot joints 3: **if** *Pi*,*j*,<sup>3</sup> < 10 and *Pi*,*j*,<sup>1</sup> > 180 **then** - Detect orange markers 4: *Ai*,*<sup>j</sup>* = 1 5: **else** 6: *Ai*,*<sup>j</sup>* = 0 7: **end if** 8: **if** *Pi*,*j*,<sup>3</sup> < 60 and *Pi*,*j*,<sup>1</sup> < 128 **then** - Detect green markers 9: *Bi*,*<sup>j</sup>* = 1 10: **else** 11: *Bi*,*<sup>j</sup>* = 0 12: **end if** 13: *A* =*imclose*(*A*) - Morphological closing of the image 14: *B* =*imopen*(*B*) - Morphological opening of the image 15: *G* = *A* + *B* - Create logical image 16: *C* =*DetectCentroids*(*G*) - Function to detect centroids 17: **for** *i* = 1, 2, ..., 6 **do** - Calculating the absolute angle with the slope of neighbor centroids 18: *Ti* = arctan( *Ci*+<sup>1</sup> 2 <sup>−</sup>*Ci*2 *Ci*+<sup>1</sup> 1 <sup>−</sup>*Ci*1 ) 19: **end for** 20: *θ*1 = *T*1 21: *S* = 0 22: **for** *i* = 1, 2, ..., 5 **do** - Calculating the relative angle, with respect with the first one 23: *S* = *S* + *θi* 24: *θi*+<sup>1</sup> = *Ti* − *S* 25: **end for** 26: **Return** *θ* 27: **Start control calculation** - Go to Algorithm 2

**Algorithm 2** Control implementation

1: **Start** 2: Obtain the corresponding angles *θj*,*i* - From Algorithm 3: Check the switching condition - To see what pump is active and the corresponding system *j* = 1, 2 4: Implement the extended observer to recover ˙ *θj*,*i* from Equation (17) - To recover ˙ ˆ *θj*,*i* and *<sup>ρ</sup>*<sup>ˆ</sup>*j*,*<sup>i</sup>* 5: Implement the control law in (23) for the corresponding system 6: Evaluate the obtained decoupled controllers to convert into a pulse modulation signal (PWM) 7: **if** |*uj*,*<sup>i</sup>*| > 255 **then** - These values correspond to the high time in the PWM signal8: *uj*,*<sup>i</sup>* = 255 9: **else** 10: *uj*,*<sup>i</sup>* = *uj*,*<sup>i</sup>* 11: **end if** 12: Evaluete the control to determine the movement direction of the actuators 13: **if** sign(*uj*,*<sup>i</sup>*) > 1 **then** 14: *dj*,*<sup>i</sup>* = 1 15: **else** 16: *dj*,*<sup>i</sup>* = 0 17: **end if** 18: Send the values through serial comunication (RS-232 protocol) to the TIVA1294 19: Activate a PWM with the values of *uj*,*<sup>i</sup>* and *dj*,<sup>1</sup> to be sent to the H bidges 20: **Return** to Algorithm 1 to calculate again the current position

1

#### **7. Simulation Results**

The proposed output feedback controller was evaluated using a set of numerical evaluations considering the exerting of three completes gait cycles . The corresponding sequences of reference angular movements were calculated using a biomechanical study of a *Leptidoptera gonodonta* or measuring worm. Once the angles were calculated, the method to produce the reference trajectories was implemented. These reference trajectories were injected into the SimMechanics software.

The numerically simulated model in SimMechanics-Matlab was evaluated considering the real masses (assuming the construction based on PLA material) and dimensions of each mechanical section of the WBRD. This strategy allowed for evaluating the controller as well as tuning the gains of both the estimators and controllers. These gains were used as the initial values in the experimental device.

The angular trajectories measured from the simulated WBRD were compared with the reference trajectories (Figure 7). The shown trajectories correspond to the reference signals, the measured position with the proposed H-ADRC (using the estimated velocities from the state estimator), and the state feedback form. The comparison of all trajectories confirms that the H-ADRC controller provides an equally faster convergence than other controllers, but it has less oscillations during the transient period. Such characteristic is a consequence of the additional compensation provided by the extended state observer that can actively compensate the effect of external perturbations and internal modeling imprecision. The comparison with a classical PID controller confirms such additional benefit of introducing the augmented compensation aggregated in the H-ADRC form. In addition, the proposed controller tracks the reference with smaller deviations than all other controllers considered for comparison. Notice also that these trajectories confirm the presence of high-frequency oscillations at the beginning of the tracking period (first three seconds). Although these oscillations may be undesired for the WBRD movements, the tracking exerted after the oscillations period justifies the introduction of H-ADRC based compensation due to its robustness against matched perturbations.

Figure 8 shows (in logarithmic scale) the control associated energy enforced by the state feedback (marked with PD) and the H-ADRC controllers. This comparison considers that both controllers solve the tracking with the same convergence quality. The application of the compensated control form consumes smaller amounts of energy and augments the working life of the DC motors' actuators. These controllers were chosen for this comparison because they provided the best trajectory tracking among the evaluated controllers.

**Figure 7.** Comparison of reference trajectories with controlled states implementing either the PD and the proposed H-ADRC controllers.

**Figure 8.** Euclidean norm of the control energy used with each control algorithm.

Figure 9 details the comparison of the Euclidean norms of the tracking errors obtained with the application of the same evaluated controllers that were presented in Figure 8. This figure highlights the rate of convergence and the ultimately bounded zone for the tracking errors. This figure is also presented in logarithm scale for the purpose of better detecting the differences among the proposed controllers. In both cases, the H-ADRC forces a faster convergence and smaller oscillations amplitude in steady-state for the controlled trajectories.

**Figure 9.** Euclidean norm of the tracking error for each control technique used in simulation.

Figure 10 demonstrates a sequence of image captures obtained from the simulated evaluation of the H-ADRC application over the WBRD. This sequence highlights the sequence of movements exerted by the entire simulated WBRD associated with the sequence of articular movements enforced by the distributed form of the proposed controllers. The sequence also demonstrates the benefits of introducing the simulated SimMechanics model because it allows for getting an efficient gains adjustment which yields to satisfying the complete gait sequence. Moreover, the hybrid analysis of the controller is confirmed with the efficient tracking of the reference angular positions in both scenarios a and b.

**Figure 10.** Sequence of movements of the gait cycle of the WBRD in simulation.

#### **8. Experimental Results**

Once the simulated evaluation showed acceptable results measured in terms of the tracking errors and the consumed energy, the control was implemented according to Algorithms 1 and 2. Different controllers were implemented with the aim of evaluating the advantages of the proposed methodology. The H-ADRC controller was compared with the classical state feedback controller using the derivative obtained by means of the extended state observer and an experimental PID form.

Figure 11 shows the comparison of the angular displacements obtained in the experimental results with three different controllers. For the PD controller supplied with the estimated derivative, the vector of the five different proportional gains were selected as *kP* = [23, 45, 50, 45, 23] and the derivative gains were *kD* = [2.41, 3.51, 4, 3.51, 2.41]. The case of the integral part included the same proportional and derivative gains while the integral gains were: *kI* = [0.5, 0.9, 1.2, 0.75, 0.3]. The hardware configuration used to evaluate the proposed controller provided an updating time of the control action of 0.05 s, which was enough to successfully realize the gait cycle by the experimental WBRD.

The comparison of the proposed controllers confirmed that the observed additional compensation of the H-ADRC improves the tracking efficiency for all the articulations. Moreover, the oscillations of the measured angular are reduced during the transient period. In addition, one may notice that state feedback provides the worst tracking performance among the evaluated controllers. This result was also true for all the trajectories in the constructed WBRD.

The comparison of the controllers' performances was realized through the calculus of the norm of the tracking errors (Figure 12a). This comparison proves that the tracking error is smaller if the evaluated controller was the H-ADRC in comparison with the other two controllers (state feedback and PID). Notice that the PID form provides a comparable tracking quality to the H-ADRC. Notice that a fair comparison between the evaluated controllers cannot include the norm of the tracking error only, but it must include the energy associated with the controller. Here, one may notice that H-ADRC uses larger energy (measured in terms of the norm of the control action) than the other two controllers (Figure 12b). This increment is actually not significant (12%), and it occurs only during the 20% of the evaluated period corresponding to the gait cycle.

**Figure 11.** Comparison of the trajectory tracking task in the WBRD by means of different control techniques.

**Figure 12.** Performance index for each control technique: (**a**) comparison of the Euclidean norm of the tracking error; (**b**) energy used by each controller represented by the evolution of the integral of the square control action *u*.

Figure 13 provides a sequence of photographs corresponding to the experimental evaluation of the H-ADRC controller over the constructed WBRD. These photos reveal a sequence of the BIMR positions if the controller BIMR regulates the DC motor actuators yielding a coordinated articular movements. The sequence confirms the effects of the additional compensation integrated in the H-ADRC. Additionally, the sequence confirms the performance of the alternated activation of the vacuum pumps according to the realization of the gait semi-cycle in either scenario a or b.

Experimentally, the hybrid controller provides the efficient tracking of the reference angular positions in both scenarios a and b, despite the model of the WBRD not having been used at all in the experimental sequences.

Figure 14 is intended to highlight the sequence realized by pumps that remain attached to the floor after the proposed controller drives the trajectories toward the references (red arrows for the attached and blue arrows for the released pumps). This strategy succeeded in keeping the angular velocities bounded at each joint in the bioinspired robot. In addition, the reference trajectories for the controller were proposed to keep the distal section of the WBRD closer to the floor. In addition, the absolute values of their time derivatives are small enough to limit the possibility of having fast variations of the controlled angular position at each join, which also contributes to reducing high frequency oscillation of the tracking error. All of these strategies together restricted the possibility of the discontinuous movements' effect on the proposed WBRD.

**Figure 13.** Experimental gait sequence.

**Figure 14.** Reference framework switching: (**a**) second vacuum pump activated (red arrow) in zero position; (**b**) second vacuum pump activated during gait cycle; (**c**) first vacuum pump activated in zero position; (**d**) first vacuum pump activated during gait cycle.

**Remark 5.** *The magnitude of the control signal depends on the initial conditions of the position and velocities of the suggested mobile robot links. Moreover, the magnitude of the control is a trade-off between the convergence time, as well as the accuracy on the tracking error. This problem can be solved by an adaptive version of the controller in order to reduce the control magnitude as the tracking error approaches the origin. In addition, the energy is necessary to fulfill the restrictions imposed by the Barrier function. In the case of the experimental results, the control signal is restricted according to the signal that is sent to the robotic device, which means that a pulse Width modulated (PWM) signal is implemented in the device. The maximum value for the control output is* 2*<sup>N</sup> with N the number of bits used for implementing PWM signal. Under this condition, the CD motor is moving as its maximum speed, which is always bounded.*
