Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology

Zhao, Huarong; Peng, Li; Yu, Hongnian

doi:10.3390/s20154164

Open AccessArticle

Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology

by

Huarong Zhao

¹,

Li Peng

^1,2,* and

Hongnian Yu

³

¹

Research Center of Engineering Applications for IOT, Jiangnan University, Wuxi 214122, China

²

Jiangsu Province Internet of Things Application Technology Key Construction Laboratory, Wuxi Taihu College, Wuxi 214145, China

³

School of Engineering and the Built Environment, Edinburgh Napier University, Edinburgh EH10 5DT, UK

^*

Author to whom correspondence should be addressed.

Sensors 2020, 20(15), 4164; https://doi.org/10.3390/s20154164

Submission received: 14 June 2020 / Revised: 17 July 2020 / Accepted: 22 July 2020 / Published: 27 July 2020

(This article belongs to the Section Physical Sensors)

Download

Browse Figures

Versions Notes

Abstract

:

This paper proposes a distributed model-free adaptive bipartite consensus tracking (DMFABCT) scheme. The proposed scheme is independent of a precise mathematical model, but can achieve both bipartite time-invariant and time-varying trajectory tracking for unknown dynamic discrete-time heterogeneous multi-agent systems (MASs) with switching topology and coopetition networks. The main innovation of this algorithm is to estimate an equivalent dynamic linearization data model by the pseudo partial derivative (PPD) approach, where only the input–output (I/O) data of each agent is required, and the cooperative interactions among agents are investigated. The rigorous proof of the convergent property is given for DMFABCT, which reveals that the trajectories error can be reduced. Finally, three simulations results show that the novel DMFABCT scheme is effective and robust for unknown heterogeneous discrete-time MASs with switching topologies to complete bipartite consensus tracking tasks.

Keywords:

data driven; multi-agent system; bipartite consensus; switching topologies

1. Introduction

Multi-agent systems (MASs) and machine learning, two exciting trends in the robotics field, have recently attracted more and more researchers’ attention due to the new epoch of artificial intelligence (AI) [1,2]. How to introduce intelligent algorithms into traditional control theories is one of the hottest and significant research topics. Specifically, utilizing intelligent algorithms to improve the robustness of MASs and reducing the calculation burden of designing controllers [3,4,5] to achieve consensus tracking are two of the challenges we need to address.

In the past half-century, most of the excellent control schemes have been developed based on explicit or implicit mathematical models. Examples are sliding model control, intermittent control, impulse control, and fuzzy control, to name but a few. In addition, most of these control theories were successfully applied to consensus tracking tasks of MASs. In [5], Barbot et al. first introduced the concept of a second-order sliding mode. Many novelty approaches have been developed since then. For instance, a novel sliding-mode-based discrete differentiator was proposed that can estimate the accurate derivatives input of the controlled plant [6], and the output constraint problems are considered in the second-order sliding mode controller designer in [7]. In [8], Xu et al. researched the second-order consensus problems of MASs, where local intermittent information among the agents is utilized to design a distributed adaptive completely intermittent controller to achieve second-order consensus. The impulse control approaches can be seen in [9,10], where the fixed-time quantity consensus, delayed and stochastic perturbation, and second-order consensus are considered to design appropriate properties for MASs. In terms of fuzzy control, the author in [11] designed a mixed controller, which consists of a fuzzy controller and a fuzzy observer, to solve the partly unmeasurable states of controlled systems. It is noteworthy that most traditional control algorithms [5,6,7,8,9,10,11] must consider the dynamics of a controlled system, which is called model-based control (MBC).

However, an accurate model of the plant is hard to obtain, so most MBC approaches are established on the approximate dynamics of systems, which usually are not robust in a partial application. Fortunately, in the past few years, with the development of machine learning, another branch of control theory has been developed that is inspired by machine learning and tries to introduce the leaning approach into traditional theories to avoid the difficulties in acquiring or estimating the dynamics of physical systems. To complete similar control tasks as those solved by MBC schemes, the new control theory works by merely using the interactive information between itself and its external environment, improving the control performance by self-leaning; this is called model-free control (MFC) or data-driven control [2,4].

Recently, several papers [12,13,14,15,16,17,18,19,20,21,22,23,24] have reported on model-free adaptive control (MFAC), interactive learning control (ILC), repetitive learning control (RLC), reinforcement learning (RL), and so on. The consensus tracking problems of MASs were researched in [12] by the MFAC approach, where both the time invariable and varying desired trajectories tracking are archived. Moreover, the further theoretical analysis of MFAC was rigorously presented in [4], which introduces that the MFAC method only needs input/output (I/O) measurement data of a controlled plant, without the need of any explicit mathematical model, Lyapunov stability theory, or key technical lemma to design controllers for various control tasks. ILC is an effective approach for repetitive operating systems, which was developed by many researchers such as in [2,13,14]. In [2], Hui et al. extended the dimension of ILC, which has a time dimension, iteration dimension, and space dimension, to achieve a faster and more precise tracking performance for the MASs’ formation task. In [13], Li et al. studied how to combine the ILC with model predictive control to achieve better performance. The RLC model is utilized to track periodic exogenous signals in continuous processes, which can be seen in [14], where a novel distributed adaptive protocol is investigated for uncertain nonlinear leader–follower MASs to achieve global asymptotic consensus. In [15], Odekunle et al. presented a novel approach to solve the non-zero-sum game output regulation problem for MASs by using RL. In our investigations, we found that another category of MFC methods is based on neural networks (NNs), which have unparalleled approximation abilities for nonlinear dynamics. In [16,17,18], the authors designed actor–critic-based neural networks to approximate the value function and control policy for each agent, respectively, to optimize consensus control performance. It should be pointed out that NNs-based methods need training processes and external testing signals for controller design, which are not convenient. Meanwhile, there are some interesting adaptive schemes in [19,20,21,22,23].

In the aforementioned related studies [5,6,7,8,9,10,11], consensus problems of MASs are based on MBC approaches, while the authors of [12,13,14,15,16,17,18,19,20,21,22,23] employed and developed MFC methods to address consensus or consensus tracking problems for MASs; however, it is still an open and challenging problem for unknown dynamics MASs to achieve consensus tracking. Furthermore, it is obvious from a review of the above literature that MASs consensus control and tracking only consider the cooperation interactions among agents. In fact, we usually find that the two relationships are inseparable from one another in natural or engineering scenarios, for instance, activators and inhibitors in biological systems, teams opposed in a sports match, or duopolistic regimes arising when agents compete for limited resources in economical systems [24]. Hence, to improve the adaptive and autonomous abilities of MASs, the competition relationship needs to be considered, which is becoming a hot research topic. Altafini [25] first explored consensus for MASs with antagonistic interactions, and this specific consensus is called bipartite consensus (BC), which means that agents are assigned to two alliances, where each alliance has a unique sign, but each agent ultimately achieves the same position, velocity, and/or angle. After that, BC sparked the interest of many researchers and has been discussed for MASs with linear, nonlinear, and even heterogeneous dynamics. Moreover, the BC for MASs with Lipschitz-type, second-order, or high-order dynamics is investigated in [24,26,27]. Inspired by the above contributions, several theories have been extended. In [28], a distributed extended state observer is employed to guarantee leader–follower BC for MASs with mismatched unknown disturbance. It is observable that formulating a BC controller is more challenging for high-order MASs than for low-order ones. The BC problem for high-order MASs with input saturation is researched by combining distributed event-triggered control and a low-gain feedback technique in [29]. The finite-time and fixed-time BC for MASs are explored in [30,31], respectively. A novel RL based protocol is presented in [32], which is the first use of RL for unknown discrete-time leader–follower MASs, where the author utilizes data-driven actor–critic-based NNs to address the BC problem for unknown MASs, but it increases computations. Moreover, a training process is necessary.

Although much effort has been made toward solving the BC problem [33,34,35,36], to the best of our knowledge, pseudo partial derivative (PPD) approaches have not been taken into account in the existent results. From the above observations and analysis, this paper employs a PPD method to estimate an equivalent dynamic linearization data model of an easy agent, where merely the measurement I/O data of neighborhood agents is necessary. Then, a distributed model-free adaptive bipartite consensus tracking (DMFABCT) scheme is designed for unknown detected-time heterogeneous nonaffine nonlinear MASs with switching topologies to realize time-invariant and time-varying reference trajectory bipartite consensus tracking tasks by using the neighbor-based tracking error. It is worth pointing out that although a few agents could receive the desired trajectory, the rigorous theoretical proof confirms that our proposed algorithm can guarantee convergence of all agents. In the investigation of the existing consensus approaches of MASs, the main contributions of this work might be summarized as follows:

(1): A DMFABCT framework is established for unknown heterogeneous nonaffine nonlinear detected-time MASs with switching topologies and a coopetition network. It is a data-driven distributed intelligent algorithm, which has good performance to address the BC problem under both time-invariant and time-varying reference trajectories. Although Bu et al. [37] proposed a novel data-driven framework for MASs, it only discussed the cooperative interactions.
(2): The proposed DMFABCT scheme is designed by neighbor-based online measurement I/O data that can bypass the confusion of existing consensus algorithms as seen in [5,6,7,8,9,10,11,24,25,26,27,28,29,30,31,32,33,34,35] to obtain an accurate mathematical model so that the designed scheme is more robust and reduces energy costs from the massive computation.
(3): Both collaborative and antagonistic interactions among agents are considered in the proposed protocol. Compared with the protocols in [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23], the proposed protocol is more reasonable. Moreover, the difference of DMFABCT from the novel algorithm proposed in [32] is that DMFABCT copes with the BC problem with PPD, where the training processes and external testing signals are not necessary.

The remainder of this paper is structured as follows. Several essential preliminaries are presented in Section 2. The introduction of the DMFABCT algorithm and the tracking performance of fixed and time-varying reference trajectory analysis are presented in Section 3. Three numerical simulation experiments are provided in Section 4. Finally, conclusions and future work are provided in Section 5.

2. Preliminaries and Problem Formulation

2.1. Graph Theory and Some Notations

Let

R

denote the set of real numbers. The Euclidean norm of

Χ \in R^{n x n}

is expressed by

‖ Χ ‖

. The identity matrix and diagonal matrix are expressed by

I

and

d i a g (•)

, respectively, where the dimension is dependent on the context. In this paper, the algebraic graph theory is employed to analyze the interaction topologies of MASs. It should to be pointed out that the graphs are directed and the weighted directed graph is expressed by

G = (V, E, A)

, where

V = {1, 2, \cdot \cdot \cdot, N}

,

E \subseteq {(V_{i} {, V}_{j}) {| V}_{i} {, V}_{j} \in V} \subseteq V \times V

, and

A

are the set of vertices, the set of edges, and the adjacency matrix, respectively. Then,

i

as the parent and

j

is the child, if the

i

can transmit the information to

j

directly, which is expressed as

(i, j) \in E

. If

i

is not the father of

j

,

a_{i, j} = 0

, otherwise

a_{i, j} \neq 0

. In the graph of MASs, the

i

has many children so utilizes the

N_{i} = (j | j \neq i, (V_{j}, V_{i}) \in E)

to describe the relationships among each agent, which is named as the neighborhood of the agent

i

in other literature. In this paper, the cooperative and competitive relationships are considered between each agent so that the elements of

{A = (a}_{i, j}) \in R^{N \times N}

have three different values, −1, 0, and 1. If the node

i

and

j

belong to a same group, agent

i

could get the information from agent

j

,

a_{i, j} = 1

, otherwise

a_{i, j} \neq 1

. When

a_{i, j} = - 1

, the agents

i

and

j

must be in opposite groups, which is called a competitive relationship between the agents

i

and

j

. Alternatively, there is another definition, which is cooperation. Moreover, we usually use cooperation to represent the two different situations among the MASs network. The Laplacian matrix of

G

can be calculated by

L = D - A

, where

D = d i a g (d_{1}^{i n}, d_{1}^{i n}, \cdot \cdot \cdot, d_{N}^{i n})

and

d_{i}^{i n} = \sum_{j = 1}^{N} a_{i, j}

are called in-degree of vertex

i

. The coopetition network

G

is called structurally balanced if the whole nodes in

V

can be divided into two disjointed subsets, that is,

V_{1}

,

V_{2}

. They satisfy the following three conditions:

(1).: ${V = V}_{1} \cup V_{2}$ and $V_{1} \cap V_{2} = \emptyset$ .
(2).: if $\forall i, j \in V_{z} (z \in {1, 2})$ , $a_{i j} \geq 0$ .
(3).: if $\forall i \in V_{z}, j \in V_{q}, z \neq q (z, q \in {1, 2})$ , $a_{i j} \leq 0$ .

Furthermore, if this MASs graph

G

contains a spanning tree, the information can transmit from a root node to any other node, and so this graph is considered to be a strongly connected graph.

In order to investigate time-varying switching topologies, let

\bar{G} (k)

denote a time-varying switching graph with a virtual leader, which is dependent on

k

, and

A_{F} (k) = [a_{i}_{j} (k)] \in R^{N \times N}

,

d_{i} (k) = \sum_{j \in N (i)} | a_{ij} (k) |

,

L (k) = - A_{F} (k) + D (k) \in R^{N \times N}

are the corresponding adjacency matrix, degree matrix, and Laplacian matrix, respectively.

N_{p} (i)

denotes the neighborhood of the

ith

agent and

B (k) = d i a g {b_{1} (k), \cdot \cdot \cdot, b_{N} (k)} \in R^{N \times N}

is employed to depict the relationship between the virtual leader 0 and each follower. If the agent

i

can directly get the desired trajectory from virtual leader 0,

i . e ., {0, i} \in \bar{E}

,

b_{i} (k) = 1

. Otherwise,

b_{i} (k) = 0

. To describe the time-varying topology, let

{\bar{G}}_{l} = {{\bar{G}}_{1}, {\bar{G}}_{2}, \cdot \cdot \cdot, {\bar{G}}_{κ}}

denote the set of all directed graphs for the agents, where

κ \in Z^{+}

denotes the total number of possible interaction graphs.

2.2. Problem Formulation

In existing studies, the consensus problem, especially the bipartite consensus problem, is often considered for a group of agents with identical dynamics. However, heterogeneity is the intrinsic property for multi-agent systems. Therefore, the problem of bipartite consensus for heterogeneous agents presents many challenges. It is noteworthy that the following assumptions are fundamental conditions of nonlinear dynamics for our analysis.

Definition 1.

Consider a discrete-time heterogeneous SISO (simple-input-simple-output) MAS with

N

agents and the nonlinear dynamics of agent

i

satisfies the following equivalent:

y_{i} (k + 1) = f_{i} (y_{i} (k), u_{i} (k))

(1)

where

y_{i} (k) \in R

is the output,

i = 1, 2, \dots, N

,

f_{i} (\cdot)

is an unknown nonlinear function, and

u_{i} (k) \in R

is the controlling input, respectively.

y_{0} (k)

denotes the trajectory of a virtual leader, which is represented by using vertex 0 in the graph. Furthermore, only a subset of agents can receive information from the virtual leader directly. Hence, the directed graph

\bar{G}

of MASs is combined with

N + 1

agents and the corresponding edge set and weighted adjacency matrix are expressed by

\bar{E}

and

\bar{A}

, respectively.

Assumption 1.

u_{i} (k)

is a continuous function in order to obtain the partial derivative of nonlinear function

f_{i} (\cdot)

.

Assumption 2.

Those conditions where

u_{i} (k - 1)

,

Δ u_{i} (k) \neq 0

,

| Δ y_{i} (k + 1) | \leq r | Δ u_{i} (k) |

satisfy for all

k

and

r

are a positive constant, where

Δ u_{i} (k) = u_{i} (k) - u_{i} (k - 1)

and

Δ y_{i} (k + 1) = y_{i} (k + 1) - y_{i} (k)

. Meanwhile the model (1) is generalized Lipschitz.

Remark 1.

The authors of [12,37] and those in their references have introduced the reasonability of Assumptions 1 and 2 for practical nonlinear systems and MASs.

Lemma 1.

Under these circumstances where the agent’s dynamic (1) satisfies Assumptions 1, 2, and

Δ u_{i} (k) \neq 0

, the system (1) can utilize the following compact form linearization model to present [37,38].

Δ y_{i} (k + 1) = Γ_{i} (k) Δ u_{i} (k)

(2)

where

| Γ_{i} (k) | \leq \bar{r}

,

\bar{r}

is a positive constant, and

Γ_{i} (k)

is a variable named pseudo-partial-derivative (PPD).

Remark 2.

Using PPD to establish a dynamic linearization data model is called the PPD approach, where the PPD is only dependent on

Δ y_{i} (k + 1)

and

Δ u_{i} (k)

. Moreover, the dynamic linearization data model is updated by the PPD, which could approximate the practical dynamics of the controlled plant better.

Γ_{i} (k)

is not easy to obtain, so we design a parameter estimation law (4) to obtained the estimation (

{\hat{Γ}}_{i} (k)

) of

Γ_{i} (k)

. Meanwhile, the estimation error of

Γ_{i} (k)

is analyzed in Theorem 1. Since the PPD approach is not complex and the dynamic linearization data model obtained is simple, the PPD approach is a hot topic in data-driven control for researches to study discrete-time nonlinear systems. However, it is still an open topic for utilizing the PPD approach to solve consensus problems of multi-agent systems, especially the multi-agent systems bipartite consensus problems with switching topologies.

Definition 2.

The following distributed measurement output:

ξ_{i} (k) = \sum_{j \in N_{i}} | a_{i j} (k) | (s i g n (a_{i j} (k)) y_{j} (k) - y_{i} (k)) + b_{i} (k) (s_{i} (k) y_{0} (k) - y_{i} (k))

(3)

If the agent

i

can directly get the desired trajectory from virtual leader 0,

i . e ., {0, i} \in \bar{E}

,

b_{i} (k) = 1

. Otherwise,

b_{i} (k) = 0

. Let

ε_{i} (k) = s_{i} y_{0} (k) - y_{i} (k)

denote the tracking error, where

s_{i} = 1

for

i \in V_{1}

and

s_{i} = - 1

for

i \in V_{2}

.

Assumption 3.

All of the time-varying switching communication graphs are strongly connected graphs and the trajectory information of the virtual leader can be transmitted to one or more follower agents directly.

Assumption 4.

In the relative literature,

Γ_{i} (k) > 0, i = 1, 2, 3, \dots, N

(

o r Γ_{i} (k) < 0

) stratify for all k, so we assume

Γ_{i} (k) > 0

in this paper.

Remark 3.

The above Assumption 3 is a fundamental condition for researching the bipartite consensus tracking problems. Moreover, it can obviously find Assumption 4, which is implied in the traditional model-based control algorithms as a type of linear-like characteristic. Furthermore, this assumption is wildly used in some practical multi-agent systems, for instance, in unmanned air vehicles and mobile robots.

3. Main Results

In order to solve the bipartite consensus tracking problem stated in Section 2.2, we propose the DMFABCT approach below:

{\hat{Γ}}_{i} (k) = {\hat{Γ}}_{i} (k - 1) + \frac{p Δ u_{i} (k - 1)}{w + | Δ u_{i} (k - 1) |^{2}} (Δ y_{i} (k) - {\hat{Γ}}_{i} (k - 1) Δ u_{i} (k - 1))

(4)

{\hat{Γ}}_{i} (k) = {\hat{Γ}}_{i} (1), {\begin{matrix} | {\hat{Γ}}_{i} (k) | \leq c \\ s i g n ({\hat{Γ}}_{i} (k)) \neq s i g n ({\hat{Γ}}_{i} (1)) \end{matrix}

(5)

u_{i} (k) = u_{i} (k - 1) + \frac{ρ {\hat{Γ}}_{i} (k)}{λ + | {\hat{Γ}}_{i} (k) |^{2}} ζ_{i} (k)

(6)

where

p > 0

,

ρ > 0

are the step sizes, which will be defined in the next section.

w > 0

and

λ > 0

are weight factors. According to Assumption 4, let

{\hat{Γ}}_{i} (1) > 0

, which is the initial value of

{\hat{Γ}}_{i} (k)

, and it is the estimated value of

Γ_{i} (k)

. Practically, if the

c

is very small, it means that the

{\hat{Γ}}_{i} (k)

does not update any more, thus,

c

is selected as 10⁻⁴.

Remark 4.

It is noted that

{\hat{Γ}}_{i} (k)

could be obtained by merely using the output data

Δ y_{i} (k)

in the parameters estimation scheme (4) and another important thing is worth pointing out that the convergence of parameters estimation scheme (4) can be guaranteed as shown in [12] and [37]. The control law (6) illustrates that the controlling input

u_{i} (k)

is updated by using the distributed measurement output

ξ_{i} (k)

for agent

i

, so that the algorithm is a kind of DMFABCT scheme.

Remark 5.

The feature of this DMFABCT scheme is that agents’ model dynamics are not required, for instance, the PPD parameters estimation algorithm is only used on the measured I/O data of multi-agent systems to complete the formulation, therefore, it is a classic data-driven control approach for solving the MASs’ BC problem.

Remark 6.

Both

λ

and

ρ

are important parameters of the distributed DMFABCT algorithms. A suitable

λ

, which is a weight parameter, can ensure the stability of MASs, and

ρ

is a controller parameter that can guarantee the tracking error that will be cut. Furthermore, the value ranges of

ρ

will be analyzed in the following Theorems.

To analyze the stability of MASs, Lemma 2 is one of the important conditions.

Lemma 2.

A time-varying irreducible substochastic matrix and the set of all possible

T (Q)

are denoted by

T (Q)

and

T

respectively [39]. Also, the diagonal entries of

T (Q)

are positive. Then, we can obtain

‖ T (Q) T (Q - 1) \cdot \cdot \cdot T (1) ‖ \leq ϒ

where

0 < ϒ < 1

and

T (Q)

,

K = 1, 2, \dots, Q

, are

Q

matrices arbitrarily selected from

T

.

The stability analysis of the DMFABCT approach is presented by Theorem 1.

Theorem 1.

Under these circumstances where the MASs (1) satisfies Assumptions 1, 2, and 4 and its communication topology satisfies Assumption 3, apply the proposed DMFABCT algorithms (4)–(6) to track the desired reference trajectory

y_{0} (k)

, which is time invariable, i.e.,

y_{0} (k) = c o n s t

, if

ρ

satisfies the following condition

ρ < \frac{1}{\max = 1, \dots, N, l = 1, \dots, κ \sum_{j = 1}^{N} | a_{i j}^{l} (k) | + b_{i}^{l} (k)}

and

λ > λ_{\min} > 0

,

\underset{k - > \infty}{l i m} ‖ ε_{i} (k) ‖ = 0, i = 1, 2, \dots, N

.

Proof:

We prove this theorem using the three steps below.

Step 1 (Proving the Boundedness of

{\hat{Γ}}_{i} (k)

): Define

{\tilde{Γ}}_{i} (k) = {\hat{Γ}}_{i} (k) - Γ_{i} (k)

. According to the Lemma 1 and parameter estimation law (4), the following equation can be obtained.

\begin{array}{l} {\tilde{Γ}}_{i} (k) & = {\hat{Γ}}_{i} (k) - Γ_{i} (k) \\ = {\hat{Γ}}_{i} (k - 1) + \frac{p Δ u_{i} (k - 1)}{w + | Δ u_{i} (k - 1) |^{2}} (Δ y_{i} (k) - {\hat{Γ}}_{i} (k - 1) Δ u_{i} (k - 1)) - Γ_{i} (k) \\ = \frac{p Δ u_{i} {(k - 1)}^{2}}{w + | Δ u_{i} (k - 1) |^{2}} (Γ_{i} (k - 1) - {\hat{Γ}}_{i} (k - 1)) + Γ_{i} (k - 1) - Γ_{i} (k) \\ = (1 - \frac{p Δ u_{i} {(k - 1)}^{2}}{w + | Δ u_{i} (k - 1) |^{2}}) {\tilde{Γ}}_{i} (k - 1) + Γ_{i} (k - 1) - Γ_{i} (k) \end{array}

(7)

According to Equation (7) the following equation can be obtained.

| {\tilde{Γ}}_{i} (k) | \leq | (1 - \frac{p Δ u_{i} {(k - 1)}^{2}}{w + | Δ u_{i} (k - 1) |^{2}}) | | {\tilde{Γ}}_{i} (k - 1) | + | Γ_{i} (k - 1) - Γ_{i} (k) | .

(8)

The inequalities

p Δ u_{i} {(k - 1)}^{2} \leq {| Δ u_{i} (k - 1) |}^{2} \leq w + {| Δ u_{i} (k - 1) |}^{2}

can be obtained by selecting

p

and

w

, which satisfy

0 < p \leq 1

and

w \geq 0

.

Δ u_{i} {(k - 1)}^{2} = {| Δ u_{i} (k - 1) |}^{2}

because the system studied in this paper is a single input and output. Thus, a constant

ϖ

can be selected to satisfy the following inequality.

0 < | (1 - \frac{p Δ u_{i} {(k - 1)}^{2}}{w + {| Δ u_{i} (k - 1) |}^{2}}) | \leq ϖ < 1

(9)

Since

| Γ_{i} (k) | \leq \bar{r}

, according to Assumption 4, the following inequalities can be obtained.

{\begin{matrix} Γ_{i} (k) - Γ_{i} (k - 1) \leq Γ_{i} (k) \leq \bar{r}, if Γ_{i} (k) \leq Γ_{i} (k - 1) \\ Γ_{i} (k - 1) - Γ_{i} (k) \leq Γ_{i} (k) \leq \bar{r}, if Γ_{i} (k - 1) \leq Γ_{i} (k) \end{matrix}

Obviously, it can obtain

| Γ_{i} (k - 1) - Γ_{i} (k) | \leq \bar{r}

and

\begin{array}{l} | {\tilde{Γ}}_{i} (k) | \leq & ϖ | {\tilde{Γ}}_{i} (k - 1) | + \bar{r} \\ \leq ϖ^{2} | {\tilde{Γ}}_{i} (k - 2) | + ϖ \bar{r} + \bar{r} \\ \leq ϖ^{3} | {\tilde{Γ}}_{i} (k - 3) | + ϖ^{2} \bar{r} + ϖ r + \bar{r} \\ \leq \cdot \cdot \cdot \leq ϖ^{k - 1} | {\tilde{Γ}}_{i} (1) | + ϖ^{k - 2} \bar{r} + \cdot \cdot \cdot + ϖ \bar{r} + \bar{r} \\ \leq ϖ^{k - 1} | {\tilde{Γ}}_{i} (1) | + \frac{\bar{r} (1 - ϖ^{k - 1})}{1 - ϖ} \end{array}

(10)

so that

\lim_{k - > \infty} | {\tilde{Γ}}_{i} (k) | = \frac{\bar{r}}{1 - ϖ}

. Moreover, since

Γ_{i} (k)

is bounded, it is obvious that

{\hat{Γ}}_{i} (k)

is bounded.

Step 2 (Proving the Convergence of

ε (k)

): Since

ε_{i} (k) = s_{i} (k) y_{0} (k) - y_{i} (k)

, Equation (3) can be rewritten as follows:

ξ_{i} (k) = \sum_{j \in N_{i}} | a_{i j} (k) | ((s i g n (a_{i j} (k)) y_{j} (k) - y_{i} (k)) + b_{i} (k) ε_{i} (k)

(11)

Equation (11) can be written for clarity as a compact form

\begin{array}{l} ξ (k) & = {[ξ_{1} (k), ξ_{2} (k), \cdot \cdot \cdot, ξ_{N} (k)]}^{T} \\ = - L (k) y (k) + B (k) (s (k) {\bar{y}}_{0} (k) - y (k)) \\ = - L (k) y (k) + L s (k) {\bar{y}}_{0} (k) + B (k) (s (k) {\bar{y}}_{0} (k) - y (k)) \\ = (B (k) + L (k)) (s (k) {\bar{y}}_{0} (k) - y (k)) \\ = (B (k) + L (k)) ε (k) \end{array}

(12)

where

B (k) = d i a g (b_{1} (k), b_{2} (k), \cdot \cdot \cdot, b_{N} (k))

ε (k) = {[ε_{1} (k), ε_{2} (k), \cdot \cdot \cdot, ε_{N} (k)]}^{T}

s (k) = d i a g (s_{1} (k), s_{2} (k), \cdot \cdot \cdot, s_{N} (k))

s_{i} = 1

for

i \in V_{1}

and

s_{i} = - 1

for

i \in V_{2}

,

{\bar{y}}_{0} = 1 \otimes y_{0}

, and

1 = c o l (1, \cdot \cdot \cdot, 1) \in R^{N}

is the N-vector. Moreover, obviously

L s (p) {\bar{y}}_{0} (k) = 0

.

According to Equation (12), the compact form of the DMFABCT algorithm (6) can be written as follows:

\begin{array}{l} u (k) & = {[u_{1} (k), u_{2} (k), \cdot \cdot \cdot, u_{N} (k)]}^{T} \\ = u (k - 1) + ρ Ω_{1} (k) ξ (k) \\ = u (k - 1) + ρ Ω_{1} (k) (L (k) + B (k)) ε (k) \end{array}

(13)

where

Ω_{1} (k) = diag (\frac{{\hat{Γ}}_{1} (k)}{λ + | {\hat{Γ}}_{1} (k) |^{2}}, \cdot \cdot \cdot, \frac{{\hat{Γ}}_{N} (k)}{λ + | {\hat{Γ}}_{N} (k) |^{2}})

According to equations

Δ y_{i} (k + 1) = Γ_{i} (k) Δ u_{i} (k)

,

Δ y_{i} (k + 1) = y_{i} (k + 1) - y_{i} (k)

, and

Δ u_{i} (k) = u_{i} (k) - u_{i} (k - 1)

, Equation (2) can be written as follows:

\begin{array}{l} y (k + 1) & = y (k) + Ω_{T} (k) Δ u (k) \\ = y (k) + Ω_{T} (k) (u (k) - u (k - 1)) \\ = y (k) + Ω_{T} (k) (u (k - 1) + ρ Ω_{1} (k) (L (k) + B (k)) ε (k) - u (k - 1)) \\ = y (k) + ρ Ω_{1} (k) Ω_{T} (k) (L (k) + B (k)) ε (k) \end{array}

(14)

where

Ω_{T} (k) = diag ({\hat{Γ}}_{1} (k), {\hat{Γ}}_{2} (k), \cdot \cdot \cdot, {\hat{Γ}}_{N} (k))

. According to

ε (k) = s (p) {\bar{y}}_{0} (k) - y (k)

, it is easy to get

ε (k + 1) - ε (k) = y (k) - y (k + 1)

. Furthermore, we could substitute (13) to (14) to get

\begin{array}{l} ε (k + 1) & = ε (k) - ρ Ω_{1} (k) Ω_{T} (k) (L (k) + B (k)) ε (k) \\ = (I - ρ Ψ (k) (L (k) + B (k))) ε (k) \\ = (I - ρ Ω_{1} (k) Ω_{T} (k) (L (k) + B (k))) ε (k) \\ = (I - ρ Ξ (k)) ε (k) \end{array}

(15)

where

Ψ (k) = Ω_{1} (k) Ω_{T} (k) = d i a g (Φ_{1} (k), Φ_{2} (k), \cdot \cdot \cdot, Φ_{n} (k))

,

Φ_{i} (k) = \frac{Γ_{i} (k) {\hat{Γ}}_{i} (k)}{λ + | {\hat{Γ}}_{i} (k) |^{2}}, i = 1, 2, \dots, N

,

Ξ (k) = Ψ (k) (L (k) + B (k))

. From (15), we can obtain that if

‖ I - ρ Ξ (k) ‖ < 1

for all

k

, then

{l i m}_{k - > \infty} ‖ ε (k + 1) ‖ = 0

.

Step 3 (Obtaining the Convergence Condition of MASs): In this step, the convergence condition of MASs will be derived.

According to the conditions

Γ_{i} (k) \leq \bar{r}

,

s i g n ({\hat{Γ}}_{i} (k)) = s i g n ({\hat{Γ}}_{i} (1)) > 0

,

λ + | {\hat{Γ}}_{i} (k) |^{2} \geq 2 \sqrt{λ} | {\hat{Γ}}_{i} (k) |

,

λ_{m i n} > 0

, and

λ > λ_{m i n}

for all

i = 1, 2, \dots, N

, the following inequalities can be obtained:

0 < \frac{Γ_{i} (k) {\hat{Γ}}_{i} (k)}{λ + | {\hat{Γ}}_{i} (k) |^{2}} \leq \frac{\bar{r} {\hat{Γ}}_{i} (k)}{2 \sqrt{λ} | {\hat{Γ}}_{i} (k) |} < \frac{\bar{r}}{2 \sqrt{λ}} < \frac{\bar{r}}{2 \sqrt{λ_{\min}}} < 1

First of all, in order to guarantee the strictly connected property of MASs under all of the communication topologies,

I - ρ Ξ (k)

must be an irreducible matrix. Secondly,

0 < Φ_{i} (k) < 1

for all

i = 1, 2, \dots, N

and

ρ

satisfies following inequality

ρ < \frac{1}{\max = 1, \dots, N, l = 1, \dots, κ \sum_{j = 1}^{N} | a_{i j}^{l} (k) | + b_{i}^{l} (k)},

which means that all of the diagonal entry in

L (k) + B (k)

are larger than the reciprocal of

ρ

. In this case, obviously

I - ρ Ξ (k)

is strictly less than one, so

I - ρ Ξ (k)

is an irreducible substochastic matrix and its diagonal entries are positive. According to (15), the following inequality can be obtained.

\begin{array}{l} ε (k + 1) & = (I - ρ Ξ (k)) ε (k) \\ \leq ‖ I - ρ Ξ (k) ‖ ‖ ε (k) ‖ \\ \leq ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ ‖ ε (k - 1) ‖ \\ \leq ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ ‖ ε (k - 1) ‖ \cdot \cdot \cdot ‖ I - ρ Ξ (1) ‖ ‖ ε (1) ‖ \end{array}

(16)

According to Lemma 1, the following inequality can be obtained.

‖ ε (k + 1) ‖ \leq ϒ^{⌊ \frac{k}{Q} ⌋} ‖ ε (1) ‖

where

⌊ \cdot ⌋

stands for the floor function. Hence, the bipartite consensus fixed trajectory tracking errors of MASs can converge to the origin. □

Theorem 2

Under these circumstances where the MASs (1) satisfies Assumptions 1, 2, and 4 and its communication topology satisfies Assumption 3, apply the designed DMFBAC schemes (4)–(6) to track the time-varying reference trajectory

y_{0} (k)

, where

{\bar{y}}_{0} (k) = {[y_{0} (k), y_{0} (k), \cdot \cdot \cdot, y_{0} (k)]}^{T}

and

Δ {\bar{y}}_{0} (k) = {\bar{y}}_{0} (k + 1) - {\bar{y}}_{0} (k)

. Moreover, if

ρ

satisfies the following condition

ρ < \frac{1}{\max = 1, \dots, N, l = 1, \dots, κ \sum_{j = 1}^{N} | a_{i j}^{l} (k) | + b_{i}^{l} (k)}

‖ Δ {\bar{y}}_{0} (k) ‖ < r_{y}

and

λ > λ_{m i n} > 0

, then there will be a small constant

α

, where

\underset{k - > \infty}{l i m} ‖ ε_{i} (k) ‖ \leq α, i = 1, 2, \dots, N

. The value of

α

is dependent on output gain of the time-varying trajectory.

Proof:

Since

ε (k) = s (k) {\bar{y}}_{0} (k) - y (k)

, then

ε (k + 1) - ε (k) = y (k) - y (k - 1)

, so that the bipartite consensus tracking error Equation in (15) can be rewritten as

ε (k + 1) = (I - ρ Ξ (k)) ε (k) + Δ {\bar{y}}_{0} (k)

(17)

so that the following inequality can be obtained.

\begin{array}{l} ‖ ε (k + 1) ‖ & \leq ‖ I - ρ Ξ (k) ‖ ‖ ε (k) ‖ + ‖ Δ {\bar{y}}_{0} (k) ‖ \\ \leq ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ ‖ ε (k - 1) ‖ + ‖ I - ρ Ξ (k) ‖ ‖ Δ {\bar{y}}_{0} (k - 1) ‖ + ‖ Δ {\bar{y}}_{0} (k) ‖ \\ = ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ \cdot \cdot \cdot ‖ I - ρ Ξ (1) ‖ ‖ ε (1) ‖ + ‖ Δ {\bar{y}}_{0} (k) ‖ + ‖ I - ρ Ξ (k) ‖ ‖ Δ {\bar{y}}_{0} (k - 1) ‖ + \cdot \cdot \cdot \\ + ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (2) ‖ ‖ Δ {\bar{y}}_{0} (1) ‖ \\ \leq ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ \cdot \cdot \cdot ‖ I - ρ Ξ (1) ‖ ‖ ε (1) ‖ + r_{y} + ‖ I - ρ Ξ (k) ‖ r_{y} \\ + ‖ I - ρ Ξ (k) ‖ ‖ I - ρ Ξ (k - 1) ‖ r_{y} + \cdot \cdot \cdot + ‖ I - ρ Ξ (k) ‖ \cdot \cdot \cdot ‖ I - ρ Ξ (2) ‖ r_{y} \end{array}

(18)

Let

O (K) = ϒ^{⌊ K Q / Q ⌋} + ϒ^{⌊ K Q + 1 / Q ⌋} + \cdot \cdot \cdot + ϒ^{⌊ ((K + 1) Q - 1) / Q ⌋}

and utilizing Lemma 1 we can obtain that

O (k) = Q ϒ^{k}

, and (16) can be written as follows:

\begin{array}{l} \lim_{k - > \infty} ‖ ε (k + 1) ‖ & = \lim_{k - > \infty} (ϒ^{⌊ \frac{K}{Q} ⌋} ‖ ε (1) ‖ + (ϒ^{⌊ \frac{K - 1}{Q} ⌋} + ϒ^{⌊ \frac{K - 2}{Q} ⌋} + \cdot \cdot \cdot + ϒ^{⌊ \frac{0}{Q} ⌋}) r_{y}) \\ = \lim_{k - > \infty} (ϒ^{⌊ \frac{K - 1}{Q} ⌋} + ϒ^{⌊ \frac{K - 2}{Q} ⌋} + \cdot \cdot \cdot + ϒ^{⌊ \frac{0}{Q} ⌋}) r_{y} \\ = \lim_{k - > \infty} (ϒ^{⌊ \frac{(K + 1) Q - 1}{Q} ⌋} + ϒ^{⌊ \frac{(K + 1) Q - 2}{Q} ⌋} + \cdot \cdot \cdot + ϒ^{⌊ \frac{K - 1}{Q} ⌋} + \cdot \cdot \cdot + ϒ^{⌊ \frac{0}{Q} ⌋}) r_{y} \\ = \lim_{k - > \infty} (O (k) + O (k - 1) + \cdot \cdot \cdot + O (0)) r_{y} \\ = Q \lim_{k - > \infty} (ϒ^{k} + ϒ^{k - 1} + \cdot \cdot \cdot + ϒ^{0}) r_{y} \\ = \frac{Q}{1 - ϒ} r_{y} \end{array}

(19)

where

⌊ \cdot ⌋

denotes the floor function. Finally, the bounded of

‖ ε (k + 1) ‖

is obtained.

Thus, bipartite time-varying trajectory tracking error is bound, which is dependent on the output gain

‖ Δ {\bar{y}}_{0} (k) ‖

of the reference trajectory. □

4. Simulation

In order to illustrate the efficiency of the proposed bipartite consensus tracking algorithm, three numerical simulations with seven follower agents are performed, where agents are governed by

\begin{matrix} A g e n t 1 : y_{1} (k + 1) = \frac{y_{1} (k) u_{1} (k)}{1 + y_{1}^{3} (k)} + 0.5 u_{1} (k), \\ A g e n t 2 : y_{2} (k + 1) = \frac{y_{2} (k) u_{2} (k)}{1 + y_{2}^{3} (k)} + 0.45 u_{2} (k), \\ A g e n t 3 : y_{3} (k + 1) = \frac{y_{3} (k) u_{3} (k)}{1 + y_{3}^{5} (k)} + 0.7 u_{3} (k), \\ A g e n t 4 : y_{4} (k + 1) = \frac{y_{4} (k) u_{4} (k)}{1 + y_{4}^{5} (k)} + 0.6 u_{4} (k), \end{matrix}

\begin{matrix} A g e n t 5 : y_{5} (k + 1) = \frac{y_{5} (k) u_{5} (k)}{1 {+ y}_{5}^{7} (k)} + 0.9 u_{5} (k), \\ A g e n t 6 : y_{6} (k + 1) = \frac{y_{5} (k) u_{5} (k)}{1 {+ y}_{5}^{7} (k)} + 0.75 u_{5} (k), \\ A g e n t 7 : y_{7} (k + 1) = \frac{y_{5} (k) u_{5} (k)}{1 {+ y}_{5}^{7} (k)} + 0.65 u_{5} (k) . \end{matrix}

It can be discovered that each agent has a unique dynamics system model, so the considered MASs are heterogeneous. Furthermore, it is noteworthy that the above dynamics system models are only applied to produce the I/O data for the MASs, while the distributed DMFABCT algorithm does not utilize any model information. During the design of this algorithm, the dynamics of MASs are all unknown.

The communication topology of considered MASs is shown in Figure 1. It demonstrates that the virtual leader is denoted by using vertex 0 and the followers are distributed into two alliances in each topology. Moreover, in Figure 1, the black solid lines are used to express the cooperative relationships among agents, and the competitive relationships are denoted by dotted lines. It is noted that only a subset of agents could directly receive the information from the leader. Moreover, the information among agents only transmits along the arrows and the direction is fixed. Although other agents cannot directly get the commands from the virtual leader, all of the communication graphs satisfy Assumption 3, so the virtual leader can intervene in the two competitive alliances. As the matrixes above show, the reciprocal of the greatest diagonal entry of

L (l) + B (l)

is 0.5 for

l = 1, 2, 3

. In order to satisfy the convergence condition for all

i = 1, 2, 3, 4, 5, 6, 7

in Theorem 2, we choose the controller parameters as

ρ = 0.3

for each simulation and the other parameters are selected as

p = 0.5

,

w = 1

,

λ = 0.5

, and

c = 10^{- 4}

.

4.1. Fixed Trajectory Tracking Example

In order to obtain a clear result of this simulation, a piecewise function and the desired reference trajectory are given below:

{\begin{matrix} {\bar{G}}_{1}, & 0 \leq k \leq 400 \\ {\bar{G}}_{2}, & 400 < k \leq 800 \\ {\bar{G}}_{3}, & 800 < k \leq 1400 \end{matrix}

y_{0} (k) {\begin{matrix} 10, \\ 20, \\ 15, \end{matrix} \begin{matrix} 0 < k < 400 \\ 400 \leq k < 800 \\ 800 \leq k < 1400 \end{matrix}

Initial conditions are chosen as

u_{i} (1) = 0

,

{\hat{Γ}}_{i} (1) = 2

for all agents and

y_{1} (1) = 0.5

,

y_{2} (1) = 3.5

,

y_{3} (1) = 6.5

,

y_{4} (1) = 4.5

,

y_{5} (1) = 1.5

,

y_{6} (1) = 5.5

,

y_{7} (1) = 5.5

in this simulation.

The simulation results of the bipartite tracking performance, tracking errors, and PPD estimation of each agent are shown in Figure 2, Figure 3 and Figure 4, respectively.

From Figure 2, Figure 3 and Figure 4 it can be seen that the output between followers and leader has an extreme variation initially, but the bipartite tracking errors can be decreased radically and the bipartite tracking is realized after a few steps. For example, in Figure 2, the value of trajectory is changed from 10 to 20 at

k = 400

and we could also find that several agents exchanged their groups at the same time, but only after about 100 steps after a new bipartite consensus is achieved, which Figure 3 also reveals. Furthermore, from Figure 4 we can see that the changing of the topology and the desire trajectory affect the estimation value of PPDs for each agent, but they achieve stable values immediately, which shows that the proposed DMFABCT has a good robustness.

4.2. Time-Varying Trajectory Tracking Example

In this example, the bipartite consensus time-varying trajectory tracking is discussed, and the desired trajectory is

y_{0} (k + 1) = 90 c o s (k π / Ψ) + 100

where

Ψ = 2200

is the output gain rate and the time-varying topologies are governed by

{\begin{matrix} {\bar{G}}_{1}, & 0 \leq k \leq 2500 \\ {\bar{G}}_{2}, & 2500 < k \leq 5000 \\ {\bar{G}}_{3}, & 5000 < k \leq 8000 \end{matrix}

where the initial data of

y_{i} (k)

,

u_{i} (k)

, dynamics of each agent, and other parameters were defined in the beginning of this section.

The bipartite consensus tracking performance of this example and the tracking errors of each agent are presented in Figure 5, which shows that the DMFABCT scheme can decrease the number of errors dramatically. Although the errors of the bipartite tracking cannot be removed, they converge to a small bound, which is demonstrated in Figure 6 and Figure 7. Compared with the desired output data of agents, the max distortion rate can be obtained in Figure 7, which is 0.084%. Obviously, this result demonstrates that MASs with switching topologies also can perform the bipartite time-varying tracking tasks. From Figure 8, we can also arrive at the same conclusion that MASs can change the value of PPDs to adaptive environmental change and can obtain a high fault-tolerance property.

By tracking performance of different tracking trajectories, according to Figure 3 and Figure 6, we can conclude that the performance of fixed trajectory tracking is better than that of the time-varying trajectory tracking, which further validates the correctness of the theoretical analysis in Section 3. In addition, in order to further analyze the errors forces of the time-varying trajectory, we change the output gain rate

Ψ

of the desired trajectory

y_{0} (k + 1) = 90 c o s (k π / Ψ) + 100

from 500 to 4000 to analyze the tracking performance. From Figure 9, we can easily find that the error rates of each agent all decrease, when the value of

Ψ

increases. The error rates of MASs at

Ψ = 500

,

Ψ = 2200

, and

Ψ = 4000

are shown in Figure 7, Figure 10 and Figure 11, respectively. Although the biggest error rate of MASs at

Ψ = 500

is about 0.418%, it can bind the error rates of each agent, which means that the errors of MASs are also bounded. Furthermore, errors rates of each agent, which are shown in Figure 11, are close to the original point, so that it further demonstrates the correctness of Theorem 2. Meanwhile, we can conclude that MASs are stable under the proposed DMFABCT scheme and the tracking errors are dependent on the output gain

‖ Δ {\bar{y}}_{0} (k) ‖

of the reference trajectory.

4.3. Realistic DC Linear Motors Example

In this case, we utilize seven permanent magnet DC linear motors to perform fixed and time-varying trajectory bipartite consensus tracking tasks. The realistic dynamic of the DC linear motor is investigated in [37,40], which has been modeled as below:

{\begin{cases} \dot{x} (t) = v (t) \\ v (t) = \frac{u (t) - f_{f r i c t i o n} (t) - f_{r i p p l e} (t)}{m} \\ y (t) = v (t) . \end{cases}

where

t

is continuous time (s),

x (t)

is the position (m),

v (t)

is the speed (m/s),

m

is the combined mass of translator and load,

u (t)

is the developed force (N),

f_{f r i c t i o n} (t)

is the friction force (N), and

f_{r i p p l e} (t)

is the ripple force (N). The friction and ripple forces have been identified as:

\begin{array}{l} f_{f r i c t i o n} (t) = (f_{c} + (f_{s} - f_{c}) e^{- {(\frac{\dot{x}}{{\dot{x}}_{δ}})}^{δ}} + f_{v} \dot{x}) s i g n (\dot{x}) \\ f_{r i p p l e} (t) {= b}_{1} s i n (w_{0} x (t)) \end{array}

where

f_{c}

is the minimum level of Coulomb friction and

f_{s}

is the level of static friction,

{\dot{x}}_{δ}

and

f_{v}

are lubricant and load parameters, respectively.

δ

is an additional empirical parameter. In this example, these parameters are selected as:

m = 0.59 k g

,

{\dot{x}}_{δ} = 0.1

,

δ = 1

,

f_{c} = 10 N

,

f_{s} = 20 N

,

f_{v} = 10 N \cdot s \cdot m^{- 1}

,

b_{1} = 8.5 N

,

w_{0} = 314 s^{- 1}

.The desired velocity is given as

y_{0} (t) = 90 c o s (t π / 4000) + 100, t \in [0, 8]

Using the Euler formula to discretize the above model and selecting sampling time as

h = 0.001

, we have

T = 1000

.

In this case, a random noise is introduced in the output measurement data for each DC motor. Moreover, we define the bound of the noise as

[- 0.02, 0.02]

. Here, we use the same parameters and the communication topology as those of example 2 to perform the simulation.

The fixed trajectory bipartite consensus tracking performances of seven DC motors are shown in Figure 12 and another tracking task is presented in Figure 13. From the two simulation results, we observe that several agents have changed their alliance, but the results of the two different bipartite consensus tracking tasks show that the tracking errors of MASs can be reduced, which further proves the effectiveness and applicability of the designed DMFABCT.

As shown above, the proposed DMFABCT scheme is correct and effective.

5. Conclusions

In this work, a data-driven bipartite consensus tracking scheme has been proposed for unknown nonlinear discrete-time multi-agent systems with switching topologies, and a compact form linearization model is established. This algorithm ensures that all agents can track the fixed and time-varying desired trajectory and realize the bipartite tracking. Compared with the model-based control algorithm, one of the main advanced features in our method is that it does not need the agent’s dynamics and requires only the input–output. Moreover, both of the cooperation and competition relationships among multi-agent systems are considered, and the convergence and stability of the algorithm are proven by rigorous mathematical analyses. Meanwhile, the corresponding simulation of the bipartite consensus tracking algorithm has been presented to validate the effectiveness of the proposed algorithm. In the future work, we will consider the bipartite consensus problem for multi-input-multi-output multi-agent systems with delay and disturbances.

Author Contributions

Conceptualization, H.Z.; Funding acquisition, L.P.; Software, H.Z.; Validation, L.P. and H.Y., Writing-original draft, H.Z.; Writing-review and editing, L.P. and H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Key R&D Program of China (2018YFD0400902); The National Natural Science Foundation of China (61873112); Jiangsu Planned Projects for Postdoctoral Research Funds (1601085C); Jiangsu Key Construction Laboratory of IoT Application Technology (190449, 190450); Postgraduate Research & Practice Innovation Program of Jiangnan University (JNKY19_043).

Acknowledgments

The authors would like to thank the reviewers who have helped improve the presentation of the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hock, A.; Schoellig, A.P. Distributed iterative learning control for multi-agent systems. Auton. Robot. 2019, 43, 1989–2010. [Google Scholar] [CrossRef]
Hui, Y.; Chi, R.; Huang, B.; Hou, Z. 3-D Learning-Enhanced Adaptive ILC for Iteration-Varying Formation Tasks. IEEE Trans. Neural Netw. Learn. Syst. 2020, 31, 89–99. [Google Scholar] [CrossRef] [PubMed]
Oh, K.K.; Park, M.C.; Ahn, H.S. A survey of multi-agent formation control. Automatica 2015, 53, 424–440. [Google Scholar] [CrossRef]
Hou, Z.; Chi, R.; Gao, H. An overview of dynamic-linearization-based data-driven control and applications. IEEE Trans. Ind. Electron. 2016, 64, 4076–4090. [Google Scholar] [CrossRef]
Barbot, J.-P.; Levant, A.; Livne, M.; Lunz, D. Discrete differentiators based on sliding modes. Automatica 2020, 112, 108633. [Google Scholar] [CrossRef]
Emelyanov, S.V.; Korovin, S.K.; Levantovsky, L.V. Second order sliding modes in controlling uncertain systems. Sov. J. Comput. Syst. Sci. 1986, 24, 63–68. [Google Scholar]
Li, H.; Zhu, Y. Consensus of second-order delayed nonlinear multi-agent systems via node-based distributed adaptive completely intermittent protocols. Appl. Math. Comput. 2018, 326, 1–15. [Google Scholar] [CrossRef]
Xu, Z.; Li, C.; Han, Y. Leader-following fixed-time quantized consensus of multi-agent systems via impulsive control. J. Frankl. Inst. 2019, 356, 441–456. [Google Scholar] [CrossRef]
Yang, S.; Liao, X.; Liu, Y.; Chen, X. Consensus of delayed multi-agent dynamical systems with stochastic perturbation via impulsive approach. Neural Comput. Appl. 2016, 28, 647–657. [Google Scholar] [CrossRef]
Yang, S.; Liao, X.; Liu, Y. Second-order consensus in directed networks of identical nonlinear dynamics via impulsive control. Neurocomputing 2016, 179, 290–297. [Google Scholar] [CrossRef]
Wang, Y.; Zheng, L.; Zhang, H.; Zheng, W.X. Fuzzy Observer-based Repetitive Tracking Control for Nonlinear Systems. IEEE Trans. Fuzzy Syst. 2019, 1. [Google Scholar] [CrossRef]
Bu, X.; Hou, Z.; Zhang, H. Data-Driven Multiagent Systems Consensus Tracking Using Model Free Adaptive Control. IEEE Trans. Neural Netw. Learn. Syst. 2017, 29, 1514–1524. [Google Scholar] [CrossRef] [PubMed]
Li, D.; He, S.; Xi, Y.; Liu, T.; Gao, F.; Wang, Y.; Lu, J. Synthesis of ILC–MPC Controller With Data-Driven Approach for Constrained Batch Processes. IEEE Trans. Ind. Electron. 2020, 67, 3116–3125. [Google Scholar] [CrossRef]
Yang, N.; Li, J. New distributed adaptive protocols for uncertain nonlinear leader-follower multi-agent systems via a repetitive learning control approach. J. Frankl. Inst. 2019, 356, 6571–6590. [Google Scholar] [CrossRef]
Odekunle, A.; Gao, W.; Davari, M.; Jiang, Z.-P. Reinforcement learning and non-zero-sum game output regulation for multi-player linear uncertain systems. Automatica 2020, 112, 108672. [Google Scholar] [CrossRef]
Zhang, H.; Jiang, H.; Luo, Y.; Xiao, G. Data-Driven Optimal Consensus Control for Discrete-Time Multi-Agent Systems with Unknown Dynamics Using Reinforcement Learning Method. IEEE Trans. Ind. Electron. 2017, 64, 4091–4100. [Google Scholar] [CrossRef]
Zhang, H.; Yue, D.; Dou, C.; Zhao, W.; Xie, X. Data-Driven Distributed Optimal Consensus Control for Unknown Multiagent Systems with Input-Delay. IEEE Trans. Cybern. 2019, 49, 2095–2105. [Google Scholar] [CrossRef]
Wu, H.; Song, S.; You, K.; Wu, C. Depth Control of Model-Free AUVs via Reinforcement Learning. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 2499–2510. [Google Scholar] [CrossRef] [Green Version]
Liu, L.; Liu, Y.-J.; Tong, S. Neural Networks-Based Adaptive Finite-Time Fault-Tolerant Control for a Class of Strict-Feedback Switched Nonlinear Systems. IEEE Trans. Cybern. 2018, 49, 2536–2545. [Google Scholar] [CrossRef]
Ren, H.; Zhang, H.; Su, H.; Mu, Y. Data-based stable value iteration optimal control for unknown discrete-time systems with time delays. Neurocomputing 2020, 382, 96–105. [Google Scholar] [CrossRef]
Liu, D.; Yang, G.-H. Performance-based data-driven model-free adaptive sliding mode control for a class of discrete-time nonlinear processes. J. Process. Control. 2018, 68, 186–194. [Google Scholar] [CrossRef]
Kadri, M.B. Model-Free Fuzzy Adaptive Control for MIMO Systems. Arab. J. Sci. Eng. 2017, 42, 2799–2808. [Google Scholar] [CrossRef]
Radac, M.-B.; Precup, R.-E.; Roman, R.-C. Data-driven model reference control of MIMO vertical tank systems with model-free VRFT and Q-Learning. ISA Trans. 2018, 73, 227–238. [Google Scholar] [CrossRef]
Wu, Y.; Zhao, Y.; Hu, J. Bipartite Consensus Control of High-Order Multiagent Systems with Unknown Disturbances. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 2189–2199. [Google Scholar] [CrossRef]
Altafini, C. Consensus Problems on Networks with Antagonistic Interactions. IEEE Trans. Autom. Control. 2012, 58, 935–946. [Google Scholar] [CrossRef]
Wu, J.; Deng, Q.; Han, T.; Yang, Q.-S.; Zhan, H. Bipartite tracking consensus for multi-agent systems with Lipschitz-Type nonlinear dynamics. Phys. A Stat. Mech. Appl. 2019, 525, 1360–1369. [Google Scholar] [CrossRef]
Ning, B.; Han, Q.L.; Zuo, Z. Bipartite Consensus Tracking for Second-Order Multi-Agent Systems: A Time-Varying Function Based Preset-Time Approach. IEEE Trans. Autom. Control 2020. (accepted). [Google Scholar] [CrossRef]
Bhowmick, S.; Panja, S. Leader—Follower Bipartite Consensus of Linear Multiagent Systems Over a Signed Directed Graph. IEEE Trans. Circuits Syst. II Express Briefs 2018, 66, 1436–1440. [Google Scholar] [CrossRef]
Xu, Y.; Wang, J.; Zhang, Y.; Xu, Y. Event-triggered bipartite consensus for high-order multi-agent systems with input saturation. Neurocomputing 2020, 379, 284–295. [Google Scholar] [CrossRef]
Wang, H.; Yu, W.; Wen, G.; Chen, G. Finite-Time Bipartite Consensus for Multi-Agent Systems on Directed Signed Networks. IEEE Trans. Circuits Syst. I Regul. Pap. 2018, 65, 4336–4348. [Google Scholar] [CrossRef]
Deng, Q.; Wu, J.; Han, T.; Yang, Q.-S.; Cai, X.-S. Fixed-time bipartite consensus of multi-agent systems with disturbances. Phys. A Stat. Mech. Appl. 2019, 516, 37–49. [Google Scholar] [CrossRef]
Peng, Z.; Hu, J.; Shi, K.; Luo, R.; Huang, R.; Ghosh, B.K.; Huang, J. A novel optimal bipartite consensus control scheme for unknown multi-agent systems via model-free reinforcement learning. Appl. Math. Comput. 2020, 369, 124821. [Google Scholar] [CrossRef]
Li, H. Reverse Group Consensus of Second-Order Multi-Agent Systems with Delayed Nonlinear Dynamics in the Cooperation—Competition Networks. IEEE Access 2019, 7, 71095–71108. [Google Scholar] [CrossRef]
Li, E.; Ma, Q.; Zhou, G. Bipartite output consensus for heterogeneous linear multi-agent systems with fully distributed protocol. J. Frankl. Inst. 2019, 356, 2870–2884. [Google Scholar] [CrossRef]
Zhao, L.; Jia, Y.; Yu, J. Adaptive finite-time bipartite consensus for second-order multi-agent systems with antagonistic interactions. Syst. Control. Lett. 2017, 102, 22–31. [Google Scholar] [CrossRef]
Li, H. H∞ group consensus for partial-state coupled linear systems with fixed and switching topologies in the cooperation-competition networks. J. Frankl. Inst. 2020, 357, 314–342. [Google Scholar] [CrossRef]
Bu, X.; Yu, Q.; Hou, Z.; Qian, W. Model Free Adaptive Iterative Learning Consensus Tracking Control for a Class of Nonlinear Multiagent Systems. IEEE Trans. Syst. Man Cybern. Syst. 2019, 49, 677–686. [Google Scholar] [CrossRef]
Hou, Z.; Jin, S. A Novel Data-Driven Control Approach for a Class of Discrete-Time Nonlinear Systems. IEEE Trans. Control. Syst. Technol. 2010, 19, 1549–1558. [Google Scholar] [CrossRef]
Yang, S.; Xu, J.-X.; Li, X. Iterative learning control with input sharing for multi-agent consensus tracking. Syst. Control. Lett. 2016, 94, 97–106. [Google Scholar] [CrossRef]
Armstrong-Hélouvry, B.; Dupont, P.E.; De Wit, C.C. A survey of models, analysis tools and compensation methods for the control of machines with friction. Automatica 1994, 30, 1083–1138. [Google Scholar] [CrossRef]

Figure 1. Communication topology among agents.

Figure 2. Tracking performance of each agent (example 1).

Figure 3. Tracking errors of each agent (example 1).

Figure 4. Pseudo partial derivative (PPD) estimation of each agent (example 1).

Figure 5. Tracking performance of each agent (example 2).

Figure 6. Tracking errors of each agent (example 2).

Figure 7. Tracking errors rate of each agent at

Q = 2200

(example 2).

Figure 7. Tracking errors rate of each agent at

Q = 2200

(example 2).

Figure 8. PPD estimation of each agent (example 2).

Figure 9. Tracking errors rate of each agent at

Ψ = 500

(example 2).

Figure 9. Tracking errors rate of each agent at

Ψ = 500

(example 2).

Figure 10. Tracking errors rate of each agent at

Ψ = 4000

(example 2).

Figure 10. Tracking errors rate of each agent at

Ψ = 4000

(example 2).

Figure 11. Tracking errors rate of each agent at

Ψ \in [500, 4000]

(example 2).

Figure 11. Tracking errors rate of each agent at

Ψ \in [500, 4000]

(example 2).

Figure 12. Tracking errors of each agent (example 3).

Figure 13. Tracking errors of each agent (example 4).

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, H.; Peng, L.; Yu, H. Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology. Sensors 2020, 20, 4164. https://doi.org/10.3390/s20154164

AMA Style

Zhao H, Peng L, Yu H. Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology. Sensors. 2020; 20(15):4164. https://doi.org/10.3390/s20154164

Chicago/Turabian Style

Zhao, Huarong, Li Peng, and Hongnian Yu. 2020. "Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology" Sensors 20, no. 15: 4164. https://doi.org/10.3390/s20154164

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Model-Free Bipartite Consensus Tracking for Unknown Heterogeneous Multi-Agent Systems with Switching Topology

Abstract

1. Introduction

2. Preliminaries and Problem Formulation

2.1. Graph Theory and Some Notations

2.2. Problem Formulation

3. Main Results

4. Simulation

4.1. Fixed Trajectory Tracking Example

4.2. Time-Varying Trajectory Tracking Example

4.3. Realistic DC Linear Motors Example

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI