Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric

Kagan, Evgeny; Ben-Gal, Irad

doi:10.3390/e26040318

Open AccessArticle

Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric

by

Evgeny Kagan

^1,*

and

Irad Ben-Gal

²

¹

Department of Industrial Engineering, Ariel University, Ariel 4076414, Israel

²

Department of Industrial Engineering, Tel Aviv University, Tel Aviv 6997801, Israel

^*

Author to whom correspondence should be addressed.

Entropy 2024, 26(4), 318; https://doi.org/10.3390/e26040318

Submission received: 4 March 2024 / Revised: 2 April 2024 / Accepted: 2 April 2024 / Published: 5 April 2024

(This article belongs to the Section Multidisciplinary Applications)

Download

Browse Figures

Versions Notes

Abstract

:

The paper addresses the problem of distinguishing the leading agents in the group. The problem is considered in the framework of classification problems, where the agents in the group select the items with respect to certain properties. The suggested method of distinguishing the leading agents utilizes the connectivity between the agents and the Rokhlin distance between the subgroups of the agents. The method is illustrated by numerical examples. The method can be useful in considering the division of labor in swarm dynamics and in the analysis of the data fusion in the tasks based on the wisdom of the crowd techniques.

Keywords:

leading agents; classification; entropy; Rokhlin metric

1. Introduction

The behavior of the group of autonomous agents includes a variety of physical and cognitive actions, like collective motion and cooperative decision-making. Each of these actions depends on the individual abilities of the agents, on the division of the agents among the teams, and on the division of labor between the agents and the teams.

To simplify control of the teams’ and the group’s activities, some of the agents are often determined as leaders, whose task is to influence the other agents and promote them to fulfill the mission. Such procedures are widely known as elections in distributed systems, which are considered both in social and political processes and in different contexts of computer science and robots’ control [1,2,3].

Given a group of communicating agents, elections are conducted as follows: The agents consider the candidate agents, and after deliberations, they identify and label a certain agent as the leader. Then, the elected leader influences the other agents and coordinates their activities [1]. Certainly, each team in the group can elect its leader, and then these leaders elect the leader of the group, which results in hierarchical control in the system.

The process that is close to the election of the leader is the selection of the leader [4,5,6,7]. In this process, the leader is selected according to the known characteristics of the agents and their correspondence with the characteristics required for fulfilling the mission of the group. Usually, after the section of the leader, the other agents are considered followers. Note that, in contrast to the election of the leader, the leader selection is not necessarily conducted by the agents but can be processed by the central coordinator or controller of the group.

Now, assume that the group leader or the team leader already exists. Then rises an inverse problem—to distinguish the leaders in the distributed system. Namely, given a group of communicating agents, it is required to identify the leaders, which are the agents who mostly influence the other agents in the group.

In the paper, the problem of distinguishing the leading agents is considered in the context of the classification problem [8,9]. In such a classification, it is assumed that the agents have different levels of expertise, and the cooperative classification is obtained by a certain version of plurality voting.

The most popular method of avoiding the influence of non-competent agents uses the weighted opinions of the agents. This method was implemented in the well-known Dawid–Skene algorithm [10,11], which iterates the expectation of the correct choice with respect to the agents’ expertise and maximizes the likelihood of the agents’ expertise with respect to the expected correct choice.

Another approach to selecting the competent agents was implemented in the algorithms [12,13], which are based on the similarities of the agents’ classifications, and in the algorithm [14], where the competent agents are selected using the expectation bias [15].

After selecting the subgroup of competent agents, the resulting classification is obtained using the opinions of these agents and ignoring the opinions of non-competent agents.

However, studies in social psychology [16], which can be traced back to the well-known experiments by Asch [17,18], demonstrate that the opinion of the group member is highly influenced by the opinions of the other members of the group. Consequently, competent agents are not necessarily the most influential or leading agents. Together with that, it is reasonable to assume that the leader must be competent in certain fields and be elected on the basis of this competence (see the 11th rule by Peterson [19]).

In the paper, we suggest a method of distinguishing the leaders in the group. The method considers the connectivity between the agents and creates the subgroups by maximizing the distances between the partitions formed by these subgroups.

Note that the elections in the group are internal processes in which the leaders are identified following certain criteria known to the agents, while distinguishing the leaders is an external operation in which the agents are characterized by their relations with the other agents. Thus, the distinguished leaders can differ from the elected leaders: elections specify who will govern, and distinguishing specifies who must govern.

The rest of the paper is organized as follows: Section 2 includes a formulation of the problem, and Section 3 considers an example that clarifies the problems of classification, distinguishing the competent agents, and distinguishing the leading agents. Section 4 presents a suggested solution based on the connectivity between the agents and the Rokhlin distance between the agents’ subgroups. Section 5 presents the methods of distinguishing the experts, and Section 6 considers two examples that illustrate the relationship between the group of experts and the group of leaders. Section 7 concludes the discourse.

2. Problem Formulation

The main problem considered in the paper is the problem of distinguishing the leaders in the group of agents. As indicated above, this problem differs from the problem of electing the leader and requires consideration of communication between the agents.

We consider the problem in the context of the classification of given items by a group of agents, where some of the agents are experts in the field of classification and others are deletants.

Consideration of these two problems gives rise to the third problem, which is the problem of relations between the group of leaders and the group of experts.

2.1. Distinguishing the Leaders

Let

A = \{a_{1}, a_{2}, \dots, a_{l}\}

be a group of communicating agents conducting a certain common mission. Communication between the agents is defined by the directed graph

G = (V, E)

in which the vertices

v \in V

are associated with the agents and the edges

e \in E

are defined by the adjacency matrix

R = {(r_{i j})}_{l \times l}

such that

r_{i j} = 1

if agent

a_{i}

communicates with agent

a_{j}

and

r_{i j} = 0

otherwise.

The problem of distinguishing the leaders in group

A

is formulated as follows: Given the adjacency matrix

R,

it is required to recognize a subgroup

A^{*} \subset A

of the agents such that the agents

a^{*} \in A^{*}

have maximal influence in the group

A

.

Consideration of the agent’s influence in the group is based on the following intuition inspired by the flows on graphs (see, e.g., [20]). Assume that each vertex

v \in V

in the graph

G

is a source of unique colored flow and that the color of the vertex is a mix of its own color and the colors of the incoming flows. Then, the influence of the agent is associated with the impact of its vertex on the colors of the other vertices in the graph

G

.

The vertices

v^{*} \in V^{*} \subset V

associated with the leading agents

a^{*} \in A^{*}

are called the leading vertices. The leading agent such that its vertex in the graph

G

has no predecessors is called a dictator, and the leading agent whose vertex is a separating vertex (see, e.g., [21]) is called a monarch.

2.2. Distinguishing the Experts

Let

X = \{x_{1}, \dots, x_{n}\}

be a set of

n

items, which represent certain objects, concepts, or symbols. Classification problem requires to label the items

x_{i} \in X

,

i = 1, 2, \dots, n

, by

m

labels,

1 < m < n

, such that each item is labeled by a single label.

Formally, the problem is to distribute the items

x_{i} \in X

,

i = 1, 2, \dots, n

, over

m

sets

C_{1}, C_{2}, \dots, C_{m}

,

1 < m < n

, called classes, such that each item

x_{i}

is included only in one class

C_{j}

and that there is no item which is not included in some classes. Then, the resulting classification is the partition

γ = \{C_{1}, C_{2}, \dots, C_{m}\}

of the set

X

, where

C_{j} \subset X

,

j = 1, \dots, m

,

C_{j^{'}} \cap C_{j^{″}} = \emptyset

for

j^{'} \neq j^{″}

, and

⋃_{j = 1}^{m} C_{j} = X

.

If classification is conducted by a single agent, then the resulting classification

γ

depends on the competence of this agent. The quality of classification is defined by the difference between the classification

γ

and the correct classification

\overset{ˇ}{γ}

. Certainly, the correct classification

\overset{ˇ}{γ}

is not available to the agent and is used for testing the classification methods.

Now assume that the classification is conducted by the indicated above group

A = \{a_{1}, a_{2}, \dots, a_{l}\}

of agents where each agent

a_{k} \in A

,

k = 1, 2, \dots, l

, provides classification represented by the partition

γ_{k} = \{C_{k, 1}, C_{k, 2}, \dots, C_{k, m}\}

. By the general assumption of “the wisdom of the crowd” techniques [8,9], some combination of the agents’ classification will provide classification

γ

, which is as close as possible to the correct classification

\overset{ˇ}{γ}

.

Then, the problem is to aggregate the agents’ partitions

γ_{k}

into a single partition

γ

such that it, as best as possible, represents the correct partition

\overset{ˇ}{γ}

.

The simplest method of creating the partition

γ

is plurality voting. By this method, the item

x

is included in the class

C,

which was chosen by most agents

a \in A

(the ties are broken randomly). Despite its popularity, this method strongly depends on the competence of the agents, such that non-competent agents can influence the resulting classification.

To avoid such influence, in more sophisticated methods [10,11,12,13,14], the problem is divided into two stages. First, using the agents’ classifications

γ_{k}

,

k = 1, 2, \dots, l

, the agents competent in certain classes are distinguished, and second, the resulting classification

γ

is obtained by aggregation of the classes provided by the competent agents.

2.3. Relationship between the Leaders and the Experts

Finally, assume that the group

A = \{a_{1}, a_{2}, \dots, a_{l}\}

of

l

communicating agents classifies the set

= \{x_{1}, \dots, x_{n}\}

of

n

items to

m

classes

C_{1}, C_{2}, \dots, C_{m}

,

1 < m < n

.

In addition, assume that the group

A

of the agents includes a non-empty subgroup

A^{*} \subset A

of leaders and non-empty subgroup

A^{'} \subset A

of experts.

Then, the problem is to check whether there exists a relationship between the set of leaders

A^{*}

and the set

A^{'}

of experts, and if it exists, what this relationship is.

We assume that in real-world situations, such a relationship is possible, and the leaders are assumed to be experts in at least one class. The problem is to confirm or withdraw this hypothesis.

3. Illustrative Example

Assume that the group

A

includes

l = 9

communicating agents and that communication between the agents is defined by the directed graph

G = (V, E)

, where the vertices

v \in V

are associated with the agents and the edges

e \in E

specify the communication between the corresponding agents. The graph

G

is shown in Figure 1.

The sets of input and output vertices in this graph are presented in Table 1.

As indicated above, in consideration of the agents’ impacts, each vertex of the graph

G

is considered a source of the unique colored flow. The color absorbed by the vertex is a mix of its own color and the colors of the incoming flows. The impact of the agent is considered to be the impact of its vertex on the color of the other vertices. Then, intuitively, the vertex with a maximal number of predecessors and successors is associated with the agents with maximal influence.

Following the cardinality of the sets of input and output vertices, it can be supposed that the set of vertices associated with the candidates to the leading agents is

V^{c} = \{v_{1}, v_{4}, v_{5}, v_{8}\}

such that

a_{1}

is a dictator and

a_{4}

is a monarch. In addition, because of the maximal number of predecessors, intuitively, vertex

v_{6}

can also be considered a candidate for the leading vertex.

Assume that the agents

a

from the group

A

distribute

n = 12

items

x

from the set

X

over

m = 4

classes

C

. The results of the classification are shown in Table 2.

In the table, the first agent

a_{1}

included the item

x_{1}

into the class

C_{2}

, the item

x_{2}

into the class

C_{1}

, the item

x_{3}

into the class

C_{3}

and so on, such that the partition

γ_{1}

created by the first agent is

γ_{1} = \{\{x_{2}, x_{6}, x_{8}\}, \{x_{1}\}, \{x_{3}, x_{9}, x_{10}, x_{12}\}, \{x_{4}, x_{5}, x_{7}, x_{11}\}\} .

The second agent

a_{2}

included the item

x_{1}

into the class

C_{2}

, the item

x_{2}

into the class

C_{3}

, the item

x_{3}

into the class

C_{2}

and so on, such that the second agent is

γ_{2} = \{\emptyset, \{x_{1}, x_{3}, x_{10}\}, \{x_{2}, x_{6}, x_{7}, x_{8}, x_{9}, x_{11}, x_{12}\}, \{x_{4}, x_{5}\}\},

and so on. Correct classification is represented by the following partition:

\overset{ˇ}{γ} = \{\{x_{2}, x_{6}, x_{8}\}, \{x_{1}\}, \{x_{3}, x_{9}, x_{10}, x_{12}\}, \{x_{4}, x_{5}, x_{7}, x_{11}\}\},

and the partition obtained by the plurality voting is

γ_{P l} = \{\{x_{2}, x_{3}, x_{6}, x_{10}\}, \{x_{1}, x_{8}\}, \{x_{11}, x_{12}\}, \{x_{4}, x_{7}, x_{9}\}\},

It is seen that the classifications

γ_{k}

provided by the agents

a_{k}

,

k = 1,2, \dots, l

, are rather far from the correct classification

\overset{ˇ}{γ}

as well as the aggregated classification

γ_{P l}

obtained by plurality voting.

However, some agents created certain classes that are equivalent to the classes in the correct classification, namely,

agent $a_{1}$ —class $C_{1} = \{x_{2}, x_{6}, x_{8}\}$ ,
agent $a_{2}$ —class $C_{2} = \{x_{1}, x_{3}, x_{10}\}$ ,
agent $a_{3}$ —classes $C_{1} = \{x_{2}, x_{6}, x_{8}\}$ and $C_{2} = \{x_{1}, x_{3}, x_{10}\}$ ,
agent $a_{4}$ —class $C_{3} = \{x_{4}, x_{5}, x_{11}\}$ ,
agent $a_{5}$ —class $C_{4} = \{x_{7}, x_{9}, x_{12}\}$ ,
agent $a_{6}$ —classes $C_{3} = \{x_{4}, x_{5}, x_{12}\}$ and $C_{4} = \{x_{7}, x_{9}, x_{12}\}$ .

The other agents

a_{7}

,

a_{8}

and

a_{9}

provided completely erroneous classifications, where despite the correct classification of some items, all obtained classes differ from the classes appearing in the correct classification. Then, aggregating the appropriate classes from the classifications created by the agents

a_{1}, \dots, a_{6}

and avoiding classifications created by the agents

a_{7}, \dots, a_{9}

, provides correct classification

γ = \overset{ˇ}{γ}

.

In the considered case of random classifications and relations between the agents, a possible set of leaders is

A^{*} = \{a_{1}, a_{4}, a_{5}, a_{8}\}

and possible set of experts is

A^{'} = \{a_{1}, a_{2}, a_{3}, a_{4}, a_{5}, a_{6}\}

, which means that there is no clear relationship between these sets. However, since in real-world situations, the leader must be competent in at least one field of knowledge, the absence of such a relationship is not obvious, and its consideration is reasonable.

4. Distinguishing the Group of Leaders Using the Entropy-Based Metric

Let

A = \{a_{1}, a_{2}, \dots, a_{l}\}

be a group of agents and

G = (V, E)

be the directed graph representing communication between the agents such that the vertices

v \in V

are associated with the agents and the edges

e = (v_{i}, v_{j}) \in E

define communication between the agents

a_{i}, a_{j} \in A

,

i, j = 1,2, \dots, l

.

Denote by

\overset{⃐}{u} (v, ξ) \in V

a predecessor of the vertex

v \in V

such that the shortest path from

\overset{⃐}{u} (v, ξ)

to

v

in the graph

G

is of the length

ξ

, and by

\overset{⃑}{u} (v, ξ) \in V

a successor of the vertex

v \in V

such that the shortest path from

v

to

\overset{⃑}{u} (v, ξ)

in the graph

G

is of the length

ξ

. In particular,

\overset{⃐}{u} (v, 1) = \overset{⃐}{u} (v)

is a direct predecessor of

v

and

\overset{⃑}{u} (v, 1) = \overset{⃑}{u} (v)

is a direct successor of

v

. For completeness, we also say that

\overset{⃐}{u} (v, 0) = \overset{⃑}{u} (v, 0) = v

.

It is clear that each

v \in V

and all its predecessors are the predecessors of each successor of

v

, and each

v \in V

and all its successors are the successors of each predecessor of

v

.

For a vertex

v \in V

, denote by

\overset{⃐}{U} (v, ξ)

, the set of its predecessors

\overset{⃐}{u} (v, ξ)

and by

\overset{⃑}{U} (v, ξ)

the set of all its successors

\overset{⃑}{u} (v, ξ)

. The set of direct predecessors is denoted by

\overset{⃐}{U} (v, 1) = \overset{⃐}{U} (v)

and the set of direct successors is denoted by

\overset{⃑}{U} (v, 1) = \overset{⃑}{U} (v)

.

Given a graph

G

, let

\overset{⃐}{U} (v)

be the set of direct predecessors of the vertex

v

,

\overset{⃐}{U} (\overset{⃐}{u} (v))

be the sets of direct predecessors of the vertices

\overset{⃐}{u} (v) \in \overset{⃐}{U} (v)

,

\overset{⃐}{U} (\overset{⃐}{u} (\overset{⃐}{u} (v)))

be the sets of direct predecessors of the vertices

\overset{⃐}{u} (\overset{⃐}{u} (v)) \in \overset{⃐}{U} (\overset{⃐}{u} (v))

and so on, up to, but not including, the set that already appears among the sets of direct predecessors obtained at the previous steps.

Similarly, let

\overset{⃑}{U} (v)

be the set of direct successors of the vertex

v

,

\overset{⃑}{U} (\overset{⃑}{u} (v))

be the sets of direct successors of the vertices

\overset{⃑}{u} (v) \in \overset{⃑}{U} (v)

,

\overset{⃑}{U} (\overset{⃑}{u} (\overset{⃑}{u} (v)))

be the sets of direct successors of the vertices

\overset{⃑}{u} (\overset{⃑}{u} (v)) \in \overset{⃑}{U} (\overset{⃑}{u} (v))

and so on, up to, but not including, the set that already appears among the sets of direct successors obtained at the previous steps.

Finally, for the vertex

v,

let us form the predecessors’ tree

\overset{⃐}{T} (v)

and the successors’ tree

\overset{⃑}{T} (v)

. In the tree

\overset{⃐}{T} (v)

, the root is associated with the set

\overset{⃐}{U} (v)

and the leaves at their levels are associated with the sets

\overset{⃐}{U} (\overset{⃐}{u} (v))

,

\overset{⃐}{U} (\overset{⃐}{u} (\overset{⃐}{u} (v)))

and so on, respectively. Similarly, in the

\overset{⃑}{T} (v)

, the root is associated with the set

\overset{⃑}{U} (v)

and the leaves at their levels are associated with the sets

\overset{⃑}{U} (\overset{⃑}{u} (v))

,

\overset{⃑}{U} (\overset{⃑}{u} (\overset{⃑}{u} (v)))

and so on.

For illustration, the predecessors’ tree

\overset{⃐}{T} (v_{8})

and the successors’ tree

\overset{⃑}{T} (v_{8})

of the vertex

v_{8}

in the graph

G

are shown in Figure 2.

The sets associated with the leaves of the trees

\overset{⃐}{T} (v)

and

\overset{⃑}{T} (v)

form, respectively, the predecessor cover

\overset{⃐}{τ} (v)

and the successor cover

\overset{⃑}{τ} (v)

of certain subsets of the set

V

of vertices.

For example, the predecessor and successor covers of the vertex

v_{8}

are

\overset{⃐}{τ} (v_{8}) = \{\{v_{1}, v_{2}\}, \{v_{4}\}, \{v_{7}, v_{9}\}\}

and

\overset{⃑}{τ} (v_{8}) = \{\{v_{5}, v_{6}\}, \{v_{6}\}\}

.

The predecessor cover

\overset{⃐}{τ} (V^{'})

of the subset

V^{'} \subset V

of vertices is a set

\overset{⃐}{τ} (V^{'}) = ⋃_{v \in V^{'}} \overset{⃐}{τ} (v)

of the predecessor covers

\overset{⃐}{τ} (v)

of the vertices

v \in V^{'}

, and the successor cover

\overset{⃑}{τ} (V^{″})

of the subset

V^{″} \subset V

of vertices is a set

\overset{⃑}{τ} (V^{″}) = ⋃_{v \in V^{″}} \overset{⃑}{τ} (v)

of the successor covers

\overset{⃐}{τ} (v)

of the vertices

v \in V^{″}

.

For example, for the indicated above set

V^{c} = \{v_{1}, v_{4}, v_{5}, v_{8}\}

of vertices, the predecessor and successor covers are

\overset{⃐}{τ} (V^{c}) = \{\{v_{1}, v_{2}\}, \{v_{3}, v_{4}, v_{8}\}, \{v_{4}\}, \{v_{7}, v_{9}\}\}

and

\overset{⃑}{τ} (V^{c}) = \{\{v_{2}, v_{4}\}, \{v_{4}\}, \{v_{5}, v_{6}\}, \{v_{5}, v_{7}\}, \{v_{8}\}\}

.

Then, we say that the subset

V^{*} \subset V

is a set of leading vertices if the distance

d (\overset{⃐}{τ} (V^{*}), \overset{⃑}{τ} (V^{*}))

between its predecessor cover

\overset{⃐}{τ} (V^{*})

and successor cover

\overset{⃑}{τ} (V^{*})

is maximal over all possible subsets of the set

V

.

The agents

a^{*}

associated with the leading vertices

v^{*} \in V^{*}

are called the leaders and the group

A^{*} \subset A

of leading agents is called the leading group.

Distance

d (\overset{⃐}{τ} (V^{'}), \overset{⃑}{τ} (V^{″}))

between the covers

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

can be calculated using different methods. Here, we suggest the distance measure, which is based on the Rokhlin metric [22]. Since in the considered task, the main stress is on the communication between the agents and on the classification of the data items, the use of such an entropy-based metric is reasonable. Together with that, since the suggested method deals with formal sets of vertices in the graph, the other measures, e.g., the Ornstein distance [23,24], can be applied. For a comparison between the Rokhlin distance and the Ornstein distance, see [25]. Note that both Rokhlin and Ornstein metrics require the defined probability measure on the sets; if such a probability does not exist, then the normalized Hamming distance [12] can be used.

The Rokhlin metric is defined as follows. Let

(Ω, Q, p)

be a probability space with a probability measure

p

on

Ω

, and let

α = \{Q | Q \in Q\}

,

Q_{i} \cap Q_{j} = \emptyset

,

i \neq j

,

⋃_{Q \in α} Q = Ω

, be a partition of

Ω

. The entropy of the partition

α

is the value

H (α) = - \sum_{Q \in α} p (Q) \log p (Q),

where

l o g

is base

2,

and it is assumed that

p (\emptyset) \log p (\emptyset) = 0 \log 0 = 0

. In addition, let

β = \{R | R \in Q\}

,

R_{i} \cap R_{j} = \emptyset

,

i \neq j

,

⋃_{R \in β} R = Ω

, be another partition of

Ω

. Then the conditional entropy of partition

α

given partition

β

is the value

H (α | β) = - \sum_{R \in β} \sum_{Q \in α} p (Q, R) \log p (Q | R),

where

p (Q, R) = p (Q \cap R)

and

p (Q | R) = \frac{p (Q \cap R)}{p (R)}

.

The Rokhlin metric [22], which defines the distance between partitions

α

and

β

is a sum

d_{R} (α, β) = H (α | β) + H (β | α),

For basic properties of this metric and its role in dynamical systems theory, see [26,27]; for additional properties and comparison with the Ornstein metric, see [25].

To apply this metric for measuring the distance between the covers

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

, note again that each of these sets does not necessarily cover the set of vertices

V

, but the subsets

⋃_{Q \in \overset{⃐}{τ} (V^{'})} Q \subset V

and

⋃_{R \in \overset{⃐}{τ} (V^{″})} R \subset V

of this set. Then, let us add to each of these sets the set which completes it to the cover of

V

.

Namely, the set

\overset{⃐}{τ} (V^{'})

is completed with the set

Q^{'} = V \ ⋃_{Q \in \overset{⃐}{τ} (V^{'})} Q

and the set

\overset{⃑}{τ} (V^{″})

is completed with the set

R^{'} = V \ ⋃_{R \in \overset{⃑}{τ} (V^{″})} R

. As a result, the sets

{\overset{⃐}{τ}}^{'} (V^{'}) = \overset{⃐}{τ} (V^{'}) \cup \{Q^{'}\} and {\overset{⃑}{τ}}^{'} (V^{″}) = \overset{⃑}{τ} (V^{″}) \cup \{R^{'}\}

cover the set

V

of vertices.

Then, it is required to define the probability measure

p : V \to [0,1]

on the set of vertices. Since there is no additional information about the agents, we assume that

p (v) = \frac{1}{# V}

for each vertex

v \in V

, and

p (Q) = \sum_{v \in Q} p (v) = \frac{# Q}{# V}

for each subset

Q \subset V

of vertices.

For the conditional entropy, we have

\begin{array}{l} H ({\overset{⃐}{τ}}^{'} (V^{'}) | {\overset{⃑}{τ}}^{'} (V^{″})) & = - \sum_{R \in {\overset{⃑}{τ}}^{'} (V^{″})} \sum_{Q \in {\overset{⃐}{τ}}^{'} (V^{'})} p (Q, R) \log p (Q | R) \\ = - \sum_{R \in \overset{⃑}{τ} (V^{″})} \sum_{Q \in {\overset{⃐}{τ}}^{'} (V^{'})} p (Q, R) \log p (Q | R) - \sum_{Q \in {\overset{⃐}{τ}}^{'} (V^{'})} p (Q, R^{'}) \log p (Q | R^{'}) \\ = - \sum_{R \in \overset{⃑}{τ} (V^{″})} (\sum_{Q \in \overset{⃐}{τ} (V^{'})} p (Q, R) \log p (Q | R) + p (Q^{'}, R) \log p (Q^{'} | R)) \\ - \sum_{Q \in \overset{⃐}{τ} (V^{'})} p (Q, R^{'}) \log p (Q | R^{'}) - p (Q^{'}, R^{'}) \log p (Q^{'} | R^{'}) \\ = - \sum_{R \in \overset{⃑}{τ} (V^{″})} \sum_{Q \in \overset{⃐}{τ} (V^{'})} p (Q, R) \log p (Q | R) \\ - \sum_{R \in \overset{⃑}{τ} (V^{″})} p (Q^{'}, R) \log p (Q^{'} | R) - \sum_{Q \in \overset{⃐}{τ} (V^{'})} p (Q, R^{'}) \log p (Q | R^{'}) \\ - p (Q^{'}, R^{'}) \log p (Q^{'} | R^{'}) . \end{array}

In this formula, the first term is equivalent to the conditional entropy of the sets

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

, the second term represents the influence of the sets

Q^{'}

and

R^{'}

to the elements of the sets

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

, and the last term defines the conditional entropy of

Q^{'}

with respect to

R^{'}

.

This definition is a direct extension of the definition of conditional entropy of the partitions. In fact, if the sets

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

are covers of

V

, then

Q^{'} = \emptyset

and

R^{'} = \emptyset

. Then,

H ({\overset{⃐}{τ}}^{'} (V^{'}) | {\overset{⃑}{τ}}^{'} (V^{″})) = H (\overset{⃐}{τ} (V^{'}) | \overset{⃑}{τ} (V^{″})) = - \sum_{R \in \overset{⃑}{τ} (V^{″})} \sum_{Q \in \overset{⃐}{τ} (V^{'})} p (Q, R) \log p (Q | R),

and if

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{″})

are partitions of

V

, then it is equivalent to the definition of the conditional entropy.

Note that since

{\overset{⃐}{τ}}^{'} (V^{'})

and

{\overset{⃑}{τ}}^{'} (V^{″})

are covers of the set

V

, the conditional entropy

H ({\overset{⃐}{τ}}^{'} (V^{'}) | {\overset{⃑}{τ}}^{'} (V^{″}))

does not necessarily meet all the properties of the conditional entropy defined for the partitions. However, here, we will not consider specific properties of the conditional entropy of the covers but will use it directly to define the distance between the predecessor cover

\overset{⃐}{τ} (V^{'})

and the successor cover

\overset{⃑}{τ} (V^{″})

.

The distance

d (\overset{⃐}{τ} (V^{'}), \overset{⃑}{τ} (V^{″}))

between the predecessor cover

\overset{⃐}{τ} (V^{'})

and the successor cover

\overset{⃑}{τ} (V^{″})

is defined by the Rokhlin distance between the covers

{\overset{⃐}{τ}}^{'} (V^{'})

and

{\overset{⃑}{τ}}^{'} (V^{″})

of the set

V

of vertices as

d (\overset{⃐}{τ} (V^{'}), \overset{⃑}{τ} (V^{″})) = H ({\overset{⃐}{τ}}^{'} (V^{'}) | {\overset{⃑}{τ}}^{'} (V^{″})) + H ({\overset{⃐}{τ}}^{'} (V^{″}) | {\overset{⃑}{τ}}^{'} (V^{'})),

For example, the distance between the predecessor cover

\overset{⃐}{τ} (V^{c}) = \{\{v_{1}, v_{2}\}, \{v_{3}, v_{4}, v_{8}\}, \{v_{4}\}, \{v_{7}, v_{9}\}\}

and the successor cover

\overset{⃑}{τ} (V^{c}) = \{\{v_{2}, v_{4}\}, \{v_{4}\}, \{v_{5}, v_{6}\}, \{v_{5}, v_{7}\}, \{v_{8}\}\}

is calculated as follows:

The completed sets for the covers

\overset{⃐}{τ} (V^{c})

and

\overset{⃑}{τ} (V^{c})

are

Q^{'} = \{v_{1}, v_{2}, \dots, v_{9}\} \ \{v_{1}, v_{2}, v_{3}, v_{4}, v_{7}, v_{8}, v_{9}\} = \{v_{5}, v_{6}\}

and

R^{'} = \{v_{1}, v_{2}, \dots, v_{9}\} \ \{v_{2}, v_{4}, v_{5}, v_{6}, v_{7}, v_{8}\} = \{v_{1}, v_{3}, v_{9}\}

. Then, the completed covers of the set

V

of vertices are

\overset{⃐}{τ} (V^{c}) = \{\{v_{1}, v_{2}\}, \{v_{3}, v_{4}, v_{8}\}, \{v_{4}\}, \{v_{7}, v_{9}\}, \{v_{5}, v_{6}\}\}

and

\overset{⃑}{τ} (V^{c}) = \{\{v_{2}, v_{4}\}, \{v_{4}\}, \{v_{5}, v_{6}\}, \{v_{5}, v_{7}\}, \{v_{8}\}, \{v_{1}, v_{3}, v_{9}\}\} .

The probability of each vertex

v \in V

is

p (v) = \frac{1}{9}

. Then, conditional entropies

H ({\overset{⃐}{τ}}^{'} (V^{c}) | {\overset{⃑}{τ}}^{'} (V^{c}))

and

H ({\overset{⃑}{τ}}^{'} (V^{c}) | {\overset{⃐}{τ}}^{'} (V^{c}))

are (the zero terms are omitted).

\begin{array}{l} H ({\overset{⃐}{τ}}^{'} (V^{c}) | {\overset{⃑}{τ}}^{'} (V^{c})) & = - p (\{v_{2}\}) \log \frac{p (\{v_{2}\})}{p (\{v_{2}, v_{4}\})} - p (\{v_{4}\}) \log \frac{p (\{v_{4}\})}{p (\{v_{2}, v_{4}\})} \\ - p (\{v_{4}\}) \log \frac{p (\{v_{4}\})}{p (\{v_{2}, v_{4}\})} - p (\{v_{7}\}) \log \frac{p (\{v_{7}\})}{p (\{v_{5}, v_{7}\})} - p (\{v_{5}\}) \log \frac{p (\{v_{5}\})}{p (\{v_{5}, v_{7}\})} \\ - p (\{v_{1}\}) \log \frac{p (\{v_{1}\})}{p (\{v_{1}, v_{3}, v_{9}\})} - p (\{v_{3}\}) \log \frac{p (\{v_{3}\})}{p (\{v_{1}, v_{3}, v_{9}\})} - p (\{v_{9}\}) \log \frac{p (\{v_{9}\})}{p (\{v_{1}, v_{3}, v_{9}\})} \\ = - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{3} \\ = 1.08, \end{array}

\begin{array}{l} H ({\overset{⃑}{τ}}^{'} (V^{c}) | {\overset{⃐}{τ}}^{'} (V^{c})) & = - p (\{v_{2}\}) \log \frac{p (\{v_{2}\})}{p (\{v_{1}, v_{2}\})} - p (\{v_{1}\}) \log \frac{p (\{v_{1}\})}{p (\{v_{1}, v_{2}\})} \\ - p (\{v_{4}\}) \log \frac{p (\{v_{4}\})}{p (\{v_{3}, v_{4}, v_{8}\})} - p (\{v_{4}\}) \log \frac{p (\{v_{4}\})}{p (\{v_{3}, v_{4}, v_{8}\})} - p (\{v_{8}\}) \log \frac{p (\{v_{8}\})}{p (\{v_{3}, v_{4}, v_{8}\})} \\ - p (\{v_{3}\}) \log \frac{p (\{v_{3}\})}{p (\{v_{3}, v_{4}, v_{8}\})} - p (\{v_{7}\}) \log \frac{p (\{v_{7}\})}{p (\{v_{7}, v_{9}\})} - p (\{v_{9}\}) \log \frac{p (\{v_{9}\})}{p (\{v_{7}, v_{9}\})} \\ - p (\{v_{5}\}) \log \frac{p (\{v_{5}\})}{p (\{v_{5}, v_{6}\})} \\ = - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{3} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} - \frac{1}{9} \log \frac{1}{2} \\ = 1.26, \end{array}

The distance between the predecessor and successor covers of the set

V^{c} = \{v_{1}, v_{4}, v_{5}, v_{8}\}

of vertices is

d (\overset{⃐}{τ} (V^{c}), \overset{⃑}{τ} (V^{c})) = 1.08 + 1.26 = 2.34 .

For comparison, the distance between the indicated above predecessor cover

\overset{⃐}{τ} (v_{8}) = \{\{v_{1}, v_{2}\}, \{v_{4}\}, \{v_{7}, v_{9}\}\}

and the successor cover

\overset{⃑}{τ} (v_{8}) = \{\{v_{5}, v_{6}\}, \{v_{6}\}\}

of the vertex

v_{8}

is

d (\overset{⃐}{τ} (v_{8}), \overset{⃑}{τ} (v_{8})) = 1.96 .

Then, the group

A^{c} = \{a_{1}, a_{4}, a_{5}, a_{8}\}

of the agents associated with the vertices of the group

V^{c} = \{v_{1}, v_{4}, v_{5}, v_{8}\}

is preferable as a group of leaders than a group

A^{c} = \{a_{8}\}

, which includes only one agent

a_{8}

associated with the vertex

v_{8}

.

Hereby, we defined the group

A^{*} \subset A

of leading agents and suggested the criterion for its recognition among the other agents in the group

A

. The same procedure can be continued over the group

A^{*}

and then recurrently over the obtained groups up to distinguishing a unique leading agent.

The algorithmic solution to the problem of distinguishing the leading agents is a complex task which requires an exhaustive search among all possible subsets of the agents from the group

A

or, that is, the same, among all possible subsets of vertices from the set

V

. Together with that, certain heuristics omitting the vertices with a relatively small number of predecessors and successors can strongly decrease the number of candidate solutions.

5. Distinguishing the Group of Experts

Assume that the group of agents

A = \{a_{1}, a_{2}, \dots, a_{l}\}

considers the set

X = \{x_{1}, \dots, x_{n}\}

of

n

items and each agent

a_{k} \in A

,

k = 1,2, \dots, l

, provides partition

γ_{k} = \{C_{k, 1}, C_{k, 2}, \dots, C_{k, m}\}

of the set

X

to

m

classes. The resulting classification is an aggregated partition

γ = \{C_{1}, C_{2}, \dots, C_{m}\}

created from the agents’ partitions

γ_{k}

,

k = 1,2, \dots, l

, and to obtain the correct partition

γ,

it is required to recognize partitions provided by the competent agents and avoid partitions provided by non-competent agents.

Distinguishing the experts is based on the assumption that the agents with the same competence in the same fields provide similar classifications of the items related to their field of expertise and can provide different classifications of the items that are outside of the scope of their competence [12]. In other words, we follow the well-known phrase by Father Dominique Bouhours ([28] (p. 125), punctuation and grammar preserved):

“Great Minds often think alike on the same Occasions, and we are not always to suppose, that such Thoughts are borrow’d from one another when exprest by Persons of the same heroick Sentiments.”

Following this assumption, agent

a_{k} \in A

is considered a weak expert in a certain class

C_{j}

if the agent’s partition

γ_{k}

includes

C_{j}

and there exist the other agents

a_{k^{'}}, a_{k^{″}}, \dots \in A

, such that their partitions

γ_{k^{'}}

,

γ_{k^{″}}

,… include

C_{j}

. If the partition

γ_{k}

class

C_{j}

is at the same position as in the partitions

γ_{k^{'}}

,

γ_{k^{″}}

,…, then the agent

a_{k}

is called strong expert or expert, for briefness.

The number of agents with equivalent classes

C_{j}

required for specifying the agent as an expert varies and depends on the number

l

of agents in the group. Following general statistical assumptions, we say that the number of such agents is at least

10 %

of

l

and, for small groups, is not less than

2

.

As indicated above, there exist several algorithms of classification that implement the difference between the opinions of competent and non-competent agents [10,11,12,13]. In particular, the Distance-Based Collaborative Classification (DBCC) algorithm [12] directly considers the normalized Hamming distance

d_{H} (γ_{k}, γ_{k^{'}} | j)

between the partitions

γ_{k} = \{C_{k, 1}, C_{k, 2}, \dots, C_{k, m}\}

and

γ_{k^{'}} = \{C_{k^{'}, 1}, C_{k^{'}, 2}, \dots, C_{k^{'}, m}\}

with respect to each class

C_{j}

d_{n H} (γ_{k}, γ_{k^{'}} | j) = # (C_{k, j} ∆ C_{k^{'}, j}) / (# C_{k, j} + # C_{k^{'}, j}),

where

C_{k, j} ∆ C_{k^{'}, j} = (C_{k, j} \cup C_{k^{'}, j}) \ (C_{k, j} \cap C_{k^{'}, j})

is a symmetric difference between the classes

C_{k, j}

and

C_{k^{'}, j}

.

If on the set

X

of items, a probability measure

p : X \to [0,1]

is defined, then using this measure, the distance between the partitions

γ_{k}

and

γ_{k^{'}}

of

X

with respect to the class

C_{j}

can be defined as

d_{p H} (γ_{k}, γ_{k^{'}} | j) = - p (C_{k, j} ∆ C_{k^{'}, j}),

or in the form of the Rokhlin metric as

d_{p R} (γ_{k}, γ_{k^{'}} | j) = - p (C_{k, j} \ C_{k^{'}, j}) \log p (C_{k, j} \ C_{k^{'}, j}) - p (C_{k^{'}, j} \ C_{k, j}) \log p (C_{k^{'}, j} \ C_{k, j}) .

Below, we assume that the experts are already distinguished by these or other methods and consider the relationship between the group of leaders and the group of experts.

6. Relationship between the Group $A^{*}$ of Leaders and the Group $A'$ of Experts

Let us return to the above example and assume that the distinguished group of experts is

A^{'} = \{a_{1}, a_{2}, a_{3}, a_{4}, a_{5}, a_{6}\}

. Our aim is to check whether this group of experts is also a group of leaders.

Denote by

V^{'} \subset V

, the set of vertices in the graph

G

associated with the agents from the group

A^{'}

of experts. Then, following the presented above procedure of distinguishing the group of leading agents, the distance between the predecessor partition

\overset{⃐}{τ} (V^{'})

and the successor partition

\overset{⃑}{τ} (V^{'})

is

d (\overset{⃐}{τ} (V^{'}), \overset{⃑}{τ} (V^{'})) = H ({\overset{⃐}{τ}}^{'} (V^{'}) | {\overset{⃑}{τ}}^{'} (V^{'})) + H ({\overset{⃑}{τ}}^{'} (V^{'}) | {\overset{⃐}{τ}}^{'} (V^{'})) = 0.89 + 0.70 = 1.59 .

It is seen that the predecessor partition

\overset{⃐}{τ} (V^{'})

and successor partition

\overset{⃑}{τ} (V^{'})

of the set

V^{'}

are closer than the predecessor partition

\overset{⃐}{τ} (V^{c})

and successor partition

\overset{⃑}{τ} (V^{c})

of the previously distinguished set

V^{c} = \{v_{1}, v_{4}, v_{5}, v_{8}\}

of vertices associated with the agents from the set

A^{c}

of candidate leaders (distance

d (\overset{⃐}{τ} (V^{c}), \overset{⃑}{τ} (V^{c})) = 2.34

). Moreover, partitions

\overset{⃐}{τ} (V^{'})

and

\overset{⃑}{τ} (V^{'})

are closer than the partitions

\overset{⃐}{τ} (v_{8})

and

\overset{⃑}{τ} (v_{8})

of the vertex

v_{8}

(distance

d (\overset{⃐}{τ} (v_{8}), \overset{⃑}{τ} (v_{8})) = 1.96

).

Now, let us consider the classifications provided by the candidate group

A^{c} = \{a_{1}, a_{4}, a_{5}, a_{8}\}

of leaders. We have

γ_{1} = \{\{x_{2}, x_{6}, x_{8}\}, \{x_{1}\}, \{x_{3}, x_{9}, x_{10}, x_{12}\}, \{x_{4}, x_{5}, x_{7}, x_{11}\}\}, γ_{4} = \{\{x_{3}, x_{6}, x_{7}, x_{10}, x_{12}\}, \{x_{8}, x_{9}\}, \{x_{4}, x_{5}, x_{11}\}, \{x_{1}, x_{2}\}\}, γ_{5} = \{\{x_{3}, x_{4}, x_{10}\}, \{x_{5}, x_{6}, x_{8}, x_{11}\}, \{x_{1}, x_{2}\}, \{x_{7}, x_{9}, x_{12}\}\}, γ_{8} = \{\{x_{3}, x_{10}\}, \{x_{2}, x_{8}\}, \{x_{1}, x_{4}, x_{5}, x_{6},, x_{11}, x_{12}\}, \{x_{7}, x_{9}\}\} .

Despite the competence of agent

a_{1}

in class

C_{1}

of agent

a_{4}

in class

C_{3}

and of agent

a_{5}

in class

C_{5}

, the resulting plurality voting partition is (item

x_{2}

is labeled randomly)

γ_{P l} = \{\{x_{3}, x_{6}, x_{10}\}, \{x_{2}, x_{8}\}, \{x_{1}, x_{5}, x_{11}, x_{12}\}, \{x_{4}, x_{7}, x_{9}\}\},

which is far from the correct partition

\overset{ˇ}{γ}

.

Thus, in the considered example with random relations between the agents

a_{k}

and

a_{k^{'}}

and randomly chosen classifications

γ_{k}

,

k, k^{'} = 1,2, \dots, l

, the group of experts strongly differs from the group of leading agents.

However, as indicated above, it is reasonable to assume that the agent elected to be a leader or a member of the group of leaders is competent in certain fields [19].

Following this assumption, let us consider the other example and form a group of leaders starting with the agents’ expertise [13]. Assume that the group

A

of

l = 8

agents classifies

n = 9

items

x

from the set

X

over

m = 5

classes

C

. The agents’ classifications are shown in Table 3.

By the Wisdom in the Crowd (WICRO) algorithm [13], the agents are divided into clusters based on the number of agreements about the classes for each item

x

. The clusters obtained by the agents are summarized in Table 4.

Following the table,

a_{1}

and

a_{2}

are the agents that agree that the item

x_{1}

should be in the class

C_{1}

and the item

x_{3}

should be in the class

C_{3}

;

a_{1}

,

a_{2}

, and

a_{7}

are the agents that agree that the item

x_{2}

should be in the class

C_{2}

and so on.

In addition, it is seen that the agents

a_{1}

and

a_{2}

appear both in the cluster

\{a_{1}, a_{2}\}

and in the cluster

\{a_{1}, a_{2}, a_{7}\}

together with the agent

a_{7}

. Thus, we assume that the agents

a_{1}

and

a_{2}

are predecessors and successors of each other and both are predecessors of the agent

a_{7}

. The same holds for the agents

a_{5}

and

a_{6}

and the agent

a_{4}

.

Also, the agents

a_{3}

and

a_{4}

appear in two clusters

\{a_{1}, a_{3}, a_{4}\}

and

\{a_{2}, a_{3}, a_{4}\}

, while each of the agents

a_{1}

and

a_{2}

appear in only one of these two clusters. So, we assume that the agents

a_{3}

and

a_{4}

are predecessors and successors of each other and both are predecessors of the agents

a_{1}

and

a_{2}

.

Finally, the agent

a_{8}

does not appear in any cluster, so we assume that this agent is a successor of all other agents.

Associating the agents with the vertices and the relations between the agents with the edges, one obtains the directed graph

G = (V, E)

shown in Figure 3.

Similarly to Table 1, the sets of input and output vertices in this graph are presented in Table 5.

Following this graph, the clear leader is the agent

a_{4}

associated with the vertex

v_{4}

and additional leaders are the agents

a_{1}

and

a_{2}

associated with the vertices

v_{1}

and

v_{2}

. Together with that, according to the structure of the graph, none of the agents is a dictator or monarch.

The further application of the majority voting in the group

A^{*} = \{a_{1}, a_{2}, a_{4}\}

of leading agents results in the classification that is correct for items

x_{1}

,

x_{2}

,

x_{3}

and

x_{5}

, is incorrect for items

x_{6}

,

x_{7}

and

x_{8}

, and with probability

\frac{1}{3}

can be correct for each of the items

x_{4}

and

x_{9}

.

7. Conclusions

The paper considered the problem of distinguishing the leaders in the group of autonomous agents.

In the paper, we suggested a definition of the leading agents, which are the agents that maximally divide the group. For calculating the distances between the subgroups of the agents, we use the entropy-based Rokhlin metric, which was extended for measuring the distances between the covers of the sets.

In the framework of classification problems, the paper considers the relationship between the competent agents and the leading agents and presents an example of distinguishing the leaders based on their expertise in certain fields of knowledge.

The suggested method can be used in programming the division of labor in the swarm activity dynamics and in the analysis of the data fusion in the records obtained by the wisdom of the crowd techniques.

Further research will include verification of the method on a wider range of data and consideration of the relations between the properties of the graphs, the groups of the distinguished leading agents, and the levels of their expertise.

Author Contributions

Conceptualization, E.K. and I.B.-G.; methodology, E.K. and I.B.-G.; software, E.K.; formal analysis, E.K.; investigation, E.K.; writing—original draft preparation, E.K.; writing—review and editing, I.B.-G.; supervision, I.B.-G.; project administration, I.B.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This researchwas partially supported by the Koret Foundation’s “Digital Living 2030” fund.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable to this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Garcia-Molina, H. Elections in a distributed computing system. IEEE Trans. Comput. 1982, 31, 48–59. [Google Scholar] [CrossRef]
Kim, T.W.; Kim, E.H.; Kim, J.K.; Kim, T.Y. A leader election algorithm in a distributed computing system. In Proceedings of the Fifth IEEE Computer Society Workshop on Future Trends of Distributed Computing Systems, Cheju, Republic of Korea, 28–30 August 1995; pp. 481–485. [Google Scholar]
Guo, M.; Tumova, J.; Dimarogonas, D.V. Cooperative decentralized multi-agent control under local tasks and connectivity constraints. In Proceedings of the 53rd IEEE Conference on Decision and Control, Los Angeles, CA, USA, 15–17 December 2014; pp. 75–80. [Google Scholar]
Clark, A.; Bushnell, L.; Poovendran, R. On leader selection for performance and controllability in multi-agent systems. In Proceedings of the 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), Maui, HI, USA, 10–13 December 2012; pp. 86–93. [Google Scholar]
Walker, P.; Amraii, S.A.; Lewis, M.; Chakraborty, N.; Sycara, K. Control of swarms with multiple leader agents. In Proceedings of the 2014 IEEE International Conference on Systems, Man, and Cybernetics (SMC), San Diego, CA, USA, 5–8 October 2014. [Google Scholar]
Fitch, K. Optimal Leader Selection in Multi-Agent Networks: Joint Centrality, Robustness and Controllability. Ph.D. Thesis, Princeton University, Princeton, NJ, USA, 2016. [Google Scholar]
Lewkowicz, M.A.; Agarwal, R.; Chakraborty, N. Distributed algorithm for selecting leaders for supervisory robotic swarm control. In Proceedings of the 2019 International Symposium on Multi-Robot and Multi-Agent Systems (MRS), New Brunswick, NJ, USA, 22–23 August 2019; pp. 112–118. [Google Scholar]
Whitehill, J.; Ruvolo, P.; Wu, T.; Bergsma, J.; Movellan, J. Whose vote should count more: Optimal integration of labels from labelers of unknown expertise. Adv. Neural Inf. Process. Syst. 2009, 22, 2035–2043. [Google Scholar]
Hamada, D.; Nakayama, M.; Saiki, J. Wisdom of crowds and collective decision making in a survival situation with complex information integration. Cogn. Res. 2020, 5, 48. [Google Scholar] [CrossRef] [PubMed]
Dawid, A.P.; Skene, A.M. Maximum likelihood estimation of observer error-rates using the EM algorithm. J. Roy. Stat. Soc. Ser. C 1979, 28, 20–28. [Google Scholar] [CrossRef]
Sinha, V.B.; Rao, S.; Balasubramanian, V.N. Fast Dawid-Skene: A fast vote aggregation scheme for sentiment classification. In Proceedings of the 7th KDD Workshop on Issues of Sentiment Discovery and Opinion Mining, London, UK, 20 August 2018. [Google Scholar]
Ghanaiem, A.; Kagan, E.; Kumar, P.; Raviv, T.; Glynn, P.; Ben-Gal, I. Unsupervised classification under uncertainty: The distance-based algorithm. Mathematics 2023, 11, 4784. [Google Scholar] [CrossRef]
Ratner, N.; Kagan, E.; Kumar, P.; Ben-Gal, I. Unsupervised classification for uncertain varying responses: The wisdom-in-the-crowd (WICRO) algorithm. Knowl.-Based Syst. 2023, 272, 110551. [Google Scholar] [CrossRef]
Kagan, E.; Novoselsky, A. Unsupervised classification by iterative voting. Crowd Sci. 2023, 7, 63–67. [Google Scholar] [CrossRef]
Galton, F. Vox populi. Nature 1907, 75, 450–451. [Google Scholar] [CrossRef]
Aronson, E. The Social Animal, 10th ed.; Aronson, J., Ed.; Worth Publishers: New York, NY, USA, 2008. [Google Scholar]
Asch, S. Effects of group pressure upon the modification and distortion of judgments. In Groups, Leadership, and Men; Guetzdow, H., Ed.; Carnegie Press: Lancaster, PA, USA, 1951; pp. 117–190. [Google Scholar]
Asch, S. Opinions and social pressure. Sci. Am. 1955, 193, 31–35. [Google Scholar] [CrossRef]
Peterson, J. 12 Rules of Life. An Antidote to Chaos; Random House: Toronto, ON, Canada, 2018. [Google Scholar]
Cormen, T.; Leiserson, C.; Rivest, R. Introduction to Algorithms; The MIT Press: Cambridge, MA, USA, 1990. [Google Scholar]
Ore, O. Theory of Graphs; American Mathematical Society: Providence, RI, USA, 1962. [Google Scholar]
Rokhlin, V.A. Lectures on the entropy theory of measure-preserving transformations. Russ. Math. Surv. 1967, 22, 1–52. [Google Scholar] [CrossRef]
Ornstein, D.S. Measure preserving transformations and random processes. Am. Math. Mon. 1971, 78, 833–840. [Google Scholar] [CrossRef]
Ornstein, D.S. Ergodic Theory, Randomness, and Dynamical Systems; Yale University Press: New Haven, CT, USA; London, UK, 1974. [Google Scholar]
Kagan, E.; Ben-Gal, I. Probabilistic Search for Tracking Targets; Wiley & Sons: Chichester, UK, 2013. [Google Scholar]
Sinai, Y.G. Introduction to Ergodic Theory; Princeton University Press: Princeton, NJ, USA, 1977. [Google Scholar]
Sinai, Y.G. Topics in Ergodic Theory; Princeton University Press: Princeton, NJ, USA, 1994. [Google Scholar]
Bouhours, D. The Arts of Logick and Rhetorick, Illustrated by Examples Taken out of the Best Authors, Antient and Modern, in All the Polite Languages, Interpreted and Eplain’d by That Learned and Judicious Critick; Clark, J., Hett, R., Pemberton, J., Ford, R., Gray, J., Eds.; ECCO Print: London, UK, 1728. [Google Scholar]

Figure 1. Example of the graph defining communication between the agents.

Figure 2. Example of the predecessors’ and successors’ trees: (a) the predecessors’ tree

\overset{⃐}{T} (v_{8})

and (b) the successors’ tree

\overset{⃑}{T} (v_{8})

of the vertex

v_{8}

in the graph shown in Figure 1.

Figure 2. Example of the predecessors’ and successors’ trees: (a) the predecessors’ tree

\overset{⃐}{T} (v_{8})

and (b) the successors’ tree

\overset{⃑}{T} (v_{8})

of the vertex

v_{8}

in the graph shown in Figure 1.

Figure 3. Graph of communication between the agents with respect to the agents’ clustering.

Table 1. The sets of input and output vertices in the graph

G

.

Table 1. The sets of input and output vertices in the graph

G

.

Vertex	$v_{1}$	$v_{2}$	$v_{3}$	$v_{4}$	$v_{5}$	$v_{6}$	$v_{7}$	$v_{8}$	$v_{9}$
Input set	$\emptyset$	$\{v_{1}\}$	$\emptyset$	$\{v_{1}, v_{2}\}$	$\{v_{4}, v_{4}, v_{8}\}$	$\{v_{5}, v_{8}\}$	$\{v_{4}\}$	$\{v_{7}, v_{9}\}$	$\emptyset$
Output set	$\{v_{1}, v_{4}\}$	$\{v_{4}\}$	$\{v_{5}\}$	$\{v_{5}, v_{7}\}$	$\{v_{6}\}$	$\emptyset$	$\{v_{8}\}$	$\{v_{5}, v_{6}\}$	$\{v_{8}\}$

Table 2. Example of

n = 12

items distributed by

l = 9

agents over

m = 4

classes. Partitions

γ_{k}

,

k = 1,2, \dots, 9

represent the agents’ classifications; partition

\overset{ˇ}{γ}

represents correct classification; and partition

γ_{P l}

represents the result of plurality voting.

Table 2. Example of

n = 12

items distributed by

l = 9

agents over

m = 4

classes. Partitions

γ_{k}

,

k = 1,2, \dots, 9

represent the agents’ classifications; partition

\overset{ˇ}{γ}

represents correct classification; and partition

γ_{P l}

represents the result of plurality voting.

	$γ_{1}$	$γ_{2}$	$γ_{3}$	$γ_{4}$	$γ_{5}$	$γ_{6}$	$γ_{7}$	$γ_{8}$	$γ_{9}$	$\overset{ˇ}{γ}$	$γ_{P l}$
$x_{1}$	2	2	2	4	3	1	$1$	$3$	$4$	2	2
$x_{2}$	1	3	1	4	3	1	$1$	$2$	$3$	1	1
$x_{3}$	3	2	2	1	1	1	$2$	$1$	$3$	2	1
$x_{4}$	4	4	4	3	1	3	$4$	$3$	$1$	3	4
$x_{5}$	4	4	4	3	2	3	$4$	$3$	$2$	3	4
$x_{6}$	1	3	1	1	2	2	$2$	$3$	$1$	1	1
$x_{7}$	4	3	3	1	4	4	$4$	$4$	$3$	4	4
$x_{8}$	1	3	1	2	2	2	$1$	$2$	$3$	1	2
$x_{9}$	3	3	4	2	4	4	$2$	$4$	$4$	4	4
$x_{10}$	3	2	2	1	1	1	$2$	$1$	$1$	2	1
$x_{11}$	4	3	4	3	2	3	$4$	$3$	$3$	3	3
$x_{12}$	3	3	3	1	4	4	$1$	$3$	$4$	4	3

Table 3. Example of

n = 9

items distributed by

l = 8

agents over

m = 5

classes. Partitions

γ_{k}

,

k = 1,2, \dots, 8

represent the agents’ classifications and partition

\overset{ˇ}{γ}

represents correct classification.

Table 3. Example of

n = 9

items distributed by

l = 8

agents over

m = 5

classes. Partitions

γ_{k}

,

k = 1,2, \dots, 8

represent the agents’ classifications and partition

\overset{ˇ}{γ}

represents correct classification.

	$γ_{1}$	$γ_{2}$	$γ_{3}$	$γ_{4}$	$γ_{5}$	$γ_{6}$	$γ_{7}$	$γ_{8}$	$\overset{ˇ}{γ}$
$x_{1}$	$1$	$1$	$2$	$4$	$3$	$5$	$4$	$3$	$1$
$x_{2}$	$2$	$2$	$3$	$4$	$3$	$4$	$2$	$5$	$2$
$x_{3}$	$3$	$3$	$2$	$1$	$4$	$2$	$5$	$2$	$3$
$x_{4}$	$5$	$1$	$4$	$4$	$3$	$3$	$2$	$1$	$4$
$x_{5}$	$5$	$2$	$5$	$5$	$1$	$4$	$4$	$3$	$5$
$x_{6}$	$5$	$2$	$2$	$2$	$3$	$1$	$1$	$4$	$1$
$x_{7}$	$4$	$3$	$1$	$5$	$2$	$2$	$4$	$4$	$2$
$x_{8}$	$2$	$1$	$5$	$2$	$3$	$2$	$1$	$2$	$3$
$x_{9}$	$5$	$1$	$2$	$4$	$4$	$4$	$3$	$2$	$4$

Table 4. Clusters of the agents with respect to the number of agreements about the classes of the items.

	Agent’s Cluster	Chosen Class Number
$x_{1}$	$\{a_{1}, a_{2}\}$	1
$x_{2}$	$\{a_{1}, a_{2}, a_{7}\}$	2
$x_{3}$	$\{a_{1}, a_{2}\}$	3
$x_{4}$	$\{a_{5}, a_{6}\}$	3
$x_{5}$	$\{a_{1}, a_{3}, a_{4}\}$	5
$x_{6}$	$\{a_{2}, a_{3}, a_{4}\}$	2
$x_{7}$	$\{a_{5}, a_{6}\}$	2
$x_{8}$	$\{a_{5}, a_{6}\}$	3
$x_{9}$	$\{a_{4}, a_{5}, a_{6}\}$	4

Table 5. The sets of input and output vertices in the graph

G

shown in Figure 3.

Table 5. The sets of input and output vertices in the graph

G

shown in Figure 3.

Vertex	$v_{1}$	$v_{2}$	$v_{3}$	$v_{4}$	$v_{5}$	$v_{6}$	$v_{7}$	$v_{8}$
Input set	$\{v_{2}, v_{3}\}$	$\{v_{1}, v_{4}\}$	$\{v_{4}\}$	$\{v_{3}, v_{5}, v_{6}\}$	$\{v_{6}\}$	$\{v_{5}\}$	$\{v_{1}, v_{2}\}$	$\emptyset$
Output set	$\{v_{2}, v_{7}, v_{8}\}$	$\{v_{1}, v_{7}, v_{8}\}$	$\{v_{1}, v_{4}, v_{8}\}$	$\{v_{2}, v_{3}, v_{8}\}$	$\{v_{4}, v_{6}, v_{8}\}$	$\{v_{4}, v_{5}, v_{8}\}$	$\{v_{8}\}$	$V \ \{v_{8}\}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kagan, E.; Ben-Gal, I. Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric. Entropy 2024, 26, 318. https://doi.org/10.3390/e26040318

AMA Style

Kagan E, Ben-Gal I. Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric. Entropy. 2024; 26(4):318. https://doi.org/10.3390/e26040318

Chicago/Turabian Style

Kagan, Evgeny, and Irad Ben-Gal. 2024. "Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric" Entropy 26, no. 4: 318. https://doi.org/10.3390/e26040318

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric

Abstract

1. Introduction

2. Problem Formulation

2.1. Distinguishing the Leaders

2.2. Distinguishing the Experts

2.3. Relationship between the Leaders and the Experts

3. Illustrative Example

4. Distinguishing the Group of Leaders Using the Entropy-Based Metric

5. Distinguishing the Group of Experts

6. Relationship between the Group $A^{*}$ of Leaders and the Group $A'$ of Experts

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Distinguishing the Leading Agents in Classification Problems Using the Entropy-Based Metric

Abstract

1. Introduction

2. Problem Formulation

2.1. Distinguishing the Leaders

2.2. Distinguishing the Experts

2.3. Relationship between the Leaders and the Experts

3. Illustrative Example

4. Distinguishing the Group of Leaders Using the Entropy-Based Metric

5. Distinguishing the Group of Experts

6. Relationship between the Group A * of Leaders and the Group A ′ of Experts

7. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

6. Relationship between the Group $A^{*}$ of Leaders and the Group $A'$ of Experts