1. Introduction
The formation of groups with shared beliefs, opinions, and views has been and continues to be a topic of great interest, discussed in sociology, political science, communication, and organizational science, as well as studies focusing on structure of science (see, e.g., [
1,
2,
3,
4,
5] for the diversity of topics and areas of studies). In all these cases, one key issue is to understand the dynamics which drive the segregation and consolidation of groups, even in conditions where communication and sharing of beliefs is common and frequent (for reviews, see [
3,
4,
5]).
The computational modeling of opinion group formation [
5,
6,
7], the formation of collaborative groups and collective decision making [
2,
8,
9], as well as disciplinary fragmentation and progress in science [
10], has shed light on the social dynamics behind group formation, segregation, and consolidation, often revealing the unexpectedly simple but self-reinforcing interactions behind such complex phenomena. Consequently, consensus group formation and opinion adoption, and their change and evolution, have been modeled using a variety of idealized models, many of them founded on one or another theoretical view about social influence and interaction or social learning, some of them being computational renderings of empirical findings. The models of opinion dynamics are as diverse as are the their theoretical underpinnings and intended scopes of applications. However, many of the models of opinion dynamics, where formation of consensus groups is of interest, can be classified in three groups [
3]: (1) models of assimilative social influence; (2) models with similarity biased influence, and; (3) models with repulsive influence. In addition, it is possible to recognize models that are hybrids of these three classes [
3]. Different well-known models in each class, their theoretical background and motivation, and most well-known computational implementations and empirical applications and justification (when it exists) are reviewed in detail elsewhere [
3,
4,
5], and therefore we provide here only a brief summary of the most important aspects of models with similarity bias to the extent needed to put in perspective the model introduced here.
Models with similarity bias do not assume a structurally fixed connection between agents, but instead, agents can interact if they have sufficiently similar opinions (or beliefs) or if they are sufficiently similar regarding some other pertinent feature. If the similarity (opinions or other preferred feature) is too distant, interactions are no longer possible. Such a threshold of interaction can be interpreted as a confidence to interact and modify one’s opinion and can be assumed to arise from a variety of psychological or sociological reasons [
3,
4,
5]. Therefore, many similarity bias models are referred to as bounded confidence models [
11,
12] owing to the existence of a kind of a confidence threshold to interact. These models typically give rise to persistent opinion clusters, where agents are similar and dissimilar to agents in other clusters. In these models, the cluster formation is an outcome of similarity bias homophily; the more stringent the threshold for required similarity, the more numerous are the segregated non-interacting clusters. In most bounded confidence models, the threshold is sharp and deterministic, and stochastic variation is included by adding a stochastic noise [
11,
12]. Interestingly, the addition of noise may fundamentally affect the consensus cluster formation [
3,
4,
5]. The role of noise and its non-trivial effects in deterministically thresholded bounded confidence models suggests that to model the bounded regions of interactions due to similarity bias, genuinely stochastic and probabilistic rules to decide whether or not the interactions happen are sometimes preferable [
13,
14]. Finally, important and interesting recent generalizations of bounded confidence models are models where opinions or beliefs are multidimensional [
15,
16,
17], or in one way or another, more complex structures of beliefs and related opinion elements [
2,
7,
9,
18]. Such models allow more realistic modeling of complex opinions, give rise to richer dynamics than one-dimensional bounded confidence models, and, moreover, the emerging clusters of opinions are in these cases more diverse than those found in one-dimensional models.
Paralleling the above briefly summarized recent generalizations of similarity-biased models of consensus formation, the agent-based model (ABM) proposed here is meant to be another step towards generalizations applicable to model complex sets of beliefs, where interactions between agents and dynamics of the change in beliefs are modeled as a genuine stochastic process. The model, however, adopts the simple view where agents modify their views by acquiring and accommodating their sets of views present in to a collection of all agents; no new elements emerge.
The ABM proposed here, given its restriction to describe the creation of new elements of opinion beliefs, and instead describing the evolution of consensus groups with complex sets of beliefs, has two possible areas of applications. One area of interest is the formation of disciplinary scholarly groups and schools, along with their characteristic ways of using scientific terms and forming various research programs [
19,
20,
21,
22]. Research fields always contain disciplinary groups where key scientific terms differ, and the same terms may be used differently in discussing and framing the key problems within the field [
21,
22]. In this case, within the established paradigms of research, new disciplinary groups are often formed within the existing fields. Such strong disciplinary fragmentation seems to be particularly apparent and typical in the human and behavioral sciences [
23,
24,
25]. Another situation of interest, where the creation and generation of new knowledge is not necessarily of primary interest, but where differing disciplinary views about thematic topics can be recognized, is related to the disciplinary views of science education scholars [
26,
27] as well as science students, where student groups may have consensus views that differ from those of other student groups views, even when they have used the same study materials [
28,
29]. To address situations in which the knowledge or meaning structures of interest are complex systems of terms, concepts, or conceptions characteristic of the disciplinary group, it seems appropriate to use the expression “webs of beliefs” (compare e.g., ref. [
30]), to be referred to briefly as WoBs in what follows. In this study, we approach such a disciplinary group formation through convergence and consolidation of WoBs, where the dynamics are driven by similarity and consensus seeking, without posing explicit bounds of confidence to constrain interactions (as in so-called bounded confidence models, [
1,
2,
3]). In the ABM presented here, agents possess generic WoBs in the form of a complex network. Agents compare their WoBs and exchange bits and pieces of them, guided by similarity-seeking dynamics (i.e., homophily). The comparison-triggered adjustment of WoBs leads to their convergence but also divergence, and thus to the formation of segregated disciplinary consensus groups. The model is highly idealized, generic, and simple, but as discussed in the final section, it has features that resemble the real situations of disciplinary group formation. Thus, the model presented here is a first attempt towards agent-based modeling of consensus group formation where views and opinions are complex webs of beliefs.
3. Results
The simulations are carried out for 25 agents, which possess different but partially overlapping WoBs about (here = 120 and = 20) edges, which are randomly selected parts of the same template network consisting of 200 vertexes (i.e., nodes) and 400 edges (i.e., links). However, only 2-core (i.e., each vertex has at least two edges attached) and fully connected WoBs are included as initial states, and thus the initial number of vertexes in the WoBs, when pruned to connected networks, varies roughly from 40 to 100 (with a few exceptional cases of 20 to fewer than 40 vertexes). The parameter affecting mostly the evolution of WoBs during the simulations is the sensitivity to similarity of partners (i.e., sensitivity to homophily). Note that simulations are carried out for this parameter in the range from 10 to 300, but results are reported for scaled values to allow easy presentation in logarithmic scale. The parameters and are the next most important in affecting the outcome of simulations. The effect of these parameters is tested by keeping ratio fixed and changing the absolute values by by scaling them with a factor of = 0.8 and 1.2.
In simulations with unfolding interaction events , the agent selects another agent to compare and adjust its WoB to better match the other agent’s WoB. The selection probability of the agent to interact depends on the similarity between the agents and the agent’s sensitivity to similarity in that selection. In comparison events of WoBs, interacting agents either adopt or delete an edge to increase their mutual similarity.
In
Figure 1, examples of initial and final WoBs are shown. In all cases, the number of vertexes
V (and edges) increase because the utility function favors the addition of edges and vertexes, thus favoring the growth of WoBs. However, initial and resulting final WoBs may have quite different sizes. Note that the initial WoBs are always 2-core networks, but during their evolution, singly connected vertexes may appear. In the final state ensemble, averaged values characterizing WoBs are not changing anymore, although slight changes in details of the structure may take place.
The dynamic changes in WoBs, driven by the similarity bias of interactions, eventually leads to a formation of consensus clusters. In
Figure 2, an example of a similarity cluster in the initial stage is shown, and then in the stabilized final stage for agents making confident choices of interaction partners with
(recall that this is a scaled value
). The stabilization of the cluster distributions with
, as shown in
Figure 2b, is obtained with about 30,000 interaction events in a group of 25 agents, each of them possessing about 100 edges on average (originating from a sampled set 120 edges on average). Roughly, this means that agents need about 10 interaction events for each edge if they are to reach consensus states. Note that in what follows, we report interaction events
scaled by a factor of 1000.
In
Figure 2a,b, clusters are shown as a density plot of effective similarity
S versus the (geometric) average size
of the WoB, normalized to a maximum
, with both values discretized into bins of 0.01 ranging from 0 to 1. It is seen how the initial diffuse cluster (
Figure 2a) segregates and consolidates to several smaller ones; in practice, six separate clusters (
Figure 3a), as resolved by
DBSCAN. In the case shown in
Figure 2b, the
DBSCAN routine finds seven or eight clusters (depending on parameters), but due to thresholding, which ignores very low-populated clusters, six remain to be counted as significant clusters. In
Figure 2a, DBSCAN detects two clusters. In
Figure 2c,d, the same case is shown, but now as a density plot of edge similarity
and vertex similarity
of the WoB. The similarities and sizes shown in
Figure 2 are aggregated over the last few stable ensembles corresponding to the range
.
In reading the relevant information from
Figure 2, it should be noted that while the high-similarity groups are clearly seen, so are the groups where agents are dissimilar (one agent can well belong to both groups, since similarity is a pairwise property). Therefore, we see not only groups where similarity has increased but also groups where it has decreased. Such behavior is a natural hallmark of the segregation of consensus groups.
Consensus cluster formation proceeds gradually from the diffuse initial state to a stabilized final state, depending on the number of interaction events.
Figure 3 shows the dynamics of cluster formation as it is monitored through number of clusters
N (
Figure 3a–d), relative occupancy (average fraction of agents in a cluster)
R (
Figure 3e–h), and average similarity
S (
Figure 3i–l) of clusters, as well as through entropy
H of cluster distribution (
Figure 3m–p). Results are shown for different strengths of sensitivity to similarity
(scaled by a factor of 10) from high (
) and to low sensitivity (
) as a function of update events
(scaled by a factor of 1000). For
and
, results remain essentially the same as for the corresponding limiting values shown in
Figure 3. In all cases, dynamic behavior can be fitted to an exponentially decaying function (exponential fits are shown in
Figure 3 but are barely visible).
The average number
N of consensus clusters in the final, stabilized state (
Figure 3a–d) depends on the sensitivity
of agents to similarity in making choices to interact with other agents. For high-sensitivity
(scaled), the number of clusters in stabilized state is on average about six (see
Figure 3a), but fluctuations as measured by the standard deviation of the ensemble averages are large. The relative occupation
R of clusters shown in
Figure 3e is on average about 0.30, corresponding to six or seven agents in a cluster. Increasing the level of sensitivity by increasing values of
does not change the situation, and thus results for
appear to represent the most extreme segregation found in the group of 25 agents. As seen later, changing the number of edges and vertexes in initial WoBs about by
also leaves the number of clusters in this case nearly intact. When the sensitivity to similarity in making choices becomes smaller, and values of
are reduced (
Figure 3b–d), the number of clusters decreases steadily, reaching the lowest attainable values of about two clusters on average for
. Additional reduction in
appears not to lead to a definite monocluster situation, although single clusters become more abundant. As is seen from
Figure 3f–h, the relative occupancy
R of clusters follows roughly inversely the behavior of the average number of clusters, so that the product
remains roughly a constant, indicating relatively uniform distribution of agents in different clusters. In all cases, the similarity
S (
Figure 3i–l) has nearly the same average value, owing to the choice of parameter
controlling the ratio of utility of addition to deletion. The larger the value of
, the larger the bias towards addition, and thus growth of clusters. The choice to keep the bias from growth moderate, and average similarity close to a constant value, makes the interpretation of consensus cluster formation easier and rules out the possibility that the formation of high-similarity clusters is mainly due to bias towards systematically higher average similarities.
The entropy
H of the cluster distribution (
Figure 3m–p) relaxes considerably more slowly to a stable value in comparison with cluster number
N and relative occupancy
R. This indicates that consolidation of clusters and segregation of clusters occurs only partly simultaneously, and within the clusters, WoBs continue to evolve more similarly, as also indicated by the slow convergence of similarity
S; clusters consolidate without further segregation.
The stabilized, final values of cluster number
N, entropy
H, and the average occupancy
R as a function of the sensitivity
is shown in
Figure 4 for a range of values from
to
, on a (natural) log-scale. It is now seen (
Figure 4a) that if the sensitivity to similarity is high enough (
), cluster formation takes place, and in the group of 25 agents about 6–7 are formed, with roughly equal numbers of 5–6 agents (
) in a cluster (
Figure 3c), with entropy
, which roughly corresponds to cases such as the one shown in
Figure 2b. However, in all cases, variations around mean values are large, as indicated by the error bars showing the standard deviation of values. In all cases, a transition to single cluster takes place around the value
. After the transition, only two clusters are obtained on average, with high entropy (
), roughly corresponding to two large and diffuse clusters of similar type as an initial cluster, shown in
Figure 2a.
The results in
Figure 4 show that the high sensitivity of agents to similarity in selecting similar partners (i.e., strong homophily) for interaction and for updating their WoBs invariably leads to the formation of segregated consensus groups. In addition to parameter
regulating the sensitivity to similarity, the parameter
for regulating agents’ confidence in decisions to add or delete edges affects the dynamics. Here, the value
is chosen to maintain the average similarity in stabilized stage nearly a constant. Parameter
regulates mainly the number of events needed for the relaxation and progress in harvesting new vertexes and edges. At low values of
, there is no average growth in similarity, at high values of beta
in the region of a confident decision to add or delete, the growth is maximal, constrained only by the new options allowed for the addition of edges within the consensus clusters. In particular,
has no effect on the transition from single to multiple clusters.
Finally, the effect of the initial number of edges
in WoBs and their variation
is checked by changing their absolute values by
, scaling both values by a factor of
= 0.8 and 1.2 but keeping the ratio
fixed.
Figure 5 shows the evolution of cluster number
N (
Figure 5a–c), occupancy
R (
Figure 5d–f), and entropy
H (
Figure 5g–i) for the altered configurations with high sensitivity
(the most interesting case with a high number of clusters) for values
(the upper row) and
to be compared with the results shown in
Figure 3, corresponding to
(reproduced in the middle row). The evolution of similarity is essentially similar to the results shown in
Figure 4 and is thus omitted in
Figure 5. As can be seen from the results, the increase in size of the WoB corresponding to factor
slightly increases the number of clusters, but relaxation to steady state then requires more interaction events. This is as expected, because agents now have more choices available (i.e., more edges and vertexes) to change their WoBs, and thus more comparison events are needed to reach consensus. When the initial WoBs are smaller, corresponding to the choice
, the number of consensus clusters is slightly smaller than for
, and the events needed for relaxation are fewer. Qualitatively, for other values of
, the results remain essentially similar to the cases shown in
Figure 3.
4. Discussion
In many respects, the ABM presented here describing the formation of consensus clusters resembles the so-called bounded confidence models (see e.g., [
1,
2,
3,
5] but differs from those models in two important ways. First, the ABM model describes the space of states of the agents (i.e., opinion-like states) as complex networks (WoBs) of elements related to each other, mimicking conceptual or semantic networks, not as discrete sets of choices or fixed to a few choices as in most bounded confidence models. To extend the applicability of consensus formation models provides understanding of not only choices of existing opinions (e.g., as in political or consumer choices) but allow the evolution of opinions. It is important to find ways to use flexible, dynamic, and changing states of agents, and furthermore, allow these states to be affected by choices that the agents make during the course of the unfolding interaction events. Such features provide a more realistic basis to model the formation of consensus groups in comparison with traditional models of predetermined fixed choices between different opinion states. Second, the ABM model introduced here does not assume a fixed bounded confidence criterion for the realization of the interactions. Instead, decisions to interact are made stochastically on the basis of utility-type evaluations of the prospect of increasing similarity. Therefore, there is no sharp exclusion of interactions, but instead, bias for similarity (homophily). In this respect, the WoBs introduced here parallel some other recent attempts to extend the consensus and opinion formation models [
7,
13].
Another set of models that resemble the ABM presented here are the so-called epistemic landscape models, which have found several potential applications in describing the formation and segregation of collaborative or consensus groups. The epistemic landscape models assume a fixed landscape of “knowledge” that agents explore [
42,
43] or closely related structures of fixed ground truths [
10] or epistemic landscapes with agents sensing the distance from the ground truths or the gradients toward them. Such models assume a fixed, pre-existing landscapes of “knowledge” to be explored by agents, and thus the outcomes of the exploration are more or less predetermined by the structure of the landscape and its gradients [
10,
42,
43]. The epistemic landscape models have turned out to be relevant to discussions of division of labor and how the ability of agents affects their collaboration, but such models do not easily yield to situations where dynamic changes in the landscape or the problem space are of interest.
The present ABM has two major limitations. First, the WoBs utilized in it are always substructures of more extensive templates. Although individual agents’ WoBs evolve, no new elements or connections are created, and WoBs will always remain as partial structures of the initial template. This, however, may not be a severe restriction in cases of intended applications where the targeted area of knowledge is opinions, views or belief of existing knowledge (i.e., discovery and creation of new knowledge is not in focus). Second, the WoBs and their update rules do not take into account the coherence of elements in the WoBs, nor requirements of coherence when elements are added or deleted. In some recent ABM approaches, the structure and coherence of agents’ opinions (or beliefs) are taken into account, allowing more realistic description of complex opinion and belief systems [
7,
44,
45]. The inclusion of coherence of belief elements as part of ABM is, however, not unproblematic; diverse opinions exist regarding how to implement that notion as part of the idea of network-like knowledge. In some views, cognitive dissonance is important [
46], and such aspects have been successfully implemented in ABM, where a model of cognitive (or conceptual) structures co-evolves with structural agents’ interactions [
44]. In the present model, the coherence of belief systems arises from its dovetailed network structure, which is also a form of coherence of knowledge (see e.g., [
47]). In that picture, adoption or deletion of nodes depends on their neighborhoods, where changes take place, always requiring the network to remain connected, but dissonant aspects of coherence are not taken into account. A lack of attention on dissonant connection is a clear restriction of the ABM introduced here.
Recent extensions of ABM have also included many other complex relevant features, for example: opinion dynamics making a separation between private and public opinions and the role of social hierarchies in interactions [
48]; prior beliefs and knowledge-making decisions to adopt new beliefs [
14,
49]; and collective cognitive alignment, when group members perceive and recall the information they receive in aligned ways [
50]. In addition to such features, it is possible to imagine many other socially important aspects the future ABM should take into account, for example, trust and its effects on social interactions [
51]. While it is unreasonable to assume that at present any ABM can meet such diverse requirements, it is useful to keep in mind that omitting such features unavoidably limits the applicability of any suggested model, but in different ways.
In the current simple and idealized model presented here, many advanced and complex features are omitted. The goal of the model is to take a moderate further step to use more complex belief systems in similarity-biased models to show that even in the case of WoB-like structures, robust consensus clusters are formed; segregation and formation of consensus groups may well arise from the distribution of continuously evolving systems describing complex WoBs. Moreover, the dynamics of the formation of disciplinary consensus groups based on WoBs need not be overly complex; simple reinforcing similarity-seeking dynamics may clarify many empirically found features of how disciplinary groups and their boundaries are formed.
The most obvious areas of applications of the present model are found at least in three special cases. First, in disputes regarding how to frame and understand the meaning of abstract words or terms central to given scientific paradigms, where paradigms have their own specialized lexicons. Different ways of using scientific terms and framing problems are an interesting area of applications related to the social nature of science, the role of scientific discourse, and the argumentation and agreement of truth of scientific claims, as discussed extensively in the philosophy of science [
19,
20,
21,
22], as well as finding a tenable grounding on empirical research about scientific activity and formation of disciplinary groups [
23,
24,
52,
53]. Second, the ABM presented here may find applications in making sense of the formation of disciplinary groups in scholarly disputes about a given pre-existing corpus of study, for example, in disputes in the humanistic sciences about how to interpret the work of some renowned but difficult to follow scholar (e.g., Hegel, Kant or Wittgenstein), where the amount of interpretative literature with differing and even opposing views may be extensive and exceed the amount of original work. Third, and perhaps with the most foreseeable practical utility, to describe group formation in learning situations, where a group of students tries to make sense of a limited amount of sources about a given topic; in the simplest case, using a single textbook (see e.g., [
28,
29,
35]. In such cases, due to sharing of views and opinions about same sources, one can nevertheless assume formation of different groups of consolidated views.
The possibility to connect the results of the current ABM more securely to existing empirical findings is not, however, straightforward. The ABM provides some insight structure and formation of discourse groups in learning and teaching, where groups of the size from three to about five to six students appear to perform the best [
54,
55,
56,
57], but in larger groups the phenomenon of isolation emerges [
58,
59]. While many unrelated factors (e.g., teaching arrangements and designs) affect group sizes, it is a plausible assumption that at least in the case of discourse groups, shared meanings of key terms and concepts is also a factor affecting group stability and how the number of students in a group evolves. For example, the isolation and break-up of small isolated groups from larger ones may well be a phenomenon related to similarity bias. In the case of scholarly disciplinary groups, empirical evidence of group sizes is more elusive and unclear. Scientometric analyses of the number of co-authors in publications provide some information, showing that small co-author groups are the most common, signaled as a fat-tailed distribution of number of authors [
60,
61]. This notion is supported also by results based on the size of scholarly groups that introduce (and thus use) new concepts, in which case small groups are also the most active [
62,
63]. Interestingly, reminiscent formation of disciplinary groups are found within focused research areas [
64], as well as in case of non-scientific but organized beliefs groups (i.e., religion), [
65]. In all these cases, however, the assumption that similarity bias guiding the formation of groups is a factor affecting the formation, segregation, and consolidation of groups is only tentative, although plausible. Without better knowledge of how, for example, that the changes in shared vocabulary or lexicons used by groups are correlated with group formation it is difficult to make more conclusive inferences on, or to propose more specific empirically testable hypotheses.
Nevertheless, due to the obvious tentativeness and generic nature of the results presented in this study, the present ABM model, where similarity is monitored through WoBs and where changes in WoBs are the basis of group formation dynamics, suggests that in future, more detailed studies, paying attention to correlations between consensus group formation and changes in WoBs or related structures may provide essentially new important insights on group dynamics. In that, ABM may also significantly guide designs of empirical research.