Defining, Detecting, and Characterizing Power Users in Threads
Abstract
:1. Introduction
- We propose a new definition of power users that requires them to have simultaneously very high values of degree, closeness, betweenness, and eigenvector centralities, which are the most classical centrality measures and whose properties are well known in SNA.
- We illustrate an approach to detect and characterize Threads power users that is tailored to the characteristics of this social network. To the best of our knowledge, this is the first attempt to study Threads power users in the literature.
- We provide an open dataset on Threads that can be used in the future by all researchers who want to investigate this social platform.
2. Related Literature
3. Methods
3.1. Description of Our Threads Dataset
3.2. Model Definition
3.3. Measures Adopted in Our Analysis
- The degree centrality of a node is defined as the number of arcs it has. In the case of directed networks, one can distinguish between the indegree centrality of a node, which is the number of arcs incoming into it, and the outdegree centrality of a node, which is the number of arcs outgoing from it. The higher the degree centrality, the more important the node.
- The closeness centrality of a node is defined as the inverse of its distance from other nodes. The higher the closeness centrality of a node, the more important it is.
- The betweenness centrality of a node is defined as the sum of the fractions of all-pairs shortest paths passing through it. The higher the betweenness centrality of a node, the more important it is.
- The eigenvector centrality of a node codifies the idea that the importance of a node depends on the number of arcs it has with other nodes and the importance of these nodes. Thus, the definition is recursive. The higher the eigenvector centrality of a node, the more important it is.
- The density of a network is the ratio of the number of real arcs to the number of potential arcs. The higher the density, the more connected the network.
- The average clustering coefficient of a network is equal to the average of the clustering coefficients of its nodes. The clustering coefficient of a node is given by the fraction of nodes connected to it by an arc that are also connected to each other. The higher the average clustering coefficient, the more connected the network.
- The average path length of a network is the average number of arcs that form the shortest paths between every pair of nodes in the network. The lower the average path length, the easier it is for information to flow through the network.
- The diameter of a network is the number of arcs that make up the shortest path between the two most distant nodes in the network; in other words, it is the number of arcs of the longest shortest path between a pair of nodes in the network. The smaller the diameter, the easier it is for information to flow through the network.
- A connected component of a network is a maximally connected subset. The maximum connected component of a network is the connected component with the largest number of nodes in the network. In directed networks, the maximum strongly connected component takes into account the direction of the arc when determining whether two nodes are connected. In contrast, the maximum weakly connected component does not take into account the direction of the arc when determining whether two nodes are connected, but only the existence of an arc between them.
- The normalized average degree of a node is defined as the ratio of the average degree to the number of nodes in the network. Its value is between 0 and 1. The higher its value, the more important a node is. This measure was introduced by us in this paper to take into account the size of the network when comparing the value of the average degree of nodes in different networks. In fact, the same average degree can have different implications for a very large network and a very small one.
3.4. Definition of Power Users
- Having a high indegree centrality, they have many users connected to them and are thus recognized as important reference points by other users;
- Having a high closeness centrality, they are connected to other Threads users by medium-to-short paths, so the information they transmit can reach these users very quickly;
- Having a high betweenness centrality, they are among the few strategic nodes that can carry information between different Threads subnetworks;
- Having a high eigenvector centrality, they are connected to several other equally central users in Threads; this allows us to hypothesize the presence of a backbone connecting power users. In the next section, we will see that this hypothesis is actually confirmed.
4. Results
4.1. Detection of Power Users in Threads
4.2. Characterization of Threads Power Users
- The density in is much higher than in (specifically, it is 56.31 times higher). This suggests that power users are much more interconnected than users.
- The average clustering coefficient in is much higher than in (specifically, it is 37.35 times higher). This indicates that power users are much more likely to form closed triads with each other than users are, which is another indicator that power users tend to interact with each other much more than users do.
- The normalized average indegree in is much higher than in (specifically, it is 55.99 times higher). This indicates that the tendency of power users to interact with other power users is much greater than the tendency of users to interact with other users.
- The average shortest path and diameter in are slightly smaller than those in . This indicates that information can flow a little better in than in .
“Extract the main topic of discussion from the text. One word, not too specific, general topic of discussion.”
- We took the 50 most frequent topics; the frequency of a topic is measured in terms of the number of users who published at least one post on it. These are the topics reported in Figure 8. We limited ourselves to this number of topics for computational reasons, and because the other topics had negligible numbers of associated users compared to them.
- We considered all 2450 topic pairs that could be obtained from them.
- For each pair, we determined the number of users that the two topics had in common.
- For all pairs with a number of users in common greater than 0, we computed the ratio of common power users to all common users (including power users).
- We averaged the values obtained in this way.
4.3. Discussion
4.3.1. Comparing Our Power Users with Those of the Other Approaches
4.3.2. Two Different Hypotheses About the Backbone of Power Users
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Mislove, A.; Marcon, M.; Gummadi, K.; Druschel, P.; Bhattacharjee, B. Measurement and analysis of online social networks. In Proceedings of the ACM SIGCOMM International Conference on Internet Measurement (IMC’07), San Diego, CA, USA, 24–26 October 2007; ACM: New York, NY, USA, 2007; pp. 29–42. [Google Scholar]
- Buntain, C.; Golbeck, J. Identifying Social Roles in Reddit Using Network Structure. In Proceedings of the International Conference on World Wide Web (WWW’14), Seoul, Republic of Korea, 7–11 April 2014; ACM: New York, NY, USA, 2014; pp. 615–620. [Google Scholar]
- Yadav, A.; Johari, R.; Dahiya, R. Identification of centrality measures in social network using network science. In Proceedings of the International Conference on Computing, Communication, and Intelligent Systems (ICCCIS’19), Greater Noida, India, 18–19 October 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 229–234. [Google Scholar]
- Bonifazi, G.; Cauteruccio, F.; Corradini, E.; Marchetti, M.; Terracina, G.; Ursino, D.; Virgili, L. A framework for investigating the dynamics of user and community sentiments in a social platform. Data Knowl. Eng. 2023, 146, 102183. [Google Scholar] [CrossRef]
- Howlader, P.; Sudeep, K. Degree centrality, eigenvector centrality and the relation between them in Twitter. In Proceedings of the International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT’16), Bangalore, India, 20–21 May 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 678–682. [Google Scholar]
- Bonifazi, G.; Corradini, E.; Marchetti, M.; Sciarretta, L.; Ursino, D.; Virgili, L. A Space-Time Framework for Sentiment Scope Analysis in Social Media. Big Data Cogn. Comput. 2022, 6, 130. [Google Scholar] [CrossRef]
- Bonifazi, G.; Cauteruccio, F.; Corradini, E.; Marchetti, M.; Pierini, A.; Terracina, G.; Ursino, D.; Virgili, L. An approach to detect backbones of information diffusers among different communities of a social platform. Data Knowl. Eng. 2022, 140, 102048. [Google Scholar] [CrossRef]
- Bonifazi, G.; Breve, B.; Cirillo, S.; Corradini, E.; Virgili, L. Investigating the COVID-19 vaccine discussions on Twitter through a multilayer network-based approach. Inf. Process. Manag. 2022, 59, 103095. [Google Scholar] [CrossRef] [PubMed]
- Pierri, F.; Piccardi, C.; Ceri, S. A multi-layer approach to disinformation detection in US and Italian news spreading on Twitter. EPJ Data Sci. 2020, 9, 35. [Google Scholar] [CrossRef]
- Jeong, U.; Jiang, B.; Tan, Z.; Bernard, R.; Liu, H. BlueTempNet: A Temporal Multi-network Dataset of Social Interactions in Bluesky Social. IEEE Data Descr. 2024, 1, 71–79. [Google Scholar] [CrossRef]
- La Cava, L.; Mandaglio, D.; Tagarelli, A. Polarization in Decentralized Online Social Networks. In Proceedings of the International ACM Web Science Conference (WEBSCI’24), Stuttgart, Germany, 21–24 May 2024; ACM: New York, NY, USA, 2024; pp. 48–52. [Google Scholar]
- Bono, C.; Cava, L.L.; Luceri, L.; Pierri, F. An Exploration of Decentralized Moderation on Mastodon. In Proceedings of the International ACM Web Science Conference (WEBSCI’24), Stuttgart, Germany, 21–24 May 2024; ACM: New York, NY, USA, 2024; pp. 53–58. [Google Scholar]
- Bin Zia, H.; He, J.; Castro, I.; Tyson, G. Fediverse Migrations: A Study of User Account Portability on the Mastodon Social Network. In Proceedings of the International ACM Web Science Conference (WEBSCI’24), Stuttgart, Germany, 21–24 May 2024; ACM: New York, NY, USA, 2024; pp. 68–75. [Google Scholar]
- Wilson, C.; Boe, B.; Sala, A.; Puttaswamy, K.; Zhao, B. User interactions in social networks and their implications. In Proceedings of the ACM European Conference on Computer systems (EuroSys’09), Nuremberg, Germany, 1–3 April 2009; ACM: New York, NY, USA, 2009; pp. 205–218. [Google Scholar]
- Buccafurri, F.; Foti, V.; Lax, G.; Nocera, A.; Ursino, D. Bridge Analysis in a Social Internetworking Scenario. Inf. Sci. 2013, 224, 1–18. [Google Scholar] [CrossRef]
- Buccafurri, F.; Lax, G.; Nicolazzo, S.; Nocera, A. Comparing Twitter and Facebook user behavior: Privacy and other aspects. Comput. Hum. Behav. 2015, 52, 87–95. [Google Scholar] [CrossRef]
- Kratzer, J.; Lettl, C.; Franke, N.; Gloor, P. The social network position of lead users. J. Prod. Innov. Manag. 2016, 33, 201–216. [Google Scholar] [CrossRef]
- Yustiawan, Y.; Maharani, W.; Gozali, A. Degree centrality for social network with Opsahl method. Procedia Comput. Sci. 2015, 59, 419–426. [Google Scholar] [CrossRef]
- Alp, Z.; Öğüdücü, S. Identifying topical influencers on Twitter based on user behavior and network topology. Knowl.-Based Syst. 2018, 141, 211–221. [Google Scholar]
- Zareie, A.; Sheikhahmadi, A.; Jalili, M. Influential node ranking in social networks based on neighborhood diversity. Future Gener. Comput. Syst. 2019, 94, 120–129. [Google Scholar] [CrossRef]
- Huang, X.; Chen, D.; Wang, D.; Ren, T. Identifying influencers in social networks. Entropy 2020, 22, 450. [Google Scholar] [CrossRef] [PubMed]
- Subramani, N.; Easwaramoorthy, S.V.; Mohan, P.; Subramanian, M.; Sambath, V. A gradient boosted decision tree-based influencer prediction in social network analysis. Big Data Cogn. Comput. 2023, 7, 6. [Google Scholar] [CrossRef]
- Anastasiei, B.; Dospinescu, N.; Dospinescu, O. Word-of-mouth engagement in online social networks: Influence of network centrality and density. Electronics 2023, 12, 2857. [Google Scholar] [CrossRef]
- Bertoni, V.; Saurin, T.; Fogliatto, F. How to identify key players that contribute to resilient performance: A social network analysis perspective. Saf. Sci. 2022, 148, 105648. [Google Scholar] [CrossRef]
- Eskandanian, F.; Sonboli, N.; Mobasher, B. Power of the Few: Analyzing the Impact of Influential Users in Collaborative Recommender Systems. In Proceedings of the International ACM Conference on User Modeling, Adaptation and Personalization (UMAP ’19), Larnaca, Cyprus, 9–12 June 2019; ACM: New York, NY, USA, 2019; pp. 225–233. [Google Scholar]
- Wang, N.; Xie, W.; Tiberius, V.; Qiu, Y. Accelerating new product diffusion: How lead users serve as opinion leaders in social networks. J. Retail. Consum. Serv. 2023, 72, 103297. [Google Scholar] [CrossRef]
- Rehman, A.; Jiang, A.; Rehman, A.; Paul, A.; Din, S.; Sadiq, M. Identification and role of opinion leaders in information diffusion for online discussion network. J. Ambient. Intell. Humaniz. Comput. 2023, 14, 15301–15313. [Google Scholar] [CrossRef]
- Tsugawa, S.; Watabe, K. Identifying influential brokers on social media from social network structure. In Proceedings of the International AAAI Conference on Web and Social Media (ICWSM’23), Lymassol, Cyprus, 5–8 June 2023; Volume 17, pp. 842–853. [Google Scholar]
- Tsvetovat, M.; Kouznetsov, A. Social Network Analysis for Startups: Finding Connections on the Social Web; O’Reilly Media, Inc.: Sebastopol, CA, USA, 2011. [Google Scholar]
- Zhang, P.; He, Y.; Haq, E.; He, J.; Tyson, G. The Emergence of Threads: The Birth of a New Social Network. arXiv 2024, arXiv:2406.19277. [Google Scholar]
- Bonifazi, G.; Corradini, E.; Ursino, D. Definition of status and value assortativity in complex networks and their evaluation in Threads. Soc. Netw. Anal. Min. 2024, 14, 212. [Google Scholar] [CrossRef]
- Trifiro, B.; Clarke, M.; Huang, S.; Mills, B.; Ye, Y.; Zhang, S.; Zhou, M.; Su, C. Media moments: How media events and business incentives drive twitter engagement within the small business community. Soc. Netw. Anal. Min. 2022, 12, 174. [Google Scholar] [CrossRef] [PubMed]
- Borah, A.; Singh, S. Investigating political polarization in India through the lens of Twitter. Soc. Netw. Anal. Min. 2022, 12, 97. [Google Scholar] [CrossRef]
- Poulopoulos, V.; Wallace, M. Social Media Analytics as a Tool for Cultural Spaces—The Case of Twitter Trending Topics. Big Data Cogn. Comput. 2022, 6, 63. [Google Scholar] [CrossRef]
- Sahneh, E.; Nogara, G.; DeVerna, M.; Liu, N.; Luceri, L.; Menczer, F.; Pierri, F.; Giordano, S. The Dawn of Decentralized Social Media: An Exploration of the Bluesky Social Ecosystem. arXiv 2024, arXiv:2408.03146. [Google Scholar]
- Quelle, D.; Bovet, A. Bluesky: Network Topology, Polarisation, and Algorithmic Curation. arXiv 2024, arXiv:2405.17571. [Google Scholar] [CrossRef]
- Naseem, U.; Razzak, I.; Khushi, M.; Eklund, P.; Kim, J. Covidsenti: A large-scale benchmark Twitter data set for COVID-19 sentiment analysis. IEEE Trans. Comput. Soc. Syst. 2021, 8, 1003–1015. [Google Scholar] [CrossRef]
- Hamraoui, I.; Boubaker, A. Impact of Twitter sentiment on stock price returns. Soc. Netw. Anal. Min. 2022, 12, 28. [Google Scholar] [CrossRef]
- Alieva, I.; Ng, L.; Carley, K. Investigating the spread of Russian disinformation about biolabs in Ukraine on Twitter using social network analysis. In Proceedings of the IEEE International Conference on Big Data (BigData’22), Osaka, Japan, 17–20 December 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1770–1775. [Google Scholar]
- Ferreira, C.; Murai, F.; Silva, A.; Almeida, J.; Trevisan, M.; Vassio, L.; Mellia, M.; Drago, I. On the dynamics of political discussions on instagram: A network perspective. Online Soc. Netw. Media 2021, 25, 100155. [Google Scholar] [CrossRef]
- Stoddart, M.; Koop-Monteiro, Y.; Tindall, D. Instagram as an Arena of Climate Change Communication and Mobilization: A Discourse Network Analysis of COP26. Environ. Commun. 2025, 19, 218–237. [Google Scholar] [CrossRef]
- Chang, M.; Yi, T.; Hong, S.; Lai, P.Y.; Jun, J.; Lee, J. Identifying museum visitors via social network analysis of Instagram. J. Comput. Cult. Herit. 2022, 15, 1–19. [Google Scholar] [CrossRef]
- Purba, K.; Asirvatham, D.; Murugesan, R. Influence maximization diffusion models based on engagement and activeness on instagram. J. King Saud-Univ.-Comput. Inf. Sci. 2022, 34, 2831–2839. [Google Scholar] [CrossRef]
- Kumar, S.; Mallik, A.; Khetarpal, A.; Panda, B. Influence maximization in social networks using graph embedding and graph neural network. Inf. Sci. 2022, 607, 1617–1636. [Google Scholar] [CrossRef]
- Bartolucci, S.; Caccioli, F.; Caravelli, F.; Vivo, P. Ranking influential nodes in networks from aggregate local information. Phys. Rev. Res. 2023, 5, 033123. [Google Scholar] [CrossRef]
- Wu, S.; Li, W.; Shen, H.; Bai, Q. Identifying influential users in unknown social networks for adaptive incentive allocation under budget restriction. Inf. Sci. 2023, 624, 128–146. [Google Scholar] [CrossRef]
- Bhadra, J.; Khanna, A.; Beuno, A. A Graph Neural Network Approach for Identification of Influencers and Micro-Influencers in a Social Network: Classifying influencers from non-influencers using GNN and GCN. In Proceedings of the International Conference on Advances in Electronics, Communication, Computing and Intelligent Information Systems (ICAECIS’23), Bangalore, India, 19–21 April 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 66–71. [Google Scholar]
- Iqbal, S.; Khan, R.; Khan, R.; Alarfaj, F.; Alomair, A.; Ahmed, M. Association Rule Analysis-Based Identification of Influential Users in the Social Media. Comput. Mater. Contin. 2022, 73, 6479–6493. [Google Scholar]
- Hasan, M.; Bakar, A.; Yaakub, M. Measuring user influence in real-time on twitter using behavioural features. Phys. A Stat. Mech. Its Appl. 2024, 639, 129662. [Google Scholar] [CrossRef]
- Karoui, W.; Hafiene, N.; Ben Romdhane, L. Machine learning-based method to predict influential nodes in dynamic social networks. Soc. Netw. Anal. Min. 2022, 12, 108. [Google Scholar] [CrossRef]
- Zar, J. Spearman rank correlation: Overview. In Wiley StatsRef: Statistics Reference Online; Wiley Online Library: Hoboken, NJ, USA, 2014. [Google Scholar]
Property | Value |
---|---|
Number of nodes | 45,349 |
Number of arcs | 72,333 |
Density | 0.000035 |
Average clustering coefficient | 0.000743 |
Diameter | 13 |
Average shortest path | 4.540 |
Maximum connected component’s size | 45,349 |
Average indegree | 1.595 |
Average outdegree | 1.595 |
Indegree assortativity | −0.042 |
Outdegree assortativity | −0.003 |
Centrality Measures | Percentage of Common Nodes |
---|---|
20-Top-D ∩ 20-Top-C | 10.31% |
20-Top-D ∩ 20-Top-B | 7.79% |
20-Top-D ∩ 20-Top-E | 7.05% |
20-Top-C ∩ 20-Top-B | 4.44% |
20-Top-C ∩ 20-Top-E | 3.78% |
20-Top-B ∩ 20-Top-E | 18.76% |
Mean Indegree | Median Indegree | |
---|---|---|
All users | 1.595 | 1 |
Power users | 19.076 | 5 |
Parameter | Value in | Value in |
---|---|---|
Number of nodes | 45,349 | 1176 |
Number of arcs | 72,333 | 2724 |
Density | 0.000035 | 0.001971 |
Average clustering coefficient | 0.000743 | 0.027748 |
Diameter | 13 | 12 |
Average shortest path | 4.540146 | 3.914330 |
Average indegree | 1.595 | 2.316 |
Normalized average indegree | 0.00003517 | 0.001969 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bonifazi, G.; Buratti, C.; Corradini, E.; Marchetti, M.; Parlapiano, F.; Ursino, D.; Virgili, L. Defining, Detecting, and Characterizing Power Users in Threads. Big Data Cogn. Comput. 2025, 9, 69. https://doi.org/10.3390/bdcc9030069
Bonifazi G, Buratti C, Corradini E, Marchetti M, Parlapiano F, Ursino D, Virgili L. Defining, Detecting, and Characterizing Power Users in Threads. Big Data and Cognitive Computing. 2025; 9(3):69. https://doi.org/10.3390/bdcc9030069
Chicago/Turabian StyleBonifazi, Gianluca, Christopher Buratti, Enrico Corradini, Michele Marchetti, Federica Parlapiano, Domenico Ursino, and Luca Virgili. 2025. "Defining, Detecting, and Characterizing Power Users in Threads" Big Data and Cognitive Computing 9, no. 3: 69. https://doi.org/10.3390/bdcc9030069
APA StyleBonifazi, G., Buratti, C., Corradini, E., Marchetti, M., Parlapiano, F., Ursino, D., & Virgili, L. (2025). Defining, Detecting, and Characterizing Power Users in Threads. Big Data and Cognitive Computing, 9(3), 69. https://doi.org/10.3390/bdcc9030069