Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions
Abstract
:1. Introduction
- It presents a large-scale open-access Twitter dataset of 138,584 Tweets (including original Tweets, retweets, and replies) about exoskeletons posted on Twitter for a period of 5-years from 21 May 2017 to 21 May 2022. The dataset is available at https://dx.doi.org/10.21227/r5mv-ax79.
- Based on a comprehensive review of 108 emerging works in these fields, this paper discusses multiple interdisciplinary applications of this dataset and presents a list of 100 research questions for researchers to study, analyze, evaluate, ideate, and investigate based on this dataset.
2. Methodology
3. Results and Discussions
3.1. Hydrating the Dataset—Steps and Associated Results
- Download and install the desktop version of the Hydrator app from this website [67]. The version that was used for this work is v0.30.0.
- Click on the “Link Twitter Account” button on the Hydrator app to connect the app to an active Twitter account.
- Click on the “Add” button to add a new dataset comprising only Tweet IDs (Figure 1). Browse and select the file—“Exoskeleton_TweetIDs_Set2.txt” available on local storage.
- If the file upload is successful, the Hydrator app will show the total number of Tweet IDs present in the file. For this file—“Exoskeleton_TweetIDs_Set2.txt”, the app would show the number of Tweet IDs as 19,415.
- Provide details for the respective fields: Title, Creator, Publisher, and URL in the app, and click on “Add Dataset” to add this dataset to the app.
- The app would automatically redirect to the “Datasets” tab. Click on the “Start” button to start hydrating the Tweet IDs (Figure 2). During the hydration process, the progress indicator would increase, indicating the number of Tweet IDs that have been successfully hydrated and the number of Tweet IDs that are pending hydration.
- After the hydration process ends, a .jsonl file would be generated by the app that the user may choose to save. The app would also display a “CSV” button in place of the “Start” button. Clicking on this “CSV” button would generate a .csv file with detailed information about the Tweets, which would include the text of the Tweet, user ID, user name, retweet count, language, Tweet URL, source, and other public information related to the Tweet.
3.2. Statistical Analysis of the Dataset
4. Potential Applications and Research Directions
4.1. List of 100 Research Questions Related to Exoskeletons
- RQ1.
- Sentiment analysis [69] of these Tweets would help to identify the positive, negative, and neutral sentiments associated with these conversations about exoskeletons on Twitter.
- RQ2.
- Deep learning may be used to identify the emotional state of the users in terms of the basic emotional responses - fear, anger, joy, sadness, disgust, and surprise [70], at the time of posting of these respective Tweets.
- RQ3.
- Aspect-based sentiment analysis, along with tokenization and lemmatization of these Tweets [71], may be performed to identify the specific aspects or subject matters related to exoskeletons to which certain specific sentiments or emotional responses are associated.
- RQ4.
- Studying the trends in Tweet counts [72] and the associated sentiments and emotional responses to detect any correlations between the two.
- RQ5.
- Studying the word count of each of these Tweets [73] to determine if any correlation exists between the word count and the associated sentiments and emotional responses towards different exoskeleton products.
- RQ6.
- Investigating whether the number of replies or retweets of a Tweet [74] posted by a company to share the news about a new kind of exoskeleton could have a correlation with the influence and follower metrics of the company’s Twitter profile.
- RQ7.
- Detecting popular Tweets [75] related to exoskeletons and studying the subject matters and aspects mentioned in those Tweets by performing tokenization and lemmatization.
- RQ8.
- Detecting sarcasm [76] related to exoskeletons and studying for any correlation of sarcasm with the sentiment or emotional response associated with the respective Tweets.
- RQ9.
- Identifying the commonly used hashtags associated with Tweets related to exoskeletons and detecting the sentiment related to these respective hashtags [77].
- RQ10.
- Analyzing the Tweets posted by users of exoskeletons to detect the diverse user personas [78] and their associated perspectives, experiences, opinions, and feedback about exoskeletons.
- RQ11.
- Studying trending discussions [79] on Twitter related to exoskeletons and using machine learning algorithms to detect these trends in real-time.
- RQ12.
- Studying the trends in sentiments and emotional responses [80] associated with exoskeletons to track if there is any correlation of the same with the trends in exoskeleton sales or its market potential.
- RQ13.
- Investigating for any interdependence between Tweets about specific exoskeleton products and sales [81] of those specific exoskeleton products.
- RQ14.
- Performing topic modeling [82] of these respective Tweets to interpret the associated communication as news, recommendation, discussion, feedback, perspective, opinion, etc., related to different kinds of exoskeletons.
- RQ15.
- Investigating retweeting patterns of Tweets [83] to determine the interest in certain topics related to exoskeletons expressed in the respective Tweets.
- RQ16.
- Studying the Tweeting patterns and content of the Tweets by various companies or manufacturers of exoskeletons to understand their audience management methodologies which include targeting different audiences, concealing subjects, and maintaining authenticity [84].
- RQ17.
- Detecting the Point of Interest (P.O.I.) of a Tweet [85], which presents high-level location information about a place, to understand the location-specific opinions, perspectives, or attitudes of the public towards exoskeleton technology.
- RQ18.
- Developing a personalized Tweet recommendation system [86] that would present the latest developments in exoskeleton technology, including exoskeletons available for purchase to potential user groups, which may include the elderly, disabled, handicapped, etc.
- RQ19.
- RQ20.
- Performing semantic analysis of the content of each Tweet as per the methodology discussed in [89] to determine if any political leaders have influenced the sale or public opinion or perspectives towards a specific kind of exoskeleton.
- RQ21.
- Analysis of Tweets to determine the emergence of exoskeletons [90] in different fields such as healthcare and medicine.
- RQ22.
- Studying the mentions of exoskeleton companies in Tweets to determine the patterns of customer engagement [91] with each of these companies.
- RQ23.
- RQ24.
- Development of a Tweet ranking model to present important Tweets [94] to potential end-users or customers of exoskeletons based on their specific needs or interests.
- RQ25.
- Determining how official accounts on Twitter play a role in the propagation and correction of online rumors related to exoskeletons in different geographic locations [95].
- RQ26.
- RQ27.
- RQ28.
- Developing an approach to determine the audience size [100] of any potential Tweet related to a specific exoskeleton that might be helpful for the consumer outreach of exoskeleton companies and manufacturers.
- RQ29.
- Application of the gratification theory [101] on these Tweets to deduce the factors that gratify users related to different use-cases of exoskeletons.
- RQ30.
- Performing a study on the Tweets to investigate the role of news organizations, including regional media, local media, national media, and broadcast news agencies, in the dissemination of the latest developments [102] in the field of exoskeleton technology.
- RQ31.
- Determining the occupation of potential end-users of exoskeletons from their Tweets [103], which may be helpful for companies and/or manufacturers to develop or improvise exoskeletons to better assist these end-users in their respective professions.
- RQ32.
- Implementation of the TCV-Rank summarization technique for generating online summaries and historical summaries related to Tweets [104] about exoskeletons posted from different geographic regions.
- RQ33.
- Implementation of the TURank (Twitter User Rank) algorithm [105] to find authoritative Twitter users who post Tweets related to exoskeletons.
- RQ34.
- Studying the trends of entity linking [106] in Tweets about upcoming exoskeletons for different use-cases.
- RQ35.
- Investigating the impact of following clusters of exoskeleton users or exoskeleton companies [107] on the ideologies of the Twitter user(s) over exoskeletons.
- RQ36.
- Analysis of the Tweets centered around specific hashtags [108] related to exoskeletons to analyze the tweeting trends and replies related to these hashtags.
- RQ37.
- Implementation of the HybridSeg approach [109] to find the optimal segmentation of Tweets related to exoskeletons for improving segmentation quality as well as for exploring applications of this approach for named entity recognition.
- RQ38.
- Interpretation of the use of Twitter by companies or organizations in the exoskeleton industry to examine brand attributes (both product-related and non-product-related) and their relation to Twitter’s key engagement features (Reply, Retweet, Favorite) [110].
- RQ39.
- Development of an approach by application of the Latent Dirichlet Allocation (LDA) model as proposed in [111] to deduce the information credibility related to exoskeleton-based Tweets originating from different sources.
- RQ40.
- RQ41.
- Implementation of the Tweet2Vec method [114] for learning Tweet embeddings using character-level CNN-LSTM encoder-decoder for efficient categorization of Tweets centered around exoskeleton technologies in general or related to any specific exoskeleton technology.
- RQ42.
- Implementation of the Self-Exciting Point Process Model for Predicting Tweet Popularity (SEISMIC) model [115] to predict the popularity of Tweets related to exoskeletons.
- RQ43.
- Studying the geographic diffusion patterns in terms of random, local, and information brokerage of the information contained in a specific Tweet [116] related to exoskeletons and their diverse use cases.
- RQ44.
- Performing Tweet wikification [117] to identify different concepts mentioned in a Tweet to link these concepts to existing concepts about exoskeletons present in a knowledge base, such as Wikipedia.
- RQ45.
- RQ46.
- Development of an approach similar to the work in [120] for detection of complaints related to specific exoskeleton technologies.
- RQ47.
- RQ48.
- Application of the approach proposed in [123] to detect the patterns of emojis present in information-based Tweets about exoskeletons for the analysis of the relationships between plain texts and emojis usage in such Tweets.
- RQ49.
- RQ50.
- RQ51.
- Investigating the effect of tweeting about research papers [132] on exoskeletons on the downloads and citations of these respective papers.
- RQ52.
- Determining the social identities [133] of diverse users of exoskeletons based on the content and context of their Tweets.
- RQ53.
- Studying the relevance of a Tweet [134] about a specific exoskeleton based on the hyperlinked documents in the same.
- RQ54.
- Investigating how exoskeleton companies and/or manufacturers use tagging [135] on Twitter for audience engagement and retention.
- RQ55.
- Performing stance detection [136] towards exoskeletons by analyzing the Tweets posted by its users.
- RQ56.
- Interpretation of satire [137] in the context of Tweets about new and upcoming exoskeleton technologies.
- RQ57.
- Predicting the age of existing users or potential users of exoskeletons from their Tweets [138] to personalize the exoskeletons as per the age-specific needs.
- RQ58.
- Investigating the selective attention over different entities expressed in any Tweet pertaining to exoskeletons, as per the methodology proposed in [139].
- RQ59.
- Studying the paradigms of readability in Tweets posted by users of exoskeletons to interpret the degrees of engagement [140].
- RQ60.
- Deducing the best time to Tweet [141] any information related to exoskeletons that might be helpful for the sales and marketing team of exoskeleton companies and/or manufacturers.
- RQ61.
- Tracking repliers and retweeters of Tweets [142] about improvisations in existing exoskeletons posted by exoskeleton companies to detect degrees of intimacy with the target audience.
- RQ62.
- Detecting the number of Tweets [143] related to exoskeletons from a geographic area that could be helpful in understanding the associated needs or public perceptions of a specific exoskeleton-based technology available or marketed in that area.
- RQ63.
- Analyzing the multimodal factors that are associated with the retweet of any Tweet [144] communicating news about exoskeletons.
- RQ64.
- Using the concept of knowledge graphs for Tweet summarization for effectiveness in obtaining useful information [145] related to exoskeleton technologies on Twitter.
- RQ65.
- Recommendation of specific hashtags [146] related to exoskeletons to Twitter users who could be potential users of exoskeletons.
- RQ66.
- Performing contextualization of Tweets [147] related to exoskeletons based on hashtag performance prediction and multi-document summarization.
- RQ67.
- Assigning value to Tweets related to specific use cases of exoskeletons based on the approach proposed in [148] to compute the worth of the underlining Tweets.
- RQ68.
- Deducing the number of followers of exoskeleton companies from their Tweets [149] to determine their customer base.
- RQ69.
- Studying Tweets for the detection of suggestions and classifications of suggestions [150] related to existing and/or emerging technologies associated with exoskeletons.
- RQ70.
- Studying Tweets to interpret any forms of discrimination [151] faced by existing or potential users of exoskeletons.
- RQ71.
- Implementation of the iFACT framework [152] on Tweets associated with exoskeletons to identify, assess, and evaluate the underlying factual information mentioned in the Tweets.
- RQ72.
- Implementation of the SEDTWik framework [153] for segment-based detection of any kinds of events from Tweets that focus on the use of exoskeletons by diverse user groups.
- RQ73.
- Developing an approach as per [154] for followee recommendation to existing and/or potential users of exoskeletons based on topic extraction and sentiment analysis from the Tweets.
- RQ74.
- Studying Tweets for detecting stress levels and reasons for stress [155] in current or potential users of exoskeletons.
- RQ75.
- Detecting if any Tweet about exoskeletons posted by exoskeleton companies can be classified as a “regrettable” Tweet [156] so that these companies may delete the Tweet to reduce the chances of any potential damage to their reputation.
- RQ76.
- Interpreting diverse activities related to use cases of exoskeletons by studying the associated Tweets [157] and mapping these activities on pleasure and arousal dimensions using cognitive computing principles.
- RQ77.
- Studying Tweets posted by users of exoskeletons to monitor their mental health [158].
- RQ78.
- Detecting deception (both positive and negative deception) from Tweets [159] about the use of exoskeletons by specific user groups.
- RQ79.
- Tracking happiness associated with exoskeleton usage in different cities [160] based on studying Tweets related to exoskeletons originating from these cities.
- RQ80.
- Identification of hate speech and abusive language in Tweets [161] made by unsatisfied customers of exoskeletons.
- RQ81.
- Developing an approach as per [162] to filter out relevant Tweets comprising of latest breaking news in the context of exoskeletons.
- RQ82.
- Extracting information from Tweets related to exoskeletons to interpret the multimodal forms of purchase intentions [163] in potential users.
- RQ83.
- Inferring shared interests [164] related to exoskeletons based on studying the Tweets of both current and potential users of exoskeletons.
- RQ84.
- Modeling public mood in different geographic regions [165] towards new advances in exoskeletons based on semantic analysis of the Tweets originating from these respective regions.
- RQ85.
- Implementation of the Categorical Topic Model [166] for extracting categorical topics and emerging issues about exoskeletons from Tweets.
- RQ86.
- Using classification approaches to deduce inundation levels [167] in the context of use case scenarios of different exoskeletons by different user groups.
- RQ87.
- Studying the Tweets to interpret bias and degrees of the same [168] towards using exoskeletons by potential user groups.
- RQ88.
- Detection, classification, and ranking of trending topics [169] related to conversations about exoskeletons on Twitter.
- RQ89.
- RQ90.
- Performing user characterization [172] from the tweeting patterns of any potential user to develop user personas for personalization of exoskeletons.
- RQ91.
- Studying Tweets posted by users of exoskeletons to detect and analyze their feedback and suggestions [173] for possible improvements in different types of exoskeletons.
- RQ92.
- Application of the Similarity Learning Algorithm (SiLA) as proposed in [174] to identify popular Tweets related to current and emerging exoskeletons and their use cases.
- RQ93.
- Implementation of the approach proposed in [175] to study patterns of Tweets related to exoskeletons to detect Tweets that represent “extreme behavior” on social media.
- RQ94.
- Classifying Tweets about specific use cases of exoskeletons as “alarming” and “reassuring” [176] to investigate the views of different user groups.
- RQ95.
- Performing semantic analysis of Tweets posted by new users of exoskeletons to detect instances of euphoria or delusion [177] in the context of the use cases mentioned in the underlining Tweets.
- RQ96.
- Detecting obesity from Tweets [178] posted by users of exoskeletons and investigating any potential correlations between obesity and exoskeleton usage.
- RQ97.
- Classifying potential user groups of exoskeletons into communities [179] based on studying their needs expressed in their Tweets for the development of specific exoskeletons to meet these community-based needs.
- RQ98.
- Estimating demographic information of exoskeleton users from their Tweets [180] to interpret any variation of use cases based on user diversity.
- RQ99.
- Studying the Tweets to deduce the perceptions [181] of exoskeleton users about different exoskeleton companies to interpret their buying behavior.
- RQ100.
- Analyzing the Tweets to track misinformation and trends in the same [182] about upcoming or existing exoskeletons.
4.2. Methodology as a Starting Point for Investigating Some of the RQs
- Hydrate all the Tweet IDs and merge the results into a single .csv file. Import this .csv file as a “New Dataset” into RapidMiner Studio.
- Develop a “Bag of Words” model in RapidMiner Studio, which would act as a collection of keywords and/or phrases of interest related to exoskeletons. These could include names of exoskeleton companies, specific exoskeleton products, influential people who are users of exoskeletons, politicians, celebrities, or other popular personalities sharing positive or negative opinions about exoskeletons.
- Use the “Select Attribute” operator to select the text of the Tweets from the dataset. Identify and remove stop words from the Tweet texts.
- Use the “Data Filters” in RapidMiner to filter out unwanted attributes from the dataset to have only the Tweets and other essential information needed for this study. Here unwanted attributes refer to non-essential details associated with each of the Tweets, such as the default profile image of the users, description mentioned on each user’s profile, follower count of the users, location information of all the users, screenname of all the users, and verified status of all the user accounts, which would be obtained upon hydration using the Hydrator app.
- Using the “Read Document” operator, set up a path to a file on the local system that contains a set of keywords or phrases that would be used for checking similarity and/or occurrence (Step 2). It is recommended that this file is a .txt file.
- Implement the Levenshtein distance algorithm [186] using the “Fuzzy matching” operator.
- Provide the output from the operator in Step 4 as the source and the “Read Document” operator (Step 5) as the grounds for comparison to generate similarity scores based on the string-comparison of each Tweet.
- Enable the advanced parameters of the “Fuzzy matching” operator to define the threshold value. This can be any user-defined value, and only those Tweets that have a similarity (indicating occurrence) greater than the threshold would be retained in the results.
- Integrate all the above operators and develop a RapidMiner process and set up a Twitter connection in RapidMiner Studio.
- Run the process to compute the results after defining specific parameters in Step 2 and Step 8.
5. Conclusions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Olar, M.-L.; Leba, M.; Risteiu, M. Exoskeleton—Wearable Devices. Literature Review. MATEC Web Conf. 2021, 342, 5005. [Google Scholar] [CrossRef]
- Yang, C.-J.; Zhang, J.-F.; Chen, Y.; Dong, Y.-M.; Zhang, Y. A Review of Exoskeleton-Type Systems and Their Key Technologies. Proc. Inst. Mech. Eng. Part C 2008, 222, 1599–1612. [Google Scholar] [CrossRef]
- Palazzi, E.; Luzi, L.; Dimo, E.; Calanca, A. An Affordable Upper-Limb Exoskeleton Concept for Rehabilitation Applications. Technologies 2022, 10, 22. [Google Scholar] [CrossRef]
- Laubscher, C.A.; Goo, A.; Farris, R.J.; Sawicki, J.T. Hybrid Impedance-Sliding Mode Switching Control of the Indego Explorer Lower-Limb Exoskeleton in Able-Bodied Walking. J. Intell. Robot. Syst. 2022, 104, 76. [Google Scholar] [CrossRef]
- Sarkisian, S.V.; Ishmael, M.K.; Lenzi, T. Self-Aligning Mechanism Improves Comfort and Performance with a Powered Knee Exoskeleton. IEEE Trans. Neural Syst. Rehabil. Eng. 2021, 29, 629–640. [Google Scholar] [CrossRef] [PubMed]
- van der Have, A.; Rossini, M.; Rodriguez-Guerrero, C.; Van Rossom, S.; Jonkers, I. The Exo4Work Shoulder Exoskeleton Effectively Reduces Muscle and Joint Loading during Simulated Occupational Tasks above Shoulder Height. Appl. Ergon. 2022, 103, 103800. [Google Scholar] [CrossRef] [PubMed]
- Zahedi, A.; Wang, Y.; Martinez-Hernandez, U.; Zhang, D. A Wearable Elbow Exoskeleton for Tremor Suppression Equipped with Rotational Semi-Active Actuator. Mech. Syst. Signal Process. 2021, 157, 107674. [Google Scholar] [CrossRef]
- Peng, X.; Acosta-Sojo, Y.; Wu, M.I.; Stirling, L. Actuation Timing Perception of a Powered Ankle Exoskeleton and Its Associated Ankle Angle Changes during Walking. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 869–877. [Google Scholar] [CrossRef] [PubMed]
- Liu, H.; Zeng, B.; Liu, X.; Zhu, X.; Song, H. Detection of Human Lifting State Based on Long Short-Term Memory for Wearable Waist Exoskeleton. In Lecture Notes in Electrical Engineering; Springer Singapore: Singapore, 2022; pp. 301–310. [Google Scholar]
- Ishmael, M.K.; Archangeli, D.; Lenzi, T. A Powered Hip Exoskeleton with High Torque Density for Walking, Running, and Stair Ascent. IEEE ASME Trans. Mechatron. 2022, 1–12. [Google Scholar] [CrossRef]
- Garosi, E.; Mazloumi, A.; Jafari, A.H.; Keihani, A.; Shamsipour, M.; Kordi, R.; Kazemi, Z. Design and Ergonomic Assessment of a Passive Head/Neck Supporting Exoskeleton for Overhead Work Use. Appl. Ergon. 2022, 101, 103699. [Google Scholar] [CrossRef] [PubMed]
- Song, J.; Zhu, A.; Tu, Y.; Zou, J. Multijoint Passive Elastic Spine Exoskeleton for Stoop Lifting Assistance. Int. J. Adv. Robot. Syst. 2021, 18, 172988142110620. [Google Scholar] [CrossRef]
- Dragusanu, M.; Iqbal, M.Z.; Baldi, T.L.; Prattichizzo, D.; Malvezzi, M. Design, Development, and Control of a Hand/Wrist Exoskeleton for Rehabilitation and Training. IEEE Trans. Robot. 2022, 38, 1472–1488. [Google Scholar] [CrossRef]
- Athira; Oommen, R.M. Advancements in Robotic Exoskeleton. In Proceedings of the 2018 International Conference on Circuits and Systems in Digital Enterprise Technology (ICCSDET), Kottayam, India, 21–22 December 2018; pp. 1–3. [Google Scholar]
- Li, G.; Cheng, L.; Sun, N. Design, Manipulability Analysis and Optimization of an Index Finger Exoskeleton for Stroke Rehabilitation. Mech. Mach. Theory 2022, 167, 104526. [Google Scholar] [CrossRef]
- Guntara, A.; Rahyussalim, A.J. The Uses of Lower Limb Exoskeleton, Functional Electrical Stimulation, and Future Improvements for Leg Paralysis Management—A Systematic Review. In Proceedings of the 5th International Symposium of Biomedical Engineering (ISBE) 2020, Depok, Indonesia, 28–29 July 2021. [Google Scholar]
- Thamsuwan, O.; Milosavljevic, S.; Srinivasan, D.; Trask, C. Potential Exoskeleton Uses for Reducing Low Back Muscular Activity during Farm Tasks. Am. J. Ind. Med. 2020, 63, 1017–1028. [Google Scholar] [CrossRef] [PubMed]
- Kumar, V.; Hote, Y.V.; Jain, S. Review of Exoskeleton: History, Design and Control. In Proceedings of the 2019 3rd International Conference on Recent Developments in Control, Automation & Power Engineering (RDCAPE), Noida, India, 10–11 October 2019; pp. 677–682. [Google Scholar]
- Coren, M.J. Robot Exoskeletons are Finally Here, and They’re Nothing Like the Suits from Iron Man. Available online: https://qz.com/971741/robot-exoskeletons-are-finally-here-and-theyre-nothing-like-the-suits-from-iron-man/ (accessed on 26 May 2022).
- Global Market Insights; Inc Exoskeleton Market Worth $3.4bn by 2024: Global Market Insights, Inc. Available online: https://www.globenewswire.com/en/news-release/2017/08/30/1104254/0/en/Exoskeleton-Market-worth-3-4bn-by-2024-Global-Market-Insights-Inc.html (accessed on 26 May 2022).
- da Costa, V.C.F.; Oliveira, L.; de Souza, J. Internet of Everything (IoE) Taxonomies: A Survey and a Novel Knowledge-Based Taxonomy. Sensors 2021, 21, 568. [Google Scholar] [CrossRef] [PubMed]
- Radetić-Paić, M.; Boljunčić, V. The Causes of I.C.T. Use Which Increase Time Spent on the Internet by Secondary School Students and Affect Exposure to Bullying from Other Students. Econ. Res. 2021, 35, 2859–2867. [Google Scholar] [CrossRef]
- Pan, Y.-C.; Chiu, Y.-C.; Lin, Y.-H. Systematic Review and Meta-Analysis of Epidemiology of Internet Addiction. Neurosci. Biobehav. Rev. 2020, 118, 612–622. [Google Scholar] [CrossRef] [PubMed]
- Boulianne, S. Social Media Use and Participation: A Meta-Analysis of Current Research. Inf. Commun. Soc. 2015, 18, 524–538. [Google Scholar] [CrossRef]
- Gruzd, A.; Haythornthwaite, C. Enabling Community through Social Media. J. Med. Internet Res. 2013, 15, e248. [Google Scholar] [CrossRef] [PubMed]
- Shepherd, A.; Sanders, C.; Doyle, M.; Shaw, J. Using Social Media for Support and Feedback by Mental Health Service Users: Thematic Analysis of a Twitter Conversation. BMC Psychiatry 2015, 15, 29. [Google Scholar] [CrossRef] [Green Version]
- Kavada, A. Social Media as Conversation: A Manifesto. Soc. Media Soc. 2015, 1, 205630511558079. [Google Scholar] [CrossRef]
- Goldberg, S.C. The Promise and Pitfalls of Online’ Conversations’. Roy Inst. Philos. Suppl. 2021, 89, 177–193. [Google Scholar] [CrossRef]
- Ramnarain, Y.; Govender, K. Social Media Browsing and Consumer Behaviour: Exploring the Youth Market. Afr. J. Bus. Manag. 2013, 7, 1885–1893. [Google Scholar] [CrossRef]
- Awan, M.J.; Rahim, M.S.M.; Nobanee, H.; Munawar, A.; Yasin, A.; Azlanmz, A.M.Z. Social Media and Stock Market Prediction: A Big Data Approach. Comput. Mater. Contin. 2021, 67, 2569–2583. [Google Scholar] [CrossRef]
- Pezzuti, T.; Leonhardt, J.M.; Warren, C. Certainty in Language Increases Consumer Engagement on Social Media. J. Interact. Mark. 2021, 53, 32–46. [Google Scholar] [CrossRef]
- Wang, L.; Lee, J.H. The Impact of K-Beauty Social Media Influencers, Sponsorship, and Product Exposure on Consumer Acceptance of New Products. Fash. Text. 2021, 8, 15. [Google Scholar] [CrossRef]
- Varghese, M.S.; Agrawal, M.M. Impact of Social Media on Consumer Buying Behavior. Saudi J. Bus. Manag. Stud. 2021, 6, 51–55. [Google Scholar] [CrossRef]
- Majeed, M.; Asare, C.; Fatawu, A.; Abubakari, A. An Analysis of the Effects of Customer Satisfaction and Engagement on Social Media on Repurchase Intention in the Hospitality Industry. Cogent Bus. Manag. 2022, 9, 2028331. [Google Scholar] [CrossRef]
- Liu, Y.; Singh, L.; Mneimneh, Z. A Comparative Analysis of Classic and Deep Learning Models for Inferring Gender and Age of Twitter Users. In Proceedings of the 2nd International Conference on Deep Learning Theory and Applications, Virtual Event, 7–9 July 2021; SCITEPRESS—Science and Technology Publications: Setubal, Portugal, 2021. [Google Scholar]
- Gruzd, A.; Wellman, B.; Takhteyev, Y. Imagining Twitter as an Imagined Community. Am. Behav. Sci. 2011, 55, 1294–1318. [Google Scholar] [CrossRef]
- Aslam, S. Twitter by the Numbers (2022): Stats, Demographics & Fun Facts. 2022. Available online: https://www.omnicoreagency.com/ (accessed on 27 May 2022).
- Dooms, S.; De Pessemier, T.; Martens, L. MovieTweetings: A Movie Rating Dataset Collected from Twitter. In Proceedings of the Workshop on Crowdsourcing and Human Computation for Recommender Systems (CrowdRec 2013), Held in Conjunction with the 7th A.C.M. Conference on Recommender Systems (RecSys 2013), Hong Kong, 12–16 October 2013. [Google Scholar]
- Banda, J.M.; Tekumalla, R.; Wang, G.; Yu, J.; Liu, T.; Ding, Y.; Chowell, G. A Large-Scale COVID-19 Twitter Chatter Dataset for Open Scientific Research—An International Collaboration. Epidemiologia 2020, 2, 315–324. [Google Scholar] [CrossRef]
- Chen, E.; Deb, A.; Ferrara, E. #Election2020: The First Public Twitter Dataset on the 2020 U.S. Presidential Election. J. Comput. Soc. Sci. 2022, 5, 1–18. [Google Scholar] [CrossRef]
- Wijesiriwardene, T.; Inan, H.; Kursuncu, U.; Gaur, M.; Shalin, V.L.; Thirunarayan, K.; Sheth, A.; Arpinar, I.B. ALONE: A Dataset for Toxic Behavior among Adolescents on Twitter. In Lecture Notes in Computer Science; Springer International Publishing: Cham, Switzerland, 2020; pp. 427–439. ISBN 9783030609740. [Google Scholar]
- Zangerle, E.; Pichl, M.; Gassler, W.; Specht, G. #nowplaying Music Dataset: Extracting Listening Behavior from Twitter. In Proceedings of the First International Workshop on Internet-Scale Multimedia Management—WISMM ’14, Orlando, FL, USA, 7 November 2014. [Google Scholar]
- Meng, L.; Dong, Z.S. Natural Hazards Twitter Dataset. arXiv 2020, arXiv:2004.14456. [Google Scholar] [CrossRef]
- Salem, M.S.; Ismail, S.S.; Aref, M. Personality Traits for Egyptian Twitter Users Dataset. In Proceedings of the 2019 8th International Conference on Software and Information Engineering, Cairo, Egypt, 9–12 April 2019. [Google Scholar]
- Sech, J.; DeLucia, A.; Buczak, A.L.; Dredze, M. Civil Unrest on Twitter (CUT): A Dataset of Tweets to Support Research on Civil Unrest. In Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020), Online, 19 November 2020; Association for Computational Linguistics: Stroudsburg, PA, USA, 2020; pp. 215–221. [Google Scholar]
- Tekumalla, R.; Banda, J.M. A Large-Scale Twitter Dataset for Drug Safety Applications Mined from Publicly Existing Resources. arXiv 2020, arXiv:2003.13900. [Google Scholar] [CrossRef]
- Effrosynidis, D.; Karasakalidis, A.I.; Sylaios, G.; Arampatzis, A. The Climate Change Twitter Dataset. Expert Syst. Appl. 2022, 204, 117541. [Google Scholar] [CrossRef]
- Febriana, T.; Budiarto, A. Twitter Dataset for Hate Speech and Cyberbullying Detection in Indonesian Language. In Proceedings of the 2019 International Conference on Information Management and Technology (ICIMTech), Denpasar, Indonesia, 19–20 August 2019; Volume 1, pp. 379–382. [Google Scholar]
- Urchs, S.; Wendlinger, L.; Mitrovic, J.; Granitzer, M. MMoveT15: A Twitter Dataset for Extracting and Analysing Migration-Movement Data of the European Migration Crisis 2015. In Proceedings of the 2019 IEEE 28th International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE), Napoli, Italy, 12–14 June 2019; pp. 146–149. [Google Scholar]
- Schroeder, D.; Schaal, F.; Filkukova, P.; Pogorelov, K.; Langguth, J. WICO Graph: A Labeled Dataset of Twitter Subgraphs Based on Conspiracy Theory and 5G-Corona Misinformation Tweets. In Proceedings of the 13th International Conference on Agents and Artificial Intelligence, Virtual Event, 7–9 July 2021; SCITEPRESS—Science and Technology Publications: Setubal, Portugal, 2021. [Google Scholar]
- Stemmer, M.; Parmet, Y.; Ravid, G. What Are IBD Patients Talking about on Twitter? In I.C.T. for Health, Accessibility and Wellbeing; Springer International Publishing: Cham, Switzerland, 2021; pp. 206–220. ISBN 9783030942083. [Google Scholar]
- Warren, E. Strengthening Research through Data Sharing. N. Engl. J. Med. 2016, 375, 401–403. [Google Scholar] [CrossRef]
- Fecher, B.; Friesike, S.; Hebing, M. What Drives Academic Data Sharing? PLoS ONE 2015, 10, e0118053. [Google Scholar] [CrossRef]
- Logan, J.A.R.; Hart, S.A.; Schatschneider, C. Data Sharing in Education Science. AERA Open 2021, 7, 233285842110064. [Google Scholar] [CrossRef]
- Privacy Policy. Available online: https://twitter.com/en/privacy/previous/version_15 (accessed on 27 May 2022).
- Developer Agreement and Policy. Available online: https://developer.twitter.com/en/developer-terms/agreement-and-policy (accessed on 27 May 2022).
- RapidMiner GmbH Search Twitter—RapidMiner Documentation. Available online: https://docs.rapidminer.com/latest/studio/operators/data_access/applications/twitter/search_twitter.html (accessed on 27 May 2022).
- Mierswa, I.; Wurst, M.; Klinkenberg, R.; Scholz, M.; Euler, T. YALE: Rapid Prototyping for Complex Data Mining Tasks. In Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining—KDD ’06, Philadelphia, PA, USA, 20–23 August 2006. [Google Scholar]
- Rate Limits: Standard v1.1. Available online: https://developer.twitter.com/en/docs/twitter-api/v1/rate-limits (accessed on 27 May 2022).
- Using Standard Search. Available online: https://developer.twitter.com/en/docs/twitter-api/v1/tweets/search/guides/standard-operators (accessed on 27 May 2022).
- Wilkinson, M.D.; Dumontier, M.; Aalbersberg, I.J.; Appleton, G.; Axton, M.; Baak, A.; Blomberg, N.; Boiten, J.-W.; da Silva Santos, L.B.; Bourne, P.E.; et al. The FAIR Guiding Principles for Scientific Data Management and Stewardship. Sci. Data 2016, 3, 160018. [Google Scholar] [CrossRef]
- Lamsal, R. Hydrating Tweet I.D.s. Available online: https://theneuralblog.com/hydrating-tweet-ids/ (accessed on 27 May 2022).
- Bramus. Accessing a Tweet Using Only Its ID (and Without the Twitter API). Available online: https://www.bram.us/2017/11/22/accessing-a-tweet-using-only-its-id-and-without-the-twitter-api/ (accessed on 27 May 2022).
- Hydrator. Available online: https://github.com/DocNow/hydrator (accessed on 27 May 2022).
- Tekumalla, R.; Banda, J.M. Social Media Mining Toolkit (SMMT). Genom. Inform. 2020, 18, e16. [Google Scholar] [CrossRef]
- Twarc. Available online: https://github.com/docnow/twarc (accessed on 27 May 2022).
- Hydrator Versions. Available online: https://github.com/docnow/hydrator/releases (accessed on 27 May 2022).
- ISO 639. Available online: https://www.iso.org/iso-639-language-codes.html (accessed on 10 July 2022).
- Carvalho, J.; Plastino, A. On the Evaluation and Combination of State-of-the-Art Features in Twitter Sentiment Analysis. Artif. Intell. Rev. 2021, 54, 1887–1936. [Google Scholar] [CrossRef]
- Gu, S.; Wang, F.; Patel, N.P.; Bourgeois, J.A.; Huang, J.H. A Model for Basic Emotions Using Observations of Behavior in Drosophila. Front. Psychol. 2019, 10, 781. [Google Scholar] [CrossRef]
- Do, H.H.; Prasad, P.W.C.; Maag, A.; Alsadoon, A. Deep Learning for Aspect-Based Sentiment Analysis: A Comparative Review. Expert Syst. Appl. 2019, 118, 272–299. [Google Scholar] [CrossRef]
- Asur, S.; Huberman, B.A.; Szabo, G.; Wang, C. Trends in Social Media: Persistence and Decay. SSRN Electron. J. 2011, 5, 434–437. [Google Scholar] [CrossRef]
- Fouad, M.M.; Mahany, A.; Aljohani, N.; Abbasi, R.A.; Hassan, S.-U. ArWordVec: Efficient Word Embedding Models for Arabic Tweets. Soft Comput. 2020, 24, 8061–8068. [Google Scholar] [CrossRef]
- Chen, G.M. Tweet This: A Uses and Gratifications Perspective on How Active Twitter Use Gratifies a Need to Connect with Others. Comput. Hum. Behav. 2011, 27, 755–762. [Google Scholar] [CrossRef]
- Hong, L.; Dan, O.; Davison, B.D. Predicting Popular Messages in Twitter. In Proceedings of the 20th International Conference Companion on World Wide Web—W.W.W., Hyderabad, India, 28 March–1 April 2011. [Google Scholar]
- Rajadesingan, A.; Zafarani, R.; Liu, H. Sarcasm Detection on Twitter: A Behavioral Modeling Approach. In Proceedings of the Eighth A.C.M. International Conference on Web Search and Data Mining—WSDM ’15, Shanghai, China, 2–6 February 2015. [Google Scholar]
- Wang, X.; Wei, F.; Liu, X.; Zhou, M.; Zhang, M. Topic Sentiment Analysis in Twitter: A Graph-Based Hashtag Sentiment Classification Approach. In Proceedings of the 20th A.C.M. International Conference on Information and Knowledge Management—CIKM ’11, Glasgow, UK, 24–28 October 2011. [Google Scholar]
- Li, J.; Galley, M.; Brockett, C.; Spithourakis, G.P.; Gao, J.; Dolan, B. A Persona-Based Neural Conversation Model. arXiv 2016, arXiv:1603.06155. [Google Scholar] [CrossRef]
- Aiello, L.M.; Petkos, G.; Martin, C.; Corney, D.; Papadopoulos, S.; Skraba, R.; Goker, A.; Kompatsiaris, I.; Jaimes, A. Sensing Trending Topics in Twitter. IEEE Trans. Multimed. 2013, 15, 1268–1282. [Google Scholar] [CrossRef]
- Lee, K.; Palsetia, D.; Narayanan, R.; Patwary, M.M.A.; Agrawal, A.; Choudhary, A. Twitter Trending Topic Classification. In Proceedings of the 2011 IEEE 11th International Conference on Data Mining Workshops, Vancouver, BC, Canada, 11 December 2011; pp. 251–258. [Google Scholar]
- Dijkman, R.; Ipeirotis, P.; Aertsen, F.; van Helden, R. Using Twitter to Predict Sales: A Case Study. arXiv 2015, arXiv:1503.04599. [Google Scholar] [CrossRef]
- Alvarez-Melis, D.; Saveski, M. Topic Modeling in Twitter: Aggregating Tweets by Conversations. In Proceedings of the Tenth International AAAI Conference on Web and Social Media, Phoenix, AZ, USA, 12–13 February 2016. [Google Scholar]
- Boyd, D.; Golder, S.; Lotan, G. Tweet, Tweet, Retweet: Conversational Aspects of Retweeting on Twitter. In Proceedings of the 2010 43rd Hawaii International Conference on System Sciences, Honolulu, HI, USA, 5–8 January 2010; pp. 1–10. [Google Scholar]
- Marwick, A.E.; Boyd, D. I Tweet Honestly, I Tweet Passionately: Twitter Users, Context Collapse, and the Imagined Audience. New Media Soc. 2011, 13, 114–133. [Google Scholar] [CrossRef]
- Li, W.; Serdyukov, P.; de Vries, A.P.; Eickhoff, C.; Larson, M. The Where in the Tweet. In Proceedings of the 20th A.C.M. International Conference on Information and Knowledge Management–CIKM ’11, Glasgow, UK, 24–28 October 2011. [Google Scholar]
- Chen, K.; Chen, T.; Zheng, G.; Jin, O.; Yao, E.; Yu, Y. Collaborative Personalized Tweet Recommendation. In Proceedings of the 35th international ACM SIGIR Conference on Research and Development in Information Retrieval—SIGIR ’12, Portland, OR, USA, 12–16 August 2012. [Google Scholar]
- Ray, S. A Quick Review of Machine Learning Algorithms. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; pp. 35–39. [Google Scholar]
- da Silva, N.F.F.; Hruschka, E.R.; Hruschka, E.R., Jr. Tweet Sentiment Analysis with Classifier Ensembles. Decis. Support Syst. 2014, 66, 170–179. [Google Scholar] [CrossRef]
- Kreis, R. The “Tweet Politics” of President Trump. J. Lang. Politics 2017, 16, 607–618. [Google Scholar] [CrossRef]
- Myslín, M.; Zhu, S.-H.; Chapman, W.; Conway, M. Using Twitter to Examine Smoking Behavior and Perceptions of Emerging Tobacco Products. J. Med. Internet Res. 2013, 15, e174. [Google Scholar] [CrossRef]
- Wigley, S.; Lewis, B.K. Rules of Engagement: Practice What You Tweet. Public Relat. Rev. 2012, 38, 165–167. [Google Scholar] [CrossRef]
- Liu, I.L.B.; Cheung, C.M.K.; Lee, M.K.O. Understanding Twitter Usage: What Drive People Continue to Tweet. In Proceedings of the Pacific Asia Conference on Information Systems, PACIS 2010, Taipei, Taiwan, 9–12 July 2010. [Google Scholar]
- Cheng, Z.; Caverlee, J.; Lee, K. You Are Where You Tweet: A Content-Based Approach to Geo-Locating Twitter Users. In Proceedings of the 19th A.C.M. international conference on Information and knowledge management—CIKM ’10, Toronto, ON, Canada, 26–30 October 2010. [Google Scholar]
- Uysal, I.; Croft, W.B. User Oriented Tweet Ranking: A Filtering Approach to Microblogs. In Proceedings of the 20th A.C.M. International Conference on Information and Knowledge Management—CIKM ’11, Glasgow, UK, 24–28 October 2011. [Google Scholar]
- Andrews, C.A.; Fichet, E.S.; Ding, Y.; Spiro, E.S.; Starbird, K. Keeping up with the Tweet-Dashians: The Impact of `official- Accounts on Online Rumoring. In Proceedings of the 19th A.C.M. Conference on Computer-Supported Cooperative Work & Social Computing—CSCW ’16, San Francisco, CA, USA, 27 February–2 March 2016. [Google Scholar]
- Pujazon-Zazik, M.; Park, M.J. To Tweet, or Not to Tweet: Gender Differences and Potential Positive and Negative Health Outcomes of Adolescents’ Social Internet Use. Am. J. Mens. Health 2010, 4, 77–85. [Google Scholar] [CrossRef] [PubMed]
- Merler, M.; Cao, L.; Smith, J.R. You Are What You Tweet…pic! Gender Prediction Based on Semantic Analysis of Social Media Images. In Proceedings of the 2015 IEEE International Conference on Multimedia and Expo (ICME), Turin, Italy, 29 June–3 July 2015; pp. 1–6. [Google Scholar]
- André, P.; Bernstein, M.; Luther, K. Who Gives a Tweet? Evaluating Microblog Content Value. In Proceedings of the A.C.M. 2012 Conference on Computer Supported Cooperative Work–CSCW ’12, Seattle, WA, USA, 11–15 February 2012. [Google Scholar]
- Tao, K.; Abel, F.; Hauff, C.; Houben, G.-J. What Makes a Tweet Relevant for a Topic? Available online: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.309.8507&rep=rep1&type=pdf (accessed on 27 May 2022).
- Kupavskii, A.; Umnov, A.; Gusev, G.; Serdyukov, P. Predicting the Audience Size of a Tweet. ICWSM 2013, 7, 693–696. [Google Scholar]
- Han, S.; Min, J.; Lee, H. Antecedents of Social Presence and Gratification of Social Connection Needs in S.N.S.: A Study of Twitter Users and Their Mobile and Non-Mobile Usage. Int. J. Inf. Manag. 2015, 35, 459–471. [Google Scholar] [CrossRef]
- Armstrong, C.L.; Gao, F. Now Tweet This: How News Organizations Use Twitter. Electron. News 2010, 4, 218–235. [Google Scholar] [CrossRef]
- Hu, T.; Xiao, H.; Nguyen, T.-V.T.; Luo, J. What the Language You Tweet Says about Your Occupation. arXiv 2017, arXiv:1701.06233. [Google Scholar] [CrossRef]
- Shou, L.; Wang, Z.; Chen, K.; Chen, G. Sumblr: Continuous Summarization of Evolving Tweet Streams. In Proceedings of the 36th International ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 28 July–1 August 2013. [Google Scholar]
- Yamaguchi, Y.; Takahashi, T.; Amagasa, T.; Kitagawa, H. TURank: Twitter User Ranking Based on User-Tweet Graph Analysis. In Web Information Systems Engineering–WISE 2010; Springer: Berlin/Heidelberg, Germany, 2010; pp. 240–253. ISBN 9783642176159. [Google Scholar]
- Guo, S.; Chang, M.-W.; Kıcıman, E. To Link or Not to Link? A Study on End-to-End Tweet Entity Linking. Available online: https://aclanthology.org/N13-1122.pdf (accessed on 27 May 2022).
- Himelboim, I.; McCreery, S.; Smith, M. Birds of a Feather Tweet Together: Integrating Network and Content Analyses to Examine Cross-Ideology Exposure on Twitter. J. Comput. Mediat. Commun. 2013, 18, 40–60. [Google Scholar] [CrossRef]
- Bruns, A. How Long is a Tweet? Mapping Dynamic Conversation Networks Ontwitterusing Gawk and Gephi. Inf. Commun. Soc. 2012, 15, 1323–1351. [Google Scholar] [CrossRef]
- Li, C.; Sun, A.; Weng, J.; He, Q. Tweet Segmentation and Its Application to Named Entity Recognition. IEEE Trans. Knowl. Data Eng. 2015, 27, 558–570. [Google Scholar] [CrossRef]
- Parganas, P.; Anagnostopoulos, C.; Chadwick, S. ’ You’Ll Never Tweet Alone’: Managing Sports Brands through Social Media. J. Brand Manag. 2015, 22, 551–568. [Google Scholar] [CrossRef]
- Ito, J.; Song, J.; Toda, H.; Koike, Y.; Oyama, S. Assessment of Tweet Credibility with LDA Features. In Proceedings of the 24th International Conference on World Wide Web–W.W.W. ‘15 Companion, Florence, Italy, 18–22 May 2015. [Google Scholar]
- Stephens, M. A Geospatial Infodemic: Mapping Twitter Conspiracy Theories of COVID-19. Dialogues Hum. Geogr. 2020, 10, 276–281. [Google Scholar] [CrossRef]
- Fong, A.; Roozenbeek, J.; Goldwert, D.; Rathje, S.; van der Linden, S. The Language of Conspiracy: A Psychological Analysis of Speech Used by Conspiracy Theorists and Their Followers on Twitter. Group Process. Intergroup Relat. 2021, 24, 606–623. [Google Scholar] [CrossRef]
- Vosoughi, S.; Vijayaraghavan, P.; Roy, D. Tweet2Vec: Learning Tweet Embeddings Using Character-Level CNN-LSTM Encoder-Decoder. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, Pisa, Italy, 17–21 July 2016. [Google Scholar]
- Zhao, Q.; Erdogdu, M.A.; He, H.Y.; Rajaraman, A.; Leskovec, J. SEISMIC: A Self-Exciting Point Process Model for Predicting Tweet Popularity. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining—KDD ’15, Sydney, Australia, 10–13 August 2015. [Google Scholar]
- van Liere, D. How Far Does a Tweet Travel? Information Brokers in the Twitterverse. In Proceedings of the International Workshop on Modeling Social Media—M.S.M. ’10, Toronto, ON, Canada, 13 June 2010. [Google Scholar]
- Huang, H.; Cao, Y.; Huang, X.; Ji, H.; Lin, C.-Y. Collective Tweet Wikification Based on Semi-Supervised Graph Regularization. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Baltimore, MD, USA, 22–27 June 2014; Association for Computational Linguistics: Stroudsburg, PA, USA, 2014. [Google Scholar]
- Alom, Z.; Carminati, B.; Ferrari, E. Detecting Spam Accounts on Twitter. In Proceedings of the 2018 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), Barcelona, Spain, 28–31 August 2018; pp. 1191–1198. [Google Scholar]
- Wang, B.; Zubiaga, A.; Liakata, M.; Procter, R. Making the Most of Tweet-Inherent Features for Social Spam Detection on Twitter. arXiv 2015, arXiv:1503.07405. [Google Scholar] [CrossRef]
- Purwarianti, A.; Andhika, A.; Wicaksono, A.F.; Afif, I.; Ferdian, F. InaNLP: Indonesia Natural Language Processing Toolkit, Case Study: Complaint Tweet Classification. In Proceedings of the 2016 International Conference On Advanced Informatics: Concepts, Theory And Application (ICAICTA), Penang, Malaysia, 16–19 August 2016; pp. 1–5. [Google Scholar]
- Pant, D.R.; Neupane, P.; Poudel, A.; Pokhrel, A.K.; Lama, B.K. Recurrent Neural Network Based Bitcoin Price Prediction by Twitter Sentiment Analysis. In Proceedings of the 2018 IEEE 3rd International Conference on Computing, Communication and Security (ICCCS), Kathmandu, Nepal, 25–27 October 2018; pp. 128–132. [Google Scholar]
- Jain, A.; Tripathi, S.; Dwivedi, H.D.; Saxena, P. Forecasting Price of Cryptocurrencies Using Tweets Sentiment Analysis. In Proceedings of the 2018 Eleventh International Conference on Contemporary Computing (IC3), Noida, India, 2–4 August 2018; pp. 1–7. [Google Scholar]
- Wu, C.; Wu, F.; Wu, S.; Huang, Y.; Xie, X. Tweet Emoji Prediction Using Hierarchical Model with Attention. In Proceedings of the 2018 A.C.M. International Joint Conference and 2018 International Symposium on Pervasive and Ubiquitous Computing and Wearable Computers, Singapore, 8–12 October 2018. [Google Scholar]
- Tomihira, T.; Otsuka, A.; Yamashita, A.; Satoh, T. What Does Your Tweet Emotion Mean? Neural Emoji Prediction for Sentiment Analysis. In Proceedings of the 20th International Conference on Information Integration and Web-based Applications & Services—iiWAS2018, Yogyakarta, Indonesia, 19–21 November 2018. [Google Scholar]
- Bansal, B.; Srivastava, S. Lexicon-Based Twitter Sentiment Analysis for Vote Share Prediction Using Emoji and N-Gram Features. Int. J. Web Based Communities 2019, 15, 85. [Google Scholar] [CrossRef]
- Singh, A.; Blanco, E.; Jin, W. Incorporating Emoji Descriptions Improves Tweet Classification. In Proceedings of the 2019 Conference of the North, Minneapolis, MN, USA, 2–7 June 2019; pp. 2096–2101. [Google Scholar]
- Suman, C.; Saha, S.; Bhattacharyya, P.; Chaudhari, R.S. Emoji Helps! A Multi-Modal Siamese Architecture for Tweet User Verification. Cognit. Comput. 2021, 13, 261–276. [Google Scholar] [CrossRef]
- Reyes, A.; Rosso, P.; Veale, T. A Multidimensional Approach for Detecting Irony in Twitter. Lang. Resour. Eval. 2013, 47, 239–268. [Google Scholar] [CrossRef]
- Mendoza, M.; Poblete, B.; Castillo, C. Twitter under Crisis: Can We Trust What We RT? In Proceedings of the First Workshop on Social Media Analytics–SOMA ’10, Washington, DC, USA, 25 July 2010. [Google Scholar]
- Ersahin, B.; Aktas, O.; Kilinc, D.; Akyol, C. Twitter Fake Account Detection. In Proceedings of the 2017 International Conference on Computer Science and Engineering (UBMK), London, UK, 5–8 October 2017; pp. 388–392. [Google Scholar]
- Saez-Trumper, D. Fake Tweet Buster: A Webtool to Identify Users Promoting Fake News Ontwitter. In Proceedings of the 25th A.C.M. Conference on Hypertext and Social Media, Santiago de Chile, Chile, 1–4 September 2014. [Google Scholar]
- Tonia, T.; Van Oyen, H.; Berger, A.; Schindler, C.; Künzli, N. If I Tweet Will You Cite? The Effect of Social Media Exposure of Articles on Downloads and Citations. Int. J. Public Health 2016, 61, 513–520. [Google Scholar] [CrossRef] [PubMed]
- Huang, B.; Carley, K.M. Discover Your Social Identity from What You Tweet: A Content Based Approach. In Lecture Notes in Social Networks; Springer International Publishing: Cham, Switzerland, 2020; pp. 23–37. ISBN 9783030426989. [Google Scholar]
- McCreadie, R.; Macdonald, C. Relevance in Microblogs: Enhancing Tweet Retrieval Using Hyperlinked Documents. In Proceedings of the 10th Conference on Open Research Areas in Information Retrieval, Lisbon, Portugal, 15–17 May 2013; Le Centre De Hautes Etudes Internationales D’informatique Documentaire: Paris, France, 2013; pp. 189–196. [Google Scholar]
- Haugh, B.R.; Watkins, B. Tag Me, Tweet Me If You Want to Reach Me: An Investigation into How Sports Fans Use Social Media. Int. J. Sport Commun. 2016, 9, 278–293. [Google Scholar] [CrossRef]
- Darwish, K.; Stefanov, P.; Aupetit, M.; Nakov, P. Unsupervised User Stance Detection on Twitter. arXiv 2019, arXiv:1904.02000. [Google Scholar] [CrossRef]
- Salas-Zárate, M.D.P.; Paredes-Valverde, M.A.; Rodriguez-García, M.Á.; Valencia-García, R.; Alor-Hernández, G. Automatic Detection of Satire in Twitter: A Psycholinguistic-Based Approach. Knowl. Based Syst. 2017, 128, 20–33. [Google Scholar] [CrossRef]
- Pandya, A.; Oussalah, M.; Monachesi, P.; Kostakos, P. On the Use of Distributed Semantics of Tweet Metadata for User Age Prediction. Future Gener. Comput. Syst. 2020, 102, 437–452. [Google Scholar] [CrossRef]
- Ran, C.; Shen, W.; Wang, J. An Attention Factor Graph Model for Tweet Entity Linking. In Proceedings of the 2018 World Wide Web Conference on World Wide Web—W.W.W. ’18, Lyon, France, 23–27 April 2018. [Google Scholar]
- Davis, S.W.; Horváth, C.; Gretry, A.; Belei, N. Say What? How the Interplay of Tweet Readability and Brand Hedonism Affects Consumer Engagement. J. Bus. Res. 2019, 100, 150–164. [Google Scholar] [CrossRef]
- Al Abdullatif, A.M.; Alsoghayer, R.A.; AlMajhad, E.M. An Algorithm to Find the Best Time to Tweet. In Proceedings of the International Conference on Computer Vision and Image Analysis Applications, Sousse, Tunisia, 18–20 January 2015; pp. 1–13. [Google Scholar]
- Yuan, N.J.; Zhong, Y.; Zhang, F.; Xie, X.; Lin, C.-Y.; Rui, Y. Who Will Reply to/Retweet This Tweet? The Dynamics of Intimacy from Online Social Interactions. In Proceedings of the Ninth A.C.M. International Conference on Web Search and Data Mining, San Francisco, CA, USA, 22–25 February 2016. [Google Scholar]
- Wei, H.; Zhou, H.; Sankaranarayanan, J.; Sengupta, S.; Samet, H. Residual Convolutional LSTM for Tweet Count Prediction. In Proceedings of the Companion of the The Web Conference 2018 on The Web Conference W.W.W. ’18, Lyon, France, 23–27 April 2018. [Google Scholar]
- Lee, M.; Kim, H.; Kim, O. Why Do People Retweet a Tweet? Altruistic, Egoistic, and Reciprocity Motivations for Retweeting. Psychologia 2015, 58, 189–201. [Google Scholar] [CrossRef]
- Kim, T.-Y.; Kim, J.; Lee, J.; Lee, J.-H. A Tweet Summarization Method Based on a Keyword Graph. In Proceedings of the 8th International Conference on Ubiquitous Information Management and Communication—ICUIMC ’14, Siem Reap, Cambodia, 9–11 January 2014. [Google Scholar]
- Jeon, M.; Jun, S.; Hwang, E. Hashtag Recommendation Based on User Tweet and Hashtag Classification on Twitter. In Web-Age Information Management; Springer International Publishing: Cham, Switzerland, 2014; pp. 325–336. ISBN 9783319115375. [Google Scholar]
- Deveaud, R.; Boudin, F. Effective Tweet Contextualization with Hashtags Performance Prediction and Multi-Document Summarization. In Proceedings of the Initiative for the Evaluation of XML Retrieval (INEX), Valencia, Spain, 23–26 September 2013. [Google Scholar]
- Yan, J.L.S.; Kaziunas, E. What is a Tweet Worth? Measuring the Value of Social Media for an Academic Institution. In Proceedings of the 2012 iConference on—iConference ’12, Agadir, Morocco, 5–6 November 2012. [Google Scholar]
- Klotz, C.; Ross, A.; Clark, E.; Martell, C. Tweet!—And I Can Tell How Many Followers You Have. In Advances in Intelligent Systems and Computing; Springer International Publishing: Cham, Switzerland, 2014; pp. 245–253. ISBN 9783319065373. [Google Scholar]
- Dong, L.; Wei, F.; Duan, Y.; Liu, X.; Zhou, M.; Xu, K. The Automated Acquisition of Suggestions from Tweets. In Proceedings of the Twenty-Seventh AAAI Conference on Artificial Intelligence, Washington, DC, USA, 14–18 July 2013. [Google Scholar]
- Yuan, S.; Wu, X.; Xiang, Y. A Two Phase Deep Learning Model for Identifying Discrimination from Tweets. Available online: http://www.csce.uark.edu/~xintaowu/publ/edbt16p.pdf (accessed on 27 May 2022).
- Lim, W.Y.; Lee, M.L.; Hsu, W. IFACT: An Interactive Framework to Assess Claims from Tweets. In Proceedings of the 2017 A.C.M. on Conference on Information and Knowledge Management, Singapore, 6–10 November 2017. [Google Scholar]
- Morabia, K.; Murthy, N.L.B.; Malapati, A.; Samant, S. SEDTWik: Segmentation-Based Event Detection from Tweets Using Wikipedia. In Proceedings of the 2019 Conference of the North Association for Computational Linguistics, Stroudsburg, PA, USA, 2–7 June 2019; pp. 77–85. [Google Scholar]
- Yamamoto, Y.; Kumamoto, T.; Nadamoto, A. Followee Recommendation Based on Topic Extraction and Sentiment Analysis from Tweets. In Proceedings of the 17th International Conference on Information Integration and Web-based Applications & Services, Brussels, Belgium, 11–13 December 2015. [Google Scholar]
- Kvtkn, P.; Ramakrishnudu, T. A Novel Method for Detecting Psychological Stress at Tweet Level Using Neighborhood Tweets. J. King Saud Univ. Comput. Inf. Sci. 2021. Epub ahead of print. [Google Scholar] [CrossRef]
- Zhou, L.; Wang, W.; Chen, K. Identifying Regrettable Messages from Tweets. In Proceedings of the 24th International Conference on World Wide Web—W.W.W. ‘15 Companion, Florence, Italy, 18–22 May 2015. [Google Scholar]
- Jussila, J.; Madhala, P. Cognitive Computing Approaches for Human Activity Recognition from Tweets—A Case Study of Twitter Marketing Campaign. In Research & Innovation Forum 2019; Springer International Publishing: Cham, Switzerland, 2019; pp. 153–170. ISBN 9783030308087. [Google Scholar]
- McClellan, C.; Ali, M.M.; Mutter, R.; Kroutil, L.; Landwehr, J. Using Social Media to Monitor Mental Health Discussions—Evidence from Twitter. J. Am. Med. Inform. Assoc. 2017, 24, 496–502. [Google Scholar] [CrossRef] [PubMed]
- Alowibdi, J.S.; Buy, U.A.; Yu, P.S.; Ghani, S.; Mokbel, M. Deception Detection in Twitter. Soc. Netw. Anal. Min. 2015, 5, 1–16. [Google Scholar] [CrossRef]
- Pauken, B.; Pradyumn, M.; Tabrizi, N. Tracking Happiness of Different U.S. Cities from Tweets. In Big Data—BigData 2018; Springer International Publishing: Cham, Switzerland, 2018; pp. 140–148. ISBN 9783319943008. [Google Scholar]
- Ibrohim, M.O.; Budi, I. Multi-Label Hate Speech and Abusive Language Detection in Indonesian Twitter. In Proceedings of the Third Workshop on Abusive Language Online, Florence, Italy, 1 August 2019; pp. 46–57. [Google Scholar]
- Sankaranarayanan, J.; Samet, H.; Teitler, B.E.; Lieberman, M.D.; Sperling, J. TwitterStand: News in Tweets. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems—G.I.S. ’09, Seattle WA, USA, 4–6 November 2009. [Google Scholar]
- Haque, R.; Hasanuzzaman, M.; Ramadurai, A.; Way, A. Mining Purchase Intent in Twitter. Comput. Sist. 2019, 23, 871–881. [Google Scholar] [CrossRef]
- Chaloulos, K. Inferring Shared Interests from Tweets. Available online: https://pub.tik.ee.ethz.ch/students/2011-FS/SA-2011-20.pdf (accessed on 27 May 2022).
- Bollen, J.; Mao, H.; Pepe, A. Modeling Public Mood and Emotion: Twitter Sentiment and Socio-Economic Phenomena. ICWSM 2011, 5, 450–453. [Google Scholar]
- Zheng, L.; Han, K. Extracting Categorical Topics from Tweets Using Topic Model. In Information Retrieval Technology; Springer: Berlin/Heidelberg, Germany, 2013; pp. 86–96. ISBN 9783642450679. [Google Scholar]
- Ilona, K.F.; Budi, I. Classification of Inundation Level Using Tweets in Indonesian Language. In Proceedings of the 2021 10th International Conference on Software and Computer Applications, Kuala Lumpur, Malaysia, 23–26 February 2021. [Google Scholar]
- Tankard, E.; Flowers, C.; Li, J.; Rawat, D.B. Toward Bias Analysis Using Tweets and Natural Language Processing. In Proceedings of the 2021 IEEE 18th Annual Consumer Communications & Networking Conference (CCNC), Las Vegas, NV, USA, 9–12 January 2021; pp. 1–3. [Google Scholar]
- Umakanth, N.; Santhi, S. Classification and Ranking of Trending Topics in Twitter Using Tweets Text. J. Crit. Rev. 2020, 7, 895–899. [Google Scholar] [CrossRef]
- Garcia-Lopez, F.J.; Batyrshin, I.; Gelbukh, A. Analysis of Relationships between Tweets and Stock Market Trends. J. Intell. Fuzzy Syst. 2018, 34, 3337–3347. [Google Scholar] [CrossRef]
- Liew, J.K.-S. Do Tweet Sentiments Still Predict the Stock Market? SSRN Electron. J. 2016, 1–16. [Google Scholar] [CrossRef]
- Zahra, K.; Azam, F.; Butt, W.H.; Ilyas, F. A Framework for User Characterization Based on Tweets Using Machine Learning Algorithms. In Proceedings of the 2018 VII International Conference on Network, Communication and Computing—ICNCC 2018, Taipei, Taiwan, 14–16 December 2018. [Google Scholar]
- Balusamy, B.; Murali, T.; Thangavelu, A.; Krishna, P.V. A Multi-Level Text Classifier for Feedback Analysis Using Tweets to Enhance Product Performance. Int. J. Electron. Mark. Retail. 2015, 6, 315. [Google Scholar] [CrossRef]
- Ahmed, H.; Razzaq, M.A.; Qamar, A.M. Prediction of Popular Tweets Using Similarity Learning. In Proceedings of the 2013 IEEE 9th International Conference on Emerging Technologies (ICET), Islamabad, Pakistan, 9–10 December 2013; pp. 1–6. [Google Scholar]
- Sharif, W.; Mumtaz, S.; Shafiq, Z.; Riaz, O.; Ali, T.; Husnain, M.; Choi, G.S. An Empirical Approach for Extreme Behavior Identification through Tweets Using Machine Learning. Appl. Sci. 2019, 9, 3723. [Google Scholar] [CrossRef]
- Vemprala, N.; Akello, P.; Valecha, R.; Rao, H.R. An Exploratory Analysis of Alarming and Reassuring Messages in Twitterverse during the Coronavirus Epidemic. In Proceedings of the AMCIS 2020, Virtual Conference, 15–17 August 2020. [Google Scholar]
- Akpojivi, U. Euphoria and Delusion of Digital Activism: Case Study of #ZumaMustFall. In Advances in Social Networking and Online Communities; I.G.I. Global: Hershey, PA, USA, 2018; pp. 179–202. ISBN 9781522528548. [Google Scholar]
- Anwar, M.; Yuan, Z. Linking Obesity and Tweets. In Smart Health; Springer International Publishing: Cham, Switzerland, 2016; pp. 254–266. ISBN 9783319291741. [Google Scholar]
- Silva, W.; Santana, Á.; Lobato, F.; Pinheiro, M. A Methodology for Community Detection in Twitter. In Proceedings of the International Conference on Web Intelligence—W.I. ’17, Leipzig, Germany, 23—26 August 2017. [Google Scholar]
- Sloan, L.; Morgan, J.; Housley, W.; Williams, M.; Edwards, A.; Burnap, P.; Rana, O. Knowing the Tweeters: Deriving Sociologically Relevant Demographics from Twitter. Sociol. Res. Online 2013, 18, 74–84. [Google Scholar] [CrossRef]
- Culotta, A.; Cutler, J. Mining Brand Perceptions from Twitter Social Networks. Mark. Sci. 2016, 35, 343–362. [Google Scholar] [CrossRef]
- Jain, S.; Sharma, V.; Kaushal, R. Towards Automated Real-Time Detection of Misinformation on Twitter. In Proceedings of the 2016 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Jaipur, India, 21–24 September 2016; pp. 2015–2020. [Google Scholar]
- Text Processing Extenstion of RapidMiner. Available online: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text (accessed on 11 July 2022).
- Text Analysis by AYLIEN. Available online: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_com.aylien.textapi.rapidminer (accessed on 11 July 2022).
- String Matching Extenstion of RapidMiner. Available online: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_string_matching (accessed on 11 July 2022).
- Levenshtein, V.I. Binary Codes Capable of Correcting Deletions, Insertions and Reversals. Sov. Phys. Dokl. 1966, 10, 707. [Google Scholar]
- Natural Language Toolkit. Available online: https://sourceforge.net/projects/nltk/ (accessed on 10 July 2022).
- SpaCy. Industrial-Strength Natural Language Processing in Python. Available online: https://spacy.io/ (accessed on 10 July 2022).
- TextBlob: Simplified Text Processing—TextBlob 0.16.0 Documentation. Available online: https://textblob.readthedocs.io/en/dev/ (accessed on 10 July 2022).
- Overview. Available online: https://stanfordnlp.github.io/CoreNLP/index.html (accessed on 10 July 2022).
- PyNLPl. Available online: https://pypi.org/project/PyNLPl/ (accessed on 10 July 2022).
- Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
- Varoquaux, G.; Buitinck, L.; Louppe, G.; Grisel, O.; Pedregosa, F.; Mueller, A. Scikit-Learn: Machine Learning without Learning the Machinery. GetMob. Mob. Comput. Commun. 2015, 19, 29–33. [Google Scholar] [CrossRef]
- Keras: The Python Deep Learning API. Available online: https://keras.io/ (accessed on 10 July 2022).
- PyTorch. Available online: https://pytorch.org/ (accessed on 10 July 2022).
- Pandas. Available online: https://pandas.pydata.org/ (accessed on 10 July 2022).
- Michalke, M. Text Analysis with Emphasis on POS Tagging, Readability, and Lexical Diversity [R Package KoRpus Version 0.13-8]. Available online: https://cran.r-project.org/web/packages/koRpus/index.html (accessed on 10 July 2022).
- Hornik, K. Apache OpenNLP Tools Interface [R Package OpenNLP Version 0.2-7]. Available online: https://cran.r-project.org/web/packages/openNLP/index.html (accessed on 10 July 2022).
- Quantitative Analysis of Textual Data. Available online: https://quanteda.io/ (accessed on 10 July 2022).
- Hornik, K. R/Weka Interface [R Package RWeka Version 0.4-44]. Available online: https://cran.r-project.org/web/packages/RWeka/index.html (accessed on 10 July 2022).
- Wrapper to the SpaCy NLP Library. Available online: https://spacyr.quanteda.io/ (accessed on 10 July 2022).
- Wickham, H. Simple, Consistent Wrappers for Common String Operations [R Package Stringr Version 1.4.0]. Available online: https://cran.r-project.org/web/packages/stringr/index.html (accessed on 10 July 2022).
- Selivanov, D. Text2vec: Fast Vectorization, Topic Modeling, Distances and GloVe Word Embeddings in R. Available online: https://github.com/dselivanov/text2vec (accessed on 10 July 2022).
- Text Mining Package [R Package Tm Version 0.7-8]. Available online: https://cran.r-project.org/web/packages/tm/index.html (accessed on 10 July 2022).
- Apache OpenNLP. Available online: https://opennlp.apache.org/ (accessed on 10 July 2022).
- Apache UIMA—Apache UIMA. Available online: https://uima.apache.org/ (accessed on 11 July 2022).
- Wikipedia Contributors General Architecture for Text Engineering. Available online: https://en.wikipedia.org/w/index.php?title=General_Architecture_for_Text_Engineering&oldid=1065938586 (accessed on 11 July 2022).
- LingPipe Home. Available online: http://www.alias-i.com/lingpipe/ (accessed on 10 July 2022).
- Mallet: MAchine Learning for LanguagE Toolkit. Available online: https://mimno.github.io/Mallet/ (accessed on 10 July 2022).
- NLP4J by Emorynlp. Available online: https://emorynlp.github.io/nlp4j/ (accessed on 10 July 2022).
- Welcome to Apache Lucene. Available online: https://lucene.apache.org/ (accessed on 10 July 2022).
- Emms, S. MITIE: MIT Information Extraction. Available online: https://www.linuxlinks.com/mitie-mit-information-extraction/ (accessed on 11 July 2022).
- Emms, S. MeTA—Modern C++ Data Sciences Toolkit. Available online: https://www.linuxlinks.com/meta-modern-c-plus-plus-data-sciences-toolkit/ (accessed on 11 July 2022).
- Emms, S. CRF++: Yet Another CRF Toolkit. Available online: https://www.linuxlinks.com/crf-yet-another-crf-toolkit/ (accessed on 11 July 2022).
- van Gompel, M. Colibri-Core. Available online: https://github.com/proycon/colibri-core (accessed on 11 July 2022).
- Wang, C. InsNet. Available online: https://github.com/chncwang/InsNet (accessed on 11 July 2022).
- Libfolia: FoLiA Library for C++. Available online: https://github.com/LanguageMachines/libfolia (accessed on 11 July 2022).
- Twitter-Text. Available online: https://github.com/twitter/twitter-text (accessed on 11 July 2022).
- Moore, B. Knwl.Js. Available online: https://github.com/benhmoore/Knwl.js (accessed on 11 July 2022).
- Poplar. Available online: https://github.com/synyi/poplar (accessed on 11 July 2022).
- Nlp.Js. Available online: https://github.com/axa-group/nlp.js (accessed on 11 July 2022).
- Node-Question-Answering. Available online: https://github.com/huggingface/node-question-answering (accessed on 11 July 2022).
- Saul. Available online: https://github.com/CogComp/saul (accessed on 11 July 2022).
- Astrakhantsev, N. ATR4S: Toolkit with State-of-the-Art Automatic Terms Recognition Methods in Scala. arXiv 2016, arXiv:1611.07804. [Google Scholar] [CrossRef] [Green Version]
- Stanton, A. Word2vec-Scala: Scala Port of the Word2vec Toolkit. Available online: https://github.com/Refefer/word2vec-scala (accessed on 11 July 2022).
- Hall, D. Epic. Available online: https://github.com/dlwh/epic (accessed on 11 July 2022).
- Tm: Regularized Multilingual Probabilistic Semantic Analysis Scala Implementation. Available online: https://github.com/ispras/tm (accessed on 11 July 2022).
- Potapov, S. Whatlang-Rs. Available online: https://github.com/greyblake/whatlang-rs (accessed on 11 July 2022).
- Snips-Nlu-Rs: Snips NLU Rust Implementation. Available online: https://github.com/snipsco/snips-nlu-rs (accessed on 11 July 2022).
- Rust-Bert. Available online: https://github.com/guillaume-be/rust-bert (accessed on 11 July 2022).
- Hinman, L. Clojure-Opennlp. Available online: https://github.com/dakrone/clojure-opennlp (accessed on 11 July 2022).
- Inflections-Clj. Available online: https://github.com/r0man/inflections-clj (accessed on 11 July 2022).
- Postagga: A Library to Parse Natural Language in Pure Clojure and ClojureScript. Available online: https://github.com/turbopape/postagga (accessed on 11 July 2022).
- Monkeylearn-Ruby. Available online: https://github.com/monkeylearn/monkeylearn-ruby (accessed on 11 July 2022).
- Dialogflow-Ruby-Client: Ruby SDK for Dialogflow. Available online: https://github.com/dialogflow/dialogflow-ruby-client (accessed on 11 July 2022).
- Kane, A. FastText-Ruby: Efficient Text Classification and Representation Learning for Ruby. Available online: https://github.com/ankane/fastText-ruby (accessed on 11 July 2022).
- Granger, M. Ruby-Wordnet. Available online: https://github.com/ged/ruby-wordnet (accessed on 11 July 2022).
- Ruby-Fann. Available online: https://github.com/tangledpath/ruby-fann (accessed on 11 July 2022).
- Tensorflow.Rb: Tensorflow for Ruby. Available online: https://github.com/somaticio/tensorflow.rb (accessed on 11 July 2022).
- Wailes, C. RLTK: The Ruby Language Toolkit. Available online: https://github.com/chriswailes/RLTK (accessed on 11 July 2022).
Attribute Name | Description |
---|---|
Row no. | Row number of the results |
Id | ID of the Tweet |
Created-At | Date and time when the Tweet was posted |
From-User | Twitter username of the user who posted the Tweet |
From-User-Id | Twitter User ID of the user who posted the Tweet |
To-User | Twitter username of the user whose Tweet was replied to (if the Tweet was a reply) in the current Tweet |
To-User-Id | Twitter user ID of the user whose Tweet was replied to (if the tweet was a reply) in the current Tweet |
Language | Language of the tweet |
Source | Source of the Tweet to determine if the Tweet was posted from an Android source, Twitter website, etc. |
Text | Complete text of the Tweet, including embedded URLs |
Geo-Location-Latitude | Geo-Location (Latitude) of the user posting the Tweet |
Geo-Location-Longitude | Geo-Location (Longitude) of the user posting the Tweet |
Retweet Count | Retweet count of the Tweet |
Filename | Number of Tweet IDs | Date Range of the Tweets |
---|---|---|
Exoskeleton_TweetIDs_Set1.txt | 22,945 | 20 July 2021–21 May 2022 |
Exoskeleton_TweetIDs_Set2.txt | 19,415 | 1 December 2020–19 July 2021 |
Exoskeleton_TweetIDs_Set3.txt | 16,673 | 29 April 2020–30 November 2020 |
Exoskeleton_TweetIDs_Set4.txt | 16,208 | 5 October 2019–28 April 2020 |
Exoskeleton_TweetIDs_Set5.txt | 17,983 | 13 February 2019–4 October 2019 |
Exoskeleton_TweetIDs_Set6.txt | 34,009 | 9 November 2017–12 February 2019 |
Exoskeleton_TweetIDs_Set7.txt | 11,351 | 21 May 2017–8 November 2017 |
Language | Language Code | Absolute Count | Fraction |
---|---|---|---|
English | en | 71,585 | 0.904296308 |
Danish | da | 1085 | 0.013706244 |
Tagalog | tl | 926 | 0.011697679 |
Spanish | es | 774 | 0.009777542 |
Indonesian | in | 419 | 0.00529301 |
Japanese | ja | 288 | 0.003638155 |
French | fr | 267 | 0.003372873 |
German | de | 206 | 0.002602292 |
Hungarian | hu | 197 | 0.002488599 |
Czech | cs | 192 | 0.002425437 |
Catalan | ca | 183 | 0.002311744 |
Romanian | ro | 181 | 0.002286479 |
Turkish | tr | 119 | 0.001503265 |
Estonian | et | 117 | 0.001478001 |
Portuguese | pt | 110 | 0.001389573 |
Finnish | fi | 105 | 0.001326411 |
Basque | eu | 96 | 0.001212718 |
Dutch | nl | 89 | 0.001124291 |
Slovenian | sl | 84 | 0.001061129 |
Russian | ru | 65 | 8.21E-04 |
Thai | th | 50 | 6.32E-04 |
Haitian | ht | 43 | 5.43E-04 |
Italian | it | 36 | 4.55E-04 |
Arabic | ar | 31 | 3.92E-04 |
Lithuanian | lt | 19 | 2.40E-04 |
Swedish | sv | 18 | 2.27E-04 |
Polish | pl | 16 | 2.02E-04 |
Papiamentu | qst | 13 | 1.64E-04 |
Korean | ko | 11 | 1.39E-04 |
Kunstsprachen | art | 8 | 1.01E-04 |
Chinese | zh | 8 | 1.01E-04 |
Greek | el | 6 | 7.58E-05 |
Vietnamese | vi | 6 | 7.58E-05 |
Hindi | hi | 5 | 6.32E-05 |
Icelandic | is | 5 | 6.32E-05 |
Norwegian | no | 4 | 5.05E-05 |
Persian | fa | 3 | 3.79E-05 |
Hebrew | iw | 3 | 3.79E-05 |
Welsh | cy | 2 | 2.53E-05 |
Latvian | lv | 2 | 2.53E-05 |
Malayalam | ml | 2 | 2.53E-05 |
Urdu | ur | 2 | 2.53E-05 |
Bulgarian | bg | 1 | 1.26E-05 |
Gujrati | gu | 1 | 1.26E-05 |
Armenian | hy | 1 | 1.26E-05 |
Georgian | ka | 1 | 1.26E-05 |
Burmese | my | 1 | 1.26E-05 |
Tamil | ta | 1 | 1.26E-05 |
Ukrainian | uk | 1 | 1.26E-05 |
Date | Number of Tweets |
---|---|
4 October 2019 | 1014 |
30 March 2018 | 214 |
10 November 2017 | 203 |
24 February 2019 | 194 |
8 July 2018 | 188 |
7 October 2019 | 184 |
6 July 2017 | 162 |
6 October 2019 | 160 |
22 November 2019 | 151 |
29 March 2018 | 137 |
24 November 2017 | 113 |
29 November 2018 | 111 |
23 June 2017 | 109 |
27 September 2017 | 109 |
14 December 2020 | 108 |
11 November 2017 | 107 |
25 February 2019 | 104 |
9 January 2019 | 104 |
23 November 2019 | 103 |
24 November 2019 | 101 |
28 November 2017 | 100 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Thakur, N. Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions. Analytics 2022, 1, 72-97. https://doi.org/10.3390/analytics1020007
Thakur N. Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions. Analytics. 2022; 1(2):72-97. https://doi.org/10.3390/analytics1020007
Chicago/Turabian StyleThakur, Nirmalya. 2022. "Twitter Big Data as a Resource for Exoskeleton Research: A Large-Scale Dataset of about 140,000 Tweets from 2017–2022 and 100 Research Questions" Analytics 1, no. 2: 72-97. https://doi.org/10.3390/analytics1020007