Next Article in Journal
Projecting Drivers of Human Vulnerability under the Shared Socioeconomic Pathways
Next Article in Special Issue
A Spatial Panel Data Analysis of Economic Growth, Urbanization, and NOx Emissions in China
Previous Article in Journal
Effectiveness of Integration and Re-Integration into Work Strategies for Persons with Chronic Conditions: A Systematic Review of European Strategies
Previous Article in Special Issue
Managing Risk Aversion for Low-Carbon Supply Chains with Emission Abatement Outsourcing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Attitudes Expressed in Online Comments about Environmental Factors in the Tourism Sector: An Exploratory Study

by
Jose Ramon Saura
1,*,
Pedro Palos-Sanchez
2 and
Miguel Angel Rios Martin
3
1
Department of Business and Economics, Faculty of Social Sciences and Law, Rey Juan Carlos University, Paseo Artilleros s/n, Madrid 28032, Spain
2
Department of Business Organization, Marketing and Market Research, International University of La Rioja, Av. de la Paz 137, 26006 Logroño, Spain
3
Department of Financial Economy and Operations Management, Faculty of Economics and Business, University of de Sevilla, Av. de Ramon y Cajal, 1, Sevilla 41004, Spain
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2018, 15(3), 553; https://doi.org/10.3390/ijerph15030553
Submission received: 12 January 2018 / Revised: 20 February 2018 / Accepted: 15 March 2018 / Published: 19 March 2018
(This article belongs to the Special Issue Green Environment, Green Operations and Sustainability)

Abstract

:
The object of this exploratory study is to identify the positive, neutral and negative environment factors that affect users who visit Spanish hotels in order to help the hotel managers decide how to improve the quality of the services provided. To carry out the research a Sentiment Analysis was initially performed, grouping the sample of tweets (n = 14459) according to the feelings shown and then a textual analysis was used to identify the key environment factors in these feelings using the qualitative analysis software Nvivo (QSR International, Melbourne, Australia). The results of the exploratory study present the key environment factors that affect the users experience when visiting hotels in Spain, such as actions that support local traditions and products, the maintenance of rural areas respecting the local environment and nature, or respecting air quality in the areas where hotels have facilities and offer services. The conclusions of the research can help hotels improve their services and the impact on the environment, as well as improving the visitors experience based on the positive, neutral and negative environment factors which the visitors themselves identified.

1. Introduction

1.1. Global Overview of Tourism Trends

Over the past few years, the use of new technologies has led to changes in the environmental, cultural, economic and social sustainability sectors from how they were at the beginning of the 21st century [1,2]. Tourism and the activities linked to its growth have become a key part of the world economy and have also made local populations suffer the consequences of the impact of these activities on the environment [3].
Activities for tourism provide important amounts of income and jobs, promote knowledge of other cultures and conserve cultural and natural heritage, as well as investing in local infrastructure [4]. These activities provide both economic and social benefits. However, not everything is positive as some styles of tourism and certain recreational activities can lead to the destruction of habitats, the deterioration of the landscape and competition for scarce resources and services (fresh water, territory, energy, wastewater treatment, etc.) [5].
In addition, local populations may lose their traditional way of life as a consequence of these activities and could also become excessively dependent on the income generated by tourism [6]. The increase in prices associated with tourism can also negatively affect the local population, which runs the risk of losing land, houses, shops and services for the construction of hotels, hostels, rural houses or tourist services [7]. These problems are worsened by the concentration of tourist activity in relatively short holiday periods and in certain limited areas. These areas may also be subject to environmental pressures from other economic activities such as agriculture, fishing, and industrial development or from the increasing resident population. More than other sectors, tourism and recreational activities depend on the quality of the natural and cultural environment for their long-term success.
It can be seen therefore, that tourism can affect the natural environment to the point of endangering its own existence. To stop this, it is important to identify the environment-related factors which hotel users detect during their stays at hotels. These users are aware of the problems and the impact of tourism on the environment [8,9]. It is important to draw attention to the fact that the development of new technologies, and especially social networks has caused important changes to the tourism sector [6]. One great change is that people who stay at hotels can easily make comments on the Internet and provide information about their experience and the quality of their visit [9,10].
Like Hung, Kuo and Lin [6] the research of Bifet and Frank [7] analyzed the posts made on social network by companies in one sector. The results of the research show that the tone and narrative used in the post are directly associated to the interest shown in the post by both mobile device and desktop social networks users, which allows for the identification of the key factors which should be used to establish company strategies [11,12,13,14]. For some time, social networks and understanding their use has been of interest to researchers in different sectors, such as environmental, cultural, economic and social sustainability. In the work of Kwon [15,16,17], in which a qualitative approach is taken to determine the attitudes of older people, or the work of Honeycutt et al. [18] in which a textual analysis is performed to determine and understand the short- and long-term impacts in the field of environmental science and public health.

1.2. Literature Review

The interest that researchers have in identifying environmental factors which affect the users’ experiences when staying at hotels must also be considered. The interest of researchers in this area is shown in various research papers, such as those of Hussain and Singh [19], in which the research aims to discover the attitudes and behaviors of hotel users concerning sustainability. The work of Suanmali [20] is also concerned with detecting the factors that affect tourists’ satisfaction with the Northern Part of Thailand, using an empirical study relating to the environment.
In the work of Kaltenborn et al. [21], tourists’ attitudes are investigated concerning the environmental, social and managerial attributes of the Serengeti National Park, concluding with the identification of factors that affect hotel users concerned about the environmental impact of hotel activities. The work of Bruyere [22] aims to identify rural hotel users’ feelings about the benefits of exploiting rural areas for tourism and then suggest how the management of rural hotels can use these findings. Eliam and Trop [23,24] research users’ opinions on activities that affect the environment, identifying the main points which concern travelers.
In order to identify the factors that affect Spanish hotel users during their stay, we compiled a set of factors that affect the environment (see Table 1). It is important to identify the environment factors detected by the users of Spanish hotels as these users are already aware of the problems and the impact of tourism on the environment. Therefore, the experiences and feelings of these users when identifying environment factors of the tourism industry are very valuable.
It is also useful to remember that the world tourism sector keeps growing [25]. In 2016, 1235 million tourists traveled to foreign countries, which means some 46 million more than the previous year, according to the World Tourism Organization (WTO) [26,27]. This is the seventh consecutive year of increases. However, the growth rate has slowed down as in 2016 the increase was 3.9%, whereas in previous years the rise has been around 4.6%. This means that last year there were 300 million foreign tourists more than in 2008 [28,29,30].
If we look at 2017 WTO world tourism figures, the number of visitors at destinations all around the world shows that these was a large demand for international tourism in the second half of 2017. Worldwide, international tourist arrivals increased by 6% compared to the same semester of the previous year, far surpassing the sustained and constant growth trend of at least 4% observed since 2010 [31]. These figures give the first six months of 2017 the best semi-annual results obtained in the last seven years. The results are related to the strong growth registered in many destinations and the continuing recovery in those that had registered falls in previous years. Of all WTO regions, growth was greatest in the Middle East (+9%), Europe (+8%) and Africa (+8%), followed by Asia and the Pacific (+6%) and then the Americas (+13%) [27].
In addition, the WTO research concludes that the growth of arrivals was driven by the demand for outbound tourism from the main sourcing markets. In particular, Canada, China, the Republic of Korea, Spain (object of our investigation), the United States, France and the United Kingdom, which have continued to report strong expenditure growth of outbound tourism.

1.3. The Context of Hotel Tourism in Spain

When studying the state of tourism in Spain, which is the geographical area on which this study is centered, we found that Spain received 75.6 million tourists in 2016. These figures are 10.3% higher than for 2015. In the last month of 2016, Spain was visited by 4 million foreign tourists, a figure that means an advance of 13.3% on the previous year. In Spain, this sector is a very important part of the national economy, contributing to nearly 11% of GDP (Gross Domestic Product) [27].
Tourists have also increased their spending to 77,000 million Euros. Tourists in Spain usually stay for between 4 and 7 nights. The main tourist destination in Spain was Catalonia. With 18 million tourists, it received 4% more visitors than in 2015. Next were the Canary Islands (with 13.3 million and an increase of 13.2%) and Illes Balears (with 13 million which means an increase of 11.9%) [27,28].
To understand the relevance of the analysis of the best hotels in Spain, we can see the results of FRONTUR’s research in Table 2. This shows that the number of tourists using paid accommodation as their main type of accommodation increased by 15.1% annually in December. In this market, hotel accommodation increased by 11.2% and rented housing increased by 49.2%. Unpaid accommodation showed an increase of 9.1%. Tourists housed in family or friends homes increased by 0.3% and those staying in self-owned housing by 42.5% [28].
In order to calculate the consequences tourism has on the environment it is also useful to see how these travelers arrived. The largest number of tourists to Spain arrived by air transport (December 2016) with almost 3.2 million arrivals, which represents an annual growth of 16.6%. By road, 0.1% more tourists arrived than in December 2015, while 41.3% more entered by rail and 8.8% more through sea ports, as can be seen in Table 3 [22].
This data emphasizes the importance that tourism has in the Spanish economy. In order to guarantee its future, companies and especially hotels, must identify the parts of their businesses that affect the environment [21,22].
One of the most important factors to be considered is the protection of the environment. This concept includes many points, not only landscapes and natural resources, but is also closely linked to the quality of the services which are offered and is a variable that constantly appears in the tourism sector [22].
This research can therefore be seen to be of interest to the management of hotels, who have to understand that the activities of their hotels and related services affect the environment and world sustainability. Consequently, it is important for them to improve their services by detecting if the hotel users considered that the environment is well treated or not during their stay at the hotel.
Hotels should recognize the importance of this sector as a resource that represents the environment, so that it is not harmed in any way and therefore does not affect the current and future economic and social stability, which is a very important sector in the global economy. The future proposal for the tourism sector is based on quality and sustainable tourism which respects the environment [21]. Tourism and recreational activities depend far more than other sectors, on the quality of the natural and cultural environment for their long-term success. In this way, when a country which has attractive areas for tourism becomes an interesting destination for tourism and recreational activities, uncontrolled environmental impacts can jeopardize future benefits [22]. Therefore, tourism can affect the natural environment to the point of endangering its existence, hence the importance of identifying factors that affect the environment as a result of tourism activities in hotels.
In this sense, with the developments produced in sectors such as environmental, cultural, economic and social sustainability, this research is concerned with the identification of environment factors from the analysis of the Twitter users’ opinions for the 25 Hotels that won the Traveler’s Choice award from TripAdvisor in 2017. In this exploratory study, Twitter as a means of commenting on a topic, the opinions of hotel users and the identification of key environment factors that affect hotel visitors will be studied.
To complete this exploratory study a Sentiment analysis was carried out using an algorithm developed in Python by MonkeyLearn API (MonkeyLearn, San Francisco, CA, USA) [32] with which 14,459 tweets were grouped according to feeling (positive, negative or neutral) and then a textual analysis was performed on these tweets to identify the main environmental factors from the feelings shown using the qualitative analysis software Nvivo (QSR International, Melbourne, Australia). Afterwards, a linear correlation analysis and difference between means test were done using IBM SPSS version 24 (IBM, Chicago, IL, USA) and the analysis and results were presented. Finally, a discussions and conclusions section was written for the results obtained.

2. Related Work

2.1. Machine Learning and Sentiment Analysis Approaches for Social Network Analysis

There are several researchers that have developed models based on machine learning for the analysis of social networks, the opinions that users have or to identify key factors related to a specific theme [33,34]. Supervised methods based on classification and categorization of key factors such as MaximumEntropy (MaxEnt) and Support Vector Machines (SVMs) are used for a combination of features to perform social network analysis with machine learning and research methods based on technology to identify the important factors of a research category [7,35,36,37]. These investigations can be based on keywords, ratings of feelings regarding a topic, semantic meaning, concepts and semantic theories, sentiment-topic features such as hashtags, retweets or points on social networks, and valuation identifiers for products and services on the Internet [38,39].
The research of Pak and Paroubek [40] presents an in-depth development of methodologies based on a study of Twitter. In general, semantic approaches based on sentiment analysis determine the occurrence of the key words, to which a statistical factor has been added. This factor determines the feeling [41] and is often used to perform analysis of positive and negative feelings and shows that Twitter can be an appropriate platform for researching factors of interest [42,43].
In the research by Zhang, Yun, Liang y Zhang [44] a semi-supervised dual recurrent neural network is proposed to prepare a Sentiment Analysis. This is similar to traditional neural networks and can be used to evaluate a set of data over a long period of time. This technique allows a more effective and efficient sentiment analysis to be carried out.
In Turney y Pantel [45], a recursive neural network is used to understand the meaning of particular comments. To achieve this, sentiment analysis identifies words and offers the semantic meaning for the particular topic of interest [46,47,48]. Neural networks are used to establish labels for each word and classify them according to predetermined criteria.
The research of Robinson [49] proposed a probabilistic model that is called Textual-based Information Diffusion and Evolution (TIDE), with which the evolution of different topics in social communities and their diffusion over time was measured. This method was based on Sentiment Analysis. The model extracts characteristics from the text of the comments made and implicitly captures them using standard ranges of the Gaussian field, as other authors have done [50,51,52,53].
Table 4 shows a summary of the main research using Sentiment Analysis and textual analysis combined with other methodologies based on machine learning and semantic analysis.

2.2. Textual Analysis

Textual analysis is a qualitative and exploratory procedure that determines the key factors of an event or object of study by grouping them into topic nodes. The Nvivo qualitative analysis software is one of the most relevant in this research category and has been used on numerous occasions in the last decade as a research method [55,56,57,58].
In Vázquez and Escamilla [52] a qualitative approach using the Nvivo software is taken to determine the attitudes of older people in order to perform a textual analysis on the contents of the results of the research. Ramirez-Andreotta et al. [53] also carry out a textual analysis to determine and understand the short- and long-term impacts of bio-monitoring and exposure of the participants in the study, in order to identify future factors for environmental justice using the Nvivo software.
In the work of Saito, Nakano and Kimura [59] a probabilistic matrix for the prediction of re-tweets based on textual analysis was developed. The social context of the relationships between the messages and their time latency were studied. Likewise, Jiang et al. [60] analyzed the fundamental factors that affect a concept called “re-tweetability” of each tweet when using a predictive filter based on collaboration between users. Textual analysis is used to determine and identify the most repeated factors of the study, and, based on these, determine the corresponding actions for the investigation.
In Table 5 the research using textual analysis can be seen, along with the main characteristics of qualitative analysis software such as Nvivo, which includes classification into nodes, categorization by topic, number of times a keyword is repeated and the type of keywords that are repeated.

3. Conceptual Framework and Hypothesis Development

As we have already outlined, the interest of researchers over the last decades has been focused on the identification and analysis of key determinant factors in social networks with regard to a specific topic. Suanmali [24] detects the environment factors that affect the satisfaction of tourists by using an empirical study in the Northern Part of Thailand and links these factors to the actions carried out by hotels in the geographical area in which they are located. Kaltenborn [25] also investigates the attitudes of tourists to the environmental, social and managerial attributes of Serengeti National Park, concluding with the identification of factors that affect the interest of users and then defines and suggests actions that can be taken by the hotel sector to respect and conserve the environment.
The environmental actions carried out by hotels are those actions related to the improvement of services based on social policies, the promotion of traditional products or local consumer goods, services related to the quality of the air or the mountain areas where the facilities are located, etc. Pak and Paroubek [40] carried out research in which a recursive neural network was used to understand what a particular content says and categorizes it with respect to the identification of specific terms. Pak and Paroubek [40] undertake a sentiment analysis to identify key words and offer the semantic meaning for each of the positive, negative or neutral links [42]. Based on the statements made above, it is proposed that:
Hypothesis 1 (H1).
The type of consumer experience in Spanish hotels (positive, negative or neutral) influences the environmental actions carried out by hotels.
As indicated previously by Bruyere [26], who identified the feelings of users who visited hotels in rural areas and how these influenced the users’ experiences in hotels in those areas. Eliam [27] also investigates user attitudes to the environment and identifies the key factors that were expressed.
Likewise, Moreno et al. [12] develops an enriched consumer recruitment system to increase the retention of users in campaigns for community schemes on social networks, classifying the key factors that was shared in the content and relating them to the research objective [61].
The factors that users consider to be relevant to the environment in hotels can determine their satisfaction with the services contracted in the hotel and can modify their enjoyment in a positive or negative way. The work of Roshan et al. [11] focuses on companies in different periods in order to categorize what the most important factors are when assessing the users’ experience with the companies. Roshan et al. [11] evaluate users experiences and opinions, and determine key factors from these [62]. Therefore, the following hypothesis is presented:
Hypothesis 2 (H2).
The environmental factors that users of Spanish hotels observe during their stays influence their experience at the hotel.

4. Methodology

The methodology used was firstly to perform a Sentiment Analysis on Twitter posts [7,33]. Using the results of this Sentiment analysis, a textual analysis was done to identify the key factors related to the environment [44,46]. Finally, bivariate linear correlation was used to demonstrate the statistical significance of the results [63] and then a difference of means test was done on the type of hotels, which were grouped according to their star ratings. This test was based on the Levene Test using the statistical analysis program IBM SPSS version 24.

4.1. Sample

As we have already indicated, the objective of this exploratory study is to identify the positive, neutral and negative factors related to the environment that affect the experiences of hotel users. These factors can help hotel management to improve their services and propose new social responsibility strategies for sustainability and respect for the environment such as, tourist activities which respect the environment or the promotion of activities that do not pollute the environment.
The research sample is composed of Spanish hotels ranked using TripAdvisor Traveler’s Choice Awards, which draws from more than 500 million opinions from travelers in Spain. Also, it is important to notice that Spain has become the second most visited country in world (82 million tourists visited Spain in 2018 according to data from the government of Spain). To identify the factors related to the environment, we analyzed the reviews that hotel users made on Twitter between 1 October 2016 and 1 October 2017 for the 25 hotels that won the Traveller’s Choice Awards. A total of n = 14,459 tweets for the 25 hotels that make up the sample were analyzed [64]. As we have already indicated, the research sample is made up of the Twitter profiles of the 25 hotels that won the TripAdvisor Traveller’s Choice Awards in 2017 and that have been classified according to:
  • Active and official profile on Twitter
  • Public profile
  • Number of opinions in tweets
  • Number of interactions with users
Appendix A shows the identification of the hotels under study in TripAdvisor and Twitter.

4.2. Data Collection and Extraction

Data was collected using the Twitter API between 1 October 2016 and 1 October 2017. To carry out the Sentiment Analysis, we used the access algorithm in the MonkeyLearn API [32] that is written in Python and uses machine learning techniques to improve the levels of prediction and significance.
Firstly, the data was collected and classified using the Twitter API. Then, after having thoroughly trained the algorithm that does the Sentiment Analysis with part of the sample data, the entire database was analysed. The tweets for each hotel were divided into negative, positive and neutral groupings. Thirdly, a textual analysis was performed using the Nvivo 11 software (QSR International, Melbourne, Australia). The tweets were separated into nodes according to the feelings expressed (N1, N2 and N3) and then thematic nodes were identified to test N4, which is made up of the key environment factors.

4.3. Textual Analysis

The next step was to perform a textual analysis to identify the environment factors in the analyzed tweets. We used the Nvivo software to do this as it allows us to completely configure the Analysis for our test purpose [22].
The Nvivo software allows for classification by Nodes. The Nodes are configured as containers for the information which includes the evidence and has already been grouped beforehand [65,66,67]. It must be emphasized that the creation, design and exploration of nodes is a way to research pure data, in order to achieve higher quality descriptive and explanatory levels than could be reached without it [68]. Free nodes are data containers that can group ideas separately and that are not conceptually related to other nodes in an analysis. Branched nodes, are used to represent concepts grouped by categories which are logically linked and can be grouped hierarchically. The analysis results are then presented and are characterized using different indicators [69].
To do this we have established different nodes to identify the main environmental factors from the feelings examined with the nodes [70]. Firstly, a textual analysis was carried out on environmental factors in the tweets, classified as negative feelings (N1), secondly, with the tweets classified as neutral feelings (N2) and thirdly with the tweets classified as positive feelings (N3).
An exploratory analysis of each of the nodes allowed us to identify environment factors which were grouped into the node (N4) and then divided into three categories. Table 6 shows the classification of the nodes according to negative, neutral and positive environment factors [71].

5. Findings

5.1. Sentiment Analysis

In order to identify the environment factors observed by hotel users, the results of the textual analysis were analyzed for each of the nodes [72].
Firstly, the classifications which were used for the Sentiment Analysis carried out with machine learning are presented. These classify all of the users’ opinion tweets into positive, negative and neutral [73].
Next, in Table 7, the probability coefficients obtained from the classification of the Sentiment Analysis are presented with a summary of the content averages for each feeling. Then, the results of the textual analysis are presented for the nodes that subdivide each of the topics and factors that have been identified in the research.
The total number of analyzed tweets, n was 14,459. The average of published tweets was 657.22 and from the machine learning Sentiment Analysis the greatest probability percentage was 0.679 and the least was 0.555 [74].
The probability percentages resulting from the Sentiment Analysis for each tweet classification can be seen in Figure 1. The probability percentage is a measure of accuracy, precision and recall of the samples in each category.
From all the analyzed tweets, 477 were negative, 7737 were neutral and 6275 were positive, which show that Twitter can be used as a platform to establish a relationship with the user [75].

5.2. Textual Analysis

To identify the key environment factors in these user interactions, we carried out a textual analysis with Nvivo, for which each of the nodes of the textual analysis was divided into positive, negative and neutral nodes for the environment factors [76].
In Figure 2 below, we can the results of the global textual analysis after the Sentiment Analysis, and the environment factors which were tested [77].
The textual analysis of the environmental factors in N1 found the following results when analyzing the global opinions of users about their experience in the hotel [78]. Table 8 shows the Semantic analysis for negative environmental factors and Table 9 shows three negative tweets examples.
The negative factors in N1 show the users’ concerns about garbage collection, air pollution and the pollution of the hotels’ environment [76]
These results demonstrate the users concerns about negative environment factors. The weighted percentages do not exceed 0.06 for the negative content, so we can affirm that the hotel users are not highly concerned about these environmental factors [76].
The environment factors for N2, neutral factors, are shown in Table 10 below and Table 11 shows three neutral tweets examples.
The textual analysis of quality of service factors (N2) gave results showing how the users consider the products and services that the hotel makes available for planned trips and specialized offers.
Neutral environment factors were found, such as the importance of local products in local markets and shops, crafts from local potteries and artisans, traditional experiences such as visits to shelters, seminars and local monasteries that have not suffered changes over the years and finally a respect for nature, rivers and geographical features when constructing villages, along with the preservation of mountains and local roads. By using weighted percentages for N2, it can be seen that users give greater importance to N1. The highest weighted percentage for environment factors in the textual analysis was 0.40. Finally, the results of the textual analysis for environment factors of the 6275 tweets that make up the N3 node can be seen in Table 12. Also, Table 13 shows three neutral tweets examples.
A great variety of similar environment factors were found for N3. In particular, different contents about local tradition, customs, air quality, traditional ecology and the importance of nature were analyzed and evaluated.
The users valued positively the environment factors regarding hostels, routes, local products and shows. In addition, use showed positive relationships with the environments of dances and traditional customs, such as pastry making and viticulture. Another very important factor is the quality of the air, which can be linked to breathing problems, asthma and other diseases [73].
Traditional ecology is represented by factors such as salt pans, orchards and organic products. In general, nature is linked to concepts such as oasis, islands, mountains, rivers and paths. The weighted percentages of the textual analysis of N3 are between 0.39 and 0.09 in terms of positive factors related to the environment.
Table 14 shows all the environmental factors identified as a result of the Sentiment Analysis and subsequent textual analysis of the opinions and tweets of the hotel users that make up the sample.
Table 11 summarizes the main environmental factors in the users’ opinions for the hotels of the 2017 Traveler’s Award and can be used to highlight the most important factors for the consumer. That is to say, the analysis of the results shows how the Spanish hotels users express their attitudes towards the environment factors which influence their experiences.
The results of this exploratory study can be used by hotel management to pay attention to the factors which are shown to be important to the guests. The positive factors that affect users visiting Spanish hotels are directly related to the activities carried out or managed by the hotels and which are associated with local traditions, air quality, customs, traditional ecology or nature. Hotel management can therefore organize activities that promote these factors to improve the users’ experience.
The results also show that the activities provided by the hotels with local and artisan products, traditional experiences and care for nature are factors that are neutral for visitors. However, the results of the exploratory study show that hotel management can use these results to improve their activities, initiatives and services, taking into account that the users regard garbage collection, atmospheric pollution and pollution as negative environment factors.

5.3. Linear Correlation Analysis and Difference between Means Tests

In order to show the statistical significances as an additional analysis of the investigation results, a bivariate linear correlation was made to check the possible relationships between the variables. The dependent variables were the number of followers on Twitter, and the negative, neutral or positive feeling of the tweets that were in the results [63]. The independent variables are the number of stars that the hotels under study have on TripAdvisor, the number of comments in the official Twitter profiles and the number of followers [64,65].
This test is used to check the possible relationship between two metric variables. The measurement is carried out using the linear correlation coefficient, which ranges between 1 and −1. If the value is 1 and positive, it indicates variation in the same direction for both variables. A negative value of 1 indicates variation in opposite directions, which means that when one of the variables increases, the other decreases. A value close to zero means that the variables behave independently. Therefore, correlation shows the extent to which two variables share a variation. To calculate the percentage of joint variation, the coefficient of determination is used, which is the square of the linear correlation coefficient. Correlations measure how variables or rank orders are related. The results of the correlations are shown in Table 15.
High correlation between five variables can be seen: Number of Environment Tweets, Number of Followers, Negative Environment Tweets, Neutral Environment Tweets and Positive Environment Tweets, and also high significance (**) of two-by-two correlation coefficients. This can be between −1 (a perfect negative relationship) and +1 (a perfect positive relationship). A value of 0 indicates that there is no linear relationship.
The test for significance was unilateral probability, since the direction of association was Number of Environment Tweets towards the type of Tweets and Number of Followers towards Number of Tweets.
Correlations with significant correlation coefficients (p ≤ 0.05) are indicated with a single asterisk and those with (p ≤ 0.01) are identified by two asterisks.
The results show that there are significant correlations (p ≤ 0.01) between Number of Tweets and Negative Environment Tweets, Neutral Environment Tweet and Positive Environment Tweets. In all three cases correlation is positive and close to +1.
Also, there is significant correlation (p ≤ 0.01) between Number of Followers and Number of Environment Tweets. In addition, we can conclude that the higher the number of tweets in a hotel profile, the higher the number of negative tweets (Pearson correlation = 0.648, p ≤ 0.01). However, the relationship is even more significant if we take into account that the higher the number of tweets made for the hotel, the higher the number of neutral comments (Pearson correlation = 0.780, p ≤ 0.001) and negative comments (Pearson correlation = 0.688; p ≤ 0.001).
The coefficient of determination (R2) shows that 47% of the changes in the number of Tweets correspond to change in the number of negative tweets. The highest value of R2, 60.8%, is obtained by the number of neutral environmental tweets.
In addition, a descriptive statistical analysis was carried out using a difference of means test. It is quite frequent in market research to hypothesize whether the differences in behavior detected in two population subsamples give sufficient evidence for differences in the populations or if, on the contrary, they are only a product of sampling error. Before applying the mean difference test or Student’s t test, any statistically significant differences when comparing the averages of the two independent subsamples must be determined.
The Levene Test must be performed to check whether the variances are different or not. A predictive model is said to present homoscedasticity when the variance of the error of the endogenous variable is maintained throughout the observations. Therefore, this condition is checked before carrying out the hypothesis test that leads to the t test. To do this, we divided the sample into two groups: 5-star hotels and 4.5-star hotels. As can be seen in Table 16, the difference in means was not significant, except in negative comments (p ≤ 0.05), where there are differences and where there are more negative comments in hotels with fewer stars.

6. Discussion

As the work of Honeycutt and Herring [50] indicates, the opinions and reviews of users can be analyzed from a consumer satisfaction perspective since concerns and opinions are identified regarding the purchase of a product, or use of a service [31,41]. Furthermore Agarwal et al. [35] indicates that an exploratory study can be carried out to link these concepts with any particular research theme [37]. In this way, this exploratory study links the main environment factors for the users who have visited hotels that appear in the Traveler’s Ranking 2017.
The work of Kwon [17] shows Sentiment Analysis as a research process that allows for correct classification of users’ opinions and reviews on social networks and 2.0 platforms. Social networks have given companies new ways to receive the impressions and expectations of their customers. The works of Rosa, Batista and Carvalho [42] and Halog [52] show the need to understand these new phenomena and above all to know how to identify problems and key factors.
To do this in our exploratory study, key factors were identified that could help companies in the tourism sector to improve the quality of their services and offer better products [20] based on the improvement of environment factors identified in the investigation [7,13]. As stated, after conducting the Sentiment Analysis, a total of n = 14,459 tweets were identified for the sample of hotel users. These tweets were classified as negative, neutral and positive with an algorithm developed in Python and based on machine learning by MonkeyLearn API [7,26]. As we have already verified from the analysis of the research results by using a qualitative approach on the analyzed environment indicators, Twitter has been shown to be a valid platform for research on the environment.
This exploratory study shows that although some Twitter users make complaints or take advantage of offers and discounts, many users comment on the environmental factors which determined their expectations and experiences at hotels.
Therefore, this exploratory study demonstrates that it is possible to determine the relationships that hotel users in Spain have with the key environment factors that can be improved or attended to by hotels such as those related to activities carried out by the hotel and the impact of these on the environment, on air quality or on the pollution generated by these activities. This can be seen because the hotel users show interest in these factors in their tweets and so they consequently partially determine the satisfaction and experience that the visitors have when they stay at the hotels.
In addition, the quantitative analysis finds that a greater number of followers on Twitter does not mean that these followers are positive in their comments. In fact, the exploratory study shows that negative comments increase when hotels have more followers. This is due to the users who experience negative environmental factors usually follow the hotel’s Twitter profile to make complaints or indicate dissatisfaction with the services provided. Also, as a result of the quantitative analysis we saw that the number of followers on Twitter does not necessarily increase the number of positive comments of the hotel users in the Spanish hotel sector.
We also found that if the number of followers is greater, the neutral comments also increase which means that hotel users use Twitter as a source of information for activities, updates and monitoring of hotel communications, but not necessarily to express satisfaction with the hotel. Twitter has therefore consolidated itself as a social network that can be the object of exploratory study related to the environment. It can also help hotel managers to improve their services by identifying the environment factors which travelers have shown interest in during their stay.

7. Conclusions

Over the last few decades, the use of new technologies has led to changes in sectors related to environment, culture, sustainability and global warming [1,5]. The development of new technologies and especially social networks has allowed the tourism sector to incorporate important changes when assessing actions related to the environment and social responsibility policies [6,7]. For this investigation, the hotels which won the Traveler’s Choice Award, which is based on over 500 million travelers’ reviews on TripAdvisor, were used. These hotels have achieved ratings of between 4.5 and 5.0, with 5 being the highest valuation from people who used their services.
The findings of this exploratory study were analyzed to find the environment factors which could concern hotel users, with the negative, neutral and positive environment factors being classified as N1, N2 and N3. The affect that these environment factors have on the users who visit the hotels was also studied (N4) [7]. H1 is confirmed because after the textual analysis, it was seen that factors related to the environment influence the experience that users have in hotels, and these users refer to different factors based on their feelings with opinions and reviews of the hotels. H2 is accepted, as shown by the results of the exploratory study, because the N1, N2 and N3 structures have been developed and the main factors that affect hotel users have been structured and classified into three types of feelings; negative, neutral and positive. Likewise, H2 has been accepted since the results of the exploratory study show that there are connections between the reviews that users leave about their experiences in hotels on Twitter with the identification of factors related to the environment.
The contribution of this exploratory study is the identification of the key environmental concepts that can help the hotel industry to make improvements in the activities and actions related to the environment, significantly improving the perception that users have of topics related to ecology, traditional commerce, ecological products or air quality, among other factors. Hotels can use the results of this exploratory study to highlight the importance that the factors identified by users have for the hotel sector and act upon them. Therefore, the results of this exploratory study are relevant for hotel management because they can be used to add value to the strategies for improvement of tourist activities so that they respect the environment. Some examples such as hotel activities that could affect the garbage collection or atmospheric pollution in its surroundings. Also, improvements can be made in the hotel activities to support and respect local traditions, improve air quality, support traditional customs and ecology and respect and maintain activities that favor the natural environment of the geographical area in which the hotel is located.
This exploratory study shows positive and negative concerns about the quality of services when users travel and use the services and activities organized by the hotels. This shows that travelers really do care about the environment and the consequences of their trips. Spanish hotels can also use the results of this exploratory study to develop new strategies for corporate social responsibility and to correctly identify which environmental factors concern users so that hotel activities can be adapted to incorporate actions that support or disseminate local traditions and products, maintain the local environment and wildlife of rural areas, or respecting air quality in the areas where hotels have facilities and offer services. With the results of this exploratory study Spanish hotels can increase user satisfaction by taking into account the environmental factors that were identified in this study.
The limitations of this study are related to the number of subjects that make up the sample, the time frame in which the reviews were made and the qualitative analysis carried out to determine the topics of interest, as well as the Occurrence and Reliability percentages of the results.
Further studies could be made into the environment factors detected by hotel users and moderating variables such as age or sex, as well as the geographical areas in which the hotels are located. Also, how the users’ feelings about these factors influence their satisfaction with the hotel and its services, along with the real impact they have on the environment.

Author Contributions

Jose Ramon Saura, Pedro Palos-Sanchez and Miguel Angel Rios Martin conceived and designed the review; Jose Ramon Saura performed the methodology; Pedro Palos-Sanchez and Miguel Angel Rios Martin analyzed the results; Jose Ramon Saura, Pedro Palos-Sanchez and Miguel Angel Rios Martin wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Winning traveler’s choice hotels from TripAdvisor sample in 2017.
Table A1. Winning traveler’s choice hotels from TripAdvisor sample in 2017.
Traveller’s Choice fromTripAdvisor 2017LocationTripAdvisor StartsCustomer ReviewsTwitter ProfileTweetsFollowers
Hotel The SerrasBarcelona, Spain5.0843@TheSerrasHotel682727
VincciSelecciónAleysa Hotel Benalmádena, Spain5.0842@Vincci_Hoteles11.9 k19.6 K
Casa Camper Hotel BarcelonaBarcelona, Spain5.01924@casacamper1042175
Hotel OrfilaMadrid, Spain5.0111@HotelOrfila726356
Hotel AbadiaRetuerta Le DomaineSardón de Duero, Spain5.0449@arledomaine26421.365
Gran Hotel Son NetPuigpunyent, Spain4.5598@GranHotelSonNet12603286
Hotel Maria CristinaSan Sebastián—Donostia, Spain4.51794@hotelmariacrist15922218
Only YOU Boutique Hotel MadridMadrid, Spain5.01975@OnlyYOUHotels45123877
Alma Pamplona Muga de BelosoPamplona, Spain5.01187@almahotels6886
La Bobadilla, a Royal Hideaway HotelLoja, Spain5.0774@barcelohoteles11.2 k179 K
Seaside Grand Hotel ResidenciaMaspalomas, Spain5.0653@GrandHotelPunta753858
Hotel Hacienda de AbajoTazacorte, Spain5.0259@HaciendadeAbajo585712
Hotel Olivia BalmésBarcelona, Spain4.52304@Olivia_Balmes5487241
Riviera BeachotelBenidorm, Spain4.52314@benidormhoteles29121859
Sant Francesc Hotel SingularPalma de Mallorca, Spain4.5476@hotelstfrancesc14901409
El Palace HotelBarcelona, Spain4.51776@ElPalaceHotel21522512
Gran Hotel La PerlaPamplona, Spain5.0336@Ghotellaperla64403020
Barceló EmperatrizMadrid, Spain4.5568@barcelohoteles11.2 k179 K
Hotel Astoria Playa OnlyAdultsPort d’Alcudia, Spain4.52427@astoriaplayasup254139
Catalonia SquareBarcelona, Spain4.5932@CataloniaHotels22.9 k13.6 K
H10 CubikBarcelona, Spain4.5822@h10hotels10.9 k18.4 K
Gold By MarinaPlaya del Inglés, Spain4.51596@GoldByMarina1.594381
Hotel Spa Relais & ChateauxAugaSantiago de Compostela, Spain4.5819@aquintadaauga42921996
Alma BarcelonaBarcelona, Spain4.51876@almahotels6886
Hotel Las Madrigueras Golf Resort and SpaPlaya de las Américas, Spain5.0305---

References

  1. Ramirez-Andreotta, M.; Brody, J.; Lothrop, N.; Loh, M.; Beamer, P.; Brown, P. Improving Environmental Health Literacy and Justice through Environmental Exposure Results Communication. Int. J. Environ. Res. Public Health 2016, 13, 690. [Google Scholar] [CrossRef] [PubMed]
  2. Chisholm, E.; O’Sullivan, K. Using Twitter to Explore (un)Healthy Housing: Learning from the #Characterbuildings Campaign in New Zealand. Int. J. Environ. Res. Public Health 2017, 14, 1424. [Google Scholar] [CrossRef]
  3. Breville, M. US Environmental Protection Agency Tribal Environmental Health Research Program. Epidemiology 2011, 22. [Google Scholar] [CrossRef]
  4. Brown, P. Popular Epidemiology, Toxic Wastes, and Social Movements. In Medicine, Health and Risk: Sociological Perspectives; Jonathan, G., Ed.; Blackwell: Oxford, UK, 1995; pp. 91–112. [Google Scholar]
  5. Brody, J.G.; Morello-Frosch, R.; Brown, P.; Rudel, R.A.; Altman, R.G.; Frye, M.; Osimo, C.A.; Perez, C.; Seryak, L.M. Improving disclosure and consent: “Is It safe?”: New ethics for reporting personal exposures to environmental chemicals. Am. J. Public Health 2007, 97, 1547–1554. [Google Scholar] [CrossRef] [PubMed]
  6. Hung, S.-C.; Kuo, T.-T.; Lin, S.-D. Novel topic diffusion prediction using latent semantic and user behavior. In Proceedings of the ASE BigData & SocialInformatics, Kaohsiung, Taiwan, 7–9 October 2015; ACM: New York, NY, USA, 2015; p. 39. [Google Scholar]
  7. Bifet, A.; Frank, E. Sentiment knowledge discovery in Twitter streaming data. In Proceedings of the International Conference on Discovery Science, Canberra, Australia, 6–8 October 2010. [Google Scholar]
  8. Järvinen, J.; Karjaluoto, H. The use of Web analytics for digital marketing performance measurement. Ind. Mark. Manag. 2015, 50, 117–127. [Google Scholar] [CrossRef]
  9. Kuo, T.-T.; Hung, S.-C.; Lin, W.-S.; Peng, N.; Lin, S.-D.; Lin, W.-F. Exploiting latent information to predict diffusions of novel topics on social networks. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2, Association for Computational Linguistics, Jeju Island, Korea, 8–14 July 2012; pp. 344–348. [Google Scholar]
  10. Jayaram, D.; Manrai, A.K.; Manrai, L.A. Effective use of marketing technology in Eastern Europe: Web analytics, social media, customer analytics, digital campaigns and mobile applications. J. Econ. Financ. Adm. Sci. 2015, 20, 118–132. [Google Scholar] [CrossRef]
  11. Roshan, M.; Warren, M.; Carr, R. Understanding the use of social media by organisations for crisis communication. J. Comput. Hum. Behav. 2016, 63, 350–361. [Google Scholar] [CrossRef]
  12. Moreno, J.; Tejeda, A.; Porcel, C.; Fujita, H.; Viedma, E. A system to enrich marketing customers acquisition and retention campaigns using social media information. J. Serv. Res. 2015, 80, 163–179. [Google Scholar]
  13. Honeycutt, C.; Herring, S.C. Beyond microblogging: Conversation and collaboration via Twitter. In Proceedings of the 42nd Hawaii International Conference on System Sciences, Hawaii, HI, USA, 5–8 January 2009; pp. 1–10. [Google Scholar] [CrossRef]
  14. Järvinen, J.; Töllinen, A.; Karjaluoto, H.; Jayawardhena, C. Digital and social media marketing usage in B2B industrial section. Mark. Manag. J. 2012, 22. [Google Scholar] [CrossRef] [Green Version]
  15. Vásquez, G.A.; Escamilla, E.M. Best Practice in the Use of Social Networks Marketing Strategy as in SMEs. Procedia Soc. Behav. Sci. 2014, 148, 533–542. [Google Scholar] [CrossRef]
  16. Wang, R.; Kim, J.; Xiao, A.; Jung, Y.J. Networked narratives on Humans of New York: A content analysis of social media engagement on Facebook. Comput. Hum. Behav. 2017, 66, 149–153. [Google Scholar] [CrossRef]
  17. Kwon, S. Gerontechnology: Research, Practice, and Principles in the Field of Technology and Aging; Springer Publishing Company, LLC: New York, NY, USA, 2017. [Google Scholar]
  18. Palos-Sanchez, P.R.; Saura, J.R.; Debasa, F. The Influence of Social Networks on the Development of Recruitment Actions that Favor User Interface Design and Conversions in Mobile Applications Powered by Linked Data. Mob. Inf. Syst. 2018, 1–11. [Google Scholar] [CrossRef]
  19. Hussain, Z.; Singh, J. A Study of Consumer Attitudes and Behaviour towards Sustainability in Bradford, UK: An Economical and Environmentally Sustainable Opportunity. Corp. Sustain. CSR Sustain. Eth. Gov. 2013, 115–156. [Google Scholar] [CrossRef]
  20. Suanmali, S. Factors Affecting Tourist Satisfaction: An Empirical Study in the Northern Part of Thailand. SHS Web Conf. 2014, 12, 01027. [Google Scholar] [CrossRef]
  21. Kaltenborn, B.P.; Nyahongo, J.W.; Kideghesho, J.R. The Attitudes of Tourists towards the Environmental, Social and Managerial Attributes of Serengeti National Park, Tanzania. Trop. Conserv. Sci. 2011, 4, 132–148. [Google Scholar] [CrossRef]
  22. Bruyere, B.L.; Beh, A.W.; Lelengula, G. Differences in Perceptions of Communication, Tourism Benefits, and Management Issues in a Protected Area of Rural Kenya. Environ. Manag. 2008, 43, 49–59. [Google Scholar] [CrossRef] [PubMed]
  23. Eilam, E.; Trop, T. Environmental Attitudes and Environmental Behavior—Which Is the Horse and Which Is the Cart? Sustainability 2012, 4, 2210–2246. [Google Scholar] [CrossRef] [Green Version]
  24. Buffa, F. Young Tourists and Sustainability. Profiles, Attitudes, and Implications for Destination Strategies. Sustainability 2015, 7, 14042–14062. [Google Scholar] [CrossRef]
  25. Palos-Sanchez, P.R.; Correia, M.B. Perspectives of the Adoption of Cloud Computing in the Tourism Sector. In Handbook of Research on Technological Developments for Cultural Heritage and eTourism Applications; IGI Global: Hershey, Pennsylvania, 2018; pp. 377–400. [Google Scholar]
  26. Anjaria, M.; Guddet, R. Influence Factor Based Opinion Mining of Twitter Data Using Supervised Learning. In Proceedings of the 6th International Conference on Communication Systems and Networks (COMSNETS), Bangalore, India, 7–10 January 2014; pp. 1–8. [Google Scholar]
  27. Statistics and Tourism Satellite Account. (OMT). Available online: http://statistics.unwto.org/ (accessed on 9 December 2017).
  28. Estadística de Movimientos Turísticos en Fronteras (FRONTUR); INE (Instituto Nacional de Estadística de España): Madrid, Spain, 2016.
  29. Xu, M.; Allenby, B.; Kim, J.; Kahhat, R. A dynamic agent-based analysis for the environmental impacts of conventional and novel book retailing. Environ. Sci. Technol. 2009, 43, 2851–2857. [Google Scholar] [CrossRef] [PubMed]
  30. Sexton, K.; Needham, L.; Pirkle, J. Human biomonitoring of environmental chemicals: Measuring chemicals in human tissues is the “gold standard” for assessing exposure to pollution. Am. Sci. 2004, 92, 38–41. [Google Scholar] [CrossRef]
  31. Radhi, H.; Sharples, S. Global warming implications of facade parameters: A life cycle assessment of residential buildings in Bahrain. Environ. Impact Assess. Rev. 2013, 38, 99–108. [Google Scholar] [CrossRef]
  32. MonkeyLearn. API Reference. Available online: https://monkeylearn.com/docs/article/api-reference/ (accessed on 8 November 2017).
  33. Neethu, M.; Rajasree, R. Sentiment Analysis in Twitter Using Machine Learning Techniques. In Proceedings of the 4th International Conference on Computing Communications and Networking Technologies (ICCCNT), Tiruchengode, India, 4–6 July 2013; pp. 1–5. [Google Scholar]
  34. Palos-Sanchez, P.; Saura, J. The Effect of Internet Searches on Afforestation: The Case of a Green Search Engine. Forests 2018, 9, 51. [Google Scholar] [CrossRef]
  35. Peters, K.; Chen, Y.; Kaplan, A.M.; Ognibeni, B.; Pauwels, K. Social Media Metrics—A Framework and Guidelines for Managing Social Media. J. Interact. Mark. 2013, 27, 281–298. [Google Scholar] [CrossRef]
  36. Pitt, L.; Watson, R. The World Wide Web as an advertising medium. J. Advert. Res. 1999. [Google Scholar] [CrossRef]
  37. Macintyre, M.; Mee, W.; Solomon, F. Evaluating social performance in the contextual of an ‘audit culture’: A pilot social review of a gold mine in Papua New Guinea. Corp. Soc. Responsib. Environ. Manag. 2008, 15, 100–110. [Google Scholar] [CrossRef]
  38. Saura, J.R.; Palos-Sánchez, P.; Suárez, L.M. Understanding the Digital Marketing Environment with KPIs and Web Analytics. Future Internet 2017, 9, 76. [Google Scholar] [CrossRef]
  39. Go, A.; Bhayani, R.; Huang, L. Twitter Sentiment Classification Using Distant Supervision; CS224N Project Report Stanford; Stanford University: Stanford, CA, USA, 2009. [Google Scholar]
  40. Pak, A.; Paroubek, P. Twitter as a corpus for sentiment analysis and opinion mining. In Proceedings of the LREC, Valletta, Malta, 17–23 May 2010. [Google Scholar]
  41. Rodríguez-Herráez, B.; Pérez Bustamante, D.; Saura, L. Information classification on social networks. Content analysis of e-commerce companies on twitter. Espacios 2017, 38, 17–35. [Google Scholar]
  42. Sexton, K. Cumulative Risk Assessment: An Overview of Methodological Approaches for Evaluating Combined Health Effects from Exposure to Multiple Environmental Stressors. Int. J. Environ. Res. Public Health 2012, 9, 370–390. [Google Scholar] [CrossRef] [PubMed]
  43. Rong, W.; Peng, B.; Ouyang, Y.; Li, C.; Xiong, Z. Semi-supervised Dual Recurrent Neural Network for Sentiment Analysis. In Proceedings of the IEEE 11th International Conference on Autonomic and Secure Computing (DASC), Chengdu, China, 21–22 December 2013; pp. 438–445. [Google Scholar]
  44. Turney, P.D.; Pantel, P. From frequency to meaning: Vector space models of semantics. J. Artif. Intell. Res. 2010, 37, 141–188. [Google Scholar]
  45. Wittgenstein, L. Philosophical Investigations; Blackwell: London, UK, 1953. [Google Scholar]
  46. Takamura, H.; Inui, T.; Okumura, M. Extracting semantic orientations of words using spin model. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics—ACL 05, Michigan, MI, USA, 25–30 June 2005. [Google Scholar]
  47. Geurs, K.; Wee, V.B. Backcasting as a tool to develop a sustainable transport scenario assuming emission reductions of 80–90%. Innovation 2000, 13, 47–62. [Google Scholar] [CrossRef]
  48. Rosa, H.; Batista, F.; Carvalho, J.P. Twitter topic fuzzy fingerprints. In Proceedings of the IEEE World Congress on Computational Intelligence WCCI 2014, International Conference on Fuzzy System, FUZZ-IEEE, Beijing, China, 6–11 July 2014; pp. 776–783. [Google Scholar]
  49. Welling, R.; White, L. Web site performance measurement: Promise and reality. Manag. Serv. Qual. 2006, 16, 654–670. [Google Scholar] [CrossRef]
  50. Bourne, M.; Neely, A.; Platts, K.; Mills, J. The success and failure of performance measurement initiatives: Perceptions of participating managers. Int. J. Oper. Prod. Manag. 2002, 22, 1288–1310. [Google Scholar] [CrossRef]
  51. Boyd, D.; Golder, S.; Lotan, G. Tweet, tweet, retweet: Conversational aspects of retweeting on twitter. In Proceedings of the IEEE 43rd Hawaii International Conference on Social Systems (HICSS), Kauai, HI, USA, 5–8 January 2010. [Google Scholar]
  52. Saito, K.; Nakano, R.; Kimura, M. Prediction of information diffusion probabilities for independent cascade model. In Knowledge-Based Intelligent Information and Engineering Systems; Springer: Berlin/Heidelberg, Germany, 2008; pp. 67–75. [Google Scholar]
  53. Jiang, B.; Liang, J.; Sha, Y.; Li, R.; Liu, W.; Ma, H.; Wang, L. Retweeting behavior prediction based on one-class collaborative filtering in social networks. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval, ACM, Tuscany, Italy, 17–21 July 2016; pp. 977–980. [Google Scholar]
  54. Kouloumpis, E.; Wilson, T.; Moore, J. Twitter sentiment analysis: The good the bad and the omg! In Proceedings of the ICWSM, Barcelona, Spain, 17–21 July 2011.
  55. Fei, H.; Jiang, R.; Yang, Y.; Luo, B.; Huan, J. Content based social behavior prediction: A multi-task learning approach. In Proceedings of the 20th ACM International Conference on Information and Knowledge Management, ACM, Scotland, UK, 24–28 October 2011; pp. 995–1000. [Google Scholar]
  56. Holmberg, J.; Lundqvist, U.; Robert, K.; Wackernagel, M. The ecological footprint from a systems perspective of sustainability. Int. J. Sustain. Dev. World Ecol. 1999, 6, 17–33. [Google Scholar] [CrossRef]
  57. Kuisma, J. Backcasting for Sustainable Strategies in the Energy Sector; IIIEE Report; Lund University: Lund, Sweden, 2000; Volume 18. [Google Scholar]
  58. Halog, A. Models for evaluating energy, environmental and sustainability performance of biofuels value chain. Int. J. Glob. Energy Issues 2009, 32, 87–101. [Google Scholar] [CrossRef]
  59. Jenkins, H.; Yakovleva, N. Corporate social responsibility in the mining industry: Exploring trends in social and environmental disclosure. J. Clean. Prod. 2006, 14, 271–284. [Google Scholar] [CrossRef]
  60. Heijungs, R.; Huppes, G.; Guinee, J. Life cycle assessment and sustainability analysis of products, materials and technologies: Toward a scientific framework for sustainability life cycle analysis. Polym. Degrad. Stab. 2010, 95, 422–428. [Google Scholar] [CrossRef]
  61. Das, S.; Chen, M. Yahoo! for Amazon: Extracting market sentiment from stock message boards. In Proceedings of the 8th Asia Pacific Finance Association Annual Conference (APFA), Bangkok, Thailand, 22–25 July 2001; Notes on CG and LM-BFGS optimization of logistic regression. Available online: http://hal3. name/megam/ (accessed on 7 January 2018).
  62. Debes, V.; Sandeep, K.; Vinnett, G. Predicting information diffusion probabilities in social networks: A Bayesian networks based approach. J. Knowl. Based Syst. 2017, 133, 66–76. [Google Scholar]
  63. Sarwar, M.T.; Fountas, G.; Anastasopoulos, P.C. Simultaneous estimation of discrete outcome and continuous dependent variable equations: A bivariate random effects modeling approach with unrestricted instruments. Anal. Methods Accid. Res. 2017, 16, 23–34. [Google Scholar] [CrossRef]
  64. Wilson, T.; Wiebe, J.; Hoffmann, P. Recognizing contextualual polarity in phrase-level sentiment analysis. In Proceedings of the Conference on Human Language Technology and Empirical Methods in Natural Language Processing, Vancouver, BC, Canada, 6–8 October 2005. [Google Scholar]
  65. Rosa, H.; Carvalho, J.P.; Astudillo, R.; Batista, F. Detecting user influence in twitter: Pagerank vs. katz, a case study. In Proceedings of the Seventh European Symposium on Computational Intelligence and Mathematics, Cádiz, Spain, 7–10 October 2015. [Google Scholar]
  66. Pactwa, K.; Woźniak, J. Environmental reporting policy of the mining industry leaders in Poland. Res. Policy 2017, 53, 201–207. [Google Scholar] [CrossRef]
  67. Sakaki, T.; Okazaki, M.; Matsuo, Y. Earthquake shakes twitter users: Realtime event detection by social sensors. In Proceedings of the 19th International Conference on World Wide Web, WWW’10, Raleigh, CA, USA, 26–30 April 2010; ACM: New York, NY, USA, 2010; pp. 851–860. [Google Scholar]
  68. Azoulay, A.; Garzon, P.; Eisenberg, M.J. Comparison of the Mineral Content of Tap Water and Bottled Waters. J. Gen. Intern. Med. 2001, 16, 168–175. [Google Scholar] [CrossRef] [PubMed]
  69. Loh, M.L.; Sugeng, A.; Lothrop, N.; Klimecki, W.; Cox, M.; Wilkinson, S.T.; Lu, Z.; Beamer, P. Multimedia exposures to arsenic and lead for children near an inactive mine tailings and smelter site. Environ. Res. 2016, 146, 331–339. [Google Scholar] [CrossRef] [PubMed]
  70. Falk, J.H.; Storksdieck, M.; Dierking, L.D. Investigating public science interest and understanding: Evidence for the importance of free-choice learning. Public Underst. Sci. 2007, 16, 455–469. [Google Scholar] [CrossRef]
  71. Halog, A.; Chan, A. Developing a dynamic systems model for sustainable development of the Canadian oil sands industry. Int. J. Environ. Technol. Manag. 2008, 8, 3–22. [Google Scholar] [CrossRef]
  72. Chen, Y.; Conroy, N.J.; Rubin, V.L. News in an online world: The need for an automatic crap detector. In Proceedings of the 78th ASIS & T Annual Meeting: Information Science with Impact: Research in and for the Community (ASIST’15), St. Louis, MO, USA, 6–10 November 2015; American Society for Information Science: Silver Springs, MD, USA, 2015; p. 4. [Google Scholar]
  73. Brody, J.G.; Dunagan, S.C.; Morello-Frosch, R.; Brown, P.; Patton, S.; Rudel, R.A. Reporting individual results for biomonitoring and environmental exposures: Lessons learned from environmental communication case studies. Environ. Health. 2014, 13, 40. [Google Scholar] [CrossRef] [PubMed]
  74. Culotta, A. Towards detecting influenza epidemics by analyzing twitter messages. In Proceedings of the First Workshop on Social Media Analytics, Washington DC, USA, 25–28 July 2010; ACM: New York, NY, USA, 2010; pp. 115–122. [Google Scholar]
  75. Scheffran, J.; BenDor, T. Bioenergy and land use: A spatial-agent dynamic model of energy crop production in Illinois. Int. J. Environ. Pollut. 2009, 39, 4–27. [Google Scholar] [CrossRef]
  76. Kim, J.; Xu, M.; Kahhat, R.; Allenby, B.; Williams, E. Designing and assessing a sustainable networked delivery (SND) system: Hybrid business-to-consumer book delivery case study. Environ. Sci. Technol. 2009, 43, 181–187. [Google Scholar] [CrossRef] [PubMed]
  77. Chaffey, D.; Patron, M. From web analytics to digital marketing optimization: Increasing the commercial value of digital analytics. J. Direct Data Dig. Mark. Pract. 2012, 14, 30–45. [Google Scholar]
  78. QSR International Pty Ltd. NVIVO: Reference Guide; QSR International Pty Ltd.: Doncaster, Australia, 2000. [Google Scholar]
Figure 1. Classification of tweets according to feelings about the environment.
Figure 1. Classification of tweets according to feelings about the environment.
Ijerph 15 00553 g001
Figure 2. Relationship of Nodes and number of tweets for environment factors identification.
Figure 2. Relationship of Nodes and number of tweets for environment factors identification.
Ijerph 15 00553 g002
Table 1. Tourism factors and their impact on the environment.
Table 1. Tourism factors and their impact on the environment.
FactorsEnvironmental Impact
Accumulation of publicStress for the environment and animals
Extreme sportsDisturb the fauna
Animal feedingChanges in wildlife behavior
Diving and snorkelingDamage to sea beds
Camping/PicnicsErosion of the soil. Damage to vegetation. Noise. Rubbish
Hunting and fishingReduction of species
Off-road drivingDestruction of soil and vegetation
Noise emissionDisturb the animals
Climbing, hikingDamage to vegetation
CarsRunning over animals, pollution, noise
“Souvenir” collectionInterruption of natural processes
Collecting woodDeforestation, destruction of habitats
Throwing garbageDeterioration of space and danger to local animal and human health
Discharge of waste not suitable for waterWater pollution, acidity
Construction of facilitiesLoss and division of habitats
Construction of electricity pylonsImpact on birds in flight
Tourism infrastructureVisual impact on fauna, vegetation and aquatic habitat
Table 2. International tourists depending on type of accommodation.
Table 2. International tourists depending on type of accommodation.
Type of AccommodationMonthly DataYearly VariationAccumulated DataAccumulated Data
Absolute ValueAbsolute ValueYearly Variation
TOTAL3.979.71313.375.563.19810.3
Total paid accommodation (3)2.841.46415.159.419.13811.7
—Hotel accommodation2.299.98811.247.726.62311.2
—Rented housing397.01049.28.278.5258.0
—Other paid accommodation144.4676.43.413.99032.1
Unpaid accommodation1.138.2489.116.144.0605.3
—Self-owned housing320.53942.55.039.04015.7
—Relatives or friends house704.6220.39.550.9429.4
—Other unpaid accommodation113.087−2.41.554.079−31.1
Table 3. Arrival of international tourists according to access routes.
Table 3. Arrival of international tourists according to access routes.
Access RoutesMonthly DataYearly VariationAccumulated DataYearly Variation
Absolute ValueAbsolute Value
TOTAL3.979.71313.375.563.19810.3
Airport3.197.75616.660.582.40611.7
Highway696.5860.113.038.3914.4
Train23.03841.3364.1156.2
Puerto62.3328.81.578.2879.6
Table 4. Characteristics of Sentiment Analysis in investigations.
Table 4. Characteristics of Sentiment Analysis in investigations.
CharacteristicsReferences
Pak et al. [34]Kuo et al. [9]Honeycutts et al. [50]Kouloumpis et al. [54]Rodríguez-Herráez [41]Boyd et al. [51]This Research
Neuronal Connection--
Textual analysis----
Time------
Hashtags, URLs or mentions----
Topic----
Classification of information----
Table 5. Characteristics of textual analysis in investigations.
Table 5. Characteristics of textual analysis in investigations.
CharacteristicsReferences
Kwon et al. [17]Ramirez-Andreotta [18]Rosa et al. [48]Honeycutt et al. [50]Boyd et al. [51]Saito et al. [52]Jiang et al. [53]This Research
Classification into nodes
Categorization---
Word Count--
Key word-----
Table 6. Environmental Node classification with Nvivo software.
Table 6. Environmental Node classification with Nvivo software.
Factor IdentificationNodes
NegativeNode 1 (N1)
NeutralNode 2 (N2)
PositiveNode 3 (N3)
Table 7. Environment data which was analyzed and average classification probability percentages for each Hotel.
Table 7. Environment data which was analyzed and average classification probability percentages for each Hotel.
Traveller’s Choice from TripAdvisor 2017TweetsNegativeNeutralPositiveAverage Probability
Hotel The Serras159190680.639
Vincci Selección Aleysa Hotel Boutique & Spa1665817348500.610
Casa Camper Hotel Barcelona52139120.555
Hotel Orfila4361419160.662
Hotel Abadia Retuerta Le Domaine399151802040.620
Gran Hotel Son Net213-161520.616
Hotel Maria Cristina211101001010.614
Only YOU Boutique Hotel Madrid756123833610.651
Alma Pamplona Muga de Beloso40116230.674
Seaside Grand Hotel Residencia128589387260.624
Hotel Hacienda de Abajo80-44370.666
Hotel Olivia Balmés35-2880.609
Riviera Beachotel36992251360.682
Sant Francesc Hotel Singular355-296600.598
El Palace Hotel373191731820.600
Gran Hotel La Perla1539866707840.607
Barceló Emperatriz1722599387260.624
Hotel Astoria Playa Only Adults93162300.596
Catalonia Square27623218599350.652
H10 Cubik1407812141850.711
Gold By Marina720193983040.622
Hotel Spa Relais945344374750.679
n = 14,459
Table 8. Results for N1 for environment factors identification.
Table 8. Results for N1 for environment factors identification.
N1CountSimilar FactorsWeighted Percentage
Rubbish collection16rubbish, trash, waste0.06
Atmospheric contamination15contamination, breathing, asthma0.06
Pollution14dirty, dirt0.06
Table 9. Results for N2 for environment factors identification.
Table 9. Results for N2 for environment factors identification.
Negative Tweets
1. Really disgusted by the unhygienic food that @barcelohoteles are serving at the #AllegroIsora really regret booking here for 8 days when staff won’t do anything about it.
2. @H10_Hotels we are at your resort in Punta Cana sick with food poisoning management does not care. Many guests are sick We will never return
3. @El_Felips paid for a upgraded room in Barcelo Lanzarote, so disappointed the room is tired, air con very poor, mould in bathroom and smelling very bad!! The installations and trips to locals places are also bad!
Table 10. Results for N2 for environment factors identification.
Table 10. Results for N2 for environment factors identification.
N2CountSimilar FactorsWeighted Percentage
Local products250Streetmarket, markets, local produce0.40
Handcraft71artisan, craftsmen, pottery0.11
Traditional experience67shelters, seminars, monasteries0.11
Looking after nature57rivers, villages, mountains, roads0.09
Table 11. Neutral tweets examples.
Table 11. Neutral tweets examples.
Negative Tweets
1. RT @AnneSemonin We love to visit Hotel Sant Francesc for a spot of sunshine and a luxury Anne Semonin treatment.
2. RT @Stanatic_Ness: A beautiful street with so much of history and tradition #PalmaDeMallorca #Spain
3. Eating a local fish in #LaPalma @haciendadeabajo-medregal-tastes a bit like tuna, v tasty
Table 12. Results for N3 for environment factors identification.
Table 12. Results for N3 for environment factors identification.
N3CountSimilar FactorsWeighted Percentage
Local Traditions67hostels, routes, local products, shows0.29
Air Quality32clarity, pure, sight, breathing, asthma0.12
Customs44dances, traditional production, pastry making, viticulture0.21
Traditional Ecology12saltpans, orchards, eco-products0.10
Nature11oasis, islands, mountain, rivers, trails0.09
Table 13. Positive tweets examples.
Table 13. Positive tweets examples.
Positive Tweets
1. #AQuintadaAuga is a #hotel surrounded by #nature in #SantiagodeCompostela. It is a mandatory stop
2. RT @AmyWorsley85: Always looking for a #sustainable place to stay, like the @Olivia_Balmes hotel in #Barcelona which was great!
3. @barcelohoteles @Bobadilla5GL Congrats! More hotels can be sustainable using local biomass for energy. Please visit us at @conectabioener!
Table 14. Results of N4 for environment factors identification.
Table 14. Results of N4 for environment factors identification.
NodesEnvironment FactorsRelated FactorsTotal Count
Positive FactorsLocal Traditionshostels, routes, local products, shows clarity, pure, sight, breathing, asthma, dances, traditional production, pastry making, viticulture, saltpans, orchards, eco-products, oasis, islands, mountain, rivers, trails166
Air Quality
Customs
Traditional Ecology
Nature
Neutral FactorsLocal productsstreet market, markets, local produce445
Handcraftartisan, craftsmen, pottery
Traditional experienceshelters, seminars, monasteries
Looking after naturerivers, villages, mountains, roads
Negative FactorsGarbage collectionrubbish, trash, waste45
Atmospheric pollutioncontamination, breathing, asthma
Pollutiondirty, dirt
Table 15. Bivariate linear correlation results for environmental factors.
Table 15. Bivariate linear correlation results for environmental factors.
Correlation ResultsStarsNumber of Environment CommentsNumber of Environment TweetsNumber of Followers
Number of FollowersPearson Correlation−0.220−0.1830.588 **1
Sig. (bilateral)0.3260.4150.006
N22222022
Negative TweetsPearson Correlation0.183−0.3250.648 **0.371
Sig. (bilateral)0.4140.1400.0020.089
N22222022
Neutral TweetsPearson Correlation−0.152−0.3390.780 **0.339
Sig. (bilateral)0.4990.1230.0000.122
N22222022
Positive TweetsPearson Correlation0.068−0.3030.688 **0.381
Sig. (bilateral)0.7640.1710.0010.081
N22222022
** Significance intensity.
Table 16. Differences of means test results.
Table 16. Differences of means test results.
VariableFSig.tDifference of MeansDifference of Standard Error
Nº comments1.3630.257−1.687−510.933302.795
Tweets0.0660.800−0.414−725.0001750.570
Followers2.9430.102−1.006−16,259.15016,157.287
Negative7.4540.0130.8339.68311.620
Neutral1.4830.237−0.689−139.617202.547
Positive0.9320.3460.30541.933137.628
F: F Square; Sig.: Significance level; t: t-Student.

Share and Cite

MDPI and ACS Style

Saura, J.R.; Palos-Sanchez, P.; Rios Martin, M.A. Attitudes Expressed in Online Comments about Environmental Factors in the Tourism Sector: An Exploratory Study. Int. J. Environ. Res. Public Health 2018, 15, 553. https://doi.org/10.3390/ijerph15030553

AMA Style

Saura JR, Palos-Sanchez P, Rios Martin MA. Attitudes Expressed in Online Comments about Environmental Factors in the Tourism Sector: An Exploratory Study. International Journal of Environmental Research and Public Health. 2018; 15(3):553. https://doi.org/10.3390/ijerph15030553

Chicago/Turabian Style

Saura, Jose Ramon, Pedro Palos-Sanchez, and Miguel Angel Rios Martin. 2018. "Attitudes Expressed in Online Comments about Environmental Factors in the Tourism Sector: An Exploratory Study" International Journal of Environmental Research and Public Health 15, no. 3: 553. https://doi.org/10.3390/ijerph15030553

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop