1. Background of Research
Small and medium-sized enterprises have encountered greater difficulties than ever before under the outbreak of COVID-19 at the end of 2019 [
1]. Nowadays, people’s daily life has gradually recovered under the normal situation of epidemic prevention and control. However, the negative influences on economy have been seen continuing, especially for some small and medium-sized enterprises and individual businesses with weak pressure resistance [
2]. At present, small and medium-sized enterprises as well as individual businesses in different forms can found everywhere in China. They are like the “capillaries” in China’s economic development. While meeting the daily needs of residents, they contribute more than 80% of employment, more than 70% of scientific and technological innovation achievements, more than 60% of GDP and more than 50% of tax revenue to the country [
3]. Among these small and medium-sized enterprises, catering, accommodation, filmed entertainment, wholesale and retail, as well as transportation and tourism suffered the most concentrated and maximum effects [
4]. They rely on crowd density and mobility to obtain income. However, sharply decrease of people flow under the epidemic makes their earnings almost zero. It is this situation that causes many enterprises drop into not only the problem of short-term capital liquidity but long-term financing depression.
According to statistics, the catering industry ranked second among the top ten industries that most affected by the epidemic [
5]. The dinner on New Year’s Eve was almost entirely unsubscribed because of the outbreak of COVID-19. Affected by the epidemic, people also need to cut the time they go out when dining together is prohibited, which makes many restaurants suffer heavy losses [
6]. In the first half of 2020, the national catering revenue was 1460.9 billion Yuan, a year-on-year decrease of 32.8%, and the growth rate was 42.2 percentage points lower than that of the same period last year. The catering revenue of units above designated size was 311.9 billion Yuan, a year-on-year decrease of 30.2% [
7]. In 2020, the catering revenue was 16.6% lower than that of the previous year [
8]. Numerous food produced go to hoarded, the pressure of rent, water, and electricity charges as well as labor cost increased rapidly, making the catering industry profits plummeted, facing a huge survival crisis [
6].
2. Literature Review
The work resumptions of all walks of life have been widely concerned by the society. Since the outbreak of COVID-19, tertiary industry has been carried out investigation and research on the survival and development under the epidemic. Typical literatures are as follows: Yang and Han [
9] conduct a “real-time” investigation with user-generated content on Twitter to identify the main concerns in the hospitality industry during the COVID-19 pandemic and gain a better understanding of the business challenges and responses from the industry. In terms of how the global pandemic applies to our eating patterns and eating habits, Michael [
10] found that COVID-19 fundamentally changed many everyday food-related practices, whether due to supply chain failures or national directives (such as in-place shelter orders). At the same time, in the face of massive text data, many scholars use text mining technology, combined with machine language deep learning to form a more complete algorithm to study natural language processing [
11]. For example, Tian et al. [
12] used sensory analysis method to study the review data of the catering industry and found the internal relationship between customer emotion and customer rating. Kumar et al. [
13] comprehensively surveyed the evolution of the online social networks, their associated risks, and solutions. The various security models and the state of the art algorithms had been discussed along with a comparative meta-analysis using machine learning, deep learning, and statistical testing to recommend a better solution. Lee et al. [
14] proposed a novel unified approach for learning to rank products based on online product reviews. Unlike existing approaches, it used deep-learning techniques to extract the high-level latent review representation that contains the most semantic information in the learning process. The experiments showed that this approach outperformed the existing methods in sales rank prediction based on online product reviews. In addition, Wang et al. [
15] proposed an unsupervised method for judging comment credibility based on the Biterm Sentiment Latent Dirichlet Allocation (BS-LDA) model. Their experimental results using comments from Amazon.com demonstrated that the overall performance of their approach could play an important role in determining the credibility of comments in some situation. However, there is still a lack of research on the change of consumers’ preference for the catering industry before and after COVID-19. As the main service object of the catering industry, consumers are an important force to promote the resumption of work and production, transformation, and upgrading of the catering industry.
According to this situation, this paper crawls review data in China from August 2019 to April 2020, and divides the data into “before the outbreak” and “after the outbreak” by taking January 2020 as the dividing point. Subsequently, through analyzing these comments by LDA topic analysis method, this paper compares the theme results and comes to the differences of consumer preferences before and after the outbreak. According to the score through the analysis of consumers’ emotion after the epidemic, this paper gets the consumers’ preference for catering after the epidemic, analyzing as well as predicting the consumers’ preference under public health emergencies, so as to provide emergency strategies for the catering industry and further improve the catering industry’s ability to deal with public health emergencies.
4. Empirical Research
4.1. Data Acquisition
At first, all the cities are split into three groups: high risk area (The cumulative number of patients is more than 10,000), medium risk area (The cumulative number of patients is between1000 and 10,000) and low risk area (The cumulative number of patients is less than 1000) according to the severity of the epidemic. By using stratified random sampling, Hangzhou and Guangzhou were selected as low-risk areas, Beijing and Shanghai were selected as medium-risk areas, and Wuhan was selected as high-risk area. Python programming was used to crawl the catering review data of Dianping platform (
https://www.dianping.com, accessed on 21 January 2021) from August 2019 to April 2020. In addition, 20 January 2020 was taken as the cut-off point. Then the data are divided into “pre-epidemic” and “post-epidemic” parts. A total of 11,705 pre-epidemic comments and 2749 post-epidemic comments were obtained (the number of post-epidemic comments affected by COVID-19 was lower than that before COVID-19). The results are shown in
Table 1 and
Table 2. It should be noted that this paper uses the original language (Chinese) in the process of obtaining, processing, and analyzing the data. We just translated the results into English to show them.
4.2. Data Preprocessing
4.2.1. Data Import and Cleaning
It takes restaurant name, various ratings, review content, and review time as result to save them into a file in CSV format. Then it imports the review data into Python and checks whether the data format is correct after import. It preprocesses the data after confirming the success of import. There are many repeated values, modal words, and missing values in the original data, which will lead to deviation between the analysis results and the actual situation. Therefore, it is necessary to delete the repeated values and missing values (referring to the rows without the content), and then take into account the specific situation of the comment data to remove characters such as emoji expressions and low quality review evaluation, etc. Then the text is filtered, and the short sentences below five characters are deleted. Finally, the text is filtered, and the stop-words are deleted by using the stop-words to optimize the text quality and make the analysis results more reliable. Repeat the second data cleaning after the first data cleaning is completed.
4.2.2. Word Segmentation and Stop-Word Filtering
After sentence segmentation processing, the text needs to be filtered. There are a large number of colloquial words which cannot reflect the theme but appear frequently in the comment data. It will affect the experimental results to a certain extent. Therefore, stop-word filtering is needed. In this paper, Jieba [
20,
21], a third-party natural language processing library of Python, is used for word segmentation of Chinese text, removing stop-words. The result is shown in
Table 3.
4.2.3. Words Comparison of Three Risk Groups
At first, the data of low, medium, and high risk areas was analyzed, and this paper got the occurrence of high frequency words, respectively. The specific words are as shown in
Table 4 and
Table 5:
We can see from
Table 4 and
Table 5 that the distribution of high-frequency words in three risk areas before and after COVID-19. It is beyond dispute that these words do not have significant differences. What is more, due to the synchronous progress of the prevention and control of the epidemic in different regions after the epidemic, the differences among different regions are small. Thus, this paper combined and summarized the three regions into pre-epidemic and post-epidemic for research and analysis.
4.3. Word Frequency Statistics and Word Cloud Map
After text preprocessing, we get the cleaned text data. Then, the word frequency statistics of these processed comments data is carried out. Due to the complexity of Chinese parts of speech, a word may have different parts of speech in different contexts. Therefore, it is necessary to distinguish between word parts of speech in order to ensure their correctness in context. In this paper, POS-Tag is adopted for part of speech tagging of words after word segmentation, and the top ten words with the largest word frequency are selected for easy observation, as shown in
Table 6 and
Table 7.
Table 6 shows the word frequency statistics and part of speech tagging of comments before the COVID-19, from which it can be seen that the three words most mentioned by consumers are “delicious”, “taste”, and “queue”. The adjective “delicious” ranks first and the noun “taste” ranks second. This indicates that before the outbreak, a major factor for consumers to choose restaurants is the taste of the dishes, and they tend to prefer the restaurants with better dishes. The third is the verb “queue”, indicating that before the outbreak of the epidemic, many popular restaurants need to queue up and wait, which reflects the sound development of the catering industry.
Table 7 shows the word frequency statistics and part of part of speech tagging of comments after the outbreak of the epidemic. It can be seen from it that the three words most frequently mentioned by consumers are “Epidemic”, “less people”, and “food”. Since this article collected data from comments published after January 2020, the outbreak has had a huge impact on residents’ lives, so it is not surprising that the word “epidemic” appears frequently. In second place, it was “fewer people,” indicating the impact of the outbreak on the restaurant industry, with fewer customers in stores. In third place, it was the noun “dish,” indicating that even during the worst of the epidemic, consumers still valued food quality in the restaurant industry. In the review, the word “protection” is a special word for consumers at this particular time. The high frequency of the word “protection” indicates that consumers pay more attention to protection measures when eating out and have a certain panic about the epidemic.
Based on Python, this paper uses comment word segmentation [
22,
23] to construct word cloud maps before and after the epidemic (
Figure 2 and
Figure 3), respectively, so as to make text word segmentation more visible.
The word cloud map makes it more clear that the most common words in these comments, such as “taste”, “service”, “price”, and “feature”, are consumers’ preferences for the restaurant industry before the epidemic; “protection”, “take-out”, and “disinfection” are measures taken by consumers and businesses to reduce the possibility of further spread of the disease during the epidemic. The paper can use word cloud to understand the consumer psychology and consumer behavior in food and beverage during the epidemic period, which is helpful for us to study the psychological behavior of consumers in public health emergencies.
4.4. Subject Word Determination
According to the degree of confusion corresponding to the subject words run by the Python program [
24], the following results were obtained: the degree of confusion was the minimum when the number of subjects before the epidemic was three; after the epidemic, the degree of confusion was the smallest when the number of subjects was four. After the epidemic, the reduction of the minimum degree of confusion was greater, so four themes were selected respectively before and after the epidemic shown in
Figure 4 and
Figure 5.
In this paper, LDA thematic model is used to mine the potential topics of pre-epidemic and post-epidemic commentary texts, respectively. Four themes were selected before and after the epidemic, and six keywords were selected for each theme to get the following
Table 6.
As shown in
Table 8, subject words were extracted from the comments before the epidemic. In this paper, the first theme is summarized as the quality of dishes in the restaurant. “taste”, “palate”, and “flavor” indicates that consumers pay attention to the taste of dishes, while “fresh” and “meat quality” indicates that consumers pay attention to the quality of dishes. Generally speaking, consumers prefer dishes with good taste and fresh meat quality and pay attention to the quality of dishes. The second theme is summarized as recommended, “clock out”, “for the first time”, “recommend”, and “signs” suggests that many consumers to shop for the first time is usually based on the network or friend’s recommendation.“Web celebrity effect” has a great influence on the choice of consumers, but the probability of words shows that consumers have a high percentage of disappoint web celebrity to web celebrity restaurant and popular dishes. The third theme is summarized as consumer preferences. The words “taste”, “price”, “food”, and “recommendation” respectively represent the demands from different perspectives that consumers value more before the epidemic. The fourth theme is summarized as the restaurant service level, and “speed” and “attitude” grade shows that consumers pay attention to the restaurant’s service, wait speed and the waiter’s service attitude.
As shown in
Table 9, subject terms were extracted from the comments after COVID-19. In this paper, the first theme is summarized as the dining style chosen by consumers. Compared with the in-store food, consumers are more likely to choose takeaway, package, or purchase semi-finished products for processing and eating at home. The second theme is summarized dining experience for consumers. Because of the epidemic prevention and control as well as experts for advice citizens out less, most of the food and beverage outlets traffic is small. Moreover, providing eat-in during COVID-19 of food and beverage outlets need reasonable layout according to the requirement, and ensure that diners keep a safe distance, strictly separate portions, etc., so there will be occasional queues. Theme three changes are summed up as food. Because of COVID-19, the transport is limited by certain ingredients. In addition, it is hard to ensure freshness, especially salmon, sashimi, etc.; imported frozen meat, and aquatic products are even more affected. As a result, some food spoilage occurred. Moreover, food and beverage outlets suffered serious losses, and after reopening, many stores chose to reduce the amount of stock, to reduce spending, or to increase their income in order to maintain its normal operation. The fourth theme is summarized as the degree of epidemic prevention. After the outbreak of the epidemic, consumers were required to wear masks, measure their body temperature, use disinfectant to disinfect their hands, and sit at a safe distance when eating. Therefore, there are many comments of this type in the comments.
By comparing the four themes before and after the outbreak, the authors found difference and relationships in consumers’ psychological behavior before and after the outbreak: first, compared with before, after the outbreak of consumers were more focused on the dining environment, paying attention to the epidemic prevention and health conditions of dining place, at the same time also noticing whether the shop, when arranging customers to line up and eat, required them to keep a certain distance. Secondly, the way that the consumers’ diet before, and after, the epidemic has greatly changed. Before the epidemic, they tended to go to online celebrity stores to punch in, but after the epidemic, they tended to pack home or buy takeaway food. Thirdly, before the outbreak, consumers considered whether the store had tasty food, and if the store was a recommend web celebrity restaurant. Moreover, after the outbreak, consumers paid more attention to food safety degree. For example, when some frozen fish tested positive during the COVID-19 outbreak, consumers reduced their demand for such ingredients, so changes to food types also needed to be made accordingly. The relationship between consumer behavior before and after the epidemic was mainly focused on the waiting time for meals (including waiting time and serving speed). All consumers paid attention to the service standard and attitude of the staff in the restaurant. In the highly competitive catering industry, a good service attitude can make a restaurant have more repeat customers and praise rate. Especially during the epidemic, when many stores are closed, good service not only warms customers, but also extends the life of stores.
4.5. Sentiment Analysis
Comments represent the feelings of consumers and can reflect the psychology of their behavior. During the epidemic period, people reduced their consumption outside. Online comments have become an important reference for consumers to choose products, and the emotional tendency of comments is the main factor affecting consumers’ choice. Therefore, after getting the keywords from LDA topic model, this paper uses SNOWNLP [
25,
26] to judge the situation before and after the epidemic according to the proportion of positive and negative comments before and after the epidemic. The following
Figure 6 can be obtained.
Before COVID-19, the positive comments accounted for about 43%, while the negative comments accounted for about 57%. After COVID-19, positive comments accounted for about 54%, while negative comments accounted for about 46%. Compared with before the outbreak of COVID-19, the proportion of positive comments on food and beverage after the outbreak of COVID-19 did not decrease but increased, which is more optimistic, indicating that during the epidemic period, people’s consumption tolerance increased. Their requirements for some basic conditions of food and beverage that they care about before the outbreak decreased. Moreover, they generally had the psychological behavior of expecting the epidemic to end soon and businesses to resume normal business.
This paper converts the three kinds of scores of taste, environment, and service obtained by Python into one point system. It compares consumers’ emotional tendency with three kinds of scores to explore which aspect of influence is more similar. It takes the mean value of every 50 data to eliminate the influence of extreme values, and then draws a double line chart to compare with the emotional tendency shown in
Figure 7,
Figure 8 and
Figure 9.
Compared with emotional scores, taste, environment, and service scores are generally higher. The broken line of taste score and emotion score has the lowest fitting degree. Furthermore, it is the most stable among all the scoring lines, indicating that the food taste of restaurants did not change significantly before the epidemic, and consumers’ score of taste fluctuated less. The broken lines of environment and service scores fit well with the broken lines of emotion scores. The peaks and valleys are roughly consistent, indicating that environment and service scores have a greater impact on the overall emotional scores of consumer reviews. Before the outbreak of epidemic, many restaurants did good business. Especially in the dinner rush hour, it is easy to have environmental health problems such as noisy dining environment, overdue cleaning of debris, and foreign bodies in dishes. In addition, there would be long queues, slow delivery of food, and insufficient service personnel leading to lower service standards. Consumer emotion and the emotional score are obviously affected by the environment and service, and therefore arise an obvious peak period.
As can be seen from
Figure 10,
Figure 11 and
Figure 12, the scores of taste, environment, and service have a bit gentle trend compared with the scores of emotion, and their mean values are significantly higher than the scores of emotion. Among them, the trend of service score and emotional score are the most similar. Under the condition of stable taste and environment, the service quality of restaurants has a direct impact on the dining experience of consumers. From entering the restaurant, sitting down to leaving the restaurant, the service quality is greatly affected by human factors, such as the quality of the training of the waiter, working hours, and other factors. So it is difficult to maintain a unified standard of service in the restaurant. Whether the waiter can deal with some unexpected situations such as wrong dishes or foreign bodies in dishes, promptly, and solve them smoothly directly affects the consumer’s consumption feelings. Due to the serious loss of catering stores, many stores choose to reduce their stock in order to reduce the cost of storing food or to increase their food prices to increase income to maintain their normal operation after reopening. If the service is not improved in time, consumers are likely to be dissatisfied. Affected by the epidemic prevention and control, the catering stores providing hall food during the epidemic period need to be reasonably arranged according to the requirements to ensure that the diners keep a safe distance. As a result, the number of available seats in the store is reduced, and consumers have to wait at the door of store. Therefore, the waiters not only need to ensure the demands of the consumers who dine in the restaurants are met, but also should pay more attention to the demands of the consumers who line up outside the restaurants.
For example, some measurement including providing hot drinks and seat and so on can be used. Due to the outbreak of COVID-19, the transportation of food materials has been limited to some extent, especially the imported frozen meat products such as salmon and aquatic products, which have been greatly affected. The waiter should inform the consumers of the adjustment of the dishes in the store in time, so as to avoid the change of food materials that may lead to the dissatisfying taste of the dishes, which may affect the consumption experience. In addition, epidemic prevention measures are extremely crucial as well [
27,
28]. Waiters need to wear masks throughout the whole process, measure the temperature of consumers entering the store and use disinfectant to disinfect their hands to ensure the hygiene of the restaurant. In general, the restaurant needs to ensure the quality of dishes and the environment of the restaurant, increase the management and training of the waiter, and further improve the service level and service awareness of the waiter, so as to bring better dining experience to consumers.
5. Conclusions
This paper uses data mining technology to study consumer preferences before and after the epidemic. First of all, through cleaning and sorting of the collected data, the authors get relatively clean comment data. Based on this, the paper carries out word frequency statistics and word cloud mapping and analyzes both psychological behavior and psychological changes of consumers under public health emergencies. After that, this paper conducts a thematic study on comment data [
29] and compares the theme composition before and after the epidemic, respectively. It is found that before the outbreak of COVID-19, consumers pay more attention to the taste of dishes when dining out. After this, they focus on epidemic prevention measures, health services, and food safety of stores. BeforeCOVID-19 happened, consumers tended to punch in online stores. However, they prefer pack or order takeout after the outbreak. It seems beyond dispute that COVID-19 not only changed consumers’ lifestyle in many ways, but also changed consumer attitudes and consumption habits. Finally, this paper makes an emotional analysis [
30] of the comments by comparing the scores of taste, environment, and service which obtained by Python crawling with those obtained by machine learning. The authors find the relationship between consumer ratings and consumer emotions. During the outbreak response, psychological attitude of catering consumers was still positive. The environment and service had a significant impact on the emotional score of consumers before and after the outbreak, indicating that consumers now think highly of to the state and comfortable degree of dining. Based on the above conclusions, suggestions are as follows:
- (1)
Improve the level of epidemic prevention services and the transparency of food information to gain consumers’ trust.
Under public health emergencies, consumers pay increasingly attention to the epidemic prevention of restaurants. The catering industry should strengthen the epidemic prevention measures, disinfect on time every day, prepare disinfectant and masks, as well concentrate on cleaning and disinfecting of tableware. At the same time, reasonable arrangements for consumers queuing and dining distance need attention, either. Catering service personnel need regular training to ensure that they can also have professional service standards and warm service attitude in the face of public health emergencies. For example, when consumers are dissatisfied, they can take appropriate measures to appease and compensate, so as to minimize the loss.
From the semi-finished products made from the purchased raw materials to the final products, information such as the name of the ingredients, the place of origin, the place of passage, and the results of various food safety tests are presented on the QR (Quick Response) code. By setting the QR code on the dining table and the network platform, consumers can directly scan the code to understand the changes of the ingredients and the safety of the ingredients. In addition, it can also reduce the time for the waiter to explain the dishes to consumers. Thus, they are available to focus on the service to improve service efficiency while give consumers a sense of security in the event of public security emergencies.
- (2)
The combination of online and offline double line discount and consumption coupon to win the “heart” of consumer.
The competition in the catering industry is fierce, and all kinds of preferential activities emerge in endlessly. How to stand out in the competition is worth thinking about. Each store can adopt different online and offline discount methods. It can adopt the system of issuing food consumption coupons and exchanging a certain amount of accumulated consumption for dishes online, so that consumers can not only enjoy the discount but continue to consume in the store. At the same time, online sales are more suitable to launch “small dishes” package discount and encourage consumers to buy dishes in the form of package, which can not only save food and beverage stores. The types and costs of food materials procurement can also save consumers’ choice time. Offline preferential activities are more diversified than online ones. Consumers can not only “buy back” by presenting e-coupons to shop customers, but also present their own special dishes or new dishes or store related small gifts to consumers in the form of lucky draw, so that consumers will have a good time; meanwhile, their heart will dance with happiness.
- (3)
Actively obtain feedback to awaken consumer demand and seize the “stomach” of consumers.
During the epidemic, consumers’ choice and demand for dishes changed. The service personnel should take the initiative to ask consumers about their taste and feelings of the dishes and record the problems so as to rectify the dishes and meet the needs of consumers. Let consumers in the restaurant to obtain “stomach” satisfaction, at the same time to obtain emotional and social satisfaction. Moreover the restaurant can flexibly adjust dishes according to market supply and demand, to give consumers satisfaction.
- (4)
Reasonable use of community group purchase to provide consumers with “convenience”.
In the context of the epidemic, people generally do not want to wait in line, but want to be able to eat quickly. The community group purchase with the community as the unit arises at the historic moment. The restaurant can accordingly change its business form and flexibly adjust the products it sells according to the market demand. Catering stores can launch meals in the form of group purchase, such as semi-finished products and group purchase packages, in order to meet the living needs of surrounding residents and give consumers convenience.
- (5)
Develop environmentally friendly takeout mode, leaving consumers with “beauty”.
In the event of public health and security, the increase in the number of people ordering takeaway food has led to the problem of excessive use of disposable cutlery. Catering stores can encourage consumers to reduce the use of disposable tableware by offering certain preferential measures to those who bring their own tableware. Catering stores should replace plastic packaging boxes with degradable paper boxes as much as possible, so as to reduce the harm to the environment and achieve long-term sustainable development.
- (6)
Data help the recovery of the catering industry.
Local governments can set up data centers and local online epidemic prevention workstations based on “counties’ street” to provide corresponding policy information support for local catering enterprises. At the same time, commercial banks can cooperate with catering enterprises to provide consumption discounts to consumers who handle business in the bank, so as to increase the cash flow of the catering industry and promote the recovery of catering enterprises during the epidemic period.