1. Introduction
Live streaming E-commerce is a new manner of commodity sales, integrating merchants, online celebrities, and consumers. It combines live streaming with E-commerce and connects people, goods, places, and other elements, forming a new model of “E-commerce + live streaming”. Based on mature, stable, and efficient mobile Internet, mobile payment, advanced intelligent technology, and modern logistics system, it attracts younger users, expands the size of consumers, and enhances interactivity.
In particular, the current COVID-19 pandemic has affected most parts of the world, bringing unprecedented challenges to the economy. Research by Wu et al. [
1] showed that the COVID-19 epidemic had caused significant economic losses in the manufacturing industry, transportation industry, service industry, import and export trade, etc. It can be seen that the epidemic has had a significant impact on residents’ travel and social output. Therefore, live streaming E-commerce, as a new consumption mode, is gaining more and more attention with the advantages of online social commerce. Qing and Jin [
2] proposed that the COVID-19 pandemic had affected all aspects of retail, but the rapid development of live streaming E-commerce, as a new consumer model in the residential and non-contact economies, has not only quickly met the market demand in the context of the current epidemic but also further promoted the development and innovation of online businesses. It can be seen that the outbreak of the COVID-19 epidemic in late 2019 has objectively accelerated the upsurge of online consumption. Coupled with the government’s advocacy for stimulating consumer demand, live streaming E-commerce has become an important means for solving material supply and people’s daily necessities. Its purpose and inherited nature have also extended from the sharing and entertainment to new fields such as product sales, cultural tourism promotion, poverty alleviation, and agricultural assistance. The current live streaming mode has been upgraded from “E-commerce + live streaming” to “Internet celebrity + star + official + E-commerce + live streaming”, forming “government-led, E-commerce-centered, social assistance, fan consumption, and economic development”. It is undeniable that live streaming advances economic development, expands domestic demand, and contributes to poverty alleviation and employment. However, there are still a lot of problems and risks. Huang and Wu [
3] show that many problems, such as false publicity, data falsification, counterfeit and inferior products, and lack of after-sales service, occur frequently in the live streaming E-commerce, and the industry norms and regulatory system need to be improved. Xia and Song [
4] also showed that although, under the special background of the post-epidemic situation, the popularity and acceptance of the new retail mode of “live streaming E-commerce” had been greatly improved, and the platform economy has been given new vitality, the illegal chaos and improper supervision hidden in the industry cannot be ignored. Therefore, for the healthy development of the industry, various national departments have issued normative policies in succession, such as
the Code of Conduct for Online Live Streaming Marketing,
Guiding Opinions of the State Administration for Market Regulation on Strengthening the Supervision of Online Live Streaming Marketing Activities, and other policies issued by the State Administration for Market Regulation. All of them are used to protect consumers’ rights and interests while stimulating economic recovery and regulating the live streaming E-commerce industry.
Compared with traditional industries, live streaming E-commerce features a low entry threshold, wide participation scope, and huge influence. Today, as Internet technology is becoming more and more mature, live streaming E-commerce significantly impacts social and economic structures. With the improvement of the professionalization of hosts and the surge in the number of participants, the industry’s problems have become increasingly prominent, seriously affecting consumers’ interests, and even threatening the future development of live streaming E-commerce. Therefore, the public’s call for its governance and regulation has become stronger and stronger. Based on this, introducing these normative policies will inevitably attract great attention from the public, who will express their opinions on the effects of these policies from various perspectives. Promptly discussing the public’s attitudes before and after the policy implementation and studying the voice of the public can effectively analyze the policy implementation effect, reduce potential risks in the development of the live streaming E-commerce industry, and lay down a solid foundation for regulating the live streaming E-commerce. Moreover, online social networking platforms have huge advantages in terms of scope and speed of information dissemination, so they can better obtain public opinion and use adopted data mining and natural language processing technology to explore public opinion from multiple dimensions and provides a theoretical basis for the implementation of the policy and the development of the industry. Chakraborty and Sharma [
5] proposed that Weibo, a service called Twitter, has become a popular platform for people to express their opinions on different issues and that analyzing tweets discussing government policies can help to understand the public’s views on different government decisions. Zhou et al. [
6] used the public’s decision on the country to build large infrastructure to analyze the degree of public support and the implementation effect of government policies.
Meanwhile, Guo et al. [
7] proposed that live streaming E-commerce has become an important driver of global trade, but limited attention has been paid to this area. Sun et al. [
8] also pointed out that although people were increasingly interested in live streaming E-commerce, relevant research is still limited, and scholars mainly analyze the development of the live streaming E-commerce industry based on the behavioral intentions of consumers. It can be seen that at present, there are relatively few studies on the implementation effect of the live streaming E-commerce policies both in domestic and international research, let alone analyzing the implementation effect of policies in this industry by using online comments. Based on this, this paper collects Sina Weibo data, integrates it with topic modeling and sentiment analysis methods, explores the evolution of online public opinion before and after the policy implementation, analyzes the public’s attitude and attention to these normative policies, and studies the social effects of various policy implementations.
The structure of the paper is organized as follows:
Section 2 is a literature review.
Section 3 is the collection and processing of data related to the live streaming E-commerce industry.
Section 4 uses online comment data to analyze the changes in public attention before and after the policy implementation based on the LDA models and online HDP models.
Section 5 uses three emotional dictionaries to construct different sentiment analysis models respectively, conducts sentiment analysis on the collected comment data, explores the changes in public sentiment before and after the policy implementation, and studies their satisfaction with the policy implementation.
Section 6 is the summary of the full text and the prospect of future work.
2. Literature Review
At present, the live streaming E-commerce industry is at a stage of rapid development, playing a practical and effective role in economic revitalization, poverty alleviation, and expanding employment. However, during the booming development, the problems and drawbacks, such as soaring complaints, poor quality, fraudulent advertising, and nonstandard supervision, are prominent. Based on this, the Chinese government successively issued a series of policy documents aimed at regulating the live streaming E-commerce industry, providing a legal basis for managing its chaos. This section analyzes the literature from the perspectives of live streaming E-commerce policies and online public opinion.
- (1)
Live streaming E-commerce policy
At present, the analysis of the policy implementation effects is mostly concentrated in the fields of medical and health care, environmental protection, and industrial manufacturing. Courtney and Lorie [
9] explored the policies of Canadian nursing regulatory bodies and associations on nursing practice specific to environmental health. Through publicly available position statements and competency documents regarding health and the environment coded inductively and thematically analyzed, their study found a gap between nursing policies and competencies directing nursing action related to the health of the environment across Canada. Yuan et al. [
10] performed a thematic framework and content analysis to analyze the related policies about disease control and prevention systems in China from 2000 to 2020 based on the theory of policy instruments. The results showed that in the policy formulation process, the government should strengthen the comprehensive application of multiple policy instruments, particularly about the inducement instrument and symbolic and hortatory instrument, for stimulating internal motivation. Albulescu et al. [
11] used a quantile fixed-effect panel data approach and OECD (Organization for Economic Co-operation and Development) data to investigate how environmental policy stringency affects CO2 emissions in a set of 32 countries from 1990 to 2015. Results showed that an increase in policy stringency had a negative impact on emissions and that environmental stringency had a more powerful impact in countries with lower levels of carbon emissions. Based on city-level panel data from China, Liu et al. [
12] applied a difference-in-difference (DID) model and a bootstrap panel Granger causality test to investigate the relationship between two types of environmental regulations and industrial growth. Their results showed the following: (1) Command-and-control environmental regulation (CAC) had a significant inhibitory effect on industrial growth. (2) In most regions, there was no significant Granger causality between market-based incentive environmental regulation (MBI) and industrial growth. (3) Compared with CAC, MBI allowed more flexibility for enterprises, which was more beneficial for technological innovation. Wang [
13] used social network analysis, content analysis, and other methods to explore the sustainable evolution law of China’s cloud manufacturing policies from the perspectives of policy issuing departments, policy focus topics, and policy tools. The results illustrated that the evolution of cloud manufacturing policies showed obvious stage characteristics, i.e., it mainly went through three stages of “encouraging development-top level design-implementation guidance”. Meanwhile, China’s cloud manufacturing industry was still in the development stage, and it was urgent to introduce policies that directly affected the industry. Zheng et al. [
14] estimated the treatment effects of the differential electricity pricing (DEP) policy through the method of propensity score matching and difference in differences (PSM-DID). The results indicated a significant negative effect of the DEP on the TFP of energy-intensive industries shortly after the DEPs implementation. Moreover, the role of the DEP policy played a transition from increasing costs to stimulating technology improvement for energy-intensive industries. Based on the PASS approach (P: prepare-protect-provide; A: avoid-adjust; S: shift-share; S: substitute-stop), Zhang et al. [
15] classified the 418 COVID-19 policy measures issued by Australia, Canada, Japan, and New Zealand in 2020 and developed a dynamic Bayesian multilevel generalized structural equation model to represent dynamic cause-effect relationships between policymaking, its influencing factors, and its consequences. The results showed that “Prepare-protect-provide” policy measures had dominated in practice; about 40% of all 418 measures could be judged as effective, and the UK showed the best performance, followed by Japan and Australia.
However, there are very limited studies on the effects of its policy implementation in the live streaming E-commerce industry. Scholars mostly analyze various factors affecting the development of the industry from the perspectives of sales strategies and consumer behavioral intentions and propose relevant policy suggestions accordingly. From the seller’s perspective, Assarut et al. [
16] used a mixed quantitative and qualitative approach to analyze Facebook data of live streaming sellers to assess the nature and extent of engagement metrics, and delineated the dynamic, interactive live streaming sales process. Finally, they identified four sales approaches and twelve strategies adopted in acquiring and retaining customers and provided a framework for the government and various live streaming platforms to understand the relationship mechanism in live streaming commerce. Niuet al. [
17] found that Machiavellianism was positively associated with gift-giving in live video streaming through the mediating role of desire for control; the mediating effect of desire for control was moderated by materialism, with this relation being stronger for individuals with a higher level of materialism. According to the characteristics of network live broadcast and agricultural products, Chen et al. [
18] used the means of case analysis and questionnaire survey to analyze the policy of live broadcast with goods to help agriculture and found that the main problems under the current policy were the imperfect service guarantee system and the lack of specialization of anchors. Zhang et al. [
19] applied the grounded theory method to analyze semi-structured interviews of 96 consumers, identified the eight characteristics of E-commerce live streaming anchors (expertise, attractiveness, credibility, interactivity, popularity, price support, affinity, and responsiveness), and further classified them into four roles (opinion leader, spokesperson, interactive friend, and salesperson), providing a clear framework for the management and training of anchors. At the same time, many scholars have noticed that the hidden risks in the industry are constantly eroding the trust of consumers and overdrawing the future potential of the industry. Therefore, they use this as a breakthrough to provide a theoretical basis for the improvement of industry policies. Lu and Chen [
20] examined how live streaming affected consumers’ purchase intentions (PI) by considering product uncertainty reduction and trust cultivation between consumers and broadcasters. Based on a structured survey data set and an unstructured interview data set from live streaming commerce users in China, they used structural equation modeling as well as qualitative analyses to verify the research model. The results showed that there were significant impacts of product fit uncertainty, product quality uncertainty, and trust on PI, and trust was more important compared with product fit and quality uncertainty in affecting PI. Guo et al. [
21] examined how the affordance of live streaming affected consumer behavior in the cross-border E-commerce context based on the information transparency perspective. Results showed that although live streaming did not directly affect consumers’ cross-border purchase intentions, it could increase consumers’ purchase intentions through increasing perceived information transparency. Wang and Lin [
22] adopted a multiple-case grounded methodology to build a theoretical framework for helping live streaming E-commerce to “resist evil and follow good” through analyzing the comments of the 12 live streaming E-commerce “roll over” events, in order to provide a theoretical basis for standardizing the behavior of live streaming E-commerce.
In addition, according to the above literature analysis, it is not difficult to find that most scholars’ research on policy is mostly based on policy tools, text content, questionnaire analysis, and data analysis of relevant influencing factors, but rarely analyzes the policy implementation effect through data mining of online comments. The chaos in this industry has seriously affected the vital interests of consumers. Therefore, the normative policies issued by relevant departments have quickly attracted the attention of the public, and a large number of live streaming E-commerce users have shared personal experiences with the policies on online social media platforms. Based on this, analyzing the attitudes and emotional tendencies of the public in the process of implementing these normative policies through network public opinion can effectively analyze the policy implementation effects.
- (2)
Online public opinion on social hot events
At present, the analysis of public opinion on social hot events generally starts with public opinion surveys based on questionnaires and interviews, and then uses traditional qualitative and quantitative methods to analyze the data. For example, Jony et al. [
23] analyzed predictors of control measures and psychosocial problems associated with COVID-19 pandemic through questionnaires. Their findings suggest that health authorities must promote health education and implement related policies to disseminate COVID-19awareness that can prevent and control the spread of COVID-19 infection. James [
24] argued that a survey poll was more likely a way of representing a small group of individuals’ viewpoints rather than public opinion or social opinion. Bian et al. [
25] thought that since the survey polls were conducted in a private environment and due to the time required for and high capital costs involved in data collection for survey polls, data quantity was usually limited, restricting the openness of public opinion and the representativeness of the findings to a large extent. Fortunately, with the popularity of the Internet, social media platforms have provided a new way of expressing and measuring public opinion, namely, online opinion. This new form of the opinion-gathering method increases the size of the datasets obtained, enables sample diversity, reduces the associated costs, and speeds up data collection. Ceron and Negri [
26] believed that online public opinion could provide decision makers with meaningful information, and it provided a cheap and efficient way to monitor and evaluate public opinion. Over the past few decades, social media-based network opinion analysis has been applied in various fields, including social sciences, politics, education, and medicine. Zheng and Tuo [
27] found that Weibo had become the most popular channel for political discussions in China as almost all government agencies at the local, regional, and central levels had opened official Weibo accounts. This unique characteristic motivates scholars to use it to conduct public opinion analyses on Chinese issues. Liesbeth et al. [
28] used data extracted from online (social) media to provide monitoring of prominent opinions on health policy among the public at certain times and helped public health institutes immediately respond appropriately to these public concerns.
In recent years, scholars have proposed many methods for online public opinion analysis. Dong and Lian [
29] thought that the temporal and spatial distribution analysis, sentiment analysis, and viewpoint analysis were the three most commonly applied approaches in previous studies. Lozano et al. [
30] developed a distributed geo-aware streaming latent Dirichlet allocation model and used it for automatic discovery and geographical tracking of election topics during parts of the 2016 American presidential primary elections. Their results showed that the online discussion topics’ locations correlated with the actual election locations and that the model provided a better geo location classification approach. Taking the Three Gorges project (TGP) as a case study, Jiang et al. [
31] proposed a project sentiment analysis (PSA) system using a lexicon-based method. This system collected user opinions from social networking sites, established emotion dictionaries, and built basic rules that calculated those sentiment values embodied in a collection of messages. Their results showed that about half of the collected messages expressed negative emotions towards the TGP, while the other messages were positive or neutral. Jiang et al. [
32] combined sentiment analysis and topic modeling, and spatiotemporal analysis to establish a systematic framework for the assessment of large infrastructure projects and transformed unstructured online public opinions on large infrastructure projects into sentimental and topical indicators. Their results showed that sentiment polarity and major topics of public opinion were strongly associated with the spatiotemporal distribution. Barachi et al. [
33] constructed a two-way LSTM (long short-term memory) network model containing online latent semantic indexes with regularization constraints to extract multiple emotions from a large number of online posts about climate change. In addition, their findings of the study indicated that chosen topics, cultural sensitivities, and posting frequency all played critical roles in public reactions to the posts and the subsequent perspectives they adopted.
To sum up, with the popularity of live streaming E-commerce blossoming in the country, most scholars have explored the factors affecting the development of the industry as well as the chaos in related industries and then proposed many policy suggestions based on analysis. However, there is very limited analysis of online public opinion on policies. Using large text corpora obtained from social media platforms to analyze public opinion can more effectively capture the attention and emotional tendencies of the public. Based on this, this paper takes Weibo as the online comment data collection platform, uses the LDA and online HDP models to extract topics, identifies sentiment polarity and sentiment intensity values based on the analysis models of different emotion dictionaries, explores the changes in public attention before and after the policy implementation, and analyzes the policy implementation effect, providing a pivotal basis for optimizing the normative policy of live streaming E-commerce.
3. Research Framework and Methods
The public may have different concerns and emotional tendencies during the implementation of the normative policies in the live streaming E-commerce industry. Therefore, this paper analyzes the changes in public opinion during this period from these two perspectives and studies the implementation effects of the normative policies. The research consists of the following three parts: data collection, data preprocessing, and data analysis.
Firstly, this paper analyzes the current situation of the live streaming E-commerce industry and the most important policy texts in the industry and finds out the biggest existing problems in the industry and the planning of government policy schemes. Secondly, the collection scope of online comments is determined by analyzing the content of policies, and the comment data from 1 January 2020 to 31 August 2021 is crawled from Sina Weibo by Python crawler technology based on the implementation time of the policy, and the data preprocessing is carried out after the statistical analysis of the attention intensity difference of comments. Third, taking 1 July 2020 as the time node, when the first live streaming E-commerce industry normative policy, the Code of Conduct for Online Live Streaming Marketing, was implemented, the timeline was divided into the time window before and the time window after the implementation of this policy, and the released dates of subsequent normative policies are added on this basis (5 November 2020, 9 February, 1 May, and 25 May 2021) to analyze comments. Fourthly, the topic extraction model and sentiment analysis model are adopted to study the changes in online public opinion before and after the policy implementation and to explore the implementation effect of the normative policies for the live streaming E-commerce industry as follows: In order to analyze the development trend of public opinion on the live streaming E-commerce standard policy, it is necessary to identify the focus of public attention during this period. In this paper, the LDA model is used to extract topics from online comment data, and the results of the online HDP model are compared and supplemented to identify potential topics, analyze possible problems in policy implementation, and ensure the effective implementation of policies. Additionally, by mining users’ views and analyzing users’ emotional tendencies, we can deeply understand users’ attitudes and emotions towards events. This paper analyzes the emotional changes before and after the implementation of the industry standard policies, studies the public’s attitude and tendency toward these policies, and provides suggestions for how the government can guide the development of public opinion, so as to effectively guarantee the healthy development of the live streaming industry.
The specific research framework is shown in
Figure 1.
5. LDA Model Result: The Evolution of the Online Topics of the Normative Policies for the Live Streaming E-Commerce
To analyze the evolution of public opinion on the policy for regulating live streaming E-commerce, it is necessary to identify the attention of the public during this period. In order to accurately find out the potential topics of the public’s online comments, this section comprehensively considers the analysis results of the LDA topic extraction model and online HDP model to identify the potential topics of comments, analyze the possible problems in policy implementation, and ensure the effective implementation of policies.
5.1. Data Preprocessing
Through the analysis of the raw data, it can be found that many users have made Weibo comments on the released live streaming E-commerce normative policy, reflecting the public’s great interest. They express views or emotions by frequently commenting, liking, and forwarding the official Weibo. To accurately extract public opinion from these comments, data must be preprocessed before analysis. The specific process is as follows:
- (1)
Leave out invalid, duplicate, or incomplete data by filtering. For example, only “ha ha ha” and the other two or three words of meaningless comments, other website links, advertisements, illogical text disorderly code, and other comments are all invalid and incomplete texts.
- (2)
The forwarded Weibo usually shares the same viewpoint as the original Weibo. If the forwarded Weibo does not add any meaning, the original Weibo will be kept and the forwarded Weibo will be deleted to avoid duplication of data.
- (3)
Remove noise information. In Weibo, nicknames usually follow the @ symbol. If nicknames appear frequently, it will affect the results of topic extraction, and the emotional information contained in them will also affect the results of sentiment analysis.
- (4)
Exclude non-text data. The evolution of public opinion is analyzed from the perspective of textual information, while pictures, charts, animations, emojis, and other non-text elements are not considered.
Through the above processing, a total of 308,751 pre-processed data were finally verified and stored in the data set, among which 68,161 were before policy implementation and 240,590 were after implementation. This paper uses the Jieba Chinese word segmentation module in Python to segment each preprocessed dataset and remove the stop words in it to prepare it for the topic and sentiment analysis.
5.2. LDA Models
As a natural language processing (NLP) technique, topic modeling can be used for information retrieval and classification of large-scale documents or corpora. Asmussen and Moller [
35] proposed that LDA models were not only the preferred method for topic modeling but also the most advanced tool for exploratory analysis of a particular subject with textual material. Blei’s [
36] research also proposed that the LDA topic modeling algorithms were the most convenient of the several possible generative models that can be used for topic modeling. Therefore, this paper also uses this model to conduct the topic extraction of the policy text.
5.2.1. Construction of LDA Models
Document modeling in the LDA model is Unigram Model believes that the generation process is constantly taking a document from a bag of words, that is, choosing a topic with a certain probability, and then with a certain probability to choose a word from the topic, and finally, in the case of known text words, through the probability method to derive the topic distribution of the text set, for the clustering of documents. Blei et al. [
37] proposed that this method ignores the word order in the modeling process, which further simplifies the complexity of the model and is suitable for the modeling of massive text data with heterogeneous characteristics. The process of the LDA model is shown in
Figure 4.
Where K is the number of topics, D is the number of documents, and N is the number of words. For the nth word of the
dth document, the topic distribution
θd corresponding to the document is first generated by the prior distribution (Dirichlet distribution) of the topic, and a topic
Zdn is extracted the refrom. According to the current topic
Zdn, the distribution
βk of the word is then generated from the prior distribution
β (Dirichlet distribution) of the word, and a word
ωdn is extracted from the distribution. After N repetitions of previous steps, it will obtain a document with N words. The joint probability of LDA modeling is defined in Equation (1) as follows:
In this paper, the topic modeling is constructed by the related modules of the LDA models in Python and Java to analyze the changes in public attention before and after the policy implementation and to discuss the implementation effect of the policy.
5.2.2. Selecting the Best Number of Topics
Before building the LDA models, perplexity and coherence tests are necessary to determine the number of topics. The perplexity is a measurement of how well a probability distribution or probability model predicts a sample. A low perplexity indicates the probability distribution is good at predicting the sample. Coherence judges the semantic consistency of word statistics between different texts, which can solve the problem of model overfitting caused by perplexity. A higher coherence represents the topic model with a higher quality. The results of the best number of topics in the two-stage public opinion evolution process are shown in
Figure 5 below.
Figure 5 shows the changes in the perplexity and coherence of LDA models under the different number of topics before and after policy implementation.
Figure 5a,b shows that before the implementation, when the number of topics is 8, the perplexity score is the lowest and the coherence score is the highest, so the best number of topics for the LDA models at this stage is 8. After the implementation, when the number of topics is 11, the perplexity obtains the minimum value. When the number of topics is 10, the coherence score is the highest, but there is not much difference between the results when the number of topics is 11. Therefore, after comprehensive consideration, the minimum value of the LDA models after the implementation is set at 11.
5.3. Online HDP Models
Compared to standard LDA models, which require a manual search for the number of motifs, the HDP model can automatically infer the number of topics according to the document collection, which greatly increases the robustness of the algorithm. Tang et al. [
38] proposed that for large data sets, users generally did not determine how many topics they contain. Therefore, the hierarchical Dirichlet process (HDP) was proposed after LDA by Blei [
39] to enable the investigation of and representation of more complex corpus/data. The Wang and Al-Rubaie [
40] study gave the following specific process of the HDP model: The HDP was a stochastic process that could be used to define a nonparametric distribution on an assortment of mixtures model; that is, each grouping of data was drawn from a mixture model, and the mixture components were shared among the different groups. Using a hierarchy of Dirichlet processes enables the number of mixture components to be inferred from the data.
However, Wang et al. [
41] pointed out that one limitation of HDP analysis was that existing posterior inference algorithms require multiple passes through all the data as follows: these algorithms are intractable for very large-scale applications. So, they proposed an online variational inference algorithm for the HDP—online HDP. Moreover, the HDP model module in the genshim package in Python software is built with this algorithm as its core, so in this paper, this module is used to build the model and enhance the identification of topics by comparing it with the results of the LDA model. Then, since the online HDP model is an unsupervised model, no number of topics was set before the construction, and the keywords within the different topics in the results output by Python are partially repeated or similar in meaning. Therefore, this paper selects the top 20 topics in the output results based on the keywords in the topic, ata rate of more than 60% of the topic as the same topic, and the keywords selected the top 15 keywords as the keyword content of the topic.
5.4. Result Analysis
After constructing the topic model before and after the policy implementation, the results are shown in
Table 3,
Table 4 and
Table 5 as follows:
- (1)
Topic models before policy implementation
Table 3 and
Table 4 above are the LDA topic model results, and
Table 5 is the online HDP model results, where the topic overview is obtained by the synthesis of the keywords within the topic and the comments under that topic. It can be seen that the content of the two topic models is basically the same, which both focus on the economic drive, poverty alleviation and public welfare, industry development, chaos, and standardization. Additionally, the LDA results are more detailed on the economic side, while the online HDP model complements the topic of industry chaos; for example, topics 5 and 7 in
Table 5 identify consumer rights protection and a government crackdown on illegal activities.
From the results of the topic models before the implementation of the normative policies, and with the rise of the live streaming E-commerce industry, the public fully realizes the potential value of this industry. Especially during the epidemic and economic recovery, live streaming E-commerce experiences an explosive development, which broadens sales channels for all kinds of businesses, explores new living spaces, and accelerates the resumption of work and production. Mainstream media such as CCTV News also give full play to the role of social responsibilities, cooperate with major E-commerce and live streaming platforms, and create various online public welfare promotion activities on live streaming platforms to enrich economic vitality for areas seriously affected by the epidemic or underdeveloped economy and transportation. In addition, the “Internet + rural retail” mode has become the main means of revitalizing the rural economy. Many village officials even personally participate in live streaming events for their agricultural products to contribute to poverty alleviation. In addition, local governments also issue a number of promotion measures for the live streaming E-commerce industry, such as the construction of Internet celebrity bases, the introduction of live streaming hosts, and host training, as discussed by the public in LDA models’ Topic 2. However, grassroots celebrities, celebrity entrepreneurs, and local governments face problems, such as “roll over” events, exaggerated publicity, and false advertisements discussed in LDA models’ topic 7. Moreover, the difficult problems of consumer rights protection are mined by topic 5 in the online HDP models.
Finally, according to the proportion of topics obtained by the LDA module in Python in
Table 3, before the implementation of the normative policy for the live streaming E-commerce industry, the main topics discussed by netizens were related to the economic driving capacity of the industry. In topics 1, 2, 3, 5, and 6, topic 1 accounted for the highest proportion of discussions, up to 35.20%, which was related to the government’s support for the live streaming E-commerce industry in the first half of 2020. However, topics 4 and 7 also accounted for 15.89% of the discussions on industry chaos and industry norms, indicating that the current industry norms have seriously affected people’s consumption experiences and require government control. To analyze the public’s discussion on the industry normative issues of live streaming E-commerce before the policy implementation, this paper further extracts the comment data under topics 4 and 7in
Table 3. The results are shown in
Table 6 below.
From
Table 6, it can be seen that the public calls for the release of normative policies. Before the policy implementation, the public was mainly concerned about supervision, Internet celebrity host behavior, counterfeit and shoddy products, rights protection, false publicity, and other issues. During this period, the live hosts, such as Simba, Luo Yonghao, Fan Bingbing, and other “roll over” events, generated animated discussions among the public. The chaos in the industry is gradually eroding the public’s sense of trust, and the implementation of normative policies is imminent.
- (2)
Topic models after policy implementation
From the results of the LDA models after the policy implementation in
Table 7 and
Table 8, it can be seen that, compared with before the policy implementation, although there are also a lot of discussions on economic development, the proportion of topics related to normative policy has increased significantly, accounting for 42% of the total number of comments. It reflects that the introduction of government policies guides online public opinion.
Combined with the supplementary analysis of
Table 9 about online HDP models, it can be seen that the live streaming E-commerce industry has contributed a lot to the digital transformation and upgrading of many industries in the post-epidemic period, and has also achieved great results in poverty alleviation, local economy and cross-border exchanges. However, with the gradual emergence of chaos, such as lack of after-sales service, brushing, counterfeit goods, and difficulty in safeguarding rights, people’s trust in live streaming E-commerce continues to shrink, and the future of the industry is full of uncertainty. Internet celebrities and grassroots merchants are stuck in “roll-over” events, false publicity, and brushing, not to mention the difficulties in safeguarding rights and nowhere to complain, all of which make the public doubt this industry. In particular, the following topics are contained in both models: Simba, one of the top hosts, sold fake bird nests, and Pan Changjiang, a well-known artist, sold fake wine, which made the public deeply aware of the many problems in the industry. Many netizens express that live streaming E-commerce needs to keep the bottom line of the law based on continuously enriching the live streaming content.
The Code of Conduct for Online Streaming Marketing, implemented in July 2020, regained confidence in the live streaming E-commerce industry. Later,
the Rectification Action of the State Administration for Market Regulation rectified many violations in the industry. In October 2020, the State Administration for Market Regulation even announced
the Measures for the Supervision and Administration of Online Transactions (Draft for Comment) to solicit public opinions. In 2021,
the Measures for the Supervision and Administration of Online Transaction was officially released, clarifying the laws of relevant subjects. It indicates that the legal provisions of the live streaming E-commerce industry are becoming gradually optimized. Especially, topic 8 in the online HDP models shows that people who sell fake goods with live streaming E-commerce can be sentenced to up to 10 years imprisonment. This mandatory law has deterred many unscrupulous businesses.
In addition, to better analyze the concerns of the public on normative policies, the following further analyzes the comments belonging to normative problem topics in the LDA models.
- (3)
The evolution of major topics related to normative issues after policy implementation
The evolution of topics of concern can better reflect the changes in public attention and contribute to the formulation of more specific and effective industry management measures in the future.
Table 10 shows the keywords with the highest proportion of comments on normative issues for each month from 1 July 2020 to 31 August 2021. It can be seen from
Table 10 that after the implementation of the normative policy, with the improvement of laws and the strengthening of supervision, more and more chaos in the industry has been exposed. The most discussed topics are host fraud, sales of fake goods, supervision, and “roll-over” events. Among them, the incident caused by Sinba was the most hotly debated and lasted the longest period. This incident, to a large extent, promoted the implementation of normative policies, thereby raising the public’s awareness of the need to crack down on counterfeiting and safeguard their rights. In the month when the policy was released and implemented, the public’s attention was basically focused on the content of the policy itself, which showed that the public held great support for the implementation of the policy. However, there is a hot discussion on fake host sales and industry supervision every month after policy implementation. It can be seen that the implementation of the policy may not be thorough enough due to the excessive coverage of the entire live streaming E-commerce industry. All kinds of businesses can use live streaming to expand their sales channels, which makes it difficult for the government to implement management requirements. Therefore, it comes up with higher requirements for improving various laws and regulations. Government departments need to combine the laws and regulations of other industries to clarify the responsibilities of various stakeholders in the live streaming E-commerce industry, and it is necessary to continuously revise existing laws or formulate new regulations. Finally, compared with positive events, negative events tend to generate more heated discussions in the short term, and they can be regarded as a leading indicator for predicting the development trend of the industry. Therefore, the analysis of public opinion in the early stage can help predict the evolution of topics in the later period, provide a reference for the active guidance of public opinion and the improvement of policies, and timely and properly solve the shortcomings of the current policy.