Skip to Content
SustainabilitySustainability
  • Article
  • Open Access

11 March 2022

New Energy Vehicle Consumer Demand Mining Research Based on Fusion Topic Model: A Case in China

,
and
1
School of Economics and Management, China University of Mining and Technology, Xuzhou 221116, China
2
School of Mathematics and Information Engineering, Lianyungang Normal College, Lianyungang 222000, China
*
Author to whom correspondence should be addressed.

Abstract

This study extracted the demand preference topic words of new energy vehicle consumers with the help of the topic model, calculated the similarity between the word vectors and the topic keywords and expanded the topic keywords, analyzed and compared the demand topics and feature expansion words of different car models, and summarized the demand differences of other consumer groups. The analysis results show that consumers’ demands of different groups have the exact demand dimensions such as new energy features and brand features, and different demand dimensions such as application, services, and professional performance. The research findings help consumers filter valuable information from online review data and help car companies objectively and accurately obtain consumer demands, develop more reasonable marketing strategies, and achieve healthy and sustainable corporate development.

1. Introduction

Research shows that most consumers browse online reviews before purchasing a product, and online reviews are a vital influence on consumers’ purchase decisions [1]. For expensive durable goods like cars, the high “mismatch cost” makes consumers more inclined to obtain valuable word-of-mouth information through online reviews to assist their purchase decisions. Efficiently mining information from complex online reviews is an essential tool for auto consumers to understand and grasp the performance of cars and form a comprehensive knowledge of auto products.
On the other hand, companies increasingly focus on building and managing online reviews, paying attention to online consumers’ voices, and identifying their needs. Studies have confirmed that including consumers in the product design process is more effective than simply treating consumers as buyers of products. In particular, leading users of products’ involvement can help companies grasp market demand trends in time and obtain more sources of product innovation [2]. Brand images from consumers’ mouths are more readily accepted and spread than those set by companies. A good company image can enhance consumers’ recognition of the company, which directly affects the market acceptance of the product and largely determines the market share of the company and its future development prospects. The above factors make online reviews a “bridge” between consumers and car companies and become an essential tool for car companies to export their brand image and product marketing.
The high price of cars, the high “mismatch cost”, and the information asymmetry have created a “rigid demand” for car consumers to find valuable information from online reviews. This “rigid demand” makes it possible for car companies to use online reviews for online marketing. Based on the fusion topic model, this study makes full use of online reviews as a “bridge” to help consumers filter out valuable information from online review data. On the other hand, it allows car companies to obtain objective and accurate information about consumers’ needs, improve production decisions, and develop more reasonable marketing strategies to achieve healthy and sustainable corporate development.
The remainder of this paper is structured in the following sections. Section 2 presents a literature review of studies about consumer demands. Section 3 introduces topic models that can indicate demand topics and demand attributes, and describes the methodology used in this study. The following Section 4 elaborates on the principles of data set selection, presents the experimental design and results analysis. We discuss our obtained results in Section 5. Concluding remarks appear in Section 6.

3. Methodology and Model

3.1. Demand Topic Extraction Based on the Topic Model

Latent Dirichlet Allocation (LDA) based topic model occupies a crucial position in data mining such as text sentiment classification and information extraction and is commonly used to mine the latent topic information in the corpus in the big data environment. This model was proposed by Blei D M et al. in 2003 [23]. Its core idea is a three-layer Bayesian probabilistic model, which contains a three-layer structure of documents, topics, and words, forming document-topic and topic-word probability distributions. The LDA topic model can extract the deep semantic relationships between terms and documents and effectively extract the hidden topics in large-scale document sets and corpora. It is the most widely used and successful text topic extraction model. The generation formula is shown as follows.
P w i | d j = s = 1 K P w i | k = s × P k = s | d j
where P w i | k = s denotes the probability that word w i belongs to the topic s , and P k = s | d j denotes the probability that the topic s is in text d j [24].
Chen et al. [25] used the LDA topic model approach to analyze and understand the main viewpoints of online public opinion at the next level; Li et al. [26] used the LDA algorithm to construct an online opinion topic identification model and used Sina Weibo as an example to identify the viewpoint topics in online public opinion; Liu et al. [27] used the LDA topic model to mine and parse the text comment’s feature structure and semantic content. They also explored and tracked the evolutionary trends of topics; Guo et al. [28] used the LDA topic model to mine customer online reviews and found 19 dimensions related to the satisfaction of potential hotel customers. According to the results of literature research [29,30,31,32,33], LDA uses an efficient probabilistic inference algorithm to process large-scale data and excels at identifying the implied semantics of large sets of documents.

3.2. Demand Attribute Expansion Based on the Vector Model

Word2vec is a semantic computation tool that trains models through neural network algorithms. It transforms words into vectors and maps them into a high-dimensional space for vector operations, ultimately predicting the terms related to their semantics. Word2vec consists of two models, CBOW and Skip-gram; the former indicates the probability of the current word by contextual words, while the latter predicts the likelihood of contextual words based on the present term.
The steps to expand the demand attributes using Word2vec word vector model are as follows.
(1)
Word vector training. The word vector model and the word vector representation of the corresponding dimension can be obtained using Word2vec tool to train the comment corpus afterword separation. Context window distance and vector space dimension are essential parameters for model training; the more significant the window, the more contextual information involved, the better the vector representation effect.
(2)
Topic word expansion. The trained model is used to calculate the cosine of the angle between the comments in the comment corpus and the given topic words. Several words with higher semantic similarity are selected as candidate feature words to expand the topic words. The semantic similarity is calculated as shown in Equation (2).
S i m w i , w j = cos θ i = 1 n   x i y i i = 1   n x i 2 × i = 1   n y i 2
where S i m w i , w j denotes the word vector cosine similarity between word w i and word w j [34].

3.3. Demand Preference Mining Based on Fusion Topic Model

The LDA model focuses on the co-occurrence of documents and words, extracts deep semantic relationships between terms and documents, and effectively removes the implicit topics in the corpus. However, the shortcoming is that it cannot consider the contextual relationship between words and words. The Word2vec model, on the other hand, focuses on the co-occurrence of context and words and describes the relationships between phrases according to the contextual background. Overall, the advantages and disadvantages of the above two models for semantic analysis complement each other to form the basis for constructing the fusion topic model in this study.
This study collects comment data from online automotive forums and generates an online comment corpus after pre-processing, including word separation and deactivation. Firstly, we extracted the topic words of new energy vehicle consumer demand preference with the help of the LDA model, and then used the trained Word2vec tool to calculate the similarity between word vectors and topic words, and carried out demand attribute expansion, and finally improved the accuracy and stability of topic mining in the form of fusion topic model. The algorithm steps are as follows.
Input: pre-processed comment corpus, set the number of demand preference topics
Output: demand topic attribute expansion words
Algorithm:
(1)
Defining the set of documents D = d 1 , d 2 , d 3 , , d n , α and β are the prior parameters of the Dirichlet function, θ is the polynomial distribution of topics in the documents, which obeys the Dirichlet initial distribution with hyperparameters α , and t h e   is the polynomial distribution of words in the topics, which follows the Dirichlet initial distribution with hyperparameters β . Training the LDA model and obtaining the model parameters θ and ;
(2)
Training the Word2vec model and setting the word vector length.
(3)
For the training set of documents d i   ( d i d 1 , d 2 , d 3 , , d n .
  • Obtaining the topic distribution of document d i .
  • Obtaining the word vectors of the top n topic words, respectively.
  • Selecting the topic k m with the highest probability in d i , select the first k words under the topic word a 1 , a 2 , a 3 , , a k and their probabilities q 1 , q 2 , q 3 , , q k , normalize the probability value, the formula is shown in (3), where: q i indicates the value of p i after normalization.
    q i = p i a = 1 k p a
  • Using Word2vec model to obtain the word vector of each word A 1 , A 2 , A 3 , , A k , the word vectors of k words are weighted and summed to obtain the topic extension feature words, the formula is shown in (4), where B i is the topic extension feature word of document d i .
    B i = b = 1 k q b × A b

4. Results

4.1. Data Sources

According to the report “Economic Operation of the Automotive Industry in December 2020” published by the Ministry of Industry and Information Technology [35], China’s auto sales reached 25.311 million units in 2020, the world’s first for 12 consecutive years. The sales of new energy vehicles reached 1.367 million units, making new energy vehicles a new hot spot in the auto market. New energy vehicle refers to the vehicles that use unconventional vehicle fuel or use conventional fuel but adopt a new vehicle power unit, integrate the advanced technologies of power control and drive of the vehicle with new technology, new structure, and new theory. New energy vehicles can be broadly divided into four categories: pure electric vehicles, hybrid vehicles, fuel cell vehicles, and solar electric vehicles [36].
This study takes the online forum of Auto Home (see www.autohome.com (accessed on 9 February 2022)) as the research object. Auto home is the world’s most visited online car forum. Consumers can post reviews of cars in terms of space, power, handling, and fuel consumption on the “Word of Mouth” channel. Combining the hot spot of new energy vehicles, we selected the “New Energy” section of the “Word of Mouth” channel. According to the price level, the channel divides new energy vehicles into four groups: “below CNY100,000 (USD15,830)”, “CNY100,000–200,000 (USD15,830–31,660)”, “CNY200,000–300,000 (USD31,660–47,490)”, and “over CNY300, 000 (USD47,490)”. We collected online reviews in four groups and explored the demand preference characteristics of new energy vehicle consumers in different price levels and groups. We defined the above four groups as Group A, Group B, Group C, and Group D. Auto Home is a Chinese version of the website, so we collected 18,484 Chinese online reviews of the four groups through the self-coded crawler, including 3910 reviews of Group A, 3157 reviews of Group B, 7141 reviews of Group C and 4276 reviews of Group D. Further, after deactivation and word separation processing, the four groups of corpora were generated and prepared for subsequent analysis.

4.2. Demand Topic Extraction

Determining the number of topics in the LDA model focuses on solving the applicability problem of the mining algorithm. The corresponding LDA topic model is generally optimal when the average similarity of topic structure is most minor. Too large several topics are prone to over fitting. It generates many cases without obvious categorical semantic information. At the same time, too small of several issues do not fully reflect the topic dimension and generate coarse-grained topics, which dramatically impacts classification. Perplexity, as a measure for judging probabilistic models, is a mainstream method to determine the optimal number of topics, which generally shows a decreasing law with the increase of the number of the potential topics, and the smaller the value of perplexity means the more vital the generative ability of the topic model [37]. Therefore, in this study, a relatively small perplexity value and a relatively small number of topics are selected as the optimal model parameters for LDA topic model training [38], and the calculation process is shown in Equation (5).
perplexity D = e x p m = 1 M log P W m m = 1 M N m
where D denotes the set of all words in a document, M denotes the number of copies, W m denotes the words in document m, P   W m denotes the probability of occurrence of a word in a document, and N m represents the number of words in each record.
The group’s A, B, C, and D perplexity values were calculated in Python. The curves showed that the perplexity values of the four data groups were the smallest when the number of topics was 12, 10, 12, and 7, respectively. In general, there is no “perfect result” for classifying topics. It is necessary to try to extract the topics by combining the values of the perplexity method and then to determine the number of topics according to their readability and interpretability. In this study, after extracting topics according to the calculation results of the perplexity method, we found that some topics had too much overlap or similarity. We filtered, merged, and organized the topics by manually reading the comment statements where the keywords were located. We finally formed groups of 4–5 topics and the keywords corresponding to each case. These topics have explicit content, little similarity, and low overlap. Considering the article’s length, only the topics extracted from each group were summarized, as shown in Table 1.
Table 1. Summary of review topics for each group of models.

4.3. Demand Attribute Expansion

Although the topics of each group of car reviews have been summarized according to the model, the keywords corresponding to each group of topics do not fit perfectly with the issues. In other words, the LDA algorithm cannot obtain a “perfect” topic classification result without considering the contextual relationship. Given this, we continued to vectorize the review corpus and find the words that are most semantically similar to the specific terms in the word vector space of four groups of models, i.e., constituting the expanded words of the demand topic features. The steps are decomposed as follows.
(1)
Word frequency statistics were conducted on the comment corpus of the four groups, respectively, and high-frequency words were retained.
(2)
For each group of particular topics in Table 1, two high-frequency words most relevant to the issues are manually selected from the word frequency statistics.
(3)
Find the words with the most similar semantics to the high-frequency words in the word vector space and form the expanded words of consumer demand topic features. The aggregated results are shown in Table 2, Table 3, Table 4 and Table 5.
Table 2. Demand topic feature expansion words (Group A).
Table 3. Demand topic feature expansion words (Group B).
Table 4. Demand topic feature expansion words (Group C).
Table 5. Demand topic feature expansion words (Group D).
The aggregated results are shown in Table 2, Table 3, Table 4 and Table 5. Among them, group A has five demand topics and 82 topic feature expansion words, group B has four demand topics and 57 topic feature expansion words, group C has four demand topics and 68 topic feature expansion words, and group D has five demand topics and 75 topic feature expansion words.

5. Discussions

As an essential dimension to consider in enterprises’ differentiated segmentation business strategy and personalized product design, vehicle model characteristic is an element that cannot be ignored in identifying users’ demand preferences. This study found that by comparing the demand topics and topic feature words of group A, B, C, and D online review.
(1)
One of the topic dimensions discussed by the four groups is “New Energy Features”. The reviews collected in this study are all about new energy vehicles on the market, so the topic of discussion naturally focuses on new energy features, such as the battery, electric consumption, hybrid, blade (referring to the blade battery, a battery product released by BYD on 29 March 2020), and so on. It is worth noting that more new names or proper nouns closely related to new energy vehicles appear in the demand feature words of Group C, such as EV (electric vehicle), HEV (hybrid electric vehicle), PHEV (plug-in hybrid electric vehicle), ECO (an energy-saving mode), X-pedal (energy-saving driving mode), NEDC (a range standard), etc. These characteristic words also reflect that, consumers who buy models in the CNY200,000–300,000 (USD31,660–47,490) range are more familiar with and understand new energy vehicles and have more specialized background knowledge in new energy vehicles.
(2)
Another topic dimension shared by all four groups is “Basic Features”. Keywords such as “acceleration”, “braking”, and “endurance” reflect the basic features of the vehicle that are common to all consumers. As a kind of mobility tool, the essential characteristics of the car, such as material, speed, and configuration, are common topics that all people are concerned about and can easily stimulate discussion. As a new market hotspot, new energy vehicles are becoming more accepted in the consumer community. It is very natural for consumers to pay attention to the basic features of new energy vehicles as they do gasoline vehicles. As a car company, in addition to highlighting new energy features, it is more important to focus on its products’ overall quality and function.
(3)
The topic dimension that is common to both groups A and B is “Brand Features”, indicating that consumers who buy models below CNY100,000 (USD15,830) and CNY100,000–200,000 (USD15,830–31,660) pay more attention to the brand of the car, which is also confirmed by the presence of multiple car companies or model names in the feature expansion words. The “Subjective Experience” is a common topic discussed in groups C and D, indicating that consumers who buy models between CNY200,000–300,000 (USD31,660–47,490) and those above CNY300,000 (USD47,490) focus on the consumer experience of new energy vehicles and tend to express their subjective feelings and sensations in the process of using them. In comparison, consumers who choose models above CNY300,000 (USD47,490) have a more prosperous and more diverse range of emotional expressions.
(4)
One of the topic dimensions of the discussion specific to Group A models is “Application”. This indicates that consumers who buy models under CNY100,000 (USD15,830) are more concerned about how to use new energy vehicles, which is more specific in the corresponding feature expansion words, such as “commuting”, “grocery shopping”, “touring around”, and “shopping”. Another topic dimension is “Personalized Configuration”. In this discussion of entry-level new energy models, personalized configuration information such as “mobile phone”, “blue tooth”, and “induction” are the shared demand preferences of consumers.
(5)
The topic dimension specific to Group C models is “Horizontal Comparison”. By searching the online comments from consumers who bought models between CNY200,000 (USD31,660) and CNY 300,000 (USD47,490), we found that the keywords “foreign”, “Tenna”, and “NIO auto” appear in the topic feature expansion words mainly refer to the comparison between the purchased model and other car brands or models. The discussion topics specific to Group D models are “Service” and “Professional Performance”. The case of “Service” corresponds to keywords such as “get a car”, “change of battery”, “test driving”, “insurance”, etc. After searching the corresponding comment corpus, we found that mid-to-high-end consumers who buy models over CNY300,000 (USD47,490) pay more attention to the service content and quality during the purchase and use process. At the same time, the words “all aluminum”, “high frequency”, and “caliper” corresponding to “Professional Performance” indicate that mid-to-high-end consumers have more professional knowledge about cars and care more about the professional performance of new energy vehicles, which is also confirmed and reflected in other demand feature expansion words.

6. Conclusions

Mining analysis and knowledge discovery through massive data is a common concern of academia and industry in big data. With the successful development of online forums, consumers share their information on product demands, purchase preferences, and consumption experiences in major automotive platforms, which objectively provides a new online marketing basis for car companies. This study crawled 18,484 online consumer reviews of 37 new energy vehicles in four groups: “below CNY100,000 (USD15,830)”, “CNY100,000–200,000 (USD15,830–31,660)”, “CNY200,000–300,000 (USD31,660–47,490)”, and “over CNY300, 000 (USD47,490)” in the “New Energy” section of the online forum of “Auto Home”. We used the LDA algorithm to construct a three-layer text probability model of “document–topic–word” and extracted and summarized the demand topics of new energy vehicle consumers who bought different models. Using the Word2vec algorithm and combing word vector similarity, we expanded the topic keywords and formed topic feature expansion words. We identified the focus features of online reviews about car consumption and the consumer demand dimensions hidden in the reviews. Finally, we analyzed and compared different groups of consumer demand topics and topic feature expansion words and summarized the demand differences of different consumer groups.
With the help of research results, consumers can more easily understand the topics and priorities of each other’s discussions. And car companies get information about consumers’ demands and feedback on product and service quality, so they can quickly adjust their business strategies, improve product and service quality, and get twice as much performance with half the effort.
Further research regarding possible improvements should be carried out based on the customer demand from the traditional fuel vehicle and new energy vehicle. On the one hand, traditional fuel vehicles are still sought after by consumers because of their endurance and service life; on the other hand, new energy vehicles are gradually becoming a new hot spot in the auto market because of their energy-saving and environmental protection, low maintenance costs, and significant purchase discounts. Further study will focus on how consumers choose to buy fuel or new energy vehicles, the similarities and differences in consumer demand between the two types of cars, and what the differences in consumer demand tell us about the marketing of automotive companies.

Author Contributions

Conceptualization, X.W. and T.L.; methodology, X.W.; software, X.W. and L.F.; validation, X.W., T.L. and L.F.; formal analysis, X.W.; investigation, X.W.; resources, X.W.; data curation, X.W. and L.F.; writing—original draft preparation, X.W.; writing—review and editing, X.W., T.L. and L.F.; visualization, X.W.; supervision, T.L.; project administration, X.W.; funding acquisition, X.W. and T.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Social Science Foundation of China(Grant No. 20FGLB011), Social Science Foundation Project of Lianyungang City (Grant No. 20LKT1016), High-level Research Project of Lianyungang Normal College (Grant No. LSZGJB202004), and Haiyan Project of Lianyungang City (Grant No. 2020).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data will be made available from the corresponding author on reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Chevalier, A.; Mayzlin, D. The effect of word of mouth on sales: Online book reviews. J. Mark. Res. 2006, 43, 345–354. [Google Scholar] [CrossRef] [Green Version]
  2. Tuarob, S.; Tucker, S. Automated discovery of lead users and latent product features by mining large-scale social media networks. J. Mech. Des. 2015, 137, 071402. [Google Scholar] [CrossRef]
  3. Al-Mahish, M.; AlDossari, N.; Almarri, A. Consumer’s demand for Disinfectants and Protective Gear from COVID-19 infection in Al-Hofuf, Saudi. J. Infect. Dev. Ctries. 2021, 11, 1618–1624. [Google Scholar] [CrossRef]
  4. Baarsma, B.; Groenewegen, J. COVID-19 and the Demand for Online Grocery Shopping: Empirical Evidence from the Netherlands. Economist 2021, 4, 407–421. [Google Scholar] [CrossRef]
  5. Wu, J.; Wang, Y. Discovery of associated consumer demands: Construction of a co-demanded product network with community detection. Expert Syst. Appl. 2021, 178, 115038. [Google Scholar] [CrossRef]
  6. Park, M.; Kim, M.; Ryu, S. The relationship between network governance and unilateral governance in dynamic consumer demand. Ind. Mark. Manag. 2020, 84, 194–201. [Google Scholar] [CrossRef]
  7. Pallant, J.; Sands, S.; Karpen, I. Product customization: A profile of consumer demand. J. Retail. Consum. Serv. 2020, 54, 102030. [Google Scholar] [CrossRef]
  8. Zhang, H.; Rao, H.; Feng, J. Product innovation based on online review data mining: A case study of Huawei phones. Electron. Commer. Res. 2018, 3, 3–22. [Google Scholar] [CrossRef]
  9. Nikumanesh, E.; Bohlouli, M.; Fathi, M. Knowledge discovery from online customer reviews towards product improvement. In Proceedings of the 13th International Conference Applied Computing 2016 Proceedings, Mannheim, Germany, 28–30 October 2016; pp. 211–214. [Google Scholar]
  10. Sun, B.; Shen, R. Product Demand Preference Discrimination and Customer Segmentation Based on Online Reviews: The Case of Smartphones. China Manag. Sci. Available online: https://doi.org/10.16381/j.cnki.issn1003-207x.2020.0164 (accessed on 6 February 2022). [CrossRef]
  11. Zhang, H. Factors influencing consumption satisfaction of fresh agricultural products in e-commerce—An exploratory analysis based on online reviews. Jiangsu Agric. Sci. 2019, 47, 4–8. [Google Scholar]
  12. Zhou, Q.; Zhang, C. Fine-grained attribute extraction for online user reviews. J. Intell. 2017, 36, 484–493. [Google Scholar]
  13. Dertwinkel-Kalt, M.; Koster, M.; Sutter, M. To buy or not to buy? Price salience in an online shopping field experiment. Eur. Econ. Rev. 2020, 130, 103593. [Google Scholar] [CrossRef]
  14. Singh, A.; Jenamani, M.; Thakkar, J.; Rana, N. Propagation of online consumer perceived negativity: Quantifying the effect of supply chain underperformance on passenger car sales. J. Bus. Res. 2021, 132, 102–114. [Google Scholar] [CrossRef]
  15. Hee, H. A Study on the Consumer Behaviors and Satisfaction Toward Hybrid Cars: Focused on Consumers’ Perceived Cost and Consumption Values. Korean J. Hum. Ecol. 2021, 5, 783–801. [Google Scholar]
  16. Galarraga, I.; Kallbekken, S.; Silvestri, A. Consumer purchases of energy-efficient cars: How different labeling schemes could affect consumer response to price changes. Energy Policy 2020, 137, 111181. [Google Scholar] [CrossRef]
  17. Alganad, A.; Isa, N.; Fauzi, W. Boosting green cars retail in Malaysia: The influence of conditional value on consumer’s behavior. J. Distrib. Sci. 2021, 7, 87–100. [Google Scholar]
  18. Jiang, Z.; Dennis, Z.; Tat, C. How Do Bonus Payments Affect the Demand for Auto Loans and Their Delinquency? J. Mark. Res. 2021, 3, 476–496. [Google Scholar] [CrossRef]
  19. Ni, M.; Xu, X.; Deng, S. Extended QFD and data-mining-based methods for supplier selection in mass customization. Int. J. Comput. Integr. Manuf. 2007, 20, 280–291. [Google Scholar] [CrossRef]
  20. Yang, Y.; Zeng, Z.; Xu, J. Customer demand classification for automobile service based on analytical Kano model. Boletín Técnico 2017, 55, 59–67. [Google Scholar]
  21. Wang, S.; Li, D.; Li, Y. Sentiment mining of product word-of-mouth data based on joint model. J. Tsinghua Univ. 2017, 9, 926–931. [Google Scholar]
  22. Huang, H. Research and Application of Sentiment Classification for Online Reviews of Automobiles. Master’s Thesis, Harbin Institute of Technology, Harbin, China, 2013; pp. 167–168. [Google Scholar]
  23. Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent Dirichlet Allocation. J. Mach. Learn. Res. 2003, 3, 993–1022. [Google Scholar]
  24. Wu, Y.; Huang, K.; Wang, X. Method of Emotional Classification in Short Texts Combined with LDA Models. J. Chin. Comput. Syst. 2019, 10, 2082–2086. [Google Scholar]
  25. Chen, X.; Gao, C. An LDA Topic Modeling Approach for Online Opinion Extraction. Lib. Intell. Work. 2015, 59, 21–26. [Google Scholar]
  26. Li, Z.; Ding, S. A study on the identification of opinion topics on the Internet. Data Anal. Knowl. Discov. 2017, 1, 18–30. [Google Scholar]
  27. Liu, S.; Peng, X. A study of learner topic mining for MOOC course reviews. Res. Electrochem. Teach. Learn. 2017, 38, 30–36. [Google Scholar]
  28. Guo, Y.; Barnes, S.J.; Jia, Q. Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent Dirichlet allocation. Tour. Manag. 2017, 59, 467–483. [Google Scholar] [CrossRef] [Green Version]
  29. Ha, T.; Lee, J.; Lee, C.-H.; Coh, B. The Prediction of Long-Term Survival of Artificial Intelligence Patents Based on Deep-Learning and Latent Dirichlet Allocation Modeling. Int. Telecommun. Policy Rev. 2021, 1, 27–50. [Google Scholar] [CrossRef]
  30. Edison, H.; Carcel, H. Text data analysis using Latent Dirichlet Allocation: An application to FOMC transcripts. Appl. Econ. Lett. 2020, 1, 38–42. [Google Scholar] [CrossRef] [Green Version]
  31. Ozyurt, B.; Akcayol, M. A new topic modeling based approach for aspect extraction in aspect based sentiment analysis: SS-LDA. Expert Syst. Appl. 2021, 168, 114231. [Google Scholar] [CrossRef]
  32. Ekinci, E.; Omurca, S.I. Concept-LDA: Incorporating Babelfy into LDA for aspect extraction. J. Inf. Sci. 2020, 3, 406–418. [Google Scholar] [CrossRef]
  33. Joung, J.; Kim, H. Automated Keyword Filtering in Latent Dirichlet Allocation for Identifying Product Attributes From Online Reviews. J. Mech. Des. 2021, 8, 084501. [Google Scholar] [CrossRef]
  34. Zeng, Q.; Hu, X.; Li, C. Extracting Keywords with Topic Embedding and Network Structure Analysis. Data Anal. Knowl. Discov. 2019, 7, 52–60. [Google Scholar]
  35. Ministry of Industry and Information Technology Announced the “December 2020 Economic Operation of the Automotive Industry”. Available online: https://www.miit.gov.cn/jgsj/zbys/gzdt/art/2021/art_2f2576adb8004b07ae2711df028a14b0.html (accessed on 6 February 2022).
  36. Zhang, T.; Ma, C.; Yong, C. Development Status and Trends of New Energy Vehicles in China. In Proceedings of the 4th International Conference on Energy Science and Applied Technology (ESAT), Chongqing, China, 29–30 December 2018; p. 020012. [Google Scholar]
  37. Guan, P.; Wang, Y. Research on the optimal topic number determination method of LDA topic model in scientific and technological intelligence analysis. Mod. Libr. Inf. Technol. 2016, 9, 42–50. [Google Scholar]
  38. Zeng, Z.; Wang, J. Research on microblog rumor identification based on LDA and random forest-an an example of 2016 haze rumor. J. Intell. 2019, 38, 89–96. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.