Next Article in Journal
Wavelets in Combination with Stochastic and Machine Learning Models to Predict Agricultural Prices
Next Article in Special Issue
Differentiating Chat Generative Pretrained Transformer from Humans: Detecting ChatGPT-Generated Text and Human Text Using Machine Learning
Previous Article in Journal
Inference Based on the Stochastic Expectation Maximization Algorithm in a Kumaraswamy Model with an Application to COVID-19 Cases in Chile
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution

1
School of Economics and Management, Dalian Jiaotong University, Dalian 116021, China
2
Faculty of Education, The University of Hong Kong, Hong Kong 999077, China
3
Institute of Computing Technology, China Academy of Railway Sciences, Beijing 100081, China
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(13), 2895; https://doi.org/10.3390/math11132895
Submission received: 12 May 2023 / Revised: 5 June 2023 / Accepted: 12 June 2023 / Published: 28 June 2023

Abstract

Reinforcement learning is an important machine learning method and has become a hot popular research direction topic at present in recent years. The combination of reinforcement learning and a recommendation system, is a very important application scenario and application, and has always received close attention from researchers in all sectors of society. In this paper, we first propose a feature engineering method based on label distribution learning, which analyzes historical behavior is analyzed and constructs, whereby feature vectors are constructed for users and products via label distribution learning. Then, a recommendation algorithm based on value distribution reinforcement learning is proposed. We first designed the stochastic process of the recommendation process, described the user’s state in the interaction process (by including the information on their explicit state and implicit state), and dynamically generated product recommendations through user feedback. Next, by studying hybrid recommendation strategies, we combined the user’s dynamic and static information to fully utilize their information and achieve high-quality recommendation algorithms. Finally, the algorithm was designed and validated, and various relevant baseline models were compared to demonstrate the effectiveness of the algorithm in this study. With this study, we actually tested the remarkable advantages of relevant design models based on nonlinear expectations compared to other homogeneous individual models. The use of recommendation systems with nonlinear expectations has considerably increased the accuracy, data utilization, robustness, model convergence speed, and stability of the systems. In this study, we incorporated the idea of nonlinear expectations into the design and implementation process of recommendation systems. The main practical value of the improved recommendation model is that its performance is more accurate than that of other recommendation models at the same level of computing power level. Moreover, due to the higher amount of information that the enhanced model contains, it provides theoretical support and the basis for an algorithm that can be used to achieve high-quality recommendation services, and it has many application prospects.
Keywords: recommendation system; neural networks; value distribution reinforcement learning; label distribution learning; nonlinear expectation recommendation system; neural networks; value distribution reinforcement learning; label distribution learning; nonlinear expectation

Share and Cite

MDPI and ACS Style

Guo, Z.; Fu, J.; Sun, P. Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution. Mathematics 2023, 11, 2895. https://doi.org/10.3390/math11132895

AMA Style

Guo Z, Fu J, Sun P. Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution. Mathematics. 2023; 11(13):2895. https://doi.org/10.3390/math11132895

Chicago/Turabian Style

Guo, Zhida, Jingyuan Fu, and Peng Sun. 2023. "Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution" Mathematics 11, no. 13: 2895. https://doi.org/10.3390/math11132895

APA Style

Guo, Z., Fu, J., & Sun, P. (2023). Reinforcement Learning Recommendation Algorithm Based on Label Value Distribution. Mathematics, 11(13), 2895. https://doi.org/10.3390/math11132895

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop