Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs
Abstract
:1. Introduction
2. Related Works
3. Framework
3.1. Problem Formulation
3.2. KG Construction
- E is a finite set of nodes (entities).
- R is a finite set of edge types (relationships).
- S is a finite set of triples, where:
- User data (CVs) contain user profiles, including their skills and occupations.
- Skill data represent skills with unique identifiers and descriptive labels (e.g., EBSCO URLs).
- Job data include job descriptions, specifying the required skills and occupations.
- Entities (Nodes): users, skills, jobs, and occupations.
- Relations (Edges): connections between entities based on the data, such as “has_skill” which links users or jobs to skills, “has_occupation” which connects users or jobs to occupations, “requires_skill” which relates jobs to required skills, and “requires_occupation” which associates jobs to associated occupations.
3.3. Graph Embedding Workflow
- (1)
- Entities and relationsEach entity () and the relationship () in the KG is represented as a vector in a continuous d-dimensional space ():Suppose that we have a user user_1, a skill Python, and a job Data_Scientist in the KG, we will have relationships such as (user_1, has_skill, Python) and (Data_Scientist, requires_skill, Python).
- (2)
- Scoring functionIn our framework, we use TransE model and the loss function is typically a margin-based ranking loss, also known as a pairwise ranking loss:TransE is a straightforward, efficient, and interpretable technique that struggles with one-to-many relationships. TransE is a fundamental KG embedding model that has resulted in numerous modifications, including TransH, TransR, and TransD, that address its shortcomings while maintaining the primary translation process [25].We apply this function to the triples extracted from KG (user_1, has_skill, Python). If the calculated score is high, it indicates that this triple is valid and that the model has learned a strong relationship between the user and the skill, which means user_1 probably has the skill Python.
- (3)
- Learning entity and relation representations involve defining how they are expressed as continuous vectors.Example of a positive triple: (user_1, has_skill, Python) has a high score, and an example of a negative triple: (user_1, has_skill, Snorkeling) has a low score. Over time, the embeddings are adjusted so that the valid relationships in the KG are properly represented in the vector space.
- Concatenation:
- Averaging:
- is the distilled embedding;
- is the mapping function (a neural network) trained to approximate.
3.4. Recommendation Process
- Environment
- Agent
- (i)
- in case of the overlap between the user’s skills and the job required skills;
- (ii)
- in case of non-overlap.
- (i)
- Exploration: With probability , the agent selects jobs randomly to explore new possibilities.
- (ii)
- Exploitation: With probability , the agent selects the job with the highest Q-value based on its current policy.
- Policy
3.5. Explanation Generation
3.6. Evaluation
- Precision measures the proportion of recommended items that are relevant to the user.
- Recall measures the proportion of all relevant items that are recommended to the user.
- The F1-score is the harmonic mean of Precision and Recall, providing a balance between the two.
- Normalized discounted cumulative gain (NDCG) evaluates the quality of the ranking of the recommended items, considering the positions of relevant items in the recommendation list.
4. Experimental Setup
4.1. Dataset
4.2. Libraries and Tools
4.3. Hyperparameter Tuning
4.4. Implementation Details
5. Discussion
5.1. Performance Results
5.2. Explanations Results
- For recommendation 1 with LIME explanation, job_System_Engineer_-_Athens has the highest Q-value at 8.78. Positive factors include “lead a team” and “Basque”, indicating leadership, and Basque language skills enhance fit for this role. Negative factors such as “setting prices of menu items”, “design user interface”, and “data mining” suggest that these skills are less relevant for this role. SHAP results show positive contributions from skills in alter management (+0.51), literature (+0.35), and knowledge of financial products (+0.31), indicating that adaptability, communication, and financial knowledge may be critical to the role of system engineering with financial systems. Negative contributions include soldering techniques (−0.38) and managing supplies (−0.34), suggesting that these hands-on skills are less aligned with the job.
- For recommendation 2 with LIME explanation, job_DevOps_Architect_[50,000_-_70,000_GBP], Slough has a Q-value of 1.63. Positive contributions include “program work according to incoming orders” and “customs law”, while “circular economy” and “direct inward dialing” may be less relevant to the user’s skills. SHAP results indicate that financial product knowledge (+0.010) and customer communication (+0.08) contribute positively, indicating a need for client engagement skills. Negative contributions, such as questioning techniques (−0.09), literature (−0.08), and Haskell (−0.08), suggest that skills such as Haskell programming may improve suitability.
- For recommendation 3 with LIME explanation, job_Senior_.NET_Software_Engineer has a Q-value of 1.42. Positive contributions include technical skills such as “energy efficiency”, “customer service”, “iOS”, and “production processes”. SHAP explanation highlights data gathering (+0.12), typography (+0.10), and customer communication (+0.09) as beneficial. Negative contributions include questioning techniques (−0.13) and lack of knowledge of Apache Tomcat (−0.09), suggesting that improving server technology skills may enhance fit.
- For recommendation 4 with LIME explanation, job_Care_Assistant, Tonbridge has a Q-value of 1.27. Positive contributions include “represent the organization”, “surveying”, and “strategic planning”. Negative contributions like “perform ground-handling maintenance procedures” imply that these skills may not be needed for the role. SHAP results indicate positive contributions from data collection (+0.13), familiarity with Absorb (learning management systems) (+0.010), digital printing (+0.08), and customer insight (+0.06), which align well with the needs of a care role.
- For recommendation 5 with LIME explanation, Job_Localities_Social_Worker_-_Low_Caseload, Berkshire has a Q-value of 1.25. Positive contributions include “robotics” and “file documents”, highlighting useful technical and administrative skills. Negative factors such as “physics”, “logistics”, and “Slovak” suggest some misalignment with job expectations. SHAP results point to positive contributions from digital printing (+0.08), familiarity with Absorb (+0.07), and customer satisfaction (+0.06), while questioning techniques (−0.09) and OCR (−0.06) are less emphasized.
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Scherer, J.; Kiparski, G. Buchbesprechungen. Feiler, Lukas/Forgó, Nikolaus/Weigl, Michaela: The Eu General Data Protection Regulation (Gdpr): A Commentary. Comput. Und Recht 2018, 34, 69–70. [Google Scholar] [CrossRef]
- McCrie, R.; Lee, S.Z. Decisions on Hiring to Meet Protective Goals. In Security Operations Management, 4th ed.; McCrie, R., Lee, S.Z., Eds.; Butterworth-Heinemann: Oxford, UK, 2022; pp. 71–118. [Google Scholar] [CrossRef]
- Suadicani, P.; Bonde, J.P.; Olesen, K.; Gyntelberg, F. Job satisfaction and intention to quit the job. Occup. Med. 2013, 63, 96–102. [Google Scholar] [CrossRef]
- Vellido, A. The importance of interpretability and visualization in machine learning for applications in medicine and health care. Neural Comput. Appl. 2020, 32, 18069–18083. [Google Scholar] [CrossRef]
- Brigo, D.; Huang, X.; Pallavicini, A.; de Ocáriz Borde, H.S. Interpretability in deep learning for finance: A case study for the Heston model. arXiv 2021, arXiv:2104.09476. [Google Scholar] [CrossRef]
- Pessach, D.; Singer, G.; Avrahami, D.; Ben-Gal, H.C.; Shmueli, E.; Ben-Gal, I. Employees recruitment: A prescriptive analytics approach via machine learning and mathematical programming. Decis. Support Syst. 2020, 134, 113290. [Google Scholar] [CrossRef] [PubMed]
- Linardatos, P.; Papastefanopoulos, V.; Kotsiantis, S. Explainable AI: A Review of Machine Learning Interpretability Methods. Entropy 2021, 23, 18. [Google Scholar] [CrossRef]
- Raghavan, M.; Barocas, S.; Kleinberg, J.M.; Levy, K. Mitigating bias in algorithmic hiring: Evaluating claims and practices. In Proceedings of the FAT*’20: Conference on Fairness, Accountability, and Transparency, Barcelona, Spain, 27–30 January 2020; Hildebrandt, M., Castillo, C., Celis, L.E., Ruggieri, S., Taylor, L., Zanfir-Fortuna, G., Eds.; ACM: New York, NY, USA, 2020; pp. 469–481. [Google Scholar] [CrossRef]
- Schemmer, M.; Kühl, N.; Satzger, G. Intelligent Decision Assistance Versus Automated Decision-Making: Enhancing Knowledge Workers Through Explainable Artificial Intelligence. In Proceedings of the 55th Hawaii International Conference on System Sciences, HICSS 2022, Virtual Event. Maui, HI, USA, 4–7 January 2022; ScholarSpace: Ann Arbor, MI, USA, 2022; pp. 1–10. [Google Scholar]
- Ye, H.; Vedula, S.; Chen, Y.; Yang, Y.; Bronstein, A.M.; Dreslinski, R.G.; Mudge, T.N.; Talati, N. GRACE: A Scalable Graph-Based Approach to Accelerating Recommendation Model Inference. In Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 3, ASPLOS 2023, Vancouver, BC, Canada, 25–29 March 2023; Aamodt, T.M., Jerger, N.D.E., Swift, M.M., Eds.; ACM: New York, NY, USA, 2023; pp. 282–301. [Google Scholar] [CrossRef]
- Kostis, I.; Sarafis, D.; Karamitsios, K.; Kotrotsios, K.; Kravari, K.; Badica, C.; Chatzimisios, P. Towards an Integrated Retrieval System to Semantically Match CVs, Job Descriptions and Curricula. In Proceedings of the 26th Pan-Hellenic Conference on Informatics, PCI 2022, Athens, Greece, 25–27 November 2022; ACM: New York, NY, USA, 2022; pp. 151–157. [Google Scholar] [CrossRef]
- Li, Q.; Xia, W.; Yin, L.; Jin, J.; Yu, Y. Privileged Knowledge State Distillation for Reinforcement Learning-based Educational Path Recommendation. In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2024, Barcelona, Spain, 25–29 August 2024; Baeza-Yates, R., Bonchi, F., Eds.; ACM: New York, NY, USA, 2024; pp. 1621–1630. [Google Scholar] [CrossRef]
- Liu, H.; Sun, Z.; Qu, X.; Yuan, F. Top-aware recommender distillation with deep reinforcement learning. Inf. Sci. 2021, 576, 642–657. [Google Scholar] [CrossRef]
- Adomavicius, G.; Tuzhilin, A. Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions. IEEE Trans. Knowl. Data Eng. 2005, 17, 734–749. [Google Scholar] [CrossRef]
- Ribeiro, M.T.; Singh, S.; Guestrin, C. “Why Should I Trust You?”: Explaining the Predictions of Any Classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 13–17 August 2016; KDD’16. pp. 1135–1144. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S. A Unified Approach to Interpreting Model Predictions. In Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, Long Beach, CA, USA, 4–9 December 2017; Guyon, I., von Luxburg, U., Bengio, S., Wallach, H.M., Fergus, R., Vishwanathan, S.V.N., Garnett, R., Eds.; ACM: New York, NY, USA, 2017; pp. 4765–4774. [Google Scholar]
- Vultureanu-Albisi, A.; Badica, C. Improving Students’ Performance by Interpretable Explanations using Ensemble Tree-Based Approaches. In Proceedings of the 15th IEEE International Symposium on Applied Computational Intelligence and Informatics, SACI 2021, Timisoara, Romania, 19–21 May 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 215–220. [Google Scholar] [CrossRef]
- Bouleanu, D.; Badica, C.; Kravari, K. Using SHAP-Based Interpretability to Understand Risk of Job Changing. In Proceedings of the Intelligent Distributed Computing XV, 15th International Symposium on Intelligent Distributed Computing, IDC 2022, Virtual Event. Bremen, Germany, 14–15 September 2022; Braubach, L., Jander, K., Badica, C., Eds.; Studies in Computational Intelligence. Springer: Berlin/Heidelberg, Germany, 2022; Volume 1089, pp. 41–50. [Google Scholar] [CrossRef]
- Wang, X.; Wang, D.; Xu, C.; He, X.; Cao, Y.; Chua, T. Explainable Reasoning over Knowledge Graphs for Recommendation. In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence, AAAI 2019, The Thirty-First Innovative Applications of Artificial Intelligence Conference, IAAI 2019, The Ninth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2019, Honolulu, HI, USA, 27 January–1 February 2019; AAAI Press: Washington, DC, USA, 2019; pp. 5329–5336. [Google Scholar] [CrossRef]
- Xian, Y.; Fu, Z.; Muthukrishnan, S.; de Melo, G.; Zhang, Y. Reinforcement Knowledge Graph Reasoning for Explainable Recommendation. In Proceedings of the 42nd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2019, Paris, France, 21–25 July 2019; Piwowarski, B., Chevalier, M., Gaussier, É., Maarek, Y., Nie, J., Scholer, F., Eds.; ACM: New York, NY, USA, 2019; pp. 285–294. [Google Scholar] [CrossRef]
- Wang, P.; Fan, Y.; Xia, L.; Zhao, W.X.; Niu, S.; Huang, J.X. KERL: A Knowledge-Guided Reinforcement Learning Model for Sequential Recommendation. In Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2020, Virtual Event. Xi’an, China, 25–30 July 2020; ACM: New York, NY, USA, 2020; pp. 209–218. [Google Scholar] [CrossRef]
- Kang, S.; Hwang, J.; Kweon, W.; Yu, H. DE-RRD: A Knowledge Distillation Framework for Recommender System. In Proceedings of the CIKM’20: The 29th ACM International Conference on Information and Knowledge Management, Virtual Event, Ireland, 19–23 October 2020; d’Aquin, M., Dietze, S., Hauff, C., Curry, E., Cudré-Mauroux, P., Eds.; ACM: New York, NY, USA, 2020; pp. 605–614. [Google Scholar] [CrossRef]
- Hogan, A.; Blomqvist, E.; Cochez, M.; d’Amato, C.; de Melo, G.; Gutiérrez, C.; Kirrane, S.; Labra Gayo, J.E.; Navigli, R.; Neumaier, S.; et al. Knowledge Graphs; Number 22 in Synthesis Lectures on Data, Semantics, and Knowledge; Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar] [CrossRef]
- Wang, Q.; Mao, Z.; Wang, B.; Guo, L. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Trans. Knowl. Data Eng. 2017, 29, 2724–2743. [Google Scholar] [CrossRef]
- Asmara, S.M.; Sahabudin, N.A.; Ismail, N.S.N.B.; Sabri, I.A.A. A Review of Knowledge Graph Embedding Methods of TransE, TransH and TransR for Missing Links. In Proceedings of the 8th IEEE International Conference On Software Engineering and Computer Systems, ICSECS 2023, Penang, Malaysia, 25–27 August 2023; IEEE: Piscataway, NJ, USA, 2023; pp. 470–475. [Google Scholar] [CrossRef]
- Velickovic, P.; Cucurull, G.; Casanova, A.; Romero, A.; Liò, P.; Bengio, Y. Graph Attention Networks. arXiv 2017, arXiv:1710.10903. [Google Scholar]
- Knyazev, B.; Taylor, G.W.; Amer, M.R. Understanding attention in graph neural networks. arXiv 2019, arXiv:1905.02850. [Google Scholar]
- Shi, M.; Qin, F.; Ye, Q.; Han, Z.; Jiao, J. A scalable convolutional neural network for task-specified scenarios via knowledge distillation. arXiv 2016, arXiv:1609.05695. [Google Scholar]
- Levine, S.; Kumar, A.; Tucker, G.; Fu, J. Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems. arXiv 2020, arXiv:2005.01643. [Google Scholar]
- Fan, J.; Wang, Z.; Xie, Y.; Yang, Z. A Theoretical Analysis of Deep Q-Learning. In Proceedings of the 2nd Annual Conference on Learning for Dynamics and Control, L4DC 2020, Online Event, Berkeley, CA, USA, 11–12 June 2020; Bayen, A.M., Jadbabaie, A., Pappas, G.J., Parrilo, P.A., Recht, B., Tomlin, C.J., Zeilinger, M.N., Eds.; Proceedings of Machine Learning Research: Brooklyn, NY, USA; 2020; Volume 120, pp. 486–489. [Google Scholar]
- Muraretu, I.; Bouleanu, D.; Vultureanu-Albisi, A.; Badica, C.; Sarafis, D.; Kravari, K.; Chatzimisios, P. A Microservice-Based Multi-agent System for the Job Market. In Proceedings of the Intelligent Distributed Computing XVI-16th International Symposium on Intelligent Distributed Computing, IDC 2023, Hamburg, Germany, 13–15 September 2023; Köhler-Bußmeier, M., Renz, W., Sudeikat, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2023; Volume 1138, pp. 301–316. [Google Scholar] [CrossRef]
Recommendations | Q-Value | Positive Contributions | Negative Contributions |
---|---|---|---|
job_System_Engineer_ _Athens | 8.78 |
- lead a team (+0.03) - Basque (+0.02) |
- set prices of menu items (−0.03) - design user interface (−0.03) - data mining (−0.03) - astrology (−0.02) - represent the company (−0.02) - mechatronics (−0.01) |
job_DevOps_Architect_ [50,000_-_70,000_GBP],_Slough_ | 1.63 |
- programme work based on incoming orders (+0.02) - customs law (+0.02) - teach history (+0.02) - security threats (+0.02) - style sheet languages (+0.02) |
- circular economy (−0.02) - direct inward dialing (−0.02) - address an audience (−0.01) - nanoelectronics (−0.01) - political science (−0.01) |
job_Senior._NET_Software_ _Engineer | 1.42 |
- joint ventures (+0.02) - perform procurement processes (+0.02) - energy efficiency (+0.02) - customer service (+0.02) - iOS (+0.02) - production processes (+0.02) - electrical machines (+0.01) - labour market (+0.01) |
- seek innovation in current practices (−0.02) - history (−0.02) |
job_Care_Assistant,_Tonbridge_ | 1.27 |
- represent the organisation (+0.03) - surveying (+0.03) - portfolio management in textile manufacturing (+0.02) - ecosystems (+0.02) - strategic planning (+0.02) - competition law (+0.02) |
- perform maintenance procedures (−0.02) - LESS (−0.02) - metrology (−0.01) - make reservations (−0.01) |
job_Localities_Social_Worker_–_ Low_Caseload,_Berkshire_ | 1.25 |
- robotics (+0.02) - file documents (+0.02) |
- physics (−0.03) - logistics (−0.03) - Slovak (−0.03) - security threats (−0.02) - Macedonian (−0.02) - Shiva (−0.02) - think creatively (−0.01) - adult education (−0.01) |
Recommendations | Q-Value | Positive Contributions | Negative Contributions |
---|---|---|---|
job_System_Engineer_ _Athens | 8.78 |
- alter management (+0.51) - literature (+0.35) - financial products (+0.31) |
- soldering techniques (−0.38) - manage supplies (−0.34) |
job_DevOps_Architect_ [50,000_-_70,000_GBP],_Slough_ | 1.63 |
- financial products (+0.10) - communicate with customers (+0.08) |
- use questioning techniques (−0.09) - literature (−0.08) - Haskell (−0.08) |
job_Senior._NET_Software_ _Engineer | 1.42 |
- gather data (+0.12) - typography (+0.10) - communicate with customers (+0.09) |
- use questioning techniques (−0.13) - Apache Tomcat (−0.09) |
job_Care_Assistant,_Tonbridge_ | 1.27 |
- gather data (+0.13) - learning management systems (+0.10) - digital printing (+0.08) - customer insight (+0.06) |
- use questioning techniques (−0.14) |
job_Localities_Social_Worker_–_ Low_Caseload,_Berkshire_ | 1.25 |
- digital printing (+0.08) - learning management systems (+0.07) - guarantee customer satisfaction (+0.06) |
- use questioning techniques (−0.09) - optical character recognition software (−0.06) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Vultureanu-Albişi, A.; Murareţu, I.; Bădică, C. Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs. Information 2025, 16, 282. https://doi.org/10.3390/info16040282
Vultureanu-Albişi A, Murareţu I, Bădică C. Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs. Information. 2025; 16(4):282. https://doi.org/10.3390/info16040282
Chicago/Turabian StyleVultureanu-Albişi, Alexandra, Ionuţ Murareţu, and Costin Bădică. 2025. "Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs" Information 16, no. 4: 282. https://doi.org/10.3390/info16040282
APA StyleVultureanu-Albişi, A., Murareţu, I., & Bădică, C. (2025). Explainable Recommender Systems Through Reinforcement Learning and Knowledge Distillation on Knowledge Graphs. Information, 16(4), 282. https://doi.org/10.3390/info16040282