Designing a Smart Health Insurance Pricing System: Integrating XGBoost and Repeated Nash Equilibrium in a Sustainable, Data-Driven Framework
Abstract
1. Introduction
2. Literature Review
2.1. Foundational Approaches: Actuarial Science and Risk Pooling
2.2. Machine Learning in Health Risk Analysis
2.3. Game Theory and Strategic Behavior Analysis
2.4. Convergence of Approaches: Integrating ML and Game Theory
2.5. Research Gaps and Innovation of the Present Study
3. Theoretical Foundations
3.1. Nash Equilibrium and Tripartite Dynamics in the Health Insurance Market
3.2. The XGBoost Algorithm and Redefining Health Risk Prediction
3.3. The Synergy of Nash Equilibrium and XGBoost: A Novel Framework for Health Insurance Pricing
- XGBoost is employed to extract precise individual risk profiles;
- Multilateral Nash equilibrium is used to model the strategic interactions between the insurer, employer, and insured party;
- And ultimately, a structure is formed in which no actor has an incentive to deviate unilaterally from the agreed strategy.
4. Methodology
4.1. Strategic Modeling Framework
- XGBoost machine learning model, employed for accurate prediction of medical claims and classification of insured individuals based on their risk levels;
- Repeated Game Theory, using long-term cooperation strategies (Folk Theorem) to analyze behavioral stability among actors and derive a sustainable Nash equilibrium within insurance contracts.
- Employee/Insured: The insured individual can adopt either healthy or risky strategies in lifestyle and healthcare service utilization.
- Insurer: The insurance company makes decisions regarding premium setting and risk management.
- Employer: As a facilitator and partial contributor to the insurance premium, the employer plays a critical role in contract stability and in discouraging risky behaviors.
4.2. Data and Risk Classification
4.3. Premium Rate Modeling Steps
- Groups 1 & 2: Premium reduction to encourage healthy behavior
- Group 3: No change (base rate applied)
- Group 4: Premium increase to compensate for higher risk
- The base premium rate is computed using Wald’s Principle, derived from the expected value of future claims [39].
4.4. Nash Equilibrium in Repeated Games with Folk Strategy
- The game repeats annually;
- In the case of defection by any player (e.g., unjustified reduction of insurance coverage or filing fraudulent claims), the other party applies a permanent punishment strategy based on the Folk Theorem;
- The players are rational and prefer long-term benefits over short-term gains.
4.5. Model Conclusion
- Optimal and data-driven adjusted premium rates can be derived;
- Long-term cooperation remains stable from both economic and behavioral perspectives across all risk groups;
- The combination of XGBoost + Repeated Games + Folk Strategy results in a model that benefits from both predictive accuracy and behavioral stability at the policy-making level.
4.6. Fundamental Controlling Mechanism of the Model
5. Results and Discussion
5.1. Claim Prediction Model Performance
- Coefficient of Determination (R2): The XGBoost model achieves the highest R2 value (≈0.787), indicating superior ability to explain the variance in medical claim costs. While models like Linear Regression and Random Forest also show reasonable performance, they fall short compared to XGBoost. The SVM model demonstrates the weakest performance in this metric.
- Mean Absolute Error (MAE) and Root Mean Square Error (RMSE): XGBoost achieves the lowest prediction error across both MAE and RMSE, reflecting its strong capability in accurately estimating actual claim amounts. In contrast, the high error rates of the SVM model on both metrics highlight its inefficiency when dealing with nonlinear and imbalanced datasets.
5.2. Analysis of Repeated Game with Long-Term Cooperation Strategy (Folk Theorem)
- Profit/loss for each player is expressed in monetary units (IRR), comparing continued cooperation with early defection.
- A reasonable discount rate is applied over time to calculate the present value in the repeated game.
- The decision-making algorithm combines data from the XGBoost model with dynamic game scenarios.
5.3. Feature Importance Analysis Using SHAP Plots
6. Conclusions and Future Work
Author Contributions
Funding
Acknowledgments
Conflicts of Interest
References
- Health, T.L.P. Universal Health Coverage: Realistic and achievable? Lancet Public Health 2019, 4, e1. [Google Scholar] [CrossRef]
- Lagomarsino, G.; Garabrant, A.; Adyas, A.; Muga, R.; Otoo, N. Moving towards universal health coverage: Health insurance reforms in nine developing countries in Africa and Asia. Lancet 2012, 380, 933–943. [Google Scholar] [CrossRef]
- Chan, M. Best days for public health are ahead of us, says WHO Director-General. Presented at the 65th World Health Assembly, Geneva, Switzerland, 21 May 2012. [Google Scholar]
- Kutzin, J. Towards Universal Health Care Coverage: Goal-Oriented Framework for Policy Analysis; World Bank: Washington, DC, USA, 2000. [Google Scholar]
- Savitha, B.; Kumar, K.N.; Kiran, K.B. Mitigating India’s health woes: Can health insurance be a remedy to achieve universal health coverage? Arch. Med. Health Sci. 2016, 4, 271–279. [Google Scholar] [CrossRef]
- Thomas, T.K. Role of health insurance in enabling universal health coverage in India: A critical review. Health Serv. Manag. Res. 2016, 29, 99–106. [Google Scholar] [CrossRef]
- Green, S. Negotiating with the future: The culture of modern risk in global financial markets. Environ. Plan. D Soc. Space 2000, 18, 77–89. [Google Scholar] [CrossRef]
- Denuit, M.; Hainaut, D.; Trufin, J. Effective Statistical Learning Methods for Actuaries II; Springer Actuarial: Cham, Switzerland, 2019. [Google Scholar]
- Chen, Y.T.; Peng, S.C. Information asymmetry in medical insurance: An empirical study of one life insurance company in Taiwan. Appl. Econ. 2022, 54, 5351–5356. [Google Scholar] [CrossRef]
- Bagariang, E.I.U.; Raharjanti, A.U. Calculation of Motor Vehicle Insurance Premiums Through Evaluation of Claim Frequency and Amount Data. Oper. Res. 2023, 4, 157–162. [Google Scholar] [CrossRef]
- Zahi, J. Non-life insurance ratemaking techniques. Int. J. Account. Financ. Audit. Manag. Econ. 2021, 2, 344–361. [Google Scholar]
- Shouri, S. Validation of Health Insurance Customers Using XGBoost Modeling. J. Int. Crisis Risk Commun. Res. 2024, 7, 1016–1026. [Google Scholar]
- Obermeyer, Z.; Emanuel, E.J. Predicting the Future—Big Data, Machine Learning, and Clinical Medicine. Health Aff. 2016, 35, 1130–1137. [Google Scholar] [CrossRef]
- Chen, Y.; Chirikov, V.V.; Marston, X.L.; Yang, J.; Qiu, H.; Xie, J.; Sun, N.; Gu, C.; Dong, P.; Gao, X. Machine learning for precision health economics and outcomes research (P-HEOR): Conceptual review of applications and next steps. J. Health Econ. Outcomes Res. 2020, 7, 35–42. [Google Scholar] [CrossRef] [PubMed]
- Einav, L.; Finkelstein, A. Moral hazard in health insurance: What we know and how we know it. J. Eur. Econ. Assoc. 2018, 16, 957–982. [Google Scholar] [CrossRef]
- Cutler, D.M.; Zeckhauser, R.J. Adverse selection in health insurance. In Frontiers in Health Policy Research; National Bureau of Economic Research: Cambridge, MA, USA, 1998; Volume 1, pp. 1–32. [Google Scholar]
- Lundberg, S.M.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the 31st Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; Volume 30. [Google Scholar]
- Bühlmann, H. Credibility theory for premium calculation. ASTIN Bull. 1967, 4, 271–286. [Google Scholar]
- McGuire, T.G.; Van Kleef, R.C. Risk sharing, risk adjustment, and premium regulation in health insurance markets. J. Health Econ. 2018, 61, 1–7. [Google Scholar]
- Prabhakaran, V.; Martin, D., Jr. Participatory machine learning using community-based system dynamics. Health Hum. Rights 2020, 22, 71. [Google Scholar] [PubMed]
- Xie, Y.; Gopukumar, D.; Pan, T.; Kolli, A. Predicting readmission charges billed by hospitals: Machine learning approach. JMIR Med. Inform. 2022, 10, e37578. [Google Scholar] [CrossRef]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Neghab, D.P.; Cevik, M.; Wahab, M.I.M. Explaining exchange rate forecasts with macroeconomic fundamentals using interpretive machine learning. arXiv 2023, arXiv:2303.16149. [Google Scholar] [CrossRef]
- Sun, W.; Wang, Y.; Zhang, L.; Chen, X.H.; Hoang, Y.H. Enhancing economic cycle forecasting based on interpretable machine learning and news narrative sentiment. Technol. Forecast. Soc. Change 2025, 215, 124094. [Google Scholar] [CrossRef]
- Aslam, F.; Hunjra, A.I.; Ftiti, Z.; Louhichi, W.; Shams, T. Insurance fraud detection: Evidence from artificial intelligence and machine learning. Res. Int. Bus. Financ. 2022, 62, 101744. [Google Scholar] [CrossRef]
- Backovic, M.; Popovic, Z.; Stamenkovic, M. Reflexive game theory approach to mutual insurance problem. Montenegrin J. Econ. 2016, 12, 87. [Google Scholar] [CrossRef]
- Zhang, R.; Zhu, Q.; Hayel, Y. A bi-level game approach to attack-aware cyber insurance of computer networks. IEEE J. Sel. Areas Commun. 2017, 35, 779–794. [Google Scholar] [CrossRef]
- Benaddi, H.; Ibrahimi, K.; Benslimane, A.; Jouhari, M.; Qadir, J. Robust enhancement of intrusion detection systems using deep reinforcement learning and stochastic game. IEEE Trans. Veh. Technol. 2022, 71, 11089–11102. [Google Scholar] [CrossRef]
- Shouri, S.; Ramezani, R. A blockchain-based health insurance model enhanced with quadratic voting. Health Inform. J. 2025, 31, 14604582251339422. [Google Scholar] [CrossRef] [PubMed]
- Nash, J.F. Equilibrium points in n-person games. Proc. Natl. Acad. Sci. USA 1950, 36, 48–49. [Google Scholar] [CrossRef]
- Wolfrum, F.; Boehler, Y.B.; Wichardt, P.C. PNS325 game theory in the healthcare literature: A systematic review focusing on negotiations. Value Health 2019, 22, S819. [Google Scholar] [CrossRef]
- Li, H.; Huang, W.; Duan, Z.; Mguni, D.H.; Shao, K.; Wang, J.; Deng, X. A survey on algorithms for Nash equilibria in finite normal-form games. Comput. Sci. Rev. 2024, 51, 100613. [Google Scholar] [CrossRef]
- Fudenberg, D.; Maskin, E. The Folk Theorem in repeated games with discounting or with incomplete information. Econometrica 1986, 54, 533–554. [Google Scholar] [CrossRef]
- Mailath, G.J.; Samuelson, L. Repeated Games and Reputations: Long-Run Relationships; Oxford University Press: Oxford, UK, 2006. [Google Scholar]
- Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Velarde, G.; Weichert, M.; Deshmunkh, A.; Deshmane, S.; Sudhir, A.; Sharma, K.; Joshi, V. Tree boosting methods for balanced and imbalanced classification and their robustness over time in risk assessment. Intell. Syst. Appl. 2024, 22, 200354. [Google Scholar] [CrossRef]
- Ben Hamida, A.; Kacem, M.; de Peretti, C.; Belkacem, L. Machine learning based methods for ratemaking health care insurance. Int. J. Mark. Res. 2024, 66, 810–831. [Google Scholar] [CrossRef]
- North-East Oil Industry Health and Treatment Organization. Statistical Report of Health Insurance Data; North-East Oil Industry Health and Treatment Organization: Mashhad, Iran, 2023. [Google Scholar]
- Lima Ramos, P. Premium calculation in insurance activity. J. Stat. Manag. Syst. 2017, 20, 39–65. [Google Scholar] [CrossRef]
Conceptual Change in Model | Description | Complexity Level | Computational Impact | Strategic/Behavioral Impact |
---|---|---|---|---|
Integration of ML (XGBoost) with Game Theory | Combining predictive analytics with strategic modeling | High | Increased computation time due to ML training | Improved accuracy in premium setting and stability of equilibrium |
Risk Stratification Using SHAP Analysis | Interpretable feature contributions for classification | Medium | Minor overhead for SHAP value computation | Enhanced transparency and fairness |
Multilateral Nash Equilibrium | Modeling three stakeholders instead of bilateral | High | More simulation iterations for equilibrium convergence | Captures realistic tripartite dynamics |
Application of Folk Theorem | Sustaining cooperation in repeated games | Medium | Requires simulation over multiple time periods | Promotes long-term stability and cooperation |
Dynamic Premium Adjustment Mechanism | Premiums updated based on real-time risk profiles | Medium-High | Periodic recalculation needed | Aligns incentives and discourages opportunism |
Category | Description | Count | % of Population | % of Total Claims (IRR) |
---|---|---|---|---|
Cat. 1 | No claims submitted | 156 | 0.52% | 0.00% |
Cat. 2 | Claims less than the paid premium | 20,459 | 68.69% | 17.84% |
Cat. 3 | Claims equal to the paid premium | 333 | 1.12% | 1.13% |
Cat. 4 | Claims greater than the paid premium | 8837 | 29.67% | 81.00% |
Total | – | 29,785 | 100% | 100% |
Category | Adjusted Premium (IRR) | Long-Term Cooperation Value | Initial Defection Value | Result |
---|---|---|---|---|
Category 1 (No Claims) | 15,552,759 | −186,633,113 | −200,630,596 | Sustainable Cooperation |
Category 2 (Low Risk) | 17,382,496 | −208,589,950 | −224,234,196 | Sustainable Cooperation |
Category 3 (Medium Risk) | 18,297,364 | −219,568,368 | −236,035,996 | Sustainable Cooperation |
Category 4 (High Risk) | 23,786,573 | −285,438,878 | −306,846,794 | Sustainable Cooperation |
Evaluation Metric | XGBoost Algorithm |
---|---|
R2 | 0.787 |
RMSE | 16,352,672 |
MAE | 1,080,986 |
Feature | Category 1: No Claims | Category 2: Low Risk | Category 3: Medium Risk | Category 4: High Risk |
---|---|---|---|---|
Number of Visits | Almost no impact—low frequency is common | Most influential variable—more visits increase risk | Strong impact—risk increases with more visits | Very strong effect—risk rises with a high number of visits |
Specific disease | Almost irrelevant—patients are generally healthy | Mild effect—most individuals do not have a Specific disease | Negligible—the model does not give it much weight | Clear impact—presence of a Specific disease raises the risk significantly |
Age | Minor effect—model not highly sensitive | Moderate effect—older age slightly increases risk | Significant effect—notable variation in both directions | Relatively strong—higher age is associated with increased risk |
Gender | No noticeable effect | Very slight impact | Minor role in model decision-making | Low but visible influence |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shouri, S.; De la Sen, M.; Gordji, M.E. Designing a Smart Health Insurance Pricing System: Integrating XGBoost and Repeated Nash Equilibrium in a Sustainable, Data-Driven Framework. Information 2025, 16, 733. https://doi.org/10.3390/info16090733
Shouri S, De la Sen M, Gordji ME. Designing a Smart Health Insurance Pricing System: Integrating XGBoost and Repeated Nash Equilibrium in a Sustainable, Data-Driven Framework. Information. 2025; 16(9):733. https://doi.org/10.3390/info16090733
Chicago/Turabian StyleShouri, Saeed, Manuel De la Sen, and Madjid Eshaghi Gordji. 2025. "Designing a Smart Health Insurance Pricing System: Integrating XGBoost and Repeated Nash Equilibrium in a Sustainable, Data-Driven Framework" Information 16, no. 9: 733. https://doi.org/10.3390/info16090733
APA StyleShouri, S., De la Sen, M., & Gordji, M. E. (2025). Designing a Smart Health Insurance Pricing System: Integrating XGBoost and Repeated Nash Equilibrium in a Sustainable, Data-Driven Framework. Information, 16(9), 733. https://doi.org/10.3390/info16090733