Analysis of Student Dropout Risk in Higher Education Using Proportional Hazards Model and Based on Entry Characteristics
Abstract
1. Introduction
2. Related Works
3. Data Collection and Processing
3.1. Data Collection and Pre-Processing
3.2. The Conceptual Model
3.3. Model Testing and Evaluation
- hi(t)—the hazard rate for the ith case at time t;
- h0(t)—the baseline hazard at time t;
- bj—the value of the jth regression coefficient;
- xi1—gender (female, male);
- xi2—faculty (IITF, LPTF, ESAF, MVZF, VMF);
- xi3—priority to study in the program ((1st, 2nd, 3rd and lower, NM);
- xi4—sum of secondary school marks (SM). The SM variable was standardised using z-score transformation.
4. Results and Discussion
4.1. Analysis of Student Dropout
4.2. Student Survival and Dropout Risk
4.3. Relationship Between Secondary School and Academic Performance
4.4. Risk Factor Analysis of Student Dropout
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Assumptions and Comparison of Weibull and Cox Proportional Hazards Models
Model | Time Variable Distribution | AIC | Log-Likelihood | Significant Covariates | Assumptions |
---|---|---|---|---|---|
Cox PH | Does not assume a specific distribution for survival time | 6013.8 | −2996.9 (df = 10) | Faculty Finance SM | PH assumptions are met for Cox PH Model 2 (see Figure A3) |
Weibull | Assumes a specific distribution for survival time | 4156.2 | −2066.1 (df = 12) | Faculty Finance SM | Residuals not normally distributed (see Figure A1) |
References
- Vaarma, M.; Li, H. Predicting student dropouts with machine learning: An empirical study in Finnish higher education. Technol. Soc. 2024, 76, 102474. [Google Scholar] [CrossRef]
- OECD. Education at a Glance 2024 (Education at a Glance). 2024. Available online: https://www.oecd.org/en/publications/education-at-a-glance-2024_c00cad36-en.html (accessed on 7 July 2025).
- Latvia Ministry of Education. OECD INES|Ministry of Education. 2024. Available online: https://www.izm.gov.lv/lv/oecd-ines (accessed on 24 February 2025).
- OECD. Education at a Glance 2022. OECD Indicators/Education at a Glance. 2022. Available online: https://www.oecd.org/en/publications/education-at-a-glance-2022_3197152b-en.html (accessed on 7 July 2025).
- Latvia Ministry of Education. Higher Education|Study in Latvia. 2024. Available online: https://studyinlatvia.lv/higher-education# (accessed on 24 February 2025).
- González-Morales, M.O.; López-Aguilar, D.; Álvarez-Pérez, P.R.; Toledo-Delgado, P.A. Dropping out of higher education: Analysis of variables that characterise students who interrupt their studies. Acta Psychol. 2025, 252, 104669. [Google Scholar] [CrossRef] [PubMed]
- Delogu, M.; Lagravinese, R.; Paolini, D.; Resce, G. Predicting dropout from higher education: Evidence from Italy. Econ. Model. 2024, 130, 106583. [Google Scholar] [CrossRef]
- Tayebi, A.; Gomez, J.; Delgado, C. Analysis on the lack of motivation and dropout in engineering students in Spain. IEEE Access 2021, 9, 66253–66265. [Google Scholar] [CrossRef]
- Pecuchova, J.; Drlik, M. Predicting Students at Risk of Early Dropping Out from Course Using Ensemble Classification Methods. Procedia Comput. Sci. 2023, 225, 3223–3232. [Google Scholar] [CrossRef]
- Seo, E.Y.; Yang, J.; Lee, J.E.; So, G. Predictive modelling of student dropout risk: Practical insights from a South Korean distance university. Heliyon 2024, 10, e30960. [Google Scholar] [CrossRef] [PubMed]
- Loder, A.K.F. Master’s programs’ dropout and graduation clusters in a university system with a multiple enrollment policy. Int. J. Educ. Res. Open 2025, 8, 100423. [Google Scholar] [CrossRef]
- Rabelo, A.M.; Zárate, L.E. A model for predicting dropout of higher education students. Data Sci. Manag. 2025, 8, 72–85. [Google Scholar] [CrossRef]
- Da Silva, D.E.M.; Pires, E.J.S.; Reis, A.; De Moura Oliveira, P.B.; Barroso, J. Forecasting students dropout: A UTAD university study. Future Internet 2022, 14, 76. [Google Scholar] [CrossRef]
- Helland, H.; Strømme, T.B.; Thomsen, J.-P. Social inequality in dropout rates in higher education: Denmark and Norway. Stud. High. Educ. 2024, 1–16. [Google Scholar] [CrossRef]
- Tinto, V. Leaving College: Rethinking the Causes and Cures of Student Attrition, 2nd ed.; University of Chicago Press: Chicago, IL, USA, 1993. [Google Scholar] [CrossRef]
- R Core Team. R, version 4.4.3; A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2025. Available online: https://www.R-project.org/ (accessed on 15 February 2025).
- Murthy, D.P.; Xie, M.; Jiang, R. Weibull Models; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
- Cox, D.R. Regression models and life-tables. J. R. Stat. Soc. Ser. B 1972, 34, 187–220. [Google Scholar] [CrossRef]
- Therneau, T.M.; Grambsch, P.M. Modeling Survival Data: Extending the Cox Model; Springer Science & Business Media: Berlin, Germany, 2000; ISBN 0-387-98784-3. [Google Scholar]
- Therneau, T. R Package, version 3.8-3; A Package for Survival Analysis in R; R Foundation for Statistical Computing: Vienna, Austria, 2024. Available online: https://CRAN.R-project.org/package=survival (accessed on 15 February 2025).
- Kassambara, A.; Kosinski, M.; Biecek, P. R Package, version 0.5.0; Survminer: Drawing Survival Curves Using ‘ggplot2’; R Foundation for Statistical Computing: Vienna, Austria, 2024. Available online: https://rpkgs.datanovia.com/survminer/index.html (accessed on 3 February 2025).
- Schloerke, B.; Cook, D.; Larmarange, J.; Briatte, F.; Marbach, M.; Thoen, E.; Elberg, A.; Crowley, J. R Package, version 2.2.1; GGally: Extension to ‘ggplot2’; R Foundation for Statistical Computing: Vienna, Austria, 2024. Available online: https://github.com/ggobi/ggally (accessed on 15 February 2025).
- Paura, L.; Arhipova, I.; Vitols, G. Evaluation of students dropout rate and reasons during the study. In INTED2017 Proceedings; IATED: Valencia, Spain, 2017; pp. 2233–2238. [Google Scholar] [CrossRef]
- Pusztai, G.; Fényes, H.; Kovács, K. Factors Influencing the Chance of Dropout or Being at Risk of Dropout in Higher Education. Educ. Sci. 2022, 12, 804. [Google Scholar] [CrossRef]
- Paura, L.; Arhipova, I. Cause Analysis of students’ dropout rate in higher education study program. Procedia—Soc. Behav. Sci. 2014, 109, 1282–1286. [Google Scholar] [CrossRef]
- Wild, S.; Heuling, L.S. Student dropout and retention: An event history analysis among students in cooperative higher education. Int. J. Educ. Res. 2020, 104, 101687. [Google Scholar] [CrossRef]
- Kabashi, Q.; Shabani, I.; Caka, N. Analysis of the student dropout rate at the Faculty of Electrical and Computer Engineering of the University of Prishtina, Kosovo, from 2001 to 2015. IEEE Access 2022, 10, 68126–68137. [Google Scholar] [CrossRef]
- Behr, A.; Giese, M.; Teguim, K.H.D.; Theune, K. Dropping out from Higher Education in Germany an Empirical Evaluation of Determinants for Bachelor Students. Open Educ. Stud. 2020, 2, 126–148. [Google Scholar] [CrossRef]
- Espinoza, O.; Sandoval, L.; González, L.; Maldonado, K.; Larrondo, Y.; Corradi, B. Reasons for university dropout in Chile: Does student gender play a role? Educ. Rev. 2024, 77, 562–577. [Google Scholar] [CrossRef]
- Fior, C.A.; Polydoro, S.A.J.; Pelissoni, A.M.S.; Dantas, M.A.; Martins, M.J.; Da Silva Almeida, L. Impact of self-efficacy and academic performance in the dropout of higher education students. Psicol. Esc. E Educ. 2022, 26, e235218. [Google Scholar] [CrossRef]
- Cocoradă, E.; Curtu, A.L.; Năstasă, L.E.; Vorovencii, I. Dropout Intention, Motivation, and Socio-Demographics of Forestry students in Romania. Forests 2021, 12, 618. [Google Scholar] [CrossRef]
- Pedersen, J.V.; Nielsen, M.W. Gender, self-efficacy and attrition from STEM programmes: Evidence from Danish survey and registry data. Stud. High. Educ. 2023, 49, 47–61. [Google Scholar] [CrossRef]
- Astorne-Figari, C.; Speer, J.D. Drop out, switch majors, or persist? The contrasting gender gaps. Econ. Lett. 2018, 164, 82–85. [Google Scholar] [CrossRef]
- Meyer, J.; Strauß, S. The influence of gender composition in a field of study on students’ drop-out of higher education. Eur. J. Educ. 2019, 54, 443–456. [Google Scholar] [CrossRef]
- Masci, C.; Giovio, M.; Mussida, P. Survival models for predicting student dropout at university across time. Int. Conf. Educ. New Dev. 2022, 1, 203–207. [Google Scholar] [CrossRef]
- Gutierrez-Pachas, D.A.; Garcia-Zanabria, G.; Cuadros-Vargas, E.; Camara-Chavez, G.; Gomez-Nieto, E. Supporting Decision-Making Process on Higher Education Dropout by Analyzing Academic, Socioeconomic, and Equity Factors through Machine Learning and Survival Analysis Methods in the Latin American Context. Educ. Sci. 2023, 13, 154. [Google Scholar] [CrossRef]
- Kalamaras, D.; Maska, L.; Nasika, F. A Cox Proportional Hazards Model with Latent Covariates Reflecting Students’ Preparation, Motives, and Expectations for the Analysis of Time to Degree. Stats 2025, 8, 37. [Google Scholar] [CrossRef]
- Llauró, A.; Fonseca, D.; Romero, S.; Aláez, M.; Lucas, J.T.; Felipe, M.M. Identification and comparison of the main variables affecting early university dropout rates according to knowledge area and institution. Heliyon 2023, 9, e17435. [Google Scholar] [CrossRef]
Variable Name | Type of Data | No. of Levels or Range | Description |
---|---|---|---|
Gender | Nominal | 2 | Female, Male |
Faculty | Nominal | 5 | IITF, LPTF, ESAF, MVZF, VMF |
Dropout date | Date | - | Student dropout date |
Dropout semester | Nominal | 7 | Student dropout semester |
Dropout reason | Nominal | 4 | Did not start study, voluntary dropout, did not fulfil financial obligations, did not complete study courses |
Dropout after days | Numeric | 0–1258 days | Survival time = difference between student dropout date and study start date |
Dropout after month | Numeric | 0–41 months | Student survival time in months |
Status | Nominal | 2 | 1—dropout; 0—study |
Priority | Nominal | 4 | Not mentioned (NM), 1st, 2nd, 3rd and lower |
Finance source | Nominal | 2 | Government financed, self-financed |
WAM | Numeric | 1–9.49 | Weighted Average Mark |
SM | Numeric | 46–512.9 | Sum of secondary school marks and the results from the central exam |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Paura, L.; Arhipova, I.; Vitols, G.; Sproge, S. Analysis of Student Dropout Risk in Higher Education Using Proportional Hazards Model and Based on Entry Characteristics. Data 2025, 10, 110. https://doi.org/10.3390/data10070110
Paura L, Arhipova I, Vitols G, Sproge S. Analysis of Student Dropout Risk in Higher Education Using Proportional Hazards Model and Based on Entry Characteristics. Data. 2025; 10(7):110. https://doi.org/10.3390/data10070110
Chicago/Turabian StylePaura, Liga, Irina Arhipova, Gatis Vitols, and Sandra Sproge. 2025. "Analysis of Student Dropout Risk in Higher Education Using Proportional Hazards Model and Based on Entry Characteristics" Data 10, no. 7: 110. https://doi.org/10.3390/data10070110
APA StylePaura, L., Arhipova, I., Vitols, G., & Sproge, S. (2025). Analysis of Student Dropout Risk in Higher Education Using Proportional Hazards Model and Based on Entry Characteristics. Data, 10(7), 110. https://doi.org/10.3390/data10070110