A Multi-Stage Financial Distress Early Warning System: Analyzing Corporate Insolvency with Random Forest
Abstract
:1. Introduction
- By focusing on insolvency status, we construct an EWS using an insolvency-based modeling approach and investigate whether insolvency status can serve as an alternative financial distress status between active and bankruptcy statuses. We examine whether precautionary financial signs of insolvency can be accurately identified using a random forest machine learning approach (Breiman, 2001).
- We analyze how financial criteria and mechanisms differ among active, insolvent, and bankrupt companies by identifying and comparing key variables derived from random forest-based insolvency prediction models.
- Overall, we introduce a systematic random forest machine learning framework to conduct multi-stage EWS construction and analysis based upon active, insolvency, and bankruptcy company statuses.
2. Literature Review
2.1. Bankruptcy Prediction Approach
2.2. Multi-Stage Financial Distress Modeling Approach
2.3. Identifying Research Gaps and Contributions
3. Random Forest Insolvency Modelling Methodology
3.1. Data
3.2. Random Forest Modelling
- Draw a subset of training data using random sampling with replacement (bootstrap).
- Train a decision tree using this subset of training data. At each node of the tree, select the best split from a randomly chosen subset of variables (rather than using all available variables).
- Repeat steps 1 and 2 to generate d decision trees.
- Make predictions for new data by voting for (or taking the average of) the most frequent class among all the outputs of the d decision trees.
4. Experimental Results
4.1. Model Performances
4.1.1. Active vs. Insolvent Model Results
4.1.2. Bankrupt vs. Insolvent Model Results
4.1.3. Active vs. Insolvent vs. Bankrupt Model Results
4.2. Criteria Differences in Financial Condition
4.2.1. Financial Criteria Differences Between Active and Insolvent Companies
4.2.2. Financial Criteria Differences Between Bankruptcy and Insolvency Companies
4.2.3. Financial Criteria Differences Between Active and Bankruptcy Status
4.2.4. Financial Criteria Differences Among Active and Bankrupt and Insolvent Status
4.3. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
1 | See https://www.bvdinfo.com/en-gb/our-products/data/international/orbis (accessed on 26 March 2025). |
2 | Insolvency is categorized as one status of active companies and defined in Orbis as follows: ‘Active (insolvency proceedings): Here the company is declared insolvent. The company remains active, though it is in administration or receivership or under a scheme of arrangement (US—Chapter 11). During this period, the company is usually placed under the protection of a law and continues operating and repaying creditors and tries to reorganize and return to normal operation. At the end, the company will either return to normal operation (the default of payment was thus temporary); or will be reorganized (parts of its activity can be restructured or sold); or will be liquidated’. |
3 | We do not include deep learning because it is not very practical. It requires intensive hyper-parameter tuning to obtain good performance and does not provide off-the-shelf importance variable measurement; hence, it is not very interpolative. |
4 | A variable is also called an indictor or, more widely known as a feature in machine learning. |
5 | Random forest can be replaced by boosting approaches, such as gradient boosting (Friedman, 2001) and XGBBoost (T. Chen & Guestrin, 2016), since it also provides easy hyper-parameter setting and interpolation of the model. In fact, as we conducted an experiment of XGBBoost with the set experimental setup, the result is very similar to random forest (76.83%, 70.32%, and 75.83%). We used random forest because it is computationally lighter and more scalable owing to its better parallelization capability than the boosting approach, which is generally sequential learning. |
References
- Almaskati, N., Bird, R., Yeung, D., & Lu, Y. (2021). A horse race of models and estimation methods for predicting bankruptcy. Advances in Accounting, 52, 100513. [Google Scholar]
- Alpaydin, E. (2014). Introduction to machine learning. The MIT Press. [Google Scholar]
- Altman, E. I. (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The Journal of Finance, 23(4), 589–609. [Google Scholar] [CrossRef]
- Altman, E. I. (1984). The success of business failure prediction models: An international survey. Journal of Banking & Finance, 8(2), 171–198. [Google Scholar]
- Altman, E. I. (1993). Corporate financial distress and bankruptcy: A complete guide to predicting and avoiding distress and profiting from bankruptcy (Vol. 18. 3). Wiley Finance Edition. Wiley. [Google Scholar]
- Altman, E. I., Marco, G., & Varetto, F. (1994). Corporate distress diagnosis: Comparisons using linear discriminant analysis and neural networks (the Italian experience). Journal of Banking & Finance, 18(3), 505–529. [Google Scholar]
- Barboza, F., Kimura, H., & Altman, E. (2017). Machine learning models and bankruptcy prediction. Expert Systems with Applications, 83, 405–417. [Google Scholar]
- Beaver, W. H. (1966). Financial ratios as predictors of failure. Journal of Accounting Research, 4, 71–111. [Google Scholar] [CrossRef]
- Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. [Google Scholar] [CrossRef]
- Breiman, L., Friedman, J., Stone, C. J., & Olshen, R. A. (1984). Classification and regression trees. In Monterey: Wadsworth. Chapman & Hall. [Google Scholar]
- Brédart, X. (2014). Bankruptcy prediction model: The case of the United States. International Journal of Economics and Finance, 6(3), 1–7. [Google Scholar] [CrossRef]
- Chakraborty, S., & Sharma, S. K. (2007). Prediction of corporate financial health by artificial neural network. International Journal of Electronic Finance, 1(4), 442–459. [Google Scholar] [CrossRef]
- Chen, C. C., Chen, C. D., & Lien, D. (2020). Financial distress prediction model: The effects of corporate governance indicators. Journal of Forecasting, 39(8), 1238–1252. [Google Scholar] [CrossRef]
- Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In KDD ’16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794). Association for Computing Machinery. [Google Scholar]
- Climent, F., Momparler, A., & Carmona, P. (2019). Anticipating bank distress in the Eurozone: An extreme gradient boosting approach. Journal of Business Research, 101, 885–896. [Google Scholar]
- Drehmann, M., & Juselius, M. (2014). Evaluating early warning indicators of banking crises: Satisfying policy requirements. International Journal of Forecasting, 30(3), 759–780. [Google Scholar]
- Einav, L., & Levin, J. (2014). The data revolution and economic analysis. Innovation Policy and the Economy, 14, 1–24. [Google Scholar] [CrossRef]
- Farooq, U., Jibran Qamar, M. A., & Haque, A. (2018). A three-stage dynamic model of financial distress. Managerial Finance, 44(9), 1101–1116. [Google Scholar] [CrossRef]
- Farooq, U., & Qamar, M. A. J. (2019). Predicting multistage financial distress: Reflections on sampling, feature and model selection criteria. Journal of Forecasting, 38(7), 632–648. [Google Scholar] [CrossRef]
- Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. The Annals of Statistics, 29(5), 1189–1232. [Google Scholar]
- Geron, A. (2022). Hands-on machine learning with scikit-learn, keras, and tensorflow: Concepts, tools, and techniques to build intelligent systems (3rd ed.). O’Reilly Media, Inc. [Google Scholar]
- Hillegeist, S. A., Keating, E. K., Cram, D. P., & Lundstedt, K. G. (2004). Assessing the probability of bankruptcy. Review of Accounting Studies, 9(1), 5–34. [Google Scholar] [CrossRef]
- Holopainen, M., & Sarlin, P. (2017). Toward robust early-warning models: A horse race, ensembles and model uncertainty. Quantitative Finance, 17(12), 1933–1963. [Google Scholar]
- Hosaka, T. (2019). Bankruptcy prediction using imaged financial ratios and convolutional neural networks. Expert Systems with Applications, 117, 287–299. [Google Scholar] [CrossRef]
- Jabeur, S. B., & Serret, V. (2023). Bankruptcypredictionusing fuzzy convolutional neural networks. Research in International Business and Finance, 64, 101844. [Google Scholar]
- Jayasekera, R. (2018). Prediction of company failure: Past, present and promising directions for the future. International Review of Financial Analysis, 55, 196–208. [Google Scholar] [CrossRef]
- Jones, S. (2017). Corporate bankruptcy prediction: A high dimensional analysis. Review of Accounting Studies, 22(3), 1366–1422. [Google Scholar] [CrossRef]
- Jones, S., Johnstone, D., & Wilson, R. (2017). Predicting corporate bankruptcy: An evaluation of alternative statistical frameworks. Journal of Business Finance & Accounting, 44(1–2), 3–34. [Google Scholar]
- Kim, C. N., Yang, K. H., & Kim, J. (2008). Human decision-making behavior and modeling effects. Decision Support Systems, 45(3), 517–527. [Google Scholar] [CrossRef]
- Kristóf, T., & Virág, M. (2022). EU-27 bank failure prediction with C5.0 decision trees and deep learning neural networks. Research in International Business and Finance, 61, 101644. [Google Scholar] [CrossRef]
- Lennox, C. (1999). Identifying failing companies: A re-evaluation of the logit, probit and DA approaches. Journal of Economics and Business, 51(4), 347–364. [Google Scholar] [CrossRef]
- Lin, S. M., Ansell, J., & Andreeva, G. (2012). Predicting default of a small business using different definitions of financial distress. Journal of the Operational Research Society, 63(4), 539–548. [Google Scholar] [CrossRef]
- Liu, J., Li, C., Ouyang, P., Liu, J., & Wu, C. (2023). Interpreting the prediction results of the tree-based gradient boosting models for financial distress prediction with an explainable machine learning approach. Journal of Forecasting, 42(5), 1112–1137. [Google Scholar] [CrossRef]
- Manzaneque, M., & Priego, A. M. (2016). Corporate governance effect on financial distress likelihood: Evidence from Spain. Revista de Contabilidad, 19(1), 111–121. [Google Scholar] [CrossRef]
- Martin, D. (1977). Early warning of bank failure: A logit regression approach. Journal of Banking & Finance, 1(3), 249–276. [Google Scholar]
- Messier, W. F., Jr., & Hansen, J. V. (1988). Inducing rules for expert system development: An example using default and bankruptcy data. Management Science, 34(12), 1403–1415. [Google Scholar]
- Miglani, S., Ahmed, K., & Henry, D. (2015). Voluntary corporate governance structure and financial distress: Evidence from Australia. Journal of Contemporary Accounting & Economics, 11(1), 18–30. [Google Scholar]
- Ohlson, J. A. (1980). Financial ratios and the probabilistic prediction of bankruptcy. Journal of Accounting Research, 18(1), 109. [Google Scholar]
- Pindado, J., Rodrigues, L., & De la Torre, C. (2008). Estimating financial distress likelihood. Journal of Business Research, 61(9), 995–1003. [Google Scholar]
- Purnanandam, A. (2008). Financial distress and corporate risk management: Theory and evidence. Journal of Financial Economics, 87(3), 706–739. [Google Scholar] [CrossRef]
- Shirata, C. (2003, January). Predictors of bankruptcy after bubble economy in Japan: What can you learn from Japan case? The Proceedings of the 15th Asian-Pacific Conference on International Accounting Issues, Bangkok, Thailand. [Google Scholar]
- Sun, J., Fujita, H., Zheng, Y., & Ai, W. (2021). Multiclass financial distress prediction based on support vector machines integrated with the decomposition and fusion methods. Information Sciences, 559, 153–170. [Google Scholar] [CrossRef]
- Tanaka, K., Higashide, T., Kinkyo, T., & Hamori, S. (2019). Analyzing industry-level vulnerability by predicting financial bankruptcy. Economic Inquiry, 57(4), 2017–2034. [Google Scholar]
- Tanaka, K., Kinkyo, T., & Hamori, S. (2016). Random forests-based early warning system for bank failures. Economics Letters, 148, 118–121. [Google Scholar]
- Tanaka, K., Kinkyo, T., & Hamori, S. (2018). Financial hazard map: Financial vulnerability predicted by a random forests classification model. Sustainability, 10(5), 1530. [Google Scholar] [CrossRef]
- Thor, M., & Postek, Ł. (2024). Gated recurrent unit network: A promising approach to corporate default prediction. Journal of Forecasting, 43(5), 1131–1152. [Google Scholar] [CrossRef]
- Tian, S., & Yu, Y. (2017). Financial ratios and bankruptcy predictions: An international evidence. International Review of Economics & Finance, 51, 510–526. [Google Scholar]
- Tian, S., Yu, Y., & Guo, H. (2015). Variable selection and corporate bankruptcy forecasts. Journal of Banking & Finance, 52, 89–100. [Google Scholar]
- Tsai, B.-H. (2013). An early warning system of financial distress using multinomial logit models and a bootstrapping approach. Emerging Markets Finance and Trade, 49(Suppl. S2), 43–69. [Google Scholar]
- Turetsky, H. F., & McEwen, R. A. (2001). An empirical investigation of firm longevity: A model of the ex ante predictors of financial distress. Review of Quantitative Finance and Accounting, 16(4), 323–343. [Google Scholar]
- Varian, H. R. (2014). Big data: New tricks for econometrics. Journal of Economic Perspectives, 28(2), 3–28. [Google Scholar]
- Vochozka, M., Vrbka, J., & Suler, P. (2020). Bankruptcy or success? The effective prediction of a company’s financial development using LSTM. Sustainability, 12(18), 7529. [Google Scholar] [CrossRef]
Status | Active | Insolvent | Bankrupt |
---|---|---|---|
Data size | 5,398,234 | 43,274 | 273,275 |
1st Qu. | Median | Mean | 3rd Qu. | ||
---|---|---|---|---|---|
Profitability | ROE using P/L before tax | −0.97 | 8.35 | 2.63 | 26.87 |
Ratios | ROCE using P/L before tax | 2.88 | 6.61 | 4.80 | 11.60 |
ROA using P/L before tax | −5.67 | 0.87 | −0.45 | 6.57 | |
ROE using Net income | −0.12 | 5.78 | −0.61 | 19.53 | |
ROCE using Net income | 2.94 | 6.02 | 3.74 | 10.04 | |
ROA using Net income | −4.68 | 0.53 | −0.92 | 5.32 | |
Profit margin | −4.23 | 0.83 | −0.84 | 4.58 | |
EBITDA margin | 0.49 | 4.06 | 4.72 | 8.75 | |
EBIT margin | −2.46 | 1.99 | 1.29 | 6.22 | |
Cash flow/Operating revenue | −0.05 | 2.51 | 2.34 | 6.37 | |
Operational | Net assets turnover | 1.51 | 3.36 | 9.97 | 7.25 |
Ratios | Interest cover | 0.17 | 1.18 | 9.75 | 2.17 |
Stock turnover | 5.84 | 9.72 | 30.95 | 17.06 | |
Collection period days | 7.00 | 43.00 | 77.77 | 96.00 | |
Credit period days | 9.00 | 32.00 | 63.53 | 73.00 | |
Structure | Current ratio | 0.79 | 1.13 | 2.35 | 1.76 |
Ratios | Liquidity ratio | 0.44 | 0.84 | 1.81 | 1.35 |
Shareholders liquidity ratio | 0.19 | 0.59 | 7.09 | 1.42 | |
Solvency ratio (Asset based) (%) | 3.35 | 16.63 | 20.37 | 38.66 | |
Solvency ratio (Liability based) (%) | 14.21 | 19.86 | 24.81 | 27.03 | |
Gearing (%) | 14.72 | 52.32 | 116.92 | 117.99 |
Random Forest | Neural Network | Tree | Logistic | |
---|---|---|---|---|
Active vs. Insolvent | 76.84% | 70.75% | 71.15% | 69.41% |
Bankrupt vs. Insolvent | 70.31% | 60.97% | 62.99% | 56.70% |
Active vs. Bankrupt | 75.64% | 69.53% | 71.27% | 68.42% |
Active vs. Insolvent vs. Bankrupt | 60.81% | 50.55% | 52.45% | 48.98% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Tanaka, K.; Higashide, T.; Kinkyo, T.; Hamori, S. A Multi-Stage Financial Distress Early Warning System: Analyzing Corporate Insolvency with Random Forest. J. Risk Financial Manag. 2025, 18, 195. https://doi.org/10.3390/jrfm18040195
Tanaka K, Higashide T, Kinkyo T, Hamori S. A Multi-Stage Financial Distress Early Warning System: Analyzing Corporate Insolvency with Random Forest. Journal of Risk and Financial Management. 2025; 18(4):195. https://doi.org/10.3390/jrfm18040195
Chicago/Turabian StyleTanaka, Katsuyuki, Takuo Higashide, Takuji Kinkyo, and Shigeyuki Hamori. 2025. "A Multi-Stage Financial Distress Early Warning System: Analyzing Corporate Insolvency with Random Forest" Journal of Risk and Financial Management 18, no. 4: 195. https://doi.org/10.3390/jrfm18040195
APA StyleTanaka, K., Higashide, T., Kinkyo, T., & Hamori, S. (2025). A Multi-Stage Financial Distress Early Warning System: Analyzing Corporate Insolvency with Random Forest. Journal of Risk and Financial Management, 18(4), 195. https://doi.org/10.3390/jrfm18040195