Does ESG Predict Business Failure in Brazil? An Application of Machine Learning Techniques
Abstract
:1. Introduction
2. A Review of Machine-Learning Techniques for Business Failure Prediction
2.1. Logistic Regression
2.2. K-Nearest Neighbors (KNN)
2.3. Decision Trees
2.4. Support Vector Machine (Linear Kernel)
2.5. Support Vector Machine (Non-Linear Kernel)
2.6. Artificial Neural Networks (ANNs)
2.7. Random Forest
2.8. Gradient Boosting
2.9. XGBoost Classifier
2.10. AdaBoost Classifier
2.11. Catboost Classifier
3. Data and Methodology
3.1. Data and Variables
3.2. Data Screening Process
3.3. Descriptive Statistics
3.4. Pooled Within-Groups Correlation
4. Results and Analysis
4.1. Application of Machine-Learning Models for Prediction Business Failure
4.2. Receiver Operating Characteristic (ROC) Curves
4.3. Confusion Matrices
4.4. Feature Importance
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Abdi, Abdulhakim Mohamed. 2020. Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data. GIScience and Remote Sensing 57: 1–20. [Google Scholar] [CrossRef]
- Ahmad, Nisar, Asma Mobarek, and Naheed Nawazesh Roni. 2021. Revisiting the impact of ESG on financial performance of FTSE350 UK firms: Static and dynamic panel data analysis. Cogent Business and Management 8: 1900500. [Google Scholar] [CrossRef]
- Ahsan, Muhammad, Arif Khoirul Anam, Erdi Julian, and Andi Indra Jaya. 2022. Interpretable Predictive Model of Network Intrusion Using Several Machine Learning Algorithms. BAREKENG: Jurnal Ilmu Matematika Dan Terapan 16: 057–064. [Google Scholar] [CrossRef]
- Akinrinola, Olatunji, Wilhelmina Afua Addy, Adeola Olusola Ajayi-Nifise, Olubusola Odeyemi, and Titilola Falaiye. 2024. Application of machine learning in tax prediction: A review with practical approaches. Global Journal of Engineering and Technology Advances 18: 102–17. [Google Scholar] [CrossRef]
- Alsayegh, Maha Faisal, Rashidah Abdul Rahman, and Saeid Homayoun. 2020. Corporate economic, environmental, and social sustainability performance transformation through ESG disclosure. Sustainability 12: 3910. [Google Scholar] [CrossRef]
- Ansari, Abdollah, Ibrahim Said Ahmad, Azuraliza Abu Bakar, and Mohd Ridzwan Yaakub. 2020. A hybrid metaheuristic method in training artificial neural network for bankruptcy prediction. IEEE Access 8: 176640–50. [Google Scholar] [CrossRef]
- Antunes, Francisco, Bernardete Ribeiro, and Francisco Pereira. 2017. Probabilistic modeling and visualization for bankruptcy prediction. Applied Soft Computing Journal 60: 831–43. [Google Scholar] [CrossRef]
- Aoki, Shigeo, and Yukio Hosonuma. 2004. Bankruptcy Prediction Using Decision Tree. In The Application of Econophysics. Tokyo: Springer, pp. 299–302. [Google Scholar] [CrossRef]
- Arshad, Roshayani, Sharinah Mohamed Iqbal, and Normah Omar. 2015. Prediction of business failure and fraudulent financial reporting: Evidence from Malaysia. Indian Journal of Corporate Governance 8: 34–53. [Google Scholar] [CrossRef]
- Ashraf, Sumaira, Elisabete G. S. Félix, and Zélia Serrasqueiro. 2019. Do Traditional Financial Distress Prediction Models Predict the Early Warning Signs of Financial Distress? Journal of Risk and Financial Management 12: 55. [Google Scholar] [CrossRef]
- Asselman, Amal, Mohamed Khaldi, and Souhaib Aammou. 2023. Enhancing the prediction of student performance based on the machine learning XGBoost algorithm. Interactive Learning Environments 31: 3360–79. [Google Scholar] [CrossRef]
- Aydoğmuş, Mahmut, Güzhan Gülay, and Korkmaz Ergun. 2022. Impact of ESG performance on firm value and profitability. Borsa Istanbul Review 22: S119–S127. [Google Scholar] [CrossRef]
- Ayuni, Ni Wayan Dewinta, Ni Nengah Lasmini, and Agus Adi Putrawan. 2022. Support Vector Machine (SVM) as Financial Distress Model Prediction in Property and Real Estate Companies. In Proceedings of the International Conference on Applied Science and Technology on Social Science 2022 (ICAST-SS 2022). Amsterdam: Atlantis Press, pp. 397–402. [Google Scholar] [CrossRef]
- Bapat, Varadraj, and Abhay Nagale. 2014. Comparison of Bankruptcy Prediction Models: Evidence from India. Accounting and Finance Research 3: 91–98. [Google Scholar] [CrossRef]
- Baranyi, Aranka, Csaba Faragó, Csilla Fekete, and Zsuzsanna Szeles. 2018. The Bankruptcy Forecasting Model of Hungarian Enterprises. Advances in Economics and Business 6: 179–89. [Google Scholar] [CrossRef]
- Barboza, Flávio Luiz De Moraes, Denize Lemos Duarte, and Michele Aparecida Cunha. 2022. Anticipating corporate’s distresses. Exacta 20: 470–96. [Google Scholar] [CrossRef]
- Bateni, Leila, and Farshid Asghari. 2016. Bankruptcy Prediction Using Logit and Genetic Algorithm Models: A Comparative Analysis. Computational Economics 55: 335–48. [Google Scholar] [CrossRef]
- Buckley, Tadhg, Bidisha Ghosh, and Vikram Pakrashi. 2023. A Feature Extraction & Selection Benchmark for Structural Health Monitoring. Structural Health Monitoring 22: 2082–127. [Google Scholar] [CrossRef]
- Byvatov, Evgeny, Uli Fechner, Jens Sadowski, and Gisbert Schneider. 2003. Comparison of Support Vector Machine and Artificial Neural Network Systems for Drug/Nondrug Classification. Journal of Chemical Information and Computer Sciences 43: 1882–89. [Google Scholar] [CrossRef]
- Capelli, Paolo, Federica Ielasi, and Angeloantonio Russo. 2021. Forecasting volatility by integrating financial risk with environmental, social, and governance risk. Corporate Social Responsibility and Environmental Management 28: 1483–95. [Google Scholar] [CrossRef]
- Chaudhuri, Arindam, and Kajal De. 2011. Fuzzy Support Vector Machine for bankruptcy prediction. Applied Soft Computing Journal 11: 2472–86. [Google Scholar] [CrossRef]
- Chen, Tianqi, and Carlos Guestrin. 2016. XGBoost: A scalable tree boosting system. Paper presented at the Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13–17; pp. 785–94. [Google Scholar] [CrossRef]
- Cheng, Christine, Stewart Jones, and William J. Moser. 2018. Abnormal trading behavior of specific types of shareholders before US firm bankruptcy and its implications for firm bankruptcy prediction. Journal of Business Finance and Accounting 45: 1100–38. [Google Scholar] [CrossRef]
- Chiaramonte, Laura, and Barbara Casu. 2017. Capital and liquidity ratios and financial distress. Evidence from the European banking industry. British Accounting Review 49: 138–61. [Google Scholar] [CrossRef]
- Choi, Hyunchul, Hyeonwoo Sung, Hyukman Cho, Sungwook Lee, Hyojoo Son, and Changwan Kim. 2017. Comparison of single classifier models for predicting long-term business failure of construction companies using finance-based definition of the failure. Paper presented at the ISARC 2017—Proceedings of the 34th International Symposium on Automation and Robotics in Construction, Taipei, Taiwan, June 28–July 1; pp. 282–87. [Google Scholar] [CrossRef]
- Chouaibi, Salim, Matteo Rossi, Dario Siggia, and Jamel Chouaibi. 2022. Exploring the moderating role of social and ethical practices in the relationship between environmental disclosure and financial performance: Evidence from Esg companies. Sustainability 14: 209. [Google Scholar] [CrossRef]
- Chui, Kwok Tai, and Miltiadis D. Lytras. 2019. A novel MOGA-SVM multinomial classification for organ inflammation detection. Applied Sciences 9: 2284. [Google Scholar] [CrossRef]
- Deif, Mohanad A., Ahmed A. A. Solyman, Mohammed H. Alsharif, and Peerapong Uthansakul. 2021. Automated triage system for intensive care admissions during the covid-19 pandemic using hybrid xgboost-ahp approach. Sensors 21: 6379. [Google Scholar] [CrossRef] [PubMed]
- Delis, Manthos D., and Panagiotis K. Staikouras. 2011. Supervisory effectiveness and bank risk. Review of Finance 15: 511–43. [Google Scholar] [CrossRef]
- Dellepiane, Umberto, Michele Di Marcantonio, Enrico Laghi, and Stefania Renzi. 2015. Bankruptcy Prediction Using Support Vector Machines and Feature Selection During the Recent Financial Crisis. International Journal of Economics and Finance 7: 182–95. [Google Scholar] [CrossRef]
- DeSalvo, Giulia, and Mehryar Mohri. 2016. Random composite forests. Paper presented at the 30th AAAI Conference on Artificial Intelligence, AAAI 2016, Phoenix, AZ, USA, February 12–17; pp. 1540–46. [Google Scholar] [CrossRef]
- Ding, Yongsheng, Xinping Song, and Yueming Zen. 2008. Forecasting financial condition of chinese listed companies based on support vector machine. Expert Systems with Applications 34: 3081–89. [Google Scholar] [CrossRef]
- Domicián, Máté, Raza Hassan, and Ahmad Ishtiaq. 2023. Comparative Analysis of Machine Learning Models for Bankruptcy Prediction in the Context of Pakistani Companies. Paper presented at the 2023 3rd International Conference on Intelligent Technologies, CONIT 2023, Hubli, India, June 23–25. [Google Scholar]
- Elklawy, Mohamed. 2024. Is ESG a Determinant of Banks’ Resilience and Growth Everywhere? A Response from an AI-Aided Approach. Master’s thesis, The American University in Cairo, New Cairo, Egypt. [Google Scholar]
- Faris, Hossam, Ruba Abukhurma, Waref Almanaseer, Mohammed Saadeh, Antonio M. Mora, Pedro A. Castillo, and Ibrahim Aljarah. 2020. Improving financial bankruptcy prediction in a highly imbalanced class distribution using oversampling and ensemble learning: A case from the Spanish market. Progress in Artificial Intelligence 9: 31–53. [Google Scholar] [CrossRef]
- Feng, Ji, Yi-Xuan Xu, Yuan Jiang, and Zhi-Hua Zhou. 2020. Soft Gradient Boosting Machine. arXiv arXiv:2006.04059. [Google Scholar]
- Ferreira, Artur J., and Mário A. T. Figueiredo. 2012. Boosting Algorithms: A Review of Methods, Theory, and Applications. In Ensemble Machine Learning. New York: Springer. [Google Scholar] [CrossRef]
- Friede, Gunnar, Timo Busch, and Alexander Bassen. 2015. ESG and financial performance: Aggregated evidence from more than 2000 empirical studies. Journal of Sustainable Finance and Investment 5: 210–33. [Google Scholar] [CrossRef]
- Furman, Jason, and Joseph E. Stiglitz. 1998. Economic crises: Evidence and Insights from East Asia. Brookings Papers on Economic Activity 1998: 1–135. [Google Scholar] [CrossRef]
- Gaber, Tarek, Alaa Tharwat, Aboul Ella Hassanien, and Vaclav Snasel. 2016. Biometric cattle identification approach based on Weber’s Local Descriptor and AdaBoost classifier. Computers and Electronics in Agriculture 122: 55–66. [Google Scholar] [CrossRef]
- Ganatra, Amit P., and Yogesh P. Kosta. 2010. Comprehensive Evolution and Evaluation of Boosting. International Journal of Computer Theory and Engineering 2: 931–36. [Google Scholar] [CrossRef]
- Gaytan, Jesus Cuauhtemoc Tellez, Karamath Ateeq, Aqila Rafiuddin, Haitham M. Alzoubi, Taher M. Ghazal, Tariq Ahamed Ahanger, Sunita Chaudhary, and G. K. Viju. 2022. AI-Based Prediction of Capital Structure: Performance Comparison of ANN SVM and LR Models. Computational Intelligence and Neuroscience 2022: 8334927. [Google Scholar] [CrossRef]
- Georganos, Stefanos, Tais Grippa, Sabine Vanhuysse, Moritz Lennert, Michal Shimoni, and Eleonore Wolff. 2018. Very High Resolution Object-Based Land Use-Land Cover Urban Classification Using Extreme Gradient Boosting. IEEE Geoscience and Remote Sensing Letters 15: 607–11. [Google Scholar] [CrossRef]
- Habib, Ahmed Mohamed. 2023. Do business strategies and environmental, social, and governance (ESG) performance mitigate the likelihood of financial distress? A multiple mediation model. Heliyon 9: e17847. [Google Scholar] [CrossRef]
- He, Shaofu, and Fei Li. 2021. Artificial Neural Network Model in Spatial Analysis of Geographic Information System. Mobile Information Systems 2021: 1166877. [Google Scholar] [CrossRef]
- Hegde, Girish, Vishwanath R. Hulipalled, and Jay B. Simha. 2023. Data driven algorithm selection to predict agriculture commodities price. International Journal of Electrical and Computer Engineering 13: 4671–82. [Google Scholar] [CrossRef]
- Helminen, Niilo. 2023. ESG Momentum and Stock Performance in U.S. During 2018–2023 Does Market Reward Companies for Improving ESG Scores? LUT University. Available online: https://urn.fi/URN:NBN:fi-fe20231204150949 (accessed on 2 November 2024).
- Hjaltason, Gísli R., and Hanan Samet. 1999. Distance browsing in spatial databases. ACM Transactions on Database Systems 24: 265–318. [Google Scholar] [CrossRef]
- Hu, Yi-Chung. 2020. A multivariate grey prediction model with grey relational analysis for bankruptcy prediction problems. Soft Computing 24: 4259–68. [Google Scholar] [CrossRef]
- Hwase, Tesyon Korjo, and Abdul Joseph Fofanah. 2021. Machine Learning Model Approaches for Price Prediction in Coffee Market Using Linear Regression, XGB, and LSTM Techniques. International Journal of Scientific Research in Science and Technology 8: 10–48. [Google Scholar] [CrossRef]
- Jacobs, Michael, Jr. 2024. Benchmarking alternative interpretable machine learning models for corporate probability of default. Data Science in Finance and Economics 4: 1–52. [Google Scholar] [CrossRef]
- Jadhav, Swati, Hongmei He, and Karl Jenkins. 2018. Information gain directed genetic algorithm wrapper feature selection for credit rating. Applied Soft Computing Journal 69: 541–53. [Google Scholar] [CrossRef]
- Jafarzadeh, Hamid, Masoud Mahdianpari, Eric Gill, Fariba Mohammadimanesh, and Saeid Homayouni. 2021. Bagging and Boosting Ensemble Classifiers for Classification of Comparative Evaluation. Remote Sensing 13: 4405. [Google Scholar] [CrossRef]
- Kadkhodaei, Hamid Reza, Amir Masoud Eftekhari Moghadam, and Mehdi Dehghan. 2020. HBoost: A heterogeneous ensemble classifier based on the Boosting method and entropy measurement. Expert Systems with Applications 157: 113482. [Google Scholar] [CrossRef]
- Kalaiarasi, Ganesan, and Sureshbabu Maheswari. 2021. Deep proximal support vector machine classifiers for hyperspectral images classification. Neural Computing and Applications 33: 13391–415. [Google Scholar] [CrossRef]
- Kalantar, Bahareh, Biswajeet Pradhan, Seyed Amir Naghibi, Alireza Motevalli, and Shattri Mansor. 2018. Assessment of the effects of training data selection on the landslide susceptibility mapping: A comparison between support vector machine (SVM), logistic regression (LR) and artificial neural networks (ANN). Geomatics, Natural Hazards and Risk 9: 49–69. [Google Scholar] [CrossRef]
- Kaleem, Mehwish, Hashim bin Jusoh, Hassan Raza, Misbah Sadiq, and Ahmad Husni bin Hamzah. 2024. A Machine Learning Approach to Predict Bankruptcy in Chinese Companies with ESG Integration. Pakistan Journal of Commerce and Social Sciences 18: 335–57. [Google Scholar]
- Karatzoglou, Alexandros, Alex Smola, Kurt Hornik, and Achim Zeileis. 2004. A survey on patents of typical robust image watermarking. Advanced Materials Research 271–273: 389–93. [Google Scholar] [CrossRef]
- Kim, Sang, and Zhichuan Li. 2021. Understanding the impact of esg practices in corporate finance. Sustainability 13: 3746. [Google Scholar] [CrossRef]
- Kovacova, Maria, and Jana Kliestikova. 2017. Modelling bankruptcy prediction models in Slovak companies. SHS Web of Conferences 39: 01013. [Google Scholar] [CrossRef]
- Kristanti, Farida Titik, Deannes Isynuwardhana, and Sri Rahayu. 2019. Market concentration, diversification, and financial distress in the Indonesian banking system. Jurnal Keuangan Dan Perbankan 23: 514–24. [Google Scholar] [CrossRef]
- Kuang, Li, Han Yan, Yujia Zhu, Shenmei Tu, and Xiaoliang Fan. 2019. Predicting duration of traffic accidents based on cost-sensitive Bayesian network and weighted K-nearest neighbor. Journal of Intelligent Transportation Systems: Technology, Planning, and Operations 23: 161–74. [Google Scholar] [CrossRef]
- Kumar, Mukesh, Karan Bajaj, Bhisham Sharma, and Sushil Narang. 2021. A Comparative Performance Assessment of Optimized Multilevel Ensemble Learning Model with Existing Classifier Models. Big Data 10: 371–87. [Google Scholar] [CrossRef] [PubMed]
- Laeven, Luc, and Ross Levine. 2009. Bank governance, regulation and risk taking. Journal of Financial Economics 93: 259–75. [Google Scholar] [CrossRef]
- Le, Tuong, Minh Thanh Vo, Bay Vo, Mi Young Lee, and Sung Wook Baik. 2019. A Hybrid Approach Using Oversampling Technique and Cost-Sensitive Learning for Bankruptcy Prediction. Complexity 2019: 8460934. [Google Scholar] [CrossRef]
- Lim, Se Hun, and Kyungdoo Nam. 2006. Artificial Neural Network Modeling in Forecasting a Successful Implementation of ERP Systems. International Journal of Computational Intelligence Research 2: 115–19. [Google Scholar] [CrossRef]
- Lim, Tristan. 2024. Environmental, social, and governance (ESG) and artificial intelligence in finance: State-of-the-art and research takeaways. Artificial Intelligence Review 57: 76. [Google Scholar] [CrossRef]
- Lin, Wei-Chao, Yu-Hsin Lu, and Chih-Fong Tsai. 2018. Feature selection in single and ensemble learning-based bankruptcy prediction models. Expert Systems 36: e12335. [Google Scholar] [CrossRef]
- Lisin, Anton, Andrei Kushnir, Alexey G. Koryakov, Natalia Fomenko, and Tatyana Shchukina. 2022. Financial Stability in Companies with High ESG Scores: Evidence from North America Using the Ohlson O-Score. Sustainability 14: 479. [Google Scholar] [CrossRef]
- Liu, Yuting, and Minghua Lin. 2022. Research on The Influence of ESG Information Disclosure on Enterprise Financial Risk. Frontiers in Business, Economics and Management 5: 264–71. [Google Scholar] [CrossRef]
- Marom, Shaike, and Robert N. Lussier. 2014. A Business Success Versus Failure Prediction Model for Small Businesses in Israel. Business and Economic Research 4: 63. [Google Scholar] [CrossRef]
- Masanobu, Matsumaru, Kaneko Shoichi, Katagiri Hideki, and Kawanaka Takaaki. 2019. Bankruptcy prediction for Japanese corporations using support vector machine, artificial neural network, and multivariate discriminant analysis. International Journal of Industrial Engineering and Operations Management 1: 78–96. [Google Scholar] [CrossRef]
- Michalkova, Lucia, Katarina Valaskova, Katarina Frajtova Michalikova, and Andrea Drugau Constantin. 2018. The Holistic View of the Symptoms of Financial Health of Businesses. Paper presented at the Third International Conference on Economic and Business Management (FEBM 2018), Hohhot, China, October 20–22; Amsterdam: Atlantis Press, vol. 56, pp. 90–18. [Google Scholar] [CrossRef]
- Michela, Porta. 2023. The relationship between ESG Performance and Financial Stability: Are Sustainable Companies Less Likely to Fail? University of Vaasa-Vaasan yliopisto Research Repository. Available online: https://osuva.uwasa.fi/bitstream/handle/10024/16837/Uwasa_2024_Porta_Michela.pdf?sequence=2&isAllowed=y (accessed on 2 November 2024).
- Min, Jae H., and Young Chan Lee. 2005. Bankruptcy prediction using support vector machine with optimal choice of kernel function parameters. Expert Systems with Applications 28: 603–14. [Google Scholar] [CrossRef]
- Mishina, Yohei, Ryuei Murata, Yuji Yamauchi, Takayoshi Yamashita, and Hironobu Fujiyoshi. 2015. Boosted random forest. IEICE Transactions on Information and Systems E98-D: 1630–36. [Google Scholar] [CrossRef]
- Moreno, Sinvaldo Rodrigues, Ramon Gomes Silva, Matheus Henrique dal Molin Ribeiro, Naylene Fraccanabbia, Viviana Cocco Mariani, and Leandro dos Santos Coelho. 2020. Very Short-term Wind Energy Forecasting Based on Stacking Ensemble. Paper presented at the 14th Brazilian Computational Intelligence Meeting (CBIC), Belem, Brazil, November 16–20; pp. 1–22. [Google Scholar] [CrossRef]
- Mukarramah, Rifqatul, Dedy Atmajaya, and Lutfi Budi Ilmawan. 2021. Performance comparison of support vector machine (SVM) with linear kernel and polynomial kernel for multiclass sentiment analysis on twitter. ILKOM Jurnal Ilmiah 13: 168–74. [Google Scholar] [CrossRef]
- Muñoz-Izquierdo, Nora, María Del Mar Camacho-Miñano, María Jesús Segovia-Vargas, and David Pascual-Ezama. 2019. Is the external audit report useful for bankruptcy prediction? Evidence using artificial intelligence. International Journal of Financial Studies 7: 20. [Google Scholar] [CrossRef]
- Neumeyer, Pablo A., and Fabrizio Perri. 2005. Business cycles in emerging economies: The role of interest rates. Journal of Monetary Economics 52: 345–80. [Google Scholar] [CrossRef]
- Nguyen, Phong Thanh. 2021. Application Machine Learning in Construction Management. TEM Journal 10: 1385–89. [Google Scholar] [CrossRef]
- Ogachi, Daniel, Richard Ndege, Peter Gaturu, and Zeman Zoltan. 2020. Corporate Bankruptcy Prediction Model, a Special Focus on Listed Companies in Kenya. Journal of Risk and Financial Management 13: 47. [Google Scholar] [CrossRef]
- Özparlak, Gerçek, and Menevşe Özdemir Dilidüzgün. 2022. Corporate Bankruptcy Prediction Using Machine Learning Methods: The Case of the Usa. International Journal of Management Economics and Business 18: 1007–31. [Google Scholar] [CrossRef]
- Paramita, Adi Suryaputra, Indra Maryati, and Laura Mahendratta Tjahjono. 2022. Implementation of the K-Nearest Neighbor Algorithm for the Classification of Student Thesis Subjects. Journal of Applied Data Sciences 3: 128–36. [Google Scholar] [CrossRef]
- Pérez, Fernando A. Acosta, Gabriel E. Rodríguez Ortiz, Everson Rodríguez Muñiz, Fernando J. Ortiz Sacarello, Jee Eun Kang, and Daniel Rodriguez-Roman. 2020. Predicting Trip Cancellations and No-Shows in Paratransit Operations. Transportation Research Record 2674: 774–84. [Google Scholar] [CrossRef]
- Prokhorenkova, Liudmila, Gleb Gusev, Aleksandr Vorobev, Anna Veronika Dorogush, and Andrey Gulin. 2018. Catboost: Unbiased boosting with categorical features. In Advances in Neural Information Processing Systems. pp. 6638–48. Available online: https://api.semanticscholar.org/CorpusID:5044218 (accessed on 2 November 2024).
- Prusak, Błażej. 2018. Review of Research into Enterprise Bankruptcy Prediction in Selected Central and Eastern European Countries. International Journal of Financial Studies 6: 60. [Google Scholar] [CrossRef]
- Purnomo, Ariana Tulus, Ding Bing Lin, Tjahjo Adiprabowo, and Willy Fitra Hendria. 2021. Non-contact monitoring and classification of breathing pattern for the supervision of people infected by covid-19. Sensors 21: 3172. [Google Scholar] [CrossRef] [PubMed]
- Rainarli, E. 2019. The Comparison of Machine Learning Model to Predict Bankruptcy: Indonesian Stock Exchange Data. IOP Conference Series: Materials Science and Engineering 662: 052019. [Google Scholar] [CrossRef]
- Ramdani, Fatwa, and Muhammad Tanzil Furqon. 2022. The simplicity of XGBoost algorithm versus the complexity of Random Forest, Support Vector Machine, and Neural Networks algorithms in urban forest classification. F1000Research 11: 1069. [Google Scholar] [CrossRef]
- Ranjan, Adyant, and Guillermo Goldsztein. 2022. An Optimization of Machine Learning Approaches in the Forecasting of Global Financial Stability. Journal of Student Research 11: 1–15. [Google Scholar] [CrossRef]
- Rehman, Sana ur, Hani Baloch, and Abdul Qayyum. 2021. Anticipating Bankruptcy for Non-Financial Sector Firms: A Case of Pakistan. Journal of Accounting and Finance in Emerging Economies 7: 401–9. [Google Scholar] [CrossRef]
- Richardson, Bill, Sonny Nwankwo, and Susan Richardson. 1994. Understanding the Causes of Business Failure Crises: Generic Failure Types: Boiled Frogs, Drowned Frogs, Bullfrogs and Tadpoles. Management Decision 32: 9–22. [Google Scholar] [CrossRef]
- Rokach, Lior. 2016. Decision forest: Twenty years of research. Information Fusion 27: 111–25. [Google Scholar] [CrossRef]
- Saadaari, F. Saadaari, Daniel Mireku-Gyimah, and Boluwaji Muriana Olaleye. 2020. Development of a Stope Stability Prediction Model Using Ensemble Learning Techniques—A Case Study. Ghana Mining Journal 20: 18–26. [Google Scholar] [CrossRef]
- Sachs, Jeffrey, Aaron Tornell, and Andres Velasco. 1996. Financial Crises in Emerging Markets in 1995. Brookings Papers on Economic Activity 1: 147–215. [Google Scholar] [CrossRef]
- Sahin, Emrehan Kutlug. 2020. Assessing the predictive capability of ensemble tree methods for landslide susceptibility mapping using XGBoost, gradient boosting machine, and random forest. SN Applied Sciences 2: 1308. [Google Scholar] [CrossRef]
- Santoso, Nurilen Wuri, Ratih Kusumawardhani, and Alfiatul Maulida. 2024. Comparative Analysis Of The Altman, Ohlson, And Zmijewski Models To Predict Financial Distress During The Covid-19 Pandemic. Maksimum 14: 13–21. [Google Scholar] [CrossRef]
- Sarker, Iqbal H., Faisal Faruque, Hamed Alqahtani, and Asra Kalim. 2020. K-Nearest Neighbor Learning based Diabetes Mellitus Prediction and Analysis for eHealth Services. EAI Endorsed Transactions on Scalable Information Systems 7: e4. [Google Scholar] [CrossRef]
- Shekar, Shetty, Musa Mohamed, and Brédart Xavier. 2022. Bankruptcy Prediction Using Machine Learning Methods. Information Society. 1: 56–67. [Google Scholar] [CrossRef]
- Shin, Kyung Shik, and Kyoung Jun Lee. 2004. Neuro-genetic approach for bankruptcy prediction modeling. In Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). Berlin and Heidelberg: Springer, vol. 3214, pp. 646–52. [Google Scholar] [CrossRef]
- Smiti, Salima, and Makram Soui. 2020. Bankruptcy Prediction Using Deep Learning Approach Based on Borderline SMOTE. Information Systems Frontiers 22: 1067–83. [Google Scholar] [CrossRef]
- Smiti, Salima, Makram Soui, and Khaled Ghedira. 2022. Tri-XGBoost Model: An Interpretable Semi-supervised Approach for Addressing Bankruptcy Prediction. Authorea, 1–44. [Google Scholar] [CrossRef]
- Srebro, Bosiljka, Bojan Mavrenski, Vesna Bogojević Arsić, Snežana Knežević, Marko Milašinović, and Jovan Travica. 2021. Bankruptcy risk prediction in ensuring the sustainable operation of agriculture companies. Sustainability 13: 7712. [Google Scholar] [CrossRef]
- Srinivas, T. Aditya Sai, and Munnuru Bharathi. 2024. Financial Oracle: Unlocking Credit Scores. Journal of IoT and Machine Learning 1: 9–11. [Google Scholar] [CrossRef]
- Sun, Bin, Wei Cheng, Prashant Goswami, and Guohua Bai. 2018. Short-term traffic forecasting using self-adjusting k-nearest neighbours. IET Intelligent Transport Systems 12: 41–48. [Google Scholar] [CrossRef]
- Sun, Jie, Hui Li, Qing Hua Huang, and Kai Yu He. 2014. Predicting financial distress and corporate failure: A review from the state-of-the-art definitions, modeling, sampling, and featuring approaches. Knowledge-Based Systems 57: 41–56. [Google Scholar] [CrossRef]
- Tang, Yajiao, Junkai Ji, Yulin Zhu, Shangce Gao, Zheng Tang, and Yuki Todo. 2019. A Differential Evolution-Oriented Pruning Neural Network Model for Bankruptcy Prediction. Complexity 2019: 8682124. [Google Scholar] [CrossRef]
- Torlay, Laurent, Marcela Perrone-Bertolotti, Elizabeth Thomas, and Monica Baciu. 2017. Machine learning–XGBoost analysis of language networks to classify patients with epilepsy. Brain Informatics 4: 159–69. [Google Scholar] [CrossRef] [PubMed]
- Uğurlu, Mine, and Hakan Aksoy. 2006. Prediction of corporate financial distress in an emerging market: The case of Turkey. Cross Cultural Management: An International Journal 13: 277–95. [Google Scholar] [CrossRef]
- Vanhaeren, Thomas, Federico Divina, Miguel García-Torres, Francisco Gómez-Vela, Wim Vanhoof, and Pedro Manuel Martínez García. 2020. A comparative study of supervised machine learning algorithms for the prediction of long-range chromatin interactions. Genes 11: 985. [Google Scholar] [CrossRef]
- Wang, Falin, Gang Yuan, Chaoyang Guo, and Zhinong Li. 2022. Research on fault diagnosis method of aviation cable based on improved Adaboost. Advances in Mechanical Engineering 14: 1–14. [Google Scholar] [CrossRef]
- Wang, Mingjing, Huiling Chen, Huaizhong Li, Zhennao Cai, Xuehua Zhao, Changfei Tong, Jun Li, and Xin Xu. 2017. Grey wolf optimization evolving kernel extreme learning machine: Application to bankruptcy prediction. Engineering Applications of Artificial Intelligence 63: 54–68. [Google Scholar] [CrossRef]
- Wang, Nannan, Dayao Li, Dengfeng Cui, and Xiaolong Ma. 2022. Governance disclosure and corporate sustainable growth: Evidence from China. Frontiers in Environmental Science 10: 1015764. [Google Scholar] [CrossRef]
- Wyner, Abraham J., Matthew Olson, Justin Bleich, and David Mease. 2017. Explaining the success of adaboost and random forests as interpolating classifiers. Journal of Machine Learning Research 18: 1–33. [Google Scholar]
- Xu, Yuhong, Zhiwen Yu, Wenming Cao, and C. L. Philip Chen. 2023. A Novel Classifier Ensemble Method Based on Subspace Enhancement for High-Dimensional Data Classification. IEEE Transactions on Knowledge and Data Engineering 35: 16–30. [Google Scholar] [CrossRef]
- Yotsawat, Wirot, Kanyalag Phodong, Thawatchai Promrat, and Pakaket Wattuya. 2023. Bankruptcy prediction model using cost-sensitive extreme gradient boosting in the context of imbalanced datasets. International Journal of Electrical and Computer Engineering 13: 4683–91. [Google Scholar] [CrossRef]
- Yotsawat, Wirot, Pakaket Wattuya, and Anongnart Srivihok. 2021. Improved credit scoring model using XGBoost with Bayesian hyper-parameter optimization. International Journal of Electrical and Computer Engineering 11: 5477–87. [Google Scholar] [CrossRef]
- Youn, Hyewon, and Zheng Gu. 2010. Predict US Restaurant Firm Failures: The Artificial Neural Network Model versus Logistic Regression Model. Tourism and Hospitality Research 10: 171–87. [Google Scholar] [CrossRef]
- Zhang, Guoqiang, Michael Y. Hu, B. Eddy Patuwo, and Daniel C. Indro. 1999. Artificial neural networks in bankruptcy prediction: General framework and cross-validation analysis. European Journal of Operational Research 116: 16–32. [Google Scholar] [CrossRef]
- Zhang, Wei, Xiaohui Chen, Yueqi Liu, and Qian Xi. 2020. A Distributed Storage and Computation K-Nearest Neighbor Algorithm Based Cloud-Edge Computing for Cyber-Physical-Social Systems. IEEE Access 8: 50118–30. [Google Scholar] [CrossRef]
- Zhao, Changhong, Yu Guo, Jiahai Yuan, Mengya Wu, Daiyu Li, Yiou Zhou, and Jiangang Kang. 2018. ESG and corporate financial performance: Empirical evidence from China’s listed power generation companies. Sustainability 10: 2607. [Google Scholar] [CrossRef]
- Zizi, Youssef, Amine Jamali-Alaoui, Badreddine El Goumi, Mohamed Oudgou, and Abdeslam El Moudden. 2021. An optimal model of financial distress prediction: A comparative study between neural networks and logistic regression. Risks 9: 200. [Google Scholar] [CrossRef]
- Zou, Yao, Changchun Gao, and Han Gao. 2022. Business Failure Prediction Based on a Cost-Sensitive Extreme Gradient Boosting Machine. IEEE Access 10: 42623–39. [Google Scholar] [CrossRef]
Algorithm | Pros | Cons | Application in Study |
---|---|---|---|
Logistic Regression | Simple, interpretable, and effective for linear relationships | May fail with non-linear relationships, prone to overfitting if not regularized | Used for basic binary classification of bankruptcy |
K-Nearest Neighbors (KNNs) | Simple, effective for non-linear relationships | Sensitive to the local data structure, high computational cost for large datasets | Employed for its flexibility in capturing complex, varied relationships |
Decision Trees | Easy to interpret, can handle both numerical and categorical data | Prone to overfitting, can create overly complex trees that do not generalize well | Utilized for their interpretability and effectiveness in categorical data handling |
Support Vector Machine (Linear) | Effective in high-dimensional spaces, memory-efficient | Requires feature-scaling, not suitable for larger datasets | Applied for linearly separable data, used primarily for classification tasks |
Support Vector Machine (Non-Linear) | Can model non-linear relationships, robust against overfitting in high-dimensional spaces | Computationally intensive, requires careful tuning of parameters | Used to handle datasets with complex decision boundaries |
Artificial Neural Networks (ANNs) | Highly flexible, can model complex non-linear relationships | “Black box” nature, computationally expensive, requires large datasets | Deployed for their ability to learn and model non-linear and complex relationships effectively |
Random Forest | Handles overfitting well, effective for large datasets, provides feature importance | Can be slow to generate predictions if consisting of many trees | Used for its robustness and efficiency in handling different types of data |
Gradient Boosting | Often provides predictive accuracy that cannot be beaten, very flexible | Can overfit on noisy datasets, sensitive to outliers, computationally expensive | Applied for its strength in sequential correction of predecessors’ errors |
XGBoost | Handles large datasets well, provides feature importance, efficient and flexible | Can overfit if not correctly tuned, complex parameter tuning required | Utilized for its efficiency in large datasets and high performance in bankruptcy prediction |
AdaBoost | Improves classification accuracy, combines multiple weak learners to form a strong learner | Sensitive to noisy data and outliers, can overfit if the weak learners are too complex | Employed for its ability to adaptively focus on misclassified instances |
CatBoost | Excels with categorical data without extensive data preprocessing, reduces overfitting | Less interpretable compared to simpler models, parameter tuning can be complex | Used for its advanced handling of categorical features and robustness against overfitting |
Group | Failed Firm | Non-Failed Firm | ||
---|---|---|---|---|
Variable | Mean | Std. Deviation | Mean | Std. Deviation |
ESG | 34.532 | 22.247 | 53.848 | 21.425 |
P1 | −1692 | 11511 | −2.061 | 173 |
P7 | −0.447 | 1.604 | 0.102 | 0.535 |
P8 | −0.190 | 0.990 | 0.109 | 0.419 |
L2 | −1.700 | 1.946 | −1.588 | 2.783 |
L3 | 0.500 | 0.883 | 1.209 | 1.220 |
AC1 | 41.16 | 124.37 | 88.83 | 1466.57 |
ESG | P1 | P7 | P8 | L2 | L3 | AC1 | |
---|---|---|---|---|---|---|---|
ESG | 1 | ||||||
P1 | 0.046 | 1 | |||||
P7 | −0.029 | 0.079 | 1 | ||||
P8 | 0.021 | 0.024 | −0.628 | 1 | |||
L2 | −0.083 | −0.031 | 0.065 | 0.026 | 1 | ||
L3 | −0.108 | −0.039 | −0.003 | −0.009 | 0.08 | 1 | |
AC1 | −0.058 | −0.001 | −0.004 | −0.006 | −0.069 | 0.014 | 1 |
Model(s) | Precision | Recall | F1-Score | Overall Accuracy | |||
---|---|---|---|---|---|---|---|
Failed | Non-Failed | Failed | Non-Failed | Failed | Non-Failed | ||
Logistic Regression | 0.92 | 0.94 | 0.94 | 0.92 | 0.93 | 0.93 | 93.39% |
K-Nearest Neighbors | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 99.15% |
Decision Trees | 0.99 | 0.98 | 0.98 | 0.99 | 0.99 | 0.99 | 98.51% |
SVMs (Linear Kernel) | 0.93 | 0.95 | 0.95 | 0.92 | 0.94 | 0.94 | 93.60% |
SVMs (RBF Kernel) | 0.97 | 0.93 | 0.93 | 0.97 | 0.95 | 0.95 | 95.10% |
Neural Networks | 0.97 | 0.98 | 0.98 | 0.97 | 0.98 | 0.98 | 97.65% |
Random Forest | 1 | 1 | 1 | 1 | 1 | 1 | 99.57% |
Gradient Boosting | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 98.93% |
XgBoost Classifier | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 0.99 | 99.15% |
AdaBoost Classifier | 0.98 | 0.95 | 0.95 | 0.98 | 0.96 | 0.96 | 96.38% |
CatBoost Classifier | 1 | 1 | 1 | 1 | 1 | 1 | 99.57% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kaleem, M.; Raza, H.; Ashraf, S.; Almeida, A.M.; Machado, L.P. Does ESG Predict Business Failure in Brazil? An Application of Machine Learning Techniques. Risks 2024, 12, 185. https://doi.org/10.3390/risks12120185
Kaleem M, Raza H, Ashraf S, Almeida AM, Machado LP. Does ESG Predict Business Failure in Brazil? An Application of Machine Learning Techniques. Risks. 2024; 12(12):185. https://doi.org/10.3390/risks12120185
Chicago/Turabian StyleKaleem, Mehwish, Hassan Raza, Sumaira Ashraf, António Martins Almeida, and Luiz Pinto Machado. 2024. "Does ESG Predict Business Failure in Brazil? An Application of Machine Learning Techniques" Risks 12, no. 12: 185. https://doi.org/10.3390/risks12120185
APA StyleKaleem, M., Raza, H., Ashraf, S., Almeida, A. M., & Machado, L. P. (2024). Does ESG Predict Business Failure in Brazil? An Application of Machine Learning Techniques. Risks, 12(12), 185. https://doi.org/10.3390/risks12120185