Identification of Investment-Ready SMEs: A Machine Learning Framework to Enhance Equity Access and Economic Growth
Abstract
1. Introduction and Literature Review
1.1. Investment Readiness and SME Financing
1.2. Research Question and Hypothesis
- RQ1. Can machine learning models enhance the ability to correctly identify investment-ready SMEs given the high imbalance in questionnaire data, the significant noise created by the questionnaire respondents, and the complexity and volume of such data in the EU’s SAFE questionnaire?
- H1. Machine learning models demonstrate a greater balanced accuracy compared to traditional classifiers in identifying investment-ready SMEs within highly imbalanced datasets.
2. Data and Research Methodology
2.1. Data Collection and Pre-Processing
- Innovative: These are the firms that have reported new developments, such as introducing a new product or service to the market, implementing a new production process or method, adopting new management practices, or exploring new ways of selling goods or services.
- Fast-growing: These are the cases where the annual turnover increases by more than 20%.
- Open to equity financing: Whether it reported equity as either a relevant funding source or one used in the past six months.
2.2. Machine Learning Algorithms
2.2.1. Logistic Regression
2.2.2. K-Nearest Neighbors
2.2.3. Random Forest
2.2.4. Support Vector Machines
2.2.5. Naïve Bayes
2.2.6. AdaBoost
2.2.7. Easy Ensemble
2.2.8. Balanced Bagging Classifier
2.2.9. Gradient Boosting Trees
2.2.10. Cross-Validation
2.2.11. Model Selection
- Logistic Regression;
- K-Nearest Neighbors;
- Random Forest;
- Support Vector Machines with RBF and Linear Kernel;
- Naïve Bayes Classifier;
- AdaBoost;
- Easy Ensemble;
- Balanced Bagging Classifier;
- Gradient Boosting Trees.
2.2.12. Data Imbalance
2.3. Forecasting Performance Metrics
- True Positives (TP): The number of instances in which the model correctly predicts the positive class (in our case, the SME to be predicted as investment-ready, class 1).
- True Negatives (TN): The number of instances in which the model correctly predicts the negative class (in our case, the SME to be predicted as non-investment-ready, class 0).
- False Positives (FP): The number of instances in which the model incorrectly predicts the positive class when the actual class is negative.
- False Negatives (FN): The number of instances in which the model incorrectly predicts the negative class when the actual is positive.
2.3.1. Precision
2.3.2. Recall (Sensitivity/Specificity)
2.3.3. F1-Score
2.3.4. Balanced Accuracy
2.3.5. Receiver Operating Characteristic Area Under Curve (ROC AUC)
2.3.6. Feature Engineering and Exploratory Analysis
2.4. Variable Importance Measure (VIM) and Shapley Additive Explanation (SHAP)
3. Empirical Results
- 267 TP predictions (investment-ready) out of 374, or 71.3%.
- 1440 TN predictions (not investment-ready) out of 1814, or 79.3%.
- 374 FP predictions (predicted as investment-ready but are not), or 20.6%.
- 107 FN predictions (predicted as not investment-ready and are not), or 28.6%.
- The most important predictor appears to be the firm’s “confidence in negotiations with equity investors or venture capital firms”. High levels of negotiation confidence likely reflect firm preparation, understanding of investor expectations, and stronger business fundamentals, all of which are critical traits for attracting external investment.
- The second most important predictor is “financing growth”, which indicates that the firms that are actively seeking and managing financial growth tend to be investment-ready. This highlights the importance of proactive financial planning and scaling strategies as indicators of a firm’s investment potential.
- “Factors in the future of financing of the firm” is ranked third in top predictors. This variable points to the importance of future financial planning. Investors may favor firms that not only demonstrate current performance but also show foresight in securing future financing.
- “External financing factors” including market conditions or access to funding channels is also an essential feature that influences the model’s decision-making process. Firms capable of navigating external factors’ influences may be more successful in attracting external investors.
- “Autonomous organization type”, relating to the structure of the organization, plays the fourth-most-important role in predicting an investment-ready SME. Autonomous firms might be more agile and able to innovate and thus they draw investors’ interest.
- “Willingness of investors to invest in the enterprise”, relating to the investors’ sentiment towards the firm.
3.1. SHAP Values
- Confidence in negotiations: High values of confidence—as expected—indicate an investment-ready SME.
- Importance of factors in the future financing of the firm: Declining future financing prospects reduce the investment readiness of an SME.
- Not taking autonomous financial decisions due to being a subsidiary: The results show that high values of this variable significantly decrease investment readiness for an SME.
- Willingness of external investors to invest in a specific SME: Perceived persistence of the interest of external investors in specific SMEs seems to increase investment readiness.
3.2. Cost Function
- A DeLong test on the ROC AUCs produced by the two best-performing algorithms, the XGBoost and the logit. The hypotheses tested are the following:H0: The two algorithms produce AUCs that are not statistically different.H1: The two algorithms produce AUCs that are statistically different.The test is performed on the same validation set for both models. And the z-score is −0.4462 with a p-value of 0.6554, providing statistical evidence that the two algorithms do not significantly differ in terms of their ROC AUCs. The relevant confidence intervals are the following:XGBoost AUC at 95% probability produces a confidence interval of [0.783, 0.835].Logit AUC at 95% probability produces a confidence interval of [0.781, 0.833].Thus, with the DeLong test, we did not find evidence that one model produces a statistically better ROC AUC than the other.
- 2.
- We performed a McNemar test, which is a non-parametric test applied on categorical observations to analyze the differences between two related groups, usually a 2 × 2 matrix with dichotomous variables. On the main diagonal, we have the count of instances where the two models agree, and on the secondary diagonal, we have the count of observations where the two models disagree.The differences between the two methods’ performances are the following (the secondary diagonal values):
- XGBoost correct, logit wrong: 110.
- XGBoost wrong, logit correct: 56.
4. Discussion
- We created a universal EU model using the large dataset collected by the Survey on Access to Finance of Enterprises (SAFE), instead of using a country-specific one.
- We addressed the problem of class imbalance using weights and appropriate methods (e.g., Easy Ensemble, Balance Bagging).
- We created and implemented a cost-sensitive metric (false negatives weighted five times the false positives in our model) to find the optimal model, reflecting economic consequences, instead of using just simple accuracy metrics. Consequently, Gradient Boosting is not just the optimal model statistically, but more importantly also economically.
- Venture capital and private equity early screening often relies on subjective judgments and/or personal networks. Perceived weaknesses may lead to early SME’s rejection. The proposed ML model, in contrast, evaluates firms on data-driven criteria, reducing the risk of bias or limited information.
- The proposed ML-based model (and, in general, ML approaches) can scan a large dataset like SAFE, identifying IR firms outside the investor circle thus expanding the pool of potential opportunities.
- The proposed ML-driven classification sorts SMEs by underlining those with higher predicted IR. Investors can then focus their in-depth analysis on a smaller set of IR-identified SMEs—cutting costs and improving efficiency.
- Finally, IR in this paper is defined through concrete features like innovation, growth, and openness to equity. This means that the model can function not only as a filter to identify IR but also as a diagnostic guide, showing SMEs whose capabilities they can strengthen to improve their funding prospects.
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
Appendix A
# | Variable Name | Question |
---|---|---|
1 | Autonomous | How would you characterise your enterprise? |
2 | MainActivity | What is the main activity of your enterprise? |
3 | Age | In which year was your enterprise first registered? |
4 | Ownership | Who owns the largest stake in your enterprise? |
5 | profitable | Firms that report, simultaneously, higher turnover and profits, lower or no interest expenses and lower or no debt-to-assets ratio. |
6 | MostImportantProblemFacing | Which was the most important problem faced by for your enterprise during the {previous quarter and current quarter} OR {current quarter}? |
7 | ExternalFinancing_Factors#GeneralEconomicOutlookToObtainExternalFinancing | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? General economic outlook, insofar as it affects the availability of external financing |
8 | ExternalFinancing_Factors#AccessToPublicFinancialSupportIncludingGuarantees | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}?. Access to public financial support, including guarantees |
9 | ExternalFinancing_Factors#YourFirmSpecificOutlook | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}?. Your enterprise-specific outlook with respect to your sales and profitability or business plan |
10 | ExternalFinancing_Factors#YourFirmOwnCapital | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? Your enterprise’s own capital |
11 | ExternalFinancing_Factors#YourFirmCreditHistory | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? Your enterprise’s credit history |
12 | ExternalFinancing_Factors#WillingnessOfBanksToProvideCredit | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? Willingness of banks to provide credit to your enterprise |
13 | ExternalFinancing_Factors#WillingnessOfBusinessPartnersToProvideTradeCredit | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? Willingness of business partners to provide trade credit |
14 | ExternalFinancing_Factors#WillingnessOfInvestorsToInvestInYourEnterprise | For each of the following factors, would you say that they have improved, remained unchanged or deteriorated during the {previous quarter and current quarter} OR {current quarter}? Willingness of investors to invest in your enterprise |
15 | FirmGrowth#InTermsOfEmploymentRegardingTheNumberOfFullTimeOrFullTimeEquivalentEmployees | Over the last three years (<year>-<year>), how much did your firm grow on average per year? (in terms of employment regarding the number of full-time or full-time equivalent employees) |
16 | ConfidenceInNegotiations#WithBanks | Do you feel confident talking about financing with banks and that you will obtain the desired results? And how about with equity investors/venture capital enterprises? With banks |
17 | ConfidenceInNegotiations#WithEquityInvestorsOrVentureCapitalFirms | Do you feel confident talking about financing with banks and that you will obtain the desired results? And how about with equity investors/venture capital enterprises? With equity investors/venture capital enterprises |
18 | FinancingGrowth_Instruments#FinancingGrowth_Instruments | If you need external financing to realise your growth ambitions, what type of external financing would you prefer most? |
19 | FinancingGrowth_Amount#FinancingGrowth_Amount | If you need external financing to realise your growth ambitions over the next two to three years what amount of financing would you aim to obtain? |
20 | FinancingGrowth_TheMostImportantLimitation | What do you see as the most important limiting factor to get this financing? |
21 | ExternalFinancing_Expectations#InternalFounds | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Retained earnings or sale of assets/internal funds |
22 | ExternalFinancing_Expectations#BankLoans | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Bank loans (excluding overdraft and credit lines) |
23 | ExternalFinancing_Expectations#EquityInvestments | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Equity capital |
24 | ExternalFinancing_Expectations#TradeCredit | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Trade credit |
25 | ExternalFinancing_Expectations#DebtSecuritiesIssued | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Debt securities issued |
26 | ExternalFinancing_Expectations#Other | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Other |
27 | ExternalFinancing_Expectations#CreditLineBankOverdraftOrCreditCardsOverdraft | Looking ahead, for each of the following types of financing available to your enterprise, please indicate whether you think their availability will improve, deteriorate or remain unchanged over the next {two quarters} OR {quarter}. Credit line, bank overdraft or credit cards overdraft |
28 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#GuaranteesForLoans | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Guarantees for loans |
29 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#MeasuresToFacilitateEquityInvestments | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Measures to facilitate investments |
30 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#ExportCreditsOrGuarantees | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Export credits or guarantees |
31 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#TaxIncentives | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Tax incentives |
32 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#BusinessSupportServices | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Business support services |
33 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#MakingExistingPublicMeasuresEasierToObtain | On a scale of 1–10, where 10 means it is extremely important and 1 means it is not at all important, how important are each of the following factors for your enterprise’s financing in the future? Making existing public measures easier to obtain |
34 | FirmIncomeGenerationIndicators#LabourCost | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Labour costs (including social contributions) |
35 | FirmIncomeGenerationIndicators#OtherCost | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Other costs (materials, energy, other) |
36 | FirmIncomeGenerationIndicators#InterestExpenses | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Interest expenses |
37 | FirmIncomeGenerationIndicators#Profit | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Profit |
38 | FirmIncomeGenerationIndicators#ProfitMargin | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Profit Margin |
39 | FirmIncomeGenerationIndicators#DebtComparedToAssets | Have the following company indicators decreased, remained unchanged or increased during the {previous quarter and current quarter} OR {current quarter}? Debt compared to assets |
40 | FinancingApplied#ApplicationOfExternalFinancing_BankLoans | Have you applied for the following types of financing during the {previous quarter and current quarter} OR {current quarter}? Bank loans |
41 | FinancingApplied#ApplicationOfExternalFinancing_TradeCredit | Have you applied for the following types of financing during the {previous quarter and current quarter} OR {current quarter}? Trade credit |
42 | FinancingApplied#ApplicationOfExternalFinancing_OtherExternalFinancing | Have you applied for the following types of financing during the {previous quarter and current quarter} OR {current quarter}? Other external financing |
43 | FinancingApplied#ApplicationOfExternalFinancing_CreditLine | Have you applied for the following types of financing during the {previous quarter and current quarter} OR {current quarter}? Credit line |
44 | vulnerable | Firms that report, simultaneously, lower turnover, decreasing profits, higher interest expenses and higher or unchanged debt-to-assets ratio. |
45 | permid | Id of SME |
46 | wave | Survey wave |
47 | id | Survey unique id |
48 | CountryOfResidence | Country of SME |
49 | FirmSize | Size of Company |
50 | Date | Date of survey |
51 | InvestmentReady | Target variable |
# | Variable Name | Variable Type | Description Values | Freq/Count | Percentage |
---|---|---|---|---|---|
1 | Autonomous | Categorical | an autonomous profit-oriented enterprise | 7117 | 81.35 |
part of a profit-oriented enterprise (e.g., subsidiary* or branch) not taking autonomous financial decisions | 1444 | 16.6 | |||
a subsidiary of another enterprise | 178 | 2.03 | |||
a branch of another enterprise | 8 | 0.09 | |||
[DK/NA] | 2 | 0.02 | |||
2 | MainActivity | Categorical | Industry | 3109 | 35.54 |
Services | 2630 | 30.06 | |||
Trade | 2164 | 24.73 | |||
Construction | 846 | 9.67 | |||
3 | Age | Categorical | 10 years or more | 6799 | 77.71 |
5 years or more but less than 10 years | 1162 | 13.28 | |||
2 years or more but less than 5 years | 481 | 5.5 | |||
[DK/NA] | 203 | 2.32 | |||
Less than 2 years | 104 | 1.19 | |||
4 | Ownership | Categorical | Family or entrepreneurs | 4387 | 50.14 |
A natural person, one owner only | 1809 | 20.68 | |||
Other firms or business associates | 1438 | 16.44 | |||
Public shareholders, as your company is listed on the stock market | 601 | 6.87 | |||
Other | 312 | 3.57 | |||
Venture capital firms or business angels | 160 | 1.82 | |||
DK/NA | 42 | 0.48 | |||
5 | profitable | Binary | 0 | 8148 | 93.13 |
1 | 601 | 6.87 | |||
6 | MostImportantProblemFacing | Categorical | Finding customers | 1822 | 20.83 |
Competition | 1360 | 15.54 | |||
Availability of skilled staff or experienced managers | 1352 | 15.45 | |||
Access to finance | 1237 | 14.14 | |||
Costs of production of labour | 1039 | 11.88 | |||
Regulation and industrial regulations | 793 | 9.06 | |||
Other | 726 | 8.3 | |||
All problems are equally pressing | 300 | 3.43 | |||
DK/NA | 120 | 1.37 | |||
7 | ExternalFinancing_Factors#GeneralEconomicOutlookToObtainExternalFinancing | Categorical | Remained unchanged | 3801 | 43.45 |
Deteriorated | 2560 | 29.26 | |||
Improved | 1959 | 22.39 | |||
DK | 429 | 4.9 | |||
8 | ExternalFinancing_Factors#AccessToPublicFinancialSupportIncludingGuarantees | Categorical | Remained unchanged | 3243 | 37.07 |
Not applicable | 2993 | 34.21 | |||
Deteriorated | 1407 | 16.08 | |||
Improved | 610 | 6.97 | |||
DK | 496 | 5.67 | |||
9 | ExternalFinancing_Factors#YourFirmSpecificOutlook | Categorical | Remained unchanged | 3923 | 44.84 |
Improved | 3169 | 36.22 | |||
Deteriorated | 1263 | 14.44 | |||
DK | 394 | 4.5 | |||
10 | ExternalFinancing_Factors#YourFirmOwnCapital | Categorical | Remained unchanged | 4278 | 48.9 |
Improved | 3436 | 39.27 | |||
Deteriorated | 932 | 10.65 | |||
DK | 103 | 1.18 | |||
11 | ExternalFinancing_Factors#YourFirmCreditHistory | Categorical | Remained unchanged | 4860 | 55.55 |
Improved | 2796 | 31.96 | |||
Deteriorated | 713 | 8.15 | |||
DK | 372 | 4.25 | |||
Not applicable | 8 | 0.09 | |||
12 | ExternalFinancing_Factors#WillingnessOfBanksToProvideCredit | Categorical | Remained unchanged | 3415 | 39.03 |
Improved | 2150 | 24.58 | |||
Deteriorated | 1579 | 18.05 | |||
Not applicable | 1259 | 14.39 | |||
DK | 346 | 3.95 | |||
13 | ExternalFinancing_Factors#WillingnessOfBusinessPartnersToProvideTradeCredit | Categorical | Remained unchanged | 3611 | 41.27 |
Not applicable | 2615 | 29.89 | |||
Improved | 1178 | 13.46 | |||
Deteriorated | 942 | 10.77 | |||
DK | 403 | 4.61 | |||
14 | ExternalFinancing_Factors#WillingnessOfInvestorsToInvestInYourEnterprise | Categorical | Not applicable | 5782 | 66.09 |
Remained unchanged | 1802 | 20.6 | |||
Improved | 478 | 5.46 | |||
DK | 377 | 4.31 | |||
Deteriorated | 310 | 3.54 | |||
15 | FirmGrowth#InTermsOfEmploymentRegardingTheNumberOfFullTimeOrFullTimeEquivalentEmployees | Categorical | Less than 20% per year | 3620 | 41.37 |
No growth | 2201 | 25.16 | |||
Over 20% per year | 1412 | 16.14 | |||
Got smaller | 1400 | 16 | |||
DK/NA | 61 | 0.7 | |||
Not applicable, the firm is too recent | 55 | 0.63 | |||
16 | ConfidenceInNegotiations#WithBanks | Categorical | Yes | 6727 | 76.89 |
No | 1293 | 14.78 | |||
Not applicable | 618 | 7.06 | |||
DK | 111 | 1.27 | |||
17 | ConfidenceInNegotiations#WithEquityInvestorsOrVentureCapitalFirms | Categorical | Not applicable | 4254 | 48.62 |
Yes | 2442 | 27.91 | |||
No | 1758 | 20.09 | |||
DK | 295 | 3.38 | |||
18 | FinancingGrowth_Instruments#FinancingGrowth_Instruments | Categorical | Bank loan | 5627 | 64.32 |
Loan from other sources | 1156 | 13.21 | |||
Equity investment | 780 | 8.92 | |||
Other | 450 | 5.14 | |||
DK/NA | 439 | 5.02 | |||
Subordinated loans, participation loans or similar financing instruments | 297 | 3.39 | |||
19 | FinancingGrowth_Amount#FinancingGrowth_Amount | Categorical | Over €1 million | 1632 | 18.65 |
DK/NA | 1620 | 18.52 | |||
More than €25,000 and up to €100,000 | 1456 | 16.64 | |||
€100,000–€1 million | 1443 | 16.49 | |||
More than €250,000 and up to €1 million | 1163 | 13.29 | |||
More than €100,000 and up to €250,000 | 867 | 9.91 | |||
Up to €25,000 | 568 | 6.50 | |||
20 | FinancingGrowth_TheMostImportantLimitation | Categorical | There are no obstacles | 2646 | 30.23 |
Interest rates or price too high | 1474 | 16.85 | |||
Insufficient collateral or guarantee | 1321 | 15.1 | |||
Other | 824 | 9.42 | |||
Financing not available at all | 549 | 6.28 | |||
DK/NA | 1558 | 17.81 | |||
Reduced control over the firm | 305 | 3.49 | |||
too much paper work | 72 | 0.82 | |||
21 | ExternalFinancing_Expectations#InternalFounds | Categorical | Will remain unchanged | 4459 | 50.97 |
Will improve | 2405 | 27.49 | |||
Not applicable | 1170 | 13.37 | |||
Will deteriorate | 451 | 5.15 | |||
DK | 264 | 3.02 | |||
22 | ExternalFinancing_Expectations#BankLoans | Categorical | Will remain unchanged | 4498 | 51.41 |
Will improve | 1793 | 20.49 | |||
Not applicable | 1480 | 16.92 | |||
Will deteriorate | 796 | 9.1 | |||
DK | 182 | 2.08 | |||
23 | ExternalFinancing_Expectations#EquityInvestments | Categorical | Not applicable | 4292 | 49.05 |
Will remain unchanged | 3027 | 34.6 | |||
Will improve | 979 | 11.19 | |||
Will deteriorate | 180 | 2.06 | |||
DK | 271 | 3.1 | |||
24 | ExternalFinancing_Expectations#TradeCredit | Categorical | Will remain unchanged | 4157 | 47.51 |
Not applicable | 2670 | 30.52 | |||
Will improve | 1205 | 13.77 | |||
Will deteriorate | 523 | 5.98 | |||
DK | 194 | 2.22 | |||
25 | ExternalFinancing_Expectations#DebtSecuritiesIssued | Categorical | Not applicable | 5605 | 64.06 |
Will remain unchanged | 1959 | 22.39 | |||
DK | 839 | 9.59 | |||
Will improve | 205 | 2.34 | |||
Will deteriorate | 141 | 1.62 | |||
26 | ExternalFinancing_Expectations#Other | Categorical | Not applicable | 3396 | 38.82 |
Will remain unchanged | 3168 | 36.3 | |||
DK | 1167 | 13.33 | |||
Will improve | 787 | 9 | |||
Will deteriorate | 231 | 2.64 | |||
27 | ExternalFinancing_Expectations#CreditLineBankOverdraftOrCreditCardsOverdraft | Categorical | Will remain unchanged | 4739 | 54.17 |
Not applicable | 1700 | 19.43 | |||
Will improve | 1374 | 15.7 | |||
Will deteriorate | 570 | 6.52 | |||
DK | 366 | 4.18 | |||
28 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#GuaranteesForLoans | Ordinal | 8 | 1556 | 17.78 |
5 | 1467 | 16.77 | |||
1 | 1283 | 14.66 | |||
10 | 1232 | 14.08 | |||
7 | 1011 | 11.56 | |||
6 | 574 | 6.56 | |||
9 | 475 | 5.43 | |||
3 | 470 | 5.37 | |||
2 | 371 | 4.24 | |||
4 | 310 | 3.55 | |||
29 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#MeasuresToFacilitateEquityInvestments | Ordinal | 1 | 2662 | 30.42 |
5 | 1475 | 16.86 | |||
8 | 867 | 9.91 | |||
7 | 715 | 8.17 | |||
2 | 705 | 8.06 | |||
3 | 602 | 6.88 | |||
10 | 551 | 6.3 | |||
6 | 512 | 5.85 | |||
4 | 418 | 4.78 | |||
9 | 242 | 2.77 | |||
30 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#ExportCreditsOrGuarantees | Ordinal | 1 | 3216 | 36.76 |
5 | 1175 | 13.43 | |||
8 | 782 | 8.94 | |||
2 | 660 | 7.54 | |||
7 | 644 | 7.36 | |||
10 | 610 | 6.97 | |||
3 | 531 | 6.07 | |||
6 | 476 | 5.44 | |||
4 | 363 | 4.15 | |||
9 | 292 | 3.34 | |||
31 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#TaxIncentives | Ordinal | 10 | 1886 | 21.56 |
8 | 1491 | 17.04 | |||
5 | 1310 | 14.97 | |||
7 | 1023 | 11.69 | |||
1 | 816 | 9.33 | |||
9 | 632 | 7.22 | |||
6 | 615 | 7.03 | |||
3 | 380 | 4.34 | |||
4 | 340 | 3.89 | |||
2 | 256 | 2.93 | |||
32 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#BusinessSupportServices | Ordinal | 5 | 1640 | 18.74 |
8 | 1193 | 13.64 | |||
7 | 1088 | 12.44 | |||
1 | 983 | 11.24 | |||
6 | 856 | 9.78 | |||
10 | 842 | 9.62 | |||
3 | 649 | 7.42 | |||
4 | 570 | 6.52 | |||
2 | 493 | 5.63 | |||
9 | 435 | 4.97 | |||
33 | ImportanceOfFactorsInTheFutureFinancingOfTheFirm#MakingExistingPublicMeasuresEasierToObtain | Ordinal | 10 | 1670 | 19.09 |
8 | 1575 | 18 | |||
5 | 1319 | 15.08 | |||
7 | 1148 | 13.12 | |||
6 | 772 | 8.82 | |||
1 | 675 | 7.72 | |||
9 | 636 | 7.27 | |||
3 | 349 | 3.99 | |||
4 | 338 | 3.86 | |||
2 | 267 | 3.05 | |||
34 | FirmIncomeGenerationIndicators#LabourCost | Categorical | Increased | 5006 | 57.21 |
Remained unchanged | 3041 | 34.76 | |||
Decreased | 677 | 7.74 | |||
DK/NA | 25 | 0.29 | |||
35 | FirmIncomeGenerationIndicators#OtherCost | Categorical | Increased | 5535 | 63.26 |
Remained unchanged | 2594 | 29.65 | |||
Decreased | 585 | 6.69 | |||
DK/NA | 35 | 0.4 | |||
36 | FirmIncomeGenerationIndicators#InterestExpenses | Categorical | Remained unchanged | 4177 | 47.74 |
Increased | 2559 | 29.25 | |||
Decreased | 1358 | 15.52 | |||
DK/NA | 608 | 6.95 | |||
Not applicable, the firm has no debt | 47 | 0.54 | |||
37 | FirmIncomeGenerationIndicators#Profit | Categorical | Increased | 3688 | 42.15 |
Decreased | 2513 | 28.72 | |||
Remained unchanged | 2411 | 27.56 | |||
DK/NA | 137 | 1.57 | |||
38 | FirmIncomeGenerationIndicators#ProfitMargin | Categorical | Remained unchanged | 2945 | 33.66 |
Decreased | 2526 | 28.87 | |||
Increased | 1957 | 22.37 | |||
DK/NA | 1321 | 15.1 | |||
39 | FirmIncomeGenerationIndicators#DebtComparedToAssets | Categorical | Remained unchanged | 3471 | 39.67 |
Decreased | 2754 | 31.48 | |||
Increased | 1704 | 19.48 | |||
Not applicable, the firm has no debt | 743 | 8.49 | |||
DK | 77 | 0.88 | |||
40 | FinancingApplied#ApplicationOfExternalFinancing_BankLoans | Categorical | Did not apply because of sufficient internal funds | 4234 | 48.39 |
Applied | 2350 | 26.86 | |||
Did not apply for other reasons | 1630 | 18.63 | |||
Did not apply because of possible rejection | 334 | 3.82 | |||
DK/NA | 201 | 2.3 | |||
41 | FinancingApplied#ApplicationOfExternalFinancing_TradeCredit | Categorical | Did not apply because of sufficient internal funds | 3910 | 44.69 |
Did not apply for other reasons | 2365 | 27.04 | |||
Applied | 1939 | 22.16 | |||
DK/NA | 343 | 3.92 | |||
Did not apply because of possible rejection | 192 | 2.19 | |||
42 | FinancingApplied#ApplicationOfExternalFinancing_OtherExternalFinancing | Categorical | Did not apply because of sufficient internal funds | 4272 | 48.83 |
Did not apply for other reasons | 2527 | 28.88 | |||
Applied | 1373 | 15.69 | |||
DK/NA | 370 | 4.23 | |||
Did not apply because of possible rejection | 207 | 2.37 | |||
43 | FinancingApplied#ApplicationOfExternalFinancing_CreditLine | Categorical | Did not apply because of sufficient internal funds | 4259 | 48.68 |
Applied | 2061 | 23.56 | |||
Did not apply for other reasons | 1689 | 19.31 | |||
Did not apply because of possible rejection | 318 | 3.63 | |||
DK/NA | 422 | 4.82 | |||
44 | vulnerable | Binary | 0 | 8430 | 96.35 |
1 | 319 | 3.65 |
Model | Balanced Accuracy | ROC AUC |
---|---|---|
Gradient Boosting | 0.754 | 0.815 |
Logistic Regression | 0.750 | 0.811 |
Easy Ensemble Classifier | 0.746 | 0.804 |
Balanced Random Forest Classifier | 0.738 | 0.810 |
Random Forest | 0.733 | 0.807 |
SVC | 0.732 | 0.793 |
Balanced SVC | 0.729 | 0.800 |
AdaBoost | 0.718 | 0.794 |
MultinomialNB | 0.711 | 0.790 |
Balanced MultinomialNB | 0.709 | 0.789 |
Balanced KNeighbors | 0.678 | 0.743 |
KNeighbors | 0.552 | 0.640 |
Model | Precision Class 0 | Recall Class 0 | F1 Class 0 | Precision Class 1 | Recall Class 1 | F1 Class 1 |
---|---|---|---|---|---|---|
Gradient Boosting | 0.931 | 0.794 | 0.857 | 0.417 | 0.714 | 0.526 |
Logistic Regression | 0.931 | 0.779 | 0.848 | 0.402 | 0.722 | 0.517 |
Easy Ensemble Classifier | 0.930 | 0.775 | 0.845 | 0.396 | 0.717 | 0.510 |
Balanced Random Forest Classifier | 0.928 | 0.764 | 0.838 | 0.383 | 0.711 | 0.498 |
Random Forest | 0.917 | 0.830 | 0.872 | 0.436 | 0.636 | 0.517 |
SVC | 0.920 | 0.803 | 0.858 | 0.409 | 0.660 | 0.505 |
Balanced SVC | 0.925 | 0.754 | 0.831 | 0.371 | 0.703 | 0.486 |
AdaBoost | 0.918 | 0.768 | 0.836 | 0.373 | 0.668 | 0.478 |
MultinomialNB | 0.918 | 0.745 | 0.823 | 0.354 | 0.676 | 0.465 |
Balanced MultinomialNB | 0.917 | 0.746 | 0.823 | 0.353 | 0.671 | 0.463 |
Balanced KNeighbors | 0.900 | 0.770 | 0.830 | 0.344 | 0.586 | 0.433 |
KNeighbors | 0.845 | 0.962 | 0.899 | 0.434 | 0.142 | 0.214 |
References
- World Bank. Small and Medium Enterprises (SMEs) Finance—Improving SMEs’ Access to Finance and Finding Innovative Solutions to Unlock Sources of Capital. 2023. Available online: https://www.worldbank.org (accessed on 3 September 2025).
- Mason, C.M.; Harrison, R.T. “Investment readiness”: A critique of government proposals to increase the demand for venture capital. Reg. Stud. 2001, 35, 663–668. [Google Scholar] [CrossRef]
- Douglas, E.J.; Shepherd, D.A. Exploring investor readiness: Assessments by entrepreneurs and investors in Australia. Ventur. Cap. 2002, 4, 219–236. [Google Scholar] [CrossRef]
- Fellnhofer, K. Literature review: Investment readiness level of small and medium-sized companies. Int. J. Manag. Financ. Account. 2015, 7, 268–284. [Google Scholar] [CrossRef]
- Mullainathan, S.; Spiess, J. Machine learning: An applied econometric approach. J. Econ. Perspect. 2017, 31, 87–106. [Google Scholar] [CrossRef]
- Dumitrescu, E.; Hué, S.; Hurlin, C.; Tokpavi, S. Machine learning or econometrics for credit scoring: Let’s get the best of both worlds. HAL Open Sci. 2021. Available online: https://hal.archives-ouvertes.fr/hal-02507499 (accessed on 3 September 2025).
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Levy, J.; Mussack, D.; Brunner, M.; Keller, U.; Cardoso-Leite, P.; Fischbach, A. Contrasting classical and machine learning approaches in the estimation of value-added scores in large-scale educational data. Front. Psychol. 2020, 11, 2190. [Google Scholar]
- Agresti, A. Foundations of Linear and Generalized Linear Models, 1st ed.; Wiley: Hoboken, NJ, USA, 2015; pp. 1–480. [Google Scholar]
- Hosmer, D.W.; Lemeshow, S.; Sturdivant, R.X. Applied Logistic Regression, 3rd ed.; Wiley: Hoboken, NJ, USA, 2013; pp. 1–528. [Google Scholar]
- Churpek, M.M.; Yuen, T.C.; Winslow, C.; Meltzer, D.O.; Kattan, M.W.; Edelson, D.P. Multicenter Comparison of Machine Learning Methods and Conventional Regression for Predicting Clinical Deterioration on the Wards. Crit. Care Med. 2016, 44, 368–374. [Google Scholar] [CrossRef]
- Hornung, O.; Smolnik, S. AI invading the workplace: Negative emotions towards the organizational use of personal virtual assistants. Electron. Markets 2022, 32, 123–138. [Google Scholar] [CrossRef]
- Groves, R.M.; Fowler, F.J., Jr.; Couper, M.P.; Lepkowski, J.M.; Singer, E.; Tourangeau, R. Survey Methodology, 2nd ed.; Wiley: Hoboken, NJ, USA, 2009. [Google Scholar]
- European Central Bank Survey on the Access to Finance of Enterprises (SAFE). 2023. Available online: https://www.ecb.europa.eu/stats/ecb_surveys/safe/html/data.en.html (accessed on 3 September 2025).
- Cusolito, A.P.; Dautovic, E.; McKenzie, D. Can government intervention make firms more investment-ready? A randomized experiment in the Western Balkans. Rev. Econ. Stat. 2021, 103, 428–442. [Google Scholar] [CrossRef]
- Owen, R.; Botelho, T.; Hussain, J.; Anwar, O. Solving the SME finance puzzle: An examination of demand and supply failure in the UK. Ventur. Cap. 2023, 25, 31–63. [Google Scholar] [CrossRef]
- Du, J.; Rada, R. Machine learning and financial investing. In Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques; IGI Global: Hershey, PA, USA, 2010; pp. 375–398. [Google Scholar] [CrossRef][Green Version]
- Khan, A.H.; Shah, A.; Ali, A.; Shahid, R.; Zahid, Z.U.; Sharif, M.U.; Jan, T.; Zafar, M.H. A performance comparison of machine learning models for stock market prediction with novel investment strategy. PLoS ONE 2023, 18, e0286362. [Google Scholar] [CrossRef] [PubMed]
- Alexakis, C.; Gogas, P.; Petrella, G.; Polemis, M.; Salvadè, F. Investigating the Investment Readiness of European SMEs: A Machine Learning Approach. 2025. Available online: https://ssrn.com/abstract=5007133 (accessed on 3 September 2025).
- Zana, D.; Barnard, B. Venture capital and entrepreneurship: The cost and resolution of investment readiness. SSRN Electron. J. 2019. [Google Scholar] [CrossRef]
- Wooldridge, J.M. Introductory Econometrics: A Modern Approach, 5th ed.; South-Western Cengage Learning: Mason, OH, USA, 2013. [Google Scholar]
- Norvig, R. Artificial Intelligence. A Modern Approach, 4th ed.; Pearson Series; Pearson Education, Inc.: Upper Saddle River, NJ, USA, 2020. [Google Scholar]
- Cox, D.R. The regression analysis of binary sequences. J. R. Stat. Soc. Ser. B (Methodol.) 1958, 20, 215–232. [Google Scholar] [CrossRef]
- Cover, T.M.; Hart, P.E. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar]
- Schütze, H.; Manning, C.D.; Raghavan, P. Introduction to Information Retrieval; Cambridge University Press: Cambridge, UK, 2008; pp. 234–265. [Google Scholar]
- Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. Eur. Conf. Comput. Learn. Theory 1997, 55, 119–139. [Google Scholar]
- Liu, X.; Wu, J.; Zhou, Z. Exploratory Undersampling for Class-Imbalance Learning. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 2009, 39, 539–550. [Google Scholar]
- Breiman, L. Bagging predictors. Mach. Learn. 1996, 24, 123–140. [Google Scholar]
- Gogas, P.; Papadimitriou, T.; Sofianos, E. Money Neutrality, Monetary Aggregates and Machine Learning. Algorithms 2019, 12, 137. [Google Scholar] [CrossRef]
- Brodersen, K.H.; Ong, C.S.; Stephan, K.E.; Buhmann, J.M. The balanced accuracy and its posterior distribution. In Proceedings of the 20th International Conference on Pattern Recognition 2010, Istanbul, Turkey, 23–26 August 2010; pp. 3121–3124. [Google Scholar]
- Hair, J.; Anderson, R.; Tatham, R.; Black, W. Multivariate Data Analysis, 5th ed.; Prentice Hall: Upper Saddle River, NJ, USA, 1998. [Google Scholar]
- López, V.; Fernández, A.; García, S.; Palade, V.; Herrera, F. An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics. Inf. Sci. 2013, 250, 113–141. [Google Scholar] [CrossRef]
- Pes, B.; Lai, G. Cost-sensitive learning strategies for high-dimensional and imbalanced data: A comparative study. PeerJ Comput. Sci. 2021, 7, e832. [Google Scholar] [CrossRef] [PubMed]
- Araf, I.; Idri, A.; Chairi, I. Cost-sensitive learning for imbalanced medical data: A review. Artif. Intell. Rev. 2024, 57, 80. [Google Scholar] [CrossRef]
- Peykani, P.; Peymany Foroushany, M.; Tanasescu, C.; Sargolzaei, M.; Kamyabfar, H. Evaluation of Cost-Sensitive Learning Models in Forecasting Business Failure of Capital Market Firms. Mathematics 2025, 13, 368. [Google Scholar] [CrossRef]
- Mason, C.M.; Kwok, J. Investment Readiness Programmes and Access to Finance: A Critical Review of Design Issues; Working Paper 10-03; Hunter Centre for Entrepreneurship, University of Strathclyde: Scotland, UK, 2010. [Google Scholar]
- Brush, C.G.; Edelman, L.F.; Manolova, T.S. Ready for funding? Entrepreneurial ventures and the pursuit of angel financing. Ventur. Cap. 2012, 14, 111–129. [Google Scholar] [CrossRef]
- Landström, H. The ivory tower of business angel research. Ventur. Cap. 2019, 21, 97–119. [Google Scholar] [CrossRef]
Model | FP | FN | TMC |
---|---|---|---|
Gradient Boosting | 438 | 94 | 908 |
Logistic Regression | 379 | 107 | 914 |
Easy Ensemble Classifier | 409 | 103 | 924 |
Random Forest | 441 | 99 | 936 |
AdaBoost | 386 | 111 | 941 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gogas, P.; Papadimitriou, T.; Goumenidis, P.; Kontos, A.; Giannakis, N. Identification of Investment-Ready SMEs: A Machine Learning Framework to Enhance Equity Access and Economic Growth. Forecasting 2025, 7, 51. https://doi.org/10.3390/forecast7030051
Gogas P, Papadimitriou T, Goumenidis P, Kontos A, Giannakis N. Identification of Investment-Ready SMEs: A Machine Learning Framework to Enhance Equity Access and Economic Growth. Forecasting. 2025; 7(3):51. https://doi.org/10.3390/forecast7030051
Chicago/Turabian StyleGogas, Periklis, Theophilos Papadimitriou, Panagiotis Goumenidis, Andreas Kontos, and Nikolaos Giannakis. 2025. "Identification of Investment-Ready SMEs: A Machine Learning Framework to Enhance Equity Access and Economic Growth" Forecasting 7, no. 3: 51. https://doi.org/10.3390/forecast7030051
APA StyleGogas, P., Papadimitriou, T., Goumenidis, P., Kontos, A., & Giannakis, N. (2025). Identification of Investment-Ready SMEs: A Machine Learning Framework to Enhance Equity Access and Economic Growth. Forecasting, 7(3), 51. https://doi.org/10.3390/forecast7030051