Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study

Das, Madhusmita; Mohan, Biju R.; Guddeti, Ram Mohana Reddy; Prasad, Nandini

doi:10.3390/math12162521

Open AccessArticle

Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study

Department of Information Technology, National Institute of Technology Karnataka, Surathkal, Mangalore 575025, India

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(16), 2521; https://doi.org/10.3390/math12162521

Submission received: 10 June 2024 / Revised: 27 July 2024 / Accepted: 2 August 2024 / Published: 15 August 2024

(This article belongs to the Section Mathematics and Computer Science)

Download

Browse Figures

Versions Notes

Abstract

:

Addressing real-time optimization problems becomes increasingly challenging as their complexity continues to escalate over time. So bio-optimization algorithms (BoAs) come into the picture to solve such problems due to their global search capability, adaptability, versatility, parallelism, and robustness. This article aims to perform hyperparameter tuning of machine learning (ML) models by integrating them with BoAs. Aiming to maximize the accuracy of the hybrid bio-optimized defect prediction (HBoDP) model, this research paper develops four novel hybrid BoAs named the gravitational force Lévy flight grasshopper optimization algorithm (GFLFGOA), the gravitational force Lévy flight grasshopper optimization algorithm–sparrow search algorithm (GFLFGOA-SSA), the gravitational force grasshopper optimization algorithm–sparrow search algorithm (GFGOA-SSA), and the Lévy flight grasshopper optimization algorithm–sparrow search algorithm (LFGOA-SSA). These aforementioned algorithms are proposed by integrating the good exploration capacity of the SSA with the faster convergence of the LFGOA and GFGOA. The performances of the GFLFGOA, GFLFGOA-SSA, GFGOA-SSA, and LFGOA-SSA are verified by conducting two different experiments. Firstly, the experimentation was conducted on nine benchmark functions (BFs) to assess the mean, standard deviation (SD), and convergence rate. The second experiment focuses on boosting the accuracy of the HBoDP model through the fine-tuning of the hyperparameters in the artificial neural network (ANN) and XGBOOST (XGB) models. To justify the effectiveness and performance of these hybrid novel algorithms, we compared them with four base algorithms, namely the grasshopper optimization algorithm (GOA), the sparrow search algorithm (SSA), the gravitational force grasshopper optimization algorithm (GFGOA), and the Lévy flight grasshopper optimization algorithm (LFGOA). Our findings illuminate the effectiveness of this hybrid approach in enhancing the convergence rate and accuracy. The experimental results show a faster convergence rate for BFs and improvements in software defect prediction accuracy for the NASA defect datasets by comparing them with some baseline methods.

Keywords:

machine learning; software defect prediction; hyperparameter tuning; benchmark functions; bio-optimization algorithm

MSC:

35M99

1. Introduction

In the real world, solving constrained and unconstrained optimization problems is practically very challenging due to their increased complexity. Optimization problems arise in various fields, including mathematics, engineering, economics, computer science, etc.

In recent years, metaheuristic approaches have gained popularity due to local and global convergence and have been applied to many optimization problems. Based on [1], metaheuristic algorithms have been categorized into single-based methods and population-based methods. Population-based algorithms consist of evolutionarily inspired, swarm-inspired, physics-based, human-based, bio-based, and math-based algorithms. The popularity of the genetic algorithm (GA) [2], the evolutionary algorithm (EA) [3], and the particle swarm optimization (PSO) [4] algorithm has created more interest among researchers to propose more bio-optimization algorithms (BoAs), such as grasshopper optimization algorithm (GOA) [5], the Wolf Pack Algorithm (WPA) [6], Artificial Bee Colony (ABC) [7], the Harris Hawks Optimizer (HHO) [8], etc. Bio-optimization algorithms (BoAs) follow a two-step process, consisting of the the exploration and exploitation stages. Due to their higher convergence speed, few parameters, and simple implementation, these BoAs are widely used in solving parameter optimization, NP-hard problems [9], fault prediction [10], bioinformatics [11], image processing [12], and many other real-time engineering problems.

In this paper, we considered the widely used NASA defect dataset [13] to predict software defects as a case study, where accuracy is considered as one of the performance metrics. To improve or enhance the accuracy, the optimization term comes into the picture. By combining BoAs with the machine learning (ML) and deep learning (DL) approaches for parameter tuning of the respective models, the accuracy level can be enhanced.

Despite the development of numerous BoAs, each algorithm possesses its own set of limitations. Based on the no free lunch (NFL) theorem [14], no single BoA can perform best in addressing all optimization problems. As a result, researchers are interested in exploring and proposing effective optimization methods to tackle a broader range of optimization problems. Referring to [15], innovating hybrid algorithms integrating two or more bio-optimization techniques can be a promising solution. These hybrids aim to amalgamate the strengths of all constituent algorithms.

1.1. Motivation

The need for more advanced optimization approaches increases with the complexity of the optimization problems. Our research in this field is motivated by the following factors:

Based on the aforementioned explanation, hybrid approaches are becoming more and more popular to enhance the convergence performance of individual algorithms.
The effectiveness and scalability of hybrid BoAs across diverse scientific domains inspired us to propose novel and competitive hybrid methodologies.
Hyperparameter tuning can be regarded as an optimization problem, as this process involves searching through a range of objective functions to maximize accuracy.
The use of hybrid BoAs for solving the hyperparameter tuning problem is motivated by their ability to handle efficient exploration–exploitation trade-offs and scalability.

Motivated by the preceding arguments, our study introduces an improved version of the GOA. This enhanced version incorporates the principles of gravitational force (GF) and Lévy flight (LF). Additionally, we developed a hybrid novel algorithm by merging the potent exploration feature of the SSA with the enhanced version of the GOA.

1.2. Contribution

The objective of this research work is to propose four novel hybrid BoAs for improving the accuracy of the SDP model. These novel hybrid algorithms are proposed based on the probabilistic selection mechanism, which carefully balances the strong exploration and exploitation tendencies. To evaluate their effectiveness and performance, two types of experiments were carried out. Initially, the experiments were conducted on benchmark functions (BFs) to assess the mean, standard deviation (SD), and convergence rates. Subsequently, a second experiment focused on boosting the accuracy of SDP through the fine-tuning of the hyperparameters in both the artificial neural network (ANN) and XGBOOST (XGB) models. The key contributions of the proposed work are as follows:

Design of an Enhanced GOA named gravitational force Lévy flight grasshopper optimization algorithm (GFLFGOA) by introducing LF and GF concepts to balance the exploration and exploitation nature.
Design of a novel hybrid gravitational force Lévy flight grasshopper optimization algorithm–sparrow search algorithm (GFLFGOA-SSA) that includes GFLFGOA and the sparrow search algorithm (SSA) concept to accelerate convergence rate.
Design of a hybrid gravitational force grasshopper optimization algorithm–sparrow search algorithm (GFGOA-SSA) by embedding the concepts of gravitational force grasshopper optimization algorithm (GFGOA) and SSA concepts.
Design of a hybrid Lévy flight grasshopper optimization algorithm–sparrow search algorithm (LFGOA-SSA).
Extensive experiments conducted on BFs and hyperparameter tuning of XGB and ANN models to prove the above-proposed algorithm’s superiority to the state-of-the-art techniques.

The subsequent sections of this paper are organized as follows. Section 2 delves into a literature survey, while Section 3 provides a concise overview of the background concepts utilized in the proposed method. Section 4 outlines the methodology employed for carrying out two extensive experiments. Following this, Section 5 presents the results obtained and subsequent analysis. Lastly, Section 6 concludes the paper by summarizing observations and suggesting potential directions for future research.

2. Literature Survey

This section presents some pertinent works that adopt various bio-optimized approaches with ML and DL models for solving various types of optimization problems. We categorized this section into two subsections, namely

bio-optimized approach to optimization problems;
bio-optimized approach to hyperparameter tuning problems.

2.1. Bio-Optimized Approach to Optimization Problems

It has recently been discovered that optimization approaches based on bio-optimized algorithms are effective in a variety of fields. Certain variations of PSO [4], GA [16], ABC [17], etc., algorithms have been successfully used in a variety of sectors to achieve restricted optimization.

Zhen et al. in [18] proposed a hybrid WPA-PSO to estimate and predict the software reliability parameters. The proposed algorithm was verified through five sets of data from the industry. Through the simulation, the authors justified higher accuracy and better optimization. Zulfiqar et al. in [19] proposed a hybrid model by integrating the multivariate empirical model decomposition (MEMD) with the adaptive differential evolution (ADE) algorithm to optimize the hyperparameter of a support vector machine (SVM) model for electricity load forecasting. The above-proposed MEMD-ADE-SVM model achieved good accuracy by tuning the parameters of SVM. Blume et al. in [20] implemented GA for hyperparameter optimization on ANN for designing software sensors. Akter et al. in [21] developed a new crossover technique to improve GA and implemented it on the traveling salesman problem for optimization.

Sajjad et al. [11] implemented a hyperparameter tuning approach using grey wolf optimization (GWO) on machine learning (ML) algorithms and deep neural networks. The proposed method was employed on eleven datasets of different kinds, including biological, biomedical, clinical diagnosis, etc. Binghui et al. [22] proposed a metamodel-based methodology for hyperparameter optimization of optimization algorithms in building energy optimization (BEO). The authors validated their methodology by considering 15 benchmark BEO problems with various properties. Meetesh et al. [10] investigated the impact of hyperparameter optimization on defect count prediction. In their paper, the validation of the approach was performed with 15 software defect datasets. Through their research work, the authors emphasized the importance of the exploration of parameter space.

These days, several fields have discovered bio-optimized strategies to be effective. Many studies have been carried out on swarm intelligence based on the behavior of sparrows, wolves, bees, ants, etc. On that behavior, some new algorithms and hybrid algorithms have been proposed. Xue et al. in [23] introduced a new optimization technique based on sparrow behavior, which signifies the improvement in terms of convergence speed and search precision. In the recent advancement of the SSA algorithm, researchers have introduced different variants of SSA with adaptive learning factor [24], chaotic mapping [25], robot path planning [26], and adaptive version with adaptive weight [27].

Zhao et al. [28] introduced an optimized technique based on the behavior of wolves. It is based on a greedy strategy, which leads to getting trapped in local optima because of excessive greed. It has been observed that it has faster convergence in the initial stages, but in later stages, the convergence speed slows down. There are many improvisations employed in this algorithm. Regarding the greedy problem, Li et al. [29] have shown that incorporating a chaotic approach into WPA’s search process can successfully prevent it from entering a local optimum. Additionally, the work by Xiu et al. [30] describes how to increase the slow convergence speed at later stages using Lévy behavior. Chen et al. [31] introduced the concept of opposition-based genetic learning to tackle the lack of influence of lead wolves, reducing the stability of the algorithm.

Jadon et al. [32] proposed a hybrid ABC algorithm with differential evolution (DE) that is tested on a welded beam design problem. He modified the position update equation for the employee bee phase and then applied DE to update the position for the onlooker bee phase. Hybrid PSOGA was proposed by Mirjalili et al. [33] for binary optimization by combining the social component of PSO with GA to accelerate the convergence speed. To solve high-dimensional complex problems, quantum particle swarm optimization (QPSO) was introduced by Li et al. [34]. A new hybrid SSA-PSO was developed by Yang et al. [35], which showed great improvements in convergence speed and stability.

As our research work focuses on the hybridization of the GOA with other BoAs, the survey emphasizes various variants and applications of GOA. Saremi et al. [36] proposed a novel BoA named GOA using swarm intelligence to solve various optimization problems. As this algorithm is proven to be efficient enough in solving optimization problems, it has gained a lot of interest among researchers. The authors in [37,38,39] performed a comprehensive analysis of GOA on various real-time problems.

Meraihi et al. in [5] conducted a comprehensive review of the hybridized version of GOA with other BoAs to address real-world problems in order to fully extend GOA’s performance. Arora et al. in [40] incorporated chaos theory into GOA to boost global convergence and justified it by testing it using thirteen benchmark functions (BFs). In [41], the authors proposed improved GOA (IGOA) by embedding trigonometric substitution into the original GOA to boost Cauchy mutation. The performance of the proposed IGOA was validated on the IEEE CEC2017 BFs compared with other BoAs.

The authors in [42] enhanced the exploration phase of the GOA by embedding the crossover operator and salp swarm algorithm into it. The correctness of the proposed method was validated through feature selection datasets and six real-time engineering problems. Yildiz et al. in [43] proposed an improved version of GOA by adding an elite opposite-based learning method into it, called the elite opposite-based learning grasshopper optimization method (EOBL-GOA). The EOBL-GOA was validated through various engineering design problems. Feng et al. [44] developed dynamic opposite learning-assisted GOA (DOLGOA) and validated the correctness of DOLGOA through CEC2014 BFs and the flexible job scheduling problem (FJSP). Peng et al. in [45] proposed an improved grasshopper optimization algorithm (IGOA) by taking the gravity force concept into GOA to optimize the parameters of the backpropagation neural network (BPNN).

The aforementioned algorithms offer the advantage of striking a balance between exploration and exploitation, but their performance is somewhat constrained. Given our focus on hyperparameter tuning of ML models using software defect datasets, we conducted a literature review on optimization problems.

2.2. Bio-Optimized Approach for Hyperparameter Tuning Problems

ML and DL models are widely used for solving regression and classification problems related to various types of real-time applications. The authors in [46] use various ML models, including naive Bayes (NB), support vector regression (SVR), decision tree (DT), and random forest (RF) algorithms, to evaluate the performance metrics for the software reliability assessment problem. The aforementioned models are cross-validated on DBS-1 and DBS-2 datasets. Besides ML, the authors also implemented a learning approach that incorporates a recurrent neural network (RNN) for comparison on the above-mentioned dataset. Consideration of the limited dataset in this work and the lack of an explanation of further optimization processes are the limitations of this work.

In [47], the authors implemented various DL techniques to predict software fault considering Chidamber and Kemerer (CK) metrics-based datasets. The above methods were verified and validated by comparing them with other existing methods. The authors in [48] implemented the feature reduction concept for software defect prediction (SDP). To carry out the proposed method, they used four NASA defect datasets.

An attention-based recurrent neural network (DPARNN) framework is proposed in [49] for SDP. Seven open-source java projects in apache were considered for validating the above-proposed method. The authors in [50] conducted a systematic literature review on fault prediction using various ML, DL, and data mining techniques. Besides the techniques, they also reviewed defect datasets and performance metrics.

The authors in [51] proposed an ANN model with one input layer (eight dimensions), two hidden layers, and one output layer with a sigmoid activation function to predict the software defect of dataset JM1. A comparative analysis was performed by Jindal et al. [52] with different DL models such as gated recurrent network (GRU), long short-term memory (LSTM), and RNN with ANN as a base model, and they found that LSTM performs better than others. Alghanim et al. [53] proposed an enhanced deep neural network (NN) model based on GRNN and tested it with repeated 10-fold cross-validation.

Clemente et al. [54] proved that within ANN, SVM, DT, and RF, RF performed better in the case of PC1, and ANN performed better in the case of KC2. Wongpheng et al. [55] proved that when a model was trained for 100 and 1000 epochs, it delivered high accuracy for a greater number of epochs, but extra experiments were still required for deciding the optimal learning rate and other model parameters.

In [56], the authors examined "five NASA defect public datasets" from the PROMISE data repository named CM1, KC1, KC2, JM1, and PC1 for defect prediction using ten ML classifiers but found no consistently accurate results. DL techniques were also explored, highlighting the complexity of the problem. By selecting the most informative features, the dimensionality of the data is reduced, improving the efficiency of the analysis and enhancing the quality of predictions. It also helps in understanding the underlying factors that contribute to software failures, thus allowing for targeted improvements and resource allocation.

Alsaeedi et al.’s work in [57] employed three classifiers and ensemble methods for defect anticipation on NASA datasets. They found that RF and Ada-Boost with RF outperformed other approaches in addressing software flaws. In [58], an ensemble approach was used with six algorithms on NASA datasets, demonstrating the effectiveness of RF as the best ensemble algorithm for defect prediction.

Iqbal et al.’s study [59] focuses on utilizing twelve NASA defect datasets and employs a range of classification algorithms, such as NB, radial basis function, multi-layer perceptron, and K-nearest neighbor (K-NN). The objective was to forecast software errors and enhance the reliability of software systems through ML-based classifiers and statistical methods.

R. Malhotra’s work in [60] conducted a comprehensive assessment of software bug prediction methods, evaluating ML techniques, comparing them with statistical approaches, and summarizing their strengths and weaknesses. Parashar et al. in [61] proposed a multicore parallel ML approach to classification problems for SDP. The proposed model was trained and tested on eleven software systems of NASA and other relevant repositories.

To the best of our knowledge, in the above-mentioned literary survey, there is a lack of explanation on further improvisation of the performance metrics, even though they had implemented various ML and DL techniques for SDP. As this research work focuses on optimization of the ML models to obtain accurate prediction, it motivates us to focus on parameter tuning of the models for SDP using hybrid BoAs.

3. Concepts

This section briefly describes the base BoAs that are used to propose four novel hybrid BoAs. The performance of the proposed algorithm is verified through BFs and hyperparameter tuning of ML models for the SDP classification problem. The ML models considered for this work are XGB and ANN. For the SDP classification problem, the NASA defect dataset (NASA Defect Dataset https://github.com/klainfo/NASADefectDataset, accessed on 5 January 2024) is considered.

3.1. Grasshopper Optimization Algorithm (GOA)

Saremi et al. [36] proposed GOA for solving optimization problems [62] by mimicking the behavior of grasshopper swarms in nature. The mathematical model of the original GOA is as follows:

Z_{i} = S I_{i} + G F_{i} + A W_{i}

(1)

where:

$Z_{i} \leftarrow$ $i^{t h}$ grasshopper position;
$S I_{i} \leftarrow$ Social interaction;
$G F_{i} \leftarrow$ Gravitational force;
$A W_{i} \leftarrow$ Wind advection.

Social interaction in Equation (1) is an important factor for the evaluation of the position of the grasshopper and is evaluated as follows:

S I_{i} = \sum_{j = 1, j \neq i}^{n} s (d_{i j}) \hat{d_{i j}}

(2)

where:

n $\leftarrow$ Number of grasshoppers;
$d_{i j} \leftarrow$ Distance from $i^{t h}$ grasshopper to $j^{t h}$ grasshopper;
$\hat{d_{i j}} \leftarrow$ Unit Vector from $i^{t h}$ grasshopper to $j^{t h}$ grasshopper.

The evaluations of

d_{i j}

and

\hat{d_{i j}}

are presented in Equations (3) and (4), respectively.

d_{i j} = |Z_{j} - Z_{i}|

(3)

\hat{d_{i j}} = \frac{(Z_{j} - Z_{i})}{d_{i j}}

(4)

where:

$Z_{j} \leftarrow$ $j^{t h}$ grasshopper’s position;
$Z_{i} \leftarrow$ $i^{t h}$ grasshopper’s position.

The social force strength (s) is defined in Equation (5).

s (r) = f e^{- r / l} - e^{- r}

(5)

where:

f $\leftarrow$ Intensity of attraction;
l $\leftarrow$ Attractive length.

As mentioned in [36], the parameter values are l = 1.5 and f = 0.5. The social force (s) cannot be applied if the distance is large between grasshoppers, as s trends to 0. So, to avoid such situations, d should be mapped in the interval of [1, 4]. The gravitational force (

G F_{i}

) is presented as follows:

G F_{i} = - g \hat{e_{g}}

(6)

where:

g $\leftarrow$ Gravitational constant;
$\hat{e_{g}} \leftarrow$ Unity vector towards the center of the earth.

The wind advection

A W_{i}

is presented as follows:

A W_{i} = u \hat{w_{i}}

(7)

where:

u $\leftarrow$ Constant draft;
$\hat{w_{i}} \leftarrow$ Unity vector in the wind direction.

Substituting the values of

S I_{i}

,

G F_{i}

, and

A W_{i}

in Equation (1), Equation (1) becomes

Z_{i} = \sum_{j = 1, j \neq i}^{n} s (|Z_{j} - Z_{i}|) \frac{(Z_{j} - Z_{i})}{d_{i j}} - g \hat{e_{g}} + u \hat{w_{i}}

(8)

Using Equation (8), the optimization model cannot be solved efficiently, as the grasshopper may be located in the comfort zone. Due to their comfort zone, the grasshoppers cannot be converged efficiently. So, for efficient convergence, some special parameters are added to Equation (8). So Equation (8) can be modified as follows:

Z_{i}^{d} = c (\sum_{j = 1, j \neq i}^{n} c \frac{u b_{d} - l b_{d}}{2} s (|Z_{j}^{d} - Z_{i}^{d}|) \frac{Z_{j} - Z_{i}}{d_{i j}}) + \hat{T_{d}}

(9)

where:

$u b_{d}$ and $l b_{d} \leftarrow$ Upper bound and lower bound of the $d^{t h}$ dimension of the $i^{t h}$ grasshopper, respectively;
$\hat{T_{d}} \leftarrow$ Target or optimal position in the $d^{t h}$ dimension (best solution found so far);
c $\leftarrow$ A decreasing coefficient to shrink the comfort repulsion and attraction areas.

The parameter c is updated to reduce exploration and increase exploitation in accordance with the number of iterations, as indicated in Equation (10).

c = c_{m a x} - r \frac{c_{m a x} - c_{m i n}}{G}

(10)

where:

$c_{m a x} \leftarrow$ Maximum value;
$c_{m i n} \leftarrow$ Minimum value;
r $\leftarrow$ Current iteration;
G $\leftarrow$ Maximum iteration.

The values adopted in this work are

c_{m i n} = 0.00001

and

c_{m a x} = 1

.

3.2. Lévy Flight GOA (LFGOA)

Paul Lévy introduced the Lévy Flight (LF) concept, which is currently presented as Lévy statistics [63]. Generally, Lévy’s flight step can be represented by Equation (11).

L e v y_s \sim {|s|}^{- 1 - α}

(11)

where:

s $\leftarrow$ Random step length of Lévy’s flight;
$α \leftarrow$ Power-law index [0, 2].

Based on [64], the authors considered Mantegna’s algorithm to generate

L e v y_S

for stable distribution of the Lévy flight. So for random walks, the step length

L e v y_S

is determined using Mantegna’s algorithm and is defined by using Equations (12)–(14).

L e v y_s = \frac{M}{{|P|}^{\frac{1}{α}}}

(12)

M = N (0, σ_{M}^{2})

(13)

P = N (0, σ_{P}^{2})

(14)

where:

$L e v y_S \leftarrow$ Step length for random walk;
$α \leftarrow$ 1.5;
M $\leftarrow$ Normal standard variable with standard deviation $σ_{M}$ that follows a normal distribution;
P $\leftarrow$ Normal standard variable with standard deviation $σ_{P}$ that follows a normal distribution.

σ_{M}

and

σ_{P}

are derived from the normal distribution and are represented by using Equations (15) and (16), respectively.

σ_{M} = {[\frac{Γ (1 + α) \times sin (0.5 π α)}{Γ (0.5 (1 + α)) \times α \times 2^{0.5 (α - 1)}}]}^{1 / α}

(15)

σ_{P} = 1

(16)

So the new position of the grasshopper using Lévy’s flight is defined as follows:

Z_{i}^{d} = l e v y_S * c (\sum_{j = 1, j \neq i}^{n} c \frac{u b_{d} - l b_{d}}{2} s (| Z_{j}^{d} - Z_{i}^{d} |) \frac{Z_{j -} Z_{i}}{d_{i j}}) + {\hat{T}}_{d}

(17)

3.3. Gravitational Force GOA (GFGOA)

Peng et al. [45] proposed an improved GOA by considering gravitational force (GF) and the selected probability (p) by normalizing the distance between the grasshoppers. In our research work, only the GF concept is considered, as presented in Equation (6). Considering only the GF concept in Equation (9), Equation (9) can be updated as follows:

Z_{i}^{d} = c (\sum_{j = 1, j \neq i}^{n} c \frac{u b_{d} - l b_{d}}{2} s (|Z_{j}^{d} - Z_{i}^{d}|) \frac{Z_{j} - Z_{i}}{d_{i j}}) - g \hat{e_{g}} + \hat{T_{d}}

(18)

where:

g $\leftarrow$ Gravitational constant (0.9);
$\hat{e_{g}} \leftarrow \frac{Z_{j} - Z_{i}}{d_{i j}}$ .

3.4. Sparrow Search Algorithm (SSA)

SSA is inspired by the foraging behaviors of sparrows, developed by Xue et al. [23]. In SSA, the sparrows are divided into producer sparrows (PD) and scrounger sparrows (SD). The PD directs the whole population towards the food source. During each iteration, the PD position is updated by Equation (19).

Z_{i, j}^{r + 1} = \{\begin{matrix} Z_{i, j}^{r} \cdot exp (\frac{- i}{α \cdot G}) & R_{2} < S T \\ Z_{i, j}^{r} + Q \cdot L & R_{2} \geq S T \end{matrix}

(19)

where:

r $\leftarrow$ Current iteration;
L $\leftarrow$ Matrix of (1 × d);
d $\leftarrow$ Dimension of the variable;
$Z_{i, j}^{r}$ $\leftarrow$ Position of the $j^{t h}$ dimension of the $i^{t h}$ sparrow at iteration r;
G $\leftarrow$ Maximum iterations;
Q $\leftarrow$ Random number which obeys a normal distribution;
$α$ $\leftarrow$ Random number [0, 1];
$R_{2}$ $\leftarrow$ Alarm value [0, 1];
ST $\leftarrow$ Safety threshold [0.5, 1.0];
$R_{2}$ < ST $\leftarrow$ Signals the predator to search on wide mode, as there are no predators;
$R_{2}$ ≥ ST $\leftarrow$ Signals sparrows to fly to another place, as there are predators.

For the SD update, they immediately leave their place to compete for food once they know that the PD has discovered some food source. The position of the SD is updated by using Equation (20).

Z_{i, j}^{r + 1} = \{\begin{matrix} Q \cdot exp (\frac{Z_{worst}^{r} - Z_{i, j}^{r}}{i^{2}}) & if i > \frac{n}{2} \\ Z_{P}^{r + 1} + |Z_{i, j}^{r} - Z_{P}^{r + 1}| \cdot A^{+} \cdot L & otherwise \end{matrix}

(20)

where:

$Z_{w o r s t}^{r}$ $\leftarrow$ Current global worst location at iteration r;
$Z_{P}^{r + 1}$ $\leftarrow$ ith sparrow’s current best location obtained so far;
Q $\leftarrow$ Random number which obeys a normal distribution;
$A^{+} = A^{T} {(A A^{T})}^{- 1}$ ;
A $\leftarrow$ One-dimensional vector [−1, 1];
i $\leftarrow$ ${1, 2 \dots n}$ ;
n $\leftarrow$ Number of sparrows.

Based on the assumption that 20% of the sparrows are aware of the danger, the positions of those sparrows are updated using Equation (21).

Z_{i, j}^{r + 1} = \{\begin{matrix} Z_{b e s t}^{r} + β \cdot |Z_{i, j}^{r} - Z_{b e s t}^{r}| & f_{i} > f_{g} \\ Z_{i, j}^{r} + K \cdot (\frac{|Z_{i, j}^{r} - Z_{w o r s t}^{r}|}{(f_{i} - f_{w}) + ε}) & f_{i} = f_{g} \end{matrix}

(21)

where:

$Z_{b e s t}^{r}$ $\leftarrow$ Current global optimal location at iteration r;
$β$ $\leftarrow$ Random number [mean = 0, variance = 1];
K $\leftarrow$ Random number [−1, 1];
$f_{i}$ $\leftarrow$ Current sparrow’s fitness value;
$f_{g}$ $\leftarrow$ Current global best value;
$f_{w}$ $\leftarrow$ Worst fitness value;
$ϵ$ $\leftarrow$ Constant value (c);
$f_{i}$ > $f_{g}$ $\leftarrow$ Sparrows are at the edge of the group;
$f_{i}$ = $f_{g}$ $\leftarrow$ Sparrows are in the middle of the population.

4. Methodology

This section emphasizes four proposed algorithms and their implementations on the unimodal and multimodal BFs. Besides focusing on the verification of the proposed algorithms on the BFs, this paper also focuses on the validation of the proposed algorithm for the improvement of accuracy on the SDP problem.

4.1. Proposed Algorithm

This subsection explains in detail the proposed hybrid BoAs, as mentioned in Section 4.1.1, Section 4.1.2, Section 4.1.3 and Section 4.1.4. Table 1 lists the notations and their explanation used for the proposed algorithms’ flowcharts, as drawn in Figure 1, Figure 2, Figure 3 and Figure 4.

4.1.1. GFLFGOA Hybrid Algorithm

In the original GOA, to balance local exploitation and global exploration, the parameter “c” was introduced. However, in the original GOA, the optimization process is nonlinear and has limited exploration ability. Due to the limited exploration ability, GOA may get stuck in local optima, which results in slow convergence. So, to overcome the disadvantages of GOA, an enhanced version of GOA is proposed, called GFLFGOA. To make a good balance between exploration and exploitation, LF and GF are introduced into the original GOA. After embedding the GF and LF concepts into the original GOA, as explained in Section 3.1, Section 3.2 and Section 3.3, the position of the grasshopper can be updated as follows:

Z_{i}^{d} = l e v y_S * c (\sum_{j = 1, j \neq i}^{N} c \frac{u b_{d} - l b_{d}}{2} s (|Z_{j}^{d} - Z_{i}^{d}|) \frac{Z_{j} - Z_{i}}{d_{i j}}) - g \hat{e_{g}} + \hat{T_{d}}

(22)

The pseudocode of the GFLFGOA algorithm is presented in Algorithm 1.

Algorithm 1: GFLFGOA

To visualize the pseudocode and the process of the proposed GFLFGOA algorithm, a flow diagram is drawn in Figure 1.

4.1.2. LFGOA-SSA Hybrid Algorithm

A new hybrid LFGOA-SSA algorithm is proposed in this paper. Its features allow the improvement of the convergence speed and accuracy by avoiding the local optima. It can be observed from the flowchart, as drawn in Figure 2, that the developed LFGOA-SSA is designed to combine the fast convergence of LFGOA and the high accuracy of SSA. The pseudocode of the proposed algorithm is written in Algorithm 2. Basically, this hybrid algorithm is developed based on the probabilistic selection mechanism after the initialization. If rand < 0.5, then the LFGOA is considered to generate the new fitness solution, while the SSA will be selected for rand ≥ 0.5. For SSA, the search agents are divided into PD and SD. If the search agent is PD and

R_{2}

< ST, then the new position will be updated as follows:

Z_{i, j}^{r + 1} = Z_{i, j}^{r} \cdot exp (\frac{- i}{α \cdot G})

(23)

Otherwise, the position for the PD will be updated as mentioned below:

Z_{i, j}^{r + 1} = Z_{i, j}^{r} + Q \cdot L

(24)

In the case of the SD search agent, if i

< \frac{S D}{2}

, then the position is updated by Equation (25), and otherwise followed by Equation (26).

Z_{i, j}^{r + 1} = Q \cdot exp (\frac{Z_{worst}^{r} - Z_{i, j}^{r}}{i^{2}})

(25)

Z_{i, j}^{r + 1} = Z_{P}^{r + 1} + |Z_{i, j}^{r} - Z_{P}^{r + 1}| \cdot A^{+} \cdot L

(26)

The pseudocode of the proposed LFGOA-SSA algorithm is presented in Algorithm 2.

Algorithm 2: LFGOA-SSA

The flow diagram of the LFGOA-SSA algorithm is presented in Figure 2.

4.1.3. GFGOA-SSA Hybrid Algorithm

An enhanced version of GOA named GFGOA, as explained in Section 3.3, is combined with the SSA to improve the balance between exploration and exploitation. The faster convergence of GFGOA and the good exploration capacity of SSA are combined based on the probabilistic selection mechanism. The GFGOA algorithm is selected if the randomly generated value is less than 0.5; otherwise, SSA will be considered. For better visualization of the steps of the GFGOA-SSA hybrid algorithm, the pseudocode and the flow diagram are presented in Algorithm 3 and Figure 3, respectively.

Algorithm 3: GFGOA-SSA

4.1.4. GFLFGOA-SSA Hybrid Algorithm

This novel hybrid algorithm is proposed based on a random selection mechanism. If the random value is less than 0.5, then our proposed algorithm GFLFGOA, as explained in Section 4.3.1, is selected for generating the best fitness value; otherwise, the SSA algorithm is used to generate the fitness value. The pseudocode of the GFLFGOA-SSA algorithm is presented in Algorithm 4.

Algorithm 4: GFLFGOA-SSA

The flow diagram of the GFLFGOA-SSA algorithm is drawn in Figure 4.

4.2. Optimization on Benchmark Functions (BFs)

BFs are artificial problems that can be used to assess the behavior and the performance of optimization algorithms in diverse and complex situations [65]. These functions are categorized into unimodal problem, multimodal problem, multidimensional problem, etc. For any optimization algorithm, it is mandatory to verify the performance of the algorithm by comparing it with other existing optimization algorithms. A total of nine BFs are considered to test the ability of our proposed algorithms. With respect to our experimentation, three unimodal BFs (

f n_{1} (x) - f n_{3} (x)

) and six multimodal BFs (

f n_{4} (x) - f n_{9} (x)

) are taken into consideration. Out of six multimodal BFs, three (

f n_{4} (x) - f n_{6} (x)

) are of fixed dimension, and the remaining are multi-dimensional functions. The mathematical expressions of the BFs are listed in Table 2. As mentioned in Table 2, n represents population size, G is the maximum number of iterations, dim refers to the number of dimensions, Range represents the interval of search space, and

f n_{m i n}

represents the optimal solution with respect to that mathematical function. The parameters considered for this experiment are n, G, dim, and Range.

4.3. Software Defect Prediction (SDP) Framework

To validate the performance of the proposed algorithms, as detailed in Section 4.1, we performed a second experiment on the proposed SDP framework to improve the accuracy of the HBoDP model. The improvisation of accuracy is based on the hyperparameter tuning approach. Figure 5 represents the SDP framework that is followed to carry out our experiment.

4.3.1. Data Source

As for the data source, this paper uses 13 NASA defect datasets (NASA Defect Dataset https://github.com/klainfo/NASADefectDataset, accessed on 5 January 2024) to validate the effectiveness of the hybrid BoAs. A detailed description of the NASA defect dataset is tabulated in Table 3.

4.3.2. Data Pre-Processing

Before inputting the datasets into the ML model, data pre-processing plays an important role in avoiding biases towards any particular features. In this paper, data pre-processing is carried out using four steps, as follows:

Label Encoding: Label encoding is a part of data transformation. It is defined as mapping the non-numeric value into the numeric value. In this research work, the dataset consists of yes (Y) and no (N) labels. Yes (Y) and no (N) are mapped to 0 and 1 in this step, respectively.
Data Cleaning: In this process, the data are cleaned by removing any outliers, and the inconsistent values are transformed with the mean of the attributes. The outliers are removed and replaced by the inter-quartile range (IQR) method.
Feature Selection: As the dataset consists of more features, to avoid bias towards only one kind of feature, feature selection is performed using the Pearson correlation coefficient, which is tabulated in Table 3.
Data Scaling: Normally known as normalization, we performed a min-max scaler for normalization.

After performing the data pre-processing step, the pre-processed data were split at a 75–25 ratio into training and testing sets, respectively.

4.3.3. Hybrid Bio-Optimized Defect Prediction (HBoDP) Model

The HBoDP model is established by integrating the novel hybrid BoAs (GFLFGOA, LFGOA-SSA, GFGOA-SSA, GFLFGOA-SSA) with the ML models (ANN, XGB). The model is trained and validated using the proposed hybrid BoA to predict software defects, with performance compared against baseline ML models such as ANN and XGB. This model aims to enhance accuracy while optimizing computational efficiency. To improve the accuracy of the HBoDP model, the hyperparameter tuning approach is applied. As the hyperparameter tuning approach is an optimization problem, BoA comes into the picture to tune the hyperparameters, thus enhancing the accuracy of the model.

5. Results and Discussion

This section focuses on the results and analyzes two different experiments, along with the parameter settings of the SDP framework for conducting the experiments.

5.1. Parameter Settings

This section details the parameter settings of the HBoDP model in two sections: first, a description of the NASA defect dataset, and second, the hyperparameters of the ML models.

5.1.1. NASA Defect Dataset Description

Based on our research work, this article considers 13 NASA defect datasets (NASA Defect Dataset https://github.com/klainfo/NASADefectDataset, accessed on 5 January 2024) to validate the effectiveness of the proposed algorithm of the HBoDP model. The NASA defect dataset is a collection of datasets that have been curated from software projects developed within the National Aeronautics and Space Administration (NASA). Given the nature of NASA’s operations, the software they develop is often safety-critical. Any defects or failures in such software can lead to catastrophic consequences, including mission failures, loss of expensive equipment, or even loss of life. Therefore, understanding and predicting defects in such systems is of great importance. This dataset provides a rich source of information on defects from real-world projects, making it highly relevant for studies aiming to predict failures in SCSs. A detailed description of the NASA defect dataset is tabulated in Table 3.

Based on [66], each instance represents one module from the original source code. A module is a self-contained unit of code that encapsulates one or more functions. Each of these datasets, as specified in Table 3, provides valuable insights into the nature of the defects in different types of software systems.

5.1.2. Hyperparameters of the ML Models

Hyperparameter tuning plays an important part in training the models. The ranges of hyperparameters considered to obtain high accuracy and faster convergence for the ANN and XGB models are tabulated in Table 4 and Table 5, respectively.

The proposed hybrid BoAs, as explained in the Section 4.1, are coded with Python language and run in the Jupyter Notebook environment. The experiments are conducted with the following system configuration as tabulated in Table 6.

All the experiments are conducted by considering the 13 NASA defect datasets, as explained and tabulated in Section 5.1.1 and Table 3, respectively.

5.2. Results

Based on the two different experiments, results are noted in the following Section 5.2.1 and Section 5.2.2.

5.2.1. BF Results

The experiments are conducted for 200 iterations; the values of the mean and SD, along with their ranking, are listed in Table 7 and Table 8, respectively. The convergence rates of the algorithms based on the fitness values are plotted in Figure 6.

5.2.2. SDP Framework Results

The experiments are conducted with the system configuration specified in Table 6 for 100 generations; the results are noted in Table 9, Table 10, Table 11 and Table 12. For evaluating the proposed hybrid algorithm, accuracy is considered a performance metric. To verify the computational effectiveness of the proposed algorithm, this paper compares the runtime of the proposed algorithms with some base algorithms. The runtime of the algorithm justifies the computational effectiveness of the algorithms proposed in this study.

5.3. Analysis

This paper focuses on two different experiments to assess the effectiveness of the proposed algorithms. The first one evaluates the statistical analysis of GFLFGOA, GFLFGOA-SSA, GFGOA-SSA, and LFGOA-SSA with the LFGOA, GFGOA, GOA, and SSA algorithms. The second one is performed to improve the prediction accuracy by optimizing the hyperparameters of the ANN and XGB models. For a comprehensive analysis, the discussion is divided into Section 5.3.1, Section 5.3.2 and Section 5.3.3.

5.3.1. BF Analysis

In order to evaluate the statistical performance of the GFLFGOA, GFLFGOA-SSA, GFGOA-SSA, and LFGOA-SSA algorithms on BFs, the mean and SD values, along with the rank of each method, are taken into account. A total of nine BFs are considered, consisting of unimodal, fixed-dimension multimodal, and multi-dimensional multimodal BFs, as detailed in Section 4.2. The statistical results of BFs are tabulated in Table 7 and Table 8. For fair analysis, each BF is experimented on for 200 iterations, and the convergence rates of the algorithms are plotted in Figure 6. The top rank, indicated in bold, is attained by having the lowest mean and SD values for each BF. To avoid ambiguity, we have categorized our discussion based on the types of statistical analysis.

Mean Analysis: For a better understanding of the mean analysis of BFs, we presented the discussion based on the types of BFs.
- Unimodal BFs ( $f n_{1}$ – $f n_{3}$ ): From Table 7, it can be clearly stated that for $f n_{2}$ , GFGOA-SSA ranks first, while for $f n_{1}$ and $f n_{3}$ , even though SSA ranks first, the convergence rates of the proposed algorithms are relatively faster, as plotted in Figure 6a,c, respectively.
- Fixed-dimension Multimodal BFs ( $f n_{4}$ – $f n_{6}$ ): GFLFGOA-SSA ranks first, achieving the lowest mean value for $f n_{6}$ , as tabulated in Table 7. But in the case of $f n_{4}$ and $f n_{5}$ , the ranking of the proposed algorithm is the same as the base algorithms. Even though the ranking is the same, the convergence rate is relatively equal to the base algorithms, as visualized in Figure 6d,e.
- Multi-dimension Multimodal BFs ( $f n_{7}$ – $f n_{9}$ ): Except $f n_{8}$ , GFLFGOA-SSA and LFGOA-SSA rank first for $f n_{7}$ and $f n_{9}$ , respectively. Even though SSA ranks first for $f n_{8}$ , the convergence rate is relatively close to GFLFGOA-SSA, as plotted in Figure 6h.
SD Analysis: To avoid ambiguity, we elaborated the discussion based on the types of BFs.
- Unimodal BFs ( $f n_{1}$ – $f n_{3}$ ): In $f n_{1}$ and $f n_{2}$ , the proposed algorithm (GFLFGOA, GFGOA-SSA) ranks first for SD values, as tabulated in Table 8. In the case of $f n_{3}$ , the GFLFGOA-SSA has a faster convergence rate despite the fact that SSA ranks first, as demonstrated in Figure 6c.
- Fixed-dimension Multimodal BFs ( $f n_{4}$ – $f n_{6}$ ): For $f n_{6}$ , GFLFGOA-SSA achieve the lowest SD value, as tabulated in Table 8. In the case of $f n_{4}$ , all of the other proposed algorithms have equal convergence rates, despite the fact that SSA ranks first, as plotted in Figure 6d, while in the case of $f n_{5}$ , the convergence rate of GFGOA is nearly identical to that of the proposed algorithm, despite the SD being zero.
- Multi-dimension Multimodal BFs ( $f n_{7}$ – $f n_{9}$ ): From Table 8, it can be clearly stated that any one of our proposed algorithms ranks first for $f n_{7}$ – $f n_{9}$ . GFLFGOA-SSA, GFLFGOA, and LFGOA-SSA achieve the lowest SD value for $f n_{7}, f n_{8}$ , and $f n_{9}$ , respectively.

From Table 7 and Table 8, there are seven out of nine statistical results ranking first, obtained by the proposed algorithms. However, these proposed algorithms give unsatisfactory results on

f n_{3}

and

f n_{5}

. Although the mean and SD results are not satisfactory for some BFs, the proposed algorithms’ convergence rate is relatively equal to that of the base algorithms. Through unimodal BF, we may claim the good exploitation capacity of the proposed algorithm, as it possesses one global optimum. Multimodal functions are more complicated than unimodal functions due to more than one local optimum. Through multimodal function, the proposed algorithm’s exploration capacity is evaluated. So based on the aforementioned discussion, we can justify the performance of the proposed algorithms.

5.3.2. SDP Framework Analysis

To verify the performance and the scalability of the proposed hybrid BoAs, we considered 13 NASA defect datasets, as they cover a wide range of instances, i.e., ranges from 125 to 10,878. Basically, BoA deals with the fitness function, which represents the evaluation of the quality of a candidate solution within the optimization problem. With respect to our research work, i.e., classification problem, the term fitness function used in the BoA is analogous to the evaluation metrics used to assess the performance of ML models, known as accuracy. To justify the effectiveness, the four proposed algorithms are compared with the four base algorithms. Except for accuracy, as noted in Table 9 and Table 11, the experimental runtime is also tabulated in Table 10 and Table 12. The algorithms that provide good accuracy and have the lowest runtime compared with the other base algorithms can be called good optimization algorithms. For a better and more precise analysis, we split our discussion into accuracy and runtime analysis with respect to the SDP problem.

Algorithm Accuracy Analysis: From Table 9 and Table 11, it can be clearly stated that for all datasets, there is an enhancement in accuracy when the XGB and ANN models are tuned with BoAs. As tabulated in Table 13, in the case of JM1, MW1, PC1, PC3, PC4, and PC5, our proposed algorithms have better optimization effects as compared to the four base algorithms, while in the cases of CM1, KC3, KC4, MC1, and PC2, the superiority of the proposed algorithm is relatively close to the base approaches. In hyperparameter tuning of the ANN model, GFGOA-SSA, GFLFGOA-SSA, and GFGOA-SSA show better accuracy for JM1, PC3, and PC5, respectively. For the remaining datasets, the superiority of our algorithms is relatively close or equal to the base approaches, as shown in Table 11 and Table 14.
Algorithm Runtime Analysis: From Table 10, we can justify that the computational runtime is lowest for all datasets except CM1 and MW1 by embedding either the LF or GF or both concepts into the SSA. For CM1 and MW1, the difference in runtime is not too great compared to LFGOA. For CM1, JM1, KC1, KC4, MC1, MW1, PC1, PC3, and PC5, the base algorithm’s computational runtime is low as compared to the proposed algorithm, as listed in Table 12, whereas for other datasets, our proposed algorithm’s runtime shows superiority.

For better visualization, the lowest runtime and highest accuracy for ML models are noted in Table 13 and Table 14. From the above-stated explanation, we may state that for the XGB model, our proposed algorithm’s runtime is low, whereas for the ANN model, the computational runtime is low only for a few datasets. From this, we may deduce that the runtime is affected by the complexity of the model. As the complexity of the model increases, the runtime may increase.

As for most of the datasets, at least one of our proposed algorithms works better, so based on experimental results and the NFL theorem [14], we may conclude that our proposed algorithms perform better in terms of global search and have higher stability. The improvements in the global search ability are due to the GF and LF concepts embedded with the SSA concept. Through the experimental results, the robustness of our hybrid approach is verified in terms of global search ability and accuracy.

In this paper, we also focus on comparing our proposed algorithms with the other state-of-the-art methods, except for the comparison with the other base algorithms. The observations are noted in Table 15.

From Table 15, it can be clearly stated that our proposed algorithm provides better accuracy for eight datasets than the other state-of-the-art methods. From the above-mentioned observation, we may conclude that our approaches show superiority in runtime and accuracy. Experimental results prove that our hybrid algorithm has better optimization and stable performance. So, the proposed algorithm has certain computational validity.

5.3.3. Computational Complexity Analysis

The computational complexity analysis of the hybrid BoAs, as detailed in Section 4.1, is analyzed in two ways:

Time Complexity: In general, the time complexity can be defined as follows:
Time Complexity = O(Initialization) + [O(Fitness evaluation of each search agent) + O(Position updation of agents) + O(Sorting)] ∗ Maximum iteration.
Mathematically, this can be coded as follows:

$T i m e C o m p l e x i t y = O (n * d) + [O (n) + O (n * d) + O (n l o g n)] * G = O (n * d + G * n (1 + d + l o g n))$

(27)
Space Complexity: The maximum number of spaces occupied by the proposed algorithm at any time is decided by the random initialization of the population. So, the space complexity can be calculated as follows:

$S p a c e C o m p l e x i t y = O (n * d)$

(28)

where d represents the dimension, G represents the maximum iteration, and n represents the size of the population. The above time and space complexity, as presented in Equations (27) and (28), respectively, is the same for GFLFGOA, GFGOA-SSA, LFGOA-SSA, and GFLFGOA-SSA. The above-mentioned time and space complexity, as presented in Equations (27) and (28), respectively, are the same not only for the proposed algorithm, but also for the base BoAs, as all BoAs go through the exploration and exploitation phases.

6. Concluding Remarks and Future Scope

This article presents four hybrid BoAs by employing GOA, SSA, LF, and GF concepts to enhance the exploration and exploitation nature of hybrid algorithms. To validate their performance and stability, experiments are conducted, first on BFs and second on the HBoDP model using the NASA defect dataset. This HBoDP model is established by integrating the proposed hybrid algorithms with the ML models (ANN, XGB). The first set of experiments achieved a good exploration and exploitation capacity of the proposed algorithms, as it performs better for seven BFs as compared to the baseline algorithms. But in the case of two BFs, even though the mean and SD values are lower for the proposed algorithm compared to the baseline algorithm, the convergence rate is relatively equal. The second set of experiments is conducted to tune the ML hyperparameters in the HBoDP model to improve the accuracy.

Achieving higher accuracy and the lowest runtime compared to the baseline algorithms validates the effectiveness of the algorithms proposed in the study. Based on the result and analysis, these proposed algorithms provide a compromise in terms of accuracy for some of the datasets but are effective in terms of runtime. For example, for the ANN model, there will be a change in weights in neurons while rerunning the experiments again. So, this may affect the global fitness values. Sometimes, different tuning parameter values in the optimization methods might lead to significant differences in their performance, which may result in a completely different conclusion. Also, it may lead to a different conclusion if there is a change in hyperparameter range value, such as learning rate, epochs, population size, number of iterations, etc.

So to justify this limitation and prove the correctness of the algorithm, future research should include experiments with these hybrid algorithms on variable dimensions of unimodal and multimodal BFs to prove scalability. The future scope of research also includes the implementation of other ML and DL models to tune the hyperparameters. The implementation of these novel algorithms can be extended to other engineering problems.

Author Contributions

Conceptualization, M.D. and B.R.M.; Methodology, M.D. and N.P.; Software, M.D. and N.P.; Formal analysis, M.D.; Investigation, M.D.; Data curation, M.D.; Writing—original draft, M.D.; Writing—review & editing, M.D., B.R.M. and R.M.R.G.; Visualization, B.R.M.; Supervision, B.R.M. and R.M.R.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Van Thieu, N.; Mirjalili, S. MEALPY: An open-source library for latest meta-heuristic algorithms in Python. J. Syst. Archit. 2023, 139, 102871. [Google Scholar] [CrossRef]
Holland, J.H. Genetic algorithms. Sci. Am. 1992, 267, 66–73. [Google Scholar] [CrossRef]
Bäck, T.; Schwefel, H.P. An overview of evolutionary algorithms for parameter optimization. Evol. Comput. 1993, 1, 1–23. [Google Scholar] [CrossRef]
Eberhart, R.; Kennedy, J. Particle swarm optimization. In Proceedings of the IEEE International Conference on Neural Networks, Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
Meraihi, Y.; Gabis, A.B.; Mirjalili, S.; Ramdane-Cherif, A. Grasshopper optimization algorithm: Theory, variants, and applications. IEEE Access 2021, 9, 50001–50024. [Google Scholar] [CrossRef]
Wu, H.S.; Zhang, F.M. Wolf pack algorithm for unconstrained global optimization. Math. Probl. Eng. 2014, 2014, 465082. [Google Scholar] [CrossRef]
Karaboga, D.; Basturk, B. An artificial bee colony (ABC) algorithm for numeric function optimization. In Proceedings of the IEEE Swarm Intelligence Symposium, Indianapolis, IN, USA, 28–29 September 2006; IEEE: Piscataway, NJ, USA, 2006; Volume 2006. [Google Scholar]
Heidari, A.A.; Mirjalili, S.; Faris, H.; Aljarah, I.; Mafarja, M.; Chen, H. Harris hawks optimization: Algorithm and applications. Future Gener. Comput. Syst. 2019, 97, 849–872. [Google Scholar] [CrossRef]
Hussain, K.; Mohd Salleh, M.N.; Cheng, S.; Shi, Y. Metaheuristic research: A comprehensive survey. Artif. Intell. Rev. 2019, 52, 2191–2233. [Google Scholar] [CrossRef]
Nevendra, M.; Singh, P. Empirical investigation of hyperparameter optimization for software defect count prediction. Expert Syst. Appl. 2022, 191, 116217. [Google Scholar] [CrossRef]
Nematzadeh, S.; Kiani, F.; Torkamanian-Afshar, M.; Aydin, N. Tuning hyperparameters of machine learning algorithms and deep neural networks using metaheuristics: A bioinformatics study on biomedical and biological cases. Comput. Biol. Chem. 2022, 97, 107619. [Google Scholar] [CrossRef]
Lentzas, A.; Nalmpantis, C.; Vrakas, D. Hyperparameter tuning using quantum genetic algorithms. In Proceedings of the 2019 IEEE 31st International Conference on Tools with Artificial Intelligence (ICTAI), Portland, OR, USA, 4–6 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1412–1416. [Google Scholar]
Shepperd, M.; Song, Q.; Sun, Z.; Mair, C. Data quality: Some comments on the nasa software defect datasets. IEEE Trans. Softw. Eng. 2013, 39, 1208–1215. [Google Scholar] [CrossRef]
Wolpert, D.H.; Macready, W.G. No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1997, 1, 67–82. [Google Scholar] [CrossRef]
Zang, H.; Zhang, S.; Hapeshi, K. A review of nature-inspired algorithms. J. Bionic Eng. 2010, 7, S232–S237. [Google Scholar] [CrossRef]
McCall, J. Genetic algorithms for modelling and optimisation. J. Comput. Appl. Math. 2005, 184, 205–222. [Google Scholar] [CrossRef]
Karaboga, D.; Basturk, B. A powerful and efficient algorithm for numerical function optimization: Artificial bee colony (ABC) algorithm. J. Glob. Optim. 2007, 39, 459–471. [Google Scholar] [CrossRef]
Zhen, L.; Liu, Y.; Dongsheng, W.; Wei, Z. Parameter estimation of software reliability model and prediction based on hybrid wolf pack algorithm and particle swarm optimization. IEEE Access 2020, 8, 29354–29369. [Google Scholar] [CrossRef]
Zulfiqar, M.; Kamran, M.; Rasheed, M.; Alquthami, T.; Milyani, A. Hyperparameter optimization of support vector machine using adaptive differential evolution for electricity load forecasting. Energy Rep. 2022, 8, 13333–13352. [Google Scholar] [CrossRef]
Blume, S.; Benedens, T.; Schramm, D. Hyperparameter optimization techniques for designing software sensors based on artificial neural networks. Sensors 2021, 21, 8435. [Google Scholar] [CrossRef]
Akter, S.; Nahar, N.; ShahadatHossain, M.; Andersson, K. A new crossover technique to improve genetic algorithm and its application to TSP. In Proceedings of the 2019 International Conference on Electrical, Computer and Communication Engineering (ECCE), Cox’s Bazar, Bangladesh, 7–9 February 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
Si, B.; Liu, F.; Li, Y. Metamodel-based hyperparameter optimization of optimization algorithms in building energy optimization. Buildings 2023, 13, 167. [Google Scholar] [CrossRef]
Xue, J.; Shen, B. A novel swarm intelligence optimization approach: Sparrow search algorithm. Syst. Sci. Control Eng. 2020, 8, 22–34. [Google Scholar] [CrossRef]
Zhang, Z.; Han, Y. Discrete sparrow search algorithm for symmetric traveling salesman problem. Appl. Soft Comput. 2022, 118, 108469. [Google Scholar] [CrossRef]
Yang, X.; Liu, J.; Liu, Y.; Xu, P.; Yu, L.; Zhu, L.; Chen, H.; Deng, W. A novel adaptive sparrow search algorithm based on chaotic mapping and t-distribution mutation. Appl. Sci. 2021, 11, 11192. [Google Scholar] [CrossRef]
Ouyang, C.; Qiu, Y.; Zhu, D. Adaptive spiral flying sparrow search algorithm. Sci. Program. 2021, 2021, 1–16. [Google Scholar] [CrossRef]
Liang, Q.; Chen, B.; Wu, H.; Han, M. A novel modified sparrow search algorithm based on adaptive weight and improved boundary constraints. In Proceedings of the 2021 IEEE 6th International Conference on Computer and Communication Systems (ICCCS), Chengdu, China, 23–26 April 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 104–109. [Google Scholar]
Zhao, Q.; Tao, R.; Li, J.; Mu, Y. An improved wolf pack algorithm. In Proceedings of the 2020 Chinese Control And Decision Conference (CCDC), Hefei, China, 22–24 August 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 626–633. [Google Scholar]
Li, H.; Wu, H. An oppositional wolf pack algorithm for Parameter identification of the chaotic systems. Optik 2016, 127, 9853–9864. [Google Scholar] [CrossRef]
Xiu, Z.; Zhen-Hua, W. Improved Wolf Pack Algorithm Based on Tent Chaotic Mapping and Levy Flight. In Proceedings of the 2017 International Conference on Robots & Intelligent System (ICRIS), Huai’an, China, 15–16 October 2017; pp. 165–169. [Google Scholar] [CrossRef]
Chen, X.; Cheng, F.; Liu, C.; Cheng, L.; Mao, Y. An improved Wolf pack algorithm for optimization problems: Design and evaluation. PLoS ONE 2021, 16, e0254239. [Google Scholar] [CrossRef] [PubMed]
Jadon, S.S.; Tiwari, R.; Sharma, H.; Bansal, J.C. Hybrid artificial bee colony algorithm with differential evolution. Appl. Soft Comput. 2017, 58, 11–24. [Google Scholar] [CrossRef]
Mirjalili, S.; Wang, G.G.; Coelho, L.d.S. Binary optimization using hybrid particle swarm optimization and gravitational search algorithm. Neural Comput. Appl. 2014, 25, 1423–1435. [Google Scholar] [CrossRef]
Li, Z.; Yu, M.; Wang, D.; Wei, H. Using hybrid algorithm to estimate and predicate based on software reliability model. IEEE Access 2019, 7, 84268–84283. [Google Scholar] [CrossRef]
Yang, L.; Li, Z.; Wang, D.; Miao, H.; Wang, Z. Software defects prediction based on hybrid particle swarm optimization and sparrow search algorithm. IEEE Access 2021, 9, 60865–60879. [Google Scholar] [CrossRef]
Saremi, S.; Mirjalili, S.; Lewis, A. Grasshopper optimisation algorithm: Theory and application. Adv. Eng. Softw. 2017, 105, 30–47. [Google Scholar] [CrossRef]
Abualigah, L.; Diabat, A. A comprehensive survey of the Grasshopper optimization algorithm: Results, variants, and applications. Neural Comput. Appl. 2020, 32, 15533–15556. [Google Scholar] [CrossRef]
Razmjooy, N.; Estrela, V.V.; Loschi, H.J.; Fanfan, W. A comprehensive survey of new meta-heuristic algorithms. In Recent Advances in Hybrid Metaheuristics for Data Clustering; Wiley Publishing: Hoboken, NJ, USA, 2019. [Google Scholar]
El-Henawy, I.; Abdelmegeed, N.A. Meta-heuristics algorithms: A survey. Int. J. Comput. Appl. 2018, 179, 45–54. [Google Scholar] [CrossRef]
Arora, S.; Anand, P. Chaotic grasshopper optimization algorithm for global optimization. Neural Comput. Appl. 2019, 31, 4385–4405. [Google Scholar] [CrossRef]
Zhao, S.; Wang, P.; Heidari, A.A.; Zhao, X.; Ma, C.; Chen, H. An enhanced Cauchy mutation grasshopper optimization with trigonometric substitution: Engineering design and feature selection. Eng. Comput. 2022, 38, 4583–4616. [Google Scholar] [CrossRef]
Ewees, A.A.; Gaheen, M.A.; Yaseen, Z.M.; Ghoniem, R.M. Grasshopper optimization algorithm with crossover operators for feature selection and solving engineering problems. IEEE Access 2022, 10, 23304–23320. [Google Scholar] [CrossRef]
Yildiz, B.S.; Pholdee, N.; Bureerat, S.; Yildiz, A.R.; Sait, S.M. Enhanced grasshopper optimization algorithm using elite opposition-based learning for solving real-world engineering problems. Eng. Comput. 2022, 38, 4207–4219. [Google Scholar] [CrossRef]
Feng, Y.; Liu, M.; Zhang, Y.; Wang, J. A dynamic opposite learning assisted grasshopper optimization algorithm for the flexible jobscheduling problem. Complexity 2020, 2020, 8870783. [Google Scholar] [CrossRef]
Qin, P.; Hu, H.; Yang, Z. The improved grasshopper optimization algorithm and its applications. Sci. Rep. 2021, 11, 23733. [Google Scholar] [CrossRef]
Behera, R.K.; Shukla, S.; Rath, S.K.; Misra, S. Software reliability assessment using machine learning technique. In Computational Science and Its Applications–ICCSA 2018: Proceedings of the 18th International Conference, Melbourne, VIC, Australia, 2–5 July 2018, Proceedings, Part V 18; Springer: Berlin/Heidelberg, Germany, 2018; pp. 403–411. [Google Scholar]
Batool, I.; Khan, T.A. Software fault prediction using deep learning techniques. Softw. Qual. J. 2023, 31, 1241–1280. [Google Scholar] [CrossRef]
Jayanthi, R.; Florence, L. Software defect prediction techniques using metrics based on neural network classifier. Clust. Comput. 2019, 22, 77–88. [Google Scholar] [CrossRef]
Fan, G.; Diao, X.; Yu, H.; Yang, K.; Chen, L. Software defect prediction via attention-based recurrent neural network. Sci. Program. 2019, 2019, 6230953. [Google Scholar] [CrossRef]
Batool, I.; Khan, T.A. Software fault prediction using data mining, machine learning and deep learning techniques: A systematic literature review. Comput. Electr. Eng. 2022, 100, 107886. [Google Scholar] [CrossRef]
Sobhana, M.; Preethi, G.S.S.; Sri, G.H.; Sujitha, K.B. Improved Reliability Prediction in Engineering Systems Based on Artificial Neural Network. In Proceedings of the 2022 International Mobile and Embedded Technology Conference (MECON), Noida, India, 10–11 March 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 455–460. [Google Scholar]
Jindal, A.; Gupta, A. Comparative Analysis of Software Reliability Prediction Using Machine Learning and Deep Learning. In Proceedings of the 2022 Second International Conference on Artificial Intelligence and Smart Energy (ICAIS), Coimbatore, India, 23–25 February 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 389–394. [Google Scholar]
Alghanim, F.; Azzeh, M.; El-Hassan, A.; Qattous, H. Software defect density prediction using deep learning. IEEE Access 2022, 10, 114629–114641. [Google Scholar] [CrossRef]
Clemente, C.J.; Jaafar, F.; Malik, Y. Is predicting software security bugs using deep learning better than the traditional machine learning algorithms? In Proceedings of the 2018 IEEE International Conference on Software Quality, Reliability and Security (QRS), Lisbon, Portugal, 16–20 July 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 95–102. [Google Scholar]
Wongpheng, K.; Visutsak, P. Software defect prediction using convolutional neural network. In Proceedings of the 2020 35th International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC), Nagoya, Japan, 3–6 July 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 240–243. [Google Scholar]
Cetiner, M.; Sahingoz, O.K. A comparative analysis for machine learning based software defect prediction systems. In Proceedings of the 2020 11th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Kharagpur, India, 1–3 July 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 1–7. [Google Scholar]
Alsaeedi, A.; Khan, M.Z. Software defect prediction using supervised machine learning and ensemble techniques: A comparative study. J. Softw. Eng. Appl. 2019, 12, 85–100. [Google Scholar] [CrossRef]
Li, R.; Zhou, L.; Zhang, S.; Liu, H.; Huang, X.; Sun, Z. Software defect prediction based on ensemble learning. In Proceedings of the 2019 2nd International Conference on Data Science and Information Technology, Seoul, Republic of Korea, 19–21 July 2019; pp. 1–6. [Google Scholar]
Iqbal, A.; Aftab, S.; Ali, U.; Nawaz, Z.; Sana, L.; Ahmad, M.; Husen, A. Performance analysis of machine learning techniques on software defect prediction using NASA datasets. Int. J. Adv. Comput. Sci. Appl. 2019, 10, 300–308. [Google Scholar] [CrossRef]
Malhotra, R. Comparative analysis of statistical and machine learning methods for predicting faulty modules. Appl. Soft Comput. 2014, 21, 286–297. [Google Scholar] [CrossRef]
Parashar, A.; Kumar Goyal, R.; Kaushal, S.; Kumar Sahana, S. Machine learning approach for software defect prediction using multi-core parallel computing. Autom. Softw. Eng. 2022, 29, 44. [Google Scholar] [CrossRef]
Topaz, C.M.; Bernoff, A.J.; Logan, S.; Toolson, W. A model for rolling swarms of locusts. Eur. Phys. J. Spec. Top. 2008, 157, 93–109. [Google Scholar] [CrossRef]
Kamaruzaman, A.F.; Zain, A.M.; Yusuf, S.M.; Udin, A. Levy flight algorithm for optimization problems-a literature review. Appl. Mech. Mater. 2013, 421, 496–501. [Google Scholar] [CrossRef]
Luo, J.; Chen, H.; Xu, Y.; Huang, H.; Zhao, X. An improved grasshopper optimization algorithm with application to financial stress prediction. Appl. Math. Model. 2018, 64, 654–668. [Google Scholar] [CrossRef]
Jamil, M.; Yang, X.S. A literature survey of benchmark functions for global optimisation problems. Int. J. Math. Model. Numer. Optim. 2013, 4, 150–194. [Google Scholar] [CrossRef]
Lemon, B. The Effect of Locality Based Learning on Software Defect Prediction; West Virginia University: Morgantown, WV, USA, 2010. [Google Scholar]
Ali, M.; Mazhar, T.; Arif, Y.; Al-Otaibi, S.; Ghadi, Y.Y.; Shahzad, T.; Khan, M.A.; Hamam, H. Software Defect Prediction Using an Intelligent Ensemble-Based Model. IEEE Access 2024, 12, 20376–20395. [Google Scholar] [CrossRef]
Yadav, S. Software Reliability Prediction by using Deep Learning Technique. Int. J. Adv. Comput. Sci. Appl. 2022, 13, 683–693. [Google Scholar] [CrossRef]
Mumtaz, B.; Kanwal, S.; Alamri, S.; Khan, F. Feature Selection Using Artificial Immune Network: An Approach for Software Defect Prediction. Intell. Autom. Soft Comput. 2021, 29, 669–684. [Google Scholar] [CrossRef]
Odejide, B.J.; Bajeh, A.O.; Balogun, A.O.; Alanamu, Z.O.; Adewole, K.S.; Akintola, A.G.; Salihu, S.A.; Usman-Hamza, F.E.; Mojeed, H.A. An empirical study on data sampling methods in addressing class imbalance problem in software defect prediction. In Computer Science On-Line Conference; Springer: Cham, Switzerland, 2022; pp. 594–610. [Google Scholar]
Balogun, A.O.; Basri, S.; Capretz, L.F.; Mahamad, S.; Imam, A.A.; Almomani, M.A.; Adeyemo, V.E.; Alazzawi, A.K.; Bajeh, A.O.; Kumar, G. Software defect prediction using wrapper feature selection based on dynamic re-ranking strategy. Symmetry 2021, 13, 2166. [Google Scholar] [CrossRef]
Balogun, A.O.; Lafenwa-Balogun, F.B.; Mojeed, H.A.; Adeyemo, V.E.; Akande, O.N.; Akintola, A.G.; Bajeh, A.O.; Usman-Hamza, F.E. SMOTE-based homogeneous ensemble methods for software defect prediction. In Computational Science and Its Applications–ICCSA 2020: Proceedings of the 20th International Conference, Cagliari, Italy, 1–4 July 2020, Proceedings, Part VI 20; Springer: Berlin/Heidelberg, Germany, 2020; pp. 615–631. [Google Scholar]

Figure 1. GFLFGOA flowchart.

Figure 2. LFGOA-SSA flowchart.

Figure 3. GFGOA-SSA flowchart.

Figure 4. GFLFGOA-SSA flowchart.

Figure 5. Software defect prediction (SDP) framework.

Figure 6. Convergence rate of benchmark functions (BFs). (a)

f n_{1}

. (b)

f n_{2}

. (c)

f n_{3}

. (d)

f n_{4}

. (e)

f n_{5}

. (f)

f n_{6}

. (g)

f n_{7}

. (h)

f n_{8}

. (i)

f n_{9}

.

Figure 6. Convergence rate of benchmark functions (BFs). (a)

f n_{1}

. (b)

f n_{2}

. (c)

f n_{3}

. (d)

f n_{4}

. (e)

f n_{5}

. (f)

f n_{6}

. (g)

f n_{7}

. (h)

f n_{8}

. (i)

f n_{9}

.

Table 1. Notations used in flowcharts.

Symbols	Explanation
r	Current iteration
G	Maximum iteration
ST	Safety Threshold [0.5, 1.0]
SD	Scrounger Sparrow
PD	Producer Sparrow
$Z_{b e s t}$	Return the best position and the corresponding fitness value. With respect to our flowchart, as drawn in Figure 1, Figure 2, Figure 3 and Figure 4, $Z_{b e s t}$ returns the current best individual after the initialization step, whereas, after the $r < G$ condition becomes false, $Z_{b e s t}$ returns the global best position and the corresponding fitness value.

Table 2. Benchmark functions [65].

Name	Mathematical Equation	Type	(n, G, dim)	Range	${fn}_{\min}$
Quartic	$f n_{1} (x) = \sum_{i = 1}^{d} i x_{i}^{4} + r a n d o m [0, 1)$	Unimodal	100, 200, 30	[−1.28,1.28]	0
Rosenbrock	$f n_{2} (x) = \sum_{i = 1}^{d} [100 {(x_{i + 1} - x_{i}^{2})}^{2} + {(x_{i} - 1)}^{2}]$	Unimodal	100, 200, 30	[−30, 30]	0
Schwefel 2.21	$f n_{3} (x) = {max}_{1 \leq i \leq d} \|x_{i}\|$	Unimodal	100, 200, 30	[−100, 100]	0
Six Hump Camel	$f n_{4} (x) = (4 - 2.1 x_{1}^{2} + \frac{x_{1}^{4}}{3}) x_{1}^{2} + x_{1} x_{2} + (4 x_{2}^{2} - 4) x_{2}^{2}$	Fixed-dimension Multimodal	100, 200, 2	[−5, 5]	−1.0316
Branin	$f n_{5} (x) = {(x_{2} - \frac{5.1 x_{1}^{2}}{4 π^{2}} + \frac{5 x_{1}}{π} - 6)}^{2} + 10 (1 - \frac{1}{8 π}) cos (x_{1}) + 10$	Fixed-dimension Multimodal	100, 200, 2	[−5, 0, 10, 15]	0.397887
Booth	$f n_{6} (x) = {(x_{1} + 2 x_{2} - 7)}^{2} + {(2 x_{1} + x_{2} - 5)}^{2}$	Fixed-dimension Multimodal	100, 200, 2	[−10, 10]	0
Zakharov	$f n_{7} (x) = \sum_{i = 1}^{n} x_{i}^{2} + {(\frac{1}{2} \sum_{i = 1}^{n} i x_{i})}^{2} + {(\frac{1}{2} \sum_{i = 1}^{n} i x_{i})}^{4}$	Mullti-dimension Multimodal	100, 200, 30	[−5, 10]	0
Rastrigin	$f n_{8} (x) = \sum_{i = 1}^{d} [x_{1}^{2} - 10 cos (2 π x_{i}) + 10]$	Multi-dimension Multimodal	100, 200, 30	[−5.12, 5.12]	0
Schaffer 6	$f n_{9} (x) = \sum_{i = 1}^{d} 0.5 + \frac{s i n^{2} \sqrt{x_{i}^{2} + x_{i + 1}^{2}} - 0.5}{{[1 + 0.001 (x_{i}^{2} + x_{i + 1}^{2})]}^{2}}$	Multi-dimension Multimodal	100, 200, 30	[−100, 100]	0

Table 3. Dataset description.

Dataset	Description	Target Feature	Original Features	Selected Features	Instances
CM1	Spacecraft instrument’s software	Defective	38	26	327
JM1	Real-time predictive ground system	label	22	16	10,878
KC1	spacecraft’s ground data system (storage management)	Defective	22	15	2107
KC3	Flight software system	Defective	40	25	194
KC4	Software system related to spacecraft operations	Defective	41	35	125
MC1	Spacecraft’s data processing system	Defective	40	30	9466
MC2	spacecraft’s power distribution software system	Defective	40	27	124
MW1	Ground data system for a weather satellite	Defective	38	28	250
PC1	Air traffic control software system	Defective	38	26	679
PC2	Spacecraft’s altitude control software system	Defective	37	25	722
PC3	University’s administration software system	Defective	38	25	1053
PC4	Spacecraft’s orbit determination software system	Defective	38	27	1270
PC5	Satellite’s ground software system	Defective	39	26	1694

Table 4. Hyperparameters of ANN.

Hyperparameters	Lower Bound	Upper Bound
Learning rate	0.001	0.1
Neurons in layer1	8	10
Neurons in layer 2	6	10
Batch size	4	32
Epochs	10	50

Table 5. Hyperparameters of XGB.

Hyperparameters	Lower Bound	Upper Bound
Learning rate	0.001	0.1
Max depth	3	18
Subsample	0	1
N estimator	50	200

Table 6. System configuration.

Specification	Size
RAM	32 GB
Hard Disk	512 GB
Processor	Intel core I7
OS Name	Ubuntu
OS Type	64-bit

Table 7. Mean value and ranking of eight BoAs (Bold indicates lowest mean value).

BF	Index	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
$f n_{1}$	Mean	$5.73 \times 10^{1}$	$4.77 \times 10^{- 3}$	$6.09 \times 10^{- 3}$	$2.76 \times 10^{- 3}$	$2.94 \times 10^{1}$	$7.66 \times 10^{1}$	$3.69 \times 10^{1}$	$1.81 \times 10^{- 3}$
$f n_{1}$	Rank	7	3	4	2	5	8	6	1
$f n_{2}$	Mean	$1.95 \times 10^{7}$	$5.85 \times 10^{- 2}$	$2.16 \times 10^{- 1}$	$3.28 \times 10^{- 2}$	$5.59 \times 10^{7}$	$2.58 \times 10^{6}$	$1.25 \times 10^{8}$	$1.47 \times 10^{- 1}$
$f n_{2}$	Rank	6	2	4	1	7	5	8	3
$f n_{3}$	Mean	$6.75 \times 10^{1}$	$4.25 \times 10^{- 4}$	$9.61 \times 10^{- 4}$	$3.66 \times 10^{- 4}$	$8.72 \times 10^{1}$	$5.98 \times 10^{1}$	$8.57 \times 10^{1}$	$2.43 \times 10^{- 4}$
$f n_{3}$	Rank	6	3	4	2	8	5	7	1
$f n_{4}$	Mean	$- 9.67 \times 10^{- 1}$	$- 1.02 \times 10^{0}$	$- 1.03 \times 10^{0}$	$- 9.90 \times 10^{- 1}$	$- 1.02 \times 10^{0}$	$- 8.72 \times 10^{- 1}$	$- 9.94 \times 10^{- 1}$	$- 1.03 \times 10^{0}$
$f n_{4}$	Rank	5	2	1	4	2	6	3	1
$f n_{5}$	Mean	$4.53 \times 10^{- 1}$	$3.99 \times 10^{- 1}$	$3.99 \times 10^{- 1}$	$3.99 \times 10^{- 1}$	$3.98 \times 10^{- 1}$	$5.12 \times 10^{- 1}$	$4.02 \times 10^{- 1}$	$3.98 \times 10^{- 1}$
$f n_{5}$	Rank	4	2	2	2	1	5	3	1
$f n_{6}$	Mean	$1.14 \times 10^{- 1}$	$7.67 \times 10^{- 3}$	$9.82 \times 10^{- 3}$	$1.26 \times 10^{- 2}$	$4.01 \times 10^{- 2}$	$7.91 \times 10^{- 2}$	$6.68 \times 10^{- 2}$	$8.55 \times 10^{- 2}$
$f n_{6}$	Rank	8	1	2	3	4	6	5	7
$f n_{7}$	Mean	$6.61 \times 10^{2}$	$9.19 \times 10^{- 2}$	$1.80 \times 10^{1}$	$7.30 \times 10^{0}$	$4.70 \times 10^{2}$	$8.94 \times 10^{2}$	$4.41 \times 10^{2}$	$4.53 \times 10^{- 1}$
$f n_{7}$	Rank	7	1	4	3	6	8	5	2
$f n_{8}$	Mean	$4.22 \times 10^{2}$	$1.82 \times 10^{- 1}$	$2.34 \times 10^{- 2}$	$1.33 \times 10^{- 1}$	$2.82 \times 10^{2}$	$4.17 \times 10^{2}$	$3.34 \times 10^{2}$	$2.49 \times 10^{- 7}$
$f n_{8}$	Rank	8	4	2	3	5	7	6	1
$f n_{9}$	Mean	$1.55 \times 10^{- 2}$	$1.40 \times 10^{- 4}$	$3.04 \times 10^{- 7}$	$8.45 \times 10^{- 6}$	$2.17 \times 10^{- 2}$	$1.43 \times 10^{- 2}$	$2.75 \times 10^{- 2}$	$7.33 \times 10^{- 7}$
$f n_{9}$	Rank	6	4	1	3	7	5	8	2

Table 8. Standard deviation (SD) value and ranking of eight BoAs (Bold indicates lowest SD value).

BF	Index	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
$f n_{1}$	SD	$7.11 \times 10^{- 15}$	$3.01 \times 10^{- 2}$	$5.05 \times 10^{- 2}$	$1.76 \times 10^{- 2}$	$2.79 \times 10^{1}$	$2.84 \times 10^{- 14}$	$1.70 \times 10^{1}$	$9.78 \times 10^{- 3}$
$f n_{1}$	Rank	1	5	6	4	8	2	7	3
$f n_{2}$	SD	$3.02 \times 10^{7}$	$3.60 \times 10^{- 1}$	$2.62 \times 10^{0}$	$2.93 \times 10^{- 1}$	$5.14 \times 10^{7}$	$1.21 \times 10^{7}$	$8.21 \times 10^{7}$	$2.03 \times 10^{0}$
$f n_{2}$	Rank	6	2	4	1	7	5	8	3
$f n_{3}$	SD	$1.31 \times 10^{1}$	$3.12 \times 10^{- 3}$	$1.01 \times 10^{- 2}$	$3.15 \times 10^{- 3}$	$6.09 \times 10^{- 1}$	$1.87 \times 10^{1}$	$3.19 \times 10^{0}$	$5.43 \times 10^{- 4}$
$f n_{3}$	Rank	7	2	4	3	5	8	6	1
$f n_{4}$	SD	$2.46 \times 10^{- 1}$	$5.52 \times 10^{- 2}$	$7.39 \times 10^{- 3}$	$1.56 \times 10^{- 1}$	$8.72 \times 10^{- 2}$	$5.23 \times 10^{- 2}$	$3.26 \times 10^{- 2}$	$9.22 \times 10^{- 4}$
$f n_{4}$	Rank	8	5	2	7	6	4	3	1
$f n_{5}$	SD	$4.29 \times 10^{- 2}$	$3.87 \times 10^{- 3}$	$9.16 \times 10^{- 3}$	$3.63 \times 10^{- 4}$	$1.48 \times 10^{- 3}$	$0.00 \times 10^{0}$	$1.91 \times 10^{- 2}$	$3.31 \times 10^{- 4}$
$f n_{5}$	Rank	8	5	6	3	4	1	7	2
$f n_{6}$	SD	$4.42 \times 10^{- 1}$	$1.77 \times 10^{- 2}$	$2.84 \times 10^{- 2}$	$3.86 \times 10^{- 2}$	$2.22 \times 10^{- 1}$	$5.00 \times 10^{- 1}$	$1.30 \times 10^{- 1}$	$3.19 \times 10^{- 1}$
$f n_{6}$	Rank	7	1	2	3	5	8	4	6
$f n_{7}$	SD	$8.36 \times 10^{1}$	$7.22 \times 10^{- 1}$	$2.20 \times 10^{2}$	$6.27 \times 10^{1}$	$1.51 \times 10^{2}$	$1.21 \times 10^{2}$	$1.49 \times 10^{2}$	$6.40 \times 10^{0}$
$f n_{7}$	Rank	4	1	8	3	7	5	6	2
$f n_{8}$	SD	$0.00 \times 10^{0}$	$2.21 \times 10^{0}$	$2.13 \times 10^{- 1}$	$1.25 \times 10^{0}$	$5.60 \times 10^{1}$	$5.68 \times 10^{- 14}$	$2.30 \times 10^{1}$	$4.47 \times 10^{- 7}$
$f n_{8}$	Rank	1	6	4	5	8	2	7	3
$f n_{9}$	SD	$3.25 \times 10^{- 2}$	$1.14 \times 10^{- 3}$	$2.08 \times 10^{- 6}$	$6.28 \times 10^{- 5}$	$3.15 \times 10^{- 2}$	$8.71 \times 10^{- 3}$	$3.76 \times 10^{- 2}$	$1.03 \times 10^{- 5}$
$f n_{9}$	Rank	7	4	1	3	6	5	8	2

Table 9. Accuracy of XGB.

Dataset	Without Optimization	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
CM1	0.89024	0.93902	0.93902	0.92683	0.93902	0.92683	0.92683	0.92683	0.93902
JM1	0.81066	0.81471	0.81507	0.81434	0.81691	0.81434	0.81213	0.81397	0.81654
KC1	0.86338	0.87666	0.87666	0.87097	0.87666	0.86717	0.87856	0.86907	0.87856
KC3	0.81633	0.85714	0.85714	0.85714	0.85714	0.83674	0.85714	0.83674	0.85714
KC4	0.6875	0.75	0.78125	0.78125	0.78125	0.71875	0.78125	0.75	0.75
MC1	0.99451	0.99620	0.99662	0.99620	0.99620	0.99535	0.99662	0.99620	0.99620
MC2	0.70968	0.83871	0.83871	0.83871	0.83871	0.80645	0.83871	0.838710	0.90323
MW1	0.88	0.90476	0.90476	0.90476	0.92064	0.90476	0.90476	0.90476	0.90476
PC1	0.91176	0.92941	0.92941	0.92941	0.93529	0.91765	0.92353	0.92941	0.92941
PC2	0.98895	0.99448	0.99448	0.99448	0.99448	0.99448	0.99448	0.99448	0.99448
PC3	0.86364	0.88258	0.88258	0.875	0.88258	0.87879	0.87879	0.875	0.875
PC4	0.90252	0.94025	0.93306	0.93082	0.93082	0.93711	0.93711	0.92453	0.93082
PC5	0.77830	0.82075	0.80896	0.81132	0.81604	0.81368	0.81604	0.81132	0.81604

Table 10. Runtime (seconds) of XGB.

Dataset	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
CM1	8.63894	2.62227	5.97816	8.16711	6.80014	4.67062	17.40778	4.14228
JM1	49.11337	11.16519	23.01613	6.44055	20.11788	18.97812	100.45693	32.35944
KC1	10.98801	13.28012	4.71760	2.79241	9.85696	7.21173	30.87882	6.74143
KC3	5.16992	4.89649	4.92847	3.61582	7.55046	2.61733	11.18174	7.2783
KC4	3.37044	2.72615	1.03054	8.00315	3.01872	7.93459	7.89999	10.81161
MC1	20.06377	16.28936	17.45409	3.18721	20.94327	10.61918	36.69793	7.54776
MC2	6.51045	7.53665	6.08344	3.0346	4.42068	3.79698	19.6863	5.38492
MW1	11.61992	8.84075	8.75449	8.91545	6.90571	7.68827	17.27596	14.46098
PC1	16.15804	5.14336	2.1437	7.26307	5.33309	11.99368	7.72782	13.36885
PC2	8.65440	7.13130	4.98462	8.60696	9.58645	5.79961	6.66192	12.31117
PC3	15.00149	16.57482	8.22984	3.48169	11.09310	4.54789	25.59780	11.45645
PC4	11.35519	12.56271	9.46519	11.60129	18.41754	14.41984	31.31732	43.08894
PC5	14.16040	3.07928	8.78662	7.02953	22.22073	8.27753	20.31468	21.10666

Table 11. Accuracy of ANN.

Dataset	Without Optimization	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
CM1	0.87805	0.93902	0.95122	0.93902	0.93902	0.91463	0.93902	0.91463	0.95122
JM1	0.79669	0.81471	0.81544	0.81397	0.81581	0.81544	0.81544	0.81360	0.81544
KC1	0.84820	0.87476	0.86907	0.87287	0.87287	0.87287	0.87097	0.87856	0.88046
KC3	0.77551	0.85714	0.85714	0.87755	0.85714	0.83674	0.85714	0.87755	0.85714
KC4	0.5625	0.71875	0.71875	0.8125	0.78125	0.71875	0.84375	0.71875	0.75
MC1	0.99408	0.99578	0.99535	0.99578	0.99535	0.99471	0.99578	0.99535	0.99578
MC2	0.74194	0.83871	0.83871	0.87097	0.87097	0.83871	0.90323	0.87097	0.87097
MW1	0.88889	0.93651	0.93651	0.93651	0.93651	0.93651	0.93651	0.93651	0.93651
PC1	0.89412	0.94118	0.94706	0.94706	0.93529	0.92941	0.94118	0.94706	0.93529
PC2	0.98895	1	1	1	1	0.99448	1	0.99448	0.99448
PC3	0.82576	0.86742	0.88258	0.87879	0.87121	0.875	0.87879	0.85606	0.875
PC4	0.90252	0.93082	0.92767	0.92453	0.93082	0.93396	0.93082	0.93711	0.94339
PC5	0.74057	0.81132	0.80425	0.80660	0.81368	0.80660	0.80660	0.80660	0.80896

Table 12. Runtime (seconds) of ANN.

Dataset	GFLFGOA	GFLFGOA-SSA	LFGOA-SSA	GFGOA-SSA	LFGOA	GFGOA	GOA	SSA
CM1	14.15148	19.03989	10.93558	14.25837	19.88457	20.53972	49.70118	59.26953
JM1	329.74139	461.88978	280.21241	258.71992	372.32476	428.38037	100.74072	677.38826
KC1	67.28775	40.93038	34.27968	43.71838	33.20376	27.49754	127.15726	370.54866
KC3	16.96921	10.03377	16.24724	22.79089	10.27701	12.48286	22.32237	25.73986
KC4	13.65393	17.94655	11.18101	10.96659	28.47858	8.85138	16.14852	30.20686
MC1	403.20674	185.79484	196.88806	375.0098	138.50669	213.44733	413.83031	505.23935
MC2	20.73147	9.0823	9.80959	16.01918	12.00442	12.04628	13.47418	26.23941
MW1	25.6989	10.81326	11.2502	16.40268	13.56527	9.44403	17.95148	27.54926
PC1	26.2941	24.40644	19.09213	19.34053	24.48024	33.99985	17.26851	58.35074
PC2	10.24549	54.52347	33.17081	26.83333	36.66479	101.2086	22.20873	41.70491
PC3	19.32979	33.10067	32.45259	40.49498	14.1666	31.19656	23.19152	71.15856
PC4	28.11799	30.82265	35.64099	32.15147	68.57497	45.70245	31.37875	100.55374
PC5	287.27928	72.0665	62.88953	70.00658	27.62535	30.31593	46.86406	91.27525

Table 13. Bio-optimized algorithm analysis (XGB).

Dataset	Accuracy (Highest)	Runtime (Lowest)
CM1	GFLFGOA-SSA, GFLFGOA, GFGOA-SSA, SSA	GFLFGOA-SSA
JM1	GFGOA-SSA	GFGOA-SSA
KC1	GFGOA, SSA	GFGOA-SSA
KC3	GFGOA, GFLFGOA-SSA, SSA, GFGOA-SSA, GFLFGOA	GFGOA
KC4	GFGOA-SSA, GFGOA, LFGOA-SSA, GFLFGOA-SSA	LFGOA-SSA
MC1	GFLFGOA-SSA, GFGOA	GFGOA-SSA
MC2	SSA	GFGOA-SSA
MW1	GFGOA-SSA	LFGOA
PC1	GFGOA-SSA	LFGOA-SSA
PC2	GFLFGOA, GFLFGOA-SSA, LFGOA-SSA, GFGOA-SSA, LFGOA, GFGOA, GOA, SSA	LFGOA-SSA
PC3	GFGOA-SSA, GFLFGOA, GFLFGOA-SSA	GFGOA-SSA
PC4	GFLFGOA	LFGOA-SSA
PC5	GFLFGOA	GFLFGOA-SSA

Table 14. Bio-optimized algorithm analysis (ANN).

Dataset	Accuracy (Highest)	Runtime (Lowest)
CM1	GFLFGOA-SSA, SSA	GOA
JM1	GFGOA-SSA	GOA
KC1	SSA	GFGOA
KC3	LFGOA-SSA, GOA	GFLFGOA-SSA
KC4	GFGOA	GFGOA
MC1	GFLFGOA, LFGOA-SSA, GFGOA, SSA	LFGOA
MC2	GFGOA	GFLFGOA-SSA
MW1	GFLFGOA, GFLFGOA-SSA, LFGOA-SSA, GFGOA-SSA, LFGOA, GFGOA, GOA, SSA	GFGOA
PC1	GFLFGOA-SSA, LFGOA-SSA, GOA	GOA
PC2	GFGOA-SSA, GFGOA, GFLFGOA, GFLFGOA-SSA, LFGOA-SSA	GFLFGOA
PC3	GFLFGOA-SSA	LFGOA
PC4	SSA	GFLFGOA
PC5	GFGOA-SSA	LFGOA

Table 15. Comparison of state-of-the-art methods (accuracy).

Datasets	CM1	JM1	KC1	KC3	KC4	MC1	MC2	MW1	PC1	PC2	PC3	PC4	PC5
[67]	0.8687	0.7912	-	-	-	-	0.6842	0.8933	0.9216	-	0.8797	0.8714	-
[57]	0.83	0.78	-	-	-	-	0.68	-	0.91	-	0.84	0.84	-
[56]	0.878	0.803	0.850	-	-	-	-	-	0.922	-	-	-	-
[68]	-	0.89	0.84	-	-	0.95	-	-	0.85	0.86	0.83	0.89	0.91
[48]	-	0.81	0.79	-	-	-	-	-	-	-	0.89	-	-
[59]	0.7755	0.7396	-	-	-	-	0.6486	0.8266	-	-	0.8259	0.8608	-
[69]	0.8179	-	-	-	-	-	-	-	0.8979	-	-	-	-
[70]	-	-	-	-	-	-	-	-	-	-	0.8192	-	-
[71]	-	-	-	-	-	-	0.6832	0.6005	-	-	0.737	0.869	-
[72]	-	-	-	-	-	-	-	-	-	-	-	0.8661	-
[53] (GA-SVM)	0.9035	-	0.8514	0.7989	0.9035	0.9950	0.6710	0.9183	0.9367	0.9959	0.9014	0.8821	-
[53] (PSO-SVM)	0.9037	-	0.8514	0.9040	0.6885	0.9950	0.6706	0.9179	0.9367	0.9959	0.9021	0.8793	-
[53] (GAPSO-SVM)	0.9049	-	0.8438	0.8995	0.6782	0.9951	0.6783	0.9157	0.9386	0.9959	0.9040	0.8779	-
Proposed (GFLFGOA-XGB)	0.9390	0.8147	0.8767	0.8571	0.75	0.9962	0.8387	0.9048	0.9294	0.9945	0.8826	0.9403	0.8208
Proposed (GFLFGOA-SSA-XGB)	0.9390	0.8151	0.8767	0.8571	0.7813	0.9966	0.8387	0.9048	0.9294	0.9945	0.8826	0.9331	0.8090
Proposed (LFGOA-SSA-XGB)	0.9268	0.8143	0.8710	0.8571	0.7813	0.9962	0.8387	0.9048	0.9294	0.9945	0.875	0.9308	0.8113
Proposed (GFGOA-SSA-XGB)	0.9390	0.8169	0.8767	0.8571	0.7813	0.9962	0.8387	0.9206	0.9353	0.9945	0.8826	0.9308	0.8160
Proposed (GFLFGOA-ANN)	0.9390	0.8147	0.8748	0.8571	0.7188	0.9958	0.8387	0.9365	0.9412	1.0	0.8674	0.9308	0.8113
Proposed (GFLFGOA-SSA-ANN)	0.9512	0.8154	0.8691	0.8571	0.7188	0.9954	0.8387	0.9365	0.9471	1.0	0.8826	0.9277	0.8043
Proposed (LFGOA-SSA-ANN)	0.9390	0.8140	0.8729	0.8776	0.8125	0.9958	0.8710	0.9365	0.9471	1.0	0.8788	0.9245	0.8066
Proposed (GFGOA-SSA-ANN)	0.9390	0.8158	0.8729	0.8571	0.7813	0.9954	0.8710	0.9365	0.9353	1.0	0.8712	0.9308	0.8137

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Das, M.; Mohan, B.R.; Guddeti, R.M.R.; Prasad, N. Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study. Mathematics 2024, 12, 2521. https://doi.org/10.3390/math12162521

AMA Style

Das M, Mohan BR, Guddeti RMR, Prasad N. Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study. Mathematics. 2024; 12(16):2521. https://doi.org/10.3390/math12162521

Chicago/Turabian Style

Das, Madhusmita, Biju R. Mohan, Ram Mohana Reddy Guddeti, and Nandini Prasad. 2024. "Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study" Mathematics 12, no. 16: 2521. https://doi.org/10.3390/math12162521

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Bio-Optimized Algorithms for Hyperparameter Tuning in Machine Learning Models: A Software Defect Prediction Case Study

Abstract

1. Introduction

1.1. Motivation

1.2. Contribution

2. Literature Survey

2.1. Bio-Optimized Approach to Optimization Problems

2.2. Bio-Optimized Approach for Hyperparameter Tuning Problems

3. Concepts

3.1. Grasshopper Optimization Algorithm (GOA)

3.2. Lévy Flight GOA (LFGOA)

3.3. Gravitational Force GOA (GFGOA)

3.4. Sparrow Search Algorithm (SSA)

4. Methodology

4.1. Proposed Algorithm

4.1.1. GFLFGOA Hybrid Algorithm

4.1.2. LFGOA-SSA Hybrid Algorithm

4.1.3. GFGOA-SSA Hybrid Algorithm

4.1.4. GFLFGOA-SSA Hybrid Algorithm

4.2. Optimization on Benchmark Functions (BFs)

4.3. Software Defect Prediction (SDP) Framework

4.3.1. Data Source

4.3.2. Data Pre-Processing

4.3.3. Hybrid Bio-Optimized Defect Prediction (HBoDP) Model

5. Results and Discussion

5.1. Parameter Settings

5.1.1. NASA Defect Dataset Description

5.1.2. Hyperparameters of the ML Models

5.2. Results

5.2.1. BF Results

5.2.2. SDP Framework Results

5.3. Analysis

5.3.1. BF Analysis

5.3.2. SDP Framework Analysis

5.3.3. Computational Complexity Analysis

6. Concluding Remarks and Future Scope

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI