Hybrid Artificial Protozoa-Based JADE for Attack Detection

Al Hwaitat, Ahmad k.; Fakhouri, Hussam N.

doi:10.3390/app14188280

Open AccessArticle

Hybrid Artificial Protozoa-Based JADE for Attack Detection

by

Ahmad k. Al Hwaitat

^1,*

and

Hussam N. Fakhouri

²

¹

King Abdullah II School of Information Technology, The University of Jordan, Amman 11942, Jordan

²

Data Science and Artificial Intelligence Department, Faculty of Information Technology, University of Petra, Amman 11196, Jordan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(18), 8280; https://doi.org/10.3390/app14188280

Submission received: 5 August 2024 / Revised: 27 August 2024 / Accepted: 30 August 2024 / Published: 13 September 2024

(This article belongs to the Section Computing and Artificial Intelligence)

Download

Browse Figures

Versions Notes

Abstract

:

This paper presents a novel hybrid optimization algorithm that combines JADE Adaptive Differential Evolution with Artificial Protozoa Optimizer (APO) to solve complex optimization problems and detect attacks. The proposed Hybrid APO-JADE Algorithm leverages JADE’s adaptive exploration capabilities and APO’s intensive exploitation strategies, ensuring a robust search process that balances global and local optimization. Initially, the algorithm employs JADE’s mutation and crossover operations, guided by adaptive control parameters, to explore the search space and prevent premature convergence. As the optimization progresses, a dynamic transition to the APO mechanism is implemented, where Levy flights and adaptive change factors are utilized to refine the best solutions identified during the exploration phase. This integration of exploration and exploitation phases enhances the algorithm’s ability to converge to high-quality solutions efficiently. The performance of the APO-JADE was verified via experimental simulations and compared with state-of-the-art algorithms using the 2022 IEEE Congress on Evolutionary Computation benchmark (CEC) 2022 and 2021. Results indicate that APO-JADE achieved outperforming results compared with the other algorithms. Considering practicality, the proposed APO-JADE was used to solve a real-world application in attack detection and tested on DS2OS, UNSW-NB15, and ToNIoT datasets, demonstrating its robust performance.

Keywords:

artificial protozoa; JADE; attack detection; optimization

1. Introduction

As modern problems become more intricate and the outcomes more nuanced, the need for advanced optimization algorithms has surged. However, optimization is essentially about finding the most efficient solution to a problem, whether it be the least or most extreme value of a particular function [1]. The solution, represented by a variable x, lies within a predefined search space and is refined over several iterations to approach the best possible outcome, which is known as the optimal solution. Each round of optimization tweaks the solution based on a set of heuristics or rules, aiming to improve it compared to the previous best. However, challenges arise in ensuring these algorithms do not become trapped in local optima—solutions that are best in a nearby region but not in the entire search space—or stray too far from the solution range. As such, the design of these algorithms requires a delicate balance, updating solutions carefully to navigate toward the global optimum, the absolute best solution across the entire landscape. This iterative, heuristic-based approach is at the core of metaheuristic optimization, which is a method that is increasingly vital in navigating complex, multi-dimensional problem spaces across various disciplines [2].

Metaheuristic algorithms stand out for their simplicity and adaptability, especially when contrasted with traditional optimization methods. Traditional approaches like linear, integer, and mixed programming are well suited for structured problems with clear definitions and constraints [3]. They offer precise solutions and are analytically approachable, revealing insights into their computational complexity and convergence behaviors. However, these methods often falter in complex scenarios characterized by multiple potential solutions or extremes. Here, metaheuristic algorithms shine by offering a versatile balance between exploration and exploitation [4]. They adeptly navigate through multiple solutions, efficiently avoiding local optima—suboptimal points that are better than adjacent possibilities but inferior to the best overall solution. This flexibility enhances their ability to locate the global optimum or the absolute best solution across the entire problem space. The strategic design of metaheuristics allows them to adapt and perform where traditional methods might struggle, making them particularly effective for a wide array of complex optimization problems [5].

The effectiveness and versatility of metaheuristic algorithms are evidenced by their widespread adoption across various sectors. Their ability to reliably find global optima makes them invaluable, particularly as they are not overly reliant on initial conditions and demonstrate robust performance across different solution domains. Such features make them highly adaptable and robust, leading to significant applications in complex, real-world problems. For instance, they have been successfully employed in optimizing travel routes, enhancing image segmentation, streamlining ship routing and scheduling, selecting features in datasets, and more [6].

Notably, the development of metaheuristic algorithms is inspired by a range of natural phenomena, leading to a rich diversity of approaches categorized broadly into swarm intelligence, physics-based, evolutionary, and human-inspired algorithms. Each category draws from different aspects of natural and human-made systems, leveraging their unique principles to navigate the search space efficiently. The continuous evolution and specialization of these algorithms have made them a go-to solution for problems that traditional optimization methods struggle to solve, highlighting their growing importance in both academic research and practical applications [7].

The No Free Lunch (NFL) theorem for optimization, proposed by Wolpert and Macready, is a fundamental principle that underscores the limitations of metaheuristic algorithms. It states that no single optimization algorithm is universally superior for all problems. Essentially, the theorem posits that when averaged over all possible problems, all algorithms perform equally well. This means that an algorithm showing excellent performance on certain types of problems may not necessarily perform as well on others [8].

The NFL theorem emphasizes the importance of understanding the specific nature and requirements of the problem at hand when selecting or designing an optimization algorithm. It suggests that the effectiveness of an algorithm is highly dependent on how well suited its mechanisms are to the particularities of the problem’s landscape. As a result, the field of optimization involves a constant search for new algorithms or the adaptation of existing ones to better fit the unique challenges of different problems [9].

This understanding has significant implications for the design and application of optimization methods. It encourages diversity in algorithm development and promotes a more nuanced approach to algorithm selection and problem solving. Instead of seeking a “one-size-fits-all” solution, researchers and practitioners are more inclined to consider a variety of strategies, tailor approaches to specific contexts, and remain open to innovation and adaptation in algorithm design [10].

1.1. Contributions

The key contributions of this paper are as follows:

Proposes a novel Hybrid APO-JADE optimization algorithm that integrates the strengths of the Artificial Protozoa Optimizer (APO) and JADE (Adaptive Differential Evolution) to effectively balance exploration and exploitation in optimization tasks.
Demonstrates the effectiveness of the Hybrid APO-JADE algorithm through comprehensive testing across multiple benchmark suites from the IEEE Congress on Evolutionary Computation (CEC) for the years 2017, 2021, and 2022, showcasing superior performance in terms of convergence rates and solution accuracy.
Applies the Hybrid APO-JADE algorithm to a real-world problem in cybersecurity, specifically for attack detection using the DS2OS, UNSW-NB15, and ToNIoT datasets, highlighting the algorithm’s practical applicability and robustness in diverse scenarios.
Introduces an innovative approach to hyperparameter tuning using the APO-JADE algorithm, enhancing the performance of deep learning models in detecting cybersecurity threats across different datasets.
Provides detailed experimental results and analyses, including accuracy, precision, recall, and F1 score metrics, to validate the proposed algorithm’s effectiveness in both binary and multiclass classification tasks.

1.2. Paper Structure

The rest of this paper is organized as follows. Section 2 contains the Literature Review, providing an overview of the foundational components, including the Artificial Protozoa Optimizer (APO), Differential Evolution (DE), and JADE: Adaptive Differential Evolution with an Optional External Archive. Section 3 contains the Hybrid APO-JADE Optimization Algorithm, covering aspects such as initialization, behavioral strategies (foraging, dormancy, reproduction, autotroph, and heterotroph), adaptive mechanisms, and the integration of JADE with APO. Next, Section 4 evaluates the algorithm’s performance across multiple IEEE Congress on Evolutionary Computation (CEC) benchmark suites from 2017, 2021, and 2022, using various metrics and diagrams to illustrate the results. The Case Study: Application of APO-JADE for Attack Detection demonstrates the algorithm’s application in cybersecurity, focusing on the DS2OS, UNSW-NB15, and ToNIoT datasets, with detailed discussions on data preparation, pre-processing, normalization, and hyperparameter tuning. Section 5.8 discusses the metrics used to evaluate the model, including accuracy, precision, recall, and F1 score. This is followed by Section 5.9, which outlines the experimental setup, methodology, and results, particularly focusing on the DS2OS dataset. The paper concludes with the Conclusion, summarizing the findings and highlighting the contributions of the Hybrid APO-JADE algorithm.

2. Literature Review

2.1. Overview of Artificial Protozoa Optimizer (APO)

The Artificial Protozoa Optimizer (APO) [11] is a novel bio-inspired metaheuristic algorithm designed for engineering optimization. The algorithm mimics the survival behaviors of protozoa, specifically their foraging, dormancy, and reproductive activities. The APO is inspired by the biological characteristics and behaviors of protozoa, particularly the Euglena species. Protozoa exhibit both autotrophic (photosynthesis) and heterotrophic (consumption of organic matter) behaviors, making them versatile in different environmental conditions. This adaptability is harnessed in the APO to perform effective optimization in both continuous and discrete spaces. The APO simulates three primary behaviors: foraging, dormancy, and reproduction. Foraging behavior is modeled by both autotrophic and heterotrophic modes. In the autotrophic mode, protozoa move toward suitable light conditions for photosynthesis. In the heterotrophic mode, they move toward areas rich in organic nutrients. Dormancy occurs when environmental conditions are unfavorable, leading protozoa to enter a dormant state by forming cysts to survive until conditions improve. Reproduction is modeled as asexual reproduction through binary fission, creating identical offspring. These behaviors are mathematically modeled to balance exploration and exploitation during the optimization process. The APO uses mathematical models to simulate the above behaviors: Autotrophic Foraging, where protozoa move toward light conditions suitable for photosynthesis, modeled using a foraging factor and the random selection of neighboring protozoa; Heterotrophic Foraging, where protozoa move toward nutrient-rich areas, with movements influenced by environmental factors and nearby food sources; Dormancy, where protozoa enter a dormant state, represented by randomly generating new positions to simulate surviving harsh conditions; and Reproduction, where protozoa split into two, with slight perturbations to simulate genetic diversity.

2.2. Overview of Differential Evolution (DE)

Differential Evolution (DE) is a population-based metaheuristic optimization algorithm introduced by Storn and Price in 1995. It is widely recognized for its simplicity and effectiveness in solving continuous optimization problems. DE is inspired by the principles of natural evolution, particularly the processes of mutation, crossover, and selection.

DE begins with a randomly generated population of candidate solutions, each represented by a vector of parameters. The population size remains constant throughout the optimization process. Each individual in the population is evaluated using a fitness function, which quantifies the quality of the solution with respect to the optimization objective.

For each individual in the population, a mutant vector is created by adding the weighted difference between two randomly selected population vectors to a third vector. This mutation operation is critical for introducing diversity into the population and is mathematically expressed as

v_{i} = x_{r 1} + F \cdot (x_{r 2} - x_{r 3}),

(1)

where

v_{i}

is the mutant vector,

x_{r 1}, x_{r 2}

, and

x_{r 3}

are randomly selected vectors from the population (and

r 1 \neq r 2 \neq r 3 \neq i

), and F is a mutation factor typically in the range [0, 2]. This operation helps with exploring the search space effectively by generating new candidate solutions that are different from the existing ones.

A crossover operation is then performed between the mutant vector and the target vector (the current population member) to generate a trial vector. This operation ensures that the trial vector inherits parameters from both the target and mutant vectors. The binomial crossover is commonly used, which is defined as

u_{i, j} = \{\begin{matrix} v_{i, j} & if rand (0, 1) \leq C_{r} or j = j_{r a n d}, \\ x_{i, j} & otherwise, \end{matrix}

(2)

where

u_{i, j}

is the trial vector,

v_{i, j}

is the mutant vector,

x_{i, j}

is the target vector,

C_{r}

is the crossover probability, and

j_{r a n d}

is a randomly chosen index to ensure that at least one parameter is taken from the mutant vector. This crossover mechanism is crucial for combining the explorative power of the mutation operation with the exploitative power of the target vector.

The trial vector is then compared to the target vector using a selection mechanism based on their fitness values. If the trial vector has a lower (or higher, depending on the optimization objective) objective function value than the target vector, it replaces the target vector in the next generation. This can be represented as

x_{i}^{(t + 1)} = \{\begin{matrix} u_{i} & if f (u_{i}) \leq f (x_{i}), \\ x_{i} & otherwise, \end{matrix}

(3)

where

x_{i}^{(t + 1)}

is the individual in the next generation,

u_{i}

is the trial vector, and

f (\cdot)

is the objective function. This selection process ensures that only the best solutions are carried forward to the next generation, guiding the population toward better solutions over successive generations.

2.3. Overview of JADE: Adaptive Differential Evolution with Optional External Archive

JADE (Adaptive Differential Evolution with Optional External Archive) is an advanced variant of the Differential Evolution (DE) algorithm, which is designed to enhance the optimization performance by introducing adaptive control parameters and an external archive for better exploration and exploitation. Proposed by J. Zhang and A. Sanderson in 2009 [12], JADE addresses some of the limitations of traditional DE by adapting mutation and crossover rates dynamically and utilizing historical information to guide the search process more effectively.

JADE retains the core structure of DE, including population initialization, mutation, crossover, and selection, but it introduces several innovative features. The key improvement in JADE is the adaptation of the mutation factor F and the crossover rate

C_{r}

during the optimization process. These parameters are crucial for balancing exploration and exploitation. Instead of using fixed values, JADE adjusts F and

C_{r}

based on the success of previous generations, allowing the algorithm to adapt to different stages of the search process. This adaptability helps in maintaining diversity in the population and avoiding premature convergence to local optima.

The mutation operation in JADE is enhanced with an external archive, which stores inferior solutions that were replaced in previous generations. The archive is used to provide additional information for generating mutant vectors, thus enriching the diversity of potential solutions. The mutation strategy in JADE can be expressed as

v_{i} = x_{i} + F \cdot (x_{p} - x_{i}) + F \cdot (x_{r} 1 - x_{r} 2),

(4)

where

v_{i}

is the mutant vector,

x_{i}

is the target vector,

x_{p}

is the best solution in the current population,

x_{r} 1

and

x_{r} 2

are randomly selected vectors, and F is the mutation factor. The use of the best solution

x_{p}

guides the search towards promising regions, while the random vectors

x_{r} 1

and

x_{r} 2

introduce variability.

Crossover in JADE is performed similarly to traditional DE, but with an adaptive crossover rate

C_{r}

, which is adjusted based on the performance of the previous generations. The crossover operation combines the mutant vector and the target vector to generate a trial vector

u_{i}

:

u_{i, j} = \{\begin{matrix} v_{i, j} & if rand (0, 1) \leq C_{r} or j = j_{r a n d}, \\ x_{i, j} & otherwise, \end{matrix}

(5)

where

u_{i, j}

is the trial vector,

v_{i, j}

is the mutant vector,

x_{i, j}

is the target vector,

C_{r}

is the crossover probability, and

j_{r a n d}

is a randomly chosen index to ensure that at least one parameter is taken from the mutant vector.

Selection in JADE follows the DE principle, where the trial vector

u_{i}

competes with the target vector

x_{i}

. The vector with the better fitness value is selected for the next generation:

x_{i}^{(t + 1)} = \{\begin{matrix} u_{i} & if f (u_{i}) \leq f (x_{i}), \\ x_{i} & otherwise, \end{matrix}

(6)

where

x_{i}^{(t + 1)}

is the individual in the next generation,

u_{i}

is the trial vector, and

f (\cdot)

is the objective function.

JADE’s adaptive mechanism and the use of an external archive significantly enhance its performance on various optimization problems. The adaptability of F and

C_{r}

allows JADE to fine-tune its search strategy dynamically, improving its ability to find global optima and avoid stagnation. The external archive contributes to maintaining diversity and preventing the loss of valuable information throughout the optimization process.

3. Hybrid APO-JADE Optimization Algorithm

3.1. Algorithm Framework

The hybrid APO-JADE algorithm combines the diverse behavioral strategies of the Artificial Protozoa Optimizer (APO) with the adaptive parameter control mechanisms of Adaptive Differential Evolution (JADE). This hybrid approach leverages the strengths of both algorithms to enhance the optimization process.

3.1.1. Artificial Protozoa Optimizer (APO)

APO is inspired by the behaviors of protozoa, single-celled organisms that exhibit complex behaviors such as foraging, dormancy, reproduction, autotrophy, and heterotrophy. These behaviors allow the algorithm to explore the search space in diverse ways:

Foraging refers to protozoa moving toward food sources, which, in optimization terms, translates to individuals moving toward promising areas in the search space. This movement helps with the local exploration and exploitation of potential solutions.

Dormancy occurs when protozoa enter a state of low activity in unfavorable conditions. In the context of optimization, this corresponds to individuals exploring new random positions in the search space, thereby enhancing diversity and avoiding local optima.

Reproduction involves protozoa generating offspring, which is analogous to the generation of new candidate solutions through perturbations of existing ones. This behavior promotes localized exploration within the search space.

Autotrophy is the process by which protozoa produce their own food through photosynthesis. In optimization, this can be seen as individuals adjusting their positions toward a peer in order to balance exploration and exploitation.

Lastly, heterotrophy occurs when protozoa feed on other organisms. In an optimization context, this means individuals adjust their positions based on the differences between two peers, thereby enhancing diversity in the search process.

3.1.2. Adaptive Differential Evolution (JADE)

JADE enhances traditional Differential Evolution by introducing adaptive mechanisms for the crossover rate (

C R

) and mutation factor (F). These parameters are dynamically adjusted based on the historical performance of solutions, allowing the algorithm to adapt to different optimization landscapes:

The Crossover Rate (CR) controls the probability of components being recombined from different parent solutions. By adapting CR, JADE maintains a balance between exploration and exploitation, ensuring that the algorithm searches effectively across the solution space.

The Mutation Factor (F) determines the step size for mutations. An adaptive F allows the algorithm to adjust the search intensity dynamically, based on the progress made toward optimal solutions. This adaptability helps JADE fine-tune its exploration and exploitation throughout the optimization process.

3.1.3. Hybrid Approach

The hybrid APO-JADE algorithm integrates these two methodologies to create a robust optimization framework. The key steps in the algorithm are outlined below:

Initialization: A population of candidate solutions is generated uniformly within the defined lower and upper bounds, ensuring a diverse starting point.
Behavioral Strategy Selection: Each individual in the population randomly selects one of the APO-inspired behaviors (foraging, dormancy, reproduction, autotrophy, heterotrophy) in each iteration.
Adaptive Parameter Adjustment: JADE’s adaptive mechanisms are applied to adjust $C R$ and F, enhancing the search capabilities.
Crossover and Mutation: The selected APO behavior and JADE’s crossover method are used to generate new candidate solutions.
Fitness Evaluation: New candidate solutions are evaluated using the objective function.
Selection and Archive Update: Individuals are replaced with better-performing candidates, and an archive of replaced solutions is maintained to preserve diversity and guide parameter adjustments.
Parameter Adaptation: $C R$ and F are updated based on the performance of the archive solutions, ensuring the algorithm remains adaptable throughout the optimization process.
Repeat: The process continues for a predefined number of iterations or until convergence criteria are met.

This hybrid framework allows the algorithm to effectively explore the search space using APO’s diverse behaviors while dynamically adjusting search parameters using JADE’s adaptive mechanisms. The combination of these strategies aims to achieve superior convergence rates and solution accuracy across a variety of optimization problems.

Initialization

The algorithm begins by initializing a population of candidate solutions uniformly within the defined lower and upper bounds. This ensures a diverse starting point for the optimization process, which is critical for effective exploration.

Let

$P$ be the population matrix of size $pop \times \dim$ ;
$lb$ and $ub$ be the lower and upper bounds, respectively.

The initialization is defined as shown in Equation (7):

P_{i} = lb + r_{i} \cdot (ub - lb)

(7)

where

r_{i}

is a random vector with values uniformly distributed in

[0, 1]

.

Behavioral Strategies

The hybrid algorithm incorporates five distinct behaviors inspired by APO.

Foraging

Individuals adjust their positions based on a randomly chosen peer, promoting exploration as shown in Equation (8).

X_{new} = X_{i} + F \cdot (X_{j} - X_{i})

(8)

where j is a randomly chosen index different from i.

Dormancy

Individuals randomly relocate within the search space, enhancing diversity as shown in Equation (9).

X_{new} = lb + r \cdot (ub - lb)

(9)

where

r

is a random vector with values uniformly distributed in

[0, 1]

.

Reproduction

Individuals generate new positions based on Gaussian perturbations, enabling localized exploration as shown in Equation (10).

X_{new} = X_{i} + N (0, 1) \cdot (ub - lb) \cdot F

(10)

where

N (0, 1)

represents a vector of normally distributed random variables.

Autotroph

Individuals adjust positions toward a selected peer, balancing exploration and exploitation as shown in Equation (11).

X_{new} = X_{i} + F \cdot (X_{j} - X_{i})

(11)

where j is a randomly chosen index different from i.

Heterotroph

Individuals move based on differences between two peers, promoting diverse search patterns as shown in Equation (12).

X_{new} = X_{i} + F \cdot (X_{j} - X_{k})

(12)

where j and k are randomly chosen indices different from i and each other.

Adaptive Mechanisms

The hybrid algorithm incorporates JADE’s adaptive mechanisms for crossover rate (

C R

) and mutation factor (F), which are adjusted based on historical performance to optimize search effectiveness.

Crossover and Mutation (JADE)

JADE’s adaptive

C R

and F are applied to modify the positions of individuals. The crossover operation is defined as shown in Equation (14).

U_{i} = X_{i}

(13)

U_{i j} = V_{i j} if {rand}_{j} < C R or j = jrand

(14)

where

V

is the mutant vector,

{rand}_{j}

is a uniformly distributed random number, and jrand is a randomly chosen dimension.

Fitness Evaluation

Each candidate solution is evaluated using the objective function

f (X)

, determining its fitness within the current population.

Selection and Replacement Strategies

The algorithm employs a greedy selection mechanism, where individuals are replaced if the new candidate solutions provide better fitness. An archive of replaced solutions is maintained to preserve diversity and guide adaptive parameter adjustments.

The selection is defined as shown in Equation (15).

X_{i} = \{\begin{matrix} U_{i} & if f (U_{i}) < f (X_{i}) \\ X_{i} & otherwise \end{matrix}

(15)

Parameter Adaptation

The

C R

and F values are updated based on the performance of the archive solutions, ensuring that the algorithm remains adaptable to the problem landscape throughout the optimization process.

The adaptation is defined as shown in Equations (16) and (17).

C R = (1 - c) \cdot C R + c \cdot mean (C R_{successful})

(16)

F = (1 - c) \cdot F + c \cdot \frac{\sum F_{successful}^{2}}{\sum F_{successful}}

(17)

where c is a constant, and

C R_{successful}

and

F_{successful}

are the crossover rates and mutation factors of successful mutations.

3.2. APO-JADE Optimization Algorithm Description and Pseudocode

The hybrid APO-JADE optimization algorithm (Algorithm 1) integrates the diverse behavioral strategies of the Artificial Protozoa Optimizer (APO) with the adaptive parameter control mechanisms of Adaptive Differential Evolution (JADE) to enhance optimization performance. The algorithm starts by initializing a population of candidate solutions uniformly within the defined bounds, ensuring diversity. Each individual in the population randomly selects one of five APO-inspired behaviors (foraging, dormancy, reproduction, autotrophy, or heterotrophy) to adjust its position, balancing exploration and exploitation. JADE’s adaptive mechanisms dynamically adjust the crossover rate (

C R

) and mutation factor (F) based on historical performance, allowing the algorithm to adapt to the optimization landscape. Fitness evaluation determines the quality of each candidate solution, and a greedy selection mechanism ensures that only better-performing solutions replace existing ones. An archive of replaced solutions is maintained to preserve diversity and guide adaptive parameter adjustments. The hybrid algorithm iterates through these steps, continuously updating the population and parameters, until convergence criteria are met or the maximum number of iterations is reached. By combining APO’s exploratory behaviors with JADE’s adaptability, the hybrid algorithm aims to achieve superior convergence rates and solution accuracy across various optimization problems.

Algorithm 1: Pseudocode for Hybrid APO-JADE Optimization Algorithm.

1:: Input: Population size (pop), Maximum iterations (Max_iter), Lower bounds (lb), Upper bounds (ub), Dimension (dim), Objective function (fobj)
2:: Output: Best solution found (Best_pos), Best fitness value (Best_score)
3:: Initialize the population $P$ using Equation (7)
4:: Evaluate initial fitness of the population
5:: Initialize archive $A$ as empty
6:: Set initial crossover rate $C R = 0.5$
7:: Set initial mutation factor $F = 0.5$
8:: for gen = 1 to Max_iter do
9:: Generate adaptive $C R$ and F for each individual based on historical success
10:: for i = 1 to pop do
11:: Select a behavioral strategy for individual i:
12:: if behavior = foraging then
13:: Update position using Equation (8)
14:: else if behavior = dormancy then
15:: Update position using Equation (9)
16:: else if behavior = reproduction then
17:: Update position using Equation (10)
18:: else if behavior = autotroph then
19:: Update position using Equation (11)
20:: else if behavior = heterotroph then
21:: Update position using Equation (12)
22:: end if
23:: Perform crossover to generate new candidate solution using Equation (14)
24:: Evaluate the fitness of the new candidate solution
25:: if new candidate solution is better then
26:: Update archive $A$ with current solution
27:: Replace current solution with new candidate solution
28:: end if
29:: end for
30:: Update adaptive parameters $C R$ and F using Equations (16) and (17)
31:: Record the best solution and fitness value of the current generation
32:: Manage the size of archive $A$ to maintain diversity
33:: end for
34:: return Best solution (Best_pos) and best fitness value (Best_score)

3.3. Exploration and Exploitation

Exploration refers to the ability of an optimization algorithm to search through diverse regions of the solution space to discover potentially optimal areas. In the hybrid APO-JADE algorithm, exploration is primarily facilitated by the behavioral strategies inspired by the Protozoa Optimizer (APO). Each behavior promotes exploration in different ways. For example, the foraging behavior enables individuals to adjust their positions based on randomly chosen peers, helping to explore new areas in the solution space. This is mathematically represented by Equation (8), where an individual’s position is updated by moving toward a peer’s position scaled by a mutation factor F. Similarly, the dormancy behavior introduces randomness by relocating individuals to entirely new positions within the search space, as described in Equation (9). This random relocation helps in avoiding local optima and enhances the global search capability of the algorithm. Additionally, reproduction involves generating new positions through Gaussian perturbations around the current position, as shown in Equation (10). This localized random search helps in exploring the neighborhood of current solutions, contributing to the diversity of the population.

Exploitation, on the other hand, refers to the ability of the algorithm to intensively search around promising regions of the solution space to refine solutions and find the optimal solution. The hybrid APO-JADE algorithm incorporates exploitation through both APO’s autotroph and heterotroph behaviors, as well as JADE’s adaptive parameter mechanisms. In the autotroph behavior, individuals adjust their positions toward a selected peer, balancing exploration and exploitation, as detailed in Equation (11). This behavior helps in exploiting known good areas of the search space while still allowing some exploration. The heterotroph behavior involves moving based on differences between two peers, which promotes diverse search patterns and helps in refining the search in promising regions, as described in Equation (12).

JADE’s adaptive mechanisms dynamically adjust the crossover rate (

C R

) and mutation factor (F), ensuring that the algorithm can exploit promising regions effectively. The crossover operation, as defined in Equation (14), and the adaptive parameter updates, as shown in Equations (16) and (17), allow the algorithm to fine-tune the search process based on historical performance, enhancing exploitation capabilities. By combining these strategies, the algorithm achieves a balance between exploration and exploitation.

Achieving a balance between exploration and exploitation is crucial for the success of any optimization algorithm. The hybrid APO-JADE algorithm addresses this balance by combining APO’s exploratory behaviors with JADE’s adaptive exploitation mechanisms. By dynamically adjusting

C R

and F and incorporating diverse behavioral strategies, the algorithm maintains population diversity while focusing search efforts on promising regions. This hybrid approach ensures that the algorithm can effectively navigate complex optimization landscapes, avoid premature convergence, and find high-quality solutions. The interplay between exploration and exploitation in the hybrid APO-JADE algorithm is designed to leverage the strengths of both APO and JADE, resulting in an optimization process that is both robust and adaptive to various types of optimization problems.

4. Testing and Comparison

In order to conduct a rigorous statistical evaluation of the performance of various optimization algorithms, we utilized several fundamental statistical measures: mean and standard deviation. The mean, which represents the measure of central tendency, provides an average outcome achieved by the algorithms across multiple trials, giving a general view of their typical performance levels. The standard deviation, on the other hand, measures the variability or dispersion from the mean, offering insights into the consistency and reliability of the results obtained across different runs. This metric is essential for assessing the stability and predictability of the algorithms’ performance.

The effectiveness of different optimization algorithms was assessed and compared. The algorithms evaluated include the Fox Optimization Algorithm (FOX) [13], the Arithmetic Optimization Algorithm (AOA) [14], the Artificial Hummingbird Algorithm [15], and the Beluga Whale Optimization (BWO) [16]. Additionally, the Gray Wolf Optimizer (GWO) [17], the Optical Microscope Algorithm (OMA) [18], and the Sand Cat Optimization Algorithm (SCSO) [19] were included. The Moth-Flame Optimization (MFO) [20], the Multi-Trial vector-based Differential Evolution Algorithm (MTDE) [21], and the Multi-Verse Optimizer (MVO) [22] were also evaluated. Furthermore, the Chimp Optimization Algorithm (ChOA) [23], the Sine Cosine Algorithm (SCA) [24], the Whale Optimization Algorithm (WOA) [25], and the White Shark Optimizer (WSO) [26] were part of the comparison.

4.1. IEEE Congress on Evolutionary Computation (CEC) Benchmark Suites

The benchmark functions from the IEEE Congress on Evolutionary Computation (CEC) across the years 2017, 2021, and 2022, are standardized test problems designed to rigorously evaluate the performance of optimization algorithms. These suites, as discussed in [27,28,29], categorize functions into Unimodal, Multimodal, Hybrid, and Composition types, each presenting distinct challenges. Unimodal functions feature a single global optimum, which is ideal for assessing the convergence behavior of algorithms. Multimodal functions, with multiple local optima, test an algorithm’s ability to escape local minima and find the global optimum. Hybrid functions combine several basic functions, creating complex landscapes that simulate real-world scenarios, while Composition functions further increase complexity by integrating multiple hybrid functions, each with unique characteristics. These benchmark suites serve as critical tools in competitions and research, enabling a comparative analysis and validation of various optimization techniques.

4.2. Comprehensive Testing across Different IEEE Benchmarks over Different Years

We have tested JADE-APO on multiple CEC benchmark suites, specifically CEC 2022 as shown in Figure 1, CEC 2021, and CEC 2017, rather than relying on a single suite, to ensure a comprehensive and robust evaluation of the optimization algorithms. Each of these benchmark suites presents unique sets of functions, with different characteristics and challenges, designed to test various aspects of algorithm performance.

The CEC2022 benchmark functions [27], are categorized into Unimodal, Multimodal, Hybrid, and Composition functions. These functions have been specifically designed to reflect more recent advancements and emerging challenges in optimization, providing a modern set of problems that can highlight the strengths and weaknesses of contemporary algorithms.

In contrast, the CEC2021 benchmark suite [28] offers a diverse set of functions that test the convergence, robustness, and versatility of optimization algorithms. Testing on this suite allows for a comparative analysis against the previous year’s functions, providing insights into the consistency and adaptability of the algorithms when confronted with new but similar challenges.

The CEC2017 benchmark functions [29], though older, have been extensively used in the optimization community and offer a well-established standard for algorithm comparison. These functions are essential for ensuring that the tested algorithms perform well against challenging problems, thus providing a baseline for evaluating improvements over time.

4.3. Diverse Problem Landscapes

Each benchmark suite includes a variety of problem landscapes:

Unimodal Functions: Ideal for testing the convergence speed and precision of algorithms.
Multimodal Functions: Challenge the algorithm’s ability to avoid local optima and locate the global optimum.
Hybrid Functions: Combine multiple basic functions to create complex landscapes, simulating real-world scenarios.
Composition Functions: Integrate multiple hybrid functions to create highly complex optimization challenges, testing an algorithm’s robustness and versatility.

By testing across the CEC 2022, CEC 2021, and CEC 2017 suites, the evaluation covers a broader spectrum of optimization problems. This approach ensures that the algorithms are not only effective on a specific set of problems but also versatile and robust across different types of challenges, both historical and contemporary. It provides a more complete and rigorous assessment of the algorithms’ performance, offering valuable insights into their generalizability and potential for real-world application.

Furthermore, using multiple benchmark suites enables the identification of trends in algorithm performance over time, helping to understand whether improvements in optimization techniques are consistent with advances in benchmark design.

4.4. Evaluated Metrics

For evaluation in Table 1, Table 2, Table 3 and Table 4, we used “mean”, “std”, and “SEM”, which are statistical metrics calculated for the results obtained from running various optimization algorithms on benchmark functions.

Mean: The mean (or average) represents the average fitness score achieved by an algorithm over multiple runs (denoted by the variable RUN). This metric provides a central tendency of the results, indicating the overall performance of the algorithm across different runs. The mean is calculated as shown in Equation (18):

Mean = μ = \frac{1}{n} \sum_{i = 1}^{n} x_{i}

(18)

where n is the total number of runs, and

x_{i}

represents the fitness score from the ith run.

Std (Standard Deviation): The standard deviation is a measure of the variability or dispersion of the fitness scores from the mean. A low standard deviation indicates that the fitness scores are close to the mean, implying consistent performance across runs. Conversely, a high standard deviation suggests more variability in the results. The standard deviation is calculated as shown in Equation (19):

Std = σ = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(x_{i} - μ)}^{2}}

(19)

where

μ

is the mean fitness score, and n is the number of runs.

SEM (Standard Error of the Mean): The standard error of the mean is calculated by dividing the standard deviation by the square root of the number of runs (RUN). SEM provides an estimate of how much the mean fitness score is expected to vary if the experiment were repeated multiple times. A lower SEM indicates that the mean is a more reliable estimate of the true average performance. The SEM is calculated as shown in Equation (20):

SEM = \frac{σ}{\sqrt{n}}

(20)

where

σ

is the standard deviation and n is the number of runs.

4.5. Results on IEEE Congress on Evolutionary Computation (CEC) 2022 Benchmark Suits

Table 1 shows that the APO-JADE optimizer significantly outperforms other algorithms across multiple functions in the CEC2022 benchmark set. APO-JADE consistently achieves top ranks, notably in functions F1, F3, F4, F5, F7, F8, F9, F11, and F12, demonstrating superior performance. Not only does APO-JADE often report the lowest mean values, indicating its efficiency in finding solutions closer to the global optimum, but it also shows lower or comparable standard deviations and standard errors of the mean, which highlight its consistency and reliability in results across different runs. Such attributes underscore its robustness and versatility, which are capable of effectively handling the complex optimization landscapes characteristic of the diverse test functions in the CEC2022 set. This evidence strongly supports the conclusion that APO-JADE is a superior choice for tackling the intricate and varied optimization challenges presented.

Table 1. Results on IEEE Congress on Evolutionary Computation (CEC) 2022 benchmark suites.

Function	Statistics	APO-JADE	PSO	OMA	BWO	COOT	SHIO	ChOA	DBO	MTDE	MFO	SCA
F1	Mean	3.000E+02	1.060E+03	4.835E+03	2.155E+03	2.102E+03	2.164E+03	8.307E+03	3.703E+02	1.451E+04	6.409E+03	1.076E+03
	Std	3.769E-03	1.281E+03	6.720E+02	1.613E+03	2.027E+03	2.472E+03	1.860E+03	3.287E+01	4.418E+03	8.441E+03	4.990E+02
	SEM	1.192E-03	4.052E+02	2.125E+02	5.102E+02	6.409E+02	7.819E+02	5.882E+02	1.039E+01	1.397E+03	2.669E+03	1.578E+02
	Rank	1	3	8	6	5	7	10	2	11	9	4
F2	Mean	4.132E+02	4.132E+02	4.943E+02	4.285E+02	4.397E+02	4.266E+02	8.643E+02	4.096E+02	5.522E+02	4.169E+02	4.657E+02
	Std	2.211E+01	1.357E+01	1.665E+01	2.738E+01	4.983E+01	2.380E+01	2.385E+02	2.031E+01	3.711E+01	1.558E+01	2.238E+01
	SEM	6.991E+00	4.290E+00	5.265E+00	8.657E+00	1.576E+01	7.528E+00	7.543E+01	6.421E+00	1.173E+01	4.926E+00	7.077E+00
	Rank	2	3	9	6	7	5	11	1	10	4	8
F3	Mean	6.003E+02	6.006E+02	6.267E+02	6.077E+02	6.113E+02	6.063E+02	6.443E+02	6.150E+02	6.291E+02	6.016E+02	6.188E+02
	Std	8.379E-01	1.034E+00	4.732E+00	6.862E+00	7.233E+00	4.910E+00	5.282E+00	9.467E+00	9.666E+00	2.119E+00	3.489E+00
	SEM	2.650E-01	3.268E-01	1.496E+00	2.170E+00	2.287E+00	1.553E+00	1.670E+00	2.994E+00	3.057E+00	6.701E-01	1.103E+00
	Rank	1	2	9	5	6	4	11	7	10	3	8
F4	Mean	8.171E+02	8.173E+02	8.327E+02	8.288E+02	8.208E+02	8.172E+02	8.489E+02	8.207E+02	8.671E+02	8.329E+02	8.375E+02
	Std	7.897E+00	7.602E+00	3.034E+00	7.700E+00	4.822E+00	1.255E+01	7.216E+00	6.144E+00	1.146E+01	1.135E+01	8.011E+00
	SEM	2.497E+00	2.404E+00	9.594E-01	2.435E+00	1.525E+00	3.969E+00	2.282E+00	1.943E+00	3.624E+00	3.590E+00	2.533E+00
	Rank	1	3	7	6	5	2	10	4	11	8	9
F5	Mean	9.001E+02	9.159E+02	9.976E+02	9.475E+02	1.038E+03	9.186E+02	1.443E+03	1.045E+03	1.604E+03	1.083E+03	1.009E+03
	Std	2.889E-01	2.390E+01	3.686E+01	3.656E+01	1.040E+02	2.613E+01	1.242E+02	1.471E+02	1.696E+02	1.886E+02	2.782E+01
	SEM	9.137E-02	7.558E+00	1.166E+01	1.156E+01	3.290E+01	8.262E+00	3.928E+01	4.652E+01	5.363E+01	5.965E+01	8.798E+00
	Rank	1	2	5	4	7	3	10	8	11	9	6
F6	Mean	4.121E+03	5.349E+03	2.448E+06	6.764E+03	4.267E+03	4.368E+03	5.471E+07	3.476E+03	1.326E+07	4.685E+03	2.380E+06
	Std	2.261E+03	2.944E+03	2.337E+06	2.565E+03	1.462E+03	1.700E+03	2.421E+07	2.442E+03	7.369E+06	2.234E+03	2.027E+06
	SEM	7.149E+02	9.308E+02	7.392E+05	8.113E+02	4.623E+02	5.377E+02	7.657E+06	7.722E+02	2.330E+06	7.064E+02	6.410E+05
	Rank	2	6	9	7	3	4	11	1	10	5	8
F7	Mean	2.021E+03	2.024E+03	2.065E+03	2.045E+03	2.032E+03	2.051E+03	2.129E+03	2.031E+03	2.087E+03	2.024E+03	2.053E+03
	Std	6.156E+00	9.864E+00	1.091E+01	1.665E+01	1.293E+01	1.801E+01	1.777E+01	7.335E+00	1.450E+01	4.318E+00	5.858E+00
	SEM	1.947E+00	3.119E+00	3.449E+00	5.266E+00	4.088E+00	5.695E+00	5.619E+00	2.320E+00	4.584E+00	1.366E+00	1.852E+00
	Rank	1	3	9	6	5	7	11	4	10	2	8
F8	Mean	2.216E+03	2.223E+03	2.233E+03	2.225E+03	2.223E+03	2.238E+03	2.240E+03	2.216E+03	2.242E+03	2.223E+03	2.231E+03
	Std	3.963E+01	7.155E+00	2.937E+00	5.979E+00	1.840E+00	3.620E+01	5.594E+00	9.302E+00	4.381E+00	1.742E+00	4.247E+00
	SEM	1.253E+01	2.263E+00	9.288E-01	1.891E+00	5.818E-01	1.145E+01	1.769E+00	2.942E+00	1.385E+00	5.509E-01	1.343E+00
	Rank	1	4	8	6	3	9	10	2	11	5	7
F9	Mean	2.529E+03	2.572E+03	2.597E+03	2.606E+03	2.585E+03	2.595E+03	2.723E+03	2.534E+03	2.650E+03	2.538E+03	2.572E+03
	Std	1.886E-02	4.164E+01	2.330E+01	4.539E+01	1.665E+01	4.142E+01	4.592E+01	1.289E+01	3.752E+01	2.578E+01	3.481E+01
	SEM	5.963E-03	1.317E+01	7.367E+00	1.435E+01	5.265E+00	1.310E+01	1.452E+01	4.075E+00	1.187E+01	8.153E+00	1.101E+01
	Rank	1	5	8	9	6	7	11	2	10	3	4
F10	Mean	2.524E+03	2.547E+03	2.521E+03	2.549E+03	2.541E+03	2.711E+03	2.632E+03	2.549E+03	2.536E+03	2.528E+03	2.502E+03
	Std	6.236E+01	6.025E+01	4.434E+01	6.267E+01	6.247E+01	3.416E+02	2.185E+02	6.284E+01	3.223E+01	5.684E+01	5.606E-01
	SEM	1.972E+01	1.905E+01	1.402E+01	1.982E+01	1.975E+01	1.080E+02	6.909E+01	1.987E+01	1.019E+01	1.798E+01	1.773E-01
	Rank	3	7	2	8	6	11	10	9	5	4	1
F11	Mean	2.631E+03	2.851E+03	2.791E+03	2.858E+03	2.742E+03	2.791E+03	3.192E+03	2.689E+03	2.899E+03	2.738E+03	2.768E+03
	Std	9.536E+01	1.954E+02	1.868E+01	2.143E+02	1.710E+02	1.785E+02	3.755E+02	6.309E+01	7.767E+01	7.368E+01	1.362E+01
	SEM	3.015E+01	6.178E+01	5.906E+00	6.776E+01	5.409E+01	5.643E+01	1.187E+02	1.995E+01	2.456E+01	2.330E+01	4.308E+00
	Rank	1	8	7	9	4	6	11	2	10	3	5
F12	Mean	2.863E+03	2.864E+03	2.897E+03	2.870E+03	2.890E+03	2.896E+03	2.948E+03	2.865E+03	2.896E+03	2.864E+03	2.869E+03
	Std	2.509E+00	1.108E+00	4.688E+00	9.132E+00	1.650E+01	2.632E+01	6.632E+01	4.185E+00	1.380E+01	1.577E+00	1.972E+00
	SEM	7.934E-01	3.503E-01	1.483E+00	2.888E+00	5.218E+00	8.322E+00	2.097E+01	1.323E+00	4.363E+00	4.987E-01	6.235E-01
	Rank	1	3	10	6	7	8	11	4	9	2	5

4.6. Results on IEEE Congress on Evolutionary Computation (CEC) 2021 Benchmark Suits

In the results in the CEC2021 benchmark suite, as shown in Table 2, APO-JADE distinctly outperformed other algorithms, demonstrating its exceptional efficacy in navigating complex optimization landscapes. Notably, APO-JADE often achieved the lowest mean values and consistently ranked at or near the top across various functions, clearly outpacing other algorithms like MVO, WOA, and SCA, and even edging out newer algorithms such as MFO and BWO in certain scenarios. This superior performance is highlighted by its minimal standard deviations and errors, suggesting that APO-JADE not only finds optimal solutions but does so with remarkable consistency and reliability. The algorithm’s information-sharing mechanism, which allows particles to share knowledge and adapt their search strategies dynamically, likely contributes significantly to its success, enabling it to efficiently explore and exploit the solution space. Overall, APO-JADE’s robustness and adaptability make it a standout performer in the CEC2021 evaluations, showcasing its potential as a powerful tool for solving a wide array of complex optimization problems.

Table 2. Results on IEEE Congress on Evolutionary Computation (CEC) 2021 benchmark suites.

Fun	Statistics	APO-JADE	PSO	WOA	MPA	GWO	FOX	BWO	SHIO	MFO	SCA
C1	Mean	9.7E-191	5176.319	3.7E-156	5.83E-53	2.6E-122	420.6092	1.1E-226	3.4E-129	6000	2.85E-22
	Std	0	2390.799	8.2E-156	5.65E-53	3.9E-122	383.8391	0	7.6E-129	5477.226	3.95E-22
	SEM	0	1069.198	3.7E-156	2.53E-53	1.7E-122	171.6581	0	3.4E-129	2449.49	1.77E-22
	Rank	2	9	3	6	5	8	1	4	10	7
C2	Mean	0	703.8813	154.4034	0	0.359157	386.2328	0	107.8942	275.9674	1.14E-10
	Std	0	428.0812	345.2566	0	0.803098	291.1929	0	154.8513	95.9286	2.55E-10
	SEM	0	191.4437	154.4034	0	0.359157	130.2254	0	69.2516	42.90057	1.14E-10
	Rank	1	10	7	1	5	9	1	6	8	4
C3	Mean	0	32.83623	0	0	20.07215	40.6936	0	34.44801	24.64498	18.50074
	Std	0	7.971007	0	0	4.041335	10.38824	0	20.06978	11.13077	41.3689
	SEM	0	3.564743	0	0	1.80734	4.645762	0	8.975479	4.977834	18.50074
	Rank	1	8	1	1	6	10	1	9	7	5
C4	Mean	0	1.269271	0	0	0.573358	0.883461	0	1.106634	1.200005	4.44E-17
	Std	0	0.459856	0	0	0.372901	0.160022	0	0.568078	0.768028	9.93E-17
	SEM	0	0.205654	0	0	0.166766	0.071564	0	0.254052	0.343473	4.44E-17
	Rank	1	10	1	1	6	7	1	8	9	5
C5	Mean	1.49E-27	4281.429	5.3E-19	2.58E-23	0.11691	4282.471	3.1E-227	2.591568	309.3922	4.63E-21
	Std	1.44E-27	3875.525	1.19E-18	3.81E-23	0.261419	3597.061	0	1.68483	188.7557	1.02E-20
	SEM	6.42E-28	1733.188	5.3E-19	1.7E-23	0.11691	1608.654	0	0.753479	84.41412	4.58E-21
	Rank	2	9	5	3	6	10	1	7	8	4
C6	Mean	2.19E-19	111.3648	0.07566	1.84E-05	0.973054	74.87711	0.010678	1.439195	50.33296	0.228576
	Std	4.89E-19	102.1313	0.043766	1.3E-05	0.976493	75.78408	0.023089	2.010444	46.87492	0.354379
	SEM	2.19E-19	45.67452	0.019573	5.82E-06	0.436701	33.89167	0.010326	0.899098	20.9631	0.158483
	Rank	1	10	4	2	6	9	3	7	8	5
C7	Mean	2.28E-14	1103.701	0.018197	0.000207	0.012254	2136.873	0.000713	0.675402	106.2665	0.002972
	Std	5.11E-14	992.1609	0.024013	0.000261	0.010472	827.7399	0.000909	0.86993	198.8262	0.003538
	SEM	2.28E-14	443.7078	0.010739	0.000117	0.004683	370.1765	0.000407	0.389045	88.91778	0.001582
	Rank	1	9	6	2	5	10	3	7	8	4
C8	Mean	0	230.83	0	0	0	351.3044	0	8.365755	388.4221	0
	Std	0	218.0023	0	0	0	344.3144	0	18.7064	371.8302	0
	SEM	0	97.49361	0	0	0	153.9821	0	8.365755	166.2875	0
	Rank	1	8	1	1	1	9	1	7	10	1
C9	Mean	5.33E-15	1.100118	7.11E-15	9.88E-51	8.88E-15	0.932142	3.55E-15	1.42E-14	0.932023	1.28E-13
	Std	4.86E-15	2.028026	7.43E-15	2.13E-50	0	2.084	4.86E-15	4.86E-15	2.084067	2.51E-13
	SEM	2.18E-15	0.906961	3.32E-15	9.54E-51	0	0.931993	2.18E-15	2.18E-15	0.932023	1.12E-13
	Rank	3	10	4	1	5	9	2	6	8	7
C10	Mean	3.1E-169	48.89167	0.052986	28.89375	49.6341	49.52913	10.37682	55.04261	48.63929	61.18547
	Std	0	0.215888	0.027491	26.3761	0.550022	0.716765	23.20037	11.50547	0.359892	3.670574
	SEM	0	0.096548	0.012294	11.79575	0.245978	0.320547	10.37552	5.145404	0.160949	1.64153
	Rank	1	6	2	4	8	7	3	9	5	10

4.7. Results on IEEE Congress on Evolutionary Computation (CEC) 2017 Benchmark Suits

The results of APO-JADE over the CEC2017 benchmark functions F1–F15 as shown in Table 3 demonstrate its superior performance and consistency across various optimization problems. APO-JADE frequently achieved the best results, securing the top rank in most functions such as F1, F2, F3, F4, F5, F6, F11, F12, F13, F14, and F15. Notably, APO-JADE had the lowest mean values in these functions, indicating its effectiveness. The standard deviation (Std) and standard error of the mean (SEM) values for APO-JADE were generally low, suggesting stable and reliable performance. In functions F7, F8, F9, and F10, APO-JADE still performed competitively, ranking second or third. These results underscore the robustness and versatility of APO-JADE in handling a wide range of optimization tasks, making it a strong candidate for various applications. The comprehensive comparison with other algorithms like WOA, COA, GWO, SCSO, OMA, BWO, SHO, SHIO, ChOA, DBO, MTDE, and SCA highlights the consistent superiority of APO-JADE across different benchmark functions.

Table 3. Results on IEEE Congress on Evolutionary Computation (CEC) 2017 benchmark suites (F1–F15).

Fun	Statistics	APO-JADE	WOA	COA	GWO	SCSO	OMA	BWO	COOT	SHIO	ChOA	DBO	MTDE	SCA
F1	Mean	2.02E+03	2.48E+06	4.36E+03	5.46E+06	3.43E+03	5.91E+08	1.79E+08	1.60E+08	3.95E+07	1.03E+10	5.45E+05	2.91E+11	9.13E+08
	Std	2.06E+03	3.07E+06	4.09E+03	1.46E+07	3.40E+03	2.14E+08	2.16E+08	1.93E+08	1.13E+08	4.28E+09	1.07E+06	1.62E+10	3.55E+08
	SEM	6.51E+02	9.72E+05	1.29E+03	4.61E+06	1.08E+03	6.76E+07	6.82E+07	6.11E+07	3.58E+07	1.35E+09	3.38E+05	5.13E+09	1.12E+08
	Rank	1	5	3	6	2	10	9	8	7	12	4	13	11
F2	Mean	2.00E+02	4.92E+04	2.00E+02	1.22E+05	2.00E+02	5.24E+07	3.19E+06	1.40E+05	5.56E+07	2.80E+12	1.14E+04	7.54E+91	1.46E+07
	Std	5.72E-04	3.20E+04	9.97E-03	2.36E+05	2.32E-01	6.50E+07	9.40E+06	2.60E+05	1.48E+08	5.27E+12	1.15E+04	1.55E+92	3.61E+07
	SEM	1.81E-04	1.01E+04	3.15E-03	7.47E+04	7.33E-02	2.06E+07	2.97E+06	8.23E+04	4.69E+07	1.67E+12	3.64E+03	4.90E+91	1.14E+07
	Rank	1	5	2	6	3	10	8	7	11	12	4	13	9
F3	Mean	3.00E+02	2.50E+03	3.02E+02	1.25E+03	3.00E+02	1.43E+03	2.20E+03	3.06E+03	4.52E+03	9.48E+03	3.77E+02	4.80E+05	1.51E+03
	Std	1.12E-12	3.50E+03	3.63E+00	1.32E+03	1.57E-02	7.10E+02	2.44E+03	2.14E+03	3.90E+03	3.34E+03	4.91E+01	5.07E+04	8.40E+02
	SEM	3.53E-13	1.11E+03	1.15E+00	4.17E+02	4.96E-03	2.24E+02	7.71E+02	6.77E+02	1.23E+03	1.06E+03	1.55E+01	1.60E+04	2.66E+02
	Rank	1	9	3	5	2	6	8	10	11	12	4	13	7
F4	Mean	4.00E+02	4.22E+02	4.04E+02	4.13E+02	4.10E+02	4.64E+02	4.33E+02	4.28E+02	4.28E+02	1.00E+03	4.20E+02	1.59E+05	4.44E+02
	Std	1.27E-01	3.25E+01	1.66E+00	1.79E+01	2.17E+01	1.34E+01	2.42E+01	2.59E+01	2.27E+01	3.46E+02	2.02E+01	1.00E+04	1.76E+01
	SEM	4.03E-02	1.03E+01	5.26E-01	5.66E+00	6.86E+00	4.24E+00	7.66E+00	8.20E+00	7.17E+00	1.09E+02	6.39E+00	3.16E+03	5.58E+00
	Rank	1	6	2	4	3	11	9	8	7	12	5	13	10
F5	Mean	5.12E+02	5.55E+02	5.16E+02	5.16E+02	5.71E+02	5.52E+02	5.30E+02	5.26E+02	5.24E+02	5.87E+02	5.35E+02	1.80E+03	5.48E+02
	Std	9.66E+00	2.23E+01	7.76E+00	4.64E+00	1.94E+01	5.46E+00	1.36E+01	6.85E+00	1.39E+01	1.68E+01	1.57E+01	3.66E+01	5.56E+00
	SEM	3.05E+00	7.06E+00	2.46E+00	1.47E+00	6.13E+00	1.73E+00	4.30E+00	2.17E+00	4.40E+00	5.32E+00	4.97E+00	1.16E+01	1.76E+00
	Rank	1	10	3	2	11	9	6	5	4	12	7	13	8
F6	Mean	6.03E+02	6.30E+02	6.07E+02	6.00E+02	6.48E+02	6.25E+02	6.08E+02	6.10E+02	6.09E+02	6.45E+02	6.10E+02	7.43E+02	6.19E+02
	Std	5.53E+00	1.18E+01	1.17E+01	4.63E-01	5.65E+00	4.76E+00	4.84E+00	5.88E+00	9.00E+00	1.01E+01	7.41E+00	4.28E+00	2.56E+00
	SEM	1.75E+00	3.72E+00	3.70E+00	1.46E-01	1.79E+00	1.50E+00	1.53E+00	1.86E+00	2.85E+00	3.19E+00	2.34E+00	1.35E+00	8.09E-01
	Rank	2	10	3	1	12	9	4	7	5	11	6	13	8
F7	Mean	7.36E+02	7.83E+02	7.64E+02	7.28E+02	7.97E+02	7.70E+02	7.45E+02	7.49E+02	7.44E+02	8.10E+02	7.67E+02	6.47E+03	7.73E+02
	Std	1.49E+01	2.80E+01	2.35E+01	1.35E+01	1.86E+01	4.91E+00	1.36E+01	8.91E+00	9.65E+00	7.57E+00	2.44E+01	3.61E+02	7.97E+00
	SEM	4.71E+00	8.86E+00	7.43E+00	4.25E+00	5.90E+00	1.55E+00	4.31E+00	2.82E+00	3.05E+00	2.39E+00	7.72E+00	1.14E+02	2.52E+00
	Rank	2	10	6	1	11	8	4	5	3	12	7	13	9
F8	Mean	8.27E+02	8.35E+02	8.29E+02	8.15E+02	8.40E+02	8.37E+02	8.27E+02	8.23E+02	8.19E+02	8.50E+02	8.17E+02	2.40E+03	8.39E+02
	Std	9.30E+00	9.18E+00	9.92E+00	4.83E+00	1.14E+01	4.34E+00	8.03E+00	5.11E+00	7.59E+00	1.08E+01	6.02E+00	3.46E+01	8.42E+00
	SEM	2.94E+00	2.90E+00	3.14E+00	1.53E+00	3.61E+00	1.37E+00	2.54E+00	1.62E+00	2.40E+00	3.41E+00	1.90E+00	1.09E+01	2.66E+00
	Rank	6	8	7	1	11	9	5	4	3	12	2	13	10
F9	Mean	9.86E+02	1.37E+03	9.97E+02	9.20E+02	1.63E+03	1.01E+03	1.00E+03	1.06E+03	9.80E+02	1.56E+03	1.07E+03	8.78E+04	1.03E+03
	Std	1.96E+01	4.02E+02	2.74E+02	5.73E+01	2.68E+02	3.46E+01	1.19E+02	1.47E+02	8.30E+01	2.06E+02	1.43E+02	6.44E+03	1.28E+02
	SEM	6.21E+00	1.27E+02	8.66E+01	1.81E+01	8.47E+01	1.09E+01	3.75E+01	4.64E+01	2.63E+01	6.50E+01	4.52E+01	2.04E+03	4.05E+01
	Rank	3	10	4	1	12	6	5	8	2	11	9	13	7
F10	Mean	1.51E+03	2.09E+03	1.85E+03	1.42E+03	2.32E+03	2.49E+03	1.86E+03	1.69E+03	1.85E+03	2.61E+03	1.58E+03	1.69E+04	2.23E+03
	Std	2.58E+02	3.23E+02	4.48E+02	2.41E+02	3.15E+02	1.36E+02	4.91E+02	2.28E+02	3.44E+02	2.14E+02	2.54E+02	5.39E+02	2.13E+02
	SEM	8.15E+01	1.02E+02	1.42E+02	7.61E+01	9.96E+01	4.31E+01	1.55E+02	7.21E+01	1.09E+02	6.77E+01	8.02E+01	1.71E+02	6.74E+01
	Rank	2	8	5	1	10	11	7	4	6	12	3	13	9
F11	Mean	1.12E+03	1.19E+03	1.14E+03	1.12E+03	1.19E+03	1.27E+03	1.58E+03	1.14E+03	1.60E+03	4.69E+03	1.15E+03	7.17E+04	1.20E+03
	Std	1.01E+01	4.52E+01	5.28E+01	1.23E+01	5.07E+01	3.52E+01	1.39E+03	3.81E+01	1.38E+03	1.70E+03	3.87E+01	7.71E+03	2.64E+01
	SEM	3.18E+00	1.43E+01	1.67E+01	3.87E+00	1.60E+01	1.11E+01	4.38E+02	1.21E+01	4.35E+02	5.37E+02	1.22E+01	2.44E+03	8.36E+00
	Rank	1	7	4	2	6	9	10	3	11	12	5	13	8
F12	Mean	1.37E+04	2.34E+06	1.50E+04	6.70E+05	1.44E+04	1.09E+07	6.09E+05	6.15E+05	4.02E+05	2.78E+08	3.70E+04	1.75E+11	1.15E+07
	Std	1.13E+04	3.41E+06	1.06E+04	8.10E+05	1.29E+04	5.66E+06	8.27E+05	8.31E+05	8.18E+05	2.73E+08	3.66E+04	1.42E+10	8.40E+06
	SEM	3.58E+03	1.08E+06	3.36E+03	2.56E+05	4.08E+03	1.79E+06	2.62E+05	2.63E+05	2.59E+05	8.64E+07	1.16E+04	4.48E+09	2.66E+06
	Rank	1	9	3	8	2	10	6	7	5	12	4	13	11
F13	Mean	1.47E+03	2.30E+04	5.26E+03	1.15E+04	1.97E+03	2.30E+04	8.53E+03	6.63E+03	1.74E+04	2.25E+07	1.98E+03	1.14E+11	2.60E+04
	Std	2.38E+02	1.43E+04	7.19E+03	9.47E+03	3.00E+02	1.90E+04	5.34E+03	1.52E+03	1.16E+04	3.00E+07	3.96E+02	5.85E+09	1.13E+04
	SEM	7.53E+01	4.53E+03	2.27E+03	3.00E+03	9.49E+01	5.99E+03	1.69E+03	4.80E+02	3.67E+03	9.49E+06	1.25E+02	1.85E+09	3.58E+03
	Rank	1	10	4	7	2	9	6	5	8	12	3	13	11
F14	Mean	1.45E+03	1.90E+03	1.52E+03	2.69E+03	1.57E+03	2.22E+03	2.15E+03	3.20E+03	4.12E+03	5.78E+03	1.46E+03	6.04E+08	1.62E+03
	Std	2.62E+01	1.11E+03	5.16E+01	1.96E+03	1.29E+02	7.34E+02	1.40E+03	1.48E+03	1.81E+03	4.64E+03	2.60E+01	1.11E+08	9.87E+01
	SEM	8.28E+00	3.52E+02	1.63E+01	6.19E+02	4.08E+01	2.32E+02	4.43E+02	4.69E+02	5.72E+02	1.47E+03	8.21E+00	3.51E+07	3.12E+01
	Rank	1	6	3	9	4	8	7	10	11	12	2	13	5
F15	Mean	1.55E+03	4.76E+03	1.86E+03	4.49E+03	2.44E+03	5.76E+03	3.80E+03	2.77E+03	3.27E+03	9.24E+03	1.68E+03	4.92E+10	2.72E+03
	Std	3.64E+01	3.14E+03	2.19E+02	2.14E+03	1.64E+03	1.24E+03	1.72E+03	1.14E+03	1.46E+03	5.38E+03	9.84E+01	6.67E+09	1.20E+03
	SEM	1.15E+01	9.92E+02	6.92E+01	6.77E+02	5.19E+02	3.92E+02	5.42E+02	3.60E+02	4.61E+02	1.70E+03	3.11E+01	2.11E+09	3.79E+02
	Rank	1	10	3	9	4	11	8	6	7	12	2	13	5

Table 4 shows the results of APO-JADE over the CEC2017 benchmark functions. F16–F30 indicate its competitive performance across a range of optimization problems. For F16, APO-JADE secured the second rank with a mean value of

1.70 \times 10^{3}

, showcasing its effectiveness and consistency. In F17, APO-JADE ranked first with the lowest mean value of

1.74 \times 10^{3}

, further demonstrating its superior performance. Similarly, for F18 and F19, APO-JADE achieved top ranks with mean values of

1.95 \times 10^{3}

and

1.93 \times 10^{3}

, respectively, outperforming most other algorithms. The trend continued with F20, where APO-JADE ranked second with a mean of

2.02 \times 10^{3}

. In F21, APO-JADE again led the rankings with a mean of

2.20 \times 10^{3}

. For F22, it maintained a high rank, securing second place with a mean of

2.30 \times 10^{3}

. In F23 and F24, APO-JADE continued its strong performance, ranking third and second, respectively, with mean values of

2.62 \times 10^{3}

and

2.68 \times 10^{3}

. APO-JADE also performed well in F25 and F26, ranking first and second with mean values of

2.92 \times 10^{3}

and

2.98 \times 10^{3}

, respectively. In F27, APO-JADE achieved the top rank with a mean of

3.09 \times 10^{3}

, and it ranked second in F28 with a mean of

3.26 \times 10^{3}

. For F29, APO-JADE secured the first place with a mean value of

3.16 \times 10^{3}

. Lastly, in F30, APO-JADE ranked third with a mean of

3.24 \times 10^{5}

.

Table 4. Results on IEEE Congress on Evolutionary Computation (CEC) 2017 benchmark suites (F16–F30).

Fun	Statistics	APO-JADE	WOA	COA	GWO	SCSO	OMA	BWO	COOT	SHIO	ChOA	DBO	MTDE	SCA
F16	Mean	1.70E+03	1.86E+03	1.62E+03	1.70E+03	1.97E+03	1.88E+03	1.79E+03	1.78E+03	1.82E+03	2.07E+03	1.72E+03	2.76E+04	1.74E+03
	Std	7.24E+01	1.69E+02	4.17E+01	6.46E+01	1.47E+02	5.79E+01	1.61E+02	1.46E+02	1.20E+02	1.25E+02	1.04E+02	1.61E+03	8.65E+01
	SEM	2.29E+01	5.33E+01	1.32E+01	2.04E+01	4.64E+01	1.83E+01	5.10E+01	4.62E+01	3.78E+01	3.95E+01	3.28E+01	5.08E+02	2.74E+01
	Rank	2	9	1	3	11	10	7	6	8	12	4	13	5
F17	Mean	1.74E+03	1.82E+03	1.74E+03	1.75E+03	1.89E+03	1.78E+03	1.76E+03	1.75E+03	1.76E+03	1.84E+03	1.76E+03	2.40E+06	1.78E+03
	Std	2.05E+01	4.35E+01	8.10E+00	1.99E+01	1.20E+02	7.01E+00	2.17E+01	1.53E+01	2.83E+01	3.57E+01	1.55E+01	1.09E+06	1.32E+01
	SEM	6.50E+00	1.38E+01	2.56E+00	6.29E+00	3.80E+01	2.22E+00	6.85E+00	4.84E+00	8.95E+00	1.13E+01	4.91E+00	3.43E+05	4.17E+00
	Rank	1	10	2	4	12	8	6	3	7	11	5	13	9
F18	Mean	1.95E+03	1.49E+04	1.74E+04	2.12E+04	4.14E+03	5.18E+05	3.52E+04	1.78E+04	2.40E+04	5.59E+07	2.00E+03	1.12E+09	9.75E+04
	Std	1.19E+02	1.17E+04	1.19E+04	1.49E+04	2.45E+03	4.15E+05	1.19E+04	9.52E+03	1.04E+04	1.04E+08	1.40E+02	2.80E+08	6.29E+04
	SEM	3.77E+01	3.71E+03	3.75E+03	4.71E+03	7.76E+02	1.31E+05	3.75E+03	3.01E+03	3.28E+03	3.28E+07	4.44E+01	8.84E+07	1.99E+04
	Rank	1	4	5	7	3	11	9	6	8	12	2	13	10
F19	Mean	1.93E+03	2.87E+04	2.15E+03	8.47E+03	2.02E+03	9.31E+03	9.39E+03	2.74E+04	4.50E+03	4.57E+05	1.92E+03	8.83E+09	5.16E+03
	Std	2.14E+01	5.46E+04	3.01E+02	6.26E+03	9.53E+01	5.07E+03	6.45E+03	7.24E+04	4.90E+03	6.59E+05	2.04E+01	8.36E+08	4.95E+03
	SEM	6.78E+00	1.73E+04	9.52E+01	1.98E+03	3.01E+01	1.60E+03	2.04E+03	2.29E+04	1.55E+03	2.09E+05	6.46E+00	2.65E+08	1.57E+03
	Rank	2	11	4	7	3	8	9	10	5	12	1	13	6
F20	Mean	2.02E+03	2.21E+03	2.02E+03	2.06E+03	2.23E+03	2.16E+03	2.07E+03	2.07E+03	2.16E+03	2.26E+03	2.06E+03	5.62E+03	2.10E+03
	Std	3.30E+01	5.93E+01	1.02E+01	4.03E+01	9.23E+01	3.01E+01	4.28E+01	4.99E+01	9.58E+01	7.43E+01	2.59E+01	1.91E+02	2.38E+01
	SEM	1.04E+01	1.88E+01	3.24E+00	1.28E+01	2.92E+01	9.52E+00	1.35E+01	1.58E+01	3.03E+01	2.35E+01	8.18E+00	6.04E+01	7.52E+00
	Rank	2	10	1	3	11	8	6	5	9	12	4	13	7
F21	Mean	2.20E+03	2.29E+03	2.28E+03	2.31E+03	2.36E+03	2.27E+03	2.30E+03	2.33E+03	2.33E+03	2.30E+03	2.26E+03	3.59E+03	2.22E+03
	Std	1.40E+00	7.86E+01	5.67E+01	6.31E+00	1.40E+01	3.31E+01	5.14E+01	8.61E+00	1.05E+01	5.01E+01	6.95E+01	3.95E+01	3.99E+01
	SEM	4.44E-01	2.49E+01	1.79E+01	1.99E+00	4.42E+00	1.05E+01	1.63E+01	2.72E+00	3.34E+00	1.58E+01	2.20E+01	1.25E+01	1.26E+01
	Rank	1	6	5	9	12	4	8	11	10	7	3	13	2
F22	Mean	2.30E+03	2.32E+03	2.30E+03	2.34E+03	2.42E+03	2.43E+03	2.37E+03	2.36E+03	2.31E+03	2.94E+03	2.30E+03	1.78E+04	2.37E+03
	Std	3.54E+00	1.22E+01	5.80E-01	1.16E+02	3.65E+02	3.00E+01	5.66E+01	5.13E+01	2.70E+01	2.14E+02	3.26E+01	8.63E+02	3.20E+01
	SEM	1.12E+00	3.84E+00	1.83E-01	3.67E+01	1.15E+02	9.49E+00	1.79E+01	1.62E+01	8.53E+00	6.77E+01	1.03E+01	2.73E+02	1.01E+01
	Rank	2	5	1	6	10	11	9	7	4	12	3	13	8
F23	Mean	2.62E+03	2.65E+03	2.62E+03	2.61E+03	2.67E+03	2.68E+03	2.63E+03	2.65E+03	2.63E+03	2.69E+03	2.65E+03	7.24E+03	2.65E+03
	Std	1.42E+01	2.01E+01	5.36E+00	7.56E+00	4.03E+01	1.02E+01	1.10E+01	1.64E+01	1.39E+01	8.98E+00	2.24E+01	2.01E+02	8.54E+00
	SEM	4.48E+00	6.35E+00	1.70E+00	2.39E+00	1.27E+01	3.23E+00	3.46E+00	5.20E+00	4.41E+00	2.84E+00	7.08E+00	6.35E+01	2.70E+00
	Rank	3	6	2	1	10	11	4	8	5	12	7	13	9
F24	Mean	2.68E+03	2.75E+03	2.73E+03	2.74E+03	2.82E+03	2.56E+03	2.76E+03	2.78E+03	2.76E+03	2.84E+03	2.59E+03	6.99E+03	2.76E+03
	Std	1.23E+02	7.90E+01	7.93E+01	5.10E+00	3.84E+01	1.75E+01	1.09E+01	1.54E+01	1.79E+01	3.59E+01	1.10E+02	1.89E+02	6.80E+01
	SEM	3.90E+01	2.50E+01	2.51E+01	1.61E+00	1.22E+01	5.55E+00	3.45E+00	4.86E+00	5.65E+00	1.13E+01	3.49E+01	5.97E+01	2.15E+01
	Rank	3	6	4	5	11	1	8	10	7	12	2	13	9
F25	Mean	2.92E+03	2.94E+03	2.94E+03	2.93E+03	2.93E+03	2.97E+03	2.95E+03	2.93E+03	2.95E+03	3.29E+03	2.93E+03	4.89E+04	2.97E+03
	Std	2.43E+01	4.15E+01	2.21E+01	2.21E+01	2.35E+01	2.33E+01	2.06E+01	6.63E+01	2.79E+01	1.47E+02	2.34E+01	3.37E+03	1.07E+01
	SEM	7.68E+00	1.31E+01	7.00E+00	6.98E+00	7.43E+00	7.37E+00	6.51E+00	2.10E+01	8.84E+00	4.64E+01	7.40E+00	1.06E+03	3.39E+00
	Rank	1	6	7	2	5	11	8	4	9	12	3	13	10
F26	Mean	2.98E+03	3.21E+03	2.98E+03	3.02E+03	3.61E+03	3.21E+03	3.18E+03	3.04E+03	3.28E+03	3.94E+03	2.95E+03	6.88E+04	3.09E+03
	Std	1.23E+02	3.77E+02	1.63E+02	3.28E+02	4.34E+02	7.41E+01	3.15E+02	2.01E+02	3.15E+02	2.93E+02	1.85E+02	3.90E+03	2.98E+01
	SEM	3.90E+01	1.19E+02	5.17E+01	1.04E+02	1.37E+02	2.34E+01	9.97E+01	6.37E+01	9.96E+01	9.26E+01	5.85E+01	1.23E+03	9.44E+00
	Rank	2	9	3	4	11	8	7	5	10	12	1	13	6
F27	Mean	3.09E+03	3.11E+03	3.11E+03	3.09E+03	3.18E+03	3.13E+03	3.10E+03	3.13E+03	3.11E+03	3.18E+03	3.10E+03	2.18E+04	3.10E+03
	Std	2.96E+00	2.46E+01	3.22E+01	1.78E+00	6.57E+01	1.09E+01	1.57E+01	2.38E+01	2.90E+01	8.81E+01	7.50E+00	2.41E+03	2.62E+00
	SEM	9.35E-01	7.79E+00	1.02E+01	5.64E-01	2.08E+01	3.43E+00	4.98E+00	7.53E+00	9.18E+00	2.79E+01	2.37E+00	7.63E+02	8.29E-01
	Rank	1	7	6	2	12	10	5	9	8	11	3	13	4
F28	Mean	3.26E+03	3.31E+03	3.33E+03	3.31E+03	3.39E+03	3.33E+03	3.43E+03	3.36E+03	3.37E+03	3.76E+03	3.23E+03	2.88E+04	3.31E+03
	Std	1.41E+02	1.16E+02	1.30E+02	1.08E+02	1.42E+02	9.39E+01	7.02E+01	2.25E+02	1.42E+02	1.20E+02	1.23E+02	1.26E+03	8.52E+01
	SEM	4.46E+01	3.65E+01	4.10E+01	3.43E+01	4.49E+01	2.97E+01	2.22E+01	7.12E+01	4.50E+01	3.78E+01	3.88E+01	3.99E+02	2.69E+01
	Rank	2	3	7	4	10	6	11	8	9	12	1	13	5
F29	Mean	3.16E+03	3.33E+03	3.22E+03	3.23E+03	3.43E+03	3.26E+03	3.22E+03	3.23E+03	3.24E+03	3.37E+03	3.20E+03	2.13E+06	3.21E+03
	Std	2.21E+01	9.40E+01	6.20E+01	6.06E+01	1.76E+02	3.11E+01	6.51E+01	2.67E+01	7.22E+01	1.19E+02	2.82E+01	1.04E+06	1.29E+01
	SEM	6.99E+00	2.97E+01	1.96E+01	1.92E+01	5.58E+01	9.84E+00	2.06E+01	8.44E+00	2.28E+01	3.76E+01	8.93E+00	3.30E+05	4.08E+00
	Rank	1	10	4	6	12	9	5	7	8	11	2	13	3
F30	Mean	3.24E+05	7.45E+05	6.17E+05	4.00E+05	2.28E+06	2.33E+05	6.52E+05	8.82E+05	5.75E+05	3.86E+06	9.04E+03	1.56E+10	9.62E+05
	Std	3.63E+05	5.90E+05	6.81E+05	5.81E+05	3.09E+06	2.36E+05	7.69E+05	8.13E+05	1.01E+06	3.98E+06	6.75E+03	1.79E+09	5.03E+05
	SEM	1.15E+05	1.87E+05	2.15E+05	1.84E+05	9.78E+05	7.46E+04	2.43E+05	2.57E+05	3.18E+05	1.26E+06	2.14E+03	5.66E+08	1.59E+05
	Rank	3	8	6	4	11	2	7	9	5	12	1	13	10

4.8. APO-JADE Convergence Diagram

The convergence curves as shown in Figure 2 and Figure 3 provided for the APO-JADE optimizer on the CEC2022 benchmark functions (F1 to F10) exhibit the optimizer’s performance over 500 iterations. For F1, the curve shows a rapid initial decrease in the best value obtained, dropping from over

10^{6}

to below

10^{5}

within the first 50 iterations, which is followed by a flattening curve. In F2, the optimizer demonstrates a steady decrease from around 2000 to 600, indicating consistent improvement. The F3 curve drops sharply from around 740 to 670 within the first 50 iterations and then decreases more slowly. For F4, the best value drops from 920 to 830 within 100 iterations with minor improvements thereafter. In F5, the optimizer reduces the best value from 6500 to 1500 within 100 iterations with the curve flattening out afterwards. The F6 curve starts at

10^{9}

and drops dramatically to

10^{5}

within 300 iterations, showing strong initial performance. For F7, the best value decreases from 2350 to 2100 over 500 iterations with a noticeable flattening after 200 iterations. In F8, the curve drops from 8000 to 3000 within the first 50 iterations; then, it flattens out. The F9 curve shows a decrease from 3800 to 2800 within 100 iterations with subsequent flattening. Finally, the F10 curve starts at 4500 and drops sharply to 3000 within the first 50 iterations, which is followed by diminishing returns. Overall, the APO-JADE optimizer shows a consistent pattern of rapid initial improvements followed by slower gains across all functions, which are typical of many optimization algorithms where easy-to-find improvements are made quickly, but further refinements require more time and effort.

4.9. APO-JADE Search History Diagram

As shown in Figure 4 and Figure 5, the search history analysis of the APO-JADE optimizer on the CEC2022 benchmark functions reveals various patterns in the search space exploration, Where red dot represent best solution. The plots show a tendency for the optimizer to focus its search efforts on specific regions of the search space, which likely correspond to promising areas identified during the optimization process. For instance, the search history for several functions shows dense clusters of search points, indicating areas where the optimizer has concentrated its efforts. This behavior is observed in the dense concentration of search points around specific coordinates. In some cases, the optimizer exhibits a broader spread in its search, suggesting an exploration of a wider range of values. However, even in these cases, there are often still noticeable areas of higher density, indicating focused exploration. This pattern of combining broad exploration with targeted search in promising regions is indicative of an effective optimization strategy, where the optimizer initially explores a wide range of potential solutions and then hones in on areas that appear most promising based on initial findings. Overall, the APO-JADE optimizer shows a consistent pattern of efficiently balancing exploration and exploitation, which is crucial for effectively navigating complex search spaces.

4.10. APO-JADE Average Fitness Diagram

The average fitness score serves as a key indicator of how effectively a particular solution performs in comparison to others within the scope of the optimization problem at hand. When evaluating the performance of the APO-JADE algorithm, the fitness score is determined based on the value of the objective function that the algorithm seeks to minimize or maximize. In the context of minimization, a lower fitness score typically signifies a superior solution, whereas in maximization, a higher fitness score denotes a better outcome. This fitness value reflects the quality or suitability of a solution, steering the optimizer toward progressively better-performing solutions over successive iterations.

The performance of the optimizer as shown in Figure 6 and Figure 7 across the different fitness functions in the CEC2022 benchmark suite can be observed through the variation in the average fitness of all particles over 500 iterations. In most cases, a rapid decline in the average fitness occurs within the initial iterations, demonstrating a quick convergence toward a local or global optimum. However, the scale and behavior of this convergence differ significantly across functions, which suggests varying levels of problem complexity and landscape ruggedness. Some functions show a relatively smooth and continuous improvement, while others exhibit fluctuations indicating possible re-exploration or escaping from local optima, which are particularly noticeable in functions where average fitness temporarily increases before continuing a downward trend. This behavior emphasizes the optimizer’s capability to adapt and search efficiently across diverse problem spaces, although the extent of optimization and stability varies, highlighting the need for adaptive strategies or parameter tuning tailored to specific types of optimization landscapes.

4.11. APO-JADE Box-Plot Diagram

The box plots (see Figure 8 and Figure 9) for the APO-JADE optimizer provide a visual representation of the distribution of the best fitness scores obtained for each benchmark function (F1 to F10). For F1, the median fitness score is around

2 \times 10^{4}

with a noticeable spread between the first and third quartiles, indicating variability, and a few outliers above

4 \times 10^{4}

. For F2, the median score is slightly above 400 with a wider interquartile range extending up to approximately 480, suggesting performance variability and no significant outliers. The F3 box plot shows a median score around 630, with an interquartile range from approximately 620 to 650 and a single outlier around 660, indicating consistent performance with occasional deviations. For F4, the median score is approximately 840, with a narrow interquartile range from 830 to 850, suggesting stable performance and no significant outliers. For F5, the median score is around 1500, with a wide interquartile range extending up to 2000 and a few outliers around 2500, indicating higher score variability. The F6 plot shows a median score around 4000, with a broad interquartile range extending up to approximately 7000, suggesting high variability. For F7, the median score is around 2080, with an interquartile range from 2060 to 2100, indicating stable performance and no significant outliers. For F8, the median score is around 2330, with the interquartile range extending to approximately 2250, and several outliers above 2270. For F9, the median score is around 2600, with the interquartile range extending from approximately 2580 to 2640, indicating moderate variability with no significant outliers. Lastly, for F10, the median score is around 2700, with a narrow interquartile range and a few outliers above 3500, suggesting stable performance with occasional deviations. Overall, these box plots demonstrate the APO-JADE optimizer’s performance across different benchmark functions, highlighting its ability to achieve consistent fitness scores with occasional variability depending on the function’s complexity and nature.

4.12. APO-JADE Heat Map Diagram

As shown in Figure 10 and Figure 11, the heat maps of the sensitivity analysis for different population sizes and iterations provide information on how the performance of the APO-JADE optimizer varies with these parameters in different benchmark functions (F1 to F10). For F1, the heat map shows that with 10 search agents and 100 iterations, the fitness score is highest at

5.59 \times 10^{4}

, indicating poor performance. As the number of iterations increases, the fitness scores generally decrease, with the best performance around 500 iterations and 40 agents, where the fitness score drops to

1.648 \times 10^{4}

. This trend suggests that higher iterations and a moderate number of search agents improve performance. For F2, the highest fitness score (

713.7

) occurs with 10 search agents and 100 iterations. The performance improves significantly as the number of iterations and search agents increase, with the best fitness score (

446.8

) observed at 50 search agents and 400 iterations, indicating that both parameters contribute to better optimization results. In F3, the fitness score is highest (

662.2

) with 10 search agents and 100 iterations, showing initial poor performance. The scores improve with more iterations and agents, reaching a better score (

636.4

) at 40 agents and 300 iterations, suggesting that increasing both parameters enhances optimization. The heat map for F4 shows that the highest fitness score (871) occurs with 10 agents and 100 iterations, and performance improves with increased iterations and agents. The best score (

829.4

) is at 50 agents and 500 iterations, indicating that a larger population and more iterations lead to better results. For F5, the highest score (2159) is with 10 agents and 100 iterations, showing poor initial performance. Performance improves as both parameters increase, with the best score (1437) at 50 agents and 500 iterations, highlighting the benefits of larger populations and more iterations. In F6, the highest score (

5.208 \times 10^{6}

) is with 10 agents and 100 iterations, and performance improves with increased iterations and agents, with the best score (4313) at 50 agents and 500 iterations, suggesting that larger populations and more iterations significantly enhance performance. For F7, the highest score (2142) is with 20 agents and 100 iterations, and the performance generally improves with more iterations and agents, reaching the best score (2066) at 50 agents and 300 iterations, indicating the benefit of larger populations and more iterations. The F8 heat map shows that the highest score (2301) is with 10 agents and 100 iterations, and performance improves with increased iterations and agents with the best score (2233) at 50 agents.

5. Case Study: Application of APO-JADE for Attack Detection

In this section, we delve into the comprehensive structure of the attack detection APO-JADE model. The model is inspired by DRaNN_PSO [30].

The inception phase of our proposed methodology, which utilizes Light Particle Swarm Optimization (APO-JADE), begins with the process of data assimilation and meticulous observation. To ensure the robustness and adaptability of the model [31], three state-of-the-art security datasets are employed: DS2OS, UNSW-NB15, and ToN_IoT. Each dataset possesses distinct characteristics and features, rendering them suitable for diverse types of analysis. We provide a concise exposition on each of these datasets [30]:

DS2OS;
UNSW-NB15;
ToN_IoT.

This diverse dataset collection facilitates a comprehensive and multifaceted analysis, contributing to the development of a more effective and resilient attack detection system.

5.1. DS2OS Dataset

In 2018, Marc-Oliver P. and François-X. introduced the DS2OS dataset, which is a contemporary Industrial Internet of Things (IIoT) security dataset [32]. As an open-source resource, it is an invaluable asset for assessing the competency of artificial intelligence-centric cybersecurity paradigms, particularly in the realms of smart industries, urban intelligent systems (smart cities), and various IIoT applications [30].

The DS2OS dataset consists of a cumulative 357,952 samples. Out of these, 347,935 are categorized as standard samples, while the remaining 10,017 are designated as anomalous entries. Structurally, DS2OS is composed of 13 features and is classified into 8 distinct categories. This diverse and comprehensive dataset offers a rich resource for developing and testing cybersecurity models in IIoT contexts, providing both typical and atypical scenarios for robust analysis [30].

5.1.1. UNSW-NB15 Dataset

Originating from the Cyber Range Lab of the Australian Centre for Cyber Security, the UNSW-NB15 dataset was introduced to the public in 2015 by Moustafa et al. [33]. Renowned in cybersecurity research, this dataset comprises an exhaustive collection of 257,673 samples. Delving into its composition, 93,000 of these samples are classified as regular, while a substantial 164,673 are identified as malicious entries [30].

In terms of features, the UNSW-NB15 dataset is embedded with 49 distinct characteristics and is divided into 10 categorical classes. This rich and diverse dataset is pivotal for cybersecurity research, offering a comprehensive set of data points for the development and validation of advanced cybersecurity models [30].

5.1.2. ToN_IoT Dataset

A novel addition to the security datasets tailored for IoT/IIoT applications is the ToN_IoT dataset, which was introduced by the Cyber Range and IoT Labs of the University of New South Wales, Australia, in 2019 [34]. The ToN_IoT dataset is paramount for evaluating the effectiveness and accuracy of various cybersecurity solutions, especially those underpinned by Machine Learning (ML) and Deep Learning (DL) architectures.

The dataset is extensive, housing a total of 1,379,274 samples. Of these, 270,279 are classified as normal instances with the remaining 1,108,995 are categorized as anomalous readings. In terms of its structure, the ToN_IoT dataset is organized into 10 distinct classes. This comprehensive and diverse dataset is crucial for developing and testing advanced cybersecurity models in the context of IoT and IIoT environments.

5.1.3. Data Preparation and Curation

The act of preparing datasets is a pivotal stage in the research process particularly when dealing with artificial intelligence (AI) systems. Properly curated data are essential for accelerating model training and enhancing the accuracy and efficiency of the resultant model. The data preparation process encompasses various intricate operations, such as eliminating non-essential attributes, converting categorical values into numerical formats, and applying imputation strategies for missing or incomplete data points.

To ensure the completeness and integrity of the dataset, we employed the Mean Imputation method to address missing or incomplete data. In this approach, missing values in numerical features were replaced with the mean value of the respective feature. This method was chosen for its simplicity and effectiveness in maintaining the central tendency of the data without introducing significant bias.

For each feature with missing values, the mean was calculated using the available data points, and these mean values were then used to fill in the gaps where data was missing. By doing so, we ensured that the dataset remained robust and suitable for further analysis, allowing our models to train on a complete dataset without the potential distortions that might arise from more complex imputation methods.

In our study, we meticulously employed a bifurcated procedure for data preparation, which is broadly classified into two methodologies: pre-processing and normalization. Pre-processing involves steps to make the raw data more suitable for model building, while normalization focuses on scaling the data to a specific range to ensure consistent data representation. This approach ensures the data are optimally prepared for effective and efficient analysis within AI systems.

5.1.4. Pre-Processing

The nuances of pre-processing in our research involve strategic transformations of data types, ensuring seamless integration with the architecture at hand, especially the neural network’s input layer. One of the key challenges we faced was the presence of categorical attributes, which required conversion into a numerical format for effective processing. The technique employed for this transformation was label encoding. Label encoding is particularly advantageous as it translates categorical data into a format more amenable to Machine Learning algorithms.

Furthermore, it is worth noting that certain attributes commonly found in many datasets, such as time, date, and timestamps, were determined to be inconsequential for the specific objective of attack detection in our study. Therefore, these attributes were judiciously excised from the dataset to streamline the data and focus on the most relevant features. This selective approach in pre-processing ensures that the data fed into the neural network are optimally structured for the purpose of attack detection.

5.1.5. Normalization

In data analytics, datasets often harbor attributes with vastly differing scales and magnitudes. When these discrepancies are left unchecked, they can inadvertently skew the model, leading to biased outcomes and potentially compromising the integrity of the results. The normalization process emerges as a crucial remedy to this predicament. It ensures a uniform scaling of the dataset attributes, mapping them onto a consistent range between 0.0 and 1.0. This transformation is accomplished without distorting the inherent relationships and patterns within the data.

The normalization technique of min–max scaling was judiciously adopted. Min–max scaling is a method that rescales the range of features to align with the smallest and largest values for each feature. This technique effectively maintains data consistency and ensures that all features contribute equitably during the modeling process. By implementing this approach, we mitigated the risk of skewed models and biased outcomes, enhancing the reliability and integrity of our research findings.

5.2. Hyper Parameter Tuning and Configuration

Hyperparameters play an instrumental role in delineating the architecture of a neural network and presiding over its training dynamics. In our study, while the foundational architecture of the Deep Recursive Artificial Neural Network (APO-JADE) remains invariant, the task of ascertaining the optimal values for the hyperparameters is entrusted to the APO-JADE algorithm. This approach is intended to fine-tune the network for enhanced attack detection precision.

The hyperparameters under consideration include the learning rate, number of epochs, momentum, batch size, and dropout rate. We provide a comprehensive elucidation of these parameters, which is followed by their optimal values as determined by the hybrid PSO algorithm across different datasets. The optimization of these hyperparameters is pivotal for the effectiveness of the neural network in accurately detecting attacks, ensuring that the model is not only robust but also sensitive to the nuances of the data it processes.

The optimization of these hyperparameters is achieved using APO-JADE algorithm. This method combines the adaptive behavior of Artificial Protozoa with the efficiency of the JADE algorithm to iteratively search for the optimal combination of hyperparameters. Each candidate solution within the population adjusts its position in the search space based on its experience and the collective experiences of neighboring candidates, emulating the dynamic adaptability of protozoa.

Given the distinct characteristics and variability of the datasets employed—DS2OS, UNSW-NB15, and ToN_IoT—the optimal set of hyperparameters often varies across different problems. The APO-JADE algorithm is designed to dynamically explore the hyperparameter space, adapting to the unique features and demands of each dataset. For instance, the DS2OS dataset, with its focus on Industrial Internet of Things (IIoT) security, requires different hyperparameter settings compared to the UNSW-NB15 dataset, which is centered on traditional cybersecurity threats, or the ToN_IoT dataset, which is tailored for IoT/IIoT environments.

The APO-JADE algorithm is executed separately for each dataset, with its parameters—such as population size, inertia weight, and cognitive and social coefficients—fine-tuned to the specific characteristics of the dataset. This process results in hyperparameters that are not only tailored to the particularities of each dataset but are also optimized to enhance the neural network’s performance in detecting cybersecurity threats across diverse contexts.

5.3. Learning Rate

Serving as a linchpin in Deep Learning (DL) algorithms, the learning rate is a critical hyperparameter that dictates the pace at which a model adapts during the training phase. The choice of the learning rate involves a trade-off: a diminutive learning rate might lead to more refined learning, yet it may concurrently prolong the training duration. Conversely, an elevated learning rate might expedite the learning process but could result in potentially large prediction errors. Consequently, discerning the optimal learning rate is one of the cardinal challenges in the design of DL models.

This challenge is particularly pronounced in the context of DL, where models are often complex and sensitive to the rate of learning. The learning rate impacts the convergence of the training process, with implications for both the accuracy and efficiency of the model. Thus, carefully calibrating the learning rate is essential for achieving a balance between rapid convergence and the accuracy of the learned model.

5.4. Number of Epochs

The ‘number of epochs’ hyperparameter is representative of the iterations in neural network training, demarcating the frequency with which the entire dataset is parsed by the learning algorithm. This hyperparameter is critical as it determines the number of times the weights and biases within the neural network architecture undergo updates. An appropriate selection of the number of epochs ensures that the model converges to an optimal solution without succumbing to overfitting or underfitting the data.

The optimal number of epochs is vital for the efficacy of the training process. Too few epochs might result in an undertrained model that fails to capture the complexity of the data, whereas too many epochs can lead to overfitting, where the model becomes overly tailored to the training data and performs poorly on unseen data. Thus, finding the right balance in the number of epochs is essential for developing a robust and generalizable neural network model.

5.5. Momentum

The momentum hyperparameter acts as a guiding force in neural network training, amalgamating information from preceding iterations to shape the trajectory of subsequent steps in the learning process. This strategic incorporation of ‘historical knowledge’ serves to accelerate convergence and introduce stability into the model. Specifically, momentum helps to ameliorate the erratic oscillations that can plague weight updates, thereby enabling a smoother transition toward the global minimum of the loss function.

The efficacy of the momentum hyperparameter lies in its ability to navigate the parameter space more effectively. By considering the gradients of past iterations, momentum prevents the model from getting stuck in local minima and mitigates the risk of erratic updates. Consequently, it plays a pivotal role in enhancing the model’s convergence rate and improving its overall performance.

5.6. Batch Size

The batch size is a pivotal hyperparameter in the realm of deep learning. It determines the number of training samples to be processed before the model’s intrinsic parameters, such as weights, are updated. The choice of batch size significantly influences both the computational efficiency of the training process and the granularity of the model’s weight updates.

A smaller batch size tends to lead to more frequent updates, offering a more refined and granular learning curve. This can be beneficial for capturing subtle patterns in the data but may increase computational demands. On the other hand, a larger batch size typically provides a more generalized update at each step and can potentially lead to faster convergence in terms of epochs. However, it may overlook finer nuances in the data and requires more memory.

Thus, selecting an appropriate batch size is crucial, as it strikes a balance between the accuracy of the learning process and computational efficiency. This decision is often guided by the specific characteristics of the dataset and the computational resources available.

5.7. Dropout

Dropout is a prominent regularization technique employed during the training phase of neural networks. It functions by stochastically ‘turning off’ a fraction of neurons within the network. This sporadic disabling of neurons is designed to compel the model to develop robust and diversified internal representations, reducing its reliance on any single neuron or specific feature set.

The primary function of dropout is to serve as a preventive measure against overfitting. By temporarily removing neurons during training, dropout ensures that the neural network does not become overly specialized to the training data. This approach enhances the model’s ability to generalize, thus improving its performance on previously unseen data. Consequently, dropout is an essential technique for maintaining the versatility and general applicability of neural network models.

5.8. Performance Assessment Metrics

To discern the robustness and adaptability of our proposed design, we employed a repertoire of performance metrics. These metrics are essential in gauging the congruence between the model’s predicted outcomes and the actual ground truths. Central to our evaluation methodology are the constructs of True Positives (TPs), False Positives (FPs), True Negatives (TNs), and False Negatives (FNs).

True Positives (TPs): Quantify instances where the model accurately identifies actual intrusions.
False Positives (FPs): Correspond to instances where the model erroneously labels normal activities as intrusions.
True Negatives (TNs): Represent instances where the model correctly identifies benign behaviors.
False Negatives (FNs): Occur when the model fails to detect actual intrusive activities.

Building upon these foundational metrics, subsequent measures such as accuracy, precision, recall, and the F1 score are derived to provide a comprehensive snapshot of the model’s performance. These metrics collectively offer a holistic view of the model’s effectiveness in correctly classifying and identifying various activities, illuminating its strengths and areas for improvement in the context of intrusion detection.

5.8.1. Accuracy

As a seminal metric in model evaluation, accuracy encapsulates the proportion of instances where the model’s predictions align with actual events. This includes correctly identifying both malicious intrusions and legitimate actions. Mathematically, accuracy is evaluated by the formula in Equation (21)

Accuracy = \frac{TP + TN}{TP + TN + FP + FN}

(21)

In this formula, TP represents True Positive, TN represents True Negative, FP represents False Positive, and FN represents False Negative. The sum of TPs and TNs is divided by the total number of predictions (the sum of TPs, TNs, FPs, and FNs), yielding the accuracy rate. This rate effectively measures the overall correctness of the model in classifying and predicting both intrusion and non-intrusion events. The formula for accuracy is delineated as shown below.

5.8.2. Precision

Precision is a crucial performance metric that quantifies a model’s ability to correctly identify anomalous observations. It is calculated as the fraction of True Positives (TPs), which are correctly classified anomalies, relative to the sum of True Positives and False Positives (FPs), where FPs denote incorrectly classified normal observations. Mathematically, precision can be expressed as shown in Equation (22):

Precision = \frac{TP}{TP + FP}

(22)

This metric is instrumental in gauging the trustworthiness of positive identifications made by the model. A higher precision value indicates that the model is more reliable in correctly identifying anomalous events, minimizing the likelihood of false alarms or misclassifications.

5.8.3. Recall (Sensitivity)

Recall, often referred to as sensitivity, is a crucial metric that offers insights into a model’s ability to flag anomalies. It calculates the proportion of True Positives (TPs), which are correctly identified anomalies, to the sum of True Positives and False Negatives (FNs), where FNs represent actual anomalies that the model failed to detect. The formula for recall is presented in Equation (23):

Recall = \frac{TP}{TP + FN}

(23)

Recall is an invaluable metric in scenarios where failing to detect an anomaly could result in severe consequences. It emphasizes the model’s capability to cover and correctly identify actual anomalies, serving as a critical measure of the model’s effectiveness in sensitive applications.

5.8.4. F1 Score

The F1 score is a comprehensive metric that harmoniously merges both precision and recall to produce a singular measure, balancing the trade-off between these two metrics. It is particularly relevant in scenarios characterized by an uneven class distribution. The F1 score is computed as the harmonic mean of precision and recall, offering a more holistic evaluation of the model’s performance, especially when both False Positives and False Negatives carry significant implications. The computation of the F1 score is given in Equation (24):

F 1 Score = 2 \times \frac{Precision \times Recall}{Precision + Recall}

(24)

The F1 score achieves its best value at 1, indicating perfect precision and recall, and its worst at 0. This metric is particularly useful for assessing the balance between precision and recall, providing a more nuanced understanding of the model’s overall predictive accuracy in the context of anomaly detection.

5.9. Experimental Procedure and Resultant Insights

This segment is dedicated to elucidating the experimental design underpinning our study alongside offering a cursory exploration of the emergent findings associated with the proposed method. The experimental setup is meticulously crafted to evaluate the efficacy and robustness of the proposed model, encompassing various scenarios and datasets to ensure a comprehensive assessment.

The subsequent sections will delve into the specifics of the experimental procedures, including the configuration of the model, the datasets employed, and the metrics used for evaluation. Additionally, we will present an initial overview of the findings that have emerged from the application of our proposed method, highlighting its potential implications and contributions to the field.

5.9.1. Implementation Methodology

The devised model was instantiated on a hardware setup comprising a Lenovo Core i7 processor, which is supplemented by 32 GB of DDR4 RAM. This hardware configuration provided the necessary computational power and memory capacity to efficiently run the model and process the datasets.

For the software scaffolding of the proposed algorithm, Matlab was chosen as the primary development environment. The choice of Matlab was due to its robust numerical computing capabilities and extensive libraries, making it well suited for implementing complex algorithms and data processing tasks. The entire development and testing of the algorithm were conducted on a Windows 11 Professional operating environment, ensuring a stable and powerful platform for the execution of the model.

5.9.2. Delving into the Experimental Results

The proficiency of the proposed method has been exhaustively scrutinized over three distinct datasets in both binary and multiclass contexts. Our experimental approach was anchored around the quintessential 5-fold cross-validation technique, which is a method ubiquitously acknowledged and employed in the field of machine learning and deep learning. This cross-validation approach offers a robust and impartial platform to evaluate the effectiveness of a wide array of ML/DL algorithms.

Binary context analysis;
Multiclass context analysis.

Subsequent sections will dissect the insights culled from each dataset, providing a detailed exploration of the results. This comprehensive analysis will illuminate the strengths and areas for improvement of our proposed method in various classification scenarios.

5.9.3. Evaluative Insights for the DS2OS Dataset

For our analysis focusing on the DS2OS dataset, we implemented a 70:30 train–test split. The pivotal hyperparameters were meticulously calibrated using the hybrid PSO-SQP algorithm. Our intrusion detection model was trained on 10 salient features and iteratively refined over 100 epochs.

In the binary classification analysis, we employed a 5-fold cross-validation mechanism. Here, the training dataset was divided into quintiles based on sample volume with a detailed breakdown of results corresponding to each fold. Notably, the APO-JADE paradigm reached its peak attack detection accuracy during the fifth fold, which had the highest sample volume, achieving accuracies of 97.42% in training and 97.51% in testing. In contrast, the first fold, with fewer samples, recorded the lowest accuracies of 95.21% (training) and 94.22% (testing). The subsequent folds marked accuracies of 96.47%, 96.42%, and 97.33%. Additionally, the fifth fold demonstrated the superiority of APO-JADE through auxiliary metrics like precision, recall, and the F1 score.

In multiclass classification as shown in Table 5, the APO-JADE architecture excelled in distinguishing seven distinct classes. The model showed remarkable precision in identifying categories like ‘Denial of Service (DoS)’, ‘Scan’, and ‘Wrong setup’, with accuracies of 96.544%, 95.24%, and 92.68%, respectively. Other attack vectors such as ‘Malicious operation’, ‘Spying’, ‘Malicious control’, and ‘Data type probing’ were identified with accuracies of 97.52%, 94.74%, 96.87%, and 93.81%, respectively. The ‘Normal’ class was accurately categorized with a 90.65% success rate with a minor 1.46% being erroneously flagged as malicious.

6. Conclusions

In this paper, we have presented the Hybrid APO-JADE optimizer, which is a novel optimization metaheuristic that integrates the strengths of JADE (JADE Adaptive Differential Evolution) and Artificial Protozoa Optimizer (APO) to effectively tackle complex optimization problems. The proposed algorithm is designed to balance the crucial aspects of exploration and exploitation, enhancing its ability to find high-quality solutions efficiently. The initial phase of the algorithm utilizes JADE’s adaptive mechanisms to explore the search space comprehensively. By dynamically adjusting the control parameters and employing differential mutation and crossover operations, JADE prevents premature convergence and ensures a diverse set of candidate solutions. This global exploration phase is crucial for identifying promising regions in the search space. As the optimization progresses, the algorithm transitions to the APO mechanism, which focuses on intensifying the search around the best solutions identified by JADE. The use of Levy flights and adaptive change factors in the APO phase enhances local exploitation, allowing for thorough refinement of the solutions. This dynamic transition between JADE and APO, governed by a predefined iteration threshold, ensures that the algorithm effectively shifts from exploration to exploitation at the appropriate time. The APO-JADE algorithm was evaluated using benchmark functions from CEC2017, CEC2021, and CEC2022, demonstrating notable improvements in convergence rates and accuracy over the standard PSO. Furthermore, the application of APO-JADE to real-world attack detection scenarios using the DS2OS, UNSW-NB15, and ToNIoT datasets showcased its robust performance. The experimental results highlighted APO-JADE’s capability to effectively navigate complex optimization landscapes and achieve high-quality solutions.

Author Contributions

Conceptualization, A.k.A.H. and H.N.F.; Methodology, A.k.A.H. and H.N.F.; Software, H.N.F.; Validation, H.N.F.; Formal analysis, A.k.A.H.; Investigation, A.k.A.H.; Writing—original draft, A.k.A.H. and H.N.F.; Writing—review & editing, A.k.A.H. and H.N.F.; Visualization, A.k.A.H. and H.N.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Security Management Technology Group (SMT), grant number 20243.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

We thank Samir M. Abu Tahoun, Security Management Technology Group (SMT) (http://www.smtgroup.org/) for the financial support of our research project.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Fakhouri, H.N.; Awaysheh, F.M.; Alawadi, S.; Alkhalaileh, M.; Hamad, F. Four vector intelligent metaheuristic for data optimization. Computing 2024, 106, 2321–2359. [Google Scholar] [CrossRef]
Abualigah, L.; Elaziz, M.A.; Khasawneh, A.M.; Alshinwan, M.; Ibrahim, R.A.; Al-Qaness, M.A.; Mirjalili, S.; Sumari, P.; Gandomi, A.H. Meta-heuristic optimization algorithms for solving real-world mechanical engineering design problems: A comprehensive survey, applications, comparative analysis, and results. Neural Comput. Appl. 2022, 34, 4081–4110. [Google Scholar] [CrossRef]
Fakhouri, H.N.; Alawadi, S.; Awaysheh, F.M.; Hamad, F. Novel hybrid success history intelligent optimizer with gaussian transformation: Application in CNN hyperparameter tuning. Clust. Comput. 2024, 27, 3717–3739. [Google Scholar] [CrossRef]
Al Hwaitat, A.K.; Fakhouri, H.N. The OX Optimizer: A Novel Optimization Algorithm and Its Application in Enhancing Support Vector Machine Performance for Attack Detection. Symmetry 2024, 16, 966. [Google Scholar] [CrossRef]
Fahimnia, B.; Davarzani, H.; Eshragh, A. Planning of complex supply chains: A performance comparison of three meta-heuristic algorithms. Comput. Oper. Res. 2018, 89, 241–252. [Google Scholar] [CrossRef]
Fakhouri, H.N.; Alawadi, S.; Awaysheh, F.M.; Hamad, F. Novel Hybrid Crayfish Optimization Algorithm and Self-Adaptive Differential Evolution for Solving Complex Optimization Problems. Symmetry 2024, 16, 927. [Google Scholar] [CrossRef]
Parouha, R.P.; Verma, P. Design and applications of an advanced hybrid meta-heuristic algorithm for optimization problems. Artif. Intell. Rev. 2021, 54, 5931–6010. [Google Scholar]
Hamad, F.; Fakhouri, H.N.; Alzghoul, F.; Zraqou, J. Development and Design of Object Avoider Robot and Object, Path Follower Robot Based on Artificial Intelligence. Arab. J. Sci. Eng. 2024, 1–22. [Google Scholar] [CrossRef]
Ryalat, M.H.; Fakhouri, H.N.; Zraqou, J.; Hamad, F.; Alzboun, M.S. Enhanced multi-verse optimizer (TMVO) and applying it in test data generation for path testing. Int. J. Adv. Comput. Sci. Appl. 2023, 14, 662–673. [Google Scholar]
Adam, S.P.; Alexandropoulos, S.A.N.; Pardalos, P.M.; Vrahatis, M.N. No free lunch theorem: A review. In Approximation and Optimization: Algorithms, Complexity and Applications; Springer: Cham, Switzerland, 2019; pp. 57–82. [Google Scholar]
Wang, X.; Snášel, V.; Mirjalili, S.; Pan, J.S.; Kong, L.; Shehadeh, H.A. Artificial Protozoa Optimizer (APO): A novel bio-inspired metaheuristic algorithm for engineering optimization. Knowl.-Based Syst. 2024, 295, 111737. [Google Scholar] [CrossRef]
Zhang, J.; Sanderson, A.C. JADE: Adaptive differential evolution with optional external archive. IEEE Trans. Evol. Comput. 2009, 13, 945–958. [Google Scholar] [CrossRef]
Mirjalili, S. The ant lion optimizer. Adv. Eng. Softw. 2015, 83, 80–98. [Google Scholar] [CrossRef]
Abualigah, L.; Diabat, A.; Mirjalili, S.; Abd Elaziz, M.; Gandomi, A.H. The arithmetic optimization algorithm. Comput. Methods Appl. Mech. Eng. 2021, 376, 113609. [Google Scholar]
Zhao, W.; Wang, L.; Mirjalili, S. Artificial hummingbird algorithm: A new bio-inspired optimizer with its engineering applications. Comput. Methods Appl. Mech. Eng. 2022, 388, 114194. [Google Scholar] [CrossRef]
Chopra, N.; Ansari, M.M. Golden jackal optimization: A novel nature-inspired optimizer for engineering applications. Expert Syst. Appl. 2022, 198, 116924. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar]
Hashim, F.A.; Houssein, E.H.; Mabrouk, M.S.; Al-Atabany, W.; Mirjalili, S. Henry gas solubility optimization: A novel physics-based algorithm. Future Gener. Comput. Syst. 2019, 101, 646–667. [Google Scholar] [CrossRef]
Peraza-Vázquez, H.; Peña-Delgado, A.; Merino-Treviño, M.; Morales-Cepeda, A.B.; Sinha, N. A novel metaheuristic inspired by horned lizard defense tactics. Artif. Intell. Rev. 2024, 57, 59. [Google Scholar] [CrossRef]
Mirjalili, S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowl.-Based Syst. 2015, 89, 228–249. [Google Scholar] [CrossRef]
Nadimi-Shahraki, M.H.; Taghian, S.; Mirjalili, S.; Faris, H. MTDE: An effective multi-trial vector-based differential evolution algorithm and its applications for engineering design problems. Appl. Soft Comput. 2020, 97, 106761. [Google Scholar] [CrossRef]
Mirjalili, S.; Mirjalili, S.M.; Hatamlou, A. Multi-Verse Optimizer: A nature-inspired algorithm for global optimization. Neural Comput. Appl. 2016, 27, 495–513. [Google Scholar] [CrossRef]
Abualigah, L.; Elaziz, M.A.; Sumari, P.; Geem, Z.W.; Gandomi, A.H. Reptile search algorithm (RSA): A nature-inspired meta-heuristic optimizer. Expert Syst. Appl. 2022, 191, 116158. [Google Scholar] [CrossRef]
Mirjalili, S. SCA: A sine cosine algorithm for solving optimization problems. Knowl.-Based Syst. 2016, 96, 120–133. [Google Scholar]
Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Braik, M.; Hammouri, A.; Atwan, J.; Al-Betar, M.A.; Awadallah, M.A. White Shark Optimizer: A novel bio-inspired meta-heuristic algorithm for global optimization problems. Knowl.-Based Syst. 2022, 243, 108457. [Google Scholar]
Luo, W.; Lin, X.; Li, C.; Yang, S.; Shi, Y. Benchmark functions for CEC 2022 competition on seeking multiple optima in dynamic environments. arXiv 2022, arXiv:2201.00523. [Google Scholar]
Mohamed, A.W.; Sallam, K.M.; Agrawal, P.; Hadi, A.A.; Mohamed, A.K. Evaluating the performance of meta-heuristic algorithms on CEC 2021 benchmark problems. Neural Comput. Appl. 2023, 35, 1493–1517. [Google Scholar] [CrossRef]
Stanovov, V.; Akhmedova, S.; Semenkin, E. LSHADE algorithm with rank-based selective pressure strategy for solving CEC 2017 benchmark problems. In Proceedings of the 2018 IEEE Congress on Evolutionary Computation (CEC), Rio de Janeiro, Brazil, 8–13 July 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–8. [Google Scholar]
Ahmad, J.; Shah, S.A.; Latif, S.; Ahmed, F.; Zou, Z.; Pitropakis, N. DRaNN_PSO: A deep random neural network with particle swarm optimization for intrusion detection in the industrial internet of things. J. King Saud-Univ.-Comput. Inf. Sci. 2022, 34, 8112–8121. [Google Scholar] [CrossRef]
Al Hwaitat, A.K.; Almaiah, M.A.; Almomani, O.; Al-Zahrani, M.; Al-Sayed, R.M.; Asaifi, R.M.; Adhim, K.K.; Althunibat, A.; Alsaaidah, A. Improved security particle swarm optimization (PSO) algorithm to detect radio jamming attacks in mobile networks. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 614–625. [Google Scholar] [CrossRef]
Benaddi, H.; Jouhari, M.; Ibrahimi, K.; Ben Othman, J.; Amhoud, E.M. Anomaly Detection in Industrial IoT Using Distributional Reinforcement Learning and Generative Adversarial Networks. Sensors 2022, 22, 8085. [Google Scholar] [CrossRef]
Moustafa, N.; Slay, J. UNSW-NB15: A comprehensive datasets for network intrusion detection systems (UNSW-NB15 network data set). In Proceedings of the 2015 Military Communications and Information Systems Conference (MilCIS), Canberra, ACT, Australia, 10–12 November 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 1–6. [Google Scholar]
Alsaedi, A.; Moustafa, N.; Tari, Z.; Mahmood, A.; Anwar, A. TON_IoT telemetry dataset: A new generation dataset of IoT and IIoT for data-driven intrusion detection systems. IEEE Access 2020, 8, 165130–165150. [Google Scholar] [CrossRef]

Figure 1. Objective space of CEC2022 benchmark functions (F1–F6).

Figure 2. APO-JADE convergence diagram on CEC2022 benchmark (F1–F6).

Figure 3. APO-JADE convergence diagram on CEC2022 benchmark (F7–F12).

Figure 4. APO−JADE search history diagram on CEC2022 benchmark (F1–F6).

Figure 5. APO−JADE search history diagram on CEC2022 benchmark (F7–F12).

Figure 6. APO-JADE average fitness diagram on CEC2022 (F1–F6).

Figure 7. APO-JADE average fitness diagram on CEC2022 (F7–F12).

Figure 8. APO-JADE box-plot diagram on CEC2022 benchmark (F1–F6).

Figure 9. APO-JADE box-plot diagram on CEC2022 benchmark (F7–F12).

Figure 10. APO-JADE heat map diagram on CEC2022 benchmark suite (F1–F6).

Figure 11. APO-JADE heat map diagram on CEC2022 benchmark (F7–F12).

Table 5. Multiclass classification APO-JADE results.

Category	Accuracy (%)
DoS	96.54
Scan	95.24
Wrong setup	92.68
Malicious operation	97.52
Spying	94.74
Malicious control	96.87
Data type probing	93.81
Normal	90.65

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Al Hwaitat, A.k.; Fakhouri, H.N. Hybrid Artificial Protozoa-Based JADE for Attack Detection. Appl. Sci. 2024, 14, 8280. https://doi.org/10.3390/app14188280

AMA Style

Al Hwaitat Ak, Fakhouri HN. Hybrid Artificial Protozoa-Based JADE for Attack Detection. Applied Sciences. 2024; 14(18):8280. https://doi.org/10.3390/app14188280

Chicago/Turabian Style

Al Hwaitat, Ahmad k., and Hussam N. Fakhouri. 2024. "Hybrid Artificial Protozoa-Based JADE for Attack Detection" Applied Sciences 14, no. 18: 8280. https://doi.org/10.3390/app14188280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Artificial Protozoa-Based JADE for Attack Detection

Abstract

1. Introduction

1.1. Contributions

1.2. Paper Structure

2. Literature Review

2.1. Overview of Artificial Protozoa Optimizer (APO)

2.2. Overview of Differential Evolution (DE)

2.3. Overview of JADE: Adaptive Differential Evolution with Optional External Archive

3. Hybrid APO-JADE Optimization Algorithm

3.1. Algorithm Framework

3.1.1. Artificial Protozoa Optimizer (APO)

3.1.2. Adaptive Differential Evolution (JADE)

3.1.3. Hybrid Approach

Initialization

Behavioral Strategies

Foraging

Dormancy

Reproduction

Autotroph

Heterotroph

Adaptive Mechanisms

Crossover and Mutation (JADE)

Fitness Evaluation

Selection and Replacement Strategies

Parameter Adaptation

3.2. APO-JADE Optimization Algorithm Description and Pseudocode

3.3. Exploration and Exploitation

4. Testing and Comparison

4.1. IEEE Congress on Evolutionary Computation (CEC) Benchmark Suites

4.2. Comprehensive Testing across Different IEEE Benchmarks over Different Years

4.3. Diverse Problem Landscapes

4.4. Evaluated Metrics

4.5. Results on IEEE Congress on Evolutionary Computation (CEC) 2022 Benchmark Suits

4.6. Results on IEEE Congress on Evolutionary Computation (CEC) 2021 Benchmark Suits

4.7. Results on IEEE Congress on Evolutionary Computation (CEC) 2017 Benchmark Suits

4.8. APO-JADE Convergence Diagram

4.9. APO-JADE Search History Diagram

4.10. APO-JADE Average Fitness Diagram

4.11. APO-JADE Box-Plot Diagram

4.12. APO-JADE Heat Map Diagram

5. Case Study: Application of APO-JADE for Attack Detection

5.1. DS2OS Dataset

5.1.1. UNSW-NB15 Dataset

5.1.2. ToN_IoT Dataset

5.1.3. Data Preparation and Curation

5.1.4. Pre-Processing

5.1.5. Normalization

5.2. Hyper Parameter Tuning and Configuration

5.3. Learning Rate

5.4. Number of Epochs

5.5. Momentum

5.6. Batch Size

5.7. Dropout

5.8. Performance Assessment Metrics

5.8.1. Accuracy

5.8.2. Precision

5.8.3. Recall (Sensitivity)

5.8.4. F1 Score

5.9. Experimental Procedure and Resultant Insights

5.9.1. Implementation Methodology

5.9.2. Delving into the Experimental Results

5.9.3. Evaluative Insights for the DS2OS Dataset

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI