Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm

Kong, Yueping; Liu, Ziyu

doi:10.3390/app14114603

Open AccessArticle

Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm

by

Yueping Kong

^*

and

Ziyu Liu

School of Information and Control Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, China

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2024, 14(11), 4603; https://doi.org/10.3390/app14114603

Submission received: 7 May 2024 / Revised: 24 May 2024 / Accepted: 25 May 2024 / Published: 27 May 2024

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

The regulation of the melting point of zinc smelting slag has an important impact on the subsequent smelting processes of the metal. In actual production, uncontrollable melting points may result in inconsistent product quality, which has a great negative impact on the smelter’s efficiency and environmental protection. However, the regulation mechanism of the melting point of the smelting slag is complex, with many influencing factors, and there is no recognized high-precision calculation method. In response to these challenges, this study introduces an innovative approach for optimizing the melting point of zinc smelting slag based on the improved Snake Optimization (ISO) algorithm. The melting point of zinc smelting slag is modeled using the Catboost algorithm, and the model parameters are optimized using the Tree-structured Parzen Estimator (TPE) to improve the accuracy of the model. Next, the ISO algorithm is employed to conduct optimization calculations, determining the optimal values of various production process parameters that minimize the melting point. The effectiveness of this approach was evaluated using diverse modeling algorithms and test functions, subsequently applied to optimize and validate actual production data from a smelter in Shaanxi, China. Statistical analyses reveal that the TPE-optimized Catboost model exhibits an R² of 93.89%, an RMSE of 7.02 °C, an MAE of 6.19 °C, and an MRE of 7.88%, surpassing performance metrics of alternative algorithms. Regarding optimization efficacy, the proposed ISO algorithm achieves an average reduction of 65 °C in the melting point and demonstrates superior robustness compared to both actual production data and alternative algorithms.

Keywords:

zinc smelting slag; melting point optimization; Catboost; improved snake optimization algorithm

1. Introduction

In the field of zinc smelting and manufacturing, the primary challenge faced by each smelting plant is to mitigate the environmental issues associated with solid waste such as smelting slag, respond to the call for sustainable development, and enhance the economic efficiency of the smelting plants [1,2]. Given the relatively high content of zinc, lead, gold, and silver in zinc smelting slag, an effective solution is to reasonably recover and utilize secondary metal resources generated during the zinc smelting process [3,4,5].

There have been numerous studies on methods for the recovery of secondary metal resources from zinc smelting slag [6,7,8], including leaching methods, electrochemical methods, and smelting roasting methods. Wei et al. successfully recovered cobalt and zinc from leachate of zinc smelting slag using a leaching method, achieving good recovery results [9]. Hu et al. proposed a two-stage sintering process using the smelting roasting method, which not only successfully recovered zinc and lead but also produced lightweight bricks suitable for use as construction materials [10]. Compared to leaching methods, the smelting roasting method achieves efficient metal extraction and higher purity metal products [11]. In this study, the smelting roasting method was employed. The principle of the smelting roasting method involves melting various smelting slags in a smelting furnace by heating them with coal to ensure smooth subsequent processes. However, excessively high melting temperatures not only affect energy consumption but also increase the difficulty and equipment costs of the smelting process, which greatly influences the application and promotion of the smelting roasting method [12]. Among the factors affecting melting temperature, slag melting point is a key factor, and, therefore, reducing the slag melting point is of significant importance for controlling the melting temperature.

Zinc smelting slag contains various elemental components, and it is generally believed that the slag melting point is determined by the mass fractions of oxides such as Fe₂O₃, CaO, SiO₂, Al₂O₃, and MgO [13]. The process of zinc smelting is a complex and variable physicochemical process, and the melting point is an important indicator affecting the smelting effect. However, there is no universally recognized and highly accurate calculation method for the relationship between melting point and slag composition, which poses great difficulties in modeling and controlling the melting point [14]. In the metallurgical industry, the advancement of related research often relies on the combination of production experience and phase diagrams of various factors. Bao et al. based their research on the CaO–MoO₃–La₂O₃ ternary phase diagram, using the solid-state reaction method to design and synthesize two new types of microwave dielectric ceramics, laying the foundation for the scientific research and practical application of related ceramic systems [15]. Liu et al. established the CaO–Al₂O₃–Vox ternary phase diagram, conducting in-depth research on the changes in the valence state of vanadium during vanadium steel smelting processes, also achieving good results [16]. However, experimental studies on phase diagrams require a significant amount of time and economic costs, and they are faced with challenges such as experimental difficulty and time consumption. Additionally, the establishment of ternary phase diagrams is based on the assumption of equilibrium under steady-state conditions, where the internal components reach a stable state within the entire system. Yet, in practical scenarios, the composition and temperature may change, rendering phase diagrams unable to fully depict dynamic processes and thus having limitations in practical use.

With the demonstrated effectiveness of machine learning and intelligent algorithms in handling nonlinear modeling and optimization problems, researchers have made significant strides in various industrial applications. Shi et al. [14] utilized a Generalized Regression Neural Network to model and predict melting points, coupled with genetic algorithms to determine optimal elemental compositions. This approach significantly improved work efficiency and reduced melting points compared to traditional phase diagram methods. Li et al. [17] introduced the Grey Wolf Optimization algorithm into industrial settings to optimize parameters and instructions in production processes. notably enhanced production efficiency and solution accuracy. Shim et al. [18] established welding effect prediction models in magnetic pulse welding processes and employed the Imperialist Competitive Algorithm to optimize process parameters, leading to a substantial enhancement in welding effects. These algorithms each possess strengths and weaknesses, primarily attributed to their different evolutionary mechanisms. Genetic algorithms offer simple optimization methods with good performance but suffer from serious premature convergence issues. Consequently, researchers have introduced many improved variants of genetic algorithms [19,20]. The Grey Wolf Algorithm is a widely used efficient heuristic algorithm in engineering applications, but its performance depends on its control parameters. Different problems require different parameter adjustments, which can be time-consuming and make optimization results unstable [21,22]. In recent years, The Snake Optimization (SO) algorithm has shown considerable potential in engineering applications [23], emulating behaviors observed in snakes, including predation, reproduction, and evasion. This biomimetic trait renders the algorithm more nature-centric, facilitating effective exploration of solution spaces. However, SO predominantly depends on positional data from superior and inferior individuals in the snake population to update locations, resulting in decreased utilization rates of other individuals and generating algorithmic instability, suggesting potential areas for enhancement.

Based on the above analysis, to address the problems of instability, slow convergence speed, complexity of melting point influencing factors, and insufficient modeling accuracy of traditional methods of the existing optimization methods in the field, this study firstly analyzes the relationship between relevant slag compositions and melting point. Subsequently, the prediction model of the optimization algorithm was established. Then, the improved serpentine optimization algorithm was used for optimization. The optimal process parameters obtained from the optimization were confirmed through validation experiments.

The main contributions and highlights of this study are as follows. First, by utilizing historical production data, we employed a Tree-structured Parzen Estimator (TPE) hyperparameter-optimized CatBoost model for predictive modeling. This approach not only enhances prediction accuracy but also significantly reduces the operational cost and time consumption compared to traditional methods. Second, the performance of the Improved Snake Optimization (ISO) algorithm was bolstered through the integration of Halton sequence initialization, elite reverse learning, and dynamic updating mechanisms of Cauchy–Gaussian mutation. These enhancements improved the algorithm’s capabilities in both global exploration and local exploitation stages. Third, the ISO algorithm was evaluated using ten standard benchmark functions, which are commonly used in intelligent optimization algorithms, and its performance was compared with five other state-of-the-art optimization algorithms. Finally, we assessed the ISO algorithm in practical scenarios using actual production problems and compared the results with those obtained from five other advanced optimization algorithms. The findings demonstrate the superior performance and practical applicability of the proposed ISO algorithm.

The remainder of this paper is organized as follows: Section 2 introduces the processing of smelting roasting methods and the construction of a model for optimizing melting point parameters. Section 3 elaborates on the SO algorithm and strategies for improving its performance. Section 4 describes the results of the Catboost model and the rest of the common models on several evaluation metrics, presents the performance of the ISO and five SOTA algorithms on 10 test functions and real engineering problems, and analyzes the results to demonstrate the feasibility and effectiveness of the proposed algorithms. Finally, Section 5 summarizes the research findings and proposes further development directions.

2. Optimization Model for Process Parameters of Zinc Smelting Slag Melting Point

2.1. Selection of Key Process Parameters

The process flow of zinc smelting slag treatment consists mainly of the batching stage, smelting stage, and product recovery stage, as shown in Figure 1. The specific steps are as follows:

Batching Stage: Initially, in the batching stage, slag materials are mixed with smokeless coal and river sand is added to complete slag making, resulting in mixed slag.

Smelting Stage: The mixed slag is then sent to the smelting furnace for melting treatment. During smelting, various additives are added as needed to adjust the composition, resulting in molten slag, high-silver flue ash, and dust-laden flue gas.

Product Recovery Stage: The high-silver flue ash and dust-laden flue gas are directly recovered, extracting valuable metals such as gold and silver. Subsequently, the molten slag is transferred to the reduction furnace, where various additives are incorporated as necessary to optimize production. This process generates slag suitable for use as building materials and flue gas ash. Through further recovery, metals such as lead and zinc are obtained until the slag contains over 30% Fe, at which point it is discharged as flue slag, achieving resource utilization and enhancing the economic efficiency of the smelting plant.

During the batching and smelting stages, different batching schemes can result in mixed slag containing different elements, thereby affecting the melting point of the mixed slag. Excessively high melting points can pose safety hazards during the production process and incur additional production costs for the smelting plant. Therefore, the practical problem that smelting plants need to address is how to adjust the batching scheme to meet production requirements while minimizing the melting point of the mixed slag as much as possible.

According to relevant studies and literature, it is known that the iron–silicon ratio and the calcium–silicon ratio have a significant impact on the melting point of mixed slag [24,25,26]. The iron–silicon ratio refers to the weight ratio of iron to silicon in the furnace charge. In the metallurgical industry, as the iron–silicon ratio increases, the melting point of the slag tends to increase, while the viscosity decreases, and the melting performance improves. Therefore, during the smelting process, the iron–silicon ratio is usually controlled to reduce the melting point and improve the melting performance of the materials inside the furnace. The calcium–silicon ratio refers to the weight ratio of calcium to silicon. Within a certain range, as the calcium–silicon ratio increases, the melting point rises, and the fluidity also improves. Therefore, during the smelting process, an appropriate amount of silicon is usually added to the slag to reduce the calcium–silicon ratio, lower the melting point of the slag, and improve its fluidity. The specific experimental results are shown in Figure 2. The lower curve shows the variation in slag melting point with changes in the CaO/SiO₂ ratio within the range of 0.3 to 1.0 under constant conditions. The upper curve shows the variation in slag melting point with changes in the Fe₂O₃/SiO₂ ratio within the range of 1.8 to 2.4 under constant conditions.

Additionally, the inclusion of river sand, oxygen flow, and coal also affects the melting point.

River sand, primarily composed of silica dioxide, contains a significant amount of silicon elements, which significantly influences the iron–silicon ratio and calcium–silicon ratio, thereby affecting the melting point. The addition of oxygen alters the valence state of iron in the slag, thereby affecting the iron–silicon ratio. Elements such as ash, sulfur, and sodium in coal may form compounds with high melting points inside the furnace during combustion, thereby affecting the melting point of the mixed slag.

In summary, based on relevant studies and the actual production parameters requirements of the smelting plant, this study selects the key process parameters that have a significant impact on slag melting point, namely, the iron–silicon ratio, calcium–silicon ratio, coal quantity, oxygen flow, and river sand usage.

2.2. Model Building

Regression of slag melting point is a complex nonlinear problem. Many machine learning methods can handle nonlinear problems [27], but traditional learning methods require a large amount of data for training to achieve high accuracy [28]. However, for practical factory production, collecting a large amount of qualified data is laborious and time-consuming. During the operation of the smelting furnace, the production parameters of different batches may change, requiring the collection of new data for training to obtain a new regression model. Traditional learning methods entail repetitive and extensive work, making it difficult to adapt to the rapid diagnostic requirements. Therefore, this study selects an improved gradient-boosting algorithm—the CatBoost algorithm.

CatBoost is a gradient-boosting algorithm based on symmetric decision trees, offering the advantages of minimal parameter setting, support for categorical feature data, and high classification accuracy [29]. In comparison to similar earlier algorithms such as XGBoost [30] and LightGBM [31], CatBoost incorporates enhanced functionalities. It automatically handles categorical features and adopts fully symmetric trees. It incorporates combination features specifically designed for categorical features and utilizes sorting boosting to counteract noise points in the training set, thus avoiding gradient estimation errors and addressing prediction shift issues to alleviate overfitting. Notably, its method for automatic handling of categorical features involves initially using one-hot encoding. If a maximum number of categorical values for one-hot encoding is set, when this limit is exceeded, CatBoost first conducts a statistical analysis of the categorical features to calculate the frequency of each categorical value, thereby generating new numerical features [32].

Hyperparameter optimization is pivotal in machine learning for enhancing the performance of models. While models come with default hyperparameters, fine-tuning them to a specific dataset can substantially boost their efficacy [33]. In recent years, various methods for hyperparameter optimization have emerged, such as the Tree-structured Parzen Estimator (TPE) for independent parameter sampling [34], grid search for finding the optimal combination through exhaustive search [35], and random search, which reduces computational costs based on grid search [36]. These methods have demonstrated promising performance across different domains.

TPE is a Bayesian optimization technique. Unlike grid or random search, which treats hyperparameters as isolated, TPE considers the relationship between hyperparameters and the objective function. The advantage of TPE over other methods lies in its efficiency. Constructing a probabilistic model of the objective function can suggest hyperparameters that are more likely to yield better results, hence reducing the number of trials [37].

In this study, we apply TPE to the popular machine-learning models CatBoost, XGBoost, and LightGBM. TPE adjusts these settings more effectively for specific data, reducing overfitting and enhancing the model’s ability to generalize to new data. This ultimately leads to improved overall model performance, as hyperparameters play an important role in how efficiently these algorithms handle the dataset. Table 1, Table 2 and Table 3 contain the hyperparameters used in the XGBoost, LightGBM, and CatBoost models after applying TPE hyperparameter tuning.

Using the relationship model regressed by machine learning algorithms, the iron–silicon ratio, calcium–silicon ratio, coal quantity, oxygen flow, and river sand usage are modeled as independent variables, and the slag melting point is the dependent variable. The mathematical description is as follows:

t_{s} = f (x_{1}, x_{2}, x_{3}, x_{4}, x_{5})

(1)

Based on this relationship model, we construct the objective function with the minimum slag melting point as the target. The mathematical expression is as follows:

\min t_{s} = f (x_{1}, x_{2}, x_{3}, x_{4}, x_{5})

(2)

where

x_{1}

~

x_{5}

represent the iron–silicon ratio, calcium–silicon ratio, coal quantity, oxygen flow, and river sand usage, respectively.

3. Improved Snake Optimization Algorithm for Melting Point Optimization Solution

3.1. Snake Optimization Algorithm Process

The Snake Optimizer (SO) is an algorithm proposed by Fatma A. Hashim and Abdelazim G. Hussien in 2022, based on simulating snake behavior. This algorithm is inspired by the hunting behavior of snakes. When hunting, snakes adjust their movements based on factors such as their body condition, the position of prey, and environmental conditions to achieve more efficient hunting. The Snake Optimization Algorithm utilizes this biomimetic principle to simulate the hunting process of snakes for optimizing the objective function in solution space.

Snake behavior is closely related to environmental conditions. When the temperature is low and food is abundant, male snakes may engage in combat to seek the attention of mates. However, female snakes have the power to decide whether to lay eggs, so this process can be considered a probabilistic event. When the temperature is high and food is scarce, both male and female snakes will search for food. When food becomes sufficient, snakes will engage in information sharing to ensure that companions who have not obtained enough food can quickly acquire it to meet their needs. When the temperature is low, snakes will engage in combat and mating behavior. Combat only occurs between snakes of the same gender, while mating occurs between snakes of opposite genders. These two states occur with certain probabilities. The formulas for calculating environmental temperature and food quantity are shown in Equations (3) and (4):

T e m p = e^{(\frac{- t}{T})}

(3)

Q = c_{1} * e^{(\frac{t - T}{T})}

(4)

In the equations,

c_{1}

is a constant with a value of 0.5.

The update formulas for the position coordinates of snakes during food searching, i.e., reconnaissance mode, are represented by Equations (5) and (6), corresponding to female and male snakes, respectively.

X_{i, f} = X_{r a n d, f} \pm c_{2} \times A_{f} \times ((x_{\max} - x_{\min}) \times r a n d + X_{\min})

(5)

X_{i, m} = X_{r a n d, m} \pm c_{2} \times A_{m} \times ((x_{\max} - x_{\min}) \times r a n d + X_{\min})

(6)

In the equations, A_m and A_f represent the ability of male and female snakes to search for food, respectively, and c₂ is a constant with a value of 0.05.

In the case of abundant food, the update formulas for the position coordinates of snakes during combat mode among the same gender are represented by Equations (7) and (8):

X_{i, m} = X_{i, m} + c_{3} \times F M \times r a n d \times (Q \times X_{b e s t, f} - X_{i, m})

(7)

X_{i, f} = X_{i, f} + c_{3} \times F F \times r a n d \times (Q \times X_{b e s t, m} - X_{i, f})

(8)

In the equations, FM and FF represent the combat abilities of males and females, respectively, while c₃ is a constant with a value of 2.

In the case of abundant food, when snakes enter mating mode between opposite sexes, the position coordinates updating formulas are shown in Equations (9) and (10):

X_{i, m} = X_{i, m} + c_{3} \times M_{m} \times r a n d \times (Q \times X_{i, f} - X_{i, m})

(9)

X_{i, f} = X_{i, f} + c_{3} \times M_{f} \times r a n d \times (Q \times X_{i, m} - X_{i, f})

(10)

The variables

M_{m}

and

M_{f}

represent the mating abilities of males and females, respectively, while Q denotes the quantity of food.

After mating, there is an option to choose whether to proceed with incubation. If incubation is chosen, the worst-performing male and female individuals are selected and replaced, as shown in Equations (11) and (12):

X_{w o r s t, m} = X_{\min} + r a n d \times (X_{\max} - X_{\min})

(11)

X_{w o r s t, f} = X_{\min} + r a n d \times (X_{\max} - X_{\min})

(12)

3.2. Improved Snake Optimization Algorithm

The SO algorithm, as a heuristic algorithm, has shown promising results in solving optimization problems due to its novel imitation approach. However, the limitations of biological behaviors in nature still lead to its drawbacks, such as rapidly converging to local optima and insufficient accuracy. This section proposes an Improved Snake Optimization (ISO) algorithm to address these shortcomings. The algorithm integrates three enhancement strategies based on the limitations of the SO algorithm.

3.2.1. Halton Sequence Population Initialization

The original SO algorithm employs a common random initialization formula to initialize the population, as shown in Equation (13):

X_{i} = X_{\min} + r \times (X_{\max} - X_{\min})

(13)

This method, although providing certain randomness in the initial population, does not guarantee a uniform distribution of individuals across the search space. Often, there is a problem of either excessive concentration or dispersion of individuals in the population, leading to a significant portion of the search space being overlooked and reducing the efficiency of population search. Therefore, this paper introduces the Halton sequence to uniformly initialize the population, ensuring a more even distribution of individuals throughout the entire solution space. This enhances the algorithm’s search efficiency, improves the speed of discovering high-quality solutions, and accelerates the convergence of the algorithm.

The Halton sequence is a low-discrepancy sequence constructed using a deterministic method based on prime numbers. It ensures a more uniform distribution of the population across the entire solution space, thereby increasing the diversity of the population. This accelerates the convergence speed and improves the convergence accuracy of the algorithm. The implementation process of the two-dimensional Halton sequence involves selecting two prime numbers as bases corresponding to two dimensions. Each dimension iteratively divides and zigzags within the range (0,1) based on the respective base, forming a set of non-repeating and evenly spaced points. The mathematical model for the division process is represented by Equations (14)–(16):

n = \sum_{i = 0}^{m_{0}} a_{i} p_{1}^{i} = a_{0} + a_{1} p_{1}^{1} + \dots + a_{m_{0}} p_{1}^{m_{0}}, a_{i} \in \{0, 1, 2, \dots, p_{1} - 1\}

(14)

ϕ_{p_{1}} (n) = a_{0} p_{1}^{- 1} + a_{1} p_{1}^{- 2} + \dots + a_{m_{0}} p_{1}^{- m_{0} - 1}

(15)

H (n) = [ϕ_{p_{1, 1}} (n), ϕ_{1, 2} (n)]

(16)

In the equations, n represents the ordinal number of the Halton sequence; p₁ denotes the base of the Halton sequence, which is a prime number greater than or equal to 2; ϕ_p_1(n) represents the defined sequence function; and H(n) denotes the two-dimensional uniform sequence.

The distributions of the initial population generated by the random initialization function and the Halton sequence are shown in Figure 3.

Comparing the initial population generated by the random initialization function with the distribution map of the Halton sequence, it was found that under the condition of an initial population size of 1000, there was a significant dispersion phenomenon in the left part of the initial population generated by the random initialization function, resulting in large gaps, and a clear clustering phenomenon in the right part. Many individuals had obvious overlapping occlusion. The population generated by the Halton sequence is more evenly distributed in the entire space, and compared to the random initialization function, there is no obvious clustering or dispersion phenomenon, presenting an overall uniform trend. Therefore, by comparing the two graphs, it can be concluded that the initialization population generated by the Halton sequence produces better results.

3.2.2. Elite Reverse Learning Mechanism

In many cases, the process of solving a problem typically starts from scratch or with a random value and gradually approaches the optimal solution. For instance, this applies to parameters such as weights in neural networks, population parameters in swarm intelligence algorithms, kernel parameters in support vector machines, and so on. If the initial random value happens to be close to the optimal solution, the problem can be quickly resolved. However, there is also the worst-case scenario where the random value is far from the optimal solution, resulting in a significant amount of time spent during the solving process. Without prior knowledge, obtaining a good random value at the beginning is unlikely. Moreover, from a logical perspective, the problem-solving process can explore various directions. By incorporating reverse solutions of the optimal solution as feasible solutions, thereby expanding the search space, the efficiency of optimization can be improved. This constitutes the core theory behind reverse learning.

To enhance the global optimization capability of the SO algorithm and prevent it from getting stuck in local optima, an elite reverse learning strategy is introduced. During the process of generating offspring for snake individuals, a reverse learning strategy is applied to elite individuals to generate corresponding reverse solutions. These reverse solutions are then compared with the original solutions, and individuals with higher fitness are selected to proceed to the subsequent steps. This accelerates the convergence speed of the algorithm and expands its search range. Let the original solution of the current snake individual in the D-dimensional search space be represented by:

X_{i} = (x_{1}, x_{2}, \dots, x_{D})

(17)

The reverse solution of the original solution is represented as follows:

\bar{X_{i}} = (\bar{x_{1}}, \bar{x_{2}}, \dots \bar{x_{D}})

(18)

\bar{x_{i}} = k \times (x_{i \max} + x_{i \min}) - x_{i}

(19)

The snake individual with the highest fitness value within the current population is referred to as the elite individual, as shown in Equation (20):

f (X_{i}^{e}) = Max (f (x_{i}))

(20)

The construction of reverse solutions for elite individuals can increase the diversity of the population. Selecting the optimal value between the current candidate solution and the reverse solution for further steps enhances the algorithm’s convergence ability and broadens its search range. The calculation method for the elite reverse solution is shown in Equations (21) and (22):

\bar{X_{i}^{e}} = (\bar{x_{1}^{e}}, \bar{x_{2}^{e}}, \dots, \bar{x_{3}^{e}})

(21)

\bar{x_{i}^{e}} = k \times (x_{i \max} + x_{i \min}) - x_{i}^{b e s t}

(22)

In the equations,

X_{i}^{e}

represents the current elite individual,

\bar{X_{i}^{e}}

denotes the corresponding reverse solution of the current elite individual, and k is a uniformly distributed random function over the interval [0, 1].

3.2.3. Dynamic Weights Cauchy–Gauss Mutation

In the snake optimization algorithm, after the mating behavior of male and female snakes is completed, the female snake will lay eggs and hatch, resulting in new offspring. The new snakes will replace inferior individuals to supplement the population and initiate a new cycle of iterations. In the new iteration, to prevent the algorithm from falling into local optima and accelerate convergence as much as possible, this paper introduces a dynamic weighted Gaussian mutation strategy to enhance the algorithm’s ability to escape local optima.

The Gaussian distribution is symmetric about the mean. This implies that data on both sides of the mean exhibit similar distribution characteristics. Therefore, the use of Cauchy mutation to enhance global exploration capability and utilizing Gaussian mutation to enhance local exploitation capability near the current best solution is proposed. The modified Cauchy–Gaussian mutation formula is shown in Equations (23)–(25):

m u t a t i o n (m) = m \times [1 + α_{1} C a u c h y (0, 1) + α_{2} G a u s s (0, 1)]

(23)

α_{1} = 1 - {(\frac{t}{T_{\max}})}^{c}

(24)

α_{2} = {(\frac{t}{T_{\max}})}^{c}

(25)

In the equations, m represents the position of the best individual in the current iteration round, mutation(m) represents the position of the individual after mutation, Cauchy(0,1) is a random variable following the standard Cauchy distribution, and Gauss(0,1) is a random variable following the standard Gaussian distribution. α₁ and α₂ are adaptive adjustment factors, and c is a parameter controlling the rate of change.

As the number of iterations increases,

α_{1}

gradually decreases from 1 to close to 0, while

α_{2}

ncreases from 0 to 1. In the early stages of iteration, the adaptive adjustment factor allows Cauchy mutation to play a greater role, primarily expanding the population’s search range. In the later stages of algorithm iteration, the effect of Cauchy mutation weakens, and Gaussian mutation takes over, allowing individuals to search in a small range, enhancing the population’s local search capability, and enabling the algorithm to converge efficiently and rapidly.

3.3. Improved Snake Optimization Algorithm Process

The implementation steps of the proposed Improved Snake Optimization (ISO) algorithm are as follows, and the algorithm flowchart is shown in Figure 4.

Step 1: Set the initial parameters of the algorithm, including population size N maximum number of iterations T, and dimension of the problem D.

Step 2: Initialize the snake population using the Halton sequence, as shown in Equations (14)–(16), and divide the initialized population into male and female groups.

Step 3: Calculate the temperature Temp and food quantity Q according to Equations (3) and (4).

Step 4: If the food quantity Q is less than 0.25, enter the foraging mode to search for food, and update the coordinates according to Equations (5) and (6).

Step 5: If the food quantity Q is greater than 0.25 and the temperature Temp is greater than 0.6, indicating a hot state, the snakes will continue foraging in the foraging mode.

Step 6: If the food quantity Q is greater than 0.25 but the temperature Temp is less than 0.6, and if a random number rand is greater than 0.6, enter the fighting mode to engage with snakes of the same gender, and update the coordinates according to Equations (7) and (8).

Step 7: If the random number rand is less than 0.6, enter the mating mode, and update the coordinates according to Equations (9) and (10).

Step 8: Sort the fitness of the male and female populations separately. Select elite individuals based on Equation (20), and generate elite reverse solutions using Equations (21) and (22).

Step 9: Compare the elite reverse solution with the original solution, select the one with higher fitness, perform Cauchy–Gaussian mutation on this solution according to Equation (23), update the coordinates, and proceed to the next iteration.

Step 10: Check if the termination condition is met (t < T). If not, go back to Step 3; if yes, terminate the algorithm and output the optimal solution.

Figure 4. Flowchart of Improved Snake Optimization Algorithm.

4. Results and Discussions

The experimental environment for this study is a 64-bit Windows 10 operating system with an AMD Ryzen 5 3600 6-core processor at 3.60 GHz. The development software is Pycharm and the programming language is Python 3.8. The experiments in this study are divided into four parts.

(1): To compare the accuracy of the CatBoost model with the XGBoost and LightGBM models, experiments are conducted under the default parameters and the parameters optimized by TPE, respectively. The accuracy and superiority of the chosen modeling method are verified.
(2): The ISO algorithm is compared with GWO (Grey Wolf Optimization), HHO (Harris Hawks Optimization), SOA (Sine Cosine Algorithm), SO (Snake Optimization), and SCA (Sine Cosine Algorithm) on 10 test functions respectively to validate the effectiveness of the proposed algorithm.
(3): Ablation experiments are conducted and the results are compared with those of the complete ISO algorithm on test functions to validate the superiority of the proposed improvement strategy.
(4): Simulation experiments with ISO and the aforementioned algorithms are performed on actual production data, and the results are compared with those of actual production data to validate the effectiveness and practicality of the improved algorithm in the engineering context.

4.1. Modeling Effect and Analysis

To compare the modeling effect of each algorithm, four evaluation indexes, namely, coefficient of determination (R²), root mean square error (RMSE), mean absolute error (MAE), and mean relative error (MRE), are selected for comparison in this study.

The coefficient of determination (R²) measures the proportion of the total variation in the dependent variable that is explained by the model. Its value ranges from 0 to 1, with values closer to 1 indicating stronger explanatory power of the model. The formula is shown in Equation (26).

R^{2} = 1 - \frac{\sum (y_{i} - {\hat{y}}_{i})^{2}}{\sum {(y_{i} - \bar{y})}^{2}}

(26)

RMSE measures the deviation between the model’s predicted value and the true value, with smaller values indicating higher predictive accuracy of the model. The formula is shown in Equation (27).

R M S E = \sqrt{\frac{1}{n} \sum_{i = 1}^{n} {(y_{i} - {\hat{y}}_{i})}^{2}}

(27)

MAE measures the mean absolute error between the predicted and actual values, with smaller values indicating better predictions from the model. The formula is shown in Equation (28).

M A E = \frac{1}{n} \sum_{i = 1}^{n} |y_{i} - {\hat{y}}_{i}|

(28)

MRE measures the average proportion of prediction error relative to the actual value, with smaller values indicating smaller relative errors and better performance of the model. The formula is shown in Equation (29).

M R E = \frac{1}{n} \sum_{i = 1}^{n} |\frac{y_{i} - {\hat{y}}_{i}}{y_{i}}|

(29)

Catboost, XGBoost, and LightGBM with default parameters and TPE-Catboost, TPE-XGBoost, and TPE-LightGBM optimized with TPE hyperparameters are compared, respectively, and the results of the comparisons of each index are shown in Table 4.

As shown in Table 4, the coefficient of determination (R²) of the XGBoost, LightGBM, and CatBoost models with the default hyperparameters consistently hovers around 89% on the actual production dataset of the smelter. Subsequent optimization of the model hyperparameters using the Tree-structured Parzen Estimator (TPE) resulted in an increase in the average R² to 93%, which is a 4% improvement over the default parameters. Additionally, this optimization resulted in a reduction of 1.03 °C in the Root Mean Square Error (RMSE), 0.87 °C in the Mean Absolute Error (MAE), and 1.59% in the Mean Relative Error (MRE). These results highlight the efficacy of TPE-based hyperparameter tuning in improving model accuracy.

Among the three hyperparameter-optimized models, the average R² of the TPE-CatBoost model is better than the comparable models by 1.5%. In addition, the RMSE, MAE, and MRE were also significantly reduced. In summary, the TPE-CatBoost model shows better adaptability in predicting the melting point of slag in zinc smelting, which plays a positive role in future research on melting point optimization in the industrial field.

4.2. Analysis of Improved Snake Optimization Algorithm

To validate the optimization performance of the proposed ISO algorithm, this study compares it with Grey Wolf Optimization (GWO), Harris Hawks Optimization (HHO), Seagull Optimization Algorithm (SOA), Snake Optimization Algorithm (SO), and Sine Cosine Algorithm (SCA). Ten typical test functions are selected for experimentation, comprising four unimodal functions F₁–F₄ three multimodal functions F₅–F₇, and three fixed-dimensional multimodal functions F₈–F₁₀. The population size is set to n = 30, and each algorithm runs independently for 50 iterations. The main parameters of the algorithms are presented in Table 5, while the test functions are listed in Table 6.

The experimental results comparing the six algorithms across the 10 test functions are presented in Table 7, with the evaluation criteria being the mean (Mean) and standard deviation (std):

From Table 5 and Figure 5, it is evident that when solving functions F1–F9, the ISO algorithm exhibits superior optimization results and stability compared to other algorithms. It consistently converges to the theoretical optimum value of 0 across several functions. For single-peak functions F1–F4 and multi-peak functions F5 and F7, the ISO algorithm consistently converges to the theoretical optimum value of 0, showing a significantly larger magnitude difference compared to other algorithms. Regarding multi-peak function F6, both the ISO algorithm and the HHO algorithm achieve results closest to the theoretical optimum value, However, according to Figure 5f, it is observed that the ISO algorithm exhibits significantly improved convergence speed compared to HHO, enabling it to rapidly attain the optimal solution. For multi-peak function F8, the ISO algorithm is closer to the theoretical optimum value than the other algorithms. On the fixed-dimensional multi-peak function F9, the ISO algorithm, along with the GWO and SO algorithms, achieves the theoretical optimum value. However, the ISO algorithm has a smaller standard deviation than the other two algorithms, and its convergence speed is significantly improved compared to the GWO and SO algorithms, demonstrating the stability of the ISO algorithm. These test functions adequately demonstrate the clear advantages of the ISO algorithm over other algorithms when solving benchmark test functions. Nevertheless, there is still room for improvement in the ISO algorithm. For example, in function F10, the optimization accuracy is not as good as that of the HHO algorithm. Therefore, for optimizing complex functions, the proposed ISO algorithm requires further refinement to enhance its robustness.

4.3. Analysis of Ablation Experiments

To verify the effectiveness of the proposed improvement strategies, this study sets the ISO1 algorithm as the control algorithm without Halton sequence population initialization, the ISO2 algorithm as the control algorithm without the local chaotic mapping search strategy, and the ISO3 algorithm as the control algorithm without the dynamic weight Cauchy–Gaussian mutation in the position update method. The average values of 50 runs for each of the 10 test functions are calculated, and the experimental results are shown in Table 8:

From Table 8, it can be observed that the Halton sequence, local chaotic mapping search strategy, and dynamic weight Cauchy–Gaussian mutation in the position update method used in this study all contribute to some extent to the improvement in performance compared to the standard SO algorithm. Among them, the algorithmic effect of ISO1 is more significant than that of ISO2 and ISO3 algorithms, indicating that the simultaneous application of the local chaotic mapping search strategy and the Cauchy–Gaussian mutation improves the algorithm’s optimization accuracy. Comparing the ISO1, ISO2, ISO3, and SO algorithms, it can be concluded that the comprehensive ISO algorithm proposed in this study, integrating the three improvement strategies, outperforms algorithms without a specific improvement strategy in terms of precision and stability. It also suggests that a single strategy has a limited impact on the algorithm’s performance, and the integration of three strategies optimizes the algorithm in multiple aspects, leading to a significant improvement in performance across various test functions, thus demonstrating the effectiveness of the proposed strategies.

4.4. Actual Production Data Experiment

Using real production data from a non-ferrous metal smelting plant in China, simulation tests were conducted to compare the performance of the ISO, SO, HHO, GWO, SOA, and SCA algorithms. The algorithm parameters were set according to Table 3. The comparison of slag melting point optimization results is shown in Table 9, and the convergence curve of the zinc smelting slag melting point reduction is depicted in Figure 6.

Analysis of Table 9 and Figure 6 reveals that in terms of convergence accuracy, the improved ISO algorithm can reduce the slag melting point to 1259.8 °C, which is the lowest among recent algorithms such as HHO and GWO that have shown promising results in this field. In terms of convergence speed, the ISO algorithm demonstrates a significant improvement compared to GWO, SOA, and HHO. Although its convergence speed is similar to that of the SO and SCA algorithms, the ISO algorithm achieves a much higher precision. It meets the real-time and precision requirements of practical production and has achieved good results, effectively reducing the melting point of the mixed slag, thereby reducing the cost of the smelting plant and enhancing its economic benefits.

To validate the authenticity and feasibility of the optimization results, experimental slag blending tests were conducted using raw materials from the factory, and the actual slag melting point was determined using a slag melting point and melting speed tester RDS-05. The actual experimental measurement of the slag melting point was found to be 1253 °C, which differs from the optimized result by 6.8 °C, with an error of approximately 0.54%, within the normal range of error. Therefore, the improved snake optimization algorithm demonstrates feasibility and authenticity in the slag melting point optimization problem. This method can assist metallurgical production personnel in selecting process parameters quickly and effectively. Given the similarity between process parameter optimization problems in other metallurgical fields and zinc smelting slag optimization problems, this algorithm also holds great potential for development in related fields.

5. Conclusions

This study focuses on the zinc smelting workshop of a metallurgical plant in Shaanxi Province, China, using actual production data from the workshop. The objective is to reduce the melting point of mixed slag during the smelting process, thereby lowering production costs. This research holds significant practical importance for the metallurgical industry.

This study demonstrates the effectiveness of using the CatBoost method with hyperparameter optimization to model slag melting points. Compared to the traditional phase diagram method used in the metallurgical industry, CatBoost provides higher accuracy, does not require specialized prior knowledge, and excels in handling complex problems. Unlike other ensemble learning models such as XGBoost and LightGBM, CatBoost autonomously handles categorical features, reducing the complexity of feature preprocessing, and is less affected by outliers and noise, resulting in a more stable and accurate model. Furthermore, we addressed the limitations of random initialization in the Snake Optimization (SO) algorithm by proposing the Halton sequence population initialization strategy. To counteract the tendency of SO to converge to local optima, we introduced the Elite Backward Learning strategy. Additionally, to improve the replacement of subsequent generations of individuals, a dynamic Cauchy–Gaussian mutation strategy that changes dynamically with the iteration number was incorporated. The performance of the Improved Snake Optimization (ISO) algorithm was evaluated using 10 classical test functions, encompassing unimodal, multimodal, and fixed-dimensional multimodal functions. Experimental and statistical results indicate that the ISO algorithm outperforms five existing state-of-the-art optimization algorithms in terms of accuracy, convergence speed, and stability. Finally, the ISO algorithm was applied to the optimization of zinc smelting slag melting point, where it achieved superior results to the five state-of-the-art algorithms, validating its effectiveness in real-world applications.

In conclusion, the proposed method of TPE-Catboost modeling followed by optimization using the ISO algorithm demonstrates superior effectiveness. By comparing various evaluation metrics, it is demonstrated that TPE-Catboost achieves higher modeling accuracy. Through testing on various types of test functions and real-world engineering problems, the ISO algorithm has shown good performance across different optimization problems, validating its robustness and effectiveness. In future research, efforts will be focused on advancing this method further to address the larger context of energy conservation and emission reduction, aiming to accomplish more complex and meaningful tasks.

Author Contributions

Conceptualization, Z.L. and Y.K.; methodology, Z.L. and Y.K.; software, Z.L.; validation, Z.L.; formal analysis, Z.L. and Y.K.; investigation, Z.L.; resources, Z.L. and Y.K.; data curation, Z.L.; writing—original draft preparation, Z.L. and Y.K.; writing—review and editing, Z.L. and Y.K.; visualization, Z.L.; supervision, Y.K. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to the data being part of ongoing research.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Xu, D.-M.; Fu, R.-B.; Tong, Y.-H.; Shen, D.-L.; Guo, X.-P. The Potential Environmental Risk Implications of Heavy Metals Based on Their Geochemical and Mineralogical Characteristics in the Size-Segregated Zinc Smelting Slags. J. Clean. Prod. 2021, 315, 128199. [Google Scholar] [CrossRef]
Tyszka, R.; Pietranik, A.; Potysz, A.; Kierczak, J.; Schulz, B. Experimental Simulations of ZnPb Slag Weathering and Its Impact on the Environment: Effects of Acid Rain, Soil Solution, and Microbial Activity. J. Geochem. Explor. 2021, 228, 106808. [Google Scholar] [CrossRef]
Kaya, M.; Hussaini, S.; Kursunoglu, S. Critical Review on Secondary Zinc Resources and Their Recycling Technologies. Hydrometallurgy 2020, 195, 105362. [Google Scholar] [CrossRef]
Wang, Z.; Chen, L.; Qin, Z.; Yang, K.; Liang, B.; Zhang, G.; Liu, C.; Luo, D. A Green and Efficient Route for Simultaneous Recovery of Low Valence of Vanadium and Chromium, Titanium and Iron from Vanadium Slag. Resour. Conserv. Recycl. 2022, 178, 106046. [Google Scholar] [CrossRef]
Zheng, Y.-X.; Liu, W.; Qin, W.-Q.; Kong, Y.; Luo, H.-L.; Han, J.-W. Mineralogical Reconstruction of Lead Smelter Slag for Zinc Recovery. Sep. Sci. Technol. 2014, 49, 783–791. [Google Scholar] [CrossRef]
Ma, Y.; Li, C.; Yan, J.; Yu, H.; Kan, H.; Yu, W.; Zhou, X.; Meng, Q.; Dong, P. The Release Analysis of As and Cr Metals in Lead-Zinc Smelting Slag: Mineralogical Analysis, Bioavailability and Leachability Analysis. Environ. Res. 2023, 229, 115751. [Google Scholar] [CrossRef] [PubMed]
Zhang, P.; Muhammad, F.; Yu, L.; Xia, M.; Lin, H.; Huang, X.; Jiao, B.; Shiau, Y.; Li, D. Self-Cementation Solidification of Heavy Metals in Lead-Zinc Smelting Slag through Alkali-Activated Materials. Constr. Build. Mater. 2020, 249, 118756. [Google Scholar] [CrossRef]
Xu, D.-M.; Fu, R.-B. The Mechanistic Understanding of Potential Bioaccessibility of Toxic Heavy Metals in the Indigenous Zinc Smelting Slags with Multidisciplinary Characterization. J. Hazard. Mater. 2022, 425, 127864. [Google Scholar] [CrossRef]
Song, S.; Sun, W.; Wang, L.; Liu, R.; Han, H.; Hu, Y.; Yang, Y. Recovery of Cobalt and Zinc from the Leaching Solution of Zinc Smelting Slag. J. Environ. Chem. Eng. 2019, 7, 102777. [Google Scholar] [CrossRef]
Hu, H.; Deng, Q.; Li, C.; Xie, Y.; Dong, Z.; Zhang, W. The Recovery of Zn and Pb and the Manufacture of Lightweight Bricks from Zinc Smelting Slag and Clay. J. Hazard. Mater. 2014, 271, 220–227. [Google Scholar] [CrossRef]
Huang, J.; Wang, Y.; Zhou, G.; Gu, Y. Investigation on the Effect of Roasting and Leaching Parameters on Recovery of Gallium from Solid Waste Coal Fly Ash. Metals 2019, 9, 1251. [Google Scholar] [CrossRef]
Liang, J.; Zhang, Y.; Meng, A.; Li, Q. Application of neural network method to optimize the prediction of ash melting point in waste incineration simulation. J. Environ. Eng. 2009, 3, 2087–2090. [Google Scholar]
Prostakova, V.; Shishin, D.; Shevchenko, M.; Jak, E. Thermodynamic Optimization of the Al₂O₃–FeO–Fe₂O₃–SiO₂ Oxide System. Calphad 2019, 67, 101680. [Google Scholar] [CrossRef]
Shi, X.; Cen, K. Optimization of coal ash melting point based on generalized regression neural network and genetic algorithm. J. Zhejiang Univ. 2005, 1189–1192+1242. [Google Scholar]
Bao, J.; Du, W.; Wu, H.; Yue, Z. Novel Microwave Dielectric Ceramics Based on CaO–MoO₃–La₂O₃ Pseudo-ternary Phase Diagram and the Design of Patch Antenna. J. Am. Ceram. Soc. 2024, 107, 704–718. [Google Scholar] [CrossRef]
Liu, C.; Xie, X.; Qiu, J.; Li, W.; Huo, G. The Phase Diagram of a CaO-Al₂O₃-VOx Slag System under Argon Atmosphere at 1500 °C. Metals 2024, 14, 108. [Google Scholar] [CrossRef]
Li, Y.; Lin, X.; Liu, J. An Improved Gray Wolf Optimization Algorithm to Solve Engineering Problems. Sustainability 2021, 13, 3208. [Google Scholar] [CrossRef]
Shim, J.; Kim, I. Evolutionary Algorithm to Optimize Process Parameters of Al/Steel Magnetic Pulse Welding. Appl. Sci. 2023, 13, 12881. [Google Scholar] [CrossRef]
Albadr, M.A.; Tiun, S.; Ayob, M.; AL-Dhief, F. Genetic Algorithm Based on Natural Selection Theory for Optimization Problems. Symmetry 2020, 12, 1758. [Google Scholar] [CrossRef]
Deng, W.; Zhang, X.; Zhou, Y.; Liu, Y.; Zhou, X.; Chen, H.; Zhao, H. An Enhanced Fast Non-Dominated Solution Sorting Genetic Algorithm for Multi-Objective Problems. Inf. Sci. 2022, 585, 441–453. [Google Scholar] [CrossRef]
Hou, Y.; Gao, H.; Wang, Z.; Du, C. Improved Grey Wolf Optimization Algorithm and Application. Sensors 2022, 22, 3810. [Google Scholar] [CrossRef]
Nadimi-Shahraki, M.H.; Taghian, S.; Mirjalili, S. An Improved Grey Wolf Optimizer for Solving Engineering Problems. Expert Syst. Appl. 2021, 166, 113917. [Google Scholar] [CrossRef]
Hashim, F.A.; Hussien, A.G. Snake Optimizer: A Novel Meta-Heuristic Optimization Algorithm. Knowl.-Based Syst. 2022, 242, 108320. [Google Scholar] [CrossRef]
Ruan, B.; Jiao, F.; Liu, W.; Pan, Z.; Qin, W. Utilization and Detoxification of Gypsum Sludge by Replacing Limestone in Reduction Smelting of High Lead Slag. J. Cent. South Univ. 2023, 30, 1145–1157. [Google Scholar] [CrossRef]
Zhou, S.; Wang, M.; Tan, H.; Wang, X.; Yang, W.; Xiong, X.; Yang, F. Evaluation of Aluminum Ash in Alleviating the Ash Deposition of High-Sodium and High-Iron Coal. Fuel 2020, 273, 117701. [Google Scholar] [CrossRef]
Song, J.; Xi, W.; Niu, L. Study on the Activity Model of PbO-ZnO-FeO-Fe₂O₃-SiO₂-CaO Six-Component High-Lead Slag System. Metals 2023, 13, 734. [Google Scholar] [CrossRef]
Worden, K.; Green, P.L. A Machine Learning Approach to Nonlinear Modal Analysis. Mech. Syst. Signal Process. 2017, 84, 34–53. [Google Scholar] [CrossRef]
Aldoseri, A.; Al-Khalifa, K.N.; Hamouda, A.M. Re-Thinking Data Strategy and Integration for Artificial Intelligence: Concepts, Opportunities, and Challenges. Appl. Sci. 2023, 13, 7082. [Google Scholar] [CrossRef]
Hancock, J.T.; Khoshgoftaar, T.M. CatBoost for Big Data: An Interdisciplinary Review. J. Big Data 2020, 7, 94. [Google Scholar] [CrossRef]
Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, USA, 13 August 2016; pp. 785–794. [Google Scholar]
Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.-Y. Lightgbm: A Highly Efficient Gradient Boosting Decision Tree. Adv. Neural Inf. Process. Syst. 2017, 30, 3149–3157. [Google Scholar]
Dong, L.; Zeng, W.; Wu, L.; Lei, G.; Chen, H.; Srivastava, A.K.; Gaiser, T. Estimating the Pan Evaporation in Northwest China by Coupling CatBoost with Bat Algorithm. Water 2021, 13, 256. [Google Scholar] [CrossRef]
Imani, M.; Arabnia, H.R. Hyperparameter Optimization and Combined Data Sampling Techniques in Machine Learning for Customer Churn Prediction: A Comparative Analysis. Technologies 2023, 11, 167. [Google Scholar] [CrossRef]
Bergstra, J.; Yamins, D.; Cox, D. Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. In Proceedings of the International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013; pp. 115–123. [Google Scholar]
Syarif, I.; Prugel-Bennett, A.; Wills, G. SVM Parameter Optimization Using Grid Search and Genetic Algorithm to Improve Classification Performance. TELKOMNIKA (Telecommun. Comput. Electron. Control) 2016, 14, 1502. [Google Scholar] [CrossRef]
Liashchynskyi, P.; Liashchynskyi, P. Grid Search, Random Search, Genetic Algorithm: A Big Comparison for NAS. arXiv 2019, arXiv:191206059. [Google Scholar]
Bergstra, J.; Bardenet, R.; Bengio, Y.; Kégl, B. Algorithms for Hyper-Parameter Optimization. Adv. Neural Inf. Process. Syst. 2011, 24. [Google Scholar] [CrossRef]

Figure 1. Zinc smelting slag treatment process flow chart.

Figure 2. The effect of Fe–Si ratio and Ca–Si ratio on melting point.

Figure 3. The Random Initialization and Halton Sequence Initialization.

Figure 5. Test functions adaptation curve.

Figure 6. Comparison of slag melting point optimization results.

Table 1. XGBoost model parameters.

Parameter	Description	Value
booster	Type of booster	dart
eta	learning rate	0.2177531
max_depth	Maximum depth of the tree	7
alpha	L1 regularization weight	0.0063781
lambda	L2 regularization weight	0.0286115

Table 2. LightGBM model parameters.

Parameter	Description	Value
boosting_type	Type of booster	dart
num_leaves	Maximum number of leaves in one tree	61
max_depth	Maximum depth of the tree	10
lambda_l1	L1 regularization weight	0.0006519
lambda_l2	L2 regularization weight	0.0041792
feture_fraction	Part of the features on each iteration	0.5311784
bagging_freq	Frequency for bagging	7

Table 3. Catboost model parameters.

Parameter	Description	Value
booster	Type of booster	Plain
num_leaves	Maximum number of leaves in one tree	61
max_depth	Maximum depth of the tree	9
colsample_bylevel	Subsampling rate per level for each tree	0.3724771
learning_rate	learning rate	0.2411486

Table 4. Model Accuracy Comparison.

Model	R²/(%)	RMSE/°C	MAE/°C	MRE/(%)
XGBoost	88.99	8.12	7.19	10.09
LightGBM	89.57	8.02	7.02	9.81
Catboost	89.91	8.02	6.97	9.41
TPE-Catboost	93.89	7.02	6.19	7.88
TPE-XGboost	92.55	7.41	6.33	8.18
TPE-LightGBM	92.81	7.29	6.38	8.05

Table 5. Algorithm Main Parameters.

Algorithm	Parameters
ISO	$c_{1} = 0.5, c_{2} = 0.05, c_{3} = 2$
SO	$c_{1} = 0.5, c_{2} = 0.05, c_{3} = 2$
GWO	$α = 2 ~ 0$
HHO	$J \in [0, 2]$
SOA	$F_{c} = 2 ~ 0$
SCA	-

Table 6. Test Functions.

Function	Value Range	Minimum
$F_{1} (x) = \sum_{i = 1}^{n} x_{i}^{2}$	[−100,100]	0
$F_{2} (x) = \sum_{i = 1}^{n} \|x_{i}\| + \prod_{i = 1}^{n} \|x_{i}\|$	[−1.28,1.28]	0
$F_{3} (x) = {\sum_{i = 1}^{d} (\sum_{j = 1}^{i} x_{j})}^{2}$	[−100,100]	0
$F_{4} (x) = \max_{i} \{\|x_{i}\|, 1 \leq i \leq n\}$	[−100,100]	0
$F_{5} (x) = \sum_{i = 1}^{n} [x_{i}^{2} - 10 \cos (2 π x_{i}) + 10]$	[−5.12,5.12]	0
$F_{6} (x) = - 20 \exp (- 0.2 \sqrt{\frac{1}{n} \sum_{i = 1}^{n} x_{i}^{2}}) - \exp (\frac{1}{n} \sum_{i = 1}^{n} \cos (2 π x_{i}))$	[−32,32]	0
$F_{7} (x) = \frac{1}{4000} \sum_{i = 1}^{n} x_{i}^{2} - \prod_{i = 1}^{n} \cos (\frac{x_{i}}{\sqrt{i}}) + 1$	[−600,600]	0
$F_{8} (x) = - \sum_{i = 1}^{4} c_{i} \exp (- \sum_{j = 1}^{6} a_{i j} {(x_{j} - p_{i j})}^{2})$	[0,1]	−3.32
$F_{9} (x) = - \sum_{i = 1}^{5} {[(X - a_{i}) {(X - a_{i})}^{T} + c_{i}]}^{- 1}$	[0,10]	−10.1532
$F_{10} (x) = \sum_{i = 1}^{11} {(a_{i} - \frac{x_{1} (b_{i}^{2} + b_{i} x_{2})}{b_{i}^{2} + b_{i} x_{3} + x_{4}})}^{2}$	[−5,5]	0.0003

Table 7. Comparison of Fitness of Test Functions.

Function	Measurements	Algorithm
Function	Measurements	ISO	HHO	GWO	SOA	SCA	SO
F₁	Mean	0.0000 × 10⁰⁰	5.4142 × 10⁻¹¹²	2.7749 × 10⁻³⁷	8.1617 × 10⁻¹²	2.1468 × 10⁻⁰⁹	8.1391 × 10⁻⁸²
F₁	Std	0.0000 × 10⁰⁰	1.6702 × 10⁻¹¹¹	3.5411 × 10⁻³⁷	1.8816 × 10⁻¹¹	1.8529 × 10⁻⁰⁸	3.4792 × 10⁻⁸¹
F₂	Mean	0.0000 × 10⁰⁰	5.4409 × 10⁻⁶⁵	1.0012 × 10⁻²⁶	6.1151 × 10⁻¹⁰	5.5126 × 10⁻⁰⁷	4.7886 × 10⁻⁵¹
F₂	Std	0.0000 × 10⁰⁰	3.3781 × 10⁻⁶⁴	7.5561 × 10⁻²⁷	5.9891 × 10⁻¹⁰	5.6172 × 10⁻⁰⁷	8.6612 × 10⁻⁵¹
F₃	Mean	0.0000 × 10⁰⁰	2.9952 × 10⁻⁷³	1.1671 × 10⁻¹⁴	4.1381 × 10⁻⁰⁵	7.1172 × 10⁻⁰⁷	3.0034 × 10⁻⁶⁵
F₃	Std	0.0000 × 10⁰⁰	7.5101 × 10⁻⁷³	9.3171 × 10⁻¹⁵	2.5571 × 10⁻⁰⁵	4.7199 × 10⁻⁰⁷	2.7118 × 10⁻⁶⁴
F₄	Mean	0.0000 × 10⁰⁰	2.7185 × 10⁻⁵⁸	4.4738 × 10⁻¹⁴	5.6719 × 10⁻⁰²	3.4183 × 10⁻⁰⁴	2.6118 × 10⁻⁴⁷
F₄	Std	0.0000 × 10⁰⁰	1.0049 × 10⁻⁵⁷	3.6048 × 10⁻¹³	8.7712 × 10⁻⁰²	7.7712 × 10⁻⁰⁴	7.4344 × 10⁻⁴⁷
F₅	Mean	0.0000 × 10⁰⁰	0.0000 × 10⁰⁰	1.5068 × 10⁰⁰	4.0340 × 10⁰⁰	7.5512 × 10⁰¹	2.5412 × 10⁰⁰
F₅	Std	0.0000 × 10⁰⁰	0.0000 × 10⁰⁰	2.7213 × 10⁰⁰	1.4524 × 10⁰⁰	1.7123 × 10⁰¹	6.4249 × 10⁰⁰
F₆	Mean	8.8818 × 10⁻¹⁶	8.8818 × 10⁻¹⁶	1.0001 × 10⁻¹³	1.7562 × 10⁰¹	2.2750 × 10⁰⁰	8.0846 × 10⁻⁰²
F₆	Std	0.0000 × 10⁰⁰	0.0000 × 10⁰⁰	1.7224 × 10⁻¹⁴	9.9811 × 10⁻⁰¹	1.4751 × 10⁰⁰	4.4281 × 10⁻⁰¹
F₇	Mean	0.0000 × 10⁰⁰	0.0000 × 10⁰⁰	4.9804 × 10⁻⁰⁴	1.0479 × 10⁻⁰³	3.7887 × 10⁻⁰²	2.8945 × 10⁻⁰²
F₇	Std	0.0000 × 10⁰⁰	0.0000 × 10⁰⁰	9.5512 × 10⁻⁰³	1.5908 × 10⁻⁰³	2.0617 × 10⁻⁰²	7.6613 × 10⁻⁰²
F₈	Mean	−3.3220 × 10⁰⁰	−3.0972 × 10⁰⁰	−3.2223 × 10⁰⁰	−2.9367 × 10⁰⁰	−3.2622 × 10⁰⁰	−3.3101 × 10⁰⁰
F₈	Std	1.5111 × 10⁰⁰	1.6007 × 10⁰²	8.0092 × 10⁰²	6.4643 × 10⁰²	7.2109 × 10⁰⁰	1.3751 × 10⁰²
F₉	Mean	−1.0153 × 10⁰¹	−5.7972 × 10⁰⁰	−1.0153 × 10⁰¹	−3.0993 × 10⁰⁰	−6.7167 × 10⁰⁰	−1.0153 × 10⁰¹
F₉	Std	9.2511 × 10⁻¹²	6.1189 × 10⁻⁰¹	1.2251 × 10⁰⁰	3.8082 × 10⁰⁰	3.0177 × 10⁰⁰	1.7125 × 10⁻⁰¹
F₁₀	Mean	3.8012 × 10⁻⁰⁴	3.4637 × 10⁻⁰⁴	8.8845 × 10⁻⁰³	5.0147 × 10⁻⁰¹	4.7786 × 10⁻⁰³	7.0561 × 10⁻⁰³
F₁₀	Std	9.5912 × 10⁻⁰⁵	2.7712 × 10⁻⁰⁵	4.4561 × 10⁻⁰³	1.6615 × 10⁻⁰³	6.1175 × 10⁻⁰³	8.2759 × 10⁻⁰⁴

Table 8. Comparative Analysis of Ablative Experimental Outcomes.

Function	Algorithm
Function	ISO1	ISO2	ISO3	ISO	SO
F₁	2.4415 × 10⁻¹²⁰	4.1762 × 10⁻¹⁰⁵	5.7391 × 10⁻¹⁰⁹	0.0000 × 10⁰⁰	8.1391 × 10⁻⁸²
F₂	7.8911 × 10⁻⁹¹	6.3281 × 10⁻⁶⁹	4.5732 × 10⁻⁷¹	0.0000 × 10⁰⁰	4.7886 × 10⁻⁵¹
F₃	6.5382 × 10⁻⁸⁸	9.9150 × 10⁻⁷⁵	6.3315 × 10⁻⁷⁷	0.0000 × 10⁰⁰	3.0034 × 10⁻⁶⁵
F₄	7.1913 × 10⁻⁷¹	3.9122 × 10⁻⁵⁵	7.1392 × 10⁻⁴⁹	0.0000 × 10⁰⁰	2.6118 × 10⁻⁴⁷
F₅	0.0000 × 10⁰⁰	1.0125 × 10⁰⁰	1.1051 × 10⁰⁰	0.0000 × 10⁰⁰	2.5412 × 10⁰⁰
F₆	8.8818 × 10⁻¹⁶	6.4258 × 10⁻⁰⁵	9.1172 × 10⁻⁰⁵	8.8818 × 10⁻¹⁶	8.0846 × 10⁻⁰²
F₇	0.0000 × 10⁰⁰	3.7182 × 10⁻⁰³	7.3282 × 10⁻⁰³	0.0000 × 10⁰⁰	2.8945 × 10⁻⁰²
F₈	−3.3201 × 10⁰⁰	−3.3157 × 10⁰⁰	−3.3172 × 10⁰⁰	−3.3220 × 10⁰⁰	−3.3101 × 10⁰⁰
F₉	−1.0153 × 10⁰¹	−1.0153 × 10⁰¹	−1.0153 × 10⁰¹	−1.0153 × 10⁰¹	−1.0153 × 10⁰¹
F₁₀	9.9913 × 10⁻⁰⁴	3.6012 × 10⁻⁰³	3.9912 × 10⁻⁰³	3.8012 × 10⁻⁰⁴	7.0561 × 10⁻⁰³

Table 9. Comparison of Slag Melting Point Optimization Results.

	Iron–Silicon	Calcium–Silicon	Coal Quantity	Oxygen Flow	SiO₂	Melting Point
real_data	2.0049	0.4462	109.80	76.0588	11.1251	1390.0041
GWO	1.8611	0.4591	109.13	75.4719	10.2991	1333.5102
SOA	1.8596	0.4757	110.08	71.0769	10.3537	1324.2401
SCA	1.9248	0.6490	106.93	70.8544	10.3901	1316.3657
SO	1.9981	0.6199	105.02	72.3678	10.4288	1299.3442
HHO	1.9994	0.6618	104.91	73.5412	10.4004	1281.7811
ISO	2.0472	0.6804	104.89	71.4551	10.3930	1259.8102

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kong, Y.; Liu, Z. Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm. Appl. Sci. 2024, 14, 4603. https://doi.org/10.3390/app14114603

AMA Style

Kong Y, Liu Z. Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm. Applied Sciences. 2024; 14(11):4603. https://doi.org/10.3390/app14114603

Chicago/Turabian Style

Kong, Yueping, and Ziyu Liu. 2024. "Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm" Applied Sciences 14, no. 11: 4603. https://doi.org/10.3390/app14114603

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Optimization of Zinc Smelting Slag Melting Point Based on Catboost and Improved Snake Optimization Algorithm

Abstract

1. Introduction

2. Optimization Model for Process Parameters of Zinc Smelting Slag Melting Point

2.1. Selection of Key Process Parameters

2.2. Model Building

3. Improved Snake Optimization Algorithm for Melting Point Optimization Solution

3.1. Snake Optimization Algorithm Process

3.2. Improved Snake Optimization Algorithm

3.2.1. Halton Sequence Population Initialization

3.2.2. Elite Reverse Learning Mechanism

3.2.3. Dynamic Weights Cauchy–Gauss Mutation

3.3. Improved Snake Optimization Algorithm Process

4. Results and Discussions

4.1. Modeling Effect and Analysis

4.2. Analysis of Improved Snake Optimization Algorithm

4.3. Analysis of Ablation Experiments

4.4. Actual Production Data Experiment

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI