1. Introduction
Given the exacerbation of the energy crisis, there has been an increasing focus on utilizing renewable resources to mitigate resource scarcity and alleviate environmental pollution [
1,
2]. Biomass resources, particularly crop residues, find extensive applications in various sectors, including composites [
3,
4], biodegradable stalks [
5,
6], hydrogen production [
7,
8], and battery materials [
9,
10], due to their abundance, reproducibility, cost-effectiveness, and carbon neutrality. The high-value utilization of crop straw relies on its crushing treatment. Micro-crushed straw powder offers numerous advantages, including having a large specific surface area, low melting point, good activity, and rapid dissolution speed [
11]. The uniformity of straw powder particle size is a crucial indicator for assessing powder quality; however, the particle size distribution of straw powder processed by mechanical micro-crushing methods tends to be wide [
12]. Comminution combined with particle classification is a crucial method for transforming raw materials into the desired particle size [
13,
14]. Thus, during crop straw crushing, it is essential to promptly separate straw powder that meets standards, while unqualified products are returned to the crushing device for secondary processing, preventing over-crushing and particle agglomeration [
15].
In order to achieve efficient particle classification, the design and optimization of the classifying device are particularly important. The optimization of the structural parameters of the rotor cage, as the only rotating part in the classifying chamber, is a crucial method to enhance the classification performance of the classifying device. Compared to physical tests, the CFD method is suitable for the structural optimization study of the rotor cage in a classifying device due to its cost-effectiveness and time efficiency. Many scholars have conducted related studies. For instance, Liu et al. [
16] investigated the effect of the number of rotor cage blades on classification performance using the CFD method. The results showed that if the number of blades is too small, the coarse particles in the product increase. Conversely, if the number of blades is too large, the device cannot separate qualified products in time, and both internal resistance and energy consumption increase. Yu et al. [
17] compared and analyzed the effect of the inner and outer radius of the rotor cage on the flow field in the classifying chamber using the CFD method. The results showed that the inner and outer radius of the rotor cage directly affected the radial velocity distribution within the rotor cage channel and that they should not be excessively large or small. Jia et al. [
18] analyzed the effect of an inverted conical rotor cage on the flow field and classification performance in the classifying chamber using the CFD method. The results indicated that the optimal structure is achieved when the ratio of the top surface diameter of the rotor cage to the bottom surface diameter is 1:0.8 and 1:0.7, meeting the requirements of industrial classification efficiency and reducing energy consumption. Ismail et al. [
19] analyzed the effect of the length of rotor cage blades on classification efficiency using the CFD method. The results indicated that increasing blade length can enhance the production of fine particles. However, excessively long blades may hinder timely particle discharge from the classifying chamber. Ren et al. [
20] designed a rotor cage with non-radial arc blades by analyzing the effect of the blade mounting angle on the flow field. CFD simulations of the structure revealed a significant reduction in the incidence angle at the entrance of the rotor cage and the absence of air vortex in the blade flow path.
The above literature indicates that the classification performance of a classifying device is influenced by various factors, and geometric optimization of the rotor cage is an effective approach to enhance its performance. However, most current studies adopt the control variable method, optimizing only a single structural parameter and ignoring the effect of multi-feature parameter coupling. Additionally, these studies primarily focus on simple empirical models and traditional CFD simulations, which have limited optimization effects. Therefore, existing optimization methods still have significant room for improvement in enhancing classification performance.
In recent years, Machine Learning Algorithms (MLAs) have flourished, with many studies utilizing these algorithms to predict and optimize experimental parameters. Machine learning offers new methods for parameter optimization by training predictive models with CFD data [
21,
22]. Wu et al. [
23] utilized CFD computational data to train the GWO-SVR model, predicting nitrate decomposition rates with response times that were 259,200 times shorter than industrial-scale CFD simulations. Bakhtiari et al. [
24] trained a neural network using CFD data to predict the hydrodynamic performance of a marine cycloidal propeller. Mohammadpour et al. [
25] combined CFD and ML to predict nanofluid heat transfer in microchannel heat sinks, optimizing the microchannel geometry using four regression models: K-nearest neighbor, random forest, Gaussian process regression, and multilayer perceptron. The results showed that the K-nearest neighbor model provided more accurate predictions than the other MLAs. Liu et al. [
26] combined CFD and ML to predict the multi-objective in-furnace combustion characteristics associated with pulverized coal injection operation. By comparing the accuracy of six ML models, the Random Forest model was selected as the predictive model, reducing the response time by nearly 16,000 times compared to CFD simulation. In summary, MLAs have powerful regression and classification prediction capabilities and are widely used in various engineering fields, but few studies apply MLAs for the structural optimization of rotor cages. Additionally, most studies use a single algorithm to predict and optimize structural parameters, which lacks quantitative optimal solutions.
To address the challenges in quantitatively analyzing rotor cage structural parameters, coupling effects on classification performance, and the high computational costs of industrial-scale simulations, this paper proposes a novel CFD-ML-GA (Computational Fluid Dynamics-Machine Learning-Genetic Algorithm) methodology. The methodology aims to optimize rotor cage structural parameters and enhance the classification performance of the classifying device. Initially, the rotor cage classifying devices with varying structures is modeled using CFD and orthogonal experiments, leading to qualitative conclusions via analysis of variance. Subsequently, four machine learning algorithms—Decision Tree Regression (DTR), Support Vector Regression (SVR), Random Forest Regression (RFR), and Artificial Neural Network (ANN)—are used to develop a cutting particle size prediction model. Among these, RFR is identified as the most effective model, and its accuracy is further validated through correlation analysis, SHAP, and PDP methods. Finally, a Genetic Algorithm (GA) combined with the RFR model is used for the quantitative optimization of rotor cage geometric parameters. The integration of CFD and machine learning not only reduces computational resources and experimental costs but also provides a scientific foundation for designing key engineering equipment. The multidisciplinary approach used in this paper can inform and inspire research in related fields.
2. Materials and Methods
2.1. Numerical Modeling of the Straw Micro-Crusher Classifying Device
2.1.1. Physical Model
The straw micro-crusher designed and manufactured by our research team can perform both crushing and classifying operations. Its composition, primarily consisting of a crushing device and a classifying device, is shown in
Figure 1. After coarse crushing, straw material approximately 10 mm in size is fed into the crushing chamber through the feeding pipe. In the crushing chamber, straw materials are subjected to the impact of the crush cutter and shear forces. Simultaneously, the rotating airflow propels the materials against the crushing chamber wall, resulting in crushing through the combined effects of impact, shear, friction, and collision. With the air inlet positioned below the crushing chamber, the crushed material enters the classifying chamber along the conveying channel due to the negative pressure airflow. The rotor cage is located in the classifying chamber, and its primary function is to separate coarse and fine particles. The rotor cage rotates at high speed to generate centrifugal force opposing the negative pressure. The straw powder entering the flow area between the blades is subjected simultaneously to the centripetal force of the negative pressure airflow, gravity, and the centrifugal force generated by the rotor cage. The particles smaller than the cut size enter the cyclone collection device through the discharge pipe and valve, while the particles larger than the cut size are flung back to the crushing area to be further crushed along the inner side of the flow guiding loop due to their larger mass.
2.1.2. Computational Model
In this paper, the structural parameters of the classifying device are optimized with the objective of improving its classification performance. Based on the mechanism and working principle of the straw micro-crusher, the fluid space inside the classifying chamber is selected as the calculation area. SolidWorks software 2020 is used to model the classifying device, with its main structural dimensions shown in
Figure 2. The bottom surface of the rotor cage is the XOY plane (Z = 0 mm), with the center of this bottom surface as the coordinate origin. The original structure has 48 rotor blades, each with a length of 272 mm and a blade installation angle of 0°, and they are evenly distributed along the outer edge of the rotor cage.
2.1.3. Mathematical Models
CFD can be used to analyze the performance of the air classifier and predict the internal flow field [
18,
27]. The internal flow field of the classifying device is cyclonic, with the state of motion characterized by turbulent flow [
28]. The RNG k-ε model is more capable of simulating high-strain flow and accurately simulates vortex flow; the use of this model is well documented in the literature [
29,
30]. Therefore, the RNG k-ε model is selected for numerical simulation in this paper.
The classifying device is simulated in three dimensions using ANSYS Fluent software 2022 R1. The gas-solid phases are treated separately for numerical simulations. Initially, the Eulerian method is used for the gas-phase solution, with the mathematical model primarily comprising the mass conservation equation and the energy conservation equation [
31]. Secondly, the Lagrangian method is applied for the discrete phase model. This involves calculating the gas-phase flow field, solving for each particle’s velocity by combining the flow field variables, and then tracking the trajectory of each particle [
32]. Considering the particle phase volume fraction at the inlet to be below 10% and thus neglecting the role of particles relative to the gas phase, a single-coupled discrete phase model is adopted to simulate the particle motion characteristics in the classifying chamber [
33].
The equation of motion for discrete phase particles is:
where
is the drag coefficient, given by
where
is the gravitational acceleration;
is the gas kinematic viscosity;
is the particle velocity;
is the gas velocity;
is the particle density;
is the gas density;
is the relative Reynolds number;
is the particle diameter;
is the additional acceleration;
is the drag coefficient.
2.1.4. Boundary Conditions
To gain deeper insight into the effect of blade shape on classification performance, the other structural parameters of the classifying device are assumed to remain unchanged in the subsequent simulation analysis. Based on actual operating conditions, the process parameters of the classifying device are set, including inlet air velocity and rotor cage speed . The material particle density is 2750 kg/m3, and single particles () are injected from the inlet surface. The Tromp curve is obtained by tracking the particle trajectory.
To couple the moving rotor cage with the stationary surrounding components, the Multiple Reference Frame (MRF) method is introduced, and the inner and outer surfaces of the cage are used as interfaces. The “velocity-inlet” boundary condition is applied to the air inlet, and the “pressure-outlet” boundary condition is used for the air outlet. The wall boundaries are defined using no-slip boundary conditions, and the near-wall surfaces are treated with standard wall functions. The detailed parameters used in the CFD modeling are listed in
Table 1.
2.1.5. Performance Evaluation Parameters
Cut size (
) and classifying sharpness index (
) are the standard evaluation indices of air classifiers [
34,
35]. After each CFD experiment, the number of particles captured for each particle size is counted, and the classification efficiency is then calculated according to the following equation:
where
is the classification efficiency;
is the number of particles of a given size in the trap;
is the number of particles of a given size in the total number of particles.
and
represent the corresponding particle sizes when the classification efficiency is 25% and 75%, respectively. According to the definition of the classifying sharpness index, the classifying sharpness index can be calculated using the following formula:
The classifying sharpness index (
) reflects the accuracy of particle size distribution after the classifying experiment of the classifying device on the material. The larger the value of
, the higher the classifying sharpness index; for ideal particle grading, the value of
is 1.
means that the probability of particles of this size entering both the fine and coarse powder outlets is 50%, meaning the classification efficiency is 50% at this point. As the most important indicator, obtaining a smaller
is the current research hotspot [
36,
37].
2.1.6. Grid Independence Analysis
The simplified model of the classifying device is meshed using ANSYS Workbench Meshing software 2022 R1. Appropriate cell sizes are selected to generate the mesh. An overview of the meshes after delineation is shown in
Figure 3a, where the entire area is divided into 3,044,700 grid cells. Due to the relatively narrow spacing between the rotor cage blades, the rotor cage region is treated with local mesh refinement.
The accuracy of the simulation calculations is ensured by the mesh sensitivity study, as shown in
Figure 3b,c. When the number of meshes exceeds 3,044,700, the tangential velocity and pressure drop do not change significantly, continuing to increase the number of meshes has little effect on the computational results. Meanwhile, four measurement points at various positions were taken at the outlet of the rotor cage, and the axial velocities of the corresponding models under different grid numbers were measured, as shown in
Figure 3d. It is observed that the axial velocities in the outlet region of the rotor cage remained essentially unchanged when the number of grids exceeded 3,044,700, indicating that the simulation results are independent of the mesh size. Therefore, the maximum total number of meshes is determined to be 3,044,700 to ensure the accuracy of the calculation results while saving calculation time and computational resources.
2.2. Orthogonal Experiment
2.2.1. Determination of Optimization Parameter Range
The blade is the core component of the rotor cage, and its structure determines the flow field distribution within the classifying device. In this paper, the number of rotor blades, the length of rotor blades, and the rotor blade installed angle are selected as three parameters for the orthogonal test.
The flowing zone between the blades is crucial for classifying and conveying fine powder. An excessive or insufficient number of blades can negatively impact classification; a reasonable number of rotor blades can improve the classification performance of the classifying device and reduce energy consumption. Based on previous scholarly research and engineering practice [
16,
38,
39], the number of blades is selected between 24 and 64 for the study. Considering the height of the classifying chamber and the crushing chamber below, the length of rotor blades in this study is selected to be between 224 and 304 mm. When the airflow enters the rotor cage, it impacts the blades, forming a vortex between them and causing blade wear. The stability of the flow field can be effectively improved by adjusting the blade installation angle. Based on previous scholarly research on the airflow impact angle and blade installation angle [
20,
38], the blade installed angle is varied between 0° and 40° for this study.
2.2.2. Orthogonal Experimental Protocol Design
The orthogonal design of experiments is a fast and effective mathematical and statistical method for addressing complex multifactorial problems in mechanical engineering [
40]. Using predetermined orthogonal tables significantly reduces the number of required experiments while maintaining the integrity of the study [
41].
This section discusses the effect of blade geometry on the classification performance. The selected characteristic data include the number of rotor blades, the length of rotor blade, and the installation angle of rotor blade, denoted as A, B, and C, respectively. The performance of the classifying device is evaluated based on the cut size
and classifying sharpness index
. The labeled data include cut size and classifying sharpness index. The objective is to ensure that the cut size (
) is as small as possible while maintaining a high classifying sharpness index (
), thereby identifying the blade geometry with the best classification performance. An orthogonal experimental design is employed to investigate the effect of three factors on the performance of the classifier. Six levels are selected for each factor, and the experiment is arranged according to the commonly used orthogonal design form, the L36 table. The factor levels are displayed in
Table 2.
2.3. Machine Learning Algorithms
Machine learning method is a data-driven data mining method based on data. The research idea of this paper using a machine learning algorithm is shown in
Figure 4.
Implementation of machine learning algorithms such as DTR, SVR, RFR, ANN, etc., using Python language, details of these machine learning algorithms are given in the literature [
42]. Given the extensive prior research on cut size prediction in classifying devices [
36,
37,
43,
44], it is established that cut size serves as an indicator of classification performance. In this paper, we apply four machine learning algorithms to construct a cut size prediction model and compare them to find the most accurate and stable prediction model. Subsequently, we utilize the selected model to predict the classifying sharpness index.
DTR, a classic algorithm in machine learning, differs from others by not relying on a fitting function in favor of constructing a mapping between inputs and outputs using logical language. Essentially, it segments the feature space into cells, each with its own output, with boundaries aligned parallel to coordinate axes. Test data is assigned to these cells based on features, yielding corresponding output values. Unlike classification tasks, DTR minimizes differences between sets using a squared error metric. The final prediction is based on the continuous variables derived from the decision tree.
SVR is a machine learning method based on statistical learning theory. One of its distinctive features is that the computation of model complexity does not depend on the dimensionality of the input data. SVR uses a nonlinear mapping to abstract the data into an n-dimensional feature space, and linear regression fits in the n-dimensional space by finding a hyperplane that can achieve the best fit to the data points.
RFR, a supervised machine learning algorithm, harnesses ensemble learning. It excels in combining various algorithms or iterations of the same algorithm to create more potent predictive models. Specifically, each tree within a random forest is trained on a subset of data, amalgamating multiple decision trees to determine the final output rather than relying on a single one. This process enhances stability and robustness. In this study, a random forest model comprising 80 decision trees predicts the cut size. Feature information guides the selection of optimal features and their respective thresholds for node splitting during decision tree construction.
ANNs play a significant role in machine learning algorithms, utilizing back-propagation of errors to minimize prediction errors through iterative weight updates using the steepest descent method. In this study, a four-layer neural network with two hidden layers (3-19-14-1) is used to establish a nonlinear mapping between input and output terms. The training process involves two nested loops: in the inner loop, forward propagation computes input and activation values for each hidden and output layer, followed by backpropagation to compute errors for both output and hidden layers, facilitating weight updates based on these errors.
The hyperparameter settings for the four prediction models are shown in
Table 3, and the optimal parameters are found by examining the learning curves.
In model evaluation, the performance of the prediction model is evaluated in this paper using two key metrics: the coefficient of determination (
) and the mean squared error (
).
is used to assess the degree of fit between the predicted values and the true values, measuring the proportion of the variance in the dependent variable that is predictable from the independent variables. Meanwhile, the magnitude of deviation of the predicted values from the true values is expressed as
, computed by averaging the squares of the differences between predicted and true values. Thus, a higher
value indicates a better fit of the model to the data, while a lower
indicates higher accuracy.
and
are calculated as [
45]:
where
is the true value;
is the predicted value;
is the average value.