Next Article in Journal
How Fluid Particle Interaction Affects the Flow of Dusty Williamson Fluid
Previous Article in Journal
Effects of Tip Clearance and Impeller Eccentricity on the Aerodynamic Performance of Mixed Flow Fan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Coal Gangue Identification Method Based on HOG Combined with LBP Features and Improved Support Vector Machine

1
State Key Laboratory of Mining Response and Disaster Prevention and Control in Deep Coal Mines, Anhui University of Science and Technology, Huainan 232001, China
2
School of Mechanical Engineering, Anhui University of Science and Technology, Huainan 232001, China
*
Author to whom correspondence should be addressed.
Symmetry 2023, 15(1), 202; https://doi.org/10.3390/sym15010202
Submission received: 12 December 2022 / Revised: 4 January 2023 / Accepted: 5 January 2023 / Published: 10 January 2023

Abstract

:
Identification of coal and gangue is one of the important problems in the coal industry. To improve the accuracy of coal gangue identification in the coal mining process, a coal gangue identification method based on histogram of oriented gradient (HOG) combined with local binary pattern (LBP) features and improved support vector machine (SVM) was proposed. First, according to the actual underground working environment of the mine, a machine vision platform for coal gangue identification was built and the coal gangue image acquisition experiment was carried out. Then, the images of coal and gangue were denoised by median filtering, and the coal and gangue features were extracted by using the HOG combined with LBP feature extraction algorithm, and these features were normalized and principal component analysis (PCA) reduced dimension to remove the correlation and redundancy between the features. Finally, SVM, SVM optimized by genetic algorithm (GA-SVM), SVM optimized by particle swarm optimization (PSO-SVM) algorithm, and SVM optimized by grey wolf optimization (GWO-SVM) algorithm were used as classifiers for identification and classification, respectively. The experimental results show that the GWO-SVM classification model has the highest accuracy, and the average classification accuracies were 96.49% and 94.82% of the training set and test set, respectively, which shows that grey wolf algorithm to optimize support vector machine has a good effect on classification of coal gangue images, which proves the feasibility and accuracy of the proposed method.

1. Introduction

Coal is one of the important energy sources for human survival and has played an essential role in global economic development. China is also the world’s largest coal producer and consumer, and China’s energy consumption structure will continue to be dominated by coal for a long time to come [1,2]. In 2021, China’s energy consumption structure is shown in Figure 1, of which coal energy consumption accounted for 56%. Additionally, in the process of energy structure transformation, coal should play a leading role. However, gangue is a kind of solid waste with low carbon content emitted in the process of coal mining. Its main chemical components are SiO2 and Al2O3 [3,4]. Raw coal mixed with gangue will not only reduce the quality of coal but also seriously pollute the atmospheric environment when burned. Therefore, coal gangue identification and separation is a vital part of the coal production process. In China, the coal industry will enter a stage of high-quality development, and the Chinese government is actively promoting efficient, clean, and green coal mining [5,6,7,8].
There are many ways to identify coal and gangue. Traditional manual identification work is intense, and coal gangue sorting efficiency is low [9]. Mechanical vibration identification is mainly through coal and coal gangue particles and metal plate collision vibration signal recognition, but this method will reduce the quality of coal [10,11]. Xing et al. [12] proposed a coal and gangue identification method based on the intensity image of lidar and DenseNet. 3D laser scanning combined with dynamic weighing is an identification method based on the density difference between coal and gangue [13], but these methods have large measurement error. Hu et al. [14,15] used multispectral, thermal imaging, and infrared spectroscopy methods to identify coal and gangue [16,17,18,19], but these methods are easily affected by ambient temperature and light and are only suitable for a laboratory environment and are difficult to promote and apply in actual coal gangue sorting. High-energy rays (such as X-rays, γ rays) [20,21] have high costs for coal and gangue identification and are radiation hazards to the human body. Therefore, compared with the above methods, machine vision technology has the advantages of environmental protection, no harm, low cost, and real-time acquisition of images.
In recent years, machine vision and image processing technologies have been widely used in various fields, such as defect detection [22], medical diagnosis [23], etc. In the field of coal, Huang et al. [24] proposed a binocular machine vision and particle queuing method for online detection of coal content in gangue. Chaves et al. [25] used machine vision techniques for automatic characterization of coke during pulverized coal combustion. Image processing of coal and gangue mainly focuses on feature extraction [26], machine learning [27,28], and deep learning [29,30], etc. Li et al. [31] proposed a coal gangue detection and recognition algorithm based on deformable convolution YOLOv3 (DCN-YOLOv3). This algorithm effectively detects and recognizes coal gangue, improves the accuracy and efficiency of detecting and recognizing small-size coal and gangue, and improves environmental robustness. Although the classification accuracy of deep learning is high, unfortunately, deep learning requires a large amount of image data when building the model, the network model is complicated, and the processing time is long. In image recognition based on machine learning, the most commonly used are back propagation (BP) neural network and support vector machine (SVM). BP neural network has the disadvantages of slow convergence speed and falling into local extremum [32]. However, support vector machine has good generalization performance and takes into account the advantages of training error minimization and test error minimization. Support vector machine is a machine learning model for binary classification, which is especially suitable for classification of small samples and high-dimensional features [33]. In view of the above problems, inspired by the good optimization ability of the wolf pack intelligent optimization algorithm and the classification performance of SVM, this paper applies the grey wolf optimization (GWO) algorithm to optimize SVM parameters [34] and proposes a method for coal gangue identification based on GWO-SVM. This method is different from any previous method of coal gangue identification. First, we acquired a coal gangue image by machine vision technology and denoised it by median filtering. Then, we extracted coal gangue features using a histogram of oriented gradient (HOG) combined with local binary pattern (LBP) feature extraction algorithm. As far as we know, this is the first study of coal gangue image feature extraction. In addition, we also used the principal component analysis (PCA) method for feature dimension reduction to remove the correlation and redundancy between features. Finally, we used support vector machine as the classification model and, respectively, used genetic algorithm (GA), particle swarm optimization (PSO), and grey wolf optimization (GWO) algorithms for optimization and comparison. The experimental results show that the classification accuracy of the GWO-SVM model is the highest, which proves that the GWO algorithm can improve the accuracy and generalization ability of the model.
The rest of this paper is organized as follows. Section 2 introduces the design of the coal gangue image acquisition test platform. Section 3 introduces image processing of coal gangue. Section 4 introduces feature extraction of the coal gangue image and proposes a new feature extraction method for HOG combined with LBP features. Section 5 introduces SVM classification, focusing on grey wolf optimization algorithm to optimize SVM. Section 6 introduces the comparative simulation experiments of SVM, GA-SVM, PSO-SVM, and GWO-SVM models to prove the feasibility and accuracy of the proposed method. Conclusions and future work are summarized in Section 7.

2. Design of Image Acquisition Experimental Platform

The experimental samples of coal and gangue were selected from the coal mine in Panji District, Huainan, Anhui Province, China. The particle size of coal and gangue was distributed in 40–100 mm. Among them, the surface of the sample coal was black and shiny, with a few cracks and soft texture, while the surface of gangue was black–gray, dull and rough, and hard in texture. To realize acquisition of coal and gangue images, we imitate the actual working environment of the underground to build a machine vision experimental platform in the laboratory, as shown in Figure 2, and the platform consists of a conveyor belt, charge coupled device (CCD) industrial camera, USB3.0 data cable, double LED light as a light source, and computer. CCD industrial camera is the core equipment of the image acquisition device; selection of the camera determines the resolution and image quality of the acquired images. The camera model used in this experiment is Hikvision MV-CA050-11UC 5-million-pixel color camera, which supports manual adjustment of gain and exposure time, the resolution is 2448 × 2048, and frame rate is 35 fps. Using USB 3.0 data interface communication and connection to the computer, the camera can collect images of coal and gangue in real time. The light intensity during the experiment is 2000 Lux, and the speed of the conveyor belt is 0.2 m/s.

3. Image Processing

In this experiment, a total of 300 coal and gangue samples were selected for image acquisition. To save storage space and improve the training speed of subsequent classification models, each image is cropped and the original image is uniformly captured into an image with a size of 200 × 200 pixels. The environment of the actual coal gangue sorting site is harsh, which leads to environmental interference, such as dust and light in the process of image collection, which makes the collected coal gangue images contain noise and become fuzzy and unclear. To ensure clarity and identification accuracy of the image, it is necessary to process the original image. In this paper, the median filter is used for denoising processing, which can ensure the quality of the original image without distortion, but the filtering effect is very dependent on the size of the filtering window. To find the best filter window, 3 × 3, 5 × 5, and 7 × 7 three window sizes are selected, and image comparison before and after filtering is shown in Figure 3.
As can be seen from Figure 3, the larger the filter window size, the blurrier the edge of the image; if it is too small, the denoising effect will be poor. When the window size is 5 × 5, the image will be clearest. Therefore, this paper chooses a 5 × 5 filtering window size.

4. Future Extraction

4.1. Histogram of Oriented Gradient

In the process of coal and gangue identification based on machine vision and image processing technology, feature extraction of images is crucial, which directly affects the accuracy of coal and gangue identification. The feature of histogram of oriented gradient is a description operator used to detect objects in computer vision and image processing [35]. It composes features by calculating and counting the gradient direction histogram of local areas of the image. HOG feature extraction combined with SVM classifier has been widely used in image identification [36]. The basic steps of HOG feature extraction are as follows:
Step 1: Image graying. The color information in the image does not play much of a role, so the original image is converted to gray-scale image.
Step 2: Image normalization. The Gamma correction method is used to normalize and compress the input image, which aims to adjust the contrast of the image, reduce the influence of local shadow and lighting changes of the image, and suppress the interference of noise. The gamma compression expression is:
I ( x , y ) = I ( x , y ) G a m m a
where I(x, y) is the input image, and Gamma takes 1/2.
Step 3: Calculate the gradient size and direction for each pixel. The calculation formula is:
I ( x , y ) = ( G x ( x , y ) ) 2 + ( G y ( x , y ) ) 2
θ ( x , y ) = a r c t a n G y ( x , y ) G x ( x , y )
where Gx(x, y) and Gy(x, y) are gradients in the x and y directions, respectively, and θ(x, y) represents the gradient direction of the image at coordinates (x, y).
Step 4: Divide the image into square cells. Calculate the histogram of gradient direction for each square cell. The gradient direction is divided into K bars in the [0,π] interval, and binK is used to represent the K-th gradient direction; if the gradient direction of a pixel in the square cell is binK, the corresponding column value of the gradient direction is added to the gradient value of the pixel. In this paper, K takes 9; that is, the gradient direction is between 0° and 180°, and every 20° is divided into one direction to obtain 9 gradient directions.
Step 5: Calculate the HOG feature vector in one block by forming adjacent elements into blocks. The gradient direction histogram of each square cell in the block is converted into a single-dimensional vector, that is, a vector composed of the number of gradients corresponding to the direction, and all the square cell vectors are connected in series to form the HOG eigenvector of the block.
Step 6: HOG feature vector normalization. The purpose of normalization is to reduce the effect of feature vectors from lighting, shadows, and edge changes.
Step 7: The final HOG feature vector of the image is obtained. The feature vectors of all blocks of the sample are combined to obtain the final HOG feature vector of the original image.
A schematic diagram of HOG features and histograms in a coal image is shown in Figure 4. The pixels in the image can be used to train machine learning algorithms by using this HOG feature representation and can improve the learning efficiency of the algorithm, which is the beauty of gradient direction histograms, and it is also an advantage of the method proposed in this paper.

4.2. Local Binary Pattern

4.2.1. Basic LBP Operator

LBP is a kind of operator, which is used to describe the local texture feature of an image [37]. It has strong classification ability, high computational efficiency, and invariability to monotonous grayscale change. This operator was originally proposed by Ajala et al. in 1994 [38], and the original LBP operator is calculated as shown in Figure 5, and the calculation formula is:
L B P ( x C , y C ) = P = 0 P 1 2 P s ( g p g C )
s ( x ) = 1             i f 0 0             e l s e
where (xc, yc) is the coordinates of the central pixel, p is the p-th pixel of the neighborhood, gp is the gray value of the neighborhood pixel, gc is the gray value of the central pixel, and s(x) is the symbol function.
The implementation process of the original LBP feature extraction algorithm is as follows:
Step 1: Divide the detection window into 3 × 3 small cells.
Step 2: For the gray value of the pixel in each cell, the gray value of the adjacent 8 pixels is compared with the gray value of the central pixel, and, if the gray value of the pixel is greater than the central threshold, it is recorded as 1; otherwise, it is denoted as 0.
Step 3: Read the thresholded binary number clockwise from the upper left corner of the region, and then convert the binary number to a decimal number as the LBP feature value of the image.

4.2.2. Improved LBP Operator

The original LBP operator can only cover a fixed range of regions, which obviously cannot meet the requirements of rotational invariance and grayscale invariance. To adapt to the texture features of different scales, the original LBP operator is improved. The original 3 × 3 neighborhood is extended to any neighborhood, and the original square neighborhood is replaced by a circular neighborhood. The improved LBP operator has obvious symmetry, as shown in Figure 6. The improved LBP operator is represented by LBPP,R; that is, in the circular neighborhood with radius R, there are P pixels compared with the central pixel threshold point, and, finally, its minimum value is the LBP value of the neighborhood.
For a given center point (xc, yc), its neighborhood pixel position is (xp, yp), pP, and the sampled pixel point (xp, yp) can be calculated as follows:
x p = x c + R c o s ( 2 π p P ) y p = y c R s i n ( 2 π p P )
In the formula, R is the neighborhood radius, p represents the p-th pixel point, and P is the total number of neighborhood pixels, pP.

4.3. HOG Combined with LBP Features

The image of coal gangue not only has contour information but also contains texture information. HOG features can represent the contour information of the coal gangue image, LBP features include a local texture feature that can only reflect the local texture information in the image. To comprehensively and better reflect the feature information of coal gangue images, this paper proposes a feature extraction method combining HOG and LBP, which is also an innovation point of this paper. There are many different ways of feature fusion, such as weighted fusion, parallel fusion, and serial fusion. According to the feature types of HOG and LBP, this paper chooses the serial fusion mode. The schematic diagram of the serial fusion of HOG and LBP features of coal gangue is shown in Figure 7. This method can maximize retention of the contour and texture information of the coal gangue image, ensuring that the feature information is not distorted and the identification accuracy is high.
To eliminate the dimensional influence between different features, make each feature data in the same order of magnitude, we also need to normalize these features before building a classification model. In this paper, normalized to 0~1 interval, and then the principal component analysis (PCA) method is used to reduce the dimension of the feature vector to remove the correlation and redundancy between the components. The calculation formula of the cumulative contribution rate in PCA is shown in Equation (7):
R = i = 1 k λ i / i = 1 p λ i
where we set R to 85%, because p principal components can be obtained in the PCA method. If the cumulative contribution rate of the first k principal components reaches 85%, it indicates that the k principal components basically contain all the information contained in all feature vectors, so as to achieve the purpose of feature dimension reduction.

5. Support Vector Machine Classification

5.1. Basic Principles of Support Vector Machine

Support vector machine [39] is a binary machine learning model, which is essentially a classification method to quantify the difference between two types of data. In recent years, it has been widely used in practical engineering, which proves that SVM is a classification model with strong generalization ability [40] since some features of the actual coal and gangue samples are linearly inseparable. Therefore, for the problem of linear indivisibility, the support vector machine can map the sample data from the original two-dimensional space to a high-dimensional feature space through the kernel function, making the sample linearly divisible in this feature space, as shown in Figure 8.
Suppose a given sample dataset D = {(x1, y1), (x2, y2),..., (xn, yn)}, where xiRn, yi ∈ {−1, + 1}, (i = 1,2,..., n). Considering linear inseparable data, SVM takes sample classification as the starting point to find an optimal classification hyperplane by introducing penalty factors and relaxation variables to obtain the following inequality constraint minimization problem:
min 1 2 ω 2 + C i = 1 n ξ i s . t . y i ( ω T x i + b ) 1 ξ i ξ i 0 , i = 1 , 2 , n
where ω is the weight vector, C is the penalty factor, which is the positive parameter set manually to balance the importance of the penalty term, ξ i is the relaxation variable, and b is the classification threshold. Obviously, this is a convex quadratic programming problem, so there is a unique minimal solution; the Lagrangian function of Equation (9) can be obtained by the Lagrangian multiplier method:
L ( ω , b , α , ξ , β ) = 1 2 ω 2 + C i = 1 n ξ i i = 1 n α i [ y i ( ω T x i + b ) ( 1 ξ i ) ] i = 1 n β i ξ i
where α i ≥ 0, β i ≥ 0, α i , and β i are Lagrange multipliers. Let the partial derivatives of L ( ω , b , α , ξ i , β ) to ω , b, and ξ i be 0, respectively:
L ω = 0 ω = i = 1 n α i y i x i L b = 0 i = 1 n α i y i = 0 L ξ i = 0 α i + β i = C
by introducing the kernel function K(xi, xj), according to the KKT condition and solving the dual problem, a new objective function is obtained:
max i = 1 n α i 1 2 i = 1 n j = 1 n α i α j y i y j K ( x i , x j ) s . t . i = 1 n α i y i = 0 0 α i C , i = 1 , 2 , , n
selection of kernel functions is critical, and, if the kernel function selection is not appropriate, it means that the sample is mapped to an inappropriate high-dimensional feature space, potentially leading to poor generalization of the classification. To better realize the feature transformation of sample data, this paper chooses Gaussian radial basis function (RBF) as the kernel function, which is a kind of radially symmetric scalar function. The expression of the kernel function is:
K ( x i , x j ) = exp ( x i x j 2 2 g 2 )
where g is the parameter of the kernel function. Therefore, the decision function of the support vector machine can be expressed as:
f ( x ) = s g n ( i = 1 n α i y i K ( x i , x j ) + b )
Determination of parameter g and penalty factor C in the kernel function has a great influence on the performance of the support vector machine model. To find optimal parameters C and g, intelligent optimization algorithms are often required to improve the optimization.

5.2. Genetic Algorithm Optimizes Support Vector Machine

GA is an adaptive global optimization search algorithm [41] that simulates the process of heredity and evolution in the natural environment. Genetic algorithms have the characteristics of operating the coding of parameters, needing no derivation and additional information, non-certainty, and adaptability of optimization rules. The basic processes of genetic algorithm optimization support vector machines are as follows:
Step 1: Parametric initialization of genetic algorithms. Setting the population size Np, crossover probability Pc, mutation probability Pm, and genetic maximum evolution algebra G.
Step 2: Chromosome encoding. Generating initial population by binary encoding.
Step 3: Individual evaluations. Calculating the fitness of each individual in the population.
Step 4: Genetic operation. Using the Selection, Crossover, and Variation operators to process the current population to determine whether to terminate the condition and, if satisfied, output the optimal individual to obtain optimal parameters C and g input into the SVM model. Otherwise, return to Step 3.

5.3. Particle Swarm Optimization Algorithm Optimizes Support Vector Machine

PSO is a swarm intelligence optimization algorithm based on parallel global search of birds swarm [42], which finds the optimal solution through cooperation and information sharing among individuals in the group. PSO has the advantages of fewer adjustment parameters and fast convergence speed and can be optimized in high-dimensional space. The basic process of particle swarm optimization support vector machine is as follows:
Step 1: Randomly initializing subgroups, including group size N, position of each particle, and velocity.
Step 2: Determining the fitness function according to the SVM decision function.
Step 3: Calculating and evaluating the fitness value of each particle.
Step 4: Searching for the global optimal parameters and iteratively updating the position and velocity of the particles. Determine whether the termination condition is met. If yes, optimal parameters C and g are output and input into the SVM model for training classification. Otherwise, Step 3 is returned.

5.4. Grey Wolf Optimization Algorithm Optimizes Support Vector Machine

GWO is a swarm intelligence optimization algorithm proposed by Australian scholars Mirjalili et al. in 2014 [43]. The algorithm is inspired by grey wolf predation prey activities and developed an optimized search method, which has the advantages of strong convergence performance and few parameters. In recent years, it has been widely studied by scholars and has been successfully applied to parameter optimization, fault diagnosis, image classification, and other fields [44,45]. GWO algorithm imitates the leadership level and hunting mechanism of the grey wolf in nature [46]. It consists of four kinds of grey wolves, Alpha ( α ), Beta ( β ), Delta ( δ ), and Omega ( ω ), which are used to simulate the various classes of grey wolves. The hierarchy of grey wolves is shown in Figure 9.
The grey wolf hunting process is divided into three stages: tracking prey, encircling prey, and attacking prey. During hunting, grey wolves round up their prey, defined as follows:
D = C X p ( t ) X ( t )
X ( t + 1 ) = X p ( t ) A D
where D is the current distance between the grey wolf and the prey, A and C is the coefficient vector, t is the current iteration number, X is the position vector of the grey wolf, and X p is the position vector of the prey. The formula for calculating A and C is as follows:
A = 2 a r 1 a
C = 2 r 2
where a is the convergence factor, which decreases linearly from 2 to 0 during the iteration, the value of a can be obtained by the Equation (18), and the modulus of r 1 and r 2 takes a random number between [0, 1].
a = 2 2 ( t t m a x )
where t m a x is the maximum number of iterations. The hunting process is usually led by α wolves, β wolves, and δ wolves will guide the wolves to surround the prey. However, in the search space, we do not know the best position of the prey; therefore, we assume that α wolves, β wolves, and δ wolves have a better understanding of the location of prey. The change in the location of the wolf pack round-up prey is formulated as follows:
D α = C 1 × X α ( t ) X ( t ) D β = C 2 × X β ( t ) X ( t ) D δ = C 3 × X δ ( t ) X ( t )
X 1 = X α A 1 × D α X 2 = X β A 2 × D β X 3 = X δ A 3 × D δ
X ( t + 1 ) = X 1 + X 2 + X 3 3
where X α , X β , and X δ represent the current position vectors of α , β , and δ wolves, respectively. D α , D β , and D δ are the distances between α , β , and δ wolves and prey, respectively. The position of ω wolves is constantly adjusted to be closer to the prey under the guidance of α , β , and δ wolves. X ( t + 1 ) represents the final position of ω wolves. The position update of gray wolves around prey is shown in Figure 10.
Based on the advantages of GWO algorithm, such as simplicity and less parameter setting, the method proposed in this paper is to optimize the parameters of SVM by GWO algorithm, obtain the best penalty factor C and kernel function parameter g of SVM through iterative optimization, and improve the training speed, generalization ability, and classification accuracy of the model. Finally, the flow chart of the GWO-SVM model proposed in this paper is shown in Figure 11.

6. Experimental Validation

6.1. Preparation of Dataset

To verify the feasibility of the proposed method of coal gangue identification based on GWO-SVM, we acquired 300 images of coal and gangue as sample datasets through the coal gangue identification machine vision experimental platform constructed in Section 2. The method of random sampling was used to divide the dataset, and the training set and the test set were divided in a ratio of 7:3. Among them, 210 are used as training sets, the remaining 90 are used as test sets, and the two datasets are independent of each other without crossover.

6.2. Model Parameter Setting

To prove that the proposed method has good performance, we use SVM, GA-SVM, PSO-SVM, and GWO-SVM classification models to test, respectively. The parameter settings of these algorithm models are shown in Table 1. The experiment was run on a computer with Chinese Lenovo CPU(AMD Ryzen7 5800H with Radeon Graphics 3.20 GHz) Microsoft’s Windows11 (64bit) and simulated on MATLAB R2022a software from MathWorks USA.
After setting the parameters of the model, according to the method proposed in Section 4.3, the histograms of the extracted HOG and LBP features are fused in series and then the feature data are normalized to the normalized interval of [0, 1], and, after normalization, feature dimensionality reduction is processed by the PCA. Finally, the datasets after normalized and PCA dimensionality reduction are input into the established SVM, GA-SVM, PSO-SVM, and GWO-SVM classification models, respectively. To better obtain the classification accuracy of the proposed model and compare the classification accuracy and performance stability of these four classification models, optimization simulation experiments were run 30 times.

6.3. Experimental Results and Analysis

Table 2 shows the classification accuracy and average accuracy of different classification methods on the training set and test set in the 30 experiments. Figure 12a,b shows the classification accuracy of the four classification models SVM, GA-SVM, PSO-SVM, and GWO-SVM on the training and test sets based on the data in Table 2, respectively. From the figure, we can clearly find that, whether in the training set or the test set, the accuracy of the GWO-SVM classification model is the highest, and its classification accuracy on the training set and the test set can reach 100% and 96.67%, followed by the PSO-SVM model; the original SVM has the lowest classification accuracy. In addition, as can also be seen from Table 2 and Figure 12, the method proposed (HOG combined LBP features and GWO-SVM) of all the methods in this paper has the highest average accuracy on the training set and test set, which indicates that the support vector machine optimized by grey wolf algorithm has the best classification effect in coal gangue image recognition applications. According to Section 5, the principle of optimizing support vector machine by these three intelligent optimization algorithms is to optimize SVM parameters C and g through iterative updating. However, their classification effect is very different. Grey wolf algorithm has the best optimization effect, which is because the GWO algorithm has fewer parameter settings, better robustness, and higher accuracy.
Figure 13 is a columnar diagram of the dataset classification accuracy comparison. As can be seen from the figure, the proposed method has higher classification accuracy on the training set and test set than other classification models, and the GWO-SVM method has an average accuracy of 96.49% and 94.82% on the training set and test set, respectively. That is, the GWO-SVM model is best classified under the condition of same training sample. In fact, compared with other intelligent optimization algorithms, the grey wolf algorithm can accurately find the best penalty factor C and kernel function parameter g of SVM, which can improve the classification accuracy of the model, which is also the feasibility and superiority of the method proposed in this paper.

7. Conclusions and Future Work

To improve the accuracy of coal gangue image identification, a coal gangue identification method based on HOG combining LBP features and improved support vector machine was proposed in this paper. To verify its feasibility, first, we built a machine vision experimental platform for coal gangue recognition by imitating the actual underground environment and acquired 300 coal gangue images as sample image datasets. Median filtering denoising was performed on the images of coal and gangue, HOG and LBP features of the images were extracted, and we combined the two feature histograms in series. To eliminate the dimensional influence between different features, we normalized the feature data to the [0, 1] interval and carried out PCA down-dimensional processing to remove correlation and redundancy between the features. Finally, 30 identification and classification experiments were conducted with SVM, GA-SVM, PSO-SVM, and GWO-SVM, respectively. The experimental results show that the average classification accuracy of the GWO-SVM model on the training set and test set is 96.49% and 94.82%, respectively, and its classification effect is better than that of SVM. Both the GA-SVM and PSO-SVM models are good, which proves the efficiency and accuracy of the proposed method in this paper.
However, our current work still has some shortcomings, such as the small number of samples, and the algorithm is prone to fall into the local optimal solution. In addition, grey wolf optimization algorithm is still a heuristic optimization technique, and the optimal solution generated is only close to the original optimal solution, not the optimal solution of the problem. Therefore, in future work, we will further expand the image dataset of coal gangue samples, continue to optimize classification performance, and carry out in-depth research on coal gangue identification by different feature selection methods and optimization algorithms so as to further improve the accuracy of coal gangue identification.

Author Contributions

All the authors contributed to the work. G.C. and J.C. proposed the new method; Y.W., S.C. and Z.P. designed the experiments and simulations; J.C. wrote the paper; G.C. obtained the financial support for the project leading to this publication. All authors modified the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Anhui Provincial University System Innovation Project of China (grant no. GXXT-2021-076), Open Research Fund of Joint National-Local Engineering Research Centre for Safe and Precise Coal Mining (grant no. EC2021010), and Open Project Program of Anhui Province Key Laboratory of Metallurgical Engineering & Resources Recycling (grant no. JKF22-06).

Data Availability Statement

Not applicable.

Acknowledgments

The authors are grateful to their peer experts for the full support of this paper and to the editors and the reviewers for their insightful suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wang, J.; Liu, F.; Wang, L. Sustainable coal mining and mining sciences. J. China Coal Soc. 2016, 41, 2651–2660. [Google Scholar]
  2. Zhou, N.; Yao, Y.; Song, W.; He, Z.; Meng, G.; Liu, Y. Present situation and Prospect of coal gangue treatment technology. J. Min. Saf. Eng. 2020, 37, 11. [Google Scholar]
  3. Han, L.; Ren, W.; Wang, B.; He, X.; Ma, L.; Huo, Q.; Wang, J.; Bao, W.; Chang, L. Extraction of SiO2 and Al2O3 from coal gangue actuvated by supercritical water. Fuel 2019, 253, 1184–1192. [Google Scholar] [CrossRef]
  4. Xiao, J.; Li, F.; Zhong, Q.; Bao, H.; Wang, B.; Huang, J.; Zhang, Y. Separation of aluminum and silica from coal gangue by elevated temperature acid leaching for the preparation of alumina and SiC. Hydrometallurgy 2015, 155, 118–124. [Google Scholar] [CrossRef]
  5. Wang, G.; Wang, H.; Ren, H.; Zhao, G.; Pang, Y.; Du, Y.; Zhang, J.; Hou, G. 2025 scenarios and development path of intelligent coal mine. J. China Coal Soc. 2018, 43, 295–305. [Google Scholar]
  6. Bai, G.; Xu, T. Coal Mine Safety Evaluation Based on Machine Learning: A BP Neural Network Model. Comput. Intell. Neurosc. 2022, 2022, 5233845. [Google Scholar] [CrossRef] [PubMed]
  7. Dai, S.; Zheng, X.; Wang, X.; Finkelman, R.B.; Jiang, Y.; Ren, D.; Yan, X.; Zhou, Y. Stone coal in China: A review. Int. Geol. Rev. 2018, 60, 736–753. [Google Scholar] [CrossRef]
  8. Liu, H.; Liu, Z. Recycling utilization patterns of coal mining waste in China. Resour. Conserv. Recycl. 2010, 54, 1331–1440. [Google Scholar]
  9. Zou, H.; Jia, R. Visual Positioning and Recognition of Gangues Based on Scratch Feature Detection. Trait. du Signal 2019, 36, 147–153. [Google Scholar] [CrossRef]
  10. Yang, Y.; Zeng, Q.; Yin, G.; Wan, L. Vibration Test of Single Coal Gangue Particle Directly Impacting the Metal Plate and the Study of Coal Gangue Recognition Based on Vibration Signal and Stacking Integration. IEEE Access 2019, 7, 106784–106805. [Google Scholar] [CrossRef]
  11. Yang, D.; Li, J.; Zheng, K.; Du, C.; Liu, S. Impact-Crush Separation Characteristics of Coal and Gangue. Int. J. Coal Prep. Util. 2018, 38, 127–134. [Google Scholar] [CrossRef]
  12. Xing, J.; Zhao, Z.; Wang, Y.; Nie, L.; Du, X. Coal and gangue identification method based on the intensity image of lidar and DenseNet. Appl. Opt. 2021, 60, 6566–6572. [Google Scholar] [CrossRef]
  13. Wang, W.; Zhang, C. Separating coal and gangue using three-dimensional laser scanning. Int. J. Miner. Process. 2017, 169, 79–84. [Google Scholar] [CrossRef]
  14. Hu, F.; Zhou, M.; Yan, P.; Bian, K.; Dai, R. Multispectral Imaging: A New Solution for Identification of Coal and Gangue. IEEE Access 2019, 7, 169697–169704. [Google Scholar] [CrossRef]
  15. Hu, F.; Hu, Y.; Cui, E.; Guan, Y.; Gao, B.; Wang, X.; Wang, K.; Liu, Y.; Yao, X. Recognition method of coal and gangue combined with structural similarity index measure and principal component analysis network under multispectral imaging. Microchem. J. 2023, 186, 108330. [Google Scholar] [CrossRef]
  16. Alfarzaeai, M.S.; Niu, Q.; Zhao, J.Q.; Eshaq, R.; Hu, E.Y. Coal/Gangue Recognition Using Convolutional Neural Networks and Thermal Images. IEEE Access 2019, 7, 169697–169704. [Google Scholar] [CrossRef]
  17. Zou, L.; Yu, X.H.; Li, M.; Lei, M.; Yu, H. Nondestructive Identification of Coal and Gangue via Near-Infrared Spectroscopy Based on Improved Broad Learning. IEEE Trans. Instrum. Meas. 2020, 69, 8043–8052. [Google Scholar] [CrossRef]
  18. Song, L.; Liu, S.; Yu, M.; Mao, Y.; Wu, L. Research on coal and gangue classification method based on combined analysis of visible-near infrared and thermal infrared spectroscopy. Spectrosc. Spectr. Anal. 2017, 37, 416–422. [Google Scholar]
  19. Song, L.; Yu, Y.; Yan, Z.; Xiao, D.; Sun, Y.; Zhang, X.; Li, X.; Cheng, B.; Gao, H.; Bai, D. Rapid Analysis of Composition of Coal Gangue Based on Deep Learning and Thermal Infrared Spectroscopy. Sustainability 2022, 14, 16210. [Google Scholar] [CrossRef]
  20. Yang, H.G.; Qiao, Z.M. Design of separation system of coal and gangue based on X-ray and machine vision. Ind. Mine Autom. 2017, 43, 85–89. [Google Scholar]
  21. Zhang, N.; Liu, C. Radiation characteristics of natural gamma-ray from coal and gangue for recognition in top coal caving. Sci. Rep. 2018, 8, 190. [Google Scholar] [CrossRef] [Green Version]
  22. Fan, C.-L.; Chung, Y.-J. Supervised Machine Learning-Based Detection of Concrete Efflorescence. Symmetry 2022, 14, 2384. [Google Scholar] [CrossRef]
  23. Doan, M.; Carpenter, A.E. Leveraging machine vision in cell-based diagnostics to do more with less. Nat. Mater. 2019, 18, 414–418. [Google Scholar] [CrossRef] [PubMed]
  24. Huang, H.; Dou, D.; Zhang, C. Detecting coal-carrying rate in gangue based on binocular machine vision and particle queuing method. Int. J. Coal Prep. Util. 2022, 1–13. [Google Scholar] [CrossRef]
  25. Chaves, D.; Fernandez-Robles, L.; Bernal, J.; Alegre, E.; Trujillo, M. Automatic characterisation of chars from the combusion of pulverised coals using machine vision. Powder Technol. 2018, 338, 110–118. [Google Scholar] [CrossRef]
  26. Li, N.; Xue, J.; Gao, S. Feature extraction method CNDFA for target contour of coal and gangue based on multifractal. J. Electron. Imaging 2022, 31, 041217. [Google Scholar] [CrossRef]
  27. Xue, G.; Li, X.; Qian, X.; Zhang, Y. Coal-gangue image recognition in fully-mechanized caving face based on random forest. Ind. Mine Autom. 2020, 46, 57–62. [Google Scholar]
  28. Lin, X.; Zhang, P.; Meng, F.; Liu, C. A Coal Seam Thickness Prediction Model Based on CPSAC and WOA-LS-SVM: A Case Study on the ZJ Mine in the Huainan Coalfield. Energies 2022, 15, 7324. [Google Scholar] [CrossRef]
  29. Liu, Q.; Li, J.; Li, Y.; Gao, M. Recognition Methods for Coal and Coal Gangue Based on Deep Learning. IEEE Access 2021, 9, 77599–77610. [Google Scholar] [CrossRef]
  30. Gao, R.; Sun, Z.; Li, W.; Pei, L.; Hu, Y.; Xiao, L. Automatic Coal and Gangue Segmentation Using U-Net Based Fully Convolutional Networks. Energies 2020, 13, 829. [Google Scholar] [CrossRef] [Green Version]
  31. Li, D.Y.; Wang, G.F.; Zhang, Y.; Wang, S. Coal gangue detection and recognition algorithm based on deformable convolution YOLOv3. Int. Image Process 2022, 16, 134–144. [Google Scholar] [CrossRef]
  32. Wang, D.; Ni, J.; Du, T. An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network. Symmetry 2022, 14, 880. [Google Scholar] [CrossRef]
  33. Shen, X.; Niu, L.; Qi, Z.; Tian, Y. Support vector machine classifier with truncated pinball loss. Pattern Recognit. 2017, 68, 199–210. [Google Scholar] [CrossRef]
  34. Li, C.; Peng, T.; Zhu, Y. A Cutting Pattern Recognition Method for Shearers Based on ICEEMDAN and Improved Grey Wolf Optimizer Algorithm-Optimized SVM. Appl. Sci. 2021, 11, 9081. [Google Scholar] [CrossRef]
  35. Wang, H.; Wei, S.B.; Fang, B.F. Facial expression recognition using iterative fusion of MO-HOG and deep features. J. Supercomput. 2020, 76, 3211–3221. [Google Scholar] [CrossRef]
  36. Xu, P.; Huang, L.; Song, Y. An optimal method based on HOG-SVM for fault detection. Multimed. Tools Appl. 2022, 81, 6995–7010. [Google Scholar] [CrossRef]
  37. Yang, B.; Chen, S. A comparative study on local binary pattern (LBP) based face recognition: LBP histogram versus LBP image. Neurocompuying 2013, 120, 365–379. [Google Scholar] [CrossRef]
  38. Ojala, T.; Pietikainen, M.; Harwood, D. Performance evalution of texture measure with classification based on Kullback discrimination of distribution. In Proceedings of the 12th International Conference on Pattern Recognition, Jerusalem, Israel, 9–13 October 1994; pp. 582–585. [Google Scholar]
  39. Burges, C. A Tutorial on Support Vector Machine for Pattern Recognition. Data Min. Knowl. Discov. 1998, 2, 121–167. [Google Scholar] [CrossRef]
  40. Wang, L.; Liu, Y.; Li, T.; Xie, X.; Chang, C. The Short-Term Forecasting of Asymmetry Photovoltaic Power Based on the Feature Extraction of PV Power and SVM Algorithm. Symmetry 2020, 12, 1777. [Google Scholar] [CrossRef]
  41. Katoch, S.; Chauhan, S.S.; Kumar, V. A review on genetic algorithm: Past, present, and future. Multimed. Tools Appl. 2021, 80, 8091–8126. [Google Scholar] [CrossRef]
  42. Wang, B.; Huang, H.; Dou, D.; Qiu, Z. Detection of coal content in gangue via image analysis and particle swarm optimization-support vector machine. Int. J. Coal Prep. Util. 2021, 42, 1915–1924. [Google Scholar] [CrossRef]
  43. Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey Wolf Optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef] [Green Version]
  44. Dong, Z.; Zheng, J.; Huang, S.; Pan, H.; Liu, Q. Time-Shift Multi-Scale Weighted Permutation Entropy and GWO-SVM Based Fault Diagnosis Approach for Rolling Bearing. Entropy 2019, 21, 621. [Google Scholar] [CrossRef] [Green Version]
  45. Hou, Y.; Gao, H.; Wang, Z.; Du, C. Improved Grey Wolf Optimization Algorithm and Application. Sensors 2022, 22, 3810. [Google Scholar] [CrossRef] [PubMed]
  46. Gupta, S.; Deep, K. Enhanced leadership-inspired grey wolf optimizer for global optimization problems. Eng. Comput. 2020, 36, 1777–1800. [Google Scholar] [CrossRef]
Figure 1. Diagram of China’s energy consumption structure in 2021.
Figure 1. Diagram of China’s energy consumption structure in 2021.
Symmetry 15 00202 g001
Figure 2. Coal gangue identification machine vision experimental platform.
Figure 2. Coal gangue identification machine vision experimental platform.
Symmetry 15 00202 g002
Figure 3. The effect of the three filtering methods. (a) Original image; (b) 3 × 3 filtering; (c) 5 × 5 filtering; (d) 7 × 7 filtering.
Figure 3. The effect of the three filtering methods. (a) Original image; (b) 3 × 3 filtering; (c) 5 × 5 filtering; (d) 7 × 7 filtering.
Symmetry 15 00202 g003
Figure 4. HOG feature visualization of coal. (a) Original image; (b) HOG feature diagram; (c) HOG feature histogram.
Figure 4. HOG feature visualization of coal. (a) Original image; (b) HOG feature diagram; (c) HOG feature histogram.
Symmetry 15 00202 g004
Figure 5. Schematic diagram of the basic LBP operator.
Figure 5. Schematic diagram of the basic LBP operator.
Symmetry 15 00202 g005
Figure 6. LBP operator for circular domain. (a) LBP8,1; (b) LBP8,2; (c) LBP16,2.
Figure 6. LBP operator for circular domain. (a) LBP8,1; (b) LBP8,2; (c) LBP16,2.
Symmetry 15 00202 g006
Figure 7. Serial fusion diagram of HOG and LBP features.
Figure 7. Serial fusion diagram of HOG and LBP features.
Symmetry 15 00202 g007
Figure 8. Schematic diagram of SVM dataset dimensioning.
Figure 8. Schematic diagram of SVM dataset dimensioning.
Symmetry 15 00202 g008
Figure 9. The grey wolf hierarchy.
Figure 9. The grey wolf hierarchy.
Symmetry 15 00202 g009
Figure 10. Grey wolf position update in GWO algorithm. Blue circle: α wolves; purple circle: β wolves; green circle: δ wolves; red circle: ω wolves or any other hunters; yellow circle: position of the prey.
Figure 10. Grey wolf position update in GWO algorithm. Blue circle: α wolves; purple circle: β wolves; green circle: δ wolves; red circle: ω wolves or any other hunters; yellow circle: position of the prey.
Symmetry 15 00202 g010
Figure 11. Flow chart of GWO-SVM model.
Figure 11. Flow chart of GWO-SVM model.
Symmetry 15 00202 g011
Figure 12. (a) Classification accuracy on the training set; (b) classification accuracy on the test set. Red *: GWO-SVM; blue circles: PSO-SVM; green circles: GA-SVM; black *: SVM.
Figure 12. (a) Classification accuracy on the training set; (b) classification accuracy on the test set. Red *: GWO-SVM; blue circles: PSO-SVM; green circles: GA-SVM; black *: SVM.
Symmetry 15 00202 g012
Figure 13. Comparison of classification accuracy of data set.
Figure 13. Comparison of classification accuracy of data set.
Symmetry 15 00202 g013
Table 1. Parameter settings of different models.
Table 1. Parameter settings of different models.
NumberModelParameter Setting
1SVMThe training function libsvmtrain is called here to train the SVM model, and the test set is tested and verified by the libsvmpredict function to obtain the identification classification accuracy. The experiment was run on MATLAB R2022a software.
2GA-SVMGenetic algorithm is used to optimize the support vector machine. The genetic algorithm is called MATLAB R2022a toolbox. The population size Np is 100, the cross-probability Pc is 0.7, the probability of variation Pm is 0.005, and the genetic maximum evolutionary algebra G is 200.
3PSO-SVMParticle swarm optimization is used to optimize the support vector machine. The experiment has the following related settings by calling the toolbox of “PSOt” in MATLAB: the number of particles N = 20, learning factor c1 and c2 are 1.5, the inertia weight w is 0.8, and the maximum number of iterations G is 200.
4GWO-SVMGWO algorithm was used to optimize the penalty factor C and kernel parameter g of SVM, where the population size of grey wolf M is 20 and the maximum number of iterations tmax is 200.
Table 2. Classification accuracy of different methods.
Table 2. Classification accuracy of different methods.
NumberClassification Accuracy on Training SetClassification Accuracy on Test Set
SVMGA-SVMPSOSVMGWO-SVMSVMGA-SVMPSO-SVMGWO-SVM
190.00%91.12%94.45%100.00%87.78%90.00%93.34%96.67%
288.89%92.23%91.12%96.67%85.56%91.12%92.23%95.56%
390.00%90.00%92.23%95.56%86.67%88.89%90.00%94.45%
491.12%90.00%92.23%96.67%86.67%90.00%91.12%95.56%
588.89%88.89%91.12%94.45%84.45%88.89%88.89%93.34%
687.78%92.23%91.12%94.45%86.67%87.78%90.00%95.56%
788.89%92.23%94.45%95.56%87.78%91.12%90.00%94.45%
886.67%90.00%93.34%96.67%85.56%90.00%92.23%95.56%
988.89%90.00%94.45%95.56%85.56%88.89%90.00%94.45%
1090.00%91.12%92.23%97.78%85.56%87.78%91.12%94.45%
1190.00%91.12%93.34%100.00%88.89%90.00%88.89%96.67%
1288.89%92.23%92.23%96.67%88.89%88.89%90.00%95.56%
1388.89%93.34%94.45%95.56%86.67%90.00%90.00%94.45%
1488.89%90.00%93.34%96.67%87.78%90.00%91.12%93.34%
1591.12%91.12%92.23%95.56%85.56%88.89%91.12%94.45%
1690.00%90.00%92.23%94.45%86.67%88.89%88.89%93.34%
1790.00%92.23%91.12%96.67%85.56%91.12%90.00%94.45%
1891.12%92.23%94.45%96.67%86.67%88.89%92.23%94.45%
1991.12%93.34%93.34%95.56%87.78%90.00%92.23%94.45%
2088.89%92.23%94.45%97.78%86.67%90.00%91.12%95.56%
2188.89%92.23%93.34%97.78%86.67%88.89%91.12%94.45%
2288.89%91.12%93.34%96.67%87.78%90.00%91.12%96.67%
2390.00%92.23%93.34%96.67%87.78%90.00%90.00%95.56%
2490.00%92.23%92.23%96.67%87.78%88.89%90.00%95.56%
2591.12%93.34%92.23%95.56%85.56%91.12%91.12%93.34%
2688.89%92.23%93.34%97.78%86.67%91.12%92.23%93.34%
2790.00%92.23%91.12%95.56%86.67%90.00%91.12%94.45%
2890.00%91.12%92.23%95.56%87.78%90.00%91.12%94.45%
2991.12%91.12%92.23%96.67%87.78%88.89%90.00%95.56%
3090.00%92.23%93.34%96.67%86.67%90.00%91.12%94.45%
Average accuracy89.63%91.52%92.82%96.49%86.82%89.67%90.78%94.82%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Cheng, G.; Chen, J.; Wei, Y.; Chen, S.; Pan, Z. A Coal Gangue Identification Method Based on HOG Combined with LBP Features and Improved Support Vector Machine. Symmetry 2023, 15, 202. https://doi.org/10.3390/sym15010202

AMA Style

Cheng G, Chen J, Wei Y, Chen S, Pan Z. A Coal Gangue Identification Method Based on HOG Combined with LBP Features and Improved Support Vector Machine. Symmetry. 2023; 15(1):202. https://doi.org/10.3390/sym15010202

Chicago/Turabian Style

Cheng, Gang, Jie Chen, Yifan Wei, Sensen Chen, and Zeye Pan. 2023. "A Coal Gangue Identification Method Based on HOG Combined with LBP Features and Improved Support Vector Machine" Symmetry 15, no. 1: 202. https://doi.org/10.3390/sym15010202

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop