An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network

Wang, Dongxing; Ni, Jingxiu; Du, Tingyu

doi:10.3390/sym14050880

Open AccessArticle

An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network

by

Dongxing Wang

^1,2,3

,

Jingxiu Ni

^4,* and

Tingyu Du

^3,*

¹

R & D Department, Zhuhai Xinhe Technology Co., Ltd., Zhuhai 519600, China

²

School of Electrical Engineering, Zhejiang University, Hangzhou 310000, China

³

School of Mechanical Electronic & Information Engineering, China University of Mining & Technology-Beijing, Beijing 100083, China

⁴

Comprehensive Experimental Teaching Demonstration Center of Engineering, Beijing Union University, Beijing 100101, China

^*

Authors to whom correspondence should be addressed.

Symmetry 2022, 14(5), 880; https://doi.org/10.3390/sym14050880

Submission received: 24 March 2022 / Revised: 14 April 2022 / Accepted: 22 April 2022 / Published: 25 April 2022

(This article belongs to the Topic Applied Metaheuristic Computing)

Download

Browse Figures

Versions Notes

Abstract

:

To improve the recognition accuracy of coal gangue images with the back propagation (BP) neural network, a coal gangue image recognition method based on BP neural network and ASGS-CWOA (ASGS-CWOA-BP) was proposed, which makes two key contributions. Firstly, a new feature extraction method for the unique features of coal and gangue images is proposed, known as “Encircle–City Feature”. Additionally, a method that applied ASGS-CWOA to optimize the parameters of the BP neural network was introduced to address to the issue of its low accuracy in coal gangue image recognition, and a BP neural network with a simple structure and reduced computational consumption was designed. The experimental results showed that the proposed method outperformed the other six comparison methods, with recognition of 95.47% and 94.37% in the training set and the test set, respectively, showing good symmetry.

Keywords:

coal gangue image; classification; wolf pack optimization; BP neural network

1. Introduction

Removing gangue from raw coal is conducive to improving the efficiency of coal utilization and reducing environmental pollution. Therefore, the identification of gangue and coal blocks is a necessary step for the efficient removal of coal gangue in coal production, which is of great significance for environmental protection [1,2]. Because of its obvious advantages of resource saving, low cost, the simple processing system and convenient maintenance, image recognition technology for the separation of coal gangue has been widely studied and applied in the literature [3,4,5,6,7]. Researchers have developed a variety of image recognition methods for coal gangue image recognition, which contribute to research into coal gangue separation technology based on image recognition and have laid a foundation for further research. However, these methods have their shortcomings. For example, in [3], the proposed method resulted in a waste of computing resources and low time efficiency because it calculated 15 eigenvalues at the same time, although only the best five eigenvalues were actually used. In the study by Zhou et al. [4], the new method adopted a deep convolution neural network to address the task of online accurate and rapid identification of coal gangue, which required extensive computation. In other studies [5,6], although the recognition rate of the methods used was high, the artificially set threshold parameters played a decisive role in the intermediate calculation link, which depended on an extensive experimental statistical analysis or experience, resulting in the calculation being complex. The stability of performance was questionable, and the generalized ability to recognize coal gangue images with different characteristics in different regions was not necessarily high. The authors of [7], proposed a method that required a large number of samples generated by the introduction of the transfer learning method, suggesting that the method’s generalization and universality are questionable and may be limited when the sample size is small.

Since it was proposed by the team of Rumelhart and McClelland in 1986, the BP (back propagation) neural network has been widely used in the field of image recognition [8] because of its simple structure and high calculation efficiency, such as in [9]. To solve the problem of remote sensing image classification, the authors established an improved BP neural network and set the dynamic training intensity to improve the learning speed and classification accuracy of the BP neural network classifier. However, determining the optimal training intensity consumes a large amount of computing resources; moreover, the sample set classification problems of different fields have different optimal training intensities due to the large differences in the sample characteristics, so the generalizability is questionable, probably resulting in this method being unsuitable for image coal gangue classification with a wide area and many sources. In another study [10], a small image recognition and classification method based on GA-BP was proposed, in which the combination of a genetic algorithm and the BP neural network gave full play to the nonlinear mapping ability of the neural network, resulting in strong learning ability and fast convergence speed. The experimental results showed that the new method had higher recognition accuracy and better performance than the traditional BP algorithm and the GA-BP algorithm. The authors of [11] adopted a hybrid algorithm combining a genetic algorithm and a back propagation algorithm, and the experimental results showed that this GA-BP algorithm had higher efficiency, robustness and practicability. The researchers in [12] proposed an epilepsy diagnosis method based on an improved genetic algorithm–optimized back propagation (IGA-BP) neural network and used this method to detect clinical epilepsy quickly and effectively. Liu et al. [13] used a genetic algorithm (GA) to construct a classification model based on the BP neural network to automatically identify the correlations in a multi-mode blog, and the experimental results showed that the classification model based on GA-BP was better than the traditional BP neural network. Yu et al. [14] considered the problem that a BP algorithm based on a gradient descent principle falls into the local minima, and thus, the classification of multispectral remote sensing images using spectral information cannot obtain ideal results. Yu et al. developed a new method combining feature texture knowledge with a BP neural network trained by particle swarm optimization (PSO). The experimental results showed that this method had improved classification accuracy. Ying et al. [15] proposed a method combining a random loss algorithm and particle swarm optimization (PSO-BP) for the recognition and classification of small images, which corrected the weights of the PSO algorithm based on the error back propagation adjustment of the traditional BP algorithm to establish the PSO-BP network model. The hidden layer unit of the PSO-BP network was improved by using the random loss algorithm, thus achieving faster operation speeds. The authors of [16,17] proposed variants of the PSO-BP combining a BP neural network and PSO to address the task of supervised classification of synthetic aperture radar (SAR) images and the problems of low accuracy and efficiency in traditional part classification methods. This kind of BP neural network method based on GA or PSO optimization has a certain reference value for coal gangue image classification. Unfortunately, the accuracy of image coal gangue classification was still not high, as shown by a comparative experiment in a follow-up paper. In addition, the BP neural network has also been applied in other fields. For example, in [18], the researchers proposed a metal surface defect classification method based on an improved bat algorithm to optimize the BP neural network, which was used to classify images of defects with different characteristics. The researchers in [19] proposed an automatic recognition method of cloud and precipitation particle shapes based on a BP neural network to solve the problem that the shape of cloud particle images measured by airborne cloud imaging probes (CIPs) cannot be automatically recognized.

Although these methods have some shortcomings, they have still contributed to the research into coal gangue separation technology based on image recognition and laid a foundation for further research. In 2007, Yang and his coauthors proposed a new swarm intelligence algorithm [20], which simulates the predation process of wolves to solve complex nonlinear optimization problems. Due to its superior performance, it has been widely used in various fields and has been continuously developed and improved. For example, in [21], the author proposed a novel and effective opposite wolf pack algorithm to estimate the parameters of a Lorenz chaotic system. In another study [22], the improved wolf pack search algorithm was used to calculate the quasi-optimal trajectory of a rotor UAV in complex three-dimensional space. Moreover, in [23], the wolf pack algorithm was used to find the root of the polynomial equation of the problem accurately and quickly. Similarly, in [24], Zhou Qiang and others proposed a wolf pack search algorithm based on the leader strategy (LWCA). The researchers in [25] designed an adaptive shrinking grid search chaos wolf optimization algorithm (ASGS-CWOA) using adaptive standard deviation updating to achieve better performance, which included new regeneration rules, a new raid strategy, a new siege strategy and a new siege adaptive step size. Inspired by the good optimization ability of the wolf pack intelligent optimization algorithm and the excellent classification performance of the BP neural network, this article proposes a coal gangue image recognition method based on ASGS-CWOA and the BP neural network.

The remainder of this article is organized as follows. Section 2 presents a description of “Encircle–City Feature”, a combination of ASGS-CWOA and the BP neural network and an overview of new method. In Section 3, we present comparative experiments to show the effectiveness of the proposed approach. The conclusion is given in Section 4.

2. Proposed Method

2.1. Encircle–City Feature

By comparing a large number of sample images, it was found that the brightness area of coal block images is significantly larger than that of gangue images, but the degree of recognition of distinguishing coal blocks and gangue according to the contrast index is not very high because the contrast of coal is significantly greater than that of gangue from a local point of view. Nevertheless, this difference in contrast will partially offset the images from each other from an overall point of view.

In this article, the basic idea of Encircle–City Feature is to divide the sample image into several continuous small areas of 50 × 50 without overlaps or blanks, in which the following operations should be performed. If we assume that the matrix of the small 50 × 50 area is I, the implementation steps are as follows:

Step 1: Divide each sample image evenly into M × N small areas with M rows and N columns such that each small area should be 50 pixels × 50 pixels without overlaps and blanks and perform Steps 2 to 4 for each one.

Step 2: Obtain the average gray value of the “City”. The total gray (denoted by City_{sum_gray}) of small 30 × 30 areas is calculated to obtain the average gray (denoted by City_{average_gray}) in the central district. This is like a castle located in the central area, so we call it the “City” (shown in red in Figure 1). This is given by Equation (1).

{\begin{matrix} C i t y_{s u m_g r a y} = \sum_{i = 11, j = 11}^{40, 40} I (i, j) \\ C i t y_{a v e r a g e_g r a y} = \frac{C i t y_{s u m_g r a y}}{900} \end{matrix}

(1)

Step 3: Obtain the total gray value (denoted by Encircle_sum-gray) and the average gray value (denoted by Encircle_average-gray). The peripheral part of the small 50 × 50 area excluding the central 40 × 40 pixels is like the wall around a castle, so it is called the “Encircle”, as shown in blue in Figure 1. Equation (2) is as follows:

{\begin{matrix} E n c i r c l e_{s u m - g r a y} = \sum_{i = 1, j = 1}^{50, 50} I (i, j) - \sum_{i = 6, j = 6}^{45, 45} I (i, j) \\ E n c i r c l e_{a v e r a g e - g r a y} = \frac{E n c i r c l e_{s u m_g r a y}}{900} \end{matrix}

(2)

Step 4: For the small area of Row m and Column n, obtain the “Encircle–City” value using Equation (3):

{\begin{matrix} E n c i r c l e - C i t y (m, n) = C i t y_{a v e r a g e - g r a y} - E n c i r c l e_{a v e r a g e - g r a y} \\ (m < = M, n < = N) \end{matrix}

(3)

Step 5: Obtain the average value of the “Encircle–City” matrix (M × N matrix “Encircle–City” obtained using Step 4). Finally, calculate the “Encircle–City” value of the whole sample image as shown in Equation (4).

{\begin{matrix} A v e r a g e_{g r a y} = \frac{\sum_{m = 1, n = 1}^{M, N} E n c i r c l e - C i t y (m, n)}{(M \times N)} \\ L e n g t h = f i n d - l e n g t h (E n c i r c l e - C i t y > A v e r a g e_{g r a y}) \\ E n c i r c l e - C i t y_{e i g e n v a l u e} = l e n g t h / (M \times N) \end{matrix}

(4)

where “Average_gray” means the average value of the overall matrix “Encircle–City”; “Length” means the number of elements larger than “Average_gray” in the matrix “Encircle–City”, which is obtained by the function “find-length”; Encircle–City_eigenvalue stands for the overall “Encircle–City Feature” value of one sample image, as shown in Figure 2.

In this article, small areas of 50 × 50 were used to segment one overall sample image, in which “Encircle” contains 900 pixels and “City” contains 900 pixels, meaning that the Encircle–City Feature can better reflect the texture features of the image under ideal circumstances. It should be noted that we only used the Encircle–City Feature value to identify coal and gangue, and the recognition accuracy reached 83.24%, which is not discussed in detail due to limited space.

Unfortunately, the images are often irregular such that the “Encircle” and the “City” in the small 50 × 50 local area do not necessarily strictly follow the ideal situation shown in Figure 2, which leads to the lower accuracy in identifying coal and coal gangue images using the Encircle–City Feature (83.24%). In fact, we are more concerned about the light–dark contrast of small local areas than the whole image, so we introduced the auxiliary value of the “Encircle–City Feature”: the “Encircle–City Assist”, the details of which are given below.

Step 1: Divide each sample image evenly into small M × N areas with M rows and N columns such that each small area must include 50 pixels × 50 pixels without overlaps or blanks and perform Steps 2 to 4 for each one.

Step 2: Sort the image pixels of the small 50 × 50 area in ascending order according to the gray value, as shown in Equation (5):

b l o c k_{s o r t} = S o r t (b l o c k)

(5)

where “block_sort” means the matrix after arrangement in ascending, which is calculated and returned by the function “Sort”.

Step 3: Calculate “Encircle–City_Assist”, the auxiliary value of the “Encircle–City Feature” of the current small 50 × 50 area block, which is the difference between the second half of “block_sort” and its first half, as shown in Equation (6).

E n c i r c l e - C i t y_{a s s i s t} = b l o c k (2501 : 5000) - b l o c k (1 : 2500)

(6)

It should be noted that we only used the “Encircle–City_Assist” value to identify coal and gangue, and the recognition accuracy reached 78.21%, which is not discussed in detail due to limited space.

2.2. ASGS-CWOA-BP

2.2.1. Overview of ASGS-CWOA

We proposed ASGS-CWOA in [20] with three contributions: the strategy of adaptive shrinking grid search (ASGS), the strategy of opposite–middle raid (OMR) and the adaptive standard deviation updating amount (ASDUA), which has been shown to have superior performance compared with some state-of-the-art algorithms at that time. Accordingly, in this study, we use ASGS-CWOA to address the issue of optimizing the weights of the BP neural network for coal gangue image recognition. In order to adapt to the particularity of the recognition network weights, which are always small, some necessary adjustments of the step size should be made according to the following rules.

In this article, the variation range of the weights was set between −5 and 5 based on experience, i.e., range_max = 5 and range_min = −5. Correspondingly, the value range is [−5, 5] in any dimension for the position of one wolf.

Thus, the step size of the siege stage can be obtained by using Equation (7) as follows:

{\begin{matrix} o s t e p_c_m a x = (r a n g e_m a x - r a n g e_m i n) / 2 \\ s t e p_c_m i n = 0.01 \\ s t e p c = s t e p_c_m i n \times (r a n g e_m a x - r a n g e_m i n) \times e x p ((l o g (s t e p_c_m i n / s t e p_c_m a x)) \times t / T) \end{matrix}

(7)

where step_c_max is the upper limit of the siege step’s size, step_c_min is the lower limit, t indicates the current number of iterations and T represents the upper limit.

The step sizes of the migration stage and the summons–raid stage can be obtained by using Equation (8) as follows:

{\begin{matrix} s t e p a = s t e p c \times 100, w h e n s t e p c \geq 0.001 \\ s t e p a = s t e p c \times 1000, w h e n s t e p c < 0.001 \\ s t e p b = s t e p a \times 2 \end{matrix}

(8)

where stepa means the step size of the migration stage and stepc means one of the summons–raid stages. In order to prevent the stepc value from getting smaller and smaller with each iteration such that the values of stepa and stepb become too small to affect the optimization effect, the values of stepa and stepb are amplified when the value of stepc is less than 0.001.

2.2.2. The Recognition Network

Based on the BP neural network and the ASGS-CWOA algorithm and considering the factors of low network complexity and less computation, this research designed a recognition network with a simple structure for coal and gangue images (RN-CGI), which includes six input layers, four hidden layers and one output layer, as shown in Figure 2.

Here, the hidden layer adopts the “tansig” kernel function, and the output layer adopts the “purelin” kernel function. The position coordinates of each wolf in the wolf pack represent the weights of the BP neural network, and the fitness value is jointly calculated by the recognition network and the sample eigenvector according to Equations (9) and (10).

X_i = (x_i1, …, x_id, …, x_iD) (i = 1, …, n; d = 1, …, D)

(9)

{\begin{matrix} N e t . W = [\begin{matrix} \begin{matrix} x_{i 1} \\ x_{i 2} \end{matrix} \\ \begin{matrix} x_{i 3} \\ x_{i 4} \end{matrix} \end{matrix} \begin{matrix} \begin{matrix} x_{i 5} \\ x_{i 6} \end{matrix} \\ \begin{matrix} x_{i 7} \\ x_{i 8} \end{matrix} \end{matrix} \begin{matrix} \begin{matrix} x_{i 9} \\ x_{i 10} \end{matrix} \\ \begin{matrix} x_{i 11} \\ x_{i 12} \end{matrix} \end{matrix} \begin{matrix} \begin{matrix} x_{i 13} \\ x_{i 14} \end{matrix} \\ \begin{matrix} x_{i 15} \\ x_{i 16} \end{matrix} \end{matrix} \begin{matrix} \begin{matrix} x_{i 17} \\ x_{i 18} \end{matrix} \\ \begin{matrix} x_{i 19} \\ x_{i 20} \end{matrix} \end{matrix} \begin{matrix} \begin{matrix} x_{i 21} \\ x_{i 22} \end{matrix} \\ \begin{matrix} x_{i 23} \\ x_{i 24} \end{matrix} \end{matrix}] \\ N e t . L = [\begin{matrix} \begin{matrix} x_{i 25} \\ x_{i 26} \end{matrix} \\ \begin{matrix} x_{i 27} \\ x_{i 28} \end{matrix} \end{matrix}] \end{matrix}

(10)

where X_id is the coordinate of the i-th wolf in the d-th dimension, Net.W is the weight from the input layer to the hidden layer, and Net.L is the weight from the hidden layer to the output layer. From the sum of the elements of Net.W and Net.L, it is obvious that D is 28, correspondingly. In this way, the location information of each wolf can be mapped into the weight parameters of the recognition network. By continuously optimizing the location information of the wolves, the potential optimal solution with the best fitness value is obtained.

In this article, the network output values of all training samples are calculated according to the network weight parameters mapped by the position information of wolf_i. Since this article considers the binary classification of coal and gangue images (coal = 0; gangue = 1) and the BP neural network adopts the “tansig” and “purelin” kernel functions, the following judgment can be made for the network output value out_i: for the i-th sample, when out_i is less than 0.5, this indicates coal, i.e., set 0, but if it is greater than or equal to 0.5, it is judged to be gangue, i.e., set 1. Accordingly, the fitness function can be given by Equation (11).

{\begin{matrix} o u t = (o u t_{1}, o u t_{2}, \dots \dots, o u t_{i}, \dots \dots o u t_{n u m}), i = 1, 2, \dots, n u m \\ r i g h t_n u m = l e n g t h (o u t = B J) \\ f i t n e s s_{k} = r i g h t_n u m / n u m \end{matrix}

(11)

where num is the number of training or test samples, right_num is the number of samples correctly identified, BJ is the label (0 or 1) of the training or test sample, length (out = BJ) is a function that can calculate and return the number of correctly identified samples and fitness_k is the fitness value of wolf_k of the k-th wolf, that is, the recognition accuracy.

2.3. Overview of the Proposed Method

2.3.1. Image Preprocessing

With a camera (Huawei Honor 20, 48 million pixels), 358 gangue images and coal block images (including 285 gangue images and 173 coal block images) were taken. The image size was 4000 × 3000, forming the original sample set. We preprocessed the original images in order to extract the feature vectors, as shown in Figure 3.

Image Graying: Grayscale images refer to images containing only brightness information and no color information. Grayscale processing is the process of changing the color image containing brightness and color into grayscale images.
Median Filtering: Median filtering is a nonlinear signal processing technology that can effectively suppress noise based on the sorting statistical theory. Its basic principle is to replace the gray value of a point in the digital image with the median value of each point in the local neighborhood of the point. This paper used a 3 × 3 local neighborhood.
Otsu Segmentation: The Otsu algorithm is an efficient algorithm for image binarization proposed by the Japanese scholar Otsu in 1979. The principle is to divide the original image into foreground and background images by a threshold. For the foreground, N1, csum and M1 are used to represent the number of points, the quality moment and the average gray level of the foreground under the current threshold, respectively. For the background, N2, sum csum and M2 are used to represent the number of points, the quality moment and the average gray level of the background under the current threshold, respectively. When the optimal threshold is selected, the difference between the background and the foreground should be the greatest.
Erosion and Dilation: Erosion is the use of algorithms to corrode the edges of the image. The function is to start off the “burr” on the edge of the target. Inflation uses the algorithm to expand the edges of the image. The function is to fill the edges or internal pits of the target. Having the same amount of erosion and dilation can make the target surface smoother, which is a symmetrical process.
Target Area Focusing: The size of the original image of the sample was 4000 × 3000. The processing capacity of image storage and calculation is large, and the blank area accounts for a large proportion, resulting in unnecessary gangue in the resources. In order to lock the effective area, we used the corroded and expanded images to minimize the boundary of the target area, remove the unnecessary background and focus on the foreground target area of the image.
Nearest Interpolation Image Size Scaling: After the target area focusing operation, due to the differences in the influence of sample image noise and the different sizes of the target areas, the sizes of the gray image of the “target area” were different. In order to unify the sample size, the size scaling operation was carried out on the image; that is, the “nearest interpolation” operation was used to reduce the size of the sample images that were greater than 800 × 600 and increase the size of the sample images that were less than 800 × 600. The unified sample image size was 800 × 600.

The results of the prepossessing process are shown in Figure 3.

2.3.2. Gray Level Co-Occurrence Matrix (GLCM)

The method commonly used to describe the grayscale texture is the grayscale correlation matrix. The index eigenvalues derived from the gray level co-occurrence matrix are as follows: “contrast” returns the contrast between a pixel in the whole image and its neighbors. The contrast of an image composed of constants is 0. The calculation equation is

C o n t r a s t = \sum_{i, j} {| i - j |}^{2} p (i, j)

(12)

“Correlation” returns the cross-correlation between a pixel in the whole image and its neighbors. The value range is [−1, 1]. The cross-correlation of images composed of constants is none. The correlation degrees 1 and −1 correspond to complete positive correlation and complete negative correlation, respectively. The calculation equation is

C o r r e l a t i o n = \sum_{i, j} \frac{(i - μ * i) (j - μ * j) p (i, j)}{σ_{i} * σ_{j}}

(13)

“Homogeneity” reflects the tightness of the distribution of elements in the GLCM relative to the diagonal of the GLCM. The value range is [0, 1]. The homogeneity of a diagonal GLCM is 1. The equation is

H o m o g e n e i t y = \sum_{i, j} \frac{p (i, j)}{1 + | i - j |}

(14)

“Energy” returns the sum of squares of all elements in the GLCM. The value range is [0, 1]. The energy of an image composed of constants is 1. The calculation equation is

E n e r g y = \sum_{i, j} p {(i, j)}^{2}

(15)

2.3.3. Feature Extraction of Coal and Gangue Images

According to the theories detailed above, the feature vector of each sample image is composed of six image features (contrast, correlation, homogeneity, energy, Encircle–City Feature and Encircle–City Feature auxiliary). As shown in Table 1, the sample included 358 sample images composed of 185 gangue images and 173 coal images.

As space is limited, only some data have been listed; the complete list of data is given in the link in Appendix A.

2.3.4. Flowchart of the Proposed Method

As shown Figure 4, the flow chart of the proposed method includes the steps of inputting samples, initialization of the ASGS-CWOA, the optimization process and recording the data. The details about inputting the samples are given in Section 2.3.2 and Section 2.3.3, and the details of initialization of the ASGS-CWOA, the optimization process and recording the data are described in Section 2.2.1 and Section 2.2.2 above.

3. Simulation Experiment

3.1. Experimental Environment

To verify the feasibility and efficiency of the algorithm proposed in this article, several groups of comparative experiments were carried out by using the new ASGS-CWOA-BP method, the classification method GA-BP based on genetic algorithm optimization and the BP neural network, the classification method PSO-BP based on particle swarm optimization and the BP neural network, the classification method LWCA-BP based on the wolf pack optimization algorithm with the leadership strategy and the BP neural network, and the original BP neural network based on gradient descent and random forest (RF).

Table 1 shows the numerical experimental data based on six-dimensional feature vectors from 358 samples, and Table 2 lists the parameters of the six classification methods for coal and gangue images. The comparative experiments were run on a computer equipped with a CPU (AMD A6-3400m APU with Radeon^TM HD Graphics 1.40 GHz), 12.0 GB of memory (11.5 GB available) and Windows 7 (64 bit). To prove the good performance of the proposed algorithm, optimization calculations were run 30 times on the sample feature vectors for testing, and the classification algorithm mentioned above were also tested.

3.2. Experimental Results

Firstly, Figure 5a,b show the comparison curves for the classification accuracy of each algorithm based on the data in Table 3 for the training set and test set, respectively, from which we can intuitively see that the curve for ASGS-CWOA-BP is better than that of the others, except for the pink curve corresponding to RF on the training set, which means that, on the whole, ASGS-CWOA-BP has the best performance in the classification of coal gangue images using RN-CGI.

It can be clearly seen from Figure 5c that the ASGS-CWOA-BP curve is higher than the GA-BP curve in both the training set and the test set, which means that ASGS-CWOA-BP is better than GA-BP in terms of the accuracy of classifying coal gangue images using RN-CGI. In the same way, ASGS-CWOA-BP is better than PSO-BP and LWCA-BP, as shown in Figure 5d,e, respectively. The principle is that these four methods are based on the same intelligent algorithm to optimize the weight of the BP neural network; however, their classification results are not consistent, which shows that the optimization ability of these four intelligent algorithms is different and that the ability of ASGS-CWOA-BP is the best.

Additionally, Figure 5f indicates that the ASGS-CWOA-BP had better performance than the original BP depending on gradient descent for the classification of coal gangue images using RN-CGI, whether on the training set or the test set. In fact, BP depending on gradient descent had the worst performance compared with GA-BP, PSO-BP, LWCA-BP and ASGS-CWOA-BP, which are based on an intelligent algorithm to optimize the weight of the BP neural network, which shows the inherent deficiency of gradient-descent-based BP resulting from limitations by the degree of the gradient descent because of some particularity of the problem to be solved, while intelligent algorithm-based BPs are not subject to such restrictions.

In particular, RF was used as a comparison algorithm to analyze the performance of BPs. It can be seen that RF has a very good curve for the training set but a poor one for the test set (Figure 5g), which means that the model trained by RF will be overfitted due to the small dimensions of the feature vectors, which also shows the superiority of the algorithm proposed in this study.

Finally, the best record, average and variance of the classification accuracy are shown in Figure 6a–c, respectively, from which we can see that the proposed method was better for the best value of classification accuracy than any algorithm except RF on the training set, as well as for the average value. However, the variance of ASGS-CWOA-BP was not better than that of LWCA-BP and RF on the training set, while it was better than that of GA-BP, PSO-BP and BP, as shown in Figure 6c. However, the variance of ASGS-CWOA-BP was less than that of GA-BP, PSO-BP and LWCA-BP and was basically the same as that of BP, although it was a little worse than RF. Thus ASGS-CWOA-BP had the best performance of in terms of the best value and high robustness.

4. Conclusions

To improve the recognition accuracy of coal gangue images, a coal gangue image recognition method based on the BP neural network and ASGS-CWOA (ASGS-CWOA-BP) was proposed, which makes two key contributions. Firstly, a new feature extraction method regarding the unique features of coal and gangue images is proposed. Additionally, a method using ASGS-CWOA to optimize the parameters of the BP neural network was introduced to address the issue of low accuracy in coal gangue image recognition, and a BP neural network with a simple structure and reduced computational consumption was designed. The theoretical research and experimental results revealed that compared with GA-BP, PSO-BP, LWCA-BP, BP and RF, ASGS-CWOA-BP had the best classification accuracy and high robustness under the same conditions.

Compared with the five other algorithms, ASGS-CWOA-BP performed well in most cases on the training set and test set, and its best classification accuracy on the training set was 95.47% while that on the test set was 94.37%, as shown in Table 4 and Figure 6a. It should be emphasized that this was achieved under extremely limited conditions as follows: (1) the structure of the BP-based recognition network was extremely simple (only six-dimensional feature vectors were required in the input layer and only four nodes in the hidden layer), and (2) the number of samples was very small (only 358 coal gangue image samples). These extremely limited conditions greatly reduced the amount of calculation, and the GPU was not used from beginning to end; therefore, all simulation experiments can be implemented only on a laptop with ordinary performance, as detailed above, which shows that the new method proposed in this article has superior performance. In fact, the recognition model trained by this method is quite suitable for use in mobile portable coal gangue image recognition equipment with weak computing power and low energy consumption.

However, what needs to be remembered is that the most popular image recognition model based on deep learning has higher and better recognition or classification accuracy and has been studied by a considerable number of scholars. Unfortunately, the network of this technology is complex (with many levels and a large amount of calculation) and often requires a large number of image samples. On the contrary, this is exactly the advantage of the method proposed in this article.

Our future work is to continue to improve the performance of the wolf pack optimization algorithm and to apply it to optimize a more complex BP-based recognition network to increase the feature dimensions of the extracted coal gangue images and increase the number of samples to improve the classification accuracy of coal gangue images.

Author Contributions

D.W. conceived the algorithm framework and wrote the article; J.N. and D.W. performed the program experiments; J.N. and T.D. contributed the data. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by National Key Research and Development Program of China (grant no. SQ2018YFC060172).

Acknowledgments

The authors are grateful to their peer experts for the full support of this paper and thank Zhuhai Xinhe Technology Co., Ltd. and China University of Mining and Technology-Beijing for providing the necessary scientific research environment, as well as special thanks to Beijing Union University for its support of scientific research funds.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Link: https://pan.baidu.com/s/1dlcJGE6_UYNn3vYoQdH_BgExtraction (accessed on 1 February 2022). Code: 4033.

References

Qian, M.; Xu, J.; Wang, J. Further on the sustainable mining of coal. J. China Coal Soc. 2018, 43, 1–13. [Google Scholar]
Zhou, N.; Yao, Y.; Song, W.; He, Z.; Meng, G.; Liu, Y. Present situation and Prospect of coal gangue treatment technology. J. Min. Saf. Eng. 2020, 37, 11. [Google Scholar]
Xue, G.; Li, X.; Qian, X.; Zhang, Y. Coal-gangue image recognition in fully-mechanized caving face based on random forest. Ind. Mine Autom. 2020, 46, 57–62. [Google Scholar]
Xu, Z.Q.; Lv, Z.Q.; Wang, W.D.; Zhang, K.; Lv, H. Machine vision recognition method and optimization for intelligent separation of coal and gangue. J. China Coal Soc. 2020, 45, 2207–2216. [Google Scholar]
Yu, G. Expanded order co-occurrence matrix to differentiate between coal and gangue based on interval grayscale compression. J. Image Graph. 2012, 8, 966–970. [Google Scholar]
Yu, L. A New Method for Image Recognition of Coal and Coal Gangue. Mod. Comput. 2017, 17, 68–72. [Google Scholar]
Rao, Z.; Wu, J.; Li, M. Coal-gangue image classification method. Ind. Mine Autom. 2020, 46, 69–73. [Google Scholar]
Wen, X. Intelligent Fault Diagnosis Technology: Matlab Application; Beijing University of Aeronautics and Astronautics Press: Beijing, China, 2015. [Google Scholar]
Zheng, Y.-G.; Wang, P.; Ma, J.; Zhang, H.B. Remote sensing image classification based on BP neural network model. Trans. Nonferrous Met. Soc. China 2005, 15, 232–235. [Google Scholar]
Chen, Y.X.; Liao, X.D.; Wang, J.H.; Tao, Z.; Sui, L.Y. Small Image Recognition Classification Based on PCA and GA-BP Neural Network. In Proceedings of the 2018 2nd IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC 2018), Xi’an, China, 25–27 May 2018; pp. 1360–1363. [Google Scholar]
Zhou, W.C.; Xie, G.S.; Liu, B. The application of mixed GA-BP algorithm on remote sensing image classification. In Proceedings of the Conference: Geoinformatics 2008 and Joint Conference on GIS and Built Environment: Classification of Remote Sensing Images, Guangzhou, China, 28–29 June 2008. [Google Scholar]
Liu, G.; Wei, X.; Zhang, S.; Cai, J.; Liu, S. Analysis of epileptic seizure detection method based on improved genetic algorithm optimization back propagation neural network. Shengwu Yixue Gongchengxue Zazhi/J. Biomed. Eng. 2019, 36, 24–32. [Google Scholar]
Liu, M.; Guan, W.; Yan, J.; Hu, H. Correlation identification in multimodal weibo via back propagation neural network with genetic algorithm. J. Vis. Commun. Image Represent. 2019, 60, 312–318. [Google Scholar] [CrossRef]
Yu, J.; Zhang, Z.; Guo, P.; Qin, H.; Zhang, J. Multispectral remote sensing image classification based on PSO-BP considering texture. In Proceedings of the 7th World Congress on Intelligent Control and Automation (WCICA), Chongqing, China, 25–27 June 2008; pp. 6803–6806. [Google Scholar]
Chen, Y.X.; Liao, X.D.; Wang, J.H.; Tao, Z.; Sui, L.Y. Small Image Recognition Classification Based on Random Dropout and PSO-BP. In Proceedings of the 2018 2nd IEEE Advanced Information Management, Communicates, Electronic and Automation Control Conference, Xi’an, China, 25–27 May 2018; pp. 1243–1246. [Google Scholar]
Yu, J.; Li, Y.; Zhang, Z.S.; Jiang, J.C. Research on supervised classification of fully polarimetric SAR image using BP neural network trained by PSO. In Proceedings of the World Congress on Intelligent Control and Automation (WCICA), Jinan, China, 7–9 July 2010; pp. 6152–6157. [Google Scholar]
Wei, B.; Hu, L.; Zhang, Y.; Zhang, Y. Parts Classification based on PSO-BP. In Proceedings of the 2020 IEEE 4th Information Technology, Networking, Electronic and Automation Control Conference (ITNEC 2020), Chongqing, China, 12–14 June 2020; pp. 1113–1117. [Google Scholar]
Wang, W.; Lu, K.; Wu, Z.; Long, H.; Zhang, J.; Chen, P.; Wang, B. Surface defects classification of hot rolled strip based on improved convolutional neural network. ISIJ Int. 2021, 61, 1579–1583. [Google Scholar] [CrossRef]
Dong, H.; Jiao, R.; Huang, M. Research on recognition method of cloud precipitation particle shape based on bp neural network. MATEC Web Conf. 2021, 336, 06011. [Google Scholar] [CrossRef]
Yang, C.; Tu, X.; Chen, J. Algorithm of Marriage in Honey Bees Optimization Based on the Wolf Pack Search. In Proceedings of the International Conference on Intelligent Pervasive Computing, Jeju, Korea, 11–13 October 2007; Volume 871, pp. 462–467. [Google Scholar]
Li, H.; Wu, H. An oppositional wolf pack algorithm for Parameter identification of the chaotic systems. Opt. Int. J. Light Electron Opt. 2016, 127, 9853–9864. [Google Scholar] [CrossRef]
Chen, Y.B.; Mei, Y.S.; Yu, J.Q.; Su, X.L.; Xu, N. Three-dimensional Unmanned Aerial Vehicle Path Planning Using Modified Wolf Pack Search Algorithm. Neurocomputing 2017, 266, 445–457. [Google Scholar]
Yang, N.; Guo, D.L. Solving Polynomial Equation Roots Based on Wolves Algorithm. Sci. Technol. Vis. 2016, 15, 35–36. [Google Scholar]
Zhou, Q.; Zhou, Y. Wolf colony search algorithm based on leader strategy. Appl. Res. Comput. 2013, 30, 2629–2632. [Google Scholar]
Wang, D.; Ban, X.; Qian, X. An Adaptive Shrinking Grid Search Chaos Wolf Optimization Algorithm with Adaptive Standard-Deviation Updating Amount. Comput. Intell. Neurosci. 2020, 2020, 7986982. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Schematic diagram of Encircle—City Feature.

Figure 2. Recognition network for coal and gangue images.

Figure 3. Flowchart of image preprocessing.

Figure 4. Flowchart of the proposed method.

Figure 5. Classification accuracy comparison curve. (a) Comparison on the training set; (b) comparison on the test set; (c) ASGS-CWOA-BP vs. GA-BP; (d) ASGS-CWOA-BP vs. PSO-BP; (e) ASGS-CWOA-BP vs. LWCA-BP; (f) ASGS-CWOA-BP vs. BP; (g) ASGS-CWOA-BP vs. RF. Red: ASGS-CWOA-BP; green: GA-BP; black: PSO-BP; blue: LWCA-BP; cyan: original BP based on gradient descent; pink: RF. Circles indicate data from the training set while * indicates data from the test set.

Figure 6. Statistical analysis of the classification accuracy. (a) Best recorded classification accuracy; (b) average classification accuracy; (c) variance of the classification accuracy.

Table 1. Feature vector of the sample set.

Order	Contract	Correlation	Homogeneity	Energy	Encircle–City Feature	Encircle–City Feature Auxiliary
1	6.08	0.82	0.66	0.14	0.38	0.49
2	5.4	0.8	0.63	0.09	0.36	0.43
3	2.94	0.88	0.75	0.22	0.37	0.42
4	7.92	0.73	0.59	0.08	0.34	0.41
5	8.09	0.77	0.64	0.13	0.34	0.51
6	7.81	0.79	0.65	0.14	0.33	0.46
……
353	3.52	0.89	0.73	0.13	0.3	0.34
354	2.54	0.92	0.78	0.13	0.28	0.28
355	3.13	0.9	0.73	0.09	0.34	0.32
356	3.21	0.91	0.73	0.09	0.31	0.36
357	2.81	0.91	0.73	0.11	0.29	0.28
358	3.97	0.87	0.73	0.17	0.31	0.38

Table 2. Configuration of the other methods in the comparison.

Order	Name	Configuration
1	GP-BP	GA is applied to optimize the BP neural network. The genetic algorithm experiment uses the toolbox of MATLAB 2017A, and its configuration parameters are as follows: the crossover probability is set to 0.7, the mutation probability is set to 0.01 and the generation gap is set to 0.95.
2	PSO-BP	PSO is applied to optimize the parameters of the BP neural network to produce the PSO-BP classification method. The toolbox called “PSOt” in MATLAB is used in experiments of particle swarm optimization, with the following configuration: individual acceleration = 2; weighted initial time = 0.9; weighted convergence time = 0.4. This limits the individual speed to 20% of the variation range.
3	LWCA-BP	LWCA is applied to optimize the parameters of the BP neural network to produce the LWCA-BP method based on the ideas presented in [24]. The configuration is: migration step (Step A) = 1.5, summons–raid step (Step B) = 0.9, siege threshold (R0) = 0.2, upper limit of the siege step (Step c_max) = 1 × 10⁶, lower limit of the siege step (Step c_min) = 1 × 10⁻², updated amount of the population (M) = 5, maximum number of iterations (T) = 600, number of wolves in the population = 50.
4	BP with Gradient Descent (BP)	BP neural network with gradient descent. This calls the BP neural network training function of MATLAB 2017b to generate the BP network net = newff (P, t, s). The sim (net, in) function is then used to predict the input data. P represents the training sample set, T represents the labels of the training sample set, s represents the network parameters (such as the number of hidden layers), net represents the trained network classification prediction model and in is the input data to be determined.
5	Random Forest (RF)	This calls the random forest function classrf_train of MATLAB 2017b to train the training network and calls classrf_predict to predict the training samples and test samples.
6	ASGS-CWOA-BP	Upper limit number of iterations (T) = 600; number of wolves in the population (N) = 50, Range_max = 5 and Range_min = −5, i.e., the value range is [−5, 5] in any dimension for the position of one wolf.

Table 3. Experimental records.

Order	Classification Accuracy on the Training Set						Classification Accuracy on the Test Set
Order	ASGS-CWOA-BP	GA-BP	PSO-BP	LWCA-BP	BP	RF	ASGS-CWOA-BP	GA-BP	PSO-BP	LWCA-BP	BP	RF
1	0.9233	0.885	0.9094	0.9094	0.892	1	0.9577	0.8028	0.8732	0.8592	0.7887	0.7746
2	0.9373	0.878	0.892	0.9059	0.892	1	0.9296	0.8873	0.831	0.8732	0.7887	0.7887
3	0.9408	0.885	0.885	0.9094	0.9164	1	0.9014	0.831	0.8732	0.8732	0.831	0.7887
4	0.9373	0.8746	0.9059	0.9164	0.9094	1	0.9014	0.831	0.9014	0.8732	0.831	0.7746
5	0.9338	0.9164	0.9024	0.9199	0.9094	1	0.8873	0.8873	0.8451	0.8592	0.831	0.7746
6	0.9164	0.8711	0.8885	0.9338	0.9094	1	0.8732	0.831	0.8732	0.9437	0.831	0.7887
7	0.9477	0.8955	0.8885	0.9268	0.9094	1	0.9014	0.8451	0.8873	0.9014	0.831	0.7746
8	0.9547	0.892	0.8955	0.9129	0.9094	1	0.9437	0.8028	0.8732	0.8592	0.831	0.7887
9	0.9373	0.9094	0.878	0.9164	0.9094	1	0.9014	0.8592	0.8732	0.8451	0.831	0.7887
10	0.9199	0.892	0.9129	0.9129	0.9338	1	0.9014	0.8592	0.8451	0.9014	0.8592	0.7887
11	0.9477	0.8815	0.899	0.9199	0.9338	1	0.9155	0.8028	0.831	0.9155	0.831	0.7887
12	0.9408	0.8955	0.9164	0.9164	0.9338	1	0.9296	0.8732	0.8873	0.8873	0.831	0.7887
13	0.9338	0.8885	0.8746	0.9059	0.9408	1	0.9155	0.831	0.8028	0.8873	0.8451	0.7746
14	0.9164	0.9024	0.892	0.9024	0.9408	1	0.9014	0.8873	0.8732	0.8732	0.8451	0.7746
15	0.9233	0.8815	0.9164	0.9129	0.9408	1	0.9155	0.8592	0.8873	0.831	0.8451	0.7887
16	0.9164	0.892	0.885	0.9129	0.9408	1	0.9014	0.831	0.8451	0.8592	0.8451	0.7887
17	0.9268	0.9024	0.8955	0.9094	0.9408	1	0.9296	0.8451	0.8873	0.8873	0.8451	0.7887
18	0.9338	0.885	0.9164	0.9094	0.9408	1	0.9577	0.831	0.8732	0.8592	0.8451	0.7887
19	0.9233	0.892	0.9024	0.9129	0.9408	1	0.9155	0.8169	0.831	0.8732	0.8451	0.7746
20	0.9408	0.8955	0.885	0.9059	0.9408	1	0.9014	0.8732	0.8732	0.8169	0.8451	0.7887
21	0.9373	0.8955	0.885	0.9199	0.9408	1	0.9296	0.8451	0.8732	0.9155	0.8451	0.7887
22	0.9338	0.8955	0.8955	0.9199	0.9408	1	0.9296	0.8169	0.8592	0.9014	0.8451	0.7887
23	0.9477	0.892	0.8885	0.9094	0.9408	1	0.9155	0.8028	0.8028	0.9155	0.8451	0.7887
24	0.9268	0.8815	0.899	0.9129	0.9408	1	0.9014	0.8169	0.8169	0.8732	0.8451	0.7887
25	0.9408	0.8711	0.899	0.9164	0.9408	1	0.9296	0.7746	0.8028	0.831	0.8451	0.7887
26	0.9303	0.8711	0.9094	0.9059	0.9408	1	0.9296	0.8028	0.8732	0.8028	0.8451	0.7746
27	0.9233	0.8641	0.9024	0.9164	0.9408	1	0.8873	0.8169	0.8592	0.8732	0.8451	0.7746
28	0.9303	0.8711	0.9094	0.9059	0.9408	1	0.9155	0.8873	0.8592	0.9155	0.8451	0.7887
29	0.9408	0.899	0.878	0.9164	0.9408	1	0.9155	0.8169	0.8592	0.8873	0.8451	0.7887
30	0.9373	0.885	0.8885	0.9129	0.9408	1	0.8873	0.8451	0.8732	0.9014	0.8451	0.7746

Table 4. Statistical analysis of classification accuracy.

Method	Training Set			Test Set
Method	Best Accuracy	Average	Variance	Corresponding Accuracy	Average	Variance
ASGS-CWOA-BP	95.47%	0.9333	0.0101	94.37%	0.9141	0.02
GA-BP	91.64%	0.888	0.0122	88.73%	0.8371	0.0302
PSO-BP	90.59%	0.8965	0.012	90.14%	0.8582	0.0272
LWCA-BP	93.38%	0.9136	0.0067	94.37%	0.8765	0.0317
BP	94.08%	0.9297	0.0163	84.51%	0.8376	0.0151
RF	100%	1	0	78.87%	0.784	0.0068

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, D.; Ni, J.; Du, T. An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network. Symmetry 2022, 14, 880. https://doi.org/10.3390/sym14050880

AMA Style

Wang D, Ni J, Du T. An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network. Symmetry. 2022; 14(5):880. https://doi.org/10.3390/sym14050880

Chicago/Turabian Style

Wang, Dongxing, Jingxiu Ni, and Tingyu Du. 2022. "An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network" Symmetry 14, no. 5: 880. https://doi.org/10.3390/sym14050880

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Image Recognition Method for Coal Gangue Based on ASGS-CWOA and BP Neural Network

Abstract

1. Introduction

2. Proposed Method

2.1. Encircle–City Feature

2.2. ASGS-CWOA-BP

2.2.1. Overview of ASGS-CWOA

2.2.2. The Recognition Network

2.3. Overview of the Proposed Method

2.3.1. Image Preprocessing

2.3.2. Gray Level Co-Occurrence Matrix (GLCM)

2.3.3. Feature Extraction of Coal and Gangue Images

2.3.4. Flowchart of the Proposed Method

3. Simulation Experiment

3.1. Experimental Environment

3.2. Experimental Results

4. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI