1. Introduction
Complex plasma is a weakly ionized gas containing small solid charged particles. It has been widely used in the fields of plasma etching, material processing [
1,
2,
3], aerospace plasma propulsion [
4,
5], etc. However, as the integration density and complexity of semiconductor devices continue to increase in integrated circuits [
6], the shrinking size imposes increasingly stringent requirements on the precision of processes such as etching and materials processing [
7,
8]. And it is an indispensable means of developing some diagnostics for process technologies at a higher level in the plasma industry [
9,
10,
11].
Spectral analysis is a widely used tool for plasma diagnostics and process monitoring during plasma material processing and is significantly important for ensuring processing precision, improving product yield, and controlling processing costs [
12,
13]. In atmospheric pressure plasma [
14], the spectral lines of air and argon plasma emission were recorded and analyzed, and plasma parameters such as discharge current, discharge gap, and electron temperature were measured, which are used to control the process of materials’ surface modification promptly. The intensity of plasma emissive spectra could be enhanced by an argon additive; thus, the electron concentration as well as the energy was increased, which finally prompted the ionization rate to produce active N, O, and O
3. In the work of Chen et al. [
15], the relationships between processing parameters and plasma temperature during the laser additive manufacturing process were studied using the spectral diagnosis method. By constructing the corresponding relevance of time and plasma temperatures, the defect diagnosis of the whole cladding process was realized with relative accuracy. Jeong and Kim [
16] investigated the behavior of nitrogen-active species during the pulsed dc plasma nitriding process, and emission spectra measurements were performed for various treatment temperatures and different gas pressures in the reactor. In the study [
17], the spectral intensities were influenced by process parameters, which play a very important role in textile material surface modifications. Some researchers use collision radiation models to conduct quantitative analysis of plasma emission spectra [
18], but the model relies on spectral diagnosis is affected by the deviation contained in basic physical data such as collision cross section, resulting in errors in the diagnostic results [
19,
20,
21,
22].
Overall, in spectral analysis, in order to efficiently obtain the specific spectral information related to the parameters, the relationship between spectral features and plasma parameters needs to be recognized. However, due to the large amount of spectral data and the complex quantitative relationship between the spectrum and discharge parameters, traditional manual diagnostic methods have low accuracy and are time-consuming. Therefore, the exploration of a rapid and accurate recognition method for plasma spectra has become an important problem to be solved.
In recent years, deep learning has attracted increasing interest in plasma applications. Interpreting convolutional neural networks for real-time volatile organic compounds’ detection and classification using the OES of plasma [
23]. Wang et al. [
24] developed and tested an efficient data acquisition platform for a plasma emission spectrum deep ANN neural network in aqueous solution. Kruger et al. [
25] developed deep ANN plasma–surface interface for coupling sputtering and gas-phase transport simulations. Grelier et al. [
26] proposed the deep learning-based process for the automatic detection, tracking, and classification of thermal events on the in-vessel components of fusion reactors. Shin [
27] proposed early-stage lung cancer diagnosis via the spectroscopic analysis of circulating exosomes with the help of deep learning.
Ethylene discharge has important applications in many fields such as polymer synthesis, surface modification, and pollutant degradation. This paper proposes an ethylene discharge spectrum recognition model based on a deep convolutional neural network. Our main contributions are as follows.
- (1)
A total of 8236 ethylene plasma spectra were collected with the discharge radio frequency (rf) powers within 60–69 W in a rf plasma discharge system. The dataset consisted of 10 classes with the labels ranging from 0 to 9.
- (2)
In our model, the deep convolutional neural network was used to achieve better effects in regard to data recognition because of its strong feature learning ability, which can automatically extract features from input data. A residual shrinkage block was added to the deep convolutional neural network, which finds the less important features through the attention mechanism, and then the soft threshold function filters them out and discards this redundant information, and finally important information during feature extraction is retained. Moreover, it introduces shortcut connections to allow gradients to propagate directly from the input layer to the output layer, effectively preserving important data features even as the network depth increases, thereby improving the recognition accuracy of numerical data.
- (3)
In this study, a deep convolutional neural network was constructed for the accurate recognition of the collected ethylene plasma spectral data under each label to obtain the corresponding rf power. This model can not only recognize the macro experimental parameters including rf power, gas pressure, gas ratio, and so on, but also plasma microscopic parameters including the temperatures and number densities of electrons and ions corresponding to the discharge spectra during plasma discharge. Due to the model having the ability to learn and recognize the data features of different parameters, when the plasma parameters change, the model can still perform effective recognition, but requires updating of the dataset, and retraining and retesting of the model. This model provides a new technique for plasma spectrum diagnosis.
2. Data Collection
The plasma was generated using ethylene gas discharge in a capacitively coupled rf plasma discharge system with the gas pressure of 190 Pa. In this experiment, during plasma discharge, the spectral data were collected by the spectrometer (PG2000-Pro) through the spectral acquisition software while deducting the background noise data.
Figure 1a shows the top view of the setup, in which the discharge spectrum is collected and stored by a spectrometer (370–1050 nm) through a fixed fiber from window A. During the experiment, the glow generated by the ethylene plasma contains the important information of plasma spectrum and plasma discharge parameters. Visually, a discharge glow image offers a more intuitive representation; for example, the glow image at 69 W and 190 Pa taken from the side view is shown in
Figure 1b.
Figure 2a represents the corresponding spectrum (with 2048 wavelengths) of 69 W after deducting background noise, showing that strong peaks appear at some specific wavelengths of 385.68, 450.28, etc. The spectral curves after deducting background noise collected under different rf powers from 60 to 69 W are shown in
Figure 2b. It can be seen that the discharge intensity varies with rf power, i.e., the higher the rf power, the stronger the discharge intensity. In addition, it is seen that for a certain specific spectral curve, it is difficult to quickly identify the corresponding discharge power without a suitable model. In this paper, 60 to 69 W of rf power is only taken as an example to show the discharge macroscopic parameter. When the discharge power changes continuously, it is only necessary to update the dataset, and retrain and test the model. The model still has the ability to learn and recognize spectral features under different parameters; therefore, the discrete values of rf discharge power will not limit the application of the model.
When considering the model building and dataset construction, the file sizes of different formats should be compared. Compared with the collected spectral data files (in csv format), glow images (in bmp or jpg format) and the plotted spectral curve graphs (in png format) tend to have larger file sizes and more complexity. If using glow images and the plotted spectral curve graphs to build a dataset, it is necessary to construct a complicated 2D convolutional neural network. Meanwhile, for the spectral data file in csv format, because of its obviously smaller file size and the 1D data, only a 1D convolutional neural network is required. For example, the data for 10 spectral curves in csv format are 18 kb, while the data of the same 10 spectral curves are plotted as a graph in png format with a file size of 172 kb for the resolution of 300 dpi and 297 kb for the resolution of 600 dpi. In addition, the spectral data offer flexibility in terms of storage formats, ranging from simple integers to complex float point numbers. Its flexibility allows the numerical data to be adapted to the needs of a variety of applications, and the precise numerical representation of the data helps to achieve greater accuracy in data analysis and processing tasks. Thus, the 1D spectral data in csv format were used to construct the dataset and be recognized by our proposed 1D model as presented in this paper. For the construction of the ethylene spectral dataset, the data file of 8343 spectral curves was collected under different rf power, as shown in
Table 1. The total data files for the collected spectral curves can be classified into 10 classes with different labels corresponding to different rf powers. In the constructed dataset, the data of 5840 (or 70%) spectral curves were used to train the proposed model and the data of the remaining 2503 (or 30%) spectral curves were used to test the model.
3. Network Structure
3.1. Overall Design Ideas
In this study, a 1D deep convolutional neural network was constructed for recognition of the ethylene spectral data. First, the spectral data files (in csv format) with different labels were input into the model for training to learn the data features. During the training process, the model was saved with updated parameters. When the accuracy and loss reached the expected values after multiple iteration training, the trained model was ultimately saved for later use. Then, each spectral datum in the test dataset was input into the saved model to output the recognized label value (i.e., the rf power).
Figure 3 shows the constructed model structure, which is mainly composed of the convolution layers and residual shrinkage (RS) blocks, marked by dashed lines. The RS block is composed of global average pooling, two fully connected layers, ReLU, and Sigmoid activation function. The model input size is 2048 × 1, and all the convolutional kernel sizes for Conv1-Conv4 in the blocks are 3 × 3 with the stride of 1 (there are two convolution layers with the same name of Conv1, meaning the same parameter in these two layers). The number of filters for Conv1-Conv4 and RS1-RS4 are 32, 64, 128, and 256, respectively. The last layer is the fully connected layer with the activation function of Sigmoid for the category of the output label.
This network not only contributes to the application of deep convolutional neural networks in 1D complex plasma data recognition tasks, but also innovates and improves the structure and learning mechanism of deep neural networks at the theoretical level by introducing residual blocks to solve the problem of gradient disappearance in deep networks to successfully train deeper structures and by introducing soft thresholding to dynamically adjust the weight of features according to the importance of features, thus suppressing unimportant features and strengthening key features. These theoretical contributions not only help to improve the performance of deep convolutional neural network models, but also provide new ideas and directions for the development of deep networks. Since the plasma spectral data is 1D, this model uses a 1D convolution kernel to process such data, which enables the network to capture local 1D features more efficiently. This model expands the application range of deep convolutional neural networks in signal processing, fault diagnosis, time series analysis, and other fields.
Since the plasma spectral data are 1D, this model uses 1D convolution to process such data, which enables the network to capture local 1D data features more efficiently. It should be pointed out that the input file in this paper is in csv format, not in an image format, which can be shown from the input files (0.csv–9.csv) in
Figure 3. Due to the fact that the input spectral data file in csv format is 1D and significantly smaller than that in 2D images, the recognition efficiency of the model can be greatly improved.
The structural parameters that need to be adjusted include the kernel size, channel and step size of the convolution, etc. The selection and adjustment of these parameters have an important impact on the performance of the model. The structural parameters of the model are determined by continuous trial and adjustment according to the input data characteristics to improve the performance of the convolutional neural network. The detailed parameters in the proposed network for the input, operation, and output of the model are shown in
Table 2.
3.2. Soft Thresholding
In the acquisition of plasma spectral data, there is often a lot of noise, which will interfere with the feature extraction and pattern recognition of the signal. When traditional networks process the signal with strong noise, it will be disturbed by the noise, which leads to the deterioration of the recognition performance. The model processes the input data by introducing a soft threshold function, which enables the network to adaptively learn and adjust the threshold of each sample. This soft thresholding mechanism can filter out noise-related features effectively and retain useful information related to tasks, so as to improve the recognition ability of noisy data. Compared with the traditional fixed threshold method, the soft threshold function in this network can adaptively set the threshold according to the characteristics of each sample. The ability of adaptive threshold setting enables the model to better adapt to data with different noise levels and distributions, and improves the generalization performance of the model.
Soft thresholding segmentation is a crucial step in many signal denoising methods. However, in classical wavelet thresholding, the design of filters requires a significant amount of expertise in signal processing, which is a challenging problem. Deep learning provides a novel approach to address this problem by using gradient descent algorithms to automatically learn filters instead of them being manually designed by experts. Therefore, the combination of soft thresholding and deep learning is an effective method for removing noise-related information and constructing highly discriminative features. The formula for soft thresholding can be represented as Equation (1).
where
,
, and
represent the input feature, output feature and threshold (constant value during the training and test), respectively, and all of these three parameters are positive values. Soft thresholding is used to convert the near-zero features to zeros. From Equation (1), it can be observed that the derivative of the output
with respect to the input
is either 1 or 0, which can be represented as Equation (2) and mitigate the issues of gradient vanishing and exploding.
3.3. The Residual Shrinkage Block
Figure 4 is a residual shrinkage block with channel thresholds, in which
is the number of convolutional kernels in the convolutional layer,
is the number of neurons in the FC layer, and
,
, and 1 in
are the indicators of the number of channels, width, and height of the feature map, respectively.
,
, and
are the indicators of the feature maps to be used when determining thresholds. The first two convolution layers, two batch normalization, and two activation functions transform the characteristics of redundant information into values close to zero, but transform useful features into values far from zero, which is then propagated into the two layers of the FC layer with multiple neurons (the number of neurons equals the number of channels in the input feature map). The output value of the
th channel of the FC layer is scaled to a range of (0, 1) by Equation (3), which automatically learns a set of thresholds. Redundant features are eliminated and useful features are retained using soft thresholding.
where
and
are the features of the
th neuron and the
th scaling parameter, respectively. The threshold value,
, is calculated in Equation (4).
is the threshold of the
th channel of the feature map.
,
and
represent the dimensions of width, height, and channel of the feature
, respectively.
means the average of the feature responses for each channel, which is used for the subsequent threshold calculation and channel weight adjustment.
3.4. Algorithmic Innovations
Firstly, the network solves the problem of gradient disappearance in deep neural networks by introducing residual connections, which allows the network to learn the residuals between inputs and outputs, helps the network to better propagate gradients during training, and enables deeper network structures to be trained. This structure is essential for capturing complex and abstract features.
Then, there was the introduction of a soft thresholding mechanism, a non-linear transformation for adaptive scaling of a feature map during feature learning. Through soft thresholding, the network can dynamically adjust the weight of features, suppress unimportant features, and enhance key features. This mechanism is particularly effective for dealing with the signals with noise or complex data because it can reduce noise interference of feature representations and improve the performance of the model.
Finally, the adaptive feature learning process is realized by combining residual learning and soft thresholding. This adaptability allows the network to dynamically adjust the feature representation according to different tasks and data characteristics, so as to extract and retain the features that are important to the recognition task more effectively. In addition, the design of the network also takes into account the characteristics of tasks and data, and uses a 1D convolution kernel and adaptive feature learning to optimize the performance of the model, so as to improve the classification accuracy.
On the whole, this network structure combines two mechanisms of residual learning and soft thresholding, which are designed to improve the feature learning ability of deep neural networks on complex and noisy data and improve the recognition accuracy.
5. Conclusions
In this paper, the deep convolutional neural network was proposed for recognizing the macro parameters of discharge from the corresponding ethylene plasma spectrum. The proposed network has strong feature learning and extraction ability with a residual shrinkage block which finds the less important features through the attention mechanism and removes the unimportant information by embedding soft thresholds. In addition, a shortcut connection is added, allowing gradients to propagate directly from the input layer to the output layer, thereby effectively preserving important data features and improving the recognition accuracy. This model can effectively recognize ethylene plasma spectral data with all the four evaluation indicators of higher than 98%. Compared with four other classical recognition models, our model shows the best recognition performance. This model can not only recognize the macro plasma discharge parameters including rf power, gas pressure, gas ratio, and so on, but also plasma microscopic parameters including the temperatures and densities of electrons and ions corresponding to the spectra, which provides technical support for plasma spectrum diagnosis and plasma application in industry.