1. Introduction
Modern day industry witnesses both the advantages and challenges arising from the extensive amount of continuously streaming data enabled by the contemporary Information and Communications Technology (ICT) and initiatives, such as “Industrie 4.0” and “Made in China 2025.” The key objectives in such initiatives include reaching maximal uptime throughout the production chain and increasing productivity while reducing the production cost [
1]. One of the important approaches supporting such objectives is the Condition-Based Maintenance (CBM), where the degradation process of the system is monitored and perhaps even predicted before the breakdown [
2]. CBM benefits from the information contained in the abundant data provided by numerous sensors and other data sources, but on the other hand, the selection of the useful data remains a challenge. The large data sets are subjected to substantial selection bias and measurement errors, which have an enormous influence on the decisions made based on the data [
3]. In the fields of CBM and monitoring, a significant part of development should be directed to data pre-processing including data selection and preparation, feature extraction and selection, and model selection in order to avoid the “garbage in—garbage out” scenarios.
This study focuses on the condition monitoring of roller levelers which are used in steel factories to straighten steel strips after final rolling, heat treatment or cooling operations. These machines are exposed to high mechanical stresses [
4] due to the advanced steel grades processed [
5] and the increasing demands for productivity. Therefore, the monitoring of such machines becomes increasingly important, but relatively few studies deal with the condition monitoring of such machines [
6]. Instead, most of the studies are concerned with process simulations, parameter analysis, and the behavior of material in the leveling process [
7,
8].
Vibration analysis was selected as the technique to be utilized in the monitoring in this study, because acceleration signals are sensitive to the varying forces inflicted on the machine and have proven useful in various applications [
9]. Other commonly used techniques in industrial monitoring include acoustic emission, oil debris, ultrasound, and temperature monitoring [
10]. The acoustic emission is often used for similar tasks as the vibration, but it has practical challenges resulting from the lack of established calibration methods [
11,
12], the requirement of direct signal path through solid material [
12], and high sampling rates [
13]. In addition, the reliability of acoustic measurements [
14,
15] may be compromised due to the extensive background noises in factories. For these reasons, the well-established vibration techniques appear more appropriate for many industrial applications. Instead of analyzing the individual samples in the signals, vibration signals are commonly monitored based on features and signal transformations [
16,
17]. However, the direct use of an autoregressive model [
18] and deep learning algorithms [
19] on the signals have been proposed as well. Due to the noisy industrial environment, the conventional feature engineering approach was considered more appropriate for the approach studied here. In the case of roller levelers, there is, however, limited information on which features are useful for the monitoring of structural vibration when the monitoring is not targeted at a specific component, such as the bearing, shaft, or gear. Correlations between a few vibration features and machine parameters were studied in [
6], but it was not shown how this information could be used in monitoring.
To get insight into the operation of the machine, an appropriate solution could be the generation of a large set of features and transformations from the vibration signals, i.e., the production of a high-dimensional data set for data mining. In order to identify the characteristics or the typical operation of the machine, the use of statistical features provides a practical solution. In this study, features such as the generalized norms [
20], other features derived from them, and percentiles in short time windows are used. Such features have proven useful in previous applications in the monitoring of machine vibration [
4,
9,
21]. Before extracting these time domain features, the complete signals are also filtered by using various frequency bands with different sizes. Furthermore, the amplitudes of individual frequency components are computed, and statistical features are extracted from them in the successive time windows. The use of narrow frequency bands and individual frequency components could reveal effects masked behind other full or wide bandwidth effects and disturbances, which can be intensive in industrial surroundings. The flowchart for system identification in
Figure 1 illustrates how the signal filtering and feature generation phases relate to the overall data mining procedure applied.
To make the application of a high-dimensional feature set practical, efficient feature selection algorithms are required. With such algorithms, the irrelevant and redundant features are removed, and the predictive qualities and comprehensibility of the selected feature set are improved. A multitude of approaches has been proposed for the computational feature selection [
22]. These solutions are traditionally categorized as filters, wrappers, and embedded approaches [
23]. The significance of selecting the right solution is pronounced with high-dimensional data, where the building of a global model with a complete set of features is not practical. In such circumstances, many classical approaches such as the least squares linear regression [
24] or genetic algorithms [
25] perform unfavorably due to the bias-variance trade-off and overfitting. Many of the methods try to find a good set of variables rather than the optimal set [
25], which is also the expected end-result in the case of high-dimensional data sets. To tackle the problems of high-dimensionality, different solutions have been introduced, which include narrowing down the search space and the use of simple methods with low computation costs [
26]. These principles are also followed in this study: Two relatively simple wrapper approaches and the combination of filters and exhaustive search are tested. The wrappers include the sequential forward selection [
27] and the floating search algorithm proposed in [
28]. To narrow down the search space, several information-theoretic filters [
29] were first applied to generate small feature subsets. The exhaustive search was then applied in the modeling stage to find the optimal feature combination from the small subsets.
Numerous data-driven algorithms have been introduced to the condition monitoring of machines [
30]. Quite common targets of application include the automated diagnosis of bearing faults [
31] or gear and shaft damage [
32]. Many of these approaches are based on the principle of supervised classification, which uses the presumption that all the monitored machine states can be trained to the classifier, which then gives a categorical (qualitative) representation of the machine condition. Alternatively, regression models can be used to estimate quantitative responses, such as the relative stress level [
4] or other health indicators [
33] from the monitored system. However, the solutions where the signal features are combined with the processed materials such as the steel strips in roller levelers, as shown in
Figure 1, are rare [
4]. The machine vibrations are associated with the processed materials, operational parameters, and external disturbances, which should be considered, because they affect the success of diagnosis [
21] and prognosis [
34]. The changes of operational parameters and operating states in industrial applications make the direct monitoring of single features challenging. Regression models are practical for combining the feature values with each operating state and for identifying the typical values in each state. In addition, the features together provide information that individual features cannot provide, which eventually may increase the modeling accuracy.
The previous approaches to the regression-based estimation of steel properties in steel forming include yield strength and tensile strength prediction [
35] and hardness prediction [
36] based on steel composition and other processing parameters. Additionally, process data have been used to predict processing parameters, such as force, torque and slab temperature in a rolling process by using genetic programming [
37]. Then again, the modeling applications in roller leveling mostly focus on the analytical modeling of material behavior [
8,
38] or the leveling process [
39,
40,
41]. Such approaches do not focus on the vibrations in the process. However, analytical models and vibration measurements were used in [
42,
43] to study the polygonal wear on work rolls in a hot leveler. In a recent study [
4], the relative stress inflicted on a roller leveler was estimated based on vibration measurements.
The reported research on how the structural vibrations during leveling could be used to monitor the process is incomplete and rare. Therefore, this study proposes a data-driven approach to the identification of typical vibrations during the leveling process, which is illustrated in
Figure 1. This approach can be used (1) in the online estimation of the strip properties based on regression models, and (2) in the detection of deviant operation based on Statistical Process Control (SPC) charts for the regression residuals. Although the residuals are monitored based on the strip properties as depicted in
Figure 1, the deviations reveal more about the vibration response (or operation) of the machine than flaws in the steel strips. The steel strip properties were selected as the monitored variables, because the operation of the entire leveling process is dependent on them and no data of leveling parameters synchronized with the instantaneous vibration were available for this study.
The feature set extracted from the acceleration signals was high-dimensional, and therefore only relatively simple regression models were tested in order to have manageable computation time in the model training phase. Multiple Linear Regression (MLR) was used to identify the linear relationship between the steel strip properties and vibration. The Generalized Regression Neural Network (GRNN) [
44] was used to train models which are free from the linearity assumption. The moving mean and range charts [
45] were selected as the SPC method for roller leveler monitoring due to their immediate response to new data points and the smoothing effect of the moving mean which is useful in the case of noisy industrial data.
The remainder of the paper is organized as follows. The roller leveling process, measurements, and the methods behind the steps illustrated in
Figure 1 are presented in
Section 2.
Section 3 provides the results and
Section 4 discusses the findings and highlights the future research directions. Finally,
Section 5 concludes the paper.