1. Introduction
With the development of economy and society, the trend of seasonal and periodical shortages of power supply has become normalized, the problem of energy supply security is becoming more and more prominent, and it has become increasingly important to alleviate the contradiction between the supply and demand sides [
1]. The scheduling and regulation of flexible loads can effectively alleviate the contradiction between supply and demand, which is an important means of securing power supply [
2]. The forecasting of flexible loads can predict the adjustable space of flexible loads in advance, which is of great significance to provide relief from supply and demand-side conflicts [
3].
Air conditioners [
4], electric vehicles [
5], etc., are typical flexible loads whose data are generally included in the overall customer load data and cannot be predicted directly. Influenced by weather, time, electricity price, incentive policies, and realistic demand [
6,
7,
8], flexible loads are highly volatile and random, which also affects the prediction accuracy of the overall load. Therefore, it is practical to decompose the flexible load from the total load and then make predictions; however, there are few prediction research results for flexible load.
There are two existing load disaggregation methods: intrusive load monitoring (ILM) and non-intrusive load monitoring (NILM). The ILM method installs a separate sensor for each load, eliminating the load disaggregation step and allowing access to accurate energy consumption information for each load. However, the method requires the installation of sensors for each load of each occupant or building, which is time-consuming and labor-intensive when conducting the rollout. Additionally, the ILM method interferes with the normal life of users and involves user privacy issues when collecting data. The NILM method only needs to add a power meter to the main electrical panel of each user or building, which does not disturb the occupants during data collection. By simply installing a single power meter for the whole floor or building to obtain the overall load data of the user or building, through the load decomposition algorithm, the energy consumption information of each load can be obtained. With the continuous development of machine learning, the load decomposition accuracy of decomposition is rising, and therefore the NILM method is widely used [
9].
This paper proposes the use of the load decomposition technique to extract the flexible load from the total load, retain the flexible load characteristics, obtain the data sample set of the flexible load, and then perform flexible load prediction based on the flexible load data sample set.
The current non-invasive load decomposition (NILD) technique possesses high accuracy [
10,
11,
12,
13,
14,
15,
16,
17,
18,
19], but the flexible load state is variable and the power threshold changes, and the traditional decomposition method requires event detection, which is not applicable to flexible load decomposition [
10,
11]. Deep-learning-based methods, which do not require event detection, improve the accuracy of flexible load decomposition by performing feature extraction and feature mapping of the target load to the overall load [
12,
13,
14]. For feature extraction, graph signal processing theory [
15], fast Fourier transform [
16], and spectral map theory are mostly used, which can extract load features to a certain extent but cannot distinguish multi-state loads. In this paper, a one-dimensional convolutional neural network (CNN) is adopted for feature extraction, and CNN can effectively extract the deep-level features embedded in the data with reduced algorithm complexity. In terms of feature mapping, the bi-directional long short-term neural network (BiLSTM) overcomes the drawback of the long short-term memory neural network (LSTM) network only being able to perform one-way learning, and improves the feature mapping effect through bi-directional learning [
19].
Combinatorial prediction models are the current hotspot in load prediction, which generally use wavelet decomposition [
20], empirical modal decomposition [
21], variational modal decomposition [
22], and complete ensemble empirical modal decomposition to decompose the load sequence into multiple components to reduce the load nonlinearity, and then use LSTM, BiLSTM, and other methods for prediction. This type of method is based on single-step prediction, and the accuracy of multi-step prediction is poor, which is not suitable for the actual demand of scheduling. Meanwhile, this combined prediction method is prone to data leakage when performing sequence decomposition [
23].
Compared with traditional deep learning, CNN, LSTM, etc., transformer models can capture long-range dependencies between time series more effectively, support parallel training of the model, and have better model performance [
24], which is mostly used in language recognition, text recognition, etc. Informer model is an improvement based on a transformer model, in which a self-attention mechanism with the addition of probabilistic sparsification, which not only can avoid the recursive transmission between information and effectively capture the effective information between sequences [
25], but also reduces the complexity in the temporal sequence, improves the prediction accuracy and efficiency, and is more suitable for prediction of temporal data. The method does not have a data leakage problem and performs better in multi-step prediction.
In summary, this paper proposes a two-stage flexible load prediction method based on non-intrusive decomposition technique. In the first stage, the corresponding data are collected according to the smart meters on the customer side, a deep learning model based on CNN-BiLSTM is constructed using a non-intrusive load decomposition technique, the CNN method is used to obtain the flexible load features, and the BiLSTM method is used to perform flexible load feature mapping to accurately decompose the flexible load from the total load. In the second stage, the Informer model is used to perform multi-step flexible load prediction based on the sample set of flexible load data decomposed in the first step. Finally, the method of this paper is applied to a regional electricity load data set, and the results show that the model of this paper can decompose the flexible load from the total load, retain the change characteristics of the flexible load, and realize the future prediction of the flexible load and the residual load, respectively, and the prediction accuracy of Informer is significantly improved compared with the traditional prediction model when multi-step prediction is performed, and the more the number of prediction steps, the more obvious the improvement is. In this paper, the model is validated using two datasets, where dataset 1, the prediction coefficients of determination for flexible load air conditioning and electric vehicles, are 0.9329 and 0.9892. The predicted value of the total load is obtained by adding the flexible load to the residual load. At a prediction step of 1, the total load prediction coefficient of determination is 0.9813, which improves the prediction coefficient of determination by 0.0069 compared to the direct prediction of the total load, and prediction decision coefficient improves by 0.067 at 20 predicted steps. When applied to data set 2, the prediction coefficient of determination for flexible load air conditioning is 0.9646.
5. Experimental Verification
In this paper, two datasets were selected to validate the model. In dataset one, the experimental data were obtained from the electricity consumption data of a region, and the load data include ten kinds of loads, such as air conditioners, refrigerators, computers, dishwashers, and electric cars. For dataset two, data were obtained from the REDD dataset. The REDD dataset, published by MIT, contains energy use data from six different households over a period of weeks. In this paper, we selected house6 in the REDD dataset, which includes nine types of equipment such as air conditioners, air handling units, dishwashers, and electric furnaces (the link to the REDD dataset is
http://redd.csail.mit.edu/ (accessed on 21 May 2022)).
We used Python and Pytorch to build the Informer and NILD-Informer models. Two models, CNN-BiLSTM and VMD-CNN-BiLSTM, were built with Python and Tensorflow 2.0, and the NILD-Informer model proposed in this paper was compared with the other three models for validation.
5.1. Model Parameters
To ensure the rationality of the validation method, the Informer model kept the same parameters, the parameters of VMD were optimized according to the literature [
27], and the CNN and BiLSTM parameters were optimized according to the minimum prediction error, as shown in
Table 1. Detailed model parameters are shown in
Figure A1 and
Table A1 in
Appendix A.
5.2. Experiment 1
The experimental data were collected for 30 days from 1 July to 30 July 2019, with a sampling interval of 6 s once, for a total of 432,000 data.
5.2.1. Load Decomposition
The load data include ten kinds of loads such as air conditioners, refrigerators, computers, dishwashers, electric cars, etc. In this paper, Python and Tensorflow 2.0 were utilized. Both constructed a non-invasive load decomposition model based on the CNN-BiLSTM network. The data from 1 to 3 July were selected for training. The data of 4 July were selected as the test set, and the electricity consumption of each load on 4 July is shown in
Figure 7. The model was used to decompose the air conditioning load and the electric vehicle load from the total load. The decomposition results are shown in
Figure 8, and the decomposition accuracy is shown in
Table 2:
The results from
Figure 7 and
Figure 8 and
Table 2 show that the load power of air conditioning and electric vehicles is larger, with obvious load characteristics and high decomposition accuracy. The air conditioning and electric vehicle loads were subtracted from the total load to form three types of loads: air conditioning, electric vehicle, and residual load. The following is a multi-step prediction for each of the three types of loads.
5.2.2. Prediction and Result Analysis
We decomposed and reconstructed the data from 5 to 30 July, and the data sampling interval was adjusted to once every 5 min, with a total of 7488 data. Using the data from the first 25 days for training, the last day’s data were decomposed in multiple steps, and 1, 5, 10, and 20 steps were predicted, respectively, i.e., the prediction time scales were 5 min, 10 min, 50 min, and 100 min.
5.2.3. Decomposition of Prediction Results
Figure 9 shows the prediction results for the 10-step prediction of air conditioning load and electric vehicle load, as shown in
Figure 9a,b, respectively, and
Table 3 shows the evaluation indexes for the 10-step prediction of air conditioning and electric vehicles. Results for the other prediction steps are shown in
Figure A2 and
Figure A3 in
Appendix A.
According to the results in
Figure 9, it can be seen that the method possesses high accuracy for multi-step prediction of flexible loads, with prediction decision coefficients of 0.9329 and 0.9892 for the two flexible loads, respectively.
5.2.4. Total Load Prediction Evaluation Index Results
Four models were used for 1-step, 5-step, 10-step, and 20-step prediction, and the results of total load prediction are shown in
Table 4.
5.2.5. Comparative Analysis of Experimental Results
Comparing the experimental results of CNN-BiLSTM and VMD-CNN-BiLSTM, the single-step prediction accuracy of VMD-CNN-BiLSTM was stronger than that of CNN-BiLSTM model, which is because VMD reduces the complexity of loading in line with the current mainstream view. However, when multi-step prediction was performed, the addition of new data made the decomposition error of VMD larger, leading to a greater decrease in prediction accuracy and a less effective prediction than CNN-BiLSTM.
When multi-step prediction was performed, the prediction errors of all four models increased as the number of prediction steps increased. However, the Informer model had a more stable trend of increasing prediction errors. It had better performance in different prediction steps. When the number of prediction steps was 20, the Informer model reduced the prediction error by 12.41% and 9.86% compared with the CNN-BiLSTM and VMD-CNN-BiLSTM prediction models, respectively.
Comparing the results of the NILD-Informer model and the Informer model, the NILD-Informer model was able to predict the load for air conditioning and electric vehicles separately. After reconstructing the prediction results, the multi-step prediction error of the total load of the NILD-Informer model decreased the least. Among them, the NILD-Informer model prediction reduces the RMSE of prediction error by 13.92% compared with the Informer model when 20-step prediction was performed. This indicates that NILD-Informer has higher multi-step prediction accuracy, and after decomposing air conditioners and electric vehicles from the total load, the uncertainty of the load was reduced and the prediction accuracy was improved.
5.2.6. Total Load Prediction Results
The 20-step prediction results for the total load of the above four models are shown in
Figure 10. From the enlarged local details, it can be seen that the prediction fit of the NILD-Informer model is the best compared to the other three models, which can be closer to the true value with less prediction error. Additionally, the NILD-Informer model was able to predict the flexible load accurately, while the other models were not.
Based on the zoomed-in local details, it can be seen that the NILD-Informer model has the best prediction fit compared to the other three models, which can be closer to the true value and has less prediction error. The NILD-Informer model could accurately predict the flexible load, but the other models could not.
5.3. Experiment 2
The data of the REDD data set house6 was selected, which includes 9 types of equipment such as air conditioners, air handling units, dishwashers, and electric stoves, and the collection time was 30 days from 19 April to 19 May 2011, with a sampling interval of 3 s, and a total of 864,000 data.
Decomposition Prediction Results
The first four days of data were used to train the decomposition model. After using the above decomposition model, the data from 23 April to 19 May were decomposed into flexible load data and residual load data, and then the data were reconstructed and the data sampling interval was adjusted to once every 5 min, with a total of 7488 data. The parameters of the decomposition and prediction model were kept consistent with Example 1.
The decomposition forecast of air conditioning load is shown in
Figure 11, the forecast evaluation index is shown in
Table 5, and the total load forecast result is shown in
Figure 12. Results for the other prediction steps are shown in
Figure A4 and
Figure A5 in
Appendix ABased on the above results, it can be seen that applying this paper’s model to the REDD dataset possesses high accuracy in the decomposition prediction of flexible load air conditioning, which is beneficial for flexible loads to join the grid regulation for peak and valley reduction. The zoomed-in results in
Figure 12 show that NILD-Informer has a better fit compared to the other three models. The predicted values are closer to the true values with the smallest prediction errors.