Next Article in Journal
Operational Insights and Future Potential of the Database for Positive Energy Districts
Previous Article in Journal
Simulation Approaches and Validation Issues for Open-Cathode Fuel Cell Systems in Manned and Unmanned Aerial Vehicles
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multihousehold Load Forecasting Based on a Convolutional Neural Network Using Moment Information and Data Augmentation

1
School of Business, University College Dublin, Dublin A94XF34, Ireland
2
School of Electrical Engineering, Korea University, Seoul 02841, Republic of Korea
3
Department of Electrical Engineering, Sangmyung University, Seoul 03016, Republic of Korea
4
Department of Electrical and Control Engineering, Mokpo National University, Muan 58554, Republic of Korea
*
Author to whom correspondence should be addressed.
Energies 2024, 17(4), 902; https://doi.org/10.3390/en17040902
Submission received: 18 January 2024 / Revised: 11 February 2024 / Accepted: 13 February 2024 / Published: 15 February 2024
(This article belongs to the Section A1: Smart Grids and Microgrids)

Abstract

:
Deep learning (DL) networks are a popular choice for short-term load forecasting (STLF) in the residential sector. Hybrid DL methodologies based on convolutional neural networks (CNNs) and long short-term memory networks (LSTMs) have a higher forecasting accuracy than conventional statistical STLF techniques for different types of single-household load series. However, existing load forecasting methodologies are often inefficient when a high load demand persists for a few hours in a day. Peak load consumption is explicitly depicted as a tail in the probability distribution function (PDF) of the load series. Due to the diverse and uncertain nature of peak load demands, DL methodologies have difficulty maintaining consistent forecasting results, particularly when the PDF of the load series has a longer tail. This paper proposes a multihousehold load forecasting strategy based on the collective moment measure (CMM) (which is obtained from the PDF of the load series), data augmentation, and a CNN. Each load series was compared and ordered through CMM indexing, which helped maintain a minimum or constant shifting variance in the dataset inputted to the CNN. Data augmentation was used to enlarge the input dataset and solve the existing data requirement issues of the CNN. With the ordered load series and data augmentation strategy, the simulation results demonstrated a significant improvement in the performance of both single-household and multihousehold load forecasting. The proposed method predicts day-ahead multihousehold loads simultaneously and compares the results based on a single household. The forecasting performance of the proposed method for six different household groups with 10, 20, 30, 50, 80, and 100 household load series was evaluated and compared with those of existing methodologies. The mean absolute percentage error of the prediction results for each multihousehold load series could be improved by more than 3%. This study can help advance the application of DL methods for household load prediction under high-load-demand conditions.

1. Introduction

Short-term load forecasting (STLF) facilitates the operation of power systems as an economical, secure, and reliable program. It is equally important in the residential sector. Recently, STLF methodologies based on statistical, machine learning, deep learning (DL), causal, econometric, and judgment-based algorithms have been developed [1]. For single households, these techniques must deal with granular load series collected from advanced metering infrastructure (AMI). Each series is composed of uncertain load consumption values, which have a high degree of stochasticity, making them difficult to define using a physical measure [2,3,4].
Although people today live and spend time in grouped areas with distinct lifestyles, such as in apartment blocks, department stores, and companies, each household produces a unique load consumption series. The major exogenous factors that bring differences in household load series include the use of automatic thermal devices, installation of electrical vehicles (EVs), penetration of renewable energy resources, and evolution of social demographics. The impact of these factors is expected to increase in the future, and the corresponding load series will become more complex for analysis and prediction. Hence, a new approach that addresses two main issues should be developed: (1) The vast number of customer load series at any given time, and (2) the increasing amount of uncertainty in single-household load series.
The load series from a single household has properties similar to nonstationary signals and has been mainly analyzed based on zero-mean, trend, and seasonal modeling [1]. Many statistical approaches can be used to extract hidden information from unusual load consumptions with the help of consecutive load observations during peak load demand hours [5,6,7]. Relevant customer load series has been explored using a Gaussian process for forecasting the load profiles of target households [8]. With the help of the correlation and L2 norm-based input selection methods, an appropriate sub-load series was extracted for each type of special day [9]. Clustering- and classification-based forecasting methodologies reduce the uncertainty level by batching homogenous customers, days, or weather parameters [9,10,11,12,13]. DL methodologies, such as support vector machine (SVM) [14], multilinear perceptron (MLP) [15], convolutional neural networks (CNNs) [16,17,18], and recurrent neural networks (RNNs) [19], have been used to forecast single-household loads. Pooling [16] and data augmentation [17] are popular algorithms for preparing a training dataset for the CNN from historical load series. A multiple household load forecasting model using Bayesian networks has been presented with various input features such as past consumption, temperature, and socioeconomic and electricity usage aspects [20]. A forecasting model for multiple individual households’ short-term electricity consumption has also been developed using two-level information extraction stages [21], where input datasets are structured in the low-level stage and a combined LSTM and CNN-based forecasting model is developed in the high-level stage. However, existing methodologies are associated with challenges in precisely batching and exploiting the diverse load series, and too weakly structured a dataset to achieve a high forecasting accuracy.
Advanced CNN-based architectures have been developed for image recognition, medical imaging, and natural language processing applications [22]. ImageNet [23] and VGGNet [24] are well-known CNN architectures in which a number of images can be taken as input for classifying multiple image-like objects. These architectures have been mainly developed for data classification problems that are closer to multi-series forecasting than to single-series forecasting. Many load series are recorded in ordered 1D architectures; however, CNN-based architectures operate by accepting three- or higher-dimensional datasets. Few have studied the local correlations between multiple time-series data. Convoluting multi-nonstationary signals can produce a better performance by exploiting the local minima, correlations, and dependencies [25]. Multisite PV profiles are predicted by developing a space–time matrix using a greedy adjoining algorithm [26]. For better accuracy, a large dataset is required for training with a minimum or constant shifting variance [27,28]. Existing pooling and augmentation techniques are popular for enlarging datasets and producing better forecasting results than their peers. However, the pooling strategy [16] has difficulty in delivering better forecasting results because of the large number of complex load series. Complex load series contain an abundance of tail information in their probability distribution functions (PDFs) because there may be higher load demand for a few hours in any given day. The data augmentation technique [17] has difficulty enlarging the appropriate dataset because of its shortcomings in generating a large number of similar residual load series. This makes it difficult to achieve a high forecasting accuracy for a single household; improving the performance continues to be a prerequisite. Hence, exploring multiple household load series for simultaneous load prediction is another approach to improve the forecasting accuracy of existing models.
This paper proposes a novel day-ahead multihousehold forecasting method based on a CNN architecture in which forecasting results are obtained simultaneously. The proposed method was tested on six different household groups containing 10, 20, 30, 50, 80, and 100 households with different load series from the database. Each load series was identified as unique by evaluating its collective moment measure (CMM). The CMM quantifies the hidden integrated statistics of the load series using moment parameter information, such as the mean and variance. The quantitative measurement of the moment parameters assists in learning the daily few-hour peak load from the load series of the household groups. The load series discovered based on the CMM index are more likely and placed in order for data augmentation. The data augmentation methodology for the ordered multihousehold load series maintains a minor or minimum shifting variance in the input dataset for the CNN. Although high load-demand hours are present each day in each load series, CMM-based ordering balances the entire uncertain peak load consumption, and the CNN can learn to predict accurate day-ahead load profiles for all input households. The simulation results showed a significant improvement of more than 8% and 3% in the forecasting accuracy in terms of the mean absolute percentage error (MAPE) over existing CNN methods with pooling [16] and augmentation [17], respectively.
The remainder of this paper is organized as follows. Section 2 discusses multihousehold load forecasting, the significance of the minimum shifting variance in the CNN, and reordering load series based on the tail PDF. Section 3 describes the estimation of the tail length, the data augmentation methodology for single-household load series, and the procedure of the proposed multihousehold forecasting algorithm. Section 4 presents the obtained multihousehold load forecasting simulation results and comparisons. Section 5 discuss the potential results and limitations of the paper. Finally, Section 6 concludes the paper.

2. Problem Description

2.1. Multihousehold Load Forecasting Based on CNN

The multihousehold load forecasting methodology predicts the number of day-ahead load profiles of different households simultaneously by sharing load-consuming characteristics from the nearest neighborhood. Many machine- and DL-based forecasting strategies, such as pooling [16] and k-nearest neighbor [29], can perform forecasting operations for single-household load series using the load-series data of other households. In comparison, a data augmentation strategy can predict all the household load series with a higher forecasting accuracy. Figure 1 shows the overall procedure of the proposed CNN-based multihousehold load forecasting methodology. The data augmentation method meets the data requirement of the CNN for multihousehold load forecasting, even though each household has a peak load consumption for a few hours in its daily load profiles.

2.2. Effectiveness of CNN with Low-Shifting Data

Shifting invariance is key to realizing a high prediction accuracy when using a CNN [30,31] and is useful for forecasting multihousehold load series. Shifting invariance means maintaining unbiased learning for small displacements in the input dataset [32], where the convolutional architecture is robust to learning the peak load difference in the input dataset. However, the CNN forecasting results vary significantly under high and low peak loads because of parameter sharing between the convolution layers and the partial effects from the pooling layers [28]. Hence, variance shifting should be minimized or set as a constant in the input dataset to achieve a high forecasting accuracy when using a CNN. To acknowledge the low shifting and evaluate the potential significant difference in the input series L = [ l 1,1 ,   l 1,2 , l 1,3 ,   l d , 1 ,   l d , t , l d , T , l D , 1 . l D , T ] , the average shifting variance δ ¯ s f is computed from n shifting variance δ s f , given by:
δ s f ( n ) = 1 F i = n F + 1 n l i l ¯ F 2  
δ ¯ s f = 1 D × T F n = F D × T δ ( n )  
where l i   is the hourly electricity consumption with domain of i = T d 1 + t , T represents the total hours in a day, t is the hour of the day d, D is the number of days, F is the specified length of the window series in l d to calculate the variance, and l ¯ F is the mean load consumption of the window series.

2.3. Reordering Load Series Based on Tail Length for Reducing Shifting Variance

The tail length of a load series of a single household can be explicitly observed when there is unusual load consumption. All the load series exhibited either long or short tails in their PDFs [33,34]. Figure 2 shows two different load series with their corresponding tail characteristics. Figure 2a,b shows the short-tail load series and the corresponding PDF; Figure 2c,d shows the long-tail load series and corresponding PDF, respectively. The main reason for the occurrence of either short- or long-tail PDFs in the load series is the high demand for a few hours on a daily basis.
For example, air conditioners, heaters, ovens, washing machines, and microwaves are higher load-consuming utilities and are mostly used for specific durations only. Other major causes for different peak loads are the type of day, number of family members in a household, working hours, and whether EVs are used or not. The same peak demand may not exist at the same hour on a daily basis. This leads to a high uncertainty in day-ahead load prediction and affects the forecasting accuracy.

3. Proposed Multihousehold Load Forecasting Method

3.1. Tail Length Estimation of the PDF of Electricity Load

Electricity demand data from households with uncertain peaks can result in a long tail in the PDF. In this study, the occurrence of these uncertain peaks was estimated using the CMM index. Statistical moment metrics, such as the mean and variance, can be used to extract the appropriate tail information from each load series and help identify the uniqueness and learn how to predict the uncertain daily few-hour peak loads for each load series.
The overall uncertainty in the PDF of the load-series data is an ensemble of moments from different types of daily load profiles. Let the daily load profile l d = l d , 1 ,   l d , 2 , l d , 3 ,   l d , t , . l d , T , where t represents the hourly load consumption on day d . The following statistical results can be obtained:
s d ,   1 = 1 T t = 1 T l d , t
s d ,   2 = 1 T t = 1 T l d , t s d ,   1 2
s d ,   3 = t = 1 T l d , t s d , 1 2 T  
s d ,   4 = 1 T t = 1 T l d , t s d ,   1 s d ,   3 3  
s d ,   5 = 1 T t = 1 T l d , t s d ,   1 s d ,   3 4
s d ,   6 = t = 2 T l d , t l d , t 1
where l d , t represents the single-household load consumption value at time t for day d, and s d , 1 ,   s d ,   2 ,   s d , 3 ,   s d , 4 ,   s d , 5 ,   and s d , 6   are the statistical moment measuring tools for day d, such as the mean, variance, standard deviation, skewness, kurtosis, and cumulative difference, respectively. The cumulative difference is a newly introduced metric that measures the degree to which the daily load profile contains the sum of the hourly differences in the daily load consumption. Each statistical moment measures vary and represent the shift in the uncertainty on a daily basis. Hence, the overall metric difference vector I d can be computed on a daily basis as follows:
I d = s d s d 1   d  
where s d = [ s d , 1 , s d , 2 , s d , 3 , s d , 4 , s d , 5 , s d , 6 ] represents the overall change in the uncertainty values between the load profiles for days d and d 1 .
The CMM can serve as a valuable tool for defining characteristics such as the long-tail PDF of a single-household load series. Consequently, the CMM vector I m = I 1 , I 2 , I 3 , ,   I d , ,   I D for a single household m can be obtained, where d represents D historical load profile days. Each load profile has a unique tail property, and the compositions of the different types of load profiles make each household load series unique. In this study, the L2 norm was used to calculate the overall magnitude of the CMM vector, which comprised the entire uncertainty in the load series. The comprehensive uncertainty presented in a CMM vector of the load series m is calculated as follows:
I m = d = 1 D I d 2  
where I m is the L2-norm magnitude value of the CMM vector. Each load series L m   can be rearranged based on the increasing values obtained from Equation (5).

3.2. Reordering Input Load Series

For each household group, each quantitative uncertainty measure I m in a single-household load series can be calculated as follows:
I = I 1 , I 2 , I 3 , . I m , . I N
where N is the total number of load series in the input household group. The difference between I m is measured by the integrated uncertainty between the two load series, and can help place each load series in the appropriate order. The proper placement of the load series may preserve the ordering of the shifting variance in the input load-series dataset. The lowest I m indicates that the m-th load series has the lowest number of uncertain load profiles, and its position is calculated as follows:
I m = argmin m I ( m )  
Equation (7) is applied for all households where N is replaced by N 1 each time. All the N × ( N 1 ) / 2 iterations are eventually performed, and the ordered CMM values are finalized. The load series are placed according to the lowest I m position given by Equation (7), and all the load series are placed in this order. The ordered dataset L is expressed as follows:
L = l 1,1 l 1,2 l 1 , d l 1 , D l 2,1 l 2,2 l 2 , d l 2 , D l m , 1 l m , 2 l m , d l m , D             l N , 1 l N , 2 l N , d l N , D T  
where L = L 1 , L 2 , L 3 , L m , L N T , and each L m = ( l m , 1 l m , 2 l m , d l m , D ) is the m-th-ordered single-household load series containing daily load profiles of D days.

3.3. Data Augmentation for Single-Household Load Series

The main purpose of using data augmentation in the residential sector is to learn new features from a single load series that is less volatile but still shows all the possible characteristics of the selected series [17]. Each L m is passed to the data-augmentation procedure, which creates L number of homogenous load series. Using multiple k-means clustering, each L m generates L different centroid load profiles C l m   that are less volatile and uncertain relative to each other. Each load profile l d m of L m is decomposed into its centroid load profile C l m and daily residual load profile r d m l as follows:
l m , d = C l m + r d m l
The identified C l m extract completes the seasonality of the load series, which is repetitive and easy to estimate using forecasting methods. By contrast, r l , d m comprises a large amount of noisy information that implicitly contains the tail PDF property of the load series and is difficult to estimate. With C l m , a new residual load series is generated that contains all the uncertain information of the load series, as given by:
R m l = L m C l m  
where R m l = r 1 m l , r 2 m l , r 3 m l , t h   r d m l , . r d m l comprises new residual load profiles r d m l generated by C l m for all input days d. The efficacy of R m l can be enhanced by maintaining k-input to the multiple k-means clustering. However, unwanted residual series are generated when the input k is greater than six. To explore only a homogenous residual load series and collect them in a single matrix R i n , the following equation was derived [17]:
R i n m = r l | l 1 L l = 1 L l  
where l is the Frobenius norm of R m l , which selects only less volatile residual series. Each residual series shows similar properties to the inputted m-th load series; when their corresponding C l m is added, an augmented load series L i n m is generated as follows:
L i n m = L m 1 , L m 2 , L m 3 , . L m l L m L  
where L m l = l l 1 m , l l , 2 m , l l , 3 m l l , d m . l l , D m represents the extracted l -th homogenous load series for household m. L i n m includes L × D load profiles with L number of augmented load series for household m. Figure 3 shows the extracted L homogenous load series L i n m   from the augmentation CNN method for the two tested single-household load series.

3.4. Proposed Multihousehold Load Forecasting

Many CNN architectures are available in pretrained versions for image classification, segmentation, localization, and object detection [22,23,24]. The CNN architecture passes the input data to different layers: (a) input, (b) convolution layer, (c) pooling, and (d) fully connected layers. However, there are no precise rules for designing a CNN architecture. All the proposed CNN-based architectures require a large amount of input data to ensure better accuracy.
To perform multihousehold load forecasting operations, the proposed CNN forecasting model was designed in a manner similar to the existing CNN architecture [19,20,21]. Let all the augmented multihousehold load series collected in order using Equation (12) be given by:
X   = L 1 1 , L 1 2 , L 1 3 , . L 1 l L 1 L L 2 1 , L 2 2 , L 2 3 , . L 2 l L 2 L L 3 1 , L 3 2 , L 3 3 , . L 3 l L 3 L L m 1 , L m 2 , L m 3 , . L m l L m L L N 1 , L N 2 , L N 3 , . L N l L N L
where X   contains N × L × D load profiles with N input load series, L is the count of the augmented series, and D units of daily load profiles are presented in the load series.
The newly generated homogenous load series was treated as an image-like input and was convolutionally operated using CNN filters. Although CNN filters are resistant to learning over noisy information, X   provides the necessary and sufficient information to predict the high demand for a few hours. The convolved data are passed to the pooling layer, which is vital for selecting the maximum pointing information to update the weight and bias in the CNN layers. The ending of the CNN layers is attached to a fully connected layer that receives all the higher-dimensional learning weights in a one-dimensional space. Eventually, this layer performs the forecasting operation for a multihousehold load in one time-step. To perform the training operation, the proposed model provides multihousehold load profiles Y ^ t , which receive the actual multihousehold load profiles Y t . The two profiles can be expressed as:
Y ^ t = l ^ 1 , D ,   1 l ^ 1 , D ,   2 l ^ 1 , D ,   t l ^ 1 , D ,   T l ^ 2 , D ,   1 l ^ 2 , D ,   2 l ^ 2 , D ,   t l ^ 2 , D ,   T       l ^ m , D ,   1 l ^ m , D ,   2 l ^ m , D ,   t l ^ m , D ,   T       l ^ N , D ,   1 l ^ N , D ,   2 l ^ N , D ,   t l ^ N , D ,   T
Y t = l 1 , D ,   1 l 1 , D ,   2 l 1 , D ,   t l 1 , D ,   T l 2 , D ,   1 l 2 , D ,   2 l 2 , D ,   t l 2 , D ,   T       l m , D ,   1 l m , D ,   2 l m , D ,   t l m , D ,   T       l N , D ,   1 l N , D ,   2 l N , D ,   t l N , D ,   T
where l D + 1 ,   t m and l ^ D + 1 ,   t m are the actual and predicted hourly load consumptions at time t. The optimizing equation developed for the proposed multihousehold load forecasting algorithm can be written as:
M i n i m i z e j   m = 1 N t = 1 T l m , D ,   t   l ^ m , D ,   t   2
where j denotes the number of predefined training epochs in the proposed CNN model.
Figure 4 shows the overall block diagram of the proposed multihousehold load forecasting method. The peak consumed loads in each load series generated a significant difference between the load profiles. The reordering of each load series based on the CMM was effective in maintaining a low shifting variance in the input multihousehold dataset. Maintaining this variance is useful for achieving a high forecasting accuracy for the multiple load series, where every household forecasting result within a different household group can be obtained simultaneously. The N-load series is maintained by ordering based on the tail PDFs for multihousehold load forecasting. With each ordered load series as the input, an augmented load series was generated and attached by maintaining the minimum shifting variance in the dataset. Consequently, the consecutive household load series from the ordering process presented the minimum difference in its tail characteristics.
Figure 5 shows the flowchart of the proposed methodology for multihousehold load forecasting, comprising three stages: (a) CMM load-series ordering, (b) data augmentation, and (c) the CNN framework. In the first stage, multiple load series are ordered using the lowest CMM values. Data augmentation is applied in the second stage to maintain a constant or minimum shifting variance in the input dataset. Finally, in the third stage, N load profiles are predicted for N multiple household load series using the CNN hyperparameters.

4. Simulation Results

4.1. Data Description and CNN Architecture Modeling

A large amount of multiload series data was collected from 1181 residential households with AMI installed, located in Seoul, South Korea, for the experiment. The AMI recording began in September 2016 and ended in August 2017.
Figure 6 shows the model of the proposed CNN architecture for the prepared dataset containing an input N household load series. The training dataset contained each load series data up to day D , whereas the testing dataset contained only multiload profiles of the next day, D + 1 .
Table 1 lists the hyperparameters of the tested DL framework for performing multihousehold forecasting operations. The program was written in Python using the Keras and TensorFlow libraries [35,36,37]. The program was tested on an i7 CPU-based computer, and numerous forecasting results were obtained using Google Colab, which is a GPU-based cloud application. Three core DL architectures, namely CNN, LSTM, and CNN–LSTM, were employed at the start of the proposed method to determine which DL benchmark produced the best forecasting results.

4.2. Performance Evaluation Metrics

To generalize the proposed forecasting methodology, each input dataset containing 10, 20, 50, 80, and 100 household load series was used for performance evaluation. To compute the load forecasting results for the m-th household in each multiple-household group, the following performance metrics were used for each DL methodology for comparison:
M A P E m = 1 T t = 1 T l D + 1 ,   t m l ^ D + 1 ,   t m l D + 1 ,   t m × 100 % m
R M S E m = 1 T t = 1 T l D + 1 ,   t m l ^ D + 1 ,   t m 2 m  
M A P E a v g = 1 N m = 1 N M A P E m  
R M S E a v g = 1 N m = 1 N R M S E m  
where l D + 1 ,   t m and   l ^ D + 1 ,   t m are the actual and predicted load consumptions of the next day D + 1 for hour t; M A P E m and R M S E m are the MAPE and root-mean-square error (RMSE) for testing household m. M A P E a v g and R M S E a v g represent the average MAPE and RMSE results, respectively, to generalize the forecasting results of the proposed multihousehold methodology.

4.3. Single-Household Load Forecasting

To evaluate the performance results for an individual household, the first ten household load series from the first group were used for forecasting using the CNN, LSTM, and CNN–LSTM methodologies. The input dataset was ordered, augmented, and prepared using the proposed methodology. Figure 7 shows the actual and predicted daily load profiles of the nine tested load series for the next day D + 1. In Figure 7, the forecasting results of the tested load profiles are arranged from lower-tail PDF load series to the longer-tail PDF load series. The CNN produced the most accurate prediction results, as shown in Figure 7g–i, which had a longer-tail PDF. Hence, the proposed CNN architecture predicted accurate results for the peak load consumption, despite the unusually high load demand observed for a few hours in a day.
Table 2 presents the forecasting MAPE and RMSE results for the ten households. Among the ten load series, the CNN outperformed for six of them that had longer tail characteristics in their PDFs. In terms of the average forecasting results, the CNN enhanced the performance results, with the forecasting M A P E a v g being higher than 2.3% and forecasting R M S E a v g being higher than 0.032 kWh compared with the LSTM and CNN–LSTM algorithms, respectively.

4.4. Average Single-Household Load Forecasting

The average single-household load forecasting results are effective for generalizing the performance of the proposed multihousehold load forecasting model. Figure 8 shows the box-plot results of the average forecasting performance result of four different household groups for a week. Each box plot represents the average MAPE results from the household groups containing 30, 50, 80, and 100 household load series. The box plot reveals that the performance of the CNN varies much less than those of LSTM and CNN–LSTM, despite the increase in the number of load series.
Table 3 shows the average MAPE performance results of all the multihousehold groups in July 2017. The month of July was considered for the observation because it contained mostly long-tail PDF load profiles in the load series compared with the other months in South Korea. Each result can be generalized to each household in the household group. As a result, the CNN-based model demonstrated a better performance, with the forecasting MAPE being higher than 3.5% and the forecasting RMSE being higher than 0.035 kWh for the general case.

4.5. Comparison with Pooled and Data Augmentation Methodologies

Pooled and data augmentation methodologies are efficient forecasting strategies for the electricity load of a single household. Therefore, they were compared with the proposed methodology for the same amount of input load series. For a robust comparison, four weeks from each season were selected to evaluate the forecasting performance of the 20, 50, 80, and 100 household load series. Table 4 presents a comparison of the forecasting performances of the proposed methodology, pooled CNN, and augmented CNN in terms of M A P E a v g and R M S E a v g . The proposed methodology shows noteworthy improvements of more than 8% and 0.108 kWh in M A P E a v g and R M S E a v g , respectively. These results are significant for single-household day-ahead forecasting.

4.6. Augmentation vs. Ordered Strategies

Because the proposed method uses both augmentation techniques and ordered strategies, its forecasting evaluations were studied separately. Both the methods were found to be suitable for multihousehold load forecasting. Table 5 presents the average performance results in terms of M A P E a v g and R M S E a v g for the augmented, ordered, and proposed methods. The comparison shows that creating a hybrid configuration based on augmentation and reordering using the CMM for the CNN brings about an improvement of more than 2% in the MAPE. Hence, both data enlargement from augmentation and ordering based on the tail PDF of multiple load series are useful for producing robust forecasting results of the multihousehold load.

5. Discussion

In this study, we developed a methodology for identifying synergies in energy consumption patterns between households and presented simulation results demonstrating the effectiveness of the proposed approach.
As each household’s electricity consumption differed from others’, especially in peak load values for a few hours in a day, the difference was explicitly illustrated when analyzing the tail PDF of the load series. The statistics-based CMM index was able to order the load series by computing the overall moment measure of the PDF generated from the load series. The electricity demand data used consisted of data from groups of households with ensured similarity. The ordered dataset was enlarged by using a data augmentation methodology that is effective in fulfilling the data requirements of the CNN. As a result, the proposed approach showed a significant improvement of more than 8% and 3% in forecasting accuracy in terms of the MAPE over existing CNN methods for the typical individual household.
Despite efforts to deliver higher forecasting results, the proposed model exhibits some unaddressed aspects and limitations. While the proposed method demonstrated utility in a case study involving 100 households over a one-year period, it is limited by the availability of only one year’s worth of data. Implementing the proposed methodology requires access to comprehensive data, which may not always be possible. In cases where access is restricted or data availability is limited, applying the proposed approach to groups of households with slightly different characteristics remains unexplored territory. Moreover, alternative methods for identifying synergies could yield better results. Exploring these methods could enhance the robustness of the proposed approach.
In conclusion, the performance of small-scale electricity demand prediction could be improved by aligning input data and adjusting the learning sequence with CMM, along with the combination of data augmentation. However, the underlying principles behind the synergy of this combination have not been analyzed. This study did not explore alternative methods for data augmentation techniques for better synergy with rearranging input data; therefore, further research in this area will be necessary in the future.

6. Conclusions

Residential load series contain granular load consumption values. Therefore, their performance results are comparatively higher, particularly in terms of the MAPE. Existing DL-based forecasting methodologies for load series are often inefficient due to the high load demand observed for a few hours in a given day, which is explicitly depicted as a long tail in the PDF of the load series. A multihousehold forecasting method based on load-series ordering, data augmentation, and CNN seems to be most suitable.
The main results of this study can be summarized as follows:
  • Multihousehold forecasting research is significant for residential living areas such as apartment blocks, community buildings, or even small towns where each household’s electricity consumption is different from others’;
  • The electricity consumption characteristics of each household are explicitly demonstrated in the load series PDF. Based on tail information in each load series’ PDF, CMM ordering was developed;
  • The proposed multihousehold load forecasting technique uses the CMM to index the order of the load series. The CMM produces an ordered load series by computing the overall moment measure of the PDF generated from the load series;
  • The ordered multiple series maintain a minimum or constant shifting variance in the load-series dataset inputted to the CNN. A data augmentation method is used for each load series of the input multihousehold group to solve the existing data requirement issues of the CNN;
  • A comparison of the average forecasting results for single households helped validate the efficacy of the proposed method;
  • A higher load forecasting accuracy for both single and multiple households could be realized by utilizing the CMM, data augmentation, and CNN, which solves the peak load demand issue encountered in the residential sector.
In this manuscript, we have discussed multihousehold forecasting using moment information and data augmentation. In the future, we would like to explore deep statistical methodologies and incremental learning for the multihousehold electricity consumptions.

Author Contributions

S.K.A. developed the augmentation strategies for the CNN-based forecasting framework, performed the simulation operations, and wrote the paper. H.Y. and Y.-M.W. analyzed the simulation results and commented on the manuscript. J.L. provided guidance for research and revised the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2021R1C1C1012408).

Data Availability Statement

The data presented in this study are available upon request from the corresponding author. The data are not publicly available due to the conduct of further research based on some of the data in this article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Box, E.P.G.; Jenkins, G.M.; Gwilym, R.C.G.; Ljung, G.M. Time Series Analysis Forecasting and Control, 5th ed.; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
  2. Gram-Hanssen, K. Standby consumption in households analyzed with a practice theory approach. J. Ind. Ecol. 2010, 14, 150–165. [Google Scholar] [CrossRef]
  3. Richardson, I.; Thomson, M.; Infield, D.; Clifford, C. Domestic electricity use: A high resolution energy demand model. Energy Build. 2010, 42, 1878–1887. [Google Scholar] [CrossRef]
  4. Stephen, B.; Tang, X.; Harvey, P.R.; Galloway, S.; Jennett, K.I. Incorporating Practice Theory in Sub-Profile Models for Short term aggregated Residential Load Forecasting. IEEE Trans. Smart Grid 2017, 8, 1591–1598. [Google Scholar] [CrossRef]
  5. Abbas, F.; Feng, D.; Habib, S.; Rahman, U.; Rasool, A.; Yan, Z. Short term residential load forecasting: An improved optimal nonlinear auto regressive (NARX) method with exponential weight decay function. Electronics 2018, 7, 432. [Google Scholar] [CrossRef]
  6. Welikala, S.; Dinesh, C.; Ekanayake, M.P.B.; Godaliyadda, R.I.; Ekanayake, J. Incorporating appliance usage patterns for non-Intursive load monitoring and load forecasting. IEEE Trans. Smart Grid 2017, 10, 448–461. [Google Scholar] [CrossRef]
  7. Cheng, C.; Sa-Ngasoongsong, A.; Beyca, O.; Le, T.; Yang, H.; Kong, Z.J.; Bukkapatnam, S.T.S. Time series forecasting for nonlinear and non-stationary processes: A review and comparative study. IIE Trans. 2015, 47, 1053–1071. [Google Scholar] [CrossRef]
  8. Xie, G.; Chen, X.; Weng, Y. An integrated Gaussian process modelling framework for residential load prediction. IEEE Trans. Power Syst. 2018, 33, 7238–7248. [Google Scholar] [CrossRef]
  9. Ghofrani, M.; Ghayekhloo, M.; Arabali, A.; Ghayekhloo, A. A hybrid short-term load forecasting with a new input selection framework. Energy 2015, 81, 777–786. [Google Scholar] [CrossRef]
  10. Wang, Y.; Chen, Q.; Gan, D.; Yang, J.; Kirschen, D.S.; Kang, C. Deep learning-based socio-demographic information identification from smart meter data. IEEE Trans. Smart Grid 2018, 10, 2593–2602. [Google Scholar] [CrossRef]
  11. Zhang, Y.; Chen, W.; Xu, R.; Black, J. A cluster-based method for calculating baselines for residential loads. IEEE Trans. Smart Grid 2015, 7, 2368–2377. [Google Scholar] [CrossRef]
  12. Zhong, S.; Tam, K.S. Hierarchical classification of load Profies based on their characteristics attributes in Frequecy domain. IEEE Trans. Power Syst. 2015, 30, 1–8. [Google Scholar] [CrossRef]
  13. Quilumba, F.L.; Lee, W.J.; Huang, H.; Wang, D.Y.; Szabados, R.L. Using smart meter data to improve the accuracy of intraday load forecasting considering customer behavior similarities. IEEE Trans. Smart Grid 2014, 6, 911–918. [Google Scholar] [CrossRef]
  14. Cecati, C.; Kolbusz, J.; Rozycki, P.; Siano, P.; Wilamowski, B.M. A novel RBP training algorithm for Shor-term electric load forecasting and comparative studies. IEEE Trans. Ind. Electron. 2015, 62, 6519–6529. [Google Scholar] [CrossRef]
  15. Ding, N.; Benoit, C.; Foggia, G.; Besanger, Y.; Wurtz, F. Neural network-based model design for short-term load forecast in distributed systems. IEEE Trans. Power Syst. 2016, 31, 72–81. [Google Scholar] [CrossRef]
  16. Shi, H.; Xu, M.; Li, R. Deep learning for household load forecasting-A novel pooling deep RNN. IEEE Trans. Smart Grid 2018, 9, 5271–5280. [Google Scholar] [CrossRef]
  17. Acharya, S.K.; Wi, Y.-M.; Lee, J. Short-term load forecasting for a single household based on convolution neural networks using data augmentation. Energies 2019, 12, 3560. [Google Scholar] [CrossRef]
  18. Claessencs, B.J.; Varancx, P.; Ruelens, F. Convolution Neural Network for Automatic State-Time Feature Extraction in Reinforcement Learning Applied to Residential Load Control. Trans. Smart Grid 2018, 9, 3259–3269. [Google Scholar] [CrossRef]
  19. Kong, W.; Dong, Z.Y.; Jia, Y.; Hill, D.J.; Xu, Y.; Zhang, Y. Short-Term Residential Load Forecasting based on LSTM Recurrent Neural Network. IEEE Trans. Smart Grid 2017, 10, 841–851. [Google Scholar] [CrossRef]
  20. Bessani, M.; Massignan, J.A.; Santos, T.M.; London, J.B., Jr.; Maciel, C.D. Multiple households very short-term load forecasting using bayesian networks. Electr. Power Syst. Res. 2020, 189, 106733. [Google Scholar] [CrossRef]
  21. Jiang, L.; Wang, X.; Li, W.; Wang, L.; Yin, X.; Jia, L. Hybrid Multitask Multi-Information Fusion Deep Learning for Household Short-Term Load Forecasting. IEEE Trans. Smart Grid 2021, 12, 5362–5372. [Google Scholar] [CrossRef]
  22. LeCun, Y.; Bengio, Y. Convolution Neural Networks for Images, Speech and Time Series; MIT Press: Cambridge, MA, USA, 1988. [Google Scholar]
  23. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. In Proceedings of the International Conference on Neural Information Processing Systems, Stateline, NV, USA, 3–6 December 2012; pp. 1097–1105. [Google Scholar]
  24. Pedregosa, F.; Varoquax, G.; Gramfort, A. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  25. Badeau, R.; Plumbley, M.D. Multichannel HR-NMF for modelling convolute mixtures of nonstationary signals in the time-frequency domain. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, USA, 20–23 October 2013; Volume 2013, pp. 1–4. [Google Scholar] [CrossRef]
  26. Jeong, J.; Kim, H. Multi-site photovoltaic forecasting exploiting space-time convolutional neural network. Energies 2019, 12, 4490. [Google Scholar] [CrossRef]
  27. Chen, W.; Xie, D.; Zhang, Y.; Pu, S. All you need is a few shifts: Designing efficient convolutional neural networks for image classification. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 7234–7243. [Google Scholar] [CrossRef]
  28. Chaman, A.; Dokmanic, I. Truly shift-invariant convolutional neural networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 3772–3782. [Google Scholar] [CrossRef]
  29. Fan, G.-F.; Guo, Y.-H.; Zheng, J.-M.; Hong, W.-C. Application of the weighted K-nearest neighbor algorithm for short-term load forecasting. Energies 2019, 12, 916. [Google Scholar] [CrossRef]
  30. Ioffe, X.; Szegedy, C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. arXiv 2015, arXiv:1502.03167. [Google Scholar]
  31. Huang, Q.; Li, J.; Zhu, M. An Improved convolution neural network with load range discretization for probabilistic load forecasting. Energy 2020, 203, 117902. [Google Scholar] [CrossRef]
  32. Zhang, R. Making convolutional networks shift-invariant again. In Proceedings of the International Conference on Machine Learning, Long Beach, CA, USA, 9–15 June 2019; pp. 7324–7334. [Google Scholar]
  33. Kwac, J.; Flora, J.; Rajagopal, R. Household energy consumption segmentation using hourly data. IEEE Trans. Smart Grid 2015, 5, 420–430. [Google Scholar] [CrossRef]
  34. Marini, L.; Gutierrez-Polo, I.; Kooper, R.; Satheesan, S.P.; Burnette, M.; Lee, J.; Nicholson, T.; Zhao, Y.; McHenry, K.; Zhao, Y.; et al. Clowder: Open source data management for long tail data. In Proceedings of the Practice and Experience on Advanced Research Computing, Pittsburgh, PA, USA, 22–26 July 2018; pp. 1–8. [Google Scholar] [CrossRef]
  35. Tensor Flow: Large Scale Machine Learning on Heterogeneous Distrusted Systems. arXiv 2016, arXiv:1603.04467.
  36. Acharya, S.K.; Wi, Y.-M.; Lee, J. Weather data mixing models for day-ahead PV forecasting in small-scale PV plants. Energies 2021, 14, 2998. [Google Scholar] [CrossRef]
  37. Pattanayek, S. Pro Deep Learning with TensorFlow; Apress: Bangalore, India, 2017. [Google Scholar]
Figure 1. CNN-based multihousehold load forecasting methodology using data augmentation.
Figure 1. CNN-based multihousehold load forecasting methodology using data augmentation.
Energies 17 00902 g001
Figure 2. Single-household load series and their corresponding probability distribution functions (PDFs): (a) Short-tail load series, (b) PDF of the short-tail load series, (c) long-tail load series, and (d) PDF of the long-tail load series.
Figure 2. Single-household load series and their corresponding probability distribution functions (PDFs): (a) Short-tail load series, (b) PDF of the short-tail load series, (c) long-tail load series, and (d) PDF of the long-tail load series.
Energies 17 00902 g002
Figure 3. Homogenous load series obtained from the augmentation technique: (a) Household load series 1 with a short-tail PDF, (b) household load series 2 having a long-tail PDF.
Figure 3. Homogenous load series obtained from the augmentation technique: (a) Household load series 1 with a short-tail PDF, (b) household load series 2 having a long-tail PDF.
Energies 17 00902 g003
Figure 4. Block diagram of the proposed multihousehold load forecasting method.
Figure 4. Block diagram of the proposed multihousehold load forecasting method.
Energies 17 00902 g004
Figure 5. Flowchart of the proposed multihousehold day-ahead load forecasting method.
Figure 5. Flowchart of the proposed multihousehold day-ahead load forecasting method.
Energies 17 00902 g005
Figure 6. Modeling of the proposed CNN-based architecture for multihousehold load forecasting.
Figure 6. Modeling of the proposed CNN-based architecture for multihousehold load forecasting.
Energies 17 00902 g006
Figure 7. Predicted next-day load profiles of nine different households from the first household group: (a) ordered short-tail household 1, (b) ordered short-tail household 2, (c) ordered short-tail household 3, (d) ordered medium-tail household 4, (e) ordered medium-tail household 5, (f) ordered medium tail household 6, (g) ordered long-tail household 7, (h) ordered long-tail household 8, and (i) ordered long-tail household 9.
Figure 7. Predicted next-day load profiles of nine different households from the first household group: (a) ordered short-tail household 1, (b) ordered short-tail household 2, (c) ordered short-tail household 3, (d) ordered medium-tail household 4, (e) ordered medium-tail household 5, (f) ordered medium tail household 6, (g) ordered long-tail household 7, (h) ordered long-tail household 8, and (i) ordered long-tail household 9.
Energies 17 00902 g007
Figure 8. Box-plot results of the average MAPE for the different household groups: (a) 30 households, (b) 50 households, (c) 80 households, and (d) 100 households.
Figure 8. Box-plot results of the average MAPE for the different household groups: (a) 30 households, (b) 50 households, (c) 80 households, and (d) 100 households.
Energies 17 00902 g008
Table 1. Hyper-parameter modeling for the tested core DL framework.
Table 1. Hyper-parameter modeling for the tested core DL framework.
MethodHidden LayersHidden Nodes/FiltersNo. of Epochs(Iteration)Activation
Function
Optimizer
CNN296150ReLURMSprop
LSTM272300Sigmoid, tanhRMSprop
CNN–LSTM296150ReLURMSprop
Table 2. Comparison of the forecasting performance in terms of the MAPE (%) and RMSE (kWh) for the first household group.
Table 2. Comparison of the forecasting performance in terms of the MAPE (%) and RMSE (kWh) for the first household group.
HouseholdMAPE (%)RMSE (kWh)
LSTMCNNCNN–LSTMLSTMCNNCNN–LSTM
118.4522.5523.180.0710.0930.090
217.4316.4918.320.1060.1050.122
321.1127.7227.140.0780.1040.103
432.1823.6726.100.1090.1080.142
532.8120.6628.100.1640.1360.162
618.5626.1025.740.1250.1550.152
745.1032.8237.850.1300.1110.116
829.3727.5328.230.1370.1350.179
935.2731.1532.060.2560.2080.213
1022.7922.6322.830.2080.2070.208
Table 3. Average forecasting performance results in terms of the MAPE (%) and RMSE (kWh) for the different multi-household groups in July 2017.
Table 3. Average forecasting performance results in terms of the MAPE (%) and RMSE (kWh) for the different multi-household groups in July 2017.
Multihousehold GroupsMAPE (%)RMSE (kWh)
LSTMCNNCNN–LSTMLSTMCNNCNN–LSTM
N = 1025.6422.3326.950.1450.1370.147
N = 2031.8327.3230.270.1510.1570.177
N = 3034.4927.6730.370.1760.1480.161
N = 5037.7431.1635.690.2370.1720.209
N = 8036.7528.3934.820.2020.1550.201
N = 10038.3632.4336.160.1710.1310.162
Table 4. Comparison of the average forecasting MAPE (%) and RMSE (kWh) performance results between existing and proposed forecasting methodologies for all household groups across all seasons in 2017.
Table 4. Comparison of the average forecasting MAPE (%) and RMSE (kWh) performance results between existing and proposed forecasting methodologies for all household groups across all seasons in 2017.
Multihousehold GroupsSeasonMAPE (%)RMSE (kWh)
Pooled CNNAugmented CNNProposedPooled CNNAugmented CNNProposed
N = 20Spring36.2329.2825.460.2360.1980.122
Summer39.2531.2628.130.2450.2040.111
Autumn41.2535.3627.320.2750.2070.157
Winter42.6333.2326.630.2770.2130.118
Average39.8432.2826.820.2580.2050.127
N = 50Spring34.2329.1223.380.2440.1910.092
Summer36.1530.9327.920.2510.2010.107
Autumn39.3733.7231.160.2970.2180.172
Winter38.4229.8223.690.2810.2090.095
Average37.0430.8926.530.2680.2040.116
N = 80Spring33.1528.1222.260.2560.1810.075
Summer36.5233.8528.390.2740.1930.155
Autumn35.2630.0925.150.2690.2120.095
Winter34.2927.0622.150.2640.1950.088
Average34.8029.7824.480.2650.1950.103
N = 100Spring32.2529.6527.750.2330.1710.082
Summer34.6831.1727.990.2410.1820.081
Autumn36.1633.4631.110.2780.1960.121
Winter35.1731.1628.300.2710.1930.098
Average34.5631.3628.780.2550.1850.095
Table 5. Comparison of the forecasting performance results in terms of MAPE (%) and RMSE (kWh) between augmentation, ordered, and proposed methodologies.
Table 5. Comparison of the forecasting performance results in terms of MAPE (%) and RMSE (kWh) between augmentation, ordered, and proposed methodologies.
MethodMAPE (%)RMSE (kWh)
N = 20N = 50N = 80N = 100N = 20N = 50N = 80N = 100
Augmentation30.0928.9226.1529.310.2170.1170.0960.087
Only ordered33.2532.4831.3634.210.1840.1640.1440.126
Proposed28.1327.1624.2926.990.1110.1100.0950.082
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Acharya, S.K.; Yu, H.; Wi, Y.-M.; Lee, J. Multihousehold Load Forecasting Based on a Convolutional Neural Network Using Moment Information and Data Augmentation. Energies 2024, 17, 902. https://doi.org/10.3390/en17040902

AMA Style

Acharya SK, Yu H, Wi Y-M, Lee J. Multihousehold Load Forecasting Based on a Convolutional Neural Network Using Moment Information and Data Augmentation. Energies. 2024; 17(4):902. https://doi.org/10.3390/en17040902

Chicago/Turabian Style

Acharya, Shree Krishna, Hwanuk Yu, Young-Min Wi, and Jaehee Lee. 2024. "Multihousehold Load Forecasting Based on a Convolutional Neural Network Using Moment Information and Data Augmentation" Energies 17, no. 4: 902. https://doi.org/10.3390/en17040902

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop