Next Article in Journal
Recent Progress in Metal Borohydrides for Hydrogen Storage
Previous Article in Journal
New Approaches for the Production of Hydrocarbons from Hydrate Bearing Sediments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Short-Term Load Forecasting for Electric Power Systems Using the PSO-SVR and FCM Clustering Techniques

1
State Key Laboratory of Power Transmission Equipment & System Security and New Technology, Chongqing University, Chongqing, 400030, China
2
School of Automation Engineering, Chongqing University, 400030, China
3
Chongqing Tongnan Electric Power Company, Chongqing, 402660, China
*
Author to whom correspondence should be addressed.
Energies 2011, 4(1), 173-184; https://doi.org/10.3390/en4010173
Submission received: 14 November 2010 / Revised: 13 December 2010 / Accepted: 5 January 2011 / Published: 20 January 2011

Abstract

:
This paper presents a new combined method for the short-term load forecasting of electric power systems based on the Fuzzy c-means (FCM) clustering, particle swarm optimization (PSO) and support vector regression (SVR) techniques. The training samples used in this method are of the same data type as the learning samples in the forecasting process and selected by a fuzzy clustering technique according to the degree of similarity of the input samples considering the periodic characteristics of the load. PSO is applied to optimize the model parameters. The complicated nonlinear relationships between the factors influencing the load and the load forecasting can be regressed using the SVR. The practical load data from a city in Chongqing was used to illustrate the proposed method, and the results indicate that the proposed method can obtain higher accuracy compared with the traditional method, and is effective for forecasting the short-term load of power systems.

1. Introduction

Short-term load forecasting (STLF) aims at predicting the system load over a short time period like a day or a week, and plays an important role in power system operation. Many load forecasting approaches have been reported in the last decade, such as conventional smoothing technique, regression method, statistical analysis, time series analysis, autoregressive moving average model, and expert system [1,2]. Although these techniques and models are reliable and useful, they are not suitable for analyzing unusual weather conditions and varied holiday activities, which usually result in a daily load with highly non-linear characteristics.
Load forecasting for any given day is a difficult task, because it depends not only on the load of the previous days, but also on that on the same day in the previous weeks and even the previous years [3]. Furthermore, it is difficult to model the relationship between the load and the mentioned external factors influencing the load demand, such as weather variations, holiday activities, etc. These are the major factors making the modeling process more complicated. Because the model parameters are decided according to the historical data, some errors may be introduced [4].
The time series methods treat the load pattern as a time series signal with known period, and offer a rough prediction of the load for the given season. The techniques used in time series methods are as follows [5,6]: (a) Kalman Filter method; (b) Box Jenkins method; (c) Regression processes; (d) Spectral expansion technique. The spectral expansion technique utilizes the Fourier series. Generally, these techniques use a large number of complex relationships and require long computation times.
An intelligent system exhibits intelligence in capturing and processing information, and employs artificial intelligence techniques to fulfill some or all of its computational requirements. Artificial Neural Networks (ANNs) are capable of performing non-linear modeling and adaptation. In addition, they do not require a functional relationship between load and weather variables in advance. An expert system has the ability to act as a knowledge expert, therefore the load-forecast model can be built using the knowledge about the load-forecast domain provided by an expert in the field [7,8,9].
Recently methods load forecasting based on the load similarity were reported, which forecast the load curve by using the load information of the days with weather conditions similar to the objective day. In these methods several similar days are selected to improve the accuracy of load forecasting, which can not only deal with the non-linear relationship of the loads, but also the influence caused by weekends or special days. However the weights of different influencing factor cannot reflect the actual degree of influence due to the lack of adequate cognition of these factors.
In order to solve the problems mentioned above, this paper presents a novel preprocessing procedure based on the PSO-SVR and the FCM clustering techniques, which can improve the precision of daily load forecasting. Practical load data from a city in Chongqing, China, was used to demonstrate the proposed techniques, and the results indicate that the proposed method is effective for forecasting the short-term power system load.
The remaining parts of the paper are arranged as follows: Section 2 introduces the support vector regression based on PSO. Section 3 describes the short-term load forecasting process using the PSO-SVR and FCM clustering techniques, and an analysis of a practical case study follows in Section 4. Finally the conclusions will come at Section 5.

2. Support Vector Regression Base on PSO

Particle swarm optimization (PSO) was introduced by Eberhart and Kennedy in 1995 [10]. A PSO has two primary operators: a velocity update operator and a position update operator. During the convergence process, each particle is accelerated toward the particles with previous best position and the global best position. A new velocity for each particle is calculated in an iteration based on its current velocity and the distances from its previous best position and the global best position. The new velocity is then used to calculate the next position of the particle in the search space. This process is repeated until a pre-specified minimum error is met. The updated velocity and position are given by:
{ v i d k + 1 = w × v i d k + c 1 × r 1 × ( p i d x i d k ) + c 2 × r 2 × ( p g d x i d k ) x i d k + 1 = v i d k + 1 + x i d k
Support vector regression (SVR) [11] is a regression method based on the SVM method. Different from SVM, SVR tries to find a hyper-plane which can accurately predict the distribution of information, but not the plane on which to classify the data. However, the prerequisite for SVR to achieve the better results is to find three appropriate parameters: the kernel function K, the balance factor C and the insensitive loss function ε, which are critical for achieving high accuracy and generalization of the regression model.
For a training sample set:
T = { ( x i , y i ) } i = 1 l
xiRm is the input vector, yiR is the corresponding objective output and l is the number of training samples.
Using a nonlinear mapping ϕ ( x ) RmRM, the SVR maps a low dimensional input space to a high dimensional feature space, then it regresses linearly in a high dimensional feature space, which can demonstrate the influence of nonlinear regression in original input space. The general function of a decision linear regression is:
f ( x ) = ( w ϕ ( x ) ) + b
where w is a weight vector, and b is a threshold value. The parameters of the optimum regression in Equation (2) are solved using (3) to minimize the structural risk R:
min R = 1 2 w 2 + C R e m p
where:
R e m p = 1 l i = 1 l ( L e ( x i , y i f ( x i ) ) )
L e ( x , y f ( x ) ) = max { 0 , | y f ( x ) | ε }
The item ||w||2/2 in (3) is the expressing ability of the complexity of the decision function; the item Remp is the training error, and C is the balance factor which indicates the degree of matching of the two parts above. Experience risk Remp is measured by the loss function, which usually adopts the insensitive loss function L e ( x , y f ( x ) ) .
By introducing the slack variables ξ i and ξ i ( i = 1 , 2 , ... , l ), Equation (3) can be rewritten as:
min 1 2 w 2 + C l i = 1 l ( ξ i + ξ i )
St : { ( w ϕ ( x i ) ) + b y i ε + ξ y i ( w ϕ ( x i ) ) b ε + ξ ε i , ε i 0 ( i = 1 , 2 , ... , l )
Equation (6) is called the primitive problem of SVR. The primitive problem is actually a problem with linear constraints.
Introducing the Lagrange multipliers to the Lagrange function, the partial derivative of the Lagrange function with respect to w , b , ξ i , ξ i can be obtained, and its dual problem is given by:
min 1 2 i = 1 l ( α i α i ) ( α j α j ) K ( x i , x j ) + ε i = 1 l ( α i + α i ) i = 1 l y i ( α i α i )
St : { i = 1 l ( α i α i ) = 0 0 α i α i C l ( i = 1 , 2 , ... , l )
St : { b ¯ = y j i = 1 l ( α ¯ i α ¯ i ) ( x i , x j ) + ε i f   α ¯ j ( 0 , C l ) b ¯ = y j i = 1 l ( α ¯ i α ¯ i ) ( x i , x j ) ε f   α ¯ j ( 0 , C l )
where K ( x i , x j ) = ( ϕ ( x i ) ϕ ( x j ) ) is a kernel function. The kernel function is used to obtain the decision-making regressive equation without knowing the definite form of nonlinear mapping ϕ ( x ) from the low dimensional input space to high dimensional feature space.
Assuming that α ¯ = ( α ¯ 1 , α ¯ 1 , ... , α ¯ l , α ¯ l ) T is the solution of dual problem, there is at least a zero in a pair of ( α ¯ i , α ¯ i ) according to the sparse characteristic of the SVR.
Decision function is only determined by support vectors and has nothing to do with nonsupport vectors. Decision regression function (2) can be written as:
f ( x ) = i = 1 l ( α ¯ i α ¯ i ) K ( x i , x ) + b ¯

3. Short-Term Load Forecasting Using the PSO-SVR and FCM Clustering Technique

This section presents the optimum FCM clustering analysis and a new short-term load forecasting method using the PSO-SVR and FCM clustering technique.
Fuzzy c-means (FCM) is a clustering method which allows a large number of data to be classified into two or more clusters. This method (developed by Dunn in 1973 [12] and improved by Bezdek in 1981 [13]) is widely used in pattern recognition. The FCM is formulated as a model of minimizing the following objective function:
J m = i N j C u i j m x i c j 2 , 1 m <
where m is any real number greater than 1, uij is the degree of membership of xi belonging to cluster j, xi is the ith of d-dimensional measured data, cj is a d-dimension center of the cluster, and ||*|| is any norm expressing the similarity between any measured data and the center.
Fuzzy partitioning is carried out through an iterative optimization of the objective function shown above, with the updates of membership uij and the cluster centers cj by:
u i j = 1 k = 1 C [ x i c j x i c k ] 2 m 1
c j = i = 1 N u i j m x i i = 1 N u i j m
The iteration is stopped when max i j { | u i j ( k + 1 ) u i j ( k ) | } < ε , where ε is a pre-specified error criterion and k is the iteration step. This procedure converges to a local minimum or a saddle point of Jm.
The main idea for the short-term load forecasting is as follows: (1) Use the FCM clustering algorithm to obtain the optimum load pattern classification samples considering the change rules of the historical daily load in a power system. (2) Use the pattern recognition method to extract the optimal training samples. (3) Construct the load forecasting model using the PSO-SVR regression algorithm. This method can reduce the number of training samples and improve the prediction accuracy of support vector machines.
The following are the details for this method:
(1)
Pre-process the historical load data and normalize the basic data. A basic sample for each predicted time can be obtained based on the input-output load data, and the forecasting input sample x t ( t = 1 , 2 , , N ) is formed.
(2)
Set the kernel function and model parameters, and select the weight of the FCM cluster analysis, here m = 1.3 , the largest possible number of clusters c max = n , where n is the load sample number of the dataset used in clustering analysis, then set the pre-specified iterative convergence error of FCM clustering algorithm, here ε = 10 6 . The basic sample set for each predict time in the objective day is analyzed using optimal FCM cluster method, then the classified information of the basic sample set with the best clusters number can be collected. The distance between the forecast time period of the input samples x t is calculated, and the minimum distance corresponding to the sub-class as the PSO-SVR forecasting methods sample set will be set. By using the sample set at each forecasting time and the selected clustering parameters, the PSO-SVR load forecasting model is constructed, and the parameters for support vector machine regression algorithm are obtained. The objective function as Equation (7) can be formed and solved. Finally, by substituting the optimal solution of SVR model α i and b to Equation (8), the optimal regression function at each time period can be 0.
(3)
Use the forecast input samples and the optimal regression function at each time period to predict the load in the future.
The detail of the load forecasting process using the PSO-SVR and FCM clustering techniques is shown in Figure 1.
Figure 1. Flow chart of the load forecasting process.
Figure 1. Flow chart of the load forecasting process.
Energies 04 00173 g001

4. Simulation Results and Discussion

In order to verify the feasibility of the proposed method, a case study on the load forecasting from a city in Chongqing, China, was used. The traditional methods are first used to forecast the load, and the method in this paper is also used to forecast the short-term load, and comparisons were made to analyze the characteristics of the different methods.
The historical load data is shown in Table 1. Firstly the time series method is used to forecast the short-term load, and Table 2 shows the load forecasting results using the time series method. It can be concluded that the maximum forecasting error is 9.362%, which is a little large for practical engineering. The average error of the time series method is 1.136%.
Table 1. Historical load data.
Table 1. Historical load data.
Month12345678910
Time(hour)
0:002269.52072.42082.52064.12165.32292.42308.62331.82212.92224.0
1:002013.61863.21781.91887.61967.82057.72112.12055.11985.61955.5
2:001782.31626.11724.31740.21814.31873.31889.51867.71893.81841.6
3:001684.51624.91659.01696.41721.31831.41802.31817.61822.21789.5
4:001692.41553.11611.81643.21697.81798.51793.71752.61782.91744.8
5:001622.91556.21537.41622.31713.71783.41736.51732.61747.11742.0
6:001655.71560.81577.71663.01734.91789.01786.41748.51800.31766.5
7:001704.21657.01677.71890.71974.42025.41872.11801.62009.51972.0
8:001809.81786.61792.71965.02097.72108.51955.61901.72103.52112.1
9:002043.42064.62205.22505.82658.12677.72410.32317.52593.72642.6
10:002034.72204.42369.02697.62807.42819.22604.62432.22752.42693.4
11:002168.42343.42483.12842.02955.42899.92776.42608.82880.52621.4
12:002320.72477.42553.92846.02920.52838.32781.32650.22818.82601.8
13:002130.42194.02322.92662.72832.62625.72567.62382.12551.32400.1
14:002093.32245.72408.22798.52869.82696.42619.72400.32679.12519.3
15:002119.02294.22428.92832.22764.52696.82649.72430.82649.42540.6
16:002131.22245.32454.22794.82733.32665.32666.22407.82621.62517.8
17:002213.72342.32509.82845.32799.32747.62729.02510.92698.52599.4
18:002372.52555.32707.72896.02905.72824.42820.42671.12774.42668.6
19:002584.22752.32853.23004.53122.83118.52963.12950.93016.52891.2
20:002540.32695.22859.42934.73096.83109.52014.33031.72983.22878.0
21:002569.62643.42834.32944.73083.03130.63001.02979.52982.22900.1
22:002419.42543.02602.22708.92873.82893.52817.62754.72822.22731.6
23:002199.22222.82262.12394.02558.92609.02509.32440.02458.62411.7
Table 2. Load forecasting using the time series method.
Table 2. Load forecasting using the time series method.
Time (hour)Actual Load (MW)Forecasting Load (MW)Forecasting Error (%)Time (hour)Actual Load (MW)Forecasting Load (MW)Forecasting Error (%)
0:002096.02202.3434.82912:002651.62680.8891.094
1:001983.91968.010−0.80713:002498.62466.940−1.283
2:001821.61805.310−0.90214:002608.42533.030−2.975
3:001773.41744.910−1.63315:002570.52540.610−1.176
4:001710.71707.080−0.21216:002573.22523.750−1.959
5:001698.61679.410−1.14317:002642.12599.580−1.636
6:001773.41708.286−3.81018:002751.22719.609−1.161
7:001936.51858.460−4.19919:002817.22925.7203.709
8:002055.61963.320−4.70020:002864.82814.310−1.794
9:002637.72411.890−9.36221:002872.32906.8401.188
10:002591.22541.490−1.95622:002665.32716.6901.892
11:002650.52657.9300.28023:002396.12406.5600.435
Then the method based on PSO-SVR and method in this paper were used, respectively, to forecast the load on December 11th, and the results are shown in Table 3. It can be concluded that the average error for the PSO-SVR method is 1.443%, and the average error for the method presented in this paper is 1.066%. The maximum errors for the different methods are 3.25% and 2.798%, therefore the method in this paper improves the accuracy of the short-term load forecasting.
Table 3. Comparison of load forecasting on December 11th between the method based on PSO-SVM and method in this paper.
Table 3. Comparison of load forecasting on December 11th between the method based on PSO-SVM and method in this paper.
Time (hour)Actual Load (MW)PSO-SVR ModelFCM Based PSO-SVR Model
Forecasted Load (MW)Forecasting Error (%)Forecasted Load (MW)Forecasting Error (%)
0:002096.02084.384−0.5522098.7190.128
1:001983.91936.638−2.3721975.301−0.333
2:001821.61798.668−1.2491811.568−0.521
3:001773.41673.171−2.2101755.734−0.993
4:001710.71699.914−2.0941699.189−0.673
5:001698.61728.8720.0761705.6290.414
6:001773.41914.571−2.7111729.517−2.495
7:001936.52048.105−1.1121882.311−2.798
8:002055.62589.259−0.3642001.069−2.653
9:002637.72604.762−1.7372656.1040.698
10:002591.22633.8700.5132617.0520.998
11:002650.52634.436−0.6172695.4791.697
12:002651.62457.940−0.6272680.6771.097
13:002498.62521.008−1.5272454.077−1.682
14:002608.42540.626−3.2502586.288−0.848
15:002570.52542.728−1.0622585.8580.597
16:002573.22597.982−1.1812564.809−0.326
17:002642.12675.977−1.6602654.6470.475
18:002751.22743.891−2.7042717.247−1.254
19:002817.22910.345−2.5022749.604−2.399
20:002864.82926.3851.6002866.9770.0760
21:002872.32675.5751.8832885.2490.451
22:002665.32411.6800.3862670.7050.203
23:002396.12084.3840.6502377.644−0.770
Average Error (Absolute Value )1.443 1.066
Maximum Error (Absolute Value )3.250 2.798
To make a clearer comparison between the time series method, the method based on PSO-SVM, and the method in this paper, the comparison between these methods and the real load is made as shown in Figure 2. It can be concluded that the method in this paper is mostly close to the real load.
The hourly forecasting error curves of the three load forecasting models are shown in Figure 3, which shows the load at different hours on December 11, it also indicates that the error between the real load and the method in this paper is the lowest.
Figure 2. Comparison of forecasting loads using different methods.
Figure 2. Comparison of forecasting loads using different methods.
Energies 04 00173 g002
Figure 3. Comparison of load forecasting errors using different methods.
Figure 3. Comparison of load forecasting errors using different methods.
Energies 04 00173 g003
For further study, the load on the weekend is forecasted by the time series method, the PSO-SVR model and method in this paper, respectively, and the result is shown in Figure 4. The same conclusion can be reached.
The above forecasting is regarding the load at different points in a day. In addition, the load forecasting on the different day in one week is carried out with the different methods, and the results are shown in Table 4. It can be concluded from the Table that the technique proposed in this paper has a higher accuracy compared with the conventional model. Table 4 and Table 5 also show that the proposed model improves accuracy of the hourly load forecasting significantly.
Figure 4. Comparison of the load forecasting value on the weekend.
Figure 4. Comparison of the load forecasting value on the weekend.
Energies 04 00173 g004
Table 4. Comparison of load forecasting using different methods.
Table 4. Comparison of load forecasting using different methods.
DateTime SeriesPSO-SVR ModelFCM Based PSO-SVR Model
Average Absolute Error (%)Maximum Error (%)Average Absolute Error (%)Maximum Error (%)Average Absolute Error (%)Maximum Error (%)
12.59.43817.6201.4433.2501.0662.798
12.68.71314.0041.3494.3971.1372.989
12.73.5189.3441.6283.5911.3313.270
12.81.1179.6491.6824.2841.5003.954
12.95.57610.6411.4244.6991.1142.887
12.101.76010.8891.3603.7410.9542.599
12.111.1379.3621.9113.9281.4263.466
Average in a week4.46611.6441.5423.9841.2183.137
Table 5. Load forecasting errors in a week.
Table 5. Load forecasting errors in a week.
DateAverage Error (%)Maximum Error (%)Error > 3%Error > 4%
12.51.0662.79800
12.61.1372.98900
12.71.3313.27010
12.81.5003.95430
12.91.1142.88700
12.100.9542.59900
12.111.4263.46610

5. Conclusions

An improved method for the short-term load forecasting is presented to improve the forecasting accuracy of daily loads, especially in the scenario where the daily load is affected by weather and date factors seriously. Considering the periodical feature of load, the optimal FCM clustering was used to obtain the optimal data pattern division of history load samples and the best training sample set. The data regularity of input-output function relation in the training samples can be enhanced, the consistency of data characteristics can be ensured. The effective integration of PSO-SVR and the optimal FCM clustering algorithm is achieved.
The load forecasting results of a city in Chongqing illustrate the proposed hybrid method is effective, and not only reduces the number of training samples, but also improves the load forecasting accuracy.

Acknowledgement

The authors would like to acknowledge the grant from the Fundamental Research Funds for the Central Universities (Project No. CDJXS11150022).

References

  1. Li, Y.B.; Zhang, N.; Li, C.B. Support vector machine forecasting method improved by chaotic particle swarm optimization and its application. J. Cent. South Univ.Technol. 2009, 16, 478–481. [Google Scholar] [CrossRef]
  2. Wang, B.; Tai, N.L.; Zhai, H.Q.; Ye, J.; Zhu, J.D.; Qi, L.B. Hybrid optimization method based on evolutionary algorithm and particle swarm optimization for short-term load forecasting. Proc. CSU-EPSA 2008, 20, 50–55. [Google Scholar]
  3. Wang, L.J.; Liu, C. Short-term price forecasting based on PSO train BP neural network. Electr. Power Sci. Eng. 2008, 24, 21–23. [Google Scholar]
  4. Li, X.M.; Gong, D.; Li, L.; Sun, C.Y. Next day load forecasting using SVM. Lect. Notes Comput. Sci. 2005, 3498, 634–639. [Google Scholar]
  5. Mahalanabis, A.K.; Kothari, D.P.; Ahson, S.I. Computer Aided Power System Analysis and Control; Tata McGraw-Hill Publishing Company limited: New Delhi, India, 1988. [Google Scholar]
  6. Box, G.E.; Jenkins, G.M. Time Series Analysis Forecasting and Control; Holden-Day: San Fransisco, CA, USA, 1976. [Google Scholar]
  7. Moghram, I.; Rahman, S. Analysis and evaluation of five short-term load forecasting techniques. IEEE Trans. Power Syst. 1989, 4, 1484–1491. [Google Scholar] [CrossRef]
  8. Chen, C.H. Fuzzy Logic and Neural Network Handbook (Computer Engineering Series); McGraw-Hill Companies: Hightstown, NJ, USA, 1996. [Google Scholar]
  9. George, J.; Klis, T.; Folger, A. Fuzzy Sets Uncertainty and Information; Prentice Hall of India Private Limited: New Delhi, India, 1993. [Google Scholar]
  10. Caijin, D.; Yun, F.Z. Application of particle group and neural network hybrid algorithm in load forecast. High Voltage Eng. 2007, 33, 90–93. [Google Scholar]
  11. Zhuang, Y.Y. Short-term load forecast in power system based on PSO optimizing algorithm. Control Manage. 2007, 3, 9–11. [Google Scholar]
  12. Dunn, J.C. A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters. J. Cybern. 1973, 3, 32–57. [Google Scholar] [CrossRef]
  13. Bezdek, J.C. Pattern Recognition with Fuzzy Objective Function Algorithms; Plenum Press: New York, NY, USA, 1981. [Google Scholar]

Share and Cite

MDPI and ACS Style

Duan, P.; Xie, K.; Guo, T.; Huang, X. Short-Term Load Forecasting for Electric Power Systems Using the PSO-SVR and FCM Clustering Techniques. Energies 2011, 4, 173-184. https://doi.org/10.3390/en4010173

AMA Style

Duan P, Xie K, Guo T, Huang X. Short-Term Load Forecasting for Electric Power Systems Using the PSO-SVR and FCM Clustering Techniques. Energies. 2011; 4(1):173-184. https://doi.org/10.3390/en4010173

Chicago/Turabian Style

Duan, Pan, Kaigui Xie, Tingting Guo, and Xiaogang Huang. 2011. "Short-Term Load Forecasting for Electric Power Systems Using the PSO-SVR and FCM Clustering Techniques" Energies 4, no. 1: 173-184. https://doi.org/10.3390/en4010173

Article Metrics

Back to TopTop