Next Article in Journal
Assessing the Role of Policies on Land-Use/Cover Change from 1965 to 2015 in the Mu Us Sandy Land, Northern China
Previous Article in Journal
Climate Change Mitigation Pathways for Southeast Asia: CO2 Emissions Reduction Policies for the Energy and Transport Sectors
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Nonadditive Grey Prediction Using Functional-Link Net for Energy Demand Forecasting

1
College of Management & College of Tourism, Fujian Agriculture and Forestry University, Fuzhou 350002, China
2
Department of Business Administration, Chung Yuan Christian University, Chung Li Dist., Taoyuan 32023, Taiwan
Sustainability 2017, 9(7), 1166; https://doi.org/10.3390/su9071166
Submission received: 2 June 2017 / Revised: 26 June 2017 / Accepted: 27 June 2017 / Published: 3 July 2017
(This article belongs to the Section Energy Sustainability)

Abstract

:
Energy demand prediction plays an important role in sustainable development. The GM(1,1) model has drawn our attention to energy demand forecasting because it only needs a few data points to construct a time series model without statistical assumptions. Residual modification is often considered as well to improve the accuracy of predictions. Several residual modification models have been proposed, but they focused on residual sign estimation, whereas the FLNGM(1,1) model using functional-link net (FLN) can estimate the sign as well as the modification range for each predicted residual. However, in the original FLN, an activation function with an inner product assumes that criteria are independent of each other, so additivity might influence the forecasting performance of FLNGM(1,1). Therefore, in this study, we employ the FLN with a fuzzy integral instead of an inner product to propose a nonadditive FLNGM(1,1). Experimental results based on real energy demand cases demonstrate that the proposed grey prediction model performs well compared with other grey residual modification models that use sign estimation and the additive FLNGM(1,1).

1. Introduction

Several advanced countries have worked on the green new deal to promote the development of green energy technology, and to achieve the ultimate goal of sustainable development. Sustainable development is based on three main traits, including energy security, economic development, and environmental protection [1]. These traits involve the amount of energy demand and supplies, effective use of the energy for industries, and development of new technologies for renewable energy. Environmental impacts due to energy consumption are inevitable because economic development is ongoing, and will have a significant role when devising the policy of sustainable development for cities or countries [2]. In the last few decades, continuous economic development and population increases worldwide have led to rapid growth in the demand for energy. If the current global energy consumption pattern continues, the global energy consumption will increase by over 50% before 2030 [3]. In terms of the global electricity demand, the average annual growth rate was 2.6% from 1990 to 2000 and 3.3% from 2000 to 2010. It is expected to reach 2.8% from 2010 to 2020 [4]. Since the amount of energy demand must be taken into account first for sustainable development, this leads to an important issue related to how to accurately predict energy demand.
Many forecasting methods, including artificial intelligence techniques, multivariate regression, and time series analysis, have frequently been applied to energy demand forecasting [5,6,7,8,9,10,11,12,13]. A large number of samples are required for multivariate regression and time series analysis like the autoregressive integrated moving average (ARIMA) [14]. The performance of the above-mentioned methods can be significantly affected by the number and the representativeness of observations [15,16]. However, using long-term data to build energy consumption prediction models may be impractical because the average annual growth rate of energy consumption is high and unstable. Beyond this, statistical methods usually require that the data conform to statistical assumptions, such as having a normal distribution [17], yet energy consumption data often do not conform to these usual statistical assumptions [17], limiting the forecasting capabilities of statistical methods. Therefore, to construct an energy demand prediction model, a forecasting method is needed that works well with small samples and without making any statistical assumptions [18]. One of the grey prediction models, GM(1,1), has drawn our attention to energy demand forecasting [2]. Indeed, how factors such as income and population influence the demand for energy is not clear, so energy demand forecasting can be regarded as a grey system problem [2,16].
The GM(1,1) model only needs four recent sample data points to achieve a reliable and acceptable accuracy of prediction [14]. Several versions have been proposed to improve the accuracy of its predictions [19,20,21,22,23]. In addition, a residual modification model built on residuals obtained from the original GM(1,1) may be an effective solution [24,25]. In terms of residual modification, in addition to residual sign estimation [17,26,27,28], Hu [29] proposed a new model called FLNGM(1,1) which uses the functional-link net (FLN) with an effective function approximation capability [30,31,32,33] to estimate both the sign and the modification range of the predicted residuals obtained from the residual GM(1,1) model.
In the original FLNGM(1,1), the hyperbolic tangent function is considered as the activation function. Clearly, the output of this function uses a weighted sum of the connection weights with an enhanced pattern. It is assumed that the additivity property of the interaction among individual features results in an enhanced pattern. However, the attributes are not always independent of each other [34,35,36,37], so an assumption of additivity for the enhanced pattern may not be reasonable [38], thereby affecting the forecasting performance of FLNGM(1,1). The Choquet fuzzy integral [39,40,41] does not assume the independence of each criterion, so in this study we propose a nonadditive residual modification model, called nonadditive FLNGM(1,1) (N-FLNGM(1,1)) for energy demand forecasting, where we replace the weighted sum with the Choquet integral inside the hyperbolic tangent function. A genetic algorithm(GA) [42] is used to construct the proposed nonadditive FLNGM(1,1) model with high prediction accuracy.
The remainder of this paper is organized as follows. Section 2 introduces the traditional GM(1,1) model using residual modification with sign estimation. Section 3 and Section 4 present residual modification using FLN and the proposed N-FLNGM(1,1) model, respectively. Section 5 examines the energy demand forecasting performance of the N-FLNGM(1,1) model based on real energy demand cases in China. In Section 6, we discuss the outcomes and give our conclusions.

2. GM(1,1) Model Using Residual Modification with Sign Estimation

2.1. Original GM(1,1) Model

Given an original data sequence X(°) = ( x 1 ( 0 ) , …, x n ( 0 ) ) made up of n samples, then a new sequence, X(1) = ( x 1 ( 1 ) , x 2 ( 1 ) , …, x n ( 1 ) ), can be generated from x 1 ( 0 ) by the accumulated generating operation [9,25] as follows:
x k ( 1 ) = j = 1 k x j ( 0 ) , k =   1 ,   2 , , n
and x 1 ( 1 ) , x 2 ( 1 ) , …, x n ( 1 ) can then be approximated by a first-order differential equation,
d x ( 1 ) d t + a   x ( 1 ) =   b
where a and b are the developing coefficient and control variable, respectively.
The predicted value x ^ k ( 1 ) can be obtained by solving the differential equation with an initial condition that x 1 ( 1 ) = x 1 ( 0 ) :
x ^ k ( 1 ) =   (   x 1 ( 0 ) b a )   e a ( k 1 ) + b a
and thus x ^ k ( 1 ) = x 1 ( 0 ) holds. Then, a and b can be estimated by a grey difference equation:
x k ( 0 ) +   a   z k ( 1 ) =   b
where z k ( 1 ) is the background value,
z k ( 1 ) =   α   x k ( 1 ) +   ( 1     α ) x k 1 ( 1 )
and α is usually specified as 0.5 for convenience. In turn, a and b can be obtained using the ordinary least-squares method:
[ a ,   b ] T =   ( B T B ) 1 B T y
where
B   = [ z 2 ( 1 ) 1 z 3 ( 1 ) 1 z n ( 1 ) 1 ]
and
y =   [ x 2 ( 0 ) , x 3 ( 0 ) , , x n ( 0 ) ] T
Using the inverse accumulated generating operation, the predicted value of x k ( 0 ) is
x ^ k ( 0 )   =   x ^ k ( 1 ) x ^ k 1 ( 1 ) , k =   2 ,   3 , , n
Therefore,
x ^ k ( 0 ) =   ( 1     e a )   ( x 1 ( 0 ) b a )   e a ( k 1 ) , k =   2 ,   3 ,   , n
Note that x ^ 1 ( 1 ) = x ^ 1 ( 0 ) holds.

2.2. Residual Modification with Sign Estimation

To build a residual modification model, the original GM(1,1) is constructed first, followed by the residual GM(1,1). Let ε ( 0 ) = ( ε 2 ( 0 ) , ε 3 ( 0 ) , …, ε n ( 0 ) ) denote the sequence of absolute residual values, where
ε k ( 0 ) = | x k ( 0 ) x ^ k ( 0 ) | , k =   2 ,   3 , , n
Using the same construction for X(°) as the original GM(1,1) model, a residual model can be established for ε ( 0 ) , and the predicted residual of ε k ( 0 ) is
ε ^ k ( 0 ) =   ( 1 e a ε )   (   ε 2 ( 0 ) b ε a ε ) e a ε ( k 1 ) , k =   3 ,   4 , , n
In the remnant GM(1,1) model with sign estimation, x ^ k ( 0 ) is modified to x ^ k r ( 0 ) by adding or subtracting ε ^ k ( 0 ) from x ^ k ( 0 ) [27].
x ^ k r ( 0 ) =   x ^ k ( 0 ) +   s k ε ^ k ( 0 ) , k =   2 ,   3 , , n
where sk denotes the positive or negative sign of ε ^ k ( 0 ) with respect to the k-th year. Those residual sign estimation methods mentioned above can be used to determine sk.

3. Residual Modification Using FLN

A single-layer feed-forward FLN is an appropriate tool for obtaining sign and range estimations due to its effective function approximation ability. An enhanced pattern with respect to a single input denoted by t t can be generated as (t, sin(πt), cos(πt), sin(2πt), cos(2πt)) through a functional link [31,32], where t denotes a specific time point, such as the t-th year.
In the original FLNGM(1,1), the hyperbolic tangent function is used as the activation function in the output node,
tanh ( z )   = e z e z e z + e z
where tanh(z) lies within the range (−1, 1). Let θ be the bias to the output node. When (t, sin(πt), cos(πt), sin(2πt), cos(2πt)) is presented to the net, z can be computed as follows:
z = w1t + w2sin(πt) + w3cos(πt) + w4sin(2πt) + w5cos(2πt) + θ
The actual output value, y, is just to equal tanh(z), and y can be interpreted as the range of modification for x ^ k ( 0 ) . Let yk denote the actual output value with respect to x ^ k ( 0 ) . Then, yk = 1 indicates that x ^ k ( 0 ) can be modified as its tolerable upper limit, whereas yk = −1 denotes that x ^ k ( 0 ) can be modified as its tolerable lower limit. An advantage compared with residual estimation methods is that the modification range of the original x ^ k ( 0 ) is not restricted to ε ^ k ( 0 ) . Thus, it can be seen that the additivity assumption of the interaction among t, sin(πt), cos(πt), sin(2πt), and cos(2πt) holds; i.e., these terms are assumed to be independent of each other.
In particular, based on the idea of three-sigma limits, which are used to set the upper and lower control limits in statistical quality control charts [43], the value x ^ k FLN ( 0 ) predicted by the proposed model is formulated heuristically as follows.
x ^ k FLN ( 0 ) =   ( 1 e a ) (   x 1 ( 0 ) b a ) e a ( k 1 ) +   3 y k ε ^ k ( 0 ) ,   k =   2 ,   3 , , n
where 3 ε ^ k ( 0 ) refers to data within three residuals with respect to x ^ k ( 0 ) and represents the tolerable maximum range for modifying x ^ k ( 0 ) .

4. Nonadditive Residual Modification Model

Let (t, sin(πt), cos(πt), sin(2πt), cos(2πt)) be represented by (v1, v2, v3, v4, v5), and let X = {v1, v2, v3, v4, v5}, where X is called the feature space. A fuzzy measure is a nonadditive set function that can be used along with a fuzzy integral for aggregating information sources [36]. Among various fuzzy measures, the λ-fuzzy measure has been suggested for computation of the fuzzy integral because of its convenience users [44,45,46]. The advantage is that, after determining fuzzy densities μ1, μ2, …, μ5, where μk denotes μ({vk}), λ can be uniquely determined from the condition μ(X) = 1. μ(Ek) can be computed as follows:
μ ( E k )   = 1 λ [ i = k n ( 1 + λ μ i )     1 ]
where Ek = {vk, vk+1, …, v5} (1 ≤ k ≤ 5). As illustrated in Figure 1, fuzzy densities can be used as connection weights, and they can be determined automatically by the GA.
Let f be a non-negative, real-valued measurable function defined on X. The element in X with min{f(vk)|k = 1, 2, …, 5} is renumbered as one, where f(vk) denotes the performance value of vk. In other words, all the elements vk are rearranged in order of descending f(vk), such that f(v1) ≤ f(v2) ≤…≤ f(v5). The Choquet integral (c) f d u over X of f with respect to μ is defined as follows.
( c ) f d μ = k = 1 5 f ( v k ) [ μ ( E k ) μ ( E k + 1 ) ]
where μ(E6) is specified as zero. The Choquet integral illustrated in Figure 2 comprises five different rectangular areas. Since the assumption of additivity among features is not warranted in real applications, it is reasonable to use the Choquet integral with respect to aggregate t, sin(πt), cos(πt), sin(2πt), and cos(2πt) to form the nonadditive hyperbolic tangent function denoted by (c)tanh(z). This definitely differentiates N-FLNGM(1,1) from the original FLNGM(1,1) models.
In the construction of the original FLNGM(1,1), a real-valued GA is designed to automatically determine the connection weights (i.e., μ1, μ2, μ3, μ4, μ5) and the bias (i.e., θ) for the N-FLNGM(1,1). Indeed, Rooij et al. [47] noted that GAs are appropriate tools for training perceptron-like networks. To construct the model with high prediction accuracy, the mean absolute percentage error (MAPE) for the training patterns is considered. MAPE can be treated as the benchmark, and it is more stable than the more commonly used mean absolute error and root mean square error [48,49]. MAPE is formulated as follows:
MAPE = k T S | x k ( 0 ) x ^ k FLN ( 0 ) | | T S | × x k ( 0 ) ×   100 %
where TS denotes the training or testing data. Ref. [50] presented MAPE criteria for evaluating a forecasting model, where MAPE ≤ 10, 10 < MAPE ≤ 20, 20 < MAPE ≤ 50, and MAPE > 50 correspond to high, good, reasonable, and weak forecasting models, respectively. It is reasonable to define the fitness function as MAPE. Thus, the fitness of each individual is specified by a single objective in the combinatorial optimization problem.
Let nsize and nmax denote the population size and maximum number of generations, respectively, and Pm denote the population generated in generation m (1 ≤ mnmax). After evaluating the fitness value for each chromosome in Pm, selection, crossover, and mutation are applied until nsize new chromosomes have been generated for Pm+1. The GA can be performed until nmax generations have been generated. When the stopping condition has been satisfied, the algorithm is terminated, and the best chromosome with maximum fitness value among all successive generations serves as the desired solution to examine the generalization ability of the proposed N-FLNGM(1,1) model. The selection, crossover, and mutation operations required for N-FLNGM(1,1) are similar to those for FLNGM(1,1), so the details of the GA are omitted for simplicity and they can be found in [29].

5. Experimental Results

In Asia, China has become increasingly influential in terms of energy production and consumption [4]. It is interesting to conduct experiments on real energy demand cases from China to compare the energy demand forecasting ability of the proposed N-FLNGM(1,1) model with that of the additive FLNGM(1,1) and other grey residual modification models that use sign estimation, including the residual modification model using multi-layer perceptron (MLPGM(1,1)) [26], genetic programming (GPGM(1,1)) [17], and Markov chain [27]. The prediction accuracy of different prediction models were demonstrated by two real cases related to the energy demand in China. To ensure a fair comparison with the original FLNGM(1,1), the parameter specifications used for GA were the same as those described by Hu (2016), including nsize = 200, nmax = 1000, probabilities for crossover and mutation are 0.7 and 0.01 respectively.

5.1. Case I

An experiment was conducted based on the historical annual energy demand data from China collected between 1990 and 2007. As reported by Lee and Tong [17], the data from 1990 to 2003 were used for model fitting, and those from 2004 to 2007 were used for ex-post testing. The forecasting results obtained by Lee and Tong [17] using the original GM(1,1), MLPGM(1,1), and GPGM(1,1) models are summarized in Table 1 and illustrated in Figure 2.
Table 1 and Figure 3 show the prediction performance of the different forecasting models. The MAPE for model fitting obtained by N-FLNGM(1,1) was slightly inferior to that by the original FLNGM(1,1). However, we can see that N-FLNGM(1,1) outperformed the other forecasting models compared with the testing data. Using the testing data, in terms of MAPE, both the original FLNGM(1,1) and the proposed N-FLNGM(1,1) had good forecasting abilities, whereas the original GM(1,1), GPGM(1,1), MLPGM(1,1), and Markov-chain sign estimation only had reasonably fair forecasting abilities. A massive change occurred up to 2004, which may explain why the ex-post testing results were not as good as the model-fitting results.

5.2. Case II

The second experiment was conducted on the historical annual electricity demand of China, collected from China Statistical Yearbook 2014 [51]. As described by Zhou et al. [52], data from 1981 to 1998 were used for model fitting, and the other data for testing. The forecasting results obtained by the different forecasting models are shown in Table 2 and Figure 4. We can see that the MAPE for the original GM(1,1), MLPGM(1,1), GPGM(1,1), Markov-chain sign estimation, original FLNGM(1,1), and the proposed N-FLNGM(1,1) using the training data were 2.28%, 2.03%, 1.44%, 2.31%, 0.10%, and 0.13%, respectively. Using the testing data, the MAPE were 7.24%, 3.90%, 3.90%, 3.90%, 1.52%, and 1.41%, respectively. Figure 5 illustrates the APE of the different forecasting models. Based on these results, it is obvious that the proposed N-FLNGM(1,1) model yielded comparable performance to the other forecasting models considered. Additionally, Zhou et al. [52] showed that the MAPE for an autoregressive integrated moving average model and trigonometric grey prediction model were 3.25% and 2.12%, respectively, for training data, and 2.64% and 2.37% for testing data, which are inferior to the results obtained by the proposed N-FLNGM(1,1) model.

6. Discussion and Conclusions

The GM(1,1) model is the most frequently used grey prediction model, and it has played an important role in energy demand forecasting because it only requires a limited number of samples to construct a prediction model without statistical assumptions. In this paper, the proposed N-FLNGM(1,1) is built on the original FLNGM(1,1) model, which uses FLN to estimate the sign and modification range for each predicted residual simultaneously by using the predicted value obtained from the GM(1,1) model. We performed experiments to validate the effectiveness of the proposed N-FLNGM(1,1) model for energy demand forecasting. Experimental results demonstrate that the proposed nonadditive grey prediction model performs well compared with the additive FLNGM(1,1). In addition to grey prediction models, we further examine the prediction performance of a frequently-used prediction model—multi-layer perceptron (MLP) with backpropagation learning. MLP here has one input node, one hidden layer with two neurons, and one output layer with one neuron, training with 10,000 iterations, and learning rate of 0.8. The forecasting results obtained by MLP for case I and II are summarized in Table 3 and Table 4, respectively. It can be seen that the proposed N-FLNGM(1,1) outperforms MLP.
The proposed N-FLNGM(1,1) model is totally different from the original FLNGM(1,1), MLPGM(1,1), GPGM(1,1), and Markov-chain sign estimation. First, when the Choquet fuzzy integral with respect to a λ-fuzzy measure is incorporated into the FLN to consider the interaction among features in the enhanced pattern, the testing results obtained by the additive FLNGM(1,1) can be further improved by the proposed N-FLNGM(1,1). Second, MLPGM(1,1), GPGM(1,1), and Markov-chain sign estimation contributed to estimate residual sign sk in Equation (13) to improve the prediction accuracy of the original GM(1,1) model. The common characteristic of these three models is that x ^ k ( 0 ) can be adjusted by either ε ^ k ( 0 ) or − ε ^ k ( 0 ) . This is too tight a restriction for the modification range. In contrast, originating from the idea of three-sigma limits, N-FLNGM(1,1) presents a novel updated rule for x ^ k ( 0 ) by FLN, where the adjustment of x ^ k ( 0 ) ranges between −3 ε ^ k ( 0 ) and 3 ε ^ k ( 0 ) . It can be seen that N-FLNGM(1,1) outperformed MLPGM(1,1), GPGM(1,1), and Markov-chain sign estimation compared with data used for model fitting and ex-post testing. It should be noted that in a perceptron-like net, it is not easy to explain the meaning of the connection weights [53]. However, an advantage of the proposed N-FLNGM(1,1) compared with the original FLNGM(1,1) is that μk can be interpreted as the degree of importance of vk.
Over the next two decades, coal, natural gas, and crude oil can be the main energy supplies driving the world economy. It is expected that the growth of the crude oil demand will mainly come from emerging markets from 2015 to 2035, and one half of the growth will be in China [54]. As a matter of fact, energy consumption in China has been mainly provided by coal and crude oil. For instance, the China Statistical Yearbook 2014 [51] showed that approximately two-thirds of the total energy consumed was provided by coal and 18% by oil in 2013. This leads to inevitable environmental impacts on China. Undoubtedly, energy demand prediction has become increasingly important when devising sustainable development plans for China [16]. The proposed N-FLNGM(1,1) has demonstrated its potential for energy demand forecasting.
When it comes to fuzzy integrals, the Sugeno integral is the other most commonly used fuzzy integral. However, only the maximum and minimum operators are involved in the Sugeno integral, so the Choquet integral is preferable to the Sugeno integral for many decision problems [36], which is why we use the Choquet integral rather than the Sugeno integral. In order to perform a soft aggregation, two special ordered weighted averaging (OWA) operators—S-OWA-OR and S-OWA-AND—can be employed to replace the maximum and minimum operators, respectively [55]. In future research, it would be interesting to examine the forecasting ability of a nonadditive prediction model using the Sugeno integral combined with OWA operators.
In this study, both the original and the residual GM(1,1) models use the least squares method to obtain the developing coefficient and control variable, which depend on the background value. However, it is not easy to determine the background value. Hu et al. [56] presented a novel neural-network-based GM(1,1) model (NNGM(1,1)) to resolve this troublesome problem by automatically determining the developing coefficient and control variable. Thus, it would be interesting to examine whether incorporating NNGM(1,1) into N-FLNGM(1,1) instead of the traditional GM(1,1) model might affect the prediction performance of N-FLNGM(1,1) in energy demand forecasting.

Acknowledgments

The author would like to thank the anonymous referees for their valuable comments. This research is partially supported by the Ministry of Science and Technology of Taiwan under grant MOST 104-2410-H-033-023-MY2.

Conflicts of Interest

The author declare no conflict of interest.

References

  1. Taiwan Bureau of Energy. White Paper on Energy and Industrial Technology, Technical Report. Available online: https://www.moeaboe.gov.tw/ecw/populace/images/file_icon/pdf.png (accessed on 30 May 2017).
  2. Suganthi, L.; Samuel, A.A. Energy models for demand forecasting—A review. Renew. Sustain. Energy Rev. 2012, 16, 1223–1240. [Google Scholar] [CrossRef]
  3. Smith, M.; Hargroves, K.; Stasinopoulos, P.; Stephens, R.; Desha, C.; Hargroves, S. Energy Transformed: Sustainable Energy Solutions for Climate Change Mitigation; The Natural Edge Project, CSIRO, and Griffith University: Brisbane, Australia, 2007. [Google Scholar]
  4. Liu, Z.Y. Global Energy Internet; China Electric Power Press: Beijing, China, 2015. [Google Scholar]
  5. Boroojeni, K.G.; Amini, M.H.; Bahrami, S.; Iyengar, S.S.; Sarwat, A.I.; Karabasoglu, O. A novel multi-time-scale modeling for electric power demand forecasting: From short-term to medium-term horizon. Electr. Power Syst. Res. 2017, 142, 58–73. [Google Scholar] [CrossRef]
  6. Ediger, V.S.; Akar, S. ARIMA forecasting of primary energy demand by fuel in Turkey. Energy Policy 2007, 35, 1701–1708. [Google Scholar] [CrossRef]
  7. Lauret, P.; Fock, E.; Randrianarivony, R.N.; Manicom-Ramasamy, J.F. Bayesian neural network approach to short time load forecasting. Energy Convers. Manag. 2000, 49, 1156–1166. [Google Scholar] [CrossRef]
  8. Mohanty, S.; Patra, P.K.; Sahoo, S.S.; Mohanty, A. Forecasting of solar energy with application for a growing economy like India: Survey and implication. Renew. Sustain. Energy Rev. 2017, 78, 539–553. [Google Scholar] [CrossRef]
  9. Toksari, M.D. Estimating the net electricity energy generation and demand using ant colony optimization approach: Case of Turkey. Energy Policy 2009, 37, 1181–1187. [Google Scholar] [CrossRef]
  10. Tutun, S.; Chou, C.A.; Canıyılmaz, E. A new forecasting framework for volatile behavior in net electricity consumption: A case study in Turkey. Energy 2015, 93, 2406–2422. [Google Scholar] [CrossRef]
  11. Verdejo, H.; Awerkin, A.; Becker, C.; Olguin, G. Statistic linear parametric techniques for residential electric energy demand forecasting. A review and an implementation to Chile. Renew. Sustain. Energy Rev. 2017, 74, 512–521. [Google Scholar] [CrossRef]
  12. Xia, C.; Wang, J.; McMenemy, K.S. Medium and long term load forecasting model and virtual load forecaster based on radial basis function neural networks. Electr. Power Energy Syst. 2010, 32, 743–750. [Google Scholar] [CrossRef]
  13. Yang, Y.; Chen, Y.; Wang, Y.; Li, C.; Li, L. Modelling a combined method based on ANFIS and neural network improved by DE algorithm: A case study for short-term electricity demand forecasting. Appl. Soft Comput. 2016, 49, 663–675. [Google Scholar] [CrossRef]
  14. Wang, C.H.; Hsu, L.C. Using genetic algorithms grey theory to forecast high technology industrial output. Appl. Math. Comput. 2008, 195, 256–263. [Google Scholar] [CrossRef]
  15. Li, D.C.; Chang, C.J.; Chen, C.C.; Chen, W.C. Forecasting short-term electricity consumption using the adaptive grey-based approach-An Asian case. Omega 2012, 40, 767–773. [Google Scholar] [CrossRef]
  16. Pi, D.; Liu, J.; Qin, X. A grey prediction approach to forecasting energy demand in China, Energy Sources, Part A: Recovery. Util. Environ. Eff. 2010, 32, 1517–1528. [Google Scholar]
  17. Lee, Y.S.; Tong, L.I. Forecasting energy consumption using a grey model improved by incorporating genetic programming. Energy Convers. Manag. 2011, 52, 147–152. [Google Scholar] [CrossRef]
  18. Feng, S.J.; Ma, Y.D.; Song, Z.L.; Ying, J. Forecasting the energy consumption of China by the grey prediction model, Energy Sources, Part B: Economics. Plan. Policy 2012, 7, 376–389. [Google Scholar]
  19. Chen, Y.; He, K.; Zhang, C. A novel grey wave forecasting method for predicting metal prices. Resour. Policy 2016, 49, 323–331. [Google Scholar] [CrossRef]
  20. Li, K.; Liu, L.; Zhai, J.; Khoshgoftaar, T.M.; Li, T. The improved grey model based on particle swarm optimization algorithm for time series prediction. Eng. Appl. Artif. Intell. 2016, 55, 285–291. [Google Scholar] [CrossRef]
  21. Wang, Z.X.; Hao, P. An improved grey multivariable model for predicting industrial energy consumption in China. Appl. Math. Model. 2016, 40, 5745–5758. [Google Scholar] [CrossRef]
  22. Zeng, B.; Meng, W.; Tong, M.Y. A self-adaptive intelligence grey predictive model with alterable structure and its application. Eng. Appl. Artif. Intell. 2016, 50, 236–244. [Google Scholar] [CrossRef]
  23. Zhao, H.; Guo, S. An optimized grey model for annual power load forecasting. Energy 2016, 107, 272–286. [Google Scholar] [CrossRef]
  24. Deng, J.L. Control problems of grey systems. Syst. Control Lett. 1982, 1, 288–294. [Google Scholar]
  25. Liu, S.; Lin, Y. Grey Information: Theory and Practical Applications; Springer-Verlag: London, UK, 2006. [Google Scholar]
  26. Hsu, C.C.; Chen, C.Y. Applications of improved grey prediction model for power demand forecasting. Energy Convers. Manag. 2003, 44, 2241–2249. [Google Scholar] [CrossRef]
  27. Hsu, C.I.; Wen, Y.U. Improved Grey prediction models for trans-Pacific air passenger market. Transp. Plan. Technol. 1998, 22, 87–107. [Google Scholar] [CrossRef]
  28. Hsu, L.C. Applying the grey prediction model to the global integrated circuit industry. Technol. Forecast. Soc. Chang. 2003, 70, 563–574. [Google Scholar] [CrossRef]
  29. Hu, Y.C. Grey prediction with residual modification using functional-link net and its application to energy demand forecasting. Kybernetes 2017, 46, 349–363. [Google Scholar] [CrossRef]
  30. Hu, Y.C. Functional-link nets with genetic-algorithm-based learning for robust nonlinear interval regression analysis. Neurocomputing 2009, 72, 1808–1816. [Google Scholar] [CrossRef]
  31. Pao, Y.H. Adaptive Pattern Recognition and Neural Networks; Addison-Wesley: Reading, Boston, MA, USA, 1989. [Google Scholar]
  32. Pao, Y.H. Functional-link net computing: Theory, system architecture, and functionalities. Computer 1992, 25, 76–79. [Google Scholar] [CrossRef]
  33. Park, G.H.; Pao, Y.H. Unconstrained word-based approach for off-line script recognition using density-based random-vector functional-link net. Neurocomputing 2000, 31, 45–65. [Google Scholar] [CrossRef]
  34. Hu, Y.C. Nonadditive similarity-based single-layer perceptron for multi-criteria collaborative filtering. Neurocomputing 2014, 129, 306–314. [Google Scholar] [CrossRef]
  35. Hu, Y.C.; Chiu, Y.J.; Liao, Y.L.; Li, Q. A fuzzy similarity measure for collaborative filtering using nonadditive grey relational analysis. J. Grey Syst. 2015, 27, 93–103. [Google Scholar]
  36. Wang, Z.; Leung, K.S.; Klir, G.J. Applying fuzzy measures and nonlinear integrals in data mining. Fuzzy Sets Syst. 2005, 156, 371–380. [Google Scholar] [CrossRef]
  37. Wang, W.; Wang, Z.; Klir, G.J. Genetic algorithms for determining fuzzy measures from data. J. Intell. Fuzzy Syst. 1998, 6, 171–183. [Google Scholar]
  38. Hu, Y.C.; Tseng, F.M. Functional-link net with fuzzy integral for bankruptcy prediction. Neurocomputing 2007, 70, 2959–2968. [Google Scholar] [CrossRef]
  39. Murofushi, T.; Sugeno, M. An interpretation of fuzzy measure and the Choquet integral as an integral with respect to a fuzzy measure. Fuzzy Sets Syst. 1989, 29, 201–227. [Google Scholar] [CrossRef]
  40. Murofushi, T.; Sugeno, M. A theory of fuzzy measures: Representations, the Choquet integral, and null sets. J. Math. Anal. Appl. 1991, 159, 532–549. [Google Scholar] [CrossRef]
  41. Murofushi, T.; Sugeno, M. Some quantities represented by the Choquet integral. Fuzzy Sets Syst. 1993, 56, 229–235. [Google Scholar] [CrossRef]
  42. Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Massachusetts, MA, USA, 1989. [Google Scholar]
  43. Montgomery, D.C. Statistical Quality Control; John Wiley & Sons: New Jersey, NJ, USA, 2005. [Google Scholar]
  44. Kuncheva, L.I. Fuzzy Classifier. Design; Physica-Verlag: Heidelberg, Germany, 2000. [Google Scholar]
  45. Sugeno, M. Fuzzy measures and fuzzy integrals: A survey. In Fuzzy Automata and Decision Processes; Gupta, M.M., Saridis, G.N., Gaines, B.R., Eds.; North Holland: New York, NY, USA, 1977; pp. 89–102. [Google Scholar]
  46. Sugeno, M.; Narukawa, Y.; Murofushi, Y. Choquet integral and fuzzy measures on locally compact space. Fuzzy Sets Syst. 1998, 7, 205–211. [Google Scholar] [CrossRef]
  47. Rooij, A.J.F.; Jain, L.C.; Johnson, R.P. Neural Network Training Using Genetic Algorithms; World Scientific: Hackensack, NJ, USA, 1996. [Google Scholar]
  48. Lee, S.C.; Shih, L.H. Forecasting of electricity costs based on an enhanced gray-based learning model: A case study of renewable energy in Taiwan. Technol. Forecast. Soc. Chang. 2011, 78, 1242–1253. [Google Scholar] [CrossRef]
  49. Makridakis, S. Accuracy measures: Theoretical and practical concerns. Int. J. Forecast. 1993, 9, 527–529. [Google Scholar] [CrossRef]
  50. Lewis, C. Industrial and Business Forecasting Methods; Butterworth Scientific: London, UK, 1982. [Google Scholar]
  51. National Bureau of Statistics of China. China Statistical Yearbook 2014; China Statistics Press: Beijing, China, 2014.
  52. Zhou, P.; Ang, B.W.; Poh, K.L. A trigonometric grey prediction approach to forecasting electricity demand. Energy 2006, 31, 2839–2847. [Google Scholar] [CrossRef]
  53. Eric, R.; Seema, S. Bankruptcy prediction by neural network. In Neural Networks in Finance and Investing: Using Artificial Intelligence to Improve Real-World Performance; Trippi, R.R., Turban, E., Eds.; McGraw-Hill: Chicago, IL, USA, 1996; pp. 243–259. [Google Scholar]
  54. British Petroleum. Energy Outlook, Technical Report. Available online: https://www.bp.com/content/dam/bp/pdf/energy-economics/energy-outlook-2017/bp-energy-outlook-2017.pdf (accessed on 30 May 2017).
  55. Yager, R.R. Elements selection from a fuzzy subset using the fuzzy integral, IEEE Transactions on Systems. Man Cybern. 1993, 23, 467–477. [Google Scholar] [CrossRef]
  56. Hu, Y.C.; Tzeng, G.H.; Hsu, Y.T.; Chen, R.S. Using learning algorithm to find the developing coefficient and control variable of GM(1,1) model. J. Chin. Grey Syst. Assoc. 2001, 4, 17–26. [Google Scholar]
Figure 1. A functional-link net with Choquet fuzzy integral.
Figure 1. A functional-link net with Choquet fuzzy integral.
Sustainability 09 01166 g001
Figure 2. Graphical representation of Choquet fuzzy integral.
Figure 2. Graphical representation of Choquet fuzzy integral.
Sustainability 09 01166 g002
Figure 3. Predicted and actual values of different forecasting models for Case I.
Figure 3. Predicted and actual values of different forecasting models for Case I.
Sustainability 09 01166 g003
Figure 4. Predicted and actual values of different forecasting models for Case II.
Figure 4. Predicted and actual values of different forecasting models for Case II.
Sustainability 09 01166 g004
Figure 5. APE of different forecasting models for Case II.
Figure 5. APE of different forecasting models for Case II.
Sustainability 09 01166 g005
Table 1. Prediction accuracy obtained by different forecasting models for energy demand (unit: 104 tons of standard coal equivalent (SCE)). MAPE: mean absolute percentage error.
Table 1. Prediction accuracy obtained by different forecasting models for energy demand (unit: 104 tons of standard coal equivalent (SCE)). MAPE: mean absolute percentage error.
YearActualGM(1,1)MLPGM(1,1)GPGM(1,1)Markov-ChainFLNGM(1,1)N-FLNGM(1,1)
PredictedAPEPredictedAPEPredictedAPEPredictedAPEPredictedAPEPredictedAPE
199098,70398,703098,703098,703098,703098,703098,7030
1991103,783108,706.14.74103,7830103,7830103,7830104,116.20.32108,332.54.38
1992109,170112,335.52.9116,225.86.46108,445.20.66108,445.20.66108,913.60.23112,708.33.24
1993115,993116,086.10.08111,804.13.61111,804.13.61111,804.13.61115,973.20.02116,938.20.82
1994122,737119,9622.26115,248.86.10124,675.11.58115,248.86.10125,629.82.36124,309.91.28
1995131,176123,967.25.50129,154.81.54129,154.81.54129,154.81.54131,380.70.16130,8420.25
1996138,948128,106.27.80133,816.13.69133,816.13.69133,816.13.69135,703.72.33135,615.42.40
1997137,798132,383.33.93138,668.20.63138,668.20.63138,668.20.63136,710.10.79137,554.20.18
1998132,214136,803.33.47143,7218.7129,885.51.76129,885.51.76133,735.81.15136,059.72.92
1999133,831141,370.85.63133,756.50.06133,756.50.06133,756.50.06135,005.60.88136,387.91.92
2000138,553146,090.85.44137,709.80.61137,709.80.61137,709.80.61138,603.20.04140,826.21.65
2001143,199150,968.45.43141,743.61.02141,743.61.02141,743.61.02142,946.80.18145,273.31.45
2002151,797156,008.92.77145,855.23.91145,855.23.91145,855.23.91150,954.80.55153,016.60.81
2003174,990161,217.67.87150,041.614.26172,393.51.48150,041.614.26175,848.70.49168,529.33.69
MAPE 4.13 3.61 2.59 2.70 0.68 1.78
2004203,227166,600.218.02178,901.511.97178,901.511.97178,901.511.97186,450.18.26183,696.69.61
2005224,682172,162.623.37185,702.417.35185,702.417.35185,702.417.35194,058.013.63195,771.812.87
2006264,270177,910.732.68192,813.827.04192,813.827.04192,813.827.04202,011.123.56206,254.621.95
2007265,583183,850.730.77200,254.324.60200,254.324.60167,447.136.95210,377.620.79214,426.919.26
MAPE 26.21 20.23 20.23 23.22 16.56 15.92
Table 2. Prediction accuracy obtained by different forecasting models for electricity demand (unit: 100 million kWh).
Table 2. Prediction accuracy obtained by different forecasting models for electricity demand (unit: 100 million kWh).
YearActualGM(1,1)MLPGM(1,1)GPGM(1,1)Markov-chainFLNGM(1,1)N-FLNGM(1,1)
PredictedAPEPredictedAPEPredictedAPEPredictedAPEPredictedAPEPredictedAPE
19813096309603096030960309603096030960
198232803327.71.453280032800328003214.32.003374.42.88
198335193611.52.633638.03.383585.01.883585.01.883548.50.843625.03.01
198437783919.53.753888.32.923888.32.923888.32.923845.31.783893.63.06
198541184253.93.304217.12.414217.12.414217.12.414166.41.184189.41.73
198645074616.72.434573.31.474573.31.474573.31.474513.60.154523.10.36
198749855010.50.514959.40.514959.40.514959.40.514889.01.934897.41.76
198854675437.90.535377.61.635498.20.575377.61.635294.73.155320.32.68
198958655901.70.635830.70.595830.70.595830.70.595733.02.255775.01.53
199062306405.12.816321.41.476321.41.476321.41.476209.50.336276.40.74
199167756951.42.606852.81.156852.81.156852.81.156777.90.046859.81.25
199275427544.30.037428.11.517428.11.517428.11.517596.70.727545.10.04
19938426.58187.82.838050.84.468324.81.218050.84.468425.20.028319.01.28
19949260.48886.24.048724.85.789047.62.308724.85.789207.40.579144.21.25
199510,023.49644.13.789834.31.899834.31.899453.95.6810,000.40.2310,000.10.23
199610,764.310,466.72.7610,690.90.6810,690.90.6810,242.54.8510,742.70.2010,685.50.73
199711,284.411,359.50.6711,623.73.0111,095.31.6811,095.31.6811,273.80.0911,264.80.17
199811,598.412,328.46.2912,017.13.6112,017.13.6112,017.13.6111,796.71.7111,866.72.31
MAPE 2.28 2.03 1.44 2.31 0.10 0.13
199912,305.213,379.98.7313,0135.7513,013.05.7513,013.05.7512,581.92.2512,627.32.62
200013,471.414,521.27.7914,088.84.5814,088.84.5814,088.84.5813,535.10.4713,565.90.70
200114,633.515,759.87.7015,250.34.2115,250.34.2115,250.34.2114,598.40.2414,710.70.53
200216,331.517,1044.7316,503.61.0516,503.51.0516,503.51.0515,821.43.1216,040.21.78
MAPE 7.24 3.90 3.90 3.90 1.52 1.41
Table 3. Prediction accuracy obtained by multi-layer perceptron (MLP) and autoregressive integrated moving average (ARIMA) for case I.
Table 3. Prediction accuracy obtained by multi-layer perceptron (MLP) and autoregressive integrated moving average (ARIMA) for case I.
YearActualMLPN-FLNGM(1,1)
PredictedAPEPredictedAPE
199098,70393,012.65.77987,030
1991103,783107,674.63.75108,332.54.38
1992109,170116,921.07.10112,708.33.24
1993115,993122,130.45.29116,938.20.82
1994122,737125,034.61.87124,309.91.28
1995131,176126,861.33.29130,8420.25
1996138,948128,373.07.61135,615.42.40
1997137,798130,080.15.60137,554.20.18
1998132,214132,407.00.15136,059.72.92
1999133,831135,788.91.46136,387.91.92
2000138,553140,696.71.55140,826.21.65
2001143,199147,565.73.05145,273.31.45
2002151,797156,595.83.16153,016.60.81
2003174,990167,469.64.30168,529.33.69
MAPE 3.85 1.78
2004203,227179,212.111.82183,696.69.61
2005224,682190,465.415.23195,771.812.87
2006264,270200,083.024.29206,254.621.95
2007265,583207,546.221.85214,426.919.26
MAPE 18.30 15.92
Table 4. Prediction accuracy obtained by MLP and ARIMA for case II.
Table 4. Prediction accuracy obtained by MLP and ARIMA for case II.
YearActualMLPN-FLNGM(1,1)
PredictedAPEPredictedAPE
198130962947.54.8030960
198232803255.40.753374.42.88
198335193573.71.553625.03.01
198437783900.43.243893.63.06
198541184234.92.844189.41.73
198645074579.01.604523.10.36
198749854938.10.944897.41.76
198854675323.12.635320.32.68
198958655752.71.915775.01.53
199062306253.50.386276.40.74
199167756855.11.186859.81.25
199275427575.50.447545.10.04
19938426.58396.80.358319.01.28
19949260.49252.60.089144.21.25
199510,023.410,051.40.2810,000.10.23
199610,764.310,724.40.3710,685.50.73
199711,284.411,250.60.3011,264.80.17
199811,598.411,646.40.4111,866.72.31
MAPE 1.34 0.13
199912,305.211,942.12.9512,627.32.62
200013,471.412,166.39.6913,565.90.70
200114,633.512,341.115.6714,710.70.53
200216,331.512,481.523.5716,040.21.78
MAPE 12.97 1.41

Share and Cite

MDPI and ACS Style

Hu, Y.-C. Nonadditive Grey Prediction Using Functional-Link Net for Energy Demand Forecasting. Sustainability 2017, 9, 1166. https://doi.org/10.3390/su9071166

AMA Style

Hu Y-C. Nonadditive Grey Prediction Using Functional-Link Net for Energy Demand Forecasting. Sustainability. 2017; 9(7):1166. https://doi.org/10.3390/su9071166

Chicago/Turabian Style

Hu, Yi-Chung. 2017. "Nonadditive Grey Prediction Using Functional-Link Net for Energy Demand Forecasting" Sustainability 9, no. 7: 1166. https://doi.org/10.3390/su9071166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop