1. Introduction
The conventional ordinary regression method requires very strict statistical assumptions such as linearity of variables, no multicollinearity among independent variables, homoskedasticity, reliability of measurement, error should be normally distributed and independently [
1]. All assumptions above should be provided completely to attain the best regression model. Additionally, the input information related to data quality is a highly indispensable component that should be considered for this method. However, this regression method will not be effective and is not recommended for limited data size and linguistic variables. Based on a systematic review paper, multiple linear regression, general linear regression, polynomial regression, exponential regression, and multivariate adaptive regression spline are frequently implemented for electricity load consumption forecasting models [
2].
In previous studies, some non-statistical methods, such as fuzzy regression, fuzzy autoregressive, general regression neural network, kernel regression with
-nearest neighbors, and fuzzy time series, have been integrated with ordinary regression to handle the previously-mentioned limitations [
3,
4,
5,
6]. Its applications are commonly employed for electricity forecasting [
2]. For example, one of them is the integration between fuzzy and regression methods in handling some issues like linguistic data, small-size data, and normality data. Fuzzy regression estimates parameters using the fuzzy optimization approach more effectively than the ordinary least square [
7,
8]. Some fuzzy regression methods consider the triangular fuzzy number (TFN) for data pre-processing [
9].
In each country, electricity forecasting and its models are the main components to be managed and projected by state and private companies for efficient operations of power distribution systems in supporting daily life activities [
10,
11]. The conventional models have been discussed and implemented by previous researchers to investigate electricity power distribution and its factors using conventional regression or time series. However, the highest forecasting accuracy is an arduous task since various unpredictable factors may influence electricity power distributions.
Hybrid models have been introduced to improve elements, such as forecasting accuracy and data size. Fuzzy regression is one of the hybrid model types in electricity forecasting [
12,
13,
14]. This model deals with the triangular fuzzy number (TFN) of fuzzy form data and is not strictly vital in terms of statistical assumptions [
15,
16,
17]. In this paper, time series analysis is proposed to support fuzzy regression in predicting the value of each variable (dependent and independent) by following a series of times (yearly data). Because the fuzzy regression model is suitable for estimating the significant relationship between dependent and independent variables using fuzzy parameters, it is not a recommended model to forecast future values of variables, especially time series data. Thus, an exponential smoothing model is more practical for such forecasting purposes. Essentially, there are two forecasting phases in this paper.
4. Empirical Study
In this section, the implementation of the suggested phases is attempted in two case studies as follows:
Case study A: Electricity power distribution
Step 1: Build ORM for electricity power distribution using secondary data [
26] as presented in
Table 3.
Step 2: Transform single data into symmetrical TFN forms for electricity power distribution and its factors using Sturges rule as follows:
Determine range (R) data for each dependent and independent variable.
Determine .
Determine .
Determine lower and upper limits of intervals.
Provide a distribution table.
For example, the transformation value of customer numbers is illustrated in
Figure 3.
Step 3: The estimates of fuzzy parameters presented in
Table 4 illustrate the building of fuzzy optimization.
Table 4 shows the minimization of the spread function (
) from the mid value (
) using fuzzy intervals to left-right constraints.
Step 4: Based on parameters obtained in Step 3, build FRMs as presented in
Table 5.
Table 5 shows that the left and right sides have three different FRMs, respectively. Furthermore, these models will be used for forecasting purposes using training and testing data in Step 5.
Step 5: Forecast electricity power distribution using all possible FRM as expressed in
Table 6, respectively.
Step 6: Evaluate and validate all possible FRMs using MAPE of training and testing data, respectively, as presented in
Table 7.
Based on
Table 6, scatter plots between actual and forecast values are illustrated in
Figure 4.
Step 7: Forecast electricity power distribution for 2016–2021 using the best FRM model (smallest MAPE) without intercept and exponential smoothing (ES) as presented in
Table 8 and
Table 9, respectively.
Based on
Table 8, electricity power distribution (
) is predicted using FRM right without intercept, as expressed in
Table 7. On a regular basis, a regression model is not directly practical for forecasting purposes. In this case, each variable was gathered and measured by considering time intervals (yearly time series data). Thus, they should be predicted separately using time series models such as exponential smoothing (ES). Additionally, each forecasted value was obtained from the ES model, respectively.
In the final stage, the prediction of power distribution can be substituted into FRM right model as written in Equation (7):
From this table, the predicted
values were obtained using Equation (7) and ES model. Actual
power distribution was 93,634.63 GWh in 2016. On a note, the predicted and actual values revealed immense differences because the State Electricity Company of Indonesia offered a power subsidy for the residential sector for that year. Additionally, the national championship sports of Indonesia were also conducted in 2016. Therefore, the electricity distribution exceeded the actual amount. In this case, two forecasting parts, namely parameter estimation using fuzzy regression and future amount estimation, were already taken into account using the exponential smoothing technique. Unlike some previous studies [
11,
12,
13,
14], the researchers were only concerned with the fuzzy regression part.
The State Electricity Company of Indonesia offers subsidies for their customers every year. Thus, the proposed model lacks the ability to capture the actual amount. Occasionally, the difference is also significant between forecasted and actual amounts.
Case study B: Palm oil production
By following the same steps given in Case study A, the comparison between actual and forecast values can be shown for palm oil data from January–December 2012 in
Table 10,
Table 11 and
Table 12, respectively.
5. Conclusions
In this paper, the parameters (intercept and slopes) of ordinary regression in building fuzzy linear regression were implemented. Both parameters were employed for fuzzy optimization purposes, namely objective function and left-right constraints. Furthermore, the Sturges rule was used to determine the symmetrical TFN and the number of fuzzy intervals when the total number of observations was specified.
In application, FRM without intercept was considered to capture the actual electricity data precisely. Each variable from FRM was predicted using a basic time series technique known as exponential smoothing. Therefore, two types of forecasting strategies have been employed to estimate yearly electricity power distribution in Indonesia from 2000 to 2021 and palm oil production. In this paper, we also considered the effectiveness between with and without intercepts in the forecasting models.