Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting

Serrano Ardila, Vanessa María; Maciel, Joylan Nunes; Ledesma, Jorge Javier Gimenez; Ando Junior, Oswaldo Hideo

doi:10.3390/en15030845

Open AccessArticle

Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting

by

Vanessa María Serrano Ardila

¹

,

Joylan Nunes Maciel

^1,2

,

Jorge Javier Gimenez Ledesma

^1,2 and

Oswaldo Hideo Ando Junior

^2,3,*

¹

Latin American Institute of Technology, Infrastructure and Territory (ILATIT), Federal University of Latin American Integration (UNILA), Foz do Iguaçu 85867-000, PR, Brazil

²

Research Group on Energy & Energy Sustainability (GPEnSE), Cabo de Santo Agostinho 54518-430, PE, Brazil

³

Academic Unit of Cabo de Santo Agostinho (UACSA), Federal Rural University of Pernambuco (UFRPE), Cabo de Santo Agostinho 54518-430, PE, Brazil

^*

Author to whom correspondence should be addressed.

Energies 2022, 15(3), 845; https://doi.org/10.3390/en15030845

Submission received: 23 December 2021 / Revised: 17 January 2022 / Accepted: 19 January 2022 / Published: 24 January 2022

(This article belongs to the Topic Solar Thermal Energy and Photovoltaic Systems)

Download

Browse Figures

Versions Notes

Abstract

:

Solar photovoltaic energy has experienced significant growth in the last decade, as well as the challenges related to the intermittency of power generation inherent to this process. In this paper we propose to perform short-term forecasting of solar PV generation using fuzzy time series (FTS). Two FTS methods are proposed and evaluated to obtain a global horizontal irradiance (GHI) value. The first is the weighted method and the second is the fuzzy information granular method. Using the direct proportionality of the power with the GHI, the spatial smoothing process was applied, obtaining spatial irradiance on which a first-order low pass filter was applied to simulated power photovoltaic system generation. Thus, this study proposed indirect and direct forecasting of solar photovoltaic generation which was statistically evaluated and the results showed that the indirect prediction showed better performance with GHI than the power simulation. Error statistics, such as RMSE and MBE, show that the fuzzy information granular method performs better than the weighted method in GHI forecasting.

Keywords:

fuzzy time series; photovoltaic energy prediction; short-term forecasting

1. Introduction

Solar photovoltaic energy (SPE) is positioned as an energy source that contributes significantly to the diversification of the world’s energy matrix. According to statistics from the International Renewable Energy Agency (IRENA), in 2010 the installed capacity of photovoltaic systems (PVS) was 40 GW and in 2020 this installed capacity worldwide reached 714 GW, which represents an approximate increase of 94% in the last decade [1]. In South America, according to IRENA, in 2010 the installed capacity of PVS was 43 MW and in 2020 this number increased to 12 GW, representing an approximate increase of 99% in the region [1].

In the face of this growth, there are several challenges to consider in terms of the high penetration rates of PVS, being that this type of energy generation varies with the existence of a maximum generation limit that changes over time, from seconds to years [2], which is known as variability. In addition, this limit is not known with perfect precision, which is called uncertainty or error. The movement around the sun generates variability that can be predicted, while the variability associated with clouds can be difficult to predict, as well as the uncertainty due to difficulties in forecasting the behavior of weather conditions.

SPE generation forecasts are fundamental to face the challenges brought by variability and uncertainty. They are also a great tool for the management of electricity grids, their security and the commercialization of solar energy [3]. There are works that apply forecasts to improve the performance of the electricity system that consider demand and prices as variables to be predicted [4,5,6]. Both generators and suppliers that work with PV systems require forecasts for making operational and planning decisions [5]. Given, also, the high dependence of the SPE on weather conditions, its nature is unstable and can affect the reliability and quality of the power grid, causing frequency and voltage fluctuations [7].

Considering this context, the main objective of this work is to apply, investigate and evaluate two fuzzy time series (FTS) methods in the short-term generation prediction of SPE with historical data from a single database, obtained in Florianopolis, Santa Catarina, Brazil. The database was provided by the Photovoltaic Laboratory [8] by personal e-mail communication and can be accessed at the GitHub repository [9]. As reported in [10], there are still few studies in the literature conducted in Latin America countries.

Considering the main objective presented, the specific objectives are defined: (1) to compare the global horizontal irradiance (GHI) forecast accuracy of two FTS methods, the first order weighted method with chronological weights and another higher order method working with fuzzy information granules (FIG), which have different learnings; (2) subsequently, to evaluate the use of FTS in a power simulation using the spatial smoothing method [11]; and (3) transversal to the first two objectives, we intend to compare the performance of the FTS with three different short-term prediction horizons, being 5, 15 and 30 min.

The FTS excels in allowing system flexibility by considering natural circumstances while dealing with vague and imprecise knowledge in time series data [12]. Among the possible tools, the FTS methods are implemented through the Python library pyFTS [13] which develops the steps of the method proposed by [12] for forecasting with FTS. This library is the result of a work of the MINDS laboratory (Machine Intelligence and Data Science laboratory) that researches computational intelligence and machine learning, optimization, data visualization and decision making [14].

Considering one of the most comprehensive literature reviews reported in [15], significant progress has been seen in PV power generation forecasting, especially in recent years with the use of machine learning and deep learning methods. However, relatively few studies applying FTS are observed in this research field. The study described in [16] presents a database of irradiance data recorded at 30 min intervals which is used to generate forecasts using two FTS methods, showing optimal performance when compared to other forecasting methods. Other work has developed a fuzzy logic model for short-term forecasting that considers one hour ahead of solar energy production [17]. In the studies [18,19], fuzzy logic is applied to make short-term load forecasts. More recently, the following highlighted studies are [20,21] where the FTS method is improved with probabilistic forecasting and the information granule method, and is proposed in order to simplify the process with multivariate models.

Publications showing the progress of the study of FTS are found, such as the one in [22] where the non-stationary fuzzy time series (NSFTS) is introduced, which is able to dynamically adapt its fuzzy sets to reflect changes in the underlying stochastic processes based on residual errors. There are works, such as the short-term forecasting method based on the Takagi-Sugeno (T-S) fuzzy model for wind power and wind speed, where the results show that the proposed T-S fuzzy model can effectively improve the accuracy of the short-term forecasting of wind power [23]. It is observed in [21] how it is possible to generate the interval forecasts, from which it is possible to construct a cumulative density function and use it to build the quantile function and probabilistic forecasts with the treatment of stochastic simulations or ensembles. The interval forecasts deal with the drawbacks of point forecasts and this is also discussed in [24], where the probability distribution problem is addressed using the Kernel density estimation. Stacked modules of the deep fuzzy model (DIRM-DFM) for accurate prediction have been found that show the current progress of fuzzy models [25]. Another topic that shows the progress in the study of FTS is dealt with in [26]; here, a hyperparameter optimization method for high order weighted FTS that automates the generation of accurate and parsimonious models using genetic algorithms is presented.

The FTS method is used in conjunction with other prognostic methods and interesting results are evident from this coupling. In [27], the FTS and convolutional neural networks (CNN) are combined for short term load forecasting; they report good efficiency of the method. Similar results are reported in [28], where the proposal is to improve a solar forecasting model based on an artificial neural network (ANN) with fuzzy logic preprocessing. The study reported in [18] highlights the ability of FTS methods to deal with sudden variations in temperature, considering their influence on PVS, in a simple and robust way.

In this context, this study evaluates the performance of the FTS methods combined with the spatial smoothing method to solve the SPE generation forecasting problem considering the spatial dimension and characteristics of a specific photovoltaic system. For this evaluation we use the compilation made by [29] where three criteria are recommended to measure the accuracy of the model: the overall bias, the dispersion and the ability to reproduce statistical distributions. The most recommended metrics to quantify these criteria are, respectively, the mean bias error (MBE), the root mean square error (RMSE) and the Kolmogorov-Smirnov (KS) test. In addition, the coefficient of determination (R²), which expresses the fit of the predicted model data to the original data, will be used [15].

This work presents in Section 2 the theoretical foundation, where the forecasting approaches are detailed and the FTS method is exposed, how the fuzzy relationships were created and, from this, how the forecasts are generated. Section 3 describes the model implementation process, the used database, the hyperparameters implementation and the learning process of each method. Section 4 shows the results obtained from the radiation predictions and the subsequent power simulation, as well as a discussion of the results obtained. Finally, the conclusions are described in Section 5.

2. Theoretical Background

This section details forecasting approaches and time scales and briefly reviews the different forecasting methods in the literature. The FTS method, the process of creating fuzzy relationships and the steps to generate a forecast are outlined.

2.1. Photovoltaic Solar Energy Forecast

Depending on the available data, we initially define the time frame on which the forecasts will be based, i.e., the time scale horizon prediction. In [30], the main types in PVS generation forecasting are discussed; that is, direct and indirect. Indirect forecasting first predicts the solar irradiance and then, using a PV performance model of the plant, obtains the energy produced; direct forecasting directly calculates the plant energy production [30]. It is also useful to distinguish forecasting techniques according to time scales, given the importance of the specific definition of the prediction horizon [31]. Two types of forecasting are specified according to the needs of this work: short term forecasting deals with time horizons from minutes to hours and long term forecasting deals with time horizons from hours to a few days ahead [16].

Forecasting methods and time scales are highlighted first as they are considered cross-cutting in forecasting methods. Several of the most commonly used methods, from statistical methods to those using artificial intelligence, are now enunciated. Among the forecasting methods that are considered relevant to highlight are, initially, the statistical methods, which extract relationships with previous historical data to make forecasts of the future behavior of the plant [30]. Here we include the simplest methods, such as the persistence method, and the autoregressive methods compared in [32], such as ARMA and ARIMA. In terms of statistical methods, the quality of the historical data is essential for a good forecast [15]. There are also physical methods which, unlike statistical methods, use the specific configuration of the system and meteorological information for forecasting, methods based on sky images where cloud cover and optical depth are considered, and methods based on NWP (numerical weather prediction) [33]. Artificial intelligence (AI) based methods, such as artificial neural networks (ANN), that work similarly to the learning process of a human brain, learn the relationship between input parameters and output variables by studying previously recorded data. They do not need characteristic information about the system, which makes the use of ANNs ideal for modeling nonlinear, dynamic, noisy and complex systems [34].

Finally, a particular emphasis is made on the fuzzy time series (FTS) method, the method used in this study. The FTS are conceptualized as being part of the branch of computational intelligence [35] and stand out for allowing system flexibility when considering natural circumstances and dealing with the vague and imprecise knowledge in time series data. In this way, it becomes possible to reduce the problem of uncertainty and variability of solar resource data [21].

2.2. Fuzzy Time Series

The theoretical construction that regards the FTS method and its development from the concepts of fuzzy logic (FL) are presented in this section.

The FTS method was introduced by [12] as a nonparametric method for time series forecasting based on fuzzy logic theory that provides a different representation of the time series. They are noted for allowing system flexibility in considering natural circumstances dealing with vague and imprecise knowledge in time series data [21].

The main difference between a conventional time series and fuzzy time series is that the observations are real numbers in a time series, while fuzzy sets [12] compose the universe of discourse; so, given a time series

Y \in 1

and its individual values

y (t) \in Y for t = 0, 1, \dots, T,

the universe of discourse

(U)

is delimited by the maximum and minimum values of

Y

, such that

U = [\min (Y), \max (Y)]

in which the fuzzy sets

f i (t), (t = 1, 2, 3, \dots)

are defined and

F (t)

is the collection of

f i

and is called the fuzzy time series in

Y (t)

[12].

If the universe of discourse

(U)

with

U = {u 1, u 2, \dots, u i}

where

u i

are the possible linguistic values of

U

, we then define a fuzzy set of linguistic variables

A i

from

U

where

μ A i

is the membership function of the fuzzy set

A i

such that

μ A i : U

→[0, 1]. If

u i

is a member of

A i

, then

μ A i (u i)

is the degree of membership of

u i

a

A i

[24].

2.2.1. Fuzzy Logic Relationships

With the definition of the data set representing the universe of discourse, a fuzzy logic relationship (FLR) is established that creates the relationship between the input data and the estimated output data. The following is the definition of the FLRs made by [12] for both first-order and higher-order models: Assuming

F (t)

is caused only by

F (t - 1)

then there is a fuzzy relationship between

F (t)

and

F (t - 1),

which can be expressed as:

F (t) = F (t - 1) \circ R (t, t - 1)

(1)

“

\circ

” is the operator of the Max-Min function. The relation R is called the first-order model of

(t)

. If

(t)

is caused by more fuzzy sets,

F (t - 1), F (t - 2),

and

F (t - p),

(

p

>0) simultaneously, the fuzzy relation is of a higher order (order

p

) and is represented by

F (t) = (F (t - 1) \times F (t - 2) \times \dots \times F (t - p)) \circ R (t, t - p)

(2)

The main steps proposed by [12] with the FLR generate a complex computational demand for the calculation of each of the fuzzy relations. It is for this reason that [36] proposes an improvement of the algorithm by means of a simplification of the arithmetic operations, being less complex than the Max-Min function, creating groups of fuzzy logic relations (GFLR). Considering

(t - 1)

=

A i

and

F (t)

=

A j

, the relation of fuzzy logic (FLR) can be defined as

A i

→

A j

where

A i

and

A j

are called the left-hand side and right-hand side of the FLR. The FLRs with the same left-hand are put together in GFLR. The left-hand groups indicate the input value of the model and the right-hand groups correspond to the outputs that were estimated [36].

2.2.2. Stages of the FTS Algorithm

This section describes the steps to follow for the construction of the FTS models that detail and explain the algorithm used to obtain solar irradiance forecasts and the subsequent simulation of the power values. Figure 1 represents the method proposed by [12] and developed through the Python pyFTS library, with the hyperparameters and parameters relevant to the model.

Data Preprocessing

In this stage the data, which may contain missing fields or incorrect information, as well as repeated measurements, are handled and adjusted. In many time series it becomes necessary to perform such adjustments, e.g., to scale data within an interval, remove trends and/or seasonality and understand the behavior of cycles or randomness, which may interfere in the model validation processes.

The membership function, the number of partitions of the universe of discourse and the partitioning method are considered within the hyperparameters of the model. The hyperparameters of the FTS methods provide more versatility and flexibility, allowing control of the sensitivity of the fuzzification process, the number of rules generated and, mainly, the accuracy of the model [20]. The common hyperparameters of the FTS processes are shown in Table 1, where it is explained how they influence the model. Table 1 was extracted from [24].

Fuzzification of the Data

Once the partitioning scheme and the number of partitions have been defined, during the fuzzification process each element of the numerical time series

Y (t)

will be replaced by the fuzzy set with a maximum membership value, and the fuzzy time series will then be created

F (t)

[13]. This step is influenced by the hyperparameter α-cut, which is the minimum membership degree to be considered in the fuzzification process.

Fuzzy Rule Generation and Inference Process

The generation of fuzzy rules has a syntactic component that generates linguistic values and a semantic component that associates each linguistic term to its meaning. These rules depend on the method and its characteristics, as well as on the hyperparameters of the order model (Ω) and the delay indices (L) [24]. The inference process depends on the chosen learning method. The WEIGHTED-FTS and FIG-FTS methods are used for the model to learn and represent the temporal patterns found in the data that have been fuzzified. The goal of this process is to produce a

f (t + 1)

with the fuzzy sets, to represent the future value that is being predicted

y (t + 1)

[24].

The methods used are classified as multivariable, and within the group of variables that are defined the endogenous variable is the one to be predicted, also known as the target variable or dependent variable, and the other variables are the exogenous, explanatory or independent variables [24].

WEIGHTED FTS Method

This method is a first-order multivariate method proposed by [37] which applies linear chronological weights and produces more accurate forecasts than the fuzzy time series method proposed by Chen [36] where all FLRs have the same weight during the forecasting process. The method of [37] assigns appropriate weights to the fuzzy relationships. This method, as proposed, is described below.

Assuming the forecast of

(t) is A j 1, A j 2, \dots, A j k

. The corresponding weights for

A j 1, A j 2, \dots, A j k

these

w 1, w 2, \dots, w k

are specified. However, after forming the matrix of weights with those that were assigned, we have

(T) = [{w^{'}}_{1}, {w^{'}}_{2}, \dots, {w^{'}}_{k}]

which must satisfy the following condition:

\sum_{h = 1}^{k} {w^{'}}_{h} = 1

(3)

therefore, these weights,

w_{1}, w_{2}, \dots, w_{k}

, must be standardized. The following matrix of weights is thus obtained:

W (T) = [{w^{'}}_{1}, {w^{'}}_{2}, \dots, {w^{'}}_{k}] = [\frac{w_{1}}{\sum_{h = 1}^{k} w_{h}}, \frac{w_{2}}{\sum_{h = 1}^{k} w_{h}}, \dots, \frac{w_{k}}{\sum_{h = 1}^{k} w_{h}}]

(4)

where

w_{h}

is the corresponding weight for

A j h

.

Fuzzy Granular Information Method

The FIG (fuzzy information granular) method is a higher order multivariate method. It works as a wrapper that transforms the real values of a multivariate time series into a fuzzy univariate time series [20]. The resulting time series F is composed of data points

(t) \in F

representing the sequence of fuzzy information granules

G_{i}

. Each granule contains a fuzzy set of linguistic variables relative to each fuzzy variable

V_{i}

[20]. There is a global linguistic variable which is the union of all of the fuzzy information granules, which, in turn, are the combination of one of the fuzzy sets for each variable, such as

G_{i}

which in turn is the combination of one of the fuzzy sets for each variable, such that

G_{i} = {A_{j} V_{i}}, \forall V_{i} \in V

and its membership function is given by

μ G i = \cap^{} μ A_{j} V_{i}

where ∩ is the minimum T-norm. The FIG set is indexed by midpoints of its internal fuzzy sets. With the linguistic variable FIG the fuzzification process transforms each multivariate data point

y (t) \in Y into G_{i}

∈ FIG such that

f (t) = G_{i}

[20].

For each of these methods for the forecasting process, the input is applied to the model and the output is calculated, which in turn will be the predicted value [12]. The forecast horizon is the number of subsequent values to be predicted, or the number of delays in prediction after the last input value

y (t - L (0)), \dots, y (t - L (Ω))

[20].

Defuzzification Process

In this step we transform the fuzzy forecast values that have been predicted with the definition of the discourse horizon, which are linguistic values, into real numbers [13]. The objective of the process is to transform (t + 1) into an estimated numerical value y(t + 1).

Data Post-Processing

In this last step of the forecasting process, we run the spatial smoothing method proposed in [11] where the key point is the first-order low-pass filter and its respective pole, the value of which is a function of the area of the PVS [11].

This model is based on the direct proportionality of the power of a PVS with the incident irradiance [38] and that the variations in irradiance at a given point, which tend to be smoothed by considering the spatial effect, a phenomenon called spatial smoothing [11]. When the filter is applied to the irradiance predicted by FTS, taking into account the area of the plant, a smoothed value called spatial irradiance

G s (t)

is used to represent the irradiance on a surface, represented by A [11].

The application of the model, as described in [38], begins with obtaining the spatial irradiance time series from the predicted

G s (t)

from the values of the irradiance time series that was forecasted

G (t)

.

\frac{G_{s} (t)}{G (t)} = \frac{1}{τ s + 1}

(5)

where s is the Laplace transform variable, t is the time and

τ

is the filter time constant, approximated by considering the value of the cut-off frequency, f_c, obtained from the curve fit for the cut-off frequencies of the power spectra of several SFV plants as a function of the area A [Ha] of the system [11].

τ = \frac{\sqrt{A}}{2 π \cdot f_{c}}

(6)

Once the spatial irradiance is obtained, it is possible to obtain, using Equation (7), a simulated power for the system, by means of the product of the spatial irradiance with the installed power, P*, of the PVS given by [39]. In addition, it is considered in this simulation that the temperature rise of the module reduces its efficiency [39]. Therefore, in Equation (7), a factor is included as a function of the module temperature throughout the day. This factor increases or decreases by 0.4% of the power as indicated in most PVS module data sheets for each degree that deviates from the standard test conditions [39].

P_{s i m} (t) = \frac{G_{s} (t) \cdot P^{*}}{G_{S T C}} [1 - 0.004 (T (t) - T_{S T C})]

(7)

where G_STC

= 1000 W / m^{2}

is the irradiance at standard test conditions, as well as contact temperature T_STC [39],

G s (t)

is the spatial irradiance and

G (t)

is the irradiance that was forecasted.

2.3. Statistical Analysis

Once the prognostic method is defined, the way in which its performance will be evaluated is shown, with the intuition of standardizing the results and making them comparative with other works. For this performance evaluation, we used the compilation made by [29] where three criteria are recommended to measure the accuracy of the model: the overall bias, the dispersion and the ability to reproduce statistical distributions. The metrics recommended to quantify these criteria are, respectively, the mean bias error (MBE); the normalized root mean square error (RMSE), which is used to obtain comparative models where the samples have different sizes, as indicated in [15]; and the Kolmogorov-Smirnov (KS) normality test which is defined as the maximum value of the absolute difference between the two cumulative probability functions [29].

The MBE (Equation (8)) and RMSE (Equation (9)) provide information about an expected range of errors in a given geography or station. Likewise, the RMSE represents well the hourly or sub-hourly dispersion of the values [29]. The Kolmogorov-Smirnoff (KS) test (Equation (10)), is obtained by finding the maximum absolute difference between the cumulative frequency distributions of GHI

φ (c_{i})

modeled and

φ (m_{i})

. observed [40]. In addition, the coefficient of determination (R²) in Equation (11), is used to indicate how close the predicted data are to the test data [41].

MBE = \frac{1}{M} \sum_{t = 1}^{N} (c_{i} - m_{i})

(8)

RMSE = {[\frac{1}{M} \sum_{t = 1}^{N} {(c_{i} - m_{i})}^{2}]}^{\frac{1}{2}}

(9)

KS = M A X | φ (c_{i}) - φ (m_{i}) |

(10)

R^{2} = 1 - \frac{σ^{2} (c_{i} - m_{i})}{σ^{2} (m_{i})}

(11)

N represents the total number of data, M refers to mean, c_i is the

i - é s i m o

predicted value and m_i represents the

i - é s i m o

observed value [41]. In the equations σ represents the variance in the database.

3. Materials and Methods

In this section we describe the implementation of the analytical model and tools used to perform the experimental evaluation based on the algorithm that is described in Section 2. In this sense, the experiments were conducted in order to compare the two FTS methods, WEIGHTED-FTS and FIG-FTS, through three short-term time horizons of 5, 15 and 30 min. Subsequently, as post-processing, the spatial smoothing method of [11] was used to obtain a generated power simulation.

The procedure performed is shown in Figure 2 to give a general understanding of what is addressed in this section. It should also be noted that the power simulation is a consequence of the irradiance forecast performed through two FTS methods.

3.1. Tools, Technologies and Database

The experiments were performed on a virtual machine created using Google Colab, an open source collaborative programming environment [42], a free tool for writing and running Python code, and the graphics processing unit (GPU) provided by the Nvidia K80s platform with 12 GB of memory.

The irradiance database used to train and test the FTS model and the power data used to validate the low pass filter model in the power simulation are extracted from a 2.2 kWp PVS operating at the “Centro de Pesquisa e Capacitação em Energia Solar Fotovoltaica da UFSC” [8], located in the city of Florianópolis/SC Brazil, where the irradiance sensor is located, along with a pyranometer SMP22 Kipp&Zonen with horizontal configuration and a temperature and humidity sensor (PTB110 VAISALA). It consists of 12 months, from 1 January 2018 to 31 December 2018, for model training, and 12 months, from 1 January 2019 to 31 December 2019, for testing, as observed in Table 2. The data were captured by the pyranometer every minute, and the database provided by [9] has filtered data every 5, 15 and 30 min. No missing data were found in the database.

The two-year observations represent a 50/50 ratio for training and testing. One year is considered as training in order to provide learning for the FTS model, and the behavior of the seasonality profiles was the same for the test period [16].

Table 3 shows the summary of the data, describing the units of the variables treated, as well as the statistics of the database, and can be accessed in the GitHub repository [9]. In this database, the full 24 h interval is considered. This choice changes the total number of data and the normalization of the error metrics [15]. This choice has been made, taking into account the threshold value below which solar irradiance data are excluded [15]. This means that in the model training process zero values are included and missing fields are filled with zero in the pre-processing process.

Table 4 shows the variables that compose the database and the output of the model. The database is composed of four columns: data containing the seasonality information (where the date, hour and minute of each record are shown) from which the variables min (minute), hour, month; temp_amb (ambient temperature); ghi (global horizontal irradiance) selected as endogenous variables; temp_contact (module operating temperature), which is not included in the database available for the model, is extracted; temp_contact (module operating temperature), which is not included in the database available for the model and thus is calculated by considering the nominal operating temperature of the cell; pow (power generated by the PVS) [43]; and modeled_power (modeled power resulting from the process of applying the low pass filter) as outputs of the method.

Solar irradiance has two seasonality components: annual and daily. These two components can be extracted from the data column. The study conducted in [44] demonstrated that the temperature variable exhibits the highest Spearman correlation in relation to GHI among the several meteorological variables that were analyzed. For this reason, only the ambient temperature is considered, since it interferes with the performance of the solar modules.

3.2. Method and Experimental Setup

The selected hyperparameters are shown, the process for creating the fuzzy and the model learning process is given, as well as the GHI forecast and subsequent power simulation.

3.2.1. Hyperparameters

After the choice of the database on which the irradiance forecast is made, the hyperparameters are defined. Two things stand out, the first of which is the fact that the choice of their value is empirical and data dependent [21]. The second refers to the fact that the model is multivariable, so for each variable it is necessary to define the hyperparameters to train the model [21]. The membership function is chosen as reported in the literature, as in [45]. However, it is noted that the real impact of the membership function on accuracy is low, as demonstrated in [24].

The Table 5 shows the values selected for each variable. The exogenous variables were assigned the same α- cut and the same membership function (triangular), and in the case of the number of partitions, for the variables related to time we respected their time division and for temperature we selected the number of partitions that was close to its variation value. With the endogenous variable we experimented using a different membership function (Gaussian) in order to better capture the transition between the fuzzy sets; likewise, the value of the minimum degree of membership, α- cut, is a little lower than in the other variables, due to an attempt to include more values when performing the fuzzification process.

As mentioned above, the selection of these hyperparameters comes from heuristic knowledge of the process. The basis for the choice of the different values was also taken with the support of works on solar irradiance forecasting by FTS, as in [21,24,45].

The graphical representation of the membership function chosen for the variables is shown in Figure 3.

The graph of a triangular MF is shown for the exogenous variables and, in the case of the GHI variable—the endogenous variable, which is the variable to be predicted—the Gaussian MF is used and the specific linguistic values that are used are shown.

3.2.2. Fuzzy Rules, Learning and Forecasting

The partitions of the endogenous variable (irradiance) are separated by levels of linguistic values, PP–‘very small’, P–‘small’, M–‘medium’, G–‘large’, GG–‘very large’; each of the linguistic values with seven sub-levels. These will generate rules, as per the example in [45]:

PP5, 6 → P6, M0, M1

This rule can be read as: IF (t − 1) IS Very Small (Sublevel 5) AND y(t) IS Small (Sub-Level 6) THEN y(t + 1) it will be Small (Sublevel 6) OR Medium (Sublevel 0) OR Medium (Sublevel 1).

The model rule creation process described in this paper, with the two methods, the WEIGHTED-FTS of the first order (Ω = 1) and the second order FIG-FTS (Ω = 2) and k-NN two (κ = 2), are the result of the learning process of each of the models using train data.

As represented in Figure 4, the rule generation process of the models WEIGHTED-FTS (Order 1) and FIG-FTS (Order 2) have the format Precedent → Consequent, which comes from a temporal pattern indicating two fuzzy sets that appeared in sequence in the created fuzzy time series [45], where the precedent indicates a set at time t and the consequent set that appeared at time t + 1 [45].

In Figure 4 the difference of the order is shown, along with how the information granules of the FIG method work and how the order of each of the methods is manifested. Recall that the order is the number of lags (past values) that are used by the models, i.e., how much past information is necessary to describe future events [45].

The differences between the WEIGHTED-FTS method (which is of order 1 and performs chronological weightings for the result) and the information granules generated in FIG-FTS (which is a higher order method, both multivariate) are highlighted. Once the learning process of the models is finished, the forecasting process is performed, and here the data from the database corresponding to the test data are used.

Finally, Table 6 shows the experimental setup applied with each of the methods, as well as the respective inputs and outputs.

3.3. Low Pass Filter and Power Simulation

According to [46], photovoltaic energy is approximately proportional to a spatial irradiance profile. Thus, once the irradiance forecast is obtained, we obtain a spatial irradiance profile considering the specifications of the PVS which has 20 modules of 110 kWp organized in four strings of five modules and a 2.5 kW ABB UNO Power Inverter. We proceed to the filter application by means of the butter function of the Python Scipy library. Finally, by applying Equation (7), a simulated power value is obtained for each of the values according to the prediction horizon.

4. Results and Discussion

This section shows the results obtained from radiation predictions and subsequent power simulations. The experimental analysis was carried out with two forecasting methods—WEIGHTED-FTS and FIG FTS—with three short-term prediction horizons of 5, 15 and 30 min, as shown in Table 5. In each method, the month, day, hour, minute and ambient temperature are introduced as exogenous variables and the Global Horizontal Irradiance (GHI) as the endogenous variable. Subsequently, spatial smoothing is applied to the irradiance forecast obtained using the technical and physical specifications of the PVS to obtain the generated power simulation. In the Google Colab environment, the average training time varied from 10 to 20 min, depending on the horizon prediction.

4.1. Statistical Errors

The experiments were performed based on the concepts and steps described in Section 2 and the method presented in Section 3. The results with respect to prediction errors of the two FTS models are gathered in Table 6.

The following statistical metrics were used in this study: MBE to quantify the overall bias and detect whether the model is producing an overestimation (MBE > 0) or underestimation (MBE < 0) of the predicted data and capture the overall trends, rather than variability [15]; RMSE (normalized–nRMSE) as the uncertainty measure [29] that is also very sensitive to large individual errors and captures variability rather than general trends [15]; and the coefficient of determination (R²) which expresses the fit of the predicted model data relative to the original data [41]. In addition, the Kolmogorov-Smirnov (KS) test shows the ability to reproduce statistical distributions [29]. Finally, the KS test assumes that the higher the number of experimental data, the closer the modeled distribution is to the measured distribution [29].

Table 7 shows the results of the error statistics. Part A of the table shows the irradiance forecast (indirect prediction), in order to compare the accuracy of the two FTS methods, and in part B the metrics shown are used to evaluate the use of FTS in a power simulation (direct prediction) using the spatial smoothing method. The analysis of the different time horizons in each of the FTS methods used is also shown in the two parts of Table 7.

For the KS statistical test, a significance level of α = 0.05, from which a critical value (Vc) is established for each of the forecast horizons, considering the size of the samples that each one offers [40]. It is tested if both samples have the same distribution. For this purpose, a hypothesis test is performed, assuming that both samples have the same distribution, the null hypothesis is accepted; this is against the alternative hypothesis that they are different, in which case the null hypothesis rejected [40]. From the KS test result we observe that for KS < Vc the modeled frequency distribution is similar to the frequency distribution and as such the null hypothesis (H0) is accepted; and, for KS > Vc we have the alternative hypothesis (H1) which states that the measured frequency distribution is not consistent with the modeled distribution, i.e., a poor fit of the predicted data [29]. The alternative hypothesis is equivalent to a rejection of the null hypothesis.

The normality test applied to all samples obtained in both the GHI forecast and the subsequent power simulation is observed in Table 8. The KS test determines whether two data sets differ significantly [40]. This test compares the distribution of a set of predicted data with a distribution of observed data and evaluates the differences [40].

4.2. Analysis and Discussion

In this section, the statistical analyses and comparisons of the prediction errors for the two FTS models applied in this work are presented, considering the indirect prediction for the solar irradiance forecast and direct prediction for the generated power simulation.

Initially, as stated in the specific objectives of this work, the accuracy of the global horizontal irradiance forecast of the two FTS methods is evaluated and compared. The FIG-FTS method shows better results in the 5 and 15 min time horizon, and for the 30 min time horizon, the WEIGHTED-FTS method has better results, as shown in Table 7.

The higher order FIG-FTS method performs better when working with smaller time scales, since FIG-FTS performs multivariate forecasting, acting as a multiple-input multiple-output (MIMO) method, in which all variables are both targets and explanatory variables [24]. For this reason, it is considered that in order to perform forecasting with multivariate methods it is necessary to have a database with the minimum redundancy and as much detail as possible, and that the use of order 1 methods, such as WEIGHTED-FTS, is preferable when the database has a larger time scale, which implies a smaller number of data.

The KS test determines whether two sets of data differ significantly [40]. Then, as detailed in Table 8, the KS values are lower for the FIG-FTS method at the 5 and 15 min time horizons, i.e., that the FIG-FTS method is more able to reproduce the observed frequency distributions with smaller time scales. Table 8 shows how the power simulation does not have the ability to reproduce the normal statistical distributions of the data, and as such the table shows the rejection of the null hypothesis.

The forecast result shown in the Figure 5 exemplifies one day of forecasts with the test data relative to 3 May 2019 for the two FTS methods and the three employed horizon predictions.

The comparison of the accuracy of the two FTS methods used for the global horizontal irradiance forecast by horizon prediction is presented in Figure 6 where, graphically, the coefficient of determination (R²) is treated. With the comparison made in Figure 6, the better performance of the FIG-FTS method in the 5 and 15 min time horizon is visible when compared to the WEIGHTED-FTS method, which results in a better R² when the model forecasts in the 30 min time horizon.

The evaluation of the use of FTS in a power simulation using the spatial smoothing method, presented in Section 2 (stage f) of the post-processing of the GHI forecast with FTS, is now presented.

The power was calculated from the irradiance forecasts that were performed with the WEIGHTED-FTS and FIG-FTS methods. The error metrics corresponding to this part of the process that is presented in this work are also shown in Table 7, part B. It is observed that the power results obtained from the forecasts made with the order 2, FIG-FTS method, show better performance than the results obtained with the order 1 method. However, it should be noted that the use of a higher model order does not imply an increase in its performance. Since the higher the order, the more fuzzy sets will be generated, too many fuzzy sets generate an overfitting, causing the model to start learning noise from the data; similarly, sets with lower order generate an underfitting due to oversimplification of the signal [45]. For the proper choice of the order it is possible to perform hyperparameter optimization processes, as shown in [26].

It is also observed, in Table 7 in part B, that the RMSE and MBE values are higher than those presented in the GHI forecasts. This is explained by the fact that, in addition to the error presented in the GHI prediction, the error presented in the power simulation caused by the filter is added, and in addition the correction factor with the contact temperature that decreases the efficiency of the photovoltaic module, as shown in Equation (7). It is relevant to highlight that no studies evaluating, together, the direct and indirect predictions with the application of FTS-based methods at the short-term horizons of this study were observed in the literature.

An example of the power simulation is shown in the Figure 7, where the GHI value obtained with the FIG-FTS method from which the power simulation is performed is shown, and the real observed power value is also shown, with which the results are compared. As already mentioned, a simulation of the generated power values is made from the GHI forecast, using the spatial smoothing method and then passing through the first order low-pass filter [11]. An example of the power simulation is shown in the Figure 7, where the FIG method was used to simulate one day—3 May 2019—of generated power for the 5 min prediction horizon.

The error metrics corresponding to this part of the process that is presented in this work are shown in Table 7. The trend of better results for the FIG method and the 5 min time horizon is shown when analyzing the MBE and RMSE. The same results are obtained in the coefficient of determination analysis applied to the simulation of the generated power, as shown in Figure 8.

Finally, as shown in Table 7, for irradiance forecasts, as the time scale decreases, the accuracy of the method increases. We see that the RMSE and MBE metrics yield better results in the 5 min horizon compared to the 15 and 30 min horizons. These results are expected, since the forecast error measured by nRMSE increases with an increasing forecast time scale [15].

This section presented the comparative performance results of two FTS-based short-term PV power generation prediction methods considering three short-term prediction horizons, including the comparative performance analysis between indirect and direct predictions with FTS.

5. Conclusions

This study developed two multivariable fuzzy time series (FTS) methods and evaluated their use in the indirect prediction of short-term photovoltaic power generation. In addition, the application of the FTS methodology added to the spatial smoothing for power simulation (direct prediction) allows for a controlled experimental setup, enabling the monitoring of the whole process, mainly how the learning of each of the models occurs and the creation of the fuzzy rules, as shown in Figure 4.

Each of the methods used, both WEIGHTED-FTS and FIG-FTS, was assigned the GHI as endogenous variables and the exogenous variables of the minute, hour, day, date and ambient temperature worked through the forecast time horizons of 5, 15 and 30 min. Although there are works where the FTS methods are applied, a contribution of this study is to have used the spatial smoothing method with the application of a low-pass filter, added to the method as post-processing of the data, to simulate power values where the particular characteristics of PVS are considered. For analysis purposes, data obtained from the 2.2 kWp SFV of the solarimetric station installed in the Photovoltaic laboratory of the Federal University of Santa Catarina (UFSC), located in the city of Florianópolis in Brazil, were used.

In the comparative analysis of both methods, it was found that the FIG-FTS method, of the higher order, provides better results in the GHI forecast through the 5 and 15 min horizons, which can be perceived in the statistical results of Table 7 and in the analysis of the KS statistic representing the model’s capacity to reproduce the cumulative distribution function of the observed data. However, the GHI values predicted with the FIG method at the 30 min horizon (the widest of the horizons tested), presents underestimation, i.e., MBE < 0 of the predicted values. At the 30 min time horizon, the best statistics are observed when applying the WEIGHTED-FTS method, as can be seen in Figure 6. This indicates that for longer prediction horizons, first order methods are indicated as more effective.

Once the power simulation was performed with the low pass filter, considering the SFV specifications, it was evidenced that the coupling of the FIG method results in better statistical indexes. It can also be perceived from Table 8 that shorter time horizons, such as 5 min, improve the values obtained from statistics, such as RMSE and KS test values. The statistical analysis shows higher error values, as well as the inability to reproduce the statistical distribution of the samples in the data obtained in the power simulation. This is due to the fact that the application of the low pass filter, by smoothing the fast irradiance peaks, also eliminates irradiance values with large variations typical of this type of measurement.

Although the higher order method has better results, it is necessary to emphasize that increasing the order value of the method indiscriminately does not imply an increase in its performance. This is because the higher the order, the more fuzzy sets will be generated, and too many fuzzy sets generate an overfitting, causing the model to start learning the noise of the data; similarly, lower order sets generate an underfitting, due to the oversimplification of the signal [45].

It is suggested, based on the results, that both FTS methods applied can be used in PV energy generation forecasting in the evaluated short-term horizons, with the best accuracy depending on the prediction horizon. In addition, the direct prediction produced higher errors than the indirect prediction for both FTS methods analyzed. It is highlighted that the implementation of the FTS through the Python library, pyFTS, is a reliable process since it allows access to its source code. In this work, point forecasting is used but its performance can be improved by including, in addition to hyperparameter optimization, interval prediction, with which the fuzzy method can be heuristically simple and fast without generating a large computational demand for the probability density function [24]. Furthermore, to ensure good performance of the FTS forecasting process, the use of a database with good integrity is required. The database provided by the Photovoltaic Laboratory of the Federal University of Santa Catarina was of vital importance.

Finally, another original contribution of this study is that the prediction results have two simultaneous approaches: direct and indirect prediction, since this approach has not been observed on solar photovoltaic energy forecasting literature [15,30,47]. In addition, this study allowed the evaluation of two FTS methods with different experimental setups and their performance when used together with spatial smoothing applied with the low-pass filter. Future work will be directed at comparing the accuracy of FTS methods with benchmark models and other machine learning methods, as well as forecasting with FTS hyperparameter optimization [26] and probabilistic forecasts considering the seasonal components of the input variables [21].

Author Contributions

Conceptualization: V.M.S.A. and O.H.A.J., investigation and simulation: V.M.S.A. and J.N.M., wrote and final editing: V.M.S.A., J.N.M., J.J.G.L. and O.H.A.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by PRPPG of the Federal University of Latin American Integration (UNILA). The O.H.A.J. was funded by the Brazilian National Council for Scientific and Technological Development (CNPq), grant number 407531/2018-1 and 303293/2020-9.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

The authors would like to thank to the Federal University of Latin American Integration (UNILA) for financial supporting and facilities, Coordination for the Improvement of Higher Education Personnel (CAPES) and the Brazilian Council for Scientific and Technological Development (CNPq) for financial support.

Conflicts of Interest

The authors declare no conflict of interest.

Nomenclature

ANN	Artificial Neural Network
CNN	Convolutional Neural Networks
FIG	Fuzzy Information Granules
FL	Fuzzy Logic
FLR	Fuzzy Logic Relationship
FTS	Fuzzy Time Series
GFLR	Groups of Fuzzy Logic Relations
GHI	Global Horizontal Irradiance
GHI_STC	Global Horizontal Irradiance at Standard Test Conditions
G	Gigawatt
IRENA	International Renewable Energy Agency
kNN	k-Nearest Neighbor
KS	Kolmogorov-Smirnov
MBE	Mean Bias Error
MF	Membership Function
MINDS	Machine Intelligence and Data Science Lab
MW	Megawatt
nRMSE	Normalized Root Mean Square Error
NSFTS	Non-Stationary Fuzzy Time Series
NWP	Numerical Weather Prediction
PV	Photovoltaic
PVS	Photovoltaic Systems
R²	Coefficient of Determination
RMSE	Root Mean Square Error
SPE	Solar Photovoltaic Energy
U	Universe of Discourse
Vc	Critical Value

References

IRENA, International Renewable Energy Agency. Renewable Power Generation Costs in 2014. 2014. Available online: http://www.irena.org/ (accessed on 26 November 2020).
Pelland, S.; Remund, J.; Kleissl, J.; Oozeki, T.; De Brabandere, K. Photovoltaic and solar forecasting: State of the art. IEA PVPS Task 2013, 14, 1–36. [Google Scholar]
Nwaigwe, K.; Mutabilwa, P.; Dintwa, E. An overview of solar power (PV systems) integration into electricity grids. Mater. Sci. Energy Technol. 2019, 2, 629–633. [Google Scholar] [CrossRef]
Shah, I.; Iftikhar, H.; Ali, S.; Wang, D. Short-Term Electricity Demand Forecasting Using ComponentsEstimation Technique. Energies 2019, 12, 2532. [Google Scholar] [CrossRef] [Green Version]
Lisi, F.; Shah, I. Forecasting next-day electricity demand and prices based on functional models. Energy Syst. 2019, 11, 947–979. [Google Scholar] [CrossRef]
Bibi, N.; Shah, I.; Alsubie, A.; Ali, S.; Lone, S.A. Electricity Spot Prices Forecasting Based on Ensemble Learning. IEEE Access 2021, 9, 150984–150992. [Google Scholar] [CrossRef]
Ziadi, Z.; Taira, S.; Oshiro, M.; Funabashi, T. Optimal Power Scheduling for Smart Grids Considering Controllable Loads and High Penetration of Photovoltaic Generation. IEEE Trans. Smart Grid 2014, 5, 2350–2359. [Google Scholar] [CrossRef]
F. UFSC. Fotovoltaica—Grupo de Pesquisa Estratégica em Energia Solar Fotovoltaica. 2021. Available online: https://fotovoltaica.ufsc.br/sistemas/fotov (accessed on 30 August 2020).
Vanessa Maria Carolina Serrano Ardila. GitHub Repository Dataset. 2021. Available online: https://github.com/vannserr/Fuzzy-Time-Series-Methods-Applied (accessed on 22 December 2021).
Maciel, J.N.; Wentz, V.H.; Ledesma, J.J.G.; Junior, O.H.A. Analysis of Artificial Neural Networks for Forecasting Photovoltaic Energy Generation with Solar Irradiance. Braz. Arch. Biol. Technol. 2021, 64. [Google Scholar] [CrossRef]
Marcos, J.; Marroyo, L.; Lorenzo, E.; Alvira, D.; Izco, E. From irradiance to output power fluctuations: The PV plant as a low pass filter. Prog. Photovoltaics: Res. Appl. 2011, 19, 505–510. [Google Scholar] [CrossRef] [Green Version]
Song, Q.; Chissom, B.S. Fuzzy time series and its models. Fuzzy Sets Syst. 1993, 54, 269–277. [Google Scholar] [CrossRef]
MINDS. pyFTS Quick Start—pyFTS 1.6 Documentation. Available online: https://pyfts.github.io/pyFTS/build/html/quickstart.html#what-are-fuzzy-time-series-fts (accessed on 6 September 2020).
MINDS. Machine Intelligence and Data Science Lab. Engineering School, UFMG. 2021. Available online: https://minds.eng.ufmg.br/ (accessed on 6 September 2021).
Blaga, R.; Sabadus, A.; Stefu, N.; Dughir, C.; Paulescu, M.; Badescu, V. A current perspective on the accuracy of incoming solar energy forecasting. Prog. Energy Combust. Sci. 2018, 70, 119–144. [Google Scholar] [CrossRef]
Severiano, C.A.; Silva, P.C.L.; Sadaei, H.J.; Guimaraes, F.G. Very short-term solar forecasting using fuzzy time series. In Proceedings of the 2017 IEEE international conference on fuzzy systems, Naples, Italy, 9–12 July 2017; pp. 1–6. [Google Scholar] [CrossRef]
Chugh, A.; Chaudhary, P.; Rizwan, M. Fuzzy logic approach for short term solar energy forecasting. In Proceedings of the 2015 Annual IEEE India Conference (INDICON), New Delhi, India, 17–19 December 2015; pp. 1–6. [Google Scholar] [CrossRef]
Hong, T.; Wang, P. Fuzzy interaction regression for short term load forecasting. Fuzzy Optim. Decis. Mak. 2013, 13, 91–103. [Google Scholar] [CrossRef]
Etemadi, A.H.; Davison, E.J.; Iravani, R. A Decentralized Robust Control Strategy for Multi-DER Microgrids—Part II: Performance Evaluation. IEEE Trans. Power Deliv. 2012, 27, 1854–1861. [Google Scholar] [CrossRef]
e Silva, P.C.; Severiano, C.A.; Alves, M.A.; Cohen, M.W.; Guimarães, F.G. A New Granular Approach for Multivariate Forecasting. In Latin American Workshop on Computational Neuroscience; Springer Nature: Cham, Switzerland, 2019; pp. 41–58. [Google Scholar]
Silva, P.; Alves, M.A.; Junior, C.A.S.; Vieira, G.L.; Guimarães, F.G.; Sadaei, H.J. Probabilistic Forecasting with Seasonal Ensemble Fuzzy Time-Series. In Proceedings of the XIII Brazilian Congress on Computational Intelligence, Rio de Janeiro, Brazil, 30 October–1 November 2017. [Google Scholar] [CrossRef] [Green Version]
Silva, P.; Severiano, C.A.; Alves, M.A.; Silva, R.; Cohen, M.W.; Guimarães, F.G. Forecasting in non-stationary environments with fuzzy time series. Appl. Soft Comput. 2020, 97, 106825. [Google Scholar] [CrossRef]
Liu, F.; Li, R.; Dreglea, A. Wind Speed and Power Ultra Short-Term Robust Forecasting Based on Takagi–Sugeno Fuzzy Model. Energies 2019, 12, 3551. [Google Scholar] [CrossRef] [Green Version]
Silva, P. Scalable Models For Probabilistic Forecasting With Fuzzy Time Series. Ph. D. Dissertation, Federal University of Minas Gerais, Belo Horizonte, Brazil, 2019. [Google Scholar]
Li, C.; Zhou, C.; Peng, W.; Lv, Y.; Luo, X. Accurate prediction of short-term photovoltaic power generation via a novel double-input-rule-modules stacked deep fuzzy method. Energy 2020, 212, 118700. [Google Scholar] [CrossRef]
Lucas, P.D.; Silva, P.D.; Guimarães, F.G. Otimização Evolutiva de Hiperparâmetros para Modelos de Séries Temporais Nebulosas. In Proceedings of the 14 Simpósio Brasileiro de Automação Inteligente, Ouro Preto, Brazil, 27–30 October 2019. [Google Scholar]
Sadaei, H.J.; Silva, P.C.d.e.; Guimarães, F.G.; Lee, M.H. Short-term load forecasting by using a combined method of convolutional neural networks and fuzzy time series. Energy 2019, 175, 365–377. [Google Scholar] [CrossRef]
Sivaneasan, B.; Yu, C.; Goh, K. Solar Forecasting using ANN with Fuzzy Logic Pre-processing. Energy Procedia 2017, 143, 727–732. [Google Scholar] [CrossRef]
Paik, J.K.; Thayamballi, A.K. Ship-Shaped Offshore Installations; Cambridge University Press: Cambridge, UK, 2007. [Google Scholar] [CrossRef]
Antonanzas, J.; Osorio, N.; Escobar, R.; Urraca, R.; Martinez-De-Pison, F.J.; Antonanzas-Torres, F. Review of photovoltaic power forecasting. Sol. Energy 2016, 136, 78–111. [Google Scholar] [CrossRef]
Maciel, J.N.; Ledesma, J.J.G.; Junior, O.H.A. Forecasting Solar Power Output Generation: A Systematic Review with the Proknow-C. IEEE Lat. Am. Trans. 2021, 19, 612–624. [Google Scholar] [CrossRef]
Bartres, N. Trabajo Fin de Grado. Zaguan. Unizar. Es 2019, 2021, 43. [Google Scholar]
Wan, C.; Zhao, J.; Song, Y.; Xu, Z.; Lin, J.; Hu, Z. Photovoltaic and solar power forecasting for smart grid energy management. CSEE J. Power Energy Syst. 2015, 1, 38–46. [Google Scholar] [CrossRef]
Mubiru, J. Predicting total solar irradiation values using artificial neural networks. Renew. Energy 2008, 33, 2329–2332. [Google Scholar] [CrossRef]
Klement, E.P. Fuzzy Logic in Articial Intelligence CD. In Proceedings of the 8th Austrian Artificial Intelligence Conference, Linz, Austria, 28–30 June 1993. [Google Scholar]
Song, Q.; Chissom, B.S. Forecasting enrollments with fuzzy time series—Part II. Fuzzy Sets Syst. 1994, 62, 1–8. [Google Scholar] [CrossRef]
Yu, H.-K. Weighted fuzzy time series models for TAIEX forecasting. Phys. A Stat. Mech. Appl. 2005, 349, 609–624. [Google Scholar] [CrossRef]
Schnabel, J.; Valkealahti, S. Energy Storage Requirements for PV Power Ramp Rate Control in Northern Europe. Int. J. Photoenergy 2016, 2016, 1–11. [Google Scholar] [CrossRef] [Green Version]
Diaz, V.N. Avaliação de Desempenho das Estratégias de Controle para Suavização de Potência Ativa de Sistemas Fotovoltaicos com Armazenamento de Energia. Master’s Thesis, Universidade Estadual do Oeste do Paraná (UNIOESTE), Paraná, Brazi, 2019. [Google Scholar]
Espinar, B.; Ramírez, L.; Drews, A.; Beyer, H.G.; Zarzalejo, L.F.; Polo, J.; Martín-Pomares, L. Analysis of different comparison parameters applied to solar radiation data from satellite and German radiometric stations. Sol. Energy 2009, 83, 118–125. [Google Scholar] [CrossRef]
Gueymard, C.A. A review of validation methodologies and statistical performance indicators for modeled solar radiation data: Towards a better bankability of solar projects. Renew. Sustain. Energy Rev. 2014, 39, 1024–1034. [Google Scholar] [CrossRef]
Bisong, E. Building Machine Learning and Deep Learning Models on Google Cloud Platform. In Building Machine Learning and Deep Learning Models on Google Cloud Platform: A Comprehensive Guide for Beginners; Apress: New York, NY, USA, 2019. [Google Scholar]
de Oliveira, L.; Filho, S.C.; Filho, P.C.C. Modelos para a temperatura de operação de módulos fotovoltaicos: Uma revisão das correlações e variáveis pertinentes. In Proceedings of the VIII Congresso Brasileiro de Energia Solar, Fortaleza, Brazil, 1–5 June 2020; pp. 2–11. [Google Scholar]
Viscondi, G.D.F.; Alves-Souza, S.N. Solar Irradiance Prediction with Machine Learning Algorithms: A Brazilian Case Study on Photovoltaic Electricity Generation. Energies 2021, 14, 5657. [Google Scholar] [CrossRef]
Silva, P.C. A short tutorial on Fuzzy Time Series—Part II, owards Data Science. 2018. Available online: https://towardsdatascience.com/a-short-tutorial-on-fuzzy-time-series-part-ii-with-an-case-study-on-solar-energy-bda362ecca6d (accessed on 7 September 2021).
Marcos, J.; Marroyo, L.; Lorenzo, E.; Alvira, D.; Izco, E. Power output fluctuations in large scale pv plants: One year observations with one second resolution and a derived analytic model. Prog. Photovoltaics Res. Appl. 2010, 19, 218–227. [Google Scholar] [CrossRef] [Green Version]
Li, P.; Zhou, K.; Yang, S. Photovoltaic Power Forecasting: Models and Methods. In Proceedings of the 2nd IEEE Conference on Energy Internet and Energy System Integration, Beijing, China, 20–22 October 2018. [Google Scholar]

Figure 1. Algorithm of the forecasting process with the FTS (adapted from [22]).

Figure 2. Algorithm of the irradiance forecasting method and subsequent power simulation of this work.

Figure 3. Example of the partitions with triangular and Gaussian membership function of the variables.

Figure 4. Result of the learning process, generation of rules for each of the FTS methods used.

Figure 5. Example of GHI prediction using FTS methods with three time horizons.

Figure 6. Comparison of the coefficient of determination for the methods implemented and comparison according to the time horizon for the GHI forecast.

Figure 7. Power simulation for the 5 min horizon with the FIG-FTS method.

Figure 8. Comparison of the coefficient of determination for the power simulation.

Table 1. Model Hyperparameters [24].

Symbol	Parameter	Description
$k \in ℕ$	Number of partitions	Number of fuzzy sets to be created in the linguistic variable
$μ ∶ U \to [0, 1]$	Membership Function (MF)	$Measures the membership value of a value y \in U$ to the fuzzy set
$Π$	Partitioning method	Defines how the universe of discourse will be divided
$α \in [0, 1]$	The α–cut	Minimum degree of membership to be considered in the fuzzification process
$Ω \in ℕ$	Model order, number of backlogs	Number of backlogs used in the fuzzy house rule precedent
$L \in Ω \times ℕ$	Temporary backlog index	Index vector with length Ω and $1 \leq L [i] < L [i + 1] for i = 0, \dots, Ω$
$κ \in ℕ$	k-Nearest Neighbor (kNN)	The number of nearest neighbors that the spatial index searches for in FIG during fuzzification

Table 2. Database division and number of records.

Time Horizon	Training (Year 2018)	Test (Year 2019)
5 min	105,121	105,121
15 min	35,041	35,041
30 min	17,521	17,521

Table 3. Dataset descriptive statistic.

Measurement	Unit	Median	Mean	Maximum Record	Standard Deviation
Global Horizontal Irradiance	W/m²	2.59	179.52	1588	282.38
Air Temperature	°C	21.64	21.38	37.12	4.13
Contact Temperature	°C	25.49	28.39	59.56	8.27
Power	W	0	368.42	2732.9	591.75

Table 4. Variables used in the model.

Parameter	Variable Description	Type
data	Containing the variables min, hour and month	Input
temp_amb	Ambient Temperature (°C)	Input
GHI	Global Horizontal Irradiance (W/m²)	Input/Output
pow	Real power generated (W)	Input
temp_contact	Module temperature (°C)	Input
modeled_power	Power obtained from spatial smoothing (W)	Output

Table 5. Configuration of the variables used in the model.

Variable	Membership Function (μ)	Partitions (k)	Minimum Grade of Membership (α-cut)
Minute	Triangular	60	0.30
Hour	Triangular	24	0.30
Month	Triangular	12	0.30
Temperature	Triangular	24	0.30
Irradiance	Gaussian	35	0.25

Table 6. Experimental setup of models, variables and prediction horizons.

Model Development	WEIGHTED-FTS	FIG-FTS
Prediction horizons (minutes)	5, 15 and 30	5, 15 and 30
Input	Data, ambient temperature, GHI, generated power, contact temperature	Data, ambient temperature, GHI, generated power, contact temperature
Hyperparameters for learning models	First order (Ω = 1)	Second order (Ω = 2) and k-Nearest Neighbor (κ = 2)
Partial output	GHI	GHI
Low pass filter parameters	Module temperature, installed peak system power, GHI_STC	Module temperature, installed peak system power, GHI_STC
Output	Modeled power	Modeled power

Table 7. Statistical metrics of prediction errors.

FTS Method	Prediction Horizon	MBE	RMSE	nRMSE	R²
Part A (Indirect Prediction)–Irradiance Forecasting with FTS Methods
WEIGHTED-FTS	5 min	4.68	36.32	0.08	0.97
	15 min	3.97	36.59	0.25	0.968
	30 min	7.05	43.98	0.61	0.96
FIG-FTS	5 min	3.67	28.14	0.06	0.98
	15 min	1.82	36.27	0.25	0.97
	30 min	−1.98	53.19	0.74	0.94
Part B (Direct Prediction)–Power Simulation from Irradiance Forecasts with Low Pass Filter
WEIGHTED-FTS	5 min	7.57	143.70	0.33	0.93
	15 min	11.31	160.24	1.11	0.91
	30 min	26.63	197.78	2.76	0.87
FIG-FTS	5 min	5.54	139.12	0.32	0.93
	15 min	6.64	134.40	0.93	0.94
	30 min	6.76	177.75	2.48	0.90

Table 8. Kolmogorov-Smirnov Test (KS).

Normality Test–KS	Vc	KS	Result
Part A–Irradiance Forecasting with FTS Methods
WEIGHTED-FTS/5 min	0.1979	0.1195	H0 Accepted
WEIGHTED-FTS/15 min	0.2381	0.2325	H0 Accepted
WEIGHTED-FTS/30 min	0.3368	0.2646	H0 Accepted
FIG-FTS/5 min	0.1979	0.1057	H0 Accepted
FIG-FTS/5 min	0.2381	0.1878	H0 Accepted
FIG-FTS/5 min	0.3368	0.2464	H0 Accepted
Part B–Power Simulation from Irradiance Prediction Applying Low Pass Filter
WEIGHTED-FTS/5 min	0.1979	0.3093	H0 Rejected
WEIGHTED-FTS/15 min	0.2381	0.3498	H0 Rejected
WEIGHTED-FTS/30 min	0.3368	0.4786	H0 Rejected
FIG-FTS/5 min	0.1979	0.2768	H0 Rejected
FIG-FTS/5 min	0.2381	0.3146	H0 Rejected
FIG-FTS/5 min	0.3368	0.4082	H0 Rejected

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Serrano Ardila, V.M.; Maciel, J.N.; Ledesma, J.J.G.; Ando Junior, O.H. Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting. Energies 2022, 15, 845. https://doi.org/10.3390/en15030845

AMA Style

Serrano Ardila VM, Maciel JN, Ledesma JJG, Ando Junior OH. Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting. Energies. 2022; 15(3):845. https://doi.org/10.3390/en15030845

Chicago/Turabian Style

Serrano Ardila, Vanessa María, Joylan Nunes Maciel, Jorge Javier Gimenez Ledesma, and Oswaldo Hideo Ando Junior. 2022. "Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting" Energies 15, no. 3: 845. https://doi.org/10.3390/en15030845

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Fuzzy Time Series Methods Applied to (In)Direct Short-Term Photovoltaic Power Forecasting

Abstract

1. Introduction

2. Theoretical Background

2.1. Photovoltaic Solar Energy Forecast

2.2. Fuzzy Time Series

2.2.1. Fuzzy Logic Relationships

2.2.2. Stages of the FTS Algorithm

Data Preprocessing

Fuzzification of the Data

Fuzzy Rule Generation and Inference Process

WEIGHTED FTS Method

Fuzzy Granular Information Method

Defuzzification Process

Data Post-Processing

2.3. Statistical Analysis

3. Materials and Methods

3.1. Tools, Technologies and Database

3.2. Method and Experimental Setup

3.2.1. Hyperparameters

3.2.2. Fuzzy Rules, Learning and Forecasting

3.3. Low Pass Filter and Power Simulation

4. Results and Discussion

4.1. Statistical Errors

4.2. Analysis and Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Nomenclature

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI