An Approach for Multiparameter Meteorological Forecasts

Pérez-Vega, Abrahán; Travieso-González, Carlos M.; Hernández-Travieso, José Gustavo

doi:10.3390/app8112292

Open AccessArticle

An Approach for Multiparameter Meteorological Forecasts

by

Abrahán Pérez-Vega

,

Carlos M. Travieso-González

and

José Gustavo Hernández-Travieso

^*

Signal and Communications Department, Institute for Technological Development and Innovation in Communications (IDeTIC), University of Las Palmas de Gran Canaria, Campus Universitario de Tafira, sn, Ed. de Telecomunicación, Pabellón B, Despacho 111, E35017 Las Palmas de Gran Canaria, Spain

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2018, 8(11), 2292; https://doi.org/10.3390/app8112292

Submission received: 17 October 2018 / Revised: 9 November 2018 / Accepted: 13 November 2018 / Published: 19 November 2018

(This article belongs to the Section Environmental Sciences)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

This research provides a tool for green energy generation.

Abstract

Accurate meteorological forecasting has great importance in different fields. This works introduces a system to obtain precise predictions, which uses regression functions, and collected data using the meteorological stations from the Gran Canaria and South Tenerife airports. The dataset offers information about different phenomena as temperature, wind speed, solar radiation, pressure, moisture, cloudiness, rainfall and meteors. A preprocessing stage has been applied before prediction stage to adapt the collected data. A support vector machine, regression tree, and fit linear model are applied as regression functions. Results has been measured by the mean square error. These results reached an accuracy of 0.07 °C for temperature, 0.56 km/h for wind speed, 7.45 tenths of kJ/m² for solar radiation and 0.11 mm for precipitation. It shows the robustness of the multiparameter meteorological forecast approach.

Keywords:

prediction; meteorology; multiparameter forecasting; regression tree; fit linear model

1. Introduction

Meteorological parameters have reached importance in different fields like energy, tourism, and farming. In terms of energy use today, fossil fuels act as the main energy generation source [1], generating problems like pollution, greenhouse effects, and increased CO₂ levels. Accordingly, it is necessary to use renewable energy source as wind power, hydro, solar energy, etc. to improve the environment.

Nevertheless, optimization is a weak point in the use of renewable energy. For that reason, to supply clean energy to the population, the combination with fossil fuels is necessary. If efficiency in the generation process increases, the use of nonrenewable sources will be reduced.

One of the projects that pretends to reach self-sufficiency in energetic terms is located on the island of El Hierro (Canary Islands, Spain). It stands out with the capability to supply to the entire island the demand of power through a wind-hydro-pumped station [2].

Many works have been developed on this field. Then, different works are shown in relation to meteorology.

Meteorological predictions have been object of study for diverse applications like the study of the load in electrical networks, as in [3,4,5], or to help elderly people as in [6]. The importance of having good weather forecasting will help in various areas. Applications may vary, from the prediction of red tides or rainstorms, like [7,8,9], to comfort applications in buildings or workplaces like [10,11,12,13].

In relation to temperature prediction, in [10,11,12,14,15,16], they have obtained good results using artificial neural networks (ANN) and support vector machines (SVM). In [15], the best result reaches a mean square error (MSE) of 0.136 °C.

Concerning solar radiation predictions [17,18,19,20,21,22,23,24,25], classification systems were mainly used, such as ANN and SVM. In particular, in [25], the best result reached a 99.9% of accuracy.

Respecting to wind speed predictions, works like [26,27,28,29,30] makes use of classifications systems like ANN and SVM. Best result was obtained in [26], where the mean absolute error (MAE) was 0.85 m/s.

As seen on previous works, based on meteorological forecasting, it observes the great importance and application on different fields, as energy generation, or helping elderly people. For this reason, the purpose of this work merges with the motivation to collaborate with these applications. It consists of the design of a prediction system that can be used under different conditions to get a forecasting of different meteorological parameters as wind speed, temperature, solar radiation or precipitation. The innovative point of this work versus previous researches is the adaptability of the method to different meteorological phenomena. The correct meteorological characterization of the study area will allow to determine the moment in which appears a peak of energy demand. The information used in this study comes from the State Meteorological Agency (Agencia Estatal de Meteorología, AEMET, in Spanish) that depends on the Spanish Government. The regression functions used in this study are presented below:

Support vector machines (SVM). Using different kernel functions to get an optimized result.
Regression tree.
Fit linear model (FITLM).

As shown on this section, there are a lot of studies that carries out predictions of meteorological parameters. The statistical used to probe the proposed models varies between MAE and MSE and the classification systems vary between ANN and SVM. Many of them make short time predictions and even long time predictions too. In addition, errors reached in previous researches are higher than those obtained in this research.

But the real proof of the goodness of the proposed model is that, as it is shown on the following sections, we only applied one model to the whole experiment regardless of the meteorological phenomena involved in the prediction. This is a scientific value added into literature.

The novelty of the system consists of the feasibility to use it in different geographical location only using data from meteorological stations. The proposal does not need to introduce data from geopotential highs or sea level pressures (SLP) in order to obtain an accurate prediction in the case of wind and precipitation. Besides, this proposal can be used for different climate parameters. It shows the robustness and any similar works has not been found in the state-of-the-art.

2. Materials and Methods

This study has been made with the database provided by AEMET, an agency that depends on Ministry of Agriculture, Food and Environment of the Spanish government. From all the stations that AEMET has deployed around Spain, this work used two meteorological stations, which are located at Gran Canaria (GC) airport (Gran Canaria, Canary Islands) and South Tenerife (TF) airport (Tenerife, Canary Islands), respectively. This database offers data relative to temperature, humidity, solar radiation, meteors, precipitation, wind speed and clouds. The data from each phenomenon is given in an Excel sheet format (.xls) including the information collected hourly by both stations under study alongside the whole period of study from 2003 to 2007. In the same way a .txt file is given with explanatory information about each .xls sheet as a help to understand data contained on it. Figure 1 shows the map of the Canary Islands (Spain) and the details of Gran Canaria and Tenerife; the airports’ locations are marked with red dots.

Canary Islands are located in a subtropical climate area beaten by Trade Winds with warm temperatures. However, they suffer from the remains of tropical storms that hit the Caribbean when they turn around the Atlantic Ocean. The study areas was located on the southeastern of both islands, were winds allows the generation of green energy.

To achieve an accurate prediction, a preprocessing stage is applied to adapt the information given to the input of the prediction system. Consequently, the procedure must be quick, avoiding large delays when the prediction is performed; and thus reaching almost a real-time prediction approach.

Firstly, it is necessary to check each sheet to not introduce false or nonexisting data to the system. The reason is due to the periods of time where there is no data, caused by any damage or inactivity of the station or anyone of its sensors.

Secondly, the concatenation and adaptation (in terms of measurement unit) are done before to insert the parameter information to the system. Temperature and wind speed are two examples of meteorological phenomena that have more information during the study period. For this reason, they are two of the phenomena chosen to obtain the prediction.

Finally, once the preprocessing stage has been done, resulting in the obtaining of the input data, it is possible to apply the classification systems. Our forecasting system is based on the use of the regression functions, which is described below.

The number of samples that will be used in this study are shown in Table 1.

2.1. Support Vector Machines (SVM)

Given a training set of instance-label pairs

(x_{i}, y_{i})

,

i = 1, \dots, l

where

x_{i} ϵ R^{n}

and

y_{i} ϵ {1, - 1}^{l}

, the SVM require the solution of the following optimization problem:

\min_{w, b, ξ} \frac{1}{2} w^{t} w + C \sum_{i = 1}^{l} ξ_{i}

(1)

y_{i} (w^{t} ϕ (x_{i}) + b) \geq 1 - ξ_{i}

(2)

subject to

ξ_{i} \geq 0 .

Where w are the weight and the support vector is there are different zero, C is the cost function, b is the bias,

ϕ (x_{i})

is the kernel of the input, y is the output and

ξ_{i}

is the value of the plane.

Here training vectors

x_{i}

are mapped into a higher (maybe infinite) dimensional space by the function

ϕ

. SVM finds a linear separating hyperplane with the maximal margin in this higher dimensional space.

C > 0

is the penalty parameter of the error term. Furthermore,

K (x_{i}, x_{j}) \equiv ϕ {(x_{i})}^{T} ϕ (x_{j})

is called the kernel function [31].

To obtain a system able to reach the smallest error, it has been tested using diverse kernel functions with the aim to obtain a better forecasting approach. Year 2006 was used in the training mode, the reason is explained at the end of this section. During training and test modes, the kernel functions used in this supervised classifier were as follows:

Linear kernel:

K (x_{i}, x_{j}) = x_{i}^{T} x_{j}

(3)

Radial basis function (RBF) kernel:

K (x_{i}, x_{j}) = \exp (- γ ‖ x_{i} - x_{j} ‖^{2}), γ > 0

(4)

Polynomial kernel:

K (x_{i}, x_{j}) = ϕ {(γ x_{i}^{T} x_{j} + r)}^{d}, γ > 0

(5)

being

γ

,

r

and

d

the parameters of the kernel.

As a result of the training function, MATLAB returns a structure called SVMStruct, which contains information about the trained SVM classifier.

2.2. Regression Tree

The classification and regression trees are methods of machine learning for building predictive models from temporal series. The models are obtained by recursive partitioning of the data space and fitting a simple prediction model within each partition. As a result, the partition can be graphically represented as a decision tree. Classification trees are designed for dependent variables that have a finite number of unordered values with prediction error measured by the squared difference between observed and predicted values [32,33]. An alternative approach to nonlinear regression is to subdivide or partition the space into smaller regions, where interactions are more manageable. Then subdivisions are divided again. Finally, more manageable pieces of space are obtained, which can be fitted by simple models. This process is called recursive partitioning. Thus, the global model has two parts: one is the recursive partition, and the other is a simple model for each cell of the partition.

Prediction trees use the tree to represent the recursive partition. Each of the terminal nodes or leaves of the tree represents a cell partition and has attached to it a simple model that is applied only in that cell. A point belongs to a leaf if falls in the corresponding cell in the partition. In order to determine the cell, it begins at the root node of the tree, and a series of questions about the features is made. Interior nodes are labeled with questions, and the edges or branches between them are marked with responses. Depending on the answer to the previous question, the next question is made. In the classic version of a prediction tree, each question refers to a single attribute, and only has an affirmative or negative response (binary).

For classical regression trees, the model in each cell is only an estimate of Y constant. That is, it can be assumed that the points

(x_{i}, y_{i}), (x_{2}, y_{2}), \dots, (x_{c}, y_{c})

are all the samples belonging to the leaf of the node

l

.

Then, the model for

l

is

\hat{y} = \frac{1}{c} \sum_{i = 1}^{c} y_{i}

, the sample mean of the response variable in that cell (Shalizi, 2006). This is a piecewise-constant model.

An example of regression tree is shown on Figure 2.

2.3. Fit Linear Model (FITLM)

The measured values in the real world never fit perfectly for a model due to measurement errors and to some mathematical models is a simplification of the real world. If it is taken into account all factors influencing a set of variables, it would be unmanageable, committing the model some error.

The chosen model indicates that

X

variable (dependent variable) is a function of the variables

Y_{1}, \dots, Y_{k}

(independent variables). That is, all variables corresponding to a subset of all possible cases are measured. By applying the model, the difference between the measured variable

X

and the predicted value minimizes the error values, fitting the model.

The input to the system are the different preprocessed values of meteorological parameters previous to its target value. After the corresponding prediction process is performed, it will be obtained a single output value, being the value of the weather parameter to predict.

The different operating modes of the prediction systems are:

Training mode, in which the model used later on the test mode is created
Testing mode, in which the desired output is obtained.

Figure 3 shows the concept of the sliding window.

The sliding window is an important parameter that must be considered when the input data is introduced into the forecast system. To it, it is assigned a certain value or size. This window contains past data or attributes (highlighted in orange) of the phenomenon that it is wanted to predict, called target value (highlighted in blue). It moves in order to change its attributes in every iteration to obtain the system outputs or target values. When an iteration happens, the window moves one position forward. The last target value becomes an attribute of the new sliding window to predict the next target value. In this way, a prediction of a specific meteorological parameter is performed. The target value corresponding to each row is set into the target values vector, which is another important parameter for the training mode.

The training matrix is generated with this sliding window principle. Each row of the matrix contains the sliding window corresponding to every iteration. Every row is a vector with the information previous to a specific target value. If the window size is “n”, there will be “n” values of each type of meteorological parameter used to predict in that row of the matrix.

For example, if it is wanted to make a prediction about temperature using temperature and wind as input parameters using a sliding window size of five samples. The first row of the training matrix will consist of the first five temperature values concatenated with the first five values of the wind speed. Thus, it is intended to predict the sixth temperature value, which is in the first position of the target values vector. An example of the training matrix and target values vector, are shown in Table 2.

Size of the sliding window is a critical part in the prediction system in terms of accuracy in the obtained prediction. For that reason, it is necessary to make several test to obtain the appropriate size of the window.

As a result of the training mode, with all the information obtained on it, the model is created and used in the test mode as input.

The block diagram of the forecasting system is shown on Figure 4.

A training mode and, subsequently, a testing mode has been realized, forming the supervised classifier, to obtain kernel parameters.

The MSE is the parameter that gives the measure of the goodness of the system, defined by:

M S E = \frac{1}{n} \sum_{i = 1}^{n} {(Y_{i} - {\hat{Y}}_{i})}^{2}

(6)

where

\hat{Y}

is a vector of

n

predictions, and

Y

is the vector observed values of the variable being predicted.

The optimal size of the sliding window is calculated using a loop function where there is a supervised classifier. Errors are obtained comparing the predicted values with the originals. In each iteration of the loop, the size of the sliding window is modified and the error will also be modified. This size depends on the MSE reached once the testing stage have been done. The lower MSE, the better sliding window size.

Data from the year 2006 were used in the training mode, the reason to choose this year is due to the number of samples, different in every year caused by acquisition data problems of the sensors. In 2006, the number of samples were quite similar of the average number of samples of the rest of the years of study, giving us the possibility of modelling an entire year. Once the model has been obtained, it is used to the test stage with the data from years 2003, 2004, 2005, and 2007. The system is trained with the 20% of the samples. With the resulting model, the test mode is performed with the remaining 80% of the samples. This is one of the strengths of the system.

In the specific case of solar radiation parameter, the data for 2005 are used in the training mode because the number of samples in this year is similar to the average of other years. In the case of Gran Canaria station, there is not information about solar radiation in the year 2003.

According to this, the following experiments have been carried out:

Temperature (°C) forecasting using as input data:
○
Temperature (°C).
○
Temperature (°C) and precipitation (mm).
○
Temperature (°C) and wind speed (km/h).
○
Temperature (°C) and temperature (tenths of °C).
○
Temperature (°C), temperature (tenths of °C) and wind speed (km/h).
Wind speed (km/h) forecasting using wind speed (km/h) as input data.
Solar radiation (tenths of kJ/m²) forecasting using solar radiation (tenths of kJ/m²) as input data.
Precipitation (mm) forecasting using precipitation (mm) as input data.

3. Results and Discussion

In order to analyze results and to observe the system accuracy, they are graphically represented, making a comparison between the predicted values, in red, and the original values (target values), in blue, with labels “predicted value” and “original value”, respectively. The time period represented in these graphs is eight days, the first eight days of the year. Although errors are calculated with the information belonging to all days of the every year. Because of the great number of experiments and cases, it will only be represented graphically the forecasting corresponding to the use of the regression function which gets best results.

Despite getting several types of errors, only the values of the MSE are represented in tables for the classification systems explained in Section 2 (i.e., SVM linear, SVM RBF, SVM polynomial, regression tree and fit linear model). In all cases, results are presented with seven decimal places due to the similarity between the results.

When the SVM predictor is used, the optimal kernel parameters are obtained using a loop that contains the training and testing modes. It modifies the values of the kernel parameters leading to values that achieves smallest error, which are the best. Being

γ

,

r

, and

d

the parameters of the kernel.

This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation as well as the experimental conclusions that can be drawn.

3.1. Forecast of Temperature

In this case, temperature in °C is the weather information to be predicted.

3.1.1. With Temperature as Input Data

The optimal size of the sliding window is 1. Best results are shown in Table 3 and Figure 5.

The support vector machine system with linear kernel reaches greater accuracy in predicting temperature, using as input the temperature information in Gran Canaria.

3.1.2. With Temperature and Precipitation as Input Data

The optimal size of the sliding window is 1. Table 4 and Figure 6 shown best results when temperature (°C) and precipitation in mm are involved.

The system gets best results in Gran Canaria using the SVM system with linear kernel, using as input the information of temperature and precipitation.

3.1.3. With Input Data from Temperature and Wind Speed

The optimal size of the sliding window is 1, and best results are shown in Table 5 and Figure 7. The data of wind speed in kilometers per hour (km/h) are used in this test.

3.1.4. With Input Data from Temperature in °C and Temperature in Tenths of °C

Table 6 and Figure 8 shows best results obtained. The optimal size of the sliding window is 1.

3.1.5. With Input Data from Temperature in °C and Temperature in Tenths of °C and Wind Speed

The optimal size of the sliding window is 1, reaching best results showed in Table 7 and Figure 9.

3.2. Forecast of Wind Speed

In this case, the weather information to predict is the wind speed in km/h, using as input parameters wind speed (km/h).

Due to the number of samples relating to wind parameter is similar to the number of samples available of temperature, the optimal size of the sliding window is 1. The results are shown in Table 8 and Figure 10.

When the prediction of the wind speed is done only using wind speed as input data, the best results are achieved by means of SVM with polynomial kernel. The system reaches good results in both places, getting an average of MSE smaller than 1 km/h.

3.3. Forecast of Solar Radiation

In order to predict solar radiation in tenths of kilojoules per square meter (kJ/m²), the input data used were solar radiation in tenths of kJ/m². The optimal size of the sliding window is 1. The system was trained using data of 2005 and testing with the remaining years. Best results are shown in Table 9 and Figure 11.

The best classification system to use to predict solar radiation are those based on SVM with linear or polynomial kernels.

3.4. Forecast of Precipitation

To predict precipitation in mm, solar radiation in tenths of kJ/m² and precipitation in mm were used as input data. The system, it is trained with the information of 2006 and testing with the remaining years.

The complexity of this parameter is different from the others. As it does not happen too often, the value of precipitation is low and constant for a long period of time. However, when it happens, the value of this parameter increases considerably, whereupon the amount of previously recorded information required to make an accurate forecasting has to be big enough. As an effect, the optimal size of the sliding window is 33. The best results are shown in Table 10 and Figure 12.

The best system to predict precipitation varies depending on the location. In South Tenerife best results are achieved using a setting based on fit linear model system, while for Gran Canaria, SVM with polynomial kernel was used.

This is another innovation of the system, allowing to obtain forecasting of precipitation in a specific location despite of the difficulty of the amount of information needed to make it, by adapting the optimal size of the sliding window.

The MSE averages of the best systems (higher accuracy) in all cases are represented in Table 11, high variability data offers more difficulty to obtain high accuracy forecasting (e.g., wind and solar radiation).

Once the test has been conducted, it proceeds to discuss which system is best for predicting each weather parameter and differences between error rates depending on location. In addition, a comparison is also made with results obtained in other works.

Generally speaking, best results are obtained using systems based on SVM with a polynomial kernel most of the time. However, with the other systems are also achieved good results, as observed in the figures and tables below.

Only for comparison purposes, using the same database (AEMET), authors have made a test with a system based on ANNs, better results are also achieved with the system used in this research, as shown in Table 12. The characteristics of the ANN used in the tests are back-propagation algorithm, 24 neurons on the hidden layer and one neuron on the output layer; more details available in [16,26].

Table 13 introduces best results obtained. If temperature is the weather parameter to predict (see Table 13), the best solution is to use temperature in °C and temperature in tenths of °C as input parameters. As regressive function to obtain a predictive model, the best option is to use SVM with a polynomial function kernel. The reason why worst results are obtained in TF is because the variation between consecutive samples is higher in TF than GC. However, good results are achieved in both cases. In case of predicting wind speed (see Table 13), using as input parameters the previous values of wind speed in km/h, the best system is SVM with polynomial kernel function. The reason for the different results (better in GC than in TF) is the same as before. When predicting solar radiation parameters (see Table 13) when using registered data about radiation in tenth of kJ/m² as input, the best system can vary by location. In both cases, good results are obtained. For the forecasting of precipitation (see Table 13), and looking to data presented above, the best results are achieved in TF than in GC. The reason is that it occurs more often in TF, which allows for a better forecasting model.

Table 14 shows a comparison between the results obtained in this research versus previous studies, related to temperature, described in the introduction section [10,11,12,15,16]. In all cases, our approach presents an important improvement versus all works.

Regarding the sampling period, samples are collected hourly in [10], there are no data about sampling frequency in [11], samples are collected every 10 min from October 2012 to February 2013 in [12], samples are collected every 30 min in [15] and the same database of this study and under the same conditions has been used in [16]. It comes from different geographical locations, from Tunisia in [10], from China in [11], from Australia in [12], from Costa Rica in [15] and from Canary Islands (Spain) in [16], thus the climate conditions has a great variation from one location to another. Our work and [19] presents the same result even with different sampling period. Under the same conditions and with the same database our work improves the results of [16], proving the goodness of the system. If sampling period could be reduced in stations used for testing this system, the approach can be improved in the future.

In addition, this work can compare two geographical locations where meteorological data varies differently. In Tenerife (28°02′40″ N 16°34′21″ O), data from temperature and wind speed have more variability that in Gran Canaria due to the location of the station in the south zone of the island. In Gran Canaria (27°55′55″ N 15°23′12″ O). The location is in the south east of the island in the border line of the zone under influence of the Trade Winds that causes softest conditions, as a result of this less variation in data leads to better results in temperature. Conversely, results of wind speed offer bigger error in Tenerife, although it is still a good result versus the state-of-the-art. Meteorological parameters like solar radiation does not have high variability in both locations, they offer similar results. This capability to validate the system in two different geographical locations is also an innovation versus previous researches.

4. Conclusions

There are different methods to obtain meteorological predictions with different methods to acquire information about weather.

Some actual problems are associated with meteorological parameters. To solve this situation, it is necessary to have an accurate forecasting about those parameters. The innovation of this research is to obtain different forecasting—one per phenomenon—in different geographical locations just applying the same method.

Final results, shown in Table 13, reveals that it is obtained better forecasting accuracy than other systems based on other type of classifiers [10,11,12,15,16]. For example, in temperature prediction, wind speed and solar radiation, best results are 0.136 °C, 0.56 km/h and 7.45 tenths of kJ/m² respectively.

In addition, methodology and results show the adding value of this work. The same proposal has been checked for different climate parameters and to show an approach th at can be used for any meteorological parameter. This is an important goal of this work.

Taking into account all the results obtained, the feasibility to execute the system, and the low cost of materials needed to make the system possible, this is an interesting tool to be used in the energy generation processes by means of renewable sources. Future works will include the study of the fusion of classification systems, testing which system offers best result in the fusion.

Author Contributions

J.G.H.-T. and C.M.T.-G. conceived and designed the experiments; A.P.-V. performed the experiments; A.P.-V., C.M.T.-G. and J.G.H.-T. analyzed the data; A.P.-V. and J.G.H.-T. wrote the paper.

Funding

This research was funded by the government of Spain, grant number TEC2016-77791-C4-1-R.

Conflicts of Interest

The authors declare no conflicts of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.

References

World Bank. Available online: http://datos.bancomundial.org/indicador/EG.USE.COMM.FO.ZS (accessed on 21 March 2018).
Gorona del Viento. Available online: http://www.goronadelviento.es/index.php (accessed on 21 March 2018).
Sorjamaa, A.; Hao, J.; Nima Reyhani, N.; Ji, Y.; Amaury Lendasse, A. Methodology for long-term prediction of time series. Neurocomputing 2007, 70, 2861–2869. Available online: https://www.sciencedirect.com/science/article/pii/S0925231207001610 (accessed on 30 May 2018). [CrossRef]
Abdoos, A.; Hemmati, M.; Abdoos, A.A. Short term load forecasting using a hybrid intelligent method. Knowl. Based Syst. 2015, 76, 139–147. Available online: http://www.sciencedirect.com/science/article/pii/S0950705114004468 (accessed on 30 May 2018). [CrossRef]
El Pais. Available online: http://internacional.elpais.com/internacional/2015/04/30/actualidad/1430361520_853891.html (accessed on 21 March 2018).
Szeles, J.; Kubota, N.; Woo, J. Weather forecast support system implemented into robot partner for supporting elderly people using fuzzy logic. In Proceedings of the 2017 Joint 17th World Congress of International Fuzzy Systems Association and 9th International Conference on Soft Computing and Intelligent Systems (IFSA-SCIS), Otsu, Japan, 27–30 June 2017. [Google Scholar]
Mengjiao, Q.; Zhihang, L.; Zhenhong, D. Red tide time series forecasting by combining ARIMA and deep belief network. Knowl. Based Syst. 2017, 125, 39–52. Available online: http://www.sciencedirect.com/science/article/pii/S0950705117301569 (accessed on 30 May 2018). [CrossRef]
Sagar, S.K.; Rajeevan, M.; Bhaskara Rao, S.V.; Mitra, A.K. Prediction skill of rainstorm events over India in the TIGGE weather prediction models. Atmos. Res. 2017, 198, 194–204. Available online: http://www.sciencedirect.com/science/article/pii/S0169809516303246 (accessed on 30 May 2018). [CrossRef]
Cramer, S.; Kampouridis, M.; Freitas, A.A.; Alexandridis, A.A. An extensive evaluation of seven machine learning methods for rainfall prediction in weather derivatives. Expert Syst. Appl. 2017, 85, 169–181. Available online: http://www.sciencedirect.com/science/article/pii/S0957417417303457 (accessed on 30 May 2018). [CrossRef] [Green Version]
Ellouz, I.K.; Ben-Jmaa-Derbel, H.; Kanoun, O. Temperature Prediction of Soil-Pipe-Air Heat Exchanger Using Neural Networks. In Proceedings of the 6th International Multi-Conference on Systems, Signals and Devices, Djerba, Tunisia, 23–26 March 2009. [Google Scholar]
Chen, X.; Xu, A. Temperature and humidity of air in mine roadways prediction based on BP neural network. In Proceedings of the 2011 International Conference on Multimedia Technology (ICMT), Hangzhou, China, 26–28 July 2011. [Google Scholar]
Huang, H.; Chen, L.; Mohammadzaheri, M.; Hu, E.; Chen, M. Multi-Zone Temperature Prediction in a Commercial Building Using Artificial Neural Network Model. In Proceedings of the 10th IEEE International Conference on Control and Automation (ICCA), Hangzhou, China, 12–14 June 2013. [Google Scholar]
Park, S.Y.; Yoo, S.H. The public value of improving a weather forecasting system in Korea: A choice experiment study. Appl. Econ. 2018, 50, 1644–1658. Available online: https://www.tandfonline.com/doi/full/10.1080/00036846.2017.1368995 (accessed on 30 May 2018). [CrossRef]
Houthuys, L.; Karevan, Z.; Suykens, J.A.K. Multi-view LS-SVM regression for black-box temperature prediction in weather forecasting. In Proceedings of the 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, USA, 14–17 May 2017. [Google Scholar]
Hernández-Travieso, J.G.; Herrera-Jiménez, A.L.; Travieso-González, C.M.; Morgado-Dias, F.; Alonso-Hernández, J.B.; Ravelo-García, A.G. Temperature Control by Its Forecasting Applying Score Fusion for Sustainable Development. Sustainability 2017, 9, 193. Available online: https://www.mdpi.com/2071-1050/9/2/193 (accessed on 30 May 2018). [CrossRef]
Hernández-Travieso, J.G.; Ravelo-García, A.G.; Alonso-Hernández, J.B.; Travieso-González, C.M. Neural networks fusion for temperature forecasting. Neural Comput. Appl. 2018. Available online: https://link.springer.com/article/10.1007%2Fs00521-018-3450-0 (accessed on 30 May 2018). [CrossRef]
Wang, F.; Zhen, Z.; Mi, Z.; Sun, H.; Su, S.; Yang, G. Solar irradiance feature extraction and support vector machines based weather status pattern recognition model for short-term photovoltaic power forecasting. Energy Build. 2015, 86, 427–438. Available online: http://www.sciencedirect.com/science/article/pii/S0378778814008226 (accessed on 30 May 2018). [CrossRef]
Chia, Y.Y.; Lee, L.H.; Shafiabady, N.; Isa, D. A load predictive energy management system for supercapacitor-battery hybrid energy storage system in solar application using the Support Vector Machine. Appl. Energy 2015, 137, 588–602. Available online: http://www.sciencedirect.com/science/article/pii/S0306261914009738 (accessed on 30 May 2018). [CrossRef]
Hernández-Travieso, J.G.; Travieso-González, C.M.; Alonso-Hernández, J.B.; Dutta, M.K. Solar radiation modelling for the estimation of the solar energy generation. In Proceedings of the IEEE-2014 Seventh International Conference on Contemporary Computing (IC3), Noida, India, 7–9 August 2014. [Google Scholar]
Shamshirband, S.; Mohammadi, K.; Khorasanizadeh, H.; Yee, P.L.; Lee, M.; Petković, D.; Zalnezhad, E. Estimating the diffuse solar radiation using a coupled support vector machine–wavelet transform model. Renew. Sustain. Energy Rev. 2016, 56, 428–435. Available online: http://www.sciencedirect.com/science/article/pii/S1364032115013222 (accessed on 30 May 2018). [CrossRef]
Deo, R.C.; Wen, X.; Qi, F. A wavelet-coupled support vector machine model for forecasting global incident solar radiation using limited meteorological dataset. Appl. Energy 2016, 168, 568–593. Available online: http://www.sciencedirect.com/science/article/pii/S0306261916301180 (accessed on 30 May 2018). [CrossRef]
Liu, S.; Dong, M. Quantitative research on impact of ambient temperature and module temperature on short-term photovoltaic power forecasting. In Proceedings of the 2016 International Conference on Smart Grid and Clean Energy Technologies (ICSGCE), Chengdu, China, 19–22 October 2016. [Google Scholar]
Andrade, J.R.; Bessa, R.J. Improving Renewable Energy Forecasting With a Grid of Numerical Weather Predictions. IEEE Trans. Sustain. Energy 2017, 8, 1571–1580. Available online: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7903735&isnumber=8045832 (accessed on 30 May 2018). [CrossRef]
Murata, A.; Ohtake, H.; Oozeki, T. Modeling of uncertainty of solar irradiance forecasts on numerical weather predictions with the estimation of multiple confidence intervals. Renew. Energy 2018, 117, 193–201. Available online: http://www.sciencedirect.com/science/article/pii/S0960148117309813 (accessed on 30 May 2018). [CrossRef]
Refaat, S.S.; Abu-Rub, O.H.; Nounou, H. ANN based prognostication of the PV panel output power under various environmental conditions. In Proceedings of the 2018 IEEE Texas Power and Energy Conference (TPEC), College Station, TX, USA, 8–9 February 2018. [Google Scholar]
Hernández-Travieso, J.G.; Travieso, C.M.; Alonso, J.B. Wind Speed Modelling for the Estimation of the Wind Energy Generation. In Proceedings of the IEEE-2014 International Work Conference on Bio-Inspired Intelligence (IWOBI), Liberia, Costa Rica, 16–18 July 2014. [Google Scholar]
Buhan, S.; Çadirci, I. Multistage Wind-Electric Power Forecast by Using a Combination of Advanced Statistical Methods. IEEE Trans. Ind. Inform. 2015, 11, 1231–1242. Available online: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7105399&isnumber=7287829 (accessed on 30 May 2018). [CrossRef]
Buhan, S.; Özkazanç, Y.; Çadirci, I. Wind Pattern Recognition and Reference Wind Mast Data Correlations With NWP for Improved Wind-Electric Power Forecasts. IEEE Trans. Ind. Inform. 2016, 12, 991–1004. Available online: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=7434584&isnumber=7484223 (accessed on 30 May 2018). [CrossRef]
Allen, D.J.; Tomlin, A.S.; Bale, C.S.E.; Skea, A.; Vosper, S.; Gallani, M.L. A boundary layer scaling technique for estimating near-surface wind energy using numerical weather prediction and wind map data. Appl. Energy 2017, 208, 1246–1257. Available online: http://www.sciencedirect.com/science/article/pii/S0306261917313132 (accessed on 30 May 2018). [CrossRef]
Donida Labati, R.; Genovese, A.; Piuri, V.; Scotti, F.; Sforza, G. A Decision Support System for Wind Power Production. IEEE Trans. Syst. Man Cybern. 2018. Available online: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8252927&isnumber=6376248 (accessed on 30 May 2018). [CrossRef]
Hsu, C.W.; Chang, C.C.; Lin, C.J. A Practical Guide to Support Vector Classification; Department of Computer Science, National Taiwan University: Taipei, Taiwan, 2016; Available online: https://www.csie.ntu.edu.tw/~cjlin/papers/guide/guide.pdf (accessed on 30 May 2018). [CrossRef]
Loh, W.Y.; He, X.; Man, M. A regression tree approach to identifying subgroups with differential treatment effects. Stat. Med. 2015, 34, 11. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.6454 (accessed on 30 May 2018). [CrossRef] [PubMed]
Izenman, A.J. Modern Multivariate Statistical Techniques: Regression, Classification, and Manifold Learning; Springer: New York, NY, USA, 2008; ISBN 978-0-387-78188-4. [Google Scholar]

Figure 1. Map of the Canary Islands, detail of Gran Canaria to the right and Tenerife to the left.

Figure 2. Example of regression tree.

Figure 3. Example of sliding window using temperature data.

Figure 4. Block diagram of the forecast system.

Figure 5. Temperature forecasting in GC for the year 2004 using Support Vector Machine (SVM) linear (input: temperature).

Figure 6. Temperature forecasting in GC for the year 2004, using SVM linear (input: temperature and precipitation).

Figure 7. Temperature forecasting in GC for the year 2004, using SVM polynomial (input: temperature and wind).

Figure 8. Temperature forecasting in GC for the year 2004, using SVM linear (input: temperature and temperature in tenths of °C).

Figure 9. Temperature forecasting in GC for the year 2004, using SVM polynomial (input: temperature, temperature in tenths of °C and wind).

Figure 10. Wind speed forecasting in GC for the year 2004, using SVM polynomial (input: wind speed in km/h).

Figure 11. Solar radiation forecasting in GC for the year 2004, using SVM polynomial (input: radiation in kJ/m²).

Figure 12. Precipitation forecasting in GC for the year 2003, using SVM polynomial (input: precipitation in mm).

Table 1. Number of samples used in this research.

Year	Gran Canaria		South Tenerife
Year	With Radiation	Without Radiation	With Radiation	Without Radiation
2003	-----	8755	1344	8323
2004	4064	8731	2960	7819
2005	1296	8755	3104	8059
2006	1952	8611	5424	8131
2007	3232	8587	5568	8419

Table 2. Example of training matrix and target value vector.

Training Input Vector								Target
Temperature Information				Wind Speed Information				Target
dataT_1_1	dataT_1_2	⋮	dataT_1_n	dataW_1_1	dataW_1_2	⋮	dataW_1_n	target_value_1
dataT_2_1	dataT_2_2	⋮	dataT_2_n	dataW_2_1	dataW_2_2	⋮	dataW_2_n	target_value_2
⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮	⋮
dataT_N_1	dataT_N_2	⋮	dataT_N_n	dataW_N_1	dataW_N_2	⋮	dataW_N_n	target_value_N

Table 3. Forecasting of temperature (°C), using temperature as input data.

	MSE in Temperature Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.139	0.100
SVM RBF $γ = 1.41 \times 10^{- 4}$	0.239	0.165
SVM Polynomial $γ = 1.41 \times 10^{- 4};$ $d = 2;$ $r = 1;$	0.196	0.148
Regression tree	0.318	0.157
Fit Linear Model	0.253	0.158

Table 4. Forecasting of temperature (°C), using temperature and precipitation (mm) as input data.

	MSE in Temperature Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.140	0.100
SVM RBF $γ = 1.42 \times 10^{- 4}$	0.239	0.166
SVM Polynomial $γ = 1.42 \times 10^{- 4};$ $d = 2;$ $r = 4$	0.150	0.110
Regression tree	0.324	0.198
Fit Linear Model	0.253	0.158

Table 5. Forecasting of temperature (°C), using temperature and wind speed (km/h) as input data.

	MSE in Temperature Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.139	0.101
SVM RBF $γ = 2.12 \times 10^{- 4}$	0.260	0.179
SVM Polynomial $γ = 2.05 \times 10^{- 4};$ $d = 4;$ $r = 5$	0.140	0.093
Regression tree	0.619	0.506
Fit Linear Model	0.261	0.163

Table 6. Forecasting of temperature (°C), using temperature (°C) and temperature (tenths of °C) as input data.

	MSE in Temperature Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.136	0.073
SVM RBF $γ = 4.49 \times 10^{- 6}$	0.211	0.133
SVM Polynomial $γ = 0.89 \times 10^{- 5};$ $d = 2;$ $r = 6$	0.153	0.086
Regression tree	0.318	0.157
Fit Linear Model	0.253	0.158

Table 7. Forecasting of temperature (°C), using temperature (°C), temperature (tenths °C) and wind speed (km/h) as input data.

	MSE in Temperature Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.158	0.159
SVM RBF $γ = 3.8 \times 10^{- 6}$	0.208	0.134
SVM Polynomial $γ = 0.75 \times 10^{- 5};$ $d = 2;$ $r = 6$	0.150	0.087
Regression tree	0.619	0.506
Fit Linear Model	0.261	0.163

Table 8. Forecasting of wind speed (km/h), using wind speed (km/h) as input data.

	MSE in Wind Speed Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	1.038	0.615
SVM RBF $γ = 1.54 \times 10^{- 4}$	1.269	0.782
SVM Polynomial $γ = 6 \times 10^{- 5};$ $d = 5;$ $r = 5$	0.974	0.566
Regression tree	1.559	1.247
Fit Linear Model	1.517	1.244

Table 9. Forecasting of solar radiation (kJ/m²), using solar radiation (kJ/m²) as input data.

	MSE in Solar Radiation Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	7.451	7.861
SVM RBF $γ = 1.2 \times 10^{- 4}$	48.587	99.831
SVM Polynomial $γ = 1.41 \times 10^{1};$ $d = 1;$ $r = 1$	7.451	7.820
Regression tree	33.167	33.543
Fit Linear Model	23.784	16.499

Table 10. Forecasting of precipitation (mm), using precipitation (mm) as input data.

	MSE in Precipitation Forecasting
Method	MSE Average (2003–2007) in South Tenerife	MSE Average (2003–2007) in Gran Canaria
SVM Linear	0.182	0.216
SVM RBF $γ = 1.2 \times 10^{- 4}$	0.234	0.183
SVM Polynomial $γ = 1.41 \times 10^{1};$ $d = 1;$ $r = 1$	0.278	0.151
Regression tree	0.188	0.159
Fit Linear Model	0.119	0.153

Table 11. Best MSE average per weather parameter.

Place	Best MSE Average in Forecast of:
Place	Temperature (°C)	Wind (km/h)	Solar Radiation (kJ/m²)	Precipitation (mm)
Gran Canaria	0.073	0.566	7.820	0.151
Tenerife	0.136	0.974	7.451	0.119

Table 12. SMV vs. Artificial Neural Network (ANN).

		Average Errors
System	Place	Temperature (°C)	Wind (km/h)
SVM Polynomial	Gran Canaria	0.073	0.566
SVM Polynomial	Tenerife	0.136	0.974
ANN	Gran Canaria	0.410	0.850
ANN	Tenerife	0.567	1.020

Table 13. Summary results table.

	MSE in Temperature Forecasting (°C)		MSE in Wind Speed Forecasting (km/h)		MSE in Solar Radiation Forecasting (kJ/m²)		MSE in Precipitation Forecasting (mm)
Method	South Tenerife	Gran Canaria	South Tenerife	Gran Canaria	South Tenerife	Gran Canaria	South Tenerife	Gran Canaria
SVM Polynomial	0.136	0.073	---	0.566	---	7.820	---	---
SVM Linear	---	---	0.974	---	7.451	---	---	0.151
Fit Linear Model	---	---	---	---	---	---	0.119	---

Table 14. Comparison vs. previous works in temperature forecasting.

Comparison with Previous Researches in Temperature
Research	Error rates in Temperature Forecasting
[10]	1.100 °C
[11]	4.900%
[12]	1 °C
[15]	0.136 °C
[16]	0.410 °C
Our work in GC	0.073 °C
Our work in TF	0.136 °C

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pérez-Vega, A.; Travieso-González, C.M.; Hernández-Travieso, J.G. An Approach for Multiparameter Meteorological Forecasts. Appl. Sci. 2018, 8, 2292. https://doi.org/10.3390/app8112292

AMA Style

Pérez-Vega A, Travieso-González CM, Hernández-Travieso JG. An Approach for Multiparameter Meteorological Forecasts. Applied Sciences. 2018; 8(11):2292. https://doi.org/10.3390/app8112292

Chicago/Turabian Style

Pérez-Vega, Abrahán, Carlos M. Travieso-González, and José Gustavo Hernández-Travieso. 2018. "An Approach for Multiparameter Meteorological Forecasts" Applied Sciences 8, no. 11: 2292. https://doi.org/10.3390/app8112292

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Approach for Multiparameter Meteorological Forecasts

Abstract

Featured Application

Abstract

1. Introduction

2. Materials and Methods

2.1. Support Vector Machines (SVM)

2.2. Regression Tree

2.3. Fit Linear Model (FITLM)

3. Results and Discussion

3.1. Forecast of Temperature

3.1.1. With Temperature as Input Data

3.1.2. With Temperature and Precipitation as Input Data

3.1.3. With Input Data from Temperature and Wind Speed

3.1.4. With Input Data from Temperature in °C and Temperature in Tenths of °C

3.1.5. With Input Data from Temperature in °C and Temperature in Tenths of °C and Wind Speed

3.2. Forecast of Wind Speed

3.3. Forecast of Solar Radiation

3.4. Forecast of Precipitation

4. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI