A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates

Valipour, Mohammad; Gholami Sefidkouhi, Mohammad Ali; Raeini-Sarjaz, Mahmoud; Guzman, Sandra M.

doi:10.3390/atmos10060311

Open AccessArticle

A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates

by

Mohammad Valipour

^1,*,

Mohammad Ali Gholami Sefidkouhi

²,

Mahmoud Raeini-Sarjaz

² and

Sandra M. Guzman

¹

Department of Agricultural and Biological Engineering, Indian River Research and Education Center, University of Florida, Fort Pierce, FL 34945, USA

²

Department of Water Engineering, Sari Agricultural Sciences and Natural Resources University, Sari, Iran

^*

Author to whom correspondence should be addressed.

Atmosphere 2019, 10(6), 311; https://doi.org/10.3390/atmos10060311

Submission received: 2 May 2019 / Revised: 24 May 2019 / Accepted: 1 June 2019 / Published: 5 June 2019

(This article belongs to the Special Issue Evapotranspiration Observation and Prediction: Uncertainty Analysis)

Download

Browse Figures

Versions Notes

Abstract

In the current research, gene expression programming (GEP) was applied to model reference evapotranspiration (ETo) in 18 regions of Iran with limited meteorological data. Initially, a genetic algorithm (GA) was employed to detect the most important variables for estimating ETo among mean temperature (Tmean), maximum temperature (Tmax), minimum temperature (Tmin), relative humidity (RH), sunshine (n), and wind speed (WS). The results indicated that a coupled model containing the Tmean and WS can predict ETo accurately (RMSE = 0.3263 mm day⁻¹) for arid, semiarid, and Mediterranean climates. Therefore, this model was adjusted using the GEP for all 18 synoptic stations. Under very humid climates, it is recommended to use a temperature-based GEP model versus wind speed-based GEP model. The optimal and lowest performance of the GEP belonged to Shahrekord (SK), RMSE = 0.0650 mm day⁻¹, and Kerman (KE), RMSE = 0.4177 mm day⁻¹, respectively. This research shows that the GEP is a robust tool to model ETo in semiarid and Mediterranean climates (R² > 0.80). However, GEP is recommended to be used cautiously under very humid climates and some of arid regions (R² < 0.50) due to its poor performance under such extreme conditions.

Keywords:

machine learning; crop water requirement; Iran; hydrological extremes; uncertainty; weather parameters

1. Introduction

Evaluation of reference evapotranspiration (ETo) plays an important and undeniable role in irrigation scheduling, drought analysis, climate change studies, water level balance, agricultural and forest meteorology, long-term decision-making in food and water security policies, and optimum allocation of water resources [1,2,3]. Although several methods have been developed to predict ETo throughout the world, there is a limited number of models to estimate ETo where meteorological data is restricted or insufficient [1,2].

A solution to deal with this limitation is to use data-driven machine learning techniques, particularly genetic approaches, including genetic algorithm (GA) and gene expression programming (GEP). One advantage of the GEP to estimate ETo was that, unlike the artificial neural networks (ANN) method, the GEP generated an explicit model structure that can be easily comprehended and adopted [3].

The GA and GEP methods have been developed in various aspects of water resources such as streamflow forecasting [4], rainfall–runoff modeling [5,6,7,8], modeling transport streams with suspended sediment [9], predicting velocity in compound channels [10], characterizing risks in water supply systems [11], and modeling evaporation [12].

Although the GEP has been developed in water resources studies, the application of this technique for ETo modeling is limited. Some of the successful applications of the GEP to estimate ETo can be listed as follows.

Parasuraman et al. [3] modeled ETo using only ground temperature and net radiation. Irmak and Kamble [13] investigated evapotranspiration data assimilation with the GA and soil, water, atmosphere, and plant (SWAP) model for on-demand irrigation. The data assimilation methodology obtained from the present research can be considered a practical tool at the field scale for scheduling the irrigation estimated by remote sensing-based evapotranspiration. The results showed that the GA was effectively able to determine the terms included in the fitness function, and parameters were predicted reasonably, especially if only four variables were included. Traore and Guven [14] developed regional-specific numerical models of ETo using the GEP in Sahel. Statistically, the GEP was an effectual modeling tool for the successful computation of ETo under the study area. The results indicated that, using the GEP model, it would be possible to formulate an accurate and applicable numerical equation for each region by irrigation systems for the manual computation of ETo under the study area, where sufficient meteorological variable is often missing. Shiri [15] claimed that GEP outperforms empirical model to calculate ETo in hyper arid regions over Iran. Traore et al. [16] found that GEP is a robust tool to model ETo in in Jiangsu province, China. Mattar [17] demonstrated that GEP is a more accurate method than empirical equations to estimate ETo in Egypt.

Machine learning methods are needed in data sparse regions because Penman–Monteith equation requires a lot of data input, and many places do not have such data. Therefore, achieving similar results with fewer input data is desirable.

In addition, some studies reported greater accuracy of the GA/GEP rather than the ANN [18,19,20,21], Support Vector Regression (SVR) [20,22], Adaptive Neuro-Fuzzy Inference System (ANFIS) [1,2,23], and empirical models [2,24].

According to the literature review there are some investigations for which the superiority of the GA and GEP over other techniques such as the ANN, LP, ANFIS, and regression models have been reported. However, genetic approaches have not been simultaneously assessed under various climates. In addition, in most of previous studies, these methods were employed with many input data to estimate ETo. In different parts of the world, there is insufficient meteorological data due to the lack of synoptic stations or other limitations. Therefore, the current research seeks to evaluate ETo in four different climates in Iran to recognize the parameters with the most significant roles in ETo.

2. Materials and Methods

2.1. Designing Structure of Genetic Algorithm (GA) for the Current Research

In most of the previous investigations [1,2], a power function is presented as the optimal model to estimate ETo after modifying or calibrating the empirical models. Therefore, in the present research, various functions of the mean temperature (Tmean), maximum temperature (Tmax), minimum temperature (Tmin), relative humidity (RH), wind speed (WS), and sunshine (n) were defined and then combined by using summation and multiplying functions. The goal is to minimize the difference between the current functions with the FAO-Penman–Monteith (FPM) (Table 1).

In the next step, a GA program was coded in MATLAB environment using the values shown in Table 2.

2.2. Parameters of Gene Expression Programming (GEP) Employed in the Current Research by Using GeneXpro Tools Version 5.0

Table 3 shows the values of each option employed to design the GEP structures by using GeneXpro Tools version 5.0.

As shown in Table 3, the population size of 500 is the same as most previous research.

However, different functions are employed to enhance the results and to reduce the uncertainty in the research, and the number of replication is increased (almost 217). As a result, the elapsed time to run the GEP can be prolonged. In addition, the ranking method showed better performance compared to Roulet’s function (a fitness evaluation function).

2.3. Materials

The monthly averages of meteorological data were collocated from the Islamic Republic of Iran Meteorological Organization (IRIMO). These data contain mean, minimum, and maximum daily air temperature (°C), saturated vapor pressure deficit (kPa), mean and minimum relative humidity (%), wind speed (m s⁻¹) and direction, rainfall (mm month⁻¹), cloudy days, and sunshine (hr month⁻¹). Table 4 shows the position of all 18 synoptic stations and their climates. It should be noted that Iran is in an arid and semiarid zone in Persian Gulf region, and the major part of Iran has the same climate mentioned in Table 4. Figure 1 shows location of the weather station in Iran.

Among all stations, there is 50-year period information for 16 regions. The accuracy of the Food and Agricultural Organization of the United Nations (FAO)−Penman−Monteith (FPM) is confirmed for these regions by using lysimeter measurements and with respect to the previous investigations [25,26,27,28,29,30,31,32,33,34]. In addition, there are 27-year and 21-year period datasets for Moghan and Jiroft, respectively, which due to the availability of lysimeter data, confirms the accuracy of the FPM for these two regions; they were also added to other 16 regions (with 50–year data).

The authors have used the last five years as the testing period and the rest of the data as training period.

Table 5 shows the selected models with their references and parameters.

Among the numerous empirical methods to estimate ETo, nine models were selected that had the best performance under different climates on the basis of previous investigations [1,46,47,48,49,50], and the results were compared with the FPM.

ETo is the reference crop evapotranspiration (mm/day), R_n is the net radiation (MJ/m²/day), G is the soil heat flux (MJ/m²/day), γ is the psychrometric constant (kPa/°C), e_s is the saturation vapor pressure (kPa), e_a is the actual vapor pressure (kPa), Δ is the slope of the saturation vapor pressure–temperature curve (kPa/°C), T is the average daily air temperature (°C), u is the mean daily wind speed at 2 m (m/s), H is the elevation (m), φ is the latitude (rad), T_min is the minimum air temperature (°C), T_max is the maximum air temperature (°C), RH is the average relative humidity (%), n is the actual duration of sunshine (hr), R_s is the solar radiation (MJ/m²/day), R_a is the extraterrestrial radiation (MJ/m²/day), λ is the latent heat of vaporization (MJ/kg), and C_T, I, K_T, a, b, P, a_t, and T_x are empirical coefficients

Genetic algorithm (GA), genetic programming (GP), and gene expression programming (GEP) are different kinds of data-driven machine learning techniques to find an optimum solution for complex problems by artificial intelligence (AI). In a cell, the expression of the genetic information is an intricate procedure involving more than one hundred molecules. Two of the major players are DNA and proteins. DNA is the carrier of the genetic information and the proteins read and express the genetic information. GA employed biological evolution theory for computer applications (i.e., AI). In fact, GA is an oversimplification of biological evolution [51]. GP, introduced by [52,53], solves the problem of fixed length solutions (as stated for GA) by creating nonlinear entities. Each entity (parse tree) has a distinguished shape and size. GEP is the reliable modification of GP and GA, combining both the methodology of simple, linear chromosomes with fixed length (GA) and branched structures of various sizes and shapes (GP). GEP applies the same type of diagram representation of GP, however the entities (expression trees) evolved by GEP are the expression of a linear genome. The GEP is a constructive approach because the search operators of the GEP may constantly generate valid structures and are highly suited to genetic diversity. The GEP surpasses the old GP system for more than 100 times. In this study, the GA was employed to understand which parameters are more influential on the estimation accuracy of ETo. In addition, the GEP was used to model ETo considering the obtained results by the GA to achieve the maximum accuracy compared with the empirical methods. Therefore, a hybrid data-driven machine learning technique (considering both GA and GEP) was applied to model ETo in various climates.

To evaluate the accuracy of the models two indices were used as follows

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(X_{i} - Y_{i})}^{2}}{N}}

(1)

R^{2} = \frac{{[\sum_{i = 1}^{n} (X_{i} - \bar{X}) (Y_{i} - \bar{Y})]}^{2}}{\sum_{i = 1}^{n} {(X_{i} - \bar{X})}^{2} \sum_{i = 1}^{n} {(Y_{i} - \bar{Y})}^{2}}

(2)

where, X_i and Y_i are the ith observed and estimated values, respectively;

\bar{X}

and

\bar{Y}

are the mean of X_i and Y_i; and N is the total numbers of data.

3. Results

3.1. Performance of Genetic Algorithm (GA) for Different Parameters

Table 6 shows that although GA8 (combination of all parameters) was obtained with the highest accuracy (RMSE = 0.3030 mm day⁻¹) in Iran (average), it requires four parameters: Tmean, RH, WS, and n. On the other hand, GA5 is also obtained, which is approximately equal to GA8 (RMSE = 0.3263 mm day⁻¹) with less input data (only Tmean and WS). Meanwhile, GA5 estimated ETo throughout Iran with more accuracy than GA2, GA3, and GA4. This means that the WS was introduced as the most important factor (after Tmean) to control the dynamics of ETo process throughout Iran.

Table 6 reveals that a function of the Tmean and WS may predict ETo with acceptable accuracy. Therefore, this result is the basis for the development of the genetic models using the GEP. However, the Tmean is the first parameter to be measured in each station or region. Hence, the Tmean is considered as the input parameter in the entire GEP models in this study.

3.2. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean) as Input Data

In the first step of the GEP—ETo—was estimated in all 18 stations using only the Tmean and then compared with the FPM (Table 7).

According to Table 7, the best performance of the GEP belongs to Rasht (RMSE = 0.1100 mm day⁻¹), while, the worst accuracy was reported for Zabol (RMSE = 0.8229 mm day⁻¹). In 56% of regions the GEP was obtained with a RMSE < 0.3000 mm day⁻¹. Furthermore, the natural logarithm (ln) function was employed more than sinus, cosines, and particularly exponential functions in the GEP structures. It means that we can expect to have logarithmic relationship between ETo and Tmean more than other kinds of functions. Moreover, plus and minus functions were used more than multiplication and specially division.

3.3. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean), Minimum Temperature (Tmin), and Maximum Temperature (Tmax) as Input Data

In the second step of the GEP, ETo was estimated in all 18 stations using the Tmean, Tmin, and Tmax was then compared with the FPM.

The results reveal that the best performance of the GEP belongs to Rasht (RMSE = 0.0884 mm day⁻¹), while the worst accuracy was reported for Zabol (RMSE = 0.8020 mm day⁻¹). In 61% of regions the GEP was obtained with a RMSE < 0.3000 mm day⁻¹. Furthermore, sine and cosine functions were employed more than the natural logarithm (ln) and particularly exponential functions in the GEP structures. It means that we can expect to have periodic relationship between ETo and temperature more than other kinds of functions. Moreover, plus and minus functions were used more frequent than multiplication sign and particularly division sign. In all regions, the accuracy was improved compared to the GEP models based on the Tmean only.

3.4. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean) and Wind Speed (WS) as Input Data

With respect to Table 6, WS was introduced as the most important factor to control the variations of ETo. Thus, in the third step of GEP, ETo was estimated in all 18 stations using the Tmean, and WS was then compared with the FPM (Table 8).

Table 8 represents that the best performance of the GEP belongs to Shahrekord (RMSE = 0.0650 mm day⁻¹), while the worst accuracy was reported for Kerman (RMSE = 0.4177 mm day⁻¹). In 83% of regions the GEP resulted with a RMSE < 0.2000 mm day⁻¹. Furthermore, the natural logarithm (ln) function was employed more than sinus, cosines, and particularly exponential functions in the GEP structures. Moreover, plus and minus functions were used more than multiplication sign and specially division. In all regions (with the exception of Rasht), the accuracy was improved compared to the GEP models based on the Tmean, Tmin, and Tmax. Therefore, for Rasht, wind speed is not required. It may correspond to minimum diurnal temperature rate (DTR) in very humid regions compared to other climates (role of relative humidity and saturated vapor pressure). However, wind speed-based GEP models predicted ETo in arid, semiarid, and Mediterranean climates more accurate than the other models.

It should be noted that the arctan has not acceptable accuracy compared to other functions. In addition, the results represent that ln and exp functions have a better performance than sine and cosine functions. Therefore, in future investigations, we can more focus on other functions than arctan. In addition, we can expect to have logarithmic relationship between ETo and temperature/wind speed more than other kinds of functions.

The obtained results are comparable and sometimes better than those reported by Wang et al. [54]. They resulted RMSE between 0.222 to 0.555 mm day⁻¹. In addition, Mehdizadeh [55] reported RMSE for GEP models from 0.46 to 2.08 mm day⁻¹.

3.5. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean), Wind Speed (WS), and Relative Humidity (RH) as Input Data

The results of the GA revealed that use of the RH and n do not increase accuracy of the GEP significantly (Table 6). This is also confirmed by the GEP.

According to the results, the best performance of the GEP was for Esfahan (RMSE = 0.0730 mm day⁻¹), while the worst accuracy was reported for Kerman (RMSE = 0.4252 mm day⁻¹). In 89% of regions the GEP resulted with a RMSE < 0.2000 mm day⁻¹. Furthermore, sinus and cosines functions were employed more than the natural logarithm (ln) and particularly exponential functions in the GEP structures. Moreover, plus and minus functions were used more than multiplication sign and specially division.

A comparison of Table 8 indicates that adding the RH as input parameter not only did not increase the GEP’s accuracy, but also led to the reduction in the accuracy relevant to 56% of the regions. In addition, the improvements are inconsiderable. It should be noted that the best structures of the GEP are not a function of the RH in 44% of the regions (Arak, Bushehr, Mashhad, Moghan, Qazvin, Sanandaj, Shahrekord, Shiraz, Tabriz, and Yazd). It is an important result and confirms that the Tmean and WS have more value to be used as input variable for the GEP compared to the RH.

3.6. Accuracy of Empirical Models Against Gene Expression Programming (GEP)

Figure 2 and Figure 3 show that the best empirical models with respect to the smallest RMSE values for estimating ETo on a monthly scale lack a good performance at annual scale and this is alarming considering the importance of the agricultural water management to deal with water crisis in the world. However, the GEP may predict ETo in the most regions of Iran with considerable accuracy.

In arid regions (Figure 2), the best accuracy belongs to Esfahan (R² = 0.9217) and the worst accuracy belongs to Kerman (R² = 0.3636), due to poor performance of the GEP for peak events in this region. In the semiarid regions (Figure 3), the best accuracy belongs to Shahrekord (R² = 0.9403) and the worst accuracy belongs to Hamedan (R² = 0.8049).

The obtained results are comparable and sometimes better than those reported by Wang et al. [54]. They resulted R² between 0.639 to 0.944. In addition, Mehdizadeh [55] reported R² for GEP models from 0.084 to 0.969.

4. Discussion

The current research shows that the GEP is a robust tool to model ETo in semiarid and Mediterranean climates (R² > 0.80). However, the use of the GEP should be recommended cautiously in Rasht, Bushehr, and Kerman (R² < 0.50) due to its poor performance in extreme events of these areas. To enhance the accuracy of the GEP in extreme values, use of hybrid methods (coupling GEP with ANN, fuzzy logic, honey bee algorithm, etc.) can be recommended.

The results of this study has a potential to be compared with real data. However, reference evapotranspiration (ETo) refers to maximum evapotranspiration in the best growing conditions of a crop without any limitation (drought and salinity stresses). However, plant requirements evapotranspiration (actual evapotranspiration) refers to actual conditions of a crop in the field considering all limitations. Therefore, it is not acceptable to compare these to variables. To this end, a crop coefficient (Kc) must be considered to characterize actual evapotranspiration (ETa); Eta = Kc × ETo.

Considering complex processes of evapotranspiration, there are many difficulties to measure this parameter. Some of the methods for this end are lysimeter, remote sensing, eddy covariance, and Bowen ratio. Allen et al. [35] determined an accurate model (FPM) which has the highest adaptability with lysimeter measurements in all over the world. Therefore, in this study, such as most of previous investigations [13,18,19,20], the authors compared the outputs of the GEP with the FPM and the results (Figure 1 and Figure 2) appear that the accuracy of the GEP has been improved compared to empirical models.

It should be noted that we cannot see similar results for the stations. In fact, all the results are completely independent of each other. This is in line with previous investigations [1,2].

It is notable that, in future research, the findings obtained from the GEP models in comparison to what is noted in the biological literature as affecting this process, should be checked to achieve a reliable result to recommend the GEP for other regions and research.

The GEP has worse performance over extreme conditions. The reason is related to the structure of GEP models. In this study, the authors focused on the models in which we need one or two weather parameters (mean temperature and mean wind speed) to estimate ETo. However, in extreme conditions, sometimes other variables, like maximum and minimum temperatures, maximum wind speed, rainfall, minimum humidity, solar radiation, and vapor pressure deficit, are also important. Therefore, if we are looking to increase the accuracy of GEP model we should consider more variables. However, in many regions we have no access to all of weather parameters to apply them to support GEP models to estimate ETo.

The next step of this study is to train the GA/GEP using ETo calculated at one site with all the meteorological variables available. Then, use the resulting functions to estimate ETo at a site where there are insufficient meteorological variables. Indeed, if we do have enough meteorological information to calculate ETo, we can then train the GA/GEP on fewer input variables.

5. Conclusions

In this study, the capability of both GA and GEP was assessed using 50-year time series data under 18 regions in Iran with arid, semiarid, very humid, and Mediterranean climates. The GA suggested that use of a double-parameter basis including the Tmean and WS may predict ETo with good accuracy in arid, semiarid, and Mediterranean regions. However, in very humid regions, temperature-based models (Tmean, Tmax, and Tmin) are better alternatives to reduce uncertainty. Finally, the GEP is recommended to evaluate the annual dynamics of ETo process in arid, semiarid, and Mediterranean climates with reliable accuracy under conditions where meteorological information is limited. In this study, we represented the best structures of GEP models. In future research, when a researcher or expert is going to use an accurate model based on this study, s/he may employ the best structures of GEP whose have been considered for his/her climate region.

Author Contributions

M.V. conceived and designed the study, performed the models, and wrote the manuscript. M.A.G.S. and M.R.-S. supervised the study. S.M.G. reviewed the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Shiri, J.; Sadraddin, A.A.; Nazemi, A.H.; Kisi, O.; Landeras, G.; Fard, A.F.; Marti, P. Generalizability of gene expression programming-based approaches for estimating daily reference evapotranspiration in coastal stations of Iran. J. Hydrol. 2014, 508, 1–11. [Google Scholar] [CrossRef]
Shiri, J.; Kisi, O.; Landeras, G.; Lopez, J.J.; Nazemi, A.H.; Stuyt, L.C. Daily reference evapotranspiration modeling by using genetic programming approach in the Basque Country (Northern Spain). J. Hydrol. 2012, 414, 302–316. [Google Scholar] [CrossRef]
Parasuraman, K.; Elshorbagy, A.; Carey, S.K. Modelling the dynamics of the evapotranspiration process using genetic programming. Hydrol. Sci. J. 2007, 52, 563–578. [Google Scholar] [CrossRef]
Mehr, A.D. An improved gene expression programming model for streamflow forecasting in intermittent streams. J. Hydrol. 2018, 563, 669–678. [Google Scholar] [CrossRef]
Mehr, A.D.; Nourani, V. A Pareto-optimal moving average-multigene genetic programming model for rainfall-runoff modelling. Environ. Model. Softw. 2017, 92, 239–251. [Google Scholar] [CrossRef]
Babovic, V.; Keijzer, M. Rainfall runoff modeling based on genetic programming. Nord. Hydrol. 2002, 33, 331–343. [Google Scholar] [CrossRef]
Liong, S.Y.; Gautam, T.R.; Khu, S.T.; Babovic, V.; Keijzer, M.; Muttil, N. Genetic programming: A new paradigm in rainfall runoff modeling. J. Am. Water Res. Assoc. 2002, 38, 705–718. [Google Scholar] [CrossRef]
Aytek, A.; Alp, M. An application of artificial intelligence for rainfall runoff modeling. J. Earth Syst. Sci. 2008, 117, 145–155. [Google Scholar] [CrossRef]
Aytek, A.; Kisi, O. A genetic programming approach to suspended sediment modeling. J. Hydrol. 2008, 351, 288–298. [Google Scholar] [CrossRef]
Harris, E.L.; Babovic, V.; Falconer, R.A. Velocity predictions in compound channels with vegetated flood plains using genetic programming. Int. J. River Basin Manag. 2003, 1, 117–123. [Google Scholar] [CrossRef]
Babovic, V.; Drecourt, J.P.; Keijzer, M.; Hansen, P.F. A data mining approach to modeling of water supply assets. Urban Water 2002, 4, 401–414. [Google Scholar] [CrossRef]
Terzi, O.; Keskin, M.E. Evaporation estimation using gene expression programming. J. Appl. Sci. 2005, 5, 508–512. [Google Scholar]
Irmak, A.; Kamble, B. Evapotranspiration data assimilation with genetic algorithms and SWAP model for on−demand irrigation. Irrig. Sci. 2009, 28, 101–112. [Google Scholar] [CrossRef]
Traore, S.; Guven, A. Regional-specific numerical models of evapotranspiration using gene-expression programming interface in Sahel. Water Resour. Manag. 2012, 26, 4367–4380. [Google Scholar] [CrossRef]
Shiri, J. Evaluation of FAO56-PM, empirical, semi-empirical and gene expression programming approaches for estimating daily reference evapotranspiration in hyper-arid regions of Iran. Agric. Water Manag. 2017, 188, 101–114. [Google Scholar] [CrossRef]
Traore, S.; Luo, Y.; Fipps, G. Gene-expression programming for short-term forecasting of daily reference evapotranspiration using public weather forecast information. Water Resour. Manag. 2017, 31, 4891–4908. [Google Scholar] [CrossRef]
Mattar, M.A. Using gene expression programming in monthly reference evapotranspiration modeling: A case study in Egypt. Agric. Water Manag. 2018, 198, 28–38. [Google Scholar] [CrossRef]
Kim, S.; Kim, H.S. Neural networks and genetic algorithm approach for nonlinear evaporation and evapotranspiration modeling. J. Hydrol. 2008, 351, 299–317. [Google Scholar] [CrossRef]
Izadifar, Z.; Elshorbagy, A. Prediction of hourly actual evapotranspiration using neural networks, genetic programming, and statistical models. Hydrol. Process. 2010, 24, 3413–3425. [Google Scholar] [CrossRef]
Kisi, O.; Guven, A. Evapotranspiration modeling using linear genetic programming technique. J. Irrig. Drain. Eng. 2010, 136, 715–723. [Google Scholar] [CrossRef]
Eslamian, S.S.; Gohari, S.A.; Zareian, M.J.; Firoozfar, A. Estimating Penman–Monteith reference evapotranspiration using artificial neural networks and genetic algorithm: A case study. Arab. J. Sci. Eng. 2012, 37, 935–944. [Google Scholar] [CrossRef]
Wang, Y.; Guo, S.; Chen, H.; Zhou, Y. Comparative study of monthly inflow prediction methods for the Three Gorges Reservoir. Stoch. Environ. Res. Risk Assess. 2014, 28, 555–570. [Google Scholar] [CrossRef]
Shiri, J.; Sadraddini, A.A.; Nazemi, A.H.; Kisi, O.; Marti, P.; Fard, A.F.; Landeras, G. Evaluation of different data management scenarios for estimating daily reference evapotranspiration. Hydrol. Res. 2013, 44, 1058–1070. [Google Scholar] [CrossRef]
Marti, P.; Gonzalez-Altozano, P.; Lopez-Urrea, R.; Mancha, L.A.; Shiri, J. Modeling reference evapotranspiration with calculated targets. Assessment and implications. Agric. Water Manag. 2015, 149, 81–90. [Google Scholar] [CrossRef]
Piri, H. Evaluation of computational methods for estimation of reference evapotranspiration with lysimeter data (case study: Sistan Plan). Water Irrig. 2012, 3, 50–62, (In Persian with English Abstract). [Google Scholar]
Pouryazdankhah, H.; Razavipour, T.; Khaledian, M.R.; Rezaei, M. Determination of proper methods to estimate reference evapotranspiration in Rasht. In Proceedings of the 3rd National Conferene on Comperehensive Water Resources Management, Sari, Iran, 10 September 2012; Available online: http://www.civilica.com/Paper–NCUIMWR03–NCUIMWR03_115.html (accessed on 17 July 2012). (In Persian).
Razzaghi, F.; Sepaskhah, A. Evaluation of different reference crop evapotranspiration methods using weithed lysimeter data. In Proceedings of the 9th Conference on Irrigation and Reduction of Evaporation, Kerman, Iran, 4–6 February 2007; Available online: http://www.civilica.com/Paper–ABYARI09–ABYARI09_029.html (accessed on 11 October 2007). (In Persian).
Rezaei, A.; Bakhtiari, B.; Houshyaripour, F.; Dehghani Anari, M. Evaluation of different estimation methods for reference evapotranspiration using lysimeter measurements. In Proceedings of the 9th Conference on Irrigation and Reduction of Evaporation, Kerman, Iran, 4–6 February 2007; Available online: http://www.civilica.com/Paper–ABYARI09–ABYARI09_017.html (accessed on 11 October 2007). (In Persian).
Rezvani, S.V.A.; Fathi, P.; Khodamoradpour, M.; Azizpour, S. Evaluation and verification of computational equations of reference evapotranspiration in Sanandaj. In Proceedings of the 1st Conference on Applied Researches of Iran Water Resources, Kermanshah, Iran, 11 May 2010; Available online: http://www.civilica.com/Paper–INCWR01–INCWR01_073.html (accessed on 9 June 2010). (In Persian).
Shayannezhad, M. Comparison of accuracy of artificial neural networks and Penman–Monteith methods to estimate potential evapotranspiration. In Proceedings of the 1st National Conference on Management of Irrigation and Drainage Networks, Ahvaz, Iran, 25 August 2006; Available online: http://www.civilica.com/Paper–IDNC01–IDNC01_001.html (accessed on 3 September 2006). (In Persian).
Tafazoli, F.; Sabziparvar, A.A.; Zare Abyaneh, H.; Banzhad, H. Evaluation of conventional reference evapotranspiration models in cold and arid climate to optimal use of radiation models. In Proceedings of the 9th Conference on Irrigation and Reduction of Evaporation, Kerman, Iran, 4–6 February 2007; Available online: http://www.civilica.com/Paper–ABYARI09–ABYARI09_012.html (accessed on 11 October 2007). (In Persian).
Tanian, S.; Mirmasoudi, S.S.; Ghiami, F.; Zare Abyaneh, H. Evaluation of reference evapotranspiration using lysimeter data in Urmia. In Proceedings of the 2nd National Conference on Management of Irrigation and Drainage Networks, Ahvaz, Iran, 28–30 January 2008; Available online: http://www.civilica.com/Paper–IDNC02–IDNC02_277.html (accessed on 3 September 2008). (In Persian).
Vahidi, A. Evaluation of different estimation methods for reference evapotranspiration using weighed lysimeter. In Proceedings of the 10th Conference on Irrigation and Reduction of Evaporation, Kerman, Iran, 2–3 December 2009; Available online: http://www.civilica.com/Paper–ABYARI10–ABYARI10_197.html (accessed on 11 October 2009). (In Persian).
Zare Abyaneh, H.; Ghasemi, A.; Ahmadi, M. Determination of the most proper method to estimate reference crop evapotranspiration in comparison with empirical methods for Hamedan. In Proceedings of the 9th Conference on Irrigation and Reduction of Evaporation, Kerman, Iran, 4–6 February 2007; Available online: http://www.civilica.com/Paper–ABYARI09–ABYARI09_022.html (accessed on 11 October 2007). (In Persian).
Allen, R.G.; Pereira, L.S.; Raes, D.; Smith, M. Crop Evapotranspiration—Guidelines for Computing Crop Water Requirements—FAO Irrigation and Drainage Paper 56; FAO: Rome, Italy, 1998. [Google Scholar]
Hargreaves, G.L.; Samani, Z.A. Reference crop evapotranspiration from temperature. Appl. Eng. Agric. 1985, 1, 96–99. [Google Scholar] [CrossRef]
Jensen, M.E.; Haise, H.R. Estimation of evapotranspiration from solar radiation. J. Irrig. Drain. Div. 1963, 89, 15–41. [Google Scholar]
WMO. Measurement and Estimation of Evaporation and Evapotranspiration; Tech. Pap. (CIMO–Rep) 83; Royal Meteorological Society: Geneva, Switzerland, 1966. [Google Scholar]
Abtew, W. Evapotranspiration measurements and methoding for three wetland systems in South Florida. J. Am. Water Resour. Assoc. 1996, 32, 465–473. [Google Scholar] [CrossRef]
Makkink, G.F. Testing the Penman formula by means of lysimeters. J. Instit. Water Eng. 1957, 11, 277–288. [Google Scholar]
Turc, L. Estimation of irrigation water requirements, potential evapotranspiration: A simple climatic formula evolved up to date. Ann. Agron. 1961, 12, 13–49. [Google Scholar]
Xu, C.Y.; Singh, V.P.; Chen, Y.D.; Chen, D. Evaporation and evapotranspiration. In Hydrology and Hydraulics, 1st ed.; Singh, V.P., Ed.; Water Resources Publications, LLC: Highlands Ranch, CO, USA, 2008; pp. 229–276. [Google Scholar]
Blaney, H.F.; Criddle, W.D. Determining Water Requirements in Irrigated Areas from Climatological and Irrigation Data; Soil Conservation Service Technical Paper 96; Soil Conservation Service, US Department of Agriculture: Washington, DC, USA, 1950.
Droogers, P.; Allen, R.G. Estimating reference evapotranspiration under inprecise data conditions. Irrig. Drain. Syst. 2002, 16, 33–45. [Google Scholar] [CrossRef]
Trajkovic, S. Hargreaves versus Penman–Monteith under Humid Condition. J. Irrig. Drain. Eng. 2007, 133, 38–42. [Google Scholar] [CrossRef]
Ahmadi, S.H.; Fooladmand, H.R. Spatially distributed monthly reference evapotranspiration derived from the calibration of Thornthwaite equation: A case study, South of Iran. Irrig. Sci. 2008, 26, 303–312. [Google Scholar] [CrossRef]
Almorox, J.; Quej, V.H.; Mari, P. Global performance ranking of temperature–based approaches for evapotranspiration estimation considering Köppen climate classes. J. Hydrol. 2015, 528, 514–522. [Google Scholar] [CrossRef]
Caporusso, N.B.; Rolim, G.D.S. Reference evapotranspiration models using different time scales in the Jaboticabal region of São Paulo, Brazil. Acta Sci. Agron. 2015, 37, 1–9. [Google Scholar] [CrossRef]
Heydari, M.M.; Tajamoli, A.; Ghoreishi, S.H.; Darbe–Esfahani, M.K.; Gilasi, H. Evaluation and calibration of Blaney–Criddle equation for estimating reference evapotranspiration in semiarid and arid regions. Environ. Earth Sci. 2015, 74, 4053–4063. [Google Scholar] [CrossRef]
Mallikarjuna, P.; Jyothy, S.A.; Murthy, D.S.; Reddy, K.C. Performance of recalibrated equations for the estimation of daily reference evapotranspiration. Water Resour. Manag. 2014, 28, 4513–4535. [Google Scholar] [CrossRef]
Ferreira, C. Gene Expression Programming Mathematical Modeling by an Artificial Intelligence; Studies in Computational Intelligence, 21; Springer: Berlin/Heidelberg, Germany, 2006; ISBN 978-3-540-32849-0. [Google Scholar]
Koza, J.R. Genetic Programming: On the Programming of Computers by Means of Natural Selection; The MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
Wang, L.; Kisi, O.; Hu, B.; Bilal, M.; Zounemat-Kermani, M.; Li, H. Evaporation modelling using different machine learning techniques. Int. J. Climatol. 2017, 37, 1076–1092. [Google Scholar] [CrossRef]
Wang, S.; Lian, J.; Peng, Y.; Hu, B.; Chen, H. Generalized reference evapotranspiration models with limited climatic data based on random forest and gene expression programming in Guangxi, China. Agric. Water Manag. 2019, 221, 220–230. [Google Scholar] [CrossRef]
Mehdizadeh, S. Estimation of daily reference evapotranspiration (ETo) using artificial intelligence methods: Offering a new approach for lagged ETo data-based modeling. J. Hydrol. 2018, 559, 794–812. [Google Scholar] [CrossRef]

Figure 1. Location of the weather stations.

Figure 2. Accuracy of empirical models (annual mean ETo) against gene expression programming (GEP) in very humid (Rasht), Mediterranean (Sanandaj), and arid (other stations) climates.

Figure 3. Accuracy of empirical models (annual mean ETo) against gene expression programming (GEP) in semiarid climates.

Table 1. Structure of genetic algorithm (GA) designed in this study.

Base	Formula
Mean Temperature	$E T o = a_{1} {(T_{m e a n})}^{a_{2}} + a_{3}$
Differential Temperature	$E T o = b_{1} {(T_{\max} - T_{\min})}^{b_{2}} + b_{3}$
Relative Humidity	$E T o = c_{1} {(R H)}^{c_{2}} + c_{3}$
Wind Speed	$E T o = e_{1} {(W S)}^{e_{2}} + e_{3}$
Solar Radiation	$E T o = f_{1} {(n)}^{f_{2}} + f_{3}$
Total (Summation)	${(E T o)}_{G A} = \sum_{i = 1}^{5} E T o_{i}$
Total (Multiplying)	${(E T o)}_{G A} = \prod_{i = 1}^{5} E T o_{i}$
Goal Function	$M i n i m i z e [{(E T o)}_{G A} - {(E T o)}_{F P M}]$

Table 2. Parameters of genetic algorithm (GA) coded in this study by using MATLAB environment.

Value	Option	Value	Option
1.00E-06	TolFun	doubleVector	PopulationType
1.00E-06	TolCon	[2 × 1 double]	PopInitRange
10	InitialPenalty	20	PopulationSize
100	PenaltyFactor	2	EliteCount
1	PlotInterval	0.8	CrossoverFraction
@gacreationuniform	CreationFcn	Forward	MigrationDirection
@fitscalingrank	FitnessScalingFcn	20	MigrationInterval
@selectionstochunif	SelectionFcn	0.2	MigrationFraction
@crossoverscattered	CrossoverFcn	100	Generations
{[ @mutationgaussian ] [1] [1]}	MutationFcn	Inf	TimeLimit
Final	Display	-Inf	FitnessLimit
Off	Vectorized	50	StallGenLimit
Never	UseParallel	Inf	StallTimeLimit
1.00E-06	TolFun	doubleVector	PopulationType
1.00E-06	TolCon	[2 × 1 double]	PopInitRange

Table 3. Parameters of gene expression programming (GEP) employed in this study by using GeneXpro Tools version 5.0.

Value	Option
+, −, ×, ÷, ^, √, ∛, ∜, sin, cos, ln, exp	Function set
500	Population size
95	Mutation frequency
50	Crossover frequency
217	Number of replication
30	Block mutation rate
30	Instruction mutation rate
40	Instruction data mutation rate
95	Homologous crossover
Ranking	Selection method
1	Recombination rate
0.2	Mutation rate
Generalize	Alternative method
Yes	Elitism
6	Maximum level of tree

Table 4. Position and climate of the stations with length of collected data.

Station Name	ICAO Code	North Latitude	East Longitude	Altitude (masl)	Start Year	End Year	Climate
Ahvaz	40811	31°20′	48°40′	22.5	1961	2010	Arid
Arak	40769	34°6′	49°46′	1708.0	1961	2010	Semiarid
Bushehr	40858	28°58′	50°49′	9.0	1961	2010	Arid
Esfahan	40800	32°37′	51°40′	1550.4	1961	2010	Arid
Hamedan	40768	34°52′	48°32′	1741.5	1961	2010	Semiarid
Jiroft	40866	28°35′	57°48′	601.0	1989	2009	Arid
Kerman	40841	30°15′	56°58′	1753.8	1961	2010	Arid
Mashhad	40745	36°16′	59°38′	999.2	1961	2010	Semiarid
Moghan	40700	39°39′	47°55′	31.9	1984	2010	Semiarid
Qazvin	40731	36°15′	50°3′	1279.2	1961	2010	Semiarid
Rasht	40719	37°19′	49°37′	-8.6	1961	2010	Very humid
Sanandaj	40747	35°20′	47°0′	1373.4	1961	2010	Mediteranean
Shahrekord	40798	32°17′	50°51′	2048.9	1961	2010	Semiarid
Shiraz	40848	29°32′	52°36′	1484.0	1961	2010	Semiarid
Tabriz	40706	38°5′	46°17′	1361.0	1961	2010	Semiarid
Urmia	40712	37°40′	45°3′	1328.0	1961	2010	Semiarid
Yazd	40821	31°54′	54°17′	1237.2	1961	2010	Arid
Zabol	40829	31°2′	61°29′	489.2	1961	2010	Arid

Table 5. Selected models to estimate reference evapotranspiration including their references, formulae, and parameters.

Model	Reference(s)	Formula	Parameters
FAO Penman-Monteith (FPM)	[35]	$E T_{o} = \frac{0.408 (R_{n} - G) + γ \frac{900}{T + 273} u (e_{s} - e_{a})}{Δ + γ (1 + 0.34 u)}$	H,φ,T,Tmin,Tmax,RH,u,n
Hargreaves-Samani (HS)	[36]	$E T_{o} = 0.005508 K_{T} R_{a} {(T_{\max} - T_{\min})}^{0.5} (T + 17.8)$	T,u,Tmin,Tmax,RH,n,φ
Jensen-Haise (JH)	[37]	$E T_{o} = 0.408 C_{T} (T - T_{x}) R_{s}$	T,Rs,H,Tmax,Tmin
WMO	[38]	$E T_{o} = (1.298 + 0.934 u) (e_{s} - e_{a})$	T,Tmin,Tmax,RH,u
Abtew (Ab)	[39]	$E T_{o} = 0.01786 \frac{R_{s} T_{\max}}{λ}$	T,Tmax,Rs
Makkink (Mk)	[40]	$E T_{o} = 0.61 \frac{Δ}{Δ + γ} \frac{R_{s}}{λ} - 0.12$	T,Rs
Turc (Tu)	[41,42]	$E T_{o} = (0.3107 R_{s} + 0.65) \frac{T a_{t}}{T + 15}$	T,RH,Rs
Blaney-Criddle (BC)	[43]	$E T_{o} = a + b P (0.46 T + 8.13) (1 + 0.0001 H)$	H,T,n,RHmin,φ,u
Modified Hargreaves-Samani 1 (MHS1)	[44]	$E T_{o} = 0.0005304 R_{a} {(T_{\max} - T_{\min} - 0.0123 R)}^{0.76} (T + 17)$	T,Tmin,Tmax,φ,R
Modified Hargreaves-Samani 2 (MHS2)	[45]	$E T_{o} = 0.0009384 R_{a} {(T_{\max} - T_{\min})}^{0.424} (T + 17.8)$	T,Tmin,Tmax,φ

Table 6. Performance of genetic algorithm (GA) for different parameters.

Code	Parameters	Root Mean Square Error (RMSE) (mm/day)
GA1	Tmean	0.6823
GA2	Tmean,Tmin,Tmax	0.5049
GA3	Tmean,RH	0.5733
GA4	Tmean,n	0.6250
GA5	Tmean,WS	0.3263
GA6	Tmean,WS,RH	0.3087
GA7	Tmean,WS,n	0.3211
GA8	Tmean,WS,RH,n	0.3030

Table 7. Performance of gene expression programming (GEP) for mean temperature (Tmean) as input data.

Region	The Best Structure	Training RMSE (mm/day)	Testing RMSE (mm/day)
Ahvaz	$E T o = \ln (58.598 + T m e a n) + \ln (\ln (T m e a n + 4) + T m e a n + \sin (T + 5 + \sin (T m e a n)))$	0.5789	0.6839
Arak	$E T o = \sqrt{- \sin (T m e a n^{2} + 11 T m e a n + 28) - 0.821 - \sin (3 T m e a n) + {(1.386 T m e a n + 8.318)}^{0.754}}$	0.5321	0.2542
Bushehr	$E T o = - 0.011 \cos (8 T m e a n) - (4.289 + 1.464 T m e a n)$	0.3230	0.2190
Esfahan	$E T o = \ln ({(\ln (T m e a n))}^{2.197} + 4^{\ln (T m e a n)} + \sin (T m e a n^{6} + 1.609) + \ln (2.398 + {0.693}^{\ln (T m e a n)}))$	0.1200	0.3361
Hamedan	$E T o = \ln (\ln (\frac{\exp (4 T m e a n)}{\ln (\frac{3 T m e a n + 3}{T m e a n})}))$	0.2614	0.2844
Jiroft	$E T o = \exp (1.016 (\ln (5.307 - \cos (T m e a n) + \cos (T m e a n - 1))))$	0.1980	0.2963
Kerman	$E T o = 5 - \cos (4 - \cos (2 - \cos (T m e a n)))$	0.2090	0.5285
Mashhad	$E T o = {(T m e a n + \sin {(T m e a n + 3)}^{1.609} + {(\sin (2.221))}^{T m e a n})}^{\ln \sqrt{\ln (T m e a n)}}$	0.1417	0.2707
Moghan	$E T o = \exp (3^{0.209 \cos^{16} (T m e a n^{3})})$	0.2871	0.3207
Qazvin	$E T o = \sqrt[4]{T m e a n - 3} - 0.945 + \cos (\sin (\exp (\frac{T m e a n}{3})))$	0.2508	0.3355
Rasht	$E T o = \ln (\frac{T m e a n}{12 - 48 T m e a n} - 10 - 1.115 \sin (T m e a n) - \cos (T m e a n))$	0.2391	0.1100
Sanandaj	$E T o = \sqrt{T m e a n + 0.041 + \sin (0.414 - T m e a n) + \cos (\sqrt{T m e a n + 2} - 6)}$	0.1478	0.2858
Shahrekord	$E T o = \sqrt{12.683 - {(\ln (4 T m e a n - 2.079))}^{\cos (\sin (T m e a n^{5}))}}$	0.2172	0.2667
Shiraz	$E T o = \cos (\sin (2 - \sin (T m e a n - 3))) + \frac{0.336}{T m e a n - 4.375} + 4$	0.2251	0.2918
Tabriz	$E T o = \ln (\ln (\cos (T m e a n + 4) + 4.255)) + \sqrt{1.302 + T m e a n}$	0.2873	0.3002
Urmia	$E T o = \sqrt{T m e a n - \exp (\sin \sqrt{T m e a n - 8})}$	0.2036	0.2396
Yazd	$E T o = \frac{5.071 + \sqrt{6 + T m e a n}}{\sqrt{\frac{T m e a n}{4}} \cos (\ln (\sqrt{T m e a n - 17}))}$	0.4517	0.5519
Zabol	$E T o = \frac{4.256 T m e a n^{2}}{\ln (T m e a n^{3} + 2.545 T m e a n + 2980.958)}$	0.7208	0.8229

Table 8. Performance of gene expression programming (GEP) for mean temperature (Tmean) and wind speed (WS) as input data.

Region	The Best Structure	Training RMSE (mm/day)	Testing RMSE (mm/day)
Ahvaz	$E T o = 3.189 - \sqrt{W S - 2 + T m e a n} + \sqrt{7 + T m e a n}$	0.2008	0.2128
Arak	$E T o = \ln (\sqrt{\sqrt{2.197 \ln (2 T m e a n)} ((T m e a n - 4) W S) + 1.386})$	0.0963	0.1079
Bushehr	$E T o = \ln (17.167 W S + 137.339) - \frac{\exp (\cos \frac{1}{T m e a n})}{\exp (W S) + \cos (W S \times T m e a n)} - \frac{\exp (\cos (4 - T m e a n - \frac{W S}{8}))}{W S + \exp (\ln (W S) - 0.8)}$	0.2018	0.1640
Esfahan	$E T o = \sqrt{T m e a n} - \cos (W S) - 0.006 - \ln (\sqrt[4]{2 - \cos (W S)})$	0.0835	0.0925
Hamedan	$E T o = 5 - \exp (\exp (\frac{T m e a n \sqrt{T m e a n}}{12})) - {0.518}^{\frac{W S^{3.5}}{11 + \frac{8}{W S}}}$	0.1108	0.1457
Jiroft	$E T o = W S + 3.001 - \frac{\exp (\sqrt{5^{\sin (T m e a n)}})}{W S - 13}$	0.1489	0.1401
Kerman	$E T o = \sqrt[4]{W S} \times \sqrt{\ln (W S + \frac{T m e a n}{5})} \times \ln (T m e a n + \frac{\sqrt{T m e a n} \times \ln (W S)}{5})$	0.2489	0.4177
Mashhad	$E T o = \exp (\frac{T m e a n}{28}) + \ln (\frac{0.571 (T m e a n + W S)}{- \ln (\frac{W S}{7})})$	0.1107	0.1725
Moghan	$E T o = \ln (\sqrt{0.303 \ln (T m e a n) \times T m e a n \times W S} \times \ln (\ln (9 W S) \times (T m e a n - \cos (T m e a n))))$	0.1082	0.0906
Qazvin	$E T o = \ln (W S) + \sqrt{\ln (W S) + T m e a n - \ln (T m e a n) + 3}$	0.0827	0.1079
Rasht	$E T o = \sqrt{\sin (0.745 T m e a n) + \sqrt{5.064 (\sin (W S) + 4 + W S)}}$	0.0486	0.0961
Sanandaj	$E T o = \sqrt{T m e a n \times (\sqrt{W S} + 0.745)}$	0.1740	0.1287
Shahrekord	$E T o = 0.705 \times (\sqrt{(T m e a n + 7)} + \ln (W S) - 1 - \cos (\ln (2 + 2 T m e a n)))$	0.0583	0.0650
Shiraz	$E T o = \sqrt{\cos (2 W S + T m e a n + 26) + \cos (T m e a n + 5) + 2 W S + \exp (\sqrt{W S + 6})}$	0.1474	0.1403
Tabriz	$E T o = \ln (\exp (\sin (T m e a n)) + 6 + T m e a n + \sin (\sin (T m e a n))) - \frac{\sin (T m e a n)}{\ln (5 + W S)} + {22025.465}^{(W S - 7) (\sin (W S) + 8)}$	0.1487	0.1620
Urmia	$E T o = \cos (\frac{W S - 6.135}{T m e a n}) + \cos (\frac{12.96}{0.75 + W S + T m e a n}) + \exp (\sqrt{\frac{W S}{5}}) - \frac{3}{T m e a n}$	0.2049	0.1158
Yazd	$E T o = \sqrt[4]{\frac{T m e a n^{2} - 8 T m e a n}{2}} \sqrt{(W S + 0.008)}$	0.1790	0.1738
Zabol	$E T o = 5.016 - \frac{W S}{W S - 1} - \frac{0.246 - \frac{W S - 2}{3}}{2.273 - \cos (T m e a n)}$	0.3289	0.3612

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Valipour, M.; Gholami Sefidkouhi, M.A.; Raeini-Sarjaz, M.; Guzman, S.M. A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates. Atmosphere 2019, 10, 311. https://doi.org/10.3390/atmos10060311

AMA Style

Valipour M, Gholami Sefidkouhi MA, Raeini-Sarjaz M, Guzman SM. A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates. Atmosphere. 2019; 10(6):311. https://doi.org/10.3390/atmos10060311

Chicago/Turabian Style

Valipour, Mohammad, Mohammad Ali Gholami Sefidkouhi, Mahmoud Raeini-Sarjaz, and Sandra M. Guzman. 2019. "A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates" Atmosphere 10, no. 6: 311. https://doi.org/10.3390/atmos10060311

APA Style

Valipour, M., Gholami Sefidkouhi, M. A., Raeini-Sarjaz, M., & Guzman, S. M. (2019). A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates. Atmosphere, 10(6), 311. https://doi.org/10.3390/atmos10060311

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Hybrid Data-Driven Machine Learning Technique for Evapotranspiration Modeling in Various Climates

Abstract

1. Introduction

2. Materials and Methods

2.1. Designing Structure of Genetic Algorithm (GA) for the Current Research

2.2. Parameters of Gene Expression Programming (GEP) Employed in the Current Research by Using GeneXpro Tools Version 5.0

2.3. Materials

3. Results

3.1. Performance of Genetic Algorithm (GA) for Different Parameters

3.2. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean) as Input Data

3.3. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean), Minimum Temperature (Tmin), and Maximum Temperature (Tmax) as Input Data

3.4. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean) and Wind Speed (WS) as Input Data

3.5. Performance of Gene Expression Programming (GEP) for Mean Temperature (Tmean), Wind Speed (WS), and Relative Humidity (RH) as Input Data

3.6. Accuracy of Empirical Models Against Gene Expression Programming (GEP)

4. Discussion

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI