Next Article in Journal
How Much Meteorological Information Is Necessary to Achieve Reliable Accuracy for Rainfall Estimations?
Next Article in Special Issue
Detection and Differentiation between Laurel Wilt Disease, Phytophthora Disease, and Salinity Damage Using a Hyperspectral Sensing Technique
Previous Article in Journal
Gradual Accumulation of Heavy Metals in an Industrial Wheat Crop from Uranium Mine Soil and the Potential Use of the Herbage
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Feature Selection as a Time and Cost-Saving Approach for Land Suitability Classification (Case Study of Shavur Plain, Iran)

1
Department of Remote Sensing and GIS, Faculty of Geography, University of Tehran, P.O. Box 14155-6465, Tehran, Iran
2
Department of Range and Watershed, Agriculture College and Natural Resources of Darab, Shiraz University, Shiraz, Iran
3
Department of cognitive science modeling, Institute for Cognitive Science Studies, Tehran, Iran
4
Laboratory of Geo-Information Science and Remote Sensing, Wageningen University, P.O. Box 47, 6700 AA Wageningen, The Netherlands
*
Author to whom correspondence should be addressed.
Agriculture 2016, 6(4), 52; https://doi.org/10.3390/agriculture6040052
Submission received: 29 July 2016 / Revised: 12 September 2016 / Accepted: 30 September 2016 / Published: 10 October 2016
(This article belongs to the Special Issue Big Data Application in Agriculture)

Abstract

:
Land suitability classification is important in planning and managing sustainable land use. Most approaches to land suitability analysis combine a large number of land and soil parameters, and are time-consuming and costly. In this study, a potentially useful technique (combined feature selection and fuzzy-AHP method) to increase the efficiency of land suitability analysis was presented. To this end, three different feature selection algorithms—random search, best search and genetic methods—were used to determine the most effective parameters for land suitability classification for the cultivation of barely in the Shavur Plain, southwest Iran. Next, land suitability classes were calculated for all methods by using the fuzzy-AHP approach. Salinity (electrical conductivity (EC)), alkalinity (exchangeable sodium percentage (ESP)), wetness and soil texture were selected using the random search method. Gypsum, EC, ESP, and soil texture were selected using both the best search and genetic methods. The result shows a strong agreement between the standard fuzzy-AHP methods and methods presented in this study. The values of Kappa coefficients were 0.82, 0.79 and 0.79 for the random search, best search and genetic methods, respectively, compared with the standard fuzzy-AHP method. Our results indicate that EC, ESP, soil texture and wetness are the most effective features for evaluating land suitability classification for the cultivation of barely in the study area, and uses of these parameters, together with their appropriate weights as obtained from fuzzy-AHP, can perform good results for land suitability classification. So, the combined feature selection presented and the fuzzy-AHP approach has the potential to save time and money for land suitability classification.

1. Introduction

Land suitability analyses assess the suitability of land for specific uses such as arable farming, irrigated agriculture, forest, urbanization, industry, waste disposal, etc. [1,2,3].
Agriculture land suitability classification is defined as the process of assessment of land performance for alternative kinds of agriculture activity and crop type [1,4,5]. This activity is vital to predict the potential and limitation of land for crop production; the systematic assessment of land and water potential; and to select the best land for the cultivation of a given agricultural product [1,4,5,6,7]. In 1976, the Food and Agriculture Organization (FAO) proposed a framework for land suitability classification which allows five classes of suitability (three suitable and two not suitable) for certain crops [1]. Later, the FAO classification model was adapted for use with a wider range of soil and environmental characteristics [8,9,10]. The geographical information system (GIS) has added a much-needed spatial dimension to land suitability mapping and management, and has become a powerful tool in this regard [11,12,13]. Different combined GIS and expert systems such as multi-criteria decision analysis [5,14,15,16,17,18], artificial intelligence in geo-computation methods [19,20,21], visualization methods [22], analytical hierarchy process (AHP) [23,24], fuzzy modeling [25,26,27,28,29] and Fuzzy-AHP methods [30,31,32,33] have been widely used for agricultural land suitability classification. In all of these methods, the topography, wetness, salinity (electrical conductivity (EC)), alkalinity (exchangeable sodium percentage (ESP)), soil texture, soil depth, CaCO3, pH (H2O) and gypsum are important parameters for the evaluation of land suitability for the cultivation of different crops [34].
Because of the large number of inputs required, determining land suitability through existing approaches is time consuming and costly. Hence, the development of methods that minimize and optimize the input parameters for land suitability analysis is important. One method that shows potential is data mining-based feature selection. The data-mining approach is based on identifying valuable information in an existing database [35]. The first requirement for successful data mining is to identify the most important features through a database [36]. The feature selection methods can be seen as the combination of search approaches for proposing new feature subsets, along with an assessment measure that scores the several feature subsets [37].
Recently, several studies have evaluated feature selection for use in digital soil mapping and classification [38,39,40]. As such, feature selection could be useful in studies with a large number of input parameters, such as land suitability analyses [41]. However, so far, no study has evaluated this method for land suitability evaluation, with the exception of a conference proceeding paper by Mokarram et al. [41]. Here, we present an extended version of this conference paper, in which we aimed to assess the capabilities of different feature selection algorithms (random search, best search and genetic methods), in combination with fuzzy-AHP, for land suitability classification for the cultivation of barley. In addition, we aimed to determine the most effective parameters for this purpose. The accuracy of the combined feature selection and fuzzy-AHP methods are compared with the standard method of FAO, and the best method and framework for this task was selected and discussed.

2. Materials and Methods

2.1. Study Area

The study area, Shavur Plain, is located in the Khuzestan province, in southwest Iran, between latitudes 31°00′30″ N and 32°30′00″ N and longitudes 48°15′00″ E and 48°40′40″ E. It has an area of 774 km2 (Figure 1). The area has an arid and semi-arid climate with an average annual precipitation of 266 mm and annual evaporation from open pans of 2788.3 mm·year−1.

2.2. Dataset

Data used for this study include topography (primary slope, secondary slope and micro-relief), wetness (water table depth), salinity (EC), alkalinity (ESP), soil texture, soil depth, CaCO3, pH, and gypsum, which were extracted from the land classification report published by the Khuzestan Soil and Water Research Institute in 2009 [42]. These data were collected by sampling 256 points randomly distributed in the study area. A summary of the dataset is shown in Table 1.

2.3. Methodology

The methodology of the study is summarized in six steps, as follows:
Step 1: Geospatial maps of each parameter presented in Table 1 were constructed. For this we used the geostatistical analysis and the Ordinary Kriging model was applied to the 256 sampled points.
Step 2: Based on a previous study [28,29,30,31], the following functions (Equations (1) and (2)) were determined as the best fuzzy membership functions for each parameter, and these were used to prepare fuzzy maps for all parameters. Function 1 was used for soil depth and wetness, and function 2 was used for soil texture, EC, ESP, gypsum (%), CaCO3 (%), topography, and pH values.
μ A ( X ) = { 0 x a x a / b a a < x < b 1 x b }
μ A ( X ) = { 1 x a b   x / b a a < x < b 0 x b }
Supposing that X = {x is a finite set of points, a fuzzy subset, A subset A of X, is defined by a function, μA(X), in the ordered pairs: A = {x, μA(X)} for each x, x belongs to X. where μA(X) is membership function that defines the grade of membership of x in A fuzzy set. The μA(X) takes values between 0 and 1, inclusive. The value of 1 means that x belongs completely to A, and 0 means that x absolutely does not belong to the subset A. A value between 0 and 1 indicates that, to some degree, x belongs to the subset A. The values a, b are the lower and upper limit values according to Sys’s table [34].
Step 3: The AHP method was employed in order to determine the weights of assessment parameters for land suitability classification. This method is based on a pair wise comparison matrix [43,44]. To derive the pair wise comparison matrix, the relative importance of input parameters (Table 1) was defined by using the published reports by the Khuzestan Soil and Water Research Institute [42]. This report shows that soil salinity and alkalinity, soil wetness, CaCO3, gypsum, pH, soil texture, soil depth and topography are the most important parameters (restriction) for the cultivation of barley in the study area, respectively. The relative importance of these parameters were assigned based on Saaty’s scale (Table 2) and the pair-wise comparison matrix and the final weights for each parameter were derive (Table 3).
Step 4: Finally, to compute the land suitability map of the study area, the convex combination of the raster values containing the different fuzzy parameters were calculated using the linear additive combination models [12] as in Equation (3):
S I = j = 1 k W j × μ j ( x )                       x ε X
where the value of SI is the land suitability index, k is the number of parameters (Table 1), Wj is the weight of parameter j, which is computed by using AHP (Table 3), and μj (X) is the membership grade for factor j. The value of SI is between 0 and 1, where a value of 0 represents totally unsuitability and 1 indicates 100% suitability. The suitability classification was assigned as presented in Table 4, based on the FAO framework [1], and modified slightly in this study. Thus S1 represents that the land unit is highly suitable to barely crop production with no limitations; S2 represents that the land unit is moderately suitable with some limitations; S3 represents that the land unit is marginally suitable with severe limitations; and N represents that the land unit is unsuitable for barely growth.
Step 5: The previous approach needs a large number of land and soil parameters for land evaluation, therefore in this study three different feature selection algorithms which are most common and used methods—random search, best search and genetic methods—were used to determine the most effective parameters for land suitability classification. For each feature selection method, the most effective parameters were selected, and then weights for each parameter were re-calculated according to the AHP. Land suitability classes were calculated for all methods (datasets) using a fuzzy-AHP approach. The details of feature selection methodology are explained in the next section.
Step 6: The Kappa coefficient developed by Cohen [46], was used to assess the agreement between the standard fuzzy-AHP map and the maps obtained from the different feature selection methods. The Kappa coefficient is a measurement of the degree of agreement between two observations (maps). A Kappa value of 0 indicates that there is a poor agreement between two maps and a value of 1 indicates an almost perfect agreement. The methodology of this study is summarized in Figure 2.

Feature Selection Method

The main idea of feature selection is to choose a subset of input variables by eliminating features with little or no predictive information. Feature selection can significantly improve the comprehensibility of the resulting classifier models, and it often builds a model that better generalizes to unseen points. Feature selection has been developed for decades in statistical pattern recognition [47], machine learning [48], data mining [49] and statistics [50].
Ideally, feature selection methods search through the subsets of features, and try to find the best one among the competing 2N (size) candidate subsets according to some evaluation function. This procedure is exhaustive as it tries to find only the best one. There are four basic steps in a typical feature selection method, as follows:
(a)
The generation procedure: this is basically a search procedure, which generates subsets of features for evaluation.
(b)
The evaluation function: An evaluation function measures the goodness of a subset produced by some generation procedure, and this value is compared with the previous best. If it is found to be better, then it replaces the previous best subset.
(c)
The stopping condition: Without a suitable stopping criterion, the feature selection process may run exhaustively or forever through the space of subsets. Generation procedures and evaluation functions can influence the choice of a stopping criterion. A Stopping criteria based on a generation procedure include: (i) whether a predefined number of features are selected; and (ii) whether a predefined number of iterations is reached. Stopping criteria based on an evaluation function can be: (i) whether addition (or deletion) of any feature does not produce a better subset; and (ii) whether an optimal subset according to some evaluation function is obtained. The loop continues until the stopping criterion is satisfied.
(d)
The validation procedure: the validation procedure is not a part of the feature selection process itself, but a feature selection method (in practice) must be validated. It tries to test the validity of the selected subset by carrying out different tests, and comparing the results with previously established results, or with the results of competing feature selection methods, using artificial datasets, real-world datasets, or both [51].
In this research, the different feature selection methods applied were best search, genetic search and random search. A short description of each method is given in the following boxes:
Best search [52]
1. Begin with the OPEN list containing the start state, the CLOSED list empty, and BEST start state.
2. Let s = arg max e(x) (get the state from OPEN with the highest evaluation).
3. Remove s from OPEN and add to CLOSED.
4. If e(s) ≥ e(BEST), then BEST←s.
5. For each child t of s that is not in the OPEN or CLOSED list, evaluate and add to OPEN.
6. If BEST changed in the last set of expansions, go to 2.
7. Return BEST.
Genetic search [52]
1. Begin by randomly generating an initial population P.
2. Calculate e(x) for each member x ϵ P.
3. Define a probability distribution p over the members of P where p(x) α e(x).
4. Select two population members x and y with respect to p.
5. Apply crossover to x and y to produce new population members x′ and y′.
6. Apply mutation to x′ and y′.
7. Insert x′ and y′ into P′ (the next generation).
8. If |P′| < |P|, go to 4.
9. Let P←P′.
10. If there are more generations to process, go to 2.
11. Return x ϵ P for which e(x) is highest.
Random search [53]
Input: F = {f1, f2, …, fn}
Output: A subset of F
Step 1: Initial r0 = r = Q, s = ∅, Gbest = F, k = 0, C, K are specified
Step 2: Partition r0,
Step 3: Find the most promising index:
σ = {η|η∈children(r)|}, m =|σ|, σm+1 = s, Index = 0;
For i = 1: m + 1
Randomly sample N subsets in σi, find the best G i *
End;
Index = arg m a x 1 i m + 1 ( I ( σ i ) ) ,
If index G i n d e x * G b e s t then G b e s t = G i n d e x *
If | σ i n d e x | = 1 then go to Step 5
Step 4: If 1 ≤ index ≤ m then r = σ i n d e x , s = r0\r else r = sup(r); k = k + 1;
If k > K then go to Step 5 else go to Step 3
Step 5: Output the Gbest

3. Results and Discussion

The primary fuzzy maps achieved by the appropriate fuzzy function and Kriging interpolation method for each parameter are shown in Figure 3. The land suitability map based on the fuzzy-AHP is shown in Figure 4.
Using the feature selection methods and applying a random search on the dataset, the variables EC, ESP, wetness and soil texture were selected as inputs for the land suitability classification. With the best search algorithm and the genetic search method, EC, ESP, gypsum and soil texture were selected (Table 5).
After determining the most important parameters by different feature selection methods, the pairwise comparison matrices were calculated for the selected parameters. These weights are given in Table 6 and Table 7.
Land suitability maps for different feature selection methods are shown in Figure 5.
The value of the Kappa coefficient for the random search, best search and genetic search methods compared with the fuzzy-AHP method were calculated as 0.82, 0.79 and 0.79, respectively. The result shows that there is a strong agreement between the standard fuzzy-AHP method and the random search method for the determination of land suitability in the study area. Figure 6 shows the results of this comparison as a map.
The results of the random search showed 40% of the study area as highly suitable for barley (S1 class), 17% as moderately suitable (S2 class), 21% as marginally suitable (S3 class) and 22% as unsuitable (N class). The results of the fuzzy-AHP method showed 44% of the study area as highly suitable for barley (S1 class), 22% as moderately suitable (S2 class), 4% as marginally suitable (S3 class) and 28% as unsuitable (N class).
In order to better evaluate the obtained results, 20 irrigated barley fields in the study area were randomly chosen and their yields were measured at the end of the growing season. Then, based on the Table of Sys [17], the corresponding class of each field was determined and compared with the classes obtained from the fuzzy-AHP method and different feature selection algorithms. These results are given in Table 8.
As shown in Table 8, there is a high agreement between the traditional results of Sys’s table and the fuzzy-AHP method. With the fuzzy-AHP method, 16 of the 20 reference fields were assigned to the correct class. Also, a comparison of the classes obtained by fuzzy-AHP and different feature selection methods shows that 16, 15 and 15 selected points by random search, best search and genetic methods, respectively, were assigned to the same class selected by the original fuzzy-AHP method.

4. Conclusions

Recently, there has been increasing interest in methods for the evaluation of agricultural land suitability. Most methods for land suitability classification use a large number of parameters, which is time consuming and costly. In this study, the capability of different feature selection methods (best search, random search and genetic search methods) combined with the fuzzy-AHP approach as novel and potentially time and cost saving methods were evaluated for land suitability classification for the cultivation of barely. Results showed that all three methods performed well for this purpose, but the random search method performed slightly better than the other methods. Also, soil texture, wetness, EC and ESP are the most effective parameters for the determination of land suitability classification. Overall, it can be concluded that the proposed feature selection and fuzzy-AHP combined model in this study improves the prediction of important parameters for land suitability classification and provides a faster and more cost-effective approach for land suitability classification. Feature selection is itself useful, but the increasing over-fitting risk when the number of observations is insufficient and the significant computation time when the number of variables is large, are the two main weakness of this method.

Acknowledgments

The authors would like to acknowledge the Khuzestan Soil and Water Research Institute for their assistance during the study and for providing the dataset. Furthermore, we appreciate the invaluable support provided by the AuthorAid community, especially Ann Seward, for the English editing of the paper.

Author Contributions

Saeid Hamzeh and Marzieh Mokarram carried out and analyzed the data and wrote the paper. Azadeh Haratian assisted with the programing of the feature selection algorithms. Harm Bartholomeus, Arend Ligtenberg and Arnold Bregt carried out the critical revision of the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. FAO. A Framework for Land Evaluation; FAO Soils Bulletin No. 32; FAO: Rome, Italy, 1976; p. 72. [Google Scholar]
  2. Sonneveld, M.P.W.; Hack-ten Broeke, M.J.D.; van Diepen, C.A.; Boogaard, H.L. Thirty years of systematic land evaluation in the Netherlands. Geoderma 2010, 156, 84–92. [Google Scholar] [CrossRef]
  3. Baiocchi, V.; Lelo, K.; Polettini, A.; Pomi, R. Land suitability for waste disposal in metropolitan areas. Waste Manag. Res. 2014, 32, 707. [Google Scholar] [CrossRef] [PubMed]
  4. He, Y.; Yao, Y.; Chen, Y.; Ongaro, L. Regional Land Suitability Assessment for Tree Crops Using Remote Sensing and GIS. In Proceedings of the IEEE Computer Distributed Control and Intelligent Environmental Monitoring (CDCIEM), Changsha, Hunan, China, 19–20 February 2011; pp. 354–363.
  5. Prakash, T.N. Land Suitability Analysis for Agricultural Crops: A Fuzzy Multicriteria Decision Making Approach, Science in Geoinformatics; ITC: Enschede, The Netherlands, 2003; pp. 6–13. [Google Scholar]
  6. Lukasheh, A.F.; Droste, R.L.; Warith, M.A. Review of expert system (ES), geographical information system (GIS), decision support system (DSS) and their application in landfill design and management. Waste Manag. Res. 2001, 19, 177–185. [Google Scholar] [CrossRef] [PubMed]
  7. Abd el-Kawy, O.R.; Ismasil, H.A.; Rqd, J.K.; Suliman, A.S. A Developed GIS-based Land Evaluation Model for Agricultural Land Suitability Assessments in Arid and Semi-Arid Regions. Res. J. Agric. Biol. Sci. 2010, 6, 589–599. [Google Scholar]
  8. FAO. Guidelines for Land Evaluation for Rainfed Agriculture; FAO Soils Bulletin No. 52; FAO: Rome, Italy, 1984. [Google Scholar]
  9. FAO. Guidelines Land Evaluation for Irrigated Agriculture; FAO Soils Bulletin No. 55; FAO: Rome, Italy, 1985. [Google Scholar]
  10. FAO. Land Evaluation: Towards A Revised Framework; Land and Water Discussion Paper No. 6; Food and Agriculture Organization of the United Nations: Rome, Italy, 2007. [Google Scholar]
  11. Kalogirou, S. Expert systems and GIS: An application of land suitability evaluation. Comput. Environ. Urban Syst. 2002, 26, 89–120. [Google Scholar] [CrossRef]
  12. Malczewski, J. GIS-based land-use suitability analysis: A critical overview. Prog. Plan. 2004, 62, 53–65. [Google Scholar] [CrossRef]
  13. Pan, G.; Pan, J. Research in crop land suitability analysis based on GIS. Comput. Comput. Technol. Agric. 2012, 365, 314–325. [Google Scholar]
  14. Malczewski, J. GIS and Multicriteria Decision Analysis; Wiley: New York, NY, USA, 1999. [Google Scholar]
  15. Ceballos Silva, A.; Lopez Blanco, J. Evaluating biophysical variables to identify suitable areas for oat in Central Mexico: a multi-criteria and GIS approach. Agric. Ecosyst. Environ. 2003, 95, 371–377. [Google Scholar] [CrossRef]
  16. Abushnaf, F.F.; Spence, K.J.; Rotherham, I.D. Developing a land evaluation model for the Benghazi region in northeast Libya using a geographic information system and multi-criteria analysis. APCBEE Proc. 2013, 5, 69–75. [Google Scholar] [CrossRef]
  17. Feizizadeh, B.; Blaschke, T. Land suitability analysis for Tabriz county, Iran: A multi-criteria evaluation approach using GIS. J. Environ. Plan. Manag. 2012, 56, 1–23. [Google Scholar] [CrossRef]
  18. Mendas, A.; Delali, A. Integration of multi-criteria decision analysis in GIS todevelop land suitability for agriculture: Application to durum wheat cultivation in the region of Mleta in Algeria. Comput. Electron. Agric. 2012, 83, 117–126. [Google Scholar] [CrossRef]
  19. Ligtenberg, A.; Bregt, A.K.; Van Lammeren, R. Multi-actor-based land use modeling: Spatial planning using agents. Landsc. Urban Plan. 2003, 56, 21–33. [Google Scholar] [CrossRef]
  20. Xiao, N.; Bennett, D.A.; Armstrong, M.P. Using evolutionary algorithms to generate alternatives for multiobjective site-search problems. Environ. Plan. A 2002, 34, 639–656. [Google Scholar] [CrossRef]
  21. Mokarram, M.; Hamzeh, S.; Aminzadeh, F.; Zarei, A. Using machine learning for land suitability classification. West Afr. J. Appl. Ecol. 2016, 23, 63–73. [Google Scholar]
  22. Jankowski, P.; Andrienko, N.; Andrienko, G. Map-centered exploratory approach to multiple criteria spatial decision making. Int. J. Geogr. Inf. Sci. 2001, 15, 101–127. [Google Scholar] [CrossRef]
  23. Akinci, H.; Ozalp, A.Y.; Turgut, B. Agricultural land use suitability analysis using GIS and AHP technique. Comput. Electron. Agric. 2013, 97, 71–82. [Google Scholar] [CrossRef]
  24. Chavez, M.D.; Berentsen, P.B.M.; Lansink, A. Assessment of criteria and farming activities for tobacco diversification using the analytical hierarchical process (AHP) technique. Agric. Syst. 2012, 111, 53–62. [Google Scholar] [CrossRef]
  25. Liu, M.; Samal, A. A fuzzy clustering approach to delineate agroecozones. Ecol. Model. 2002, 149, 215–228. [Google Scholar] [CrossRef]
  26. Malczewski, J. Fuzzy screening for land suitability analysis. Geogr. Environ. Model. 2002, 6, 27–39. [Google Scholar] [CrossRef]
  27. Triantafilis, J.; Ward, W.T.; McBratney, A.B. Land suitability assessment in the Namoi Valley of Australia, using a continuous model. Aust. J. Soil Res. 2001, 39, 273–289. [Google Scholar] [CrossRef]
  28. Nisar Ahamed, T.R.; GopalRao, K.; Murthy, J.S.R. GIS-based fuzzy membership model for crop-land suitability analysis. Agric. Syst. 2000, 63, 79–97. [Google Scholar] [CrossRef]
  29. Van Ranst, E.; Tang, H.; Groenemans, R.; Sinthurahat, S. Application of fuzzy logic to land suitability for rubber production in peninsular Thailand. Geoderma 1996, 70, 1–19. [Google Scholar] [CrossRef]
  30. Hamzeh, S.; Mokarram, M.; Alavipanah, S.K. Combination of Fuzzy and AHP methods to assess land suitability for barley: Case Study of semi-arid lands in the southwest of Iran. Desert 2014, 19, 173–181. [Google Scholar]
  31. Hamzeh, S.; Mokarram, M.; Naseri, A.A.; Bartholomeus, H.M.; Ligtenberg, A. Combination of Fuzzy and AHP Methods to Map Land Suitability for Barley in Comparison with the FAO Method; The 8th International Symposium, Agro Environ 2012: Wgeningen, The Netherlands, 2012. [Google Scholar]
  32. Ertugrul, I.; Karakasoglu, N. Comparison of fuzzy AHP and fuzzy TOPSIS methods for facility location selection. Int. J. Adv. Manuf. Technol. 2008, 39, 783–795. [Google Scholar] [CrossRef]
  33. Zhang, J.; Su, Y.; Wu, J.; Liang, H. GIS based land suitability assessment for tobacco production using AHP and fuzzy set in Shandong province of China. Comput. Electron. Agric. 2015, 114, 202–211. [Google Scholar] [CrossRef]
  34. Sys, C.; Van Ranst, E.; Debaveye, J. Land Suitability, Part Ш: Crop requirements; International Training Center for Post Graduate Soil Scientists, Chent University: Ghent, Belgium, 1993; p. 199. [Google Scholar]
  35. Alpaydin, E. Introduction to Machine Learning; MIT Press: Cambridge, MA, USA, 2010. [Google Scholar]
  36. Mendonca, M. Mining Software Engineering Data: A Survey, A DACS; State-of-the-Art Report; Data and Analysis Center for Software: Rome, NY, USA, 1999. [Google Scholar]
  37. Guyon, I.; Elisseeff, A. An Introduction to Variable and Feature Selection. J. Mach. Learn. Res. 2003, 3, 1157–1182. [Google Scholar]
  38. Besalatpour, A.A.; Ayoubi, S.; Hajabbasi, M.A.; Gharipour, A.; Yousefian Jazi, A. Feature selection using parallel genetic algorithm for the prediction of geometric mean diameter of soil aggregates by machine learning methods. Arid Land Res. Manag. 2014, 28, 383–394. [Google Scholar] [CrossRef]
  39. Behrens, T.; Zhu, A.X.; Schmidt, K.; Scholten, T. Multi-scale digital terrain analysis and feature selection for digital soil mapping. Geoderma 2010, 155, 175–185. [Google Scholar] [CrossRef]
  40. Shi, T.; Chen, Y.; Liu, H.; Wang, J.; Wu, G. Soil organic carbon content estimation with laboratory-based visible-near-infrared reflectance spectroscopy: Feature selection. Appl. Spectrosc. 2014, 68, 831–837. [Google Scholar] [CrossRef] [PubMed]
  41. Mokarram, M.; Hamzeh, S.; Haratian Nejady, A.; Bartholomeus, H.M.; Bregt, A. Feature Selection for Land Suitability Evaluation in Combination with the Fuzzy-AHP Method; The 8th International Symposium, Agro Environ 2012: Wgeningen, The Netherlands, 2012. [Google Scholar]
  42. Khuzestan Soil and Water Research Institute. Shavur Soil and Water Project Report; Khuzestan Soil and Water Research Institute: Khuzestan, Iran, 2009. [Google Scholar]
  43. Saaty, T.L. A scaling method for priorities in hierarchical structure. J. Math. Psychol. 1977, 15, 34–39. [Google Scholar] [CrossRef]
  44. Saaty, T.L. The Analytic Hierarchy Process: Planning, Priority Setting, Resource Allocation; McGraw-Hill International: New York, NY, USA, 1980. [Google Scholar]
  45. Saaty, T.L.; Vargas, L.G. Prediction, Projection and Forecasting; Kluwer Academic Publisher: Dordrecht, The Netherlands, 1991; p. 251. [Google Scholar]
  46. Cohen, J. Coefficient of agreement for nominal scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
  47. Mitra, P.; Murthy, C.A.; Pal, S.K. Unsupervised feature selection using feature similarity. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 301–312. [Google Scholar] [CrossRef]
  48. Robnik-Sikonja, M.; Kononenko, I. Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. 2003, 53, 23–69. [Google Scholar] [CrossRef]
  49. Dash, M.; Choi, K.; Scheuermann, P.; Liu, H. Feature selection for clustering—A filter solution. In Proceedings of the Second International Conference on Data Mining, Maebashi, Japan, 9–12 December 2002; pp. 115–122.
  50. Miller, A. Subset Selection in Regression, 2nd ed.; Chapman & Hall/CRC: London, UK, 2002. [Google Scholar]
  51. Dash, M.; Liub, H. Consistency-based search in feature selection Elsevier. Artif. Intell. 2003, 151, 155–176. [Google Scholar] [CrossRef]
  52. Hall, M.A. Correlation-based Feature Selection for Machine Learning. Ph.D. Thesis, Hamilton, New Zealand, 1999; p. 198. [Google Scholar]
  53. Li, X.; Wang, S.; Dong, Z.Y. (Eds.) The Application of Adaptive Partitioned Random Search in Feature Selection Problem. In Advanced Data Mining and Applications; Springer Berlin: Heidelberg, Germany, 2005; pp. 307–314.
Figure 1. Location of the study area, Shavur Plain in Iran (WGS 84, UTM-Zone 38 N).
Figure 1. Location of the study area, Shavur Plain in Iran (WGS 84, UTM-Zone 38 N).
Agriculture 06 00052 g001
Figure 2. Overview of the methodology.
Figure 2. Overview of the methodology.
Agriculture 06 00052 g002
Figure 3. Fuzzy maps for each parameter.
Figure 3. Fuzzy maps for each parameter.
Agriculture 06 00052 g003
Figure 4. Land suitability map for barley using the fuzzy-AHP method (S1: high suitability, S2: moderate suitability, S3: marginal suitability, N: non-suitability).
Figure 4. Land suitability map for barley using the fuzzy-AHP method (S1: high suitability, S2: moderate suitability, S3: marginal suitability, N: non-suitability).
Agriculture 06 00052 g004
Figure 5. Land suitability maps for barley using the combined method of feature selection and fuzzy-AHP.
Figure 5. Land suitability maps for barley using the combined method of feature selection and fuzzy-AHP.
Agriculture 06 00052 g005
Figure 6. Comparison map showing correspondence between fuzzy-AHP and random search results.
Figure 6. Comparison map showing correspondence between fuzzy-AHP and random search results.
Agriculture 06 00052 g006
Table 1. Summary of parameters for land suitability classification in the study area.
Table 1. Summary of parameters for land suitability classification in the study area.
Parameters Minimum Maximum
pH 7.908.32
gypsum 0.002.94
CaCO3 (%) 17.7439.16
soil depth (cm) 150.00200.00
soil texture 7.209.75
salinity and alkalinityEC (ds/m)1.0062.98
ESP (%)1.0049.99
wetnesswater table depth (cm)0.00200.00
depth of segment (cm)0.00100.00
topographyprimary slope (%)0.003.50
secondary slope (%)0.001.50
micro-relief (cm)0.0045.00
Table 2. Scales for pair-wise AHP comparisons [45].
Table 2. Scales for pair-wise AHP comparisons [45].
Intensity of ImportanceDefinition
1Equal Importance
2Weak or slight
3Moderate importance
4Moderate plus
5Strong importance
6Strong plus
7Very strong or demonstrated importance
8Very, very strong
9Extreme importance
Reciprocals of aboveIf activity i has one of the above non-zero numbers assigned to it when compared with activity j, then j has the reciprocal value when compared with i
Table 3. Pairwise comparison matrix for land suitability for barley.
Table 3. Pairwise comparison matrix for land suitability for barley.
ParametersEC and ESPSoil WetnessCaCO3GypsumpHSoil TextureSoil DepthTopographyWeight
EC and ESP123456780.3290
soil wetness1/212345670.2243
CaCO31/31/21234560.1526
gypsum1/41/31/2123450.1053
pH1/51/41/31/212340.0750
soil texture1/61/51/41/31/21230.0525
soil depth1/71/61/51/41/31/2120.0359
topography1/81/71/61/51/41/31/210.0254
Table 4. Land suitability classes for the barely in the study area.
Table 4. Land suitability classes for the barely in the study area.
Suitability ClassSuitability IndexDefinition
S10.79–0.99highly suitable
S20.74–0.79moderately suitable
S30.65–0.74marginally suitable
N0.10–0.65unsuitable
Table 5. The feature selection results.
Table 5. The feature selection results.
MethodSelected Features
random searchEC and ESP
wetness
soil texture
best searchgypsum
EC and ESP
soil texture
genetic searchgypsum
EC and ESP
soil texture
Table 6. Pairwise comparison matrix using the random search.
Table 6. Pairwise comparison matrix using the random search.
ParametersEC and ESPWetnessSoil TextureWeight
EC and ESP1230.53
wetness1/2120.3
soil texture1/31/210.17
Table 7. Pairwise comparison matrix using the best search and genetic search methods.
Table 7. Pairwise comparison matrix using the best search and genetic search methods.
ParametersEC and ESPGypsumSoil TextureWeight
EC and ESP1230.53
gypsum1/2120.3
soil texture1/31/210.17
Table 8. Comparison of the results of different random search methods and standard FAO and fuzzy-AHP methods based on crop yield.
Table 8. Comparison of the results of different random search methods and standard FAO and fuzzy-AHP methods based on crop yield.
PointsObserved Yield (ton/ha)Rate of Decrease Yield Relative to Potential Yield* (%)Suitability Classes Based on the Sys’s table Suitability Class by Fuzzy-AHPSuitability Class by Random SearchSuitability Class by Best SearchSuitability Class by Genetic Method
11.962NNNNN
23.529.2S2S2S2S2S2
32.353.2S3S3NS3S3
42.647S3S3S3S3S3
52.548.2S2S3S3S3S3
64.67S1S1S1S1S1
73.920.4S2S2S2S2S2
81.764.4NNNS3S3
94.58.8S1S1S1S1S1
104.66.6S1S1S1S2S2
111.568.8NNS3NN
124.58.8S1S1S1S1S1
13420S2S1S1S2S2
144.76S1S1S1S2S2
154.214.4S1S1S1S1S1
164.313.6S1S1S1S1S1
174.74.4S1S1S1S1S1
184.215.8S2S1S1S2S2
193.431S2S2S3S2S2
203.920.4S2S1S2S1S1
Potential yield = 5 ton/ha.

Share and Cite

MDPI and ACS Style

Hamzeh, S.; Mokarram, M.; Haratian, A.; Bartholomeus, H.; Ligtenberg, A.; Bregt, A.K. Feature Selection as a Time and Cost-Saving Approach for Land Suitability Classification (Case Study of Shavur Plain, Iran). Agriculture 2016, 6, 52. https://doi.org/10.3390/agriculture6040052

AMA Style

Hamzeh S, Mokarram M, Haratian A, Bartholomeus H, Ligtenberg A, Bregt AK. Feature Selection as a Time and Cost-Saving Approach for Land Suitability Classification (Case Study of Shavur Plain, Iran). Agriculture. 2016; 6(4):52. https://doi.org/10.3390/agriculture6040052

Chicago/Turabian Style

Hamzeh, Saeid, Marzieh Mokarram, Azadeh Haratian, Harm Bartholomeus, Arend Ligtenberg, and Arnold K. Bregt. 2016. "Feature Selection as a Time and Cost-Saving Approach for Land Suitability Classification (Case Study of Shavur Plain, Iran)" Agriculture 6, no. 4: 52. https://doi.org/10.3390/agriculture6040052

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop