1. Introduction
Rice (
Oryza sativa L.) is one of the most important staple food crops, feeding over half of the world’s population. In 2010, the global rice production was approximately 672 million tons from a cultivation area of around 154 million ha, with China contributing to 29% and 19% of the rice production and cultivation area, respectively [
1]. Paddy rice management and its irrigation strategy have significant effects on greenhouse gas emissions [
2,
3]. Globally, rice paddies contribute about 10% of the total methane flux to the atmosphere [
4]. In 2000, the soil nitrogen (N), phosphorus (P), and Potassium (K) nutrient deficit induced by rice production accounted for 42% of the global deficit amount [
5]. Paddy rice agriculture in China is therefore of national and global significance in terms of both food security and sustainable development.
In the past years, remote sensing (RS) as an advanced technology has been used extensively in agriculture to obtain spatial and temporal information about crops [
6,
7,
8,
9]. Paddy rice areas were well projected using RS techniques [
10,
11,
12]. Geographic Information System (GIS) data have been proved to be important to enhance the accuracy of land use and land cover classification [
13,
14]. RS data with coarse and medium resolution are widely used in rice cultivation research [
12,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24,
25,
26]. However, the number of conducted studies on rice using high resolution RS images was limited in the past two decades [
23,
27,
28]. Identification of rice cultivation areas and estimation of agronomic parameters from high resolution images are valuable for improving rice production.
Mapping rice cultivation areas accurately is fundamental for the assessment of agricultural and environmental productivities, the analysis of food security and therefore national and international food trade decisions [
29,
30,
31]. Many previous studies have mapped rice areas [
12]. Various classifiers, including maximum likelihood, artificial neural network, decision tree, and spatial reclassification kernel, have been used in vegetation mapping using RS. However, classification accuracies are mainly determined by the quality and quantity of RS data. No ideal image classifier is uniformly applicable to all tasks [
32]. Extensive field knowledge and auxiliary data help to improve the classification accuracy [
32]. In the Multi-Data-Approach (MDA) for land use and land cover classifications as well as crop rotation mapping, multi-temporal RS and GIS vector data are combined to derive as much information as possible. Studies [
16,
17] demonstrated that considering extensive field knowledge and ancillary GIS data in the post-classification process can improve the classification accuracy by up to 10%.
Agricultural RS refers to the method of non-contact measurements of electromagnetic radiation reflected or emitted from plant materials or soils in agricultural fields [
33]. Different vegetative covers can be distinguished according to their unique spectral behavior in relation to overall ground elements [
34]: Visible radiation in the red (630–690 nm) is absorbed by chlorophyll while radiation in the near infrared (760–900 nm) is strongly reflected by leaf cellular structures. Vegetation Indices (VIs) are developed to qualitatively and quantitatively evaluate vegetative characteristics by combining spectral measurements from different wavelength channels [
35]. Theoretically, the VIs should be particularly sensitive to vegetative covers, insensitive to non-vegetation factors such as soil properties, atmospheric effects and sensor viewing conditions [
36]. In practice, factors of soil characteristics, atmosphere and sensor radiometry degradation, as well as differences in the spectral responses and bidirectional effects all have considerable effects on vegetation indices. Therefore, many VIs have been developed to enhance the vegetative cover signal while minimizing the background response [
35]. Hansen and Schjoerting [
37] discussed the optimized NDVI from different band centers and band widths to represent wheat parameters such as biomass, leaf area index (LAI), chlorophyll, and N status. In their study, the partial least square regression algorithm was applied. Yao
et al. [
38] successfully explored empirical models based on ground-based RS techniques to estimate N status and improve N use efficiency in rice. Yu
et al. [
39] investigated the potential of hyperspectral band combinations in interpreting canopy N status in rice. Despite the lack of interpretations of the physical mechanisms and interactions between the target properties and the measured signals, these empirical models are fundamental materials for further research in order to represent the physical reality.
Remotely sensed photosynthetically active radiation has been used to evaluate the primary production of crops [
40,
41]. The most sophisticated method commonly combines RS data with dynamic crop growth models [
42,
43]. However, these methods require numeric parameters and are sensitive to soil background conditions.
Precision agriculture (PA) emerged in the middle of the 1980s [
33]. Later, in 1991, satellite data were firstly used in PA [
44]. Spatial and temporal variability of soil and crop factors within a field is the essential base of PA [
45]. Measurement of various crop canopy variables during the growing season provides an opportunity for improving grain yield and quality by site-specific fertilizer applications. Due to the ability of providing high temporal, spatial and spectral resolution images, satellite RS has a significant potential in PA [
33,
46]. The multi-temporal within-field information on crop status captured by satellite RS is invaluable in PA.
Because of its operational and economical uses over large areas, satellite RS technology has been widely used to conduct in-season crop yield forecasting for decision making on marketing intervention and policy support on regional or global scales [
29,
31]. Satellite RS is also an essential technique for agro-ecosystem studies on regional scales [
47,
48,
49].
The overall aim of this study is to investigate the possibilities and accuracies of within-field variability of rice status monitoring during the entire growing season on a county scale (Qixing Farm) using satellite RS data. The term “within-field variability” in this study refers to the spatial variability of agronomic variables (biomass, LAI, N concentration, N uptake, etc.) within a rice crop field defined by enclosed boundaries. The size of a rice field in this study ranges from 0.5–100 ha. First, a map of rice cultivation areas was produced using multi-temporal RS FORMOSAT-2 (FS-2) images, coupled with auxiliary GIS data (MDA) to improve the classification accuracy. Then, empirical regression models were developed to relate RS spectra with field measurements of the four agronomic variables. One advantage of this research is the construction of specific regression models for different growth stages. Based on a stepwise regression analysis, the optimized predictors from the satellite images were identified to investigate the dynamics of crop canopy characteristics at different stages. The specific objectives of this study are to (1) identify rice cultivation areas with high classification accuracies based on multi-temporal RS and GIS data; (2) develop regression models for deriving rice crop variables from the FS-2 imagery; (3) validate the ability of the regression models to estimate rice parameters; (4) apply the regression models to the entire Qixing Farm County study area.
The presented method of extracting high resolution within-field variability information from the FS-2 imagery will assist farmers in their site-specific rice crop management and strategy planning.
2. Study Area
The study site Qixing Farm (47.2°N, 132.8°E), is located in the Sanjiang Plain (SJP) in North-eastern China (see
Figure 1). The SJP is an alluvial plain formed by the Songhua River, the Heilong River and the Wusuli River. The administrative area of the Qixing Farm County is about 120,000 ha. The climate in the SJP is temperate sub-humid, with a mean annual precipitation of 500–650 mm. Rainfall mainly occurs from May to September during the growing season of crops. The accumulated ≥ 10 °C temperature all across the year is about 2300–2500 °C and only single-season crops are planted. The topography is rather flat with an average elevation of 60 m and is characterized by broad alluvial plains and low terraces formed by the rivers. The SJP, which covers an area of more than 100,000 km
2, exceeding the size of the Netherlands almost by three times, is one of the major agricultural areas of China. In 2009, arable land in the SJP accounted for nearly 60% of the total land, being dominated by paddy rice (57%) [
50]. Since the irrigation constructions (water channels, raised ridges) for paddy rice are reusable year by year, the field boundaries are mostly stable. In recent years, for better economic profit, there has been moderate land use change from dryland to paddy rice. Compared with Europe’s highly fragmented agricultural landscapes which prevent the utility of coarse resolution RS data for quantitative crop monitoring and yield forecasting [
32], SJP’s large homogenous landscapes provide an ideal site for monitoring crops using satellite RS.
Figure 1.
Location of the study area in Northeast China (the upper left corner shows a subset of the FS-2 image acquired on 9 August 2009).
Figure 1.
Location of the study area in Northeast China (the upper left corner shows a subset of the FS-2 image acquired on 9 August 2009).
3. Data
3.1. Satellite RS Images and GIS Data
FORMOSAT-2 (FS-2) collects multispectral images with a ground pixel resolution of 8 × 8 m2 over a swath of 24 km. The FS-2 images used in this study are optical images with 4 bands of blue (450–520 nm), green (520–600 nm), red (630–690 nm), and near-infrared (760–900 nm). Three tiles of high quality images covering the main arable land area (~56,000 ha) of the Qixing Farm were captured on 24 June, 6 July, and 9 August, in 2009. Thus, both the vegetative phase (24 June and 6 July) and the reproductive phase (9 August) of rice are well represented in these images.
GIS vector data of Qixing Farm field boundaries were provided by the Qixing Modern Agriculture Research Center. Information on crop field boundaries, irrigation wells, water drainages, and shelter forests edges are given at a fine field unit scale. Additional information such as crop type of each field is given in the GIS attribute table.
3.2. Ground Truth Data Collection
Field campaigns for the agronomic data collection were carried out during the entire rice growing season in 2009. In total, 42 sample sites covered by the FS-2 images were selected for this study. All these 42 sites were located in seven farmers’ fields being spatially separated. Each site was represented by one plot covering approximately 0.1–0.3 ha. The final plant samples collected from each site were a mixture of three or four spatially separated samples, taken from the same plot. As ground truth data, the areas of sample sites were mapped using a Trimble™ Global Positioning System (GPS) receiver. Field management calendars of transplanting, N topdressing, irrigation, application of insecticides, and harvest dates, were recorded. Several field campaigns were carried out to collect samples from the tillering stage, booting stage, heading stage, 20 days after heading, and the harvest stage. For each site, biomass, LAI and plant N concentration were measured and plant N uptake was calculated as well.
After the field sampling, the plants were first cleaned, and then separated into different organs (leaves, stems, panicles) to measure the biomass values. LAI was measured using a sub-sample of the leaf biomass. One sub-sample consisted of 10–20 leaves, randomly selected among the youngest fully developed leaves. All fresh samples were processed in the oven at 105 °C for half an hour to stop enzyme activity. After that, they were dried at 75 °C for at least 72 h until a constant weight was reached before they were finally weighted. N concentration was measured using the Kjeldahl-N method. The plant N uptake was calculated as the aboveground dry mass multiplied by the N concentration. Detailed information is listed in
Table 1.
In the study area, rice cultivation technologies in high latitude are relatively sophisticated and rice cultivation regulations developed by the government are applied by most of the farmers. The rice seedlings are first grown in greenhouses and are then transplanted into the paddy fields. In the regulations, a date window of 15–25 May is suggested for transplanting in order to capture the maximum accumulated temperature throughout the whole year. In this research, we divided our ground truth data sets into two groups according to the transplanting dates. Sample sites with seedlings transplanted from 15–25 May were used to construct empirical regression models between the agronomic parameters and the RS data. Sample sites with seedlings transplanted beyond those dates were used as validation data sets to evaluate the performance of the regression models.
Table 1.
Agronomic parameters of the sample sites in 2009.
Table 1.
Agronomic parameters of the sample sites in 2009.
Field | Number of Sample Sites | Variety | Number of Foliage | Transplanting Date | Quantity of N Topdressing (kg/ha) | Plant Density (plants/m2) | Soil Properties |
---|
pH | K (mg/kg) | P (mg/kg) | N (mg/kg) | SOM (g/kg) |
---|
Field 1 | 4 | Longjing 21 | 12 | May 10 | 104, 104, 104, 104 | 109 | 6.05 | 75.7 | 26.7 | 181.1 | 37.98 |
3 | Kongyu 131 | 11 | May 17 | 0, 94, 141 | | | | | | |
1 | Xixuan 1 | 13 | May 15 | 132 | 101 | 6.22 | 134.5 | 36.6 | 177.4 | 35.47 |
1 | Longdun 104 | 12 | May 16 | 118 | | 6.07 | 142.5 | 30.1 | 173.6 | 46.85 |
1 | Longjing 21 | 12 | May 17 | 118 | 133 | 6.08 | 82.4 | 28.3 | 175.8 | 37.78 |
Field 2 | 4 | Longjing 21 | 12 | May 13 | 84, 84, 84, 84 | 115 | 5.59 | 103.8 | 33.2 | 259.9 | 41.83 |
3 | Longjing 21 | 12 | May 13 | 0, 67, 101 | 115 | | | | | |
Field 3 | 4 | Longjing 21 | 12 | May 12 | 67, 67, 67, 67 | 117 | 6.13 | 126.5 | 24.0 | 209.0 | 49.52 |
4 | Longjing 21 | 12 | May 12 | 0, 54, 81, 67 | 117 | | | | | |
Field 4 | 4 | Kendao 6 | 11 | May 20 | 83, 83, 83, 83 | 158 | 6.29 | 372.5 | 21.5 | 249.8 | 66.03 |
Field 5 | 4 | Longjing 24 | 11 | May 16 | 95, 95, 95, 95 | 166 | 5.84 | 73.4 | 37.8 | 176.2 | 35.76 |
1 | Longjing 24 | 11 | May 16 | 0 | 166 | | | | | |
Field 6 | 1 | Longdun 249 | 12 | May 20 | 102 | | 5.78 | 126.5 | 30.6 | 169.7 | 37.08 |
1 | Chaoyou 949 | 11 | May 18 | 106 | 144 | 6.08 | 98.4 | 32.0 | 187.5 | 39.91 |
3 | Jinxuan 1 | 11 | May 20 | 0, 86, 129 | | | | | | |
Field 7 | 3 | Kongyu 131 | 11 | May 17 | 0, 82, 123 | 122 | | | | | |
4. Methods
4.1. Satellite Image Pre-Processing
Pre-processing of satellite images prior to vegetation information extraction is essential to remove noise and increase the interpretability of image data [
51]. Solar radiation reflected by the earth surface to the satellite sensors is significantly affected by its interaction with the atmosphere. Atmospheric correction is an important pre-processing step for satellite RS [
52,
53]. The uncertainties resulting from atmospheric effects in agriculture applications of satellite RS have been well discussed during the past decades [
18,
21,
54,
55]. In this study, atmospheric correction was performed using ENVI FLAASH, version 5.1.
Table 2 lists the main parameters used in the atmospheric correction process. The sub-arctic summer model for rural region was selected.
Table 2.
Main atmospheric correction parameters for the FS-2 images.
Table 2.
Main atmospheric correction parameters for the FS-2 images.
Date | Visibility (km) | Zenith Angle | Azimuth Angle |
---|
June 24 | 50 | 146°10′0.84″ | −115°13′19.56″ |
July 6 | 50 | 152°15′58.34″ | −45°34′58.34″ |
August 9 | 50 | 155°12′2.90″ | −83°58′22.08″ |
Geometric distortion is another important factor affecting the results of image processing, especially when combing geospatial data from different dates or multiple sources. Dai and Siamak [
56] noted that the geometric error even on a sub-pixel level can significantly affect the accuracy of land use classification from satellite images. The precision of geometric correction depends on the number, distribution, and accuracy of the Ground Control Points (GCPs) [
57]. To avoid labor intensive work of
in-situ GCP collection, Zhao
et al. [
58] developed a geometric correction method for georeferencing multi-source geodata by using TerraSAR-X data as a reference. The same method was applied in this study. Specifically, a stacked TerraSAR-X image produced from five dates served as the reference to georectify the FS-2 images. The main georeferencing parameters are shown in
Table 3. The positional error (PE) was less than 6 m for all three images. More details about the method can be found in Zhao
et al. [
58].
Table 3.
Main georeferencing parameters for the FS-2 images.
Table 3.
Main georeferencing parameters for the FS-2 images.
Image Capture Date | Number of Control Points | Number of Check Points | PE of Control Points (m) | PE of Check Points (m) |
---|
June 24 | 100 | 20 | 2.991 | 5.901 |
July 6 | 104 | 20 | 4.143 | 5.032 |
August 9 | 101 | 20 | 3.353 | 5.666 |
The mean reflectance and VIs for each sample plot were calculated using the “Zonal Analysis” tool in ERDAS IMAGINE 2013. For each of the three FS-2 images, correspondingly 42 RS samples were extracted and used to develop the empirical regression models for crop status monitoring.
4.2. Mapping Rice Cultivation Areas
A rice area map of the Qixing Farm in 2009 was produced using aforementioned MDA. In particular, multi-temporal RS and GIS data were integrated to improve the accuracy of rice mapping. Based on the properties/attributes of the different datasets, a knowledge base was constructed and implemented, using the Knowledge Engineering tool mounted in ERDAS IMAGINE 2013. A series of logical rules were created in the Knowledge Engineering tool to integrate the RS and GIS data in order to achieve higher classification accuracy.
The general steps for delineating rice areas are summarized as follows: (1) a supervised classification based on the maximum likelihood algorithm was carried out. In order to extract more unique spectral signatures, more than five subclasses were classified firstly (for the image of June 24: nine subclasses in total with one of them being rice; for the image of July 6: 16 subclasses in total with three of them representing rice; for the image of August 9: 11 subclasses in total with three representing rice); (2) the resulting subclasses were further combined to five main classes, including rice, dryland, forests, residential areas, and “other areas” for each image; (3) isolated pixels were eliminated using the “clump and eliminate” functions embedded in ERDAS IMAGINE 2013; (4) an expert classification system based on knowledge rules was implemented to integrate and improve the rice classification results from multiple dates; and (5) the vector GIS data were used as auxiliary dataset in the post-classification process to further improve the accuracies of rice classification.
In a MDA study, Waldhoff
et al. [
13] reported that the support vector machine (SVM) approach and the maximum likelihood classifier (MLC) yielded similar classification accuracies. They found although the SVM method performed slightly better (up to 3%) in three of the four cases, the MLC had a shorter processing time. Therefore, the MLC was selected in this study to derive multi-temporal high resolution land cover classifications over a large study area. After the classification, the rice classes derived from the three RS images were “combined” by logical rules to take advantage of the specific spectral information corresponding to growth stages. Specifically, in the first step of the supervised classification, to improve the producer’s accuracy of rice classes, relatively “broad” spectral information was selected as rice spectral signatures. Thus, some other land covers such as “dryland” and “bare soils” may have been classified into rice classes as well. However, these misclassified areas from one single image could be different from those areas from the other dates, because spectral differences between rice and other land covers vary with growth stage [
59]. In the next step, using the knowledge base, the pixels classified into rice classes in all three RS images were categorized into a new “rice class 1”. The areas classified into rice classes in two RS images were categorized into a new “rice class 2”. After that, these refined RS rice areas were further “combined” with the GIS data (rice area masks). To assign a pixel into the final rice class, the following conditions had to be satisfied: (1) this pixel was in the “rice class 1”; or (2) this pixel was in the “rice class 2” and in the GIS rice mask.
4.3. Ground Truth Data Interpolation
Ground truth data for each site were collected at the tillering stage (28–30 June), the jointing stage (10–12 July), the heading stage (2–8 August), 20 days after heading (22–28 August), and the harvest time. The FS-2 images were acquired on 24 June, 6 July and 9 August, at slightly different dates from the field campaigns. Therefore, the values of the agronomic variables of biomass, LAI, plant N concentration and N uptake were interpolated for these three FS-2 dates. First, a specific polynomial growing curve for each site was constructed based on the time series of ground truth data. The values of the agronomic variables for the three RS-2 dates were then interpolated. These interpolated values were used as the new ground truth data to explore their relationships with the satellite image reflectance and VIs.
4.4. Development of Regression Models for Deriving Agronomic Variables
A multiple linear regression method was constructed for each agronomic variable based on reflectance values and the VI derived from the FS-2 images and the corresponding field measurement. The VI used in this method was treated as an equivalent factor of reflectance. Correlation and regression analyses were performed in SPSS 21 (SPSS, Inc., Chicago, IL, USA). The format of the multiple linear regression models was as follows:
The NDVI is calculated by the following function:
R stands for the reflectance value at the subscripted satellite band. In Equation (1), YE stands for the estimated agronomic variable;
is the coefficient of the reflectance at the corresponding band/vegetation index. The performance of the regression model was evaluated by the coefficient of determination (R2).
Biomass and LAI are essential crop physiological variables which determine the crop yield. LAI refers to the ratio of leaf surface area to ground area. It is a fundamental canopy parameter in agronomy and RS since it drives absorption of solar radiation and evapotranspiration for carbon assimilation, and thus primary production. In this research, both biomass and LAI were related to the FS-2 band reflectance based on the aforementioned Equation (1).
N is one of the most remobilizable elements during the reproductive stage in rice plants [
60]. Since its remobilization causes leaf senescence, it is directly related to crop productivity [
61]. Accurate plant N status detection to develop site specific N management strategies for rice in the SJP is of importance regarding both agricultural and environmental aspects [
39]. In this study, plant N concentration and N uptake were also derived from the FS-2 images using the regression model represented by Equation (1).
4.5. Validation of the Regression Models
The regression models were evaluated in a validation analysis. The feasibility of the model was quantified by the statistical measures of relative error (RE) and index of agreement (IA). In a further step, scatterplots were generated to assess the performance of the regression models. The RE is the ratio of the Root Mean Square Error (RMSE) to the mean of observed values, describing the differences between the predicted and the observed values relative to the mean of the ground truth values. The IA represents the degree of agreement between the model estimations and observed values [
62]. It is calculated as:
where
is the observed value,
is the model-simulated value, and
is the mean of observed values. The denominator in Equation (3) was defined as a “potential error” by Willmott [
62]; therefore the IA represents the ratio between the mean square error and the “potential error”. Although the IA is sensitive to extreme values [
63], it can be interpreted straightforwardly since it ranges from 0–1.
7. Conclusions
Due to the integration of multi-temporal FS-2 imagery and GIS data, the rice cultivation areas in the Qixing Farm County were more clearly identified. The overall accuracy of the entire classified map was improved remarkably from 81.8% to 91.6%. This highly accurate rice cultivation map provides an ideal basis for further analyses of rice crops in the study area.
This study showed that the performance of the regression models was significantly affected by rice growth stages. Thus, an optimized band selection for every growth stage is important due to the varying spectral reflectance properties. Based on the R2 values, relatively higher goodness-of-fit values were found in the biomass and LAI estimation models than in the plant N uptake and plant N concentration models. In particular, for the estimation of plant N concentration, better model goodness-of-fit occurred at the earliest growth stage (tillering) when the N concentration was relatively high. The RS derived values and the interpolated ground truth values for all three dates were highly correlated. The most accurate models with the lowest REs and the highest IA values were found at the heading stage for three of the four agronomic variables, except for the N concentration. In conclusion, this study provides a framework and example of how high resolution satellite RS can support agricultural field management such as fertilizer, irrigation and pesticides management strategies by providing within-field agronomic information on regional scales. The information derived from satellite RS could be further used to study the relations between crop growth and other phenomena such as carbon fixation, climate change, and sustainable management of natural resources.