Next Article in Journal
Joint Sparse Sub-Pixel Mapping Model with Endmember Variability for Remotely Sensed Imagery
Previous Article in Journal
Advancing NASA’s AirMOSS P-Band Radar Root Zone Soil Moisture Retrieval Algorithm via Incorporation of Richards’ Equation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Using MODIS Data to Predict Regional Corn Yields

1
Department of Plant Science, College of Agriculture and Life Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, Korea
2
Department of Geoinformatic Engineering, Inha University, 100 Inha-ro, Nam-gu, Incheon 22212, Korea
3
Research Institute of Agriculture and Life Sciences, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul 08826, Korea
*
Author to whom correspondence should be addressed.
Remote Sens. 2017, 9(1), 16; https://doi.org/10.3390/rs9010016
Submission received: 11 October 2016 / Revised: 14 December 2016 / Accepted: 21 December 2016 / Published: 28 December 2016

Abstract

:
A simple approach was developed to predict corn yields using the MoDerate Resolution Imaging Spectroradiometer (MODIS) data product from two geographically separate major corn crop production regions: Illinois, USA and Heilongjiang, China. The MOD09A1 data, which are eight-day interval surface reflectance data, were obtained from day of the year (DOY) 89 to 337 to calculate the leaf area index (LAI). The sum of the LAI from early in the season to a given date in the season (end of DOY (EOD)) was well fitted to a logistic function and represented seasonal changes in leaf area duration (LAD). A simple phenology model was derived to estimate the dates of emergence and maturity using the LAD-logistic function parameters b1 and b2, which represented the rate of increase in LAI and the date of maximum LAI, respectively. The phenology model predicted emergence and maturity dates fairly well, with root mean square error (RMSE) values of 6.3 and 4.9 days for the validation dataset, respectively. Two simple linear regression models (YP and YF) were established using LAD as the variable to predict corn yield. The yield model YP used LAD from predicted emergence to maturity, and the yield model YF used LAD for a predetermined period from DOY 89 to a particular EOD. When state/province corn yields for the validation dataset were predicted at DOY 321, near completion of the corn harvest, the YP model, including the predicted phenology, performed much better than the YF model, with RMSE values of 0.68 t/ha and 0.66 t/ha for Illinois and Heilongjiang, respectively. The YP model showed similar or better performance, even for the much earlier pre-harvest yield prediction at DOY 257. In addition, the model performance showed no difference between the two study regions with very different climates and cultivation methods, including cultivar and irrigation management. These results suggested that the approach described in this paper has potential for application to relatively wide agroclimatic regions with different cultivation methods and for extension to the other crops. However, it needs to be examined further in tropical and sub-tropical regions, which are very different from the two study regions with respect to agroclimatic constraints and agrotechnologies.

Graphical Abstract

1. Introduction

Global warming is projected to accompany more frequent extreme weather events, such as heavy rainfall, high temperature, and drought [1,2]. Although these changes will positively affect crop production in some regions, crop production for food, feed, and fodder will be negatively affected in other regions [3,4], aggravating all dimensions of food security. Timely and reliable information on crop growth and yield at the regional, national and global scales is essential for food security and trade policies [5,6,7].
Notable advances in remote sensing have enabled reliable and timely prediction of crop yields [8,9]. Vegetation indices (VIs), which are calculated by combining the reflectance values of several spectral bands from multispectral satellite systems, have been used directly as crop yield estimators [8,10,11] and/or to estimate intermediate crop growth variables, such as LAI and biomass, for yield prediction [12,13,14,15]. In addition to VIs and crop growth variables, crop phenology information is required for reliable crop yield prediction because the effects of environmental conditions on crop yield differ by growth stage [16]. Crop phenology can be estimated using remote sensing data products [17,18,19,20]. For example, Islam and Bala [17] used NDVI and LAI derived from remote sensing data to identify the planting and ending dates of potato, Jönsson and Eklundh [18] developed TIMESAT for extracting phenological information from remote sensing time series estimates, Kandasamy et al. [19] compared the different methods for the performance to fill gaps and how this affects phenology estimations, and Kandasamy et al. [20] provides an approach to evaluate phenology algorithms and the uncertainty associated with its estimates.
Predicting crop yield using remote sensing data products often depends on an empirical approach that relates VIs alone or in combination with remote-sensing–derived meteorological variables to the reported crop yields [11,21,22,23]. Bolton and Friedl [21] used a simple linear regression model to predict corn and soybean yield using VIs (i.e., NDVI, EVI2, and NDWI) from MODIS in the Central USA and concluded that the EVI2 index exhibited better ability to predict maize yield than the NDVI and that the use of crop phenology information from MODIS improved the model predictability. Johnson [22] developed a regression tree model using the linear and/or exponential relationship of MODIS–derived NDVI and daytime land surface temperature with county-level yield statistics to predict corn and soybean yield for 12 states in central and northern USA. Shao et al. [23] developed simple linear regression model using multi-temporal NDVI from MODIS to predict county–level corn yields for the entire Midwestern USA and confirmed that using a digital elevation model climate data as additional model inputs slightly improved the performance of the regional corn yield model. Although simple models to predict crop yield can be developed [9,24], it is likely that the spatial portability of the model would be limited because the parameters of the empirical equation that were estimated in the study region would not be applicable to the other regions with different agroclimate and agrotechnologies. Additionally, VIs such as NDVI can detect inter-annual fluctuation of crop yield due to weather conditions while they cannot detect human-induced factors that result in increased crop yield [9], making the NDVI–crop yield regression model difficult to extend to other regions [10,25]. Sophisticated approaches requiring data in addition to remote sensing data have also been developed to improve predictions of crop growth and yield [9,26,27]. For instance, Prasad et al. [27] used a piecewise linear regression model with a break point to predict corn and soybean yield using monthly NDVI, soil moisture, surface temperature, and total rainfall. Nevertheless, it is preferable to develop a crop yield prediction model with wide spatial portability using a small dataset [28].
In addition to VIs, remote-sensing–derived LAI has also been used to predict crop biomass and yield [29,30,31]. LAI, the one side total leaf area per unit of ground area [32], is a key biophysical variable used to determine crop growth and yield [33,34]. Son et al. [31] developed a regression model using MODIS–LAI and EVI as input variables to predict rice yield in the Mekong delta, Vietnam. LAD is the integrated value of LAI over time. Although LAD is an important crop parameter that has a strong correlation with dry matter production and grain yield [35,36], it has not been used in a yield prediction model using remote sensing data. LAD has been reported to have positive correlation with corn yield under water and nitrogen stress conditions imposed at different growth stages [37] and under varying planting densities of three maize hybrids [38], and genetic differences in photosynthetic duration (longer LAD) were reported to be associated with a longer grain filling duration and higher yield [39]. These findings suggest that LAD would have greater potential to represent corn yield variability in regions with diverse agroclimate and agrotechnologies compared to VIs and LAI at a particular crop growth stage.
The objectives of this study were to develop a simple approach to predict crop phenology and corn yield using LAI and LAD derived from a minimum set of remote sensing data products and to examine the spatial portability of the simple method. This study focused on predicting corn yield in major production areas of the USA and China with different agroclimates and agrotechnologies.

2. Materials and Methods

2.1. Study Area

A simple approach using remote sensing data products was developed to predict crop yields in major crop production areas in the USA and China, where varieties and cultivation methods differ considerably. Illinois, USA (40°0′0″N, 89°0′0″W) (Figure 1a) and Heilongjiang Province, China (47°50′0″N, 127°40′0″E) (Figure 2a) were selected as the regions of interest. Illinois and Heilongjiang are located on opposite sides of the Earth. The latitude of Heilongjiang is higher than that of Illinois. Accordingly, the environmental conditions (e.g., temperature, precipitation, and solar radiation) differ between sites. The annual mean temperature in Illinois is approximately 10 °C, while in Heilongjiang it is −4 °C to 4 °C. These different environmental conditions explain why different crop varieties and cultivation methods are used at the two sites [40].
The corn production area and the quantity of corn produced as a proportion of the national total in Illinois were approximately 32% and 15% in 2013, respectively. The corresponding values in Heilongjiang were approximately 9% and 12%.

2.2. Data and Processing

2.2.1. Crop Yield and Phenology Data

Corn yields in Illinois and Heilongjiang (Figure 3) were obtained from official reports provided by statistical services in each country to evaluate the reliability of a model for predicting corn yield. In Illinois, crop yields from 2000 to 2013 were gathered from the National Agricultural Statistics Service (NASS) by agricultural district (AD) (available at https://quickstats.nass.usda.gov/). Crop area and total production were also obtained from NASS by county and state. The unit system for crop yield in Illinois was converted from “bushels per acre” to “kilograms per hectare” to fit the unit system used in Heilongjiang.
The yields and planted area for 2002 to 2012 in Heilongjiang Province were collected from the Heilongjiang Statistical Yearbook by prefecture and province. The corn yields in Heihe and Daxinganling prefectures in Heilongjiang Province were excluded from the calculation to minimize the computing resources because mean corn production was considerably lower than in other areas of the province.
The data of corn emergence and maturity date for Illinois were obtained from weekly crop progress reports by the NASS Illinois Field Office (IFO). These data, which were provided by AD, were only available for five ADs (northwest, northeast, central, western, and eastern districts) between 2003 and 2012 due to missing data in some ADs. The dates of emergence and maturity were obtained. Because of the limited availability of phenology data in Heilongjiang Province, a model was established and validated to predict the phenological stages using data only for the ADs in Illinois, USA and was applied to predict the phenological stages in Heilongjiang, China.
The phenology data were used to evaluate the reliability of a model to predict corn phenological dates using remote sensing data. The date on which a given phenological stage reached 50% in the study area, e.g., an AD, were used to represent when a phenological stage occurred. It was assumed that the proportion of area in a given stage would increase linearly over a two-week period. The weekly data for the proportion of area in which the crop was at the phenological stage of interest were collected from the NASS IFO. Those weekly data were used to compare the estimated dates on which a phenological stage occurred in the AD. For example, the day on which the corn crop reached a phenological stage was determined by linear interpolation between the weekly proportion of area for the given phenological stage.

2.2.2. Crop Cover Data

Crop cover data for Illinois (Figure 1b) were obtained from cropland data layers (CDL) provided by NASS to identify the area where a given crop was grown (available at https://www.nass.usda.gov/Research_and_Science/Cropland/SARS1a.php). Crop cover data were obtained from 2000 to 2013. The crop cover data from 2002 to 2012 in China (Figure 2b) were generated using remote sensing [30]. Crop areas were identified from the MODIS land cover type product (MCD12Q1). Major crops, including corn, within the crop areas were classified using a maximum likelihood classifier and time-series MODIS 16-day NDVI dataset (MOD13Q1). Crop cover data, which have a spatial resolution of 250 m, were subjected to reprojection using ENVI (ExelisVIS: Exelis Visual Information Solutions, Boulder, CO, USA). The projections of the crop cover maps in both regions were converted to a Universal Transverse Mercator (UTM) projection and WGS-84 coordinates at 1 km spatial resolution.

2.2.3. Remote Sensing Data

The MODIS surface reflectance data for 2000–2013 were obtained from the MOD09A1 data, which have a spatial resolution of 500 m. These data were downloaded from Reverb, which is a web-based remote sensing data portal operated by the National Aeronautics and Space Administration (available at http://modis.gsfc.nasa.gov/). The h10v04, h10v05, h11v04, and h11v05 tiled grid data for Illinois were collected from DOY 89 to 329. The h26v04 and h27v04 tiled grid data were obtained for the same period in Heilongjiang.
The near-infrared (NIR; band 2) and red (band 1) band of reflectance data were prepared to estimate LAI using a series of data tools (Figure 4). First, all tiled reflectance data were mosaicked into a single dataset using interface description language (IDL; ExelisVIS). The projection of mosaic data was converted to UTM projection at 1 km spatial resolution and WGS-84 geographic latitude and longitude coordinates using IDL, which applies the triangulation wrap and nearest neighbor resampling methods. Reflectance data were resized to fit the size and georeference of crop cover data using FWTools, which is a collection of open-source GIS applications. Gridded data for the area where a given crop was grown by year were prepared by extracting the reflectance data for the given extent to compare grids belonging to the crop of interest in the crop cover data using MATLAB (MathWorks Inc., Natick, MA, USA).
Cropland was identified using the proportion of crop cover in each pixel of the MODIS product using a mixed problem and low resolution [21,23]. Crop cover data were overlapped with the MODIS data to calculate the percentage of pixels where a given crop was grown in the research region. It was assumed that the crop of interest would be grown within pixels where the percentage of corn was >60% and >90%. Although identifying a crop using the percentage of crop cover could be used for pure growing regions of the crop, a percentage value was not set for the identification method, so the mixed problem differed by region and crop. This method can cause problems when integrating from a smaller to a larger scale due to the pixels below the threshold percentage being excluded. In this study, the pixels for corn growing in Illinois and Heilongjiang were identified according to the data standardization procedure described in Figure 4 and the number of identified pixels in Illinois and Heilongjiang were approximately 610,000 and 860,000, respectively.

2.3. Estimation of LAI

2.3.1. Estimation of LAI Using Remote Sensing Data

LAI was calculated using the NIR and red bands of the reflectance data. Nguy-Robertson et al. [41] suggested that the combined vegetation index (CVI) is more accurate than a single vegetation index set to estimate LAI. Instead of using the LAI products from the MODIS data, the LAI value was calculated as follows [31]:
LAI = { ( NDVI 0.28 ) / 0.28 ,   NDVI   0.7 ( SR 1.0 ) / 3.5 ,   NDVI   > 0.7   ,
where NDVI and SR are the normalized difference vegetation index [42] and simple ratio [43], respectively. NDVI and SR were determined as follows:
NDVI = ( NIR red ) / ( NIR + red )
SR = NIR / red ,
where NIR and red indicate the near-infrared and red band spectra, respectively.

2.3.2. Estimation of Daily LAI Using a Logistic Function

Various smoothing algorithms have been used to reduce the noise in remote sensing time-series data [44]. The Savitzky–Golay, asymmetric Gaussian, double-logistic, Whittaker smoother, and discrete Fourier transformation smoothing algorithms have been applied to the NDVI data in the MODIS product. In this study, MODIS–derived LAI was smoothed using a simple method. It was assumed that the total sum of the LAI over time would be shaped like a logistic function. For example, the LAI would be negligible early in the season until the emergence date. However, the daily LAI sum would increase rapidly during the vegetative growth stage, but the sum of the LAI would become negative after flowering. Finally, the sum of the LAI over a season would remain relatively constant until harvest after physiological maturity (Figure 5).
A logistic function was used to represent the temporal change in leaf area duration (LAD), which is the integral of LAI over time as follows (Equation (4)):
LAD   ( t ) = L A I ( t )   d t =   b 3 1.0 + exp [ b 1 ( t b 2 ) ] ,
where b1 represents the rate of LAI growth, b2 represents the date of the maximum LAI, b3 represents the cumulative LAI at physiological maturity, and t indicates days after planting.
Although it would be preferable to determine t as the date after the exact planting date, it was challenging to obtain this date in each grid cell. Because a logistic function was used in Equation (4), it was assumed that t could be determined using a date earlier than the actual planting date. In this study, DOY 89 was used as the beginning of the cropping season, which was a date prior to the actual planting for the regions of interest. The order of the remote sensing data products since DOY 89 was used to determine t and conveniently estimate the logistic function parameters. For example, t was 2 when data production on DOY 97 was used for Equation (4). The simplex method [45] was used to determine the parameter values of Equation (4) for each grid cell. The sum of the LAI derived from the MODIS product until a given date was compared with that obtained from Equation (4). The LAI values from 89 to 337 DOY were accumulated at eight-day intervals for each grid cell at a 1-km resolution. The sum of the square error between the observed and simulated values of cumulative LAI was minimized to obtain parameter values for b1, b2, and b3 in the simplex algorithm.
The derivative of Equation (4) on a given date represented the LAI on that date:
LAI   =   b 3 · b 1 · exp [ b 1 ( d     b 2 ) ] { 1.0   +   exp [ b 1 ( d     b 2 ) ] } 2 .
Daily changes in LAI (Figure 6) were determined for each grid cell.

2.4. Prediction of Phenological Dates

The dates of emergence and maturity were estimated using a simple empirical equation (Equation (6)). It was assumed that date DP for a given phenology P could be determined using b1 and b2 from the LAD-logistic function (Equation (5)), which represent the rate of LAI increase and the date of maximum LAI at a given site, respectively, as follows:
D P = b 2 + τ P + ρ P / b 1 ,
where τ P and ρ P are empirical coefficients, τ P represents the overall time difference between the date of the maximum LAI and a given phenological stage P, and ρ P represents the impact of the increase in LAI on the phenological change over time. τ P and ρ P were estimated using the simplex method. The NASS–derived dates for a given phenology were compared with those obtained from Equation (5).

2.5. Prediction of Crop Yield

Daily LAD was also determined from the date of emergence to that of maturity for each grid cell, as follows [46]:
LAD =   ( L A I d + 1   +   L A I d ) 2 ,
where LAId is the LAI value on d. The sum of the LAD values was obtained for the growing periods, e.g., from emergence to maturity, for each grid cell. Then, the sum of those values was averaged by region, e.g., AD and prefecture in a season as follows:
ALAD =   1 n c = 0 n d = 0 m L A D d c ,
where c and d represent the grid cell index in the region in which the corn was grown and the date index from emergence to maturity at c, respectively.
Huang et al. [24] reported that crop yield can be determined with the LAI using simple linear regression. In this study, ALAD was used as an independent variable in the simple linear regression to predict grain yield. The coefficients of the regression line were obtained for the reported yields and yield predictions for a given district D in a season as follows:
Yield D = α ALAD D + β ,
where α and β are coefficients estimated by the least squares difference method.

2.5.1. YP Model Using LAD Accumulated from the Estimated Emergence Date

The YP model used the predicted phenological stages and smoothed LAI values to calculate LAD. LAI values were accumulated from the emergence to maturity dates predicted using Equations (4) and (6). The last date on which the remote sensing data products were used was denoted EOD. The EOD value was the date elapsed from the initial date of analysis when the approach was used in other regions, e.g., in the southern hemisphere. However, DOY was used to indicate the EOD for simplicity. The coefficients of the simple linear regression for the YP model were obtained for the relation between the reported yields and yield estimates. No remote sensing data was available from EOD to maturity if the EOD was earlier than the maturity date. In such cases, Equation (5) was used to estimate daily LAI until maturity.

2.5.2. YF Model Using LAD Accumulated from an Arbitrarily Fixed Date

To estimate the dates of emergence and maturity, phenology data in the region of interest must be available to determine τ P and ρ P for Equation (6). Therefore, the Yp model would not be applicable to a region where few phenology data are available. Instead of using the smoothed LAD values from the predicted emergence to maturity date, the YF model used the LAD accumulated with LAI values calculated from Equation (1) for the period representing the entire growing season, e.g., from DOY 89 to 337. The ALAD value was calculated to determine yields using Equation (9) between DOY 89 and a particular EOD or between DOY 89 and 337. The coefficients of the simple linear regression for the YF model were obtained for the relations between the reported yields and yield estimates.

2.5.3. Comparison between the YP and YF Models

The YP and YF models were different with respect to the period and the LAI resource data used to calculate LAD. The YP model used LAD from the emergence to maturity dates predicted using Equations (4) and (6), and LAD was calculated from the LAI values smoothed using Equations (4) and (5). The YF model used LAD from DOY 89 to a particular EOD, and LAD was calculated with raw LAI values derived from Equation (1).

2.6. Classification of the Calibration and Validation Datasets

Yield data were classified into two groups for the calibration and validation of phenology and yield. Three years in which yield data were available for both Illinois and Heilongjiang were chosen randomly for validation of corn yield prediction at the state scale. Calibration datasets for predicting phenology and yield at the district scale were selected randomly from approximately 70% of the remaining datasets, including data for the 2000, 2001, and 2013 seasons, during which yield data were available in either China or the USA. In total, 69 and 62 sets of yield data were used as the calibration datasets for Illinois and Heilongjiang, respectively (Figure 7). Yield data at the district scale, e.g., AD and prefecture for the USA and China, respectively, were pooled to determine the empirical parameter values, including τ P , ρ P , α, and β. The other datasets (i.e., 30 and 26 sets in Illinois and Heilongjiang, respectively) were used to validate the yield models.
τ P and ρ P were determined for districts where phenology data were available in the calibration datasets. For example, all phenological dates were available in only five ADs in the USA for 2003 to 2012. The simplex method was used to determine τ P and ρ P for emergence and maturity, respectively. The crop phenology prediction model was validated with districts where phenology data were available in the validation datasets. Because no phenological data were available in China, τ P and ρ P obtained from the USA were used to determine whether Equation (5) could be used to predict crop yield.

2.7. Degree of Agreement Analysis

The degree of agreement statistics was determined by spatial scale, season, and variable. The RMSE value was determined to compare the observed and estimated phenology dates, as follows:
RMSE = 1 n i = 1 n ( P i O i ) 2   ,
where n represents the number of comparisons, and Pi and Oi are the estimated and observed data.
Four statistical measures correlation (R2), normalized root mean square error (NRMSE), concordance correlation coefficient (CCC), and RMSE, were determined for crop yield. The yield of each grid cell was summarized by individual season and district, e.g., AD and prefecture, to compare with the reported yields at the regional scale. Yields by region were aggregated to compare the predicted and reported yields at the state scale, e.g., Illinois and Heilongjiang, by season. The NRMSE was determined as follows [47]:
NRMSE = RMSE × 100 M ,
where M is the mean yield reported by the statistical agencies in China and the USA. The simulated results were considered to be either excellent (NRMSE < 10%), good (10% < NRMSE < 20%), fair (20% < NRMSE < 30%), or poor (NRMSE > 30%). The CCC value, which was used to represent precision and accuracy, was determined as follows [48]:
CCC = 2 ρ σ s σ y σ x 2 + σ y 2 + ( μ x μ y ) ,
where ρ is the correlation coefficient between the estimated and reported data, and σ and μ are the standard deviation and mean of the estimated and observed data, respectively. CCC ranges from −1 to 1, where −1 and 1 represent perfect disagreement and agreement, respectively, and 0 represents independence between the estimated and reported data [49].

3. Results

3.1. Crop Phenology

The estimated τ P and ρ P in Equation (6) for the calibration datasets differed by the last date on which the remote sensing product was used (EOD) (Figure 8). τ P and ρ P tended to group together by EOD. For example, τ P and ρ P for the emergence and maturity dates were similar after EOD 257.
The RMSEs for the occurrence date of the phenological stage for the calibration datasets differed by phenological stage (Figure 9). For example, the RMSEs for maturity (<7 days) were relatively higher than those for emergence (<5 days). Temporal changes in RMSE differed by phenological stage. Although RMSE decreased with increasing EOD until EOD 257, the error values after EOD 257 were relatively similar.
The crop phenology prediction model was validated for EODs 209, 257, and 321. Corn flowering occurred on approximately DOY 209 in Illinois, and the corn harvest was completed near DOY 321. DOY 257 was selected to examine the reliability of the yield predictions in advance because the date was one of the earliest EODs on which maturity dates for corn were reliably predicted in the calibration dataset.
The occurrence dates of a given phenology stage were estimated within a reasonable range of error (Figure 10). For example, emergence and maturity dates were estimated within seven days for most districts, e.g., 67% and 75%, respectively, when these phenological stages were estimated on EOD 257.
The degree of agreement tended to be higher for the estimates of the timing of maturity than those of emergence (Table 1). For example, the RMSE values for maturity were lower than those for emergence. Temporal changes in the degree of agreement also differed.

3.2. Crop Yield at the District Level

Estimated α and β values of Equation (9) for the calibration datasets changed with EOD in the YP model (Figure 11) and grouped together after EOD 257 because they depended on an estimated date for a phenological event, e.g., emergence or maturity. The α and β values changed over the EODs in the YF model using an arbitrarily fixed starting date of DOY 89.
During calibration, the YP model had a greater degree of agreement for predicting crop yield than the YF model (Figure 12). The R2 values for all EODs of the YP model were higher than those for the YF model when predicting crop yield by district. Although both models predicted differences in crop yield between Illinois and Heilongjiang, the YP model was more precise than the YF model (Figure 13).
The crop yield prediction model was validated for EODs 209, 257, and 321. The corn flowering date occurred on approximately DOY 209 in Illinois, and the corn harvest was completed near DOY 321. DOY 257 was selected to examine the reliability of the yield predictions in advance because the date was one of the earliest EODs on which maturity dates for corn were reliably predicted in the calibration dataset.
The YP model based on the phenology dates identified from the logistic function had a lower error than that of the YF model based on fixed dates (Table 2). The YP and YF models predicted differences in yield between the two regions, and corn yield was considerably lower in Heilongjiang than in Illinois. Although the YP and YF models for EOD 257 had similar R2, the YP model had greater R2 than the YF model for EODs 209 and 321. The YF model always had a higher RMSE than the YP model.
The errors in yield prediction were similar for EODs 257 and 321 in the YP model. The NRMSE of the crop yield prediction on EOD 257 was slightly greater than that on EOD 321. However, the YF model had the lowest error on EOD 257.

3.3. Crop Yield at the State/Province Level

The statistical indices for the corn yield prediction model over three randomly selected years (2003, 2009, and 2012) are shown in Table 3. The YP model showed high statistical measures for the degree of agreement in corn yield prediction in Illinois and Heilongjiang. In particular, the YP model performed best in Illinois, while the YF model had a similar performance in both regions. The R2 and CCC values of both prediction models and regions, except for EOD 209, were >0.87 and 0.68, respectively. The NRMSE values were 7.39–13.59. Both prediction models exhibited good corn yield prediction performance in the two regions.
The reported and predicted corn yields by state/province for the prediction models at EOD 257 in Illinois and Heilongjiang are compared in Figure 14.

4. Discussion

Simple models were developed to predict corn phenological stages and yield using only the red and NIR band surface reflectance data of the MODIS products. Sakamoto et al. [50] suggested that the temporal discontinuity of remote sensing data makes it difficult to estimate crop properties, e.g., phenological dates; therefore, different approaches have been developed. In this study, the combination of Equations (4) and (5) allowed us to overcome data discontinuity to estimate the phenological date. The remote sensing data revealed fluctuations in LAI during a season, as shown in Figure 6. However, the integrative approach using Equation (4) made it possible to identify the date of maximum LAI accurately when EOD was later than DOY 249.
The phenology prediction model performance for predicting emergence and maturity dates differed depending on EOD. The errors for predicting emergence and maturity were greater when an EOD before the maximum LAI was used. For example, a reliable and consistent value of b2 in Equation (4) could be obtained using remote sensing data only after the flowering period until near harvest. Because τ P and ρ P depend on b2, long-term data were used to represent the growth conditions and to obtain reliable estimates of τ P and ρ P . The phenology model accuracy for maturity date prediction was relatively higher than for the emergence date prediction. Although the phenology, including the maturity date, depends on the characteristics of the cultivar, such as the physiological responses to environmental conditions, like photoperiod and temperature [51], changes in phenology are closely related to changes in LAI [52]. For example, flowering occurs near the time of maximum LAI, which is b2 in Equation (4). Once the time of maximum LAI is predicted accurately, the maturity date can be reliably identified. Estimating the maximum LAI using remote sensing products is more accurate than measuring the LAI early in the season when LAI is <1 [53]. Emergence dates are related to planting dates, and the planting date is influenced by the soil temperature and soil moisture and varies widely by district and season (see Appendix A). For example, 33.6 °C is the optimal soil temperature for corn germination [54]. Thus, the planting date is delayed until the temperature condition is met in a specific field [55,56], which results in large variability in the length of the effective growing season in a particular region. As a result, the errors in the emergence dates were relatively higher than the errors in the maturity dates. The present crop phenology prediction model based on MODIS–derived LAD–logistic function provided more reliable and accurate maturity date predictions than those of previous studies. Sakamoto et al. [57] reported that the RMSE of the maturity date for corn in Illinois was 5.9 days when using a two-step filtering method and WDRVI derived from MODIS data. The RMSE value of the maturity date on EOD 257 was approximately 17% less than in a previous study. In contrast, the model resulted in greater prediction error for the emergence date. For example, the RMSE of the emergence date predicted on EOD 257 was 6.3 days, whereas the value in a previous study [57] was 4.9 days. The specific shape of seasonal LAI pattern by crop were used to predict phenological dates in the previous study. The present phenology prediction model, which is based on MODIS–derived LAD-logistic function and does not need specific shape of seasonal LAI pattern by crop, can be applied to other crops.
The α and β values in the YP model using the predicted phenological dates grouped together after EOD 257 (Figure 11). The α and β values of the model depended on the reliability of b3, which represents the maximum cumulative LAI, LAD. When an EOD before the maximum LAI was used, b3 was less reliable because the maximum LAI had not been reached in the field; thus, data up to a specific period after the maximum LAI are essential to reliably predict b3. Due to errors in remote sensing data, e.g., caused by clouds and rainfall, maximum LAI estimates can have considerable errors, even after the maximum LAI. Nevertheless, α and β were similar after a set of EOD, suggesting that a small range of α and β values could be obtained to predict corn yield. The α and β values for the YF model using an arbitrarily fixed starting date depended on the last DOY (i.e., EOD) used for the LAD calculation, and the error for the predicted corn yield also depended on EOD because LAI decreased after the maximum LAI. This model also required data up to a specific period (i.e., EOD 257) after maximum LAI to predict corn yield reliably.
The errors for predicting corn yield in the YP and YF models at the AD level were relatively small. The RMSE for the YP model was 1.08 t/ha and 1.09 t/ha at EOD 257 for 20 districts (Table 2). In previous studies, the RMSE of the predicted corn yield ranged from 1.2 t/ha to 1.7 t/ha, depending on the region of interest and season [22,58,59]. Doraiswamy et al. [59] reported an RMSE of 1.21 t/ha for corn yield in several Illinois counties. Because of differences in spatial extent and scale, e.g., district or county and extent, caution is needed when interpreting the differences in RMSE values between the current and previous studies. Nevertheless, the RMSE of both the YP and YF models tended to be smaller than those of the previous studies, which merits further validation in a variety of additional regions.
For the validation datasets for each region, the errors for predicting corn yield at the state/province scale were relatively small in both the YP and YF models. When yield prediction was performed after EOD 257, the YP model exhibited RMSE values of 0.68–0.69 t/ha in Illinois and 0.66–0.96 t/ha in Heilongjiang, while the YF model had RMSE values of 1.0–1.1 t/ha in Illinois and 0.82–0.84 t/ha in Heilongjiang (Table 3). Even for the validation dataset for both regions, the two models revealed little decay in corn yield prediction performance compared with the validation dataset for each region, with RMSEs of 0.84 and 0.93 t/ha across the two regions for YP and YF, respectively. In addition, the current model performances were comparable to previous studies: the RMSE values for predicting corn yield ranged from 0.62 to 1.45 t/ha in major USA production regions [23,27] and Sakamoto et al. [58] reported that the RMSE of the corn yield prediction was 0.83 t/ha in Illinois. The corn yield prediction errors using only LAD result from changes in specific leaf area (SLA) in a given season and region. SLA is the ratio of leaf area to leaf biomass [60] and is affected by environmental conditions [61], such as weather and disease [52]. As a result, SLA varies by season, even when the same crops are cultivated at a given site [30]. Because remote sensing products represent LAI instead of biomass, the change in SLA affects the accuracy of the biomass and yield estimations when using LAI. Therefore, the inclusion of other factors affecting SLA would improve the performance of yield prediction model using LAI and LAD as yield estimators.
Corn management practices were different between Illinois and Heilongjiang. Irrigation practices have been widely adopted in Illinois, while rainfed cultivation is a common practice in Heilongjiang. Corn cultivars were also different by latitude and temperature, and corn yield in Illinois was higher than corn yields in Heilongjiang. Nevertheless, the current yield prediction models using only LAD showed similar high performance for the two study regions, Illinois and Heilongjiang, which have different crop cultivar characteristics, crop management, and environments. This result is due to the nature of LAD used as predictor; the fluctuation in LAD accurately represents the yield variation caused by environmental stress, such as drought, cultivation management, including nitrogen fertilization and planting density, and the genetic improvement of corn hybrids [37,38,39].

5. Conclusions

Simple approaches to predicting corn phenological stages and yields were developed using a minimum MODIS product dataset. Only the red and NIR band surface reflectance data were used to estimate the LAI. Rather than using the reported techniques for filtering/smoothing the LAI data, the LAI data summed over a cropping season (LAD) were fitted to a logistic function for smoothing. A phenology prediction model was established using the MODIS–derived LAD-logistic function parameters, and it was used to predict emergence and maturity dates within a reasonable range of error. Simple linear regression models were developed to predict yield using LAD over the predicted period from emergence to maturity as a predictor variable and LAD for a predetermined period from DOY 89 to a particular EOD. Our results indicate that these simple models using LAD as a predictor variable could predict yields for the two regions of interest with considerable precision and accuracy. The model including information related to phenology exhibited slightly better performance and could be applied from a fairly early pre-harvest stage of EOD 257. In addition, the model performance showed no difference between the two regions with very different climates and cultivation methods, including cultivar and irrigation management. Irrigation practices have been widely adopted in Illinois, USA, while rainfed cultivation is a common practice in Heilongjiang, China. The approach described in this paper has potential to be applied to relatively wide agroclimatic regions with different cultivation methods and to be extended to additional crops. However, it needs to be examined further in tropical and sub-tropical regions, which are very different from the two study regions with respect to agroclimatic constraints and agrotechnologies.

Acknowledgments

This study was conducted with the support of the Cooperative Research Program for Agriculture Science & Technology Development (Project No. PJ0101072016), Rural Development Administration, Republic of Korea.

Author Contributions

Ho-Young Ban and Byun-Woo Lee contributed to the concept design and development of the research; Ho-Young Ban wrote the main manuscript text, prepared the figures and tables, and performed data processing; Ho-Young Ban, Byun-Woo Lee, and Kwang Soo Kim performed data analysis; No-Wook Park collected the remote sensing data; Byun-Woo Lee and Kwang Soo Kim contributed to manuscript improvements.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Coefficient of variation (CV) values by phenological stage were calculated using NASS–derived data for the validation dataset of the prediction model (Figure A1). The variations in planting and emergence dates were larger than those for flowering and maturity. The variation in the flowering and maturity dates was similar.
Figure A1. CV values for the NASS–derived phenological stages in the validation dataset.
Figure A1. CV values for the NASS–derived phenological stages in the validation dataset.
Remotesensing 09 00016 g015

References

  1. Rosenzweig, C.; Iglesias, A.; Yang, X.B.; Epstein, P.R.; Chivian, E. Climate change and extreme weather events; implications for food production, plant diseases, and pests. Glob. Chang. Hum. Health 2001, 2, 90–104. [Google Scholar] [CrossRef]
  2. Trenberth, K.E. Climate change and extreme weather events. In Proceedings of the Catastrophe Modeling Forum: Changing Climatic Dynamic and Catast; Epstein, P., Ed.; Insurance Information Institute: New York, NY, USA, 2008; p. 7. [Google Scholar]
  3. Kumar, B.; Tuti, A. Effect and adaptation of climate change on fodder and livestock management. Int. J. Sci. Environ. Technol. 2016, 5, 1638–1645. [Google Scholar]
  4. Sinha, S.K.; Rao, N.H.; Swaminathan, M.S. Food security in the changing global climate. In Proceedings of the Changing Atmosphere: Implications for Global Security, Toronto, ON, Canada, 27–30 June 1988.
  5. Hutchinson, C.F. Uses of satellite data for famine early warning in sub-Saharan Africa. Int. J. Remote Sens. 1991, 12, 1405–1421. [Google Scholar] [CrossRef]
  6. Kouadio, L.; Duveiller, G.; Djaby, B.; Jarroudi, M.E.; Defourny, P. Estimating regional wheat yield from the shape of decreasing curves of green area index temporal profiles retrieved from MODIS data. Int. J. Appl. Earth Obs. Geoinf. 2012, 18, 111–118. [Google Scholar] [CrossRef] [Green Version]
  7. Macdonald, R.B.; Hall, F.G. Global crop forecasting. Science 1980, 208, 670–679. [Google Scholar] [CrossRef] [PubMed]
  8. Huang, J.; Wang, X.; Li, X.; Tian, H.; Pan, Z. Remotely sensed rice yield prediction using multi-temporal NDVI data derived from NOAA’s-AVHRR. PLoS ONE 2013, 8, e70816. [Google Scholar] [CrossRef] [PubMed]
  9. Kussul, N.; Shelestov, A.; Skakun, S. Grid and sensor web technologies for environmental monitoring. Earth Sci. Inform. 2009, 2, 37–51. [Google Scholar] [CrossRef]
  10. Li, A.; Liang, S.; Wang, A.; Qin, J. Estimating crop yield from multi-temporal satellite data using multivariate regression and neural network techniques. Photogramm. Eng. Rem. Sens. 2007, 73, 1149–1157. [Google Scholar] [CrossRef]
  11. Figueiredo, G.K.; Brunsell, N.A.; Rocha, J.V.; Lamparelli, R.A.; Picoli, M.C. Using temporal stability to estimate soya bean yield: A case study in Paraná state, Brazil. Int. J. Remote Sens. 2016, 37, 1223–1242. [Google Scholar] [CrossRef]
  12. Jaafar, H.H.; Ahmad, F.A. Crop yield prediction from remotely sensed vegetation indices and primary productivity in arid and semi-arid lands. Int. J. Remote Sens. 2015, 36, 4570–4589. [Google Scholar] [CrossRef]
  13. Kolotii, A.; Kussul, N.; Shelestov, A.; Skakun, S.; Yailymov, B.; Basarab, R.; Lavreniuk, M.; Oliinyk, T.; Ostapenko, V. Comparison of biophysical and satellite predictors for wheat yield forecasting in Ukraine. Int. Arch. Photogramm. Remote Sens. 2015, 40, 39–44. [Google Scholar] [CrossRef]
  14. Rembold, F.; Atzberger, C.; Savin, I.; Rojas, O. Using low resolution satellite imagery for yield prediction and yield anomaly detection. Remote Sens. 2013, 5, 1704–1733. [Google Scholar] [CrossRef] [Green Version]
  15. Sibley, A.M.; Grassini, P.; Thomas, N.E.; Cassman, K.G.; Lobell, D.B. Testing remote sensing approaches for assessing yield variability among maize fields. Agron. J. 2014, 106, 24–32. [Google Scholar] [CrossRef]
  16. Prasad, P.V.V.; Staggenborg, S.A.; Ristic, Z. Impacts of drought and/or heat stress on physiological, developmental, growth, and yield processes of crop plants. In Response of Crops to Limited Water: Understanding and Modeling Water Stress Effects on Plant Growth Processes; Ahuja, L.H., Saseendran, S.A., Eds.; Advances in Agricultural Systems Modeling; Series 1; ASA-CSSA: Madison, WI, USA, 2008; pp. 301–355. [Google Scholar]
  17. Islam, A.S.; Bala, S.K. Assessment of Potato Phenological Characteristics Using MODIS-Derived NDVI and LAI Information. Gisci. Remote Sens. 2008, 45, 454–470. [Google Scholar] [CrossRef]
  18. Jönsson, P.; Eklundh, L. TIMESAT—A program for analyzing time-series of satellite sensor data. Comput. Geosci. 2004, 30, 833–845. [Google Scholar]
  19. Kandasamy, S.; Baret, F.; Verger, A.; Neveux, P.; Weiss, M. A comparison of methods for smoothing and gap filling time series of remote sensing observations–application to MODIS LAI products. Biogeosciences 2013, 10, 4055–4071. [Google Scholar] [CrossRef]
  20. Kandasamy, S.; Fernandes, R. An approach for evaluating the impact of gaps and measurement errors on satellite land surface phenology algorithms: Application to 20 year NOAA AVHRR data over Canada. Remote Sens. Environ. 2015, 164, 114–129. [Google Scholar] [CrossRef]
  21. Bolton, D.K.; Friedl, M.A. Forecasting crop yield using remotely sensed vegetation indices and crop phenology metrics. Agric. For. Meteorol. 2013, 173, 74–84. [Google Scholar] [CrossRef]
  22. Johnson, D.M. An assessment of pre-and within-season remotely sensed variables for forecasting corn and soybean yields in the United States. Remote Sens. Environ. 2014, 141, 116–128. [Google Scholar] [CrossRef]
  23. Shao, Y.; Campbell, J.B.; Taff, G.N.; Zheng, B. An analysis of cropland mask choice and ancillary data for annual corn yield forecasting using MODIS data. Int. J. Appl. Earth Obs. Geoinf. 2015, 38, 78–87. [Google Scholar] [CrossRef]
  24. Huang, L.; Yang, Q.; Liang, D.; Dong, Y.; Xu, X.; Huang, W. The estimation of winter wheat yield based on MODIS remote sensing data. In Computer and Computing Technologies in Agriculture V. IFIP Advances in Information and Communication Technology; Springer: Berlin/Heidelberg, Germany, 2011; pp. 496–503. [Google Scholar]
  25. Mkhabela, M.S.; Mashinini, N.N. Early maize yield forecasting in the four agro-ecological regions of Swaziland using NDVI data derived from NOAAs-AVHRR. Agric. For. Meteorol. 2005, 129, 1–9. [Google Scholar] [CrossRef]
  26. Lobell, D.B. The use of satellite data for crop yield gap analysis. Field Crops Res. 2013, 143, 56–64. [Google Scholar] [CrossRef]
  27. Prasad, A.K.; Chai, L.; Singh, R.P.; Kafatos, M. Crop yield estimation model for Iowa using remote sensing and surface parameters. Int. J. Appl. Earth Obs. Geoinf. 2006, 8, 26–33. [Google Scholar] [CrossRef]
  28. Zhang, H.; Chen, H.; Zhou, G. The Model of Wheat Yield Forecast based on MODIS-NDVI: A Case Study of XinXiang. In Proceedings of the ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Science, I-7, XXII ISPRS Congress, Melbourne, Australia, 25 August–1 September 2012.
  29. Casanova, D.; Epema, G.F.; Goudriaan, J. Monitoring rice reflectance at field level for estimating biomass and LAI. Field Crop. Res. 1997, 55, 83–92. [Google Scholar] [CrossRef]
  30. Maki, M.; Homma, K. Empirical regression models for estimating multiyear leaf area index of rice from several vegetation indices at the field scale. Remote Sens. 2014, 6, 4764–4779. [Google Scholar] [CrossRef] [Green Version]
  31. Son, N.T.; Chen, C.F.; Chen, C.R.; Chang, L.Y.; Duc, H.N.; Nguyen, L.D. Prediction of rice crop yield using MODIS EVI−LAI data in the Mekong Delta, Vietnam. Int. J. Remote Sens. 2013, 34, 7275–7292. [Google Scholar] [CrossRef]
  32. De Wasseige, C.; Bastin, D.; Defourny, P. Seasonal variation of tropical forest LAI based on field measurements in Central African Republic. Agric. For. Meteorol. 2003, 119, 181–194. [Google Scholar] [CrossRef]
  33. Noureldin, N.A.; Aboelghar, M.A.; Saudy, H.S.; Ali, A.M. Rice yield forecasting models using satellite imagery in Egypt. Egypt. J. Remote Sens. Space Sci. 2013, 16, 125–131. [Google Scholar] [CrossRef]
  34. Bach, H. Yield estimation of corn based on multitemporal LANDSAT-TM data as input for agrometeorological model. Pure Appl. Opt. 1998, 7, 809–825. [Google Scholar] [CrossRef]
  35. Liu, X.; Jin, J.; Herbert, S.J.; Zhang, Q.; Wang, G. Yield components, dry matter, LAI and LAD of soybeans in Northeast China. Field Crops Res. 2005, 93, 85–93. [Google Scholar] [CrossRef]
  36. López-Bellido, F.J.; López-Bellido, R.J.; Khalil, S.K.; López-Bellido, L. Effect of planting date on winter kabuli chickpea growth and yield under rainfed mediterranean conditions. Agron. J. 2008, 4, 957–964. [Google Scholar] [CrossRef]
  37. Wolfe, D.W.; Henderson, D.W.; Hsiao, T.C.; Alvino, A. Interactive Water and Nitrogen Effects on Senescence of Maize. I. Leaf Area Duration, Nitrogen Distribution, and Yield. Agron. J. 1988, 80, 859–864. [Google Scholar] [CrossRef]
  38. Alias, M.A.; Bukhsh, H.A.; Ahmad, R.; Malik, A.U.; Hussain, S.; Ishaque, M. Profitability of three maize hybrids as influenced by varying plant density and potassium application. J. Anim. Plant Sci. 2011, 21, 42–47. [Google Scholar]
  39. Russell, W.A. Genetic improvement of maize yields. Adv. Agron. 1991, 46, 245–298. [Google Scholar]
  40. Eberhart, S.T.; Russell, W.A. Stability parameters for comparing varieties. Crop Sci. 1966, 6, 36–40. [Google Scholar] [CrossRef]
  41. Nguy-Robertson, A.L.; Gitelson, A.A.; Peng, Y.; Viña, A.; Arkebauer, T.J.; Rundquist, D.C. Green leaf area index estimation in Maize and Soybean: Combining vegetation indices to achieve maximal sensitivity. Agron. J. 2012, 104, 1336–1347. [Google Scholar] [CrossRef]
  42. Rouse, J.; Hass, R.; Schell, J.; Deering, D. Monitoring vegetation systems in the Great Plains with ERTS. In Third ERTS Symposium; NASA: Washington, DC, USA, 1973; pp. 309–317. [Google Scholar]
  43. Jordan, C.F. Derivation of leaf-area index from quality of light on the forest floor. Ecology 1969, 50, 663–666. [Google Scholar] [CrossRef]
  44. Shao, Y.; Lunetta, R.S.; Wheeler, B.; Iiames, J.S.; Campbell, J.B. An evaluation of time-series smoothing algorithms for land-cover classifications using MODIS-NDVI multi-temporal data. Remote Sens. Environ. 2016, 174, 258–265. [Google Scholar] [CrossRef]
  45. Nelder, J.A.; Mead, R. A simplex method for function minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
  46. Power, J.F.; Willis, W.O.; Reichman, G.A. Effect of soil temperature, phosphorus and plant age on growth analysis of barley. Agron. J. 1967, 59, 231–234. [Google Scholar] [CrossRef]
  47. Soler, C.; Sentelhas, P.; Hoogenboom, G. Application of the CSM-CERES-Maize model for planting date evaluation and yield forecasting for maize grown off-season in a subtropical environment. Eur. J. Agron. 2007, 27, 165–177. [Google Scholar] [CrossRef]
  48. Lin, L. A concordance correlation coefficient to evaluate reproducibility. Biometrics 1989, 45, 255–268. [Google Scholar] [CrossRef] [PubMed]
  49. Carrasco, J.L.; King, T.S.; Chinchilli, V.M. The concordance correlation coefficient for repeated measures estimated by variance components. J. Biopharm. Stat. 2009, 19, 90–105. [Google Scholar] [CrossRef] [PubMed]
  50. Sakamoto, T.; Yokozawa, M.; Toritani, H.; Shibayama, M.; Ishitsuka, N.; Ohno, H. A crop phenology detection method using time-series MODIS data. Remote Sens. Environ. 2005, 96, 366–374. [Google Scholar] [CrossRef]
  51. Cutforth, H.W.; Shaykewich, C.F. A temperature response function for corn development. Agric. For. Meteorol. 1990, 50, 159–171. [Google Scholar] [CrossRef]
  52. Biswal, A.; Sahay, B.; Ramana, K.V.; Rao, S.V.C.K.; Sai, M.S. Relationship between AWiFS derived Spectral Vegetation Indices with Simulated Wheat Yield Attributes in Sirsa district of Haryana. Int. Arch. Photogramm. Remote Sens. 2014, 40, 689. [Google Scholar] [CrossRef]
  53. Heiskanen, J.; Rautiainen, M.; Stenberg, P.; Mõttus, M.; Vesanto, V.H.; Korhonen, L.; Majasalmi, T. Seasonal variation in MODIS LAI for a boreal forest area in Finland. Remote Sens. Environ. 2012, 126, 104–115. [Google Scholar] [CrossRef]
  54. Itabari, J.K.; Gregory, P.J.; Jones, R.K. Effects of temperature soil water status and depth of planting on germination and emergence of maize (Zea mays) adapted to semi-arid eastern Kenya. Exp. Agric. 1993, 29, 351–364. [Google Scholar] [CrossRef]
  55. Chen, G.H.; Wiatrak, P. Soybean development and yield are influenced by planting date and environmental conditions in the southeastern coastal plain, United States. Agron. J. 2010, 102, 1731–1737. [Google Scholar] [CrossRef]
  56. Thomison, P.; Nielson, R. Impact of delayed planting on heat unit requirements for seed maturation in maize. In Pontificia Universidad Católica deChile; Departamento de Ciencias Vegetales, Seminario Internacional Semillas: Santiago, Chile, 2002; pp. 140–164. [Google Scholar]
  57. Sakamoto, T.; Wardlow, B.D.; Gitelson, A.A. Detecting spatiotemporal changes of corn developmental stages in the U.S. corn belt using MODIS WDRVI data. IEEE Trans. Geosci. Remote Sens. 2011, 49, 1926–1936. [Google Scholar] [CrossRef]
  58. Sakamoto, T.; Gitelson, A.A.; Arkebauer, T.J. MODIS-based corn grain yield estimation model incorporating crop phenology information. Remote Sens. Environ. 2013, 131, 215–231. [Google Scholar] [CrossRef]
  59. Doraiswamy, P.C.; Akhmedov, B.; Beard, L.; Stern, A.; Mueller, R. Operational prediction of crop yields using MODIS data and products. In International Archives of Photogrammetry, Remote Sensing and Spatial Information, Sciences Special Publications, Commission Working Group VIII WG VIII/10; European Commission DG JRC-Institute for the Protection and Security of the Citizen: Ispra, Italy, 2007. [Google Scholar]
  60. Gunn, S.; Farrar, J.F.; Collis, B.E.; Nason, M. Specific leaf weight in barley: Individual leaves versus whole plants. New Phytol. 1999, 413, 45–51. [Google Scholar] [CrossRef]
  61. Kim, S.H.; Hong, S.Y.; Sudduth, K.A.; Kim, Y.; Lee, K. Comparing LAI estimates of corn and soybean from vegetation indices of multi-resolution satellite images. Korean J. Remote Sens. 2012, 28, 597–609. [Google Scholar] [CrossRef]
Figure 1. Map of the USA showing the location of Illinois (a) and crop cover data for Illinois in 2013 (b) (corn is indicated by yellow).
Figure 1. Map of the USA showing the location of Illinois (a) and crop cover data for Illinois in 2013 (b) (corn is indicated by yellow).
Remotesensing 09 00016 g001
Figure 2. Map of the China showing the location of Heilongjiang (a) and crop cover data for Heilongjiang in 2012 (b) (Ccorn is indicated by yellow).
Figure 2. Map of the China showing the location of Heilongjiang (a) and crop cover data for Heilongjiang in 2012 (b) (Ccorn is indicated by yellow).
Remotesensing 09 00016 g002
Figure 3. Reported corn yields from 2000 to 2013 in the central AD, Illinois and from 2002 to 2012 in Harbin Prefecture, Heilongjiang.
Figure 3. Reported corn yields from 2000 to 2013 in the central AD, Illinois and from 2002 to 2012 in Harbin Prefecture, Heilongjiang.
Remotesensing 09 00016 g003
Figure 4. Data-flow diagram of the surface reflectance and crop cover data used to standardize the data.
Figure 4. Data-flow diagram of the surface reflectance and crop cover data used to standardize the data.
Remotesensing 09 00016 g004
Figure 5. MODIS–derived and logistic-estimated cumulative LAI for corn in Illinois. The x-axis denotes the order of the dates for the remote sensing data product. For example, products for DOY 89 and 97 are indicated by 1 and 2, respectively.
Figure 5. MODIS–derived and logistic-estimated cumulative LAI for corn in Illinois. The x-axis denotes the order of the dates for the remote sensing data product. For example, products for DOY 89 and 97 are indicated by 1 and 2, respectively.
Remotesensing 09 00016 g005
Figure 6. MODIS–derived and logistic-estimated LAI for corn in Illinois. The x-axis denotes the order of the dates for the remote sensing data product. For example, the products for DOY 89 and 97 are indicated by 1 and 2, respectively.
Figure 6. MODIS–derived and logistic-estimated LAI for corn in Illinois. The x-axis denotes the order of the dates for the remote sensing data product. For example, the products for DOY 89 and 97 are indicated by 1 and 2, respectively.
Remotesensing 09 00016 g006
Figure 7. Data-flow diagram showing the process used to classify the calibration and validation datasets.
Figure 7. Data-flow diagram showing the process used to classify the calibration and validation datasets.
Remotesensing 09 00016 g007
Figure 8. Estimated τ P and ρ P values by EOD of emergence (a) and maturity (b). The x-axis denotes the τ P values, and the y-axis denotes the ρ P values.
Figure 8. Estimated τ P and ρ P values by EOD of emergence (a) and maturity (b). The x-axis denotes the τ P values, and the y-axis denotes the ρ P values.
Remotesensing 09 00016 g008
Figure 9. RMSE values by EOD for the phenology model estimates using a logistic function and the calibration datasets.
Figure 9. RMSE values by EOD for the phenology model estimates using a logistic function and the calibration datasets.
Remotesensing 09 00016 g009
Figure 10. Comparison between NASS–derived and estimated phenological stages at EOD 257.
Figure 10. Comparison between NASS–derived and estimated phenological stages at EOD 257.
Remotesensing 09 00016 g010
Figure 11. Estimated α and β values for the YP (a) and YF (b) corn yield models. The x-axis denotes the α values, and the y-axis denotes the β values.
Figure 11. Estimated α and β values for the YP (a) and YF (b) corn yield models. The x-axis denotes the α values, and the y-axis denotes the β values.
Remotesensing 09 00016 g011
Figure 12. R2 values by EOD for the corn yield prediction models at agricultural the district/prefecture level.
Figure 12. R2 values by EOD for the corn yield prediction models at agricultural the district/prefecture level.
Remotesensing 09 00016 g012
Figure 13. Comparison of the reported and predicted agricultural district/prefecture-level corn yield at EOD 257 from the YP (a) and YF (b) models in Illinois and Heilongjiang.
Figure 13. Comparison of the reported and predicted agricultural district/prefecture-level corn yield at EOD 257 from the YP (a) and YF (b) models in Illinois and Heilongjiang.
Remotesensing 09 00016 g013
Figure 14. Comparison of the reported and predicted state/province-level corn yields at EOD 257 for the YP (a) and YF (b) models in Illinois and Heilongjiang (validation dataset: 2003, 2009, and 2012).
Figure 14. Comparison of the reported and predicted state/province-level corn yields at EOD 257 for the YP (a) and YF (b) models in Illinois and Heilongjiang (validation dataset: 2003, 2009, and 2012).
Remotesensing 09 00016 g014
Table 1. Statistical indices for the phenological stage estimate model in the validation dataset.
Table 1. Statistical indices for the phenological stage estimate model in the validation dataset.
EODEmergenceMaturity
R2RMSE (Days)NRMSE (%)R2RMSE (Days)NRMSE (%)
2090.515.313.980.277.612.95
2570.416.304.720.704.911.90
3210.356.294.720.784.431.71
Table 2. Statistical indices for the corn yield prediction models at the agricultural district/prefecture level in Illinois and Heilongjiang.
Table 2. Statistical indices for the corn yield prediction models at the agricultural district/prefecture level in Illinois and Heilongjiang.
EODYP ModelYF Model
R2RMSE (kg/ha)NRMSE (%)R2RMSE (kg/ha)NRMSE (%)
2090.651158.8213.200.571283.0314.62
2570.681083.7412.350.681086.6612.38
3210.701042.4311.880.661127.6712.85
Table 3. Statistical indices for the yield prediction models at the state level in Illinois and at the province level in Heilongjiang.
Table 3. Statistical indices for the yield prediction models at the state level in Illinois and at the province level in Heilongjiang.
RegionEODYP ModelYF Model
R2RMSE (kg/ha)NRMSE (%)CCCR2RMSE (kg/ha)NRMSE (%)CCC
IL2090.431785.6219.270.210.182137.8523.07−0.12
2570.87687.687.420.930.991006.6710.860.78
3210.95684.727.390.910.941068.3611.530.74
HE2090.991115.6815.720.590.961008.1314.200.68
2570.99964.8813.590.680.99839.7511.830.79
3210.99664.079.360.870.99816.8211.510.79
IL: Illinois, HE: Heilongjiang.

Share and Cite

MDPI and ACS Style

Ban, H.-Y.; Kim, K.S.; Park, N.-W.; Lee, B.-W. Using MODIS Data to Predict Regional Corn Yields. Remote Sens. 2017, 9, 16. https://doi.org/10.3390/rs9010016

AMA Style

Ban H-Y, Kim KS, Park N-W, Lee B-W. Using MODIS Data to Predict Regional Corn Yields. Remote Sensing. 2017; 9(1):16. https://doi.org/10.3390/rs9010016

Chicago/Turabian Style

Ban, Ho-Young, Kwang Soo Kim, No-Wook Park, and Byun-Woo Lee. 2017. "Using MODIS Data to Predict Regional Corn Yields" Remote Sensing 9, no. 1: 16. https://doi.org/10.3390/rs9010016

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop