Next Article in Journal
Semi-Automated Classification of Lake Ice Cover Using Dual Polarization RADARSAT-2 Imagery
Next Article in Special Issue
Comparison of SIFT Encoded and Deep Learning Features for the Classification and Detection of Esca Disease in Bordeaux Vineyards
Previous Article in Journal
GPVC: Graphics Pipeline-Based Visibility Classification for Texture Reconstruction
Previous Article in Special Issue
Deep Learning with Unsupervised Data Labeling for Weed Detection in Line Crops in UAV Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Real-Time Prediction of Crop Yields From MODIS Relative Vegetation Health: A Continent-Wide Analysis of Africa

by
Lillian Kay Petersen
Los Alamos High School, Los Alamos, NM 87544, USA
Remote Sens. 2018, 10(11), 1726; https://doi.org/10.3390/rs10111726
Submission received: 14 August 2018 / Revised: 29 October 2018 / Accepted: 30 October 2018 / Published: 1 November 2018

Abstract

:
Developing countries often have poor monitoring and reporting of weather and crop health, leading to slow responses to droughts and food shortages. Here, I develop satellite analysis methods and software tools to predict crop yields two to four months before the harvest. This method measures relative vegetation health based on pixel-level monthly anomalies of NDVI, EVI and NDWI indices. Because no crop mask, tuning, or subnational ground truth data are required, this method can be applied to any location, crop, or climate, making it ideal for African countries with small fields and poor ground observations. Testing began in Illinois where there is reliable county-level crop data. Correlations were computed between corn, soybean, and sorghum yields and monthly vegetation health anomalies for every county and year. A multivariate regression using every index and month (up to 1600 values) produced a correlation of 0.86 with corn, 0.74 for soybeans, and 0.65 for sorghum, all with p-values less than 10 6 . The high correlations in Illinois show that this model has good forecasting skill for crop yields. Next, the method was applied to every country in Africa for each country’s main crops. Crop production was then predicted for the 2018 harvest and compared to actual production values. Twenty percent of the predictions had less than 2% error, and 40% had less than 5% error. This method is unique because of its simplicity and versatility: it shows that a single user on a laptop computer can produce reasonable real-time estimates of crop yields across an entire continent.

Graphical Abstract

1. Introduction

In the United States and Europe, there is exceptional monitoring and reporting of weather and crop health. With thousands of weather stations and regional crop yield data [1,2], crop yields may be predicted based on historical records [3]. However, not all parts of the world have open, reliable data [4]. The availability of weather and crop data depends on the government’s ability to collect it, financial resources, and willingness of authorities to share it. Lack of data is a particularly important problem in developing countries where crop yields are less stable and droughts can lead to famines, death, government instability, and war.
Recent years have shown an advancement in strategies to obtain better data coverage in developing countries. For example, the World Bank implements national household panel surveys throughout Africa that include agricultural and household information [5]. These detailed surveys offer researchers insights into African agriculture. However, these methods require expensive ground-based operations and remain difficult to scale across a large area. Other difficulties with these surveys include substantial self-reported yield error [6], an extremely low temporal resolution, presence only in select African countries, and a time lag of 1–2 years between collection and public dissemination of the data. Agriculture is one of the backbones of African economies and provides food, income, power, stability, and resilience to rural livelihoods [7]. Agricultural development is widely known to be crucial for poverty reduction and improved health; thus, there remains a major need to monitor crop health in the developing world [8,9].
Crop yields in developing countries do not benefit from the same level of agricultural technology as richer countries. Therefore, these countries have much lower yields. For example, corn yields in the US have doubled since 1970 from 5 to 10 t/ha (metric tonnes/hectare) due to improvements in agricultural technology such as irrigation, pesticides, herbicides, fertilizers, and plant breeding. Worldwide, yields of staple grains have doubled in the same time period due to these same factors [10]. In developing countries, crop yields are both much lower and much more variable than in the US and Europe, both geographically and in time [11]. For example, Ethiopia’s corn yield has increased from 0.9 to 3.5 t/ha since 1960, which is still one-third the corn yield of the US. Farmers in poor countries lack the financial resources and education to use the advanced technology available to the American and European farm industries. Therefore, crop yields in African countries are much more susceptible to the dangers of heat waves and droughts.
Remote sensing has become an asset for detecting environmental changes that impact crop health since initial studies in the 1980s and 1990s [12,13,14,15]. Today, satellite imagery costs less and is more easily accessible, making remote monitoring more broadly available to scientists and the general public. The majority of previous research on crop monitoring is in developed countries where there are many yield and production data at high resolution. Such data significantly improve agricultural research, but are only affordable by wealthier nations. Developed countries typically have large fields and few individual crops, mainly corn, soybeans, wheat, and barley (Figure 1). Because planting is so uniform, research can be specific to certain crops: many research studies use detailed ground-truth crop masks or extensive plant growth data to tune remote sensing methods to best fit those crops and climates [16,17]. Because of these factors, it is also easier to identify crop types remotely and create crop masks in developed countries. For example, Vuolo et al. [18] identified crops from 10 m Sentinel-2 imagery in a case study in Austria.
Crop prediction is significantly more challenging in many African countries due to minimal reporting of crop health and yields; farms consist of very small plots of varied crops interspersed with buildings (Figure 1); and the continent contains a vast number of different climates, growing seasons, and crops. Many small-holder farmers integrate inter-cropping methods, further complicating remote crop identification [19]. Recent GPS studies over four African countries suggest that 25% of the farms in Africa are less than 0.2 hectares and over half are less than 0.5 hectares [20]. This compares to an average farm size of 180 hectares in the US [21]. Researchers developing crop masks find that at the Landsat resolution of 30 m, many African fields are covered by just a few pixels [22].
Accurate crop masks are difficult to construct over Africa, as large numbers of different land cover/land use datasets have been published and are not easily consolidated. Some of the major land use products include: The Globcover map [23]; The MODIS-JRC crop mask (EU Joint Research Centre, Monitoring Agricultural ResourceS (MARS) [24]); The USGS Cropland Use Intensity datasets; USGS land use/land cover (LULC) maps; and the Africover maps from the UN Food and Agriculture Organization (FAO). Vancutsem et al. [25] illustrated the difficulty of synthesizing a reliable and harmonized crop mask that entirely covers the African continent, as each map differs in data source, resolution, methodology, geographical extent, and time interval. Intercropping prevalence, small field sizes with irregular boundaries, heterogeneous landscapes, and frequent changes in land use further heighten the difficulty of creating realistic land cover maps across Africa [26].
To more accurately predict yields at a higher spatial resolution, such as the household or community level, researchers shift to very high resolution imagery [22]. However, high resolution imagery has many downsides, mainly the sheer amount of data and computation required. A continent-wide analysis with high resolution imagery would require massive amounts of computational power, which is expensive and time-consuming. Additionally, very high resolution imagery, such as Planet or Quickbird, is too costly to be obtained over large areas. Many high resolution satellites, such as Sentinel-2 (10–20 m resolution), have only been launched in the last few years. Thus, insufficient temporal duration of satellite observations reduces the accuracy of empirical data-based models. Due to large amount of data, it would be difficult to scale a crop health monitoring system using high-resolution imagery across a continent.
There are multiple alternatives to high resolution imagery, such as analyzing state or national yield composites, as done in this study, or “unmixing” coarse resolution imagery to detect field-level changes. Data from lower resolution satellites are free and provide a substantial historical record, and can achieve high correlations with aggregate state or national crop yields [27,28]. Aggregate crop yields will also be more accurate than municipality-level statistics, since they have less variation geographically and in time. In other studies, coarse resolution imagery has been unmixed to detect sub-pixel changes [29,30]. Researchers have used this process to detect specific crops over a region or to analyze small-holder farms smaller than a pixel. Despite many challenges, there has been relative success with this method in many studies [31], as described in a review by Atzberger in 2013 [26]. The imagery may be unmixed with machine learning algorithms such as random forests, or with more complicated methods such as neural nets.
Many studies have developed methods to monitor droughts in Africa [32,33,34,35] or forecast crop yields for early warning [36]. For example, Mann and Warner [11] used kebele (district) level economic and crop statistics collected by the Ethiopian government to estimate wheat output per hectare. These data would be useful for high-resolution crop predictions, but are not generally available from the Ethiopian government. The lack of collection and free distribution of crop yields and other ground measurements severely hinders accurate predictions of crop health in developing countries.
A couple groups currently publish real-time forecasts of crop health. For example, the Group on Earth Observations Global Agricultural Monitoring Initiative (GEOGLAM) [37] and USDA Famine Early Warning System Network (FEWS NET) [38,39,40] generate advance notice of impending food crises. These systems are comprised of large teams that incorporate data from remote sensing, on-the-ground monitoring, field reports, and agroclimate indicators such as rain, snow, and surface temperatures. These large models require an extensive budget. In contrast to this study, their predictions are simplified into qualitative categories instead of numerical values. In addition to GEOGLAM and FEWS NET, other groups that publish early warning include: The FAO Global Information and Early Warning System (GIEWS) [41]; the JRC’s MARS, which is divided into agricultural production estimates of EU countries (Agri4Cast [42]) and food security assessments in food insecure countries (FoodSec [43]); and the Crop Watch Program at the Institute of Remote Sensing Applications (IRSA) of the Chinese Academy of Sciences (CAS) [44].
This study differs from previous work in the US and Africa because of its simplicity: it shows that a single user with a personal computer can produce reasonable forecasts of crop yields for the whole continent. I compute an overall measure of relative vegetation health compared to the mean on a per-pixel basis over select subregions in every African country, thus evaluating whether dense farming areas can be used as representative samples of larger regions to increase computational efficiency. Unlike many previous studies, it may be applied anywhere in the world—it does not depend on special tuning for the particular crop, region, or climate of interest. Relatively low-resolution pixels of the Moderate Resolution Imaging Spectroradiometer (MODIS) decrease the amount of data that must be processed, making this system cheaper and more efficient. Crop masks are not used in this model to increase simplicity, versatility, and eliminate the complication of small field sizes, intercropping, and imperfect crop masks. The method was created for developing countries where detailed monitoring on the ground simply does not exist, and was successfully validated against extensive crop data in Illinois.
The goal of this study was to see how well crop yields may be predicted using extremely straightforward methods based on simple averages and differences of common indices over dense farming regions and the resulting correlations. The index developed here is similar to Vegetation Condition Index (VCI), which has been widely used in previous studies for drought monitoring [45,46]. More complex models with crop masks and detailed tuning require a substantial staff and several years to develop and validate. This method, developed and tested by the author over the course of a couple months on a laptop computer, can produce reasonable forecasts of crop yields for the whole continent.
This paper is organized as follows. Methods are presented in Section 2 and results in Section 3. First, the results from the analysis in Illinois are explained, followed by the analysis in Africa, and ending with the 2018 harvest predictions and their associated errors. The conclusions (Section 4) discuss the quality and limitations of this method compared to previous crop prediction systems.

2. Methods

The primary goal of this research is to create a predictive measure of crop yields computed from satellite data. Python code was written to obtain satellite images, mask out clouds, calculate vegetation and water indices (VI), compute monthly VI anomalies since 2000, and correlate the anomalies with crop yield anomalies for every county in Illinois, which served as a proof of concept due to large amounts of ground truth data in the US. The same method was then applied to every country in Africa to create an early indicator of crop yields. The overall workflow is diagrammed in Figure 2 and is described in detail in this section.
Moderate Resolution Imaging Spectroradiometer (MODIS) imagery was obtained from the Descartes Labs Satellite Platform (Figure 3a,b). MODIS, hosted on the satellites Aqua and Terra, has a revisit time of one day, giving almost continuous imagery across the entire earth since 2000. I interacted with the Descartes Labs Satellite Platform through a python console on a laptop computer. Data from MODIS have a nominal resolution of 250 m at the nadir of each swath. In the Descartes Platform, MOD09 Aqua and Terra surface reflectance data points with associated coordinates are interpolated onto a grid in the form of an image [47]. The python code then sends a request to the platform to retrieve a band over a certain area and satellite pass (e.g., green band over an Illinois county for 1 January 2017). The MODIS data were obtained in early 2018. The rest of the analysis was done in the code, which is posted on Github [48].
Clouds, snow and poor atmospheric conditions in images can disrupt data and distort values. To account for cloud contamination, the standard MODIS cloud mask was retrieved from the Descartes Platform. Pixels with clouds or snow were not included in monthly averages, and images with over 80% clouds were removed altogether (Figure 3c).
To measure the health of crops throughout the growing season, three VIs were computed: Normalized Difference Vegetation Index (NDVI), Enhanced Vegetation Index (EVI), and Normalized Difference Water Index (NDWI) [49] (Table 1). All three indices have served as crop monitoring tools in previous studies, and have been shown to resemble actual crop conditions [37,50,51]. The indices range from −1 to 1. Areas containing dense vegetation show high NDVI and EVI values (between 0.4 and 0.8), desert sands will register at about zero, and snow and clouds are negative. All of these VIs are sensitive to the same biophysical variables (LAI, leaf angle, soil brightness, chlorophyll content, etc.). Although the indices are similar, it is valuable to examine all three, as they may perform differently under various environmental conditions [52,53,54].
For every pixel in Illinois, the VI monthly averages and climatologies were computed. The process begins with daily cloud-masked MODIS swaths (Figure 3). The monthly average of a vegetative index (NDVI, EVI, and NDWI) is written as VI i , m , y , where the subscripts are pixel, month, and year indices. Monthly averaging was chosen for simplicity. VI metrics are defined as follows.
VI climatology per pixel : VI ¯ i , m y = 1 N y y = 1 N y VI i , m , y
VI anomaly per pixel : VIA i , m , y = VI i , m , y VI ¯ i , m y
VI anomaly : VIA ¯ m , y i = 1 N i i = 1 N i VIA i , m , y
VI average : VI ¯ m , y i = 1 N i i = 1 N i VI i , m , y
where ( · ) ¯ y indicates an average over years, ( · ) ¯ i is a spatial average over pixels, and VIA is the vegetative index anomaly. For Illinois, the climatology is defined as the average VI over 2000–2016 for each month and pixel (Equation (1)). Next, the monthly climatology is subtracted from the monthly average for every pixel, resulting in the monthly anomaly (Equation (2)). The pixels in each county are averaged together to find the county-wide monthly anomaly (Equation (3)) and county-wide monthly average (Equation (4)). Figure 4 shows examples of the NDVI monthly average and climatology for Pike County, Illinois. The difference between a dry year and a wet year can be clearly seen in comparison to the climatology.
Illinois was chosen as a test site because the land is mostly agricultural and can provide a clear signal of crop health. Illinois also has very little irrigation: most counties irrigate less than 1% of their fields [55]. Similarly, 90% of staple food production in sub-Saharan Africa comes from rain-fed farming systems [56].
Annual crop yield data was downloaded for every county in Illinois for 2000–2016 for three crops, corn, soybeans, and sorghum, from USDA county estimate reports available online through Quickstats [1]. These crops were chosen because they are three of the largest food crops in Illinois with 4.5 million, 4.3 million, and 7.3 thousands hectares planted, respectively [57,58,59]. Because each county has different growing conditions (soil quality, hills, proximity to large water bodies, etc.), the mean was subtracted out of each county’s crop yield to find the yield anomaly,
Yield   Anom i , y = Yield i , y Yield ¯ i y
so that comparisons could be made across all counties. Correlations were found between each county’s yield anomaly and the three VIs for five months, May–September. To find the highest possible correlation amongst these variables and months, a multivariate regression was fit to each month and index for a total of 15 variables (5 months × 3 VIs).
To test the predictive ability of the model, the data were split into a training group of 90% and a testing group of the remaining 10%. The multivariate regression was then fit to the training data and asked to predict the testing set. To ensure randomness, this process was repeated ten times for each crop, and the analysis is the composite of these ten prediction sets.
After testing in Illinois was complete, the method was applied to three countries in Africa: Ethiopia, Tunisia, and Morocco. These countries were used as initial case studies because they have a recent history of relative agricultural and political stability and offer a range of climates and crops. In each country, the two to four highest-producing crops were analyzed. African crop yields were downloaded from Index Mundi, a comprehensive data portal with country-level statistics compiled from multiple sources, but the production data were originally collected by the USDA Foreign Agricultural Service (FAS) [60].
In each country, a box was analyzed over a dense farming region, which served as a representative sample of the entire country. The VI anomalies and averages from these regions were then correlated to national crop production data [60]. Subsections in each country were positioned over areas with the highest local production, which was obtained from the Spatial Production Allocation Model (MAPSPAM) [61]. Sample areas were selected rather than the entire country to limit the amount of data required. A continent-wide analysis would require significant data transfer and computational power, which is expensive and time consuming. Even with subnational sample areas, the MODIS imagery over Africa totaled to 10 terabytes of data. Imagery is only analyzed over dense farming regions to increase simplicity and decrease the amount of data. The full coordinates of every box can be found at [62].
The daily MODIS imagery over the selected boxes in each country was processed in a similar way to Illinois. First, the bands were retrieved from the Descartes Platform. NDVI, EVI, and NDWI were computed, and cloudy pixels were masked out. The climatology for each pixel was subtracted to obtain monthly anomalies as well as averages of all three indices, resulting six variables for correlation analysis: NDVI average, NDVI anomaly, EVI average, EVI anomaly, NDWI average, and NDWI anomaly (Equations (1)–(4)). Next, correlations were computed between the six indices of the month at the height of the growing season and the crop production. The height of the growing season is defined as the month in the growing season that the NDVI average peaks.
After initial successes in Ethiopia, Tunisia, and Morocco, the method was expanded to every African country with the exceptions of Western Sahara, Equatorial Guinea, and Gabon due to lack of crops or constant cloud cover. Satellite data were restricted in this study to 2013–2018 based on the limited download and compute time that is available to a typical home user on a modern-day laptop. The satellite imagery processed in Africa totaled 10 terabytes even with only five years of data. Future production was then predicted for every African country with a harvest between December 2017 (e.g., Ethiopia) and June 2018 (e.g., Namibia). Harvest dates were obtained from the FAO’s GIEWS [63]. Later, once actual production values were published, the error of the predictions in every country and crop was computed. The error is defined as
Error j = 100 P j A j A j
where P is the predicted yield, A is the reported yield, and j is an index over each county or region.

3. Results

The method was first validated in Illinois and then applied in Africa.

3.1. Illinois

Correlations were computed in Illinois between the anomalies of NDVI, EVI, and NDWI, and three crops: corn, soybeans, and sorghum; all were found to have high correlations. The method was first tested with state-wide averages to show that results are significant when analyzing a large area. The correlations between state-wide corn yield and NDVI, EVI, and NDWI anomalies are extremely statistically significant at 0.90 , 0.85 , and 0.92 respectively (Figure 5). It was found that NDVI and EVI both have positive relationships to crop yields, while NDWI is inversely related. This is because the NDWI formulation includes a negative NIR, while NDVI and EVI have positive NIR values.
The central United States was hit by a drought in 2012. During that year, Illinois had lower than average crop yields and a negative NDVI anomaly. Crop yields and NDVI anomalies were significantly higher in 2014, a wet year. These two years are used as examples to show corn yield and satellite anomalies at the county level (Figure 6).
Next, the relationships were examined at a higher resolution. The corn, soybean, and sorghum county yield data were plotted against VI anomalies for every month (May–September), county, and year (2000–2017), for a total of about 1600 data points. August was found to have the highest correlation to all three crops, while July was just slightly lower (Figure 7). Since crops are harvested in October [64], there is a two to three month lead time on yield estimates. Corn had the strongest relation to the VIs with correlations of 0.7 , 0.71 , and 0.73 for EVI, NDVI, and NDWI, respectively. Soybeans and sorghum had similar correlations to indices, both ranging from 0.53 to 0.58 . To see all of the correlations in more detail, refer to Figure 8. All of July’s and August’s correlations had a p-value less than 10 6 [65], meaning there is less than one in a million chance of them occurring through a random process.
Correlations for each crop have been computed with three indices (NDVI, EVI, and NDWI) and five months, for a total of fifteen independent variables. To create a single predictive measure of crop yields, a multivariate regression was fit to every index and every month using a Python machine learning library. Figure 9 shows an example of the multivariate regression for two of the variables and corn yield. The multivariate regression improved the individual correlations for all three crops to 0.86, 0.74, and 0.65, respectively (solid horizontal lines in Figure 7).
To test the predictive power of the model, the multivariate regression was trained on a random 90% of the data and then predicted the remaining 10%. This process was repeated ten times. The median errors of the predicted yields are 0.56 t/ha (5.7%), 0.18 t/ha (5.8%), and 0.38 t/ha (22%) for corn, soybeans, and sorghum, respectively (Figure 10). The model could predict the yield with reasonable error based on only the VI anomalies of the entire county, demonstrating how this simple method is a good indicator of crop yields.
The error for sorghum is likely higher because it covers a small portion of the state, and no crop mask was used to distinguish the pixels. Corn, soybeans, and sorghum are planted on 4.5 million, 4.3 million, and 7.3 thousands hectares in Illinois, respectively [57,58,59]. While corn and soybeans each cover about a third of the total land in Illinois, sorghum only covers 0.05%. Sorghum fields are therefore a minority of the satellite imagery processed over Illinois. Sorghum in Illinois serves as a proof of concept that a crop can be moderately well predicted even if it only covers a small portion of land.

3.2. Africa

The high correlations in Illinois show that this model has good forecasting skill for crop yields. Next, this method was applied to three countries in Africa: Ethiopia, Morocco, and Tunisia. For each country, a box within a major crop-growing region was analyzed (Figure 11 and Figure 12a). Since the empirical model proposed here uses imagery over the subregion to predict production of the entire country, it implicitly assumes that the vegetation conditions inside the box correspond with those outside the box. This assumption appears to hold true, as correlations over Africa are reasonably high.
Crop estimation in developing countries is vastly different than Illinois and the developed world. The greatest distinctions include the heterogeneity of the landscape, lack of agricultural technology, the spatial size of crop reports, and the accuracy of reported values. In Illinois, the ground is covered with large fields that grow a small number of crops: mostly corn and soybeans. In Africa, the landscape is highly diverse, with small family-owned farms intermixed with villages, lakes, mountains, and forests (Figure 1). These farms, usually much smaller than a hectare, lack much of agricultural technology found in the US, such as pesticides, herbicides, and fertilizers. This makes crops yields much more variable in Africa, both seasonally and spatially. Crop production statistics in Africa are typically only published as a national total. Very rarely are yield or production values reported at the municipality or even state levels. For this reason, crop production was predicted for each country rather than at finer spatial scales, as in the US.
In most regions in Africa, there are wet and dry seasons. For example, the wet season in Ethiopia spans from June to September, and crops are harvested in December. This is known as the Meher growing season. Ethiopia’s core agriculture and food economy is comprised of five major cereals: corn, teff, wheat, sorghum, and barley. These cereals accounted for about three-quarters of total area cultivated and 29 percent of the agricultural GDP during 2005–2006 [66].
The wet and dry seasons are evident in the monthly NDVI values for all three countries (Figure 13). During the wet season, the crops green and the NDVI values spike. During the harvest, the VIs drop. The crops with the highest production in each country were evaluated for this study. Table A1 in the Appendix A shows the crops examined in each country and the correlation with each satellite index. It was found that Ethiopia and Morocco have the best correlation to the maximum NDVI value of the growing season, while Tunisia has the highest correlations to NDWI.
There was a major drought in Ethiopia in 2015, while 2013 was a very wet year. The vegetation differences can be seen on the pixel level (Figure 12). The anomalies are especially evident in the Rift Valley where farming is most dense.
Ethiopia’s maximum NDVI values, which usually occur in August, are extremely well correlated with grain production at 0.98 and 0.99 for corn and sorghum, respectively (Figure 13a and Figure 14a). That is a near-perfect correlation between the crop production harvested in December and satellite imagery four months earlier. Tunisia and Morocco have correlations of 0.97 and 0.67 to wheat production (Figure 14b,c), showing high predictive skill of satellite indices in all three countries.

3.3. Africa: Prediction of Future Crop Production

After the initial success in Ethiopia, Tunisia, and Morocco, the method was expanded to every African country. First, a box in an agricultural region was selected in each country and a total of 10 terabytes of daily satellite imagery was processed according to the method above. Correlations and linear regressions were computed in every country for their 2–4 highest producing crops. Difficulties in finding accurate correlations include:
  • false reporting of production in some countries due to lack of resources, poor oversight, or corruption (e.g., DR Congo, Eritrea, Libya). In severe cases, one could simply use the NDVI anomaly as a proxy for production rather than computing a correlation with reported crop yields, which is commonly done by organizations such as JRC MARS [41];
  • multiple growing seasons in specific central countries (Rwanda, Somalia);
  • poor quality of earth observation data (e.g., clouds) every day for months at a time in central African countries (Gabon, Cameroon) [31]; and
  • time delays and misclassification of harvests during October–December, where production is incorrectly reported in the following calendar year (Nigeria, Sudan).
In each African country, correlations were computed between every crop and six indices (NDVI, EVI, NDWI, monthly averages and anomalies). A full listing of all correlations can be found in Table A1 in the Appendix A. Next, the historical regressions were used to predict crop production for 2018 harvests. Every country that reported productions for their 2018 harvest before the publication of this article was examined. This includes harvests ranging from December 2017 (e.g., Ethiopia) through June 2018 (e.g., Namibia), and included a total of 21 countries, about half of Africa. In April 2018, VI anomalies and crop predictions were posted on a publicly viewable interactive map [67], and the actual production values were added as they became available in mid to late 2018 (Figure 15).
In Ethiopia, the model predicted the 2018 harvests to yield 7055 giga-tonnes (GT) of corn and 4174 GT of sorghum. The actual production was 7100 GT and 4100 GT, respectively, for an error of 0.6% and 1.8%. These minimal errors show how this simple model can predict yields very accurately, even with only a few years of historical relationships.
Small errors in predictions were common across Africa. The histogram in Figure 16 displays the percent error for every country, crop, and index. The median error was 8.6%. Twenty-one percent of the predictions had a relative error below 2%, and 40% had errors below 5%.
To further examine why some predictions are better than others, the errors were plotted by five groupings: vegetative index, crop, country, latitude, and production anomaly (Figure 17). These categories highlight the factors that contribute to higher errors. For example, cotton, wheat, and sorghum are much harder to predict than millet, sugar, and rice, and extreme years had more error than normal years.
One of the countries with a very high error was Botswana. Botswana’s production of corn and sorghum is very low at only an average of 14 GT, as opposed to Ethiopia’s 4000 GT. In addition, they had a very bad drought year in 2018. With the combination of low production values and a severe drought, the linear regression predicted a negative production. This example displays a drawback of a linear model: In real life, the relationship flattens as yields approach zero, as production cannot actually be negative. However, negative predictions, although not accurate, would still signal alarm in an operational forecast system. In retrospect, flagging Botswana as high risk would have been justified this past year, as they did end up with very low crop production.

4. Conclusions

In this research, I developed a method to predict crop yields 2–4 months before the harvest, based on daily MODIS satellite imagery. The model was first validated in Illinois where there is county-level crop yield data by computing the linear fit between yields and VIs. When a split-sample validation was applied to a multivariate regression with all months of the growing season and all three VIs, the model could predict the crop yields within 5.7%, 5.8%, and 22% for corn, soybeans, and sorghum, respectively. Next, the method was applied to three countries in Africa (Ethiopia, Tunisia, and Morocco), all with different climates and crops. High correlations were found between maximum satellite indices and crop production in all three countries, where sorghum in Ethiopia was the highest at 0.99. After this success, satellite imagery was analyzed in every African country, and productions for the 2018 harvests were predicted 2–4 months before the harvest. Once 2018 harvests were published, the prediction accuracy was tested against the reported values. Forty percent of the predictions were found to have less than a 5% error.
The main objective of this study was to show how a very simple method can serve as an early warning system to predict crop yields in every African country. This method relies solely on NDVI, EVI, and NDWI anomalies calculated over specific subsections of the countries, without the use of crop masks, subnational yield statistics, or special tuning for location or climate. Even with these many simplifications, the model was still able to produce predictions with reasonable error over Illinois and throughout Africa.
The range of prediction errors may be analyzed to better understand the strengths and weaknesses of this model. EVI was able to predict yields most accurately, with NDVI and NDWI following closely (Figure 17a). The averages and anomalies performed similarly. The differences between the indices are within a couple percent, indicating that NDVI, EVI, and NDWI all serve as indirect measures of crop health.
The prediction accuracy for different crops varies substantially. Some crops are harder to predict, as each crop correlates to the VIs with different strengths. Some crops may also be affected by extreme weather late in the season, which this model does not include since it predicts yields from the height during the growing season. Millet, sugar, and rice had the lowest errors, while cotton, wheat, and sorghum were much harder to predict.
Latitude had little influence on prediction errors, indicating that this model can perform in a wide variety of climates (Figure 17d). The southern-most countries performed slightly worse, but these values are likely skewed by Botswana and South Africa, whose large errors are discussed in the previous section and below.
An important part of early warning systems is being able to accurately predict both average and extremely high and low yields. In Figure 17e, it can be seen that extremely good or bad harvests often have greater errors than average harvests. The linear regressions were only trained on five years of data, and were unlikely to experience extreme yields in this short amount of time, meaning the model lacks training data on the tails. To improve the accuracy of the extremes, the model could be trained on more years. However, it is encouraging that the errors are only 15% beyond ±1 std dev, considering the short training period.
A limitation of this model is that it relies on published yield data, so it will not predict as reliably in countries that lack reporting accuracy. In these places, the NDVI anomaly could be used as a proxy for relative crop yields compared to a mean. The model also only predicts yields at the national level and has no subnational component. However, it has the ability to predict yields sub-nationally in the future when sub-national crop data are supplied.
In this study, country-wide crop production was correlated to VI anomalies over dense farming regions to test if small areas could serve as representative samples of the entire country. In most countries, the subregions only covered between 1% and 15% of the total land, depending on the size of the country and box. Despite these small areas, the model produced surprisingly high correlations between the VIs and crop production. South Africa is an exception, with low correlations and high errors in the predictions. South Africa has farms across the country, so the selected box was not able to represent the entire area. In many other African countries, one region is a primary producer and can be used to predict country-wide production.
The model developed here may be compared to the existing early-warning systems of GEOGLAM and FEWS NET. Both are run under large budgets by an extensive team of people with partnerships around the globe. Their systems include local surveyors, remotely sensed data, agroclimate indicators, field reports, and communications with national and regional experts. In contrast, this method can be run by a single user on a modern laptop computer. It was developed over the course of a couple months, and is practically free. This model is also able to predict a numerical value of crop production, while GEOGLAM and FEWS NET present their results as a qualitative measure: conditions are compacted into five categories of crop conditions or food insecurity phases.
The power of the method developed here is that it can be applied to any crop, location, or climate to produce reasonable real-time forecasts of crop yields. It is unique because of its versatility and easy to apply due to its simplicity.

Funding

This research received no external funding.

Acknowledgments

LP is grateful for mentorship from Daniela Moody and Rick Chartrand of Descartes Labs, who provided valuable guidance in satellite data retrieval and analysis through the Descartes Labs mentorship program. She also thanks two anonymous reviewers and the Section Editor for their thoughtful feedback, which led to substantial improvements in this manuscript.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A. Summary of Results

Table A1. The correlations, predictions, and errors for select African countries for every crop and satellite index. All the countries that had harvests between December 2017 and June 2018 are displayed. Forty percent of the predictions had an error of less than 5% from the actual production. The fourth column is correlation between the index and reported crop production from 2013 to 2017.
Table A1. The correlations, predictions, and errors for select African countries for every crop and satellite index. All the countries that had harvests between December 2017 and June 2018 are displayed. Forty percent of the predictions had an error of less than 5% from the actual production. The fourth column is correlation between the index and reported crop production from 2013 to 2017.
CountryCropIndexCorrelation2018 Predicted (GT)2018 Actual (GT)% Error
BotswanaCornNDVI Avg0.98441061.7
BotswanaCornEVI Avg0.98931069.1
BotswanaCornNDWI Avg−0.879010104.9
BotswanaCornNDVI Anom0.893010100.0
BotswanaCornEVI Anom0.783010100.0
BotswanaCornNDWI Anom−0.661010100.0
BotswanaSorghumNDVI Avg0.8135833.7
BotswanaSorghumEVI Avg0.8964848.2
BotswanaSorghumNDWI Avg−0.6643859.0
BotswanaSorghumNDVI Anom0.71208100.0
BotswanaSorghumEVI Anom0.69108100.0
BotswanaSorghumNDWI Anom−0.44608100.0
BurundiCoffeeNDVI Avg0.39817220014.1
BurundiCoffeeEVI Avg−0.51222620012.8
BurundiCoffeeNDWI Avg0.7752022001.2
BurundiCoffeeNDVI Anom−0.81926720033.5
BurundiCoffeeEVI Anom0.0802022001.0
BurundiCoffeeNDWI Anom0.9192102004.8
BurundiCornNDVI Avg−0.78017015013.2
BurundiCornEVI Avg0.1161441503.9
BurundiCornNDWI Avg0.0831461502.7
BurundiCornNDVI Anom−0.4601591506.2
BurundiCornEVI Anom0.6731391507.0
BurundiCornNDWI Anom0.4041471502.1
BurundiSorghumNDVI Avg−0.59436353.8
BurundiSorghumEVI Avg−0.069303514.3
BurundiSorghumNDWI Avg−0.033303515.4
BurundiSorghumNDVI Anom−0.50835350.2
BurundiSorghumEVI Anom0.703273522.6
BurundiSorghumNDWI Anom0.296303514.7
ChadCornNDVI Avg0.5334574501.5
ChadCornEVI Avg0.7794374502.8
ChadCornNDWI Avg−0.03739845011.6
ChadCornNDVI Anom0.73828445037.0
ChadCornEVI Anom0.78027545038.8
ChadCornNDWI Anom−0.71325445043.6
ChadMilletNDVI Avg0.4167057000.8
ChadMilletEVI Avg0.3156807002.9
ChadMilletNDWI Avg−0.2976907001.4
ChadMilletNDVI Anom−0.3077037000.5
ChadMilletEVI Anom−0.2416967000.6
ChadMilletNDWI Anom0.2617087001.1
ChadRiceNDVI Avg0.0871611544.8
ChadRiceEVI Avg−0.2511551540.7
ChadRiceNDWI Avg−0.41417015410.2
ChadRiceNDVI Anom−0.91919515426.5
ChadRiceEVI Anom−0.88219415426.0
ChadRiceNDWI Anom0.80020015429.9
ChadSorghumNDVI Avg0.562143395050.8
ChadSorghumEVI Avg0.735134095041.0
ChadSorghumNDWI Avg−0.314130795037.5
ChadSorghumNDVI Anom0.84271495024.9
ChadSorghumEVI Anom0.85469995026.5
ChadSorghumNDWI Anom−0.95948095049.5
DjiboutiCerealsNDVI Avg−0.20619,18519,0790.6
DjiboutiCerealsEVI Avg−0.62218,92919,0790.8
DjiboutiCerealsNDWI Avg−0.56819,03619,0790.2
DjiboutiCerealsNDVI Anom−0.64718,97219,0790.6
DjiboutiCerealsEVI Anom−0.66818,92619,0790.8
DjiboutiCerealsNDWI Anom0.62419,05619,0790.1
DR CongoCoffeeNDVI Avg−0.4542352206.7
DR CongoCoffeeEVI Avg−0.4532282203.8
DR CongoCoffeeNDWI Avg0.4472322205.5
DR CongoCoffeeNDVI Anom−0.1882302204.7
DR CongoCoffeeEVI Anom−0.4782012208.4
DR CongoCoffeeNDWI Anom0.3302312205.2
DR CongoCornNDVI Avg0.654117512002.1
DR CongoCornEVI Avg0.652119012000.9
DR CongoCornNDWI Avg−0.522118312001.4
DR CongoCornNDVI Anom0.325118412001.3
DR CongoCornEVI Anom0.348122112001.8
DR CongoCornNDWI Anom−0.434118312001.4
EritreaBarleyNDVI Avg−0.19163653.6
EritreaBarleyEVI Avg−0.25264652.0
EritreaBarleyNDWI Avg0.14262654.1
EritreaBarleyNDVI Anom−0.19763653.6
EritreaBarleyEVI Anom−0.25564652.1
EritreaBarleyNDWI Anom0.14962654.1
EritreaMilletNDVI Avg0.19125251.3
EritreaMilletEVI Avg0.25225250.7
EritreaMilletNDWI Avg−0.14225251.4
EritreaMilletNDVI Anom0.19725251.3
EritreaMilletEVI Anom0.25525250.7
EritreaMilletNDWI Anom−0.14925251.4
EthiopiaCornNDVI Avg0.979705571000.6
EthiopiaCornEVI Avg0.972715771000.8
EthiopiaCornNDWI Avg−0.983704571000.8
EthiopiaCornNDVI Anom0.8126294710011.4
EthiopiaCornEVI Anom0.263680571004.2
EthiopiaCornNDWI Anom−0.8456160710013.2
EthiopiaSorghumNDVI Avg0.987417441001.8
EthiopiaSorghumEVI Avg0.980425741003.8
EthiopiaSorghumNDWI Avg−0.974416141001.5
EthiopiaSorghumNDVI Anom0.8833524410014.1
EthiopiaSorghumEVI Anom0.402400541002.3
EthiopiaSorghumNDWI Anom−0.9093411410016.8
Guinea-BissauRiceNDVI Avg0.897100991.2
Guinea-BissauRiceEVI Avg0.5781479948.9
Guinea-BissauRiceNDWI Avg−0.91991997.9
Guinea-BissauRiceNDVI Anom0.950889911.0
Guinea-BissauRiceEVI Anom−0.446103993.9
Guinea-BissauRiceNDWI Anom−0.943859913.8
Guinea-BissauSorghumNDVI Avg0.987162018.9
Guinea-BissauSorghumEVI Avg0.521222012.3
Guinea-BissauSorghumNDWI Avg−0.992152025.9
Guinea-BissauSorghumNDVI Anom0.973142027.6
Guinea-BissauSorghumEVI Anom−0.710172017.0
Guinea-BissauSorghumNDWI Anom−0.989142030.0
LesothoCornNDVI Avg0.61112510024.7
LesothoCornEVI Avg0.63612110020.7
LesothoCornNDWI Avg−0.6521061005.9
LesothoCornNDVI Anom0.5351011001.1
LesothoCornEVI Anom0.534991001.2
LesothoCornNDWI Anom−0.6607210028.0
LesothoWheatNDVI Avg0.71312120.3
LesothoWheatEVI Avg0.82912121.0
LesothoWheatNDWI Avg−0.619101213.2
LesothoWheatNDVI Anom0.667101214.8
LesothoWheatEVI Anom0.733101216.1
LesothoWheatNDWI Anom−0.64681230.8
LibyaBarleyNDVI Avg0.4541001000.0
LibyaBarleyEVI Avg0.7361001000.1
LibyaBarleyNDWI Avg−0.5381001000.3
LibyaBarleyNDVI Anom0.6241001000.1
LibyaBarleyEVI Anom0.7411001000.1
LibyaBarleyNDWI Anom−0.6181001000.4
LibyaOlive OilNDVI Avg−0.86017185.7
LibyaOlive OilEVI Avg−0.85017184.7
LibyaOlive OilNDWI Avg0.79917183.7
LibyaOlive OilNDVI Anom−0.79117184.9
LibyaOlive OilEVI Anom−0.85917184.8
LibyaOlive OilNDWI Anom0.77617183.4
MadagascarCoffeeNDVI Avg0.24439230030.9
MadagascarCoffeeEVI Avg0.24140330034.5
MadagascarCoffeeNDWI Avg−0.46741230037.6
MadagascarCoffeeNDVI Anom−0.16046530055.1
MadagascarCoffeeEVI Anom0.04641630038.9
MadagascarCoffeeNDWI Anom−0.21640630035.5
MadagascarCornNDVI Avg−0.18034730015.8
MadagascarCornEVI Avg0.17533930013.0
MadagascarCornNDWI Avg0.03634230014.2
MadagascarCornNDVI Anom−0.67837830026.2
MadagascarCornEVI Anom0.00434230014.1
MadagascarCornNDWI Anom0.39334930016.5
MadagascarRiceNDVI Avg0.622223423043.0
MadagascarRiceEVI Avg0.730225723042.0
MadagascarRiceNDWI Avg−0.652232223040.8
MadagascarRiceNDVI Anom0.030233523041.4
MadagascarRiceEVI Anom0.579220623044.2
MadagascarRiceNDWI Anom−0.176232523040.9
MadagascarSugarNDVI Avg0.64790900.5
MadagascarSugarEVI Avg0.44793903.8
MadagascarSugarNDWI Avg−0.83694905.1
MadagascarSugarNDVI Anom0.26291901.9
MadagascarSugarEVI Anom0.26593903.3
MadagascarSugarNDWI Anom−0.58792903.1
MalawiCornNDVI Avg−0.2063309300010.3
MalawiCornEVI Avg0.8482362300021.3
MalawiCornNDWI Avg0.496327530009.2
MalawiCornNDVI Anom−0.2063309300010.3
MalawiCornEVI Anom0.8482363300021.2
MalawiCornNDWI Anom0.496327530009.2
MalawiCottonNDVI Avg0.0601219034.6
MalawiCottonEVI Avg0.47481909.5
MalawiCottonNDWI Avg0.1071229035.6
MalawiCottonNDVI Anom0.0611219034.6
MalawiCottonEVI Anom0.47481909.5
MalawiCottonNDWI Anom0.1071229035.6
MalawiPeanut OilseedNDVI Avg0.39529232510.1
MalawiPeanut OilseedEVI Avg−0.0282993257.8
MalawiPeanut OilseedNDWI Avg−0.3752973258.5
MalawiPeanut OilseedNDVI Anom0.39529232510.1
MalawiPeanut OilseedEVI Anom−0.0282993257.8
MalawiPeanut OilseedNDWI Anom−0.3752973258.5
MoroccoBarleyNDVI Avg0.5242014250019.5
MoroccoBarleyEVI Avg0.473249625000.1
MoroccoBarleyNDWI Avg−0.4942051250018.0
MoroccoBarleyNDVI Anom0.5041851250026.0
MoroccoBarleyEVI Anom0.534243025002.8
MoroccoBarleyNDWI Anom−0.4711894250024.2
MoroccoWheatNDVI Avg0.6695666820030.9
MoroccoWheatEVI Avg0.6236879820016.1
MoroccoWheatNDWI Avg−0.6425757820029.8
MoroccoWheatNDVI Anom0.6415270820035.7
MoroccoWheatEVI Anom0.6676666820018.7
MoroccoWheatNDWI Anom−0.6065371820034.5
NamibiaCornNDVI Avg0.643505814.4
NamibiaCornEVI Avg0.642485818.0
NamibiaCornNDWI Avg−0.581485818.0
NamibiaCornNDVI Anom0.80854586.6
NamibiaCornEVI Anom0.81156583.9
NamibiaCornNDWI Anom−0.727485816.8
NigeriaCornNDVI Avg−0.890962811,00012.5
NigeriaCornEVI Avg0.27611,44011,0004.0
NigeriaCornNDWI Avg0.983949811,00013.7
NigeriaCornNDVI Anom0.16010,17811,0007.5
NigeriaCornEVI Anom0.14710,35511,0005.9
NigeriaCornNDWI Anom0.21710,36311,0005.8
NigeriaRiceNDVI Avg0.939393237804.0
NigeriaRiceEVI Avg−0.226370437802.0
NigeriaRiceNDWI Avg−0.961394137804.3
NigeriaRiceNDVI Anom−0.343388537802.8
NigeriaRiceEVI Anom−0.365383537801.5
NigeriaRiceNDWI Anom−0.175382437801.2
NigeriaSorghumNDVI Avg−0.9305708680016.1
NigeriaSorghumEVI Avg0.3977866680015.7
NigeriaSorghumNDWI Avg0.9505647680017.0
NigeriaSorghumNDVI Anom0.095633968006.8
NigeriaSorghumEVI Anom0.132642568005.5
NigeriaSorghumNDWI Anom0.341641168005.7
RwandaCoffeeNDVI Avg−0.5392542501.5
RwandaCoffeeEVI Avg0.7432532501.3
RwandaCoffeeNDWI Avg−0.76231125024.3
RwandaCoffeeNDVI Anom0.99122325010.9
RwandaCoffeeEVI Anom0.7442532501.3
RwandaCoffeeNDWI Anom0.0632542501.7
RwandaCornNDVI Avg−0.44057640043.9
RwandaCornEVI Avg0.42555640039.1
RwandaCornNDWI Avg−0.41157440043.5
RwandaCornNDVI Anom−0.47355640039.0
RwandaCornEVI Anom0.42455640039.1
RwandaCornNDWI Anom0.95155140037.8
RwandaSorghumNDVI Avg−0.2261461451.0
RwandaSorghumEVI Avg−0.5411441450.6
RwandaSorghumNDWI Avg0.5211421452.4
RwandaSorghumNDVI Anom0.3181431451.2
RwandaSorghumEVI Anom−0.5411441450.6
RwandaSorghumNDWI Anom−0.9811451450.2
SomaliaCornNDVI Avg0.9101041003.6
SomaliaCornEVI Avg0.2431051004.6
SomaliaCornNDWI Avg−0.36512110021.0
SomaliaCornNDVI Anom0.3851031003.5
SomaliaCornEVI Anom0.2431051004.6
SomaliaCornNDWI Anom−0.4201031003.3
SomaliaSorghumNDVI Avg0.43211613010.7
SomaliaSorghumEVI Avg0.1961221306.0
SomaliaSorghumNDWI Avg−0.12814813014.2
SomaliaSorghumNDVI Anom−0.47419013045.9
SomaliaSorghumEVI Anom0.1951221306.0
SomaliaSorghumNDWI Anom−0.1151231305.2
South AfricaCornNDVI Avg−0.592739213,50045.2
South AfricaCornEVI Avg−0.673568413,50057.9
South AfricaCornNDWI Avg0.651747113,50044.7
South AfricaCornNDVI Anom−0.848234313,50082.6
South AfricaCornEVI Anom−0.859196713,50085.4
South AfricaCornNDWI Anom0.902300613,50077.7
South AfricaSugarNDVI Avg0.366228322003.8
South AfricaSugarEVI Avg0.4672439220010.9
South AfricaSugarNDWI Avg−0.430229722004.4
South AfricaSugarNDVI Anom0.343238422008.4
South AfricaSugarEVI Anom0.292233322006.1
South AfricaSugarNDWI Anom−0.396239122008.7
South AfricaWheatNDVI Avg−0.7461383180023.1
South AfricaWheatEVI Avg−0.7781310180027.2
South AfricaWheatNDWI Avg0.7751405180021.9
South AfricaWheatNDVI Anom−0.9841114180038.1
South AfricaWheatEVI Anom−0.9981091180039.4
South AfricaWheatNDWI Anom0.9951179180034.5
SudanCottonNDVI Avg−0.84917850064.5
SudanCottonEVI Avg−0.77616250067.5
SudanCottonNDWI Avg0.97119350061.3
SudanCottonNDVI Anom−0.74817250065.7
SudanCottonEVI Anom−0.75416150067.8
SudanCottonNDWI Anom0.80417550065.0
SudanMilletNDVI Avg−0.565101810001.8
SudanMilletEVI Avg−0.515835100016.5
SudanMilletNDWI Avg0.4781150100015.0
SudanMilletNDVI Anom−0.54894110005.9
SudanMilletEVI Anom−0.510820100018.0
SudanMilletNDWI Anom0.45598910001.1
SudanSorghumNDVI Avg−0.8754945400023.6
SudanSorghumEVI Avg−0.837380240004.9
SudanSorghumNDWI Avg0.7985811400045.3
SudanSorghumNDVI Anom−0.8514482400012.1
SudanSorghumEVI Anom−0.830370440007.4
SudanSorghumNDWI Anom0.8044773400019.3
SudanSugarNDVI Avg0.3406827002.6
SudanSugarEVI Avg0.4576907001.5
SudanSugarNDWI Avg−0.0256827002.6
SudanSugarNDVI Anom0.4846857002.2
SudanSugarEVI Anom0.4866917001.3
SudanSugarNDWI Anom−0.4256827002.5
SudanWheatNDVI Avg0.23945540013.7
SudanWheatEVI Avg0.34146440016.1
SudanWheatNDWI Avg−0.08345440013.4
SudanWheatNDVI Anom0.33045840014.6
SudanWheatEVI Anom0.35946640016.5
SudanWheatNDWI Anom−0.36445640014.0
SwazilandCornNDVI Avg0.917897027.0
SwazilandCornEVI Avg0.811857022.1
SwazilandCornNDWI Avg−0.950957036.1
SwazilandCornNDVI Anom0.826837018.9
SwazilandCornEVI Anom0.720857021.1
SwazilandCornNDWI Anom−0.853887025.1
SwazilandSugarNDVI Avg−0.3836596904.5
SwazilandSugarEVI Avg−0.2146636904.0
SwazilandSugarNDWI Avg0.5696506905.8
SwazilandSugarNDVI Anom−0.2786636903.9
SwazilandSugarEVI Anom−0.1756636903.9
SwazilandSugarNDWI Anom0.4496586904.6
ZimbabweCornNDVI Avg0.725931170045.3
ZimbabweCornEVI Avg−0.6181347170020.8
ZimbabweCornNDWI Avg−0.8161022170039.9
ZimbabweCornNDVI Anom0.729934170045.1
ZimbabweCornEVI Anom0.2121268170025.4
ZimbabweCornNDWI Anom−0.759956170043.8
ZimbabweCottonNDVI Avg0.89714223038.4
ZimbabweCottonEVI Avg−0.02917623023.7
ZimbabweCottonNDWI Avg−0.87315823031.2
ZimbabweCottonNDVI Anom0.89814223038.1
ZimbabweCottonEVI Anom0.34619923013.4
ZimbabweCottonNDWI Anom−0.87214823035.8
ZimbabweSugarNDVI Avg0.4964484602.6
ZimbabweSugarEVI Avg0.2364514601.9
ZimbabweSugarNDWI Avg−0.4664524601.8
ZimbabweSugarNDVI Anom0.4934484602.5
ZimbabweSugarEVI Anom−0.0554534601.5
ZimbabweSugarNDWI Anom−0.5174494602.4

References

  1. Hamer, H.; Picanso, R.; Prusacki, J.J.; Rater, B.; Johnson, J.; Barnes, K.; Parsons, J.; Young, D.L. USDA/NASS QuickStats US Crop Data. 2017. Available online: https://quickstats.nass.usda.gov (accessed on 30 October 2018).
  2. Menne, M.J.; Durre, I.; Vose, R.S.; Gleason, B.E.; Houston, T.G. An Overview of the Global Historical Climatology Network-Daily Database. J. Atmos. Ocean. Technol. 2012, 29, 897–910. [Google Scholar] [CrossRef]
  3. Petersen, L.K. America’s Farming Future: The Impact of Climate Change on Crop Yields. AMS 2018. [Google Scholar] [CrossRef]
  4. McKinnon, K. GHCN-D: Global Historical Climatology Network Daily Temperatures NCAR—Climate Data Guide. 2016. Available online: https://climatedataguide.ucar.edu/climate-data/ghcn-d-global-historical-climatology-network-daily-temperatures (accessed on 30 October 2018).
  5. Carletto, G.; Beegle, K.; Himelein, K.; Kilic, T.; Murray, S.; Oseni, M.; Scott, K.; Steele, D. Improving the Availability, Quality and Policy-Relevance of Agricultural Data: The Living Standards Measurement Study Integrated Surveys on Agriculture. In Proceedings of the Third Wye City Group Global Conference on Agricultural and Rural Household Statistic, Washington, DC, USA, 8–9 April 2008; p. 26. [Google Scholar]
  6. Carletto, C.; Jolliffe, D.; Banerjee, R. From Tragedy to Renaissance: Improving Agricultural Data for Better Policies. J. Dev. Stud. 2015, 51, 133–148. [Google Scholar] [CrossRef]
  7. Challinor, A.; Wheeler, T.; Garforth, C.; Craufurd, P.; Kassam, A. Assessing the vulnerability of food crop systems in Africa to climate change. Clim. Chang. 2007, 83, 381–399. [Google Scholar] [CrossRef] [Green Version]
  8. Conceicao, P.; Levine, S.; Lipton, M.; Warren-Rodriguez, A. Toward a food secure future: Ensuring food security for sustainable human development in Sub-Saharan Africa. Food Policy 2016, 60, 1–9. [Google Scholar] [CrossRef]
  9. The World Bank. World Development Report 2008: Agriculture for Development; The World Bank: Washington, DC, USA, 2008. [Google Scholar]
  10. Hawkesford, M.J.; Araus, J.L.; Park, R.; Calderini, D.; Miralles, D.; Shen, T.; Zhang, J.; Parry, M.A.J. Prospects of doubling global wheat yields. Food Energy Secur. 2013, 2, 34–48. [Google Scholar] [CrossRef] [Green Version]
  11. Mann, M.L.; Warner, J.M. Ethiopian wheat yield and yield gap estimation: A spatially explicit small area integrated data approach. Field Crops Res. 2017, 201, 60–74. [Google Scholar] [CrossRef] [PubMed]
  12. Maas, S.J. Using Satellite Data to Improve Model Estimates of Crop Yield. Agron. J. 1988, 80, 655–662. [Google Scholar] [CrossRef]
  13. Hellden, U.; Eklundh, L. National Drought Impact Monitoring—A NOAA NDVI and Precipitation Data Study of Ethiopia; Technical Report; Lund University Press: Lund, Sweden, 1988. [Google Scholar]
  14. Gao, B.C. NDWI—A normalized difference water index for remote sensing of vegetation liquid water from space. Remote Sens. Environ. 1996, 58, 257–266. [Google Scholar] [CrossRef]
  15. Lobell, D.B. The use of satellite data for crop yield gap analysis. Field Crops Res. 2013, 143, 56–64. [Google Scholar] [CrossRef]
  16. Johnson, D.M. A comprehensive assessment of the correlations between field crop yields and commonly used MODIS products. Int. J. Appl. Earth Obs. Geoinf. 2016, 52, 65–81. [Google Scholar] [CrossRef] [Green Version]
  17. Gao, F.; Anderson, M.C.; Zhang, X.; Yang, Z.; Alfieri, J.G.; Kustas, W.P.; Mueller, R.; Johnson, D.M.; Prueger, J.H. Toward mapping crop progress at field scales through fusion of Landsat and MODIS imagery. Remote Sens. Environ. 2017, 188, 9–25. [Google Scholar] [CrossRef]
  18. Vuolo, F.; Neuwirth, M.; Immitzer, M.; Atzberger, C.; Ng, W.T. How much does multi-temporal Sentinel-2 data improve crop type classification? Int. J. Appl. Earth Obs. Geoinf. 2018, 72, 122–130. [Google Scholar] [CrossRef]
  19. Jin, Z.; Azzari, G.; Burke, M.; Aston, S.; Lobell, D.B. Mapping Smallholder Yield Heterogeneity at Multiple Scales in Eastern Africa. Remote Sens. 2017, 9, 931. [Google Scholar] [CrossRef]
  20. Carletto, C.; Gourlay, S.; Winters, P. From Guesstimates to GPStimates: Land Area Measurement and Implications for Agricultural Analysis. J. Afr. Econ. 2015, 24, 593–628. [Google Scholar] [CrossRef] [Green Version]
  21. United States Department of Agriculture, National Agricultural Statistics Service. Farms and Land in Farms: 2017 Summary. 2018. Available online: http://usda.mannlib.cornell.edu/usda/current/FarmLandIn/FarmLandIn-02-16-2018.pdf (accessed on 30 October 2018).
  22. Burke, M.; Lobell, D.B. Satellite-based assessment of yield variation and its determinants in smallholder African systems. Proc. Nat. Acad. Sci. USA 2017, 114, 2189–2194. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Fritz, S.; You, L.; Bun, A.; See, L.; McCallum, I.; Schill, C.; Perger, C.; Liu, J.; Hansen, M.; Obersteiner, M. Cropland for sub-Saharan Africa: A synergistic approach using five land cover data sets. Geophys. Res. Lett. 2011, 38. [Google Scholar] [CrossRef] [Green Version]
  24. Vancutsem, C.; Pekel, J.; Kayitakire, F. Dynamic mapping of cropland areas in Sub-Saharan Africa using MODIS time series. In Proceedings of the 2011 6th International Workshop on the Analysis of Multi-Temporal Remote Sensing Images (Multi-Temp), Trento, Italy, 12–14 July 2011; pp. 25–28. [Google Scholar]
  25. Vancutsem, C.; Marinho, E.; Kayitakire, F.; See, L.; Fritz, S.; Vancutsem, C.; Marinho, E.; Kayitakire, F.; See, L.; Fritz, S. Harmonizing and Combining Existing Land Cover/Land Use Datasets for Cropland Area Monitoring at the African Continental Scale. Remote Sens. 2012, 5, 19–41. [Google Scholar] [CrossRef] [Green Version]
  26. Atzberger, C. Advances in Remote Sensing of Agriculture: Context Description, Existing Operational Monitoring Systems and Major Information Needs. Remote Sens. 2013, 5, 949–981. [Google Scholar] [CrossRef] [Green Version]
  27. Zhang, X.; Zhang, Q. Monitoring interannual variation in global crop yield using long-term AVHRR and MODIS observations. ISPRS J. Photogram. Remote Sens. 2016, 114, 191–205. [Google Scholar] [CrossRef]
  28. Tadesse, T.; Senay, G.B.; Berhan, G.; Regassa, T.; Beyene, S. Evaluating a satellite-based seasonal evapotranspiration product and identifying its relationship with other satellite-derived products and crop yield: A case study for Ethiopia. Int. J. Appl. Earth Obs. Geoinf. 2015, 40, 39–54. [Google Scholar] [CrossRef]
  29. Atzberger, C.; Formaggio, A.R.; Shimabukuro, Y.E.; Udelhoven, T.; Mattiuzzi, M.; Sanchez, G.A.; Arai, E. Obtaining crop-specific time profiles of NDVI: The use of unmixing approaches for serving the continuity between SPOT-VGT and PROBA-V time series. Int. J. Remote Sens. 2014, 35, 2615–2638. [Google Scholar] [CrossRef]
  30. Immitzer, M.; Böck, S.; Einzmann, K.; Vuolo, F.; Pinnel, N.; Wallner, A.; Atzberger, C. Fractional cover mapping of spruce and pine at 1ha resolution combining very high and medium spatial resolution satellite imagery. Remote Sens. Environ. 2018, 204, 690–703. [Google Scholar] [CrossRef]
  31. Atzberger, C.; Rembold, F. Mapping the Spatial Distribution of Winter Crops at Sub-Pixel Level Using AVHRR NDVI Time Series and Neural Nets. Remote Sens. 2013, 5, 1335–1354. [Google Scholar] [CrossRef] [Green Version]
  32. Gissila, T.; Black, E.; Grimes, D.I.F.; Slingo, J.M. Seasonal forecasting of the Ethiopian summer rains. Int. J. Climatol. 2004, 24, 1345–1358. [Google Scholar] [CrossRef] [Green Version]
  33. Tadesse, T.; Demisse, G.B.; Zaitchik, B.; Dinku, T. Satellite-based hybrid drought monitoring tool for prediction of vegetation condition in Eastern Africa: A case study for Ethiopia. Water Resour. Res. 2014, 50, 2176–2190. [Google Scholar] [CrossRef] [Green Version]
  34. Klisch, A.; Atzberger, C. Operational Drought Monitoring in Kenya Using MODIS NDVI Time Series. Remote Sens. 2016, 8, 267. [Google Scholar] [CrossRef]
  35. Enenkel, M.; Steiner, C.; Mistelbauer, T.; Dorigo, W.; Wagner, W.; See, L.; Atzberger, C.; Schneider, S.; Rogenhofer, E. A Combined Satellite-Derived Drought Indicator to Support Humanitarian Aid Organizations. Remote Sens. 2016, 8, 340. [Google Scholar] [CrossRef] [Green Version]
  36. Rembold, F.; Atzberger, C.; Savin, I.; Rojas, O. Using Low Resolution Satellite Imagery for Yield Prediction and Yield Anomaly Detection. Remote Sens. 2013, 5, 1704–1733. [Google Scholar] [CrossRef] [Green Version]
  37. Becker-Reshef, I.; Justice, C.; Sullivan, M.; Vermote, E.; Tucker, C.; Anyamba, A.; Small, J.; Pak, E.; Masuoka, E.; Schmaltz, J.; et al. Monitoring Global Croplands with Coarse Resolution Earth Observations: The Global Agriculture Monitoring (GLAM) Project. Remote Sens. 2010, 2, 1589–1609. [Google Scholar] [CrossRef] [Green Version]
  38. Senay, G.; Velpuri, N.; Bohms, S.; Budde, M.; Young, C.; Rowland, J.; Verdin, J. Chapter 9—Drought Monitoring and Assessment: Remote Sensing and Modeling Approaches for the Famine Early Warning Systems Network. In Hydro-Meteorological Hazards, Risks and Disasters; Shroder, J.F., Paron, P., Baldassarre, G.D., Eds.; Elsevier: Boston, MA, USA, 2015; pp. 233–262. [Google Scholar]
  39. Funk, C.; Verdin, J.P. Real-Time Decision Support Systems: The Famine Early Warning System Network. In Satellite Rainfall Applications for Surface Hydrology; Springer: Dordrecht, The Netherlands, 2010; pp. 295–320. [Google Scholar]
  40. Molly, E.; Brown, E.B.B. Evaluating the use of remote sensing data in the U.S. Agency for International Development Famine Early Warning Systems Network. J. Appl. Remote Sens. 2012, 6. [Google Scholar] [CrossRef]
  41. GIEWS—Global Information and Early Warning System Food and Agriculture Organization of the United Nations. Available online: http://www.fao.org/giews (accessed on 30 October 2018).
  42. Baruth, B.; Royer, A.; Klisch, A.; Genovese, G. The Use of Remote Sensing Within the Mars Crop Yield Monitoring System of the European Commission. In Proceedings of the 21st Congress of the International Society for Photogrammetry and Remote Sensing—ISPRS, Beijing, China, 3–11 July 2008; Volume 27, pp. 935–940. [Google Scholar]
  43. Monitoring Agricultural ResourceS (MARS). Available online: https://www.eea.europa.eu/data-and-maps/data/external/monitoring-agricultural-resources-mars (accessed on 30 October 2018).
  44. Wu, B.; Meng, J.; Li, Q.; Yan, N.; Du, X.; Zhang, M. Remote sensing-based global crop monitoring: Experiences with China’s CropWatch system. Int. J. Digit. Earth 2014, 7, 113–137. [Google Scholar] [CrossRef]
  45. Domenikiotis, C.; Spiliotopoulos, M.; Tsiros, E.; Dalezios, N.R. Early cotton yield assessment by the use of the NOAA/AVHRR derived Vegetation Condition Index (VCI) in Greece. Int. J. Remote Sens. 2004, 25, 2807–2819. [Google Scholar] [CrossRef]
  46. Patel, N.R.; Anapashsha, R.; Kumar, S.; Saha, S.K.; Dadhwal, V.K. Assessing potential of MODIS derived temperature/vegetation condition index (TVDI) to infer soil moisture status. Int. J. Remote Sens. 2009, 30, 23–39. [Google Scholar] [CrossRef]
  47. Labs, D. Descartes Labs: Platform. 2018. Available online: https://www.descarteslabs.com/platform.html (accessed on 30 October 2018).
  48. Petersen, L.K. MODIS Crop Prediction Code Repository. 2018. Original-Date: 2017-08-01T21:19:50Z. Available online: https://github.com/lillianpetersen/CropPredictionFromSatellite2018 (accessed on 30 October 2018).
  49. McFeeters, S.K. The use of the Normalized Difference Water Index (NDWI) in the delineation of open water features. Int. J. Remote Sens. 1996, 17, 1425–1432. [Google Scholar] [CrossRef]
  50. Huete, A.; Didan, K.; Miura, T.; Rodriguez, E.P.; Gao, X.; Ferreira, L.G. Overview of the radiometric and biophysical performance of the MODIS vegetation indices. Remote Sens. Environ. 2002, 83, 195–213. [Google Scholar] [CrossRef]
  51. Chen, P.Y.; Fedosejevs, G.; Tiscareno-LoPez, M.; Arnold, J.G. Assessment of MODIS-EVI, MODIS-NDVI and VEGETATION-NDVI Composite Data Using Agricultural Measurements: An Example at Corn Fields in Western Mexico. Environ. Monit. Assess. 2006, 119, 69–82. [Google Scholar] [CrossRef] [PubMed]
  52. Bolton, D.K.; Friedl, M.A. Forecasting crop yield using remotely sensed vegetation indices and crop phenology metrics. Agric. For. Meteorol. 2013, 173, 74–84. [Google Scholar] [CrossRef]
  53. Matsushita, B.; Yang, W.; Chen, J.; Onda, Y.; Qiu, G.; Matsushita, B.; Yang, W.; Chen, J.; Onda, Y.; Qiu, G. Sensitivity of the Enhanced Vegetation Index (EVI) and Normalized Difference Vegetation Index (NDVI) to Topographic Effects: A Case Study in High-density Cypress Forest. Sensors 2007, 7, 2636–2651. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  54. Xiao, X.; Zhang, Q.; Hollinger, D.; Aber, J.; Moore, B. Modeling Gross Primary Production of an Evergreen Needleleaf Forest Using Modis and Climate Data. Ecol. Appl. 2005, 15, 954–969. [Google Scholar] [CrossRef]
  55. United States Department of Agriculture, National Agricultural Statistics Service. Farms and Farmland: Numbers, Acreage, Ownership, and Use; United States Department of Agriculture, National Agricultural Statistics Service: Washington, DC, USA, 2014.
  56. Cooper, P.J.M.; Dimes, J.; Rao, K.P.C.; Shapiro, B.; Shiferaw, B.; Twomlow, S. Coping better with current climatic variability in the rain-fed farming systems of sub-Saharan Africa: An essential first step in adapting to future climate change? Agric. Ecosyst. Environ. 2008, 126, 24–35. [Google Scholar] [CrossRef] [Green Version]
  57. United States Department of Agriculture, National Agricultural Statistics Service. Illinois Corn County Estimates: Corn Area Planted And Harvested, Yield, and Production by County—Illinois; Department of Agriculture, National Agricultural Statistics Service: Washington, DC, USA, 2017.
  58. United States Department of Agriculture, National Agricultural Statistics Service. Illinois Corn County Estimates: Soybean Area Planted And Harvested, Yield, and Production by County—Illinois; Department of Agriculture, National Agricultural Statistics Service: Washington, DC, USA, 2017.
  59. United States Department of Agriculture, National Agricultural Statistics Service. Illinois Corn County Estimates: Sorghum Area Planted and Harvested, Yield, and Production by County—Illinois; Department of Agriculture, National Agricultural Statistics Service: Washington, DC, USA, 2016.
  60. Mundi, I. Agricultural Production Statistics by Country. 2018. Available online: https://www.indexmundi.com/agriculture (accessed on 30 October 2018).
  61. You, L.; Wood, S.; Wood-Sichra, U.; Wu, W. Generating global crop distribution maps: From census to grid. Agric. Syst. 2014, 127, 53–60. [Google Scholar] [CrossRef] [Green Version]
  62. Petersen, L.K. Dense Farming Regions in Each African Country. 2018. Available online: https://gist.github.com/lillianpetersen/6b2227bad0c44d0a9565c717e6f178d3 (accessed on 30 October 2018).
  63. FAO GIEWS Country Briefs-Home. Available online: http://www.fao.org/giews/countrybrief/ (accessed on 30 October 2018).
  64. United States Department of Agriculture. Field Crops Usual Planting and Harvesting Dates; Technical Report; United States Department of Agriculture: Washington, DC, USA, 2010.
  65. GraphPadSoftware. p-Value Calculator. 2018. Available online: https://www.graphpad.com/quickcalcs/pvalue1.cfm (accessed on 30 October 2018).
  66. Taffesse, A.S. Crop production in Ethiopia. In Food and Agriculture in Ethiopia Progress and Policy Challenges; University of Pennsylvania Press: Philadelphia, PA, USA, 2012; pp. 53–83. [Google Scholar]
  67. Petersen, L.K. Predicting Food Shortages in Africa from Satellite Imagery. 2018. Available online: https://lillianpetersen.github.io/africa_satellite (accessed on 30 October 2018).
Figure 1. Farm fields by satellite in Ethiopia and Illinois at the same resolution. In Africa, small farm fields (smaller than a MODIS pixel) and poor ground truth data increase the difficulty of analyzing and predicting crop yields. Images are from Google Maps.
Figure 1. Farm fields by satellite in Ethiopia and Illinois at the same resolution. In Africa, small farm fields (smaller than a MODIS pixel) and poor ground truth data increase the difficulty of analyzing and predicting crop yields. Images are from Google Maps.
Remotesensing 10 01726 g001
Figure 2. Workflow diagram.
Figure 2. Workflow diagram.
Remotesensing 10 01726 g002
Figure 3. Snapshots of two MODIS satellite passes over Pike county, Illinois (a,b); and the cloud mask for the second image (c). Axes labels show MODIS pixels. Daily MODIS imagery was retrieved for every county in Illinois for 2000–2017 from the Descartes Labs Satellite Platform [47].
Figure 3. Snapshots of two MODIS satellite passes over Pike county, Illinois (a,b); and the cloud mask for the second image (c). Axes labels show MODIS pixels. Daily MODIS imagery was retrieved for every county in Illinois for 2000–2017 from the Descartes Labs Satellite Platform [47].
Remotesensing 10 01726 g003
Figure 4. August average NDVI for a drought year (a); August average NDVI for a wet year (b); and the NDVI August climatology (c).
Figure 4. August average NDVI for a drought year (a); August average NDVI for a wet year (b); and the NDVI August climatology (c).
Remotesensing 10 01726 g004
Figure 5. Illinois mean corn yield since 2000 (green) correlated with the anomalies of July and August: NDVI ((a), blue); EVI ((b), blue) and NDWI ((c), blue). Soybeans and sorghum performed similarly. Reported yields are from the USDA [1], and anomalies are computed from daily MODIS imagery using Equations (1)–(4).
Figure 5. Illinois mean corn yield since 2000 (green) correlated with the anomalies of July and August: NDVI ((a), blue); EVI ((b), blue) and NDWI ((c), blue). Soybeans and sorghum performed similarly. Reported yields are from the USDA [1], and anomalies are computed from daily MODIS imagery using Equations (1)–(4).
Remotesensing 10 01726 g005
Figure 6. Corn yield (Column 1), NDVI anomaly (Column 2), EVI anomaly (Column 3), and NDWI anomaly (Column 4) by county in Illinois: for the drought year 2012 (a); and for the wet year 2014 (b). During the drought year, there are low yields, low NDVI and EVI anomalies, and high NDWI anomalies, while the wet year is opposite. Soybeans and sorghum performed similarly.
Figure 6. Corn yield (Column 1), NDVI anomaly (Column 2), EVI anomaly (Column 3), and NDWI anomaly (Column 4) by county in Illinois: for the drought year 2012 (a); and for the wet year 2014 (b). During the drought year, there are low yields, low NDVI and EVI anomalies, and high NDWI anomalies, while the wet year is opposite. Soybeans and sorghum performed similarly.
Remotesensing 10 01726 g006
Figure 7. Correlations for each month between Illinois corn (green), soybean (yellow), and sorghum (blue) yields and the anomalies of NDVI (dashed), EVI (dot-dashed), and NDWI * ( 1 ) (dotted). Each point represents the correlation for that month between the respective data, and the multivariate regression correlations are represented by the solid horizontal lines. Every regression with corn included 1559 data points, soybeans had 1567 points, and sorghum had 188 points. July and August have the highest predictive skill for all three crops. These crops are harvested in October, meaning there is a two to three month lead time on yield estimates.
Figure 7. Correlations for each month between Illinois corn (green), soybean (yellow), and sorghum (blue) yields and the anomalies of NDVI (dashed), EVI (dot-dashed), and NDWI * ( 1 ) (dotted). Each point represents the correlation for that month between the respective data, and the multivariate regression correlations are represented by the solid horizontal lines. Every regression with corn included 1559 data points, soybeans had 1567 points, and sorghum had 188 points. July and August have the highest predictive skill for all three crops. These crops are harvested in October, meaning there is a two to three month lead time on yield estimates.
Remotesensing 10 01726 g007
Figure 8. The correlations between the August anomalies of NDVI (left), EVI (middle), and NDWI (right) with corn (top), soybeans (middle), and sorghum (bottom). Each point represents a single county and year. Corn and soybeans have the best correlations, and sorghum is slightly worse, likely because it is grown much less in Illinois than the other crops. All correlations are extremely significant with p-values less than 10 6 . August was the month with the highest correlations to yields.
Figure 8. The correlations between the August anomalies of NDVI (left), EVI (middle), and NDWI (right) with corn (top), soybeans (middle), and sorghum (bottom). Each point represents a single county and year. Corn and soybeans have the best correlations, and sorghum is slightly worse, likely because it is grown much less in Illinois than the other crops. All correlations are extremely significant with p-values less than 10 6 . August was the month with the highest correlations to yields.
Remotesensing 10 01726 g008
Figure 9. An example of the multivariate regression comprised of all three satellite indices and five months (15 variables total), but here the corn yield is only plotted against August NDVI and NDWI for visualization purposes. The multivariate regression improved the individual correlations for corn to 0.86.
Figure 9. An example of the multivariate regression comprised of all three satellite indices and five months (15 variables total), but here the corn yield is only plotted against August NDVI and NDWI for visualization purposes. The multivariate regression improved the individual correlations for corn to 0.86.
Remotesensing 10 01726 g009
Figure 10. Accuracy of the multivariate regression predictions of yields in Illinois for corn (a), soybeans (b), and sorghum (b). Each point shows the predicted yield versus the reported yield for a single county and year. The model was trained with a randomly selected 90% of the data and then predicted the other 10%. This process was then repeated ten times to ensure randomness. The median error (Equation (6)) was lowest for corn with 5.7%, soybeans was similar at 5.8%, and sorghum had the worst error with 21.9%.
Figure 10. Accuracy of the multivariate regression predictions of yields in Illinois for corn (a), soybeans (b), and sorghum (b). Each point shows the predicted yield versus the reported yield for a single county and year. The model was trained with a randomly selected 90% of the data and then predicted the other 10%. This process was then repeated ten times to ensure randomness. The median error (Equation (6)) was lowest for corn with 5.7%, soybeans was similar at 5.8%, and sorghum had the worst error with 21.9%.
Remotesensing 10 01726 g010
Figure 11. A representative area was chosen in the densest agricultural region of each country to reduce download and computation time, making it feasible to predict crop production for the entire continent of Africa on a home laptop computer. The coordinates of each box can be viewed at [62].
Figure 11. A representative area was chosen in the densest agricultural region of each country to reduce download and computation time, making it feasible to predict crop production for the entire continent of Africa on a home laptop computer. The coordinates of each box can be viewed at [62].
Remotesensing 10 01726 g011
Figure 12. The box examined in Ethiopia (a) and its September NDVI anomalies during a wet year (b) and a dry year (c). The NDVI anomalies are particularly evident in the Rift Valley, where farming is the most dense.
Figure 12. The box examined in Ethiopia (a) and its September NDVI anomalies during a wet year (b) and a dry year (c). The NDVI anomalies are particularly evident in the Rift Valley, where farming is the most dense.
Remotesensing 10 01726 g012
Figure 13. NDVI monthly average for: Ethiopia (a); Tunisia (b); and Morocco (c). The annual rainy season produces high NDVI values and corresponds to the crop-growing months. To show an example of correlations, Ethiopia includes the corn production as green bars, which has a very high correlation to maximum NDVI at 0.98. Production data were retrieved for every country from Index Mundi [60].
Figure 13. NDVI monthly average for: Ethiopia (a); Tunisia (b); and Morocco (c). The annual rainy season produces high NDVI values and corresponds to the crop-growing months. To show an example of correlations, Ethiopia includes the corn production as green bars, which has a very high correlation to maximum NDVI at 0.98. Production data were retrieved for every country from Index Mundi [60].
Remotesensing 10 01726 g013
Figure 14. Vegetation index at the height of the growing season (green) with crop production (blue) for Ethiopia (a), Tunisia (b), and Morocco (c). Ethiopia and Tunisia have significant correlations at 0.99 and 0.97. Morocco has a correlation of 0.67 and a p-value of 0.1. Ethiopia shows an example of the 2018 crop predictions (pink), which was predicted based on the historical regression and was later compared to reported crop production (light green). The error in Ethiopia is very low at 1.8%.
Figure 14. Vegetation index at the height of the growing season (green) with crop production (blue) for Ethiopia (a), Tunisia (b), and Morocco (c). Ethiopia and Tunisia have significant correlations at 0.99 and 0.97. Morocco has a correlation of 0.67 and a p-value of 0.1. Ethiopia shows an example of the 2018 crop predictions (pink), which was predicted based on the historical regression and was later compared to reported crop production (light green). The error in Ethiopia is very low at 1.8%.
Remotesensing 10 01726 g014
Figure 15. The map in the center displays the predicted crop production in standard deviations from the average. Only countries with harvests between December 2017 and June 2018 are displayed. Surrounding the map are bar charts of satellite indices (blue), historical crop production (dark green), predicted 2018 crop production (pink), and actual 2018 production (light green). To view the accuracy of predictions for all crops and countries, see Table A1.
Figure 15. The map in the center displays the predicted crop production in standard deviations from the average. Only countries with harvests between December 2017 and June 2018 are displayed. Surrounding the map are bar charts of satellite indices (blue), historical crop production (dark green), predicted 2018 crop production (pink), and actual 2018 production (light green). To view the accuracy of predictions for all crops and countries, see Table A1.
Remotesensing 10 01726 g015
Figure 16. Histogram of the 2018 prediction errors for each crop and country, as shown in Table A1. Forty percent of the predictions are under 5% error, and over half are below 9%.
Figure 16. Histogram of the 2018 prediction errors for each crop and country, as shown in Table A1. Forty percent of the predictions are under 5% error, and over half are below 9%.
Remotesensing 10 01726 g016
Figure 17. The errors of the 2018 predictions examined in five categories: vegetation index (a); crop (b); country (c); latitude (d); and production anomaly (e). Each orange line represents the median error, the boxes represent the ends of the second and third quartiles, and the whiskers show the middle of the first and fourth quartiles. On the vertical axes, the number of samples of each category are in parentheses. A full listing of predictions and errors are in Table A1.
Figure 17. The errors of the 2018 predictions examined in five categories: vegetation index (a); crop (b); country (c); latitude (d); and production anomaly (e). Each orange line represents the median error, the boxes represent the ends of the second and third quartiles, and the whiskers show the middle of the first and fourth quartiles. On the vertical axes, the number of samples of each category are in parentheses. A full listing of predictions and errors are in Table A1.
Remotesensing 10 01726 g017aRemotesensing 10 01726 g017b
Table 1. Definitions of vegetation indices to measure crop health. NIR is near infrared; G is the gain factor; L is the canopy background adjustment that addresses non-linear, differential NIR and red radiant transfer through a canopy; and C 1 and C 2 are the coefficients of the aerosol resistance term, which uses the blue band to correct for aerosol influences in the red band.
Table 1. Definitions of vegetation indices to measure crop health. NIR is near infrared; G is the gain factor; L is the canopy background adjustment that addresses non-linear, differential NIR and red radiant transfer through a canopy; and C 1 and C 2 are the coefficients of the aerosol resistance term, which uses the blue band to correct for aerosol influences in the red band.
IndexDescriptionFormula
NDVINormalized Difference Vegetation Index NDVI = NIR Red NIR + Red
EVIEnhanced Vegetation Index EVI = G NIR Red NIR + C 1 Red C 2 Blue + L
NDWINormalized Difference Water Index NDWI = Green NIR Green + NIR

Share and Cite

MDPI and ACS Style

Petersen, L.K. Real-Time Prediction of Crop Yields From MODIS Relative Vegetation Health: A Continent-Wide Analysis of Africa. Remote Sens. 2018, 10, 1726. https://doi.org/10.3390/rs10111726

AMA Style

Petersen LK. Real-Time Prediction of Crop Yields From MODIS Relative Vegetation Health: A Continent-Wide Analysis of Africa. Remote Sensing. 2018; 10(11):1726. https://doi.org/10.3390/rs10111726

Chicago/Turabian Style

Petersen, Lillian Kay. 2018. "Real-Time Prediction of Crop Yields From MODIS Relative Vegetation Health: A Continent-Wide Analysis of Africa" Remote Sensing 10, no. 11: 1726. https://doi.org/10.3390/rs10111726

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop