1. Introduction
Monitoring deforestation at sub-annual scales (e.g., weekly or monthly) using satellite data is increasingly becoming an important part of the initiatives that aim to reduce deforestation across the globe. This is because monitoring deforestation at the sub-annual scale, unlike annual monitoring, allows for timely detection of deforestation events, thus providing an opportunity for early interventions to stop illegal deforestation activities [
1]. For example, the Brazilian Institute for Space Research (INPE) monitors deforestation events at sub-annual scales in the Amazon using a system based on Moderate Resolution Imaging Spectroradiometer and bi-temporal change detection which has played an important role in reducing deforestation in Brazil. However, forest monitoring systems which detect deforestation events at sub-annual scales based on a bi-temporal change detection approach may face challenges in areas where forest has strong seasonality. To address this challenge, methods that detect deforestation at sub-annual scales from satellite image time series while accounting for seasonal variations have been developed in recent years [
2,
3,
4,
5,
6,
7,
8]. These methods detect deforestation events by testing if a newly acquired observation at a particular pixel is abnormally low when compared to historical temporal dynamics of forest at such pixel [
2,
3,
6,
9]. However, for the test to be robust, a pixel time series is required to have many historical observations. In some areas, however, pixel time series often do not have enough historical observations, mainly because of persistent cloud cover coupled with a relatively long revisit time in the past [
10], especially for satellite sensors which have high and medium spatial resolutions. To remedy the problem of cloud cover, new methods that detect deforestation events at sub-annual scales by combining optical and synthetic aperture radar (SAR) data have been proposed [
9,
11]. However, these methods also require a pixel to have many historical observations. In the past, SAR sensors, which can penetrate the clouds, had limited temporal coverage, also resulting in sparse time series. Such sparse historical observations make it difficult to properly model the normal temporal dynamics of the forest, and may lead to many false detections during deforestation detection. In the near future, however, temporally dense SAR time series from Sentinel sensors will be available, and such dense time series, when combined with data from optical sensors, will address some of the current challenges associated with monitoring deforestation at sub-annual scales in the tropics [
12]. In particular, it will be possible to detect deforestation events within a few days of occurrence. However, with such dense time series, another challenge related to pre-processing huge amounts of historical data will arise: pre-processing huge amounts of historical data for large areas is likely to take a relatively long time, thus affecting rapid detection of deforestation events.
The challenge associated with monitoring deforestation at sub-annual scales in areas with scarce historical observations can also be addressed by exploiting both temporal and spatial information in satellite image time series. An individual pixel may not have enough observations, but the spatial context derived from neighbouring pixels can provide sufficient information to determine whether a pixel with scarce historical observations is deforested or not. Such an approach may reduce the amount of historical satellite data that needs to be processed when monitoring deforestation at sub-annual scales.
Recently, Huang and colleagues [
13] exploited spatiotemporal Landsat data to identify deforested areas, but that work focused on detecting deforestation at the annual scale. Similarly, Hamunyela and co-workers [
5] used the spatial context of pixels to reduce seasonal variations in Landsat time series before detecting deforestation at sub-annual scales using a method [
3] which relies on individual pixel time series. These studies demonstrate that spatiotemporal information of image time series is useful for deforestation monitoring. However, methods for detecting deforestation at sub-annual scales based on integrated analysis of spatiotemporal data have not been published, to the best of our knowledge.
The approach proposed in this paper is to identify deforested pixels by exploiting spatiotemporal information available in the space-time data cube of satellite image time series. In this way, a pixel with at least three temporal observations in its time series is expected to still allow assessment of deforestation at sub-annual scales. This is because observations in the space-time data cube are likely to be sufficient for deciding whether a newly acquired observation is abnormally low. However, shifting to space-time change analysis requires two major challenges to be overcome. The first challenge concerns dealing with seasonality in the data cube. Seasonal variations may disguise deforestation in the data if not removed, especially in dry forests where seasonality is strong. The second challenge is how to consistently identify anomalies in a space-time data cube. Existing methods for identifying anomalous observations in space–time data cubes of climatic data [
14] and global gross primary production [
15] nevertheless focus on the temporal perspective, since thresholds are defined without considering spatial context.
This paper describes a data-driven space-time change detection method for monitoring deforestation at sub-annual scales. The method detects deforestation at pixel level as an extreme event in vegetation index values within local space-time cubes of satellite image time series. With this method, spatial-temporal information in satellite image time series is exploited to detect deforestation at sub-annual scales. Our method builds upon the spatial context approach developed in a previous study [
5] for reducing seasonal variations in satellite image time series. The method we propose here demonstrates how data cubes of satellite image time series can be used for sub-annual deforestation monitoring. We demonstrated our method at two sites, a dry tropical forest and a humid tropical forest (
Section 3), using normalised difference vegetation index (NDVI, [
16,
17]) time series derived from Landsat-5/TM and Landsat-7/ETM+ data.
3. Monitoring Deforestation in Landsat NDVI Data Cubes—A Case Study
We assessed our space-time change detection method at two study sites, shown in
Figure 4. One study site is a dry tropical forest located southeast of Santa Cruz de la Sierra in Bolivia, (centred at 18.388°S, 62.361°W), and the other study site is a humid tropical forest located west of Ariquemes, Rondonia State, Brazil (centred at 10.2952°S, 64.0478°W). The forest at the Bolivian site is characterised by strong seasonality in its photosynthetic activity, whereas the seasonality is less pronounced at the Brazilian site. Each of the study sites covers an area of about 10,000 km
2. Deforestation at the Bolivian site is dominated mainly by industrial agricultural expansion that resulted in large blocks of deforestation events, whereas deforestation events at the Brazilian site are heterogeneous in size, corresponding mostly to a process of colonisation. With varying degrees of seasonality and different deforestation processes, these study sites are particularly suitable for testing a new method for sub-annual deforestation monitoring.
We used the NDVI image time series derived from atmospherically [
23] and geometrically corrected Landsat-5/TM and Landsat-7/ETM+ images. Landsat images were obtained from the United State of America’s Geological Survey (USGS) Landsat Surface Reflectance (SR) Climate Records (CDR). We used all available (2000–2014) terrain corrected images (L1T). We assumed that co-registration of Landsat-5/TM and Landsat-7/ETM+ images was satisfactory. Clouds and cloud shadows were masked using the Fmask procedure [
24]. Note that Fmask outputs are distributed with Landsat SR CDR Products. Landsat tree cover continuous fields for 2005 [
19] were used to mask non-forest areas and areas with less than 10% tree cover prior to the year 2005.
To understand how the spatial extent of the local data cube influences spatial and temporal accuracies, we tested six varying spatial extents of the local data cube (
Table 1). We also varied the temporal extent of the RC, herein referred to as temporal extent of the data cube, to understand how it affects spatial and temporal accuracy for deforestation detection. The temporal extent was varied from one to five years of data, at an interval of one year. The RC contained images from 2004 for the one-year data scenario, images from 2003–2004 for the two-year scenario, and images from 2002–2004 for the three-year data scenario. For the four- and five-year data scenarios, the RC contained images from 2001–2004 and 2000–2004, respectively.
Figure 5 shows the number of images available in the RC for each temporal extent at each study site.
For each spatial and temporal extent, we trained our method to determine the optimal percentile for detecting deforestation at a sub-annual scale (
Section 3.2). Next, we used the optimal percentiles to validate our method (
Section 3.3).
3.1. Reference Data
We used 966 sample pixels to validate the final change map for the Bolivian site, and 400 sample pixels were used for validating the Brazilian change map (
Table 2). Training was based on 170 and 70 sample pixels for the sites in Bolivia and Brazil, respectively. Similar to [
4,
18,
25], reference data were acquired by visual interpretation of Landsat data along with high spatial resolution imagery available in Google Earth and Bing Maps. High spatial resolution imagery available in Google Earth and Bing Maps were used to determine whether an area is indeed deforested or not. At each study site, we sampled deforested and forested areas by stratified probability sampling [
26,
27] after manually digitising corresponding areas on Landsat images. The number of sample pixels was proportional to the area of the stratum. The deforested stratum contained areas that had been deforested during the period of 2005–2014, whereas the forested stratum covered areas which were still forested at the end of 2014. For each sample pixel in the deforested area, we estimated the date of deforestation by visually determining the Landsat image in which the deforestation event is first visible. The date of deforestation was used to assess the temporal accuracy.
3.2. Training the Space-Time Change Detection Method
To train our method, we generated a series of percentiles (
n = 50), ranging from 0.1 to the 5th percentile, at an interval of 0.1 percent. Next, for each spatial and temporal extent, we used each of these percentiles as a threshold for identifying deforestation at a sub-annual scale. A training data set (
Table 2) was then used to calculate the overall accuracy, bias (calculated by subtracting the omission error from the commission error [
11]), and the median temporal detection delay. Since our monitoring goal was to detect deforestation events as early as possible but with high overall accuracy, for each spatial and temporal extent we selected the percentile, i.e., the optimal percentile, that achieved the shortest median temporal detection delay with the highest overall accuracy.
3.3. Validating the Space-Time Change Detection Method
At each study site, we applied our space-time change detection method using the optimal percentiles, determined from the training data, as thresholds for deforestation. We emulated a “near real-time” monitoring scenario, implying that observations in the monitoring period were sequentially rather than simultaneously assessed for deforestation. Although some areas may have experienced multiple deforestation and regrowth regimes between 2005 and 2014, we only considered the first deforestation event per pixel, and once labelled as deforested, we stopped monitoring such a pixel at subsequent time steps. The spatial and temporal accuracy for change detected between 2005 and 2014 were calculated using the test data set (
Table 2). More specifically, we calculated the overall accuracy, producer’s accuracy, user’s accuracy, and the median temporal detection delay. Like [
5], the temporal detection delay was calculated at each sample point by counting the number of valid observations available between the image in which deforestation was visually identified, and the image in which deforestation was detected by our method. Confidence intervals for the overall accuracy, as well as producer’s and user’s accuracies, were calculated using binomial probability of success based on Wilson's method [
28].
5. Discussion
In this paper, we proposed a data-driven space-time change detection method that exploits spatiotemporal information in satellite image time series to detect deforestation at sub-annual scales as extreme events in local data cubes. We demonstrated the space-time change detection method on Landsat NDVI image time series at a dry (Bolivian site) and humid (Brazilian site) tropical forest sites. Our results show that the method is suitable for accurate detection of deforestation events at sub-annual scales in both dry and humid forests even when image time series contains only one year of historical observations. We were able to achieve a median temporal detection delay of less than three observations, and producer’s accuracy above 70%, user’s accuracy above 65%, and an overall accuracy above 80% at both dry and humid tropical forest areas when using a data cube with one year of historical observations and a window size of 56.25 ha (
Figure 8). A previous study at the same study sites [
5], which used a method that only analysed individual pixel time series [
3] and used all available Landsat images (1984–2014), also achieved at overall accuracy above 80% and a median temporal detection delay of less than four observations. Other studies [
2,
4,
6], which used different methods to detect deforestation at sub-annual scales at different study sites, also achieved overall accuracies above 80%. These studies expressed the temporal delay in time, whereas here we express the temporal detection delay as number of observations, thus making it difficult to directly compare their temporal delay to ours.
Deforestation events were mapped more accurately at the Bolivian site when using data cubes with a large spatial extent. This is mainly because deforestation events at the Bolivian site were generally large. In areas with large deforestation events, cubes with a larger spatial extent lead to accurate deforestation mapping because the cube is less likely to be entirely within the footprint of a deforestation event. If the spatial extent of a data cube is smaller than the footprint of the deforestation event, the impact of deforestation is likely to be smoothed out when spatially normalising data to reduce seasonal variations. The spatial normalisation approach assumes that there are at least 5% forest pixels within the spatial window, since pixels’ values are spatially normalised against the upper 5% tail [
5]. Using data cubes with small spatial extent can lead to accurate mapping of deforestation events in areas with relatively small deforestation events. With small deforestation events, spatial normalisation is less likely to smooth out the impact of deforestation in the data. This is why deforestation events were mapped accurately at the Brazilian site even when using data cubes with small spatial extent.
The incidents of false detection (low user accuracy) were particularly high for data cubes with large spatial extents because the thresholds calculated from data cubes with large spatial extents or longer temporal extents were relatively large. Such large thresholds can lead to many false detections because they might be too sensitive. Such sensitivity could also explain why deforestation events were typically detected with shorter delay when using data cubes with either large spatial extent or longer temporal extent (
Figure 8). Sensitive thresholds can also explain why increasing the temporal extent of the data cube at the Bolivian site led to accurate mapping of deforestation events. Increasing the temporal extent of the data cube did not affect the temporal detection delay at the Brazilian site (
Figure 8h). This is mainly because, for each temporal extent, the optimal percentile at the Brazilian site was large (
Figure 7).
Increasing the spatial extent had more of a major influence on the optimal percentile at the Brazilian site than at the Bolivian site (
Figure 7). This inter-site difference can be explained by the number of observations in the reference cube. Temporally, there were fewer images at the Brazilian site than at the Bolivian site (
Figure 5). So, optimal percentiles at the Brazilian site were less likely to reach stability at smaller spatial extents because observations in the reference cube were few. In contrast, at the Bolivian site, the optimal percentile was most likely to reach stability at a smaller spatial extents because the images were many, and additional information from spatial context was less likely to have a major influence on the optimal percent.
Our method offers new opportunities to tackle challenges associated with existing methods for monitoring deforestation at sub-annual scales [
3,
4,
5,
6,
29]. In particular, our method exploits both spatial and temporal information in satellite image time series to detect deforestation at a sub-annual scale, thus allowing us to analyse pixels which do not have many historical observations. Results from the two case studies indicate that our method is robust in detecting deforestation events at a sub-annual scale, even when the image time series only contains one year of historical observations (
Figure 8). One year of historical observations is often too short to properly differentiate deforestation from normal forest dynamics, especially in forests that exhibit strong seasonality. By combining spatial and temporal information, we can use image time series of high spatial resolution satellite sensors (e.g., RapidEye), whose time series are short, to track small-scale forest disturbances (e.g., selective logging). Similarly, by exploiting spatiotemporal information in image time series, there is no need to wait for image time series from newly launched sensors (e.g., Sentinel-2) to lengthen before exploiting such data to detect deforestation events at sub-annual scales. Since our method remains robust in detecting deforestation at sub-annual scales even when the reference period only contains one year of data, users may not need to pre-process huge amounts of historical data when monitoring deforestation at sub-annual scales.
The method presented in this paper can be applied in different forest areas because it is a data-driven approach using thresholds computed from the data. However, it may face challenges in mixed forests, where deciduous and evergreen forests coexist at short distances. This is mainly because of the way we reduce seasonal variations in the data cube. Normalisation against P
95t (
Section 2.2) is not likely to reduce seasonal variations because P
95t would represent evergreen trees in mixed stands. This limitation may be addressed by calculating P
95t for each forest type separately and by deseasonalising pixels from each forest type using the corresponding P
95t. Another limitation is related to how we treat the pixels whose historical observations also qualify as extremes although no deforestation has occurred.
We determined optimal percentiles for identifying deforestation at sub-annual scale, but these percentiles might not be optimal for other areas with different forest types and processes causing deforestation. In areas with gradual changes [
18], for example, smaller percentiles (5th percentile) might be preferable. Therefore, users should calibrate the method to identify optimal percentiles for their respective study areas before monitoring for deforestation at sub-annual scales. To do this, users should select sample pixels from both deforested and forested areas in their study areas, and apply space-time change detection method while varying the percentile for deforestation detection (e.g., in
Section 3.2). Depending on the monitoring goal, for example shorter temporal detection delay, the user can then select the percentile that achieves the shortest median temporal delay as the optimal percentile.
We tested several window sizes (spatial extents) of the data cube, but identifying a window size appropriate for different parts of the globe is still challenging. This is because a window size which is optimal in one area might not be optimal in another area. Prior knowledge on the size of the deforestation events that typically occur in a particular area can be used to decide on the spatial extent of the data cube. If such prior knowledge is lacking, the user should choose a spatial extent which is larger than the size of deforestation events the user aims to detect.
With the advent of open and free access to data from Sentinel sensors, especially Sentinel-1 and -2, detecting deforestation at small spatial scales within few days of occurrence will be possible. Combining Sentinel and Landsat data will boost monitoring of deforestation at sub-annual scales, allowing agencies responsible for forest protection to timely intervene in areas where illegal deforestation events are occurring. However, such multi-source data would need harmonisation to produce multi-sensor time series which is temporally consistent.
6. Conclusions
In this paper, we demonstrated how spatial and temporal information can be combined and exploited to detect deforestation from satellite image time series at a sub-annual scale. We proposed a data-driven space-time change detection method that detects deforestation as an extreme event within a space-time data cube of satellite image time series. We detected sub-annual deforestation from Landsat NDVI time series at a dry tropical forest site, where the forest exhibits strong seasonality, and at humid tropical forest site. The method remained robust in detecting deforestation events at a sub-annual scale even when the image time series only contained one year of historical observations. The space-time method we presented in this paper is a novel and robust approach for timely detection of deforestation events in areas where forests exhibit strong seasonality. It provides an opportunity to detect deforestation events using image time series with scarce historical observations. The method can be used in different types of forest, both evergreen and deciduous, but, it may face a challenge in mixed forests, where deciduous and evergreen forests coexist at short distances. Although we used NDVI, the method is expected to be applicable for image time series of any satellite-derived metric that is used for deforestation monitoring. To further improve deforestation monitoring at a sub-annual scale, future research should investigate how data from different satellite sensors (i.e., Landsat 7 and 8, Sentinel-2, RapidEye, and SPOT) can be combined in a space-time change detection framework to facilitate near real-time deforestation detection.