Next Article in Journal
Modelling Impacts of Environmental Water on Vegetation of a Semi-Arid Floodplain–Lakes System Using 30-Year Landsat Data
Next Article in Special Issue
Quantifying Post-Fire Changes in the Aboveground Biomass of an Amazonian Forest Based on Field and Remote Sensing Data
Previous Article in Journal
Mismatching Removal for Feature-Point Matching Based on Triangular Topology Probability Sampling Consensus
Previous Article in Special Issue
A Comparative Assessment of Machine-Learning Techniques for Forest Degradation Caused by Selective Logging in an Amazon Region Using Multitemporal X-Band SAR Images
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Near Real-Time Change Detection System Using Sentinel-2 and Machine Learning: A Test for Mexican and Colombian Forests

by
Ana María Pacheco-Pascagaza
1,2,3,*,†,
Yaqing Gou
1,2,4,†,
Valentin Louis
1,2,
John F. Roberts
1,2,
Pedro Rodríguez-Veiga
1,2,
Polyanna da Conceição Bispo
1,2,3,
Fernando D. B. Espírito-Santo
1,
Ciaran Robb
5,
Caroline Upton
1,
Gustavo Galindo
6,
Edersson Cabrera
6,
Indira Paola Pachón Cendales
6,
Miguel Angel Castillo Santiago
7,
Oswaldo Carrillo Negrete
8,
Carmen Meneses
8,
Marco Iñiguez
8 and
Heiko Balzter
1,2
1
Centre for Landscape and Climate Research (CLCR), Space Park Leicester (SPL), School of Geography, Geology and Environment, University of Leicester, 92 Corporation Rd, Leicester LE4 5SP, UK
2
National Centre for Earth Observation, University of Leicester, Space Park Leicester, Corporation Road, Leicester LE4 5SP, UK
3
Department of Geography, School of Environment, Education and Development, University of Manchester, Manchester M13 9PL, UK
4
Laboratory of Geo-Information Science and Remote Sensing, Wageningen University & Research, Rijksweg 5, 6705 Wageningen, The Netherlands
5
UK Centre for Ecology and Hydrology, Environment Centre Wales, Deiniol Road Bangor, Gwynedd LL57 2UW, UK
6
Instituto de Hidrología, Meteorología y Estudios Ambientales (IDEAM), Calle 25 D, Bogotá 110911, Colombia
7
El Colegio de la Frontera Sur (ECOSUR), Carretera Panamericana y Periférico sur s/n, San Cristóbal de las Casas 29290, Mexico
8
Comisión Nacional Forestal (CONAFOR), Periférico Poniente 5360 San Juan de Ocotán, Jalisco 45019, Mexico
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Remote Sens. 2022, 14(3), 707; https://doi.org/10.3390/rs14030707
Submission received: 24 December 2021 / Revised: 24 January 2022 / Accepted: 28 January 2022 / Published: 2 February 2022

Abstract

:
The commitment by over 100 governments covering over 90% of the world’s forests at the COP26 in Glasgow to end deforestation by 2030 requires more effective forest monitoring systems. The near real-time (NRT) change detection of forest cover loss enables forest landowners, government agencies and local communities to monitor natural and anthropogenic disturbances in a much timelier fashion than the thematic maps that are released every year. NRT deforestation alerts enable the establishment of more up-to-date forest inventories and rapid responses to unlicensed logging. The Copernicus Sentinel-2 satellites provide operational Earth observation (EO) data from multi-spectral optical/near-infrared wavelengths every five days at a global scale and at 10 m resolution. The amount of acquired data requires cloud computing or high-performance computing for ongoing monitoring systems and an automated system for processing, analyzing and delivering the information promptly. Here, we present a Sentinel-2-based NRT change detection system, assess its performance over two study sites, Manantlán in Mexico and Cartagena del Chairá in Colombia, and evaluate the forest changes that occurred in 2018. An independent validation with very high-resolution PlanetScope (~3 m) and RapidEye (~5 m) data suggests that the proposed NRT change detection system can accurately detect forest cover loss (> 87%), other vegetation loss (> 76%) and other vegetation gain (> 71%). Furthermore, the proposed NRT change detection system is designed to be attuned using in situ data. Therefore, it is scalable to larger regions, entire countries and even continents.

1. Introduction

Tropical forests stabilize the world’s climate and protect biodiversity [1,2]. In the context of the UN Framework Convention on Climate Change and the Paris Agreement, the initiative on “Reducing Emissions from Deforestation and forest Degradation” (REDD+) aims to protect carbon stores, biodiversity and other ecosystem services [3,4]. One of the main requirements for the implementation of REDD+ is a robust and objective measurement, reporting and verification (MRV) system that is based on satellite technology [5]. Humans have changed the natural environment on a global scale to an unprecedented degree. The global changes in atmospheric chemistry and climate have led to a new geological era called the Anthropocene [6]. Land use change is the driver with the largest global impact on forest cover since three quarters of these lands have changed, thereby diminishing the productivity of 23% of global land surface [4,7,8].
Remote sensing has provided new tools and opportunities for monitoring land cover changes in recent decades. Research has focused on the generation of tree cover maps [9,10], forest aboveground biomass maps [11,12] and land cover change maps [13], as well as improving algorithms to increase their accuracy [10,13,14,15]. Many of these products are produced annually or with a fixed delivery schedule, which limits their usability for enabling timely interventions when unlicensed forest cover loss is detected. Additionally, current monitoring systems do not operate at spatial scales that are sufficient to detect smaller areas of deforestation [5,16]. Examples of such global products are the tree cover change maps from Global Forest Watch using Landsat imagery and the forest/non-forest maps produced from ALOS imagery by JAXA [9,10,16,17]. However, some regional studies have used near real-time (NRT) applications that help authorities to respond quickly to events (i.e., deforestation, degradation or extreme environmental events), enabling government agencies to manage the land more effectively and supporting local decision-making with timely information [16,18,19,20,21]. NRT change alerts have been used for other purposes than in forestry, such as flood mapping [22], active wildfire monitoring, natural disasters and food security [18,23,24]. Generally, they are using imagery of medium (30 m, Landsat) to coarse spatial resolution (1 km, MODIS). Most of them rely on spectral signatures to derive indices (i.e., vegetation, fires, etc.) that allow the observation of the changes being monitored [16].
The provision of freely available and timely satellite remote sensing data has grown exponentially in recent years. With the launch of the Copernicus Sentinel satellites by the European Space Agency (ESA), there is much greater potential for monitoring land cover change in NRT at high spatial resolution. Sentinel-2 provides multi-spectral (MS) imagery at a finer spatial and temporal resolution compared to Landsat 8 and 9. The 10 m spatial resolution enables Sentinel-2 to map even small plantations, clearances and wind throw in complex forest landscapes [15]. With a 5-day return period, Sentinel-2 can be used for NRT monitoring and has a higher chance of acquiring a cloud-free image than Landsat. Various papers have demonstrated the capability of Sentinel-2 data in detecting NRT forest change by combining different sensors [25,26,27], and particularly by using freely available cloud processing platforms such as Google Earth Engine [28].
Alongside the satellite data revolution, change detection methods have advanced dramatically in the fields of remote sensing, image processing and computer vision [29]. General review papers include critical syntheses of techniques that include image algebra, trend analysis, moving window calculations, dimensionality reduction, post-classification comparison and direct pixel classification, which may be used in isolation or in combination [30]. Many factors will affect the choice of the most appropriate method, such as the aims of the change detection, the temporal frequency of the observations and the level of existing knowledge about the change. Image differencing is the simplest change detection technique since it only involves the subtraction of one image from another [31,32]. Moving window methods provide a localized alternative to image algebra, although they require post-operation analysis to provide a change product. Additionally, moving windows require the identification of a scale that is coarser than the spatial resolution of the image, which can reduce the detectability of specific small-scale features in the imagery [33]. Dimensionality reduction techniques, such as principal components analysis (PCA), produce uncorrelated Eigen images, within which a prominent representation of the change may occur [34,35]. Further processing via the use of thresholds or statistical learning is required to extract thematic change from the Eigen values. Techniques such as multi-variate alteration detection [36] use canonical correlation analysis to produce a similar output to PCA in the sense that a multi-band image is produced where any patterns of change are prominent, but they aim to eliminate some of the short comings of PCA, such as noise and linear scaling sensitivities. As with the previously mentioned techniques, further analysis, such as image thresholding or statistical learning, is required to map the thematic change classes [34,35,36]. One advantage of the methods outlined so far is that they provide a quantitative representation of change. However, they require either manual or automated post-processing to map the change.
Trend analysis techniques aim to analyze the variation of pixel values over time, such that a change in values between images beyond the bounds of the expected variation constitutes a change. Notable examples of such studies include those using the “Breaks for additive seasonal and trend” (Bfast) algorithm [37,38,39,40,41] and the continuous change detection and classification of land cover algorithm. Post-classification comparison (PCC) is another widely used change detection method that produces thematic maps of change. It is generated from statistical learning techniques and applied manually or by a set of rules [42]. The direct classification of change through a chronologically stacked pair of images reduces error propagation and has the advantage of directly classifying the signatures of change in the data. One of the most notable recent publications that utilizes this technique is Global Forest Watch [9], where a time series of global forest cover change is being produced from the year 2000 onwards. It employs a direct pixel classification of change, followed by trend analysis to identify the time when the changes occurred. Nonetheless, an NRT system requires an immediate detection of change rather than an annual retrospective approach. This requires the direct classification of a chronologically stacked image pair.
In this study, we introduce a Sentinel-2-based near real-time (NRT) forest monitoring system that is based on an open-source Python library called Python for Earth Observation (PYEO v3.6) [43], which automates data downloading, processing, analysis, validation and change alert reporting and won the Copernicus Masters’ Award (Sustainable Living) in 2017. The change detection was conducted using a refined machine learning algorithm on a chronologically stacked pair of images. Reflectance bands of two image acquisitions were stacked into a bi-temporal image stack and the classifier was trained using the temporal and spectral information. To test the accuracy and scalability of the system, forest change detection was conducted during the 12 months from December 2017 to December 2018 in the Manantlán area of Mexico and Cartagena del Chairá in Colombia. The results were validated independently using PlanetScope data.

2. Materials and Methods

2.1. Near Real-Time (NRT) Change Detection System

The new NRT change detection system was designed to automate Sentinel-2 data downloading and processing, model training, image classification, the creation of change alerts and validation procedures. The Sentinel-2 satellite constellation consists of two satellites with a combined image acquisition frequency of 5 days with 10 m, 20 m and 60 m spatial resolutions and 13 spectral bands, from the visible spectrum to short-wave infrared [44].
This NRT change detection system was able to ingest all available Sentinel-2 bands. However, we only used the highest-resolution (10 m) bands to provide the maximum capacity to detect small-scale changes. The system was designed to detect changes in forest, as well as other vegetation, covering the nine classes as described in Table 1. Water bodies were masked out in the analysis.
Prior to the NRT change detection analysis, a baseline forest cover map was generated. The baseline map consisted of a forest/non-forest mask, resulting from applying the trained model to the identification of these two classes over a year before the monitoring period. If a pixel was classified as forest for most image acquisitions in the given year and was not deforested within the last 3 months of the period, that pixel was identified as “forest”. For the baseline, we did not consider the class “other vegetation”, so as to avoid false positives due to seasonality. Therefore, areas identified as other vegetation were classified as non-vegetation on the baseline map. Once the baseline map was defined, the NRT change detection system followed the steps shown in Figure 1.
(i)
A configuration file was generated to write all parameters required for the processing chain, such as login information for the Copernicus hub account, path to the Sen2cor installation for cloud masking, input and output paths for the processing, etc. Further details can be found at [43].
(ii)
The system monitored the ESA Copernicus Open Data Hub to find any new Sentinel-2 data acquisitions over the area of interest on a regular basis.
(iii)
Sentinel-2 imagery was acquired by an optical sensor, so the data were affected by atmospheric effects (i.e., clouds, cloud shadows, aerosols) and topographic effects (i.e., illumination angle, shadows from terrain). Masking out the pixels affected by noise and clouds was the key to reducing the omission errors in the change detection, especially for pan-tropical regions with frequent cloud cover and steep terrain. Therefore, we used a robust cloud and cloud shadow masking algorithm by combining the cloud mask from Sen2Cor (version 2.5.5) [47], to process the top of atmosphere (TOA) L1C image to the bottom of atmosphere (BOA) reflectance at level L2A [48], with the cloud mask generated by F-mask [49] and buffered by a user-defined pixel number. Temporal gaps in the time series corresponded to areas where the cloud cover was ≥ 80%. Therefore, those images were not processed due to the likelihood of generating false positives in the change detection.
(iv)
A cloud-free reference composite was generated using all available cloud-masked images from the composite-building time window for the selected area. (v) As soon as a new Sentinel-2 image was acquired and added to the Copernicus Open Access Hub, it was downloaded and processed using a pre-trained machine learning model to detect vegetation change compared to the reference image composite. The NRT element operated on the same principles as the time series data, whereby a chronologically ordered image stack, consisting of a cloud-free reference image and a newly acquired image, was processed for change detection. (vi) The last available cloud-free pixel before the current acquisition data was then used to detect change against the baseline composite. A binary mask was updated after every new image was ingested into the system, so if a pixel was identified as change, it would be eliminated from the next iteration of change detection. (vii) If visual interpretation was required for validating the resultant change maps, a set of sampling points was automatically generated in shapefile format. When a forest cover change was detected, it was archived and then disseminated in a user-friendly interactive format to an authorized server and/or registered mobile devices for further action.
In addition, this NRT system was designed to be applied to monitor local landscapes. Therefore, as vegetation types changes, it was necessary to select new AOIs representing these new vegetation types (i.e., rain forests vs. dry forests in the class “Forest” or bracken vs. crops in “Other Vegetation”). It was noticed in trials that collecting more training areas could benefit the performance and accuracy of the model. New models are generally trained over different regions to ensure that the model is adapted to local vegetation types and change patterns. If a new AOI has a distinct forest type or change pattern compared to existing AOIs, then the model will need to be updated. Moreover, we did not update the model for every new Sentinel-2 image, but rather regularly on an annual basis to achieve a better performance as we had more samples of different vegetation types represented in the same class (i.e., bracken vs. crops in “Other Vegetation”). Generally, keeping the model up to date was of importance since, for example, changes to data formats and the re-calibration of the satellite signal needed to be monitored and the models updated/changed accordingly.
The outputs consisted of: (i) a thematic map of stable and change classes; (ii) probability maps; (iii) a KMZ file with the forest cover change areas, with mean probabilities across all pixels in the contiguous polygon and their geometric properties; and (iv) quick look maps of the imagery used to detect the change for visual verification purposes (Figure 1C). The KMZ file could be displayed on any computer or mobile device and was based on Google Earth/Google Maps or integrated with axillary datasets to refine the detection results. The geospatial processing element used the PYEO v3.6 library [43]. The geospatial data outputs formed part of an integrated system that leverages mobile technology named Forest Sentinel, which won the Copernicus Masters’ Sustainable Living Challenge Award in 2017 (Figure 1B). The resulting forest cover change map and alerts were sent to a server and disseminated to computers or mobile devices by registered users who could then verify the alerts in the field, allowing users to add photos, voice recordings and survey information to the data.

2.2. NRT Change Detection System Performance Evaluation

We evaluated the performance and accuracy of our new NRT change detection system (Section 2.1) in two study areas in the tropics: Manantlán in Mexico and Cartagena del Chairá in Colombia (see Figure 2). The change detection was carried out with data acquired over one year, from December 2017 to December 2018.

2.2.1. Study Regions

Manantlán is located in the western region of north-central Mexico, spanning the states of Jalisco and Colima. Within the region is the biosphere reserve Sierra de Manantlán, which is one of the most important natural mountain reserves in Mexico due to its biological richness, extent, environmental services and water resources. The forest vegetation is a mixture of temperate and cloud forests. Shrubs, grassland and agricultural areas are also present [50]. The forests of Manantlán have been subject to deforestation and forest fragmentation due to extensive cattle ranching and the timber extraction of oak and pine [46,51]. In addition, illegal logging and gold mining have affected the area significantly in the last decade [52,53]. Forest fires are a frequent phenomenon in Manantlán, and about 18,000 ha were affected in 2017. Most were surface fires that mainly affected leaf litter and shrub vegetation. Consequently, they did not contribute significantly to deforestation. However, in cases where the fires have affected the canopy, they could be identified as deforestation [54,55].
Cartagena del Chairá is located in the Colombian Amazon, in the Caquetá department. This region is covered by tropical moist broadleaf forest [56] and is a transition between the Orinoquia savannahs and Amazonia. The region is known for having the highest biological diversity in the country, holding a great variety of flora and fauna [57,58]. Land cover is characterized by undisturbed and natural forests that are subject to some human intervention, shrubs and grasslands, which are mainly for cattle ranching [59]. The Caquetá Department historically has the largest deforestation rate in Colombia. Cartagena del Chairá presented the largest deforestation rate in 2015 (23,812 ha deforested), which was equivalent to 19% of the national deforestation [60,61], and the second largest in 2017 (22,591 ha deforested), which was equivalent to 10.3% of the deforested area in the country [62]. The main deforestation driver is the conversion of forest to pastures for cattle ranching and agriculture. At a lower intensity, unlicensed logging, crop production and mining also contribute [56,61,63,64,65]. Forest fires may occur during the conversion of forest to facilitate logging [61,66]. For 2018, this region reported 17,719 ha of burned areas, 8266 ha of which were identified as forests. In Manantlán, we used 56 cloud-free images covering 93,153 ha and in Cartagena del Chairá, we used 87 images covering 31,922 ha.

2.2.2. Change Detection Algorithm

A wide range of machine learning algorithms is currently available. These algorithms have different strengths and weaknesses, but the success of their application relies on the relevant specific circumstances and data characteristics [67,68,69]. Nonlinear models have emerged as effective solutions for many real-world classification problems, such as nonlinear support vector machines (nl-SVM) and ensemble methods, such as random forest (RF) and gradient boosting (GB).
Amongst these, RF [70] was deemed the most suitable method due to it providing the best compromise between model accuracy and processing speed. Both SVM and gradient boosting (GB) algorithms are close to RF in their overall accuracy, but they take longer to optimize the parameters via the grid search cross-validation due to limited parallelization. We selected RF due to its easy parallelization and computational speed in comparison to SVM. Deep neural networks (DNN), which have gained increasing attention in the recent machine learning literature, may offer a slight increase in accuracy. However, the computational demand for the training and model calibration using DNN is significantly higher than the RF and, thus, is impractical for a model that has to be regularly updated. Still, our system was built to be flexible and could be adapted to other any classifier (i.e., SVM, DNN) based on specific research needs.
RF is an ensemble classifier, where a group of typically deep CART classifier probabilities are averaged to predict the most likely class label of a pixel or object [70]. Each tree within the ensemble employs random feature selection and bootstrap sampling with a replacement for every split of the CART tree, followed by the averaging of the probabilities across all trees, which aims to mitigate against high variance [70]. For a more detailed explanation of the underlying mathematics of RF, see [70] and [71]. Random forests also offer some interpretability, such as the “out of bag error” and “feature importance” measures, allowing some insight into the inner workings of a particular model [70].
A general rule of thumb with ensemble methods is the more trees, the more accurate the model; although, the error rate may eventually increase beyond a certain number of trees. Hence, for an increasing number of decision trees, there is normally a point of diminishing return that can be approximated to minimize computation time, which is an important factor in a near real-time system. In order to reduce the number of model fits in the subsequent grid search cross-validation, the number of trees was first tested using the out of bag error rate (OOB). This allowed for the fixing of the number of decision trees prior to further hyper-parameter tuning [72].
We calibrated the RF models using training samples from the different land cover change classes (see Table 2) using an exhaustive grid search method where the data were subset into five “folds”, with a different fold retained during each iteration to test the model’s accuracy and the remainder used to train the model. The contents of each fold were stratified to ensure that they each contained the same percentage of classes as the entire training set. Every possible input parameter combination within the following range was run for each test fold, with the number of decision trees fixed at 500 and 300 for Colombia and Mexico, respectively. The parameter combination that was the most accurate on average across the folds was chosen for scene classification. The models were optimized using the average accuracy score across the folds. As with scene classification, the change detection used a random forest classier that was trained using the exhaustive methods described above. These were generated using Geospatial Learn [73], which itself in turn uses Scikit-Learn methods [74] applied to geospatial data.

2.2.3. Targeted Change Classes and Training Data Collecting Protocol

As a supervised classification method, the RF machine-learning model requires a large training dataset for each defined class. For our test areas, we selected six generic categories representing forest-related land cover change based on Table 1. However, from the nine classes in this application of the NRT change detection system, we discarded the classes “Forest > Other Vegetation”, “Other Vegetation > Forest” and “Non-vegetation > Forest” because we did not find any examples of these changes. That was expected, considering that this is study used a short time series and these changes occur over a longer period.
Training samples consisted of polygons created by visually digitizing the different classes of study (see Table 1). The samples were collected by examining bi-temporal image stacks for the entire archive of Sentinel-2 data. The training polygons were collected from five to six image pairs over a year to train a more generic model for the different seasons for each window area. Once a sufficient coverage across the AOI and TOI (time of interest) was produced, the polygons/samples were split into training and calibration data using the stratified random split approach. Each pixel within the extent of the training polygons was used as one training data point. A forest baseline map was used to constrain the “Forest > Non-vegetation” class to previously forested areas. The forest baseline map was generated by applying the trained model to the 2017 time series data. A pixel that was classified as forest for most image acquisitions in 2017 and was not deforested within the last 3 months of 2017, was identified as “Forest”.

2.2.4. Validation Strategy and Accuracy Assessment

A stratified random sampling strategy was employed for validation [75,76]. We validated the total change over the year of analysis so as not to include seasonal variations. Therefore, training samples for each class were generated based on two cloud-free composites that were one year apart. Validation points for each stratum were generated randomly in proportion to the area of that class, with an emphasis on the change classes. We ensured that the standard error of the estimated overall accuracy was below 2% and that, for each stratum, the expected user accuracy was over 70% for the change classes and over 90% for the stable classes (Appendix A, Table A2). With this desired standard deviation for overall accuracy and user accuracy, the proposed validation sampling design is given in Table 2. A Python library was used to automatically generate the validation points based on this approach [43].
Cloud-free composites were generated for a historical baseline mapping period and the monitoring periods. Change detection was applied to the cloud-free composites and then validated. We also compared the NRT change detection accuracies to the monthly products generated from cumulative change detection with a fixed baseline (see training points in Table 2) [20]. PlanetScope image pairs covering the same time period were used for validation [77]. The individual CubeSats of the PlanetScope constellation images had four spectral bands (red, green, blue and near-infrared) with a spectral bandwidth of 60–90 nm and a spatial resolution of 3–4 m. Validation was conducted by overlaying the validation points with the PlanetScope images, visually interpreting the very high-resolution PlanetScope images and assigning the identified change class.
The user, producer and overall accuracies were derived from the confusion matrix to assess the accuracy of the resulting change maps. The user accuracy (Equation (1)) represented the probability that a pixel classified as a given category was a true representation of that category when compared to the validation data. The producer accuracy (Equation (2)) represented how well the reference pixels for the given class were assigned to pixels on the map, i.e., the number of real changes that were correctly detected by the algorithm (see Table 2). The overall accuracy was the percentage of correctly classified pixels across all classes.
U i = p i i P i ·  
P i = p j j P · j  
O = j = 1 q p j j  
where U i and P i represent the user and producer accuracy of class i and O is the overall accuracy among all targeted classes. p i i is the number of validation points that are in class p i · on the classified map to calculate the omission error. p j j is the validation points of the reference class p j in the validation dataset used to calculate the commission error [75,76].

3. Results

3.1. Forest Change Alerts

From the historical observations for 2018 in the study area, the change detection identified 1,200,000 deforestation pixels in Manantlán and 50,000 in Cartagena del Chairá. In the Manantlán window area, deforestation events mainly happened in the dry season between December and March (Figure 3, Mexico).
In total, 0.4% of the forested area in 2017 was disturbed in 2018, with 91% of the detected changes being below 0.04 ha in size and 0.9% of deforested areas being covered by vegetation. A similar trend was observed in the Cartagena del Chairá window area, where deforestation events mainly happened in the dry season between January and March (Figure 3, Colombia). The forest disturbance rate was higher in Cartagena del Chairá than in Manantlán, with 2.2% of forested area being disturbed in 2018. The Colombian study area was smaller than the Mexican area. The size distribution of deforestation events in Cartagena del Chairá showed more smaller disturbances than in Manantlán, where deforested areas were larger in size (Figure 4). In total, 90% of the detected changes were below 0.18 ha and 0.9% of the deforested areas were covered by vegetation.
Even though this NRT system recorded change alerts as and when a new image was acquired, in Figure 5 we present the total change for all classes for 2018 in both window areas. In Mexico, most changes occurred near settlements and areas of active land use. We observed changes in regrowth and the loss of vegetation other than forest, as well as deforestation in areas of pristine forest (zoom window in Figure 5A). In Colombia, the study area was not close to big settlements. Therefore, the forest loss occurred in the natural Amazonian forest and the changes in other vegetation classes followed land management practices, such as crop growth and harvesting (Figure 5B).

3.2. Change Detection Accuracy

The results from the random forest algorithm with the trained forest cover change model achieved an overall accuracy of 92.5% for the automated deforestation alerts from Sentinel-2. Table 3 shows the estimated user, producer and overall accuracies for each change class. The accuracies were sufficiently high to make the PYEO forest cover change detection system useful for local, regional or national forest monitoring for measurement, reporting and verification (MRV) purposes under the REDD+ initiative (Reducing Emissions from Deforestation and Forest Degradation in Developing Countries). The system showed higher accuracies for detecting stable land cover classes and sharp changes, such as forest loss, while there were lower accuracies (65–80%) in the detection of the changes in other vegetation. The lowest accuracy (42%) was observed in the class of non-vegetated areas changing to other vegetation in the Manantlán window area, mainly in the steep mountains. We also found that the omission/commission errors corresponded to broader classes relating to other vegetation and non-vegetated areas. These errors were related to brighter parts of the images due to atmospheric haze.

4. Discussion

The main advantages of the PYEO forest alerts system using Sentinel-2 data over other forest monitoring systems are the timeliness and speed of detection being within hours after a new image becomes available. The system was designed with the speed of detection in mind so that forest managers are able to intervene in forest cover loss events much more quickly than before. Sentinel-2 was chosen as the satellite for the change detection because it provides a new image every 5 days over all global land areas, which gives it an advantage over Landsat 8 and 9. The Sentinel-2 data are also open-access. In addition, Sentinel-2 has a 10 m resolution in the red, green, blue and NIR bands, which means that the images show small-scale forest loss events, down to the identification of individual large tropical trees being logged when their crown cover is larger than a Sentinel-2 pixel, i.e., 100 m2. The PYEO software package can be adapted to be applied regionally or even globally, although it was not created to generate global forest cover change maps. Instead, it was designed with the user in mind. Hence, the software allows a bottom up, regional approach to change detection based on locally trained machine learning models with user-specified classes of change between land cover types. This gives the user greater control over the definitions of change classes of interest and makes the system very flexible and adaptive. For example, it can be adapted to national or regional forest definitions. This change detection system was based on forest cover loss, as well as the change detection of other change classes that are useful for identifying the transition between other land cover classes.
In this paper, we tested the PYEO NRT forest alert system in two different ecoregions of the tropics: Manantlán in Mexico and Cartagena del Chairá in Colombia. Both areas have been identified as having high deforestation rates over the past years. Nonetheless, they have different ecosystem conditions, as well as different deforestation drivers and patterns. For instance, Manantlán is in a cloud forest in a mountainous region, where the biggest contributions to deforestation are forest fires, agriculture practices, livestock grazing, pine timber extraction and gold mining. In addition, the area presents forest degradation due to illegal logging and gold mining [52,53]. On the other hand, Cartagena del Chairá is on an alluvial plain, where the conversion of forest land to pastures for cattle ranching, as well as illicit agriculture and mining, is rapidly and indiscriminately changing the ecosystems [1,2]. Nonetheless, both areas presented high accuracies for the detected land cover change classes, and particularly high accuracies (>87%) for detected deforestation, for which the model was trained.
Producing a time series change detection analysis using optical data will always make it difficult to maintain the frequency of the detections of the stable classes, particularly in tropical forest. In our study regions, we observed that Manantlán was more affected by clouds on a yearly basis since 11% of the area was permanently covered by clouds, while only 0.65% of the area at Cartagena del Chairá could not be observed during the period of study.
We compared the PYEO forest loss alerts in 2018 to the Global Forest Watch (GFW) tree cover loss data [9], which is the most established operational global forest monitoring system. There are marked differences between both systems. Firstly, the GFW was designed to detect abrupt changes in forested pixels (deforestation), whilst the PYEO system classifies change between land cover classes directly, not only forest gain and loss but other types of change as well. Secondly, the GFW uses 30 m resolution Landsat images whilst PYEO is based on 10 m Sentinel-2 data. Therefore, it is able to identify a much greater number of small areas of probable forest cover loss, as observed in Figure 6.
While it is possible that some of the very small forest cover loss detections could be false alarms, this comparison demonstrates the added information content in the higher spatial resolution images of 10 m instead of 30 m. In addition, the finer spatial resolution from Sentinel-2 enables a better detection of forest degradation, which can be defined as a “state of anthropogenically induced arrested succession, where ecological processes that underlie forest dynamics are diminished or severely constrained” [74]. While forest cover loss data do not allow for any direct inferences about anthropogenic or natural causes, they do enable the detection of areas in which ecological processes in a forest are beginning to be disturbed. GFW was designed to automatically identify changes based on spectral differences from a global training sample database [16]. PYEO was designed to be trained on local conditions and, although it can be used globally in principle, it was not designed to produce global maps.
Comparing the forest loss reported by the GFW system and the PYEO alert system for 2017, the GFW reported 67,373 deforested pixels (6635 ha) for the Colombia window area whilst PYEO resulted in 156,629 smaller pixels (1566 ha). Hence, GFW reported nearly four times more deforested area than our system. For Mexico, the PYEO alert system identified 29,748 pixels (292.48 ha), about 13 ha more than the 3097 pixels from GFW (278.73 ha). This difference could be associated with the spatial resolution and the fact that we included the class “Other Vegetation”, which could be identified by GFW as forest. In Colombia, we identified 437,147 pixels (4371 ha) classified as “Other Vegetation > Non-vegetation”. For Mexico, the class “Other Vegetation > Non-vegetation” was almost non-existent whilst for Colombia, that class covered a large area (4371 ha), as seen in Figure 5.
In developing, testing and applying the PYEO forest alert system, we identified some current limitations of its application. Firstly, seasonally dry tropical forest poses a challenge to training a machine learning model from Sentinel-2 image pairs due to the strong influence of the vegetation seasonality on the spectral reflectance. Secondly, the spectral reflectance in the visible and near-infrared domain is also not very good at distinguishing between forest plantations and natural forest. Thirdly, during the processing of the Sentinel-2 images, due to the fully automated nature of the processing chain, sometimes residual atmospheric effects, such as haze, cloud fringes and cloud shadows, can remain in the images and lead to spurious change detections. The timeliness of the forest alerts depends on the availability of cloud-free pixel observations; hence, the 5-day updates are the best-case scenario that is achievable from free and open data. Finally, to train a good machine learning model requires the representative sampling of the landscape areas, which requires substantial skill and effort on the part of the image interpreter. As a result, the uncertainty of the estimates or false negative detections could be reduced.

5. Conclusions

This paper presented a NRT change detection system based on a Sentinel-2 monitoring system. It was developed to detect deforestation events, as well as changes in other land cover classes, at 10 m spatial resolution in a fully automated manner and using a bottom up approach. The system was based on the Python package “PYEO” (see the Supplementary Material for GitHub location). It is flexible and can be adapted to any classifier (i.e., RF, SVM, etc.) based on the user requirements. In addition, it is thought to be useful for all landscapes since it is built on a local training approach, which can be adapted to local landscapes. Nonetheless, local training data need to be collected to replicate the model. We presented the application of the NRT alert system to two window areas in Mexico and Colombia. The results showed that deforestation detections based on this system achieved user accuracies of 97% and 99% for the two window areas, respectively.
This system is highly useful as an automated forest monitoring system, where frequent information is required to improve forest governance. The forest alerts enable the frequent monitoring of deforestation events from space. Hence, it can provide vital information for forest managers and governments regarding the disturbances in their forest land and enable rapid interventions. In this way, the system can contribute to the implementation of the COP26 goal to end deforestation worldwide by 2030.

Author Contributions

Conceptualization, C.R., Y.G., V.L., J.F.R., P.R.-V., A.M.P.-P. and H.B.; methodology, Y.G., A.M.P.-P., V.L, P.R.-V., J.F.R. and H.B; formal analysis, Y.G., A.M.P.-P. and V.L.; validation, A.M.P.-P.; original draft preparation, A.M.P.-P., Y.G., H.B. and V.L.; supervision, H.B.; writing—reviewing and editing, A.M.P.-P., Y.G., V.L., P.R.-V., J.F.R., H.B., C.R., P.d.C.B., F.D.B.E.-S., C.U., G.G., E.C., I.P.P.C., M.A.C.S., O.C.N., C.M. and M.I. All authors have read and agreed to the published version of the manuscript.

Funding

This research was part of the Forest 2020 project funded by the Global Challenges Research Fund for the UK Space Agency’s International Partnership Program, Forest 2020 project, within the frameworks of the Earth and Sea Observation System (EASOS) Malaysia project and the Forests 2020 project. This work was also supported by the Natural Environment Research Council’s National Centre for Earth Observation (NCEO).

Data Availability Statement

Github location: Pyeo https://github.com/clcr/pyeo (accessed on 1 October 2021).

Acknowledgments

We acknowledge to the Global Challenges Research Fund and the UK Space Agency’s International Partnership Program to fund the Forests 2020 project. Also supported by EASOS and the National Centre for Earth Observation (NCEO). Forest Sentinel was supported by NERC “REDD+ Monitoring Services with Satellite Earth Observation” (NE/N017021/1). All data were processed on the ALICE high performance-computing cluster managed by the University of Leicester. We thank all partners within the projects for their collaboration, data sharing and infrastructure. We also thank the European Commission Copernicus and Planet Team for the free and open data.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Model Calibration Results

A fixed number of trees for scene models and change models was established at 500, where OOB errors level out.
Whilst log2 initially appeared to be more accurate, the “number of features” parameter changed in effectiveness during the more rigorous cross-validated grid search. The scene map and change map models consisted of the following hyper-parameters and internal average accuracies in Figure A1 (Appendix A).
Figure A1. The fine tuning of the RF model. OOB is the error rate for change detection measured for the no. of trees (n_estimators), log2 and square root of the number of features choice for each tree node split.
Figure A1. The fine tuning of the RF model. OOB is the error rate for change detection measured for the no. of trees (n_estimators), log2 and square root of the number of features choice for each tree node split.
Remotesensing 14 00707 g0a1
Table A1. The cross-validated grid search results for the hyper-parameters of the scene and change models.
Table A1. The cross-validated grid search results for the hyper-parameters of the scene and change models.
ValueScene ModelChange Model
Number of trees500500
Split criterionginigini
Maximum tree depthNone–increased until pure10
Maximum featuressquare rootsquare root
Minimum leaf samples 55
Minimum split samples 22
It is notable that the optimal parameters are only similar to the tree depth, differing between scene and change models. The feature importance is plotted for each of the image bands used to build the scene and RF change models, which are based on the mean information gain (gini impurity or entropy) per split across the trees. The most important features in the segregating vegetation and non-vegetation classes that were analyzed in this study are blue, green, red edge, third near infra-red (NIR3) and short-wave infra-red bands (Figure A2 Appendix A).
Figure A2. The feature importance calculated from the RF scene model. The numerical designations per band indicate where a wavelength area (e.g., red or NIR) is subdivided into smaller bands.
Figure A2. The feature importance calculated from the RF scene model. The numerical designations per band indicate where a wavelength area (e.g., red or NIR) is subdivided into smaller bands.
Remotesensing 14 00707 g0a2
For the change model, the “before” green, “after” green and red band features have the highest importance values, suggesting that they have the most discriminative power for the classes in the change model (Figure A3).
Figure A3. The feature importance calculated from the RF change model.
Figure A3. The feature importance calculated from the RF change model.
Remotesensing 14 00707 g0a3
Such changes are in line with the expectations of the input change imagery, where there will be data groupings associated with the changes in the green values from both time slices as vegetation is removed. The feature importance of red band 2 may be due to the exposure of bare soil cover in the “after image”, represented as a sharp increase in red values. The remaining features are of similar importance, though the NIR band 2 feature is of greater importance than the remaining bands.

Appendix A.2. Validation

Table A2. The expected user accuracy.
Table A2. The expected user accuracy.
ClassesExpected User’s Accuracy
1Stable Forest0.9
2Forest > Other Vegetation0.7
3Forest > Non-vegetation0.7
4Stable Other Vegetation0.8
5Other Vegetation > Non-vegetation0.7
6Non-vegetation > Forest0.7
7Stable Non-vegetation0.9
8Non-vegetation > Other Vegetation0.7
9Forest > Other Vegetation0.7
10Water Bodies0.9

References

  1. Putz, F.E.; Blate, G.M.; Redford, K.H.; Fimbel, R.; Robinson, J. Tropical Forest Management and Conservation of Biodiversity: An Overview. Conserv. Biol. 2001, 15, 7–20. [Google Scholar] [CrossRef] [Green Version]
  2. Mitchard, E.T. The Tropical Forest Carbon Cycle and Climate Change. Nature 2018, 559, 527–534. [Google Scholar] [CrossRef] [PubMed]
  3. FAO. The State of the World’s Forests 2018: Forest Pathways to Sustainable Development; FAO: Rome, Italy, 2018. [Google Scholar]
  4. FAO; UNEP. The State of the World’s Forests 2020: Forests, Biodiversity and People; FAO: Rome, Italy, 2020. [Google Scholar]
  5. Mitchell, A.L.; Rosenqvist, A.; Mora, B. Current Remote Sensing Approaches to Monitoring Forest Degradation in Support of Countries Measurement, Reporting and Verification (MRV) Systems for REDD+. Carbon Balance Manag. 2017, 12, 9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  6. Crutzen, P.J.; Ehlers, E.; Krafft, T. Earth System Science in the Anthropocene; Springer: Berlin/Heidelberg, Germany, 2006; pp. 13–18. [Google Scholar]
  7. Díaz, S.; Settele, J.; Brondízio, E.; Ngo, H.; Guèze, M.; Agard, J.; Arneth, A.; Balvanera, P.; Brauman, K.; Butchart, S. Summary for Policymakers of the Global Assessment Report on Biodiversity and Ecosystem Services of the Intergovernmental Science-Policy Platform on Biodiversity and Ecosystem Services; IPBES: Bonn, Germany, 2020. [Google Scholar]
  8. UN. The Sustainable Development Goals Report 2019; UN: New York, NY, USA, 2019. [Google Scholar]
  9. Hansen, M.C.; Potapov, P.V.; Moore, R.; Hancher, M.; Turubanova, S.A.A.; Tyukavina, A.; Thau, D.; Stehman, S.V.; Goetz, S.J.; Loveland, T.R. High-Resolution Global Maps of 21st-Century Forest Cover Change. Science 2013, 342, 850–853. [Google Scholar] [CrossRef] [Green Version]
  10. Shimada, M.; Itoh, T.; Motooka, T.; Watanabe, M.; Shiraishi, T.; Thapa, R.; Lucas, R. New Global Forest/Non-Forest Maps from ALOS PALSAR Data (2007–2010). Remote Sens. Environ. 2014, 155, 13–31. [Google Scholar] [CrossRef]
  11. Rodríguez-Veiga, P.; Barbosa-Herrera, A.P.; Barreto-Silva, J.S.; Bispo, P.C.; Cabrera, E.; Capachero, C.; Galindo, G.; Gou, Y.; Moreno, L.M.; Louis, V. Mapping the Spatial Distribution of Colombia’s Forest Aboveground Biomass Using SAR and Optical Data. In Proceedings of the 10 Joint Workshop “Multidisciplinary Remote Sensing for Environmental Monitoring”, Kyoto, Japan, 12–14 March 2019. [Google Scholar]
  12. Santoro, M.; Cartus, O.; Carvalhais, N.; Rozendaal, D.; Avitabilie, V.; Araza, A.; de Bruin, S.; Herold, M.; Quegan, S.; Rodríguez Veiga, P. The Global Forest Above-Ground Biomass Pool for 2010 Estimated from High-Resolution Satellite Observations. Earth Syst. Sci. Data Discuss. 2021, 13, 3927–3950. [Google Scholar] [CrossRef]
  13. Brink, A.B.; Eva, H.D. Monitoring 25 Years of Land Cover Change Dynamics in Africa: A Sample Based Remote Sensing Approach. Appl. Geogr. 2009, 29, 501–512. [Google Scholar] [CrossRef]
  14. Petit, C.; Lambin, E.F. Integration of Multi-Source Remote Sensing Data for Land Cover Change Detection. Int. J. Geogr. Inf. Sci. 2001, 15, 785–803. [Google Scholar] [CrossRef]
  15. Nomura, K.; Mitchard, E.T. More than Meets the Eye: Using Sentinel-2 to Map Small Plantations in Complex Forest Landscapes. Remote Sens. 2018, 10, 1693. [Google Scholar] [CrossRef] [Green Version]
  16. Hansen, M.C.; Krylov, A.; Tyukavina, A.; Potapov, P.V.; Turubanova, S.; Zutta, B.; Ifo, S.; Margono, B.; Stolle, F.; Moore, R. Humid Tropical Forest Disturbance Alerts Using Landsat Data. Environ. Res. Lett. 2016, 11, 034008. [Google Scholar] [CrossRef]
  17. Martone, M.; Rizzoli, P.; Wecklich, C.; González, C.; Bueso-Bello, J.-L.; Valdo, P.; Schulze, D.; Zink, M.; Krieger, G.; Moreira, A. The Global Forest/Non-Forest Map from TanDEM-X Interferometric SAR Data. Remote Sens. Environ. 2018, 205, 352–373. [Google Scholar] [CrossRef]
  18. Vargas, C.; Montalban, J.; Leon, A.A. Early Warning Tropical Forest Loss Alerts in Peru Using Landsat. Environ. Res. Commun. 2019, 1, 121002. [Google Scholar] [CrossRef]
  19. Wheeler, D.; Hammer, D.; Kraft, R.; Steele, A. Satellite-Based Forest Clearing Detection in the Brazilian Amazon: FORMA, DETER, and PRODES; World Resources Institute: Washington, DC, USA, 2014. [Google Scholar]
  20. Perbet, P.; Fortin, M.; Ville, A.; Béland, M. Near Real-Time Deforestation Detection in Malaysia and Indonesia Using Change Vector Analysis with Three Sensors. Int. J. Remote Sens. 2019, 40, 7439–7458. [Google Scholar] [CrossRef]
  21. Reiche, J.; Mullissa, A.; Slagter, B.; Gou, Y.; Tsendbazar, N.-E.; Odongo-Braun, C.; Vollrath, A.; Weisse, M.J.; Stolle, F.; Pickens, A. Forest Disturbance Alerts for the Congo Basin Using Sentinel-1. Environ. Res. Lett. 2021, 16, 024005. [Google Scholar] [CrossRef]
  22. De Groeve, T.; Kugler, Z.; Brakenridge, G.R. Near Real Time Flood Alerting for the Global Disaster Alert and Coordination System. In Proceedings of the ISCRAM 2007, Delft, The Netherlands, 13–16 May 2007; pp. 33–39. Available online: https://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.109.5899&rep=rep1&type=pdf (accessed on 20 March 2020).
  23. Bessis, J.-L.; Bequignon, J.; Mahmood, A. The International Charter “Space and Major Disasters” Initiative. Acta Astronaut. 2004, 54, 183–190. [Google Scholar] [CrossRef]
  24. Ross, K.; Brown, M.E.; Verdin, J.P.; Underwood, L. Review of FEWS NET Biophysical Monitoring Requirements. Environ. Res. Lett. 2009, 4, 024009. [Google Scholar] [CrossRef] [Green Version]
  25. Sothe, C.; de Almeida, C.M.; Liesenberg, V.; Schimalski, M.B. Evaluating Sentinel-2 and Landsat-8 Data to Map Sucessional Forest Stages in a Subtropical Forest in Southern Brazil. Remote Sens. 2017, 9, 838. [Google Scholar] [CrossRef] [Green Version]
  26. Pulvirenti, L.; Squicciarino, G.; Fiori, E.; Fiorucci, P.; Ferraris, L.; Negro, D.; Gollini, A.; Severino, M.; Puca, S. An Automatic Processing Chain for Near Real-Time Mapping of Burned Forest Areas Using Sentinel-2 Data. Remote Sens. 2020, 12, 674. [Google Scholar] [CrossRef] [Green Version]
  27. Reiche, J.; Verhoeven, R.; Verbesselt, J.; Hamunyela, E.; Wielaard, N.; Herold, M. Characterizing Tropical Forest Cover Loss Using Dense Sentinel-1 Data and Active Fire Alerts. Remote Sens. 2018, 10, 777. [Google Scholar] [CrossRef] [Green Version]
  28. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-Scale Geospatial Analysis for Everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  29. Remedios, J.; Balzter, H.; Burrows, J.; Eves, S.; Johnson, M.; Lavender, S.; Monks, P.; O’Neill, A.; Shepherd, A. Earth Observation: A Revolutionary Leap into the Future. Astron. Geophys. 2012, 53, 3–16. [Google Scholar] [CrossRef] [Green Version]
  30. Tewkesbury, A.P.; Comber, A.J.; Tate, N.J.; Lamb, A.; Fisher, P.F. A Critical Synthesis of Remotely Sensed Optical Image Change Detection Techniques. Remote Sens. Environ. 2015, 160, 1–14. [Google Scholar] [CrossRef] [Green Version]
  31. Cios, K.; Kurgan, L. Advanced Techniques in Knowledge Discovery and Data Mining; Springer: London, UK, 2005. [Google Scholar]
  32. Moser, G.; Zerubia, J. Mathematical Models for Remote Sensing Image Processing; Springer: Berlin/Heidelberg, Germany, 2018; ISBN 3-319-66330-5. [Google Scholar]
  33. Wang, X.; Kruger, U.; Irwin, G.W. Process Monitoring Approach Using Fast Moving Window PCA. Ind. Eng. Chem. Res. 2005, 44, 5691–5702. [Google Scholar] [CrossRef]
  34. Brown, J. Choosing the Right Number of Components or Factors in PCA and EFA. JALT Test. Eval. SIG Newsl. 2009, 13, 19–23. [Google Scholar]
  35. Achour, S.; Chikr Elmezouar, M.; Taleb, N.; Kpalma, K.; Ronsin, J. A PCA-PD Fusion Method for Change Detection in Remote Sensing Multi Temporal Images. Geocarto Int. 2020, 37, 196–213. [Google Scholar] [CrossRef]
  36. Nielsen, A.A.; Conradsen, K. Multivariate Alteration Detection (MAD) in Multispectral, Bi-Temporal Image Data: A New Approach to Change Detection Studies; Technical University of Denmark: Lyngby, Denmark, 1997. [Google Scholar]
  37. DeVries, B.; Decuyper, M.; Verbesselt, J.; Zeileis, A.; Herold, M.; Joseph, S. Tracking Disturbance-Regrowth Dynamics in Tropical Forests Using Structural Change Detection and Landsat Time Series. Remote Sens. Environ. 2015, 169, 320–334. [Google Scholar] [CrossRef]
  38. Herold, M.; Román-Cuesta, R.M.; Mollicone, D.; Hirata, Y.; Van Laake, P.; Asner, G.P.; Souza, C.; Skutsch, M.; Avitabile, V.; MacDicken, K. Options for Monitoring and Estimating Historical Carbon Emissions from Forest Degradation in the Context of REDD+. Carbon Balance Manag. 2011, 6, 13. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  39. Pratihast, A.K.; DeVries, B.; Avitabile, V.; De Bruin, S.; Herold, M.; Bergsma, A. Design and Implementation of an Interactive Web-Based near Real-Time Forest Monitoring System. PLoS ONE 2016, 11, e0150935. [Google Scholar] [CrossRef] [Green Version]
  40. Verbesselt, J.; Hyndman, R.; Newnham, G.; Culvenor, D. Detecting Trend and Seasonal Changes in Satellite Image Time Series. Remote Sens. Environ. 2010, 114, 106–115. [Google Scholar] [CrossRef]
  41. Masiliūnas, D.; Tsendbazar, N.-E.; Herold, M.; Verbesselt, J. BFAST Lite: A Lightweight Break Detection Method for Time Series Analysis. Remote Sens. 2021, 13, 3308. [Google Scholar] [CrossRef]
  42. Congalton, R.G.; Gu, J.; Yadav, K.; Thenkabail, P.; Ozdogan, M. Global Land Cover Mapping: A Review and Uncertainty Analysis. Remote Sens. 2014, 6, 12070–12093. [Google Scholar] [CrossRef] [Green Version]
  43. Roberts, J.; Baltzer, H.; Gou, Y.; Louis, V.; Robb, C. PYEO: Automated Satellite Imagery Processing, version v0.7; Zenodo: Meyrin, Switzerland, 2020; Available online: https://zenodo.org/record/3689674#.YfWMwqbP3ik and https://github.com/clcr/pyeo/tree/cd1daa9f5f37a2d2b3b314a63e4284dcccbfc075/pyeo/apps/change_detection; (accessed on 1 September 2021). [Google Scholar]
  44. Drusch, M.; Del Bello, U.; Carlier, S.; Colin, O.; Fernandez, V.; Gascon, F.; Hoersch, B.; Isola, C.; Laberinti, P.; Martimort, P. Sentinel-2: ESA’s Optical High-Resolution Mission for GMES Operational Services. Remote Sens. Environ. 2012, 120, 25–36. [Google Scholar] [CrossRef]
  45. Chazdon, R.L. Beyond Deforestation: Restoring Forests and Ecosystem Services on Degraded Lands. Science 2008, 320, 1458–1460. [Google Scholar] [CrossRef] [Green Version]
  46. Chazdon, R.L.; Guariguata, M.R. Natural Regeneration as a Tool for Large-scale Forest Restoration in the Tropics: Prospects and Challenges. Biotropica 2016, 48, 716–730. [Google Scholar] [CrossRef]
  47. Main-Knorn, M.; Pflug, B.; Louis, J.; Debaecker, V.; Müller-Wilm, U.; Gascon, F. Sen2Cor for Sentinel-2 GitHub repository. Available online: https://github.com/c-core-labs/sen2cor (accessed on 1 February 2020).
  48. Main-Knorn, M.; Pflug, B.; Louis, J.; Debaecker, V.; Müller-Wilm, U.; Gascon, F. Sen2Cor for Sentinel-2. In Proceedings of the SPIE Remote Sensing, Warsaw, Poland, 4 October 2017; Volume 10427, p. 1042704. [Google Scholar]
  49. Zhu, Z.; Wang, S.; Woodcock, C.E. Improvement and Expansion of the Fmask Algorithm: Cloud, Cloud Shadow, and Snow Detection for Landsats 4–7, 8, and Sentinel 2 Images. Remote Sens. Environ. 2015, 159, 269–277. [Google Scholar] [CrossRef]
  50. SIMEC; CONANP. Sierra de Manantlán; Gobierno de Mexico; 2020. Available online: https://simec.conanp.gob.mx/ficha.php?anp=59&reg=6 (accessed on 11 November 2021).
  51. Chapela, F. Estado de Los Bosques de Mexico; Consejo Civil Mexicano Para la Silvicultura Sostenible (CCMSS): Mexico City, Mexico, 2012. [Google Scholar]
  52. Santiago-Pérez, A.L.; Jardel-Peláez, E.J.; Cuevas-Guzmán, R.; Huerta-Martínez, F.M. Vegetación de Bordes En Un Bosque Mesófilo de Montaña Del Occidente de México. Boletín de la Sociedad Botánica de México 2009, 85, 31–49. Available online: http://www.scielo.org.mx/scielo.php?script=sci_arttext&pid=S0366-21282009000200004 (accessed on 10 October 2021). [CrossRef]
  53. Marshall, E.; Newton, A.C. Non-Timber Forest Products in the Community of El Terrero, Sierra de Manantlán Biosphere Reserve, Mexico: Is Their Use Sustainable? Econ. Bot. 2003, 57, 262–278. [Google Scholar] [CrossRef]
  54. Secretaría de Medio Ambiente y Desarrollo Territorial. Datos Históricos de Incendios Forestales; Secretaría de Medio Ambiente y Desarrollo Territorial: Guadalajara, Mexico, 2022. [Google Scholar]
  55. Cassell, B.A. Fire History of the Sierra de Manantlán Biosphere Reserve in Western Mexico; University of Washington: Seattle, WA, USA, 2012; ISBN 1-267-69621-4. [Google Scholar]
  56. Davalos, L.M.; Bejarano, A.C.; Hall, M.A.; Correa, H.L.; Corthals, A.; Espejo, O.J. Forests and Drugs: Coca-Driven Deforestation in Tropical Biodiversity Hotspots. Environ. Sci. Technol. 2011, 45, 1219–1227. [Google Scholar] [CrossRef]
  57. Martínez, G. Construyendo Agenda 21 Para El Departamento de Putumayo: Una Construcción Colectiva Para El Desarrollo Sostenible de La Amazonia Colombiana; Instituto Amazónico de Investigaciones Científicas SINCHI: Bogota, Colombia, 2007; ISBN 958-8317-27-4. [Google Scholar]
  58. Duivenvoorden, J.F.; Lips, J.M. A Land-Ecological Study of Soils, Vegetation, and Plant Diversity in Colombian Amazonia; Tropenbos International: Ede, The Netherlands, 1995; Available online: https://www.tropenbos.org/resources/publications/a+land-ecological+study+of+soils,+vegetation,+and+plant+diversity+in+colombian+amazonia (accessed on 10 October 2021)ISBN 90-5113-024-4.
  59. García, U.G.M.; García, C.M.H.; Rondón, J.M.R.; Castellanos, H.O.A. Monitoreo de Los Bosques y Otras Coberturas de La Amazonia Colombiana, a Escala 1: 100.000 Cambios Multitemporales En El Período 2002 Al 2007; Instituto Amazónico de Investigaciones Científicas SINCHI: Bogota, Colombia, 2011; ISBN 958-8317-70-3. [Google Scholar]
  60. IDEAM. Quinto Boletín de Alertas Tempranas de Deforestacion: Segundo Semestre; IDEAM: Bogotá, Colombia, 2015. [Google Scholar]
  61. Cabrera, E.; Galindo, G.; González, J.; Vergara, L.; Forero, C.; Cubillos, A.; Espejo, J.; Rubiano, J.; Corredor, X.; Hurtado, L. Colombian Forest Monitoring System: Assessing Deforestation in an Environmental Complex Country. In Forest Degradation Around the World; Books on Demand: Norderstedt, Germany, 2019. [Google Scholar]
  62. IDEAM. Resultados Del Monitoreo de La Deforestación 2017; IDEAM: Bogotá, Colombia, 2018. [Google Scholar]
  63. Armenteras, D.; Rodríguez, N. Dinámicas y Causas de Deforestación En Bosques de Latino América: Una Revisión Desde 1990. Colomb. For. 2014, 17, 233–246. [Google Scholar] [CrossRef] [Green Version]
  64. Coca-Castro, A.; Reymondin, L.; Bellfield, H.; Hyman, G. Land Use Status and Trends in Amazonia. A Report for the Amazonia Security Agenda Project (January, 2013); Centro Interacional de Agricultura Tropical: Cali, Colombia, 2013. [Google Scholar]
  65. Hoffmann, C.; Márquez, J.R.G.; Krueger, T. A Local Perspective on Drivers and Measures to Slow Deforestation in the Andean-Amazonian Foothills of Colombia. Land Use Policy 2018, 77, 379–391. [Google Scholar] [CrossRef]
  66. SIATAC. Puntos de Calor Por Región (Histórico). Cartagena Del Chairá 2018; Catalogo Geonetwork-SIATAC: Bogotá, Colombia, 2021. [Google Scholar]
  67. Sutton, O. Introduction to k Nearest Neighbour Classification and Condensed Nearest Neighbour Data Reduction. Univ. Lect. Univ. Leic. 2012, 1–10. Available online: https://staff.fmi.uvt.ro/~daniela.zaharie/dm2018/ro/TemeProiecte/Biblio/kNN/CondensedNearestNeighbor.pdf (accessed on 2 February 2020).
  68. García-Laencina, P.J.; Sancho-Gómez, J.-L.; Figueiras-Vidal, A.R.; Verleysen, M. K Nearest Neighbours with Mutual Information for Simultaneous Classification and Missing Data Imputation. Neurocomputing 2009, 72, 1483–1493. [Google Scholar] [CrossRef]
  69. Feng, S.; Zhao, J.; Liu, T.; Zhang, H.; Zhang, Z.; Guo, X. Crop Type Identification and Mapping Using Machine Learning Algorithms and Sentinel-2 Time Series Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2019, 12, 3295–3306. [Google Scholar] [CrossRef]
  70. Breiman, L. Statistical Modeling: The Two Cultures (with Comments and a Rejoinder by the Author). Stat. Sci. 2001, 16, 199–231. [Google Scholar] [CrossRef]
  71. Breiman, L.; Friedman, J.H.; Olshen, R.A.; Stone, C.J. Classification and Regression Trees; Wadsworth Inc.: Monterey, CA, USA, 1984. [Google Scholar]
  72. Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning; Springer Series in Statistics; Springer: New York, NY, USA, 2001. [Google Scholar]
  73. Robb, C. Geospatial Learn 0.3. Zenodo. 2017. Available online: https://github.com/Ciaran1981/geospatial-learn (accessed on 20 November 2019).
  74. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Louppe, G.; Prettenhofer, P.; Weiss, R. Scikit-Learn: Machine Learning in Python. arXiv 2012, arXiv:1201.0490. [Google Scholar]
  75. Olofsson, P.; Foody, G.M.; Herold, M.; Stehman, S.V.; Woodcock, C.E.; Wulder, M.A. Good Practices for Estimating Area and Assessing Accuracy of Land Change. Remote Sens. Environ. 2014, 148, 42–57. [Google Scholar] [CrossRef]
  76. Olofsson, P.; Foody, G.M.; Stehman, S.V.; Woodcock, C.E. Making Better Use of Accuracy Data in Land Change Studies: Estimating Accuracy and Area and Quantifying Uncertainty Using Stratified Estimation. Remote Sens. Environ. 2013, 129, 122–131. [Google Scholar] [CrossRef]
  77. Planet Team. Planet Team. Planet Application Program Interface. In Space for Life on Earth; Planet Team: San Francisco, CA, USA, 2017. [Google Scholar]
Figure 1. (A) A conceptual diagram of the proposed near real-time processing chain and outputs. S2 refers to the Sentinel-2 images. AOI refers to the area of interest or area of studio. The aoi_config.ini is the file containing all parameters for processing the Sentinel-2 image. The trained_model.pkl refers to any pre-trained machine-learning model. (B) An example of a deforestation alert in the Forest Sentinel App. (C) A quick look at the classification of change.
Figure 1. (A) A conceptual diagram of the proposed near real-time processing chain and outputs. S2 refers to the Sentinel-2 images. AOI refers to the area of interest or area of studio. The aoi_config.ini is the file containing all parameters for processing the Sentinel-2 image. The trained_model.pkl refers to any pre-trained machine-learning model. (B) An example of a deforestation alert in the Forest Sentinel App. (C) A quick look at the classification of change.
Remotesensing 14 00707 g001
Figure 2. The window areas of Manantlán (Mexico) and Cartagena del Chairá (Colombia).
Figure 2. The window areas of Manantlán (Mexico) and Cartagena del Chairá (Colombia).
Remotesensing 14 00707 g002
Figure 3. The temporal variation in the deforestation alerts for 2018 using Sentinel-2 images in the window areas for Manantlán, Mexico, and Cartagena del Chairá, Colombia.
Figure 3. The temporal variation in the deforestation alerts for 2018 using Sentinel-2 images in the window areas for Manantlán, Mexico, and Cartagena del Chairá, Colombia.
Remotesensing 14 00707 g003
Figure 4. The proportional size distribution of the detected deforestation events for the Colombia and Mexico window areas.
Figure 4. The proportional size distribution of the detected deforestation events for the Colombia and Mexico window areas.
Remotesensing 14 00707 g004
Figure 5. The detected forest change events for the window areas in 2018. (A) Mexico. (B) Colombia. In the RGB Sentinel-2 change composite, red corresponds to the blue channel of the earliest date, green corresponds to the green channel of the latest data and blue corresponds to the red channel of the latest acquisition date. In the RGB composite shown in the zoom windows, areas of dark blue correspond to forest loss, while the purple areas correspond to changes from other vegetation to non-vegetation.
Figure 5. The detected forest change events for the window areas in 2018. (A) Mexico. (B) Colombia. In the RGB Sentinel-2 change composite, red corresponds to the blue channel of the earliest date, green corresponds to the green channel of the latest data and blue corresponds to the red channel of the latest acquisition date. In the RGB composite shown in the zoom windows, areas of dark blue correspond to forest loss, while the purple areas correspond to changes from other vegetation to non-vegetation.
Remotesensing 14 00707 g005
Figure 6. A comparison of PYEO deforestation alerts to Global Forest Watch over two small areas in Colombia and Mexico, showing a color composite of Sentinel-2 data used for the automated change detection in this example from January to December 2018. The color composite has the combination of the earliest acquisition date and the date of change detection. In the Sentinel-2 RGB composite image, red corresponds to the blue channel of the earliest date, green corresponds to the green channel of the latest date and blue corresponds to the red channel of the latest date. On the maps to the left, dark blue corresponds to forest cover loss, while in the center and to the right, forest cover loss detections are overlaid in orange. GFW: Global Forest Watch.
Figure 6. A comparison of PYEO deforestation alerts to Global Forest Watch over two small areas in Colombia and Mexico, showing a color composite of Sentinel-2 data used for the automated change detection in this example from January to December 2018. The color composite has the combination of the earliest acquisition date and the date of change detection. In the Sentinel-2 RGB composite image, red corresponds to the blue channel of the earliest date, green corresponds to the green channel of the latest date and blue corresponds to the red channel of the latest date. On the maps to the left, dark blue corresponds to forest cover loss, while in the center and to the right, forest cover loss detections are overlaid in orange. GFW: Global Forest Watch.
Remotesensing 14 00707 g006
Table 1. The definitions of change classes considered in the NRT change detection system.
Table 1. The definitions of change classes considered in the NRT change detection system.
ClassesDefinition
Stable ForestForest that remains forest between two observations.
Forest > Other VegetationForest that changes to other vegetation. This class is difficult to detect during a short period of analysis. It may happen when deforested areas are in a recovery process but are still not within the definition of forest, or areas that replaced by other type of vegetation not identified as forest (i.e., bracken, agriculture, palm plantations).
Forest > Non-vegetationForest that changes to non-vegetated areas, i.e., deforestation.
Stable Other VegetationAreas that are covered by vegetation other than forest on both observation dates.
Other Vegetation > ForestOther vegetation that changes into forest, i.e., areas of tree planting, afforestation or reforestation.
Other Vegetation > Non-vegetationOther vegetation that changes into non-vegetated areas, i.e., change in crops, seasonal shrubs.
Stable Non-vegetationAreas that are not covered by vegetation in either observation, i.e., bare soils, urban areas.
Non-vegetation > ForestNon-vegetated areas that change into forest, i.e., areas of tree planting, afforestation or reforestation. This class might occur in a longer time series (i.e., > 10 years) [45,46].
Non-vegetation > Other VegetationNon-vegetated areas that change into other vegetation, i.e., crop growth, bracken growth, succession.
Table 2. The training and validation samples per class in each window area (Colombia and Mexico). The training points are pixel-based, while the validation areas correspond to polygons drawn by visual interpretation using PlanetScope data.
Table 2. The training and validation samples per class in each window area (Colombia and Mexico). The training points are pixel-based, while the validation areas correspond to polygons drawn by visual interpretation using PlanetScope data.
ClassesTraining Sampling PointsValidation Polygons
ColombiaMexicoColombiaMexico
1Stable Forest74,48687,527181123
2Forest > Non-vegetation3839803310050
3Stable Other Vegetation693325,2143020
4Other Vegetation > Non-vegetation143914,4554025
5Stable Non-vegetation829422,0394513
6Non-vegetation > Other Vegetation73011,047408
TOTAL99,047168,315446239
Table 3. The accuracy assessment of the change detections from Sentinel-2 by the visual interpretation of PlanetScope data for the two window areas. NRT: near real-time; PA: producer accuracy; UA: user accuracy; OA: overall accuracy.
Table 3. The accuracy assessment of the change detections from Sentinel-2 by the visual interpretation of PlanetScope data for the two window areas. NRT: near real-time; PA: producer accuracy; UA: user accuracy; OA: overall accuracy.
ClassesPA (%)UA (%)OA (%)
ColombiaMexicoColombiaMexicoColombiaMexico
1Stable Forest9495961009597
2Forest > Non-vegetation758899978793
3Stable Other Vegetation909568677981
4Other Vegetation > Non-vegetation957459777776
5Stable Non-vegetation938688409063
6Non-vegetation > Other Vegetation75421001008871
TOTAL87.08780858086
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Pacheco-Pascagaza, A.M.; Gou, Y.; Louis, V.; Roberts, J.F.; Rodríguez-Veiga, P.; da Conceição Bispo, P.; Espírito-Santo, F.D.B.; Robb, C.; Upton, C.; Galindo, G.; et al. Near Real-Time Change Detection System Using Sentinel-2 and Machine Learning: A Test for Mexican and Colombian Forests. Remote Sens. 2022, 14, 707. https://doi.org/10.3390/rs14030707

AMA Style

Pacheco-Pascagaza AM, Gou Y, Louis V, Roberts JF, Rodríguez-Veiga P, da Conceição Bispo P, Espírito-Santo FDB, Robb C, Upton C, Galindo G, et al. Near Real-Time Change Detection System Using Sentinel-2 and Machine Learning: A Test for Mexican and Colombian Forests. Remote Sensing. 2022; 14(3):707. https://doi.org/10.3390/rs14030707

Chicago/Turabian Style

Pacheco-Pascagaza, Ana María, Yaqing Gou, Valentin Louis, John F. Roberts, Pedro Rodríguez-Veiga, Polyanna da Conceição Bispo, Fernando D. B. Espírito-Santo, Ciaran Robb, Caroline Upton, Gustavo Galindo, and et al. 2022. "Near Real-Time Change Detection System Using Sentinel-2 and Machine Learning: A Test for Mexican and Colombian Forests" Remote Sensing 14, no. 3: 707. https://doi.org/10.3390/rs14030707

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop