2.3.1. Imagery Acquisition and Pre-Processing
Free Landsat images of four periods with ten years intervals each, 1990, 2000, 2010, and 2020, were downloaded from the United States Geological Survey (USGS, accessed 20 January 2022, “
earthexplorer.usgs.gov/”). We used images for the dry season months between July and September with < 10% cloud cover, whereby the time gap between the images needed to be > 10 days. This led to a total of 37 images (Dataset 1,
Table 1). To obtain seasonal NDVI within a year, an additional 127 Landsat images (
Dataset 2, Table S4) in four periods of each year (1990, 2000, 2010, and 2020) were collected to calculate comparative NDVI values. We omitted Landsat 7 data due to their sensor errors; the correction process showed a large gap in our study area. Instead, we used Landsat Level 1 for time series analyses. This avoided errors of processed Level 2 products that contain two different surface reflectance algorithms.
The radiance measured by any given system from any object is influenced by factors such as changes in scene illumination, atmospheric conditions, viewing geometry, and instrument response characteristics [
17]. Radiometric calibration of satellite-acquired data is therefore essential for quantitative scientific studies, as well as for a variety of image-processing applications [
18]. The objective of atmospheric correction is to retrieve the surface reflectance (which characterizes the surface properties) from remotely sensed imagery by removing atmospheric effects.
Using a Dark Object Subtraction (DOS) approach during pre-processing, all images were corrected for atmospheric and radiometric errors. This allowed the removal of haze values caused by scattering of the remote sensing data [
19], and the effects of water vapor in the atmosphere, that can absorb the radiation in a specific area [
20]. After correction, the images of each year were mosaicked using the Seamless Mosaic tool of ENVI 5.3 (Exelis Visual Information Solutions, Boulder, CO, USA) and clipped to the study area of the High Atlas Mountains.
2.3.2. Classification and Accuracy Assessment
Approach 1. Support Vector Machine
Field dataset: The first of the four vegetation classes, referred to as “Open Canopy Vegetation”, represented agricultural lands, grasslands, and tundra. The second class, referred to as “Water”, represented artificial lakes, rivers, dams, streams, and reservoirs. The third class, “Forest”, included woody and other wild vegetation, while “Bareland” referred to sand dunes, exposed rocks, deserts, and uncultivated areas (
Figure 3). Those classes derived from separation [
21] after inspecting them using Google Earth 7.1.
The training dataset was collected from the Google Earth Imagery and examined for spectral signatures of each land-use class across images of 2000, 2010, and 2020. We first used a visualization of land cover classes in the most recent year, 2020, employing Google Earth Imagery, and then re-inspected the land cover type for a single point in 2010 and 2010 to build the training set of 2010 and 2000. Moreover, we used false color images of 1990 to inspect land use classes. A small polygon was generated for each class in combination with the false color in the historical imagery of 1990. In total, 482 samples averaging 4000 m² in size were collected for all periods (
Figure 3). Overall, we collected 482 samples, of which 177 referred to Open Canopy Vegetation, 12 to Water, 95 to Forest, and 198 to Bareland (
Table 2,
Figure S3). During fieldwork conducted in February 2022, we collected 215 GPS points for validation based on Google Earth online images. These points were converted from the Keyhole Markup Language (KML) to shapefile format. On the other hand, we randomly identified a number of points for each class in ArcGIS and exported them in KML format into Google Earth. These points allowed us to verify each class and to conduct an accuracy assessment.
Classification and accuracy assessment: The SVM classification was performed through ENVI classic version 5.3 (Exelis Visual Information Solutions, Boulder, CO, USA). This method is considered as a supervised learning system based on statistical learning to identify and distinguish an optimal separation between the classes [
21] based on the training [
22]. The SVM approach was introduced first in the late 1970s [
21]. It is an efficient classifier in high-dimensional spaces, which makes it particularly applicable to multi-dimensional remote sensing data [
23]. Classification accuracy was determined by calculating the overall accuracy and the Kappa index [
19,
24]—a commonly employed index—whereby we used our ground truthing points from the field visit as an independent dataset.
Approach 2. Rules-based using Normalized Difference Vegetation Index
The NDVI was calculated using the following equation for the Normalized Difference Vegetation Index (NDVI, [
25]:
where NIR refers to spectra reflectance of the Near Infrared.
NDVI values range from −1 to 1. The highest value (NDVI =1) represents a fully healthy vegetation, while the lowest NDVI value (NDVI = −1) indicates non-vegetative land cover [
25].
To investigate the correlation between the NDVI and the land-use types in the High Atlas Mountains, we analyzed Landsat images for the four seasons of each year using the criteria presented in
Table 1 [
26]. Hereby, Dataset 2 consisted of 127 Landsat images of 1990, 2000, 2010, and 2020 in Period 1 (December–January), Period 2 (March–April), Period 3 (July–August), and Period 4 (Oct–November) within periods < 60 days depending on the availability of historical images (
Supplement Tables S1 and S4). All datasets were corrected and mosaicked before calculating NDVI. We randomized 150 points inside the boundaries of the study site area, which we virtually visited using Google Earth Pro 7.1. At each ‘virtual ground truthing point’, we labeled the class as in the land cover classification of 2020. Vegetation types such as cropping system (mono-cropping, crop and tree), and tree density were determined for the same periods. The NDVI value of each point in the four periods was recorded using the identification function in QGIS. Due to the dominance of Bareland in the High Atlas regions, we collected 88 additional stratified ‘virtual ground truthing points’ for vegetative areas. To this end, we identified land cover and vegetation type in each of the four periods (four NDVI values of 238 points) to compare the greenness of each land-use/or vegetation type using pairwise comparisons. The distribution and vegetation status allowed us to build rules to distinguish land cover types and to compare them with the results from the first approach using the SVM method. While NDVI represents a consistent index based on the reflectance of greenness in the ground retrieved from remote sensing data, we applied the same rules-based approach built for 2020 to classify land, using the historical NDVI of 1990, 2000 when no field data were available. The Landsat data of 2010 displayed a line error between single images during the mosaicking process; we therefore excluded this period in this approach. We assume this error may be the result of the limited mosaicking capacity in the software used.
To assess the accuracy of our method, we simplified the accuracy assessment by using a qualitative approach with semi-random windows visualization, comparing the classified map versus available high-resolution based maps in QGIS. For this purpose, we randomized seven windows with an average size of 5 × 5 km and subsequently visually compared classified maps of the two approaches versus a true color base map. Finally, we employed a harmonization process to group the equivalent class of both approaches. The equivalent land-use type in the SVM approach and the vegetation type (NDVI approach) were grouped into a generic class to compare both approaches using visualization. This allowed the verification of the vegetation differences between both approaches and the vegetation dynamics in the High Atlas Mountains.