1. Introduction
Aerosols and water vapour are two of the most important atmospheric constituents of the Earth-atmosphere system because they attenuate solar radiation through scattering and absorption processes, and also modulate terrestrial radiation mainly by absorption and emission. The effect of aerosols and water vapour on short wave (SW) irradiance was analysed by Obregón et al. [
1], who showed that the combined effect of those quantities was not simply the sum of their individual effects. The effect of each constituent has been also studied by Alexandri et al. [
2]. Atmospheric aerosols also indirectly affect the radiation balance by influencing cloud formation. Changes in the atmospheric composition due to these constituents can modify the radiative budget and, as a result, the global temperature of the Earth [
3]. Moreover the aerosol load in the atmospheric column (AOT) and the total water vapour integrated over an atmospheric column (IWV) are key quantities for atmospheric correction of satellite radiation measurements, and their reliability greatly influence the accuracy of the surface reflectance satellite derived products [
4].
AOT and IWV data can be obtained from sunphotometer measurements. However, these are representative of point measurements at the surface, providing local information corresponding to the specific locations where the measurement stations are located and, therefore, cannot provide the spatial coverage required for global monitoring. Consequently, if the objective is to perform studies on a regional or global scale, it is better to use data with global coverage, such as those provided by instruments on board satellite platforms. Some of the instruments used to retrieve aerosol related quantities and/or water vapour on board satellite platforms as Terra, Aqua, Aura, Calipso, Meteosat or Sentinel-2, among others, are moderate resolution imaging spectroradiometer (MODIS), multi-angle imaging spectro-radiometer (MISR), ozone monitoring instrument (OMI), spinning enhanced visible and infrared imager (SEVIRI), cloud-aerosol LIdar with orthogonal polarization (CALIOP), multi-spectral imager (MSI), etc. Most AOT and IWV data derived from observations with these instruments have already been validated with data obtained from the Aerosol Robotic Network (AERONET) sunphotometer measurements [
4,
5,
6,
7,
8,
9,
10,
11,
12]. These last three studies have validated AOT and IWV data obtained from the implementation of the offline version of an atmospheric correction algorithm to data from MSI measurements on board the Sentinel-2 satellite, termed Sen2Cor [
13]. However, there are more recent AOT and IWV data, such as those obtained from MSI measurements on board the Sentinel-2 satellite and processed by the European Space Agency (ESA) for Europe, concretely level 2A (ESA-L2A) data, and its validation has not yet been performed in a general way. Djamai et al. [
14] validated ESA-L2A AOT and IWV but only used five data points belonging to three AERONET stations, one in Spain and two in France, during the period April - October 2017. Due to the reduced number of data points and the very short time period, this validation is very limited, both spatially and temporally. Therefore, the objective of this work is to validate ESA-L2A AOT and IWV over Europe and adjacent regions (western Asia and northern Africa), under cloud-free conditions, using as many reference data as possible. Satellite AOT products that present high spatial resolution are of great interest since they can be used to study local sources of aerosols in urban areas [
15] and monitor local air pollution [
16]. In these areas, aerosol distributions can present important spatial variations due to the means of transportation, construction infrastructure, industrial activity or variations in the density of the human population. The use of high spatial resolution satellite AOT data support the identification of local peak concentrations and allow defining concentration gradients between surface monitoring stations. In addition, Sentinel-2 satellite is of great interest to study surface features and processes taking advantage of its high spatial resolution. This requires an accurate atmospheric correction of the images, therefore the need for assessing and improving Sentinel-2 atmospheric products. The reference data used, as in other similar validation studies, are AOT and IWV data derived from AERONET sunphotometer measurements. Specifically, data from 94 AERONET stations
Supplementary Materials during the period between March 2017 and December 2018 have been used for this study. This time period covers four seasons, which may be of some interest at particular regions.
This paper is organized as follows. The data used in this study are described in
Section 2. In
Section 3, the methodology is presented. The main results are shown and discussed in
Section 4. An analysis of potential uncertainty sources is provided in
Section 5 and conclusions are given in
Section 6.
3. Methodology
The ESA-L2A retrieved AOT and IWV products were compared with independent space–time-collocated AERONET AOT and IWV quantities retrieved from the sunphotometer measurements. For this purpose, subsets of 9 km × 9 km, around the coordinates of each AERONET site were extracted from the MSI tiles, following the procedures of other validation studies (e.g., [
12,
14]). The procedure was limited at each site by the existence of AERONET level 2.0 data with a temporal difference of less than ±1 h from the satellite time overpass. The next step was to identify the valid pixels within the subsets for the validation process. Each of the subset pixels, with a spatial resolution of 20 m, is assigned a value of AOT, IWV and scene classification, among other products. In this study, only those pixels with scene classifications of a dark area (codified as 2), vegetation (codified as 4) or bare soils (codified as 5) were considered. Once these pixels were identified, their percentage was calculated, finally selecting those subsets with percentages greater than 50%. It is noted that pixels classified as water (scene classification 6), have not been taken into account in the calculation of this percentage. Subsequently, the AOT and IWV averages were computed for each subset with the pixels that met the defined criteria. The closest AOT and IWV AERONET data to the satellite overpass were selected for each station, considering a maximum difference of ±1 h from the satellite overpass, as mentioned before. AERONET data fulfilling this condition were then selected and linearly interpolated to estimate the value of these products at the exact time of the satellite overpass. The existence of AERONET AOT and IWV products, both in the previous and subsequent hour, is a necessary condition to perform such an interpolation.
After these collocation steps, the two initial databases (ESA-L2A and AERONET Level 2.0) are reduced to two composed of spatial and temporal coincident cases in a total of 2398 pairs of data. AOT and IWV of both databases have been compared in order to evaluate the accuracy of AOT and IWV ESA-L2A products. Common statistical indicators have been used in this study to obtain an appropriate comparative assessment of both datasets. The mean bias error (MBE), which is a measure of the overall error or systematic error, was calculated as follows:
where N is the size of the sample corresponding to the total number of data, X are ESA-L2A products data and Xref are the ground-based reference values, i.e., AERONET Level 2.0 data.
The root mean square error (RMSE) gives a measure of the error existing between ESA-L2A products data and AERONET Level 2.0 products data. Small values of the RMSE indicate that the regression fit is acceptable. This parameter is expressed as:
Also, normalized values of the previous statistic indicators (NMBE and NRMSE) have been calculated to compare AOT and IWV.
Another statistical indicator is the determination coefficient (R2), which indicates the accuracy of the obtained regression fit from the comparison. This parameter, which shows how much the model is able to describe the data, should be close to 1.
4. Results
Figure 2 presents the scatter plots of the 2398 ESA-L2A retrievals of AOT (a) and IWV (b) versus the coincident reference AERONET Level 2.0 quantities retrieved from the ground-based measurements. This figure also shows the regression line, MBE, NMBE, RMSE, NRMSE and R2 values obtained for each quantity. The intercomparison of ESA-L2A and AERONET derived quantities showed a better agreement for IWV than for the AOT, with NRMSE of 5.33 and 9.04%, respectively. This conclusion was also reflected in the values of R
2, which are 0.99 and 0.65 for IWV and AOT, respectively. From this figure, it was extracted that the NMBE corresponding to AOT was 3.75%, while for IWV, it presented a negative value of −3.24%. These results indicated that ESA-L2A AOT data overestimated the reference values, while the opposite occurred with IWV. The value of MBE for AOT (0.05) was lower than the value of MBE reported in Djamai et al. [
14] (0.07). However, the value of MBE for IWV (−0.14 cm), although with the same sign, presented a higher absolute value than that reported by Djamai et al. [
14] (−0.02 cm). It should be taken into account that these authors only used five data points belonging to three AERONET stations from the south of Europe (see
Figure 1) during the period April 2017–October 2017. This was a reduced dataset compared with the 94 AERONET sites covering a wide geographical region with a variety of climate and environmental conditions considered in the present study.
Figure 2 also shows the accuracy requirement limits for AOT and IWV defined by ESA [
21], represented with solid blue lines. As it can be observed, IWV meets the accuracy requirements better than AOT. Specifically, from the 2398 data points used in this study, 2167 IWV data points (90.4%) were within the limits defined by ESA, while, in the case of AOT, only 875 (36.5%) met the requirement. Another aspect to be highlighted in this figure was that the minimum value of ESA-L2A AOT was 0.06. This value is greater than the minimum value of AERONET AOT, which is close to zero. In fact, 26.9% of AERONET AOT data were lower than 0.06. With respect to IWV, it was also observed that the minimum value of ESA-L2A IWV, 0.15, was higher than the minimum value of AERONET IWV, which was 0.04. However, in this case only sixteen AERONET IWV data were lower than 0.15.
According to information from ESA (
https://earth.esa.int/web/sentinel/missions/sentinel-2/news/; last accessed 21/05/2019), a new version of the Sentinel-2A spectral response functions is used from 15 January 2018 onwards, mainly correcting the responses for bands 1 and 2 due to inaccurate spectral responses provided previously and used in the above retrievals before 15 January 2018. Band 2, centered at 492 nm, is used to estimate AOT, thus one of the reasons for the poor agreement between ESA-L2A AOT estimates and AERONET AOT data could be the use of incorrect spectral response functions to estimate ESA-L2A AOT. This is investigated by dividing the data into two sub-periods, before and after the spectral response function update, and inter-comparing ESA-L2A AOT and AERONET AOT in the two sub-periods. The first sub-period includes the data obtained with the old spectral response functions, i.e., from the beginning of the study period (March 2017) until 14 January, 2018 and the second sub-period includes the data obtained with the new spectral response functions, from 15 January to December 2018.
Figure 3 presents the scatter plots of AOT from ESA-L2A retrievals versus coincident AERONET Level 2.0 data for the two sub-periods. As it can be seen in this figure, the inter-comparison of ESA-L2A AOT and AERONET showed a better agreement for the second sub-period than for the first one, with RMSE values of 0.14 and 0.08, respectively for the first and second sub-periods. Therefore, ESA-L2A AOT values estimated with the new spectral response functions are closer to the reference values than the ones obtained with the previous spectral response functions. This conclusion was also verified through the calculation of the MBE values, obtaining 0.01 for the period after 15 January, 2018 compared with the 0.09 value obtained for the period before 15 January, 2018. The values of RMSE and MBE for the two sub-periods have also been compared with the values obtained for the whole period, displayed in
Table 3. This indicated that the two statistical indicators have been drastically reduced for the sub-period with the new spectral response functions, compared with the entire period, being even smaller than the ones reported by Djamai et al. [
14] of 0.07 (MBE) and 0.09 (RMSE). It can be concluded that with the use of these new spectral response functions, the quality of the ESA-L2A AOT estimates has significantly improved. The results of this study are compared to those reported by Bilal et al. [
25], regarding the evaluation of AOT retrievals from MODIS using AERONET, in the same region. Despite the different spatial resolutions considered in both studies, the results obtained were very similar in terms of RMSE and MBE values.
The differences between the two AOT datasets (ESA-L2A and AERONET) calculated for the entire study period (March 2017–December 2018), for the first (March 2017–January 2018) and the second sub-periods (January 2018 onwards) are also discernible when a seasonal analysis is carried out as presented in
Figure 4. This figure presents the box and whisker plots for each period of analysis (complete, first and second sub-periods) of ESA-L2A AOT and AERONET AOT databases for the entire time period and for each season (spring- M,A,M-, summer-J,J,A-, autumn- S,O,N- and winter- D, J, F). When the entire study period was considered (
Figure 4a), no seasonality was observed, and the statistical values for each season were very similar. It was also observed that ESA-L2A AOT values were higher than AERONET AOT values in all seasons. However, when the sub-periods were analysed separately, the behaviors were moderately different. For example, in
Figure 4b, corresponding to the sub-period before 15 January, 2018, seasonality was observed in the AOT values of the two databases, obtaining the highest values in summer, and the lowest in winter. The small overlap between the two databases in winter was due to the minimum value of ESA-L2A AOT, which was 0.06, while the minimum value of AERONET was lower. However, the differences between the values of ESA-L2A and AERONET estimates were greater in this sub-period (for each season and the entire time) than when the entire study period was considered (
Figure 4a). It should be noticed that in
Figure 4b the boxes characterizing the inter-quartile ranges only overlap in autumn, whereas in
Figure 4a, they overlap in all seasons and for the whole time period. On the other hand, if the second sub-period is used (
Figure 4c) a greater overlap of both databases is obtained, which means that ESA-L2A AOT and AERONET AOT values show a better agreement when using the new spectral response functions in the ESA AOT retrievals. Moreover, the AOT values also showed seasonality, with the highest values in summer and the lowest in winter. The high AOT values in summer are due to a combination of factors that occur in this period of the year. One of them is the lack of precipitation, favoring the presence of aerosols in the atmosphere. Another factor is the arrival of desert-dust aerosols from the Saharan region and the summer forest fires, which are important sources of aerosols in the stations of the Mediterranean region [
26,
27] (South Europe and North Africa/Middle East). In this study, stations of the Mediterranean region predominate with respect to those of northern Europe, hence, these aerosol sources have a great influence on summer AOT values. This seasonal variation of AOT was also observed by other authors [
28,
29,
30].
5. Analysis of Potential Uncertainty Sources
Since the correlation between ESA-L2A AOT estimates and AERONET AOT data is not very good, it improves when the correct spectral response functions are applied. Hence, a sensitivity analysis to key factors, as the scene classification class, the region and the aerosol type, has also been performed.
As mentioned, in this study only pixels with scene classifications classes 2 (Dark_Area_PIXELS), 4 (Vegetation) and 5 (Bare_soils) were considered. In order to estimate the sensitivity of the intercomparison to scene classification, the study was performed individually for each scene classifications class. The results obtained are shown in
Table 4, both for the whole period and for the sub-period after 15 January, 2018. From this table, the best agreement between the two AOT datasets was obtained for the scene class 4 (Vegetation), with RMSE of 0.08 and 0.10, respectively. This conclusion was also reflected in the values of R
2, which were 0.74 and 0.81 for the whole period and for the sub-period after 15 January, respectively.
Regional variation is also an important factor to take into account in the assessment of Sentinel-2 AOT values, since each region presents different environmental conditions and is typically influenced by different aerosol types. For this reason, the stations have been grouped into three regions (North Europe, South Europe, and North Africa/Middle East) in order to investigate the effect of regional variations on the ESA-L2A AOT product accuracy. The results obtained are presented in
Table 5 and show that there are regional differences, with the best agreement obtained for the North Europe region and the worst agreement in North Africa/Middle East region, both for the whole period and for the sub-period after 15 January, 2018. The reason for the poor correlation between the two AOT datasets in the North Africa and Middle East regions is due to the assignment of AOT values of 0.2 (corresponding to the initial visibility assumed by the algorithm), as illustrated in
Figure 5, which occurs when the scene does not contain reference pixels for the calculation. The sensitivity of the comparison to different aerosol types was analysed using the classification proposed by Elias et al. [
31], who presented a criterion to classify the aerosol turbidity situations based on both the values of AOT and the Ångström exponent. The aerosol types proposed are clean (CL), continental (CO), maritime (MA), forest fires emissions (FF) and desert dust (DD).
Table 6 shows the values of MBE, RMSE and R2 for each aerosol type, both for the whole period and for the sub-period after 15 January, 2018. From this table, CO type presented the best agreement between ESA-L2A AOT estimates and AERONET AOT data, with the lowest MBE and RMSE values. However, DD type presented the worst correlation due to the existence of a large number of cases with ESA-L2A AOT value equal to 0.2, showing the relationship between this aerosol type and the North Africa region. The most negative value of MBE corresponded to FF aerosol type. This means that there is an underestimation of ESA-L2A AOT in relation to AERONET AOT during these episodes, characterized by very high AOT values. Followed by these values of MBE were those corresponding to CL and MA types, although with positive values which indicated an overestimation of ESA-L2A AOT. This was because the minimum value of ESA-L2A AOT was 0.06, which was much higher than most of the lowest values of AERONET AOT, characterizing CL and MA aerosol types. Also the highest values of AOT were not well represented by ESA-L2A AOT product, as illustrated in
Figure 5.
The previous sensitivity analyses allows the conclusion that ESA-L2A AOT estimates equal to 0.2 and AERONET AOT values lower than 0.06 and higher than 0.4 worsen the correlations between ESA-L2A AOT estimates and AERONET AOT data. Therefore, in the following analysis, AERONET AOT values lower than 0.06, higher than 0.4 and data corresponding to stations that present more than 20% of the ESA-L2A AOT values equal to 0.2 have not been considered. This analysis has been performed only for the sub-period after 15 January, 2018. The results obtained are presented in
Figure 6 and constitute the recommended dataset to use for quantitative purposes. This figure shows that the correlation between ESA-L2A AOT estimates and AERONET AOT data is better (R
2 equal to 0.84) than that obtained without the exclusions described and shown in
Figure 3 (R
2 equal to 0.72). The value of RMSE also decreased, from 0.08 to 0.07. This new AOT dataset meets the ESA accuracy requirements better than the complete AOT dataset presented in
Figure 3 (without the exclusions described). Specifically, from the 838 data points used, 526 AOT data points (62.8%) were within the error limits defined by ESA, whereas previously only 36.5% of the AOT data met this requirement.
Therefore, it can be concluded that ESA-L2A AOT product should be used with caution in two cases: (a) Regions where the appropriate AOT value is often not estimated (due to the lack of DDV regions) and the value of 0.2 is assigned, and (b) regions with extreme visibility values (high or low), that is, sites that normally have very low or high AOT values. It would be advisable to use more than one default value (start visibility), according to the geographic area and season. These reference values could be obtained from a climatological analysis of ground-based or satellite reference data (AERONET for example). This would be important because in the cases with no DDV pixels available to obtain ESA-L2A AOT estimates, the AOT corresponding to the start visibility is used, which causes important errors as demonstrated here.
6. Conclusions
This study contributes to the validation of AOT and IWV products provided by ESA from MSI measurements on board the Sentinel-2 satellite, respectively AOT ESA-L2A and IWV ESA-L2A. These products have been validated by comparison with data from 94 AERONET stations over Europe and adjacent regions and during the period between March 2017 and December 2018. The results indicated that ESA-L2A AOT data overestimated the reference values with NMBE of 3.75%, while the opposite occurred with IWV presenting a NMBE of −3.24%. The results showed a good agreement for IWV with NRMSE of 5.33% and R
2 above 0.99. However, a larger disagreement was obtained for AOT, with NRMSE of 9.04% and R
2 equal to 0.65. One possible cause for this lack of agreement in the AOT values could be due to the use of inaccurate spectral response functions, which impacted the retrieval of AOT ESA-L2A product from MSI measurements (the most affected were bands 1 and 2 and AOT retrieval uses band 2, among others). A new version of the Sentinel-2A spectral response functions is used from 15 January, 2018 onwards. Therefore, the study period was divided into two sub-periods, before and after 15 January, 2018. The inter-comparison of ESA-L2A AOT with AERONET AOT showed a better agreement for the second sub-period than for the first one, with RMSE (and MBE) values of 0.08 (0.01) for the second sub-period and 0.14 (0.09), for the first one. These values of RMSE and MBE were also smaller than the ones reported by Djamai et al. [
14], of 0.09 and 0.07, respectively. The seasonal differences between AOT ESA-L2A and AOT AERONET data for the entire study period, the first sub-period and the second sub-period have also been analysed. The results showed no seasonality for the entire period. However, when the sub-periods were analysed separately, seasonality was observed, obtaining the highest AOT values in summer and the lowest in winter. For the first sub-period, the differences between the ESA-L2A and AERONET AOT estimates were greater in this sub-period than for the entire study period for all seasons. On the other hand, if the second sub-period is considered, a greater overlap of both databases is obtained for all seasons. Therefore, it can be concluded that with the use of these new spectral response functions the quality of the ESA-L2A AOT estimates has significantly improved. An analysis of potential uncertainty sources showed that the algorithm accuracy highly depended on the AOT first guess and that this was a critical issue over bright surfaces. It is recommended to update the algorithm in order to consider different initial visibility values, based for example, on reference climatologies.