Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment

Schmidt, Toni; Kuester, Theres; Smith, Taylor; Bochow, Mathias

doi:10.3390/rs15082020

Open AccessArticle

Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment

¹

Department Remote Sensing, Helmholtz-Centre for Environmental Research—UFZ, 04318 Leipzig, Germany

²

Remote Sensing Centre for Earth System Research—RSC4Earth, Leipzig University, 04103 Leipzig, Germany

³

Remote Sensing and Geoinformatics Section, Helmholtz Centre Potsdam—GFZ German Research Centre for Geosciences, 14473 Potsdam, Germany

⁴

Institute of Geosciences, University of Potsdam, 14476 Potsdam, Germany

^*

Author to whom correspondence should be addressed.

Remote Sens. 2023, 15(8), 2020; https://doi.org/10.3390/rs15082020

Submission received: 27 February 2023 / Revised: 29 March 2023 / Accepted: 3 April 2023 / Published: 11 April 2023

(This article belongs to the Special Issue Remote Sensing of Plastic Pollution)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Plastics are part of our everyday life, as they are versatile materials and can be produced inexpensively. Approximately 10 Gt of plastics have been produced to date, of which the majority have been accumulated in landfills or have been spread into the terrestrial and aquatic environment in an uncontrolled way. Once in the environment, plastic litter—in its large form or degraded into microplastics—causes several harms to a variety of species. Thus, the detection of plastic waste is a pressing research question in remote sensing. The majority of studies have used Sentinel-2 or WorldView-3 data and empirically explore the usefulness of the given spectral channels for the detection of plastic litter in the environment. On the other hand, laboratory infrared spectroscopy is an established technique for the differentiation of plastic types based on their type-specific absorption bands; the potential of hyperspectral remote sensing for mapping plastics in the environment has not yet been fully explored. In this study, reflectance spectra of the five most commonly used plastic types were used for spectral sensor simulations of ten selected multispectral and hyperspectral sensors. Their signals were classified in order to differentiate between the plastic types as would be measured in nature and to investigate sensor-specific spectral configurations neglecting spatial resolution limitations. Here, we show that most multispectral sensors are not able to differentiate between plastic types, while hyperspectral sensors are. To resolve absorption bands of plastics with multispectral sensors, the number, position, and width of the SWIR channels are decisive for a good classification of plastics. As ASTER and WorldView-3 had/have narrow SWIR channels that match with diagnostic absorption bands of plastics, they yielded outstanding results. Central wavelengths at 1141, 1217, 1697, and 1716 nm, in combination with narrow bandwidths of 10–20 nm, have the highest capability for plastic differentiation.

Keywords:

plastic mapping; plastic debris; spectroscopy; sensor simulation; optical remote sensing; multispectral sensors; hyperspectral sensors; machine learning; classification; feature selection

Graphical Abstract

1. Introduction

Plastics designate a group of (semi-)synthetic polymers and combine excellent material properties. They are used for manifold purposes as they are cheap, durable, water resistant, strong, and light. Their mass production and use began in ~1950 with an annual global production of 2 Mt, which has increased to 381 Mt in 2015 [1]. This has resulted in a cumulative global production of ~10 Gt over the last 70 years. Many plastic types have been synthesized to date but only seven are used regularly and account for 95.1% of total plastic production: high-density polyethylene (PE-HD), low-density polyethylene (PE-LD), polyethylene terephthalate (PET), polypropylene (PP), polyurethane (PU), polystyrene (PS), and polyvinyl chloride (PVC) [1]. Packaging is their main application, accounting for almost 45% of all plastic consumption [1]. This implies that most of the plastics produced are usually only used once [2].

The outstanding material properties become a problem once plastics are released into the environment as litter. As all regularly used plastics are non-biodegradable, their total degradation takes several hundred years [3]. An accumulation of plastic debris at disposal sites or in the environment is therefore inevitable. Approximately 79% of all plastics ever produced have been accumulated at disposal sites or in the environment [1]. Consequently, nearly 8 Gt of plastics can nowadays be expected to be in landfills or the environment, where soils and oceans form a final sink [4]. Many studies highlight the accumulation of plastic debris in the marine and coastal environment, with estimates of 60–80% of the total waste being plastics [5]. For the terrestrial environment, it has been determined that more than 50% of all waste are plastics [6]. Plastic waste occurs globally since it is easily distributed by wind and water [7,8]. The persistence of plastics causes an enduring degradation of larger objects into microplastics and the release of problematic substances for long times, which threatens the environment and its organisms [4,9,10]. Moreover, human health is affected by plastics and their additives, even in the natural environment [11,12]. This emphasizes the need of recovering plastic waste before it enters water bodies or decays on land.

The long-chain polymer structure of plastics, consisting of many repetitions of identical monomers with a limited variety of atoms, forms many absorption bands in their spectral reflectance signatures [13]. These absorption bands are relatively narrow and deep compared to natural materials that have smoother signals, such as vegetation or soils. The characteristic and distinct absorption bands in the infrared spectra of plastics are found for wavelengths greater than ~900 nm, i.e., mainly in the short-wave infrared (SWIR) part of the electromagnetic spectrum. This makes them ideal for spectroscopy, as each plastic type has diagnostic absorption bands at specific wavelengths [14,15,16]. Infrared spectroscopy has been used for many decades, e.g., [17,18,19], while applications are found in waste management and the recycling industry. The findings from laboratory infrared spectroscopy applied to remote sensing could be demonstrated for various hydrocarbons at an early stage [20], and have since been intensively researched. The Hydrocarbon Index is capable of detecting hydrocarbons without differentiation using hyperspectral sensors [21]. Whether the benefits of laboratory spectroscopy for plastic differentiation are applicable for spaceborne sensors has recently been studied, e.g., [22,23,24,25,26,27,28,29,30,31,32,33].

Spaceborne remote sensing has the potential to scan large areas continuously. Its application for plastics is young, yet promising, and has the potential to create a global map of plastic debris in the environment [29]. So far, this has not yet been made, as there are only a few satellites with which plastics can be detected according to recent knowledge. The majority of studies use Sentinel-2 or WorldView-3 data and empirically explore the usefulness of the given spectral channels for the identification and differentiation of plastic litter in the environment, e.g., [27,32]. However, there remains a gap regarding a systematic evaluation of spectral configurations of the most common optical spaceborne sensors for their suitability of differentiating plastic types.

To address this gap, our study examines the capability of former and present optical spaceborne multispectral (Dove/Flock PlanetScope, Landsat 5 TM/7 ETM+, Pléiades-HR 1A/1B HiRI, Sentinel-2 MSI, Terra ASTER, Terra/Aqua MODIS, and WorldView-3 WV110) and hyperspectral sensors (EnMAP HSI, EO-1 Hyperion, and PRISMA HYC) to detect and differentiate plastics in the environment. We used infrared reflectance signals of the most commonly used plastic types to simulate their reflectance signals as would be measured on various surface materials by selected optical spaceborne sensors. These simulated sensor-specific reflectance signals were classified according to plastic type using k-nearest neighbors (k-NN) and random forests (RF), yielding information on the various sensor-specific spectral configurations for the differentiation of plastics in the environment. Finally, we propose an optimal hypothetical sensor for plastic type differentiation.

2. Data and Methods

2.1. Spaceborne Sensor Selection

Spectral channel constellations of 22 former and present multi- and hyperspectral optical spaceborne sensors are shown in Figure 1. As some sensors have similar spectral channel configurations or do not have any channels in the SWIR, we made a selection of ten exemplary and representative sensors to use for the spectral sensor simulations. To determine the potential of archive data for estimating spatial and temporal trends of plastics in the environment, we also choose two former sensors (Landsat 5 TM and EO-1 Hyperion). From the present multispectral VNIR sensors, we selected Pléiades-HR 1A/1B HiRI (2011–present, 26 days repeat, 2 m) and Dove/Flock PlanetScope (2014–present, 1 day repeat, 3.5–4 m). From the former and present multispectral sensors that have SWIR channels, too, we choose Terra ASTER (2014–2018, 16 days repeat, 15/30 m), Terra MODIS (1999–present, 1–2 days repeat, 250/500/1000 m), Sentinel-2 MSI (2015–present, 5 days repeat, 10/20 m), Landsat 5 TM/7 ETM+ (1987–2017/2003–present, 17 days repeat, 30 m), and WorldView-3 WV110 (2014–present, <1 days repeat, 1.24/3.7 m). Although the ASTER SWIR channels failed in 2008, we used them for the sensor simulations to determine the potential of the ASTER archive data. Since the channel constellation of Sentinel-2 is very similar to that of Landsat 8/9, but also has more NIR channels, we decided to simulate Sentinel-2. However, the results in the SWIR are also applicable to Landsat 8 and 9. From the former and present hyperspectral sensors, we selected EO-1 Hyperion (2000–2017, ~200 days repeat, 30 m), EnMAP HSI (2022–present, 27 days repeat, 30 m), and PRISMA HYC (2019–present, 29 days repeat, 30 m), as these sensors have bands in the SWIR as well.

2.2. Spectral Database

2.2.1. Plastic Samples

Fifty-three plastic samples of six different types (Table 1) served as the basis for the spectral database. They were of different color, transmissivity, structure, thickness, cleanness, and degree of weathering (Figure 2). However, a diameter of at least 10 mm had to be guaranteed in order to measure them with an ASD Contact Probe. We took the samples from different sources. Some pristine samples were directly from the factory, while we sampled others from everyday objects and packaging. The six plastic types correspond to those that have been assigned a code of 01–06 as part of the International Resin Identification Coding System (RIC). Thus, they are one of the most commonly used types. For the classifications, we handled PE-HD and PE-LD as one plastic type, i.e., PE. They mostly cannot be distinguished even in laboratory spectroscopy as they comprise the same monomers but differ in their molecular structures, which causes different densities [16,35].

2.2.2. ASD Data Collection

We used an ASD FieldSpec 3 imaging spectrometer for reflectance measurements in the VIS and NIR spectral range (Figure 3). It covers 2151 spectral channels at a wavelength range from 350–2500 nm with full widths at half maximum (FWHM) of 3 nm (350–1000 nm) and 10 nm (1000–2500 nm) and spectral sampling intervals (SSI) of 1.4 nm (350–1000 nm) and 2 nm (1000–2500 nm), which are resampled to 1 nm by the instrument’s operating software [36].

We conducted all measurements in a dark room laboratory. For this, we used an ASD Contact Probe, which is a handheld device. This enabled contact measurements of the plastic samples with the benefit of minimizing noise. A single mean spectrum was calculated from 100 individual spectra for each sample by the internal measuring software, respectively.

For the conversion of the measured radiance to apparent reflectance (

R_{a}

), we used a SphereOptics Zenith Polymer© PTFE 90% gray reference with a known reflectance spectrum. The experimental setup consisted of measuring each sample on a white and black sponge rubber mat, respectively, while we excluded the absorption bands of the sponge rubber mats. This setting was necessary to extract real reflectance (

R_{r}

) and transmittance (T) of the measured

R_{a}

, i.e., the reflectance of the sample and its background (Appendix A.2).

2.2.3. HySpex Data Collection and Processing

Since with the ASD Contact Probe only a single spectrum per sample were possible, we also applied imaging spectroscopy. We used a NEO HySpex SWIR 320m-e imaging spectrometer for reflectance measurements in the SWIR spectral range (Figure 3). It covers 256 spectral channels at a wavelength range from ~970–2500 nm with a FWHM of 7 nm and an SSI of ~6 nm [37].

We carried out the measurements in a dark room laboratory after internal protocol [38]. The spectrometer measured across the movement direction of the stage and averaged eight records per row in order to obtain a higher signal-to-noise ratio (SNR). In this way, images were created from which we gained 50–100 individual spectra from each sample by manually selecting the regions of interest (ROIs). This resulted in a total of 10,182 spectra, which, on average, corresponds to ~192 spectra per sample. We selected the ROIs in areas showing no specular reflections, which are caused by anisotropic reflectance of a material. We again used the SphereOptics Zenith Polymer^© PTFE 90% gray reference to convert the measured radiance to

R_{a}

. The experimental setup was similar as for the ASD spectrometer in order to allow the calculation of

R_{r}

and T (Appendix A.2). However, instead of white sponge rubber mats, we placed white glass fiber filters underneath the samples, which have no absorption bands.

All of the spectra that we picked on the HySpex images were subjected to automated flagging and subsequent cleaning using five conditions, which are described in Appendix A.1. This resulted in a total of 8261 spectra (~156 spectra per sample), which were available for further analysis. Since the signals that we measured with the ASD already correspond to an average spectrum of over 100 spectra through the device, they were not subjected to flagging and cleaning.

Finally, we merged reflectance spectra measured with the ASD spectrometer with those measured with the HySpex imaging spectrometer in order to maximize the number of spectra, while exploiting the total spectral range of 350–2500 nm (Appendix A.3).

2.3. Spectral Database Processing

2.3.1. Layered Spectral Mixture with Background Surfaces

Due to the transparency of many plastic samples, it is important to consider background surfaces for the evaluation of sensors to differentiate plastics in the environment. We used reflectance spectra in wavelengths from 350–2500 nm of ten typical land cover types: sea water [39], heathland [40], green meadow [41], dry meadow [41], peat soil [42], wet sandy soil [43], sandy soil [43], sand [40], asphalt [44], and concrete [44], whose spectra are shown in Figure 4. The spectrum of the sea water as the only non-terrestrial surface stands out, as water absorbs nearly all energy in the considered wavelength range. Thus, its spectrum is almost completely zero (Figure 4A). This implies that the plastic sample is directly on the water surface, since a slight sinking would obscure its reflectance.

To obtain the apparent reflectance spectra,

R_{a, s}

, of plastic samples as they would appear on the different environmental surfaces, we applied a layered spectral mixture, weighted by the square of the plastic sample transmissivity,

T_{s}

, as follows:

R_{a, s} = R_{r, s} + T_{s}^{2} \cdot R_{r, b},

(1)

where

R_{r, s}

is the real reflectance of the plastic sample and

R_{r, b}

is the real reflectance of the background.

This allowed for the increase of the number of

R_{a, s}

spectra by a factor of ten, which is particularly useful for plastic types with few spectra. For each sensor and background, a total of 8261 spectra were then available. Beforehand, we resampled the discrete environmental surface spectra to match the wavelengths of the laboratory plastic measurements. Both sandy soil surfaces (Figure 4F,G) were measured up to 2450 nm only. However, since simulated multispectral sensors have no spectral channels in wavelengths above 2450 nm, we retained them.

2.3.2. Spectral Sensor Simulation

We used the apparent reflectance spectra of the plastic samples on the different background surfaces to carry out the spectral sensor simulation (Figure 5). There are many steps involved in an end-to-end sensor simulation that comprises a forward and a backward simulation [45,46]. The forward simulation involves the spectral resampling, to which we limited the simulation in the scope of this study. This facilitates the comparison of different spectral specifications despite spatial limitations (see also Section 4.3).

Spectral resampling requires the sensor’s spectral response functions (SRF), which comprise the spectral response per channel (

R_{λ}

) at discrete wavelengths. Typically, SRFs are measured under laboratory conditions [47] but can also be modeled. In this case,

R_{λ}

is defined as follows [46]:

R_{λ} (λ) = \{\begin{matrix} e^{- 2 {(\frac{\sqrt[2]{\ln 4} (λ - λ_{c})}{Δ λ})}^{2}} & if Δ λ < 10 nm, \\ e^{- 32 {(\frac{\sqrt[6]{\ln 4} (λ - λ_{c})}{Δ λ})}^{6}} & otherwise, \end{matrix}

(2)

where

λ

is the wavelength at which

R_{λ}

is calculated,

λ_{c}

is the central wavelength of the channel, and

Δ λ

is the bandwidth, i.e., full width at half maximum (FWHM). As shown in Equation (2), the response of narrow channels, i.e.,

Δ λ < 10 nm

, can be approximated with a normal distribution, whereas that of broader channels, i.e.,

Δ λ \geq 10 nm

, with a modified normal distribution [45]. We computed the SRFs of Pléiades (Figure A3A), PlanetScope (Figure A3B), EnMAP, Hyperion, and PRISMA, for which no SRFs have been published. For the remaining sensors, we used the published SRFs.

We did not use cirrus and water vapor channels for simulations, because of their ability to measure atmospheric phenomena only. This reduced the number of simulated channels from MODIS, Sentinel-2, and WorldView-3 compared to their total number of channels. The channels selected for the simulations are found in Appendix B.1.

We also added noise to the simulated reflectance signals to investigate the influences of hypothetical signal-to-noise ratios (SNRs) or atmospheric noise on the classification performances. Since published SNRs are valid for radiance data, we did not use them to compute the noise, as it would suggest a wrong accuracy. Instead, we added random Gaussian noise with a mean of 0 and a standard deviation of 0.05 to the simulated signals. We fixed a random seed and the noise arrays for the classifications using both k-NN and RF to ensure the same data for all simulations. If the noise addition resulted in negative values or in values above 1, we set the reflectance to 0 or 1, respectively. Afterwards, we applied a one-dimensional Gaussian filter with a kernel standard deviation of 0.06. This created more realistic data, as reducing noise before classifications is a common pre-processing step.

2.3.3. Continuum Removal

We applied a continuum removal, adapted after Mielke et al. [48], to the simulated reflectance spectra of hyperspectral sensors in order to normalize the spectra by their continuum so that the absorption bands are emphasized and comparable regardless of the albedo of the sample [49]. Finally, we masked the simulated reflectance spectra of the hyperspectral sensors at wavelength ranges of water absorption bands, i.e., 1350–1410 nm and 1815–1935 nm [34]. This ensured comparable classification performances between multispectral and hyperspectral sensors, as the former do not have channels in these wavelength ranges, except for cirrus or water vapor channels, which we did not simulate.

2.4. Classification

2.4.1. Classifiers

The higher accuracies of machine learning classifiers compared to traditional parametric classifiers for applications in remote sensing are found in many studies [50]. We classified the plastic types of simulated reflectance signals using two probabilistic supervised machine learning classifiers: the non-ensemble method k-nearest neighbor (k-NN) and the ensemble method random forest (RF). Unlike other classifiers, k-NN does not train a model but uses the entire training set as a reference for classification [50,51]. RF uses an ensemble of decision trees to make predictions [52]. We applied two different classifiers to compare their performances against each other using the identical spectral database. To optimize the performance of the classifiers, we used exhaustive grid search to tune their parameters, as outlined in Appendix C.1. The degrees of freedom for the classifications were plastic type, background surface, sensor, and noise.

2.4.2. Performance Metrics

To validate the classifications, we employed an 8-fold cross-validation using equalized stratified random sampling, resulting in various confusion matrices (see Appendix C.2). Class-based evaluation is essential for multi-class classifications, as performance differences can arise between classes. We calculated precision, recall, F1 score, and macro F1 score from the respective confusion matrices.

2.4.3. Case Study Almería

To assess the applicability of the classifiers trained with the simulated signals on actual remote sensing data, we selected a case study on plastic roofs of greenhouses in Almería, southeast Spain. These greenhouses were built in large numbers since 1960 and offer a suitable environment for a case study. To date, the area of greenhouses has expanded to almost 500 km² in this region, while new greenhouses continue to be built. On satellite imagery, they can be well detected, as their white-appearing roofs are very clearly visible in the dry and sparsely populated region (Figure 6). With a few exceptions, their roofs and walls are made of transparent PE films [53]. This enables the classifiers to be tested since the true positive is known in advance.

We used satellite imagery of a part of Almería captured with PRISMA on 24 June 2020 and with Sentinel-2 on 29 June 2020 to determine the plastic types of the greenhouse roofs. As the classifiers can only differentiate between the five selected plastics, we masked the greenhouses on both images in advance using a deterministic albedo threshold approach. As the greenhouses appear white in the VIS, we set an albedo of 0.20 (PRISMA) and 0.25 (Sentinel-2) in the VIS as the threshold for identification. For these masked areas, we applied k-NN and RF as trained using the respective simulated signals to differentiate between the plastic types.

2.5. Hypothetical Sensor Definition

We proposed a hypothetical sensor with an optimal spectral channel constellation for the differentiation of plastics in the environment. For this, we performed a forward greedy selection using the k-nearest neighbor (k-NN) classifier. To select individual spectra per plastic type, we used equalized stratified random sampling. Input spectra for the forward greedy selection were continuum-removed apparent reflectance spectra on all background surfaces, masked at wavelength ranges of water absorption channels. As the distinctive absorption channels of plastics are in SWIR, we only selected the wavelength range from 1100–2500 nm. From the remaining 959-dimensional feature space, we selected ten subsets of features, while each subsequent set contains the features of the previous set. We later used these features as the central wavelengths of the hypothetical sensor for classifications. Using different FWHMs (10, 15, 20, 50, and 100 nm), we classified all feature sets with k-NN and RF, respectively. We finally suggested the central wavelengths–FWHM combination with the highest mean macro F1 score as the ideal hypothetical sensor.

3. Results

3.1. Classification

3.1.1. Macro F1 Scores: Influence of Sensor, Background, and Noise

In order to initially compare the performance of the two classifiers on a large scale, the variability of the macro F1 scores that were achieved using all sensors on all background surfaces are shown in Figure 7, while simulated signals without and with 5% noise are considered separately.

The results using RF were generally higher than using k-NN. For simulated signals to which no noise was added, this was true for 76.0% of all cases. In particular, for sensors whose macro F1 scores were relatively high, i.e., ASTER, WorldView-3, and the hyperspectral sensors, the difference between the two classifiers was greatest, while RF consistently achieved higher scores. For signals of multispectral VNIR sensors and multispectral VNIR/SWIR sensors (except ASTER and WorldView-3), the classification results with both classifiers were very similar. For simulated signals to which 5% noise was added, the classification performances using multispectral sensors had very small deviations between both classifiers and were in the same range, regardless of whether they belonged to the VNIR or the VNIR/SWIR sensors. For the latter, the macro F1 scores were significantly lower than using signals without noise. The variability of the macro F1 scores using signals of multispectral sensors with noise was significantly lower than without noise. ASTER and WorldView-3 also no longer stood out. Only the signals of the hyperspectral sensors could be reliably classified even with noise, despite lower macro F1 scores, while RF surpassed k-NN. The variability of the score between both classifiers was similar, and thus their accuracies. The opposite partly applies to their precisions, which were similar for multispectral VNIR sensors, but slightly different for multispectral VNIR/SWIR, and even greater for hyperspectral sensors, so that RF had a higher precision, i.e., higher macro F1 scores were generally achieved. Due to this, the results are evaluated in the following with this classifier as they reflect more robust trends, unless outstanding results obtained with k-NN are noteworthy.

Regardless of the similarities and differences in the performance of the two classifiers, clear differences can be identified between the sensor types in terms of their potential to differentiate plastics. This can also be seen in the context of the differentiated consideration of the macro F1 score per sensor and background surface using noise-free signals classified with RF (Figure 8). Multispectral VNIR sensors (Pléiades and PlanetScope) output the lowest mean macro F1 scores compared to other sensor types, as could already be deduced from Figure 7. The scores achieved with the signals of this sensor type had very similar means and standard deviations, i.e.,

0.21 \pm 0.05

(Pléiades) and

0.22 \pm 0.03

(PlanetScope), but different ranges, i.e., 0.14–0.30 (Pléiades) and 0.18–0.27 (PlanetScope) (Figure 8A). These low scores were not better than random classification, i.e., 0.2 (using five classes), and are the logical consequence of plastics having no diagnostic absorption bands up to ~900 nm (see also Figure 9).

The results using signals of multispectral VNIR/SWIR sensors showed greater variability and, as was mentioned above, can be split into two clusters, i.e., MODIS, Sentinel-2, and Landsat 5/7 and ASTER and WorldView-3 (Figure 8B). Macro F1 scores achieved with sensors of the first cluster were in the range from 0.21 to 0.37, with means and standard deviations of

0.31 \pm 0.05

(MODIS),

0.26 \pm 0.05

(Sentinel-2), and

0.26 \pm 0.04

(Landsat 5/7). Thus, they were either in the same range of the scores achieved with multispectral VNIR sensors or insignificantly higher. In contrast, the scores achieved with ASTER and WorldView-3 were the highest values among this sensor category, while both ranged from 0.37 to 0.56 but had different means and standard deviations, i.e.,

0.43 \pm 0.05

(ASTER) and

0.48 \pm 0.06

(WorldView-3).

Classifications using continuum-removed signals of all hyperspectral sensors were most successful, which is reflected in their relatively high mean macro F1 scores, which were

0.83 \pm 0.03

(Hyperion),

0.85 \pm 0.03

(EnMAP), and

0.86 \pm 0.03

(PRISMA) (Figure 8C). In contrast, when classifying simulated reflectance signals of the hyperspectral sensors, without removing their continua in advance, mean macro F1 scores of

0.49 \pm 0.03

(Hyperion),

0.51 \pm 0.04

(EnMAP), and

0.50 \pm 0.03

(PRISMA) were obtained. These scores were thus in the range of those of WorldView-3 and therefore did not stand out compared to this multispectral sensor. However, after continuum removal, as is often done with hyperspectral data, the results stood out. This suggests that a classification of plastics with hyperspectral sensors can achieve the best results among all selected types of sensors, as they scan the spectral range in which are the diagnostic absorption bands of plastics with a sufficiently high spectral resolution.

Mean macro F1 scores among the background surfaces were very similar and ranged from 0.43 (sand) to 0.50 (heathland). This indicates that the sensors performed with a relatively consistent performance across the various surfaces. While variations in macro F1 scores were observed between sensors on specific surfaces, no single background surface was found to consistently yield exceptional classification results across all sensors. Therefore, the modest differences in macro F1 scores between backgrounds were not statistically significant and unlikely to be of practical consequence.

3.1.2. F1 Scores: Influence of Plastic Type

Differences among sensor types are also evident when considering performances per plastic type, quantified by the mean F1 scores over all background surfaces per sensor (Figure 10). In contrast to the background surfaces (see paragraph above), there was a greater variability between plastic types, such that the mean F1 scores range from

0.40 \pm 0.30

(PP) to

0.53 \pm 0.17

(PVC).

The greatest differences between the individual plastic types can be found in the analysis of the individual sensor types. The classification of PS generally had the broadest range, and the signals of multispectral VNIR sensors resulted in very low scores, i.e., 0.04 (Pléiades) and 0.06 (PlanetScope), which were markedly below the mean scores of both sensors, i.e.,

0.21 \pm 0.05

(Pléiades) and

0.22 \pm 0.03

(PlanetScope) (Figure 10A). These very low values also stand in contrast to all other plastic types, with achieved value ranges from 0.21 (PET) to 0.29 (PE) using simulated Pléiades signals and 0.16 (PP) to 0.32 (PVC) using simulated PlanetScope signals.

The relatively low mean F1 score of PP achieved with PlanetScope was part of the trend within the multispectral VNIR/SWIR sensors, as this plastic type had the lowest mean F1 score on average within this group (Figure 10B). This trend was evident for all classification results of signals of multispectral VNIR/SWIR sensors, regardless of the better classification results that were generally achieved with ASTER and WorldView-3. For the remaining plastic types, the better classifiability with both sensors was applicable, whereby the highest mean F1 scores were achieved with PVC, i.e., 0.61 (ASTER) and 0.58 (WorldView-3). These values stand out especially for signals of ASTER since its mean F1 score was in the range of 0.36–0.61, which was achieved for plastic types different from PP. For signals of WorldView-3, this range was not as broad (0.51–0.58). Among the other sensors in this category, PVC was also consistently best classified, with mean F1 scores of 0.52 (MODIS), 0.42 (Sentinel-2), and 0.37 (Landsat 5/7). As the lowest mean score among these three sensors, PS—as classified using MODIS signals—stands out again (0.13), which could already be classified very poorly with multispectral VNIR sensors.

The observations described for classifications with signals of multispectral sensors were not evident for those from hyperspectral sensors (Figure 10C). On the one hand, the variabilities between the plastic types with hyperspectral sensors were not as high since the mean F1 score ranges 0.70–0.90 (Hyperion), 0.75–0.91 (PRISMA), and 0.77–0.91 (EnMAP) were achieved. On the other hand, the lowest and most different mean F1 scores were for PVC, despite the highest mean F1 score of PVC over all sensors.

3.1.3. Case Study Almería

Assuming that all greenhouse roofs are made of PE [53], a recall of 0.74 using PRISMA (Figure 11A) and 0.22 using Sentinel-2 (Figure 11B) was obtained as classified with RF, respectively. As no other classes are to be expected, the precision is 1. Thus, the recall equals the macro F1 score, which, on average, were

0.86 \pm 0.03

using simulated PRISMA signals and

0.26 \pm 0.05

using simulated Sentinel-2 signals. Although the performance with the simulated data was slightly better for PRISMA, the result using the hyperspectral image of Almería was very good and reflected the high potential of hyperspectral sensors for the differentiation of plastics. The low performance using the Sentinel-2 image confirms that there is no potential for plastic differentiation using multispectral VNIR/SWIR sensors that have only a few broad SWIR channels.

3.2. Hypothetical Sensor Definition

The dimensional reduction of the feature space has many benefits. The sequential greedy forward selection in the SWIR was limited to ten features, which are shown in Figure 9. The first two features were at 1217 and 1716 nm and were very close to the plastic features at 1215 and 1732 nm published by Martínez-Vicente et al. [29] and to the hydrocarbon feature at 1730 nm published by Kühn et al. [21]. Additionally, both selected features are in the wavelength range of two channels of WorldView-3, while the first is in the wavelength range of the channels with the lowest central wavelength in the SWIR among all selected multispectral VNIR/SWIR sensors. The third feature at 1697 nm also matches most absorption bands of the plastic types indicative for the features published by Kühn et al. [21] and Martínez-Vicente et al. [29]. The fourth feature at 1141 nm is located at the shoulder of the absorption bands with the lowest wavelength positions in the SWIR. Feature five at 2407 nm is very close to the central wavelength of ASTER’s channel 9 at 2395 nm, which is the channel with the highest central wavelength among the selected multispectral VNIR/SWIR sensors. The following three features at 1554, 1328, and 2371 nm did not match any absorption bands of the mean plastic spectra but might be crucial for differentiating between certain samples. The ninth feature at 2214 nm is located again at the shoulder of absorption bands and matches with channels of ASTER, Landsat 5/7, Sentinel-2, and WorldView-3. The last feature at 2469 nm was very close to the maximum wavelength of the measured signals and did not match any channel of the selected multispectral VNIR/SWIR sensors, as the atmospheric transmittance at this wavelength is very low.

The first four features (at 1141, 1217, 1697, and 1716 nm) performed best among all feature sets, while a narrower FWHM increases the macro F1 scores (Figure 12). This is evident for all sets of features, while the narrower the channels are, the better the performances are. The macro F1 score was higher than using WorldView-3 signals (

0.48 \pm 0.06

) while the number of SWIR channels was lower (four SWIR channels) than that of WorldView-3 (eight SWIR channels). As absorption bands of plastics are narrow too, they can best be measured using narrow channels. An increase in the number of channels, i.e., more than four, does not improve the potential of the hypothetical sensor to differentiate plastics. The performance using k-NN (Figure 12A) is generally better than using RF (Figure 12B). This can be attributed to the choice of k-NN as the classifier for the sequential greedy forward selection. However, as the general trends obtained with both classifiers are similar, with the four features having the highest scores and the correlation of FWHM and score, another run with RF was not completed.

4. Discussion

4.1. Channel Characteristics

The characteristic absorption bands in the SWIR of reflectance spectra of plastics are essential for their differentiation. This was already conducted successfully using laboratory infrared spectroscopy [54], airborne [55], and spaceborne remote sensing [31]. The spectra of some plastics have absorption bands in the NIR (Figure 9). However, there is usually only a single absorption band, which is of shallower depth than absorption bands in the SWIR. In the VIS, in turn, only the color of the plastic can be measured. A classification using VNIR sensors can thus be affected by a color bias, i.e., a distortion of the classification results by the colors of the samples.

Figure 9 shows the spectral channels of the selected sensors and the continuum-removed mean spectra of the selected plastic types. Multispectral VNIR sensors typically only have one NIR channel, as do the selected Pléiades and PlanetScope. Since the reflectance signals in VIS do not contain any information on the plastic type, it cannot be identified with a single NIR channel. Pléiades and PlanetScope mainly differ in the FWHM of their channels, while their central wavelengths show only minor deviations. Pléiades has the broadest NIR channel with 200 nm among a large selection of multispectral VNIR sensors (Figure 1), while PlanetScope has an 80 nm narrow NIR channel. The VIS channels also differ between the two in that Pléiades has broad and relatively strongly overlapping channels and PlanetScope has relatively narrow ones. These differences do not affect the performance of the classification of their signals, though small FWHM potentially increase the classification performance as obtained from the feature selection. By the use of noise-free signals classified with RF mean macro F1 scores of

0.21 \pm 0.05

(Pléiades) and

0.22 \pm 0.03

(PlanetScope) have been obtained. Given five classes (PE, PET, PP, PS, and PVC), the accuracy of correctly classifying a class using random choice would be 0.2. Thus, the classification results using signals of multispectral VNIR sensors are not better than a random classification, which was expected since plastics have no diagnostic absorption bands up to ~900 nm.

Multispectral VNIR/SWIR sensors have a better potential to differentiate between plastics due to their SWIR channels. However, this only applies to a limited selection of sensors of this type that stand out due to their relatively high number of narrow SWIR channels, namely ASTER and WorldView-3. The absorption bands of plastic spectra are also narrow, which is why they are reflected more clearly in narrow than in broad channels, provided that the central wavelengths of the channels match the positions of the absorption band wavelengths of the plastics. ASTER and WorldView-3 do have multiple narrow SWIR channels, which are located at positions of absorption bands and their shoulders of plastics. ASTER had five and WorldView-3 has four narrow SWIR2 channels and WorldView-3 also has three narrow SWIR1 channels that no other selected multispectral VNIR/SWIR sensor has. This makes ASTER and WorldView-3 signals generally better for classifying plastics, while the better performance of WorldView-3 signals is attributed to its three SWIR1 channels. In addition to the position, depth and width are further parameters of absorption bands. In laboratory spectroscopy, e.g., the band depth ratio of the absorption bands is used to determine the material. To obtain this with multispectral sensors, channels at the shoulder and valley of the absorption bands must be given, which partly applies to ASTER and WorldView-3. In contrast to these two sensors, Landsat 5/7 and Sentinel-2 only have two broad SWIR channels, respectively. The narrow absorption bands of plastics are not well sampled by these broad SWIR channels. Though the FWHM of the SWIR channels of Sentinel-2 are lower than those of Landsat 5/7, it is not beneficial for plastic differentiation. This is reflected in the same low mean macro F1 score of Landsat 5/7 (

0.26 \pm 0.04

) and Sentinel-2 (

0.26 \pm 0.05

), which are only slightly higher than that of the multispectral VNIR sensors. Furthermore, the two NIR channels of Sentinel-2, while Landsat 5/7 only has one, do not increase the classification performance of its signals. Other than the selected multispectral VNIR/SWIR sensors with single SWIR1 and SWIR2 channels, e.g., OLI (Landsat 8/9), are also very unlikely to be able to differentiate between plastics. MODIS has three SWIR channels, two of which are at similar wavelengths as those of Landsat 5/7 and Sentinel-2 and one is at a lower wavelength, i.e., channel 5 at 1240 nm. In addition, the FWHM of the MODIS channels are relatively small, ranging from 10–50 nm, which is nearly an order of magnitude smaller compared to that of Landsat 5/7. The higher number of SWIR channels and the lower FWHM do not noticeably increase the potential of MODIS for plastic differentiation but is slightly higher than that of Landsat 5/7 and Sentinel-2, while a mean macro F1 score of

0.31 \pm 0.05

was obtained. In general, the number of SWIR channels, whose FWHM must be as small as possible and whose central wavelengths must match the positions of the absorption bands of plastics and their shoulders, are decisive for a good classification of plastics, which was also demonstrated by the results of the feature selection experiment.

The limitation to a relatively small number of discrete channels does not apply to hyperspectral sensors. With their channels, an entire part of the spectrum is covered, which means that the absorption bands of the plastics can be measured in full. Since the channels of EnMAP, Hyperion, and PRISMA differ only slightly in their central wavelengths and FWHM, similarly high mean macro F1 scores were achieved, i.e.,

0.83 \pm 0.03

(Hyperion),

0.85 \pm 0.03

(EnMAP), and

0.86 \pm 0.03

(PRISMA). This suggests that data from future hyperspectral missions (e.g., ESA CHIME, NASA SBG, or ASI/ISA SHALOM) can also be successfully used for plastic differentiation.

The water absorption bands of the atmosphere were masked in the hyperspectral signals before the classifications, respectively. However, if these are not masked, very similar or equal macro F1 scores are obtained with the sensors, i.e.,

0.85 \pm 0.04

(Hyperion),

0.85 \pm 0.03

(EnMAP), and

0.85 \pm 0.03

(PRISMA). This means that the potential of hyperspectral sensors for the differentiation of plastics differs insignificantly from the potential of laboratory spectroscopy, while the practical potential is significantly reduced by the usually lower SNR of the spaceborne sensors compared to those of laboratory sensors. The consideration of absorption bands of plastics that are outside the atmospheric windows do not improve their differentiation. This was also evident from the mainly correct classification of the PE greenhouses in Almería.

The exclusive use of SWIR channels in comparison to the use of all channels leads to better classifications of signals of multispectral VNIR/SWIR and hyperspectral sensors, respectively. This difference is greatest, when using k-NN (Figure 13A) instead of RF (Figure 13B). With k-NN, the classification of signals of ASTER, MODIS, and WorldView-3 improve most if only the SWIR channels are used. The differences in their mean macro F1 scores using noise-free signals by using SWIR channels only compared to using all channels were 0.13 (ASTER), 0.17 (MODIS), and 0.28 (WorldView-3). For MODIS signals, this improvement means that the results were almost the same as those of ASTER signals, though MODIS has three SWIR channels compared to ASTER, which had six SWIR channels. With RF, these differences were with 0.07 (ASTER), 0.08 (MODIS), and 0.10 (WorldView-3), not as large. However, these improvements prove the fact that the absorption bands of plastics in the SWIR are decisive for their differentiation and that the consideration of the VIS and the NIR has the potential to cause a bias.

The variability of the mean F1 scores, i.e., among the plastic types, can hardly be explained by the spectral characteristics of the selected sensors. Although some channels match the absorption bands of some plastic types, this is difficult to trace back due to the variability in the signals between the samples. For example, PS had very low F1 scores using signals of Pléiades (0.04) and PlanetScope (0.06). However, as shown in the distribution of true positives and false negatives of PlanetScope (Figure 14A), there was no plastic type classified predominantly instead of PS, i.e., false negatives. Instead, their relative proportions were very similar. It was different with PP, which could be classified worst with the signals of all multispectral VNIR/SWIR sensors. For this plastic type, there was a trend towards false negatives with signals from WorldView-3 (Figure 14B) and PlanetScope (Figure 14A), so that predominantly PE was classified instead of PP. A similar relationship applies to the classification of PVC for these two sensors, such that the most common false negative for both was PP. These trends were not reflected in the distributions of true positives and false negatives using PRISMA signals (Figure 14C) since the proportions of true positives were consistently very high.

4.1.1. Continuum Removal

The classification results of the hyperspectral spectra were significantly improved by removing their continua. This can be seen when comparing the mean macro F1 scores between classifying continuum-removed spectra, i.e.,

0.83 \pm 0.03

(Hyperion),

0.85 \pm 0.03

(EnMAP), and

0.86 \pm 0.03

(PRISMA), and the spectra whose continua were not removed, i.e.,

0.49 \pm 0.03

(Hyperion),

0.51 \pm 0.04

(EnMAP), and

0.50 \pm 0.03

(PRISMA). This shows that k-NN and RF, as representatives for machine learning estimators, do not automatically find the most decisive features in the spectra, but rather benefit from training them with preprocessed and derived features. In addition to the continuum removal, a feature selection would limit high-dimensional feature spaces in such a way that a better performance of the classifiers would be possible.

In order to improve the ability of the hypothetical sensor to differentiate plastics, the continuum removal could also be used. However, the continuum removal can only be calculated if channels are also at the shoulders of the diagnostic absorption bands, i.e., in the vicinity of their central wavelengths. For this, the hypothetical sensor would need supporting spectral channels on both sides of the four selected channels. These would only be needed to calculate the continuum removals and not for the actual classification.

4.1.2. Background Surfaces

The background surface is one of many factors that influence the reflectance signal of a transparent or translucent material. In addition, the lighting conditions or the thickness of the material also have an influence on the signal [56]. This partly explains the low variability in the classification results of the spectra between the 10 different background surfaces (Figure 8). This means that the plastics can be classified with similar accuracy on all backgrounds using the signals from the selected sensors. However, some outliers were achieved. On sand, multispectral VNIR sensors as well as Landsat 5/7, MODIS, and Sentinel-2 achieved the lowest macro F1 scores. Sand is the background spectrum with the highest albedo, while it has almost no absorption bands (Figure 4H). The reflectance of the plastics is thus increased the most by the layered spectral mixture with the sand reflection, which means that the variance of the signals of the plastic samples increased. This can lead to overlapping of the albedo of the signals of different plastic types, which makes it difficult to differentiate between the types, especially when using the signals from the sensors mentioned. This is also confirmed by the fact that the layered spectral mixture with asphalt achieved significantly better macro F1 scores since asphalt is the background surface with the second lowest albedo (after sea water). Thus, the variance of the signals of the plastic samples was not increased. In addition, the asphalt spectrum had no absorption bands. Wet sandy soil, on the other hand, did, while it has a similarly low albedo to asphalt. However, its absorption bands lead to an overlap of the absorption bands of the plastics, which causes lower performances in the classifications.

As the only non-terrestrial background surface, sea water has the distinctiveness that its reflection is almost 0 in the entire observed wavelength range since water in the infrared range absorbs almost all of the energy. The classification on this surface can thus be seen as a benchmark, as almost no further reflection was added to the plastic spectra, and so the classification corresponds to that of the pure reflectance laboratory spectra, resampled to the respective sensor. This is particularly evident in the results with the hyperspectral sensors. Macro F1 scores of 0.74 (Hyperion), 0.78 (EnMAP), and 0.79 (PRISMA) were achieved and are in contrast to a range of 0.82–0.86 (Hyperion), 0.83–0.88 (EnMAP), and 0.83–0.87 (PRISMA). This leads to the conclusion that an increase in the depth of the absorption bands of the plastics, as is the case with the layered mixture with background spectra above 0, leads to a better classification. With regard to the peculiarity of water as a material in which plastics accumulate in the environment, it should be noted that many plastics do not float on top but within the water column, which leads to a reduction of the apparent reflectance over the complete wavelength range up to total absorption, especially for wavelengths greater than ~700 nm. Since the diagnostic absorption bands of plastics are located at wavelengths greater than ~900 nm, plastics floating in the water column cannot be identified as such, nor differentiated.

4.2. Classification

The use of the two different machine-learning classifiers, i.e., k-NN and RF, led to differences in the results of the classifications, especially for the sensors ASTER, WorldView-3, and the selected hyperspectral sensors EnMAP, Hyperion, and PRISMA, whose signals obtained the highest macro F1 scores among all sensors. RF generally achieved higher scores than k-NN. However, the higher precision of RF was in contrast to the higher classification speed of k-NN. Using noise-free simulated signals, k-NN took, on average, 85.2 ms for each training of the classifier and 33.4 ms for predicting unclassified spectra, while RF took 86.9 ms for training and 37.8 ms for predicting. Especially in high-dimensional feature spaces, such as signals of hyperspectral sensors, the classification speeds deviated significantly. k-NN took up to 246.9 ms and RF 318.7 ms for testing the classifier using PRISMA data, respectively. This difference becomes more important when the pixels of a hyperspectral image have to be classified instead of a single spectrum. Since the mean macro F1 scores of both classifiers for noise-free hyperspectral spectra are

0.73 \pm 0.05

(k-NN) and

0.84 \pm 0.03

(RF), k-NN can be used for a robust and faster classification. Another difference between the two classifiers was that even if 5% noise was added to the signals, RF still achieved higher macro F1 scores than k-NN. This means that RF also has a higher precision in this case. However, since the determination of optimal model parameter values was carried out with noise-free signals, a (small) bias can be assumed by the use of non-optimal model parameter values.

By default, both supervised classifiers cannot output an unclassified class, i.e., a class that does not appear in the training set. This was tested with two vegetation spectra, i.e., green meadow and dry meadow [41], which were also used for the layered mixture with the laboratory plastic spectra with background surfaces. These spectra were assigned one of the five plastic types used. However, since both classifiers are probabilistic, it can be assumed that an exemplary spectrum does not belong to any of the plastic classes if the probabilities of all classes are low or similar. For the two vegetation spectra, probabilities of 0.13 (PE), 0.03 (PET), 0.45 (PP), 0.08 (PS), and 0.30 (PVC) were output using RF. Thus, the spectra would be classified as PP, since this class has the highest probability. However, as the probabilities were equal for both vegetation spectra, and if one compares them with the probability of an actual PP spectrum (e.g., 0.92), it becomes apparent that it is very likely that the vegetation spectra are not plastics. This heuristic approach is more difficult to apply when spectra are used that are similar to those of plastics, e.g., those of petroleum or hydrocarbons, obtained from it. Another approach would be to mask the pixels on which plastics are mapped in advance. Only then could the differentiation of plastics be carried out. The use of the Hydrocarbon Index (HI) by Kühn et al. [21], which requires hyperspectral data, or the Plastic Greenhouse Index (PGI) by Yang et al. [25], are appropriate channel ratios to mask plastics on an image.

4.3. Applicability

The limiting factor in transferring the classifications of the simulated signals to the measured signals is the spatial resolution of the sensors, as a plastic abundance of 100% must be given for an image pixel in order to classify the plastic type using the trained classifiers of this study. GSDs of the selected sensors range from 3.7 m (WorldView-3) up to 1000 m (MODIS), while the hyperspectral sensors have a GSD of 30 m, respectively. Individual plastic pieces can thus not be recognized on this scale [27]. Since we did not train the classifiers with non-plastic classes, it must be determined before the classification whether a pixel is plastic or not. However, plastics together with other materials can make up mixed pixels [32,33]. This is particularly evident for images with a low spatial resolution. In contrast, high-resolution imagery might be biased by false positives using the trained classifiers of this study.

Nonetheless, the algorithms presented here can be applied to large areas or aggregations of plastic, respectively. For instance, plastics are predicted to accumulate in subtropical gyres, such as the Great Pacific garbage patch, whereas this was not observed using spaceborne remote sensing to date [26]. Greenhouses made of agricultural mulch films can also make up large plastic surfaces that can be captured from space [57], which led to the development of the PGI [25]. For these applications, in particular, parameters of sensors that show suitable channel configurations for the differentiation of plastics in the environment are summarized in Table 2.

The detection of microplastics in satellite images may not be possible at all or only if there is a very high concentration of microplastics in a pixel. However, the aging and weathering of plastics, especially through photodegradation, improves the detectability of plastics even before they break down into microplastics. Photodegradation makes plastics more translucent, leading to more reflected and less transmitted light, and their surface rougher, leading to a more Lambertian reflection in favor of the more specular reflection of a shiny new surface. These effects are beneficial for remote sensing, as they increase the light reflected from the sample [30].

The increasing awareness of plastic pollution and its environmental impact makes it important to determine the area-covering pollution status of larger regions. This need can be addressed by uncrewed aerial vehicle (UAV) remote sensing, equipped with hypothetical sensors that could provide the necessary spatial resolution for this task [58]. Furthermore, customized (spaceborne) sensors tailored to specific applications will soon be available on demand. Furthermore, the CubeSat sector is expanding rapidly. However, even with ideal sensors, remote sensing methods can currently only address the question of plastic coverage on the surface of land or water. To achieve more accurate and comprehensive monitoring of plastic pollution, a combination of fieldwork methods and remote sensing data validation may be required to provide more detailed information [59].

5. Conclusions

This study investigated the ability of various multispectral and hyperspectral sensors to differentiate between five common plastic types in the environment. The results showed that multispectral VNIR sensors are generally unable to differentiate between plastics. Multispectral VNIR/SWIR sensors having multiple narrow SWIR channels, such as ASTER and WorldView-3, showed better performance in classifying plastics. Hyperspectral sensors were found to be most capable of differentiating plastics since their absorption bands are sufficiently sampled by the narrow and contiguous spectral channels of these sensors. Classification results could be improved by continuum removal for hyperspectral signals and using only SWIR channels for multi- and hyperspectral sensors. Ten common background surfaces were considered, but no surface consistently produced outstanding classification results. A hypothetical sensor was derived that has four channels in the SWIR range (at 1141, 1217, 1697, and 1716 nm) for optimal plastic differentiation, while narrow FWHMs (10–20 nm) increased its performance. These findings can aid in differentiating plastics on a large scale and investigating global plastic pollution trends.

Author Contributions

Conceptualization, M.B. and T.K.; methodology, M.B., T.K. and T.S. (Toni Schmidt); software, M.B. and T.S. (Toni Schmidt); validation, M.B., T.K. and T.S. (Toni Schmidt); formal analysis, T.S. (Toni Schmidt); investigation, M.B., T.K. and T.S. (Toni Schmidt); resources, M.B. and T.S. (Toni Schmidt); data curation, T.S. (Toni Schmidt); writing—original draft preparation, T.S. (Toni Schmidt); writing—review and editing, M.B., T.S. (Taylor Smith) and T.K.; visualization, T.S. (Toni Schmidt); supervision, M.B., T.K. and T.S. (Taylor Smith); project administration, M.B.; funding acquisition, M.B. and T.K. All authors have read and agreed to the published version of the manuscript.

Funding

This work was part of the project “Entwicklung Neuer Kunststoffe für eine Saubere Umwelt unter Bestimmung Relevanter Eintragspfade (ENSURE)”, sub-project 4, funded by the German Federal Ministry of Education and Research (BMBF) under grant No. 02WPL1449D.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Acknowledgments

We acknowledge the provision of laboratories and instruments by the Helmholtz Centre Potsdam—GFZ German Research Centre for Geosciences and the provision of further plastic spectra from laboratory measurements by Shanyu Zhou. We also express our gratitude to the three anonymous reviewers for their valuable feedback and suggestions that helped improve the quality of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

ASI	Agenzia Spaziale Italiana
ASTER	Advanced Spaceborne Thermal Emission and Reflection Radiometer
CHIME	Copernicus Hyperspectral Imaging Mission for the Environment
EO-1	Earth Observing-1
ESA	European Space Agency
ETM+	Enhanced Thematic Mapper Plus
FN	False negatives
FWHM	Full width at half maximum
GSD	Ground sampling distance
HiRI	High-Resolution Imager
IR	Infrared
ISA	Israeli Space Agency
k-NN	k-nearest neighbors
MODIS	Moderate Resolution Imaging Spectroradiometer
MODTRAN	Moderate resolution atmospheric transmission
MSI	MultiSpectral Instrument
NASA	National Aeronautics and Space Administration
NIR	Near infrared
NEO	Norsk Elektro Optikk
OLI	Operational Land Imager
PE	Polyethylene
PE-HD	High-density polyethylene
PE-LD	Low-density polyethylene
PET	Polyethylene terephthalate
PP	Polypropylene
PRISMA	Precursore Iperspettrale della Missione Applicativa
PS	Polystyrene
PTFE	Polytetrafluoroethylene
PU	Polyurethane
PVC	Polyvinyl chloride
RF	Random forest
RIC	International Resin Identification Code
ROI	Region of interest
SBG	Surface Biology and Geology
SHALOM	Spaceborne Hyperspectral Applicative Land and Ocean Mission
SNR	Signal-to-noise ratio
SRF	Spectral response function
SSI	Spectral sampling interval
SWIR	Short wave infrared
TM	Thematic Mapper
TN	True negatives
TP	True positives
UAV	Uncrewed aerial vehicle
VIS	Visible part of the electromagnetic spectrum
VNIR	Visible and near infrared
WV110	WorldView-110

Appendix A. Details on the Spectral Database Processings

Appendix A.1. Data Cleaning of HySpex Data

We defined five conditions to automatically flag and subsequently clean each of the spectra we selected from the HySpex images. From the 10,182 spectra initially picked from the hyperspectral images, 8261 spectra remained after cleansing (Table A1), which on average, corresponds to ~156 spectra per sample that were available for further processings.

Table A1. Flags that we assigned to spectra picked from the HySpex images for subsequent automated cleaning. Since some spectra were flagged multiple times, the sum of the spectra per flag exceeds the sum of all flagged spectra, which is shown below in bold.

Flag	Meaning	Number of Spectra
1	Interference	297 (2.9%)
2	$R > 1$	3 (<0.1%)
3	Slumps	1606 (15.8%)
4	Outliers ( $σ > 3 \bar{σ}$ )	1204 (11.8%)
5	Solitaries	297 (2.9%)
		1921 (18.8%)

Flag 1 is used to indicate relatively strong interference in spectra, which can occur due to reflections causing a phase shift and wavy shapes in the curve. These shapes are deeper with longer wavelengths and occur at a similar frequency, making them easily detectable by visual examination. The filtering and scaling of reflectance spectra on bright backgrounds removes noise and enables the detection of wavy shapes, which are flagged 1 if they appear consistently across all spectra of a sample. The corresponding samples measured on the dark background are detected by the flag 5 algorithm, which is why interference is flagged in advance.

Flag 2 indicates reflectance above 1, which occurs due to specular reflections caused by anisotropic reflectance of the material. Absolute reflectance spectra are obtained by calibrating relative reflectance spectra with the white reference spectrum, assuming ideal diffuse reflection of the sample surface [45]. However, most samples cause complex reflections that are a mixture of specular and diffuse reflections, leading to reflectance above 1 in some regions. This effect is strongest for opaque materials and can be reduced by the Lambertian surfaces of transparent materials and the fixed zenith angle and intensity of the light source.

Flag 3 indicates slumps in the curve shapes of the spectra, i.e., the decrease of the reflectance below 0 at higher wavelengths. These are caused by spectra picked on stripes that appear on the hyperspectral images measured with the NEO Hyspex. These only appear if the SWIR sensor and not also the VNIR sensor is used for measuring and when a new software version of HySpex GROUND is used with settings that were previously selected for all measurements. With 1606 (15.8%) flagged spectra, it is the most comprehensive flag.

Flag 4 indicates outliers. The 3-sigma rule was chosen for outlier detection [60]. A large range caused by spectra with a reflectance above 1 (flag 1) would enlarge the 3-sigma range and would slip through spectra that would be outside 3-sigma without the spectra flagged 1. Thus, those were not taken into account for flag 4.

Flag 5 indicates solitaries, i.e., spectra of samples which are only available for either the bright or the dark background. As reflectance spectra on both backgrounds are needed to calculate the real reflectance and transmittance, these were removed. Counterparts to completely flagged spectra were identified, e.g., flag 1.

Appendix A.2. Extracting Real Reflectance and Transmittance from HySpex Data

For extracting real reflectance (

R_{r}

) and transmittance (T) from the apparent reflectance (

R_{a}

) measurements made with the HySpex, the real reflectance spectra of both backgrounds (

R_{r}, b

) needed to be picked from the images. In total, we respectively picked 1050 spectra of the bright and the dark background from 28 selected images (Figure A1). Then, we removed all spectra that are outside the 3-sigma range. Thus, the number of spectra per background are different: 1047 for the bright and 1041 for the dark background. Both backgrounds had no absorption bands in the considered wavelengths and their spectra were very flat. Their mean reflectance over all channels of the bright background was

0.84 \pm 0.05

and of the dark background was

0.06 \pm 0.01

. We used the mean reflectance per channel (

R_{r, b, bright}

and

R_{r, b, dark}

) for the extraction of

R_{r}

and T of the samples.

Figure A1. Reflectance spectra of the bright (1047 spectra) and the dark background (1041 spectra) and their mean and standard deviations. The axis scaling of R differed among the backgrounds.

For extracting

R_{r}

and T from the ASD measurements, we measured the

R_{r}

spectra of both backgrounds with the ASD spectrometer as well. Lillesaeter [61] published a method to derive a sample’s real reflectance (

R_{r, s}

) and transmittance (

T_{s}

) from its apparent reflectance (

R_{a, s}

), which includes the real reflectance of the background (

R_{r, b}

) as shown in Figure A2. This method requires two different backgrounds per

R_{a, s}

measurement, while both must have different albedos in the entire examined wavelength range [61]. Here, these are given by the reflectance spectra of the bright and dark backgrounds.

R_{a, s}

is defined as follows [61], where

T_{s}

is squared, as the ray runs through the material twice (Figure A2):

R_{a, s} = R_{r, s} + T_{s}^{2} \cdot R_{r, b} .

(A1)

To derive

R_{r, s}

, we measured

R_{a, s}

on the two backgrounds (

R_{r, b, bright}

and

R_{r, b, dark}

).

R_{r, s}

and

T_{s}

are not influenced by the respective background and are the same for both measurements. Then, the equations of

R_{a, s}

adjusted to the respective background are as follows, where the mean reflectance spectra of the backgrounds were used:

R_{a, s, bright} = R_{r, s} + T_{s}^{2} \cdot {\bar{R}}_{r, b, bright}

(A2)

R_{a, s, dark} = R_{r, s} + T_{s}^{2} \cdot {\bar{R}}_{r, b, dark} .

(A3)

By equalizing Equations (A2) and (A3) to

R_{r, s}

,

T_{s}

can be calculated as follows:

T_{s} = \sqrt{\frac{R_{a, s, bright} - R_{a, s, dark}}{{\bar{R}}_{r, b, bright} - {\bar{R}}_{r, b, dark}}} .

(A4)

We picked a different number of

R_{a, s}

spectra for the plastic samples on both backgrounds. Additionally, these spectra have relatively large variance. This can result from anisotropic reflectance, the non-uniform thickness of the samples, or dirt on their surfaces.

Figure A2. Ray path showing the occurrence of the sample’s apparent reflectance (

R_{a, s}

) and its components, i.e., its real reflectance (

R_{r, s}

), its transmittance (

T_{s}

), and the real reflectance of the background (

R_{r, b}

) after Miller et al. [62]. The distance between the sample and the background is enlarged for the purpose of illustration.

Figure A2. Ray path showing the occurrence of the sample’s apparent reflectance (

R_{a, s}

) and its components, i.e., its real reflectance (

R_{r, s}

), its transmittance (

T_{s}

), and the real reflectance of the background (

R_{r, b}

) after Miller et al. [62]. The distance between the sample and the background is enlarged for the purpose of illustration.

Since there are no related reflection pairs between the

R_{a, s}

measurements on either background, we calculated

R_{r, s}

and

T_{s}

using the mean reflectance spectrum of

R_{a, s}

on the complementary background. With this method, the total number of

R_{a, s}

spectra does not have to be reduced as a respective

R_{r, s}

and

T_{s}

spectrum can be derived from each

R_{a, s}

spectrum. Equation (A4) then adapts as follows:

T_{s, bright} = \sqrt{\frac{R_{a, s, bright} - {\bar{R}}_{a, s, dark}}{{\bar{R}}_{r, b, bright} - {\bar{R}}_{r, b, dark}}}

(A5)

T_{s, dark} = \sqrt{\frac{{\bar{R}}_{a, s, bright} - R_{a, s, dark}}{{\bar{R}}_{r, b, bright} - {\bar{R}}_{r, b, dark}}} .

(A6)

However, with this method, it is possible that the results of

T_{s}^{2}

are outside the physically possible range of 0–1. For negative values, the square root results in imaginary numbers for

T_{s}

. The variance within the respective

R_{a, s}

spectra on the complementary background and the use of the mean

R_{a, s}

spectrum causes under- or over-estimations of a suitably related reflectance spectrum. Negative values of

T_{s}^{2}

can occur if the difference between

R_{a, s}

spectra on either background is negative, i.e., if

R_{a, s, bright} < R_{a, s, dark}

or

R_{a, s, bright} < R_{a, s, dark}

. The background spectra have a reflectance close to 1 (bright) or 0 (dark). However, as some samples have very low

T_{s}

, the resulting reflectance spectra on either background are very similar. This potentially implies a subtraction of a higher from a lower reflectance, which causes a negative difference. However, channels with very similar

R_{a, s}

on either background are an indicator of the sample to have zero

T_{s}

in these wavelengths. These occur mostly in the absorption bands where all energy is absorbed for vibrations. Results of

T_{s}^{2}

above 1 can occur if the difference between

R_{a, s}

spectra on either backgrounds is greater than the difference between

R_{r, b}

spectra of either backgrounds, i.e., if

R_{a, s, bright} - R_{a, s, dark} > Δ R_{r, b}

or

R_{a, s, bright} - R_{a, s, dark} > Δ R_{r, b}

. For these cases, we scaled the mean

R_{a, s}

spectrum on the complementary background by the maximum gain of the deviations of the respective spectrum to

R_{a, s}

on the complementary background. These considerations show that the method after Lillesaeter [61] greatly relies on the measurement setting, as already examined by Jacquemoud and Ustin [63].

With

T_{s}

calculated,

R_{r, s}

can be calculated by equalizing Equations (A2) and (A3) to

T_{s}

as follows, where the background can be consistently chosen to obtain the same result:

R_{r, s} = R_{a, s} - T_{s}^{2} \cdot {\bar{R}}_{r, b} .

(A7)

After calculating

T_{s}

and

R_{r, s}

, we tested both for two conditions. First, both have to be in the range from 0–1. If not, we removed those exceeding this range. This led to a loss of ~1.4% due to

T_{s}

and ~0.2% due to

R_{r, s}

. Second, the sum of

T_{s}

and

R_{r, s}

must not exceed 1. If it did, we scaled

T_{s}

and

R_{r, s}

using the inverse of the maximum gain of the sum exceeding 1. We used multiplicative scaling instead of additive shifting to retain the band depth ratio, i.e., shoulder over valley. We scaled approximately 76.4% of all

T_{s}

and

R_{r, s}

. However, scaling factors were generally large with a mean of 0.93 and a minimum of 0.64 and scaling only led to a minor reduction of the total range of

T_{s}

and

R_{r, s}

.

Appendix A.3. Merging of ASD and HySpex Data

Reflectance spectra measured with the ASD spectrometer consisted of a single mean spectrum per plastic sample, while from the images recorded with the NEO spectrometer, we selected 50–100 spectra per sample. In order to obtain the same number of spectra from the images, while covering the entire wavelength range from 350–2500 nm, we cropped and scaled the spectra measured with the ASD device at 1100 nm to merge them with the individual spectra selected from the images.

After merging the spectra, we tested the two conditions again for the extraction of real reflectance and transmittance, i.e.,

T_{s}

and

R_{r, s}

have to be in the range from 0–1 and their sum must not exceed 1. We did not obtain any loss of

T_{s}

and

R_{r, s}

, as both were always in the range from 0–1, respectively. Yet, scaling had to be conducted to avoid their sum exceeding 1. We scaled approximately 34.9% of all

T_{s}

and

R_{r, s}

by factors with a mean of 0.93 and a minimum of 0.71.

Appendix B. Details on the Sensors Selected for Spectral Sensor Simulations

Appendix B.1. SRFs of the Multispectral Sensors

SRFs of multispectral sensors selected for the spectral sensor simulation are shown in Figure A3. In the absence of published SRFs, we estimated the SRFs for Pléiades, PlanetScope, EnMAP, Hyperion, and PRISMA. In contrast, we used the published SRFs of the remaining sensors.

Figure A3. SRFs of multispectral sensors selected for simulations. As Pléiades (A) and PlanetScope (B) do not have channels in the SWIR, only their VNIR channels are shown. The SRFs of (A,B) were approximated using Equation (2), given their central wavelength and FWHM, which were all above 10 nm. For (C–G), published SRFs were used. (D) MODIS channels 1–4 and 6–18 are very close together but do not have their central wavelengths in ascending order. The VNIR/SWIR threshold is at 1100 nm. SRFs of hyperspectral sensors used for simulations, i.e., EnMAP, Hyperion, and PRISMA are not shown due to their narrow SSIs. Earth’s transmission spectrum using MODTRAN is shown in gray [34].

Appendix C. Details on the Machine Learning Classifiers

Appendix C.1. Parameter Tuning for the Machine Learning Classifiers

Grid search is a traditional technique for parameter tuning. It involves a grid of all possible combinations of given values of selected model parameters to maximize a chosen evaluation metric by performing a cross validation for each of these combinations. We determined optimal sets of model parameters and their values for each sensor using the GridSearchCV function from the model selection module and are listed in Table 1. For this, we used noise-free simulated reflectance signals per sensor. We determined the set of optimal model parameter values for every background surface per sensor by maximizing the macro F1 score to derive the mode per parameter for Boolean values and strings or the integer median per parameter for numeric values.

Table A2. Model parameter values as derived from parameter tuning. For k-NN, k is the number of neighbors. For RF, MSS is the minimum number of samples required to split an internal node, MSL is the minimum number of samples required to be at a leaf node, and NE is the number of estimators.

Sensor	k-NN				RF
	Algorithm	Metric	k	Weights	Bootstrap	Criterion	MSS	MSL	NE
Pléiades	ball_tree	chebyshev	8	distance	True	entropy	3	3	20
PlanetScope	ball_tree	chebyshev	9	uniform	True	entropy	4	4	10
ASTER	ball_tree	manhattan	4	distance	True	entropy	3	3	15
MODIS	ball_tree	chebyshev	4	uniform	True	gini	4	2	15
Sentinel-2	ball_tree	euclidean	4	distance	True	entropy	4	4	10
Landsat 5/7	ball_tree	manhattan	5	distance	False	gini	3	3	10
WorldView-3	ball_tree	chebyshev	4	distance	False	entropy	3	2	20
EnMAP	ball_tree	manhattan	6	distance	True	entropy	4	4	50
Hyperion	ball_tree	manhattan	6	uniform	True	entropy	4	4	25
PRISMA	ball_tree	manhattan	6	distance	True	entropy	4	4	50

Appendix C.2. Cross Validation Setup

In cross-validation, the available dataset is divided into a training set and a test set to evaluate the classifier’s performance. Among other classifiers, k-NN and RF are sensitive to imbalanced training sets, where the support per class (number of samples per class) is different [64,65,66]. Purely random sampling can be biased towards classes with higher support, but stratified random sampling can ensure proportional support per class [66]. Here, we applied equalized stratified random sampling to give each class in the training set the same support. The class with the lowest support thus defines the support for all classes in the dataset.

An eight-fold cross validation was used to measure the performance of the classifiers. For each sensor and background surface, a total of 8261 simulated apparent reflectance spectra were available for five plastic types with varying support. To ensure fairness in the training set and avoid bias towards classes with higher support, we used an equalized stratified random sampling to select individual spectra per plastic type. We performed a cross validation to measure classifier performance, so it was important to ensure that spectra of the same sample were not in several folds. To achieve this, we set the number of folds to eight, as this is the minimum number of samples per plastic type (Table 1). We distributed all samples per plastic type to the individual folds. For plastic types with more than eight samples, we placed several samples per type in one fold. Then, we determined the number of spectra per sample. From this, we derived the minimum, which was reduced to an integer multiple of the number of folds, giving 96. Finally, we randomly selected 96 spectra per sample, laboratory background, and fold if there was only one sample per fold per plastic type. Otherwise, we divided the 96 spectra equally between the number of samples per plastic type and fold. We carried out the selection of the sample spectra first and maintained this for the following classifications using the degrees of freedom to ensure the same samples per plastic type for all background surfaces, sensors, and classifiers, respectively.

References

Geyer, R.; Jambeck, J.R.; Law, K.L. Production, use, and fate of all plastics ever made. Sci. Adv. 2017, 3, e1700782. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hopewell, J.; Dvorak, R.; Kosior, E. Plastics recycling: Challenges and opportunities. Philos. Trans. R. Soc. B Biol. Sci. 2009, 364, 2115–2126. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Avio, C.G.; Gorbi, S.; Regoli, F. Plastics and microplastics in the oceans: From emerging pollutants to emerged threat. Mar. Environ. Res. 2017, 128, 2–11. [Google Scholar] [CrossRef]
Rillig, M.C. Microplastic in Terrestrial Ecosystems and the Soil? Environ. Sci. Technol. 2012, 46, 6453–6454. [Google Scholar] [CrossRef]
Derraik, J.G. The pollution of the marine environment by plastic debris: A review. Mar. Pollut. Bull. 2002, 44, 842–852. [Google Scholar] [CrossRef] [PubMed]
Barnes, D.K.A.; Galgani, F.; Thompson, R.C.; Barlaz, M. Accumulation and fragmentation of plastic debris in global environments. Philos. Trans. R. Soc. B Biol. Sci. 2009, 364, 1985–1998. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kukulka, T.; Proskurowski, G.; Morét-Ferguson, S.; Meyer, D.W.; Law, K.L. The effect of wind mixing on the vertical distribution of buoyant plastic debris. Geophys. Res. Lett. 2012, 39. [Google Scholar] [CrossRef] [Green Version]
Forsberg, P.L.; Sous, D.; Stocchino, A.; Chemin, R. Behaviour of plastic litter in nearshore waters: First insights from wind and wave laboratory experiments. Mar. Pollut. Bull. 2020, 153, 111023. [Google Scholar] [CrossRef]
Andrady, A.L. Persistence of plastic litter in the oceans. In Marine Anthropogenic Litter; Springer: Berlin/Heidelberg, Germany, 2015; pp. 57–72. [Google Scholar]
Laskar, N.; Kumar, U. Plastics and microplastics: A threat to environment. Environ. Technol. Innov. 2019, 14, 100352. [Google Scholar] [CrossRef]
Wright, S.L.; Kelly, F.J. Plastic and Human Health: A Micro Issue? Environ. Sci. Technol. 2017, 51, 6634–6647. [Google Scholar] [CrossRef]
Revel, M.; Châtel, A.; Mouneyrac, C. Micro(nano)plastics: A threat to human health? Curr. Opin. Environ. Sci. Health 2018, 1, 17–23. [Google Scholar] [CrossRef]
Swain, S.K.; Mohammad, J. Nanostructured Polymer Composites for Biomedical Applications; Elsevier: Amsterdam, The Netherlands, 2019. [Google Scholar] [CrossRef]
Workman, J.; Workman, J. Handbook of Organic Compounds: Methods and Interpretations; Academic Press: Cambridge, MA, USA, 2001; Volume 1. [Google Scholar]
Eisenreich, N.; Rohe, T. Infrared spectroscopy in analysis of plastics recycling. In Encyclopedia of Analytical Chemistry: Applications, Theory and Instrumentation; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
Osswald, T.A. (Ed.) International Plastics Handbook: The Resource for Plastics Engineers, 1st ed.; Hanser: Munich, Germany; Cincinnati, OH, USA, 2006. [Google Scholar]
Hausdorff, H.H. Short Cuts to the Analysis of Plastics by Infrared Spectroscopy. Appl. Spectrosc. 1950, 5, 8–13. [Google Scholar] [CrossRef]
Kraft, E. Analysis of Plastics by ATR Spectroscopy; Modern Plastics; McGraw-Hill: New York, NY, USA, 1968. [Google Scholar]
Davies, A.M.C.; Grant, A.; Gavrel, G.M.; Steeper, R.V. Rapid analysis of packaging laminates by near-infrared spectroscopy. Analyst 1985, 110, 643. [Google Scholar] [CrossRef]
Cloutis, E.A. Spectral Reflectance Properties of Hydrocarbons: Remote-Sensing Implications. Science 1989, 245, 165–168. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kühn, F.; Oppermann, K.; Hörig, B. Hydrocarbon Index—An algorithm for hyperspectral detection of hydrocarbons. Int. J. Remote Sens. 2004, 25, 2467–2473. [Google Scholar] [CrossRef]
Lu, L.; Di, L.; Ye, Y. A Decision-Tree Classifier for Extracting Transparent Plastic-Mulched Landcover from Landsat-5 TM Images. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 4548–4558. [Google Scholar] [CrossRef]
Maximenko, N.; Chao, Y.; Moller, D. Developing a Remote Sensing System to Track Marine Debris. Eos 2016, 97. [Google Scholar] [CrossRef]
Lanorte, A.; De Santis, F.; Nolè, G.; Blanco, I.; Loisi, R.V.; Schettini, E.; Vox, G. Agricultural plastic waste spatial estimation by Landsat 8 satellite images. Comput. Electron. Agric. 2017, 141, 35–45. [Google Scholar] [CrossRef]
Yang, D.; Chen, J.; Zhou, Y.; Chen, X.; Chen, X.; Cao, X. Mapping plastic greenhouse with medium spatial resolution satellite data: Development of a new spectral index. ISPRS J. Photogramm. Remote Sens. 2017, 128, 47–60. [Google Scholar] [CrossRef]
Goddijn-Murphy, L.; Peters, S.; van Sebille, E.; James, N.A.; Gibb, S. Concept for a hyperspectral remote sensing algorithm for floating marine macro plastics. Mar. Pollut. Bull. 2018, 126, 255–262. [Google Scholar] [CrossRef] [Green Version]
Biermann, L.; Vincente, V.M.; Sailley, S.; Mata, A.; Steele, C. Towards a method for detecting macroplastics by satellite: Examining Sentinel-2 earth observation data for floating debris in the coastal zone. In Proceedings of the 21st EGU General Assembly, EGU2019, Vienna, Austria, 7–12 April 2019; p. 1. [Google Scholar]
Hafeez, S.; Wong, M.S.; Abbas, S.; Kwok, C.Y.T.; Nichol, J.; Lee, K.H.; Tang, D.; Pun, L. Detection and Monitoring of Marine Pollution Using Remote Sensing Technologies. In Monitoring of Marine Pollution; Fouzia, H.B., Ed.; IntechOpen: Rijeka, Croatia, 2018; Chapter 2. [Google Scholar] [CrossRef] [Green Version]
Martínez-Vicente, V.; Clark, J.R.; Corradi, P.; Aliani, S.; Arias, M.; Bochow, M.; Bonnery, G.; Cole, M.; Cózar, A.; Donnelly, R.; et al. Measuring Marine Plastic Debris from Space: Initial Assessment of Observation Requirements. Remote Sens. 2019, 11, 2443. [Google Scholar] [CrossRef] [Green Version]
Kuester, T.; Bochow, M. Spectral Modeling of Plastic Litter in Terrestrial Environments-Use of 3D Hyperspectral Ray Tracing Models to Analyze the Spectral Influence of Different Natural Ground Surfaces on Remote Sensing Based Plastic Mapping. In Proceedings of the 2019 10th Workshop on Hyperspectral Imaging and Signal Processing: Evolution in Remote Sensing (WHISPERS), Amsterdam, The Netherlands, 24–26 September 2019; IEEE: Amsterdam, The Netherlands, 2019; pp. 1–7. [Google Scholar]
Biermann, L.; Clewley, D.; Martinez-Vicente, V.; Topouzelis, K. Finding Plastic Patches in Coastal Waters using Optical Satellite Data. Sci. Rep. 2020, 10, 5364. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zhou, S.; Kuester, T.; Bochow, M.; Bohn, N.; Brell, M.; Kaufmann, H. A knowledge-based, validated classifier for the identification of aliphatic and aromatic plastics by WorldView-3 satellite data. Remote Sens. Environ. 2021, 264, 112598. [Google Scholar] [CrossRef]
Zhou, S.; Kaufmann, H.; Bohn, N.; Bochow, M.; Kuester, T.; Segl, K. Identifying distinct plastics in hyperspectral experimental lab-, aircraft-, and satellite data using machine/deep learning methods trained with synthetically mixed spectral data. Remote Sens. Environ. 2022, 281, 113263. [Google Scholar] [CrossRef]
Berk, A.; Conforti, P.; Kennett, R.; Perkins, T.; Hawes, F.; van den Bosch, J. MODTRAN6: A Major Upgrade of the MODTRAN Radiative Transfer Code; SPIE: Baltimore, MD, USA, 2014; p. 90880H. [Google Scholar] [CrossRef]
Vázquez-Guardado, A.; Money, M.; McKinney, N.; Chanda, D. Multi-spectral infrared spectroscopy for robust plastic identification. Appl. Opt. 2015, 54, 7396. [Google Scholar] [CrossRef] [Green Version]
ASD Inc. FieldSpec 3 User Manual; Technical Report ASD Document 600540 Rev. I; ASD Inc.: Falls Church, VA, USA, 2010. [Google Scholar]
Lenhard, K.; Baumgartner, A.; Schwarzmaier, T. Independent Laboratory Characterization of NEO HySpex Imaging Spectrometers VNIR-1600 and SWIR-320m-e. IEEE Trans. Geosci. Remote Sens. 2015, 53, 1828–1841. [Google Scholar] [CrossRef]
Rogass, C.; Koerting, F.M.; Mielke, C.; Brell, M.; Boesche, N.K.; Bade, M.; Hohmann, C. Translational imaging spectroscopy for proximal sensing. Sensors 2017, 17, 1857. [Google Scholar] [CrossRef] [Green Version]
Siegert, F.; Atwood, E.C.; Piehl, S.; Bochow, M.; Laforsch, C.; Franke, J. Belastung Aquatischer Ökosysteme Mit Kunststoffmüll: Globales und Lokales Monitoring Mittels Satellitengestützter Methoden: Schlussbericht; Berichtszeitraum: 1 July 2013–31 July 2017; Technical Report; Universität Bayreuth: Bayreuth, Germany, 2018. [Google Scholar] [CrossRef]
Neumann, C.; Itzerott, S.; Weiss, G.; Kleinschmit, B.; Schmidtlein, S. Mapping multiple plant species abundance patterns—A multiobjective optimization procedure for combining reflectance spectroscopy and species ordination. Ecol. Inform. 2016, 36, 61–76. [Google Scholar] [CrossRef]
Roessner, S.; Segl, K.; Heiden, U.; Kaufmann, H. Automated differentiation of urban surfaces based on airborne hyperspectral imagery. IEEE Trans. Geosci. Remote Sens. 2001, 39, 1525–1532. [Google Scholar] [CrossRef]
Blasch, G.; Spengler, D.; Itzerott, S.; Wessolek, G. Organic Matter Modeling at the Landscape Scale Based on Multitemporal Soil Pattern Analysis Using RapidEye Data. Remote Sens. 2015, 7, 11125–11150. [Google Scholar] [CrossRef] [Green Version]
Spengler, D. Anwendung Vierdimensionaler Bestandsmodelle für die Charakterisierung von Getreidearten aus Hyperspektralen Fernerkundungsdaten. Ph.D. Thesis, Technische Universität Berlin, Berlin, Germany, 2013. [Google Scholar]
Bochow, M. Automatisierungspotenzial von Stadtbiotopkartierungen durch Methoden der Fernerkundung; Logos-Verlag: Berlin, Germany, 2010. [Google Scholar]
Küster, T. Modellierung von Getreidebestandsspektren zur Korrektur BRDF-Bedingter Einflüsse auf Vegetationsindizes im Rahmen der EnMAP-Mission. Ph.D. Thesis, Humboldt-Universität Zu Berlin, Berlin, Germay, 2011. [Google Scholar] [CrossRef]
Segl, K.; Richter, R.; Küster, T.; Kaufmann, H. End-to-end sensor simulation for spectral band selection and optimization with application to the Sentinel-2 mission. Appl. Opt. 2012, 51, 439. [Google Scholar] [CrossRef] [PubMed] [Green Version]
She, X.; Zhang, L.; Cen, Y.; Wu, T.; Huang, C.; Baig, M. Comparison of the Continuity of Vegetation Indices Derived from Landsat 8 OLI and Landsat 7 ETM+ Data among Different Vegetation Types. Remote Sens. 2015, 7, 13485–13506. [Google Scholar] [CrossRef] [Green Version]
Mielke, C.; Boesche, N.K.; Rogass, C.; Kaufmann, H.; Gauert, C. New geometric hull continuum removal algorithm for automatic absorption band detection from spectroscopic data. Remote Sens. Lett. 2015, 6, 97–105. [Google Scholar] [CrossRef]
Clark, R.N. Spectral properties of mixtures of montmorillonite and dark carbon grains: Implications for remote sensing minerals containing chemically and physically adsorbed water. J. Geophys. Res. Solid Earth 1983, 88, 10635–10644. [Google Scholar] [CrossRef]
Maxwell, A.E.; Warner, T.A.; Fang, F. Implementation of machine-learning classification in remote sensing: An applied review. Int. J. Remote Sens. 2018, 39, 2784–2817. [Google Scholar] [CrossRef] [Green Version]
Shalev-Shwartz, S.; Ben-David, S. Understanding Machine Learning; Cambridge University Press: Cambridge, UK, 2014; p. 416. [Google Scholar]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Valera, D.; Belmonte, L.; Molina-Aiz, F.; López, A. Greenhouse Agriculture in Almeria. A Comprehensive Techno-Economic Analysis; Cajamar Caja Rural: Almeria, Spain, 2016. [Google Scholar]
Karaca, A.C.; Ertürk, A.; Güllü, M.K.; Elmas, M.; Ertürk, S. Plastic Waste Sorting Using Infrared Hyperspectral Imaging System. In Proceedings of the 2013 21st Signal Processing and Communications Applications Conference (SIU), Haspolat, Turkey, 24–26 April 2013; p. 4. [Google Scholar]
Garaba, S.P.; Dierssen, H.M. An airborne remote sensing case study of synthetic hydrocarbon detection using short wave infrared absorption features identified from marine-harvested macro- and microplastics. Remote Sens. Environ. 2018, 205, 224–235. [Google Scholar] [CrossRef]
Moroni, M.; Mei, A.; Leonardi, A.; Lupo, E.; Marca, F. PET and PVC Separation with Hyperspectral Imagery. Sensors 2015, 15, 2205–2227. [Google Scholar] [CrossRef] [PubMed]
Garnaud, J.C. Plasticulture magazine: Amilestone for a history of progress in plasticulture. Plasticulture 2000, 1, 30–43. [Google Scholar]
Yang, Z.; Yu, X.; Dedman, S.; Rosso, M.; Zhu, J.; Yang, J.; Xia, Y.; Tian, Y.; Zhang, G.; Wang, J. UAV remote sensing applications in marine monitoring: Knowledge visualization and review. Sci. Total Environ. 2022, 838, 155939. [Google Scholar] [CrossRef]
Tian, Y.; Yang, Z.; Yu, X.; Jia, Z.; Rosso, M.; Dedman, S.; Zhu, J.; Xia, Y.; Zhang, G.; Yang, J.; et al. Can we quantify the aquatic environmental plastic load from aquaculture? Water Res. 2022, 219, 118551. [Google Scholar] [CrossRef]
Pukelsheim, F. The Three Sigma Rule. Am. Stat. 1994, 48, 88–91. [Google Scholar] [CrossRef] [Green Version]
Lillesaeter, O. Spectral reflectance of partly transmitting leaves: Laboratory measurements and mathematical modeling. Remote Sens. Environ. 1982, 12, 247–254. [Google Scholar] [CrossRef]
Miller, J.R.; Steven, M.D.; Demetriades-Shah, T.H. Reflection of layered bean leaves over different soil backgrounds: Measured and simulated spectra. Int. J. Remote Sens. 1992, 13, 3273–3286. [Google Scholar] [CrossRef]
Jacquemoud, S.; Ustin, S.L. Leaf Optical Properties: A State of the Art; Cambridge University Press: Cambridge, UK, 2001; p. 10. [Google Scholar]
Chen, C.; Breiman, L. Using Random Forest to Learn Imbalanced Data; University of California: Berkeley, CA, USA, 2004. [Google Scholar]
Wah, Y.B.; Rahman, H.A.A.; He, H.; Bulgiba, A. Handling imbalanced dataset using SVM and k-NN approach. AIP Conf. Proc. 2016, 1750, 020023. [Google Scholar] [CrossRef] [Green Version]
Ramezan, C.A.; Warner, T.A.; Maxwell, A.E. Evaluation of Sampling and Cross-Validation Tuning Strategies for Regional-Scale Machine Learning Classification. Remote Sens. 2019, 11, 185. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Channel constellations of optical spaceborne sensors (red and blue bars) and of the laboratory sensors used (gray bars). Bold-written sensors were selected for simulations. Earth’s transmission spectrum using MODTRAN is shown in gray [34].

Figure 2. Photos of exemplary plastic samples, where (A) is PS, (B) is PE-LD, (C) is PVC, (D) is PP, (E) is PET, and (F) is PE-HD.

Figure 3. Data processing workflow as in the course of this study. Blue are work steps, gray are data, and red are major results.

Figure 4. In situ measured reflectance spectra of environmental surfaces used for layered spectral mixture after (A) Siegert et al. [39], (B,H) Neumann et al. [40], (C,D) Roessner et al. [41], (E) Blasch et al. [42], (F,G) Spengler [43], and (I,J) Bochow [44]. The y-axis scaling of (A) differs to that of the other spectra, as water absorbs nearly all energy in this wavelength range. Both sandy soil surfaces (F,G) were measured up to 2450 nm only.

Figure 5. Simulated reflectance signals (

R_{a, s, simulated}

) of PE-HD on selected background surfaces as for Landsat 5 TM/7 ETM+. (A–J) refer to the background surface spectra references given in Figure 4.

Figure 5. Simulated reflectance signals (

R_{a, s, simulated}

) of PE-HD on selected background surfaces as for Landsat 5 TM/7 ETM+. (A–J) refer to the background surface spectra references given in Figure 4.

Figure 6. True-color image of Almería, southeast Spain. White objects are primarily greenhouses. Imagery © 2023 Google, TerraMetrics, Data SIO, NOAA, U.S. Navy, NGA, GEBCO, Landsat/Copernicus, and map data © 2023 Instituto Geográfico Nacional.

Figure 7. Macro F1 scores achieved with k-NN and RF for different types of sensors using all background surfaces, while (A) is the result using simulated signals without noise and (B) is with added 5% noise. The percentages in the upper right corners indicate the proportion of cases in which the respective classifier achieved better results than the other.

Figure 8. Macro F1 scores in dependence of sensor and background surface using RF. (A) Multispectral VNIR sensors, (B) multispectral VNIR/SWIR sensors, and (C) hyperspectral sensors. No noise was added to the simulated reflectance signals.

Figure 9. Output of sequential forward selection in SWIR using continuum removed spectra classified with k-NN. Black vertical lines show the positions of the selected features, with their ranking given in parentheses. The first four features alone resulted in best classification performances. Dashed vertical lines show positions of hydrocarbon features at 1730 nm [21] and plastic features at 1215 and 1732 nm [29]. Additionally, mean continuum-removed reflectance spectrum per plastic type and the channel wavelength ranges of selected sensors are shown, where (A) are multispectral VNIR sensors, (B) are multispectral VNIR/SWIR sensors, and (C) are hyperspectral sensors.

Figure 10. Mean F1 scores in dependence of sensor and plastic type using RF. (A) Multispectral VNIR sensors, (B) multispectral VNIR/SWIR sensors, and (C) hyperspectral sensors. No noise was added to the simulated reflectance signals.

Figure 11. Classified pixels that depict greenhouse roofs of (A) PRISMA HYC and (B) Sentinel-2 MSI using a scene in Almería, southeast Spain. TP are true positives, i.e., PE, and FN are false negatives, i.e., PET, PP, PS, or PVC. Background imagery © 2023 Google, TerraMetrics, Data SIO, NOAA, U.S. Navy, NGA, GEBCO, Landsat/Copernicus, and map data © 2023 Instituto Geográfico Nacional.

Figure 12. Macro F1 scores using different numbers of channels as derived from the feature selection and in dependence of different FWHM, while (A) was classified with k-NN and (B) with RF.

Figure 13. Mean macro F1 scores obtained with (A) k-NN and (B) RF for multispectral VNIR/SWIR and hyperspectral sensors while signals of VNIR and SWIR channels or of only SWIR channels were considered, respectively.

Figure 14. True positives and false negatives per plastic type over all backgrounds using noise-free signals classified with RF, where (A) are of PlanetScope, (B) are of WorldView-3, and (C) are of PRISMA.

Table 1. Plastic types used for the spectral database. In total, we used 53 samples of 6 different types, while we handled PE-HD and PE-LD as one plastic type, i.e., PE.

Acronym	Chemical Notation	Number of Samples
PE-HD	High-density polyethylene	8
PE-LD	Low-density polyethylene	9
PET	Polyethylene terephthalate	8
PP	Polypropylene	12
PS	Polystyrene	8
PVC	Polyvinyl chloride	8

Table 2. Summary table for sensors that show suitable channel configurations for the differentiation of plastics in the environment.

Sensor	Open Data	High Spatial Resolution	Archive Data Only
ASTER	yes	no	yes
WorldView-3	no	yes	no
Hyperion	yes	no	yes
EnMAP	yes	no	no
PRISMA	yes	no	no

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Schmidt, T.; Kuester, T.; Smith, T.; Bochow, M. Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment. Remote Sens. 2023, 15, 2020. https://doi.org/10.3390/rs15082020

AMA Style

Schmidt T, Kuester T, Smith T, Bochow M. Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment. Remote Sensing. 2023; 15(8):2020. https://doi.org/10.3390/rs15082020

Chicago/Turabian Style

Schmidt, Toni, Theres Kuester, Taylor Smith, and Mathias Bochow. 2023. "Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment" Remote Sensing 15, no. 8: 2020. https://doi.org/10.3390/rs15082020

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Potential of Optical Spaceborne Sensors for the Differentiation of Plastics in the Environment

Abstract

1. Introduction

2. Data and Methods

2.1. Spaceborne Sensor Selection

2.2. Spectral Database

2.2.1. Plastic Samples

2.2.2. ASD Data Collection

2.2.3. HySpex Data Collection and Processing

2.3. Spectral Database Processing

2.3.1. Layered Spectral Mixture with Background Surfaces

2.3.2. Spectral Sensor Simulation

2.3.3. Continuum Removal

2.4. Classification

2.4.1. Classifiers

2.4.2. Performance Metrics

2.4.3. Case Study Almería

2.5. Hypothetical Sensor Definition

3. Results

3.1. Classification

3.1.1. Macro F1 Scores: Influence of Sensor, Background, and Noise

3.1.2. F1 Scores: Influence of Plastic Type

3.1.3. Case Study Almería

3.2. Hypothetical Sensor Definition

4. Discussion

4.1. Channel Characteristics

4.1.1. Continuum Removal

4.1.2. Background Surfaces

4.2. Classification

4.3. Applicability

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

Appendix A. Details on the Spectral Database Processings

Appendix A.1. Data Cleaning of HySpex Data

Appendix A.2. Extracting Real Reflectance and Transmittance from HySpex Data

Appendix A.3. Merging of ASD and HySpex Data

Appendix B. Details on the Sensors Selected for Spectral Sensor Simulations

Appendix B.1. SRFs of the Multispectral Sensors

Appendix C. Details on the Machine Learning Classifiers

Appendix C.1. Parameter Tuning for the Machine Learning Classifiers

Appendix C.2. Cross Validation Setup

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI