Next Article in Journal
Changes in Anthropogenic Aerosols during the First Wave of COVID-19 Lockdowns in the Context of Long-Term Historical Trends at 51 AERONET Stations
Previous Article in Journal
A Fast Computing Model for the Oxygen A-Band High-Spectral-Resolution Absorption Spectra Based on Artificial Neural Networks
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of Direct and Indirect Methodologies for Beach Litter Detection in Coastal Environments

by
Angelo Sozio
1,
Vincenzo Mariano Scarrica
2,
Angela Rizzo
1,3,*,
Pietro Patrizio Ciro Aucelli
2,
Giovanni Barracane
4,
Luca Antonio Dimuccio
5,
Rui Ferreira
5,
Marco La Salandra
1,
Antonino Staiano
2,
Maria Pia Tarantino
1 and
Giovanni Scicchitano
1,3
1
Department of Earth and Geoenvironmental Sciences, University of Bari Aldo Moro, 70125 Bari, Italy
2
Department of Sciences and Technology, University of Naples Parthenope, 80143 Napoli, Italy
3
Research Centre for Coastal Dynamics, University of Bari Aldo Moro, 70125 Bari, Italy
4
Environmental Surveys s.r.l., Via Dario Lupo n. 65, 74121 Taranto, Italy
5
University of Coimbra, Centre of Studies in Geography and Spatial Planning (CEGOT), Department of Geography and Tourism, 3004-503 Coimbra, Portugal
*
Author to whom correspondence should be addressed.
Remote Sens. 2024, 16(19), 3617; https://doi.org/10.3390/rs16193617
Submission received: 8 August 2024 / Revised: 21 September 2024 / Accepted: 25 September 2024 / Published: 28 September 2024
(This article belongs to the Section Environmental Remote Sensing)

Abstract

:
In this study, different approaches for detecting of beach litter (BL) items in coastal environments are applied: the direct in situ survey, an indirect image analysis based on the manual visual screening approach, and two different automatic segmentation and classification tools. One is a Mask-RCNN based-algorithm, already used in a previous work, but specifically improved in this study for multi-class analysis. Test cases were carried out at the Torre Guaceto Marine Protected Area (Apulia Region, southern Italy), using a novel dataset from images acquired in different coastal environments by tailored photogrammetric Unmanned Aerial Vehicle (UAV) surveys. The analysis of the overall methodologies used in this study highlights the potential exhibited by the two machine learning (ML) techniques (Mask-RCCN-based and SVM algorithms), but they still show some limitations concerning direct methodologies. The results of the analysis show that the Mask-RCNN-based algorithm requires further improvements and a consistent increase in the number of training elements, while the SVM algorithm shows limitations related to pixel-based classification. Furthermore, the outcomes of this research highlight the high suitability of ML tools for assessing BL pollution and contributing to coastal conservation efforts.

1. Introduction

Beach litter (BL) identifies all persistent, manufactured, and processed solid material discarded, disposed of, or abandoned in the marine and coastal environment [1,2]. Preserving coastal ecosystems is an imperative goal in the face of escalating environmental challenges. One of the most relevant issues affecting coastal areas worldwide is the increase in beach litter pollution, especially in the semi-closed basins with heavy anthropization along its coast—such as the Mediterranean basin [3]—and with high tourist fruition [4]. Several transport processes, both natural (i.e., streams, rainfall, and marine currents) and anthropogenic (i.e., drainage ditches and sewage discharges), move BL items toward coastal environments. The GRID-Arendal Environmental Communication Center, a UNEP partner, has identified numerous litter sources on land and in marine environments [5]. It is estimated that most of the litter coming from these sources settles on the bottom of the ocean (70%), and the remaining part (30%) is retained equally between the water column and beaches [6]. In Europe, statistics made available for 2016 by the Joint Research Centre recorded a high percentage (84%) of plastic material among the total marine litter items found along European coasts [7]. These critical issues confirm the need to counter the increase in BL abundance, and the scientific community has been particularly sensitive to this issue. Indeed, a great interest in the scientific community occurred in the past decade, evidenced by the increase in the use of the terms “marine” and “beach” litter in several European and international peer-reviewed papers [8].
BL elements are classified based on their size (micro-litter < 5 mm; 5 mm < meso-litter < 2.5 cm; 2.5 cm > macro-litter > 1.0 m; mega-litter > 1 m) [9] in order to define a uniform scale for field monitoring and data acquisition. This classification also affects methodologies for the activities to be carried out, as elements of different sizes require different tools and approaches to be sampled and analyzed. In this study, the analysis is focused on the macro- and mega-litter categories, as also suggested for the beach monitoring activities performed at the national and international levels. BL assessment has traditionally relied on manual surveys conducted by human observers. These approaches have yielded valuable insights into the composition and distribution of debris along coastlines [10,11,12,13]. However, the manual nature of these methods presents limitations in terms of time and effort effectiveness, necessitating the exploitation of novel automated alternatives.
Recent advances in machine learning (ML) techniques, particularly deep learning (DL) [14], represent a challenge in the field of BL assessment. Indeed, an emerging research branch has focused on developing methodologies to address this issue, spanning conventional human-driven approaches and innovative DL approaches, and exploiting drone-acquired data for comprehensive assessments [15,16]. The field of BL assessment has witnessed significant advances, with a growing emphasis on integrating ML, DL, and computer vision algorithms to assess items’ densities and distribution. Fallati et al. [17] used a domain-specific deep learning architecture called PlasticFinder on beaches in the Republic of Maldives. The authors stated a 78% F-score as the best average result on Adangau Island (peaking at 81%). Gonçalves et al. [18] used a multidisciplinary approach, comprising photogrammetry, geomorphology, machine learning, and hydrodynamic modeling to improve and make the mapping of marine litter items more efficient with UAS on coastal environments. They reached a F-test score of 75% in marine litter object detection compared to the manual procedure. Among the several computer vision algorithms commonly used for the segmentation and classification tasks on images acquired with Unmanned Aerial Vehicles (UAVs), such as Deep Neural Networks [19], Random Forests [16], and other kinds of neural networks [20], in this study, a CNN-based architecture was chosen to develop a new DL computer vision tool for the automatic identification and classification of BL items on UAV images. Indeed, the introduction of Convolutional Neural Networks (CNNs) in litter detection and classification tasks has revolutionized the field of data analysis. Winans et al. [21] tried to exploit the ML potential in a very large geographic area, the Hawaiian Islands. In their study, the authors provided the best percentage of accurate predictions, achieving an average precision of 72%, using three ML models trained on aerial images collected over 1900 km of Hawaiian coastline. Duarte et al. [22] dealt with the imbalance problem for training ML algorithms, using DenseNet121 combined with oversampling, and obtaining an F-score of 68%. Similarly, Scarrica et al. [23] employed deep learning for litter detection in aerial images acquired in southern Italy, showcasing the efficacy of Mask Region-based CNN models and achieving an F-score of 0.96% on real data. In general, Convolutional Neural Networks (CNNs) have shown remarkable efficacy in automating litter detection and classification tasks on specific images. These methods not only promise enhanced accuracy but also have the potential to significantly reduce the time and resources required for large-scale monitoring efforts. In addition, integrating UAV technology into BL assessment campaigns has opened up exciting possibilities. Drones equipped with high-resolution cameras enable rapid and comprehensive data acquisition, covering extensive coastal areas with unprecedented detail and within a short time. This approach facilitates more frequent and detailed data collection surveys and minimizes disturbances to fragile coastal habitats. On the other hand, UAVs with multispectral cameras have been used to classify litter types via the Spectral Angle Mapping (SAM) technique, improving the speed and robustness of traditional image screening [24].
Additionally, UAVs equipped with hyperspectral imaging have shown promise in identifying plastics in coastal environments. By using a push-broom sensor and Linear Discriminant Analysis (LDA), materials such as polyethylene (PE) and PET were successfully detected, demonstrating the feasibility of real-time plastic identification in both beach and marine environments [25]. However, relying solely on UAV photogrammetry, which captures imagery in the visible spectrum, presents more challenges, as distinguishing between various materials becomes less precise compared to using multispectral or hyperspectral sensors. The complexity increases in environments where overlapping textures or buried items exist, further limiting the efficacy of traditional photogrammetric techniques.
In this study, we contribute to the existing literature by comprehensively evaluating four distinct beach litter detection and classification methods. Our objectives are (i) to perform direct in situ BL assessment; (ii) to manually identify BL items on georeferenced orthomosaics; (iii) to improve the performance of the segmentation and classification tasks of a deep learning model previously developed [23] for the automated BL recognition and single-class classification on UAV images; and (iv) to evaluate the performance of a machine learning-based SVM algorithm provided by Orfeo Toolbox plugin, hosted by QGIS software (v.3.16). Finally, the DL method requires orthorectified images as the source of the training dataset for the classification task. To this aim, a survey protocol reporting the main technical and logistic parameters was drafted.
This research provides valuable insights into the strengths and limitations of the applied methods, shedding light on their potential applications in addressing beach litter pollution and furthering the cause of coastal sustainable management through efficient and cost-effective survey techniques.

2. Area of Study

The beach sector chosen as a test site for this study is located along the Adriatic coast of the Apulia Region, southern Italy, 15 km north of Brindisi (see Figure 1a). It falls into the Marine Protected Area (MPA) of Torre Guaceto (Figure 1b), established in 1991 by a Decree of the National Environmental Ministry. The MPA has an aerial extension of almost 3000 ha, including terrestrial and marine zones, and covers 8 km of the Apulian coastline [26]. This area is crossed by the Canale Reale river, a 48 km long creek that separates the Murgia plateau from the Taranto–Brindisi plain [27,28]. As the test site is included in a protected area, planned periodic cleanup operations are minimal, as is the recreational use by beachgoers. As a result, the distribution and abundance of litter appear to be related mostly to the natural transport and mobilization processes affecting the coastline. Moreover, the studied area is located near the mouth of the Canale Reale river (Figure 1c), a natural channel with a very limited flow rate artificially regulated in its final section. The Canale Reale river is considered one of the main waterways that could bring waste material from inland areas to the MPA boundaries, especially after the occurrence of extreme weather events. The sand present in the analyzed beach sector has a fine and medium-fine grain size, with a whitish color due to the high amount of carbonate components. In this specific test site, activities concerning the four proposed assessment methodologies were conducted.

3. Materials and Methods

In this section, the methodological approaches proposed for both direct (in situ) and indirect (visual screening and automatic detection) BL distribution analysis are described (Figure 2). Field activities were carried out at Torre Guaceto Beach in May 2023. During these activities, in situ BL surveys were carried out following the international guidelines [29], which identify a beach sector of 100 m long as a standard site for litter monitoring. Furthermore, a set of images was collected using a UAV system for photogrammetric surveys. The UAV images and related orthomosaics were exploited for both the visual screening and the automatic detection techniques. To assemble a training dataset including images reporting different environmental characteristics and useful for the training phase of the Mask-RCNN-based and SVM algorithms, additional UAV flights were performed in different coastal sectors in the Apulia Region (Capitolo Beach and Torre Guaceto Beach) and central Portugal (Leirosa Beach). Leirosa Beach, located 43 km west of the city of Coimbra along the Atlantic coast of Portugal, is a wide sandy beach sector characterized by a meso-tidal environment and backed by an eroded dune system from which litter items are gathered. Moreover, for a homogeneous quantification of the four BL assessment methodologies applied in this study (i.e., in situ assessment, visual screening on the orthomosaic, and the two machine learning techniques), six boxes with a mean surface area of 45 m2 were defined within the study area, from the shoreline up to the foredune limit (Figure 3). Litter density was calculated for the test boxes using results obtained by the in situ counting, as described in sub-paragraph 3.1. In sub-paragraph 3.2, the photogrammetric workflow describes the process of acquiring and post-processing of UAV images in detail. Then, indirect beach litter detection methodologies are illustrated in sub-paragraphs 3.3, 3.4, and 3.5. Activities envisaged by all four methodologies were conducted on test tiles enclosed in the six boxes, covering a total surface of 274.76 m2 of the overall test site.

3.1. In Situ Beach Litter Survey

The in situ visual assessment was conducted to measure litter abundance in Torre Guaceto Beach. Following the international monitoring guidelines [30,31], the study area covers two 100 m long beach sectors, north of the Canale Reale mouth (cf., Figure 1c). This activity was conducted taking into consideration six boxes identified from the shoreline up to the inland foredune limit (Figure 3). The identification process was focused on all litter items bigger than 2.5 cm, so considered in the macro-litter category, which was listed and classified according to the Joint list of litter categories for marine macro-litter monitoring proposed by the European Commission [31]. BL density values were computed by dividing the number of items by the box area. To make the in situ counting easier, each box was split into two parts (e.g., Box A was divided into A1 and A2, Box B into B1 and B2, and so on, as shown in Figure 3). Targets for GCPs were also used to define box extension, aside from georeferencing purposes.

3.2. UAV Surveys: Image Acquisition and Post-Processing

UAV surveys were planned to build the dataset for the training phases of the ML algorithms. Furthermore, in the Torre Guaceto test site, UAV surveys performed in May 2023 were also exploited to monitor the environmental status of the coastal sector soon after the period of occurrence of winter marine events. As for the in situ activities, surveyed areas covered 100 m long beach sectors.
In detail, for the UAV flights in the test site, twenty wooden targets (Figure 3) were used as Ground Control Point (GCP) to obtain georeferenced products in the photogrammetric post-processing. To this end, their coordinates and elevation were measured with a “Stonex S9III-N” GNSS receiver in Real Time Kinematic (GNSS-RTK) mode. The UAV images were acquired using a multirotor quadcopter “DJI Inspire 2” equipped with a “DJI Zenmuse X5S” optical camera (20.8 MP, DJI MFT 15 mm/1.7 ASPH supported lens, 4/3” CMOS sensor, FOV 72° and image resolution 5280 × 3956 pixel), property of the Department of Earth and Geoenvironmental Sciences of the University of Bari (Italy). The UAV flight missions were set up with a single grid path perpendicular to the shoreline, a nadir camera angle (−90°), and an image forward and side overlap of 80% and 65%, respectively. Four flights were performed at 10 and 15 m Above Ground Level (AGL) of the take-off location, respectively, detecting the site’s status regarding the beach litter distribution. Then, additional amounts of litter were added by operators along the beach, and two additional flights, performed at the same altitude as the previous ones, were conducted for the specific purpose of the algorithms’ training phase (new litter distribution). Indeed, this last step was necessary to construct an ideal litter pattern with known items to provide a valid dataset for the Mask-RCNN training phase. Moreover, also to this end and thanks to the collaboration with the Coimbra University, an additional flight was carried out along the Leirosa Beach sector (Portugal). In this way, the machine learning algorithms were trained on beach sectors with different background features. For technical and environmental acquisition specs of this flight, see Appendix B.
The post-processing of all the acquired UAV images (listed in Table 1) was executed using Agisoft Metashape Professional (v. 1.6.5) following the principles of the Structure from Motion (SfM) technique [32].
A sequential step-by-step workflow (Figure 4) was conducted to obtain a Digital Surface Model (DSM) and an orthomosaic for each flight. First, raw images were aligned to produce a sparse point cloud, representing the image tie points identified by the software. Then, a high-precision georeferencing procedure was carried out using the GCP coordinates collected on the field with vertical and horizontal accuracy of about 0.02 m and 0.01 m, respectively. Finally, a dense point cloud was generated and used as input to obtain the DSM and, consequently, the RGB orthomosaic. The DSM and orthomosaic spatial resolution, strongly influenced by different processing parameters (e.g., the dense cloud downscale factor), is expressed by the Ground Sampling Distance (GSD), a parameter that associates the spatial distance on the field with the final orthorectified image resolution, as previously indicated by Ventura et al. [33] with the following equation:
G S D ( cm / pixel ) = S w F h F L I w
where GSD is the image spatial resolution at ground level, Sw is the sensor width expressed in millimeters, Fh is the flight altitude in meters, FL is the camera’s focal length expressed in millimeters, and Iw is the image width expressed in pixels. In Table 1, details of all UAV flights performed for this study are provided.

3.3. Visual Screening for Litter Identification

Manual visual screening was carried out to recognize and digitize all litter items in the boxes identified at Torre Guaceto Beach. This operation was performed on the very high-spatial-resolution orthomosaics (1 mm/pixel) obtained from UAV images at a 10 m altitude, using QGis software (v. 3.16). To this aim, a specific geo-database was assembled, compiling the attribute table of the related shapefile and reporting for each item the following fields: X and Y coordinates, European classification code, and the composition material.

3.4. Mask-RCNN-Based Algorithm Dataset Building and Application

To build the dataset, each orthomosaic was split into 1000 × 1000 pixel tiles in Global Mapper GIS software (v. 20.1) to improve performance and processing times during the elaboration steps. The training dataset is composed of tiles from UAV images collected for training purposes (cf. Table 1) while the test dataset comprises 104 tiles, covering the area inside the six boxes. Three different settings of the training datasets (Dataset 1, Dataset 2, and Dataset 3) were proposed to conduct three experiments on the box areas at the Torre Guaceto test site. In detail, Dataset 1 and Dataset 2 consider the classes “Bottles”, “Worked wooden”, and “Other”, accounting both for the availability of BL items on every specific site and the most common categories of BL on the Italian coastline [34]. The label “Bottles” does not consider vials and flasks for cleaning products but only common drink bottles, whether they are made of glass or plastic. The label “Worked wooden” includes wooden artifacts such as beams, decks, or platforms, and the label “Other” includes all the remaining items of other kinds of waste, regardless of the material. In Dataset 3, the label “Other” was replaced by “Nets”, encompassing fishing nets primarily used for mussels. The network was trained three times because the three settings of the training dataset differ in the number of polygons. This is a crucial aspect of the balance among classes and for the final performance. In the first setting (Dataset 1), those objects other than “Bottles” or “Worked wood” were grouped into a single class, “Other”. In Dataset 2, the number of elements in the “Other” class was reduced, while in Dataset 3, the class “Other” was replaced by the “Nets” class, which represented one of the most abundant and characteristic objects in the coastal test site. In all experiments, the background was not included in any classes, since it was represented by sand, sea, and vegetation whose chromatic features are very variable depending on several factors, i.e., the site-specific morphodynamic characteristics, the hour, and the weather conditions during the acquisition flight.
On each tile (training and test), BL items were manually digitalized and labeled in QGIS software (v. 3.16), producing polygon shape files. Polygons labeled on test tiles represented the ground truth elements for the accuracy evaluation while polygons labeled on training tiles were used for the training phase. The DL algorithm for automatic BL analysis exploits Mask-RCNN architecture [35]. This is based on Faster-RCNN and performs an instance segmentation task with a pre-training phase on the COCO dataset [36]. The subsequent training phase was performed in one epoch consisting of 100 intermediate steps. This algorithm works on a two-stage network (Figure 5): in the first one, ROIs (regions of interest) are generated, while in the second stage, ROIs are ranked, and a bounding mask regression is returned. So, the algorithm adds parallel elaboration branches for object mask segmentation, aside from the box recognition. ResNet101 was used as the backbone. Since the manual labeling of polygons that constitute the ground truth was carried out in QGIS software (v.3.16), a conversion from shapefile format to JSON was needed. This was executed to make shapefile metadata available to the Mask-RCNN algorithm. The overall dataset was uploaded on Google Drive and analyses were conducted on the Google Colab platform, which provides Nvidia Tesla K80 GPU and 12 GB of RAM. As a result of the analysis, images in .png format were obtained on which beach litter items identified by the Mask-RCNN algorithm were segmented with masks and framed in a rectangular bounding box (Figure 5). Furthermore, the accuracy of the ML tools was evaluated through Mean Average Precision using Intersection over Union (mAP@IoU), as already used in Scarrica et al. [23]. It is derived from the precision–recall curve, incorporating the concept of Intersection over Union (IoU). IoU represents the ratio of the overlapping area, expressed in pixels, to the total area of two polygons, typically a predicted bounding box/polygon and a ground truth box/polygon. In the context of mAP, IoU is used to determine whether a detection is a true positive based on a set threshold. More specifically, mAP@IoU is defined as the mean of the areas under the precision–recall curve across all classes and IoU thresholds, where true positives are detections with an IoU higher than the given threshold. This allows mAP to account for the accuracy of localization in object detection tasks [34]. The values of IoU range between 0 and 1. In this work, the value of 0.1 was chosen as the threshold. Since the Mask-RCNN-based algorithm does not manage geo-spatial data, the script was implemented with a code block for the georeferencing task. In detail, a .txt file is associated with each exported test image. It reports projected coordinates for each segmented and classified polygon. The algorithm reads the georeferenced input tiles (.tiff format) and, at the same time, computes bounding box centroid coordinates (expressed in pixels) on the output images. After that, knowing the spatial resolution (GSD) and up-left image vertex projected coordinates (origin point) measured on the georeferenced input image, it is possible to compute the projected coordinates (X and Y) of each centroid. Indeed, the georeferenced code of the script is based on the input images in .tiff format, which keeps metric coordinates on its origin point (top-left image vertex) since it is derived from the georeferenced orthomosaic. This procedure can be applied to every projected reference system. As well as coordinates, other parameters were reported in each .txt file: object ID, category name, and score value.
The pseudo-code strings used to extrapolate the coordinates of each detected item are reported in Appendix C.

3.5. SVM Dataset Building and Application

The Support Vector Machine (SVM) algorithm [37] was selected for the classification task conducted entirely on GIS environment. This tool is included in the Orfeo Toolbox (OTB) suite, an open-source library available as a plugin on the QGIS environment [38], which provides many algorithms for high-resolution optical, multispectral, and SAR image post-processing. Moreover, OTB’s algorithms perform object detection on images following the typical object-based image analysis (OBIA) workflow [39]. Also in this case, different settings of the training datasets were organized. The training database was assembled with manually digitalized polygons considering those items belonging to Dataset S1 and Dataset S3, also used for the Mask-RCNN-based algorithm classification, but because SVM executes a panoptical pixel-based classification on the entire input image, a fourth class was added, labeled “Sediment”, to include the background. To distinguish them from the previous ones, they are called Dataset S1, Dataset S2, and Dataset S3 pertaining to the SVM elaboration. Moreover, a third experiment was conducted with a third dataset (Dataset S3) composed of five classes, that are “Bottle”, “Worked Wood”, ”Other”, ”Nets”, and “Sediment”. In this case, polygons were digitalized directly on the orthomosaic chosen as training site (Capitolo Beach) on which additional BL items were added by operators to obtain as many training objects as possible. The metadata of each polygon were characterized by two attributes, “Class” and “Label”, the latter of which was used for the training and validation phases. To this end, a file containing the pixels zonal statistics of the RGB orthomosaic was also needed in order to assign the label of each polygon to a specific cluster of pixels (groundtruth). A split value of 0.5 was set between the training and validation number of pixels. Then, the training phase task was performed using the “TrainImageClassifier” tool provided by OTB, where a polynomial kernel type wasset up for the SVM algorithm, since a “linear” type would have been too limiting to our class geometry. This step led to the generation of the model file required by the next test phase conducted on the 104 test tiles. In this case, the SVM performance accuracy was evaluated through the K index [40] and confusion matrices referring to the validation phase for each dataset setting. The kappa coefficient (K) is a statistical measure used to evaluate the degree of agreement or consistency between two or more raters or observers when they classify items into categories. Unlike simple percentage agreements, “K” considers the agreement occurring by chance. Varying K-values across different datasets when using an SVM for pixel classification reflects the differences in model performance, dataset characteristics, and quality. Higher K-values signify better model performance and alignment with true classifications, while lower values indicate potential areas for improvement in data quality or model tuning.

4. Results

This paragraph reports the results of each approach applied for the analysis of BL at Torre Guaceto Beach. Sub-paragraph 4.1 illustrates the results obtained by the in situ activities. These were conducted on 23 and 25 May 2023, soon after extreme events of the winter period and before the summer season. Products obtained by the photogrammetric elaboration using SfM techniques are illustrated in sub-paragraph 4.2 together with the results of the visual screening analysis. The dataset description and results of the analysis performed using the machine learning algorithms are provided in sub-paragraphs 4.3 and 4.4.

4.1. In Situ Analysis

The total number of BL items identified through the in situ counting, in all the box areas, was 158. Each element was assigned to a specific category, following the most recent classification codes [31]. The percentage of litter items and materials is reported in Figure 6 while all identified litter categories with their statistics are reported in Appendix A. Relying on this analysis, plastic fragments > 2.5 cm (J79-J80-J82-J124) turned out to be the most abundant item (50), especially in boxes located towards the foredune zone (A2-B2-C2). BL density was computed by dividing the total amount of cataloged items by the surface area of each sub-box. After that, the minimum, maximum, and average density values obtained were calculated, resulting in 0.06, 1.19, and 0.51 items/m2, respectively.

4.2. Photogrammetric Outputs and Visual Screening Analysis

A total of 1125 UAV images were acquired during the six flights performed at different heights ranging from 10 m to 20 m AGL (Table 2). Photogrammetric post-processing led to the creation of six RGB orthomosaics, all framed in the UTM projected reference system, with X and Y coordinates expressed in meters. The spatial resolution of orthomosaics varied according to the flight heights and it ranged from 0.2 to 0.4 cm/pixel, as summarized in Table 2. In Figure 7, the orthomosaics for the test site of Torre Guaceto Beach (obtained from the flight at 10 m) are used as background.
The manual visual screening results show how objects are mostly located on the southernmost boxes, near the Canale Reale mouth, and lying on the backshore section (Figure 7). The BL density of items identified through visual screening was also computed, obtaining an average value of 0.30 items/m2. The manual visual screening results and the ones obtained by the in situ survey conducted in the test site revealed that the first methodology returned fewer objects, up to 42% less than in situ data, as shown in Figure 8.

4.3. BL Automatic Detection Using Mask-RCNN-Based Algorithm

A total of 413 tiles were selected based on the total number of tiles obtained by splitting the six orthomosaics, taking into consideration the object distribution along the beaches and discarding those reporting no data areas. Among these, 104 tiles were used as a test, covering the area of the six boxes defined at Torre Guaceto Beach. The remaining 309 tiles were used to assemble the training dataset in three different settings, as reported in Table 3. BL items were digitalized on each training tile, obtaining several training polygons. Dataset 1 proved to be unbalanced since class “Other” was composed of 740 polygons, class “Bottle” of 40 polygons, and class “Worked wood” of only 18 polygons. Dataset 2 proved to be more balanced since classes “Bottle” and “Worked wood” were composed of almost the same number of polygons (33 and 36, respectively), and the number of polygons in class “Other” was reduced to 123. Finally, Dataset 3 resulted in the most balanced setting since the number of polygons in each class was comparable (28, 38, and 38 polygons).
The analysis of the output of the Mask-RCNN-based algorithm shows that, considering the experiment performed by using Dataset 1, the number of identified BL objects was 317, ranging from 7 (sub-box F1) to 56 (sub-box E2), so in every box, at least one item was identified. Using Dataset 2, the number of identified BL objects dropped to 24, ranging from 0 (sub-box A1, B1, C1) to 5 (sub-box D2). Using Dataset 3, the number of identified BL items was 16, ranging from 0 (most of the sub-boxes) to 7 (sub-box F2). The total amount of true positive items detected using Dataset 2 and 3 was very low, so the confusion matrix relative to Dataset 1, expressing Mask-RCNN-based algorithm classification accuracy, is reported in Figure 9. Considering the overall accuracy index on the classification, a mean value of 0.65 was reached but an overestimation of the “Other” class occurred, as can be deduced from the relative recall value. The georeferenced outputs were also obtained, and they consisted of a text file associated with each test reporting the polygon’s class ID, label, and projected coordinates. The text files were then imported into QGIS software (v.3.16) to visualize the BL spatial distribution (Figure 10), and thanks to the GCP measurements during the survey, the geometrical centroid of each segmented object had a horizontal error of about 2–3 cm. Based on the georeferenced outputs from the Mask-RCNN-based algorithm, BL density was calculated, as was done for the in situ and visual screening counting. Three different average litter density values were computed considering the three different experiments, and they are reported in Table 3. Accuracy values associated with Mask-RCNN-based algorithm performances in the three experiments were computed through the mAP@IoU parameter, and they differ depending on the dataset setting used for the training phase. The accuracy evaluated for the first experiment (Dataset 1) was 0.017 (1.7%), which increased to 0.020 (2%) considering the setting Dataset 2 and, finally, a value of 0.025 (2.5%) was reached considering the setting Dataset 3. Moreover, a decrease in the number of identified objects was observed with an increase in the mAP accuracy value (Figure 11). On the other hand, the classification task accuracy was expressed by confusion matrices for each experiment.

4.4. BL Automatic Detection Using SVM Algorithm

Considering the dataset assembled for the SVM algorithm, all three dataset settings used for the three experiments were almost balanced (Table 4). The number of pixels enclosed in the whole number of validation polygons is also reported in Table 4 and it varies according to their size. The overall accuracy of performance (Figure 12) reached by SVM during the training and validation phases was expressed by a K index ranging from 0.81 (Dataset S1) to 0.85 (Dataset S2). Outputs obtained during the image classification phase of SVM were only of visual type and without a groundtruth, since the black-box nature of the tool did not directly allow a quantitatively performance evaluation of the classification task. In detail, the SVM segmentation and classification output analysis shows a clear difference between Dataset S1 and the remaining two. Indeed, in the first experiment, pixels pertaining to the background (“Sediment” class) were wrongly classified mainly as “Other” (Figure 13). The performance of the class “Nets” is notably better on S3 compared to S2, although this improvement might not be evident from Figure 12. This is because Figure 12 displays only a sample rather than the full test set, which might not fully represent the overall trend.

5. Discussion

In this study, a multiple analysis of both direct and indirect methodologies for the identification and classification of BL items was conducted. From the analysis of the international literature, it emerged that the number of studies focusing on beach litter assessment is constantly growing [8,41,42]. Most of the studies, both at the national and international scale, are based on BL data collected through in situ surveys and are aimed at defining the overall environmental quality of the investigated beach sectors by applying specific indices [11,12,43,44,45,46]. Due to the human efforts required to carry out in situ surveys for long coastal sectors, the most recent investigations exploit the use of drones to collect tailored images [47,48,49]. Using GIS tools, the UAV images can be used to identify and map BL items by performing ex situ visual screening. The accuracy of the identification process depends on the image resolution that, in turn, depends on the technical parameters for drone flight deployment, such as flight altitude [50].
UAV images used in this study were acquired in two different beach contexts (Italian and Portuguese coastlines), with different sediment characteristics, to carry out the training task under different backgrounds. Indeed, regarding the physical characteristics of the coastal areas, variations in sediment texture and composition were the leading cause of background features. The grain size and composition of the sediment altered the sediment’s light reflection and absorption, giving the sand a specific color. This aspect deeply affected automatic segmentation and classification tasks.
All methodologies were applied at Torre Guaceto Beach while images from other sites were used as supplementary material to train machine learning algorithms. The results of the in situ surveys and the manual visual screening showed how BL items were mostly located on the southernmost part of the Torre Guaceto Beach, near the Canale Reale mouth. They were mostly located on the inland part of the backshore (Figure 7). As also highlighted in previous studies [10], this peculiar distribution could be related to the material transported by the creek and reallocated on the beach face by swash and backwash currents.
The in situ survey and visual screening litter density average values (0.51 and 0.3, respectively) and their total amount of detected BL items (158 and 92, respectively) revealed that through the latter methodology, it was possible to detect fewer objects than with the in situ survey (Figure 8). Although this was very high (0.03–0.04 cm/pixel), the detection of small objects (centimeters in scale, e.g., caps, cigarette butts, and undefined small pieces) was sometimes very difficult, and therefore it was impossible to catalog them according to litter categories. So, the visual screening methodology could be very useful for the detection and distribution purposes of macro- and mega-litter, but it showed limitations for classification purposes, especially for the meso-litter and the smaller objects belonging to the macro-litter category. This is in line with the outcomes of the research carried out by Moy et al. [51], in which precise measurements of the quantity, location, type, and size of macro-debris were provided using manual screening conducted on orthomosaics.
The Mask-RCNN-based algorithm quantitative results exhibited both an underestimation and an overestimation issue for the detection task. This phenomenon can be traced back to the quantitative equilibrium among the classes, as inferred from the comparison of the conducted experiments (Table 4). In the case of unbalanced classes (Dataset 1), the number of elements was higher, but the accuracy was low. On the other hand, in the case of more balanced classes (Dataset 2 and Dataset 4), the number of detected elements was lower, but the classification accuracy was higher. The Mask-RCNN-based algorithm was trained using three dataset settings that strongly differed in the number of training polygons. This is a crucial aspect for the balance among classes. In detail, in Dataset 1, classes proved to be unbalanced and the accuracy value returned was very low (Figure 11, mAP@IoU = 0.017). In Dataset 2, the number of elements composing the “Other” class was lowered to 123, to reach a more balanced status with the other two classes. In this case, the final performance was lower than the previous experiment and an underestimation issue occurred. To fix this issue, the class “Other” was replaced with the class “Nets”, which is considered one of the most common BL items identifiable on coastal environments. For this reason, in Dataset 3, a balance status in the polygon number (28, 38, and 38; Table 3) was reached. Nevertheless, the number of training polygons was still not high enough to return an acceptable classification output. It can be deduced that the performance of the Mask-RCNN-based algorithm can lead to good results with a higher number of training elements. Moreover, a main factor influencing segmentation and the subsequent classification outcome was the shape of training polygons. So, considering both segmentation accuracy assessment (Table 3) and classification accuracy (Figure 9), it can be deduced that the Mask-RCNN-based algorithm could be better in segmentation tasks rather than as a classifier tool. Therefore, to reach an acceptable performance for a multi-class classification, it is necessary to provide training element shapes as well defined as possible that faithfully reproduce the objects to be recognized.
The approach proposed in this study allowed the researchers to geolocate BL items in the site-specific UTM Reference System (Figure 10). Pfeiffer et al. [52] proposed an analog way to obtain a georeferenced output based on bounding boxes and framed in a geographic coordinate Reference System (WGS84). Nevertheless, the approach proposed here proved to be very accurate (with a horizontal error of about 2–3 cm) and linked to the geometrical centroid of each object, ensuring a very detailed analysis of BL item distribution along the beach profile.
As far as the SVM algorithm provided by the Orfeo Toolbox plugin is concerned, the obtained validation results reached an acceptable accuracy especially with Dataset S3 (Figure 12c), where the main objects were distinct from the background. The same cannot be said for the visual results reported in Figure 13, where, using completely new images which were different from those of the training and validation phases, some critical issues emerged. SVM panoptical classification is only pixel-based and does not consider polygon shape during the training phase, and this resulted in an indirect and passive segmentation. “Bottle” and “Worked wood” proved to be classes with highly critical issues. In the first case, elements below the real bottles lying on the beach training sector (in most cases, consisting of sand) were quite visible since they are almost always made of transparent glassy or plastic material. This generated a high degree of heterogeneity in pixel zonal statistics pertaining to this class, leading to misclassification by SVM (Figure 13a). A similar issue was encountered with the “Worked wood” class, as wooden objects labeled for the training phase had a chromatic range like the sand one. Indeed, as can be seen from confusion matrices concerning the validation phase (Figure 12), a high number of pixels belonging to the “Sediment” class were classified as “Bottle” (5655 in Dataset S1) or “Worked wood” (3182 in Dataset S1). On the other hand, the “Other” class showed the highest P.A. value (0.915 in Dataset S2 and 0.918 in Dataset S3; Figure 12) along with the “Sediment” class, as labeled training objects had a chromatic range contrasting the background one. Moreover, objects’ shadows or sandy ripples were considered independent objects, thus increasing the number of total masks produced. Trials were also conducted to verify processing time, and test tiles larger than 1000 pixels led to a significant increase in it. In the end, because of the pixel-based nature of the SVM classification process, it was impossible to obtain a density value from a raster output, unless further processing steps were added that would greatly increase the time for obtaining a preliminary and immediate estimate of the BL items’ distribution. Despite the high accuracy values achieved during the validation phase (>80%), the aspects discussed above make the SVM classifier unsuitable for a multi-class detection task on very high-resolution images with three spectral bands, but at least, it proved to be a good tool for a segmentation task. Indeed, better performances could be achieved using multiple-band images, since this kind of algorithm, as well as most of the other algorithms available in the OTB suite, are very suitable for remote sensing processing [53,54].
As a final remark, we can observe that the Mask-RCNN-based algorithm proposed in this study resulted in a promising alternative tool for the automatic detection of BL items, although still not accurate enough (mAP@IoU = 0.02), since the object shape is a crucial element for the segmentation and classification tasks. In Scarrica et al. [23] a preliminary version of the Mask-RCNN-based algorithm was exploited to detect and classify BL items on UAV images through a single-class classification approach. In this study, the algorithm was improved to reach a multi-class classification task, evaluating its performance in classifying BL items according to three classes. The robustness of the Mask-RCNN-based algorithm was evaluated by choosing an IoU threshold of 0.1, but further experiments are planned to reach a threshold value of 0.5. The number of training polygons should be as high as possible to obtain acceptable performance, and this aim could be easily reached through more aero-photogrammetric surveys.
Based on the field photogrammetric surveys conducted both in this study and in previous activities [13,23,50,51], a protocol for a potential standardization of a drone-based BL survey was proposed. In detail, specific logistic, technical, and environmental parameters are defined in Table 5.

6. Conclusions

In the past twenty years, the interest of the international scientific community regarding the environmental issue of litter in coastal areas has been increasing. In this study, an integrated analysis of different methodologies for macro-litter (i.e., objects greater than 2.5 cm) identification and classification is provided. In detail, data directly collected in situ at Torre Guaceto Beach (southern Italy) were used to evaluate beach litter density, and the obtained values were used as a reference for those calculated from indirect analyses based on the exploitation of UAV photogrammetric surveys (i.e., manual visual screening and automatic detection through ML algorithms). The results of such analysis allowed us to state that, although there is a need for further improvements in the training phase to overcome the overestimation/underestimation issues, the automatic classification tool performance is promising.
As far as the novel Mask-RCNN-based algorithm is concerned, it turned out to be poorly suitable for multi-class classification, because of low accuracy values (less than 10%). Nevertheless, the overall performance of this tool could be improved by providing additional training polygons for a set of selected BL classes (representing the most common BL items), which can be obtained by increasing the number of available training images. To this end, a flight protocol reporting the mechanical and environmental settings is proposed to carry out tailored UAV surveys. The simplicity of the training and testing steps makes the Mask-RCNN-based algorithm a versatile tool that can be further exploited to support the integrated management of the coastal zone. In addition, thanks to the introduction of a block code that allows to obtain a very accurate georeferenced position of the identified BL items, the current version of the proposed Mask-RCNN method allows the assessment of BL item spatial distribution along the investigated beach sector and, therefore, it can be exploited to identify accumulation zones and define priority areas where beach management action should be directed.
On the other hand, although the SVM classifier available in the QGIS environment exhibited a relatively high accuracy during the validation phase, it did not reach a reasonably satisfactory performance for a multi-class pixel-based detection task on very high-resolution images with three spectral bands. In this regard, the integration with multi-spectral or hyper-spectral images could be very useful to improve the suitability of this tool to be applied for BL detection purposes.
Improving the automatic detection approaches means facilitating the autonomous use of ML-based tools, reducing the time and effort needed for coastal monitoring through in situ field activities. In compliance with the objectives of sustainable development [55,56], further investigations will be carried out to provide an adequate and definitive coastal monitoring tool to be exploited for the integrated geo-environmental characterization of both natural and highly anthropized coastal areas [57].

Author Contributions

Conceptualization, A.S. (Angelo Sozio), V.M.S., A.R., G.S., A.S. (Antonino Staiano), and P.P.C.A.; methodology, A.S. (Angelo Sozio), A.R., V.M.S., A.S. (Antonino Staiano), and G.S.; software, A.S. (Angelo Sozio), V.M.S., and A.S. (Antonino Staiano); formal analysis, A.S. (Angelo Sozio) and V.M.S.; resources, M.L.S., L.A.D., R.F., G.B., and M.P.T.; data curation, A.S. (Angelo Sozio), V.M.S., M.L.S., M.P.T., R.F., and G.B.; writing—original draft preparation, A.S. (Angelo Sozio), A.R., and V.M.S.; writing—review and editing, A.S. (Angelo Sozio), A.R., V.M.S., M.L.S., G.B., L.A.D., R.F., A.S. (Antonino Staiano), and G.S.; supervision, A.R., A.S. (Antonino Staiano), and G.S.; project administration, A.R., G.B., G.S., A.S. (Antonino Staiano), and P.P.C.A.; funding acquisition, G.S. All authors have read and agreed to the published version of the manuscript.

Funding

Activities were funded by the Research Agreement signed between the Department of Earth and Geoenvironmental Sciences of the University of Bari Aldo Moro and the Marine Protected Area of Torre Guaceto (scientific supervisor: G. Scicchitano, RDO 3481808). The investigation is also part of the activities of the Centre of Studies in Geography and Spatial Planning (CEGOT), funded by national funds through the FCT under the reference UIDP/GEO/04084/2020_UC.

Data Availability Statement

Data will be available on request.

Acknowledgments

The research was carried out in the frame of the RIPARTI Project (assegni di Ricerca per riPARTire con le imprese) POC PUGLIA FESRT-FSE 2014/2020—“Marine & Beach litter monitoring in Apulia”, scientific supervisor: Angela Rizzo; industrial supervisor: Giovanni Barracane). This study was conducted under the Research Agreement between the Department of Earth and Geoenvironmental Sciences of the University of Bari Aldo Moro and the Department of Science and Technology of the University of Naples Parthenope (scientific supervisor: Pietro P. C. Aucelli, Angela Rizzo, Giovanni Scicchitano, and Antonino Staiano). Furthermore, the research belongs to the activities of the RETURN Extended Partnership, supported under the National Recovery and Resilience Plan (NRRP), Mission 4, Component 2, Investment 1.3-D.D. 1243 2/8/2022—European Union NextGenerationEU (no. PE00000005, CUP: B53C22004020002—author involved: Angela Rizzo). The authors also thank the staff of the “Consorzio di Gestione di Torre Guaceto” and the Torre Guaceto Sailing Center for the logistical support provided during field activities. Finally, this work was supported by the insightful and productive discussions within the GeomorphoAI research group, sponsored by the Italian Association of Physical Geography and Geomorphology (AIGEO) (coordinators: Domenico Capolongo, Giovanni Scicchitano).

Conflicts of Interest

Author Giovanni Barracane was employed by the Environmental Surveys s.r.l. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Correction Statement

This article has been republished with a minor correction to the reference 14. This change does not affect the scientific content of the article.

Appendix A

Report of the BL items noted during the in situ counting according to the European guidelines.
DescriptionCodeScan Box
A1A2B1B2C1C2D1D2E1E2F1F2
Small plastic bags, e.g., freezer bags, including piecesG4
1
Drink bottles <= 0.5 lG7 2 11
Drink bottles > 0.5 lG8 1 1 3
Cleaner/cleanser bottles and containersG9 4 2
Food containers, incl. fast food containersG10 11 1
Other bottles and containers (drums)G13 1 1
Engine oil bottles and containers < 50 cmG14 1
Jerry cans (square plastic containers with handle)G16 1
Plastic caps/lids from drinksG213 2213 2 1 2
Plastic caps/lids unidentifiedG2311 1
Plastic rings from bottle caps/lidsG24 1
Cigarette butts and filtersG27 5
Pens and pen lidsG28 1
Lolly sticksG31 1
Cups and cup lidsG33 1
Cutlery and traysG34 1
Nets and pieces of net < 50 cmG53 14 2 1 1
Nets and pieces of net > 50 cmG54 5 2 1
Fish boxes—expanded polystyreneG58 1
Shoes/sandalsG71 75
Plastic pieces 2.5 cm > < 50 cmG7916 110
Plastic pieces > 50 cmG80 1
Polystyrene pieces 2.5 cm > < 50 cmG82 4 42 1
Cotton bud sticksG951 1 1
Syringes/needlesG99 1
Other plastic/polystyrene items (identifiable)G124 14 26 3
Other rubber piecesG134 1
Shoes and sandals (e.g., leather, cloth)G138 1
Rope, string, and netsG142 1 1 1
Paper fragmentsG156 7
Other wood <50 cmG1711 2
Aerosol/Spray cansG174 2
Box areas (item)734103451842501308
Area (m2)15.7230.9321.3427.7627.421.9513.1530.2411.6638.7613.8722.82
Density (items/m2)0.451.130.471.220.180.820.300.8300.3400.35
Total158

Appendix B

Technical and environmental parameters used for aero-photogrammetric surveys carried out in the frame of this study for training and test sites.
Table A1. Torre Guaceto Beach (Brindisi, Italy).
Table A1. Torre Guaceto Beach (Brindisi, Italy).
Flight AttributesDescription
Type of UAVDJI Inspire 2
Time of the day12:00 p.m.–13:00 p.m.
Camera resolution20.8 MP
Camera filtersDJI MFT 15 mm/1.7 ASPH supported lens
Camera inclination90° to the ground
Flight altitude10 m/15 m
Speed1.3 m/s
Picture side overlap65%
Picture front overlap80%
Target areaForeshore, backshore, and dune
Length of target area100 m
Tidal rangeMicro-tidal
GSD0.19 cm/px–0.32 cm/px
Table A2. Capitolo Beach (Bari, Italy).
Table A2. Capitolo Beach (Bari, Italy).
Flight AttributesDescription
Type of UAVDJI Inspire 2
Time of the day10:15 a.m.–12:00 p.m.
Camera resolution20.8 MP
Camera filtersNA 1
Camera inclination90° to the ground
Flight altitude20 m
Speed2.3 m/s
Picture side overlap80%
Picture front overlap75%
Target areaForeshore, backshore, and dune
Length of target area50 m
Tidal rangeMicro-tidal
GSD0.4 cm/px
1 Not Assigned.
Table A3. Leirosa Beach (Coimbra, Portugal).
Table A3. Leirosa Beach (Coimbra, Portugal).
Flight AttributesDescription
Type of UAVDJI Mavic Air 2
Time of the day11:24 a.m.–13:14 p.m.
Camera resolution12.0 MP
Camera filtersNA 1
Camera inclination90° to the ground
Flight altitude20 m
Speed1.94 m/s
Picture side overlap85%
Picture front overlap75%
Target areaForeshore, backshore, and dune
Length of target area240 m
Tidal rangeMeso-tidal
GSD0.4 cm/px
1 Not Assigned.

Appendix C

In this appendix, a simplified transcription of the Mask-RCNN-based algorithm pseudocode customized for the automatic detection purposes is provided.
In Step 1, we define variables such as M for the Mask-RCNN model and {(I_i, A_i)} for the beach litter dataset, where I_i represents an image and A_i represents its annotations. We also use mathematical notation for iterations and conditions in the training loop.
In Step 2, we preprocess images (I_i_preprocessed) and use a loop to iterate through each instance detected by the model, extracting pixel coordinates and masks.
Step 3 involves calculating the ground coordinates (x_ground_i, y_ground_i) for each pixel within the mask using mathematical formulas. We then calculate the average latitude and longitude (avg_latitude_i, avg_longitude_i) for the litter instance. This algorithm provides a detailed mathematical representation of the steps involved in fine-tuning a Mask-RCNN model, making inferences on TIFF files, and converting pixel coordinates to exact geographic locations for beach litter assessment.
Python TensorFlow was used for the machine learning tasks, and the experiments were run on a NVIDIA Tesla T4 GPU within the Google Colab platform. This information provides context about the software and hardware environment used for the algorithm’s implementation.
  • Step 1: Fine-tune COCO pre-trained Mask-RCNN model with beach litter data
    • Initialize Mask-RCNN model (M) with pre-trained weights from COCO dataset
    • Load beach litter dataset: {(I_1, A_1), (I_2, A_2), …, (I_n, A_n)} # n images with annotations
    • Define new classification labels for beach litter categories: {C_1, C_2, …, C_k} # k categories
    • Modify model’s final classification layer to accommodate new labels
    • Train the modified model (M’) on the beach litter dataset with fine-tuning:
    for epoch in range(num_epochs):
      for batch in DataLoader(beach_litter_dataset, batch_size=batch_size, shuffle=True):
        input_images, annotations = batch
        predicted_masks, predicted_labels = M’.forward(input_images)
        compute_loss(predicted_masks, predicted_labels, annotations)
        update_model_weights()
  • Step 2: Make inference on unseen drone flight TIFF files
    • Load the fine-tuned Mask-RCNN model (M’)
    • For each unseen drone flight TIFF file (D_i):
    Load TIFF image (I_i)
    Preprocess the image: I_i_preprocessed = preprocess(I_i)
    Perform instance segmentation using M’:
      predicted_masks, predicted_labels = M’.forward(I_i_preprocessed)
    Extract pixel coordinates and masks for detected beach litter instances:
      for instance in range(num_instances):
        mask_i = predicted_masks[instance]
        label_i = predicted_labels[instance]
        pixel_coordinates_i = extract_pixel_coordinates(mask_i)
        save_segmented_image(I_i, pixel_coordinates_i, label_i)
  • Step 3: Convert pixel coordinates to exact geographic locations
    • For each segmented beach litter instance (B_i):
    Extract pixel coordinates (x_pixel, y_pixel) and mask (mask_i)
    Use geographic reference system information:
      Upper left point coordinates (UL_x, UL_y)
      Pixel-to-ground distance conversion factor (D_pixel_to_ground)
    Calculate the ground coordinates (latitude, longitude) for each pixel within the mask:
      for pixel in mask_i:
        x_pixel_i, y_pixel_i = pixel
        x_ground_i = UL_x + x_pixel_i * D_pixel_to_ground
        y_ground_i = UL_y + y_pixel_i * D_pixel_to_ground
        save_coordinates(x_ground_i, y_ground_i)
    Calculate the exact geographic location for the beach litter instance:
      avg_latitude_i = mean of all x_ground_i values
      avg_longitude_i = mean of all y_ground_i values
      save_exact_location(avg_latitude_i, avg_longitude_i)
    Output: Segmented images with pixel coordinates and objects UTM locations

References

  1. Wenneker, B.; Oosterbaan, L. Guideline for Monitoring Marine Litter on the Beaches in the OSPAR Maritime Area, 1st ed.; OSPAR Commission: London, UK, 2010; ISBN 90-3631-973-9. [Google Scholar]
  2. McGlade, J.; Samy Fahim, I.; Green, D.; Landrigan, P.; Andrady, A.; Costa, M.; Geyer, R.; Gomes, R.; Tan Shau Hwai, A.; Jambeck, J. From Pollution to Solution: A Global Assessment of Marine Litter and Plastic Pollution; UN Eveironment Programme: Nairobi, Kenya, 2021. [Google Scholar]
  3. Asensio-Montesinos, F.; Anfuso, G.; Randerson, P.; Williams, A.T. Seasonal Comparison of Beach Litter on Mediterranean Coastal Sites (Alicante, SE Spain). Ocean. Coast. Manag. 2019, 181, 104914. [Google Scholar] [CrossRef]
  4. Williams, A. Definitions and Typologies of Coastal Tourism Beach Destinations. In Disappearing Destinations: Climate Change and Future Challenges for Coastal Tourism; CABI: Wallingford, UK, 2011; pp. 47–65. [Google Scholar]
  5. Pravettoni, R. Plastic Debris in the Ocean: A Multiplicity of Sources and Pathways. In Marine Litter Vital Graphics; GRID-Arendal: Arendal, Norway, 2016. [Google Scholar]
  6. United Nations Environment Programme. Marine Litter: An Analytical Overview; UNEP: Nairobi, Kenya, 2005. [Google Scholar]
  7. Addamo, A.M.; Laroche, P.; Hanke, G. Top Marine Beach Litter Items in Europe; Joint Research Centre: Luxembourg, 2017; p. 123. [Google Scholar]
  8. Cesarano, C.; Aulicino, G.; Cerrano, C.; Ponti, M.; Puce, S. Scientific Knowledge on Marine Beach Litter: A Bibliometric Analysis. Mar. Pollut. Bull. 2021, 173, 113102. [Google Scholar] [CrossRef]
  9. Kershaw, P.; Turra, A.; Galgani, F. Guidelines for the Monitoring and Assessment of Plastic Litter and Microplastics in the Ocean. Ocean Best Pract. Syst. 2019, 99, 130. [Google Scholar]
  10. Rizzo, A.; Rangel-Buitrago, N.; Impedovo, A.; Mastronuzzi, G.; Scardino, G.; Scicchitano, G. A Rapid Assessment of Litter Magnitudes and Impacts along the Torre Guaceto Marine Protected Area (Brindisi, Italy). Mar. Pollut. Bull. 2021, 173, 112987. [Google Scholar] [CrossRef]
  11. Asensio-Montesinos, F.; Anfuso, G.; Ramírez, M.O.; Smolka, R.; Sanabria, J.G.; Enríquez, A.F.; Arenas, P.; Bedoya, A.M. Beach Litter Composition and Distribution on the Atlantic Coast of Cádiz (SW Spain). Reg. Stud. Mar. Sci. 2020, 34, 101050. [Google Scholar] [CrossRef]
  12. Rangel-Buitrago, N.; Mendoza, A.V.; Mantilla-Barbosa, E.; Arroyo-Olarte, H.; Arana, V.A.; Trilleras, J.; Gracia, A.; Neal, W.J.; Williams, A.T. Plastic Pollution on the Colombian Central Caribbean Beaches. Mar. Pollut. Bull. 2021, 162, 111837. [Google Scholar] [CrossRef] [PubMed]
  13. Andriolo, U.; Gonçalves, G.; Bessa, F.; Sobral, P. Mapping Marine Litter on Coastal Dunes with Unmanned Aerial Systems: A Showcase on the Atlantic Coast. Sci. Total Environ. 2020, 736, 139632. [Google Scholar] [CrossRef] [PubMed]
  14. Maratea, A.; Ferone, A. Deep Neural Networks and Explainable Machine Learning. In Fuzzy Logic and Applications; Fullér, R., Giove, S., Masulli, F., Eds.; Lecture Notes in Computer Science; Springer: Cham, Switzerland, 2019; Volume 11291. [Google Scholar]
  15. Bao, Z.; Sha, J.; Li, X.; Hanchiso, T.; Shifaw, E. Monitoring of Beach Litter by Automatic Interpretation of Unmanned Aerial Vehicle Images Using the Segmentation Threshold Method. Mar. Pollut. Bull. 2018, 137, 388–398. [Google Scholar] [CrossRef] [PubMed]
  16. Martin, C.; Parkes, S.; Zhang, Q.; Zhang, X.; McCabe, M.F.; Duarte, C.M. Use of Unmanned Aerial Vehicles for Efficient Beach Litter Monitoring. Mar. Pollut. Bull. 2018, 131, 662–673. [Google Scholar] [CrossRef]
  17. Fallati, L.; Polidori, A.; Salvatore, C.; Saponari, L.; Savini, A.; Galli, P. Anthropogenic Marine Debris Assessment with Unmanned Aerial Vehicle Imagery and Deep Learning: A Case Study along the Beaches of the Republic of Maldives. Sci. Total Environ. 2019, 693, 133581. [Google Scholar] [CrossRef]
  18. Gonçalves, G.; Andriolo, U.; Gonçalves, L.; Sobral, P.; Bessa, F. Quantifying Marine Macro Litter Abundance on a Sandy Beach Using Unmanned Aerial Systems and Object-Oriented Machine Learning Methods. Remote Sens. 2020, 12, 2599. [Google Scholar] [CrossRef]
  19. Bak, S.; Hwang, D.; Kim, H.; Yoon, H. Detection and Monitoring of Beach Litter Using Uav Image and Deep Neural Network. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2019, 42, 55–58. [Google Scholar] [CrossRef]
  20. Hidaka, M.; Matsuoka, D.; Sugiyama, D.; Murakami, K.; Kako, S. Pixel-Level Image Classification for Detecting Beach Litter Using a Deep Learning Approach. Mar. Pollut. Bull. 2022, 175, 113371. [Google Scholar] [CrossRef] [PubMed]
  21. Winans, W.R.; Chen, Q.; Qiang, Y.; Franklin, E.C. Large-Area Automatic Detection of Shoreline Stranded Marine Debris Using Deep Learning. Int. J. Appl. Earth Obs. Geoinf. 2023, 124, 103515. [Google Scholar] [CrossRef]
  22. Duarte, D.; Andriolo, U.; Gonçalves, G. Addressing the Class Imbalance Problem in the Automatic Image Classification of Coastal Litter from Orthophotos Derived from UAS Imagery. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2020, 3, 439–445. [Google Scholar] [CrossRef]
  23. Scarrica, V.M.; Aucelli, P.P.; Cagnazzo, C.; Casolaro, A.; Fiore, P.; La Salandra, M.; Rizzo, A.; Scardino, G.; Scicchitano, G.; Staiano, A. A Novel Beach Litter Analysis System Based on UAV Images and Convolutional Neural Networks. Ecol. Inform. 2022, 72, 101875. [Google Scholar] [CrossRef]
  24. Gonçalves, G.; Andriolo, U. Operational Use of Multispectral Images for Macro-Litter Mapping and Categorization by Unmanned Aerial Vehicle. Mar. Pollut. Bull. 2022, 176, 113431. [Google Scholar] [CrossRef]
  25. Balsi, M.; Moroni, M.; Chiarabini, V.; Tanda, G. High-Resolution Aerial Detection of Marine Plastic Litter by Hyperspectral Sensing. Remote Sens. 2021, 13, 1557. [Google Scholar] [CrossRef]
  26. Consorzio di Gestione di Torre Guaceto. S.I.C. “Torre Guaceto e Macchia S. Giovanni” (IT9140005). Available online: https://www.riservaditorreguaceto.it/index.php/it/l-ente-gestore/il-consorzio-di-gestione/le-finalita-del-consorzio-di-gestione/22-ente-gestore/123-il-consorzio-di-gestione (accessed on 5 July 2024).
  27. Mastronuzzi, G.; Caputo, R.; Di Bucci, D.; Fracassi, U.; Iurilli, V.; Milella, M.; Pignatelli, C.; Sanso, P.; Selleri, G. Middle-Late Pleistocene Evolution of the Adriatic Coastline of Southern Apulia (Italy) in Response to Relative Sea-Level Changes. Geogr. Fis. E Din. Quat. 2011, 34, 207–221. [Google Scholar] [CrossRef]
  28. Mastronuzzi, G.; Milella, M.; Piscitelli, A.; Simone, O.; Quarta, G.; Scarano, T.; Calcagnile, L.; Spada, I. Landscape Analysis in Torre Guaceto Area (Brindisi) Aimed at the Reconstruction of the Late Holocene Sea Level Curve. Geogr. Fis. Din. Quat 2018, 41, 65–79. [Google Scholar]
  29. Vlachogianni, T. Methodology for Monitoring Marine Litter on Beaches: Macro-Debris (>2.5 cm). 2017. Available online: https://mio-ecsde.org/wp-content/uploads/2017/11/Beach-litter_monitoring-methodology.pdf (accessed on 6 July 2024).
  30. Galgani, F.; Ruiz-Orejón, L.F.; Ronchi, F.; Tallec, K.; Fischer, E.K.; Matiddi, M.; Anastasopoulou, A.; Andresmaa, E.; Angiolillo, M.; Bakker Paiva, M.; et al. Guidance on the Monitoring of Marine Litter in European Seas; EUR 31539 EN; Publications Office of the European Union: Luxembourg, 2023; Volume JRC133594, ISBN 978-92-68-04093-5. [Google Scholar]
  31. Fleet, D.; Vlachogianni, T.; Hanke, G. A Joint List of Litter Categories for Marine Macrolitter Monitoring; Joint Research Centre: Luxembourg, 2021; p. 52. [Google Scholar]
  32. Westoby, M.J.; Brasington, J.; Glasser, N.F.; Hambrey, M.J.; Reynolds, J.M. ‘Structure-from-Motion’ Photogrammetry: A Low-Cost, Effective Tool for Geoscience Applications. Geomorphology 2012, 179, 300–314. [Google Scholar] [CrossRef]
  33. Ventura, D.; Bonifazi, A.; Gravina, M.F.; Belluscio, A.; Ardizzone, G. Mapping and Classification of Ecologically Sensitive Marine Habitats Using Unmanned Aerial Vehicle (UAV) Imagery and Object-Based Image Analysis (OBIA). Remote Sens. 2018, 10, 1331. [Google Scholar] [CrossRef]
  34. Legambiente I dati Dell’indagine Beach Litter 2023 e gli Appuntamenti di Spiagge e Fondali Puliti. 2023. Available online: https://www.legambiente.it/comunicati-stampa/i-dati-dellindagine-beach-litter-2023-e-gli-appuntamenti-di-spiagge-e-fondali-puliti/ (accessed on 16 September 2023).
  35. He, K.; Gkioxari, G.; Dollár, P.; Girshick, R. Mask R-CNN. In Proceedings of the 2017 IEEE International Conference on Computer Vision (ICCV), Venice, Italy, 22–29 October 2017; pp. 2961–2969. [Google Scholar]
  36. Lin, T.-Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Dollár, P.; Zitnick, C.L. Microsoft Coco: Common Objects in Context; Springer: Cham, Switzerland, 2014; pp. 740–755. [Google Scholar]
  37. Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  38. Grizonnet, M.; Michel, J.; Poughon, V.; Inglada, J.; Savinaud, M.; Cresson, R. Orfeo ToolBox: Open Source Processing of Remote Sensing Images. Open Geospat. Data Softw. Stand. 2017, 2, 1–8. [Google Scholar] [CrossRef]
  39. Blaschke, T.; Lang, S.; Hay, G.J. Object-Based Image Analysis—Spatial Concepts for Knowledge-Driven Remote Sensing Applications; Springer: Cham, Switzerland, 2008; ISBN 978-3-540-79131-7. [Google Scholar]
  40. Cohen, J. A Coefficient of Agreement for Nominal Scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
  41. Cesarano, C.; Aulicino, G.; Cerrano, C.; Ponti, M.; Puce, S. Marine Beach Litter Monitoring Strategies along Mediterranean Coasts. A Methodological Review. Mar. Pollut. Bull. 2023, 186, 114401. [Google Scholar] [CrossRef]
  42. Zielinski, S.; Anfuso, G.; Botero, C.M.; Milanes, C.B. Beach Litter Assessment: Critical Issues and the Path Forward. Sustainability 2022, 14, 11994. [Google Scholar] [CrossRef]
  43. Rangel-Buitrago, N.; Barría-Herrera, J.; Vergara-Cortés, H.; Contreras-López, M.; Agredano, R. A Snapshot of the Litter Problem along the Viña Del Mar-Concón Coastal Strip, Valparaíso Region, Chile. Mar. Pollut. Bull. 2020, 160, 111524. [Google Scholar] [CrossRef]
  44. Rangel-Buitrago, N.; Williams, A.; Anfuso, G.; Arias, M.; Gracia, A. Magnitudes, Sources, and Management of Beach Litter along the Atlantico Department Coastline, Caribbean Coast of Colombia. Ocean. Coast. Manag. 2017, 138, 142–157. [Google Scholar] [CrossRef]
  45. Botero, C.M.; Tamayo, D.; Zielinski, S.; Anfuso, G. Qualitative and Quantitative Beach Cleanliness Assessment to Support Marine Litter Management in Tropical Destinations. Water 2021, 13, 3455. [Google Scholar] [CrossRef]
  46. Asensio-Montesinos, F.; Anfuso, G.; Williams, A. Beach Litter Distribution along the Western Mediterranean Coast of Spain. Mar. Pollut. Bull. 2019, 141, 119–126. [Google Scholar] [CrossRef] [PubMed]
  47. Andriolo, U.; Gonçalves, G.; Sobral, P.; Fontán-Bouzas, Á.; Bessa, F. Beach-Dune Morphodynamics and Marine Macro-Litter Abundance: An Integrated Approach with Unmanned Aerial System. Sci. Total Environ. 2020, 749, 141474. [Google Scholar] [CrossRef] [PubMed]
  48. Andriolo, U.; Gonçalves, G.; Rangel-Buitrago, N.; Paterni, M.; Bessa, F.; Gonçalves, L.M.; Sobral, P.; Bini, M.; Duarte, D.; Fontán-Bouzas, Á. Drones for Litter Mapping: An Inter-Operator Concordance Test in Marking Beached Items on Aerial Images. Mar. Pollut. Bull. 2021, 169, 112542. [Google Scholar] [CrossRef]
  49. Andriolo, U.; Gonçalves, G.; Sobral, P.; Bessa, F. Spatial and Size Distribution of Macro-Litter on Coastal Dunes from Drone Images: A Case Study on the Atlantic Coast. Mar. Pollut. Bull. 2021, 169, 112490. [Google Scholar] [CrossRef]
  50. Gonçalves, G.; Andriolo, U.; Gonçalves, L.M.; Sobral, P.; Bessa, F. Beach Litter Survey by Drones: Mini-Review and Discussion of a Potential Standardization. Environ. Pollut. 2022, 315, 120370. [Google Scholar] [CrossRef]
  51. Moy, K.; Neilson, B.; Chung, A.; Meadows, A.; Castrence, M.; Ambagis, S.; Davidson, K. Mapping Coastal Marine Debris Using Aerial Imagery and Spatial Analysis. Mar. Pollut. Bull. 2018, 132, 52–59. [Google Scholar] [CrossRef] [PubMed]
  52. Pfeiffer, R.; Valentino, G.; D’Amico, S.; Piroddi, L.; Galone, L.; Calleja, S.; Farrugia, R.A.; Colica, E. Use of UAVs and Deep Learning for Beach Litter Monitoring. Electronics 2022, 12, 198. [Google Scholar] [CrossRef]
  53. Prasad, S.V.S.; Savitri, T.; Krishna, I. Classification of Multispectral Satellite Images Using Clustering with SVM Classifier. Int. J. Comput. Appl. 2011, 35, 32–44. [Google Scholar] [CrossRef]
  54. Ferone, A.; Maratea, A. Decoy clustering through graded possibilistic c-medoids. In Proceedings of the 2017 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE), Naples, Italy, 9–12 July 2017. [Google Scholar]
  55. Walker, T. (Micro)Plastics and the UN Sustainable Development Goals. Curr. Opin. Green Sustain. Chem. 2021, 30, 100497. [Google Scholar] [CrossRef]
  56. Löhr, A.; Savelli, H.; Beunen, R.; Kalz, M.; Ragas, A.; Van Belleghem, F. Solutions for Global Marine Litter Pollution. Curr. Opin. Environ. Sustain. 2017, 28, 90–99. [Google Scholar] [CrossRef]
  57. Rizzo, A.; Scicchitano, G.; Mastronuzzi, G. A Set of Guidelines as Support for the Integrated Geo-Environmental Characterization of Highly Contaminated Coastal Sites. Sci. Rep. 2024, 14, 8198. [Google Scholar] [CrossRef] [PubMed]
Figure 1. (a) Geographic location of the study area situated along the Adriatic coast of the Apulia Region (southern Italy); (b) seaworth boundary of the MPA of Torre Guaceto; (c) location of the investigated beach sector in the MPA.
Figure 1. (a) Geographic location of the study area situated along the Adriatic coast of the Apulia Region (southern Italy); (b) seaworth boundary of the MPA of Torre Guaceto; (c) location of the investigated beach sector in the MPA.
Remotesensing 16 03617 g001
Figure 2. Schematic synthesis of the approaches applied in this study for BL detection and classification. Each approach is described in the following sub-paragraphs: (3.1) in situ surveys; (3.2) aero-photogrammetric surveys; (3.3) manual visual screening analysis; and (3.4–3.5) dataset creation for automatic beach litter analysis tools.
Figure 2. Schematic synthesis of the approaches applied in this study for BL detection and classification. Each approach is described in the following sub-paragraphs: (3.1) in situ surveys; (3.2) aero-photogrammetric surveys; (3.3) manual visual screening analysis; and (3.4–3.5) dataset creation for automatic beach litter analysis tools.
Remotesensing 16 03617 g002
Figure 3. Box areas (A–F) were identified at Torre Guaceto Beach for the comparison of the methodologies applied for BL analysis. Targets for GCPs were also used to define box extension, in addition to georeferencing purposes.
Figure 3. Box areas (A–F) were identified at Torre Guaceto Beach for the comparison of the methodologies applied for BL analysis. Targets for GCPs were also used to define box extension, in addition to georeferencing purposes.
Remotesensing 16 03617 g003
Figure 4. Workflow of the post-processing of the UAV images applied to obtain the orthomosaic test site.
Figure 4. Workflow of the post-processing of the UAV images applied to obtain the orthomosaic test site.
Remotesensing 16 03617 g004
Figure 5. Diagram reporting the Mask-RCNN working network. In the output image, automatic segmented objects are identified by colored masks in rectangular bounding boxes.
Figure 5. Diagram reporting the Mask-RCNN working network. In the output image, automatic segmented objects are identified by colored masks in rectangular bounding boxes.
Remotesensing 16 03617 g005
Figure 6. Graph reporting the most abundant litter categories identified at Torre Guaceto Beach by the in situ counting activities (Plastic caps: J21-J23-J24; drink bottles: J7-J8; fishing nets: J53-J54-J58; plastic fragments > 2.5 cm: J79-J80-J82-J124; paper fragments: J156; shoes and sandals: J71-J138).
Figure 6. Graph reporting the most abundant litter categories identified at Torre Guaceto Beach by the in situ counting activities (Plastic caps: J21-J23-J24; drink bottles: J7-J8; fishing nets: J53-J54-J58; plastic fragments > 2.5 cm: J79-J80-J82-J124; paper fragments: J156; shoes and sandals: J71-J138).
Remotesensing 16 03617 g006
Figure 7. Map reporting the BL items identified at Torre Guaceto Beach by visual screening.
Figure 7. Map reporting the BL items identified at Torre Guaceto Beach by visual screening.
Remotesensing 16 03617 g007
Figure 8. Comparison between in situ survey and visual screening outputs obtained at Torre Guaceto Beach, in terms of the total number of identified objects. The acronyms on the abscissae axis stand for subdivisions of the six test box areas (A, B, C, D, E, F).
Figure 8. Comparison between in situ survey and visual screening outputs obtained at Torre Guaceto Beach, in terms of the total number of identified objects. The acronyms on the abscissae axis stand for subdivisions of the six test box areas (A, B, C, D, E, F).
Remotesensing 16 03617 g008
Figure 9. Confusion matrices representing Mask-RCNN-based algorithm accuracy on classification using Dataset 1. Overall accuracy, user accuracy (U.A.), precision accuracy (P.A.), and f1-score for each class are also reported. Null values are due to a lack of output.
Figure 9. Confusion matrices representing Mask-RCNN-based algorithm accuracy on classification using Dataset 1. Overall accuracy, user accuracy (U.A.), precision accuracy (P.A.), and f1-score for each class are also reported. Null values are due to a lack of output.
Remotesensing 16 03617 g009
Figure 10. Map reporting the spatial distribution of BL items recognized at Torre Guaceto Beach by Mask-RCNN using Dataset 2 for training. True positives are objects identified as BL items and correctly recognized.
Figure 10. Map reporting the spatial distribution of BL items recognized at Torre Guaceto Beach by Mask-RCNN using Dataset 2 for training. True positives are objects identified as BL items and correctly recognized.
Remotesensing 16 03617 g010
Figure 11. Results of the three experiments performed by applying the Mask-RCNN-based algorithm and taking into account the three different dataset settings for the same tile: (a) Dataset 1 with an accuracy value of 1.7% (mAP@IoU = 0.017); (b) Dataset 2 with an accuracy value of 2.0% (mAP@IoU = 0.020); and (c) Dataset 3 with an accuracy value of 2.5% (mAP@IoU = 0.025). Tiles shown in “b” and “c” are characterized by similar outputs because of the quite similar number of training polygons of Dataset 2 and Dataset 3. The square tile has a side length of 2 m. Colors of polygons are randomly generated.
Figure 11. Results of the three experiments performed by applying the Mask-RCNN-based algorithm and taking into account the three different dataset settings for the same tile: (a) Dataset 1 with an accuracy value of 1.7% (mAP@IoU = 0.017); (b) Dataset 2 with an accuracy value of 2.0% (mAP@IoU = 0.020); and (c) Dataset 3 with an accuracy value of 2.5% (mAP@IoU = 0.025). Tiles shown in “b” and “c” are characterized by similar outputs because of the quite similar number of training polygons of Dataset 2 and Dataset 3. The square tile has a side length of 2 m. Colors of polygons are randomly generated.
Remotesensing 16 03617 g011
Figure 12. Confusion matrices representing SVM performance accuracy on pixel classification obtained after the validation phase using (a) Dataset S1 with an overall accuracy value of 82% (K index = 0.811); (b) Dataset S2 with an overall accuracy value of 88% (K index = 0.850); and (c) Dataset S3 with an overall accuracy value of 86% (K index = 0.831). Precision accuracy (P.A.) and f1-score for each class are also reported. The pixel number in the “Sum” column refers to the validation pixels considering the groundtruth.
Figure 12. Confusion matrices representing SVM performance accuracy on pixel classification obtained after the validation phase using (a) Dataset S1 with an overall accuracy value of 82% (K index = 0.811); (b) Dataset S2 with an overall accuracy value of 88% (K index = 0.850); and (c) Dataset S3 with an overall accuracy value of 86% (K index = 0.831). Precision accuracy (P.A.) and f1-score for each class are also reported. The pixel number in the “Sum” column refers to the validation pixels considering the groundtruth.
Remotesensing 16 03617 g012
Figure 13. Visual results (without groundtruth) of the three experiments performed by applying the SVM algorithm and taking into account the three different dataset settings: (a) Dataset S1; (b) Dataset S2; (c) Dataset S3 for the same tile (cf. Figure 11). The square tile has a side length of 2 m.
Figure 13. Visual results (without groundtruth) of the three experiments performed by applying the SVM algorithm and taking into account the three different dataset settings: (a) Dataset S1; (b) Dataset S2; (c) Dataset S3 for the same tile (cf. Figure 11). The square tile has a side length of 2 m.
Remotesensing 16 03617 g013
Table 1. UAV survey performed to collect images for the creation of the dataset (used as training and test).
Table 1. UAV survey performed to collect images for the creation of the dataset (used as training and test).
SiteN° of FlightsAltitude (a.g.l.)Task
Torre Guaceto (Italy)210 mtraining
210 mtest
Capitolo (Italy)120 mtraining
Leirosa (Portugal)120 mtraining
Table 2. Parameters of orthomosaic obtained by the UAV image post-processing.
Table 2. Parameters of orthomosaic obtained by the UAV image post-processing.
Zone (Length)Flight Height AGL (m)N° of ImagesGSD (cm/Pixel)
Torre Guaceto (100 m)104360.2
152360.3
Capitolo (100 m)20700.3
Leirosa (100 m)202350.4
Table 3. Total number of training polygons obtained by the training tiles for each class in the three experiments and relative output.
Table 3. Total number of training polygons obtained by the training tiles for each class in the three experiments and relative output.
ExperimentsTraining SettingOutput
N° of Training TilesLabelsN° of Training PolygonsN° of BL Items as OutputLitter Density (Items/m2)
Dataset 1164Bottles403171.13
Worked wood18
Other740
Dataset 282Bottles33240.09
Worked wood36
Other123
Dataset 363Bottles28160.06
Worked wood38
Nets38
Table 4. Total number of training polygons, and the relative sum of included pixels, labeled for each class in the three experiments conducted with SVM.
Table 4. Total number of training polygons, and the relative sum of included pixels, labeled for each class in the three experiments conducted with SVM.
LabelsN° of Training PolygonsN° of Pixels
Dataset S1Bottles2059,827
Worked wood20169,822
Other20106,651
Sediment20600,193
Dataset S2Bottles2059,827
Worked wood20169,822
Nets2063,700
Sediment20600,193
Dataset S3Bottles2059,827
Worked wood20169,822
Nets2275,034
Other1846,596
Sediment20600,193
Table 5. Technical and environmental parameters for an aero-photogrammetric survey aimed to generate a dataset for automatic detection through ML tools.
Table 5. Technical and environmental parameters for an aero-photogrammetric survey aimed to generate a dataset for automatic detection through ML tools.
Flight AttributesGuidelines
Type of UAVA commercial drone for a cost-effective technique; a quadcopter is preferred to sustain high wind speed
Time of the dayPreferably around midday to limit object shadows and favor a better automatic processing of images
Camera resolution>12 MP
Camera filtersOptical for the recognition of all debris types
Camera inclination90° to the ground
Flight altitudeBetween 10 and 20 m
SpeedMax 2 m/s
Picture side overlap>65%
Picture front overlap>80%
Target areaForeshore, backshore, and dune
Length of target areaAt least 100 m
Tidal rangeMicro-, meso-, and macro-tidal
GSD<1 cm
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sozio, A.; Scarrica, V.M.; Rizzo, A.; Aucelli, P.P.C.; Barracane, G.; Dimuccio, L.A.; Ferreira, R.; La Salandra, M.; Staiano, A.; Tarantino, M.P.; et al. Application of Direct and Indirect Methodologies for Beach Litter Detection in Coastal Environments. Remote Sens. 2024, 16, 3617. https://doi.org/10.3390/rs16193617

AMA Style

Sozio A, Scarrica VM, Rizzo A, Aucelli PPC, Barracane G, Dimuccio LA, Ferreira R, La Salandra M, Staiano A, Tarantino MP, et al. Application of Direct and Indirect Methodologies for Beach Litter Detection in Coastal Environments. Remote Sensing. 2024; 16(19):3617. https://doi.org/10.3390/rs16193617

Chicago/Turabian Style

Sozio, Angelo, Vincenzo Mariano Scarrica, Angela Rizzo, Pietro Patrizio Ciro Aucelli, Giovanni Barracane, Luca Antonio Dimuccio, Rui Ferreira, Marco La Salandra, Antonino Staiano, Maria Pia Tarantino, and et al. 2024. "Application of Direct and Indirect Methodologies for Beach Litter Detection in Coastal Environments" Remote Sensing 16, no. 19: 3617. https://doi.org/10.3390/rs16193617

APA Style

Sozio, A., Scarrica, V. M., Rizzo, A., Aucelli, P. P. C., Barracane, G., Dimuccio, L. A., Ferreira, R., La Salandra, M., Staiano, A., Tarantino, M. P., & Scicchitano, G. (2024). Application of Direct and Indirect Methodologies for Beach Litter Detection in Coastal Environments. Remote Sensing, 16(19), 3617. https://doi.org/10.3390/rs16193617

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop