Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds

Wernette, Phillipe Alan

doi:10.3390/rs16122169

Open AccessArticle

Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds

by

Phillipe Alan Wernette

Great Lakes Research Center, Michigan Technological University, Houghton, MI 49931, USA

Remote Sens. 2024, 16(12), 2169; https://doi.org/10.3390/rs16122169

Submission received: 17 April 2024 / Revised: 4 June 2024 / Accepted: 12 June 2024 / Published: 15 June 2024

(This article belongs to the Special Issue Vegetation Classification and Mapping by Remote Sensing and Machine Learning)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Coastal cliffs erode in response to short- and long-term environmental changes, but predicting these changes continues to be a challenge. In addition to a chronic lack of data on the cliff face, vegetation presence and growth can bias our erosion measurements and limit our ability to detect geomorphic erosion by obscuring the cliff face. This paper builds on past research segmenting vegetation in three-band red, green, blue (RGB) imagery and presents two approaches to segmenting and filtering vegetation from the bare cliff face in dense point clouds constructed from RGB images and structure-from-motion (SfM) software. Vegetation indices were computed from previously published research and their utility in segmenting vegetation from bare cliff face was compared against machine learning (ML) models for point cloud segmentation. Results demonstrate that, while existing vegetation indices and ML models are both capable of segmenting vegetation and bare cliff face sediments, ML models can be more efficient and robust across different growing seasons. ML model accuracy quickly reached an asymptote with only two layers and RGB images only (i.e., no vegetation indices), suggesting that these more parsimonious models may be more robust to a range of environmental conditions than existing vegetation indices which vary substantially from one growing season to another with changes in vegetation phenology.

Keywords:

coastal geomorphology; vegetation; SfM; structure from motion; machine learning; MLP

1. Introduction

Coastal cliffs and bluffs erode in response to a complex interaction of abiotic, biotic, and anthropogenic factors across a range of space and time scales. Our ability to accurately predict future change in coastal cliffs is, at least partially, predicated on our ability to understand how the cliff has changed in the past in response to these natural and anthropogenic factors [1]. Coastal change along low-relief barrier islands, beaches, and some coastal dunes can be well represented through historical and modern imagery [2,3,4,5], LIDAR surveys [6,7,8,9], and near-nadir (i.e., down-looking toward the Earth surface) structure from motion with multiview stereo (SfM) [5,8,9,10,11].

In contrast, high-relief coasts remain a data poor environment and are only partially represented near-nadir data sources and surveys. Because imagery and LIDAR are frequently collected at or near nadir, they either partially or completely fail to represent vertical or near-vertical surfaces, such as coastal cliffs and bluffs, accurately [12,13]. Terrestrial LIDAR scanning (TLS) and oblique photogrammetry (aircraft, UAV, or land/water-based) do provide the ability to remotely capture data on vertical or near-vertical cliff faces and may be useful monitoring tools for cliff face erosion and talus deposition [13,14,15,16,17,18,19]. Measuring geomorphic change along the cliff face is challenging where vegetation is present.

Vegetation growing on the cliff can obscure the cliff surface (Figure 1) and generally limit our ability to monitor localized erosion patterns that can contribute to large-scale slope destabilization [15]. Localized patterns can precede larger cliff failures [20], highlighting the importance of closely monitoring the cliff face for geomorphic changes without interference from any vegetation that may be present. Although tree trunks have been used to track landslide erosion [21], cliffs present a unique challenge as they may either lack distinct objects that can be tracked and used to measure change, or such objects may rest in a relatively stable position at the cliff base (Figure 1). In addition to obscuring our view of the cliff face, vegetation may also affect the coastal cliff stability [22], and it may be important to identify where vegetation is present and where it is absent. Given the importance of monitoring coastal cliff erosion in the presence of vegetation, it is important that we can efficiently and accurately segment bare-Earth points on the cliff face from vegetation so we can monitor the cliff face for potential leading indicators of slope instability, such as groundwater seepage faces, basal notching, and rock fractures.

An added challenge with coastal cliff monitoring is our digital representation of high-relief environments. Digital elevation models (DEMs) or digital surface models (DSMs) are often used to characterize landscape morphology and change over time. However, these 2D representations of the landscape oversimplify vertical and near-vertical slopes and may completely fail to represent any oversteepened or overhanging slopes where the cliff face or edge extends farther seaward than the rest of the cliff face. Furthermore, previous research demonstrates that vertical uncertainty with 2D landscape rasters increases as the landscape slope increases [12]. Triangular irregular networks (TINs) represent a more nuanced approach to representing cliffs, although their analysis becomes significantly more challenging because each triangular face has an irregular shape and size, and they tend to be much larger files. Because coastal cliffs are very steep and may be overhanging, a more realistic approach to representing and analyzing LIDAR, TLS, or SfM-derived cliff data is to deal directly with the colorized point cloud. Since point clouds often only consist of red, green, and blue (RGB) reflectance bands and do not contain a near-infrared (NIR) band, calculating vegetation indices like the normalized difference vegetation index (NDVI) is not possible. As a result, segmenting vegetation using NDVI or other indices with NIR bands is also often not possible with point cloud data. In addition, utilizing point clouds with only RGB bands enables us to leverage older point clouds from a variety of sources lacking additional bands. Instead of relying on a hyperspectral band, a more robust approach that only utilizes RGB bands would be more valuable for a range of studies and environments.

Vegetation characterization and the filtering of point clouds can be accomplished using software, such as Agisoft Metashape 1.8.5 Professional [23] and CANUPO [24], or a combination thereof [25]. Agisoft Metashape Professional can display and classify/re-classify dense point clouds as ground, vegetation, and a range of other natural and built classes; however, it struggles with vertical and overhanging surfaces, such as cliffs (Figure 2). CANUPO is available as standalone software or an extension to Cloud Compare [26] that can also reclassify point clouds into multiple classes and does have the ability to function with vertical and overhanging surfaces. It uses a probabilistic classifier across multiple scales in a “plane of maximal separability”, although the resulting point cloud is prone to false positives for vegetation points, resulting in a speckling of noise in the reclassified point cloud (Figure 2). Other approaches, such as [25], operate using geometric and spectral signatures, and a random forest (RF) machine learning (ML) model may be more accurate but is not easily accessible and requires more powerful computing resources. While RF models may outperform other commercially available software and are more easily interpretable, they can quickly grow overly complex and large, requiring more computing resources. As such, this paper explores multi-layer perceptron (MLP) ML models as robust alternatives to existing classifiers because MLP models are simpler and faster to develop and can be more efficient than existing point cloud classifiers.

This paper builds on previous research segmenting imagery [27,28] and point clouds [23,24,25,29,30,31] to test whether MLP models can be leveraged to segment dense coastal cliff point clouds into vegetation and bare-Earth points. Multiple model architectures and model inputs were tested to determine the most efficient, parsimonious, and robust ML models for vegetation segmentation. Multiple MLP models were compared using only RGB values against MLP models using RGB values plus one or more vegetation indices based on the previous literature. In addition, this paper tested whether the addition of 3D standard deviation (3D StDev) can improve MLP models for vegetation segmentation in point clouds. The feasibility of the MLP models was demonstrated using the SfM-dervied point clouds of Elwha Bluffs, located along the Strait of Juan de Fuca in Washington state, USA. The most accurate, efficient, and robust models are compared to other point cloud classifiers and the efficacy of this approach is highlighted using the above case study.

2. Materials and Methods

2.1. Vegetation Classification and Indices

Vegetation indices computed from RGB have been extensively used to segment vegetation from non-vegetation in imagery [31,32,33,34,35,36,37,38,39,40,41,42,43,44], and their development was often associated with commercial agriculture application(s) for evaluating crop or soil health when near-NIR or other bands were not available. However, these indices can be adapted by a broader remote-sensing community for more diverse applications such as segmenting vegetation from bare-Earth points in dense point clouds. Some vegetation indices like excess red (ExR; [36]), excess green (ExG; [35,40]), excess blue (ExB; [39]), and excess red minus green (ExRG; [37]) were relatively simple and efficient to compute while others were more complex and took longer to calculate (NGRDI: [45]; MGRVI: [46]; GLI: [47]; RGBVI: [46]; IKAW: [33]; GLA: [48]). Regardless of the complexity of the algorithm, it is possible to adapt such indices to segment vegetation from bare-Earth points in any number of ways. Figure 3 illustrates how a single image can be transformed by different vegetation indices and binarized to segment vegetation from non-vegetation pixels.

All indices used in this paper (Table 1) were well constrained with defined upper and lower limits ranging from −1 to 1.4 (ExR and ExB), −1 to 2 (ExG), −2.4 to 3 (ExGR), or −1 to 1 (NGRDI, MGRVI, GLI, RGBVI, IKAW, and GLA). To overcome the vanishing or exploding gradient issue with ML models and to ensure maximum transferability to additional locations and datasets, only vegetation indices with well constrained ranges were utilized here. The mathematics and development of each vegetation index are beyond the scope of this paper but may be found in the references herein. Furthermore, to address the issue of vanishing gradients due to different input ranges, all ML model inputs, including RGB and all vegetation indices, were normalized individually to range from 0 to 1 using Equation (1):

(x_i − x_min)/(x_max − x_min)

(1)

where x_i is a value for one point, x_min is the minimum value for all points, and x_max is the maximum value for all points. For RGB values, the minimum and maximum values were 0 and 255, respectively, corresponding to the 8-bit color depth range. For each vegetation index, the minimum and maximum values correspond to the value range in Table 1, and the minimum and maximum values for 3D StDev were set equal to 0 and the maximum 3D StDev value was calculated.

To differentiate vegetation from bare-Earth pixels or points, a decision must be made about the threshold value used to separate the two classes. This process of thresholding can be done several ways, with the simplest being a user-defined manual threshold, with a more statistically robust approach being the use of Otsu’s thresholding method [48]. Manual thresholding is a brute force approach that is simpler to implement but may vary greatly by application, user, and the index or indices being thresholded. In this approach, the user specifies a single value that is used as a break point to differentiate one class from another class where points above this value belong to one class and points below the value belong to another class. Although manual thresholding may yield good results for some applications and users, the selection of a specific threshold value is highly subjective and may not be transferrable across other geographies or datasets.

Otsu’s thresholding method [48] is a more statistically robust and less subjective approach than manual thresholding. Ref. [49] demonstrated that Otsu’s thresholding was an effective approach to binarize a given index to two classes. This method assumes the index being binarized is bimodally distributed, where each mode represents a single class. If binarizing an index to vegetation and bare-Earth classes, one mode would represent vegetation pixels/points and the other mode would represent bare-Earth pixels/points. Given this assumption, a threshold between the two modes can be determined as the solution that maximizes the intermodal variability between the two mode peaks. Otsu’s thresholding is more likely to be transferrable to other datasets and/or geographies, depending on how representative the vegetation index values are of other datasets.

2.2. ML Models

2.2.1. Deriving 3D Standard Deviation

In addition to the RGB and vegetation indices, this paper tested whether the addition of StDev improved vegetation segmentation performance and/or accuracy. The 3D StDev was computed for every point in the dense point cloud iteratively, similar to the approach by [50]. Points were first spatially queried using a cKDTree available within the scipy.spatial package [51] and then standard deviation was computed for the center point by using all the points within the spatial query distance. Although there are other morphometrics [50,52] that may be valuable for vegetation segmentation, this work limited the morphometric input to 3D StDev as a first test of how influential additional point geometrics may be because 3D StDev is relatively simple. For testing and demonstration purposes, the 3D search radius was limited to 1.0 m, meaning that all points within a 1.0 m search radius were used to calculate the 3D StDev. If no points were within this search radius, then the point was not used in model training, which also meant that any point cloud being reclassified with the model had to have at least one point within the same search radius to be reclassified.

The 1.0 m search radius used here was simply used as a test of whether including point cloud geometry metrics could improve the segmentation process. Because 3D StDev or other point cloud geometry metrics were computed for every point, applying a larger search radius significantly increased the time and computing resources required to derive the 3D StDev. Another option to decrease the computation time for geometry metrics is to first decimate the input point clouds and then derive the metrics, although eliminating points may also suppress the areas of points that would otherwise be leading indicators of larger erosion events in the future. As such, it is important that ML methods be able to segment whole point clouds without sub-sampling.

2.2.2. Multi-Layer Perceptron (MLP) Architecture and Inputs

This paper tested the utility of several MLP ML model architectures with different inputs, number of layers, and nodes per layer for efficient and accurate point cloud reclassification (Table A1). These MLP models were compared against the CANUPO classifier in CloudCompare because it is a well-established approach in a widely used software package. MLP models have been applied in previous work segmenting debris-covered glaciers [53], extracting coastlines [54], landslide detection and segmentation [55], and vegetation segmentation in imagery [56], although the current work tested MLP as a simpler standalone segmentation approach for dense point clouds.

MLP models used here were built, trained, validated, and applied using Tensorflow [57] with Keras [58] in Python. They all followed a similar workflow and architecture (Figure 4), where data is first normalized to 0 to 1 and then processed through one or more dense, fully connected layer(s). The simplest models had only one densely connected layer, while the most complex models had six densely connected layers with an increasing number of nodes per layer. Varying the model architecture facilitated determining the most parsimonious model suitable for accurate vegetation segmentation. Every model included a dropout layer with a 20% chance of node dropout before the output layer to avoid model overfitting. Models used an Adam optimizer [59,60] to schedule the learning rate and increase the likelihood of reaching a global minimum, and a rectified linear unit (ReLU) activation function was used to overcome vanishing gradients and increase training and model performance.

The models were trained using two input point clouds, each containing points belonging to a single class. One training point cloud contained only points visually identified as vegetation, and the other training point cloud contained only points identified as bare-Earth. Both training point clouds were an aggregate of either vegetation or bare-Earth points from multiple dates of the same area spanning different growing seasons and years. For example, vegetation points were first manually clipped from no less than 10 different point clouds, then these clipped point cloud segments were merged to one combined vegetation point cloud as an input for the machine learning models. This process was repeated for the bare-Earth points. Special care was taken to ensure that the vegetation and bare-Earth training dense clouds included a range of lighting conditions, and, in the case of vegetation points, the training point cloud included different growing seasons and years, accounting for natural variability in vegetation growth caused by wet and dry seasons/years.

Class imbalance issues can significantly bias ML models toward over-represented classes and minimize or completely fail to learn under-represented class characteristics [61,62,63,64,65,66], a problem not unique to vegetation segmentation. Although the vegetation and bare-Earth training point clouds were different sizes (22,451,839 vegetation points and 102,539,815 bare-Earth points), class imbalance was addressed here by randomly down-sampling the bare-Earth points to equal the number of vegetation points prior to model training. A similar approach to balancing training data for point cloud segmentation was employed by [67]. The balanced training classes were then used to train and evaluate the MLP models.

Training data for the ML models were created by first manually clipping vegetation-only points from several point clouds in CloudCompare. Because this paper is focused on developing a model(s) robust to season, lighting conditions, and year, both sets of training point clouds (vegetation and bare-Earth) were composed of samples from different growing seasons and years across the 30+ photosets from [68]. All the clipped points representing vegetation were merged into a single vegetation training point cloud, and all clipped point clouds representing bare-Earth points were merged into a combined bare-Earth training point cloud.

Vegetation and bare-Earth point clouds were split 70–30 for model development and training, with 70% of the data used to train the model (i.e., “training data”) and the remaining 30% used to independently evaluate the model after training (i.e., “evaluation data”). Of the training data split, 70% was used directly for training and 30% was used to validate the model after each epoch (i.e., “validation data”). All sampling of training, validation, and evaluation sets was random. Since class imbalance was addressed prior to splitting the data, model overfitting to over-represented classes was mitigated and evaluation metrics were more reliable.

The paper explored whether MLP models were more accurate when model inputs included one or more vegetation indices or geometric derivatives compared to when the only model inputs were RGB values. Models were broadly categorized into one of five categories based on their inputs:

RGB: These models only included RGB values as model inputs;
RGB_SIMPLE: These models included the RGB values as well as ExR, ExG, ExB, and ExRG vegetation indices; these four indices were included because each one is relatively simple, abundant in previously published literature, and efficient to calculate;
ALL: These models included RGB and all stable vegetation indices listed in Table 1;
SDRGB: These models included RGB and the 3D StDev computed using the X, Y, and Z coordinates of every point within a given radius;
XYZRGB: These models included RGB and the XYZ coordinate values for every point.

2.2.3. ML Model Evaluation

Model performance was recorded during the training process, and the final model was re-evaluated using the testing set of points withheld during the training process. The number of tunable parameters was compared across different ML model architectures and inputs to determine which models and inputs were most efficient to train and apply. Accuracy was displayed and recorded after every epoch during model training, and the final model was independently evaluated using the evaluation points, which were not used during model training or validation. The last reported accuracy during training was recorded as the model training accuracy (TrAcc), while the model evaluation accuracy (EvAcc) was computed using the Tensorflow model.evaluate() function with the evaluation data split. Unfortunately, independent ground truth observations were not made in the field, and it is not possible to evaluate the performance more rigorously against a field-collected set of points. However, given the very large number of evaluation points (~13,470,000 points), split evenly between bare-Earth and vegetation points, the EvAcc serves as a reasonable measure of model accuracy.

2.3. Case Study: Elwha Bluffs, Washington, USA

The ability and performance of the MLP models to classify bare-Earth and vegetation points was tested on a section of bluffs near Port Angeles, WA, located along the Strait of Juan de Fuca just east of the Elwha River mouth (Figure 5). Although erosion has slowed with the growth of a fronting beach, large sections of the bluff still erode periodically in response to changing environmental conditions and storms. Previous research demonstrated the efficacy and accuracy of deriving high-accuracy point clouds from SfM with oblique photos [19]. Although the complete photogrammetry pipeline includes SfM for photo alignment and multiview stereo (MVS) for constructing the dense cloud, “SfM” will be used here to refer to the complete SfM-MVS data pipeline. ML model development and testing were done using a SfM-derived point cloud for 8 May 2020 that was aligned and filtered using the 4D approach with individual camera calibrations and differential GPS (Figure 5) because this approach [68] produced the most accurate point cloud compared to other approaches tested by [19].

SfM-derived point clouds are advantageous over LIDAR or TLS point clouds because the SfM photogrammetry process natively includes RGB values for every point. LIDAR and TLS record the geometry of the return, along with the intensity of the return. The RGB colors often contained in LIDAR or TLS point clouds are added during post-processing.

The Elwha Bluffs dataset highlights the importance of being able to efficiently segment vegetation from bare-Earth points since photos of these bluffs periodically show fresh slump faces and talus deposition at the base of the bluff (visible in Figure 5). Detecting micro-deformation or erosion of the bluff face is important for predicting bluff stability, and monitoring these fine-scale changes requires being able to identify and distinguish true erosion from false positives, which may be caused by vegetation growth or senescence through the seasons. Vegetation growth over the summer or across years, especially for vegetation near the bluff base, can appear as “positive” change from the bluff face which may suggest the bluff has accreted, irrespective of the true change in bluff face (erosion or deposition) in the area. Conversely, if vegetation points are included in a time-change analysis, vegetation senescence can introduce false positives for erosion. As the leaves and grasses die again in the fall and winter, subtracting a point cloud from fall or winter from one during the summer may falsely indicate erosion is occurring.

While identifying bare-Earth-only points can be done manually on one or few datasets, manually segmenting these bare-Earth points is not scalable to point clouds covering large areas or those areas with a large time-series of point clouds. The manual segmentation approach is also more subjective, potentially varying by user and season, and increasingly time-consuming as the number of datasets continues to increase with ongoing photo surveys.

3. Results

Point cloud classification with the CANUPO classifier in CloudCompare had a reported accuracy of 88.7%. However, visual inspection of the reclassified point cloud (Figure 2) shows salt-and-pepper “noise” of points incorrectly classified as vegetation throughout the bluff face. In contrast, point clouds reclassified with MLP models exhibited less salt-and-pepper noise than was present in the CANUPO reclassified points. MLP-model reclassified points appeared visually consistent with areas of vegetation and bare-Earth points in the original point cloud (Figure 6), with little variability between the RGB, RGB_SIMPLE, and ALL models.

MLP models had a variable EvAcc accuracy (Table A2; Figure 7) and the average EvAcc accuracy was 91.2%. XYZRGB models were substantially less accurate than all other models and had an accuracy of only 50.0%, while the minimum EvAcc accuracy of any other model was 89.5% (RGB_8_8). The most accurate model was SDRGB_16_16_16 (EvAcc: 95.3%), although computing the standard deviation for every point took significantly longer than the model training process for either model using standard deviation. When MLP model architecture was held constant for both the number of layers and nodes per layer, the SDRGB model was only 1.3% more accurate than the RGB model.

For models where inputs were the same (e.g., all RGB, RGB_SIMPLE, or ALL models), simpler models generally performed well with significantly fewer tunable parameters than their more complex counterparts with several layers and more nodes per layer (Table A2). EvAcc improved slightly as the number of layers and nodes per layer increased but began to asymptote when more than 2–3 layers were included. For example, the RGB_16 model had an EvAcc of 92.1%, and the RGB_16_32 model had an EvAcc of 93.8%, only a 1.7% improvement in EvAcc. However, the most complex RGB model tested included six layers with nodes doubling every successive layer from 16 to 512 nodes (RGB_16_32_64_128_256_512) and had 176,161 tunable parameters and an EvAcc of 93.9%. The most complex RGB model was only 0.1% more accurate than the simpler RGB_16_32. In addition, the number of tunable parameters substantially increased from 81 (RGB_16) and 641 (RGB_16_32) parameters to 176,161 parameters with RGB_16_32_64_128_256_512. This relationship between EvAcc and tunable parameters was also true among (a) RGB_SIMPLE and (b) ALL models.

All segmented point clouds had some minor salt-and-pepper noise caused by the misclassification of some points. This is expected because no model is 100% accurate and this is true regardless of the number of layers in the model (Figure 8). It is important to note that the point clouds did not show any systematic misclassification either way. Although Figure 8 only shows the results for RGB models, this relationship was true for all model types.

When model architecture was held constant and only the model inputs varied, RGB models performed well compared to RGB_SIMPLE and ALL models (Table A2). The RGB_16 model only utilized the RGB values as inputs and had an EvAcc of 92.14% with 81 tunable parameters. This compared favorably to the ALL_16 model that included all vegetation indices as inputs and had an EvAcc of 93.53% with 241 tunable parameters. The number of tunable parameters increased with the number of inputs for all models. ALL_16 model EvAcc was only marginally better (~1.39%) than RGB_16 EvAcc.

Other models explored here had the number of nodes constant across all layers while the number of layers changed. For example, RGB_16_16 had two layers with 16 nodes per layer, and RGB_16_16_16 had three layers with 16 nodes per layer. Models with the same number of nodes but varying layer numbers had comparable EvAcc regardless of whether the model had 2 or 3 layers or 8 or 16 nodes per layer. Aside from the RGB_8_8 model, where EvAcc was only 89.5%, the EvAcc for these relatively simple models was similar for RGB, RGB_SIMPLE, and ALL models. Three-layer models with the same number of nodes per layer generally had a higher EvAcc compared to the equivalent two-layer models, although the difference in EvAcc led to an improvement of less than 1% in most cases. No salt-and-pepper noise was visible in the bluff face for any of the 2- or 3-layer models with 8 or 16 nodes per layer (Figure 9).

Models with only RGB inputs had the fewest number of tunable parameters across comparable model architectures (Table A2; Figure 10), while models using RGB, and all vegetation indices, had the most tunable model parameters. When the model inputs were held constant, increasing the number of layers and nodes per layer also increased the number of tunable parameters (Table A2; Figure 10). The model with the most parameters included RGB and all vegetation indices as inputs and had 6 layers starting with 16 nodes, doubling the number of nodes with each successive layer. Model EvAcc generally increased logarithmically with the number of tunable parameters until reaching a quasi-plateau ~94% accuracy (Figure 7 and Figure 10).

4. Discussion

Results demonstrate that MLP models can efficiently and accurately segment vegetation and bare-Earth points in high-relief data, such as coastal cliffs and bluffs, with or without computing any vegetation indices. EvAcc sharply increased from very simple models and plateaued around ~94%, even as the number of tunable parameters continued to increase. Ref. [29] suggested that vegetation indices may be valuable in segmenting vegetation, although the comparison of MLP models with RGB against those with RGB_SIMPLE or ALL here suggests that vegetation indices may not substantially improve our ability to separate vegetation from bare-Earth points in coastal cliff point clouds. While vegetation indices may provide a marginal increase in accuracy for other ML approaches [29], the increase in the number of tunable parameters for MLP models with increasing complexity was disproportionate to the relatively minor improvement in EvAcc. For example, ALL_16 had 2.9 times more tunable parameters than RGB_16 but was only 1.5% more accurate. In all models tested, the number of tunable parameters substantially outpaced the gain in EvAcc when vegetation indices were included as model inputs. This suggests that, although the RGB models may be ≤1% less accurate than models using vegetation indices, this reduction in accuracy was not significant and was outweighed by the more efficient training and deployment of the RGB models.

An advantage of using only RGB color values as model inputs is the reduction in data redundancy. Because vegetation indices were derived from RGB values, the information encoded in them is somewhat redundant when the derived indices are included with RGB. Furthermore, not having to compute one or more vegetation indices can reduce the amount of time and computing resources required for data pre-processing. Computing vegetation indices for every point in the training point clouds and for every point in the point cloud being reclassified can be computationally demanding, and these demands increase very rapidly as the size of the input point clouds increases since computing an additional vegetation index requires storing another set of index values equal to the size of the original input data. Employing a simpler model with only RGB inputs eliminates the need for extensive pre-processing, and reduces RAM and other hardware requirements, while also segmenting vegetation points from bare-Earth points with comparable EvAcc to a more complex model that includes one or more pre-computed vegetation indices.

The most accurate models tested used RGB values and a pointwise StDev, in-line with previous work suggesting that local geometric derivatives may be useful in segmenting point clouds [50,52,69]. However, the SDRGB models were challenging to implement due to the high computational demands of computing StDev for every point in the point cloud(s). Similar to [50], we used cKDTrees to optimize the spatial query; however, performing this spatial query for a point and then computing the StDev for the nearby points was not efficient when multiplied for very large and dense point clouds such as those used in this work. In this way, the current optimized solution to deriving spatial metrics was not scalable to a very dense and large point cloud dataset. Further research may optimize this spatial query and geometry computation process, although this is beyond the scope of the current paper. SDRGB models were not considered reasonable solutions to vegetation classification and segmentation because of the demanding pre-processing required to yield only a ~2% more accurate model than RGB models.

Figure 7 and Figure 10, in conjunction with no discernable improvement in complex models over simpler ones, indicate diminishing returns with model complexity. Seeking the simplest model (i.e., fewest tunable parameters) that provides an acceptable accuracy, two general classes of models stand out: (1) models with 3 layers and 8 nodes per layer (RGB_8_8_8, RGB_SIMPLE_8_8_8, and ALL_8_8_8), and (2) models with 2 layers and 16 nodes per layer (RGB_16_16, RGB_SIMPLE_16_16, and ALL_16_16). Both classes of models had accuracies around 94% with fewer tunable parameters than their more complex counterparts, regardless of the different inputs. For example, RGB_16_32 had 641 tunable parameters and an EvAcc of 93.8% whereas the simpler RGB_16_16 had 55% fewer tunable parameters (353 parameters) and a greater EvAcc of 93.86%. These results suggest that the simpler 2- to 3-layer models with 8–16 nodes per layer can perform as well or better than larger models with more tunable parameters and are, therefore, preferred over larger models.

Results demonstrate that ML models can efficiently and accurately segment vegetation and bare-Earth points in addition to the potential one-shot transfer of information to new areas and dataset. With manual point cloud segmentation, one-shot knowledge transfer is not possible, limiting rapid assessment of coastal erosion. Leveraging patterns and information from one location and application and transferring this knowledge to a new dataset and geographies enables agencies, communities, and researchers to better map and monitor ever-changing coastal cliffs and bluffs.

MLP models are less subjective than manual segmentation or other simpler approaches. Manual segmentation or the thresholding of specific vegetation indices can be biased towards different vegetation types and environments. In contrast, MLP models using only RGB were not affected by the subjective decision to add one or more vegetation indices. Using an RGB-only ML model eliminated the need to make this arbitrary decision between one index or another. Instead, the model uses the RGB values directly and does not require extra computations.

The models trained and presented here used colorized point clouds with red, green, and blue channels only. Not having access to additional reflectance bands, such as near-infrared, limits segmenting vegetation from bare-Earth points with even greater accuracy as well as limiting segmenting different types of vegetation. The utility of NDVI or similar multispectral vegetation indices in vegetation segmentation and identification is demonstrated [46,70]; although, [71] suggested that RGB models may be tuned to perform closely to models with hyperspectral channels. In addition, near-infrared or other hyperspectral channels are often not available for point cloud data; thus, developing ML methods for using simple RGB data increases the amount of data available to coastal monitoring.

5. Conclusions

Monitoring high-relief coastal cliffs and bluffs for deformation or small-scale erosion is important for understanding and predicting future cliff erosion events, although our ability to detect such changes hinges on distinguishing false positive changes from real change. Segmenting coastal cliff and bluff point clouds into vegetation and bare-Earth points can help provide insight into these changes by reducing “noise” in the change results. Automated point cloud segmentation using MLP ML models is feasible and has several advantages over existing approaches and vegetation indices. Leveraging MLP models for point cloud segmentation is more efficient and scalable than a manual approach while also being less subjective. Results demonstrate that models using only RGB color values as inputs performed equal to mode complex models with one or more vegetation indices, and simpler models with 2–3 layers and a constant number of nodes per layer (8 or 16 nodes) outperformed more complex models with 1–6 layers but an increasing number of nodes per layer. Simpler models with only RGB inputs can help effectively segment vegetation and bare-Earth points in large coastal cliff and bluff point clouds. Segmenting and filtering vegetation from coastal cliff point clouds can help identify areas of true geomorphic change and improve coastal monitoring efficiency by eliminating the need for manual data editing.

Funding

This research received no external funding.

Data Availability Statement

All Python code is published and available at https://doi.org/10.5281/zenodo.10966854, accessed on 1 May 2021. Data is published and available at Dryad (https://doi.org/10.5061/dryad.8pk0p2nww), accessed on 1 May 2021.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Table A1. MLP model architectures tested for vegetation segmentation of point cloud data.

Model Name	Inputs	Number of Nodes per Dense Layer
rgb_8	RGB	8
rgb_8_8	RGB	8, 8
rgb_8_8_8	RGB	8, 8, 8
rgb_16	RGB	16
rgb_16_16	RGB	16, 16
rgb_16_16_16	RGB	16, 16, 16
rgb_16_32	RGB	16, 32
rgb_16_32_64	RGB	16, 32, 64
rgb_16_32_64_128	RGB	16, 32, 64, 128
rgb_16_32_64_128_256	RGB	16, 32, 64, 128, 256
rgb_16_32_64_128_256_512	RGB	16, 32, 64, 128, 256, 512
rgb_simple_8	RGB, ExR, ExG, ExB, ExGR	8
rgb_simple_8_8	RGB, ExR, ExG, ExB, ExGR	8, 8
rgb_simple_8_8_8	RGB, ExR, ExG, ExB, ExGR	8, 8, 8
rgb_simple_16	RGB, ExR, ExG, ExB, ExGR	16
rgb_simple_16_16	RGB, ExR, ExG, ExB, ExGR	16, 16
rgb_simple_16_16_16	RGB, ExR, ExG, ExB, ExGR	16, 16, 16
rgb_simple_16_32	RGB, ExR, ExG, ExB, ExGR	16, 32
rgb_simple_16_32_64	RGB, ExR, ExG, ExB, ExGR	16, 32, 64
rgb_simple_16_32_64_128	RGB, ExR, ExG, ExB, ExGR	16, 32, 64, 128
rgb_simple_16_32_64_128_256	RGB, ExR, ExG, ExB, ExGR	16, 32, 64, 128, 256
rgb_simple_16_32_64_128_256_512	RGB, ExR, ExG, ExB, ExGR	16, 32, 64, 128, 256, 512
all_8	RGB, all vegetation indices	8
all_8_8	RGB, all vegetation indices	8, 8
all_8_8_8	RGB, all vegetation indices	8, 8, 8
all_16	RGB, all vegetation indices	16
all_16_16	RGB, all vegetation indices	16, 16
all_16_16_16	RGB, all vegetation indices	16, 16, 16
all_16_32	RGB, all vegetation indices	16, 32
all_16_32_64	RGB, all vegetation indices	16, 32, 64
all_16_32_64_128	RGB, all vegetation indices	16, 32, 64, 128
all_16_32_64_128_256	RGB, all vegetation indices	16, 32, 64, 128, 256
all_16_32_64_128_256_512	RGB, all vegetation indices	16, 32, 64, 128, 256, 512
sdrgb_8_8_8	RGB, SD	8, 8, 8
sdrgb_16_16_16	RGB, SD	16, 16, 16
xyzrgb_8_8_8	RGB, XYZ	8, 8, 8
xyzrgb_16_16_16	RGB, XYZ	16, 16, 16

Table A2. MLP model architecture, training information (training epochs and time), and accuracies (TrAcc and EvAcc) for all vegetation classification and segmentation models tested.

Model	Layers	Tunable Parameters	Training			Evaluation
Model	Layers	Tunable Parameters	Epochs	Time (s)	TrAcc	EvAcc
rgb_16	1	81	7	545	91.5%	92.1%
rgb_16_32	2	641	11	884	93.9%	93.8%
rgb_16_32_64	3	2785	7	576	94.0%	93.9%
rgb_16_32_64_128	4	11,169	7	592	94.0%	93.7%
rgb_16_32_64_128_256	5	44,321	7	592	94.0%	93.9%
rgb_16_32_64_128_256_512	6	176,161	7	621	94.0%	93.9%
rgb_simple_16	1	145	7	741	91.6%	92.3%
rgb_simple_16_32	2	705	7	700	94.0%	94.0%
rgb_simple_16_32_64	3	2849	7	749	94.0%	94.0%
rgb_simple_16_32_64_128	4	11,233	10	1075	94.1%	94.1%
rgb_simple_16_32_64_128_256	5	44,385	9	1009	94.1%	94.1%
rgb_simple_16_32_64_128_256_512	6	176,225	11	1314	94.1%	94.1%
all_16	1	241	11	1648	93.1%	93.5%
all_16_32	2	801	7	1013	94.0%	94.0%
all_16_32_64	3	2945	7	1061	94.0%	94.0%
all_16_32_64_128	4	11,329	10	1505	94.1%	94.1%
all_16_32_64_128_256	5	44,481	13	2047	94.2%	94.2%
all_16_32_64_128_256_512	6	176,321	9	1455	94.2%	94.2%
rgb_16_16	2	353	8	624	93.5%	93.9%
rgb_simple_16_16	2	417	7	724	93.9%	94.0%
all_16_16	2	513	7	1033	94.0%	94.0%
rgb_16_16_16	3	625	7	513	93.9%	93.9%
rgb_simple_16_16_16	3	689	7	742	94.0%	94.0%
all_16_16_16	3	785	7	1018	94.0%	94.0%
rgb_8_8	2	113	11	850	85.7%	89.5%
rgb_simple_8_8	2	145	11	1154	92.6%	93.9%
all_8_8	2	193	7	1037	92.6%	93.9%
rgb_8_8_8	3	185	7	536	93.6%	93.8%
rgb_simple_8_8_8	3	217	7	733	93.9%	94.0%
all_8_8_8	3	265	7	1011	93.6%	94.0%
xyzrgb_8_8_8	3	209	6	589	50.0%	50.0%
xyzrgb_16_16_16	3	673	6	585	50.0%	50.0%
sdrgb_8_8_8	3	193	10	889	95.1%	95.3%
sdrgb_16_16_16	3	641	11	967	95.3%	95.3%

References

Limber, P.W.; Barnard, P.L.; Vitousek, S.; Erikson, L.H. A Model Ensemble for Projecting Multidecadal Coastal Cliff Retreat During the 21st Century. J. Geophys. Res. Earth Surf. 2018, 123, 1566–1589. [Google Scholar] [CrossRef]
Wernette, P.; Houser, C. Short Communication: Evidence for Geologic Control of Rip Channels along Prince Edward Island, Canada. Phys. Geogr. 2022, 43, 145–162. [Google Scholar] [CrossRef]
George, E.; Lunardi, B.; Smith, A.; Lehner, J.; Wernette, P.; Houser, C. Short Communication: Storm Impact and Recovery of a Beach-Dune System in Prince Edward Island. Geomorphology 2021, 384, 107721. [Google Scholar] [CrossRef]
Wernette, P.; Shortridge, A.; Lusch, D.P.; Arbogast, A.F. Accounting for Positional Uncertainty in Historical Shoreline Change Analysis without Ground Reference Information. Int. J. Remote Sens. 2017, 38, 3906–3922. [Google Scholar] [CrossRef]
Grottoli, E.; Biausque, M.; Rogers, D.; Jackson, D.W.T.; Cooper, J.A.G. Structure-from-Motion-Derived Digital Surface Models from Historical Aerial Photographs: A New 3D Application for Coastal Dune Monitoring. Remote Sens. 2020, 13, 95. [Google Scholar] [CrossRef]
Wernette, P.; Houser, C.; Lehner, J.; Evans, A.; Weymer, B. Investigating the Impact of Hurricane Harvey and Driving on Beach-Dune Morphology. Geomorphology 2020, 358, 107119. [Google Scholar] [CrossRef]
Houser, C.; Bishop, M.; Wernette, P. Short Communication: Multi-Scale Topographic Anisotropy Patterns on a Barrier Island. Geomorphology 2017, 297, 153–158. [Google Scholar] [CrossRef]
Sherwood, C.R.; Ritchie, A.C.; Over, J.R.; Kranenburg, C.J.; Warrick, J.A.; Brown, J.A.; Wright, C.W.; Aretxabaleta, A.L.; Zeigler, S.L.; Wernette, P.A.; et al. Sound-Side Inundation and Seaward Erosion of a Barrier Island During Hurricane Landfall. JGR Earth Surf. 2023, 128, e2022JF006934. [Google Scholar] [CrossRef]
Guisado-Pintado, E.; Jackson, D.W.T.; Rogers, D. 3D Mapping Efficacy of a Drone and Terrestrial Laser Scanner over a Temperate Beach-Dune Zone. Geomorphology 2019, 328, 157–172. [Google Scholar] [CrossRef]
Sturdivant, E.; Lentz, E.; Thieler, E.R.; Farris, A.; Weber, K.; Remsen, D.; Miner, S.; Henderson, R. UAS-SfM for Coastal Research: Geomorphic Feature Extraction and Land Cover Classification from High-Resolution Elevation and Optical Imagery. Remote Sens. 2017, 9, 1020. [Google Scholar] [CrossRef]
Di Paola, G.; Minervino Amodio, A.; Dilauro, G.; Rodriguez, G.; Rosskopf, C.M. Shoreline Evolution and Erosion Vulnerability Assessment along the Central Adriatic Coast with the Contribution of UAV Beach Monitoring. Geosciences 2022, 12, 353. [Google Scholar] [CrossRef]
Anderson, S.W. Uncertainty in Quantitative Analyses of Topographic Change: Error Propagation and the Role of Thresholding. Earth Surf. Process. Landf. 2019, 44, 1015–1033. [Google Scholar] [CrossRef]
Young, A.P. Decadal-Scale Coastal Cliff Retreat in Southern and Central California. Geomorphology 2018, 300, 164–175. [Google Scholar] [CrossRef]
Alessio, P.; Keller, E.A. Short-Term Patterns and Processes of Coastal Cliff Erosion in Santa Barbara, California. Geomorphology 2020, 353, 106994. [Google Scholar] [CrossRef]
Hayakawa, Y.S.; Obanawa, H. Volumetric Change Detection in Bedrock Coastal Cliffs Using Terrestrial Laser Scanning and UAS-Based SfM. Sensors 2020, 20, 3403. [Google Scholar] [CrossRef]
Kuhn, D.; Prufer, S. Coastal Cliff Monitoring and Analysis of Mass Wasting Processes with the Application of Terrestrial Laser Scanning: A Case Study of Rugen, Germany. Geomorphology 2014, 213, 153–165. [Google Scholar] [CrossRef]
Young, A.P.; Guza, R.T.; Matsumoto, H.; Merrifield, M.A.; O’Reilly, W.C.; Swirad, Z.M. Three Years of Weekly Observations of Coastal Cliff Erosion by Waves and Rainfall. Geomorphology 2021, 375, 107545. [Google Scholar] [CrossRef]
Young, A.P.; Guza, R.T.; O’Reilly, W.C.; Flick, R.E.; Gutierrez, R. Short-Term Retreat Statistics of a Slowly Eroding Coastal Cliff. Nat. Hazards Earth Syst. Sci. 2011, 11, 205–217. [Google Scholar] [CrossRef][Green Version]
Wernette, P.; Miller, I.M.; Ritchie, A.W.; Warrick, J.A. Crowd-Sourced SfM: Best Practices for High Resolution Monitoring of Coastal Cliffs and Bluffs. Cont. Shelf Res. 2022, 245, 104799. [Google Scholar] [CrossRef]
Warrick, J.A.; Ritchie, A.C.; Schmidt, K.M.; Reid, M.E.; Logan, J. Characterizing the Catastrophic 2017 Mud Creek Landslide, California, Using Repeat Structure-from-Motion (SfM) Photogrammetry. Landslides 2019, 16, 1201–1219. [Google Scholar] [CrossRef]
Weidner, L.; van Veen, M.; Lato, M.; Walton, G. An Algorithm for Measuring Landslide Deformation in Terrestrial Lidar Point Clouds Using Trees. Landslides 2021, 18, 3547–3558. [Google Scholar] [CrossRef]
Kogure, T. Rocky Coastal Cliffs Reinforced by Vegetation Roots and Potential Collapse Risk Caused by Sea-Level Rise. Catena 2022, 217, 106457. [Google Scholar] [CrossRef]
Agisoft LLC Agisoft Metashape 1.8.5—Professional Edition 2020. Available online: https://www.agisoft.com/ (accessed on 1 May 2020).
Brodu, N.; Lague, D. 3D Terrestrial Lidar Data Classification of Complex Natural Scenes Using a Multi-Scale Dimensionality Criterion: Applications in Geomorphology. ISPRS J. Photogramm. Remote Sens. 2012, 68, 121–134. [Google Scholar] [CrossRef]
Weidner, L.; Walton, G.; Kromer, R. Classification Methods for Point Clouds in Rock Slope Monitoring: A Novel Machine Learning Approach and Comparative Analysis. Eng. Geol. 2019, 263, 105326. [Google Scholar] [CrossRef]
CloudCompare 2019. Available online: https://www.danielgm.net/cc/ (accessed on 1 May 2020).
Buscombe, D. Doodler—A Web Application Built with Plotly/Dash for Image Segmentation with Minimal Supervision; U.S. Geological Survey Software Release: Reston, VA, USA, 2022. [Google Scholar]
Buscombe, D.; Goldstein, E.B.; Sherwood, C.R.; Bodine, C.; Brown, J.A.; Favela, J.; Fitzpatrick, S.; Kranenburg, C.J.; Over, J.R.; Ritchie, A.C.; et al. Human-in-the-Loop Segmentation of Earth Surface Imagery. Earth Space Sci. 2022. [Google Scholar] [CrossRef]
Anders, N.; Valente, J.; Masselink, R.; Keesstra, S. Comparing Filtering Techniques for Removing Vegetation from UAV-Based Photogrammetric Point Clouds. Drones 2019, 3, 61. [Google Scholar] [CrossRef]
Chen, Y.; Wu, R.; Yang, C.; Lin, Y. Urban Vegetation Segmentation Using Terrestrial LiDAR Point Clouds Based on Point Non-Local Means Network. Int. J. Appl. Earth Obs. Geoinf. 2021, 105, 102580. [Google Scholar] [CrossRef]
Mesas-Carrascosa, F.-J.; de Castro, A.I.; Torres-Sánchez, J.; Triviño-Tarradas, P.; Jiménez-Brenes, F.M.; García-Ferrer, A.; López-Granados, F. Classification of 3D Point Clouds Using Color Vegetation Indices for Precision Viticulture and Digitizing Applications. Remote Sens. 2020, 12, 317. [Google Scholar] [CrossRef]
Handcock, R.N.; Gobbett, D.L.; González, L.A.; Bishop-Hurley, G.J.; McGavin, S.L. A Pilot Project Combining Multispectral Proximal Sensors and Digital Camerasfor Monitoring Tropical Pastures. Biogeosciences 2016, 13, 4673–4695. [Google Scholar] [CrossRef]
Kawashima, S.; Nakatani, M. An Algorithm for Estimating Chlorophyll Content in Leaves Using a Video Camera. Ann. Bot. 1998, 81, 49–54. [Google Scholar] [CrossRef]
Lu, J.; Eitel, J.U.H.; Engels, M.; Zhu, J.; Ma, Y.; Liao, F.; Zheng, H.; Wang, X.; Yao, X.; Cheng, T.; et al. Improving Unmanned Aerial Vehicle (UAV) Remote Sensing of Rice Plant Potassium Accumulation by Fusing Spectral and Textural Information. Int. J. Appl. Earth Obs. Geoinf. 2021, 104, 102592. [Google Scholar] [CrossRef]
Meyer, G.E.; Hindman, T.W.; Laksmi, K. Machine Vision Detection Parameters for Plant Species Identification. In Proceedings of the SPIE; Meyer, G.E., DeShazer, J.A., Eds.; SPIE: Boston, MA, USA, 1998; Volume 3543, pp. 327–335. [Google Scholar]
Meyer, G.E.; Neto, J.C. Verification of Color Vegetation Indices for Automated Crop Imaging Applications. Comput. Electron. Agric. 2008, 63, 282–293. [Google Scholar] [CrossRef]
Neto, J.C. A Combined Statistical-Soft Computing Approach for Classification and Mapping Weed Species in Minimum—Tillage Systems; University of Nebraska: Lincoln, NE, USA, 2004. [Google Scholar]
Wan, L.; Li, Y.; Cen, H.; Zhu, J.; Yin, W.; Wu, W.; Zhu, H.; Sun, D.; Zhou, W.; He, Y. Combining UAV-Based Vegetation Indices and Image Classification to Estimate Flower Number in Oilseed Rape. Remote Sens. 2018, 10, 1484. [Google Scholar] [CrossRef]
Mao, W.; Wang, Y.; Wang, Y. Real-Time Detection of Between-Row Weeds Using Machine Vision. In Proceedings of the 2003 American Society of Agricultural and Biological Engineers, Las Vegas, NV, USA, 27–30 July 2003; American Society of Agricultural and Biological Engineers: St. Josepth, MI, USA, 2003. [Google Scholar]
Woebbecke, D.M.; Meyer, G.E.; Von Bargen, K.; Mortensen, D.A. Color Indices for Weed Identification Under Various Soil, Residue, and Lighting Conditions. Trans. ASAE 1995, 38, 259–269. [Google Scholar] [CrossRef]
Woebbecke, D.M.; Meyer, G.E.; Von Bargen, K.; Mortensen, D.A. Plant Species Identification, Size, and Enumeration Using Machine Vision Techniques on near-Binary Images. In Proceedings of the SPIE: The International Society for Optical Engineering; DeShazer, J.A., Meyer, G.E., Eds.; SPIE: Boston, MA, USA, 1993; pp. 208–219. [Google Scholar]
Yang, M.-D.; Tseng, H.-H.; Hsu, Y.-C.; Tsai, H.P. Semantic Segmentation Using Deep Learning with Vegetation Indices for Rice Lodging Identification in Multi-Date UAV Visible Images. Remote Sens. 2020, 12, 633. [Google Scholar] [CrossRef]
Yang, W.; Wang, S.; Zhao, X.; Zhang, J.; Feng, J. Greenness Identification Based on HSV Decision Tree. Inf. Process. Agric. 2015, 2, 149–160. [Google Scholar] [CrossRef]
Ventura, D.; Napoleone, F.; Cannucci, S.; Alleaume, S.; Valentini, E.; Casoli, E.; Burrascano, S. Integrating Low-Altitude Drone Based-Imagery and OBIA for Mapping and Manage Semi Natural Grassland Habitats. J. Environ. Manag. 2022, 321, 115723. [Google Scholar] [CrossRef]
Hunt, E.R.; Cavigelli, M.; Daughtry, C.S.T.; Mcmurtrey, J.E.; Walthall, C.L. Evaluation of Digital Photography from Model Aircraft for Remote Sensing of Crop Biomass and Nitrogen Status. Precis. Agric. 2005, 6, 359–378. [Google Scholar] [CrossRef]
Bendig, J.; Yu, K.; Aasen, H.; Bolten, A.; Bennertz, S.; Broscheit, J.; Gnyp, M.L.; Bareth, G. Combining UAV-Based Plant Height from Crop Surface Models, Visible, and near Infrared Vegetation Indices for Biomass Monitoring in Barley. Int. J. Appl. Earth Obs. Geoinf. 2015, 39, 79–87. [Google Scholar] [CrossRef]
Louhaichi, M.; Borman, M.M.; Johnson, D.E. Spatially Located Platform and Aerial Photography for Documentation of Grazing Impacts on Wheat. Geocarto Int. 2001, 16, 65–70. [Google Scholar] [CrossRef]
Otsu, N. A Threshold Selection Method from Gray-Level Histograms. IEEE Trans. Syst. Man Cybern. 1979, 9, 62–66. [Google Scholar] [CrossRef]
Torres-Sánchez, J.; Peña, J.M.; de Castro, A.I.; López-Granados, F. Multi-Temporal Mapping of the Vegetation Fraction in Early-Season Wheat Fields Using Images from UAV. Comput. Electron. Agric. 2014, 103, 104–113. [Google Scholar] [CrossRef]
Weinmann, M.; Jutzi, B.; Mallet, C.; Weinmann, M. Geometric Features and Their Relevance for 3D Point Cloud Classification. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2017, IV-1/W1, 157–164. [Google Scholar] [CrossRef]
Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef]
Weinmann, M.; Jutzi, B.; Hinz, S.; Mallet, C. Semantic Point Cloud Interpretation Based on Optimal Neighborhoods, Relevant Features and Efficient Classifiers. ISPRS J. Photogramm. Remote Sens. 2015, 105, 286–304. [Google Scholar] [CrossRef]
Alifu, H.; Vuillaume, J.-F.; Johnson, B.A.; Hirabayashi, Y. Machine-Learning Classification of Debris-Covered Glaciers Using a Combination of Sentinel-1/-2 (SAR/Optical), Landsat 8 (Thermal) and Digital Elevation Data. Geomorphology 2020, 369, 107365. [Google Scholar] [CrossRef]
Çelik, O.İ.; Gazioğlu, C. Coast Type Based Accuracy Assessment for Coastline Extraction from Satellite Image with Machine Learning Classifiers. Egypt. J. Remote Sens. Space Sci. 2022, 25, 289–299. [Google Scholar] [CrossRef]
Yang, Z.; Xu, C.; Li, L. Landslide Detection Based on ResU-Net with Transformer and CBAM Embedded: Two Examples with Geologically Different Environments. Remote Sens. 2022, 14, 2885. [Google Scholar] [CrossRef]
Kestur, R.; Meenavathi, M.B. Vegetation Mapping of a Tomato Crop Using Multilayer Perceptron (MLP) Neural Network in Images Acquired by Remote Sensing from a UAV. IJCA 2018, 182, 13–17. [Google Scholar] [CrossRef]
Abadi, M.; Agarwal, A.; Barham, P.; Brevdo, E.; Chen, Z.; Citro, C.; Corrado, G.S.; Davis, A.; Dean, J.; Devin, M.; et al. TensorFlow: Large-Scale Machine Learning on Heterogeneous Systems 2015. Available online: https://www.tensorflow.org/ (accessed on 1 May 2020).
Chollet, F. Others Keras 2015. Available online: https://keras.io/ (accessed on 1 May 2020).
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Ruder, S. An Overview of Gradient Descent Optimization Algorithms. arXiv 2016, arXiv:1609.04747. [Google Scholar]
Geisz, J.K.; Wernette, P.A.; Esselman, P.C. Classification of Lakebed Geologic Substrate in Autonomously Collected Benthic Imagery Using Machine Learning. Remote Sens. 2024, 16, 1264. [Google Scholar] [CrossRef]
Geisz, J.K.; Wernette, P.A.; Esselman, P.C.; Morris, J.M. Autonomously Collected Benthic Imagery for Substrate Prediction, Lake Michigan 2020–2021. U.S. Geol. Surv. Data Release 2024. [Google Scholar] [CrossRef]
Gómez-Ríos, A.; Tabik, S.; Luengo, J.; Shihavuddin, A.; Krawczyk, B.; Herrera, F. Towards Highly Accurate Coral Texture Images Classification Using Deep Convolutional Neural Networks and Data Augmentation. Expert Syst. Appl. 2019, 118, 315–328. [Google Scholar] [CrossRef]
Raphael, A.; Dubinsky, Z.; Iluz, D.; Netanyahu, N.S. Neural Network Recognition of Marine Benthos and Corals. Diversity 2020, 12, 29. [Google Scholar] [CrossRef]
Shihavuddin, A.S.M.; Gracias, N.; Garcia, R.; Gleason, A.; Gintert, B. Image-Based Coral Reef Classification and Thematic Mapping. Remote Sens. 2013, 5, 1809–1841. [Google Scholar] [CrossRef]
Stokes, M.D.; Deane, G.B. Automated Processing of Coral Reef Benthic Images: Coral Reef Benthic Imaging. Limnol. Oceanogr. Methods 2009, 7, 157–168. [Google Scholar] [CrossRef]
Chen, R.; Wu, J.; Luo, Y.; Xu, G. PointMM: Point Cloud Semantic Segmentation CNN under Multi-Spatial Feature Encoding and Multi-Head Attention Pooling. Remote Sens. 2024, 16, 1246. [Google Scholar] [CrossRef]
Wernette, P. Coastal Bluff Point Clouds Derived from SfM near Elwha River Mouth, Washington from 18 April 2016 to 8 May 2020. Dryad 2020. [Google Scholar] [CrossRef]
Weinmann, M.; Jutzi, B.; Mallet, C. Semantic 3D Scene Interpretation: A Framework Combining Optimal Neighborhood Size Selection with Relevant Features. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2014, II–3, 181–188. [Google Scholar] [CrossRef]
Gitelson, A.A.; Kaufman, Y.J.; Stark, R.; Rundquist, D. Novel Algorithms for Remote Estimation of Vegetation Fraction. Remote Sens. Environ. 2002, 80, 76–87. [Google Scholar] [CrossRef]
Ashapure, A.; Jung, J.; Chang, A.; Oh, S.; Maeda, M.; Landivar, J. A Comparative Study of RGB and Multispectral Sensor-Based Cotton Canopy Cover Modelling Using Multi-Temporal UAS Data. Remote Sens. 2019, 11, 2757. [Google Scholar] [CrossRef]

Figure 1. Oblique view of an SfM point cloud shows vegetation overhanging the bluff face and obscuring the bare-Earth bluff face, as well as several fallen tree trunks.

Figure 2. (a) Example SfM-derived point clouds reclassified using (b) Agisoft Metashape Professional 1.8.5 and (c) CANUPO Cloud Compare plug-in.

Figure 3. Vegetation indices can be computed from RGB bands to segment vegetation from non-vegetation pixels in an individual image. Some examples of different vegetation indices computed from a raw RGB image (a) are (b) ExB, (c) ExG, (d) NGRDI, (e) RGBVI, and (f) GLI.

Figure 4. MLP model architectures tested in this paper followed a similar structure. All input data were linearly normalized to range from 0 to 1. Some models included one or more additional vegetation indices (as indicated by the dashed line on the right), derived from the normalized RGB values. Yet other models included a computed 3D standard deviation value for every point as a model input (see the dashed line on the left). All inputs were concatenated in an input layer and then fed into one or more dense, fully connected layers, where the number of layers and nodes per layer varied (as represented by the *) to test model architectures. The dropout layer with probability 0.2 was subsequently added after these dense layer(s) and before the final dense layer, which had 2 nodes, corresponding to the number of classes included in the model. The final layer was an activation layer that condensed the probabilities from the 2-layer dense layer into a single class label.

Figure 5. The Elwha Bluffs are located east of the Elwha River Delta along the Strait of Juan de Fuca Washington coast (modified from [15]. A SfM-derived point cloud from 8 May 2020 shows vegetation obscuring the bare-Earth bluff face.

Figure 6. Coastal bluff point cloud for 8 May 2020 reclassified using MLP models with one layer of 16 nodes but different inputs.

Figure 7. Accuracy varied by model complexity and size, as measured by the number of tunable parameters.

Figure 8. Point cloud from 8 May 2020 reclassified with different MLP models where the number of nodes per layer and number of layers increased from 1 layer with 16 nodes to 6 layers with 16 to 512 nodes.

Figure 9. Point cloud from 8 May 2020 reclassified with different MLP models with relatively simple architectures of 2 layers with 8 nodes each or 3 layers of 16 nodes each.

Figure 10. Accuracy varied by the number of tunable parameters.

Table 1. Vegetation indices with well-constrained upper and lower value bounds explored in this research, where R, G, and B are the spectral values for the red, green, and blue channels, respectively.

Vegetation Index	Formula	Value Range (Lower, Upper)	Source
Excess Red (ExR)	$1.4 R - G$	(−1, 1.4)	[36]
Excess Green (ExG)	$2 G - R - B$	(−1, 2)	[35,40]
Excess Blue (ExB)	$1.4 G - G$	(−1, 1.4)	[39]
Excess Red Minus Green (ExGR)	$E x G - E x R$	(−2.4, 3)	[37]
Normal Green-Red Difference Index (NGRDI)	$\frac{G - R}{G + R}$	(−1, 1)	[45]
Modified Green Red Vegetation Index (MGRVI)	$\frac{G^{2} - R^{2}}{G^{2} + R^{2}}$	(−1, 1)	[46]
Green Leaf Index (GLI)	$\frac{2 G - R - B}{2 G + R + B}$	(−1, 1)	[47]
Red Green Blue Vegetation Index (RGBVI)	$\frac{G - (B * R)}{G^{2} + (B * R)}$	(−1, 1)	[46]
Kawashima Index (IKAW)	$\frac{R - B}{R + B}$	(−1, 1)	[33]
Green Leaf Algorithm (GLA)	$\frac{(G - R) + (G - B)}{(G + R) + (G + B)}$	(−1, 1)	[47]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wernette, P.A. Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds. Remote Sens. 2024, 16, 2169. https://doi.org/10.3390/rs16122169

AMA Style

Wernette PA. Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds. Remote Sensing. 2024; 16(12):2169. https://doi.org/10.3390/rs16122169

Chicago/Turabian Style

Wernette, Phillipe Alan. 2024. "Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds" Remote Sensing 16, no. 12: 2169. https://doi.org/10.3390/rs16122169

APA Style

Wernette, P. A. (2024). Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds. Remote Sensing, 16(12), 2169. https://doi.org/10.3390/rs16122169

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning Vegetation Filtering of Coastal Cliff and Bluff Point Clouds

Abstract

1. Introduction

2. Materials and Methods

2.1. Vegetation Classification and Indices

2.2. ML Models

2.2.1. Deriving 3D Standard Deviation

2.2.2. Multi-Layer Perceptron (MLP) Architecture and Inputs

2.2.3. ML Model Evaluation

2.3. Case Study: Elwha Bluffs, Washington, USA

3. Results

4. Discussion

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI