1. Introduction
Coastal and surface waters are vital environmental resources in ecological systems and for human activities such as agriculture, industrial production, public health and safety, tourism, and human life settlement. Although there is no strict definition to the concept of a small water body, it has been used in the literature to consider small lakes, ponds, low-order streams, ditches, and springs [
1]. Small water bodies play a fundamental role in providing habitats for animals and plants, and their environmental interaction generates beneficial weather conditions. Understanding surface water bodies’ behavior is crucial in water resources assessment, weather conditions modeling, crop improvement, flood mitigation, aquifer monitoring, wetland recording, and ecosystems studies, among many others. Coastal waters, in turn, are the interface between complex geographic and geomorphologic environments, giving rise to unique ecological systems. A large proportion of human settlements and activities are very close to coastal regions, exerting potentially or actually negative anthropic effects. Thus, adequate monitoring of these regions is essential to understand and mitigate these adverse effects.
Traditional monitoring and surveying techniques to acquire relevant information are known to be costly and cumbersome, and for this reason remote sensing has emerged as a feasible option since it produces adequate spatial and temporal information about the Earth’s surface. Optical satellite imagery provides satisfactory spectral information, which facilitates land cover determination, among other purposes. However, freely available satellite imagery usually lacks fine enough spatial and temporal resolution for small water body monitoring, or for the determination of small targets, e.g., oil spills, and also strongly depends on adequate daylight and weather conditions during acquisition. Synthetic Aperture Radar (SAR) sensors, in turn, emit active microwave pulses and then sense the energy that bounces back, with which it is possible to obtain backscatter information of the surface, regardless of daylight or weather conditions, and with arbitrary spatial resolution. SAR sensors can also “see” through canopy and vegetation layers, which is also necessary during an accurate detection of the actual surface of shallow water bodies (fresh or coastal), whose surface may be obscured by vegetation, either natural (mangroves, swamps, marshes, to name a few) or in waterlogged fields (rice crops, sugar cane, willows, and birches).
Water is characterized by a high dielectric constant that affects the backscatter intensity. Since SAR sensors can detect differences in geometric and dielectric properties, this technology appears adequate for studying surface water bodies, which can be distinguished in SAR images as dark regions. Data provided by SAR sensors have been widely used in recent years to detect and extract water patterns and to quantify their changes [
2,
3,
4,
5,
6,
7,
8,
9,
10].
Another vital area of research concerns the detection of oil spills in water. According to the European Space Agency (ESA), the coastal environment is being damaged because of tanker accidents and illegal ship discharges that spill large amounts of oil into the sea. One of the major problems is the difficulty in identifying the whole affected area, the degree of smoothness, and the direction of its movement. Oil spills are a threat to naval activity, human beings, and animal life, and they are of interest in public, political, and scientific fields. Since the speed of oil slicks ranges from 0.4 cm
−1 to 0.75 cm
−1 [
11], a timely detection methodology is crucial to prevent pollution and preserve natural resources.
SAR sensors also have the advantage of producing images of difficult to access zones, being a tool widely used in oil spill detection [
12]. However, this kind of image shows patches such as eddies, upwelling, internal waves, rain cells, and wind shadows, which are oil lookalikes but not actual oil features [
13]. Both phenomena appear as black spots in SAR images. Moreover, supervised algorithms face the problem of the scarcity of training samples [
14]. One of the most popular and simple techniques used in this task is visual inspection and manual delineation [
15], whose reliability strongly depends on the expertise and experience of personnel trained in photointerpretation. So far, the lack of robust identification techniques still raises the requirement of professionally trained supervision [
16]. In this sense, semi-automatic methods can be a good complement, alleviating the burden of humanly supervised tasks.
Unsupervised processing is advantageous over human-assisted processing, being less expensive and less affected by the typical human inaccuracies that arise due to fatigue, distractions, and other factors [
17,
18]. A notable disadvantage of human-assisted detection in this application domain is the difficulties and intra- and intersubject variance of the human vision in distinguishing fuzzy boundaries. These issues increase the execution costs and times, and demand more than one operator to achieve trustworthy results [
19]. In this context, edges play a fundamental role in image processing and computer vision. Although usually understood as
“changes in intensity or color along a border”, the notion of an edge can be more general. Edges are important because they can be used as simple descriptors of complex objects. Blake and Isard [
20] discussed their importance in applications that range from robotics to computer-based animation. In particular, in remote sensing applications, edge detection allows for fast delineation of features of interest as, for instance, shores. Detected edges can be later refined and used as feature descriptors in high-level interpretation procedures. Even though edges behave locally as lines, in some cases, there are only transition zones (fuzzy edges), which are relevant in the cases considered in this work. Most edge detection techniques employ local operators, i.e., operations that enclose a small region around each image position. In other words, the evidence of edge occurrence is assessed on a spatially limited region around each point. Remarkable examples of this approach are the Laplacian, Marr–Hildreth, and Canny operators, which find features as approximations of the unobserved continuous image gradient [
21].
Gambini et al. [
22] proposed a novel approach, (here termed “Gambini Algorithm” or GA). They analyzed a thin strip of pixels, finding evidence of a change of textural properties along the strip in an iterative fashion. If the strip crosses an edge between regions with different characteristics, the border is where such differences are maximal. The original proposal used the likelihood of univariate amplitude SAR data under the
model and obtained excellent results even in the presence of strong noise levels [
23]. This statistical line-search approach was then extended to fully polarimetric SAR data [
24]. Although very successful in some cases, in this approach, the border detection depends on computing a likelihood function several times, thus imposing a heavy computational burden. Nascimento et al. [
25] used stochastic distances between PolSAR samples in the GA approach, obtaining excellent results while reducing the computational cost. Naranjo-Torres et al. [
26] used a different class of distances, namely geodesic measures, on intensity data. This was the first attempt to use distances between intensity samples in the GA. It was computationally affordable and successful for SAR images with one or two looks (very noisy images).
Figure 1 shows a chronological overview of the use of the three main components that characterize an edge detection approach using GA, namely: (i) the model (the
model, its Harmonic
version, the Wishart distribution
, or distribution-free as in Ref. [
27]); (ii) the type of data (Amplitude, Intensity, Polarimetric); and (iii) the function to maximize (likelihood, non-parametric tests
, geodesic distances
, or features derived from
H-
divergences
). This figure also shows this paper’s contribution, namely the use of intensity data under the
model and a statistical test based on
divergences.
Nascimento et al. [
28] used stochastic distances between intensity samples. Stochastic distances were not used in the GA until Revollo et al. [
29] proposed a technique to detect oil spills based on change detection over data strips. The expressions in [
28] are more general than the geodesic distance and produce a wealth of distances that can be turned into statistical tests. In some cases, they rely on numerical integration.
The novelty of our approach is to combine the GA with a hypothesis test whose statistic is defined, for intensity data, in terms of stochastic distances derived from the
-family of divergences. Gambini et al. [
22] used probabilities to detect edges at distances no larger than
k pixels from the correct position. This quantitative measure is present in all subsequent studies. Instead, we propose a different quality assessment evaluating two measures, the Hausdorff distance between the estimated and actual edges and the Intersection over Union (IoU) of the corresponding areas. These measures are more relevant to the problem we are dealing with. Moreover, we also analyze the impact of the number of strips on the results, and propose a technique which adapts this parameter. Finally, we use eight stochastic distances and evaluate their performance with simulated and actual imagery. We tackle two problems, namely the recognition of oil spills and of water bodies boundaries with a unified approach. Both aim at delineating boundaries between targets with disparate (often unseen) properties, and suffer from different sources of confusion. The former, by waves and low wind; the latter, by shadows. Our solution relies on statistical features that are little disturbed by these issues.
This manuscript unfolds as follows.
Section 2 describes the required theory, including the multiplicative model (
Section 2.1) and hypothesis tests based on stochastic divergences (
Section 2.2 and
Section 2.3).
Section 3 is devoted to the methodology; the proposed algorithm is presented in
Section 3.1, and the test cases are described in
Section 3.2. In
Section 4 we present results obtained with simulated data and images from an operational sensor.
Section 5 concludes the article with an analysis of the results.
5. Conclusions
Water recognition is an essential task in diverse fields of knowledge, such as biological environment, climate, earthly disasters, agriculture, and tourism, among others. This information can be used either to improve the production of natural resources or to prevent damages caused by contamination, floods, and droughts.
The present work proposed an automated segmentation algorithm suited for small water bodies. First, candidate border points are detected along radial transects, taking into account the change in statistical properties of the images, implemented through a hypothesis test based on stochastic distances between distributions. Then, the estimated border is obtained when these points, or the set of these points that are not rejected as outliers, are used to define the B-splines curves. To select the most suitable stochastic distance, the suggested technique was performed using Arithmetic-Geometric, Bhattacharyya, Hellinger, Harmonic-Mean, Jensen–Shannon, Kullback-Leibler, Rényi, and Triangular distances. The method was tested both with simulated SAR images comparing different roughness levels and with four actual COSMO-SkyMed SAR images obtained from the SAOCOM/SIASGE constellation. Considering the ground truth border marked by an expert (in those cases, an optical version of the image was available) and the approximation attained, the results obtained were evaluated by computing the Hausdorff distance and the Intersection over Union.
The results suggest Harmonic-Mean and Triangular as the most suitable stochastic distances for lagoons’ border detection, and Bhattacharyya, Hellinger, and Jensen–Shannon for oil spill, using 63 ray strips and only points that are not rejected as outliers in all cases. Our approach relies on the assumption that different targets exhibit distinct statistical properties. If the models cannot retrieve such information, other features must be used. In future work, we are interested in studying this methodology’s extensions and modifications to other regions in SAR imagery with the purpose of a general segmentation algorithm. In addition, we will extend our proposal to handle non-convex targets.