1. Introduction
Maritime transport and oil rigs are the main causers of marine pollution as a result of tank cleaning and accidents. Ships transport more than 500 million tons of petrol and 300 million of oil products each year and, according to information obtained from the Spanish Maritime Safety Agency (SASEMAR), this maritime traffic spills more than 20 million m3/year of both oil and other hydrocarbons into the waters of the European Union.
Oil spill detection is currently carried out by the European Maritime Safety Agency (EMSA), whose CleanSeaNet observation services for oil spill monitoring and vessel detection do so using image from ENVISAT, RASARSAT and SENTINEL satellites, among others. CleanSeaNet is used by governments as a first step toward monitoring oil spills. In a second step, the maritime agencies of each country use aircraft with airborne sensors to monitor the sea surface in more detail. The second step is required because the EMSA satellites can monitor only the same point on the surface of the sea every three days, while and the oil spill continues to expand on the surface, causing significant environmental damage.
Many sensors are used for oil-spill detection, as is presented in [
1]. The most common, but not the best, is a radar mounted on satellites and aircraft in order to detect oil on water during searches carried out over a large area. Several efforts have been made to discuss the problem of spill detection, with attention being paid to descriptions of the different satellite radars and oil spill detectability under varying conditions [
2,
3,
4].
There are two ways in which to approach the oil-spill detection problem: studying the characterization of the slick by means of the multi-polarization features of SAR techniques, as occurs in [
5,
6,
7], or using the brightness image obtained from the backscatter signal without considering the parameters of the processes of image acquisition and formation, as in [
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18,
19]. Our work falls within the second category. Most of the approaches based on image processing techniques focus on segmenting SAR images in order to identify the dark-spot region that represents the oil slick [
8,
9,
10,
11,
12], on extracting features so as to recognize the oil slick from previously segmented images [
13,
14,
15], or on classifying dark spots using machine learning techniques [
16], statistical classifiers [
17,
18,
19] or/and artificial neural networks [
20,
21,
22,
23]. State-of-the-art methods sometimes use a combination of several approaches, as in [
24], and on other occasions, the image processing methods are combined with Geographic Information Systems (GIS), as in [
25], to provide both the detection and precise location of marine spills.
Some disadvantages of the methods presented in the state of the art are related to the difficulty involved in automating the oil slick detection task, as mentioned in [
26], in which a hybrid system that integrates several automatic and semi-automatic procedures (with human-supervision) is presented. Moreover, the methods are usually tested in an offline process and this may make the performance of the detection task in an onboard detection system difficult, because they are not always designed for input images with sufficient variability (i.e., the acquisition conditions are always chosen in the same way, signifying that the background produced is usually homogeneous or has a specific texture pattern). Another drawback of some of the state-of-the-art methods is the use of cropped images as inputs rather than using images representing a whole mission or full scan (i.e., an image represents a close-up of the oil spill with some background) [
9,
12,
27,
28,
29]. The input images also frequently show only two types of targets (i.e., the sea and oil slick zones) [
30] and when the image shows others, such as land, then pre-processing filters are manually configured to hide or eliminate the pixels in those zones [
10,
13]. All of these problems still require a solution. Moreover, detection from SLAR images requires finding a solution to these problems, and particularly the last two obstacles, such as working with a full image and using images with other false targets, such as artifacts and noise caused by the flight of the aircraft. Some authors have previously presented a segmentation method based on a boundary [
31] or the Markov Random Field (MRF) energy function [
32] to reduce or suppress the effect of random noise, low contrast and dayglow contamination.
In particular, the proposed method is based on the segmentation technique category, without considering the multi-polarization features of the radar. But unlike the works in [
8,
9,
10,
11,
12,
13,
14,
15,
16,
17,
18], the input is a SLAR and not a SAR image. It is very difficult to find works in the state of the art that address oil-spill detection using SLAR images. A recent example in this respect is [
22], in which a solution based on a Recurrent Neural Network (RNN) is proposed. The method detects the oil spill using the row of SLAR image as inputs, in which each row represents a scanning of the sea surface in a time t. Other recent works were presented in [
23,
33]. The authors of [
23] present an Auto-encoder Neural Network that uses an unsupervised learning algorithm to segment the oil slick in SLAR images used as input without any previous pre-processing. In [
33], two traditional segmentation methods based on graphs and on the concept of J-image are tested for the oil-spill detection task.
The aim of this work is to present a method based on SLAR image segmentation in order to overcome some of the disadvantages described above. Our method works well with a full image without markers and without cropping the image. In addition, we do not use pre-processing filters to manually eliminate unwanted zones, such as land zones or noise caused by aircraft.
This paper is organized as follows:
Section 2 comments on the features of an SLAR image and the difference between the SLAR and the SAR, in addition to describing the dataset obtained from a SLAR mounted on an aircraft.
Section 3 describes the proposed method. The experiments showing the behavior of the method are presented in
Section 4. Finally, our conclusions and future works are described in
Section 5.
2. SLAR Image
Oil spill detection is usually carried out by governments, principally using two types of sensors: the Synthetic Aperture Radar (SAR), which is installed on satellites, and the Side-Looking Airborne Radar (SLAR), which is installed on aircraft (
Figure 1). Unlike other sensors, such as visible spectrum and infrared sensors, both the SAR and the SLAR can monitor a wide area of the sea, even if there is low visibility owing to adverse weather conditions such as clouds or rain. In addition, they can work well at night. The SAR is able to acquire image segments from between 200 and 1400 km, whereas the SLAR image represents image segments with a width and height of 45 km, depending on the number of minutes that the mission lasts. Consequently, SLAR provides images occupying fewer bytes than SAR images. Due to this fact, the use of SLAR enables the aircraft to supply real-time image transmission using a satellite communication link with an operator on earth. Moreover, the acquisition process works on-demand controlled by the flight operator, who commands the aircraft sensors according to flight planning and the type of surveillance mission.
SLAR images are a visual representation of the interaction of the radar pulse emitted and the surface (sea, land, ship, oil slick, etc.) onto which it is projected. Each pixel in a SLAR image, therefore, represents the backscattering coefficient of that scanned area, and the stronger the backscatter (return signal), the higher the brightness value.
When both the SAR and the SLAR acquire an intensity image, the oil slicks in it are represented as dark spots (i.e., pixels with low grayscale). This is because dark pixels indicate areas with low backscatter whereas light pixels indicate areas with high backscatter. Nevertheless, it is usual for SLAR images to display other targets, such as ocean phenomena (i.e., the sea surface with an absence of waves owing to little wind) or natural activities (i.e., algae banks and phytoplankton blooms), as dark spots. More specifically, the wind must have a velocity of more than 5 kn. for sea to be sufficiently agitated to produce waves so that the sensor works well. The absence of waves does not allow the SLAR to distinguish between oil slicks and sea zones because the backscatter is low and very similar in both cases.
Moreover, SLAR images always show other phenomena which do not appear in SAR images and which are represented by dark spots that cause confusion in the oil-slick detection task. The problem is that the location of SLAR antennas, under the aircraft wings (
Figure 1b), causes noisy areas in image that may lead to confusion in the detection task. Aircraft maneuvers (turns, slips, etc.) can, similarly, also cause other noisy areas (
Figure 2).
Another important difference of SLAR versus SAR is the type of noise. SAR image usually presents granular appearance because of speckle noise, while SLAR presents artifacts. Unlike SAR, SLAR image is actually a mosaic image composed of simultaneous scans performed by each antenna. Therefore, SLAR image can include noise caused by small differences in alignment between both antennas. The position and inclination of each antenna determine the difference between the look angles. This issue influences both the shape and the size of noisy area, named as hidden area in
Figure 2. Additionally, this hidden area is modified depending on both the flight altitude and ascent/descent speed. SAR image does not suffer this problem because it does not have any hidden area since it is not a mosaic image.
The width and intensity of noisy areas depend mainly on motion in the longitudinal axis parallel to the fuselage reference line. Due to this rotational movement, the antennas change the look angles. One antenna receives bad measures while the other receives backscatter loss because it looks towards the sky.
Figure 2 shows a SLAR image that represents 7′ and 20″ of a sea surface scan taken by the aircraft shown in
Figure 1. This digital image was acquired on a marine surveillance mission on an ascending fight path (between 1552 and 1896 ft.), on which the initial latitude and longitude were N 40°51′ and E 002°22.8′ for the starting point and N 39°54.8′-E 1°11′ for the end point.
The dataset used as input in our experimentation contains 7 SLAR images of oil-slick scenes, which were acquired by a TERMA SLAR 9000 sensor (TERMA A/S, Lystrup, Denmark) mounted on an EADS-CASA CN 235–300 aircraft (EADS CASA/Airbus Group, Madrid, Spain). TERMA SLAR is a Media Wave Radar (MWR) sensor with a frequency of 9.375 MHz that, when mounted on a CN 235–300, can view about 46 km on each side of the aircraft. The beam width of each SLAR antenna is 0.6° horizontal and 19° vertical.
All the images have a resolution of 1157 × 482 pixels and each pixel is 8 bits in depth, according to the digitalization of the Terma sensor control software. They were supplied by the Spanish Maritime Safety and Rescue Agency (SASEMAR). The details of the SLAR images used in the experimentation are described in
Table 1. In addition, the proposal method is presented in
Figure 3 and some of the used parameters are summarized in
Table 2.
3. Method
The main contribution of this work is a method with which to detect oil slicks in SLAR images.
Figure 3 represents a complete scheme of our method, based on image processing techniques and computer vision. It particularly shows all the processing stages in the detection of oil slicks in SLAR images acquired from an onboard sensor. First, a ‘denoising caused by aircraft’ phase detects the noise caused by aircraft maneuvers while the mission is being carried out, after which an ‘oil slick region detection’ phase detects and locates the oil slicks in the image. The method is discussed in the following sub-sections.
3.1. Denoising Caused by the Airplane
SLAR images show noise zones caused by the absence of a radar signal. These noise zones are a consequence of three circumstances: a hidden area under the aircraft fuselage without a radar signal, and changes in location and aircraft turns that take place during an emergency mission and during which the radar signal is lost (
Figure 2). The noise zones lead to significant measurement errors which are subsequently registered as erroneous values in the pixels obtained by the digitalized process. It is, therefore, important to be able to automatically detect the pixels caused by noise zones for their subsequent labeling so that they will not be considered in the oil spill detection process. Those pixels that represent noise could cause false positives in the detection process. It is assumed that the hidden area is always unique for each input image, that its location in the image is unknown and that both the number of noise zones and their locations in the image owing to changes and turns are also unknown.
The methodology for noise zone detection is implemented as a three-step algorithm. First, the input images I(
x,
y) are filtered using Gabor filters [
34]; H. Gabor filters use directional masks to detect abrupt changes of intensity in different directions. Second, the Hough transform [
35] is applied to the filtered image in order to obtain the straight lines which delimit the abrupt changes as the border of the noise zone. Third, the input image is cropped to eliminate the pixel regions in which the noise zones were detected. A new SLAR image, which is smaller than original image, is consequently built from the input image to be used later in the detection process. Examples of the first and second steps of our algorithm are shown in
Figure 4. The detection process carried out using the output of this first phase is described in the following section.
The Gabor filter, H, is a low-pass filter that passes signals with a frequency that is lower than a reference frequency, f. Gabor makes it possible to eliminate texture pattern by detecting the pixels (
x,y) with intensity values which have a specific level of repetitiveness, i.e., that are homogeneous areas in the image. The Gabor filter is defined as follows:
where
defines the wavelength of the sinusoidal factor,
is the orientation of the normal to the parallel stripes of a Gabor filter,
is the phase offset,
is the standard deviation of the Gaussian envelope and
is the spatial aspect ratio. Nevertheless, the output of Terma SLAR sensor is an image digitalized from a time sequence of scanning and H can, therefore, be represented as a discrete filter as follows:
where
represents the frequency that determines the type of texture pattern and
is its orientation direction. The neighbors considered are controlled with the parameter of the Gaussian standard deviation
. In our experimentation, we have used
and
to obtain good results with our dataset of SLAR images. Note that each turn area is a noisy region between two horizontal imaginary lines whose orientation is close to 0°. Besides, hidden area is another noisy area between two vertical parallel lines whose orientation is close to 90° (
Figure 4a). For this reason, we tested values close to 0° and 90° for the texture patterns in a range of ±10°. We got the best results with 0° and 92°. Later, we apply a Gaussian filter
and size 3 × 3 to remove noise due to the response of Gabor filters. We have chosen these values because the effect of filtering with higher values causes too smoothing and consequently, the texture pattern of aircraft maneuvers can be partially erased and this issue difficult the detection of the borders of the noisy areas. A DoG-Canny operator with non-maximum suppression is later used to detect pixels with very different intensity values to those of their neighbors as regards magnitude and gradient direction. In this step, we have used 3 × 3 Gaussian masks and thresholds of 1 and 5 for the Canny method:
The result of first step provides an image in which the boundary pixels belonging to the noise zones are detected (
Figure 4b). We later compute the Hough transform combined with a k-means clustering algorithm to locate straight lines as an approach fitting in order to determine the border of the noise zones. We particularly search for the straight lines with an orientation of 90° and 270° for a hidden area and 0° and 180° for the asymptotes of the parables caused by the aircraft turns (
Figure 4a–c). Note that the number of turns and their positions in the image are unknown, and a statistical analysis has, therefore, accordingly been introduced in order to identify those data in the image. What is more, the Hough Transform provides too many straight lines with an identical orientation and close positions, and a k-means clustering algorithm [
36] is, therefore, used to cluster straight lines and to show only those lines that define the border in order to divide two turn zones. This analysis is performed by computing the k-means, using different cluster values to search for the k value that minimizes the variance between clusters (Equation (4)). The number of turns in the image is consequently determined by the number of clusters obtained when applying the algorithm. Each turn will, therefore, be delimited by two straight lines, which are determined by the distance between them and the centroid of the turn:
where each cluster
is the set of points that defines the straight line separating two turn zones,
is each straight line in the polar coordinates detected by the Hough transform and
is the mean of the points for
.
Once the borders of both the turn and the hidden zones have been calculated in the image, the image is divided into several sub-images using the straight lines of the detected borders as the splitting criterion (
Figure 4d). The sub-images representing turns and hidden zones can consequently be eliminated and the method can automatically recompose an entire new image from the merging process using only the remaining sub-images. The new recomposed image will have less noise that could cause false positives in the detection process.
The result of the process carried out to remove the noise caused by the airplane is presented in
Figure 4 and the summary of parameters for this step of our method is shown in
Table 2.
3.2. Candidate Regions Detection to Be Oil Spills
Once the noise has been eliminated, we employ a method based on a Fourier transform and the concept of residual spectral [
37]. This method obtains a saliency map that simplifies the representation of the image without noise (
Figure 5). The aim is to identify the pixels that provide a greater level of appearance information. The saliency technique is, therefore, used to locate objects from a non-homogeneous and non-uniform background as a previous step to the segmentation of regions method. The location of regions using segmentation methods based on color or grayscale without carrying out preprocessing beforehand does not provide good results when the brightness values of the pixels in the background change. The saliency map, therefore, provides a pre-processing step with which to perform a meaningful segmentation process that detects the pixels representing the oil spills in the image. In order to compute the saliency map, it is first necessary to calculate the spectral residue of the input image (
Figure 5c).
The spectral residue is denoted as
where
is the Discrete Fourier Transform of the image
. It can also be expressed in polar coordinates in terms of two real functions, amplitude
and phase
, where
is a matrix related to the intensity values of the pixels and
is another matrix with information related to the pixel positions. The spectral residue can, therefore, be denoted using
and is calculated as:
where
is a mean filter that we have chosen with a 3 × 3 sized mask. The map saliency of an image
is later calculated using the inverse of the spectral residue generated from the Discrete Fourier Transform (DFT) of image
as follows:
The result of applying the saliency map technique is the generation of several candidate oil-spill regions in the image. Each of the regions detected represents a slick in the SLAR image which could be considered as an oil spill. After obtaining the saliency map, we apply an adaptive thresholding method. We can use any of the methods discussed in [
38] to reduce the non-uniformity of intensity in the saliency regions. In particular,
Figure 5d shows the result of a binarization using Niblack’s method. We consider a neighborhood window for our seed pixel and then we use as threshold the intensity mean of windows. After, we compare the threshold with the rest of pixels which are not black taking into consideration of standard deviation as tolerance value.
3.3. Reconstruction of Oil Spills
The detection of candidate oil-spill regions is not sufficient when the purpose is to analyze the changes in the shape and size of the oil slick over time. The SLAR sensors are employed on emergency missions to map, monitor and analyze sea surfaces from aircraft in order to seek oil spills, detect the ships that have caused them and evaluate the disaster according to track the boundary of the oil spill. A reconstruction method must, therefore, be applied to the SLAR image after detecting the candidate regions. The basis of the reconstruction method is a segmentation process that is based on region growing and is used to reconstruct the contour of the oil slick from pixels of candidate regions obtained from the saliency map. The reconstruction, therefore, makes it possible to define the contour pixels of the candidate region and to determine both its size (i.e., using perimeter and area) and shape (i.e., shape descriptor). A comparison of its shape and size at different times is then used to estimate how the oil dissolves in the water, which depends on the oil density, the tides and winds, etc.
The region growing segmentation requires the selection of initial seed pixels to implement the process of pixels clustering. The grayscale value of a seed pixel is used as a reference value to determine the mean homogeneity that the neighboring pixels should have for them to be added to a same cluster. In our work, we use the centroid coordinates of candidate regions obtained from saliency map as seed pixels that will be applied to the original image (SLAR image without any processing). There will consequently be as many seed pixels as the candidate regions that were detected. The centroid is computed by using moments of order up to 1 [
39].
In order to perform the process of adding pixels to a cluster, we use an 8-connected neighborhood. Hence, if any adjacent pixel that considers 8-connectivity has a grayscale value similar to the seed pixel, then it is added to the cluster of the seed pixel, and they are classified as new seed pixels for the next iteration of the algorithm. The region growing algorithm follows an iterative process until there are no changes in two successive iterative stages, i.e., the new pixels are not added to grow the region.
When the region growing is applied to the candidate region that is an oil slick (this is a True Positive, TP), the growing algorithm stops because the region is bounded by an edge pixel in which the gradient is high in several directions while other pixels within the region have similar grayscale values because they represent the homogeneous dark spots. On the contrary, the growing algorithm grows throughout the image when the candidate region is not an oil slick (this is a False Positive, FP). The FP is a consequence of imprecisions in the method with which to eliminate the noise zones in a precise manner, shown in
Section 3.1.
Finally, our method is able to extract features from the pixels in the candidate regions (both the TP and the FP) in order to determine the physical features of the oil slick for the TP and to ensure that the FP is discarded. It is possible to compute the area (pixels within a dark spot representing the candidate region), perimeter (contour pixels), elongation, bounding box, etc. This matter is not, however, addressed in this work.
5. Conclusions
In this work, we propose an oil slick detection method for SLAR images using a strategy based on image processing and computer vision techniques without using the characterization of oil slicks obtained from the multi-polarization features of a radar. We use only the intensity image obtained from the backscatter signal without considering radar parameters. The strategy consists of two stages. First, SLAR images are processed to identify noise regions caused by aircraft maneuvers. This step is required to avoid the detection of false positives (i.e., pixels which are not the pixels of an oil slick but have similar brightness features). Second, SLAR images, which have been processed beforehand, are used to detect oil slicks. This process can be performed automatically and can also be carried out by on-boards sensors once the radar scan is digitalized as an SLAR image.
Although the state of the art almost never contains approaches with which to detect the dark spots of oil slicks in SLAR images, we have made an effort to make a brief comparison between our method and other approaches for the same task using SLAR images as inputs.
As future work, we would like to add deep learning techniques as data augmentation in order to artificially increase the size of the training set and thus improve the detection. Data augmentation consists of transforming the original SLAR image to obtain synthetic samples when the example of missions is not sufficient to design and implement an automatic detection method. In this line, our idea is to also prove the behavior of Convolutional Neural Networks (CNN) in order to simultaneously detect more targets. For example, CNN will provide more flexibility for use in new situations and will make it possible to detect other targets, such as coastlines or ships, at the same time as oil slicks.