Figure 1.
Segmentation framework of mangrove extraction using deep learning.
Figure 1.
Segmentation framework of mangrove extraction using deep learning.
Figure 2.
Location and false color combination of 10 m Sentinel-2 data (Shortwave-infrared reflectance (SWIR), G, and B) of the study area.
Figure 2.
Location and false color combination of 10 m Sentinel-2 data (Shortwave-infrared reflectance (SWIR), G, and B) of the study area.
Figure 3.
Architecture of the ME-Net. Arrows represent different operations, among which blue arrows represent convolution and pooling and red represents up-sampling; MCE: multiscale context embedding (MCE); GAM: global attention module; BFU: boundary fitting unit.
Figure 3.
Architecture of the ME-Net. Arrows represent different operations, among which blue arrows represent convolution and pooling and red represents up-sampling; MCE: multiscale context embedding (MCE); GAM: global attention module; BFU: boundary fitting unit.
Figure 4.
GAM architecture. Light green squares represent the characteristics of the low-stage maps, and yellow squares represent the high-stage maps. We connected the features of the adjacent stages to calculate the weight vector and then reweighted the low-stage feature map.
Figure 4.
GAM architecture. Light green squares represent the characteristics of the low-stage maps, and yellow squares represent the high-stage maps. We connected the features of the adjacent stages to calculate the weight vector and then reweighted the low-stage feature map.
Figure 5.
MCE architecture. The green box is the input feature map, the yellow box is the output feature map, the rectangle is the convolution operation of different sizes of convolution kernel, and the colored box is the multiscale feature map in series according to the channel.
Figure 5.
MCE architecture. The green box is the input feature map, the yellow box is the output feature map, the rectangle is the convolution operation of different sizes of convolution kernel, and the colored box is the multiscale feature map in series according to the channel.
Figure 6.
BFU architecture. The green box is the feature map, the yellow rectangle is the convolution operation, the brown rectangle is the batch normalization(BN) operation, the orange ellipse is ReLU, “+” represents that a feature map with the same size is added in accordance with the pixel position, the blue arrow represents the skipped connection, and the red box represents the feature mapping.
Figure 6.
BFU architecture. The green box is the feature map, the yellow rectangle is the convolution operation, the brown rectangle is the batch normalization(BN) operation, the orange ellipse is ReLU, “+” represents that a feature map with the same size is added in accordance with the pixel position, the blue arrow represents the skipped connection, and the red box represents the feature mapping.
Figure 7.
Two loss functions in ME-Net.
Figure 7.
Two loss functions in ME-Net.
Figure 8.
Samples of the Sentinel-2 remote sensing imagery used in the experiments.
Figure 8.
Samples of the Sentinel-2 remote sensing imagery used in the experiments.
Figure 9.
Accuracy and loss of ME-Net for training the datasets. The training accuracy (a) and loss (b) change with the epochs on the training datasets. The validating accuracy (c) and loss (d) change with the epochs on the validating datasets.
Figure 9.
Accuracy and loss of ME-Net for training the datasets. The training accuracy (a) and loss (b) change with the epochs on the training datasets. The validating accuracy (c) and loss (d) change with the epochs on the validating datasets.
Figure 10.
Results of mangrove extraction in different environments by different modules in ME-Net. The original images (a), the corresponding ground truth (b) and predictions (c–e) are presented. Green, red, blue, and black represent true positive (TP), false positive (FP), false negative (FN), and true negative (TN) respectively.
Figure 10.
Results of mangrove extraction in different environments by different modules in ME-Net. The original images (a), the corresponding ground truth (b) and predictions (c–e) are presented. Green, red, blue, and black represent true positive (TP), false positive (FP), false negative (FN), and true negative (TN) respectively.
Figure 11.
Results of mangrove extraction in a new mangroves region by different modules in ME-Net. The original images (a), the corresponding ground truth (b), and predictions (c–e) are presented. Green, red, blue, and black represent true positive (TP), false positive (FP), false negative (FN), and true negative (TN) respectively.
Figure 11.
Results of mangrove extraction in a new mangroves region by different modules in ME-Net. The original images (a), the corresponding ground truth (b), and predictions (c–e) are presented. Green, red, blue, and black represent true positive (TP), false positive (FP), false negative (FN), and true negative (TN) respectively.
Figure 12.
Effect of some sample data on the result of mangrove exaction under the ME-Net model. The first column (a) shows the actual color of the remote sensing imagery; the second column (b) shows the corresponding ground truth; the third column (c) shows the prediction result of the model after adding only three bands of RGB; the fourth column (d) shows the prediction result after adding MDI; the fifth column (e) shows the prediction result after adding five original bands and six multispectral indices. Green, red, blue, and black represent the TP, FP, FN, and TN, respectively.
Figure 12.
Effect of some sample data on the result of mangrove exaction under the ME-Net model. The first column (a) shows the actual color of the remote sensing imagery; the second column (b) shows the corresponding ground truth; the third column (c) shows the prediction result of the model after adding only three bands of RGB; the fourth column (d) shows the prediction result after adding MDI; the fifth column (e) shows the prediction result after adding five original bands and six multispectral indices. Green, red, blue, and black represent the TP, FP, FN, and TN, respectively.
Figure 13.
Results of ME-Net with different values of balance weight a.
Figure 13.
Results of ME-Net with different values of balance weight a.
Figure 14.
Performance of different pixel classification models in mangrove extraction. The first column (a) shows the actual color of the remote sensing imagery; the second column (b) shows the corresponding ground reality; the third column (c) shows the prediction result of object-oriented by ENVI; the fourth column (d) shows the prediction result of ResNet-based FCN model; the fifth column (e) shows the prediction result of DeepLab v3 model; the sixth column (f) shows the prediction result of the ME-Net model. Green, red, blue, and black represent the TP, FP, FN, and TN, respectively.
Figure 14.
Performance of different pixel classification models in mangrove extraction. The first column (a) shows the actual color of the remote sensing imagery; the second column (b) shows the corresponding ground reality; the third column (c) shows the prediction result of object-oriented by ENVI; the fourth column (d) shows the prediction result of ResNet-based FCN model; the fifth column (e) shows the prediction result of DeepLab v3 model; the sixth column (f) shows the prediction result of the ME-Net model. Green, red, blue, and black represent the TP, FP, FN, and TN, respectively.
Table 1.
The characteristics of Sentinel-2 imagery.
Table 1.
The characteristics of Sentinel-2 imagery.
Band | Band Name | Central (nm) | Wave Width (nm) | Spatial Resolution (m) |
---|
B1 | Aerosols | 442.3 | 45 | 60 |
B2 | Blue | 492.1 | 98 | 10 |
B3 | Green | 559 | 46 | 10 |
B4 | Red | 665 | 39 | 10 |
B5 | Vegetation red-edge | 703.8 | 20 | 20 |
B6 | Vegetation red-edge | 739.1 | 18 | 20 |
B7 | Vegetation red-edge | 779.7 | 28 | 20 |
B8 | Near infrared | 833 | 133 | 10 |
B8a | Vegetation red-edge | 864 | 32 | 20 |
B9 | Water-vapor | 943.2 | 27 | 60 |
B10 | Cirrus | 1376.9 | 76 | 60 |
B11 | Shortwave-infrared reflectance (SWIR-1) | 1610.4 | 141 | 20 |
B12 | Shortwave-infrared reflectance (SWIR-2) | 2185.7 | 238 | 20 |
Table 2.
Calculation method of multispectral indices.
Table 2.
Calculation method of multispectral indices.
Multispectral Indices | Calculation Method | Calculation Details in Sentinel-2 |
---|
NDVI | NDVI = (NIR − R)/(NIR + R) | (B8 − B4)/(B8 + B4) |
MNDWI | MNDWI = (Green − SWIR-1)/(Green + SWIR-1) | (B3 − B11)/(B3 + B11) |
FDI | FDI = NIR − (Red + Green) | B8 − (B4 + B3) |
WFI | WFI = (NIR − Red)/SWIR-2 | (B8 − B4)/B12 |
MDI | MDI = (NIR − SWIR-2)/SWIR-2 | (B8 − B12)/B12 |
Table 3.
The metrics of our best model, including precision, recall, F1 score and IoU value for mangrove extraction from remote sensing imagery.
Table 3.
The metrics of our best model, including precision, recall, F1 score and IoU value for mangrove extraction from remote sensing imagery.
Class | Precision (%) | Recall (%) | F1 (%) | IoU (%) |
---|
Mangrove | 96.88 | 98.30 | 96.56 | 97.49 |
Clutter | 99.72 | 99.13 | 99.43 | 98.86 |
Table 4.
The performance for the trained model on a new dataset.
Table 4.
The performance for the trained model on a new dataset.
Class | Precision(%) | Recall(%) | F1(%) | IoU(%) |
---|
Mangrove | 96.00 | 95.09 | 95.55 | 91.47 |
Clutter | 99.94 | 99.96 | 99.95 | 99.90 |
Table 5.
Effect of original band and multispectral index to the classification result of mangrove under the ResNet-based FCN structure.
Table 5.
Effect of original band and multispectral index to the classification result of mangrove under the ResNet-based FCN structure.
Sample Data | IoU (%) | Gains (%) |
---|
RGB | 86.64 | - |
RGB + NIR | 86.91 | 0.27 |
RGB + NIR + SWIR-1 | 87.33 | 0.42 |
RGB + NIR + SWIR-1 + NDVI | 88.25 | 0.92 |
RGB + NIR + SWIR-1 + NDVI + MNDWI | 89.49 | 1.24 |
RGB + NIR + SWIR-1 + NDVI + MNDWI + FDI | 89.94 | 0.35 |
RGB + NIR + SWIR-1 + NDVI + MNDWI + FDI + WFI | 90.52 | 0.68 |
RGB + NIR + SWIR-1 + NDVI + MNDWI + FDI + WFI + MDI | 91.57 | 1.05 |
RGB + NIR + SWIR-1 + NDVI + MNDWI + FDI + WFI + MDI + PCA1 | 92.13 | 0.56 |
Table 6.
Effect of normalization difference vegetation index (NDVI), modified normalized difference water index (MNDWI), and mangrove discrimination index (MDI) on the result of mangrove extraction under the ResNet-based FCN.
Table 6.
Effect of normalization difference vegetation index (NDVI), modified normalized difference water index (MNDWI), and mangrove discrimination index (MDI) on the result of mangrove extraction under the ResNet-based FCN.
Sample Data | IoU (%) |
---|
all | 92.13 |
without NDVI | 91.32 |
without MNDWI | 90.93 |
without MDI | 90.76 |
Only RGB | 86.64 |
Table 7.
The IoU for data in
Figure 12 from rows 1 to 5.
Table 7.
The IoU for data in
Figure 12 from rows 1 to 5.
| Only RGB | Without MDI | All |
---|
Row1 | 0.9712 | 0.9853 | 0.9901 |
Row2 | 0.7667 | 0.8017 | 0.8717 |
Row3 | 0.8607 | 0.8723 | 0.8824 |
Row4 | 0.9353 | 0.9606 | 0.9720 |
Row5 | 0.9549 | 0.9598 | 0.9646 |
Table 8.
Effects of GAM, MCE and BFU on the extraction results of mangroves.
Table 8.
Effects of GAM, MCE and BFU on the extraction results of mangroves.
Methods | IoU (%) |
---|
ResNet-based FCN | 92.13 |
ResNet-101 + GAM (C1) + GMP | 95.55 |
ResNet-101 + GAM (C1) + GAP | 95.62 |
ResNet-101 + GAM (C3) + GAP | 95.71 |
ResNet-101 + GAM (C3) + GAP + MCE (C1355) | 96.16 |
ResNet-101 + GAM (C3) + GAP + MCE C1357) | 96.24 |
ResNet-101 + GAM (C3) + GAP + BFU | 95.89 |
ResNet-101 + GAM (C3) + GAP + MCE (C1357) + BFU | 96.97 |
ResNet-101 + GAM (C3) + GAP + MCE (C1357) + BFU + DS | 97.22 |
ResNet-101 + GAM (C3) + GAP + MCE (C1357) + BFU + DA | 97.09 |
ResNet-101 + GAM (C3) + GAP + MCE (C1357) + BFU + DS + DA | 97.48 |
Table 9.
Experimental results of ME-Net and other methods.
Table 9.
Experimental results of ME-Net and other methods.
Methods | IoU (%) |
---|
SegNet | 81.39 |
FCN | 84.62 |
DilatedNet | 86.91 |
DeepLabv1 | 87.76 |
U-Net | 89.04 |
DeepLabv2 | 90.06 |
PSPNet | 91.82 |
MaskRCNN | 93.16 |
DeepLabv3 | 94.53 |
Ours | 96.97 |
Table 10.
The IoU for data in
Figure 14 from rows 1 to 5.
Table 10.
The IoU for data in
Figure 14 from rows 1 to 5.
| Object-Oriented | ResNet-Based FCN | DeepLab v3 | ME-Net |
---|
Row1 | 0.7966 | 0.9776 | 0.9878 | 0.9882 |
Row2 | 0.8394 | 0.9883 | 0.9885 | 0.9886 |
Row3 | 0.8355 | 0.9868 | 0.9877 | 0.9877 |
Row4 | 0.7952 | 0.9736 | 0.9738 | 0.9748 |
Row5 | 0.8069 | 0.9751 | 0.9782 | 0.9805 |