Figure 1.
The distribution chart of the six main remote-sensing objects in the urban scenario. It can be seen from the figure that the percentages of different kinds of remote-sensing objects obviously obey a long-tailed distribution.
Figure 1.
The distribution chart of the six main remote-sensing objects in the urban scenario. It can be seen from the figure that the percentages of different kinds of remote-sensing objects obviously obey a long-tailed distribution.
Figure 2.
The loss reweighting method based on the number of samples from each category does not consider the difference in classification difficulty of samples from various categories and cannot adapt to the relatively dynamic training process of the model, which can easily cause the misalignment between the category weights and the real learning state of the model.
Figure 2.
The loss reweighting method based on the number of samples from each category does not consider the difference in classification difficulty of samples from various categories and cannot adapt to the relatively dynamic training process of the model, which can easily cause the misalignment between the category weights and the real learning state of the model.
Figure 3.
The overall structure of the proposed CCSMLW.
Figure 3.
The overall structure of the proposed CCSMLW.
Figure 4.
The distribution of the number of samples from each category after long-tailed processing for the HistAerial dataset, the SIRI-WHU dataset, the NWPU-RESISC45 dataset, and the PatternNet dataset.
Figure 4.
The distribution of the number of samples from each category after long-tailed processing for the HistAerial dataset, the SIRI-WHU dataset, the NWPU-RESISC45 dataset, and the PatternNet dataset.
Figure 5.
Long-tailed sampling percentage of the AID dataset and the distribution of the number of samples from each category after long-tailed processing.
Figure 5.
Long-tailed sampling percentage of the AID dataset and the distribution of the number of samples from each category after long-tailed processing.
Figure 6.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed HistAerial dataset with imbalance ratios of 0.005,0.01, and 0.02, respectively. (d) shows the classification confusion matrix of the proposed method on the long-tailed HistAerial dataset with an imbalance ratio of 0.005.
Figure 6.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed HistAerial dataset with imbalance ratios of 0.005,0.01, and 0.02, respectively. (d) shows the classification confusion matrix of the proposed method on the long-tailed HistAerial dataset with an imbalance ratio of 0.005.
Figure 7.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed SIRI-WHU dataset with imbalance ratios of 0.01,0.02 and 0.05, respectively. (d) shows the classification confusion matrix of the proposed method on the long-tailed SIRI-WHU dataset with an imbalance ratio of 0.02.
Figure 7.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed SIRI-WHU dataset with imbalance ratios of 0.01,0.02 and 0.05, respectively. (d) shows the classification confusion matrix of the proposed method on the long-tailed SIRI-WHU dataset with an imbalance ratio of 0.02.
Figure 8.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed NWPU-RESISC45 dataset with imbalance ratios of 0.005,0.01, and 0.02, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy on the long-tailed NWPU-RESISC45 data set with imbalance ratios of 0.005, 0.01, and 0.02 for samples from the head categories, middle categories, and tail categories, respectively.
Figure 8.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed NWPU-RESISC45 dataset with imbalance ratios of 0.005,0.01, and 0.02, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy on the long-tailed NWPU-RESISC45 data set with imbalance ratios of 0.005, 0.01, and 0.02 for samples from the head categories, middle categories, and tail categories, respectively.
Figure 9.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed PatternNet dataset with imbalance ratios of 0.005, 0.01, and 0.02, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy method on the long-tailed PatternNet dataset with imbalance ratios of 0.005, 0.01, and 0.02 for samples from the head categories, middle categories and tail categories, respectively.
Figure 9.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed PatternNet dataset with imbalance ratios of 0.005, 0.01, and 0.02, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy method on the long-tailed PatternNet dataset with imbalance ratios of 0.005, 0.01, and 0.02 for samples from the head categories, middle categories and tail categories, respectively.
Figure 10.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed AID dataset with decay factors of 2, 3, and 4, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy method on the long-tail AID dataset with decay factors of 2, 3, and 4 for the head categories, middle categories, and tail categories, respectively.
Figure 10.
(a–c) show the classification accuracy of the proposed method and Softmax cross-entropy method for samples from each category in the long-tailed AID dataset with decay factors of 2, 3, and 4, respectively. (d) shows the average classification accuracy of the proposed method and the Softmax cross-entropy method on the long-tail AID dataset with decay factors of 2, 3, and 4 for the head categories, middle categories, and tail categories, respectively.
Figure 11.
(a) shows the effect of hyperparameters and on the average accuracy of our proposed method. (b) shows the effect of hyperparameters and on the stability of our proposed method.
Figure 11.
(a) shows the effect of hyperparameters and on the average accuracy of our proposed method. (b) shows the effect of hyperparameters and on the stability of our proposed method.
Figure 12.
(a–d) show the classification accuracy of our proposed methods, SeeSaw, Class Balanced, Effective Number with Softmax cross-entropy for samples from each category on the long-tailed HistAerial dataset, respectively.
Figure 12.
(a–d) show the classification accuracy of our proposed methods, SeeSaw, Class Balanced, Effective Number with Softmax cross-entropy for samples from each category on the long-tailed HistAerial dataset, respectively.
Table 1.
Experimental equipment configuration.
Table 1.
Experimental equipment configuration.
Item | Contents |
---|
Processor | Intel Core I9-12900K |
Memory | 64GB |
Operating system | Ubuntu20.04 |
Gpu | 3090ti |
Pytorch | 1.10.1 |
Table 2.
Comparison results of the proposed method with other similar advanced methods on the long-tailed HistAerial dataset with imbalance factors of 0.005, 0.01, and 0.02. The values in the table are the top1 OA (%).
Table 2.
Comparison results of the proposed method with other similar advanced methods on the long-tailed HistAerial dataset with imbalance factors of 0.005, 0.01, and 0.02. The values in the table are the top1 OA (%).
Imbalance Factor | 0.005 | 0.01 | 0.02 |
---|
Softmax | 57.18 ± 1.89 | 67.42 ± 0.54 | 72.27 ± 1.61 |
SeeSaw [35] | 60.85 ± 1.43 | 68.75 ± 0.74 | 74.85 ± 1.00 |
Class Balanced | 68.67 ± 0.66 | 74.59 ± 0.73 | 77.61 ± 0.52 |
Effective Number [33] | 67.61 ± 0.87 | 74.91 ± 0.52 | 77.96 ± 0.29 |
Equalization [34] | 56.59 ± 1.31 | 63.93 ± 1.03 | 74.32 ± 0.28 |
Equalizationv2 [43] | 52.27 ± 0.64 | 62.67 ± 0.53 | 72.95 ± 0.21 |
Focal [44] | 45.37 ± 1.08 | 57.94 ± 0.41 | 67.52 ± 1.01 |
GHM [45] | 34.00 ± 0.83 | 38.87 ± 0.76 | 44.18 ± 2.82 |
ours | 70.14 ± 0.63 | 75.60 ± 0.80 | 78.70 ± 0.74 |
Table 3.
Comparison results of the proposed method with other similar advanced methods on the long-tailed SIRI-WHU dataset with imbalance factors of 0.01, 0.02, and 0.05. The values in the table are the top1 OA (%).
Table 3.
Comparison results of the proposed method with other similar advanced methods on the long-tailed SIRI-WHU dataset with imbalance factors of 0.01, 0.02, and 0.05. The values in the table are the top1 OA (%).
Imbalance Factor | 0.01 | 0.02 | 0.05 |
---|
Softmax | 47.8 ± 1.21 | 49.68 ± 1.00 | 62.01 ± 0.69 |
SeeSaw [35] | 47.24 ± 0.81 | 49.62 ± 0.37 | 65.11 ± 0.35 |
Class Balanced | 49.98 ± 1.24 | 54.09 ± 0.83 | 64.72 ± 1.29 |
Effective Number [33] | 47.14 ± 0.33 | 54.12 ± 0.56 | 66.78 ± 0.42 |
Equalization [34] | 48.27 ± 0.73 | 49.38 ± 0.54 | 61.75 ± 0.25 |
Equalizationv2 [43] | 45.87 ± 0.36 | 50.83 ± 0.76 | 62.70 ± 0.40 |
Focal [44] | 46.89 ± 2.38 | 50.55 ± 1.30 | 64.57 ± 0.59 |
GHM [45] | 43.80 ± 1.54 | 45.40 ± 1.58 | 50.30 ± 1.77 |
ours | 50.71 ± 1.07 | 54.91 ± 0.72 | 67.35 ± 0.36 |
Table 4.
Comparison results of the proposed method with other similar advanced methods on the long-tailed NWPU-RESISC45 dataset with imbalance factors of 0.005, 0.01, and 0.02. The values in the table are the top1 OA (%).
Table 4.
Comparison results of the proposed method with other similar advanced methods on the long-tailed NWPU-RESISC45 dataset with imbalance factors of 0.005, 0.01, and 0.02. The values in the table are the top1 OA (%).
Imbalance Factor | 0.005 | 0.01 | 0.02 |
---|
Softmax | 49.29 ± 0.59 | 54.53 ± 0.50 | 59.79 ± 0.22 |
SeeSaw [35] | 51.18 ± 0.97 | 56.15 ± 0.37 | 62.24 ± 0.68 |
Class Balanced | 45.03 ± 0.36 | 52.88 ± 0.74 | 60.90 ± 0.46 |
Effective Number [33] | 45.72 ± 0.69 | 53.87 ± 0.12 | 61.69 ± 0.20 |
Equalization [34] | 50.73 ± 0.41 | 54.71 ± 0.34 | 61.49 ± 0.17 |
Equalizationv2 [43] | 50.69 ± 0.08 | 56.29 ± 0.47 | 61.99 ± 0.18 |
Focal [44] | 43.39 ± 1.02 | 46.66 ± 1.23 | 52.44 ± 1.52 |
GHM [45] | 37.47 ± 0.25 | 40.58 ± 0.75 | 45.07 ± 0.23 |
ours | 52.81 ± 0.53 | 57.21 ± 0.43 | 63.25 ± 0.27 |
Table 5.
Comparison results of the proposed method with other similar advanced methods on the long-tailed PatternNet dataset with imbalance factors of 0.005, 0.01, and 0.02.
Table 5.
Comparison results of the proposed method with other similar advanced methods on the long-tailed PatternNet dataset with imbalance factors of 0.005, 0.01, and 0.02.
Methods | Imbalance Factor | Head Acc (%) | Middle Acc (%) | Tail Acc (%) | OA (%) |
---|
Softmax | | 98.14 ± 0.13 | 99.72 ± 0.04 | 79.20 ± 0.20 | 83.42 ± 0.14 |
SeeSaw [35] | | 98.04 ± 0.02 | 99.68 ± 0.11 | 79.56 ± 0.42 | 83.81 ± 0.27 |
Class Balanced | | 88.52 ± 0.98 | 97.66 ± 0.17 | 78.66 ± 0.99 | 81.94 ± 0.83 |
Effective Number [33] | | 89.58 ± 2.20 | 98.30 ± 0.21 | 80.37 ± 0.68 | 83.57 ± 0.49 |
Equalization [34] | 0.005 | 98.60 ± 0.33 | 99.68 ± 0.07 | 78.97 ± 0.37 | 83.20 ± 0.36 |
Equalizationv2 [43] | | 98.94 ± 0.47 | 99.60 ± 0.03 | 78.27 ± 0.22 | 82.51 ± 0.33 |
Focal [44] | | 97.71 ± 0.39 | 99.21 ± 0.04 | 78.02 ± 1.73 | 82.46 ± 1.26 |
GHM [45] | | 98.21 ± 0.48 | 99.52 ± 0.14 | 67.75 ± 1.19 | 74.34 ± 0.89 |
Ours | | 97.54 ± 0.26 | 99.52 ± 0.11 | 81.64 ± 0.37 | 84.80 ± 0.89 |
Softmax | | 98.60 ± 0.05 | 99.41 ± 0.07 | 85.11 ± 0.51 | 88.26 ± 0.24 |
SeeSaw [35] | | 98.30 ± 0.05 | 99.30 ± 0.11 | 85.92 ± 0.30 | 88.99 ± 0.21 |
Class Balanced | | 91.14 ± 0.30 | 97.31 ± 0.30 | 86.60 ± 0.94 | 88.45 ± 0.71 |
Effective Number [33] | | 93.23 ± 0.37 | 98.13 ± 1.03 | 86.10 ± 0.21 | 88.77 ± 0.33 |
Equalization [34] | 0.01 | 98.88 ± 0.26 | 99.16 ± 0.17 | 84.91 ± 0.38 | 88.26 ± 0.31 |
Equalizationv2 [43] | | 98.63 ± 0.23 | 99.35 ± 0.08 | 81.57 ± 0.44 | 85.70 ± 0.34 |
Focal [44] | | 97.90 ± 0.21 | 98.89 ± 0.49 | 82.72 ± 0.75 | 86.57 ± 0.53 |
GHM [45] | | 98.60 ± 0.19 | 99.26 ± 0.03 | 75.85 ± 0.42 | 81.40 ± 0.29 |
Ours | | 98.09 ± 0.30 | 99.10 ± 0.17 | 87.00 ± 0.36 | 89.76 ± 0.24 |
Softmax | | 98.46 ± 0.25 | 99.42 ± 0.12 | 88.55 ± 0.16 | 91.61 ± 0.15 |
SeeSaw [35] | | 97.70 ± 0.30 | 99.46 ± 0.04 | 88.96 ± 0.68 | 91.84 ± 0.49 |
Class Balanced | | 91.69 ± 0.27 | 98.02 ± 0.35 | 90.17 ± 0.63 | 91.94 ± 0.35 |
Effective Number [33] | | 93.43 ± 0.39 | 98.05 ± 0.27 | 90.75 ± 0.64 | 92.19 ± 0.42 |
Equalization [34] | 0.02 | 98.36 ± 0.13 | 99.44 ± 0.06 | 89.20 ± 0.56 | 91.69 ± 0.20 |
Equalizationv2 [43] | | 98.71 ± 0.21 | 99.36 ± 0.18 | 86.35 ± 0.33 | 90.05 ± 0.24 |
Focal [44] | | 97.85 ± 0.25 | 98.86 ± 0.45 | 87.27 ± 0.66 | 90.52 ± 0.50 |
GHM [45] | | 98.05 ± 0.05 | 99.20 ± 0.46 | 86.68 ± 0.49 | 90.13 ± 0.34 |
ours | | 97.39 ± 0.45 | 99.28 ± 0.09 | 91.05 ± 0.27 | 93.37 ± 0.30 |
Table 6.
Comparison results of the proposed method with other similar advanced methods on the long-tailed AID dataset with decay factors of 2, 3, and 4.
Table 6.
Comparison results of the proposed method with other similar advanced methods on the long-tailed AID dataset with decay factors of 2, 3, and 4.
Methods | Decay Factor | Head Acc (%) | Middle Acc (%) | Tail Acc (%) | OA (%) |
---|
Softmax | | 93.26 ± 0.80 | 88.81 ± 0.37 | 73.35 ± 0.72 | 83.68 ± 0.21 |
SeeSaw [35] | | 92.93 ± 0.19 | 88.45 ± 0.64 | 74.13 ± 0.26 | 84.35 ± 0.25 |
Class Balanced | | 89.70 ± 1.93 | 86.39 ± 1.26 | 77.33 ± 1.91 | 83.65 ± 0.53 |
Effective Number [33] | | 88.83 ± 0.95 | 86.61 ± 0.21 | 78.57 ± 1.18 | 83.91 ± 0.53 |
Equalization [34] | 2 | 93.59 ± 0.44 | 88.75 ± 0.26 | 72.90 ± 0.54 | 83.65 ± 0.33 |
Equalization2 [43] | | 92.94 ± 0.39 | 90.05 ± 1.92 | 69.97 ± 0.04 | 82.28 ± 0.26 |
Focal [44] | | 88.08 ± 0.48 | 81.96 ± 1.46 | 65.94 ± 2.83 | 76.57 ± 0.96 |
GHM [45] | | 87.42 ± 0.70 | 79.41 ± 0.86 | 57.85 ± 1.78 | 72.66 ± 1.08 |
Ours | | 91.90 ± 0.17 | 88.83 ± 0.21 | 77.73 ± 0.46 | 84.63 ± 0.09 |
Softmax | | 92.38 ± 0.36 | 88.66 ± 0.41 | 66.75 ± 0.52 | 76.35 ± 0.26 |
SeeSaw [35] | | 92.14 ± 0.60 | 87.66 ± 0.83 | 68.87 ± 0.52 | 77.36 ± 0.41 |
Class Balanced | | 85.71 ± 0.73 | 84.80 ± 0.75 | 69.93 ± 0.81 | 76.11 ± 0.60 |
Effective Number [33] | | 85.90 ± 1.10 | 85.20 ± 1.30 | 71.25 ± 0.58 | 77.01 ± 0.28 |
Equalization [34] | 3 | 92.75 ± 0.66 | 87.40 ± 0.65 | 64.82 ± 0.35 | 75.11 ± 0.30 |
Equalization2 [43] | | 93.33 ± 0.29 | 87.46 ± 0.66 | 61.06 ± 1.04 | 73.11 ± 0.62 |
Focal [44] | | 87.71 ± 1.31 | 83.33 ± 0.37 | 56.06 ± 2.58 | 68.09 ± 1.95 |
GHM [45] | | 85.89 ± 2.20 | 82.09 ± 2.31 | 48.73 ± 2.56 | 62.95 ± 2.05 |
Ours | | 90.54 ± 0.77 | 88.06 ± 1.09 | 71.29 ± 0.49 | 78.43 ± 0.39 |
Softmax | | 92.54 ± 0.91 | 87.50 ± 1.97 | 61.12 ± 0.77 | 71.92 ± 0.65 |
SeeSaw [35] | | 91.75 ± 0.30 | 87.38 ± 0.64 | 62.03 ± 0.20 | 72.23 ± 0.13 |
Class Balanced | | 77.44 ± 1.12 | 79.23 ± 0.60 | 62.80 ± 1.84 | 68.21 ± 1.55 |
Effective Number [33] | | 81.18 ± 0.65 | 80.82 ± 0.42 | 62.78 ± 0.51 | 69.51 ± 0.44 |
Equalization [34] | 4 | 92.68 ± 0.43 | 88.50 ± 0.93 | 60.72 ± 0.45 | 71.98 ± 0.25 |
Equalization2 [43] | | 92.33 ± 0.55 | 87.33 ± 1.03 | 58.26 ± 0.84 | 69.76 ± 0.42 |
Focal [44] | | 84.51 ± 2.09 | 81.56 ± 3.10 | 48.86 ± 4.28 | 61.52 ± 3.18 |
GHM [45] | | 85.89 ± 1.34 | 81.91 ± 1.37 | 41.57 ± 1.30 | 57.31 ± 1.00 |
ours | | 90.85 ± 0.93 | 87.57 ± 0.63 | 65.17 ± 0.86 | 74.25 ± 0.47 |
Table 7.
Ablation study using long-tailed AID datasets with decay factors of 2 and 3.
Table 7.
Ablation study using long-tailed AID datasets with decay factors of 2 and 3.
Method | CWM | E-Loss | DF | Head Acc (%) | Middle Acc (%) | Tail Acc (%) | OA (%) |
---|
CCSMLW | Ⅹ | Ⅹ | 2 | 92.99 ± 0.62 | 87.94 ± 0.89 | 72.89 ± 0.15 | 83.04 ± 0.30 |
√ | Ⅹ | 92.71 ± 0.42 | 87.57 ± 0.54 | 75.77 ± 0.85 | 83.93 ± 0.46 |
Ⅹ | √ | 93.33 ± 0.18 | 88.41 ± 0.33 | 74.75 ± 0.41 | 84.05 ± 0.20 |
√ | √ | 91.90 ± 0.17 | 88.83 ± 0.21 | 77.73 ± 0.46 | 84.63 ± 0.09 |
Ⅹ | Ⅹ | 3 | 91.47 ± 0.57 | 88.19 ± 0.28 | 65.73 ± 0.84 | 75.46 ± 0.71 |
√ | Ⅹ | 91.16 ± 0.24 | 87.36 ± 0.52 | 69.03 ± 0.28 | 77.22 ± 0.08 |
Ⅹ | √ | 92.61 ± 0.44 | 88.26 ± 0.83 | 67.62 ± 1.20 | 76.75 ± 0.31 |
√ | √ | 90.54 ± 0.77 | 88.06 ± 1.09 | 71.29 ± 0.49 | 78.43 ± 0.39 |
Table 8.
Parametric analysis experiments using the long-tailed SIRI-WHU dataset with the imbalance ratio of 0.02.
Table 8.
Parametric analysis experiments using the long-tailed SIRI-WHU dataset with the imbalance ratio of 0.02.
| | OA (%) | | | OA (%) | | | OA (%) |
0.2 | 0.2 | 52.36 ± 0.40 | 0.4 | 0.2 | 53.29 ± 0.45 | 0.6 | 0.2 | 54.21 ± 0.82 |
0.4 | 53.51 ± 1.35 | 0.4 | 54.36 ± 1.04 | 0.4 | 54.91 ± 0.72 |
0.6 | 51.79 ± 1.50 | 0.6 | 54.25 ± 0.71 | 0.6 | 54.09 ± 0.24 |
0.8 | 51.38 ± 0.59 | 0.8 | 52.08 ± 0.43 | 0.8 | 52.84 ± 0.35 |
1.0 | 50.02 ± 0.35 | 1.0 | 51.57 ± 0.45 | 1.0 | 52.93 ± 0.21 |
| | OA (%) | | | OA (%) | | | OA (%) |
0.8 | 0.2 | 52.66 ± 0.94 | 1.0 | 0.2 | 50.69 ± 1.48 | 1.2 | 0.2 | 50.38 ± 1.47 |
0.4 | 53.37 ± 0.62 | 0.4 | 51.31 ± 1.22 | 0.4 | 50.15 ± 1.22 |
0.6 | 52.58 ± 0.97 | 0.6 | 52.21 ± 0.50 | 0.6 | 49.45 ± 0.65 |
0.8 | 52.78 ± 0.66 | 0.8 | 51.62 ± 0.94 | 0.8 | 49.30 ± 0.96 |
1.0 | 52.19 ± 0.82 | 1.0 | 50.73 ± 0.50 | 1.0 | 49.14 ± 1.32 |