1. Introduction
Agriculture serves as the cornerstone of human survival, with food being a critical resource for sustenance. Population growth and climate change have intensified the demand for food security [
1]. Crop production is a key component in ensuring a sufficient and high-quality food supply. The spatial and temporal distribution of crops reflects resource utilization in agricultural production, and timely, accurate crop mapping plays a crucial role in crop growth monitoring, yield prediction, and agricultural resource management. These mapping efforts are of significant importance to the structural adjustment of the agricultural industry and the formulation of food security policies [
2]. Earth Observation (EO) data, acquired via remote sensing platforms, provide comprehensive and precise information regarding the Earth’s surface. EO data, as a reliable source for agricultural data, have been increasingly utilized in land use classification and crop monitoring across the globe in recent years [
3,
4,
5,
6].
Traditional crop mapping methods primarily rely on supervised classification of remote sensing imagery. Ground truth samples are collected through field surveys during the current season and subsequently used for large-scale crop mapping. These methods are typically data-driven and require extensive labeled datasets, which incur high costs associated with in situ measurements. The accuracy of the classification results largely depends on the quantity and reliability of these samples [
7,
8]. Since crop types are not as easily interpreted from remote sensing imagery as other land use categories, collecting accurate ground data is essential for crop mapping. The acquisition of ground truth samples and the creation of corresponding datasets are pivotal for monitoring crop cultivation through EO data [
8,
9]. For example, in several European countries, agricultural policies such as the Common Agricultural Policy (CAP) mandate farmers to report the crops they plant. This requirement has led to the establishment and validation of several datasets, including EuroCrops [
10], ZueriCrop [
11], and BreizhCrops [
12]. However, the collection of ground labels often involves substantial time and labor costs, posing a significant challenge. In regions that lack formal reporting policies or are underdeveloped, the absence of large-scale real data collection exacerbates the difficulties in crop mapping. While these regions may possess favorable agricultural conditions and development potential, inadequate data support prevents the full realization of their agricultural potential [
13,
14], thereby impeding the formulation and execution of effective agricultural policies.
Therefore, many studies focus on addressing the challenge of limited ground truth samples in crop type classification tasks, such as pseudo-label generation based on prior knowledge and transfer learning [
15,
16,
17,
18]. The former aims to expand the ground truth dataset [
19,
20,
21], while the latter focuses on transferring and reusing cross-domain knowledge. Unlike traditional supervised learning methods that rely on local samples, the core of transfer learning lies in transferring knowledge from the source domain to the target domain to tackle the challenge of data scarcity. Transfer learning methods can be broadly classified into instance-based transfer learning, feature-based transfer learning, and model-based transfer learning [
22,
23]. In recent years, the application of transfer learning to crop classification has expanded [
24,
25,
26]. Compared to traditional crop mapping methods, transfer learning-based crop mapping studies can be categorized into two main approaches: temporal transfer and geographic spatial transfer. Temporal transfer primarily addresses early-season or current-season crop mapping [
27]. For example, Lei proposed a method using transfer learning to extract information from historical products, enabling cross-year crop mapping without in situ data, thus facilitating early crop mapping and rotation analysis [
28]. Geographic spatial transfer, on the other hand, focuses on transferring knowledge from source domain datasets to support crop mapping in regions with limited sample data [
29,
30]. For instance, Hao et al. applied Cropland Data Layer (CDL) data to satellite imagery to generate training samples, which were then used in machine learning classifiers, successfully transferring the classification model to three different regions, achieving overall accuracies of 97.79%, 86.45%, and 94.86%, respectively [
17]. Much of the literature focuses on evaluating model transferability [
29,
31,
32] or utilizing model transfer to extend mapping coverage [
33]. These studies often represent simple transfer applications without localized adaptation. To better align source domain datasets with the target domain, Xun et al. combined target domain datasets with the TrAdaBoost algorithm to apply the CDL dataset for cotton mapping in Xinjiang. The comparison between the mapped cotton area obtained through the transfer algorithm and statistical data yielded an R
2 of 0.64, demonstrating the effectiveness of the transfer method [
34].
Previous studies have shown that incorporating target domain datasets into transfer learning can significantly improve the efficiency of transferring knowledge from the source domain to the target domain [
24,
34,
35]. However, publicly available crop-type datasets are mostly focused on regions in Europe and North America, or other areas with well-developed agricultural systems, such as CDL, BreizhCrops, and EuroCrops. These high-quality datasets are renowned for their high availability and annotation accuracy, making them central to many crop classification and transfer learning studies [
17,
31,
35]. Furthermore, these studies often focus on regions with intensive farming or well-organized farmland. Research on areas with complex, fragmented landforms and diverse crop systems remains limited. Against this backdrop, this study selects the Hexi Corridor as the target domain. This region is characterized by a diverse range of crop types, including maize, wheat, canola, and other crops, alongside forage plants such as alfalfa and oats due to the prominence of livestock farming. The Hexi Corridor plays a vital agricultural role in northwest China. However, due to the high costs associated with sample labeling, the availability of ground truth samples is relatively limited. Moreover, agricultural production in this region relies heavily on river irrigation, with crop growth significantly influenced by water resources and climatic conditions [
36,
37,
38], which further complicates the classification task. Therefore, conducting transfer learning research for crop classification in the Hexi Corridor holds substantial research value. The region’s diverse crop distribution and limited labeled data present promising prospects for applying transfer learning in this context.
This study aims to evaluate the feasibility and effectiveness of transfer learning in addressing the challenges of crop mapping in regions with limited labeled data. The Hexi Corridor is designated as the target domain, while North America at a similar latitude is selected as the source domain. Focusing on key crop types in the Hexi Corridor (maize, alfalfa, oats, rapeseed, and spring wheat), dense time series for the year 2022 are constructed using multi-source satellite remote sensing data from Sentinel-1, Sentinel-2, and Landsat 8. To perform the crop mapping task, Random Forest (RF), Extreme Gradient Boosting (XGBoost), and TrAdaBoost are employed. By leveraging the knowledge from the source domain, this study explores the potential of transfer learning to enhance crop mapping performance in data-scarce environments. The main objectives of this study are as follows: (1) to assess the feasibility of directly transferring models based on CDL data, (2) to evaluate the effectiveness of incorporating target domain data for crop mapping in the Hexi Corridor, and (3) to generate a crop type distribution map for the Hexi Corridor.
The key contributions of this study are summarized as follows: (1) Knowledge is transferred from a data-rich source domain to a data-scarce target domain to support crop type mapping in the Hexi Corridor, providing a methodological reference for crop classification in regions with limited labeled data. (2) Based on Random Forest, XGBoost, and TrAdaBoost, we implement a model adaptation strategy that fine-tunes source domain models using target domain samples, resulting in improved classification accuracy compared to models trained solely on target domain data. (3) Multi-source satellite remote sensing data (Sentinel-1, Sentinel-2, and Landsat 8) are integrated to construct dense time series, which enhances the temporal representation of crop growth and improves the generalization ability of the classification models across domains.
3. Results
3.1. Domain Difference
The p-values calculated using the K-S test are all less than 0.05. This statistically indicates a significant difference between the two domains. Such substantial differences may stem from variations in farming practices, climatic characteristics, and field management methods between the two regions.
As shown in
Figure 4, which compares the NDVI curves between the source and target domains, maize exhibits a high degree of phenological similarity across the two domains. Specifically, during the greening period (weeks 20–25), both the source and target domains show a sharp increase in NDVI, with the NDVI reaching its peak around week 30. While other crops demonstrate similar phenological trends, some temporal shifts are observed. To further evaluate the phenological similarity between the two domains, we employed DTW to calculate the distance between the NDVI time series of the same crop in the source and target domains.
Table 3 summarizes the DTW-based distance results. The results show that there are acceptable morphological differences in the NDVI time series curves of the same crop between the source and target domains, indicating that transfer learning is suitable for adjustment and adaptation.
The observed similarity in trends between crops in the source and target domains, coupled with differences in specific time points, can be attributed to the fact that crops at similar latitudes tend to exhibit similar phenological patterns. However, due to variations in climate, elevation, and planting environments between the two domains, phenological shifts such as earlier or later growth stages may occur even for the same crop species. For instance, the southern part of the Hexi Corridor, located near the Qilian Mountains, has a higher elevation and lower temperatures, resulting in insufficient thermal accumulation and a shorter growing season for crops planted in that region. In summary, while there are both similarities and statistically significant differences between the two domains, this provides strong support for the applicability of transfer learning in crop identification across these two regions.
3.2. Comparison of the Performance of Different Transfer Strategies
Table 4 presents the performance of the respective algorithms in Experiments 1, 2, and 3 on the target domain test set. In Experiment 1, among the original models, XGBoost
naive achieved the best performance with an accuracy of 0.7833 and a Kappa coefficient of 60.09%, followed by RF
naive, while DT
naive showed the poorest performance. Experiment 2 demonstrated that RF
transfer was the most effective transfer learning model in this study, reaching an accuracy of 92.26% and a Kappa coefficient of 0.8723. It was followed by TrAdaBoost_RF, XGBoost
transfer, and TrAdaBoost_DT, with all transfer models achieving accuracies above 90%. Experiment 3 served as a comparison for transfer learning, where models were trained solely on target domain data. The RF
local model achieved an accuracy of 89.90% and a Kappa coefficient of 0.8399. In Experiment 2, all models adjusted using target domain data outperformed the local model, validating the effectiveness of transfer learning.
The weighted F1-scores of specific crops in Experiment 2 are presented in
Table 5. In terms of the specific crop F1-scores, XGBoost
transfer achieves the highest F1-score for maize, while RF
transfer performs the best in F1-score for the other crops.
The performance of the models in Experiment 2 and Experiment 3 is shown in
Figure 5. The vertical axis represents the overall accuracy, while the horizontal axis indicates the percentage of target domain data accumulated for training. It can be observed that the transfer learning strategy employed in Experiment 2 effectively improved the classification accuracy. As the amount of incorporated target domain data increased, the model accuracies exhibited a clear upward trend.
Specifically, the overall accuracy of RFtransfer increased from 87.93% to 92.26%, XGBoosttransfer improved from 85.43% to 90.29%, Tradaboost_RF rose from 84.56% to 91.86%, and Tradaboost_DT increased from 77.30% to 90.94%. Similarly, RFlocal and DTlocal also demonstrated a gradual improvement in accuracy with the addition of target domain data, with RFlocal improving from 82.02% to 89.90% and DTlocal increasing from 62.34% to 76.25%. However, their performance remained lower than that of the models utilizing the transfer learning strategy in Experiment 2.
In the classification performance for each crop,
Figure 6 shows the performance of the original models as well as the models that incorporate target domain data on the target domain test set. Maize and alfalfa exhibit relatively high F1-scores in the original models, with values of 0.8743 and 0.7618 (RF
naive), 0.8897 and 0.8523 (XGboost
naive), and 0.749 and 0.6175 (DT
naive), respectively, while other crops show relatively lower F1-scores in the original models. As target domain data are incorporated, there is an overall increase in the F1-scores for various crops, with particularly noticeable improvements for oats, canola, and spring wheat. By the final stage, after incorporating the entire target domain training data, the F1-scores for different crops reached above 0.93 for maize and alfalfa, over 0.85 for oats and spring wheat, and the lowest for canola at 0.61.
The confusion matrix reveals the specific misclassification patterns of crop categories, as shown in
Figure 7. The rows from the first to the fourth represent the four transfer learning algorithms, and from left to right, the matrices display the confusion results when 0%, 10%, 50%, and 100% of the target domain data are used for transfer. These represent the beginning, middle, and end stages of incorporating target domain data. From
Figure 7, it can be observed that when only source domain data are used for training, maize and alfalfa show higher accuracy, while other crops are poorly classified, resulting in significant misclassification. When 10% of the target domain data are added for retraining, the misclassification between oats and canola is resolved, and the classification accuracy for oats begins to improve. However, canola and spring wheat are still mainly misclassified as major crops (such as alfalfa and maize). As more target domain data are gradually incorporated (50–100%), this misclassification is notably corrected, leading to improvements in the classification accuracy of both canola and spring wheat.
3.3. Feature Importance Ranking of the Optimal Transfer Model
To investigate the contribution of multi-source data integration to model stability, we conducted a feature importance ranking based on the input features of the optimal transfer learning algorithm, RF
transfer. Among the top 200 most important features, 191 originated from Sentinel-2, 7 from Landsat-8, and only 2 from Sentinel-1. Additionally, the top 30 features and their importance rankings are illustrated in
Figure 8. The naming convention for the features on the
y-axis in the figure follows the format “SatelliteData_Index/Band_TimeStep”. For example, the top three features in terms of importance are VGCI, kNDVI, and NDVI, all calculated from Sentinel-2 satellite data as maximum composites for Week 17.
These results indicate that Sentinel-2 data contributed most significantly to model performance in the crop classification task under the transfer learning framework. Benefiting from its high spectral and temporal resolution, the Sentinel-2 time series more effectively captured crop spectral responses and phenological dynamics, thus playing a dominant role in the model. In contrast, the Landsat-8 time series, with its lower temporal resolution, offered less timely features and contributed less to the classification results. The backscatter information provided by Sentinel-1 radar data showed limited sensitivity in distinguishing between crop types, resulting in a relatively low feature importance in the model.
3.4. Crops Cultivated Areas Identification in Hexi
Since RFtransfer exhibited the best performance among all transfer learning algorithms, we selected the RFtransfer model adjusted with the full target domain data to generate the crop type map for the Hexi Corridor in 2022. It is worth noting that the trained model was applied within the cropland mask area of the Hexi Corridor.
Due to the limited number of test set samples, the accuracy reported above may not fully reflect the classification performance across the entire study area. Therefore, we further evaluated the accuracy of crop mapping by incorporating statistical data and datasets such as the 2001–2023 China Corn Planting Distribution Dataset (CCD-Maize) [
63]. To estimate the mapped crop areas, we calculated the number of pixels classified as each crop type in each city within the Hexi region, then multiplied it by the pixel size (10 m × 10 m) to obtain the estimated crop area. The resulting mapped areas for maize, alfalfa, oats, canola, and spring wheat in the Hexi region were 55.19, 33.89, 5.84, 37.53, and 11.76 million hectares (MHs), respectively.
Additionally, we referenced statistical yearbooks for reported maize and wheat planting areas, aggregating them at the city level in the Hexi region for comparison, as shown in
Figure 9. The mapped maize planting area exhibited high consistency with the statistical data, whereas discrepancies were observed in the wheat planting areas in Jiuquan and Jinchang. This difference may be attributed to the fact that statistical data for wheat in this region include both spring wheat and winter wheat, leading to variations in the comparison.
Additionally, we compared the classification results of this study with the 2001–2023 China Crop Dataset–Maize (CCD-Maize) [
64] (
Figure 10). Compared to these datasets, our study produced higher-resolution results with more complete field boundaries at a finer spatial scale. In the Hexi region, maize, alfalfa, and spring wheat are distributed across all five cities, with maize having the largest planting area, making it the dominant crop in the region. In contrast, oats and canola are primarily concentrated in higher-altitude areas.
4. Discussion
4.1. The Advantages of Transfer Learning
In classification tasks based on machine learning and remote sensing imagery, the focus is often on traditional machine learning and deep learning methods, which typically require a sufficient quantity and quality of training samples to ensure adequate classification accuracy. However, sample collection is a labor-intensive process, and sample scarcity is a common issue. Transfer learning has emerged as a promising strategy to address these challenges. Many studies have employed transfer learning methods such as fine-tuning-based transfer learning (FT), few-shot learning (FSL), multi-task learning (MSL), and unsupervised domain adaptation (UDA) to tackle the problem of sample scarcity [
26]. For crop mapping across geographic regions, challenges remain in terms of model generalization due to differences in geographic space and crop phenology, making large-scale, cross-regional crop mapping difficult. Prior knowledge and reference data can play a crucial role in transfer learning for crop mapping [
64,
65]. Transfer learning can enable cross-temporal and cross-spatial crop mapping by transferring samples or models, applying existing sample datasets to other regions, and providing solutions for regions with limited sample availability [
66].
Despite statistical and climatic differences between regions, transferring crop classification model knowledge across geographic areas is still feasible. In the context of limited and imbalanced target domain sample data, this study adopted a transfer learning strategy. Specifically, by localizing the model with labeled data from the target domain on top of a source domain model, the use of the transfer learning strategy significantly reduced the cost of label acquisition in the target domain. The results indicate an improvement in classification accuracy for the adjusted model, demonstrating that even with discrepancies in crop sample data phenology, distribution, and quantity between the source and target domains, transfer learning can still transfer knowledge from the source domain to the target domain. In this study, using data from the Hexi Corridor improved model accuracy, suggesting that the model can learn unique crop features from the target domain, thereby enhancing model robustness and generalization. This provides a valuable reference for crop type mapping in the Hexi Corridor.
4.2. The Effectiveness of the Transfer Learning Strategy in Hexi Corridor
In this study, the classification performance of staple crops was significantly better than that of minor crops. Even without retraining the model with target domain data, the model trained only on source domain data performed well in classifying staple crops. The classification accuracy of maize and alfalfa reached 0.89 and 0.85, respectively, in the XGboost
transfer model, indicating that the feature distribution of staple crops in the target domain is highly similar to that in the source domain, which enhances the model’s generalization ability. This further suggests that even when data from the target region is scarce or unavailable, crop mapping in the local area can still benefit from models trained on source domain data. However, the classification performance of minor crops, such as oats, canola, and spring wheat, is relatively poor, with lower classification accuracy. This may be related to factors such as differences in crop phenological characteristics and distribution shifts between the source and target domains. In particular, the Hexi Corridor is located on the northeastern edge of the Qinghai–Tibet Plateau, an area with some high-altitude regions. Due to its unique geographical location, crop phenology in this area is delayed. Additionally, the relatively small sample size of minor crops in the target domain limits the model’s learning ability for these categories. The imbalanced distribution of sample classes causes the classification model to overfit the dominant classes’ feature distributions, which is common in imbalanced data. This finding is similar to the results of Arias, where crop mapping using imbalanced datasets showed better classification performance for the dominant sample types [
67].
At the same time, Experimental results indicate that as the number of labeled target domain samples gradually increases during retraining, the model’s classification performance improves, and the more target domain data available, the better the classification results. This trend is consistent with the findings of Miloš Pandžić, which demonstrate that transfer learning methods incorporating target domain data are versatile and effective in crop classification tasks [
13]. Compared to large-scale crop-type products, this study produced crop-type distribution maps with a finer scale, covering a smaller area but with higher accuracy, better spatial resolution, and multiple crop types. This provides an actionable reference for regions with high fragmentation of cultivated land, complex crop planting types, and a lack of sufficient samples, making accurate crop mapping in sample-scarce areas feasible.
4.3. Comparison of Transfer Learning Algorithms
Aiming to train an initial model with strong generalization capability, the source domain data used in this study was widely distributed and balanced across different crop types, incorporating samples from various climatic conditions and phenological stages. Based on this, we applied transfer learning using the RF, XGBoost, and TrAdaBoost algorithms. After retraining with target domain data, the RFtransfer model achieved the highest classification accuracy, followed by TrAdaBoost_RF, with all transfer models reaching accuracies above 0.90.
However, under these conditions, TrAdaBoost did not demonstrate the best performance. One possible reason lies in its sensitivity to class imbalance and the limited size of target domain samples. Its effectiveness relies heavily on several hyperparameters, such as the number of iterations, initial weights, and the performance of weak classifiers. Without careful tuning, TrAdaBoost may suffer from poor convergence. In contrast, RF and XGBoost exhibit greater robustness when dealing with imbalanced samples and domain distribution shifts. They offer more stable performance and are easier to deploy, making them well suited for practical applications in sample-scarce scenarios. In addition, the wide spatial distribution of the source domain samples (e.g., including both southern and northern U.S. regions) may have introduced considerable intra-class phenological variability. Such internal heterogeneity can make it difficult for TrAdaBoost to accurately distinguish between source samples that are similar or dissimilar to the target domain during early training stages. This may affect its weight adjustment process and weaken its theoretical advantage. In contrast, the RF and XGBoost models, trained on the combined source and target datasets, are better able to learn consistent patterns from the overall data and thus exhibit stronger generalization capabilities.
These findings suggest that the optimal application scenarios for different transfer learning algorithms may vary depending on data characteristics. In future work, selecting source domain samples with phenological profiles more closely aligned with those of the target domain may help reduce sample interference and improve the performance of TrAdaBoost.
4.4. Uncertainty and Potential Refinement
Since the performance of classification models largely depends on the training and test samples, there are inherent differences in various studies not only in the categories, data volumes, and distributions of training and test datasets, but also in the choice of crop types, geographical location of transfer, selected evaluation metrics, and other related parameters. Therefore, systematically comparing the results with other crop classification studies is challenging [
13,
35]. Facing the challenges of evaluation, we have assessed the performance using an independent test set and by comparing it with statistical data. In this study, due to the imbalanced data distribution in the target domain, traditional local training methods are not applicable. This is because the sample sizes of some crop categories in the target domain are too small to support the training of local classifiers. Therefore, this study does not include a comparison with local data training in the target domain. From the perspective of comparison with local training, while transfer learning demonstrates significant capabilities, it should not be applied unconditionally. For regions that lack or have poor quality in situ data, transfer learning can effectively solve the challenges of crop mapping in those areas. However, when a certain amount of high-quality target domain data is available, transfer learning may not necessarily outperform traditional local training-based crop identification methods [
17]. Some studies have pointed out that traditional local training methods require hyperparameter optimization before model training, while transfer learning only requires model retraining, which may lead to substantial differences in processing time [
13]. Through this study, we explored the feasibility of using transfer learning strategies for crop type mapping in the Hexi Corridor. The research shows that transfer learning can effectively transfer knowledge learned from the source domain to the target domain, thereby reducing the cost of labeling target domain samples to some extent. Overall, transfer learning provides a solution to the problem of insufficient target domain data while also reducing the time required for training the model from scratch, thus improving mapping efficiency. This provides valuable references for future research on crop classification in regions with limited data.
Considering the presence of interannual climatic variability and other unstable factors, it is also crucial to evaluate the adaptability and robustness of transfer learning models across different years. To this end, the RF
transfer model trained exclusively on 2022 data was independently validated using the 2023 dataset, in order to assess its crop type prediction performance under varying climatic conditions. The detailed validation results are provided in the
Supplementary Materials. Although this study successfully implemented crop mapping in the Hexi Corridor using a transfer learning strategy, reducing the cost of local sample annotation, certain uncertainties and limitations remain, providing deeper exploration directions for future research. First, transferability estimation remains an important area of investigation. Transferability estimation is used to evaluate the compatibility between datasets and models to ensure the success of transfer learning tasks [
68]. The distributional differences between the source and target domains pose challenges to transfer learning, especially in instance-based transfer. In a study by Wang, growing degree days (GDD) were used to quantify regional differences, and the results showed a correlation between GDD differences and model transfer accuracy [
29]. Therefore, before applying crop classification using transfer learning to other regions, it is necessary to conduct transferability estimation. Second, the expansion of methods and improvements in transfer learning strategies are also key future directions. This study mainly employed shallow machine learning methods, but future research could explore alternative transfer learning strategies, such as deep learning-based transfer learning, including techniques like model parameter freezing and shared feature extraction layers. Unlike machine learning methods that rely on manually designed features, deep learning approaches offer stronger feature representation capabilities, enabling automatic extraction of spatiotemporal features and reducing dependence on feature engineering. Deep transfer learning can further improve classification performance by leveraging large-scale pretraining and fine-tuning with limited target domain data [
7]. Many studies have achieved cross-regional model transfer by employing pretraining on source domain data followed by fine-tuning with target domain data [
24,
30,
35,
69], which provides valuable insights into cross-regional crop type recognition using transfer learning. Additionally, spatial resolution limitations represent a key constraint in this study. In regions with highly fragmented farmland, the 10 m resolution Sentinel-2 imagery used in this study may not be sufficient to capture the fine boundaries of small land parcels. High-resolution imagery can better delineate small field boundaries and capture subtle land cover variations, especially in regions with diverse and complex crop types. Future studies could incorporate higher-resolution remote sensing data, such as PlanetScope imagery or WorldView imagery, to improve the spatial resolution of classification and enhance boundary detection accuracy.
5. Conclusions
In this study, high-density time series from Sentinel-2, Sentinel-1, and Landsat-8 imagery were used, and based on RF, XGBoost, and TrAdaBoost algorithms, the effectiveness of instance-based transfer strategies for crop mapping tasks was explored and evaluated in the Hexi Corridor. The results show that under conditions with abundant source domain data and limited target domain data, the transfer strategy performed well, especially for staple crops with large planting areas in the target domain. Even without using target domain data for training, the classification accuracy achieved was quite substantial, with the overall classification accuracy reaching up to 73.88%. The optimal classification accuracy for maize and alfalfa was 88.97% and 85.23%, respectively. During the process of gradually incorporating target domain data, the total accuracy for all models ranged from 0.77 to 0.92, and the F1-score ranged from 0.76 to 0.92. As the proportion of target domain data increased, model accuracy improved. After all target domain data were added, the classification accuracies for RFtransfer, XGBoosttransfer, TrAdaBoost_RF, and TrAdaBoost_DT were 92.26%, 90.29%, 91.86%, and 90.94%, respectively. The best transfer model was RFtransfer. These results demonstrate the effectiveness of the transfer strategy and provide a reference for crop mapping in areas with limited samples.