Next Article in Journal
Improving GNSS-IR Sea Surface Height Accuracy Based on a New Ionospheric Stratified Elevation Angle Correction Model
Previous Article in Journal
A Texture-Considerate Convolutional Neural Network Approach for Color Consistency in Remote Sensing Imagery
 
 
Article
Peer-Review Record

Spatial Estimation of Soil Organic Carbon Content Utilizing PlanetScope, Sentinel-2, and Sentinel-1 Data

Remote Sens. 2024, 16(17), 3268; https://doi.org/10.3390/rs16173268
by Ziyu Wang 1, Wei Wu 2 and Hongbin Liu 1,*
Reviewer 1: Anonymous
Reviewer 2: Anonymous
Remote Sens. 2024, 16(17), 3268; https://doi.org/10.3390/rs16173268
Submission received: 19 July 2024 / Revised: 30 August 2024 / Accepted: 30 August 2024 / Published: 3 September 2024

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

The paper titled "Spatial Estimation of Soil Organic Carbon Content Utilizing PlanetScope, Sentinel-2, and Sentinel-1 data" aims to estimate soil organic carbon (SOC) content in the Wuling Mountain region of Southwest China. By leveraging spectral, radar, and topographic variables from multi-temporal optical satellite images, synthetic aperture radar images, and digital elevation models, the study employs the eXtreme Gradient Boosting algorithm to model SOC content at various spatial resolutions. The research found that medium resolution (20m) models generally provided better results, with combined datasets yielding the most accurate predictions.

 

I recommend the authors compare their accuracy with existing methods of  estimation of soil organic carbon content by drawing a table listing the indicators (e.g.,  Coefficient of Determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Akaike Information Criterion (AIC)). 

 

I recommend the authors using existing methods (other than XGBoost) to redo their work and validate if their conclusions are still supported.

 

Figure 1 Add basic cartographic elements: compass, latitude and longitude.

 

Discussion section: 

 

Add detailed figures for states such as "higher accuracy" (Line 610), "accuracy has shown a decline" (Line 625),  "improve the prediction accuracy" (Line 619).

 

3) should be a paragraph. (Line 615)

 

All the websites should include "access on..."

Author Response

The paper titled "Spatial Estimation of Soil Organic Carbon Content Utilizing PlanetScope, Sentinel-2, and Sentinel-1 data" aims to estimate soil organic carbon (SOC) content in the Wuling Mountain region of Southwest China. By leveraging spectral, radar, and topographic variables from multi-temporal optical satellite images, synthetic aperture radar images, and digital elevation models, the study employs the eXtreme Gradient Boosting algorithm to model SOC content at various spatial resolutions. The research found that medium resolution (20m) models generally provided better results, with combined datasets yielding the most accurate predictions.

Response: Thank you very much for your thorough review and constructive feedback on our manuscript. We have incorporated your suggestions by conducting new model validations and revising the content accordingly. These modifications have not only enhanced the rigor of our research but also improved the reliability of our results. We believe that the revised version now meets the high standards of Remote Sensing and we hope these improvements will be satisfactory to you. Once again, we greatly appreciate your valuable time and professional guidance.

 

The point-to-point responses to the comments were given below.

Comments 1: I recommend the authors compare their accuracy with existing methods of estimation of soil organic carbon content by drawing a table listing the indicators (e.g., Coefficient of Determination (R2), Mean Absolute Error (MAE), Root Mean Square Error (RMSE), and Akaike Information Criterion (AIC)). I recommend the authors using existing methods (other than XGBoost) to redo their work and validate if their conclusions are still supported.

Response 1: Yes, we have compared the performance of XGBoost with Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and Support Vector Regression (SVR). The results of RF, GBT and SVR are shown in Tables 1-3. According the R2, MAE, RMSE, and AIC, XGBoost outperformed other models. For paddy field, XGBoost’s Model F performs the best ((R2=0.699, MAE=0.114%, RMSE=0.148%, AIC=508.820) followed by RF (Model F: R2=0.533, MAE=0.146%, RMSE=0.185%, AIC=524.246), GBDT (Model F: R2=0.639, MAE=0.131%, RMSE=0.162%, AIC=515.166), and SVR (Model D: R² = 0.240, MAE = 0.163% , RMSE = 0.234% , AIC = 333.304). For dry land, XGBoost performs the best (Model E: R2=0.673, MAE=0.107%, RMSE=0.135% ,AIC=453.246) followed by RF (Model E: R2=0.358 , MAE=0.158%,RMSE=0.189%, AIC=475.479), GBDT (Model E: R2=0.493, MAE=0.136%,RMSE=0.168%, AIC=467.689), and SVR (Model C: R2=0.118, MAE=0.163%, RMSE=0.206%, AIC=272.967). For the entire area, XGBoost performs the best (Model E: R2=0.339, MAE=0.268%, RMSE=0.349%, AIC=610.782) followed by RF (Model E: R2=0.243, MAE=0.259%, RMSE=0.375%, AIC=624.469), GBDT (Model E: R2=0.213, MAE=0.280%, RMSE=0.381%, AIC=622.491), and SVR (Model D: R² = 0.069, MAE = 0.311% , RMSE = 0.415% , AIC = 473.784).

 

Table 1 Performance results of Random Forest in predicting SOC based on different combinations of environmental variables at different modeling resolutions. The most accurate results are shown in bold.

Model

Land Use Type

 

Paddy field

Dry land

Total area

 

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

 

Model A

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.401

0.155

0.209

228.950

0.069

0.178

0.217

228.515

0.084

0.295

0.411

376.667

 

 

20m

0.419

0.160

0.206

227.892

0.327

0.155

0.194

211.006

0.128

0.292

0.401

373.374

 

 

30m

0.398

0.162

0.210

229.142

0.222

0.145

0.192

222.914

0.0942

0.297

0.409

375.912

 

 

80m

0.328

0.188

0.222

232.966

0.087

0.161

0.215

227.877

-0.038

0.299

0.366

373.813

 

Model B

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.380

0.177

0.213

278.163

0.182

0.159

0.203

272.250

0.081

0.289

0.412

424.921

 

 

20m

0.437

0.161

0.203

274.811

0.304

0.151

0.197

270.132

0.065

0.297

0.416

426.011

 

 

30m

0.418

0.162

0.206

275.952

0.109

0.162

0.212

275.097

0.065

0.311

0.416

426.027

 

 

80m

0.340

0.184

0.220

280.358

0.063

0.158

0.212

274.953

-0.025

0.299

0.363

420.887

 

Model C

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

-0.144

0.210

0.289

299.608

0.128

0.168

0.210

274.364

0.132

0.299

0.400

421.076

 

 

20m

0.414

0.150

0.207

276.199

0.199

0.171

0.211

274.756

0.202

0.277

0.38

415.397

 

 

30m

0.191

0.181

0.243

287.472

0.173

0.158

0.204

272.611

0.138

0.296

0.399

420.615

 

 

80m

0.079

0.191

0.260

292.022

0.211

0.149

0.194

269.279

-0.075

0.302

0.372

424.295

 

Model D

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

-0.135

0.221

0.288

347.337

0.255

0.146

0.194

317.169

0.098

0.300

0.408

471.644

 

 

20m

0.310

0.161

0.225

329.921

0.231

0.165

0.207

321.428

0.115

0.292

0.404

470.380

 

 

30m

-0.041

0.206

0.276

344.294

0.186

0.155

0.203

321.102

0.104

0.303

0.407

471.154

 

 

80m

0.095

0.187

0.257

339.418

0.054

0.156

0.213

323.274

-0.081

0.309

0.373

472.681

 

Model E

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.336

0.159

0.220

488.577

0.139

0.176

0.209

481.958

0.094

0.292

0.409

631.936

 

 

20m

0.470

0.145

0.197

480.662

0.358

0.158

0.189

475.479

0.243

0.259

0.375

624.469

 

 

30m

0.366

0.167

0.215

486.939

0.207

0.163

0.200

479.231

0.132

0.284

0.401

629.108

 

 

80m

0.217

0.196

0.239

494.337

0.199

0.145

0.196

477.792

0.005

0.303

0.426

634.808

 

Model F

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.309

0.175

0.225

537.983

0.212

0.163

0.199

527.009

0106

0.295

0.407

679.089

 

 

20m

0.533

0.146

0.185

524.246

0.316

0.157

0.195

525.568

0.147

0.280

0.371

672.369

 

 

30m

0.244

0.179

0.235

541.129

0.174

0.160

0.204

528.582

0.112

0.284

0.399

676.240

 

 

80m

0.227

0.186

0.238

541.908

0.159

0.152

0.201

527.370

-0.026

0.295

0.363

676.6635

 

                                 

Notes: Model A, PlanetScope+DEM; Model B, PlanetScope+Sentinel-1+DEM; Model C, Sentinel-2+DEM; Model D, Sentinel-2+Sentinel-1++DEM; Model E, PlanetScope+Sentinel-2+DEM; Model F, PlanetScope+Sentinel-2+Sentinel-1+DEM.

 

Table 2 Performance results of Gradient Boosting Decision Tree predicting SOC based on different combinations of environmental variables at different modeling resolutions. The most accurate results are shown in bold.

Model

Land Use Type

 

Paddy field

Dry land

Total area

 

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

 

Model A

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.554

0.137

0.181

218.616

0.149

0.160

0.203

223.969

0.038

0.309

0.422

379.939

 

 

20m

0.510

0.145

0.189

221.939

0.332

0.163

0.193

220.789

0.152

0.285

0.396

371.479

 

 

30m

0.449

0.161

0.201

226.007

0.203

0.148

0.201

223.400

0.059

0.299

0.417

378.432

 

 

80m

0.237

0.204

0.237

237.680

0.077

0.156

0.210

226.448

-0.079

0.300

0.373

376.598

 

Model B

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.500

0.153

0.191

270.631

0.110

0.162

0.212

275.058

0.083

0.295

0.412

424.768

 

 

20m

0.445

0.158

0.201

274.270

0.374

0.141

0.186

266.639

0.073

0.296

0.414

425.463

 

 

30m

0.311

0.169

0.225

281.864

0.092

0.149

0.214

275.718

0.046

0.305

0.420

427.405

 

 

80m

-0.007

0.198

0.271

295.134

-0.008

0.170

0.220

277.364

-0.085

0.301

0.374

424.925

 

Model C

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.049

0.191

0.264

293.148

0.137

0.158

0.209

274.015

0.153

0.300

0.396

419.404

 

 

20m

0.247

0.173

0.235

284.961

0.187

0.177

0.213

275.246

0.178

0.287

0.389

417.433

 

 

30m

0.164

0.177

0.247

288.627

0.213

0.150

0.199

270.992

0.120

0.302

0.403

422.013

 

 

80m

0.089

0.182

0.258

291.609

0.137

0.150

0.203

272.228

-0.019

0.293

0.362

420.503

 

Model D

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.029

0.187

0.266

341.862

0.162

0.156

0.206

321.060

0.182

0.298

0.389

465.077

 

 

20m

0.420

0.160

0.206

323.809

0.254

0.170

0.204

320.404

0.136

0.292

0.399

468.759

 

 

30m

-0.135

0.204

0.288

347.332

0.197

0.158

0.202

319.666

0.105

0.310

0.407

471.093

 

 

80m

0.089

0.182

0.258

339.609

0.116

0.154

0.206

321.018

-0.026

0.296

0.364

469.000

 

Model E

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.316

0.171

0.224

489.618

0.190

0.152

0.202

479.943

0.109

0.308

0.406

630.785

 

 

20m

0.598

0.135

0.171

470.985

0.493

0.136

0.168

467.689

0.213

0.280

0.381

622.491

 

 

30m

0.465

0.158

0.198

480.957

0.126

0.165

0.210

482.452

0.104

0.301

0.407

631.190

 

 

80m

0.248

0.186

0.235

492.945

0.124

0.158

0.205

480.738

-0.053

0.308

0.368

630.784

 

Model F

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.286

0.175

0.229

539.126

0.247

0.143

0.195

525.544

0.139

0.289

0.398

676.442

 

 

20m

0.639

0.131

0.162

515.166

0.447

0.145

0.175

518.538

0.135

0.287

0.399

676.788

 

 

30m

0.429

0.162

0.204

531.313

0.125

0.164

0.207

529.549

0.091

0.295

0.410

680.158

 

 

80m

0.165

0.192

0.247

544.579

0.090

0.154

0.209

529.973

-0.046

0.298

0.367

678.334

 

                                 

Notes: Model A, PlanetScope+DEM; Model B, PlanetScope+Sentinel-1+DEM; Model C, Sentinel-2+DEM; Model D, Sentinel-2+Sentinel-1++DEM; Model E, PlanetScope+Sentinel-2+DEM; Model F, PlanetScope+Sentinel-2+Sentinel-1+DEM.

 

Table 3 Performance results of Support Vector Regression predicting SOC based on different combinations of environmental variables at different modeling resolutions. The most accurate results are shown in bold.

Model

Land Use Type

 

Paddy field

Dry land

Total area

 

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

R2

MAE (%)

RMSE (%)

AIC

 

Model A

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.036

0.185

0.266

245.617

-0.010

0.169

0.226

231.213

-0.017

0.309

0.430

382.658

 

 

20m

0.033

0.185

0.266

245.741

0.012

0.178

0.235

233.710

0.001

0.310

0.429

382.453

 

 

30m

0.029

0.190

0.266

245.854

0.049

0.160

0.219

229.224

0.046

0.309

0.420

379.381

 

 

80m

0.029

0.195

0.267

245.867

0.020

0.160

0.217

228.443

0.010

0.285

0.357

370.405

 

Model B

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.022

0.189

0.268

294.130

-0.006

0.168

0.226

279.087

-0.002

0.312

0.430

430.707

 

 

20m

0.030

0.187

0.266

293.821

-0.002

0.179

0.236

282.167

-0.009

0.312

0.431

431.124

 

 

30m

0.017

0.188

0.268

294.301

0.022

0.165

0.222

278.151

0.009

0.302

0.428

429.969

 

 

80m

0.025

0.188

0.267

294.017

0.027

0.159

0.216

276.191

0.018

0.286

0.356

417.861

 

Model C

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.014

0.187

0.287

294.425

0.048

0.170

0.219

277.253

0.186

0.276

0.388

416.755

 

 

20m

0.069

0.184

0.261

292.407

-0.012

0.185

0.238

282.498

0.141

0.281

0.398

420.392

 

 

30m

0.032

0.184

0.266

293.759

0.056

0.166

0.218

276.970

0.045

0.298

0.419

427.452

 

 

80m

0.011

0.189

0.269

294.507

0.118

0.163

0.206

272.967

0.036

0.286

0.353

416.574

 

Model D

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.017

0.185

0.268

342.304

0.046

0.171

0.219

325.347

0.069

0.311

0.415

473.784

 

 

20m

0.240

0.163

0.234

333.304

-0.009

0.189

0.237

330.410

0.064

0.299

0.416

474.087

 

 

30m

0.018

0.184

0.268

342.247

0.050

0.167

0.219

325.184

0.018

0.315

0.426

477.351

 

 

80m

0.008

0.190

0.269

342.615

0.063

0.163

0.212

322.957

0.033

0.288

0.353

464.784

 

Model E

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.005

0.189

0.270

502.724

0.025

0.169

0.222

486.067

0.027

0.306

0.424

636.725

 

 

20m

0.021

0.188

0.268

502.147

0.025

0.183

0.233

483.589

0.027

0.309

0.424

636.733

 

 

30m

0.008

0.188

0.269

502.611

0.095

0.176

0.214

483.589

0.052

0.304

0.418

634.982

 

 

80m

0.048

0.183

0.264

501.167

0.019

0.160

0.217

484.476

0.035

0.281

0.353

624.630

 

Model F

 

 

 

 

 

 

 

 

 

 

 

 

 

 

3m

0.006

0.189

0.270

550.685

0.000

0.174

0.225

534.887

0.033

0.311

0.423

684.315

 

 

20m

0.222

0.173

0.239

542.136

-0.020

0.181

0.238

538.751

0.005

0.311

0.429

686.212

 

 

30m

0.009

0.189

0.269

550.584

0.033

0.174

0.221

533.790

0.025

0.304

0.424

684.829

 

 

80m

0.014

0.188

0.269

550.400

0.019

0.160

0.217

532.476

0.014

0.284

0.356

674.107

 

                                   

Notes: Model A, PlanetScope+DEM; Model B, PlanetScope+Sentinel-1+DEM; Model C, Sentinel-2+DEM; Model D, Sentinel-2+Sentinel-1++DEM; Model E, PlanetScope+Sentinel-2+DEM; Model F, PlanetScope+Sentinel-2+Sentinel-1+DEM.

 

Comments 2: Figure 1 Add basic cartographic elements: compass, latitude and longitude.

 Response 2: Compass, latitude and longitude are added in Figure 1 as follows:

Figure 1. (a) Elevation and distribution of soil sample points in the study area; (b) RGB image acquired by the PlanetScope optical satellite sensor (image date: July 27, 2017); and (c) distribution of paddy field and dry land in the study area.

Comments 3: Conclusions section: Add detailed figures for states such as "higher accuracy" (Line 610), "accuracy has shown a decline" (Line 625), "improve the prediction accuracy" (Line 619).

 Response 3: Detailed figures are added in the Conclusions section as follows:

1) Different satellite data have their own strengths and weaknesses, and the selection and combination of satellite remote sensing data greatly influence the performance of prediction models. Compared to using a single remote sensing dataset, the synergistic utilization of multi-source remote sensing data can achieve complementarity, resulting in higher accuracy in model predictions. The combination of PlanetScope, Sentinel-2, and topography factors gave satisfactory predictions for dry land (R2 = 0.673, MAE = 0.107%, RMSE = 0.135%). The addition of Sentinel-1 indicators gave the best predictions for paddy field (R2 = 0.699, MAE = 0.114%, RMSE = 0.148%).

2) According to comparative analysis, it has been shown that considering multi-scale modeling contributes to optimizing prediction models and improving the predictive capability of target soil properties. Overall, it is generally observed that under moderate modeling resolutions, more ideal predictions of SOC can be achieved compared to fine and coarse resolutions.

3) The acquisition time of remote sensing images has a certain impact on the interpretation of models. The spectral variables obtained from winter images are the main interpretation variables for SOC.

4) In regions with complex land use types, constructing prediction models for each of them can help improve the prediction accuracy of SOC. Modeling for paddy field and dry land separately resulted in higher prediction accuracies compared to modeling across the entire region, with improvements (in terms of R2) of 36.0% and 33.4%, respectively.

5) The land use types to some extent affect the interpretation of models by satellite data. In models that combine PlanetScope and Sentinel-2 data, the inclusion of radar remote sensing variables has improved the prediction accuracy for paddy field. However, for dry land, the prediction accuracy declines, with dropping by10.2%.

 

Comments 4: 3) should be a paragraph. (Line 615)

Response 4: Done.

Author Response File: Author Response.pdf

Reviewer 2 Report

Comments and Suggestions for Authors

The manuscript proposes the use of multisource and multitemporal remote data for SOC estimation. This is a topic that fully falls within the objectives of Remote Sensing and of potential interest to its readers.

The use of data measured with different sensors and in different modalities poses major challenges that should be addressed properly and with appropriate methods that produce scientifically valid results. However, the manuscript has several flaws that the authors should fix to make it suitable for publication in Remote Sensing.

The manuscript has very ambitious objectives and uses very different types of data, which penalises it in terms of clarity. Arguably, this study could be more usefully divided into at least two manuscripts that enhance the work done by the authors and make the communication of the results to the scientific community clearer.

 

 

The abstract does not adequately summarise the manuscript and does not give the reader a clear idea of its contents.

 

Keywords cannot be sentences or duplicate words in the title. Keywords are intended to allow proper indexing of the manuscript and should therefore be chosen from those relevant to the manuscript and commonly used by other authors.

 

Although the Introduction is very long, it is not sufficiently structured and clear. The issue of the different size of the support to which the data is associated, which in the case of remote sensing imagery coincides with the spatial resolution, should be clearly explained and, above all, it should be explained how it has been solved by others and mention a possible novelty proposed by the authors that would improve existing knowledge.

The use of multi-temporal imaging to mosaic the study area to obtain a composite image of bare ground is the current standard. The authors propose to use the spectral response of the vegetation as proxy information for the estimation of the SOC without using the image mosaicking. This objective seems to me very ambitious and lacking a solid scientific basis.

 

Objectives should be clearer and more concise. Exploring or investigating are not doing research.

 

The materials and methods section should be better structured and provide all necessary details so that readers can follow the development of the manuscript. The methods are many and should also be supported by relevant and original bibliographic references. There always remains the question of how data of different types are made homogeneous in terms of spatial resolution and how point data (soil samples) can be correctly combined with other data.

 

The subsection on statistical analysis of the data should provide details of ANOVA and LSD and how these metrics were used and why. These should also be supported by relevant and original literature references.

 

The rest of the manuscript should be revised in connection with the new Materials and Methods section.

Author Response

Comments and Suggestions for Authors

The manuscript proposes the use of multisource and multitemporal remote data for SOC estimation. This is a topic that fully falls within the objectives of Remote Sensing and of potential interest to its readers.The use of data measured with different sensors and in different modalities poses major challenges that should be addressed properly and with appropriate methods that produce scientifically valid results. However, the manuscript has several flaws that the authors should fix to make it suitable for publication in Remote Sensing.

Response: We appreciate you very much for your work.

We have carefully revised the manuscript according to your suggestions and comments. These changes have greatly improved the quality of the manuscript, and we believe it now meets the requirement for publication in Remote Sensing.

 

The point-to-point responses to the comments were given below.

 

Comments 1: The manuscript has very ambitious objectives and uses very different types of data, which penalises it in terms of clarity. Arguably, this study could be more usefully divided into at least two manuscripts that enhance the work done by the authors and make the communication of the results to the scientific community clearer.

Response 1: The objective of this study is to evaluate the effectiveness of multi-source and multi-temporal remote sensing data for predicting SOC across arable land under vegetation cover condition in the Wuling Mountain region of Southwest China. To do this, we analyzed (i) the utility of synergistic application of multisource remote sensing data in predicting SOC; (ii) the feasibility of using multi-temporal and multiscale remote sensing imagery for SOC content mapping; and (iii) whether constructing SOC prediction models under individual land use types can improve prediction accuracy.

The results show that the synergy of multi-source remote sensing data significantly outperforms single remote sensing data. Modeling under four different spatial resolutions (3m, 20m, 30m, and 80m) reveals that a medium scale (20m) modeling resolution generally achieves the best performance. Within multi-temporal remote sensing data, winter optical imagery plays a dominant role in SOC prediction, suggesting that data from specific time periods have a crucial impact on model performance. Constructing SOC prediction models exclusively under specific land use types indeed helps to improve prediction accuracy. For instance, establishing separate models for paddy field or dry land can significantly enhance their precision. This study demonstrates the great potential of multi-source and multi-temporal remote sensing data in predicting SOC in areas with complex vegetation cover and emphasizes the importance of medium modeling resolution and data from specific time periods. The potential readers can gain a comprehensive understanding of how the combination of remote sensing data, modeling resolution, temporal phases, and land use influence prediction outcomes. This provides valuable reference material for researchers in related fields.

In future work, we will expand our research scope to include other agriculturally significant regions and incorporate data spanning different temporal scales to enhance the robustness of our conclusions. Furthermore, we plan to broaden our experimental framework to encompass a wider range of land use types, with the objective of evaluating the adaptability of our methods across diverse environmental settings and land management practices. Additionally, we intend to integrate multi-source data, including climate, soil conditions, crop growth patterns, and fertilization practices, to further refine the predictive accuracy of our models. This approach is expected to improve both the precision and generalizability of our methods, enabling us to provide more region-specific agricultural management recommendations. We anticipate synthesizing these insights into a comprehensive research paper that validates the reproducibility of our results and elucidates the applicability of these methodologies across varied agricultural landscapes.

Comments 2: The abstract does not adequately summarise the manuscript and does not give the reader a clear idea of its contents.

Response 2: The abstract is modified to summarise the work as follows,

Accurate prediction of soil organic carbon (SOC) is important for agriculture and land management. Methods using remote sensing data are helpful for estimating SOC in bare soils. To overcome the challenge of predicting SOC under vegetation cover, this study extracted spectral, radar, and topographic variables from multitemporal optical satellite images (high-resolution PlanetScope and medium-resolution Sentinel-2), synthetic aperture radar satellite images (Sentinel-1), and digital elevation model, respectively, to estimate SOC content in arable soils in the Wuling Mountain region of Southwest China. These variables were modeled at four different spatial resolutions (3m, 20m, 30m, and 80m) using the eXtreme Gradient Boosting algorithm. The results showed that modeling resolution, the combination of multisource remote sensing data, and temporal phases all influenced SOC prediction performance. The models generally yielded better results at a medium (20m) modeling resolution than at fine (3m) and coarse (80m) resolutions. The combination of PlanetScope, Sentinel-2, and topography factors gave satisfactory predictions for dry land (R2 = 0.673, MAE = 0.107%, RMSE = 0.135%). The addition of Sentinel-1 indicators gave the best predictions for paddy field (R2 = 0.699, MAE = 0.114%, RMSE = 0.148%). The values of R2 of the optimal models for paddy field and dry land improved by 36.0% and 33.4%, respectively, compared to that for the entire study area. The optical imageries in winter played a dominant role in prediction of SOC for both paddy field and dry land. This study offers valuable insights into effectively modeling soil properties under vegetation cover at various scales using multisource and multitemporal remote sensing data.

 

Comments 3: Keywords cannot be sentences or duplicate words in the title. Keywords are intended to allow proper indexing of the manuscript and should therefore be chosen from those relevant to the manuscript and commonly used by other authors.

Response 3: The keywords are modified as follows,

Multispectral imaging; Synthetic aperture radar data; Machine learning; Soil property prediction; Land use. (P1 L19-20)

 

Comments 4: Although the Introduction is very long, it is not sufficiently structured and clear. The issue of the different size of the support to which the data is associated, which in the case of remote sensing imagery coincides with the spatial resolution, should be clearly explained and, above all, it should be explained how it has been solved by others and mention a possible novelty proposed by the authors that would improve existing knowledge.

Response 4: The objective of this study is to evaluate the effectiveness of multi-source and multi-temporal remote sensing data for predicting SOC across arable land under vegetation cover condition in the Wuling Mountain region of Southwest China.

 

The novelty of this study lies in the synergistic utilization of optical and radar remote sensing data to comprehensively capture surface information, along with the dynamic monitoring of vegetation growth cycles using multi-temporal remote sensing data. This approach aims to optimize prediction accuracy in complex environmental conditions. Additionally, by employing multi-scale modeling to refine resolution and constructing independent models tailored for different land use types, the method effectively enhances the accuracy of existing SOC prediction methods.

 

Please refer to Page 1 Line 21 to Page 3 Line 136 for modifications of the introduction.

 

Comments 5: The use of multi-temporal imaging to mosaic the study area to obtain a composite image of bare ground is the current standard. The authors propose to use the spectral response of the vegetation as proxy information for the estimation of the SOC without using the image mosaicking. This objective seems to me very ambitious and lacking a solid scientific basis.

Response 5: Currently, the application of multi-temporal remote sensing data in SOC mapping is primarily concentrated in areas with exposed soil (e.g., Guo, 2021; Stevens et al., 2010; Íala et al., 2019; Wang et al., 2018; Tayeb et al., 2021). These studies are typically limited to fallow lands, agricultural seedbeds, and arid and semi-arid regions with sparse vegetation, because, in such areas, exposed soil is more easily observed within specific time windows, allowing multi-temporal remote sensing data to effectively capture the reflective characteristics of the soil surface (Wang et al., 2018). However, in the area under vegetation cover, the signal within the pixels of satellite imagery might be generated jointly by both vegetation and soil, making it challenging to isolate information that is purely attributable to the soil.

Numerous studies have demonstrated a significant correlation between vegetation growth conditions and soil properties (Wei et al., 2022; Wang et al., 2018; Chen et al., 2023). For instance, the health status of plants often directly reflects the nutrient content in the soil around the root zones, which is closely related to Soil Organic Carbon (SOC) (Chen et al., 2023). Satellite imagery such as Sentinel-2, MODIS, and Sentinel-1 can capture the spectral characteristics of vegetation, which can serve as indirect proxy variables to reflect soil properties (Wang et al., 2021; Paul et al., 2020; Hengl et al., 2017; Dahhani et al., 2024; Maynard and Levi, 2017; Pellerin et al., 2019). By analyzing these spectral data, we can infer vegetation characteristics associated with SOC, thereby achieving more accurate predictions of SOC.

Multi-source remote sensing data provide a more comprehensive and detailed description of surface features, while multi-temporal remote sensing data allow for dynamic tracking of information over different periods, thereby reducing biases introduced by short-term climatic conditions or incidental events affecting single observations (Luo et al., 2022). Although some studies have applied multi-source and multi-temporal remote sensing data for SOC prediction and achieved positive results (Wang et al., 2021; Grabska et al., 2020), the potential of high-resolution satellite imagery for estimating SOC in densely vegetated areas is still limited.

Therefore, we try to utilize multi-source (high- and medium-resolution, and radar) and multi-temporal remote sensing data to estimate SOC in this vegetation cover area.

Considering the differences in crop growth across different seasons in vegetated areas, utilizing multi-temporal data not only effectively reduces uncertainties that may arise from single-time-point data but also comprehensively considers factors such as annual climatic conditions and vegetation growth cycles’ impact on SOC (Luo et al., 2022). The vegetation indices used in our research have been widely applied for indirectly measuring soil fertility and organic matter content (Huete et al., 2002; Wang et al., 2021; Veloso et al., 2017; Trudel et al., 2012; Yu et al., 2020; He et al., 2022; Haj-Amor et al., 2022). These indices capture the corresponding plant responses during different growth stages and seasonal variations, providing reliable information for SOC estimation. Many researchers commonly employ image fusion techniques to integrate data when using multi-temporal remote sensing data for SOC prediction (Meng et al., 2022; Li et al., 2023; Žížala et al., 2019). While this approach can reduce input variables and lower model complexity, it may lead to information blurring during the fusion process, making it difficult to identify which temporal data contribute more significantly to the prediction results during feature variable importance analysis (Zhou et al., 2022; Zhu et al., 2012; Forkuor et al., 2017). In our study, spectral variables from each temporal phase are extracted separately as proxies for SOC prediction. This approach avoids unnecessary information mixing and ensures that each variable provides unique and valuable insights, clarifying which season’s remote sensing data is more crucial for SOC prediction and enhancing the interpretability of the results.

In summary, by integrating multi-source and multi-temporal remote sensing data, we not only mitigate issues caused by plant spectral reflectance on soil property observations but also capture unique information from different periods, thereby enhancing the accuracy of SOC predictions. This approach presents a novel and effective method for predicting SOC in densely vegetated areas and introduces new research perspectives to this field.

References

Guo, J., Zhu, Q., Zhao, X., Gou, X., Han, Y., Xu, J, 2020. Hyper-spectral inversion of soil organic carbon content under different land use types. Chinese Journal of Applied Ecology, 31(3), 863-871. (In Chinese with English abstract)

Stevens, A., Udelhoven, T., Denis, A., Tychon, B., Lioy, R., Hoffmann, L., Van Wesemael, B, 2010. Measuring soil organic carbon in croplands at regional scale using airborne imaging spectroscopy. Geoderma: An International Journal of Soil Science, 158, [specific pages not provided].

Íala, D., Minaík, R., Zádorová, T, 2019. Soil Organic Carbon Mapping Using Multispectral Remote Sensing Data: Prediction Ability of Data with Different Spatial and Spectral Resolutions. Remote Sensing, 11(24), 2947.

Wang, B., Waters, C., Orgill, S., Gray, J., Cowie, A., Clark, A., Liu, D. L, 2018. High-resolution mapping of soil organic carbon stocks using remote sensing variables in the semi-arid rangelands of eastern Australia. Science of the Total Environment, 630(July 15), 367-378.

Tayebi, M., Rosas, J. T. F., Mendes, W. D. S., Poppiel, R. R., Demattê, J. A. M, 2021. Drivers of organic carbon stocks in different LULC history and along soil depth for a 30 years image time series. Remote Sensing, 13(11), 1-32.

Chen L, Jiahui S, Taogetao B. Changes in soil organic carbon and nitrogen stocks following revegetation in a semi-arid grassland of North China[J]. Journal of Environmental Management, 2023, 346: 118995.

Yajuan W, Meiying L, Ji W, et al. The effects of vegetation communities on soil organic carbon stock in an enclosed desert-steppe region of northern China[J]. Soil Science and Plant Nutrition, 2022, 68(2): 284-294.

Wang X, Li Y, Chen Y, et al. Spatial pattern of soil organic carbon and total nitrogen, and analysis of related factors in an agro-pastoral zone in Northern China[J]. PLoS One, 2018, 13(5): e0197451.

Wang, H., Zhang, X., Wu, W., Liu, H, 2021. Prediction of soil organic carbon under different land use types using Sentinel-1/-2 data in a small watershed. Remote Sensing, 13(7), 1229.

Paul, S. S., Coops, N. C., Johnson, M. S., Krzic, M., Chandna, A., Smukler, S. M, 2020. Mapping soil organic carbon and clay using remote sensing to predict soil workability for enhanced climate change adaptation. Geoderma, 363, 114177.

Hengl, T., Leenaars, J. G., Shepherd, K. D., Walsh, M. G., Heuvelink, G. B., Mamo, T., Tilahun, H., Berkhout, E., Cooper, M., Fegraus, E, 2017. Soil nutrient maps of Sub-Saharan Africa: assessment of soil nutrient content at 250 m spatial resolution using machine learning. Nutrient Cycling in Agroecosystems, 109, 77-102.

Dahhani, S., Raji, M., Bouslihim, Y, 2024. Synergistic use of multi-temporal radar and optical remote sensing for soil organic carbon prediction. Remote Sensing, 16(11), 1871.

Maynard, J. J., Levi, M. R, 2017. Hyper-temporal remote sensing for digital soil mapping: Characterizing soil-vegetation response to climatic variability. Geoderma.

Pellerin, S., Bamière, L., Constantin, J., Launay, C., Richard, G, 2019. A model-based assessment of the soil C storage potential at the national scale: A case study from France. Science of the Total Environment, 579, 1094-110.

Peng L, Wu X, Feng C, et al. Assessing the potential of multi-source remote sensing data for cropland soil organic matter mapping in hilly and mountainous areas[J]. CATENA, 2024, 245: 108312.

Xie B, Ding J, Ge X, et al. Estimation of soil organic carbon content in the Ebinur Lake wetland, Xinjiang, China, based on multisource remote sensing data and ensemble learning algorithms[J]. Sensors, 2022, 22(7): 2685.

Luo, C., Zhang, X., Meng, X., Zhu, H., Ni, C., Chen, M., Liu, H, 2022. Regional mapping of soil organic matter content using multitemporal synthetic Landsat 8 images in Google Earth Engine. Catena, 209, 105842.

Grabska, E., Frantz, D., Ostapowicz, K, 2020. Evaluation of machine learning algorithms for forest stand species mapping using Sentinel-2 imagery and environmental data in the Polish Carpathians. Remote Sensing of Environment, 251, 112103.

Huete A, Didan K, Miura T, et al. Overview of the radiometric and biophysical performance of the MODIS vegetation indices[J]. Remote sensing of environment, 2002, 83(1-2): 195-213.

Veloso, A., Mermoz, S., Bouvet, A., Le Toan, T., Planells, M., Dejoux, J., Ceschia, E, 2017. Understanding the temporal behavior of crops using Sentinel-1 and Sentinel-2-like data for agricultural applications. Remote Sensing of Environment, 199, 415-426.

Trudel, M., Charbonneau, F., Leconte, R, 2012. Using RADARSAT-2 polarimetric and ENVISAT-ASAR dual-polarization data for estimating soil moisture over agricultural fields. Canadian Journal of Remote Sensing, 38(4), 514-527.

He, W., Wang, H., Ye, W., Tian, Y., Hu, G., Lou, Y., Pan, H., Yang, Q., Zhuge, Y, 2022. Distinct stabilization characteristics of organic carbon in coastal salt-affected soils with different salinity under straw return management. Land Degradation & Development, 33(13), 2246-2257.

Haj-Amor, Z., Araya, T., Kim, D., Bouri, S., Lee, J., Ghiloufi, W., Yang, Y., Kang, H., Jhariya, M. K., Banerjee, A, 2022. Soil salinity and its associated effects on soil microorganisms, greenhouse gas emissions, crop yield, biodiversity and desertification: A review. Science of the Total Environment, 843, 156946.

Meng X, Bao Y, Wang Y, et al. An advanced soil organic carbon content prediction model via fused temporal-spatial-spectral (TSS) information based on machine learning and deep learning algorithms[J]. Remote Sensing of Environment, 2022, 280: 113166.

Li X, Li Z, Qiu H, et al. Soil carbon content prediction using multi-source data feature fusion of deep learning based on spectral and hyperspectral images[J]. Chemosphere, 2023, 336: 139161.

Žížala D, Minařík R, Zádorová T. Soil organic carbon mapping using multispectral remote sensing data: Prediction ability of data with different spatial and spectral resolutions[J]. Remote Sensing, 2019, 11(24): 2947.

Zhou, X., Dandan, L., Huiming, Y., Honggen, C., Leping, S., Guojing, Y., Qingbiao, H., Brown, L., Malone, J. B, 2002. Use of landsat TM satellite surveillance data to measure the impact of the 1998 flood on snail intermediate host dispersal in the lower Yangtze River Basin. Acta Tropica, 82(2), 199-205.

Zhu Z, Woodcock C E, Olofsson P. Continuous monitoring of forest disturbance using all available Landsat imagery[J]. Remote sensing of environment, 2012, 122: 75-91.

Forkuor G, Hounkpatin O K L, Welp G, et al. High resolution mapping of soil properties using remote sensing variables in south-western Burkina Faso: a comparison of machine learning and multiple linear regression models[J]. PloS one, 2017, 12(1): e0170478.

 

Comments 6: Objectives should be clearer and more concise. Exploring or investigating are not doing research.

Response 6: A more clearer and concise objective is addressed in the revision as follows,

Hence, the objective of this study is to evaluate the effectiveness of multi-source and multi-temporal remote sensing data for predicting SOC across arable land under vegetation cover condition in the Wuling Mountain region of Southwest China. (P3 L124 to 126)

 

Comments 7: The materials and methods section should be better structured and provide all necessary details so that readers can follow the development of the manuscript. The methods are many and should also be supported by relevant and original bibliographic references. There always remains the question of how data of different types are made homogeneous in terms of spatial resolution and how point data (soil samples) can be correctly combined with other data.

Response 7: According to your suggestion, we have revised the Materials and Methods section. A flowchart of methodology is added to help readers follow the environmental data process (Figure 2 in Section 2.3). Meanwhile, the relevant references are also added in the revision.

To address the issue of homogenizing spatial resolution across different types of data, we provided a detailed explanation in Section 2.3 and illustrated the specific procedures for processing three types of data in Figure 2. For the two optical remote sensing datasets, identical preprocessing steps were applied, including radiometric calibration and atmospheric correction using the FLAASH model. Subsequently, to standardize pixel resolutions across optical, radar, and topographic data, we employed bilinear resampling within ArcGIS 10.8 to resample all images to resolutions of 3 meters, 20 meters, 30 meters, and 80 meters respectively, thereby achieving consistent spatial resolution among these datasets.

 

Regarding the accurate integration of point data (soil samples) with other datasets, we provided a comprehensive explanation in Section 2.1. At each designated sampling point, we established 4-6 sub-sampling points centered around the GPS-located point and extended radially by a distance of 30-50 meters for sample collection. The corresponding latitude and longitude coordinates for each sub-sampling point were recorded during this process. Subsequently, soil sampling data were matched with environmental data using the “Extract Multi Values to Points” tool in ArcGIS 10.8 under a common coordinate system to achieve precise data integration.

Figure 2. Environmental data preprocessing processes for modeling.

Comments 8: The subsection on statistical analysis of the data should provide details of ANOVA and LSD and how these metrics were used and why. These should also be supported by relevant and original literature references.

Response 8: One-way analysis of variance (ANOVA) was used to analyse the differences in SOC content between paddy field and dry land. The relevant reference is added in the revision. The sentence is modified as follows:

To analyse the differences in SOC content between paddy field and dry land, one-way analysis of variance (ANOVA) with a confidence interval of 0.05 was used because it effectively determines if there are significant differences between groups (Field A, 2024). These statistical analyses were performed using SPSS V.25 software package.

References

Field A. Discovering statistics using IBM SPSS statistics[M]. Sage publications limited, 2024.

 

Comments 9: The rest of the manuscript should be revised in connection with the new Materials and Methods section.

Response 9: The manuscript is thoroughly revised according to your comments. These modifications have enhanced the rigor of our research and improved the quality of our manuscript. We believe that the revised version meets the high standards of Remote Sensing and we hope these improvements will be satisfactory to you. Once again, we greatly appreciate your valuable time and professional guidance.

Author Response File: Author Response.pdf

Round 2

Reviewer 1 Report

Comments and Suggestions for Authors

I am satisfied with the revision of the authors. I only have one recommendation before publication:

 

I recommend the authors to append their comparisons among XGBoost, Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and Support Vector Regression (SVR) in the Appendix in the manuscript.

Author Response

Comments 1: I recommend the authors to append their comparisons among XGBoost, Random Forest (RF), Gradient Boosting Decision Tree (GBDT), and Support Vector Regression (SVR) in the Appendix in the manuscript.

Response 1:We are extremely grateful for your thorough review and constructive feedback on our manuscript. Based on your suggestions, we have made the corresponding revisions. Specifically, we have added the predictive results of different model under three additional machine learning models (Random Forest (RF), Gradient Boosting Decision Trees (GBDT), and Support Vector Regression (SVR)) in Appendix B. Additionally, we have supplemented and clarified the content in Section 3.1 of the manuscript (P14 L393-396).

Reviewer 2 Report

Comments and Suggestions for Authors

The authors sufficiently addressed all comments and revised the manuscript appropriately. Some limitations of the manuscript remain from my point of view, but I believe the manuscript deserves to be published in Remote Sensing so as to encourage more comparison of the approach used in the scientific community potentially interested in it.

Author Response

We would like to extend our sincere gratitude for your thorough and professional review of our manuscript. Your valuable feedback and suggestions are greatly appreciated and will be immensely helpful in improving our work.

Back to TopTop