Next Article in Journal
Deformation Monitoring of Tailings Reservoir Based on Polarimetric Time Series InSAR: Example of Kafang Tailings Reservoir, China
Next Article in Special Issue
Spatiotemporal Variation in Compound Dry and Hot Events and Its Effects on NDVI in Inner Mongolia, China
Previous Article in Journal
A New Method for Calculating Water Quality Parameters by Integrating Space–Ground Hyperspectral Data and Spectral-In Situ Assay Data
Previous Article in Special Issue
Contributions of Climatic and Anthropogenic Drivers to Net Primary Productivity of Vegetation in the Mongolian Plateau
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

High-Precision Population Spatialization in Metropolises Based on Ensemble Learning: A Case Study of Beijing, China

1
State Key Laboratory of Remote Sensing Science, Beijing Normal University, Beijing 100875, China
2
Beijing Key Laboratory of Environmental Remote Sensing and Digital City, Beijing Normal University, Beijing 100875, China
3
Faculty of Geographical Science, Beijing Normal University, Beijing 100875, China
4
School of Statistics, Beijing Normal University, Beijing 100875, China
5
State Key Laboratory of Earth Surface Processes and Resource Ecology, Beijing Normal University, Beijing 100875, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2022, 14(15), 3654; https://doi.org/10.3390/rs14153654
Submission received: 28 June 2022 / Revised: 27 July 2022 / Accepted: 27 July 2022 / Published: 29 July 2022

Abstract

Accurate spatial population distribution information, especially for metropolises, is of significant value and is fundamental to many application areas such as public health, urban development planning and disaster assessment management. Random forest is the most widely used model in population spatialization studies. However, a reliable model for accurately mapping the spatial distribution of metropolitan populations is still lacking due to the inherent limitations of the random forest model and the complexity of the population spatialization problem. In this study, we integrate gradient boosting decision tree (GBDT), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM) and support vector regression (SVR) through ensemble learning algorithm stacking to construct a novel population spatialization model we name GXLS-Stacking. We integrate socioeconomic data that enhance the characterization of the population’s spatial distribution (e.g., point-of-interest data, building outline data with height, artificial impervious surface data, etc.) and natural environmental data with a combination of census data to train the model to generate a high-precision gridded population density map with a 100 m spatial resolution for Beijing in 2020. Finally, the generated gridded population density map is validated at the pixel level using the highest resolution validation data (i.e., community household registration data) in the current study. The results show that the GXLS-Stacking model can predict the population’s spatial distribution with high precision (R2 = 0.8004, MAE = 34.67 persons/hectare, RMSE = 54.92 persons/hectare), and its overall performance is not only better than the four individual models but also better than the random forest model. Compared to the natural environmental features, a city’s socioeconomic features are more capable in characterizing the spatial distribution of the population and the intensity of human activities. In addition, the gridded population density map obtained by the GXLS-Stacking model can provide highly accurate information on the population’s spatial distribution and can be used to analyze the spatial patterns of metropolitan population density. Moreover, the GXLS-Stacking model has the ability to be generalized to metropolises with comprehensive and high-quality data, whether in China or in other countries. Furthermore, for small and medium-sized cities, our modeling process can still provide an effective reference for their population spatialization methods.
Keywords: population spatialization; ensemble learning; stacking; metropolis; Beijing population spatialization; ensemble learning; stacking; metropolis; Beijing
Graphical Abstract

Share and Cite

MDPI and ACS Style

Bao, W.; Gong, A.; Zhao, Y.; Chen, S.; Ba, W.; He, Y. High-Precision Population Spatialization in Metropolises Based on Ensemble Learning: A Case Study of Beijing, China. Remote Sens. 2022, 14, 3654. https://doi.org/10.3390/rs14153654

AMA Style

Bao W, Gong A, Zhao Y, Chen S, Ba W, He Y. High-Precision Population Spatialization in Metropolises Based on Ensemble Learning: A Case Study of Beijing, China. Remote Sensing. 2022; 14(15):3654. https://doi.org/10.3390/rs14153654

Chicago/Turabian Style

Bao, Wenxuan, Adu Gong, Yiran Zhao, Shuaiqiang Chen, Wanru Ba, and Yuan He. 2022. "High-Precision Population Spatialization in Metropolises Based on Ensemble Learning: A Case Study of Beijing, China" Remote Sensing 14, no. 15: 3654. https://doi.org/10.3390/rs14153654

APA Style

Bao, W., Gong, A., Zhao, Y., Chen, S., Ba, W., & He, Y. (2022). High-Precision Population Spatialization in Metropolises Based on Ensemble Learning: A Case Study of Beijing, China. Remote Sensing, 14(15), 3654. https://doi.org/10.3390/rs14153654

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop