1. Introduction
Coal is a crucial energy resource, with the United States (U.S.) holding approximately 22% of global reserves [
1]. Since the first commercial extraction of coal in Virginia in 1701, U.S. coal production expanded significantly, reaching nearly a billion tons in 2000 [
2]. However, coal extraction generates considerable waste—up to 0.4 tons per ton of coal produced—which includes low-grade coal mixed with rock, shale, and other impurities [
3]. Managing this waste is a critical aspect of coal mining operations due to its environmental and logistical challenges.
Before the enactment of the Surface Mining Control and Reclamation Act (SMCRA) in 1977, there was a lack of comprehensive federal legislation regulating surface mining operations [
4]. This regulatory gap led to numerous coal mining operations being abandoned without proper reclamation. Many of these sites still contain remnants of coal mining operations in the form of coal spoil, tailings, and refuse piles [
5].
Abandoned mine lands often exhibit various forms of environmental degradation and pose risks to surrounding ecosystems and communities, including soil, water, and air pollution, as well as geological hazards for nearby communities [
6,
7,
8]. For instance, the spontaneous combustion of waste coal releases a large amount of CO, SO
2, NOx, and other harmful gases, which greatly affect the surrounding residents’ lives [
9,
10,
11]. Additionally, prolonged exposure of waste coal to the surface or groundwater releases toxic elements, further polluting the ecosystem and posing a risk to human health [
6,
9].
After the enactment of the SMCRA, coal mining operators were mandated to reclaim areas affected by mining operations. However, no company or individual bears responsibility for reclaiming the abandoned mine lands that predate the SMCRA under state or federal laws [
12]. Therefore, monitoring these historical sites is crucial to mitigate their negative impact on the ecological environment [
13,
14].
Conventional methods for monitoring mine lands, such as topographic surveys and photogrammetric studies, are known to be both time-consuming and labor-intensive [
15]. Consequently, these techniques are not efficiently scalable for mapping large-scale mine areas. Remotely sensed satellite data has emerged as a cost-effective solution for mapping land cover over large geographic regions. Advancements in sensor and satellite technologies have made it economically feasible to acquire both historical and current spatial information about various land processes. Consequently, remote sensing data has been extensively used by the mining industry for mineral exploration and environmental monitoring purposes in recent decades.
Numerous researchers have explored image classification techniques and change detection approaches to monitor mining areas and disturbed lands [
16,
17,
18,
19]. Mao et al. developed a remote sensing model using Landsat imagery to extract coal areas in Huolinhe and Huozhou mining regions in China [
20]. Zeng et al. used object-oriented decision trees to extract surface coal mining areas in Inner Mongolia [
21]. Their method integrated spectral and spatial characteristics to distinguish mining areas from non-mining areas. More recently, Werner et al. attempted to identify the size and location of various mine features using satellite imagery across different countries and for different commodities like copper, gold, silver, and platinum [
22]. More recent studies have advanced mining land monitoring through multi-source datasets. The MineCam dataset combined Sentinel-2 and Sentinel-1 imagery to map over 400 mining sites globally [
23]. Similarly, the CUG_MISDataset provided more than 1400 image blocks and 3000 annotated instances in China, supported by a new network architecture that improved segmentation accuracy [
24]. These efforts highlight rapid progress in high-precision recognition of mining land occupation and the need to explore diverse methods, as performance remains context-dependent [
25].
In recent years, time series analysis of remotely sensed data has gained popularity for monitoring changes within mining areas due to its ability to capture temporal dynamics at a granular (i.e., pixel) level. These methods typically apply algorithms to indices derived from remotely sensed data that reflect vegetation or ecological status. The algorithms often involve fitting, decomposition, and breakpoint detection [
26]. Hu et al. assessed the effectiveness of a decomposition algorithm called the Bayesian Estimator of Abrupt change, Seasonality, and Trend (BEAST) in detecting fine-scale human disturbances [
27]. Furthermore, a combination of spatiotemporal dynamic weighting, the Best Index Slope Extraction algorithm (BISE), and the moving window methods has been explored to detect mining disturbance and impact [
28,
29]. Among these, the Landsat-based Detection of Trends in Disturbance and Recovery (LandTrendr) algorithm has proven to be highly effective in identifying disturbances and trends and hence has been widely used in various applications such as land use planning, forest monitoring, and carbon accounting [
30]. It has also demonstrated efficiency in detecting changes in surface coal mines [
31,
32,
33], making it the preferred algorithm for change detection.
Several studies have investigated classification and change detection approaches independently to monitor disturbances in active mining areas. However, there is a significant research gap concerning waste coal that existed before the implementation of the SMCRA. Focusing on these pre-SMCRA waste coal piles is crucial because they were created before modern environmental regulations were in place, often resulting in more severe environmental degradation [
34]. These historical waste coal piles may pose long-term environmental hazards, including soil and water contamination, increased erosion, and spontaneous combustion. Monitoring these historical sites is essential for improving environmental quality and mitigating historical pollution. This study aims to fill this gap by developing an end-to-end data analytical workflow in Google Earth Engine (GEE) using satellite imagery to map these historical (pre-SMCRA) waste coal piles. The specific objectives of this study are as follows:
Identify and map the locations of waste coal piles using machine learning (ML) algorithms trained on satellite imagery.
Assess the temporal dynamics of the mapped waste coal piles, focusing on reclamation and disturbance monitoring using the LandTrendr algorithm.
Develop an approach to distinguish waste coal piles predating SMCRA from those in active mining areas.
4. Future Directions
This study applied ML and change-detection algorithms to medium-resolution satellite imagery to identify historical waste coal piles. The small size of these piles and the limited availability of training and testing datasets may introduce uncertainties in the performance of the ML models when applying this approach to other regions. Integrating data from larger geographic regions and leveraging advanced ML techniques may help address these limitations. As the method has been tested in a single study area, its effectiveness under different geographic and environmental conditions remains unverified. Future work should therefore prioritize integrating data from larger and more diverse geographic regions to evaluate model transferability and strengthen its generalizability. Additionally, integrating high-resolution drone or commercial imagery with medium-resolution satellite data could further improve mapping accuracy, enable more detailed characterization of surface properties, and enhance detection of smaller or partially obscured piles [
54].
We analyzed trends in NDVI within the mining region to assess reclamation efforts and monitor ongoing coal mining operations. To minimize short-term climatic and seasonal variability, we used the intra-annual 95th percentile NDVI, which helped reduce the influence of geographic variation in vegetation growth driven by climate, soil, and topography. Moving forward, future work could be strengthened by integrating ancillary datasets. In particular, DEM-derived terrain indices such as slope or aspect may serve as stable, time-invariant covariates in relatively undisturbed landscapes. However, for actively mined or reclaimed areas where topography is continually altered, the use of terrain indices should be limited to stable zones. Incorporating such terrain indices could improve the quantification of site-specific changes and enable a more comprehensive assessment of reclamation success.
5. Conclusions
This study developed a comprehensive data analytical workflow using remotely sensed data to classify and monitor waste coal piles predating the SMCRA regulations. Waste coal piles were first delineated through supervised image classification of Sentinel-2 imagery using four ML models—kNN, RF, GB, and SVM. Among these, the RF model achieved the highest performance, with a precision of 86%, a recall of 77%, an OA of 96%, and an F1-score of 91%. The LandTrendr algorithm was then applied to a multi-decadal time series to assess disturbance and reclamation. This analysis revealed that historical waste coal piles have experienced a gradual and consistent increase in vegetation cover since 1986, reflecting natural reclamation processes. These areas also showed little to no change in disturbance magnitude, indicating the absence of major new disturbances. In contrast, active mining areas exhibited consistently high disturbance magnitudes due to ongoing surface coal mining operations. These contrasting patterns, captured in the disturbance mapping, enabled clear differentiation between historical waste coal piles and active mining sites.
The proposed workflow achieved a precision of 78.6% and a recall of 100% in identifying historical waste coal piles, demonstrating its reliability for monitoring legacy mining impacts. By integrating image classification with long-term change detection, this approach advances understanding of historical mining disturbances and provides a practical framework to support reclamation planning and policy development. Moreover, the workflow is adaptable to other coal-producing regions, offering a scalable solution for data-driven environmental management and sustainable land rehabilitation.