1. Introduction
Forest fires are frequent natural disasters in China and around the world, characterized by their sudden onset, significant destructive power, and challenging emergency response efforts. Globally, more than 200,000 forest fires occur each year, resulting in the destruction of millions of hectares of forest, accounting for over 0.1% of the total forest area [
1,
2]. In recent years, forest fires have erupted frequently worldwide, with numerous significant incidents occurring in 2019 alone [
3,
4,
5,
6,
7,
8,
9]. China has consistently faced severe forest fire hazards, with a major forest fire having occurred in Liangshan Prefecture, Sichuan Province, on 30 March 2019, resulting in 31 fatalities and the burning of 20 hectares of forest land. A year later, on 30 March another massive forest fire struck the same region, causing 19 casualties, 3 injuries, and affecting an area of approximately 3048 hectares, with direct economic losses reaching nearly a billion yuan [
10]. Forest fires not only directly destroy vegetation, resulting in substantial losses, but also indirectly impact the carbon cycling process and distribution patterns within the biosphere, disrupting regional ecosystem equilibrium [
2,
9,
11]. Therefore, conducting pre-disaster warning and risk monitoring research on forest fires holds significant importance in mitigating disaster losses, protecting forest resources, and maintaining ecological balance [
12,
13].
Since the 1970s, remote sensing technology enabled the monitoring of regional wildfires based on abundant satellite data. In 1999, NASA successfully launched the first satellite of the Earth Observing System (EOS) program, TERRA, equipped with five Earth-observing sensors, including the Moderate Resolution Imaging Spectroradiometer (MODIS) [
14,
15]. The MODIS sensor includes a dedicated channel designed for wildfire monitoring [
16]. MODIS data are readily accessible, relatively easy to process, and offer high-temporal resolution, making them the most essential remote sensing data for global wildfire research in the past decade [
17].
Fuel, such as forests, could cause fires due to various reasons [
18,
19]. Increasing human activities have put forests in a flammable situation, such as burning straw for the autumn harvest, fireworks for traditional celebrations, and other ignition behaviors. Moreover, forest phenology could directly affect the stocking level, causing seasonal differences in fire cases. It has also been proven that the lower-latitude regions are more prone to risk fire hazards [
18]. Facing such uncertainty, vision-based SAG (space–air–ground) remote sensing has become a prevalent strategy for forest fire detection and risk assessment [
20].
Based on remote sensing, risk assessments of forest fires adapted to different situations have been developed, and currently, forest fire risk assessment methods can be broadly categorized into three types as follows:
These assessments analyze and study data characteristics using statistical theories, such as cluster analysis, fuzzy comprehensive evaluation, and the gray system theory, to derive statistical patterns. For instance, Lv et al. developed a forest fire spread GM (1,1) model based on the gray system modeling theory and the ER algorithm directly using MODIS fire point data. However, continuous updates and adjustments to the model are required to avoid the distortion of results [
21]. Krishna Prasad Vadrevu, in 2010, employed a combination of the fuzzy set theory and decision algorithms within a Geographic Information System (GIS) framework to analyze forest fire risk in southern India and created fire risk maps [
22]. Sirio Modugno and colleagues, in 2016, utilized logistic regression analysis to explore specific positive and negative relationships between forest fires and the wildland–urban interface (WUI) in their vicinity [
23]. In 2017, Ryan Lagerquist and his team predicted fire spread days and extreme weather fire risk using self-organizing maps (SOMs) based on the Canadian Fire Weather Index System (CFWIS) [
24]. H. Yathish and colleagues, in 2019, applied three fire hazard discrimination methods (logistic regression, multi-criteria decision analysis, and weighted overlay) in India, considering factors like elevation, slope, distance to roads, human activity area, land surface temperature (LST), and the normalized difference vegetation index (NDVI) for fire risk assessment. They found the logistic regression model to have the highest accuracy at 88.89% [
25]. Volkan Sevinc et al., in 2019, used a Bayesian network model to predict fire risk factors in southwestern Turkey, highlighting the month as the primary influencing factor on forest fire occurrence, followed closely by the temperature; however, they did not consider the relationship between the temperature and the month [
26]. These methods demand high data accuracy and reliability and possess a degree of randomness. However, they incorporated a limited mechanistic understanding of fire occurrence and background knowledge.
This category constructs the fire risk assessment index based on the mechanisms of fire occurrence, providing a more realistic reflection of fire occurrence. In 1967, McArthur introduced the fire danger index (FDI) based on meteorological factors, which laid the foundation for this approach [
27]. The Canadian Forest Fire Danger Rating System (CFFDRS) is widely used and has been applied in several countries, including parts of the United States, New Zealand, Fiji, Indonesia, and Malaysia [
28]. This system calculates multiple flammability and fire behavior indicators based on four meteorological factors and derives a risk index for quantitatively guiding forest fire management activities [
29,
30]. In China, the adaptability of the CFFDRS was verified in the Daxing’anling region [
31]. In 1995, the Chinese Ministry of Forestry introduced the National Forest Fire Weather Rating Standard, considering five fire risk meteorological factors and employing an expert scoring method. In 2007, the FFDI (forest fire danger index) was optimized and released, undergoing further improvement based on the modified Brong–Davis scheme in 2018, and continues to be used today [
32]. However, traditional fire risk indices primarily focus on meteorological and fuel factors, have limited factors for the surrounding environment such as terrain, the water system, and other geographical factors, and offer a relatively macroscopic assessment perspective, making them less adaptable for regional forest fire assessment, as they may not consider all relevant fire impact factors.
In recent years, machine learning methods have gained popularity in fire risk assessment. Support vector machines (SVMs) are commonly used and have demonstrated good performances in fire risk assessment model comparison experiments [
33,
34,
35]. Ensemble algorithms, like random forest (RF), gradient boosting decision tree (GBDT), and XGBoost, demonstrate better accuracy compared to most individual algorithms and have exhibited strong classification and regression prediction capabilities in various forest fire risk assessment studies [
36,
37,
38,
39]. Multilayer perceptrons (MLPs) and back propagation neural networks (BPNNs) are also frequently employed in forest fire risk assessment, although their accuracy tends to be lower than the aforementioned methods [
33,
34,
35,
36,
37,
38,
39,
40,
41,
42,
43]. Deep learning methods, such as recurrent neural networks (RNNs) and convolutional neural networks (CNNs), map complex relationships between fire occurrences and input features through intricate network structures [
44,
45,
46,
47]. However, machine learning methods require a high quality and quantity of data, necessitate continuous adjustments and optimizations with changing data, involve lengthy and complex training processes, and face challenges in handling diverse geographic conditions. In large-scale predictions, the complexity of these models increases significantly, making them less suitable, and the “black-box” nature of machine learning may lead to weaker reliability [
44,
45,
46,
47].
In conclusion, considering the advantages and disadvantages of various methods, we have chosen to use the widely adopted fire risk index approach. Building upon the FFDI, we took into account the mechanisms of fire occurrence, analyzed not only the hazard-affected body but also the hazard-formative environment and various hazard factors, including lightning, and constructed a composite forest fire risk index, providing assessment results for the study area from 2010 to 2019. We also incorporated historical disaster data for result validation, thereby offering valuable insights for local forest fire monitoring and prevention efforts.
3. Results
3.1. Relation between Fire and Its Indicators
The Spearman method revealed significant correlations between the fire points and the indicators, except for lightning intensity and roading density (
Table 6). To further explore and visually represent the relationships, we plotted scatter graphs of cumulative fire points within kilometer grids against each factor, except for the vegetation category (
Figure 4a–f). Due to the magnitude of fire point counts, we applied a logarithmic transformation. For the vegetation category, we created histograms and conducted an analysis (
Figure 4g).
These results indicate that as the FFDI increases, the fire points become denser, with the natural logarithm of fire point count also increasing. Specifically, higher FFDI values were associated with a greater likelihood of forest fires and more fire points. When the lightning intensity was set at 0, there were still a number of fire points present. Between the lightning intensities of 3 kA and 45 kA, there was a substantial number of fire points, with a relatively stable relationship between their natural logarithms. In the 45–90 kA range, both the number of fire points and their natural logarithms decreased. As the lightning intensity continued to rise, both the fire points and their natural logarithms decreased even further.
Regarding population density, as it gradually increases from 0, the corresponding natural logarithm of fire points initially remains high but starts to decline after reaching 300 people/km². When the distance to rivers was too great, there were very few fire points. However, when the distance exceeded 6000 m, both the number of fire points and their natural logarithms decreased as the distance to rivers increased. When the distance was less than 6000 m, the fire point density increases with increasing distance from the rivers.
For roading density, when it exceeded 3 km per kilometer grid, there were very few fire points. When the roading density exceeded 2 km, both the number of fire points and their natural logarithms decreased. When the roading density was less than 1.5 km, there was no significant change in the relationship between the roading density and both the number of fire points and their natural logarithms. For altitudes below 1000 m, there were relatively few fire points. Between 1000 m and 1500 m, the number of fire points significantly increased. The fire points peaked between 1500 m and 2500 m, after which the natural logarithm of fire points noticeably decreased. Beyond 3500 m, the fire points decreased with increasing altitude.
Regarding the vegetation category, areas with coniferous forests had the largest fire point coverage, approximately 3614 km² (10-year cumulative), representing 65% of the total fire point area, significantly surpassing the 46.3% forest area covered by coniferous forests. When the vegetation categories were shrubland, broadleaf forests, and grassland, their fire point coverage percentages were 11.8%, 6.4%, and 6.1%, respectively, all significantly lower than their respective proportions in the forest area. Cultivated vegetation areas accounted for 10.7% of the total fire point area, exceeding the 7.3% of the forest area occupied by cultivated vegetation.
3.2. Grading of Indicators
Based on the relationship between the fire risk indicators and fire occurrences and utilizing the data from all kilometer grid cells from 2010 to 2019 within the study area, we established a classification standard for forest fire risk levels (
Table 7). This classification ranges from one to ten [
58,
68,
69], referencing the relevant literature and exploration of different indicators.
3.3. Construction of the CFFRI
3.3.1. Subjective Weight Analysis for the Indicator System
We investigated the pairwise importance of hazard factors, the hazard-formative environment, and the hazard-affected body with the AHP in terms of the quantity and characteristics of fire indicators (
Table 8). Meteorological factors are the primary factors influencing forest fire occurrences, making hazard factors the most crucial. Following that are the hazard-formative environment and vulnerable elements. The weightings for these three factors were determined through the construction of the judgment matrix, resulting in weights of 0.575, 0.343, and 0.082, respectively. Utilizing the AHP for hierarchical ranking and consistency testing of the matrix model data, when the order of the matrix was three, RI = 0.58, and CR < 0.1, meeting the consistency testing requirements.
Subsequently, based on the hazard factor criterion, we explored the importance of the FFDI and lightning intensity. Since the FFDI can better represent its relationship with forest fires, while lightning intensity contributes relatively less to the ignition of forest fires, we assigned weights of 0.90 to the FFDI and 0.10 to lightning intensity when constructing the judgment matrix (
Table 9). This reflects the significant impact of the FFDI on the CFFRI compared to lightning intensity. These weightings have been tested and found to meet the requirements of hierarchical ranking and consistency testing.
Finally, based on the hazard-formative environment criterion, we explored the relative importance of population density, distance to rivers, roading density, and altitude. From the relationships between these factors and the number of fire points, it was evident that these four factors do not exhibit purely positive or negative effects on the fire point counts (
Table 10). There existed an intermediate value with a relatively significant influence. However, the population density, distance to rivers, and altitude showed significant correlations with the fire points. Taking into account the above analysis and practical considerations, we constructed a judgment matrix to determine the weights as follows: population density 0.365, distance to rivers 0.147, roading density 0.097, and altitude 0.391. When the matrix order was four, RI = 0.9, and CR < 0.1, meeting the consistency testing requirements.
Among all indicators within the three criteria, the FFDI had the most significant influence on the results, accounting for more than half at 51.8%. The altitude and population density closely followed, with their impact on the CFFRI being relatively equal, constituting from 12% to 14% of the total impact. The vegetation category came next, contributing 8.2%. The lightning intensity, distance to rivers, and roading density each accounted for 5.8%, 5%, and 3.3%, respectively (
Table 11).
3.3.2. Objective Weight Calculations of the Fire Indicators
Due to significant differences in the data of various indicators, conducting dimensionless processing on the raw data would result in substantial variations in the calculation results. Therefore, we employed the entropy method for weight calculations based on the pre-established risk level divisions for each indicator, with all risk levels ranging from one to ten. This approach allowed us to reduce numerical differences while preserving the characteristics of the data. The specific weight distributions were as follows: the FFDI and LI each contributed approximately 20%, PD and altitude around 15%, RD had a lower weight of 6%, and DtR and VC accounted for approximately 11% (
Table 12). This weighting reflects that the FFDI and LI have relatively lower entropy levels, indicating lower disorder in the information, higher differential coefficients, and greater utility of information. Consequently, they received higher weights, while the other indicators followed in descending order.
3.3.3. Weight Combination for the CFFRI
Based on the results of the two weight calculation methods, it was evident that the weights calculated using the entropy method are more balanced than those obtained through the AHP (
Figure 5). Notably, the indicators with significant differences in the results between the two methods are the FFDI and lightning intensity The former calculates objective weights based on the inherent differences in each type of data, while the latter involves subjective judgments and quantitative-level divisions by the researcher when calculating the weights. Both approaches have their advantages and drawbacks.
Subsequently, employing the principles of minimum relative entropy and the Lagrange multiplier method, we further computed the combined weights for the seven indicators based on the results of the AHP and the entropy method. These final weights were then incorporated into the calculation of the CFFRI:
where
and
represent the hazard factors, namely the FFDI and lightning intensity, respectively;
represents the vulnerable element, which is the vegetation category;
,
,
, and
stand for elements within the hazard-formative environment, representing the population density, distance to rivers, roading density, and altitude, respectively.
3.4. Forest Fire Risk from 2010 to 2019
According to the CFFRI, we utilized the assigned risk levels for the seven indicators to calculate the forest fire risk values in the study area for the years 2010–2019, with the risk scale ranging from zero to ten. Employing the natural breaks classification method, we established a classification table for the risk index (
Table 13). Based on this risk level table, we generated a fire risk distribution map for the study area (
Figure 6). High-risk areas were predominantly located in the central and southern parts of the study area, while moderately high-risk areas were mainly situated in the southern and southwestern regions. Medium-risk zones were prevalent in the eastern part of the moderately high-risk area. Moderately low-risk areas were primarily found in the northern and eastern sections of the study area, while low-risk areas were concentrated in the northeastern part. These different fire risk zones exhibited distinct boundaries.
3.5. Comparison and Validation between the CFFRI and the FFDI
We selected historical forest fire cases with relatively large burned areas and closely located fire incidents (
Table 14). The specific validation procedure involved calculating the average CFFRI from the start to the end of each wildfire event. This average CFFRI value was then compared to areas surrounding the fire-affected zone and also compared to the FFDI results. This comparison allowed us to determine whether the average CFFRI during the wildfire period was in a significantly higher state.
To ensure the accuracy of the fire risk location information, we needed to compare the fire risk index calculation results with the MCD64 fire data for the month of the fire incident. The data processing steps were as follows:
Calculate the FFDI for selected forest fire events and perform spatial interpolation.
Divide the interpolation results into five risk levels based on FFDI standards (
Table 15), which facilitates the comparison with the CFFRI.
Utilize the FFDI level values to compute the daily CFFRI values.
Determine the average FFDI and fire risk level, as well as the average CFFRI and risk level during the occurrence of the wildfires.
Compare the village coordinates with the MCD64 fire data to confirm the actual fire locations (
Figure 7). Compare the fire risk values at these locations with the areas unaffected by fires to assess the accuracy of the two calculation methods in relation to the actual fire incidents.
The results obtained from the CFFRI indicate that the overall fire risk values were relatively high, with 89% falling into the high-risk category, and 11% in the medium-risk category (
Table 16). This outcome is more consistent with the actual fire incidents compared to the FFDI, where the IV fire risk comprises 56% and the III fire risk makes up 44% of the total. To assess the accuracy of the fire risk calculation, we compared the fire risk values of burned pixels to those of unburned pixels. We calculated the probability of burned pixels having higher fire risk values than the surrounding unburned pixels. A higher probability indicates a more accurate fire risk calculation [
69].
Figure 8 illustrates that the CFFRI calculation results are predominantly positioned above the FFDI calculation results, suggesting that the CFFRI yields more instances where the fire risk values are higher than the surrounding pixel values, thereby achieving a higher level of accuracy. The calculated accuracy of the CFFRI results was 85%, while the FFDI results had an accuracy of 76%. This demonstrates that our proposed model, within the context of the data and calculation methods used in this study, outperforms the traditional FFDI. It further underscores the applicability of the CFFRI in this region.
5. Conclusions
Building upon the FFDI, we selected seven categories of influencing factors based on the RDST, encompassing hazard factors, the hazard-formative environment, and the hazard-affected body. We analyzed their interrelationships with forest fire occurrence and constructed a forest fire risk assessment index system and a composite forest fire risk index, which is the CFFRI. We computed fire risk from 2010 to 2019 and compared the results with historical disaster data against the traditional FFDI results. The main conclusions contain two aspects:
The FFDI is directly proportional to the number of forest fire points, while the other indicators show an increase in fire point count with increasing indicator values, up to a critical threshold. Beyond the threshold, an increase in indicator values results in a decrease in fire point count.
The CFFRI shows an overall high fire risk, with 89% falling into the high-risk category and 11% in the mid-high category. This is closer to the actual fire situation compared to the FFDI, which has 56% in the high-risk category and 44% in the mid-high category. The CFFRI yields more high-risk values than the surrounding grid cells, with an accuracy rate of 85%, while the FFDI has an accuracy rate of 76%, demonstrating a superior applicability of the CFFRI compared to the traditional FFDI in this region.