Next Article in Journal
Optimising Health Emergency Resource Management from Multi-Model Databases
Next Article in Special Issue
Ecological Cooperative Adaptive Control of Connected Automate Vehicles in Mixed and Power-Heterogeneous Traffic Flow
Previous Article in Journal
The Incoherence of Deep Isotropic Neural Networks Increases Their Performance in Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model

School of Automotive and Traffic Engineering, Jiangsu University, Zhenjiang 212013, China
*
Author to whom correspondence should be addressed.
Electronics 2022, 11(21), 3604; https://doi.org/10.3390/electronics11213604
Submission received: 6 October 2022 / Revised: 28 October 2022 / Accepted: 2 November 2022 / Published: 4 November 2022
(This article belongs to the Special Issue Intelligent Traffic Control and Optimization)

Abstract

:
At present, the total length of accident blackspot accounts for 0.25% of the total length of the road network, while the total number of accidents that occurred at accident black spots accounts for 25% of the total number of accidents on the road network. This paper describes a traffic accident black spot recognition model based on the adaptive kernel density estimation method combined with the road risk index. Using the traffic accident data of national and provincial trunk lines in Shanghai and ArcGIS software, the recognition results of black spots were compared with the recognition results of the accident frequency method and the kernel density estimation method, and the clustering degree of recognition results of adaptive kernel density estimation method were analyzed. The results show that: the accident prediction accuracy index values of the accident frequency method, kernel density estimation method, and traffic accident black spot recognition model were 14.39, 16.36, and 18.25, respectively, and the lengths of the traffic accident black spot sections were 184.68, 162.45, and 145.57, respectively, which means that the accident black spot section determined by the accident black spot recognition model was the shortest and the number of traffic accidents identified was the largest. Considering the safety improvement budget of 20% of the road length, the adaptive kernel density estimation method could identify about 69% of the traffic accidents, which was 1.13 times and 1.27 times that of the kernel density estimation method and the accident frequency method, respectively.

1. Introduction

The road transportation network is a key factor in promoting the development of the world economy. It is fast, efficient, reliable, and flexible, but it is also the most dangerous means of transportation. According to statistics from the World Health Organization, the world’s annual traffic accidents cause more than 1.35 million deaths and more than 50 million injuries, and traffic accidents are also the main cause of death for residents aged 15–29 years [1]. As a major public safety issue, traffic accidents not only caused serious casualties, but also brought huge economic losses to the society. In order to improve road safety, many countries have invested a lot of resources in the management of traffic accidents, but the effect has been minimal. Accident-prone sections are called black spots, in foreign countries, which refers to sections or intersections where the number of accidents is obviously prominent on the road [2,3,4]. Generally, it occupies a small amount of road mileage, but the concentration of traffic accidents is relatively large. Identifying accident black spots and taking corresponding measures can be used as an effective way to prevent traffic accidents [5]. The study of accident black spot identification methods can provide theoretical support for black spot identification research, which is of great significance for reducing road traffic accidents and improving road traffic safety.
Accurately identifying black spots in traffic accidents is a decisive link in improving road traffic safety. At present, many methods have been used to identify black spots in traffic accidents, mainly including the accident number method, the accident rate method, the probability theory mathematical statistics method, the quality control method, the cumulative frequency curve method, genetic algorithm and information distribution technology, etc. [5,6,7,8,9]. These methods usually divide the entire highway into section units (usually 1 km) to identify black spots in traffic accidents. Thomas et al. pointed out that the use of a fixed length to identify black spots in traffic accidents may lead to deviations in the positions of the black spots, causing errors between the recognition results and the actual length of the road section [10]. At the same time, Meng Xianghai also pointed out that the use of fixed length to identify accident black spots would affect the accuracy of the results, and proposed the use of the sliding window method to identify them [11]. However, as the research continues to deepen, other researchers have found that although the sliding window method can improve the accuracy of the identification of accident black spots, it will also cause the length of the road section identified by the accident black spot to dismatch the actual road section length, thereby affecting the recognition accuracy [12,13]. Koorey et al. also discussed the impact of using variable length on the identification of traffic accident blackspots, and confirmed that choosing an appropriate segmentation method has a significant impact on reducing false positives and negatives in the results of traffic accident black spot recognition [14]. Because the clustering algorithm can not only identify accident black spots, but also analyze the causes of accident black spots, it has attracted much attention in the identification of traffic accidents [15,16,17]. However, the clustering algorithm also has certain limitations. The black spots of the same type of clustered traffic accidents obtained by clustering have the same degree of danger, which does not match the actual situation, which is that each accident black spot has a different degree of danger. Nuclear density estimation can reduce the severity of accident blackspots, and has gradually become a research hotspot in traffic accident black spots [18,19,20,21]. However, using a fixed bandwidth to identify black spots in an accident will also affect the accuracy of the results. At the same time, most of the current studies use the frequency of traffic accidents as the identification index and do not consider the degree of danger of the target road section.
The purpose of this paper is to solve the problem that the degree of accident danger caused by the current clustering algorithm does not match with the current situation and thereby improve the accuracy of fixed broadband accident identification results. Taking the traffic accident data of Shanghai National Highway as the research object, this paper combined the adaptive kernel density estimation method and GIS data to identify the traffic accident black spots on the basis of considering the road danger degree and compared the results with those for the accident frequency method and nuclear density estimation method, based on the crash predictive accuracy index (CPAI) and road repair and maintenance cost limit indicators (RRMCLI) to verify and evaluate the effectiveness of the adaptive kernel density estimation method. Based on the traffic data of Shanghai trunk highways, the traffic characteristics and traffic accident characteristics were analyzed to provide supportive data for the construction of the black spot identification model of urban trunk highways and the comparative analysis of the identification results. The results show that the self-adaptive kernel density estimation based on the improved bandwidth used in this paper was superior to the previous accident blackspot recognition model.

2. Study Area and Data Collection

In order to obtain accurate traffic accident data and determine the spatial location of traffic accidents, the Shanghai National Highway was used as the data collection location. As a provincial-level administrative region and municipality directly under the Central Government, Shanghai is located in the Yangtze River Delta, bounded between 120°52′ to 122°12′ east longitude and 30°40′ to 31°53′ north latitude. The city has 16 districts with a total area of 6340.5 square kilometers. The total length of traffic infrastructure is approximately 13,500 km, of which, approximately 972 km comprises highways, and approximately 1812 km comprises national and provincial trunk roads, as shown in Figure 1. Shanghai as one of the major cities generating Chinese financial revenue, not only because of its unique geographical advantages, but also because of its developed road traffic conditions. The transportation of goods and import and export commodities has made the traffic volume of Shanghai trunk highways gradually increase, and the urban population and traffic flow have grown, leading to the frequent occurrence of road traffic accidents.
The Shanghai Traffic Safety Integrated Service Management Platform records accident information, such as the location, time, and type of traffic accidents. Therefore, the traffic accident data studied in this article are mainly derived from the comprehensive traffic safety service management platform, a total of 3 years (2017–2019) of traffic accident data on national and provincial trunk lines in Shanghai, including the number of injured, the number of deaths, and the direct economic loss, as shown in Table 1. According to statistics in Table 1, there were a total of 2031 traffic accidents in Shanghai in the past 3 years, of which 477 were injured, 108 were killed, and the direct economic and property losses were about 2.793 million yuan.
At the same time, this study also conducted a statistical analysis of traffic accidents in different administrative regions of Shanghai, as shown in Table 2. It can be found in Table 2 that although the maximum number of traffic accidents in Pudong New Area was 420, the injury rate and death rate were only 0.269 and 0.067, respectively. However, 44 traffic accidents occurred in Huangpu District, but the injury rate and death rate of traffic accidents there were the highest at 0.409 and 0.205, respectively.

3. Recognition Methods

3.1. Adaptive Kernel Density Estimation

The kernel density estimation method places a symmetrical surface at each traffic accident point, and by using the density method to define an arbitrary segment consistent with the entire road network, thereby identifies traffic accident black spots [20]. Its expression is:
f ( x ) = i = 1 n 1 n h k ( x x i h )
In Equation (1): f ( x ) is the nuclear density value at point x ;   n   is the number of sample points whose distance from the center x is less than or equal to h ; h is the bandwidth (search radius); k ( x ) is the kernel function, using the distance between the sample data point x i and x to determine the effect when x i estimating the density of point x ; x x i is the distance between event point i and event point x .
The adaptive kernel density estimation law is a natural extension of the standard kernel density estimation method, also known as the variable kernel density estimation method, which is obtained when h is no longer a global constant [14]. Its application principle relative to accident black spots is that, in the section with high accident density, relatively small bandwidth is selected to reflect the local characteristics of the accident, and in the section with low accident density, relatively large bandwidth is selected to reflect the overall characteristics of the accident. Then, the influence of outliers on the estimation value is reduced. At present, most traffic accident black spot identification methods use a fixed length of road to identify traffic accident black spots. This cannot reflect the real-time changes of traffic accidents and identify accident black spots accurately. In order to objectively select the optimal bandwidth coefficient and improve the accuracy of blackspot recognition in traffic accidents, this paper proposes to construct an adaptive kernel density estimation and improve the fixed bandwidth so that the bandwidth value changes with the change of each sample data point.

3.2. Kernel Function Selection

Commonly used kernel functions mainly include uniform, Gaussian, triangular, quadratic (Epanechnikov), and others [22]. According to the expressions of different kernel functions in Table 3, the uniform, triangular, and quadratic kernel functions intersect the x axis, and the Gaussian kernel function does not intersect the x axis, as shown in Figure 2.
In addition, this paper analyzes the influence of the kernel function on the estimated value of nuclear density by fitting the kernel density estimation curve of the above-mentioned commonly used kernel function. The curve fitting of different kernel functions is shown in Figure 3. It can be seen from Figure 3 that the kernel function has basically no effect on the kernel density estimate. Only the kernel density estimation curve obtained by using uniform kernel and quadratic kernel at the peak and boundary of the curve is different from the curves of the other two kernel functions, but the smoothness of the kernel density estimation curve in any other area is almost the same. At the same time, according to the research conclusions of relevant scholars, the determination of kernel function has little influence on the final estimation of kernel density and is not the main link between calculation and solution. In addition, under the condition that the research object has a large number of sample data, accurate estimation results of kernel density can be obtained by selecting any kernel function [23]. Although the kernel function has little influence on the kernel density estimation, the mean square error of the quadratic kernel function is the best. Therefore, the quadratic kernel function is selected as the kernel function for calculating the nuclear density of traffic accidents.

3.3. Calculation of Adaptive Bandwidth

The key to adaptive kernel density estimation lies in the choice of bandwidth, which directly affects the accuracy of the results [24]. Among them, if the bandwidth is too large, the detailed information of the data cannot be reflected; on the contrary, the overall characteristics of the data also cannot be reflected. Therefore, in order to reflect the overall characteristics and detailed characteristics of the data as much as possible, this article adopts the interpolation method and measures the quality of the bandwidth based on the mean square error (MSE). When the mean square error is the smallest, the bandwidth is the optimal bandwidth; that is, the optimal bandwidth can be obtained from the minimum value of the mean square error (MSE) between Equation (1), the kernel density f ( x ) value, and the true density f ( x ) , and its expression is as follows:
M S E ( h ) = E M I S E ( h ) = E [ f ( x ) f ( x ) ] 2 = V a r ( f ( x ) ) + B i a s ( f ( x ) ) 2
On one hand, the variance is inversely proportional to the bandwidth h , that is, the smaller the bandwidth, the greater the variance. On the contrary, the squared deviation is proportional to the sum h 2 , that is, the smaller the bandwidth, the smaller the squared deviation. Therefore, the bandwidth h can be optimized by minimizing the integral mean square error. The expression is as follows:
M I S E ( h ) = E { [ f ( x ) f ( x ) ] 2 d x } = V a r ( f ^ ( x ) ) d x + B i a s 2 ( f ^ ( x ) ) d x
Due to E ( x ) = + x f ( x ) d x and f ^ ( x ) = 1 n h i = 1 n k ( x x i h ) ,we can get
E [ f ^ ( x ) ] = E [ 1 n h i = 1 n k ( x x i h ) ] = [ 1 h k ( x y h ) f ( y ) ] d y
Suppose t = x y h ,then y = x h t , substituting it into Equation (4), we can get:
M I S E = V a r ( f ^ ( x ) ) d x + B i a s 2 ( f ^ ( x ) ) d x     = 1 n h [ k 2 ( t ) d t ] [ f ( x ) d x ] + 1 4 h 4 [ t 2 k ( t ) d t ] 2 [ f ( x ) ] 2 d x
Due to f ( x ) d x = 1 , so
M I S E = 1 n h [ k 2 ( t ) d t ] + 1 4 h 4 [ t 2 k ( t ) d t ] 2 [ f ( x ) ] 2 d x = A M I S E ( h ) + o ( 1 n h + h 4 )
In Equation (6), A M I S E ( h ) is called the asymptotic mean square error for simplifying the equation, omitting the infinitesimal term. The final formula is as follows:
A M I S E ( h ) = k 2 ( t ) d t n h + h 4 t 4 [ f ( x ) ] 2 d x 4
By deriving Equation (7) = 0, we can get:
h o p t = t 2 / 5 { k 2 ( t ) d t } 1 5 { f ( x ) 2 d x } 1 5 n 1 5
Meanwhile, the bandwidth h is the optimal value. The function f ( x ) is unknown from Equation (8). For this reason, f ( x ) can use normal distribution to estimate the value of f ( x ) 2 d x . Assuming that the data of traffic accidents occurring on the road, as a whole obeys a normal distribution, the unknown function f ( x ) can be expressed as:
f ( x ) 2 d x = σ 5 φ ( x ) 2 d x = 3 8 π 1 2 σ 5
Meanwhile, the values of functions k 2 ( x ) d x and k 2 k ( x ) d x are directly related to the selection of specific functions. When the kernel function is selected, the values of k 2 ( x ) d x and k 2 k ( x ) d x can be determined by calculation. When the kernel function is a quadratic kernel function, the optimal bandwidth is shown in Equation (10):
h o p t = ( 8 π ) 1 5 σ n 1 5 = 1.7 σ n 1 5
In the equation: σ is the sample standard deviation; n is the total number of traffic accidents.
Let the differential of Equation (3) be zero, then the adaptive bandwidth can be calculated. The mathematical formula is as follows:
h ( x ) = { f ( x ) k 2 ( x ) d x } 1 5 { f ( x ) 2 d x } 1 5 n 1 5
In the formula: f ( x ) is the actual nuclear density value.
Since the actual kernel density value f ( x ) is unknown, this section uses the kernel density estimate to replace the actual kernel density value, and substituting the quadratic kernel function into Equation (11), the adaptive bandwidth can be calculated as follows:
h ( x ) = h o p t 9 5 c 1 5 f ( x ) 1 5 ( i = 1 n 1 n h o p t t 2 k ( t ) f ( x ) ) 2 5
In Equation (12): c = f ( x ) d x = 3 8 π 1 2 h 5
By substituting Equation (12) into Equation (1), the adaptive kernel density estimation is calculated as follows:
f ( x ) * = 1 n h ( x ) i = 1 n K ( x x i h ( x ) )
In Equation (13): f ( x ) * is the adaptive kernel density estimate;   n is the total number of traffic accidents; h ( x ) is the bandwidth; K is the kernel function; x i is the coordinates of the accident point i ; x is the center point coordinates.

3.4. Road Hazard Index

For different sections, even if the number of traffic accidents is the same, the danger degree of the location is also different because of the different danger degree of specific traffic accidents. Therefore, in order to distinguish the degree of risk of traffic accidents in different locations, the Road Hazard Index (RHI) is introduced. Through weight assignment, the number of traffic accidents, the number of minor injuries, the number of serious injuries, the number of fatalities, and the direct economic loss in traffic accidents are quantified one by one to determine the risk index of the target road section. The formula for calculating the road hazard index is as follows:
R H I = w 1 I + w 2 S + w 3 D + w 4 P D O + T N T A
In Equation (14): I is the number of minor injuries in traffic accidents; S is the number of serious injuries in traffic accidents; D is the number of deaths in traffic accidents; P D O is the economic losses in traffic accidents; T N T A is the total number of traffic accidents; w 1 ,   w 2 ,   w 3 ,   w 4 are the weight coefficients of the number of minor injuries, the number of serious injuries, the number of deaths, and the economic loss, respectively.
The selection of w 1 ,   w 2 ,   w 3 ,   w 4 is related to the impact of traffic accidents on society. In a traffic accident, the casualties need to be considered first, followed by economic losses. So, w 3 > w 2 > w 1 > w 4 . Drawing lessons from the long-term research of domestic and foreign scholars, the weight of the number of minor injuries is 0.5; the weight of severe injuries is 1; the weight of deaths is 3; and the weight of economic loss is 1/30,000 [25], namely:
R H I = 0.5 I + 1 S + 3 D + 1 / 30000   P D O + T N T A
Introducing the road hazard index using the formula of the adaptive kernel density estimation method and combining the quadratic kernel function and bandwidth, the mathematical formula of the traffic accident black spot recognition model is as follows:
f ^ ( x ) = 1 n h ( x ) i = 1 n K ( x x i h ( x ) ) R H I
k ( x ) = { 3 ( 1 x 2 ) 4 0 | x | 1 | x | > 1
In Equation (17): f ^ ( x ) is the estimated value of the kernel density of a certain road section; h o p t is the optimal bandwidth; n is the total number of traffic accidents; K is a quadratic kernel function.

4. Result Analysis

4.1. Black Spot Recognition in Traffic Accidents

Through the POI retrieval interface of the digital map API, crawler technology was used to obtain the WGS-1984 coordinate information of eac traffic accident point and convert its coordinates to the corresponding projection coordinates. At the same time, the projection coordinate information was imported into ArcGIS software for analysis, and then the traffic in Shanghai was obtained. The spatial distribution map of the traffic accident points is shown in Figure 4.
Equation (14) was used to calculate the road hazard index. The minimum value of the road risk index was 1, the maximum value was 17, the average value was 1.58, and the standard deviation was 1.53. At the same time, the optimal bandwidth was calculated to be 458 m using Equation (10). Based on Equation (13), the adaptive kernel density estimation of the national and provincial trunk lines in Shanghai was calculated and Arc-Map 10.2 was used to identify black spots in traffic accidents, as shown in Figure 5. In Figure 5, the greener the color, the smaller the estimated value of nuclear density, that is, the lower the risk of road traffic accidents; the redder the color, the higher the estimated value of nuclear density, that is, the higher the risk of road traffic accidents. The research results show that there were 46 traffic accident black spots, and the estimated value of traffic accident nuclear density was relatively high, with a standard deviation of 1.907, and maximum adaptive kernel density estimate of 189.36.

4.2. Method Comparison

In order to verify the effectiveness of the proposed black spot identification model, the accident frequency method, kernel density estimation method, and traffic accident identification model were compared. In the process of identifying blackspots in traffic accidents, the search bandwidth was set to 458 m and the blackspot identification results of the two methods are shown in Figure 6. According to Figure 6a, 59 traffic accident black spots were identified using the accident frequency method, with a standard deviation of 0.867 and a maximum traffic accident density of 14.32. According to Figure 6b, a total of 51 traffic accident black spots were identified using the kernel density estimation. At the black spot of traffic accidents, the standard deviation was 1.109, and the maximum traffic accident density was 15.04. Through analysis, it can be found that, compared with the accident rate and nuclear density estimation, the standard deviation of the adaptive nuclear density estimation method was relatively large, which is consistent with the results of Boroujerdian Amin, that is, the larger the variance, the better the recognition effect [26].
In addition, the first 6 black dots identified by different traffic accident identification black dot methods were sorted, as shown in Table 4. Among the 6 black spots, the maximum kernel density estimated value recognized by adaptive kernel density estimation was 189.36, and the average length of the road section was 226 m, with a maximum road section length of 249 m, and an estimated value of kernel density of 162.43. For the nuclear density estimation method and the accident frequency method, the maximum nuclear density estimation value and the maximum density value were 176.96 and 173.62, respectively.

4.3. Method Testing and Evaluation

4.3.1. Accuracy Test

At present, many methods have been used to test the accuracy of accident black spot identification, such as hit rate index, recovery rate index, and prediction accuracy index. Among them, hit rate is the most commonly used validity test indicator, that is, the ratio of the total number of traffic accidents that occurred in a certain fixed period of time in the identified road section with black spots in the future to the total number of traffic accidents on the target road section. Although the usage of the hit rate index is not only simple and easy to understand, it is easily affected by the total length of the road section and the number of traffic accidents in the study area. For example, when all roads in the entire area to be studied are identified as road sections with black spots in traffic accidents, the hit rate of the effectiveness test index is 100%. However, this method and the recognition results are meaningless to road safety researchers. In view of this, prediction accuracy indicators are also used to test the quality of hot spots. Predictive accuracy index (PAI) refers to the hit rate and the percentage of the area or the percentage defined as the hot research area [27]. This index was later modified by Thakali to measure the predictive ability of collision accidents, and is called the crash predictive accuracy index (CPAI) [26]. However, traffic accidents occur on the road and are road-binding. Therefore, this study adopted the length of the road covered by black spots and the total length of the road in the study area, and introduced CPAI to test the validity of the black spot recognition model of traffic accidents. The mathematical formula expression is as follows:
C P A I = n / N l / L × 100 %
In the formula:   n is the number of traffic accidents in the black spot section of traffic accidents; N is the total number of traffic accidents in the study area; l is the length of the black spot section of traffic accidents, and the unit is m ;   L is the total length of roads in the study area, and the unit is m .
In this study, CPAI was used to test the predictive performance of the three methods (accident frequency, nuclear density estimation, and the traffic accident black spot identification model). The comparison results of different methods are shown in Table 4. It can be seen from Table 5 that the CPAI values of the accident frequency method, the nuclear density estimation method, and the traffic accident identification model were 14.39, 16.36, and 18.25 respectively, and the lengths of the black spots of the traffic accidents were 184.68, 162.45, and 145.57 respectively. From a practical point of view, based on the results of CPAI research, it is shown that compared with the other two methods, the black spot recognition model proposed in this paper had the best recognition effect. The main reason is that the CPAI value of this method was the largest, that is, the identified black spots of traffic accidents were the shortest, accounting for 5.48% of the total length of the road network, and the number of identified traffic accidents was 514, accounting for 26.4% of the total number of traffic accidents. At the same time, the accident frequency method had the worst recognition effect (the CPAI value was the smallest), and the length of the road section with black spots in traffic accidents was 184.68, accounting for 6.95% of the total length of the road network.

4.3.2. Evaluation of Identification Methods

From the perspective of applicability, the road repair and maintenance cost limit indicators were used to evaluate the identification effects of different methods. That is, the number of traffic accidents or the largest traffic accident density identified in the shortest road section length was the best recognition effect of this method [13]. The relationship between the total percentage of traffic accidents and the percentage of road length for different traffic accident black spot identification methods is shown in Figure 7. In Figure 7, when considering the safety improvement budget of 20% of the road length, compared with the other two methods, the traffic accident black spot recognition model proposed in this paper could identify about 69% of traffic accidents, namely nuclear density estimation and accidents, at 1.13 times and 1.27 times that of the frequency method, respectively. In addition, when using this method, the length of the road that contained 70% of the traffic accidents accounted for 18.6% of the total length. Therefore, the constructed traffic accident black spot recognition model could identify black spots more effectively and accurately, and could maximize the utilization of road resources under the condition of limited road maintenance and maintenance budgets.

5. Conclusions

(1)
General
  • Through the collection and analysis of traffic accident data on Shanghai’s national and provincial trunk lines, a method for identifying black spots in traffic accidents based on improved bandwidth adaptive kernel density estimation was proposed and road hazard indexes were added as identification parameters to construct traffic accident black spot recognition model. On this basis, using ArcGIS software, the black spot recognition results were compared with the accident frequency method and the nuclear density estimation method, and it is concluded that the adaptive nuclear density estimation method had the highest degree of clustering.
  • Using CPAI and RRMCLI to verify and evaluate the effectiveness of the three identification methods, it is concluded that the CPAI of the adaptive nuclear density estimation method was 18.25, which was higher than the accident frequency method and the nuclear density estimation. At the same time, considering the safety improvement budget of 20% of road length, the adaptive kernel density estimation method could identify about 69% of the number of traffic accidents, which was 1.13 times and 1.27 times of the kernel density estimation method and the accident frequency method, respectively.
(2)
Innovation
Based on the kernel density estimation theory, the adaptive kernel density estimation method was applied to the field of traffic accident black spot recognition. Combined with the weight assignment method, the number of traffic accidents, the number of minor injuries, the number of serious injuries, the number of deaths, and the direct economic losses were quantified one by one to establish the traffic accident black spot recognition model, which improved the recognition accuracy of the accident black spot.
(3)
Deficiencies and Prospects
  • The analysis results have reference value for road safety management and control in Shanghai, and can be further applied to the design of road safety improvement schemes. Meanwhile, the proposal and application of the adaptive kernel density estimation method in traffic accident black spots is conducive to the further expansion of black spot identification methods.
  • This study only used bandwidth and road hazard index as the main parameters when identifying traffic accident black spots and did not consider that the impact of highway spatial density may be highly related to road network structure and road parameters on accident black spots, such as lane width, road linearity, etc. In addition, the impact of different traffic flows at different sections of the road on the identification of black spots in traffic accidents needs to be further studied.

Author Contributions

Resources, W.Z.; Supervision, H.G. and L.Z.; Writing—original draft, L.D. and M.H.; Writing—review & editing, M.H. All authors have read and agreed to the published version of the manuscript.

Funding

This article is supported by Natural Science Foundation of China under grant 51905224 and Anhui Province Communications Holding Group Co., Ltd. under grant JSKY-2019-01.

Acknowledgments

Thanks to all the authors for their efforts on this article. Natural Science Foundation of China and Anhui Province Communications Holding Group Co., Ltd. are gratefully funded.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Sadeghi, A.; Farhad, H.; Moghaddam, A.M.; Qazizadeh, M.J. Identification of accident-prone sections in roadways with incomplete and uncertain inspection-based information: A distributed hazard index based on evidential reasoning approach. Reliab. Eng. Syst. Saf. 2018, 178, 278–289. [Google Scholar] [CrossRef]
  2. Chen, W. Analysis of road sections with frequent traffic accidents. J. Chin. People’s Public Secur. Univ. 2004, 2, 83–88. [Google Scholar]
  3. Ali, Y.; Zheng, Z.; Haque, M.M. Modelling lane-changing execution behaviour in a connected environment: A grouped random parameters with heterogeneity-in-means approach. Commun. Transp. Res. 2021, 1, 100009. [Google Scholar] [CrossRef]
  4. Nadimi, N.; Sheikh Hosseini Lori, E. Applying different analytic methods to determine black spots in two-lane highways. J. Transp. Saf. Secur. 2022, 14, 1395–1418. [Google Scholar] [CrossRef]
  5. Montella, A. A comparative analysis of hotspot identification methods. Accid. Anal. Prev. 2010, 42, 571–581. [Google Scholar] [CrossRef]
  6. Plug, C.; Xia, J.C.; Caulfield, C. Spatial and temporal visualisation techniques for crash analysis. Accid. Anal. Prev. 2011, 43, 1937–1946. [Google Scholar] [CrossRef]
  7. Gregoriades, A.; Mouskos, K.C. Black spots identification through a Bayesian Networks quantification of accident risk index. Transp. Res. Part C Emerg. Technol. 2013, 28, 28–43. [Google Scholar] [CrossRef]
  8. Ambros, J.; Havránek, P.; Valentová, V.; Křivánková, Z.; Striegler, R. Identification of Hazardous Locations in Regional Road Network—Comparison of Reactive and Proactive Approaches. Transp. Res. Procedia 2016, 14, 4209–4217. [Google Scholar] [CrossRef] [Green Version]
  9. Prieto, F.; Gomez-Deniz, E.; Sarabia, J.M. Modelling road accident blackspots data with the discrete generalized Pareto distribution. Accid. Anal. Prev. 2014, 71, 38–49. [Google Scholar] [CrossRef] [Green Version]
  10. Thomas, I. Spatial data aggregation: Exploratory analysis of road accidents. Accid. Anal. Prev. 1996, 28, 251–264. [Google Scholar] [CrossRef]
  11. Meng, X.; Zhang, D.; Wu, P. The influence of sliding window length and sliding step length on the identification of accident-prone sections. Transp. Syst. Eng. Inf. 2018, 18, 148–155. [Google Scholar]
  12. Ghadi, M.; Török, Á. Comparison Different Black Spot Identification Methods. Transp. Res. Procedia 2017, 27, 1105–1112. [Google Scholar] [CrossRef]
  13. Elyasi, M.R.; Saffarzade, M.; Boroujerdian, A.M. A novel dynamic segmentation model for identification and prioritization of black spots based on the pattern of potential for safety improvement. Transp. Res. Part A Policy Pract. 2016, 91, 346–357. [Google Scholar] [CrossRef]
  14. Koorey, G. Road Data Aggregation and Sectioning Considerations for Crash Analysis. Transp. Res. Rec. J. Transp. Res. Board 2009, 2103, 61–68. [Google Scholar] [CrossRef] [Green Version]
  15. Ghadi, M.; Torok, A. A comparative analysis of black spot identification methods and road accident segmentation methods. Accid. Anal. Prev. 2019, 128, 1–7. [Google Scholar] [CrossRef]
  16. Yao, S.; Loo, B.P.; Yang, B.Z. Traffic collisions in space: Four decades of advancement in applied GIS. Ann. GIS 2016, 22, 1–14. [Google Scholar] [CrossRef]
  17. Nie, K.; Wang, Z.; Du, Q.; Ren, F.; Tian, Q. A Network-Constrained Integrated Method for Detecting Spatial Cluster and Risk Location of Traffic Crash: A Case Study from Wuhan, China. Sustainability 2015, 7, 2662–2677. [Google Scholar] [CrossRef] [Green Version]
  18. Harirforoush, H.; Bellalite, L. A new integrated GIS-based analysis to detect hotspots: A case study of the city of Sherbrooke. Accid. Anal. Prev. 2019, 130, 62–74. [Google Scholar] [CrossRef]
  19. Bíl, M.; Andrášik, R.; Sedoník, J. A detailed spatiotemporal analysis of traffic crash hotspots. Appl. Geogr. 2019, 107, 82–90. [Google Scholar] [CrossRef]
  20. Flahaut, B.; Mouchart, M.; San Martin, E.; Thomas, I. The local spatial autocorrelation and the kernel method for identifying black zones. A comparative approach. Accid. Anal. Prev. 2003, 35, 991–1004. [Google Scholar] [CrossRef]
  21. Shafabakhsh, G.A.; Famili, A.; Bahadori, M.S. GIS-based spatial analysis of urban traffic accidents: Case study in Mashhad, Iran. J. Traffic Transp. Eng. 2017, 4, 290–299. [Google Scholar] [CrossRef]
  22. Kemp, C.D.; Silverman, B.W. Density Estimation for Statistics and Data Analysis. J. R. Stat. Soc. Ser. D 1987, 36, 495. [Google Scholar] [CrossRef]
  23. Tang, X.; Luo, M. The optimal choice of non-parametric kernel density estimation kernel function. J. Qiannan Teach. Coll. Natl. 2019, 39, 16–19. [Google Scholar]
  24. Bil, M.; Andrasik, R.; Janoska, Z. Identification of hazardous road locations of traffic accidents by means of kernel density estimation and cluster significance evaluation. Accid. Anal. Prev. 2013, 55, 265–273. [Google Scholar] [CrossRef] [PubMed]
  25. Peng, Y. Identification and Cause Analysis of Black Spots in Mountain Expressway Accidents; Chang’an University: Xi’an, China, 2017. [Google Scholar]
  26. Thakali, L.; Kwon, T.J.; Fu, L. Identification of crash hotspots using kernel density estimation and kriging methods: A comparison. J. Mod. Transp. 2015, 23, 93–106. [Google Scholar] [CrossRef]
  27. Ulak, M.B.; Ozguven, E.E.; Vanli, O.A.; Horner, M.W. Exploring alternative spatial weights to detect crash hotspots—ScienceDirect. Comput. Environ. Urban Syst. 2019, 78, 101398. [Google Scholar]
Figure 1. Location and traffic network of Shanghai. (a) Shanghai location. (b) Shanghai traffic arterial network map.
Figure 1. Location and traffic network of Shanghai. (a) Shanghai location. (b) Shanghai traffic arterial network map.
Electronics 11 03604 g001
Figure 2. Comparison of different kernel functions.
Figure 2. Comparison of different kernel functions.
Electronics 11 03604 g002
Figure 3. Fitting curves of kernel density estimation for different kernel functions.
Figure 3. Fitting curves of kernel density estimation for different kernel functions.
Electronics 11 03604 g003
Figure 4. Distribution of Traffic Accidents in Shanghai (2017–2019).
Figure 4. Distribution of Traffic Accidents in Shanghai (2017–2019).
Electronics 11 03604 g004
Figure 5. Identification results of traffic accident black spots by adaptive kernel density estimation method. (a) Recognition result graph. (b) Black spot road network distribution.
Figure 5. Identification results of traffic accident black spots by adaptive kernel density estimation method. (a) Recognition result graph. (b) Black spot road network distribution.
Electronics 11 03604 g005
Figure 6. Accident black spot identification results of the selected comparison method. (a) Recognition result of accident frequency method. (b) Recognition result of kernel density estimation.
Figure 6. Accident black spot identification results of the selected comparison method. (a) Recognition result of accident frequency method. (b) Recognition result of kernel density estimation.
Electronics 11 03604 g006
Figure 7. The relationship between the percentage of traffic accidents and the percentage of road length.
Figure 7. The relationship between the percentage of traffic accidents and the percentage of road length.
Electronics 11 03604 g007
Table 1. Statistical description of traffic accidents of different severity in Shanghai 2017–2019.
Table 1. Statistical description of traffic accidents of different severity in Shanghai 2017–2019.
YearTotal Number of AccidentsNumber of InjuredDeath TollDirect Economic Loss [Yuan]Injury RateMortality Rate
2017864290591,867,8830.3360.068
201861810129492,4730.1630.047
20195508620432,5600.1560.036
Total20314771082,792,9160.2350.053
Table 2. Descriptive statistics of traffic accidents in different administrative regions.
Table 2. Descriptive statistics of traffic accidents in different administrative regions.
Administrative DistrictTotal Number of AccidentsNumber of InjuredDeath TollDirect Economic Loss [Yuan]Injury RateMortality Rate
Baoshan District177471254,6000.2660.068
Chongming District414023,5000.0960
Fengxian District183414315,8000.2240.022
Hongkou Distric255046000.2000
Huangpu District44189110,0000.4090.205
Jiading District1703810192,7000.2240.059
Jingshan District12417133,0000.1370.008
Jingan District537470000.1320.075
Minhang District189546347,9000.2860.032
Pudong New Area420113281,218,5350.2690.067
Putuo District3613483810.3610.111
Qingpu District248358164,7000.1410.0322
Songjiang District1545319238,7000.3440.123
Xuhui District10016013,5000.160
Yangpu District439044000.2090
Changning District247316,0000.2920.125
Table 3. Commonly used kernel functions.
Table 3. Commonly used kernel functions.
Kernel FunctionsExpressions
Uniform k ( x ) = { 1 / 2 0    [ 1 , 1 ] o t h e r
Gaussian k ( x ) = 1 2 π σ e x 2 / 2 σ 2
Triangular k ( x ) = { 3 ( 1 x 2 / a 2 ) 4 a 0   | x | a | x | a
Epanechnikov k ( x ) = { 3 ( 1 x 2 ) 4 0   | x | 1 | x | > 1
Table 4. Comparison of Results of Different Traffic Accident Black Spot Identification Methods.
Table 4. Comparison of Results of Different Traffic Accident Black Spot Identification Methods.
Serial NumberAdaptive Kernel Density EstimationKernel Density EstimationAccident Frequency Method
Section Length [m]Kernel Density EstimateKernel Density EstimateDensity Value
1226189.36176.96173.62
2214176.91172.34161.45
3249162.43160.59143.78
4152134.61113.21101.48
5106109.2890.3686.43
683102.8383.4764.81
Table 5. Comparison of different identification methods.
Table 5. Comparison of different identification methods.
MethodTraffic Accident on the Black SpotTotal Traffic AccidentLength of Black SpotsTotal Road LengthCPAI
Accident frequency method4081947184.68265714.39
Kernel density estimation4711947162.45265716.36
Black spot recognition model5141947145.57265718.25
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Ge, H.; Dong, L.; Huang, M.; Zang, W.; Zhou, L. Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model. Electronics 2022, 11, 3604. https://doi.org/10.3390/electronics11213604

AMA Style

Ge H, Dong L, Huang M, Zang W, Zhou L. Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model. Electronics. 2022; 11(21):3604. https://doi.org/10.3390/electronics11213604

Chicago/Turabian Style

Ge, Huimin, Lei Dong, Mingyue Huang, Wenkai Zang, and Lijun Zhou. 2022. "Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model" Electronics 11, no. 21: 3604. https://doi.org/10.3390/electronics11213604

APA Style

Ge, H., Dong, L., Huang, M., Zang, W., & Zhou, L. (2022). Adaptive Kernel Density Estimation for Traffic Accidents Based on Improved Bandwidth Research on Black Spot Identification Model. Electronics, 11(21), 3604. https://doi.org/10.3390/electronics11213604

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop