Next Article in Journal
Dissolved Iron from Steel Slag with Its Chelating Agent Promotes Seaweed Growth
Next Article in Special Issue
Slow and Steady Wins the Race: A Comparative Analysis of Standing Electric Scooters’ European Regulations Integrated with the Aspect of Forensic Traumatology
Previous Article in Journal
The Effect of High, Partial, and Low Multisensory Congruity between Light and Scent on Consumer Evaluations and Approach Behavior
Previous Article in Special Issue
An Analysis of Driving Behavior of Educated Youth in Bangladesh Considering Physiological, Cultural and Socioeconomic Variables
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Characteristics of Cyclist Crashes Using Polytomous Latent Class Analysis and Bias-Reduced Logistic Regression

1
Graduate School of Science and Engineering, Chuo University, Tokyo 112-8551, Japan
2
Department of Civil and Environmental Engineering, Chuo University, Tokyo 112-8551, Japan
*
Author to whom correspondence should be addressed.
Sustainability 2022, 14(9), 5497; https://doi.org/10.3390/su14095497
Submission received: 26 March 2022 / Revised: 27 April 2022 / Accepted: 29 April 2022 / Published: 3 May 2022

Abstract

:
Although the number of cyclist crashes is decreasing in Japan, the fatality rate is not. Thus, reducing their severity is a major challenge. We used a polytomous latent class analysis to understand their characteristics and bias-reduced logistic regression to analyze their severity. Specifically, 90,696 combinations and 139,955 cyclist accidents were divided into 17 classes. The variable contributing the most to the classification was the crash location. Common fatality risks included older age groups and rural areas, whereas other factors differed among crash locations. Median strips, stop signs, and boundaries between the sidewalk and roadway affected the severity of crashes at intersections. Moreover, the existence of a median strip, collision partner, and time period affected the severity of crashes between intersections. On the sidewalks, the fatality risk was higher when the front part of the bicycle was subjected to the collision.

1. Introduction

Cycling has various advantages, including the potential for diminishing obesity; promoting health; and reducing the noise, air pollution, and CO2 emissions associated with travel. However, cyclists have a higher risk of death than motor vehicle occupants. Therefore, understanding how to make cycling safer amid increasing rates of cycling is important to improving people’s well-being. There is a growing body of research examining the safety of cyclists and severity of cyclist crashes [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17]. Recently, Macioszek and Granà [18] analyzed the severity of cyclist injuries in cyclist–vehicle crashes. Using a binary logit model, they showed that the factors affecting the crash severity were the vehicle drivers’ genders and ages, driving under the influence of alcohol, vehicles exceeding the speed limit, cyclist ages, cycling under the influence of alcohol, the speed of the cyclist before the incident, vehicle type (truck), incident place (road), time of day, and incident type.
The characteristics with the potential to influence the severity studied by various authors (including the characteristics noted above) can be grouped into those concerning the bicycle, vehicle and vehicle driver, environment, and road infrastructure (including regulations and signs). Among these, the road infrastructure can be changed by policymakers to reduce the severity of accidents.
Reynolds et al. [19] reviewed studies on the impacts of road infrastructure on cyclist safety. Intersection studies have found that multilane roundabouts can significantly increase the risk to cyclists unless a separated cycle track is included in the design. Other studies have suggested that sidewalks and multi-use trails pose the highest risk, that major roads are more hazardous than minor roads, and that the presence of cyclist facilities is associated with the lowest risk. Helak et al. [20] suggested that simply providing a dedicated space for cyclists, such as a bike lane or a paved shoulder, did not reduce the severity of the sustained injuries in a crash with a motor vehicle. Goerke et al. [21] showed that increasing the miles of bike lanes was associated with a significant reduction in severe head injuries; moreover, the miles of bike lanes were not associated with any significant changes in mortality or mechanical ventilation days when adjusted for other factors.
The factors associated with the severity of cyclist crashes may differ across different cyclist crash patterns. By considering the systematic heterogeneity of cyclist crashes, researchers and practitioners can identify the most appropriate safety countermeasures for different cyclist crash types under different conditions. One way to account for the heterogeneous nature of this data is to use a latent class clustering approach. A latent class analysis can segment the cyclist crash data into mutually exclusive and exhaustive latent classes by assuming latent categorical variables. The class membership of each cyclist crash can then be inferred from the observed variables.
Kaplan and Prato [22] analyzed the patterns of cyclist–motorist crashes in Denmark and investigated their prevalence and severity. They obtained 13 distinguishable cyclist–motorist classes using a latent class clustering approach. The prevalent features allowing for differentiation of the latent classes were the speed limit, infrastructure type, road surface conditions, number of lanes, motorized vehicle precrash maneuvers, availability of cycle lanes, cyclist intoxication, and helmet-wearing behavior. Based on an analysis of these 13 cyclist crash patterns, three types of safety considerations were proposed. These safety considerations were related to network design and connectivity, road maintenance, and cyclist road behavior.
Prati et al. [23] identified subgroups of cyclist crashes in Italy and analyzed the different cyclist crash-type subgroups separately. Using a latent class analysis, a cyclist crash dataset was segmented into 19 classes. A logistic regression analysis was used to identify the associations between each class membership and the severity of the cyclist crashes. Finally, association rules were established for each latent class to uncover the factors associated with an increased likelihood of severity.
Myhrmann et al. [24] explored the factors related to the injury severity outcomes in single-cyclist crashes. Three latent classes were identified using a latent class-ordered probit model. They found that the likelihood of cyclist membership in such classes depended on the cyclist’s age and gender. Furthermore, the road geometry, maintenance level, and interactions between the road geometry and maintenance level affected the severity.
Samerei et al. [25] identified factors affecting motor vehicle–bicycle crashes in Victoria, Australia using a binary logit model and latent class clustering. They found the factors that increased the risk of fatalities and serious injuries of cyclists in all the clusters were being an elderly cyclist, not using a helmet, and dark conditions.
Liu et al. [26] analyzed the impacts of various factors on cyclist injury severity at intersection and non-intersection locations using a latent class clustering analysis and mixed logit models. The factors with significant impacts on cyclist injury severity at intersections included cyclists drinking alcohol; driving a van, bus, or single-unit truck; motorist fault; inclement weather; and dusk or dawn conditions. In contrast, those with significant impacts on cyclist injury severity at non-intersection locations included the cyclist gender, drivers drinking alcohol, vehicle speed, speeding, type of area (rural or urban areas), traffic control, and curved roads.
Despite the above, relatively few studies have used such crash classifications, and it is not clear which factors contribute the most to these classifications. In Japan, where the number of cyclist fatalities was the highest among all the Organization for Economic Co-operation and Development countries in 2009 (see. https://www.cyclehelmets.org/1258.html (accessed on 21 March 2022).), latent class analyses have not been conducted owing to data limitations. There have been analyses of each factor and its severity [27,28], but it remains unclear which factors are the most associated with a fatality risk. The impacts of various road infrastructure elements also remain unclear.
In addition, relatively few studies have examined and compared the different factors contributing to cyclist injury severity at different locations [26]. Furthermore, a fatality accident is a rare event; thus, ordinary logistic regression is inappropriate. Even with a large number of observations, the benefits of the loss functions that normal machine learning algorithms work with may be minimized, e.g., by the machine learning predicting that all of the inputs are of the majority class. The stronger the imbalance of the outcome, the more severe the bias in the predicted probabilities. In a previous study, a penalized maximum likelihood estimation was applied to cancel out this bias [29].
This study focused on cyclist crashes occurring in Japan between 2019 and 2020. To identify reliable and relevant subgroups of cyclist crashes, we used a polytomous latent class analysis. The polytomous manifest variables included categorical indicators corresponding to characteristics of the road infrastructure, cyclist(s), vehicle(s), and environmental conditions. Specifically, characteristics concerning the infrastructure (road type, stop sign, and crash location); cyclist (age of the cyclist and collision part); collision partner (trucks and obstacles); environment (surface conditions and weather); and time period were employed in the analysis, following Prati et al. [23]. Notably, data concerning cyclist injury severity were not included in the latent class analysis, as the cyclist injury severity is considered as an outcome of a crash. We showed that the crash location (intersection, single road, and other factors) contributes the most to the classification. Then, based on crash location, we investigated the cyclist injury severity (e.g., fatality or not) using a bias-reduced logistic regression analysis. This analysis allowed us to identify the factors significantly affecting cyclist crashes.

2. Materials and Methods

The crash data for the models were provided by the National Police Agency of Japan (NPA). The original database comprised 690,415 road accidents occurring from 2019 to 2020 on Japanese roads. The NPA database includes all the publicly available data for accidents in Japan but does not include data prior to 2019.
To narrow down the events to those pertinent to the current study, we extracted 139,955 accidents in which at least one cyclist was injured or killed. The NPA database distinguishes between road accidents resulting in injuries or fatalities (within 30 days); however, it does not distinguish between different levels of injury. Table 1 shows the descriptive statistics of the 15 categorical variables selected for this dataset: (1) accident location type, (2) stop sign, (3) restricted speed, (4) median strip, (5) boundary between sidewalk and roadway, (6) road type, (7) road alignment, (8) land uses, (9) surface conditions, (10) weather, (11) time period, (12) collision partner, (13) cyclist’s age, (14) collision part of the bicycle, and (15) severity of the cyclist crash. According to the Japanese Road Code, there are five types of roads: national roads, trunk roads managed by prefecture and municipality, and prefectural and municipal roads. Unfortunately, the speed of the motor vehicle involved in the incident, speed of the cyclist, and violation of the rules were not included in the dataset.
There are more than 9.1 billion possible combinations of such accidents, but in actuality, 90,698 combinations are observed. There are two levels of severity, crashes with and without fatalities. There are 5917 (0.9%) crashes with fatalities in all the accidents, and 807 (0.6%) in cyclist-related accidents. The fatality rate in cyclist-related accidents is lower than that in all the accidents; this is because pedestrians, which represent the most vulnerable group, are not the focus of this research.
These variables and their distributions are shown in Table 1. In relation to land use, the term DIDs refers to densely inhabited districts, which are a series of census districts with a population density of 4000 or more per square kilometer and a population of more than 5000.

Methods

To perform the polytomous latent class analysis, we used the poLCA package in R software [30,31]. The latent class model estimates the observed joint distribution of the categorical variables in a set of observations and approximates it as a weighted sum of a finite number R, which is based on a large multiway contingency table.
In this study, we selected the number of classes using the Bayesian information criterion (BIC). In general, the rule of “the smaller, the better” should be applied when evaluating fit indices. To ensure that the global maximum likelihood of the latent class model was achieved (rather than the local maximum log-likelihood), we re-estimated each model 5000 times and set 10 initial values of the class-conditional response probabilities; then, we saved the model with the greatest likelihood.
From the analysis, we could determine the amount of each category of each variable included in each class. Based on the chi-square goodness-of-fit tests and tests of the differences between the proportions, we ascertained the characteristics of each class based on whether the proportion was significantly higher (or lower) than the overall proportion (p < 0.001).
In this study, the most important variable contributing to the classification was the crash location (e.g., intersections, areas between intersections, and sidewalks). Therefore, we divided the data by crash location and analyzed the risk of death. However, as mentioned above, the risk of death was generally very small. Therefore, we employed Firth’s bias-reduced logistic regression to perform the fitting using pseudo-data representations [29]. To take advantage of bias-reduced logistic regression, we used an R package called brglm [32]. The bias-reduced estimator is second-order unbiased and has a smaller variance than the maximum likelihood estimator. Using the backwards stepwise method, we estimated the model that minimized the Akaike information criterion and analyzed the risk factors in fatality accidents.

3. Results

3.1. Polytomous Latent Clustering Analysis

3.1.1. Selection of Number of Classes

By increasing the number of class settings from two, the optimal number of classes with the smallest BIC value was obtained. As shown in Figure 1, poLCA software yielded 17 latent classes.

3.1.2. Estimation Results

We tested the differences between the proportions in the full sample and those in the class. The categories showing statistical significance for each variable are depicted in Figure 2.
As an initial matter, the crash location was detected in all classes: C1–8 were intersections (66.7%), C9–14 were between intersections (31.0%), and C15–17 were between intersections and sidewalks (2.3%). For each of these three locations, we renumbered the classes in descending order of the fatality accident rates in each class. Vertical items were classified according to the road structure, environment, driver, and cyclist status. The categories for which significant differences were found for each variable are shown. If the categories were spread across multiple classes, they were bundled together. C17 had the highest fatality rate, followed by C14, 13, and 12. The fatality rate for C17 was 3.1%, i.e., more than five times the fatality rate for the entire sample. The characteristics of each variable are as follows.
Among C1–8, where the stop sign variable appeared as a feature within the intersections, “NO” appeared as a feature in C1 and C5–8. The percentages of the observations by class for “NO” were 49.6% (total sample), 70.5% (C1), and 82.8%, 84.6%, 98.4%, and 86.8% (C5–8), respectively.
The restricted speed varied considerably, with “30 km/h” for C2–3, “40 km/h” for C9 and 14, “50 km/h” for C6 and 8, “40 km/h” and “50 km/h” for C7, “60 km/h” for C4–5 and 15, and “50 km/h” and “60 km/h” for C11. The percentages of the observations by class for each restricted speed were 15.0% (overall), 49.0% (C2), and 29.6% (C3) for “30 km/h”, 23.5% (overall), 73.3% (C7), 47.6% (C9), and 52.1% (C14) for “40 km/h”; 11.5% (overall), 26.1%, 25.8%, 37.9% (C6–8), and 35.5% (C11) for “50 km/h”; and 36.9% (overall), 50.4% (C4), 61.3% (C5), 56.5% (C11), and 58.6% (C15) for “60 km/h”.
Regarding a median strip, C3, 5, 10, and 15–17 were characterized by “None”; C4, 7, 9, 12, and 14 were characterized by “Paint”; and C8 and 11 were characterized by “Yes”. C10 was represented as a feature on the side, i.e., with less than the proportion of the total sample. The percentages of the observations by class for each median strip were 48.1% (overall), 77.1% (C3), 96.6% (C5), 20.8% (C10), 93.9%, 84.2%, and 90.5% (C15–17) for “None”; 39.4% (overall), 59.8% (C4), 62.6% (C7), 79.0% (C9), and 91.1% (C14) for “Paint”; and 11.8% (overall), 49.9% (C8), and 48.6% (C11) for “Yes”.
On the boundary between the sidewalk and roadway, C8 and 10 were characterized by “Guard rail”; C4, 6, 7, 11, and 12 were characterized by “Curb”; C3, 5, and 15–17 were characterized by “None, white line”; and C9 was characterized by “None”. C9 was represented as a feature on the side, i.e., with less than the proportion of the total sample. The percentages of the observations by class for each boundary between the sidewalk and roadway were 8.6% (overall), 21.5% (C8), and 23.1% (C10) for “Guard rail”; 57.5% (overall), 71.1% (C4), 82.0% (C6), 81.6% (C7), 82.9% (C11), and 81.6% (C12) for “Curb”; 17.1% (overall), 32.3% (C3), 33.3% (C5), 34.1%, 31.6%, and 32.0% (C15–17) for “White line”’ and 16.8% (overall) and 1.7% (C9) for “None”.
Regarding the characteristics of the road type, “National road” was detected in C11; “National and trunk-municipal road” was detected in C8; and “Municipal road” was detected in C2, 3, 5, 6, 12, and 14–17. C6, 12, and 14 were represented as a feature on the side with less than the proportion of the total sample. The percentages of the observations by class for each road type were 13.7% (overall), 47.8% (C8), and 41.6% (C11) for national road; 1.1% (overall) and 5.8% (C8) for “Trunk municipal road”; and 61.4% (overall), 86.0% (C2), 90.2% (C3), 96.1% (C5), 33.8% (C6), 40.1% (C12), 36.2%, 95.3%, 93.0%, and 95.6% (C14–17) for “Municipal road”.
Regarding alignment, only one feature was expressed, i.e., “Right curve flat” at C15. The percentages of the observations by class were 1.0% (overall) and 5.2% (C15) for the right curve flat.
The characteristic of the “Urban DID (Densely Inhabitant District) area” was classified into C1 and C2, whereas that in the “Rural area” was classified in C14. The percentages of the observations by class for each land use were 61.8% (overall), 83.2% (C1), and 87.8% (C2) for “Urban-DID” and 10.6% (overall) and 32.2% (C14) for “Rural”. The land uses were clearly characterized, with low-fatality values at C1 and 2 and high values at C14, suggesting that the fatality risk may be higher in rural areas.
Regarding the surface conditions, C3, 6, 12, 13, and 16 showed “Wet” features. The percentages of observations by class for “Wet” were 11.1% (overall), 97.5% (C3), 97.3% (C6), 97.8% (C12), 99.4% (C13), and 99.3% (C16). Regarding weather, C3, 6, 12, 13, and 16 had “Rainy” features, i.e., the same classes in which wet was expressed on the road surface. The percentages of observations by class for “Rainy” were 9.4% (overall), 83.2% (C3), 84.3% (C6), 82.8% (C12), 79.3% (C13), and 79.1% (C16).
The time period was detected only in C6 and 13, as “Nighttime”. The percentages of observations by class for “Nighttime” were 14.1% (overall), 30.0% (C6), and 29.0% (C13).
The characteristics of the collision partner were estimated as only “Bicycle” for C1; only “Motorcycle -125cc” for C2; “None” for C10, 13, and 17; and only “Kei-cars” for C14. The percentages of the observations by class for each collision partner were 3.9% (overall) and 100.0% (C1) for “Bicycle”; 4.1% (overall) and 11.1% (C2) for “Motorcycle -125cc”; and 3.1% (overall), 26.1% (C10), 43.4% (C13), and 42.8% (C17) for “None”. Hence, C1 consisted entirely of cyclist accidents. C2–8, 9, 11–12, and 14–16 consisted of “(Kei-)cars, Truck, and Motorcycles” as the collision partners, whereas C10, 13, and 17 consisted of “Bicycle, Pedestrian, Obstacles, and None”.
Similar to previous research, the cyclist’s ages were detected in some classes. C1 and 5 showed “0–24”, and C17 showed “75–“. C1 was represented as a feature on the side with less than the proportion of the total sample. The percentages of the observations by class for each cyclist year were 34.7% (overall), 13.5% (C1), and 51.9% (C5) for “0–24” and 10.3% (overall) and 21.1% (C17) for “75–“. Age “75–“ was shown as a feature only in C17 and had the highest fatality rate, suggesting that it may be a factor increasing the fatality risk.
Finally, in the collision part, C1 was indicated by “Front”, and C12 and 14 were indicated by “Rear”. The percentages of the collision part were 40.5% (overall) and 64.6% (C1) for “Front” and 3.6% (overall), 11.1% (C12), and 12.0% (C14) for “Rear”.

3.2. Bias-Reduced Logistic Regression

As the polytomous latent class analysis revealed that the variable contributing the most to the classification was the crash location, a bias-reduced logistic regression analysis was conducted for each crash location. Table 2 presents the estimated results. In the following section, only those judged to be statistically significant at the 5% significance level are mentioned. The asterisks *, **, and *** indicate that the coefficients are statistically different from zero at the 5%, 1%, and 0.1% levels, respectively.
In all crash locations, older age and “Rural areas” are identified as common factors increasing the fatality risk. In addition, “No stop sign” at an intersection is found to increase the fatality risk. According to this result, it is believed that the installation of stop signs has some effect on reducing fatality accidents, although such signs not being applicable may also increase the fatality risk. However, this is possibly because the majority of collision partners are pedestrians and because the cyclist is alone during the accident.
Regarding the road type, “Trunk prefectural road” and “Municipal road” have low risks of fatality accidents within and between intersections. In contrast, “Prefectural Road” is found to have a low risk of fatality at intersections but a high risk of fatality on sidewalks. In addition, the fatality risk on sidewalks is also higher for “Prefectural road”.
Regarding the median strip, “Paint and None” increase the risk of fatality accidents both within and between intersections. “Chatter bar” increases the fatality risk only at intersections. “High-brightness paint” decreases the fatality risk at intersections but, conversely, increases the fatality risk between intersections. However, there were only two fatalities at intersections with the “Chatter bar”, and only one fatality was observed between intersections with “High-brightness paint”. Therefore, it is possible that these results were obtained only by chance, and a more detailed analysis is required in the future.
At the intersection, the boundary between the sidewalk and roadway affects the severity of the injury. The fatality risk is higher when a “Guard rail” is installed. This result may be owing to the fact that cyclists cannot “escape” to the sidewalk when they are riding on the side of the roadway on a road where a guard rail is installed, even if they are in danger of colliding with a car.
The collision partner is an important factor for severity but only between intersections. The fatality risk is found to be high in all cases, except for when the collision partner was a “Bicycle, Pedestrian, or Motorcycle −125cc”, and the fatality risk was particularly high for “Large track, Obstacles, and Motorcycle 126+cc”. These results suggest that the intensity of the impact at the time of the accident has a significant effect on the risk of death.
Other findings include that the fatality risk is higher at “Nighttime” between intersections and when the collision part is the “Front” on sidewalks. Since vehicle recognition is delayed at “Nighttime” owing to darkness, both of these factors can be attributed to the inattention of the cyclist.

4. Discussion

4.1. Accident Classification

The results of the polytomous latent class analysis show that cyclist accidents in Japan can be classified into 17 classes. In previous studies, there were 13 cases found in Denmark and 19 in Italy [22,23]. Although the number of variables and the categories addressed are different among these studies, approximately 10,000–70,000 cyclist accidents were classified into less than 20 classes based on the BIC.
The most important contributing factor to the classification in this study is the crash location. Interestingly, this result is the same as that obtained in Italy. In contrast, in Denmark, the area (urban or rural) was the first dividing factor, and crash location was the third factor. In Japan, the area is also one of the factors for classification but is generally a more important factor for explaining the severity. In Italy, the area was not included as a variable. In Denmark and Italy, the rule is for drivers to drive on the right side of the road, whereas, in Japan, they drive on the left. However, this rule had no impact on the classification.

4.2. Intersection Accidents

The bias-reduced logistic regression analysis revealed that the factors increasing the fatality risk for cyclist accidents at intersections were “older cyclist”, “Rural area”, “National road”, “Chatter bar”, “Paint”, and “None” in the median strip and “No stop sign” and “Guard rail” in the boundary between the sidewalk and roadway. “High-brightness paint” in the median strip is also shown to be a factor reducing the fatality risk. However, the effectiveness of “High-brightness paint” in the median strip remains somewhat uncertain, as there are only a few sections where this system has been introduced, and only 109 corresponding traffic accidents were confirmed.
Shen et al. [14] found that urban junctions had a higher fatality risk for cyclists, although this study found that the fatality risk was lower in urban areas than in rural areas.
The installation of stop signs and median separations may reduce the risk of fatalities and should be promoted in the same manner as in other countries. However, the existence of a guard rail is another factor increasing the fatality risk. In Japan, cyclists often ride on sidewalks or the wrong way [33]. It is possible that the cyclists riding on sidewalks may be induced to do so by the guard rail. Reynolds et al. [19] mentioned that the existing research suggests that most studies on sidewalk-riding found that it is a very dangerous behavior for cyclists. The excessive segregation of vulnerable road users and motor vehicles can create risks. For example, roadside barriers can encourage motor vehicle drivers to reduce their attentiveness toward other road users.
It is important to reduce the complexity of conflict points. For example, grade-separated junctions segregate primary traffic streams by placing them on separate levels; this measure is used for roads with high traffic volumes. Other treatments to redesign junctions include changes in the angles between roads and/or measures to improve the lines of sight and eliminate objects obscuring approaches. The concept of “daylighting” refers to the removal of parking spaces in front of curbs around an intersection, thereby increasing the visibility for pedestrians and drivers. In this study, the median strip was a factor that both increased and decreased the fatality risk. However, the sample size used herein may be insufficient. In addition, our data do not reflect physical intersection structures such as normal crossroads, T-junctions, and roundabouts. Accordingly, further investigation is required.

4.3. Non-Intersection Accidents

The bias-reduced logistic regression analysis revealed that the factors increasing the fatality risks of cyclist accidents between intersections were “older cyclist”, “Rural area”, “National road”, “Paint”, and “None” and, in the median strip, “Trucks as the collision partner” and “Nighttime”. The factors identified for sidewalks include “older cyclists”, “Rural area”, and “Prefectural road”.
The increased risk of fatalities at night has been reported in several recent studies [21,25,26,34]. This higher risk of fatalities is owing to a variety of factors, such as delays in detecting hazards at nighttime owing to poor visibility relative to daytime and higher speeds of vehicles owing to lower traffic volumes.
Asgarzadeh et al. [20] showed that collisions with large vehicles generally increase the fatality risk in cyclist accidents; this result was also identified in this study as a factor that increased the fatality risk in cyclist accidents between intersections. This study also found that collisions with “Obstacles” and “Motorcycle 126+cc” also significantly increased the fatality risk. In collisions with “Obstacles”, the cyclist’s inattention to decelerate immediately before an accident may be a contributing factor to the strong impact of collisions, whereas, in collisions with “Motorcycle 126+cc”, the motorcyclist’s high acceleration may make accidents more likely to occur at high speeds.
The higher fatality risk found for cyclist accidents in non-intersection rural areas identified by Liu et al. [26] was also shared by this study. Liu et al. [27] found that the fatality risk of cyclist accidents was also high in urban areas. This study showed that the fatality risk was higher in a rural area, “Urban-non-DID area”, and “Urban-DID area” in that order and identified areas where measures were particularly needed to effectively reduce the fatality risk of cyclist accidents.
The fatality risk on sidewalks is higher than that on prefectural roads. Of the 319 accidents, 14 were fatal, indicating a high fatality risk. The reason for this may be that the corresponding surfaces are less well-maintained than those of national and municipal roads. Since one of the parts of a bicycle involved in collisions is the front, it is conceivable that the holes in such sidewalks could lead to serious accidents. Uneven roads may exist as a result of insufficient road maintenance. Therefore, promoting these improvements may reduce the risk of death. Unfortunately, we did not collect data regarding the surface pavement conditions of the sidewalks and roadways. Thus, further research is necessary.
Finally, at both intersections and non-intersections, the restricted speed, weather, and surface conditions did not affect the severity of cyclist accidents. This result is different from that of previous studies [4,11]. However, the reasons for this are unclear, owing to the impossibility or inability to obtain detailed or specific data for such analyses.

5. Conclusions

In this study, we analyzed and classified cyclist crashes occurring in Japan between 2019 and 2020. We employed a latent class analysis using categorical indicators related to the infrastructure, road users, vehicles, and environmental/time characteristics to classify cyclist crashes. We identified 17 classes of cyclist crashes with different causes, degrees of severity, and profiles of road users. The variable contributing the most to the classification was the crash location. Common fatality risks included older age groups and rural areas, whereas other factors differed among the crash locations. The existence of median strips, stop signs, and boundaries between sidewalk and roadways affected crashes at intersections. The median strip, collision partner, and time period affected crashes between intersection. On the sidewalks, the fatality risk was higher when the collision part of the bicycle was the front.
The results of this study suggest that it is important to implement countermeasures based on the crash location. At intersections, the installation of stop signs and median separations may reduce the risk of fatalities. It is common for cyclists to decelerate when approaching a stop sign and check to ensure that there is no vehicular or pedestrian cross-traffic before proceeding through the intersection rather than putting a foot down. Many different parties can be liable for a bike accident at a stop sign, including the cyclist.
This study showed that a median separation was one of the factors reducing the severity, and the installation of guard rails potentially increases the risk of death. Wang et al. showed that tree separation for non-motorists increased the injury severity of e-bicyclists [35]. It would be worthwhile to investigate other separation treatments that could better protect e-bicyclists from crashes.
The presence of bumps and obstacles, particularly on rural roads with low pedestrian traffic, has rarely been reported to road managers, which may lead to an increase in single-vehicle accidents. Regular road maintenance of the shoulders and other non-roadway sections is important.
However, this study has the following issues regarding the data and methods. The first challenge regarding the data limitations is the definition of severity. In this study, only two classifications were used: fatal or non-fatal. Fatalities are extremely rare, and future analyses by subdividing the injuries such as hospitalization or not may yield different results. In addition, the data collected in this study do not include detailed attributes of cyclists. Previous studies have analyzed the factors such as alcohol consumption, helmet use, and improper behaviors such as violations of the law [4,5,11]. It is important to take these factors into account in the analysis of the fatality risk. Furthermore, the lack of traffic flow data for motor vehicles, cyclists, and pedestrians is a major deficiency. For example, the installation or removal of guard rails and utility poles should be considered not only for cyclists, but also for pedestrians and motor vehicles. In Japan, there are many cases of cyclists running on the wrong side of sidewalks and roadways and automobiles parking in bicycle lanes. On urban roads without a bike lane, the vehicle speeds with and without cyclists were found to be negligible [36]. On-road bicycle lanes and parked cars also reduced the passing distance [37]. It is desirable to analyze these infrastructures on the severity combined with the mean and variances traffic volume by various road users and land use along roadsides data.
Regarding the methodology, latent class analysis provides the best fit when modeling the data but does not provide an indication of how well the clusters are separated. The popularity of such methods might be explained by the apparent implication that the results show clear (i.e., separate) clusters, which are then assumed to represent a qualitatively distinct group of people with an existing disease entity. However, it should be noted that the classes formed are relative to each other and do not necessarily have an inherent meaning.
Finally, Billot-Grasset et al. [15] pointed out that road user behavior influences each step in the chain of events leading to an accident. Ma et al. [16] showed that cyclists’ crash risks were directly predicted by risky cycling behaviors and cycling anger and that the effects of cycling anger, impulsiveness, and normlessness on crash risks were mediated by cycling behaviors. These psychological aspects of cyclists and drivers are important for understanding accidents. The development of bicycle spaces will ultimately lead to the creation of safe travel spaces for all road users [38]. Although underreporting biases has been noted [39], not simply the severity but also the probability of accidents need to be examined further.

Author Contributions

Conceptualization, Y.S. and M.T.; methodology, Y.S. and M.T.; software, Y.S and M.T.; validation, M.T. and D.S.; formal analysis, Y.S. and M.T.; investigation, Y.S.; resources, M.T.; data curation, Y.S.; writing—original draft preparation, Y.S. and M.T.; writing—review and editing, Y.S. and M.T.; visualization, Y.S. and M.T.; supervision, M.T.; and project administration, M.T. All authors have read and agreed to the published version of the manuscript.

Funding

This study received no external funding.

Data Availability Statement

Publicly available datasets were analyzed in this study. These data are available at https://www.npa.go.jp/publications/statistics/koutsuu/opendata/index_opendata.html (accessed on 21 March 2022).

Acknowledgments

We thank Yuki Ishisaka for helping with the data analysis.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Klop, J.R.; Khattak, A.J. Factors Influencing Bicycle Crash Severity on Two-Lane, Undivided Roadways in North Carolina. Transp. Res. Rec. 1999, 1674, 78–85. [Google Scholar] [CrossRef]
  2. Wang, Y.; Nihan, N.L. Estimating the risk of collisions between bicycles and motor vehicles at signalized intersections. Accid. Anal. Prev. 2004, 36, 313–321. [Google Scholar] [CrossRef]
  3. Kim, J.-K.; Kim, S.; Ulfarsson, G.F.; Porrello, L.A. Bicyclist injury severities in bicycle-motor vehicle accidents. Accid. Anal. Prev. 2007, 39, 238–251. [Google Scholar] [CrossRef] [PubMed]
  4. Eluru, N.; Bhat, C.R.; Hensher, D.A. A mixed generalized ordered response model for examining pedestrian and bicyclist injury severity level in traffic crashes. Accid. Anal. Prev. 2008, 40, 1033–1054. [Google Scholar] [CrossRef] [Green Version]
  5. Moore, D.N.; Schneider IV, W.H.; Savolainen, P.T.; Farzaneh, M. Mixed logit analysis of bicyclist injury severity resulting from motor vehicle crashes at intersection and non-intersection locations. Accid. Anal. Prev. 2011, 43, 621–630. [Google Scholar] [CrossRef]
  6. Yan, X.; Ma, M.; Huang, H.; Abdel-Aty, M.; Wu, C. Motor vehicle-bicycle crashes in Beijing: Irregular maneuvers, crash patterns, and injury severity. Accid. Anal. Prev. 2011, 43, 1751–1758. [Google Scholar] [CrossRef]
  7. Klassen, J.; El-Basyouny, K.; Islam, M.T. Analyzing the severity of bicycle-motor vehicle collision using spatial mixed logit models: A City of Edmonton case study. Saf. Sci. 2014, 62, 295–304. [Google Scholar] [CrossRef]
  8. Hu, F.; Lv, D.; Zhu, J.; Fang, J. Related Risk Factors for Injury Severity of E-bike and Bicycle Crashed in Hefei. Traffic Inj. Prev. 2014, 15, 319–323. [Google Scholar] [CrossRef]
  9. Rivara, F.P.; Thompson, D.C.; Thompson, R.S. Epidemiology of bicycle injuries and risk factors for serious injury. Inj. Prev. 2015, 21, 47–51. [Google Scholar] [CrossRef]
  10. Wang, C.; Lu, L.; Lu, J. Statistical analysis of bicyclists’ injury severity at unsignalized intersections. Traffic Inj. Prev. 2015, 16, 507–512. [Google Scholar] [CrossRef]
  11. Chen, P.; Shen, Q. Built environment effects on cyclist injury severity in automobile-involved bicycle crashes. Accid. Anal. Prev. 2016, 86, 239–246. [Google Scholar] [CrossRef] [PubMed]
  12. Silvano, A.P.; Koutsopoulos, H.N.; Ma, X. Analysis of vehicle-bicycle interactions at unsignalized crossings: A probabilistic approach and application. Accid. Anal. Prev. 2016, 97, 38–48. [Google Scholar] [CrossRef] [PubMed]
  13. Asgarzadeh, M.; Verma, S.; Mekary, R.A.; Courtney, T.K.; Christiani, D.C. The role of intersection and street design on severity of bicycle-motor vehicle crashes. Inj. Prev. 2017, 23, 179–185. [Google Scholar] [CrossRef] [Green Version]
  14. Shen, J.; Wang, T.; Zheng, C.; Yu, M. Determinants of Bicyclist Injury Severity Resulting from Crashes at Roundabouts, Crossroads, and T-Junctions. J. Adv. Transp. 2020, 2020, 6513128. [Google Scholar] [CrossRef]
  15. Billot-Grasset, A.; Amoros, E.; Hours, M. How cyclist behavior affects bicycle accident configurations? Res. Part F Traffic Psychol. Behav. 2016, 41, 261–276. [Google Scholar] [CrossRef]
  16. Ma, Y.; Zheng, Y.; Li, N.; Cheng, J. Personality and behavioral predictors of cyclist involvement in crash-related conditions. Int. J. Environ. Res. Public Health 2019, 16, 4881. [Google Scholar] [CrossRef] [Green Version]
  17. Fountas, G.; Fonzone, A.; Olowosegun, A.; McTigue, C. Addressing unobserved heterogeneity in the analysis of bicycle crash injuries in Scotland: A correlated random parameters ordered probit approach with heterogeneity in means. Anal. Methods Accid. Res. 2021, 32, 100181. [Google Scholar] [CrossRef]
  18. Macioszek, E.; Granà, A. The Analysis of the Factors Influencing the Severity of Bicyclist Injury in Bicyclist-Vehicle Crashes. Sustainability 2022, 14, 215. [Google Scholar] [CrossRef]
  19. Reynolds, C.C.; Harris, M.A.; Teschke, K.; Cripton, P.A.; Winters, M. The impact of transportation infrastructure on bicycling injuries and crashes: A review of the literature. Environ. Health 2009, 21, 8–47. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  20. Goerke, D.; Zolfaghari, E.; Marek, A.P.; Endorf, F.W.; Nygaard, R.M. Incidence and Profile of Severe Cycling Injuries After Bikeway Infrastructure Changes. J. Community Health 2020, 45, 542–549. [Google Scholar] [CrossRef]
  21. Helak, K.; Jehle, D.; McNabb, D.; Battisti, A.; Sanford, S.; Lark, M.C. Factors Influencing Injury Severity of Bicyclists Involved in Crashes with Motor Vehicles: Bike Lanes, Alcohol, Lighting, Speed, and Helmet Use. South. Med. J. 2017, 110, 441–444. [Google Scholar] [CrossRef] [PubMed]
  22. Kaplan, S.; Prato, C.G. Cyclist-motorist crash patterns in Denmark: A latent class clustering approach. Traffic Inj. Prev. 2013, 14, 725–733. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Prati, G.; De Angelis, M.; Marín Puchades, V.; Fraboni, F.; Pietrantoni, L. Characteristics of cyclist crashes in Italy using latent class analysis and association rule mining. PLoS ONE 2017, 12, e0171484. [Google Scholar] [CrossRef] [PubMed]
  24. Myhrmann, M.S.; Janstrup, K.H.; Møller, M.; Mabit, S.E. Factors influencing the injury severity of single-bicycle crashes. Accid. Anal. Prev. 2021, 149, 105875. [Google Scholar] [CrossRef] [PubMed]
  25. Samerei, S.A.; Aghabayk, K.; Shiwakoti, N.; Mohammadi, A. Using latent class clustering and binary logistic regression to model Australian cyclist injury severity in motor vehicle–bicycle crashes. J. Saf. Res. 2021, 79, 246–256. [Google Scholar] [CrossRef] [PubMed]
  26. Liu, S.; Lin, Z.; Fan, W. Investigating contributing factors to injury severity levels in crashes involving pedestrians and cyclists using latent class clustering analysis and mixed logit models. J. Transp. Saf. Secur. 2021, 13, 223–245. [Google Scholar] [CrossRef]
  27. Furukura, S. Manifestations of the Characteristics of Bicycle Accidents, Column, Bicycle Parking Improvement Centre. 2021. (In Japanese). Available online: https://onl.la/TPCfzNr (accessed on 22 March 2022).
  28. Nakai, H. What increases a risk of mortality from crossing collision (cyclists against vehicles)? In Proceedings of the 57th Annual Conference of the Japan Ergonomics Society, Mie, Japan, 25–26 June 2016; Volume 57, pp. 392–393. (In Japanese). [Google Scholar]
  29. Kosmidis, I.; Firth, D. Jeffreys-prior penalty, finiteness and shrinkage in binomial response generalized linear models. Biometrika 2021, 108, 71–82. [Google Scholar] [CrossRef]
  30. Bandeen-Roche, K.; Miglioretti, D.L.; Zeger, S.L.; Rathouz, P.J. Latent Variable Regression for Multiple Discrete Outcomes. J. Am. Stat. Assoc. 1997, 92, 1375–1386. [Google Scholar] [CrossRef]
  31. Linzer, D.A.; Lewis, J.B. poLCA: An R Package for Polytomous Variable Latent Class Analysis. J. Stat. Softw. 2011, 42, 1–29. [Google Scholar] [CrossRef] [Green Version]
  32. Kosmidis, I. brglm: Bias Reduction in Binary-Response Generalized Linear Models. R Package Version 0.7.2. 2021. Available online: https://cran.r-project.org/package=brglm (accessed on 22 March 2022).
  33. Hagita, K.; Yokozeki, T. Comparison of Bicycle Volumes and Accidents Concerning Direction of Bicycle on Space for Cycling. J. Jpn. Soc. Civ. Eng. Ser. D3 2019, 75, I_493–I_506. (In Japanese) [Google Scholar] [CrossRef]
  34. Liu, S.; Li, Y.; Fan, W. Mixed logit model based diagnostic analysis of bicycle-vehicle crashes at daytime and nighttime. Int. J. Transp. Sci. Technol. 2021, in press. [Google Scholar] [CrossRef]
  35. Wang, T.; Chen, J.; Wang, C.; Ye, X. Understand e-bicyclist safety in China: Crash severity modeling using a generalized ordered logit model. Adv. Mech. Eng. 2018, 10, 1687814018781625. [Google Scholar] [CrossRef] [Green Version]
  36. Schaefer, S.J.; Figliozzi, A.M.; Unnikrishnan, A. Evidence from Urban Roads without Bicycle Lanes on the Impact of Bicycle Traffic on Passenger Car Travel Speeds. Transp. Res. Rec. J. Transp. Res. Board 2020, 2674, 87–98. [Google Scholar] [CrossRef]
  37. Beck, B.; Chong, D.; Olivier, J.; Perkins, M.; Tsay, A.; Rushford, A.; Li, L.; Cameron, P.; Fry, R.; Johnson, M. How much space do drivers provide when passing cyclists? Understanding the impact of motor vehicle and infrastructure characteristics on passing distance. Accid. Anal. Prev. 2019, 128, 253–260. [Google Scholar] [CrossRef]
  38. Marshall, E.W.; Ferenchak, N.N. Why cities with high bicycling rates are safer for all road users. J. Transp. Health 2019, 13, 100539. [Google Scholar] [CrossRef]
  39. Shinar, D.; Valero-Mora, P.; Van Strijp-Houtenbos, M.; Haworth, N.; Schramm, A.; De Bruyne, G.; Cavallo, V.; Chliaoutakis, J.; Dias, J.; Ferraro, O.E. Under-reporting bicycle accidents to police in the COST TU1101 international survey: Cross-country comparisons and associated factors. Accid. Anal. Prev. 2018, 110, 177–186. [Google Scholar] [CrossRef]
Figure 1. Change in the Bayesian information criterion (BIC) as a function of the number of latent classes.
Figure 1. Change in the Bayesian information criterion (BIC) as a function of the number of latent classes.
Sustainability 14 05497 g001
Figure 2. Characteristics of the 17 classes.
Figure 2. Characteristics of the 17 classes.
Sustainability 14 05497 g002
Table 1. Variables and their distributions.
Table 1. Variables and their distributions.
VariableCategoryn%
Crash locationBetween intersections43,33731.0
Intersections93,36766.7
Sidewalks32512.3
Stop signNot applicable *47,86634.2
YES22,67216.2
NO69,41749.6
Restricted speed2025731.8
3020,96715.0
4032,88823.5
5016,10711.5
6069,42049.6
Median stripYes16,45211.8
High-brightness paint1700.1
Chatter bar4530.3
Postcorn4950.4
Paint55,09339.4
None67,29248.1
Boundary between sidewalk and roadwayGuard rail12,0158.6
Curb80,43757.5
White line23,95917.1
None23,54416.8
Road typeNational19,11313.7
Trunk- Prefectural19,22213.7
Trunk- Municipal15351.1
Prefectural14,18410.1
Municipal85,90161.4
AlignmentRight curve - up2920.2
Right curve - down3130.2
Right curve - flat14411.0
Left curve - up2270.2
Left curve - down2860.2
Left curve - flat12130.9
Straight - down26671.9
Straight - flat46943.4
Straight - up128,81992.0
Land uses(ref) Urban - DID86,53561.8
Urban - non DID38,52327.5
Rural14,89710.6
Surface conditionsDry124,27188.8
Wet15,50611.1
Frozen690.0
Snow cover390.0
Unpaved700.1
WeatherClear99,37471.0
Cloudy27,26919.5
Rainy13,0839.3
Foggy280.0
Snowy2010.1
Time periodAfter dawn43533.1
Daytime93,83367.0
Before sunset10,0867.2
After sunset10,4717.5
Nighttime19,76914.1
Before dawn14431.0
Collision partnerCars64,69746.2
Kei-cars31,52722.5
Large truck46433.3
Small/Medium truck16,49511.8
Motorcycle 126+cc11340.8
Motorcycle -125cc57354.1
Bicycle54383.9
Pedestrian51403.7
Obstacles7940.6
None43523.1
Cyclist’s age0–2448,60234.7
25–3417,23012.3
35–4416,59811.9
45–5417,09012.2
55–6412,1888.7
65–7413,8939.9
75–14,35410.3
Collision partNone27852.0
Front56,72940.5
Right43,64731.2
Rear49793.6
Left31,81522.7
Accident typeInjury139,14899.4
Fatality8070.6
Note: * Collision partner is not motor vehicles.
Table 2. Estimation results.
Table 2. Estimation results.
Crash Location (Fatality Rate)Intersections (0.4%)Between Intersections (0.8%)Sidewalks (3.0%)
VariableCoef.z Value FatalitynCoef.z Value FatalitynCoef.z Value Fatalityn
Cyclist’s age(ref) 0–24 4734,898 1212,846 1858
25–34−0.42−1.20 911,3600.150.31 654902.502.43*3380
35–44−0.45−1.22 810,6291.203.33***1855102.852.80**3418
45–540.491.92 2210,8321.695.15***3458201.901.99*5438
55–641.295.85***3678171.865.56***2940353.193.57***15336
65–741.9010.37***7989232.849.40***8045733.423.96***29397
75–2.7116.61***17889083.3911.56***15050223.434.01***43424
Land uses(ref) Urban-DID 18556,928 12027,671 241936
Urban-non DID0.342.77**11226,8130.573.91***8710,7721.163.50***25938
Rural1.047.61***8296261.3610.03***12248941.876.23***50377
Road type(ref) National 9812,724 836152 1237
Trunk-Prefectural−0.48−2.88**5712,340−0.36−2.04*596497−0.06−0.05 1385
Trunk-Municipal−0.20−0.46 510870.801.61 44250.420.24 -23
Prefectural−0.39−2.17*489083−0.30−1.61 5147822.272.30*14319
Municipal−0.81−5.81***17158,133−0.95−5.79***13225,4811.471.59 832287
Median strip(ref) Yes 3210,279 315887
High-brightness paint−0.44−6.91*** 1091.872.12*159
Chatter bar1.412.12*22281.001.67.3224
Postcorn1.121.68.2306−0.60−0.42 -183
Paint0.532.65**13532,9580.572.83**19821,113
None0.874.36***20849,4870.461.98*9615,871
Stop sign(ref) Yes 2322,507
No1.416.63***33668,926
Not applicable *2.106.90***201934
Boundary between sidewalk and roadway(ref) Guard rail 727250
Curb−1.03−7.37***21953,166
White line−1.22−6.14***4816,390
None−1.58−7.28***3916,561
Collision partner(ref) Bicycle 22274
Cars 1.402.20*7616,986
Kei-cars 1.392.15*447964
Large truck 2.974.63***611736
Small/Medium truck 1.973.06**444683
Motorcycle 126+cc 2.403.21**6375
Motorcycle -125cc 0.360.44 31736
Pedestrian 0.520.67 43743
Obstacles 2.804.16***17629
None 2.473.88***723185
Time period(ref) Except for nighttime 21136,687
Nighttime 1.4511.78***1186650
Crach part(ref) Except for front 421875
Front 1.264.91***571376
Intercept −7.22−22.16*** −9.2913.06*** −11.57−8.53***
AIC42103004442
AIC (null)49333869888
Number of samples93,36743,3373251
The asterisks *, **, and *** indicate that the coefficients are statistically different from zero at the 5%, 1%, and 0.1% levels, respectively.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Sekiguchi, Y.; Tanishita, M.; Sunaga, D. Characteristics of Cyclist Crashes Using Polytomous Latent Class Analysis and Bias-Reduced Logistic Regression. Sustainability 2022, 14, 5497. https://doi.org/10.3390/su14095497

AMA Style

Sekiguchi Y, Tanishita M, Sunaga D. Characteristics of Cyclist Crashes Using Polytomous Latent Class Analysis and Bias-Reduced Logistic Regression. Sustainability. 2022; 14(9):5497. https://doi.org/10.3390/su14095497

Chicago/Turabian Style

Sekiguchi, Yuta, Masayoshi Tanishita, and Daisuke Sunaga. 2022. "Characteristics of Cyclist Crashes Using Polytomous Latent Class Analysis and Bias-Reduced Logistic Regression" Sustainability 14, no. 9: 5497. https://doi.org/10.3390/su14095497

APA Style

Sekiguchi, Y., Tanishita, M., & Sunaga, D. (2022). Characteristics of Cyclist Crashes Using Polytomous Latent Class Analysis and Bias-Reduced Logistic Regression. Sustainability, 14(9), 5497. https://doi.org/10.3390/su14095497

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop