1. Introduction
Car-following is a crucial technology of intelligent driving systems. The car-following model with high accuracy and strong generalization ability is of great significance for driving safety, alleviating the psychological pressure of drivers in urban expressway congestion traffic flow, and lightening the operating burden of drivers on expressways.
Existing car-following models mainly include mathematical car-following models and data-driven car-following models according to the modeling method [
1]. The mathematical car-following model is a quantitative analysis of car-following behavior based on the observation of it and careful consideration of vehicle dynamics, road conditions, and other factors to deduce an equation with clear physical parameters [
2]. It has strong interpretability and two major problems.
(1) For different road types and traffic flows, they can only be applied after calibration, and their accuracy and generalization ability are limited [
3]; and (2) with the increase in factors considered, the complexity of the model increases dramatically [
4].
The data-driven car-following model is based on the trajectory data of the car-following. It takes advantage of data science and machine learning and investigates its internal rules through the study and summary of the data. Taking time as the boundary, data-driven car-following models mainly include traditional machine-learning, deep learning, and deep reinforcement learning car-following models.
The traditional machine-learning car-following model benefits from the rapid development of machine learning algorithms and the reduction of the difficulty of obtaining high-fidelity data, which establishes a foundation for developing the data-driven car-following model [
5].
The deep learning car-following model mainly adopts recurrent neural network (RNN) and its improved algorithm. Zhou et al. [
6] introduced the first RNN car-following model and compared it with the intelligent driver model (IDM), the results show that the RNN model can better reproduce the traffic oscillation and distinguish driver styles in traffic oscillation. Wang et al. [
7] used the gated recurrent unit (GRU) of the RNN to develop a car-following model to predict the speed of the following vehicle at the next time step and compared it with the backpropagation neural network (BPNN) and IDM models through simulation. The results indicate that the GRU model has higher prediction accuracy. Wang et al. [
8] established a deep neural network car-following model, the model performance with historical input information at different time scales was compared. The results imply that the car-following model should properly embed the long memory effect. Huang et al. [
9] proposed an LSTM car-following model to study asymmetric driving behavior. The NGSIM database was used to conduct trajectory simulation and comparison between the RNN, full velocity difference, and LSTM models. The results show that the LSTM model had a better ability to capture asymmetric driving behavior. Ma et al. [
10] proposed a sequence-to-sequence learning-based car-following model, which not only considered the memory effect but also the response delay. Through comparison experiments with the IDM and LSTM models, it was proved that this model could better reproduce car-following trajectories and heterogeneous driving behaviors.
The deep deterministic policy gradient (DDPG) algorithm is widely applied in the deep reinforcement learning car-following strategy because it is adept at addressing decision problems in continuous action space [
11]. Zhu et al. [
12] proposed a human-like autonomous car-following framework based on the DDPG algorithm. They used experimental data to conduct comparative experiments on the RNN, IDM, and DDPG models. The results show that the DDPG model has high prediction accuracy and good generalization ability. Then, Zhu et al. [
13] combined a collision avoidance strategy with the DDPG algorithm to develop a speed control model for the following vehicle and used the model predictive control-based adaptive cruise control (MPC-ACC) model for comparative verification. The results show that the vehicles controlled by the DDPG model are superior to human drivers and the MPC-ACC model in terms of safety, comfort, and efficiency. Li et al. [
14] proposed a deep reinforcement learning-based autonomous car-following decision-making strategy to improve the satisfaction and acceptance of automatic driving, and carried out extensive simulation experiments to validate the model’s effectiveness and accuracy.
The data-driven car-following model has promoted car-following development, and many achievements have been made. However, the following problems still exist.
(1) About 80% of the models adopt the NGSIM database, whose representativeness needs further verification [
15]. (2) Existing data-driven car-following models are all trained and tested through a single database or driving scenario, and there is little analysis of their stability and generalization ability. (3) The car-following scenario in which human-driven and ACC vehicles are mixed on the freeway lacks attention. (4) The car-following model based on the DDPG algorithm excessively depends on the design of reward function, whose performance is unstable for different driving styles and traffic flows [
11].
Therefore, we propose a novel car-following model combining a convolutional neural network (CNN) with the LSTM network that adapts between different car-following scenarios, road types, and traffic flows to provide more accurate vehicle dynamics than any prior model that we are aware of. The primary contributions of this study are: (1) Developing a new car-following model for congested traffic flow and mixed traffic flow; (2) providing a calibrated model which can achieve car-following simulation in multiple traffic flow types and share only one group of parameters; and (3) proving the accuracy and generalization ability of the proposed model. The rest of this paper is organized as follows.
Section 2 introduces the data and the car-following trajectories data extraction standards.
Section 3 develops the CNN-LSTM car-following model, which includes the combination of a CNN and the LSTM and model confifiguration.
Section 4 carries out experiments and discussions, including calibration of models, model accuracy comparison, and generalization ability analysis.
Section 5 concludes our findings.
2. Data Preparation
The natural driving database of urban expressways was extracted by the UTE team of Southeast University using UAVs to shoot videos over four urban expressways [
16]. Because the vehicles in dataset 2 are in a free-flow state, the duration of the car-following state is short; dataset 4 lacks vehicle number information, which cannot match the corresponding state parameters of vehicles. In this paper, datasets 1 and 3 are selected as the data sources of urban expressways. See
Table 1 for the parameters of the database and
Figure 1 for lane distribution.
OpenACC is an open database of car-following experiments to study the properties of commercial ACC systems [
17], and the data of the first two experiments are used in this paper. Its parameters are shown in
Table 2. For more information about the database, please refer to the research literature [
17].
According to the following standards [
7], 400 car-following periods were extracted from the two databases, and the cumulative duration is 3.682 × 10
4 s.
(1) The leading vehicle and the following vehicle are in the same lane and the longitudinal gap is less than 100 m to prevent the following vehicles from running freely.
(2) To ensure that the vehicles do not change lanes or make sharp turns, the lateral distances’ difference between the leading vehicle and the following vehicle is less than 1.5 m.
(3) Each car-following period lasts for more than 30 s to ensure that the vehicle is in a stable car-following state.
4. Experimental Results and Discussion
The drive to develop an excellent car-following model arises from the need to analyze the traffic flow effects of proposed road network changes. Most car-following models in current use can be described as formula (8) [
20]. The framework overview of the proposed model is shown in
Figure 5. It obtains a better mapping relationship among the dynamic parameters by learning and summarizing the car-following trajectory data. The vehicle state changes with its acceleration, and the vehicle location can be updated according to formula (4).
where
is the reaction time, and
, and
are parameters that need to be estimated.
To verify the performance of the proposed model, we compare it with the IDM and LSTM models based on the same experiment. To our knowledge, the LSTM model is the best deep learning car-following model nowadays. IDM is a widely used mathematical car-following model [
8]. It assumes that acceleration is a continuous function of the velocity, gap, and velocity difference. IDM can be described as formulae (9).
where
is the acceleration calculated by IDM,
denotes the expected speed of the following vehicle,
is the safe gap calculated by IDM at time
t,
denotes the safe distance of the vehicle at rest (its default value is 2 m),
is the acceleration index (its default value is 4),
denotes the time headway, and
are the maximum acceleration and comfortable deceleration.
4.1. Calibration of Models
We randomly selected 70% (280 periods) of the extracted data to train the CNN-LSTM car-following model, and the remaining data were used as the test dataset. The cross-validation method is employed to calibrate the memory time step, prediction time step, number of LSTM layers, number of full connection layers, number of neurons, batch size, and epochs. The calibration results are shown in
Table 3. The LSTM and CNN-LSTM models adopt the same configuration and parameters, as shown in
Table 4. Different road conditions, traffic flows, and drivers’ driving styles lead to different parameter calibration results of the IDM. Therefore, they are all an interval rather than a constant, as shown in
Table 5.
4.2. Accuracy
The accuracy of the model means that the model can efficiently study and summarize the driver’s driving habits and predict future driving behavior by learning the car-following trajectory data. The car-following model with high accuracy can reproduce complex traffic phenomena and promote the research of traffic simulation.
The statistical results of 100 test periods’ errors are shown in
Table 6. The speed is low and the acceleration variation is considerable in the natural driving database, which can reflect the drivers’ features during car-following. The speed is high, the acceleration variation is slight, and the data quality is high in the OpenACC database. Therefore, the simulation errors of the three models in the OpenACC database are significant. The error distribution range and the mean of the IDM are more significant than the other two, which indicates that the neural network models have strong stability. Although there are differences in road conditions and traffic flow types, compared with the IDM and LSTM models, the CNN-LSTM model reduces the mean MSE of speed simulation by 76.0% and 55.3%, respectively, which shows that it has high accuracy and strong migration ability. We randomly selected one period from the test results to further analyze the differences between the three models.
Figure 6 shows the simulation results of the three models corresponding to this period. Compared with the LSTM model, the speed simulation value generated by the CNN-LSTM model retains a minor error with the observed value and accurately captures the changing trend of the speed at all times, highlighting the importance of feature extraction.
4.3. Generalization Ability
A neural network’s ability to use what it has learned from previous experiences to operate in a completely new environment is called generalization ability [
21]. The car-following model with strong generalization ability can be applied to different road conditions, vehicle types, and traffic flows. In this paper, the generalization ability of each model is evaluated by its ability to learn heterogeneity, reproduce the hysteresis phenomenon of congested traffic flow, and adapt the heterogeneous traffic flow mixed with ACC vehicles.
Heterogeneity includes the heterogeneity of driving behavior (different driving styles) and the heterogeneity of drivers (human, ACC system). We adopted the
K-means algorithm to cluster the mean gap and the standard deviation of the following vehicles’ speeds. The driving styles are divided into aggressive and normal [
12]. We randomly selected the periods of two driving types from the test dataset. The experimental results are shown in
Figure 7 and
Figure 8. Due to error accumulation, the IDM generates significant gap errors in the later stage of the car-following process. The deep learning car-following model is good at learning and summarizing the rules of the trajectories data, resulting in a small error between the generated trajectories and the observed trajectories, indicating that the deep learning algorithm has a strong ability to learn heterogeneous driving behaviors. Compared with the LSTM model with the same configuration and parameters, the CNN-LSTM model can reduce the trajectory prediction error by 54.9% and 62.2%, respectively. Moreover, it can reduce the running time by 60.8% compared with the LSTM model, which verifies the effectiveness of the CNN in the feature extraction of car-following behavior. It is known from the generalized force model that the driver of the following vehicle expects to keep a safe distance positively related to the speed of his vehicle with the leading vehicle during car-following [
22]. As shown in formula (10), we employ it as the safety distance, and the time headway corresponding to different driving styles. Each model’s simulation gap and the safe distance are shown in the fourth subgraphs of
Figure 7 and
Figure 8. All models cannot guarantee that the simulation gap is higher than the safe distance, but the simulation gap generated by the CNN-LSTM model is the closest to the safe distance, which indicates that the proposed model can better reproduce the driver’s expected following behavior.
where
is the safe distance,
is the time headway (the time headways of aggressive and ordinary drivers are 0.8 s and 1.5 s, respectively), and
is the minimum vehicle distance (its default value is 2 m).
In modeling car-following behavior, reproducing some unexplained traffic phenomena is the key to measuring the model’s performance. This experiment evaluates the ability of the model to produce the hysteresis phenomenon by the platoon simulation of five vehicles in a congested traffic flow with three models. The trajectory of the first following vehicle is determined by the observed course of the leading vehicle and its initial condition. Additionally, the trajectory of the next following vehicle is generated by the simulation trajectory of the leading vehicle and its initial state. However, the simulated trajectory of the last following vehicle has a significant error with the observed trajectory because of the cumulative error.
Figure 9b shows the traffic oscillation scenario in which the clockwise and counterclockwise hysteresis phenomena co-occur in congested traffic flow. We use Laval’s aggregation method [
23] to analyze it, which calculates the traffic flow and density through the parallelogram in the time-space graph, and draws a polygonal line graph to show the hysteresis loop. In
Figure 9a, the orange parallelogram has two long sides, the slope represents the wave propagation speed, and the slope of the two short sides is the vehicle speed. The traffic flow and density in each parallelogram area are calculated by formula (11).
where
and
represent the density and flow in parallelogram area
, respectively,
are the travel time and distance of the
i-th vehicle passing through area
, respectively,
represents the number of vehicles in area
(
), and
is the area of
.
As shown in
Figure 9a, the simulation trajectory of the last vehicle in the platoon may be outside the parallelogram. To ensure the parallelogram area remains unchanged and to calculate the traffic flow and density close to the observed traffic, we use the observed trajectory of the last vehicle instead of its simulation trajectory to calculate the flow and density [
24].
Figure 9b shows the hysteresis loops corresponding to the simulation trajectories of the three models. We can find that the three models can accurately reproduce the clockwise hysteresis loops corresponding to areas 1 to 5. However, the counterclockwise hysteresis loops corresponding to areas 5 to 7 are only shown in the trajectory simulated by the CNN-LSTM model, which shows that the proposed model can not only reduce the cumulative error of the platoon simulation, but accurately extract the features of the platoon to reproduce the traffic hysteresis phenomenon.
To further test the performance of adapting to the heterogeneous traffic flow mixed with ACC vehicles, we carry out a car-following test in a high-speed scenario containing two human-driven vehicles and one vehicle controlled by an ACC system. The results are shown in
Figure 10. There are apparent errors between the simulation trajectories generated by the IDM and the observed trajectories. The vehicle controlled by the ACC system accelerates from 17 to 30 s, and the simulation trajectories of the three models are hysteretic. Still, the simulation error of the CNN-LSTM model is the smallest, which reflects that it has a strong adaptability.
5. Conclusions
This paper develops a car-following model based on a CNN and the LSTM network to improve its accuracy and generalization ability. The proposed model was trained and tested through the natural driving database and OpenACC databases, and it was compared with the IDM and LSTM models. Compared with the LSTM model with the same configuration parameters, the CNN-LSTM model can decrease the mean square error of speed simulation of the single car-following pair by 55.3% and reduce the mean simulation trajectories error of the platoon by 36.9% in heterogeneous traffic flow mixed with adaptive cruise control vehicles. Furthermore, the CNN-LSTM model can reduce the mean simulation trajectories error of the platoon by 60.9% in congested traffic flow. The results proved the effectiveness of the CNN for car-following behavior feature extraction. Moreover, the CNN-LSTM model can accurately reproduce heterogeneous driving behavior and the hysteresis phenomenon of congested traffic flow, highlighting its strong learning ability and accuracy. For different traffic flows and road conditions, the CNN-LSTM model can perform better than the other two, proving that it has strong generalization ability.
In addition, the simulation gap generated by the CNN-LSTM model is the closest to the safe distance, which indicates that the proposed model can better reproduce the driver’s expected following behavior. However, how to combine the safety issue with the deep learning car-following model is still an urgent problem to be addressed. Furthermore, the simulation speed curve generated by the proposed model is smooth, which can improve vehicular ride comfort. Road transportation efficiency and fuel economy can be improved by integrating ease traffic congestion into the car-following model. This requires more effort and experimental research.