1. Introduction
As an emerging transportation mode, high-speed rail (HSR) has attracted attention from passengers and operators due to its rapidity, safety, and comfort when compared to other modes. High-speed rail, together with other transportation modes, such as bus, traditional rail, and air transport, constitute a transportation system. The introduction of high-speed rail services has significantly changed the transportation market. However, some competition occurs between HSR and other transport modes for travel demand. As shown in
Figure 1, for short-distance travel demand, bus and traditional rail are the main competitors of high-speed rail (HSR), whereas air transport has become the major competitor of high-speed rail (HSR) for medium- and long-distance travel demand. Empirical studies (Caseetta and Coppola [
1], and Borjesson [
2]) have shown that transportation mode choice is determined by many factors, especially by travel time and ticket price. Thus, the more stops a HSR train trip makes, the more time is required for the passenger to travel, which reduces the market share of HSR. A stop-schedule, sometimes called a stop pattern or a halt pattern in the literature, identifies the stations at which a train trip stops. According to Cordone and Redaelli [
3], travel demand and train services interact. HSR train services are determined by the travel demand for HSR trains, and the travel demand for HSR trains depends on the HSR train services provided. Thus, for the sustainable development of both high-speed rail and travel demand, demand forecasting, including travel behavior or modal choice, should be integrated into the train stop planning of line planning. In this paper, an integrated model for the modal choice and train stop planning problem (MC-TSPP) is proposed. The modal choice problem (MCP), involves estimating the travel demand for HSR, which is an important component in developing a sustainable HSR system. The train stop planning problem (TSPP) of line planning involves determining the stop-schedule plan for HSR train trips. For a clear description, all abbreviations used in this paper are summarized in
Table 1.
The contributions of this paper are as follows: (1) an integrated model simultaneously considering mode choice and train stop planning is presented, (2) a decomposition-based heuristic iterative algorithm is proposed, and (3) a detailed analysis of travel behavior and train services is provided.
The remainder of the paper is structured as follows.
Section 2 reviews the literature related to our study. In
Section 3, we outline an integrated model for modal choice and train stop planning. A heuristic iterative algorithm is developed in
Section 4. A case study is detailed in
Section 5.
Section 6 concludes this paper and suggests directions for future work.
2. Related Literature
2.1. Train Stop Planning Under Given Demand
Given the travel demand, which is constant and independent of the service supplied, the TSPP of line planning determines the stop-schedule plan for train trips or lines (
Table 2). To optimize a stop-schedule plan, a multiobjective programming model was presented by Chang et al. [
4]. Goossens et al. [
5] tackled the problem of altering the stops along the lines by both upgrading and downgrading stations. They assumed that the types of lines were given and focused on finding a line type for each station. To model this problem, a line-event graph was presented. A multicommodity flow model was formulated, which was also a mixed integer programming formulation. Several models were used to handle the line planning problem (LPP) with different stop patterns by Goossens et al. [
6]. Based on a heuristic algorithm, a stop-schedule plan was constructed by Shi et al. [
7]. Deng et al. [
8] presented a bilevel programming model for the optimization of stop-schedules. To optimize all-stop, short-turn, and express transit services, a mixed integer nonlinear model was adopted by Ulusoy et al. [
9]. In their approach, the stop-schedules of the express transit services were determined. They also considered the passenger assignment and transfer stations. By applying a two-layer nonlinear optimization model, Wang et al. [
10] determined the stop-schedule of each individual train trip. Passenger assignment was also considered in their model. To tackle the TSPP, a two-stage approach was proposed by Fu et al. [
11] under the assumption that the number of train stop patterns was given. For the passengers whose travel distance was beyond the train-set maintenance kilometers, they split them into several separate origin-destinations (ODs) at major transfer hubs. Huang and Peng [
12] concentrated on designing a stop-schedule plan for HSR and a two-stage heuristic algorithm was presented to handle this problem. A nonlinear programming model was presented to optimize HSR stopping patterns by Jong et al. [
13], which became infeasible when dealing with large-scale problems. The authors assumed that each passenger OD only chose the pattern with the shortest travel time. To solve the model, a genetic algorithm was presented that could solve small-scale problems, such as the Taiwan HSR system, to the optimal or a near-optimal solution. A new model with a restricted passenger flow assignment procedure was presented for the TSPP by Fu et al. [
14]. Li et al. [
15] focused on the TSPP and a nonlinear planning model was presented based on the concept of node service. Park et al. [
16] handled the line planning problem (LPP) where the stop patterns were determined based on a multicommodity flow model. To solve the model, they applied the Dantzig–Wolfe decomposition and column generation. An integrated optimization method for the TSPP was proposed by Zhang et al. [
17] based on classified stations and categorized passenger flows. For each type of train stop pattern of categorized passenger flows, an integer programming model was used.
Wang and Luo [
18] presented a nonlinear programming model to optimize the TSPP. Lai et al. [
19] focused on the TSPP for a HSR system based on a novel network structure. Yang et al. [
20] integrated the TSPP and train scheduling problem. Yue et al. [
21] also incorporated train stopping patterns into train scheduling optimization. Qi et al. [
22] extended the research of Yang et al. [
20] by considering the number of passengers boarding or alighting at each station for each train trip.
2.2. Modal Choice
To estimate travel demand, we needed to model the travel behavior in the form of decisions. Most research on travel behavior has emphasized the use of discrete choice theory, and many variants of logit models have been developed. With regard to the travel behavior, Ben-Akiva and Lerman [
23] applied the theory of discrete choice to study the travel behavior within public transport. Cascetta et al. [
24] proposed the C-Logit model to tackle path overlapping problems. To consider the cross similarities between different pure and combined modes, the cross-nested logit model was developed by Vovsha [
25]. To handle the route overlapping problem, a new logit model, the link-nested logit model, was presented by Vovsha and Bekhor [
26]. Ben-Akiva and Bierlaire [
27] reviewed the discrete choice models and applied these models to short-term travel decisions. Ramming [
28] considered network knowledge in both route generation and route choice. Almost all route choice models were compared based on the Boston case study data. Prashker and Bekhor [
29] reviewed the route choice models and applied these models to the traffic assignment problem.
A large and growing body of studies has investigated the impact of service on travel behavior (
Table 3). To estimate travelers’ preferences in multimodal networks, Arentze and Molin [
30] designed a series of stated preference (SP) experiments to estimate a multimodal travel choice model. Caseetta and Coppola [
1] proposed a trip-frequency model to forecast the short-term demand induced by HSR that depends directly on the generalized travel cost (i.e., travel time, travel cost, and service frequency). In order to forecast the diverted demand for considerably reduced travel times, when new HSR lines were introduced, Borjesson [
2] validated the Swedish long-distance model, Sampers, and proposed a new model. To study the impact of the stations’ architectural quality on the users’ choices, a quantitative analysis based on a binomial logit model was conducted by Cascetta and Carteni [
31], which showed a significant impact of the stations’ quality on the travelers’ preference. Based on a multinomial logit model, Baek and Sohn [
32] conducted an investigation of passenger behavior for express trains during peak hours for the metro line nine in Seoul, South Korea. To investigate the mode choice behavior for passengers among air transport, HSR, and integrated air and high-speed rail (AH) services, Li and Sheng [
33] proposed a modal split model. Some important results were produced. The mode choice behavior of commuters with the introduction of a new rail transit mode was studied by Sohoni et al. [
34]. Carteni et al. [
35] quantitatively analyzed the impacts of hedonic quality, including faster transit services and a number of hedonic characteristics of the services, on the students’ domestic tourist travel behavior. Mattson et al. [
36] presented a mixed logit model to estimate demand for rural intercity bus services. It can be clearly seen that modelling long-distance travel is a different kind of modelling than modelling of daily commuter traffic because of different kind of trips.
2.3. Integrated Model of Modal Choice and Transportation Services
Relatively few studies have been published concerning the interaction between travel behavior and service network design (
Table 4). To consider the relationship between the travel demand and the service network design, Cordone and Redaelli [
3] integrated timetable design into demand estimation. A solution procedure was proposed based on branch-and-bound and a heuristic algorithm. Borndorfer [
37] presented an optimization approach for fare planning based on a discrete choice model in public transport where the line operation frequency were also considered in their models. Different fare structures, optimization objectives, and scenarios were analyzed in detail. An optimization model for railway timetabling was proposed by Espinosa-Aranda [
38] that considered the mode choice and the travel behavior among different services based on a constrained nested logit model.
Cantarella et al. [
39] and Li and Yang [
40] conducted some relevant analyses of day-to-day dynamic modal choice with responsive transit service frequency for only one OD pair, which was not based on the logit-based model. Canca et al. [
41] defined a mathematical programming model that integrated network design and line planning. The elasticity of demand was considered by the model and was used to determine the proportion of demand that each mode captured for each OD pair, assuming that the market share between modes was given and not determined. Li et al. [
42] focused on the interaction between fare, frequency, and transit demand in a bimodal transportation network where transit frequency and fare were handled in an aggregate way, and a more detailed transit service design was not considered. Robenek et al. [
43] combined timetable design with a multinomial logit-based passenger assignment to present the demand elasticity under the assumption that the mode choice model was solved in advance. Wen et al. [
44] studied the autonomous vehicle operation problem considering the interaction between demand and supply, which provided new ideas for our research.
According to the literature review, it can be seen that recent research has begun to pay attention to the interaction between travel behavior and service network design. However, the literature on the interaction between travel behavior and line planning is fairly limited. In addition, most studies are restricted to frequency setting and modal choice and do not consider the interaction between stop-schedules and transit demand due to the difficulty of investigation. Differently, we explore how the travel demand interacts with the stop-schedule plan, and thus we adjust the stop-schedule plan. Compared to other studies, this method can be used to analyze passenger travel choice behavior caused by the number of stops in more detail. In addition, this integration can help make HSR more competitive than other modes of transport.
3. An Integrated Model for Modal Choice and Train Stop Planning
This section presents our integrated model for the modal choice and train stop planning problem (MC-TSPP). For convenience, the list of sets, parameters, and variables used are given in
Table 5,
Table 6 and
Table 7, respectively.
3.1. Train Stop Planning Model
To formulate the TSPP, some assumptions are required, as stated below.
Assumption 1. The set of potential operated routes is known and given. If there is no train trip that runs on a route, the route will not operate. Additionally, the running time between any two stations is given, which is obtained from the official timetable. The stop time at an intermediate station, which contains the acceleration and deceleration time, is assumed to be given and constant and independent of passenger flow (alighting and boarding) at the station.
Assumption 2. Travel demand depends on the quality of the train service, which will be illustrated in the next subsection.
The TSPP considered in this paper can be stated as follows. Given a set of feasible routes and the possible number of train trips for each route, the TSSP involves determining (1) the optimal stop-schedule for each train trip, (2) the passenger volume served by the corresponding train trip, and (3) the number of passengers on board when the train trip stops at an intermediate station.
To formulate the TSPP, the following constraints need to be satisfied:
Equation (1) states that the travel demand between any two stations must be conveyed by the train trips whose routes pass these two stations simultaneously. The capacity constraints for tracks and stations are specified by Equations (2) and (3), respectively.
To construct a stop-schedule for a train trip, the constraints in Equations (4) and (5) must be imposed. That is, a train trip on route is required to start from one terminal station of route and end at the other terminal station of route . Otherwise the train trip on route will not be operated. Constraint (6) specifies that no passenger can get on or get off a train trip at station if a train trip does not halt at station .
The seating capacity constraint of a train trip is imposed by Equation (7). That is, the number of passengers served by a train trip on route cannot exceed the seating capacity of the corresponding train trip. Equation (8) defines the number of passengers on board when a train trip on route stops at an intermediate station . The number of passengers on board is determined by the passengers that alight at a station before station and disembark at a station after station . Constraints in Equations (9)–(11) provide the restrictions on the decision variables.
3.2. Modal Choice Model
Modal choice, which involves estimating the amount of demand captured by a transport mode, is a common problem in transportation research. Modal choice or modal split is the third step of the four-step model, which consists of four steps, i.e., trip generation, trip distribution, modal split, and traffic assignment. Discrete choice methods, especially logit-based models, have been widely used to model the problem. Logit-based models assume that decision-makers, i.e., travelers, are rational and choose a transport mode that maximizes their own utility, which is not completely known. The utility is usually related to the quality of the transportation service, such as travel time and ticket price. Empirical studies have shown that transportation mode choice is determined by many factors, particularly travel time and ticket price. In our paper, we assumed that the ticket price is given and constant. Thus, we mainly focused on the travel time. For simplicity, we used the average travel time of one transport mode to represent the travel time as follows:
For HSR, the average travel time can be calculated below, which is related to the stop time:
Then, the utility function of one transport mode can be defined as follows:
where
represents the utility of mode
between city
and city
.
, the deterministic utility, accounts for all of the attributes that are known and influence the mode choice of the travelers. As stated above, for simplicity, we only selected the travel time and ticket price.
is usually a linear combination of the attributes with suitable coefficients.
denotes the ignored attributes and individual differences between the travelers, which are assumed to be independent and an identically distributed Gumbel variable with zero mean. In our model, the deterministic utility is defined as follows:
where
represents the ticket price of alternative
from
to
, and
and
are unknown coefficients that need to be estimated. We assumed that the ticket price for all transport modes between any two stations or cities are known and constant. The travel time for all modes, except for HSR, between any two stations or cities are known. The travel time for HSR depends on the average stop time. Therefore, the deterministic utilities for all transport modes apart from HSR are constant, whereas the deterministic utility of HSR is a function of the stop-schedules. Then, the estimated demand of HSR can be defined as
3.3. An Integrated Model
The logit formulation and TSPP formulation can be integrated to study the interaction between travel demand and HSR train services. From the demand perspective, we maximized the total demand of HSR mode. Then, the integrated model, i.e., MC-TSPP model, is
The objective function in Equation (17) maximizes the total demand that uses the HSR mode. The constraint in Equation (18) relates the HSR demand of each origin and the destination pair to the associated travel time. Constraint (19) relates the travel time to the HSR train running time and average stop time. The other constraints are directly obtained from the TSPP formulation.
4. Solution Procedure
The MC-TSPP formulation is a mixed integer nonlinear programming (MINLP) problem. Therefore, we cannot directly solve it with a commercial solver. By analyzing the structure of the MC-TSPP formulation, we see that the formulation can be decomposed back into the modal choice problem (MCP) and the train stop planning problem (TSPP) with fixed demand. Then, the TSPP, which is a mixed integer linear programming (MILP) problem, can be solved by a commercial solver, such as ILOG Cplex. The solution procedure was based on the idea of Cordone and Redaelli [
3]: we started from the demand estimating based on the current attributes of the modes; then, with the demand, we solved the TSPP and used the results to estimate another demand, and iterate. To illustrate the solution procedure, we outline the process below.
Step 1. Calculate the initial average travel time for the HSR based on the current HSR timetable.
To start, we calculated the average travel time of the HSR between any two stations based on the current HSR timetable, which is used to obtain the initial demand.
Step 2. Estimate the initial demand of the HSR.
Based on the obtained initial average travel time for the HSR, we calculated the logit model to estimate the initial demand of the HSR.
Step 3. Solve the TSPP and estimate the demand of the HSR.
Since the objective is to maximize the total demand captured by the HSR, which mostly depends on the number of times it stops, we solved the TSPP by minimizing the total time loss for all demand:
Based on Objective (21) and Constraints (1)–(11) the TSPP can be solved with ILOG Cplex, from which we obtained the optimal solution . Then, based on the optimal solution, we estimated the demand with Constraints (18) and (19), which can be applied to the next iteration.
Step 4. If , , and the number of iterations is no more than a limited number, go to Step 3; otherwise, terminate the algorithm and return the train service plan that generates the maximum demand. denotes the maximum demand that the HSR can transport. can be obtained with and Constraints (2)–(11), i.e., . and represent the demand of the current iteration and previous iteration, respectively; and is a threshold.
5. Case Study
This section presents the results obtained based on the integrated model and solution procedure. We selected the Beijing–Shanghai corridor to conduct the case study.
5.1. Data Collection
For the case study, the Beijing–Shanghai corridor was selected for the following reasons. First, the Beijing–Shanghai HSR operates along a corridor, which has attracted a large amount of passenger traffic. Second, several transportation modes operate in the corridor, including HSR, traditional rail, bus, and air transport. We considered the 23 cities along the Beijing–Shanghai HSR. Thus, there were 253 OD pairs in one direction. We were given the HSR demand for a normal operating day; then, the total demand for all modes and all OD pairs were derived from calculations using the HSR demand and the estimated share rate of the HSR.
To estimate the modal split models, we conducted stated preference (SP) surveys to determine the travel mode choice of the individuals. SP attempts to directly measure preferences or utility by survey. In terms of attribute selection, all attributes affecting mode choice should be considered. However, for practicality and simplicity, only the travel time and ticket prices were considered. The travel time for each transportation mode was derived from the official timetable. HSR fares were derived from the second class fare. Seat fares were selected as the traditional rail (TR) fares and bus fares. Economy class ticket prices were selected as the air transport fares. The SP survey questionnaires were distributed at stations or airports between September and December 2018. The coefficients were estimated using TransCAD Transportation Planning software.
5.2. Beijing–Shanghai HSR
The Beijing–Shanghai HSR, a double track rail line, consists of 23 stations and 22 tracks in one direction. There are five terminal stations: Beijing South, Jinan West, Xuzhou East, Nanjing South, and Shanghai Hongqiao, which can be used as start or end stations. The running time of a HSR train between any two stations was derived from the official timetable. For simplicity, let numbers 1 to 23 represent the stations from Beijing South to Shanghai Hongqiao, respectively. Other parameters related to the proposed model are given in
Table 8.
As mentioned by Jeong et al. [
45], ILOG Cplex sometimes does not work for all instances with all the Cplex parameters at their default values. To get a feasible solution, we need to revise the Cplex parameters to ensure that both the quality of the solution and the solution time are acceptable. Thus, before applying the proposed heuristic algorithm, we record the mixed integer programming (MIP) relative error of ILOG Cplex with different time limits under the condition of maximum demand (see
Table 9). This is because
and for the same MIP relative error, each TSPP needs less solution time. The results show that the more computational time is allowed, the higher the quality of the solution is. However, it is noteworthy that the improvement of solution quality becomes less significant with the increase of time when computation time is more than 2 h. Besides, the quality of solution is acceptable when the time limit is 7200 s, i.e., 2 h. Thus, the time limit was set to 7200 s for each TSPP with fixed demand. In this paper, the proposed heuriustic solution procedure is implemented by using ILOG Cplex and Visual C# on a laptop with a CPU based on a 2.53 GHz Intel Core i3 processor and 8 GB RAM.
Using the proposed solution procedure and the time limit for ILOG Cplex, the iterative process terminated after about eight iterations when the total demand of all OD pairs converged. The total travel demand of the HSR increases by around 7.45% to 1,059,136 in one direction, mainly due to the fact that the travel time for some OD pairs is shorter, caused by fewer stops. In addition, the extent to which the solution reaches maximum demand is also indicated as
Table A1 in
Appendix A. It is shown that for most OD pairs, the demand does not reach the maximum demand that HSR can serve.
To explore how the service level impacts the demand at the stations, the statistics for demand at stations in one direction are given in
Figure 2. The results show that increases in demand at stations 1, 6, 11, 16, and 23 are significant, and corresponded with Beijing South, Jinan West, Xuzhou East, Nanjing South, and Shanghai Hongqiao, respectively. Increases in passenger flow at terminal stations are usually higher than that at intermediate stations. For example, the increase in passenger flow for these five terminal stations is 73,462, which accounted for 56.18% of the total increase for all stations. This reflects the fact that terminal stations have more potential to attract more passenger trips. Thus, the operators should pay more attention to the train services at terminal stations.
Table A2 in
Appendix A shows the growth rate of the maximum passenger flow compared with the current passenger flow in one direction. As a result, only 12 OD pairs have reduced passenger flow and 241 OD pairs have increased demand. This reflects the fact that the travel time of most OD pairs decreases because the stop times for most OD pairs are reduced. In addition, this seems to indicate that the ridership growth rates of the OD pairs with long distances and more intermediate stations are usually greater than that of OD pairs with shorter distances and fewer intermediate stations. This indicates that the more stations an OD pair passes, the higher the possibility of an increase in passenger flow. This also shows that the number of stops can be further improved. Then, a new HSR train service plan in one direction, i.e., a new line plan, is generated (see
Figure A1 in
Appendix B), which is more competitive than the existing one. This indicates that operators can improve the current Beijing–Shanghai HSR train service based on our method.
To analyze the competition relationship and the diverted demand, the modal share of each transportation mode for four OD pairs is illustrated in
Figure 3. The data on the left represents the current modal share, while the data on the right denotes the optimized results. The increase rate in the HSR market share increases with the increase in distance. For short-distance transportation, such as Beijing–Jinan and Beijing–Xuzhou, the diverted demand of HSR is, to a large extent, from traditional rail, whereas for medium- and long-distance transportation, the diverted demand of HSR is mainly from air transport. This shows that HSR and TR compete in the short-distance transport market, and air transport and HSR compete in the medium- and long-distance transport market.
6. Conclusions
In this paper, we proposed an integrated model that combines modal choice and train stop planning. To solve the problem, we presented a decomposition-based heuristic algorithm. Based on a case study, some further conclusions can be drawn:
(1) It would be better to integrate modal choice and train stop planning for designing a sustainable and competitive HSR train service plan.
(2) Terminal stations usually have more potential to attract more ridership. In addition, based on our method, the HSR train service can be improved further.
(3) HSR and TR compete for the short-distance transport market, and air transport and HSR compete for the medium- and long-distance transport market.
(4) The results seem to indicate that the more stations an OD pair passes, the higher the possibility of an increase in passenger flow.
(5) Long-distance passengers are more sensitive to travel time than short-distance passengers. For short-distance passengers, other factors, such as price, may be more important, which needs further analysis.
Although we concentrated on HSR, as a planning decision aid, the model can also be applied to other modes. In addition, our model can easily take other considerations into account.
Although important discoveries have been revealed by this paper, some interesting topics remain to be explored:
(1) Fare planning was not considered in our model, which is an important aspect. In our next study, we will incorporate fare planning into our model.
(2) Travel behavior between HSR trains was not considered, which could be incorporated in future studies.
(3) Train timetable design considering travel behavior is also an interesting topic.
(4) A more effective algorithm is needed to reduce computation time.
(5) In order to be more realistic, different mode choice models for different ODs or distances should be explored based on investigation data.
(6) In addition, to make the results more realistic, the specific characteristics of individual and latent factors should be taken into account.