1. Introduction
In recent years, there has been a significant focus from transportation planning agencies and railway companies on the development and expansion of intercity railway networks to meet the growing travel demand. Consequently, the workload for rolling stock has increased, leading to unscheduled or periodic demands [
1,
2] Transportation planning agencies often consider various strategies to address these escalating demands, such as establishing new facilities or modifying existing ones. However, the expansion of high-speed rail (HSR) networks in countries like China has highlighted the challenge of aligning depot locations and capacities with transportation needs [
3]. Given the substantial costs associated with facility locations and the limited flexibility to make frequent changes, depot location planning becomes a strategic decision with long-term implications spanning several decades for the financial performance of railway companies. Hence, careful attention must be given to the facility location problem to ensure optimal outcomes and avoid costly modifications within a constrained time frame.
This paper proposes a mixed integer programming model for the depot location problem with passenger demand uncertainty in an intercity railway network, which integrates the first two stages in the railway planning process. A similar problem was treated by Tönissen and Arts et al. [
4], Tönissen and Arts [
5], and Tönissen and Arts et al. [
6], where the authors proposed robust optimization formulations for the stochastic maintenance location routing problem (SMLRP). The model presented in this paper differs significantly from that of theirs with respect to the input data and decisions. In the former, the services are given in advance and the problem consists of determining the locations of depots and the maintenance routing of rolling stocks. In contrast, as it deals with long-term decisions, services, especially train operation plans, timetables, and rolling stock workload, cannot be predetermined in advance in this paper. Our primary objective is to identify depot locations that exhibit robustness across a range of likely future scenarios, which is achieved by incorporating line planning decisions, and the problem focuses on optimizing decisions about (i) the locations and capacities of depots, (ii) the frequencies of train lines and home location of the rolling stock of train lines, and (iii) passenger assignment. Without the loss of generality, the depot capacity means the maximum number of rolling stocks that the facility can hold. We do not consider other depot capabilities, such as service and repair capability, because there is no timetable or rolling stock circulation plan to determine the work demands of the rolling stocks.
The main contributions of this paper are as follows:
We consider the integrated depot location and line planning problem with uncertain passenger demand through a strategic modeling framework;
The depot location, line plan, passenger path, and fleet allocation are optimized simultaneously, which yields a certain equilibrium between passengers and operational decisions;
Multi-type train lines are considered, which makes the calculation of the required rolling stocks more accurate;
The DE-IRH solution framework is used to deal efficiently with the NP-hard robust optimization model, where the depot location problem is solved by the DE algorithm, then the frequencies of train lines, flows on the path, and home location of the rolling stocks are determined by improved rounding heuristics.
The remainder of this paper is structured as follows. A brief review of the relevant literature is presented in
Section 2. Mathematical models for the robust integrated intercity railway depot location and line planning problem (RIDLLPP) are presented in
Section 3.
Section 4 describes the solution framework. In
Section 5, numerical examples for real-world intercity railway networks are provided to demonstrate the effectiveness of the proposed models and algorithms. The
Section 6 concludes the paper and discusses future research topics.
2. Literature Review
The literature is rich in optimization models to assist various types of facility location problems, and numerous studies and reviews have been conducted on facility location, such as Aikens [
7], Daskin [
8], Owen and Daskin [
9], and Drezner and Hamacher [
10]. Recently, Laporte and Nickel et al. [
11] reviewed the related models and algorithms using discrete optimization, and the development of continuous approximation models for facility location problems were reviewed by Ansari and Başdere et al. [
12]. Furthermore, uncertainty, supply chain decisions, and vehicle routing often affect facility decisions. We refer to the reviews of Snyder [
13] and Dönmez and Kara et al. [
14] for facility location problems under uncertainty, then Melo and Nickel et al. [
15] for facility location in combination with supply chain decisions, and Cabrales-Navarro and Arias-Osorio et al. [
16] for the location and routing problem (LRP).
However, the depot location problem for rolling stock has essential features that make it substantially different from others in terms of logistics: (i) the customers of depots are various types of rolling stock work demand that have to travel to a depot over a fixed railway network [
4], and it is necessary to plan for an empty train to reach a depot when not passing a depot for a long period; (ii) the depot location decisions should support the daily adjustment of the train line plan in different passenger demand scenarios; (iii) as long-term decisions, the line plan and railway network change annually to accommodate changing travel demands. What is more, as stated by some researchers, it does not seem appropriate to consider modifications to railway infrastructure at the network planning level without at least taking into consideration the level of traffic (both passengers and trains) that the proposed infrastructure would support in the future; thus, integrating different stages in the planning process becomes a vital research trend [
17,
18,
19,
20].
To the best of our knowledge, most papers on the depot location of rolling stocks primarily focus on integrating the depot location and maintenance routing (MLR). De Jonge [
1] studied the facility location problem in the train maintenance environment and formulated a mixed-integer programming model to determine the optimal number, size, and location of train maintenance facilities. Kho and Martagan et al. [
21] constructed a mixed integer linear program to solve the maintenance location routing (MLR) problem for NedTrain. Xie and Ouyang et al. [
22] proposed a mixed-integer program model to optimize the location and capacity of locomotive maintenance shops. The integrated planning problem for multiple types of locomotive work facilities under location, routing, and inventory considerations was studied by Xie and Chen et al. [
2]. Tönissen and Arts et al. [
4] modeled the uncertainties of and changes in line planning and fleet planning using a discrete set of scenarios and provided a two-stage stochastic programming and a two-stage robust programming formulation for the maintenance location routing problem. After that, the economies of scale in recoverable robust maintenance location routing were studied by Tönissen and Arts [
5]. An efficient mixed-integer programming for maintenance location routing problems under uncertainty conditions of line and fleet planning was proposed by Tönissen and Arts et al. [
6]. In a recent study, Tönissen and Arts [
23] studied the stochastic maintenance location routing allocation problem, where they had to locate maintenance locations and allocate fleets to these locations. The common feature of these models is their ability to determine the depot’s location and capacity, and the rolling stock’s maintenance routing, with a given rail network, candidate facilities, train service, and maintenance demand.
Recently, the combination of the depot location and rolling stock circulation has been introduced by Canca and Barrena [
18]. Considering that the rolling stock departs and returns to depots every day, and the empty runs at the beginning (from depots to initial stations) and at the end of every daily route (from terminal stations to depots) seriously affect the operation efficiency of the train lines, Canca and Barrena [
18] proposed a general mixed integer programming model to design rolling stock circulation plans and simultaneously consider the problem of determining the number and location of rest facilities. Lusby and Zhong et al. [
3] analyzed the impact of depot location on rolling stock scheduling. They proposed a two-stage mixed-integer programming model to determine a maintenance feasible, operational rolling stock schedule while simultaneously identifying potential new depots to open. An essential assumption of the above work is the assumption of a priori train schedules.
In contrast, we consider the problem of locating depots within a railway network that involves the planning or constructing of new lines. In this context, the line plan, timetable, and rolling stock work demands are not predetermined, except for the passenger demand. This is a problem that railway companies naturally encounter during network construction. Comparatively, this problem has received much less attention from an optimization perspective. The above work is challenging to apply to such a railway network because, being that it entails a long-term decision, it is difficult to obtain more information, especially regarding train service, although some of them have taken account of uncertain or changing line planning, fleet planning, and other factors.
As the first stage of the tactical level, line planning is naturally coupled with the selection of the depot location. A line plan consists of a set of train lines, and each train line is characterized by its origin and destination station, its frequency per hour or day, the route between these two stations, and the intermediate stops at passing railway stations [
24]. In the railway network, not all stations can be used as train terminals except for the stations near the depot. Thus, the location of the depot will affect the selection of train lines in the process of line planning. On the other hand, the capacity of the depot should be sufficient for the given line plan. Many papers have been published on various types of line planning problems [
25,
26]. Most recently, some efforts have been devoted to integrating line planning with network design (station location). Canca and De-Los-Santos et al. [
27,
28,
29] studied the profit-oriented integration problem of station location and line planning, where they simultaneously determined and defined the infrastructure network, line planning, train capacity of each train line, fleet investment, and personnel planning. However, the influence of the depot on the train terminal is usually not considered in the line planning process, which can significantly affect the utilization of the available rolling stock and revenues, and lead to inefficient solutions.
To summarize, the main objective of this paper is to propose a new strategic planning model that is applicable to intercity railway networks with new lines. The model integrates the depot location with line planning, which is driven by an uncertain or changing passenger demand. We modeled the uncertainty of the passenger demand using a discrete set of scenarios, and the line plan, passenger assignment, fleet allocation, and location and capacity of depots are synchronously determined. We present an iterative framework to solve our robust depot location and line planning problem.
3. Mathematical Model
3.1. Problem Description and Assumption
We consider the optimization problem of where to open new depots or expand existing depots and how many storage lines should be allocated in the intercity railway network. As shown in
Figure 1, the stations are sequentially numbered as
, and there are three intercity railway lines in operation: Line 1, Line 2, and Line 3. The depot d1 provides rolling stocks for Lines 2 and 3 and undertakes daily maintenance of rolling stocks. The rolling stocks running on Line 1 are provided by depot d2. Assuming that each intercity railway line operates independently, that is, the train’s initial station and terminal station belong to the same intercity line, due to the rapid growth of passenger demand, the railway planning department plans to open several new lines (the red dotted lines) to improve the convenience of passengers and increase network profit.
The new lines mean that more rolling stocks are needed, and that line plans may also be adjusted. Therefore, the railway planning department needs to decide to allocate more storage lines to the existing depots, d1, d2, and d3, or construct new depots among d3, d4 and d5, or even combine both to reduce the construction, operation costs, and travel time cost of passengers.
The model was formulated based on five important (and realistic) assumptions. First, the intercity railway network is owned and operated by a public entity whose objective is to minimize net public generalized costs. And each railway line operates independently, that is, the initial station and terminal station of each of the trains are on the same intercity line. Second, the information on the stations, existing depots, candidate depots, existing lines, and planning lines of the intercity railway network are given, and a single type of depot is considered. Third, a line pool has been defined [
30,
31], but the frequency and home location of the rolling stock of train lines are yet to be chosen among a given set of alternatives. Fourth, passenger demand is given by a discrete set of scenarios, which can be obtained through passenger demand surveys and passenger demand forecasts. What is more, we consider that the passenger demand of each line is symmetrical in both directions. Lastly, there is only one type of rolling stock operated on the intercity rail network.
3.2. Mathematical Notation
We consider an intercity railway network , consisting of nodes and edges . There are two types of nodes: stations, , which are the starting and ending places of passenger trips; depots, , which store and maintain rolling stocks. The number of storage lines of depot is denoted by , in particular, if is a candidate depot, and the maximum number of storage lines allowed to be constructed at is denoted by . The cost of constructing a storage line at location is denoted by , which is related to the local economic level and will be measured in monetary units per day. An edge, , is the connection between nodes. Each edge has a length and travel time . We are also given the capacity on the edge .
Let
be the set of all train lines.
can be given explicitly by a so-called line pool. Given a set of terminals, a train line
is a route with stops in the intercity railway network that starts and ends at a terminal [
30,
32]. The set of edges of a train line
is denoted by
, and the stop sequence of train line
is denoted by
. We denote by
the set of train lines operating on edge
. The connection between
and
is called a riding arc and forms the train service network, and passengers can choose a directed path from origin to destination in this train service network. The set of riding arcs is denoted by
. Note that each riding arc
is associated with precisely one train line
, denoted by
. Denoted by
is a set of possible frequencies at which these train lines can be operated. The number of rolling stocks required of train line
with
is denoted by
, with
being the turn-back operation time at the terminal,
being the dwell time at the intermediate station, and the daily operation time of the intercity railway line being 18 h in general. The capacity of a train of train line
is
. The fixed operation cost of a train is denoted by
, the dwell cost at an intermediate station is denoted by
, and the variable operating cost per kilometer is denoted by
.
In general, the railway operation department may specify in advance the set of possible railway lines each depot is allowed to be responsible for. This may be arranged according to the distance between the depots and the intercity railway lines. For this reason, we introduce the binary parameter , if the train lines of an intercity line are within depot ’s service scope, and it is otherwise. The empty run distance between depots and the initial station of the train of a train line is denoted by .
Let
be the set of all OD pairs. The number of passengers that want to travel from
to
is denoted by
. The possible directed passenger paths from
to
are denoted by
, and denoted by
is the set of all such paths. The travel time of path
is
with
is the number of intermediate stations on path
. The unit time value of passengers is denoted by
. Since depot location is a strategic decision problem, many input parameters, especially the passenger demand, will change significantly when the lines are put into operation. We introduce a simple modeling trick [
33] to deal with the uncertainty of passenger demand. The passenger demand uncertainty is described through a set of alternative scenarios: each scenario is associated with a passenger demand. Each of the input data scenarios has a positive probability to be realized. Let
be the input data scenario index and
be the set of all possible scenarios. Each input data scenario is associated with a positive probability
. For scenario
, the number of passengers that want to travel from
to
is denoted by
.
3.3. Model Formulation
For specific input data scenario
, the relationship between multiple sets of decision variables is shown in
Figure 2. The variables are classified into three groups: those related to the depot location and rolling stocks work demand assignment of the problem, those in charge of modeling transit assignment in the network, and those related to line operation decisions. The first group includes integer variable
describing the number of storage lines allocated to depot
, and
describing the value of the rolling stocks work demand provided by depot
for train line
in scenario
. The second group includes integer variables
counting the number of passengers on path
in scenario
. And the third group includes the binary variables
,
if train line
is operated with frequency
, which equal 0 otherwise in scenario
.
The integrated maintenance location and line planning problem (Model 1) is formulated as follows:
subject to.
The objective Function (1) minimizes the sum of the construction cost, line operation cost including the fixed, variable, and empty train drive costs, and passenger travel time cost, and the first item is the construction cost of the depot, the second item is the fixed and variable costs, the third item is the cost of an empty train drive between the depots and initial stations and terminal stations of train lines, and the last item is the cost of passenger travel time. In particular, the cost of empty trains can be used as a measure of the rationality of the depot location. Constraints (2) stipulate a passenger flow of for each OD pair . Constraints (3) enforce a sufficient transportation capacity on each riding arc. Constraints (4) ensure that a train line is operated with at most one frequency. Constraints (5) bound the sum of the line operation frequencies for each edge. Constraints (6) ensure that all rolling stock demands of the operated train line are fully satisfied. Constraints (7) ensure that the total assigned rolling stocks do not exceed the capacity of the depot. Constraints (8) limit the capacity of the depot. Constraints (9)–(12) define the related decision variables.
The robust depot location aims to find a solution that is within
(may be prespecified input of the designer) of the optimal solution for any realizable input data scenario, and the objective is to minimize a weighted sum of the total cost of all input data scenarios. For a given set of input data scenarios
, a depot location scheme is robust if and only if:
where
is the optimal solution of input data scenario
.
Our robust optimization model of therobust integrated depot location and line planning problem (Model 2) is given.
subject to Equations (2)–(13)
4. Solution Framework
The proposed formulation is an integer linear programming model integrating depot location and line planning, and it can be directly solved by commercial optimization software (e.g., GUROBI v9.5.1, etc.) for small-scale problems. However, in most real-world cases, it is usually difficult to find a satisfactory solution in an acceptable timeframe, e.g., the Chinese Urban Agglomeration with many intercity railway lines, and sometimes a feasible solution cannot even be found using commercial software. Most scholars hold that the heuristic or meta-heuristic algorithm effectively solves such complex engineering problems [
34,
35]. As illustrated by Gutiérrez and Kouvelis et al. [
33] and Canca and De-Los-Santos et al. [
29], decomposing the problem into several sequential steps is an effective solution strategy. Here, we introduce a solution framework that is capable of jointly solving maintenance location and line planning problems.
The proposed iterative framework consisting of the DE algorithm [
36] and improved rounding heuristics [
32,
37] (DE-IRH) is described in Algorithm 1. The solution framework consists of two layers: the upper layer solves the depot location problem using the DE algorithm, and the transit assignment and line operation problems are solved by the improved rounding heuristics in the lower layer. Note that the robustness constraints (23) are relaxed, and the objective function will be punished when the constraints are violated. Therefore, at the lower layer, the optimization problem will be decomposed into
independent subproblems, which can be solved in parallel. Since there is no difference in the solving of subproblems of different scenarios at the lower layer, we only give a general algorithm framework here.
Figure 3 summarizes the overall algorithm.
Algorithm 1: Pseudocode of DE-IRH |
Data: Input data for RIDLLPP problem |
While the stopping criterion is not satisfied do |
| Upper layer |
| Generate new depot location solutions |
| Compute the associated construction cost of each solution |
| Lower layer |
| Solve the transit assignment and operation problems on each current solution |
| Compute the total cost |
| Compare and keep the best solution |
end |
Result: the minimum total cost and the best solution |
4.1. The Upper Layer Subproblem: Depot Location
At the upper layer, the DE algorithm is used to search for the optimal depot location scheme and control the full procedure. The DE algorithm is a promising optimization algorithm that converges to the real optimum without using significant amounts of resources and has been applied in a wide range of domains and fields of technology [
38]. Starting with an initial solution population, at each iteration
, three vectors,
, are randomly chosen from the current population to construct a new mutation vector
by mutation operation. There are two common mutation strategies, ‘DE/rand/1/bin’ and ‘DE/best/1/bin’. The former focuses on improving the global search ability, while the latter focuses on improving global convergence. We use the latter in this paper because it can produce better solutions (see
Section 5.2). The associated equations are:
DE/best/1bin/:
where
is the mutation rate. The mutation process is the main distinctive component of DE and controls the evolution direction of DE. Then, the uniform crossover is used to construct trial vectors from vector
and vector
which is different from
and
. The associated equation is:
where
is the crossover rate. The next operation is selection, in which the trial vector
competes with the target vector
as follows:
where
is the objective function. In order to obtain the objective function value of the vector, we need to input the vector at the lower layer algorithm, which will be described later. If the new vector
is less than or equal to the target vector
, it replaces the target vector. Otherwise, the population maintains the target vector value.
4.2. The Upper Layer Subproblem: Transit Assignment and Line Operation
At the lower layer, for a given depot location scheme, we face the problem of determining the frequency, passenger assignment, and rolling stocks work demand assignment for each line. The associated model is obtained by setting the variable
as a constant according to the vector from the upper layer. The improved rounding heuristic is developed to solve the lower-layer model. There are four main steps of the improved rounding heuristic: First, the linear relaxation solution of the original problem is obtained by a commercial solver; second, counter the number of passengers on each riding arc,
, filter out the integer variable
, and update the number of passengers on each riding arc; third, recalculate the frequency of the remaining variables
according to the passengers on the riding arc; fourth, recalculate the rolling stock assignment according to the variables
of the linear relaxation solution. The detailed improved rounding heuristic is shown in Algorithm 2.
Algorithm 2: Pseudocode of improved rounding heuristic |
Data: Input depot location solution (vector ) |
1 | Obtain LP solution () of the lower layer model |
2 | Set riding arc weights |
3 | for do |
4 | | for do |
5 | | | if then |
6 | | | | |
7 | | | | for all |
8 | | | end |
9 | | | if is fractional then |
10 | | | | |
11 | | | end |
12 | | end |
13 | end |
14 | for do |
15 | | |
16 | | |
17 | | for all |
18 | end |
19 | Calculate the number of rolling stocks of train lines |
20 | for do |
21 | | set depot capacity (according to the vector ) |
22 | | for do |
23 | | | if and then |
24 | | | | |
25 | | | | |
26 | | | | |
27 | | | end |
28 | | end |
29 | end |
Result: IP solution () of the RIDLLPP model |
At each iteration, the improved rounding heuristic provides the solution to the passenger assignment, line operation, and rolling stock assignment problems of the input data scenarios. It should be noted that the depot capacity constraints (20) should be checked after rolling stock assignment because they may not have sufficient capacity to store the rolling stocks of the train lines. If this happens, we will update the capacity of facilities according to the current rolling stock assignment plans.
6. Conclusions
This work addresses the robust integrated depot location and line planning problem characterized by uncertainty in the input data in an intercity railway network with planning lines. The key factor in our problem is considering the dynamic relationship between depot location decisions and line planning decisions. We have proposed an integer linear programming model integrating depot location, line planning, and rolling stock work demand assignment to minimize the cost related to depot construction, the cost due to line operation (including fixed, variable, and empty train drive cost), and passenger travel time cost. The proposed model reflects the service provider’s point of view but considers the user’s travel time cost as well.
We have described the DE-IRH solution framework consisting of the DE algorithm and improved rounding heuristics for integrated depot location and line planning faced by a transit planning agency capable of solving instances of realistic sizes. We evaluated the impact of two different mutation strategies and the control parameters on the performance of the DE-RH through a series of numerical experiments. We compared the results with Gurobi to verify the performance of the DE-IRH. As reported in
Section 4.2, our algorithm can yield an optimality gap of 4.87% within less computing time. In the real-size scenario, the solver cannot find a feasible solution in an acceptable time, while our algorithm can quickly provide the decision-maker with a better solution. The numerical results reveal the influence of the uncertainty of the input data on strategic decision-making, and the outputs obtained through the model can provide useful information for strategic and tactical decisions of railway departments.
The proposed method is more applicable for the single type of maintenance depot location problem, that is, there is only one type of rolling stock in operation. The maintenance capacity of the depot is not discussed in this paper. As further work, it would also be interesting to consider multi-type facilities and maintenance capacity. This also includes the development of a new efficient heuristic algorithm for solving very large instances of the model.