1. Introduction
By limiting distances between inhabitants, urban density is suggested to be a major determinant of urban transport efficiency, as manifested in energy consumption and greenhouse gas emissions for transport [
1,
2], private vehicle use and fuel consumption [
3], modal split [
4], trip frequency, and share of active transport [
5]. Containing urban expansion is, therefore, one stated goal of the UN Sustainable Development Goals [
6]. However, urban density is clearly not a panacea for all of society’s problems. The beneficial effects of density on transport in support of ‘town-cramming’ are confounded because people with different traveling preferences are likely sorted spatially [
7]; the effect of densification on congestion is not accounted for [
8]; and the benefits of densification on transport energy consumption would be very limited [
9]. Others argue that, through economies of scale, urban efficiency primarily increases with population size [
10], and in fact, only residual deviations from general scaling laws are attributable to population densities [
11]. Higher-density residential developments are often considered less desirable, and realizing high densities in urban expansion projects may be relatively costly because of institutional barriers [
12] and the common necessity of public intervention [
13]. Densification of the existing urban fabric is plausibly even more expensive. Urban density loss and urban expansion thus persist even in regions facing population decline [
14]. Clearly, sound arguments in favor of dense urban development are called for.
Many studies focus on the effect that urban densities and public transport availability have on modal shifts. Bento et al. [
15] study vehicle ownership and private vehicle usage in cities in the United States in relation to, amongst others, urban densities, public transport network length, and distance to public transport stops. They find that shorter distances to the central business district and shorter distances to public transport stops reduce private vehicle usage, while more compact city forms and lower distances to public transport also reduce vehicle ownership. Levinson and Kumar [
16] similarly find higher transit mode shares in higher-density cities. Rickwood et al. [
17] propose the existence of positive feedback between densities, public transport availability, and transit ridership shares. Proximate public transport supply would be necessary to facilitate ridership, and high densities would be necessary to facilitate proximate public transport stops.
Whether denser land use can indeed foster proximate public transport availability is the topic of this paper. We expect that lower-density cities would require a much more sizeable network (and hitherto investment) to supply proximate public transport to a sizeable part of the population. Other explanations may be that cities need sufficient population mass to allow proximate public transport supply or the effect of regional urban form, as the existence of a commuting zone or satellite suburbs may considerably affect the difficulty of providing proximate public transport.
One way to analyze public transport availability is through the analysis of available data on public transport supply [
18]. Unfortunately, for a comparison of public transport networks, real-world public transport networks are not very useful, as political preferences, budgetary considerations, and physical geography play an important role in network expansion considerations and, thus, in network outcomes [
19]. In fact, it is plausible that, through path dependency, urban forms and public transport availability coincide [
20,
21]. For a fair comparison, network expansion considerations need to be held equal among all observations. In order to obtain the wanted ceteris paribus data, counterfactual data have been produced for this study by adopting a simulation approach. The sole aim of that approach is to simulate network expansion until a fixed percentage of a city’s population is connected by the network in order to verify whether higher-density cities can achieve high public transport availability at a comparatively limited cost. We assume here that longer transport networks are more costly to construct and operate. The next section will describe the simulation method; subsequent sections will present results and reflect on the method. Some final remarks will conclude the method and suggest venues for improvement.
Besides providing a counterfactual, comparable dataset of public transport networks in cities around the world, this paper also contributes to the developing field of network expansion studies. Novel elements in this paper are the adaptation of a simulation approach to provide data for multiple world cities, the analysis of aggregate outcomes of the simulations, and a sensitivity test that verifies the robustness of the found results when key parameters are changed. Contributions to network expansion have, inter alia, been made by physicists, economists, and quantitative geographers.
Transport network development can be described as an innovation diffusion process [
22]. It is described as a sequential game [
23], with many agents and intermediate additions that shape the future market. The sequential nature of the process causes that, even with considerable coordination, the outcome of the process is a Nash equilibrium rather than an aggregate optimum [
24,
25]. Tied to this are economic considerations. Through network externalities [
23], the value of a transport network increases more than proportionally with the number of people it connects. Network externalities can thus lock-in, in which an entire market adapts to the available network, thus contributing to the positive feedback described by Rickwood et al. [
17].
In geographic terms, it can roughly be described in three stages, in which new links have different roles [
26]. Transport networks prefer routes with presumably high demand [
27]. Levinson [
28] found that the sequence of metro station openings in London more or less followed the population density ranks of boroughs. Yamins et al. [
29] proposed a two-step procedure to simulate road network diffusion within a city to explore whether road structures can realistically be simulated. Other contributions used network models to investigate the closure of road links [
30] and first-mover advantages [
31]. Raimbault [
32] modeled the co-evolution of railway networks and urban populations, although the geographic paths of the expanding network could be omitted there. Raimbault and Le Néchet [
33] have setup a model that simulates interactions between land use and transport, by which transport network upgrades aim at maximizing aggregate or local accessibility gains.
2. Simulation Method
For this paper, network expansion simulations have been executed using the so-called Public Transport Allocator (PTA) heuristics [
34]. These heuristics have been set up for this exercise to simulate an ad hoc style of network expansion, iteratively building one link with a high estimated return on investment until the expansion process is deemed complete. Those routines have been applied to model networks in 36 world cities. Subsequently, the lengths of those networks were compared, and general linkages between density and network length were identified. All simulation results are available through [
35]. A key aspect of the method is that it iteratively forms a network by finding presumably profitable paths between termini, described as nodes. Those nodes are considered representative of the location of population masses in this application, observed in a 1 km
2 regular lattice of grid cells.
The methodological description further follows the vocabulary proposed by Rodrigue et al. [
36], discerning connections, which are target attachments that reduce travel costs between two selected termini but have no geographical form; paths, which are a spatially explicit collection of lines that form the connection between two selected termini and potentially pass intermediate nodes; and links, which are the individual line attachments between all observed nodes that can jointly form a path (see
Figure 1).
The modeling procedure is shown schematically in
Figure 2 and will be described in detail in the following sections, which deal with preparation, matrix-based connection selection, path designation, and end-state evaluation. A glossary of variables and parameters is given in
Table A1 in
Appendix A. The simulation approach draws roughly from the Transport Link Scanner model [
37]. It thus treats network development as a sequential game [
23], in which market conditions are affected by every addition to the network.
At the 1 km
2 resolution adopted in this study, the number of nodes is much larger in some world cities than the roughly 1000 considered in the Transport Link Scanner application. In addition, to connect 80% of the population of a city, many more model iterations are needed. Thus, the computational burden is much larger compared to [
37]. To be practically able to simulate urban transport network growth, the adopted approach has been simplified considerably. Only the first two steps are included. Those steps are the repeated selection of one connection, based on the estimated costs and benefits of upgrading connections between pairs of termini, and the generation of the most profitable path for the selected connection. The process is broadly similar to Yamins et al. [
29] in that iteratively, only one pair is selected for an upgrade, and subsequently, an optimal route is searched between the selected termini. Salient differences with Yamins et al. are that in this paper, (1) pairs are selected assuming that the investing agent aims at maximizing passenger kilometers traveled on the new link; (2) routes are sought that balance the additional returns from a detour versus the additional costs; and (3) the simulations are run using granular, observed population distributions in multiple world cities.
The population distribution is exogenous and held fixed throughout the simulations. Only one agent is expected to be active at any investment, and only the return on investment on the new link is expected to be relevant to select investments. In addition, network effects that can come into play, for example, when the hinterland of a terminus uses a new link as part of multimodal travel, are discarded in the computation of return on investment. When estimating the return on investment, only returns from direct trips between termini are taken into account. Despite the simplifications, the routine still takes multiple days to simulate expansion in extended cities. The computational complexity is described in
Appendix B.
2.1. Preparation
The modeling sequence starts by isolating all inhabited grid cells g ∈ G, with G consisting of all grid cells that have their center within the geography expected to be part of a city. Functional Urban Areas (FUA) [
38] have been used to delineate separate urban systems. Gridded populations in 2015 and the FUA boundaries derived from those grids were obtained from the 2019 release of the GHSL global grids [
39]. It must be emphasized that the used population grids were created by disaggregating population census results over remotely sensed built-up fractions so that population totals and the boundaries of urban systems may deviate from official statistics. The geographic centers of grid cells g are used here to represent the terminus of spatial interaction with the population in those grid cells, despite the fact that population-weighted centers are a more accurate representation of population location in spatial interaction applications [
40]. Given the fairly abstract modeling approach taken here, the additional inaccuracies caused by spatial aggregation are considered hardly relevant.
Links are added from all inhabited grid cells g to all other inhabited grid cells g that are maximally two steps away from the grid cell of interest, so that all grid cells have direct links with a maximum of 16 grid cells in their neighborhood (see
Figure 3 for a schematic representation). It is important to note that links to direct neighbors (for example, 0°, 45°, and 90° from an origin node) only connect first-tier neighbors directly. Second-tier neighbors are not connected directly because those connections can be made by passing over the direct neighbor. We do not expect that omitting these links would affect the results negatively, as there are no inherent penalties imposed on passing through intermediate nodes. Grid cells that do not have any neighbors within two steps are given single links to the closest neighbor(s) that are within the closest distance.
At the start, all links are attributed a relatively low effective speed for transversal of that link on foot. In the reported application, effective walking speed V0 is set to 4 km/h, which is considered a realistic walking speed when assuming walking at 4.5 km/h and presumable detours that are not taken into account in the drawn straight lines. Other parameters that are set a-priori are (with the main settings between parentheses) the assumed effective speed V1 of the introduced public transport mode (30 km/h); the fixed monetary costs X of a link (the equivalent of 10 km of line); the fixed waiting time W for accessing upgraded links from an entry node (10 min); and the maximum circuity γ with which a path may deviate from the straight line (1.5). The chosen parameter values represent admittedly ad hoc values that are not necessarily representative of the travel characteristics or the public transport systems in the studied cities. A limited sensitivity analysis has, therefore, been executed to explore the effects of the adopted settings. In a later section, we will discuss this sensitivity analysis in more detail.
2.2. Matrix-Based Connection Selection
As in Yamins et al. [
29] and Levinson and Xie [
31], this approach comprises an iterative procedure that repeatedly searches for candidate connections to upgrade, selects one connection, and adds that connection as a plausible path to the modeled network so that the links that form that path obtain a lower transversal time. In every model iteration, the approach starts by computing a travel time matrix between all inhabited grid cells using a shortest path algorithm based on the network of links, including prior upgrades. All links are expected to be able to exchange passengers at joint nodes without incurring additional travel costs, regardless of the direction of those links or whether any links are upgraded or not.
Subsequently, for each origin–destination pair, the return on investment of a public transport network connection between that particular pair is estimated. The return on investment of upgrading the connection between
i and
j is estimated as (1):
where
B0 and
B1 are expected passenger mileage with (
B1) or without (
B0), an upgrade between termini
i and
j;
C consists of construction costs; and
X is an additional fixed cost. The cost is estimated to be the length of the shortest possible path between termini. We acknowledge that in the real world, physical geography can pose considerable challenges for network construction, which may limit network construction paths and cause substantial variation in cost density. As in other contributions [
29,
31], this will be ignored for the sake of simplicity, along with other practically relevant factors such as institutional and private organizations.
B0 and
B1 are computed using an origin-constrained gravity model, where
B0 is computed as (2):
in which
P is the population at the origin or destination. The potential accessibility of the node is used to constrain trip production, as in a singly constrained interaction model [
41]. It is computed as
. Given that social interaction grows superlinearly with city size [
42], some elasticity of consumption on potential accessibility may be supposed, as can be specified in Alonso’s General Theory of Movements model [
43]. In the absence of fitting general parameters, a fully origin-constrained specification is considered the most conservative and has, therefore, been maintained here. Both
and
depend on the function
, which is a distance decay function of travel times obtained from the concurrent network. In the application, this function is defined as
. Finally,
L is the Euclidean distance between
i and
j. That variable proxies for the length of trips between the termini, so the benefits depend on the estimated trip length. Without a doubt, the choice of revenue model is relevant for the outcomes; it may be expected that simulating transport systems with other revenue models (for example, fixed fees to enter the system) may yield very different outcomes.
In the same vein as (2), the expected passenger mileage with an upgrade is (3):
where
is a distance decay function of travel times that are obtained in case a network upgrade is constructed for the od-pair at hand. Upgraded travel times are computed as
, so that travel time savings depend on the Euclidean distance between termini and the upgrade speed and are limited due to the fixed waiting time penalty in
W. That penalty is expected to simulate unavoidable delays due to scheduling, which presumably are most felt in the travel time savings of short trips. The computationally simplest way to solve this equation would be by computing
A1 as the accessibility if all links were upgraded. However, with many destinations, assuming general travel time improvements will substantially overestimate the change between
A0 and
A1, thus leading to the underestimation of the benefits of a single investment between two termini. We therefore introduce a modest adaptation of
A1, which we compute with a value specific to each pair of termini rather than generally per origin, so that (4):
where
A1 includes the additional interactions between
i and
j obtained when upgrading only the connection between the termini at hand. The returns on investments from (1) are estimated for all potential origin–destination pairs. The od-pair with the highest return on investment is selected, after which an optimal path is searched for that pair.
2.3. Path Designation
The shortest path between two termini is not necessarily the most attractive path for an upgrade. In the modeling routines, a path can detour from the straight line if a detour makes line operation more profitable while still providing a substantial travel time improvement between the selected termini. A corridor allocation problem-solving approach [
29,
44] is used to find the most attractive path between termini. Using such a corridor allocation approach, the routine derives a number of potential paths and estimates their return on investment. Subsequently, only one path is selected that, within a specific maximum distance, meets the highest proxied return on investment. Thus, only the links comprising one path are selected for a speed upgrade in every model iteration.
The maximum distance
is computed as the minimum value from two separate criteria, namely the maximum circuity parameter
, and the maximum distance
at which the potential travel time gains between the selected termini are maximally halved to ensure that the link will remain reasonably profitable for the selected pair of termini. Thus, maximum detour distance is computed as (5):
in which
is the Euclidean distance between
i and
j,
is the maximum distance based on circuity, and
indicates the maximum detour that is possible if at least half the potential travel time gains are obtained, so that (6):
Subsequently, for each link on the network, the following impedance is computed:
in which the impedance of every link depends on the population connected to the nodes that each link connects, the construction costs of each link, and the length of the link. This impedance is used in the shortest path searching method to find the most profitable path. For the sake of simplicity, construction costs are held linear with link length; more intricate specifications may be imputed [
29,
37] but were neglected to avoid incomparability of network outcomes between cities. The return on investment of a path is computed separately for two cases. In cases where the length of the path is lower than
, the return on investment is computed as the sum of the population connected by the path, divided by the sum of construction costs; or, in cases where the length of the path exceeds
, the return on investment is computed as the inverse of the length of the path.
The k parameter affects the balance between the left (return on investment) component and the total length of the link. With k = 0, the balance leans towards return on investment; with k = 1, only length considerations are taken into account, and the method should yield the shortest path. The routines implemented here start from k = 0 and gradually increase the value of k, with the aim of finding the link with the best-proxied return on investment. We consider the outcome close to an optimal path, given the limited information in the procedure, and therefore, a reasonable proxy for a plausible path.
2.4. End State Evaluation
The selected links are flagged in the network as being upgraded and obtain the public transport travel speed instead of the walking travel speed. A grid cell is flagged as connected if an upgraded link terminates in its node, so that (8):
Subsequently, the percentage computed population is computed as (9):
Finally, the decision is made to end the simulation or end it based on the criterion
. If the connected population is at least the target connected percentage, the final result network of public transport network links is stored as a shapefile. If not, the procedure continues by repeating step 1 on the network, including the public transport link added in this modeling iteration.
3. Results
The outlined simulation method is applied to obtain results for cities across the globe. To do so, gridded populations from the Global Human Settlement Layer [
45] have been selected within the bounds of the Functional Urban Areas [
38] of cities that are deemed either well-known or representative of a specific style of urbanization; all continents have more or less equal numbers of cities. The full list of cities and their results is in
Table A2 in
Appendix C. The approach is programmed to be executed by the open-source GeoDMS software (version 8.043), using the efficient routines of that software for the generation of travel time matrices. The programmed routines are available upon request. Execution times varied substantially for the selected cities, depending on the number of model iterations needed to connect 80% of a city’s population. Cities with modestly populated areas, such as Hong Kong, took a couple of hours; extensive cities, such as Tokyo and Houston, took multiple days. Examples of the iterative network simulation process are shown in
Figure 4.
Simulated network lengths were aggregated into a single statistic for each FUA, along with a number of other variables. It is well known that aggregate urban statistics depend on the delineation of analyzed areal units [
46,
47], so any comparison between cities is easily invalidated. FUA areas are used here as units of analysis because they provide an objective delineation of wider urban areas based on geographic characteristics rather than chance, physical geography, or political considerations; we expect that they, therefore, provide a fairer basis for comparing the performance of the selected cities than administrative boundaries.
To measure urban efficiency, network lengths are measured per capita. Shorter networks per capita are deemed more efficient. We find that population-weighted densities are the key determinant for outcome per capita network lengths. Those population-weighted densities are a particularly useful measure here, as, compared to regular density measures, they are relatively insensitive to delineation issues [
48].
Figure 5 shows a plot of the resulting network length per capita over a city’s population-weighted densities. The same loglinear improvement of efficiency with density, as previously shown in, for example, Newman and Kenworthy [
1], is clearly visible here. One immediately obvious result is that denser cities are more efficient in providing transport, with lengths per capita increasing drastically when population-weighted densities fall below 5000 per square kilometer.
Clearly, there is still variance in modeled network lengths between cities with similar densities, indicating that not only urban densities are relevant for model outcomes. Multivariate OLS regression analyses were executed to measure the relevance of potentially relevant factors to the modeled network length. Variables that were tested are population-weighted density, total population, absolute population, the share of population living in an FUA’s center, the number of necessary modeling iterations, and dummy variables indicating the world region. With regard to the percentage of a population living in an FUA’s center, those are measured as the percentage of a population living in so-called high-density clusters within an FUA. The remainder of an FUA’s population is expected to live in commuting zones. Many model specifications were fitted, the results of which all pointed towards the same conclusions. The results from only three fitted specifications are reported here, and other results are available upon request. The most straightforward regression (i) explained variance in network lengths per capita (
, using population-weighted densities
D, population
P, and
%PC, the share of the population living in the area identified as an FUA’s center. Thus (10):
A second regression (ii) additionally controlled whether differences in city characteristics between world regions have an impact on model outcomes. This was accomplished by replacing the constant
with separate constants
for the world regions
R in (11):
The identified world regions are Africa, Asia, the Middle East, Europe, North America, Australia, and South and Middle America. Australia was combined with North America here because of the similarities in urbanization styles that those regions have.
Last, to control whether the effect of population-weighted density holds in a regular scaling regression [
11], a third regression fits an equation similar to (iii), however, on total network length, so that (12):
Other tested specifications did not yield very promising results. Most importantly, including the number of model iterations, total population, or absolute center population did not yield significant estimators in any specification
1. Given the sizeable explained variance in all three regressions, we expect that the major explanatory factors are captured in these regressions. From pairwise correlations, the variable has limited correlation (with a Pearson r-score below 0.41), except for
and
, which do exhibit considerable correlation (0.76). Given that these variables do not seem to cause mutually inflated significance levels, we exclude problematic multicollinearity in our results.
The results of the regressions (10, 11, 12) are given in
Table 1. In all cases, population-weighted densities have, with considerable significance, a negative effect on modeled network length. This confirms that, per capita, modeled transport networks are substantially shorter with higher population densities.
From the results of (12), it follows that total network length responds slightly sublinearly to city size, indicating the existence of economies of scale [
10] in the modeled infrastructure provision. However, while population-weighted densities have significantly negative effects in all tested models, no significant city size effects remain when explaining the variance in per capita network size in (10) and (11). The implication is that cities can obtain shorter transport infrastructure by increasing density, regardless of their scale. Population-weighted density and total population have a limited correlation (
D and
P have an r of ~0.25, ln
D and ln
P have an r of ~0.35), so size benefits are not implicitly captured in the density variable. Thus, the length of the simulated network is subject to economies of density, not scale. This is contradictory to findings from scaling law studies [
10,
11]. One explanation may be that scale-specific aspects that play out, e.g., capacity requirements and traffic coordination costs, are not included in the simulation approach.
The share of the population in city centers has a monotonously negative effect on modeled network length as well, but only significantly so in models in which absolute (not loglinear) population-weighted density was used, confirming that population densities are the dominant explanatory variable here. The fact that the share of the population living in an FUA’s center does not significantly affect network outcomes is surprising. A possible explanation for the lack of significance could be the bias of the model for central areas. If the selected cities all have a substantial part of the population in central areas, the simulated networks are almost exclusively allocated in the center, and the commuting zones remain unconnected.
Figure 6 shows a linear relationship between the share of the population living in a commuting zone and the percentage of the network allocated in that commuting zone. Clearly, the simulation method is not necessarily biased toward central areas. A possible explanation is that due to the optimization of passenger kilometers in the simulation approach, the revenue benefits from higher population densities in central areas are offset by the fact that commuting zones typically require longer travel distances, so connecting people in commuting zones yields a higher per-capita number of kilometers traveled.
Despite the lack of significance, models, including the share of the population living in the center, are nonetheless presented, as that variable does seem to quite robustly contribute to explaining variance in modeled network length, even with the relatively low number of observations used in this study. The negative effect of the share of the center population indicates that, to a limited degree, the concentration of population in a single city center is more efficient than, for instance, distributing population over several satellite settlements, even if those satellites are populated at a high density.
Regression (11) introduces region-specific fixed effects to control for regional variances in infrastructure efficiency. The effect that those region-specific estimators have on the coefficients for population density and share of the center population suggests that regions have structurally different values of the explanatory variables. This is corroborated by the fact shown in
Figure 2 that, for example, the selected cities in North America and Australia have relatively low population-weighted densities, while the selected Asian cities have relatively high population-weighted densities. The effect of regional fixed effects is such that, in general, the modeled Asian and African cities have lower per capita network lengths than their European, North American, and Australian counterparts.
5. Discussion
This paper introduces a network expansion simulation method to establish whether density fosters shorter public transport networks and subsequently discusses the results of simulating such networks in cities throughout the world. This approach is useful as it allows us to assess the impact of population densities given the counterfactual situation that all analyzed cities have public transport networks, with stops proximate to a larger share of the population, that are based on the same shaping principles. The results confirm that population-weighted density is a crucial factor in explaining simulated network outcomes; denser cities can have shorter and, thus, presumably more efficient networks while having infrastructure that is proximate to citizens. On the other hand, neither population size nor urban structure have significant effects on network length. Overall, the results presented here complement earlier contributions that favor high densities, as those foster, amongst others, public transport availability, ridership, and mode share [
4,
5,
16,
17]. A sensitivity analysis corroborates that the results are robust for a wider range of simulation parameters.
It must be stressed that the results are based on a hypothetical approach with unrealistic assumptions. Commonly, history matters for network outcomes [
23], for example, through first-mover advantages [
31]. The adopted simulation approach does model network expansion as a sequential process in which prior investments shape future investments. However, it treats all the analyzed cities as tabula rasa, not taking into account the existing infrastructure. Furthermore, the costs of overcoming distance were held equal across all cities analyzed. The analysis thus does not take into account the many factors that may affect the supply of public transport locally, for instance, restrictive physical geography, legal conditions, ownership issues, or environmental considerations, even if those matter empirically [
37].
Another assumption that could not be tested regards the relation between network length and network efficiency, which in this paper is considered linear. On the one hand, hauling more passengers over shorter networks likely enables economies of density [
50]. On the other hand, higher transport densities presumably come with an increased demand for infrastructure capacity and higher coordination costs [
23]. In addition, the assumption that additional link upgrades always decrease overall transport costs does not hold when congestion is taken into account (and reduces travel flow) [
51]. All in all, more research is needed to establish the bounds of density and efficient public transport provision.
Public transport networks are subject to network externalities [
23] and presumably through positive feedback loops of urban density, public transport availability, and public transport ridership [
17]. A city that wants to achieve a modal shift towards sustainable transportation may improve the proximate availability of public transport stops at a lower cost if its citizens live at a higher density. This also emphasizes the problem that cities with low densities are locked into passenger car mobility lock-in [
21], as it may be prohibitively costly to roll out a public transport network that provides sufficient proximate connectivity to public transport infrastructure across a low-density urban system.