1. Introduction
Reducing road vehicle emissions in urban areas represents a big challenge for Europe [
1]. Electric mobility (e-mobility) applied to both private and public transport is one of the most promising solutions to reduce air pollution in densely populated areas as well as to support energy diversification and cut greenhouse gas emissions [
2].
Electric vehicles (EVs) have expanded significantly over the last decades, largely driven by the mass expansion of this transport mode in China accounting for nearly half of all vehicle sales [
3]. Despite its rapid global growth, the European EV market has shown moderate progress, often connected to support schemes. Most EVs are concentrated in a few northern and western EU Member States, although southern and eastern ones have recently recorded the highest sales growth. However, according to forecasts from Deloitte for 2030 [
4], EV sales will achieve a domestic market share of around 42 percent in Europe. Moreover, the IEA’s Stated Policies Scenario [
3] forecasts that the share of demand due to electric vehicles in total electricity consumption in Europe by 2030 will grow to as high as 4%.
The wide-scale adoption of EVs in urban areas poses new challenges with respect to charging infrastructure, covering a wide variety of transport patterns and needs. Due to the impact on the electricity grid, infrastructure improvements need to be made to optimize EV charging operations and to ensure a smooth integration with renewables-based energy generation systems. The investments and infrastructure requirements can vary significantly from one urban context to another according to specific market characteristics, such as local infrastructure, land use, energy system, mobility culture, and patterns.
This paper proposes a modeling framework that can be useful to evaluate short-term and medium to long-term measures to avoid energy congestion in urban areas; specifically, it aims to simulate multimodality between EVs and public transport (Park and Ride), as well as to estimate energy demand requested by EVs as a function of the land use of the city these are moving to.
The correct simulation of a short-term measure, such as promoting multimodality between EVs and public transport in Park and Ride sites located in suburban areas of the city, envisages a twofold goal from the energy viewpoint. Firstly, energy demand can be shifted differently in space and time, with respect to the request derived by the usual commuter trips adopting only private e-vehicles; moreover, it would promote an efficient charging management through a vehicle-to-grid approach in which the EVs assume the role of “demand peak stabilizers” in the parking sites.
Indeed, a correct estimation of energy demand as a function of the land use can help in the medium- to long-term in moving urban planning toward a more energy-saving approach.
Thus, the entire modeling framework, once implemented into a simulator for EVs, can be adopted by local governments, in cooperation with mobility planners and energy providers, to promote a sustainable development of e-mobility.
In both cases, specific models adopting-data-driven approaches have been realized and tested. Different sets of floating car data (FCD) derived by probe vehicles moving on the road network of Rome (Italy) have been exploited for this aim; hence, the city of Rome is the reference case for the study.
The paper is structured as follows:
Section 2 presents the literature supporting the modeling framework and the research contribution. The methodology behind the developed models is described in
Section 3: here, the main dataset, i.e., the FCD dataset, is also described to obtain a useful insight into Rome’s private vehicle travel patterns.
Section 4 shows the resulting models and their calibration and validation for the Rome case study. Possible policy implications about multimodality through Park and Ride for EVs and energy-oriented land use are reported in
Section 5. Conclusions and further developments follow.
2. Literature Outcomes and Research Contribution
The literature supporting the development of this study has been divided according to the following topics: (1) simulation tools for EVs; (2) planning measures for the sustainable development of EVs in urban areas; (3) multimodal models for estimating Park and Ride demand; and (4) how to realize urban development oriented to e-mobility.
In [
5], a detailed review of simulation tools for EVs is reported. The main elements of EV simulators are the simulation of trips, the modeling and analysis of charging stations, the charging behavior, and the impact of EV user decisions on the electrical network distribution. EV simulation can be implemented by adding specific modules inside flexible traffic simulators such as SUMO [
6] or MatSim [
7] or, alternatively, by developing ad-hoc instruments. Most of the simulation performed by the aforementioned tools deals with EV consumption models, charging behavior, charging design/management, and vehicle to grid strategies [
8,
9,
10].
Indeed, charging management and vehicle-to-grid are also adopted to reshape the charging load profile on the power grid, thus reducing line congestion or voltage deviations. Alternative strategies to reduce the energy load can be more “transport oriented”, since EVs and their supporting charging infrastructure act as a glue between the transportation and the electrical system [
11].
The concentration of energy demand can be verified in terms of both time and space, and it is well known from the literature that a specific activity distribution or travel lifestyle can impact charging patterns and, as a consequence, energy congestion [
12,
13].
In this paper, the models developed aim to be implemented into an ad-hoc simulator for EVs. As with similar tools (i.e., MatSim), the simulator relies on an agent-based system, which is characterized by two main levels: (1) a synthetic trajectory generator and (2) a recharge behavior model [
14].
In contrast, regarding the models here proposed, their aim is to simulate actions for smoothing the charging loads and specifically:
Reducing traffic congestion and related concentration of charging behavior through the modal shift toward public transport by means of Park and Ride sites. It is assumed that by promoting the Park and Ride choice, EVs can also contribute to vehicle-to-grid strategies in the parking sites;
Supporting urban planners, providing instruments able to evaluate the energy demand as a function of the travel pattern, and, consequently, as a function of the land use of the city.
As for simulating actions to promote/push Park and Ride with EVs, this is usually done by calibrating modal shift models. Most Park and Ride models are usually based on random utility theory such as logit models [
15]. Through logit models, the main variables affecting the Park and Ride demand can be investigated. For example, authors in [
16] found that the lack of parking spaces and the transfer time are the main variables in defining the level of Park and Ride demand; in [
17], the importance of car accessibility to Park and Ride sites and the waiting time of public transport is underlined. Only recently have other techniques based on machine learning been investigated also for modal shift models: Tai-Yu Ma [
18] analyzed multimodal choice behaviors through a Bayesian network, comparing it with a multinomial logit model. Thanh et al. [
19] adopted decision trees for the binary choice “shifting or not to other travel options”; a similar approach has been proposed by [
20] but using a support vector machine. Regarding the adoption of machine learning in combination with open data, tracking data, or real data not derived by survey, few examples can be found in the literature, but these are mainly related to estimating the level of occupancy of a parking site rather than to understand the potential for a modal shift [
21,
22]. Thus, there is still work to exploit these combinations of methods and data for modeling a modal shift.
Interaction between urban development and EVs is a significant topic in the literature. Indeed, EV deployment would play an important role in the spatial organization and structure of cities, and the land use will influence the charging request. Most of the contributions about land use and EVs deal with the optimal location of charging infrastructures as a function of the different functional areas of the cities or evaluating which facilities or points of interest are more interesting for EV charging [
23]. For example, Zhi Wu et al. [
24] work on locating multi-functional charging stations (MFCS) in a medium-sized city with different functional areas. However, the difference between the land use is only adopted to classify the vehicle travel destinations (assuming, for example, commuting trips mainly between residential areas and industrial or commercial areas and social and recreational trips mainly between commercial and residential areas). Other contributions deal with the effects of EVs in terms of urban sprawl: indeed, it is assumed that EV owners would drive more than they used to, even if they are shifting from very efficient ICEVs [
25].
Regarding the influence of the land use, mobility patterns, and related energy consumption, the only recently integrated land use–transport–energy model, able to incorporate the energy system explicitly, has started to be developed. Thus, there is evidence that new models to support the right association between these three systems are of strategic importance. Indeed, the spatial heterogeneity of urban areas calls for extreme care when evaluating the effectiveness of sustainable energy-oriented policies [
26].
This study will try to take a step forward with respect to the analyzed literature. The models that will be developed, specifically a Park and Ride model and an energy-oriented land-use model able to describe the transport energy profiles, will be (i) data-driven and (ii) easily implementable in any agent-based simulator.
3. Methodology
Since the models have the common factor of being data-driven, the methodology section starts describing the dataset adopted for calibration and validation. Then, modeling details are given about both multimodality and energy-oriented land use.
3.1. Available Dataset
In this study, the main data sources are timestamped geospatial locations provided by probe vehicles, i.e., floating car data (FCD). Additional data on mobility services and transport infrastructure are also collected, namely:
Park and Ride facilities for public transport inside the Rome metropolitan area;
GTFS data published by the Mobility Agency of the city of Rome that contains schedules, fares, and geographic transit information;
Census data about population and activities.
FCD are currently available worldwide; most of the vehicles (both cars and trucks, as well as fleets of vehicles such as a public transport fleet) are provided with an on-board unit, and tracking data are commercialized by several solution providers, thus ensuring accurate, reliable data for transport applications [
27]. The available FCD samples refer to private cars moving in the metropolitan area of Rome during two different time horizons: about 150,000 cars (160 million records, penetration rate 5%) tracked during May 2013 and about 240,000 vehicles (317 million records, penetration rate 7%) tracked during November 2015. The locations of points during the FCD trip are recorded together with the start and stop points. The recording frequency of the adopted data is set as follows [
2]: (1) every 30 s if the probe vehicle moves along the motorway network or some main roads in the metropolitan area; (2) every two kilometers elsewhere. The information available in the FCD is the vehicle ID, the geo-referenced position of the vehicle, the time and date, the instant speed, the state of the engine (on or off), the distance traveled from the previously recorded position, and GPS signal quality. Thus, it is possible to extract useful insight from FCD collections such as distances traveled, average speeds and related travel times, the final destination or the intermediate destinations of the trip, the parking time, etc.
The FCD have been used to extract vehicle home locations that can be taken as input to explain or predict multimodality and energy-oriented land use. In the first case, it is assumed that most of the trips connected to Park and Ride are home-based trips, since Park and Ride is usually adopted for commuting trips. In the second case, defining whether the starting/ending zone of a trip is the home location can help in understanding the motivation behind the trip, thus better defining the land use.
The residence location procedure (RLP) is conducted on a zoning system that represents a compromise between the census zones (13,656 zones) and urban zones (155 zones) of Rome, consisting of 1409 zones with an average area of 86 ha and an average population per zone of 1900 inhabitants.
The procedure works as follows:
trip destination locations of each FCD vehicle are extracted and mapped;
all the destinations are clusterized, adopting a DBScan clustering technique with parameter ε (i.e., the proximity radius between points in the same cluster) equal to 120 meters and the minimum number of points for each cluster equal to one;
overnight stops of each FCD vehicle based on arrival time and departure time at/from destination are extracted.
the residence is the cluster of destinations with higher values of overnight stops.
The procedure has been validated by comparing the resulting number of residences in each zone with the census population, thus showing a high correspondence (R
2 equal to 0.7465,
Figure 1).
3.2. Multimodality
Starting from the multimodality measure, we focus on the development of a suite of models able to represent multimodal choice behaviors between EVs and public transport in the parking sites located close to the metro stations in the suburbs of the city, i.e., Park and Ride (P&R) models.
All the models are data-driven, but two different approaches have been followed (
Figure 2): (1) an aggregated approach where both dependent variable and explanatory variables are aggregated variables at zonal level; (2) a disaggregated approach where the decision of each user is simulated as a function of several explanatory variables related to both individual features and zonal ones.
3.2.1. Formulation of the Park and Ride Models
Starting from the aggregated approach, the model has to be able to represent multimodal choice behaviors, thus assessing the share of the private EVs that can be moved onto public transport by using Park and Ride sites. The model assumes the following structure:
where G
OPark/G
O is the share of generated trips from the traffic zone O that would make P&R in the time interval Dt. The explanatory variables x
1, x
2, …, x
n are mainly related to: (1) the accessibility of the Park and Ride sites, (2) impedances due to the travel times on the public transport, (3) the location and occupancy of the P&R sites, and (4) characteristics related to the activity system in the starting point of the trip. The proposed explanatory variables were firstly proposed in [
28] and then updated according to [
14]. Specifically, the following ten explanatory variables have been adopted:
Variables depending on the starting point of the trip (origin/starting traffic zone O) and the starting time interval (Dt) of the trip:
Impedance on private transport x1 (2), i.e., the average access time from the traffic zone O to the P&R sites belonging to the catchment area of O, weighted for the O-P&R flows;
Impedances on public transport: x2, x3:
x2 (3) is an average travel time from the P&R sites belonging to the catchment area of O to all the destinations, weighted for the O-P&R flows and for the attractiveness of each destination;
In x3 (4) the travel times on public transport are weighted for the origin-destination (OD) flow;
Travel time differences between P&R and car-only mode x4 (5); x4 represents the travel time advantage the user may have following the adoption of the P&R mode. The advantage is positive if the total travel time, the sum of the parking access plus the travel time by public transport, is less than the time required to go from O to D by car;
The average occupancy of the P&R sites belonging to the catchment area of the origin/starting traffic zone O, x5 (6).
Variables depending only on the characteristics of the starting point of the trip origin/starting traffic zone O) and not by the time interval, specifically:
The variable x6 that represents the geographical extent of the origin/starting traffic zone O [km2];
The variable x7 that represents the population density of O (between 18 and 70 years old as possible users of the P&R service);
The variable x8, i.e., a binary variable equal to 1 if O is within the Main Ring Road of Rome (where the metro lines are located), 0 otherwise;
The variable x9 that represents the number of urban railway stations and metro stations in the starting zone O.
Variables depending only on the time interval, specifically:
tOP = travel time by car from the origin/starting traffic zone O to the P&R site P;
fOP = vehicle flow from the origin/starting traffic zone O to the P&R site P;
GOPark = generated trips from the origin/starting traffic zone O making P&R;
= average travel time on public transport to reach all the destinations of the city from the P&R site P;
fOD = vehicle flow from the origin/starting traffic zone O to the final destination D;
tPD = travel time by public transport from the P&R site P to the final destination D;
= number of P&R sites belonging to the catchment area of the origin/starting traffic zone O;
GO = generated trips from the origin/starting traffic zone O
tOD = travel time by private transport from the origin/starting traffic zone O to the final destination D;
AD = attractiveness of the destination zone D;
= occupancy of the P&R site P.
In the case of the disaggregated approach, the output of the model is the choice for each user between Park and Ride or car, while the explanatory variables contain both aggregated and disaggregated information. Regarding the aggregated ones, we refer to the zonal variables described for the aggregated approach since these are a function of the starting zone of the trip. The only variable not adopted from the aggregated set is the dummy variable x10, since the disaggregated model has been calibrated considering only those trips related to the morning peak hours. Regarding the disaggregated ones, six new variables have been added to describe the behavior of each individual user, specifically:
the traveled distance by Park and Ride x11;
the total P&R travel time x12;
the number of transfers by public transport x13;
the average waiting time by public transport x14;
the traveled distance by car x15;
the travel time by car x16.
Traveled distances and travel times have been considered in the models as the difference between Park and Ride and car options.
Since each variable needs to be calculated regarding whether the choice of each user is car or Park and Ride, some of them are not directly available by FCD (e.g., for each FCD trip stopping in a Park and Ride site, the final destination is unknown); thus, some assumptions are required as follows:
Only the main destinations accounting for the 70% of the demand for each starting zone are considered if the destination is unknown;
The features related to destinations are weighted for the attractiveness AD;
For users using a car, the potential Park and Ride site is considered the one closest in time with respect to the starting point of the trip.
3.2.2. Calibration and Validation Methods
For the aggregated approach, two machine learning (ML) techniques have been adopted for calibrating
f in (1): random forest, RF, and artificial neural networks, ANN [
29]. The motivation behind the adoption of ML lies in the lack of an explicit functional form for
f.
RF is a supervised learning technique adopted for both classification and regression and was firstly proposed by Breiman [
30]. RF works by creating an ensemble of decision trees and randomly selecting a subset of features to grow each tree. Then, it provides the class mode (if classification is required) or the average prediction (if regression is required) of individual trees, finally generating the resulting model through a combination of several parallel decision trees. For the training and the test phase of each tree, the data are usually divided into two subsets. In transport engineering, advanced ML techniques such as RF and ANN are emerging in recent years due to the increasing availability of traffic data.
Both RF and ANN are mainly used as data-driven approaches for travel time prediction [
31,
32,
33]. Contributions in literature dealing with the adoption of RF for demand forecasting are harder to find. For example, in Ghasri et al. [
34], decision tree and RF are adopted to reproduce individual mobility patterns. Ashqar et al. [
35] used RF to predict the number of available bikes in bike-sharing stations. Liu et al. [
36] compared RF with well-known regression methods such as autoregressive integrated moving average (ARIMA) and support vector regression (SVR) for the demand estimation of four different on-demand shared transport services.
Regarding ANN, one of the first contributions dealing with transport demand modeling was by Hensher et al. [
37]: they compared ANN with logit models underlining a high potential of ANN to describe individual mode choice. The same has been reported in Cantarella et al. [
38]. Nijkamp et al. [
39] in 2004 adopted ANN for modal choice for freight. Gazder et al. [
40] proposed a logit-ANN ensemble for mode choice modeling based on survey data. In contrast, recently, authors in [
41] calibrated a convolutional neural network (CNN) for the mode choice reconstructing trajectories and related additional features through GPS, obtaining good results in terms of model accuracy.
In our study, the RF was built by varying the main model parameters: the number of trees and the maximum depth. Regarding ANN, a multi-layer feedforward neural network has been trained by different algorithms (Levenberg–Marquardt, Bayesian regularization), modifying the activation function between layers (rectified linear unit—ReLU—or logistic regression—Sigmoid), the number of hidden layers, and the number of neurons in each hidden layer. In both cases, the validation adopts out-of-sample data, unseen by the model during the training phase.
For the disaggregated approach, ANNs have been also considered. However, since the output of the approach is mainly a binary output (car or Park and Ride), a classification with respect to the most complicated regression is required in (1); both a Perceptron network as well as a multi-layer feedforward neural network have been trained. The Perceptron is basically simpler than the multi-layer because it does not adopt hidden layers. As done for the aggregated approach, the ANNs have been trained with different algorithms, activation functions, and numbers of neurons in the hidden layer (the last one only for the multi-layer). Due to the high number of instances in the disaggregated approach (one instance for each FCD trip) the solution algorithms are different with respect to the aggregated approach, i.e., stochastic gradient descent (SGD) and Adam.
Apart from ANNs, a classical behavioral approach based on random utility theory has also been applied for the disaggregated approach. A simple binomial logit has been considered due to the need to represent the binary choice between car or Park and Ride. The maximum likelihood method is adopted for calibration, while validation is conducted through statistical tests on the estimated coefficients [
42].
3.3. Energy-Oriented Land-Use Model
A methodology has been developed able to link a specific land use to its potential energy demand.
Firstly, the land use for each zone of the city is assessed through the adoption of mobility patterns from FCD. Similar to Caceres and Benitez [
43], we exploited a clustering approach (K-means clustering, [
44]). Specifically, FCD trips have been divided between weekend and weekdays; then, since the data contain information about start and stop points of the vehicle’s engine together with the corresponding timestamp [
45], these are converted into generated and attracted trips by each zone, and profiles during the daytime have been derived. The K-means algorithm works in order to clusterize profiles, finding the optimal number of clusters
k* able to minimize the distance between observed profiles in each cluster. Each
k ∈
k* corresponds to a land use and, after the clustering phase, it is possible to find different values of
k* for attracted/generated trips during weekends/weekdays. To finally assign a land use to each zone, the ones having the same land use with respect to both generated and attracted trips are assigned univocally, while the other ones are considered mixed land use with a prevailing function given by the higher impact of a land use with respect to the other ones.
The resulting land use of each zone has been validated, comparing it with aggregated available statistics such as population density or residential buildings with respect to the total buildings. Relating the land use to mobility patterns allows the improvement of traffic prediction models and redefining zoning regulations, something that simply cannot be done relating the land use to the usual existing statistics.
Finally, the energy demand can be computed. Specifically, for each zone
z with land use
k (
zk), for both home-based and not home-based trips,
i and time interval
h, the energy demand, are computed as follows:
where:
f = recharge frequency computed as a function of distance traveled and battery life before charging;
Tripsi,h = number of trips of type i arriving in k during time h;
ParkingTimei,h = parking time of trips of type i arriving in k during time h, where distribution of parking times can be exploited by FCD;
Power coli = Recharge power. It can be assumed equal to 3.7 kW for home-based trips and 7.4 kW for not-home-based trips (thus assuring, respectively, an average recharge time of about 8 and 4 h).
The methodology has been applied to several zones of the city of Rome characterized by different land use, showing the value of the energy demand and its distribution in time and giving fundamental information for energy providers and urban planners. Indeed, the methodology requires calibration with respect to the current data of mobility patterns, but it allows the simulation of how the demand for energy can change as a function of the land use (something that can be realized only in the medium to long term by urban planners).
4. Results
4.1. Multimodality: Results
Staring from the aggregated approach, the database counts about 1000 records and 10 independent variables. It has been divided according to a cross-validation method into two subsets: 70% used to train the model (training data set) and the remaining 30% (validation data set) to evaluate its predictive capability.
According to the computed performance indices, the comparison between the two different machine-learning techniques suggests that in the P&R demand model RF gives better results than ANN. The RF model succeeds in generalizing the relationships in the data with higher prediction accuracy (
Table 1): specifically, RF shows a mean squared error between the model and the data of 0.560 with a correlation index (R
2) of about 0.73; in contrast, ANN shows a mean squared error of 1.085 with a correlation index (R
2) of about 0.66.
Regarding the analyses conducted on the parameters of the machine learning techniques:
for RF, the analyses showed that the variability of the results decreases as the number of trees increases (
Figure 3a);
a clear difference exists between the RF with 10 depth levels and the others (
Figure 3b)
the optimal parameter values for the RF have been set equal to 1000 as number of trees and 20 as the maximum depth (
Figure 3);
about the different algorithms adopted to train the ANN, the Levenberg–Marquardt algorithm resulted to be the most performing one. Indeed, despite the mean squared error seems to decrease more with the Bayesian regularization (
Figure 4a,b), the error with respect to the only test set demonstrates the higher accuracy of the Levenberg–Marquardt (
Figure 4c);
the optimal parameter values for the ANN have been set as 2 hidden layers and 8 neurons for each hidden layer (
Figure 4c).
The final RF model has been tested to controlled changes of the independent variables. Indeed, the calibration of the model generated the feature’s importance of each independent variable: this allows establishing which variable has the higher predictive power in the model. The most important variable is the parking occupancy with a nonlinear trend in defining the Park and Ride demand: a given P&R share can correspond to both a low occupancy state (high capacity available, filling phase of the P&R site) and a high occupancy state (no more capacity, no more demand).
Regarding the features’ importance in the case of ANN, this can be computed by adopting the formula in Kayri [
46]. It is possible to observe (
Figure 5) a flattening of the weights, especially when adopting the best architecture with two hidden layers. It seems to demonstrate that ANN is not able in the aggregated approach to capture correctly the different hierarchy between the explanatory variables of the model.
Moving to the disaggregated approach, the database counts about 1.6 million records and 13 independent variables. As in the aggregated case, it has been divided according to a cross-validation method (70% training data set; 30% validation data set). The resulting binomial logit (
Table 2), calibrated through the adoption of BIOGEME [
42], assumes the following expression for the systematic utilities:
Thus, the choice of Park and Ride is a function of a subset of the explanatory variables, the parking occupancy (x
5), and the number of stations (x
9); instead, the choice of the car is mainly due to the difference in travel time between car and Park and Ride (x
4) and to the location of the starting zone of the trip (inside or outside the Main Ring Road). However, although this is the most reliable random utility model obtained, the resulting accuracy is very low. The model is able to reproduce only the car choices, while it is not able to reproduce any Park and Ride choices. This is probably due to the low number of trips by Park and Ride (about 16,000 with respect to the total sample of 1.6 million records). Additional behavioral models can be calibrated starting with the same data structure, for example, mixed logit able to consider for panel data (several trips related to the same vehicle as in the FCD sample) or a logit model with a penalized likelihood such as the weighted exogenous sample maximum likelihood [
47]. However, problems in the model applicability or in the reproduction of the sample can, respectively, occur.
In contrast, by adopting the ANNs, it is possible to observe a high accuracy in reproducing not only car choices but also Park and Ride. It happens specifically in the case of the adoption of the Adam algorithm on a multi-layer feedforward neural network. This is shown in
Figure 6 where the structure of the network can be formalized as
x-
i-
j-
y, with
x the number of explanatory variables,
y the output,
i and
j, respectively, the number of neurons in the first and in the second layer of ANN.
If
i–
j are equal to zero, i.e., no hidden layers, a Perceptron network is obtained (architecture 13-0-0-1 in
Figure 6), and the model is not able to reproduce Park and Ride but only car choices as obtained by the binary logit. The best architecture is the one with two hidden layers and 12 neurons for each layer (Architecture 13-12-12-1,
Figure 6).
Regarding the adopted activation function, while a Sigmoid is always adopted as an output of the architecture, we tested a Sigmoid between the input layer and the first hidden layer and a ReLu between the first hidden layer and the second hidden layer (
Figure 6).
Considering the objective of the multimodality, i.e., to build a suite of models that can be adopted to simulate multimodality for electric vehicles and that are mainly data-driven, it is possible to summarize the obtained results as follows:
4.2. Energy-Oriented Land-Use Model: Results
After training the model, we found four clusters based on trips on the weekdays (mainly residential, business/industrial, commercial, and mixed,
Figure 7) and only three clusters on the weekend (mainly residential, mixed, and nightlife/leisure,
Figure 8).
Joining the clusters for both weekdays and weekend, as well as for generated and attracted trips, the city of Rome was 70% mixed land use, while only 8.4% of zones were classified as strictly residential, 8.6% strictly nightlife and leisure, 8.4% strictly commercial, and 2.1% strictly business/industrial. Inside the mixed land use, there is a prevailing residential function in 41% of the cases.
Results were validated comparing the resulting land use of each zone with census data:
strictly residential zones correspond to intermediate/high values of population density (about 3500–9000 inhabitants per km2) and residential buildings are more than 80% of the total buildings;
strictly business/industrial zones have a low population density (mostly below 450 inhabitants per km2), and residential buildings are less than 70% of the total buildings;
strictly commercial zones showed average population density while quite high values of residential buildings: this is probably due to the typical configuration of shopping activities in Rome, where most of the shops are located on the ground floor of residential buildings facing the street.
Once the land use was defined, we moved to the computation of the energy demand for each zone of the city as described in
Section 3.3. Most of the input required in (7) can be derived directly from FCD; for example, the recharge frequency is a function of the distance traveled (on average, each FCD in Rome travels for about 26 km in a day) and the kilometers to be traveled within the battery life (which are assumed equal to 170 km in this study). Parking times have also been exploited by FCD; these are a function of the land use of the arrival zone and can be different depending on whether the trip is home-based or not, as represented in
Figure 9.
Below, we reported the application of (7) for two specific zones of the city characterized by a different land use: for the considered residential zone, a request of about 60 kWh in the evening was estimated, while in a typical working zone, a peak value of about 120 kWh in the morning time was estimated (
Figure 10).
It is quite easy to convert these energy values into the entire energy demand if the penetration rate of FCD is known. However, we remind that the penetration rate of FCD in each zone could be different with respect to the average penetration rate, thus to obtain a refined estimation of the energy demand, it is suggested to compute a penetration rate for each zone. This can be done by comparing FCD flows with respect to traffic counts for different count sections located in the city.
5. Discussion and Policy Implications
The main research question explored in this work concerns how the future spread of EVs in urban areas can be tackled both from a mobility planning and an energy management perspective. Specifically, the proposed modeling framework is aimed to assess: (1) short-term measures driven by the characterization of the multimodal interchange between public transport and electric private mobility; (2) medium- to long-term measures based on the evaluation of the EVs’ energy demand as a function of the land use.
The models developed here are currently under testing inside an agent-based simulator for electric urban mobility adopted by the National Agency for Energy and Environment in Italy (ENEA).
Regarding multimodality, the Park and Ride module generates the changes in the Park and Ride demand in the different zones of the city as a function of possible scenarios. Two scenarios are now implemented in the simulator: (1) a scenario promoting the efficacy and effectiveness of the public transport network (thus a pull policy to increase the potential shift from EVs to public transport); (2) a scenario introducing congestion charging schemes to limiting the circulation of EVs in central areas of the city (thus a push policy to increase the adoption of public transport through the Park and Ride sites). The change in Park and Ride demand is followed by a redistribution of the final destinations and a definition of the new attraction points (EVs stopping in the Park and Ride sites). The agent model receives the new distribution of the attraction points of EVs and simulates the resulting mobility patterns and energy requests. Thus, integrating our models into a standard agent-based simulator can potentially help local governments and energy providers to form a correct assessment of measures for promoting multimodality and consequently for redistributing the charging load by EVs.
At the current stage, first results demonstrated the dependency of the potential modal shift to the Park and Ride sites by the residual capacity of the public transport system, together with a higher effectiveness of a pull policy (scenario 1) compared to a push policy (scenario 2).
Additional work is required to introduce in the multimodal model a dependency between the Park and Ride demand and possible incentives for promoting vehicle-to-grid schemes in the parking sites (e.g., through a discount on public transport subscription or on the electricity bill). However, the model structure can be easily extended to include this policy [
28].
Regarding energy-oriented land use, it works by firstly relating the land use to mobility patterns, allowing improving traffic prediction and redefining zoning regulations, something that simply cannot be done by relating the land use to the usual existing statistics. Moreover, the main result of the proposed approach is in its capability to link the energy demand to the land use, thus providing information on how the energy demand is distributed in space and time for the different land use zones.
From a policy perspective, this information can be adopted by energy providers for energy demand forecasts or for the optimal location of charging stations. Moreover, it can be adopted by urban planners in order to understand the impact of the land use on energy demand, thus working on a more sustainable and energy-oriented plan for the city. Moving urban planning toward a more energy-saving approach is closely related to a smart-city viewpoint and to the objectives of the European Green Deal [
48].
6. Conclusions and Further Developments
Electric mobility is the “closest” future of transport systems. Especially in urban areas, a strong increase of EVs is expected for private mobility and public transport, as well as for the other mobility solutions (e.g., mobility as a service solution). However, local governments and urban/transport planners require tools able to assess the sustainable development of electric mobility, especially avoiding energy congestion phenomena.
Promoting multimodal electric mobility and sustainable energy-oriented land use are two of the key challenges in energy congestion reduction towards a greater sustainability of urban context.
In this study, we developed a multimodal model that considers the change between EVs and public transport in Park and Ride sites. Moreover, a methodology to assess the land use of the city and to correlate the land use to the energy demand for mobility is proposed. Both have been modeled using data-driven approaches based on extended collections of floating car data (FCD) in the city of Rome (Italy). The final aim is to create a modeling framework able to assess short- and medium- to long-term measures for a sustainable development of e-mobility in urban areas.
Specifically, we focused on the development of Park and Ride models based on machine learning techniques and random utility theory, working with two different approaches. In the aggregated approach, where the output is the P&R share for each traffic zone of the city, results suggest that random forest outperforms ANN in reproducing the P&R share. In the disaggregated approach, ANN shows high accuracy in modeling the behavior of each vehicle, and it is able to perfectly reproduce the choice between Park and Ride and a car. In contrast, the behavioral model based on random utility theory is not satisfactory due to the oversampling of cars with respect to the Park and Ride choices. There are still several research perspectives for the problem, for example, expanding data sources for calibration and validation purposes (e.g., mobile phone data) or testing additional sophisticated machine learning techniques.
Regarding the procedure to compute the energy demand as a function of the land use, firstly the land use for each zone of the city has been defined, adopting FCD trip profiles both for home-based and not home-based trips. The results reveal four clusters on the weekdays (mainly residential, business/industrial, commercial, and mixed), and three clusters on the weekend (mainly residential, mixed, and nightlife/leisure). Specifically, joining the clusters for both weekdays and the weekend, as well as for generated and attracted trips, the city of Rome had 70% mixed land use, with 8.4% of zones strictly residential, 8.6% strictly nightlife and leisure, 8.4% strictly commercial, and 2.1% strictly business/industrial. Finally, we provided first estimates of energy demand for each zone of the city. The energy demand estimates could be improved in further developments of the study using the history of each trip in the process. This can potentially provide more refined values for the energy computation.
All models and methods here proposed are currently being implemented in an agent-based simulator for e-mobility to be adopted to guide sustainable urban planning considering both the demand for mobility and the demand for energy.