1. Introduction
Bicycle-sharing systems (BSSs) offer an eco-friendly and accessible mode of transport, contributing to a decrease in greenhouse gas emissions, reduction in energy consumption, and enhancement of societal benefits in urban settings. BSSs improve the linkages between public transport facilities by providing an effective means of covering the initial and final segments of urban journeys, appealing to both residents and tourists. BSSs are divided into two main operational types: dock-based (DBSSs) and free-floating (FBSSs). DBSSs operate with designated stations to pick up and return bicycles, whereas FBSSs offer flexibility for users to access and leave bikes at undetermined locations, removing the need for fixed docking points (bike stations).
The singular directionality of BSS rentals necessitates a redistribution system, termed rebalancing (DeMaio [
1]), to ensure the effective distribution of bicycles. Rebalancing strategies for DBSSs rely on predictive models to estimate the future demand at each station, whereas FBSSs’ rebalancing may operate based on regional restrictions. For instance, Seoul might see FBSS approaches leaning towards the employment of electric bikes or scooters in specific zones, reducing the need for rebalancing efforts. In this regional restriction setting, if someone attempts to move outside the area, the vehicle sounds an alarm and slows down. This study aims to craft vehicle routing algorithms for multiple management trucks to support the collection and redistribution of bicycles in line with predictive demand models for DBSSs. Leveraging precise demand forecasts, the process of reallocating bikes involves logistical intricacies, where vehicles transport bikes from saturated stations to those with a deficiency, ensuring route cost effectiveness all day, thereby seamlessly integrating inventory management with vehicle routing optimization principles.
For a precise and actionable forecasting method regarding demand distribution within DBSSs, we employ DeepAR (Salinas et al. [
2] and Lim et al. [
3]), which is based on autoregressive recurrent networks. This method enhances the prediction accuracy of future bike-rental demand by analyzing historical usage patterns and integrating the inherent variability and uncertainties in user behavior. Following the establishment of forecasted demand distributions, we determine the ideal target bike inventory levels using a periodic inventory review model. This systematic approach regularly evaluates inventory requirements based on the anticipated demand, ensuring that stations are adequately stocked to meet user demand without oversupply.
To operationalize the set of target inventory levels, we devise a multi-objective optimization framework utilizing a mixed-integer programming (MIP) technique. This framework is designed to achieve the multi-objectives of minimizing the travel distance for bike redistribution vehicles and reducing the deviation between the targeted and actual final inventory levels. By identifying the most efficient redistribution routes, this approach effectively tackles the logistical hurdles of matching inventory with forecasted demand, thereby achieving an equilibrium between resource distribution efficiency and inventory precision.
It is worth noting that the multi-objective optimization approach based on MIP, as commonly implemented in commercial solvers such as CPLEX, may not always guarantee the identification of the optimal solution for every problem instance owing to the underlying algorithms. To address this issue, we propose a hybrid genetic algorithm (HGA) that combines the adaptive search efficiency of genetic algorithm with the precision of MIP. This approach leverages the exploratory capabilities of the GA while using MIP for solution refinement and alignment with the optimization criteria. Our hybrid strategy aims to enhance solution optimality by generating a superior Pareto frontier when tackling complex optimization tasks in the bike-rebalancing problem of BSSs. By exploring a diverse set of potential solutions and identifying nondominated solutions along the Pareto front, this methodology empowers decision makers to better understand the trade-offs between objectives and select solutions that align with their preferences and priorities in BSS rebalancing. Through the hybridization of GA and MIP, we strive to advance multi-objective optimization in BSSs and facilitate more informed decision-making processes.
This paper is organized into three main sections.
Section 2 begins with a literature review and provides the research foundation. This is followed by a detailed discussion of the methodologies employed, including DeepAR models, target inventory determination, multi-objective optimization with MIP, and the HGA in
Section 3. In
Section 4, computational experiments are described, and the results are presented to evaluate the methods proposed in
Section 3. Finally, we summarize the contributions of our research and mention the directions of future research in
Section 5.
2. Literature Review
In this section, we review the relevant literature on inventory rebalancing for BSS. We first summarize studies related to probabilistic forecasting techniques and stochastic inventory management aimed at achieving the optimal service level. Then, we discuss the literature on both the Pick-up and Delivery Vehicle Routing Problem (PDVRP) and multi-objective optimization for BSSs, including heuristic approaches.
2.1. Demand Forecasting
Probabilistic forecasting is a crucial tool for making informed decisions in the face of demand uncertainty. It encompasses two main forecasting modeling approaches: parametric and nonparametric. Parametric models specify a probability distribution and estimate its parameters, whereas nonparametric models eschew distributional assumptions and instead build empirical distributions, often focusing on quantiles.
Snyder et al. [
4] develop a comprehensive framework for predicting the distributions and performance evaluation metrics by employing Poisson and negative binomial distributions for inventory management. Similarly, Salinas et al. [
2] concentrate on parametric methods utilizing Gaussian and negative binomial distributions to handle multi-series customer demand. Toubeau et al. [
5] propose an integrated approach that combines parametric and nonparametric models for probabilistic forecasting within electric power markets.
Recent advancements in forecasting methodologies have seen a growing emphasis on nonparametric probabilistic techniques. Notably, the works of Chapados et al. [
6], Bengio et al. [
7], Seeger et al. [
8], Wen et al. [
9], and Rangapuram et al. [
10] have underscored the robustness of these approaches, particularly in their ability to handle data that do not conform to predefined distributions. This flexibility renders nonparametric methods especially valuable in real-world forecasting scenarios. However, when data exhibit clear adherence to specific distributions or when strong theoretical foundations support particular distributional assumptions, parametric models can offer more nuanced insights into future uncertainties, thereby enhancing decision-making processes in areas such as inventory management.
In the context of bike-sharing systems, while demand forecasting has been extensively studied, there remains a notable gap in research focusing on probabilistic forecasting methods. Li et al. [
11] introduce a multi-categorical probabilistic approach for short-term demand prediction using Gaussian Mixture Models and Markov chain models, discretizing demand into categorical states. In contrast, our research employs continuous probabilistic forecasting with DeepAR, aiming to capture longer-term demand uncertainty across multiple stations for inventory optimization. Gast et al. [
12] proposed a queuing theory-based probabilistic forecasting approach and introduced new scoring rules for evaluating predictions. While their work centered on station-level forecasts, our study adopts a system-wide approach to simultaneously model demand patterns across multiple stations. More recently, Gammelli et al. [
13] developed a variational Poisson recurrent neural network for demand forecasting and inventory management, highlighting discrepancies between predictive and prescriptive metrics. Lim et al. [
3] utilized the DeepAR approach for probabilistic demand forecasting in bike-sharing systems, aiming to improve inventory rebalancing by accurately predicting station-level bike demand. Building upon these contributions, our research leverages DeepAR for system-wide probabilistic forecasting, specifically tailored to optimize rebalancing strategies across multiple stations, thus addressing the need for a comprehensive approach in bike-sharing demand prediction and inventory management.
2.2. Optimal Inventory Level
Inventory management under uncertain demand has been a focal point of extensive research, with particular emphasis on determining optimal service levels for periodic review policies. Zheng and Chen [
14] provided a comprehensive analysis of
policies, demonstrating their advantages over traditional
policies and establishing the convexity of average cost with respect to the reorder level. Building on this, Rao [
15] explored the relationships between periodic
and continuous
review policies, highlighting the resilience of optimal costs in
models to variations in replenishment intervals.
In the context of bike-sharing systems, inventory management takes on unique challenges due to the dynamic nature of demand and the spatial distribution of stations. Raviv and Kolka [
16] introduced a User Dissatisfaction Function for single-station optimization, while Liu et al. [
17] extended this concept by combining
policies with network flow matching for a more holistic approach. Fu et al. developed [
18] a robust optimization model for bike allocation in new markets. Dynamic inventory management in bike-sharing systems was addressed by Swaszek et al. [
19], who applied Receding Horizon Control, and Datner et al. [
20], who used guided local search to set optimal inventory levels while considering station interactions. These approaches represent significant steps towards more adaptive and system-wide inventory management strategies in the complex environment of bike-sharing networks. Our study advances bike-sharing inventory management by using DeepAR forecasts to optimize inventory levels across multiple stations and time periods, integrating demand prediction with rebalancing decisions.
2.3. Pick-Up and Delivery Vehicle Routing Problem (PDVRP) and Multi-Objective Optimization
To meet the unbalanced target inventory level for each station, a set of capacitated vehicles travels to redistribute bikes while minimizing the total cost. This problem is classified as a one-commodity PDVRP that determines the optimal delivery or routes from one or several depots to a certain number of customers, subject to additional constraints. In the field of logistics and transportation networks, the PDVRP is a critical problem that has been widely studied in the literature; see [
21,
22] for a detailed review.
Using existing routing models in the literature, Raviv et al. [
23] presented two MIP formulations and developed several valid inequalities to strengthen these formulations. A numerical study demonstrated that their formulations are effective in large instances with two vehicles and about 100 stations. Schuijbroek et al. [
24] determined the service-level requirements at each station and obtained optimal vehicle routes for inventory rebalancing. Since finding an exact optimal solution is practically intractable, they provided an efficient heuristic algorithm for the clustered problem.
Most of the models for PDVRP in BSS are aimed at the optimization of a single objective, such as minimizing the total costs or the maximum distance. However, multi-objectives must be also considered in public sectors such as BSS. Vishkaei et al. [
25] developed a bi-objective optimization solution for public BSSs to minimize the mean number of rejected requests for renting bikes and free docks for returning bikes per satisfied demand. Cruz et al. [
26] dealt with a particular case of the static bike rebalancing problem (SBRP), in which only a single vehicle is available. Their objective was to find the lowest-cost route that meets the demand of all stations and does not violate the minimum (zero) and maximum (vehicle capacity) load limits along the tour.
There is a substantial amount of literature on the PDVRP of the bike rebalancing problem using meta-heuristic algorithms. Given the large number of bicycle stations involved, determining optimal travel paths for multiple management vehicles with an exact solution approach is not trivial. Additionally, the multi-objective nature of the problem has led to extensive use of meta-heuristics in the literature.
Florian et al. [
27] presented a method for rebalancing the loads of bike-sharing system (BSS) stations under conditions of dynamic behavior of the bike-rebalancing problem. Their study developed a two-step method to perform effective rebalancing operations in bike sharing. The core elements of their method are a fuzzy-logic-controlled genetic algorithm (GA) for bike station prioritization and an inference mechanism aimed at assigning stations and trucks. Their model aims to maximize efficiency, which is a type of productivity measured by the number of bikes transported per unit of travel distance. Hu et al. [
28] proposed a decomposition-based multi-objective evolutionary algorithm to address the bike-rebalancing problem. Their model seeks to minimize truck operation costs, which include travel-distance costs and other operational expenses, and to maximize overall satisfaction profit, a complex metric based on the necessary demand estimated from historical and predicted pick-up and return data.
In another domain of PDVRP using meta-heuristics, Dutta et al. [
29] applied a multi-objective genetic algorithm to address the open green vehicle routing problem. This problem involves delivering services or products from a single depot to customers across various geographical locations, aiming to minimize both travel distances and carbon emissions. The authors first used a k-means algorithm to cluster customer regions, which is similar to the prioritization of bike stations in the aforementioned studies. Subsequently, they employed two multi-objective genetic algorithms (Strength Pareto Evolutionary Algorithm [
30] and nondominated sorting genetic algorithm [
31]) to optimize paths for each cluster. Their objectives were derived directly from the paths. In contrast, the problem considered in this paper involves various possible combinations of bike allocations along a given path, in addition to the distance derived from the path. Hence, this paper introduces a hybrid genetic algorithm to effectively tackle these complexities.
Despite the extensive literature on PDVRP in BSS and similar fields, there are still some areas that need further exploration. One challenge is the prioritized selection of bike stations to visit. It is reasonable to reduce the number of depots to visit, and this process is usually performed by prioritizing certain depots based on specific criteria. However, in a rebalancing problem setting, it may be more natural to pick up and deliver bikes along travel paths based on demand forecasting. Another deficiency is the choice of objectives. Minimizing travel distance is very common in the literature, and another frequently adopted objective is maximizing customer satisfaction. Usually, satisfaction is a resultant metric, which is not only difficult to define but also not easily estimated during the planning phase. Instead, it might be possible to set an optimal service level and, along with the delivery plan, use the gap between the target inventory and the actual inventory along the pick-up and delivery paths as a metric to evaluate the plan. Our approach aims to address these gaps to provide efficient and comprehensible bike-rebalancing operations.
3. Model and Methodologies
To address the bike-rebalancing problem, this study proposes a comprehensive approach that integrates demand forecasting, inventory management, and vehicle routing optimization. The effectiveness of a multi-objective MIP model for the PDVRP in bike rebalancing hinges on accurate target inventory levels, which in turn depend on reliable demand distribution predictions. To achieve this, we employ a sophisticated methodology that encompasses several key components.
This study’s methodology is structured in two principal phases. The first phase involves implementing a probabilistic demand-forecasting model based on DeepAR, employing an autoregressive recurrent neural network. This model examines intricate patterns within historical bike-sharing data to project future demand distributions. We then apply inventory management techniques to determine optimal inventory levels that sustain desired service standards, thereby establishing a solid foundation for subsequent optimization.
The second phase integrates these preparatory elements into a multi-objective framework for the PDVRP, tailored specifically to bike-rebalancing scenarios. To efficiently explore Pareto-optimal solutions in this complex optimization context, we propose a hybrid genetic algorithm (HGA) as an enhanced alternative to a conventional multi-objective MIP model. This HGA synergistically combines Genetic Algorithm (GA) principles with MIP solving techniques. Our integrated approach aims to deliver a robust and practical solution to the multi-faceted challenges inherent in bike-sharing system rebalancing.
The objectives of the PDVRP in this paper are to minimize travel distances and balance bike inventory levels across all bike station clusters. Inventory balance is measured by the sum of net deviations between the target and actual bike inventory after rebalancing. These objectives cannot be directly derived from a single solution consisting of vehicle travel sequences. While travel distance can be directly obtained, net deviations require solving an additional optimization problem. Achieving an optimal allocation plan without considering travel paths is ineffective in this problem structure. This structure does not allow the use of straightforward approaches such as simple genetic algorithms. Other complex methodologies, such as scalarization-based multi-objective optimization approaches like MOEA/D [
32] and collaborative neurodynamic approaches [
33], may not be appropriate for our problem. Furthermore, to the best of our knowledge, there is no standard exact solution method for this type of multi-objective problem.
3.1. Preparation Phase
3.1.1. DeepAR for Demand Forecasting
This study adopts the autoregressive recurrent network framework proposed by Lim et al. [
3], which draws inspiration from the DeepAR model. Their innovative approach to predicting demand distribution in bike-sharing systems builds upon the work of Salinas et al. [
2], utilizing an LSTM-based encoder–decoder structure. This framework is designed to capture the conditional distribution of future time series data based on historical patterns.
The model’s architecture formulates distribution as a product of likelihood values, with outputs generated through LSTM cells. This design effectively integrates past observations with previous function outputs, allowing the simultaneous addressing of conditioning and prediction challenges within a sequence-to-sequence paradigm. To enhance versatility, the framework accommodates both normal and Poisson distributions, deriving parameters through carefully selected functions: affine and softplus for the normal distribution, and a softplus activation function for the Poisson distribution to ensure positive mean values.
The training process optimizes the log-likelihood of time series data, incorporating data standardization to improve computational efficiency. This comprehensive approach enables the accurate estimation of demand distribution quantiles for future periods, providing a robust foundation for forecasting in complex bike-sharing scenarios. Lim et al. [
3] demonstrated the efficacy of this sophisticated method through comparative analysis with traditional non-neural network techniques, establishing its potential as an advanced tool for demand forecasting in bike-sharing systems. Given its proven effectiveness, we have chosen to implement this framework in our study to address the unique challenges of bike-sharing demand prediction.
3.1.2. Determination of Optimal Inventory Level
To manage inventory under uncertain demand, two primary replenishment strategies are used: the continuous review model, which restocks a fixed amount upon reaching a specific reorder point, and the periodic review model, which evaluates and adjusts inventory to a target level at regular intervals. Both strategies aim to maintain a desired service level (
), which is defined as the probability of fulfilling the demand with on-hand inventory. In the continuous review model, the reorder point
R is determined by the inverse cumulative demand distribution over lead time
, where
is the inverse cumulative demand distribution function during lead time,
L. For periodic review, the target inventory level
M is adjusted for demand variability over the review period plus lead time
, where
is the inverse cumulative demand distribution function during the review period and lead time
[
34].
In the periodic review model, inventory is assessed daily and adjusted to meet the demand based on daily patterns. The service level, which is critical in preventing stockouts or excess inventory, is the probability
that balances the cost of unsatisfied demand (
) with the cost of surplus inventory (
), where
and
I denote the demand and current inventory level, respectively. The optimal inventory level
minimizes the expected mismatch cost
, where
w represents the inventory level, defined in Equation (
1), as follows:
The optimal inventory level
indicates a unique optimal point, owing to
’s convexity [
35]. This framework establishes that the optimal service level aligns with the cost ratio
in Equation (
2), where
ensures cost-effective inventory management without explicit ordering costs.
To implement this model effectively, accurate demand forecasting is crucial. The inverse cumulative demand distribution function
for future periods is estimated using the DeepAR model as detailed in
Section 3.1.1. This estimated distribution is then utilized to determine the target inventory level through Equation (
2), ensuring that the inventory management system adapts to forecasted demand patterns while maintaining the desired service level.
3.2. Rebalancing Phase
3.2.1. Multi-Objective MIP Model for PDVRP of Bike Rebalancing
In this section, we develop a multi-objective MIP model for the PDVRP. The PDVRP is an extended model in which pick-ups and deliveries are performed while management vehicles move. First, we define the parameters and decision variables in
Table 1 as follows.
In this study, we simultaneously consider two objectives. The first is to minimize the sum of the differences between target inventory
and final inventory
after the rebalancing work for all stations
as shown in Equation (
3):
The other objective is to minimize the maximum traveling distance among the travel routes of the management vehicles as shown in Equation (
4):
Then, the MIP model for the multi-objective PDVRP is formulated as follows:
Constraints (6)–(8) ensure that all stations
must be visited exactly by a single management vehicle. Constraints (9)–(16) set up a path that begins at dummy node 0, travels to all nodes, and returns to dummy node
. Constraints (17) and (18) eliminate sub-tours. Constraints (19) and (20) limit the number of bikes transferred by the capacity of the vehicle, denoted as
C, when a path is connected to two stations between
i and
j. In Constraint (21), we assume that the truck departs fully loaded when it departs from the depot. Constraint (22) computes the number of bikes picked up from station
by vehicle
: this is equal to the number of bikes delivered to station
minus the number of bikes picked up from station
. Constraint (23) computes the final inventory level by adding the sum of the bikes picked up by vehicle
to the initial inventory
. Constraint (24) calculates the absolute deviation between the target inventory
and final inventory level
at each station
. Note that either variable
or
must be zero since our objective is to minimize the sum of inventory deviations as shown in Equation (
3) above. Finally, Constraint (25) computes the maximum distance of the travel routes of the management vehicles.
3.2.2. Hybrid Genetic Algorithm(HGA)
An HGA is proposed to address the PDVRP in this study, which combines GA and MIP techniques. The overall algorithm is illustrated in
Figure 1. The GA generates and updates the feasible paths (chromosomes) for the management vehicle(s). Then, the MIP for bike allocation, presented below, determines the optimal allocation of bikes to the clustered stations based on the optimal target inventory level obtained from Equation (
2). In this phase, the optimal solutions for allocation are saved in memory to retrieve the resultant value in the future when the same path pops up in the course of the evolutionary process. After obtaining the optimal solutions, the deviation in the net balance for each path is calculated. The path length is directly derived from the chromosome. These values collectively constitute a fitness function used to determine an efficient frontier for each generation. A superior frontier is selected through evolutionary iterations. Moreover, the surviving chromosomes comprising the final frontier constitute the solution to the problem when the termination conditions are satisfied.
Let
be the
i-th station listed in the route,
be the quantity delivered to station
, and
be the onboard inventory carried by the management vehicle at station
. The MIP model for the bike allocation problem is formulated as follows:
The objective function in Equation (
26) quantifies the overall discrepancy between the target and deployed inventory levels across all the rental stations. This model determines the number of bikes to be picked up or delivered to each bike station (denoted by
) along a given path. Despite having an optimal target inventory for each station, the capacity of each vehicle (denoted as
C) and the predetermined travel sequence prevent the fulfillment of the target inventory. Constraint (27) ensures that the vehicle starts each trip fully loaded with
C bicycles, where
denotes the number loaded at the dummy station. Constraint (28) prevents any deliveries to the dummy station. Inventory changes at each station are governed by Constraint (29), which calculates the updated stock levels based on delivery quantities. Constraint (30) determines the number of bikes loaded onto the vehicle upon departure from station
i, whereas Constraint (31) enforces the vehicle’s capacity limit for
C bikes to prevent overloading. Constraint (32) is equivalent to Equation (24) in MIP. To uphold mathematical validity, Constraint (33) ensures that all variables used in the deviation calculations remain non-negative.
Among the various multi-objective genetic algorithms in the literature, the nondominated sorting genetic algorithm (NSGA-II; [
31]) is applied to the problem in this study. NSGA-II is fast and suitable for solving bi-objective problems [
31,
36]. We only explain NSGA-II briefly here; for more details, please refer to [
31,
37]. It first groups the population into sets of nondominated members by comparing the chromosomes, starting from the top level. The population for the next generation is then selected from the nondominated groups, progressing from top to bottom. When only a portion of a group must be selected owing to population size limitations, chromosomes that are not located in crowded regions in the solution space are prioritized for selection. Deb et al. [
31] call this process crowd-distance sorting. The rationale behind using crowd-distance sorting is to give preference to dissimilar chromosomes (those outside crowded areas) to enhance the diversity in the population pool, thereby improving the search quality. This procedure is continued until the termination condition is satisfied.
The GA generates multiple separate travel routes with one chromosome. We design and explain the chromosome with two routes, and it can be generalized to include more than two routes.
Figure 2 shows this chromosome as an example. The first gene is called ‘division point,’ and it represents the length of the first vehicle route, including its departure point. In
Figure 2, this point is indicated as 13, which implies that the first tour is composed of 13 nodes (including the departure point ‘0’). Because the second tour begins right after the end of the first route in the same chromosome, the departure point for the second tour is denoted as ‘41’ in
Figure 2, which corresponds to the same point as ‘0’ in the first route. Even though the two departure points are shown differently (0 and 41), the number ‘41’ is represented as a dummy, and this one corresponds to the same location in our model. That means that there is only one management warehouse in a district for the maintenance and storage of extra bikes.
The genetic operators used to generate the next generation are as follows: Two child chromosomes are produced from the two selected parent chromosomes.
Figure 3 illustrates the crossover process. Each parent chromosome has four randomly selected crossover points with two points for each path. ‘Child 1’ inherits the division point from ‘parent A’, while ‘Child 2’ takes the division point from ‘parent B’. Since the first node in both routes of a chromosome represents a dummy node indicating the same departure point, they are fixed as ‘0’ and ‘16’ in the figure (excluding the departure point, there are only 15 nodes to visit in the figure). Nodes from parent A, spanning from ‘crossover point 1’ to ‘crossover point 2’ and from ‘crossover point 3’ to ‘crossover point 4’, are inherited by ‘child 1’. The remaining nodes from ‘parent A’ are inherited by ‘child 1’ in the same order as they appear in ‘parent B’. The identical procedures apply to ‘child 2’, with ‘parent A’ replaced by ‘parent B.’
The mutation process is illustrated in
Figure 4. First, the division point, indicated as ‘9’, and the first nodes (departure points) for both travel routes, indicated as ‘0’ and ‘16’, are removed. Second, two nodes are randomly selected and swapped. Finally, a division point is randomly chosen, guaranteeing that at least one third of the nodes are visited in one path, and the two departure points ‘0’ and ‘16’ are appropriately located in the chromosome. In the figure, nodes ‘10’ and ‘5’ are swapped, and node ‘10’ shifts to the next position because the corresponding location is for departure node ‘16’. During the evolutionary process, we set the crossover ratio to one-half, which implies that half of the offspring are generated through crossover operations, whereas the remaining half are produced by mutation operations.
4. Computational Experiments
The performance of our proposed HGA is evaluated by comparing it with a multi-objective optimization problem (MOOP) model. The evaluation is conducted using sample problem instances with variations in the number of nodes (places to visit for redistributing the bikes). After the comparison, the optimal solution for the real-world problem using HGA is provided.
4.1. Comparison between HGA and Multi-Objective Optimization Model
We consider 15 test problem instances with variations in the number of nodes. These fall into three categories: 10, 15, and 20 nodes. Five problem sets are generated randomly for each category. Each test set includes the coordinates of the nodes along with the target and current inventories for each node. The coordinates range from 0 to 100 for both X and Y, and the initial inventories are randomly selected between 0 and 20. The target inventories are set between −10 and 20, and we do not require the sum of target inventories for all stations to equal one, which represents a case of demand overcapacity. For these sample problem sets, HGA terminates its search after 1000 generations of evolution.
We observe that the MOOP provides only one solution to a multi-objective problem, whereas the HGA provides the Pareto frontier. Although a multi-objective approach is applied to the MOOP, it yields only one solution.
Figure 5 illustrates the results of the comparison between the HGA and MOOP across three randomly selected problem sets with different numbers of nodes. In
Figure 5, the HGA is represented by dots, and the MOOP is represented by crosses. As the figure shows, the MOOP produces only a single solution for each problem.
The HGA consistently outperforms the MOOP across all sample problem sets. For instance, in
Figure 5, analyses of the problem sets with 10, 15, and 20 nodes indicate that the MOOP solution falls short of the Pareto frontier achieved by the HGA. These comparisons are based on the HGA results, which may not fully converge because of the relatively short generation count of 1000.
Our numerical experiments are performed on a Mac M1, and the experimental results, including the total computation times, are summarized in
Table 2. In the HGA, five iterations are executed to obtain a unified frontier for each test problem. The final unified frontier is the selection of nondominated solutions from all iterations. Although the HGA finishes its search after exactly 1000 generations, the processing time fluctuates with each iteration. The processing times presented in
Table 2 represent the average across five iterations. Meanwhile, in the ‘Distance’ and ‘Net Dev.’ columns corresponding to the HGA solutions in the table, one representative solution that effectively demonstrates the dominance is selected from the frontier and displayed.
From the results in
Table 2, we note that the Pareto frontier from the HGA dominates the MOOP for all 15 sample cases. Despite the superiority of the HGA, its computation time is significantly longer than the MOOP, averaging approximately 50 min for 1000 generations, irrespective of the problem size. By contrast, the MOOP exhibits faster computation times, which increase with larger problem sizes.
4.2. Optimal Solutions from HGA for PDVRP of BSS
In this subsection, we present the optimal solutions obtained from the HGA for inventory-rebalancing problems in BSSs. Inventory rebalancing involves the PDVRP of bikes through vehicle management. Consequently, these solutions comprise traveling paths along with the respective quantities of bikes to be picked up or delivered.
We apply the HGA to address the PDVRP in the BSS of Mapo-gu district in Seoul, South Korea, which encompasses 126 bike stations. Users access real-time bike inventory information for all stations via a mobile application, allowing them to locate bikes within walking distance. This indicates that it is more valid to perform demand forecasting in cluster units that group stations within walking distance rather than on an individual bike station basis. Moreover, forecasting the demand for station clusters assists in mitigating the forecasting errors on a station basis.
The bike stations are clustered using a well-known machine learning technique, the K-means algorithm.
Figure 6 depicts 40 clusters, with stations of the same color indicating membership in the same cluster group. The maximum distance between the furthest stations within all clusters is approximately 800 m, which is within an acceptable walking range.
The essential goal of bike rebalancing is to enhance bike utilization by ensuring that an adequate number of bikes are available when required. This task resembles the news vendor problem, as it involves balancing the costs associated with underestimating (resulting in bike shortages) and overestimating (leaving bikes idle) demand. It is assumed that bike shortages lead users to opt for buses with comparable fares, thereby potentially increasing carbon emissions. Factoring in the carbon emission price as of January 2024, the daily cost of inventory shortage is calculated to be KRW 922.3761. Conversely, when bikes remain unused at stations, there is an inventory surplus cost of KRW 304.0613, which accounts for excess bike investment, attached smart devices, and insurance premiums. To optimize the service levels, Equation (
2) is used to determine a target service level of 0.752. This decision is informed by probabilistic demand forecasting, which allows setting the target inventory at the 75.2 percentile of demand calculated from the probability distribution for each time period within each cluster.
In our forecasting using the DeepAR methodology, we selected a family of normal distributions because it showed the least errors in predicting bike user demand for the station clusters. This suggests that the actual demand is highly likely derived from normal distributions, although the parameters of these distributions vary over time and between clusters (for more details, please refer to [
3]). For each station, the 75.2 percentile value of the corresponding normal distribution is set as the target inventory and the initial inventory is obtained from actual data gathered from the bike-sharing app.
The HGA is applied to solve the PDVRP, utilizing the target inventory for each station cluster as bikes within the 75.2 percentile of user demand. In the Mapo-gu district, where two management vehicles operate, two distinct travel routes are generated. These routes share the same origin and destination points, and the two points refer to a single management warehouse in the Mapo-gu district. The objectives are to minimize the longer travel distance between the two routes and to minimize the sum of the absolute net demand deviations. The absolute net demand deviation is calculated as the difference between the target inventory and the ending inventory after rebalancing by management vehicles.
The two management vehicles must visit a total of 40 points. Therefore, terminating the HGA after only 1000 generations, as applied in the sample problem sets, is too early for the solution to converge.
Figure 7 shows a preliminary run with 4000 generations. As this is a multi-objective problem, we define a weighted sum of the two objective values to represent each solution in the figure. The weighted sum is a simple method for balancing the scale between the travel distance and the sum of the absolute net deviations. Based on preliminary runs, we set the terminating condition of the HGA for this problem to 2000 generations. Although the solution continues to improve after 2000 generations, this number represents a reasonable compromise between solution quality and computation time.
The processing time for running 2000 generations of the HGA takes around 8518.66 s (approximately 2 h and 22 min) on a Mac M1, which is four times longer than the sample case with 1000 generations. The computational burden of this approach arises from finding the optimal bike allocation plan for each chromosome, which consists of two routing paths of different lengths. Although we maintain a memory to store the optimal allocation solutions for chromosomes that have appeared during the evolution process of the genetic algorithm, the large number of possible chromosomes in a large-scale problem increases the computational burden.
The Pareto frontiers from the HGA are displayed in
Figure 8. The traveling distance (maximum distance between the two travel paths) ranges from 13.4 km to 20.9 km, and the sum of absolute net deviations (SAND) ranges from 3391 to 3415 bikes. The SAND is the sum of 40 clusters encompassing 126 bike stations. Thus, for each bike station, the net deviation is approximately 27 bikes per bike station during three hours after rebalancing work with the management vehicle that carries up to 40 bikes. The solutions from (a) to (b) are acceptable in terms of the travel path. The paths of the solutions to the right of (b) are ineffective because there is too much overlap between the travel paths of the two vehicles.
Figure 9 shows the travel paths of two management vehicles for each selected result. The left figure (marked as (a)) shows the two paths with the shortest travel distance, whereas (b) represents the smallest SAND among the effective solutions from the frontiers. As the solution changes from (a) to (b), the travel distance increases 3.83 km (13.40 km to 17.23 km), while the SAND decreases by 18 bikes (3415ea to 3397ea).
The Pareto frontiers offer options from the decision-makers’ perspective. If the decision prioritizes minimizing travel distances, then the solution with the most realistic shortest distance is selected. However, if the decision must consider a certain threshold for the minimum number of bikes to be served, the solution can be determined by drawing a horizontal line on the frontier at an appropriate level and selecting the solution with the shortest distance from solutions below the horizontal line.
5. Conclusions
In this study, we devised a comprehensive methodology aimed at optimizing the travel paths of multiple management vehicles to address the bike rebalancing problem. Our approach encompasses three key components: probabilistic demand forecasting, determining optimal inventory level, and solving pick-up and delivery problems for multiple management vehicles with multi-objectives.
Recognizing that bike demand is interdependent between stations, owing to users’ ability to walk to nearby stations when their preferred station is empty, appropriate demand forecasting necessitates grouping bike stations based on geographical proximity. To this end, we employed a k-means algorithm to cluster bike stations within walking distance, and utilized the DeepAR approach for demand forecasting for each cluster. The strength of DeepAR lies in its capacity to provide a probabilistic distribution of demand, which is vital for determining the optimal service level and the associated target inventory using a news vendor model.
The bike-rebalancing problem essentially entails pick-up and delivery tasks for multiple management vehicles that transfer bikes between clusters with excess and deficit capacities. Given the operation of multiple vehicles, it is imperative to determine the optimal paths for these tasks. Two primary performance metrics guide the bike-rebalancing problem: minimizing travel distance and minimizing the gap between the target and final inventories after pick-up and delivery operations.
The multi-objective nature of our problem complicates the solution process and poses challenges in achieving optimal results. Initially, we modeled the problem using an integer programming approach with a multi-objective framework (referred to as MOOP—Multi-Objective Optimization Problem). However, the MOOP yields only a single solution and does not guarantee its optimality. To address this limitation, we developed a hybrid genetic algorithm (HGA) that mixes a genetic algorithm and a mixed integer programming model. The former tries to find the shortest path, and the latter tries to find the optimal allocation of bikes in the given paths.
The application of the proposed methodology to real bike-sharing service data from the Mapo-gu district of Seoul City, South Korea, yielded balanced travel routes for management vehicles, considering both the travel distance and the gap between target and final bike inventories post rebalancing. The frontiers provide flexibility in decision-making process. For example, if there is an upper limit to the distance or to the net balance deviations, it is possible to choose an appropriate set of travel routes which meets the condition.
This paper acknowledges certain limitations when applied to real-world scenarios. First, the central point of each station cluster, which is derived from the computed central coordinates of actual bike stations, has not been validated as a feasible location for bike drop-off and pick-up. Second, the travel paths between these cluster centers are calculated using simple Euclidean distances, without accounting for actual road conditions. Additionally, traffic conditions and alternative routes are not considered in the path calculations. Consequently, these limitations may hinder potential operational cost savings when implementing the proposed methodology. A practical implementation is necessary to identify and address any challenges that may arise in real-world applications.
Computational efficiency is another issue in the implementation of the algorithm in real-world scenarios. Due to the significant computational burden, the proposed algorithm may not be appealing to decision makers. Future research should prioritize enhancing the efficiency of the HGA, aiming to reduce computational overhead while maintaining the quality of the solutions. This endeavor might entail the development of more streamlined generation strategies or the fusion of the HGA and MOOP methodologies to capitalize on their respective strengths. In addition, investigating parallel computing methods or hardware acceleration can help alleviate the computational burden.
The practical application of our integrated methodology in addressing the bike-sharing problem underscores its potential for enhancing operational efficiency as demonstrated in real-world contexts such as the Mapo-gu bike-sharing systems. Nonetheless, overcoming computational hurdles within the HGA methodology and refining the entire methodological framework are imperative for its widespread adoption across diverse sectors.