1. Introduction
Portable computers, mobile phones, wireless, and mobile networks allow us to access the Internet from almost anywhere and maintain our connection dynamically even when we are on the move, ushering in a new economic era. People can now easily shop on traditional e-commerce websites, online marketplaces, and mobile market apps that provide fast delivery without leaving their homes all day. Furthermore, people in many OECD countries have reduced their physical presences and interactions due to the COVID-19 crisis. Self-imposed social distancing, combined with strict containment measures implemented in many OECD countries, has effectively halted much traditional brick-and-mortar retailing, at least temporarily, causing a major increase in online shopping and the use of e-commerce [
1,
2].
In the United States, between February and April 2020, retail and food service sales fell 7.7% compared with the same period in 2019. However, sales for grocery and non-store retailers increased
and
, respectively. In the EU-27.2 countries, mail-order or online retail sales increased by
in April 2020 compared with April 2019, while store retail sales decreased by
. As a result, shifts from physical retail to e-commerce will likely be significant around the globe. For example, in the United States, the share of e-commerce in total retail increased slowly from
in Q1 2018 to
in Q1 2020 but rather quickly to
in Q2 2020 [
3,
4]. Especially on specific shopping days such as Black Friday, Mother’s Day, or Christmas holidays, this demand has reached its maximum level, and the need for faster and more cost-effective package deliveries has arisen significantly [
5].
In Turkey, e-commerce sales reached USD 13.1 billion in the first half of 2020,
higher compared with its previous year. This major increase, which was at least partially due to the rise of the COVID-19 pandemic and the subsequent lockdowns, has significantly affected the already challenged package delivery industry [
6]. Consequently, transportation and logistics for e-commerce have been facing a grand challenge that can be turned into an opportunity, if managed well with intelligent supply chain management techniques [
7].
The operation of collecting orders from customers and delivering them on time involves many parameters and logistical decisions. Traditionally, e-commerce customers are served from inventories stored in or delivered to central locations (e.g., warehouses, distribution centers, and cross-docks) within the parameters requested by the customer (e.g., delivery time windows or same-day delivery) or the limitations of the logistics operations. The increased demand for e-commerce and e-grocery, for the reasons explained above, resulted in a similarly increased demand for a relatively new mode of delivery:
instant delivery. By instant delivery, we refer to customer requests that are typically served from local stores or supermarkets in a shorter-than-same-day time frame, such as being picked up within minutes of placing the order and delivered within the next hour or so. During the COVID-19 pandemic, social distancing rules, lockdowns, and the risk of disease transmission caused changes in consumer behavior, leading to the popularity of instant delivery, with high revenue generation opportunities for the e-commerce businesses. Under the pandemic conditions, instant delivery companies broke sales records as shops were closed as people stayed home. The number of instant delivery application users in Turkey reached almost 1.5 million deliveries in the second half of 2019 and continues to increase rapidly [
8].
Standard VRP problems are suitable for modeling traditional cargo operations. In these operations, fleet carriers take custody of all shipments from a single depot at the start of the delivery process and deliver them to different points during the day. There are no strict time windows during distribution. The other type of last-mile operation is on-demand instant delivery operations. These operations are based on picking up deliveries from different points during the day, depending on customer demands, and distributing them to other points within a short time window. In these operations, demands can also arrive dynamically during the day. These types of operations are categorized in the VRP literature as Dynamic Pick-up and Delivery with Time Window Vehicle Routing Problems (DPDTWVRPs).
In certain practical settings, an e-commerce business may handle both types of operations simultaneously (i.e., traditional cargo delivery and instant delivery) with two parallel fleets. For example, with the relatively recent acquisition of Whole Foods Market, Amazon can use its logistics experience and resources to facilitate the instant delivery of groceries along with its traditional e-commerce. At the time of writing this article, Amazon only offers the instant delivery of groceries to Amazon Prime members in select ZIP codes. In such cases, capacity sharing between the two fleets of e-commerce and instant delivery can provide an opportunity to expand the business overall, albeit with proper planning and scheduling.
This paper addresses the problem of capacity sharing between two types of fleets in the same business. By combining these operations, the number of required couriers and overall costs can be reduced. This in turn will increase the earnings of the non-salaried, crowd-sourced drivers that charge based on the distribution and collection process. In order to combine these two operations, a framework is needed to solve VRPs and DPDTWVRPs in terms of routing. In addition, the capacities of the fleets must be planned correctly for integration to work efficiently. To the best of our knowledge, there is no single solution in the literature to cover all aspects of such a problem. The proposed framework in this paper solves two different routing problems simultaneously and also employs the capacity planning aspect proposed in a previous work with the support of machine learning methods [
9].
More specifically, we offer a route optimization solution framework that aims to address this new trend in e-commerce logistics (i.e., instant delivery) in integration with conventional package delivery logistics. We propose an integrated route optimization framework that operates a combined fleet, sharing vehicle capacities between the instant delivery orders served from local grocery stores and the traditional time-windowed orders served from a central depot. To achieve desirable levels of cost efficiency and customer satisfaction, we use an array of mathematical models, including machine learning (ML) and simulation, to address not only the route optimization side but also the courier capacity planning aspect of the entire logistics operation. Specifically, we first use ML to predict the number of packages and allocate courier capacity to delivery areas. Then, we employ a route optimization approach to efficiently serve the complete set of delivery requests for the two markets combined. The main goal is to make efficient and optimized use of the extra idle times of the vehicles to serve a new market, as opposed to running two sets of disjointed operations for two separate fleets.
To show the effectiveness of our proposed framework and the business value of the integrated routing approach, we conduct two sets of computational experiments: testing our proposed algorithm on instances from the literature and testing it on real-world problem instances from HepsiJet in Turkey. HepsiJet is a rapidly growing, technology-driven e-commerce logistics company launched in May 2017 with a business plan that involves crowd-sourced couriers. It showed a growth trend by increasing the number of cross-docks along with the spread of e-commerce throughout Turkey. The size of its operations (measured by the number of delivered packages) rose by 91% from 2018 to 2019 and by 171% from 2019 to 2020. Fast delivery and timely arrival are essential in e-grocery shopping, as customers expect their groceries to arrive promptly and at a scheduled time. This helps them plan their days accordingly and guarantees that fresh and perishable items are delivered on time. In our case study, we demonstrate how the proposed approach can improve the current operations and how it can also validate the business case of logistically integrating the instant delivery market and generating business and consumer value.
The contributions of this study are threefold:
We present an overall framework along with its operational details for potential deployment of a capacity planning and routing optimization solution that seeks efficiency by integrating last-mile delivery operations. This framework can be used by logistics planners and optimization experts to achieve operational efficiencies above and beyond what they have already achieved.
As part of the presented framework, we propose an optimization solution that attempts to solve two traditionally disjointed but equally complex variants of the vehicle routing problem to achieve further operational efficiency using a shared fleet. To the best of our knowledge, this is the first attempt at solving this kind of a unified problem and validating its effectiveness in an industrial setting.
We provide and publicly share an authentic real-world data set for future researchers to develop and apply new algorithms, aimed at solving the complex unified routing optimization problem we have tackled in our study. This data set comes with features reflecting the detailed real-life dimensions of rich vehicle routing problems typically encountered in real-world settings. We report the solutions we have found for benchmarking purposes as well.
In what follows, we present a review of the relevant literature, followed by the mathematical definition of the problem at hand, our proposed framework and optimization methodology, as well as our computational results and suggestions for future directions for research. In the context of our study, the symmetry concept is essential as it relates to the equitable allocation of resources between the two distinct types of delivery operations. Symmetry ensures that both instant and traditional delivery services are optimally integrated, allowing for balanced use of fleet capacities and operational efforts.
3. Problem Definition
HepsiJet provides a crowd-sourced delivery service with over 2000 couriers working at more than 150 cross-docks. The delivery packages that reach the cross-docks in the last-mile process of the delivery operations are appropriated by the couriers and delivered to the recipient addresses. This last-mile operation can be described as a traditional static multi-vehicle single-depot vehicle routing problem. Within the scope of this paper, we will refer to these deliveries as standard deliveries. They include products which are already stocked in the satellite or regional warehouses of business partners and are suitable for fast delivery in a short period of time after an order is placed, along with standard cargo services.
The last-mile life cycle of standard deliveries begins when daily shipments arrive at the relevant cross-dock in ring vehicles. Packages arriving at the cross-docks are appropriated in the morning by crowd-sourced couriers, who then proceed with the deliveries and complete the life cycle when all packages are delivered. On the other hand, instant deliveries with hard time windows have a different process flow. This flow starts when the order reaches the relevant company cross-dock. Then, the order is assigned to a courier in the cross-dock either manually or automatically. The assigned courier must pick up the package at the pick-up point and deliver it to the recipient within the required time window.
In this paper, we address the combined problem of satisfying both workflows with a single set of couriers operating at the cross-dock level. In this setting, couriers who deliver standard packages may also be tasked with delivering instant delivery orders, and vice versa. The goal is to make the optimum use of any idle capacity the two separate fleets of couriers would have if they were to run their delivery operations independently. Within this context, and with the help of machine learning (ML)-based methods, we also aim to predict the number of time-windowed instant deliveries that the couriers can additionally distribute using their idle capacities at their respective cross-docks. Using ML, we predict in advance the number of shipments for each cross-dock in the coming days. Given these predictions, the number of instant deliveries that a cross-dock can commit to with its idle capacity is then revealed.
In the next subsection, we present a mathematical model for the combined route optimization problem. In the Methodology section, we outline a heuristic algorithm to solve this routing problem and also detail our ML-based solution for the capacity planning problem.
Mathematical Model
We developed a mixed-integer linear programming model for the combined route optimization problem, considering time windows and a unified fleet of couriers that can dynamically deliver both order types. We account for the possibility that additional optimizations can take place during the day for building or updating daily routes from this point on. Deliveries that are not previously completed at the time of optimization are included, along with any instant delivery orders received in the meantime. As a result, couriers start at the cross-dock depot but do not return there except at the end of the day after all deliveries are completed. Packages loaded onto a vehicle at the cross-dock in the morning stay on the same vehicle until delivered. Each optimization (from the second round on) considers the last known completed delivery location of a courier as the starting location for their new route.
We provide below the mathematical model representing this problem along with the notation, as well as the model parameters and decision variables.
V: set of all couriers;
: delivery nodes for standard orders, where ;
: pick-up nodes for instant orders, where ;
: delivery nodes for instant orders, where ;
: cross-dock (starting) node (first optimization);
: departure node of courier (subsequent optimizations);
N: set of all nodes, where ;
d: first optimization indicator, which is one for the first optimization and zero otherwise;
T: remaining time until the end of the delivery period;
: revenue or profit per delivery obtained by delivering to node i;
: driving cost and driving time from node i to j, respectively;
: required service (pick-up or drop-off) time at node i;
: time window’s lower and upper limits for node i, respectively;
: one if the deadline for delivery i is today and zero otherwise;
: late delivery cost for delivery i;
: one if and node i’s delivery package was loaded onto vehicle k in the first optimization and zero otherwise;
: objective function weights for revenue or profit generated from deliveries, total cost of delivery, and late delivery penalties, respectively.
: one if vehicle k travels from node i to j and zero otherwise;
: one if delivery at node i has not been made and zero otherwise;
: arrival time of the vehicle for delivery at node i.
The objective function in Equation (
1) optimizes the weighted combination of the total revenue or profit generated from the deliveries, the total cost of transportation, and the penalties for late deliveries. According to the constraints in Equation (
2), the total time spent by each courier driving to and delivering at all locations visited cannot exceed the remaining time until the end of the day. The constraints in Equation (
3) ensure that all couriers start from the cross-dock in the first optimization. In subsequent optimization(s), the last known position of each vehicle is taken as the starting location. The constraints in Equation (
4) satisfy the couriers’ departure from their last-known locations for reoptimization(s). Return to the warehouse for every courier is ensured by the constraints in Equation (5). One-time visiting and departing delivery points is provided by the constraints in Equations (6) and (7), respectively. The constraints in Equation (
8) ensure that the number of exits should equal the number of arrivals at a location. The constraints in Equation (
9) maintain the iterative calculation of the arrival times for each point with respect to both the service times and travel times. The constraints in Equation (
10) ensure that products are picked up before they are delivered for instant delivery orders. Instant delivery orders must be delivered within a certain time window, which is ensured by Equation (11). Standard deliveries loaded in the first optimization are delivered by the same vehicle according to the constraints in Equation (
12). For cost calculation purposes, a variable to keep track of the deliveries made is defined by the constraints in Equation (
13). Finally, the constraints in Equations (14) and (15) set the starting times for the departure nodes as well as the zero service times at these nodes.
5. Computational Results
To the best of our knowledge, our model is the first to combine the problem of pick-up and delivery with time windows with a standard VRP in an application setting. As a result, there was no test data set available to benchmark this particular use case. To address this, we designed a three-stage computational study to demonstrate the effectiveness of our proposed solution. In the first stage, we demonstrate that our solution can result in cost reductions in a company’s operations in comparison with its existing routing solutions in place. In the second stage, we test the performance of our solution, which is uniquely designed for our integrated problem, using Solomon’s VRPTW instances. This is clearly not ideal, as our solution is not highly optimized or tuned for those instances but rather designed to address the more generalized integrated problem. Yet, it should give us insight into its performance in general. Finally, we generate a complete 1 day data set simulated from the company’s operational data and use our solution to solve it. Since we do not have an optimal solution as a ground truth to compare it against, we publish this data set and the corresponding routing solution we obtained for future researchers to benchmark against. The details for each experiment are given in the relevant sections below.
5.1. Experiment 1
In the early stages of HepsiJet, the last-mile operations were not following a structured routing optimization system. Couriers delivered the packages with respect to their neighborhood know-how. If the process is considered from the route optimization point of view, the problem reduces to single-vehicle route optimization. Before the vehicle departs, it receives all deliveries’ custody from the cross-dock representative and then stops at every delivery point. In the first phases of operations at HepsiJet, distributions are carried out at the discretion of the couriers without benefiting from route optimization. Before using route optimization, we could access the kilometers traveled by the transporter, as a database of information was kept in which we could follow the movements of the transporter. When the daily shipment distribution order of the couriers taken from the database is analyzed, it can be seen that the couriers sort and distribute the shipments based on their own district, neighborhood, and environment know-how. However, by optimizing the distribution order of the transporters using route optimization, it is possible to reduce the time spent on distribution, as well as travel and gasoline costs. The setting was modeled as a TSP problem since the orders to be distributed by each courier during the day were, certain and these points were visited at least once a day. This experiment focused on the cost reduction that the company would have achieved if these early distributions had benefited from the route optimization algorithm proposed in this article.
In order to analyze the cost reduction opportunity, the information of the shipments delivered by 10 couriers from each of 12 cross-docks intuitively for 10 days and without using route optimization was obtained. In light of this information, the mileage traveled by the transporters was calculated using the distribution order. Then, the regular shipments of each courier were optimized with a
-based heuristic algorithm and customized forehead algorithm within the framework of this paper. These optimization outputs’ distance and time costs were compared with the actual non-optimized distribution costs. In the context of this comparison, a cost reduction of almost
was observed in terms of the total distance traveled by the company’s couriers. A cost reduction of the same rate was also observed in terms of travel time. It can be estimated that the distance traveled by the relevant couriers will decrease by an average of 20 km per day if the route optimization output were to be used. The results of this experiment are shown in
Table 3.
5.2. Experiment 2
The algorithm proposed in this paper has rather complex properties when compared with the classical simple vehicle routing problem. As a result, some questions can occur regarding the measurability of the routing algorithm’s performance. In order to test the performance and effectiveness of the route optimization model developed for this paper, a performance analysis was conducted on the most cited routing sample problem set. The Solomon benchmark sets, one of the most used data sets, was used since there is not yet a benchmark test covering all the features for the unique problem that was aimed to be solved in this framework. The problem set used was first produced in [
46] to test the time-windowed vehicle routing problems and then adapted to many different problem types. The solutions for problems of type R are shown in
Table 4, while the solutions for problems of type RC are shown in
Table 5.
5.3. Experiment 3
In this study, a detailed experiment was carried out for the pilot implementation and performance measurement of the proposed framework. Simulation data for two cross-docks for 1 day were prepared for two problems of different sizes. Within the framework of this study, an initial optimization was obtained with multi-vehicle route optimization for all the instant deliveries and standard deliveries known to the cross-dock at the beginning of the working day. Then, the distribution simulation was performed until a new instant order was received, taking into account the service times and travel times. The optimization was updated with the new order information every time new order information came in. New dynamic orders coming within the framework of the simulation were dynamically assigned to the couriers, and the distribution simulation continued. While there are 19 couriers in one cross-dock, the other one amounted to only 3. Consequently, a total of 788 and 73 standard packages were distributed by these cross-docks, respectively. Each cross-dock served two different markets. In addition, 179 and 18 market shipments were simulated during the day, respectively. All optimizations for a cross-dock with 19 couriers had a run time of 19 min. For the three-courier problem simulation, the total optimization run time was 84 s. For both sets of experiments, it can be seen that the couriers distributed all the shipments within the framework of their capacities, and in addition, the couriers in these fleets were crowd-sourced, meaning that they were paid based on the number of packages they collected and delivered. It can also be observed that the total distances traveled by the transporters were relatively uniform. The results for the problem with 19 couriers are shown in
Table 6, while the results for the problem with three are shown in
Table 7.
For the problem of capacity planning, the proposed GBT regression model performance is illustrated in
Figure 3 for delivery capacity predictions in April. The overall
value of the capacity prediction model was
. The average
values for each cross-dock are given in
Table 8, and the best prediction was obtained from cross-dock 2.
To measure the distance reduction that led to cost reduction provided by the framework, we conducted additional experiments on the dataset used in Experiment 3. Our approach was based on the assumption that there were enough hypothetical discrete carriers that could deliver fast deliveries with time windows. In this scenario, cross-dock couriers would handle regular deliveries, while hypothetical couriers would handle fast time-framed deliveries.
Table 9 shows a 33.5% distance reduction for the full dataset, indicating that the proposed approach satisfied a significant amount of cost reduction. Additional fixed and overhead costs for the hypothetical couriers need to be considered for real-life cost reduction calculations.