1. Introduction
Today, people are changing their shopping behavior. Recent advances in information and communication technologies have introduced new shopping channels and models, which make possible the expansion of e-commerce and, consequently, the emergence of new challenges in operational research, transportation, and logistics areas. Specifically, regarding modern e-commerce business models, new decision variables and constraints have been incorporated in them, leading to emerging variants of distribution problems in supply chain management.
Unlike brick-and-mortar stores, where salespeople are available to support and help customers to make their purchases, the popularization of mobile devices with access to the Internet has promoted the use of different shopping channels. The online channel is an example that has emerged as a competitive marketing channel to that of traditional retail centers, transforming e-commerce into a global trend and an important tool for every business worldwide [
1]. With e-commerce, customers are immersed in an environment of a plethora of information, opinions, and access to a vast combined supply of stock, which together allows them to browse through different stores in an online environment. According to some experts, the online shopping channels were predicted to kill off the physical ones. However, they co-exist and have completely transformed the way customers shop nowadays [
2]. The use of a variety of shopping channels is referred to as ’omnichannel retailing,‘ where, instead of having only the single option of physically visiting a store to buy products, consumers can also buy them via online shopping to be delivered at their homes. Hence, the company also becomes responsible for delivering goods ordered online to consumers. Therefore, this new retailing environment has introduced the need to design integrated distribution systems for both serving online customers and for replenishing the stocks of retailer stores [
3].
This paper focuses on the emerging omnichannel vehicle routing problem (OM-VRP) concept, which is an extension of the classical vehicle routing problem (VRP) in which a two-echelon distribution network is considered. The VRP aims to design cargo vehicle routes with minimum transportation costs, and sometimes the problem also considers facility location costs [
4]. The generated routes are designed to distribute goods between depots and a set of final customers during a planning period [
5]. Since the OM-VRP tackles simultaneously the replenishment of a set of retailer stores and the direct shipment of the products from those retailer stores to customers in an omnichannel retailing environment, the OM-VRP can be seen as an integrated problem that combines both the VRP and the pick-up and delivery problem (PDP). In the OM-VRP, which was first introduced by Abdulkader et al. [
1], a single distribution center supplies a group of retailer stores–i.e., the first-echelon. In turn, this set of retail centers serve a set of online customers–the second-echelon–by using a single fleet of vehicles in both delivery levels in order to reduce transportation costs. Apart from reducing operating costs and improving supply chain competitiveness, the optimization of this two-echelon problem holds the potential to improve customer service levels and to enhance the on-time delivery of customer orders [
6]. VRPs and PDPs are frequently considered in the literature as deterministic problems, where customers’ demands and travel times are constant values. In real-life, however, it is frequent that both demands and travel times are exposed to some degree of uncertainty. In these scenarios, it is more accurate to model these constant variables as random variables. In this paper, we extend the previous OM-VRP formulation by replacing the deterministic travel times by stochastic ones. To the best of our knowledge, it is the first time that a stochastic version of the OM-VRP has been considered.
Since the VRP and the PDP are both
NP-hard problems [
7,
8], the OM-VRP is, consequently,
NP-hard as well. Therefore, we propose a multi-start procedure, which employs a savings-based [
9] biased-randomization heuristic technique [
10]. The multi-start approach is able to generate a variety of good and feasible solutions in short computing times. Finally, in order to deal with the stochastic OM-VRP, the multi-start approach is combined with Monte Carlo simulation and extended into a simheuristic algorithm [
11]. Simulation-optimization methods, and simheuristics in particular, allow us to properly deal with stochastic versions of combinatorial optimization problems [
12,
13], such as the one proposed in this work, where travel times are modeled as random variables. The simheuristic approach is also employed to measure the ‘reliability’ level of the proposed distribution plan, i.e., to measure the probability that the plan can be deployed, without any route failure, in a realistic scenario under uncertain travel times. To the best of our knowledge, this is the first time that a stochastic version of the OM-VRP has been considered in the scientific literature.
The remainder of the paper is arranged as follows:
Section 2 presents a brief literature review on related topics;
Section 3 describes, in more detail, the addressed problem;
Section 4 introduces the proposed solving methodologies;
Section 5 presents an analysis of the results and a comparison between the proposed heuristic and another solution methodology; finally,
Section 6 highlights the main conclusions of this work and proposes some future lines of research.
2. Literature Review
With recent advances in information and communication technologies, different shopping channels have emerged which have attracted the attention of customers. Nowadays, customers use different channels to shop, giving them a multichannel shopping environment [
14]. The transition from multichannel to omnichannel retailing has been discussed by Verhoef et al. [
15] and Hübner et al. [
16]. On the one hand, in multichannel retailing, the online and offline channels are treated as separate businesses. On the other hand, the use of both channels in an omnichannel environment is completely integrated, providing consumers with a seamless experience [
17]. Therefore, the customer experience is different although multichannel and omnichannel retailing environments are often considered the same. Beck and Rygl [
18] presented a complete categorization and definition of retailer channels: multi, cross, and omnichannel are clearly defined. According to Hübner et al. [
3], there are several operations within the omnichannel distribution system which are responsible for its excellence, such as expanding delivery modes, increasing delivery speed, and service levels.
As mentioned, the OM-VRP combines the VRP and a PDP. The classical VRP, originally proposed by Dantzig and Ramser [
19], has been extensively studied by practitioners and academics due to its wide applications in several areas. The VRP belongs to a set of
NP-hard combinatorial optimization problems (COPs) [
7]. Therefore, the use of exact algorithms is efficient only for solving small-sized VRP instances. Mostly, these exact approaches are based on the combination of column and cut generation algorithms [
20,
21]. On the other hand, approximate algorithms, such as metaheuristics, are frequently very efficient for solving large-sized instances of COPs. Several metaheuristics have been proposed to solve the VRP, which include tabu search (TS), genetic algorithms (GA), ant colony optimization (ACO), and some hybrid methodologies [
22,
23,
24,
25]. Pick-up and delivery problems have also been studied for more than 30 years. These problems incorporate some route order dependencies in which some nodes should be visited before others in order to transfer inventory between them. Similar to the VRP, the PDP is also an
NP-hard problem [
8], and some exact methodologies, based on branch-and-cut and branch-and-cut-and-price algorithms, have been developed to optimally solve small-sized PDP instances [
26,
27]. Moreover, several heuristics and metaheuristics algorithms have been developed to solve the PDP and some of its variants. We highlight the use of TS [
28], GA [
29], large neighborhood search heuristics (LNS) [
30], adaptive LNS (ALNS) [
31,
32], particle swarm optimization (PSO) [
33], and greedy clustering methods (GCM) [
34]. A literature review and classification of PDPs is presented by Berbeglia et al. [
35].
To the best of our knowledge, Abdulkader et al. [
1] were the first authors to address the omnichannel VRP as a combination of the VRP and the PDP in an omnichannel retailing context. For solving this novel integrated problem, the authors proposed a two-phase heuristic, based on: (i) inserting consumers into retailers routes and on correcting infeasible solutions, and (ii) on joining the routes through the maximum-savings criterion, i.e., the Clarke & Write Savings heuristic (CWS) [
36]. Apart from this two-phase heuristic, a multi-ant colony algorithm (MAC) was proposed. A complete set of instances have been generated to test their methodologies, and the MAC outperformed the heuristic’s performance. Martins et al. [
9] proposed a simple and deterministic heuristic for solving the OM-VRP. Although no best-known solutions were found, the heuristic showed to be promising since feasible solutions were provided in short computational time and no repair operations were needed. Recently, the same problem have been also solved by Bayliss et al. [
37] and Martins et al. [
38]. In contrast to Bayliss et al. [
37], Martins et al. [
38] have framed the problem in the humanitarian logistics field, providing an agile optimization solving methodology–which combines parallel computing with biased-randomized heuristics–which was proposed for solving it in real-time. Competitive results were found in milliseconds, enabling the agile optimization technique able to outperform other heuristics from the literature. On the other hand, Bayliss et al. [
37] formulated the OM-VRP as a mixed-integer program (MIP) and proposed a two-phase local search with a discrete-event heuristic for solving the problem. In contrast to Abdulkader et al. [
1]’s model, their formulation avoids the need to directly assign retailers to vehicles, since the pick-up and delivery constraints are modeled through routing variables. The first phase of the approach employs a discrete-event constructive heuristic, while the second phase aims to refine the most promising solutions obtained in stage one, by applying a sequence of local search neighborhoods. The authors have found new best-known solutions for the vast majority of problem instances, and improved lower bounds for a set of small instances were obtained.
Constructive heuristics are simple deterministic procedures that follow a logical sequence of decisions and which always generate the same solution when starting from the same point. Biased-randomized algorithms incorporate non-symmetric random sampling in order to diversify the behavior of a base constructive heuristic [
39]. At each stage of the base constructive heuristic, a list of candidate decisions is considered. The list is sorted in decreasing order of the benefit concerning an objective function, and a candidate is then randomly selected. In this random selection process, the higher-ranked candidates receive a higher probability of being selected. Biased-randomized algorithms have been employed for solving different COPs in transportation [
40,
41,
42], scheduling [
43,
44], and facility location problems [
39,
45].
Simheuristic algorithms combine the use of approximate methods, such as heuristics or metaheuristics, with simulation techniques, in order to cope with stochastic combinatorial optimization problems. They have proven to be an efficient approach to solve stochastic problems [
46]. The extension of traditional metaheuristics into simheuristics is gaining popularity, and they have already been applied to solve a wide set of stochastic optimization problems, in different fields such as permutation flow-shop [
47], facility location problems [
48], inventory routing problems [
48,
49,
50], telecommunication networks [
51] and finances [
52].
Other examples of simheuristics include Lam et al. [
53], Lopes et al. [
54], and Santos et al. [
55]. Lam et al. [
53] employs a simheuristic approach for evolving agent behavior in the exploration of novel combat tactics. They use a genetic algorithm to find the states, during a flight maneuver, at which an aircraft should transit into the next phase of the maneuver. Lopes et al. [
54] tackles a stochastic assembly line problem using a simheuristic approach. Demands for different product types are stochastic and occur in real-time. The optimization task lies in assigning tasks to stations in such a way that task precedence constraints are respected and the expected throughput is maximized. Santos et al. [
55] develops a simheuristic based decision support tool for an iron ore crusher circuit. They propose and validate a simulation model of throughput efficiency depending on the amount of equipment active in each phase of the crusher circuit. The simheuristic integrates the simulation model with an iterated local search (ILS) metaheuristic. Their solutions improve throughput and reduce energy consumption compared to the all active equipment solution.
Despite the fact that travel times are stochastic in most real-life transportation activities, in the past only deterministic versions of the OM-VRP have been considered. Hence, our work goes one step further by considering random travel times and proposing a simheuristic algorithm to solve the stochastic version of the OM-VRP.
3. Details on the Stochastic OM-VRP
Usually, retail stores must be supplied from the depot with a large number of products, which are packed and frequently measured in terms of the number of pallets. In turn, and in contrast to retail stores’ demand, online customer orders are of negligible size but require processing at a physical retail store before their delivery. Since orders must be processed at retail stores before delivery, products ordered online cannot be shipped directly from the central warehouse to online customers.
In the OM-VRP, a single fleet of vehicles is employed to simultaneously perform three different operations: (i) bulk deliveries to retail stores from a main central depot; (ii) the pick-up of online customer orders from these retail stores; and finally, (iii) the deliver of online customers’ orders. Different product types are available at each retail store. Likewise, each retail store has a limited inventory of processed products that can be delivered to online customers. Therefore, the vehicle responsible for a particular online order must first visit a retail store with that product in stock. The available processed inventory and bulk demand for each retail store are known in advance. It is assumed that bulk inventory does not contribute to a retailer’s available inventory since bulk deliveries cannot be processed within the drop time of delivery. As a simplifying assumption, each online customer orders a single product. When more than one item is ordered, the consumer is replicated in the same location as a ‘virtual’ consumer, and each product is delivered separately. According to Abdulkader et al. [
1], this strategy of considering single-item orders by consumers guarantees the solution feasibility and also minimizes the distribution cost.
Figure 1 depicts how the processes of visiting retail centers and online customers are performed by the same fleet of vehicles. A route starts from the single and central depot, and the same cargo vehicle visits a set of retail centers and customers. Deliveries to customers cannot be directly performed from the depot. Hence, drivers must meet precedence constraints in order to pick-up customer orders from the retail centers before they can be delivered. Since retailers are supplied by the depot, these nodes are both delivery and pick-up points. On the other hand, customers require only the delivery of items from these retail centers, making them, delivery nodes. Apart from precedence and inventory constraints, the service of each route is limited by a maximum tour length and vehicle capacity limit.
We can define the distribution network of the OM-VRP as directed graph
. The set of vertices
is composed of a single depot (0),
r retail centers, and
c online customers. In other words, the problem is represented by a set
R of retail stores, which are supplied by the depot 0, and a set
C of online customers, which are posteriorly supplied by the retail centers. The set
comprises both sets
R and
C, in which
and
. The same fleet of capacitated vehicles, initially stationed at the central depot at the start time
, is employed for serving both the retail centers and consumers. For the complete mathematical formulation of this problem, readers are referred to Abdulkader et al. [
1]. In this work, we extend previous work on this topic by considering the case in which edge-traversal times are stochastic. We define the time to traverse edge
as
where
is the deterministic edge-traversal time (which represents the edge-traversal time under ‘ideal’ traffic conditions),
is a log-normally distributed delay term, and
denotes the distribution of edge-traversal times. The objective of this extended problem is to minimize the total travel cost of the vehicle routes such that: (i) every route starts and ends at the depot; (ii) the routes do not violate the maximum tour length; (iii) every node
is visited by only one vehicle and only once; (iv) the total load of the vehicle does not exceed the vehicle capacity; (v) the total consumer demand to be fulfilled by a vehicle for a specific product does not exceed the inventory picked up by the vehicle; (vi) the retail store determined to satisfy the consumer’s demand must be visited before the consumer and by the same vehicle; and (vii) the routes must have a reliability level greater than a user-set parameter,
. The reliability level of a set of routes is defined as the probability that all routes are completed within the maximum time limit of
.
Two alternative mathematical formulations of the deterministic OM-VRP can be found in Abdulkader et al. [
1], Bayliss et al. [
37]. The differences in this case are a stochastic objective function and probabilistic constraint regarding route completion reliability. Let
be a binary decision variable indicating whether or not vehicle
f in the fleet
F traverses edge
in the
node visit in its route. The stochastic objective function is given in Equation (
1), where
is the stochastic delay associated with traversing edge
.
The probabilistic route completion reliability constraint is given in Equation (
2).
Equation (
2) also accounts for the drop times (
) that are required when performing deliveries and pickups. In this work we employ a simheuristic approach for addressing the stochastic aspects of the OM-VRP. The main novelty of the OM-VRP, in comparison to other VRPs, is the precedence constraints regarding the collection of customer orders from retailers and subsequent delivery to customers. The vehicle capacity constraint regarding the retailer demands that can be satisfied are expressed by Equation (
3), where
denotes the number of ordered product units of node
j (which is zero for customer nodes) and
H denotes the vehicle capacity.
The customer order precedence constraints are expressed by Equation (
4), where
denotes the number of items of type
ordered by node
j (which is zero for retailer nodes) and
denotes the number of items picked up of type
at node
j (which is zero for customer nodes).
Table 1 provides the details for a small-sized instance, which is composed of one central warehouse, four retail stores, and nine online customers. For this instance,
Table 1 provides: the geographic coordinates (
X and
Y) of each node, the demand required from the depot by each retail center
, the demands for each customer
j for each product type
q (
), and the inventory of each available product
,
at the retail centers. Since the products are negligible in terms of capacity, they do not affect the vehicle load capacity. Accordingly,
Figure 2 presents the optimal solution for the provided example. The routes are performed with a total cost of
distance units, and a fleet of vehicles each with a capacity of 100 weight units is employed for performing the routes. In the first route, customers 8, 5, and 9 are served by retail center 1, while customers 12 and 13 are served by retail 3. The route requires
distance units, and the vehicle starts with 93 loaded demand units. Note that the retail centers are first visited by the vehicle in order to deliver the required demand from the depot and to pick-up the products ordered by the customers. Similarly, in the second route, customers 11 and 10 are supplied by retail center 4 while customers 6 and 7 are served by retail store 2. The route requires
distance units, starting with 79 loaded demand units.
5. Results and Discussion
To test the proposed methodologies, we have used the 60 problem instances introduced in Abdulkader et al. [
1]. These instances differ in the number of retail centers (
), online customers (
), and also in the inventory scenarios of the retail centers (tight, relaxed, and abundant). Therefore, for each inventory scenario, 20 different problem combinations are generated. The maximum tour length of the routes and the vehicle capacity are set to 8 h and 100 weight units, respectively. While the BRLH is guided by a single parameter,
, the simheuristic approach is also controlled by the maximum running time
, the acceptance margin of worst solutions
m, and the number,
and
, of simulation repeats in short and long simulation runs, respectively. The stochastic travel times for each edge are set by the log-normal distribution parameters,
and
.
Table 2 summarizes the setup of the parameters employed during the computational experiments. For calibrating these parameters, we have used the methodology proposed in Calvet et al. [
65], which is based on a general and automated statistical learning procedure. Regarding the maximum run time
, the value was set depending on the size of the instance
, which leads to a maximum execution time of 60 s in the case of the largest instance. This stopping criterion is employed in both the multi-start and simheuristic strategies.
The algorithms were coded in Java and all tests were performed on an Intel Core i7-8550U processor with 16 GB of RAM.
Note that three different values have been considered for the parameter, which is used to modify the deterministic travel times. Since the maximum tour length is fixed independently of the size of the instances, small-sized instances are more likely to generate short routes. Therefore, a larger value for introduces more variability in the travel times, which increases route failure rates. On the other hand, large-sized instances are often composed of larger routes, then a small value for is introduced. In particular we have, for small instances (composed of 25 customers), for large instances (composed of 150 customers), and for the remaining medium-sized ones. This approach ensures that small, medium, and large instances each have similar levels of difficulty with respect to the risk of route failure.
The initial analysis aims to compare the solutions generated by our greedy heuristic (LH) and by our MS
BRLH–in which
is (uniformly) randomly selected in the interval
–with the solutions obtained by the two-phase heuristic (AH) and multi-ant colony metaheuristic (MAC) proposed by Abdulkader et al. [
1]. Their methodologies were performed on four 2.1 GHz processors with 16-cores each and a total of 256 GB RAM. That is, we initially focus on comparing each algorithm in terms of deterministic travel cost.
Table 3,
Table 4 and
Table 5 present the results obtained for tight, relaxed, and abundant inventory scenarios, respectively. For each problem instance (I), we present results for: the cost of the best-found solution obtained by the different methodologies; the average cost of our MS
BRLH; the CPU time (in seconds) required by each methodology; and their percentage gaps. The best results returned by the solution methodologies are highlighted in bold.
Figure 4 and
Figure 5 present how both the gap and the cost of the solutions, i.e., the objective function (OF) value, behave according to the employed solution approach and inventory scenario, respectively.
By analyzing
Table 3,
Table 4 and
Table 5, we can observe that our MS
BRLH algorithm is able to improve previous results (from LH) by 9–12%, on average (column
gap (2)–(1)). When comparing the MS
BRLH with the AH heuristic (column
gap (3)–(2)), our approach is able to reduce solution costs by up to 26% in short computational times (about 18 s on average). On the other hand, when comparing our results with those generated by the MAC approach (column
gap (4)–(2)), our solutions are between 9% and 21% worse, on average. Particularly, in the abundant inventory scenario, MS
BRLH’s results are only 9% worse than MAC. Notice, however, that the processing time required by MAC is substantially larger in all inventory scenarios. By analyzing
Figure 4, it is evident that MS
BRLH performs better in the abundant inventory scenario, being able to find one better solution and several others with a maximum gap of 8%. Moreover, we can observe a variability of around 10% in the gap between MS
BRLH and MAC on average. This variability is reduced to around 8% for the relaxed and abundant scenarios. These results demonstrate the robustness of our solution approach for the deterministic case. When analyzing
Figure 5, which presents the overall performance of each solution approach for each inventory scenario, we can observe that our multi-start strategy is more efficient than both LH and AH heuristics, by generating solutions with a lower cost. To complement these box-plots, an ANOVA test was run for each inventory scenario. The
p-values associated with the tight, relaxed, and abundant inventory scenarios were, respectively, of
,
, and
. Also, the Fisher’s LSD test suggests significant differences in all tree scenarios between MAC and AH, between MS-BRLH and AH, as well as between MAC and LH. However, cost differences between MAC and MS-BRLH were not significant in any scenario, despite the fact that MAC employed a noticeably higher amount of computing time than MS-BRLH.
Next, in
Figure 6, we present the convergence of three problem instances’ solutions, including one from each different inventory scenario (instances
,
, and
), by comparing the MS
BRLH with the best-known solutions, in terms of gap. As introduced, these instances require approximately 15 s of processing time, given their magnitude.
As we can see in
Figure 6, the instances demonstrate the same convergence behavior as solution time increases. It is noticeable that the convergence rate is abrupt during the first few seconds. However, contrary to the tight and relaxed inventory scenarios, where the solutions continue improving over time, the search demonstrates more stable convergence during the remaining execution time in the abundant scenario. Being more efficient in the more flexible inventory case, our multi-start approach quickly achieves its best solutions.
Next, we want to compare the quality of the solutions generated for the deterministic scenario (MS
BRLH) against the solutions generated for the stochastic scenario (Sim
BRLH). Since the only difference between the deterministic and stochastic scenarios is that stochastic delays are added to edge traversal times, we can consider the deterministic cost of the best solutions generated by MS
BRLH as a lower bound (LB) of the stochastic travel times of the best Sim
BRLH solution. Moreover, since MS
BRLH does not account for stochastic travel times, we can consider the stochastic travel time of the best MS
BRLH solution as an upper bound (UB) for the stochastic travel time of the best Sim
BRLH solutions.
Table 6 provides both LBs and UBs values and the best-found stochastic travel times obtained by our Sim
BRLH. The solutions reported in the Sim
BRLH column are the best-found stochastic travel times.
As we can see in
Table 6, all the Sim
BRLH solution costs are between the LB and the UB, as expected. For 24 problem instances, the solution returned by our simheuristic is better than the best deterministic solution when it is tested in the stochastic scenario (the
column). From this, we can assert that our Sim
BRLH is able to generate competitive results for the stochastic scenario. The reliability value is calculated by simulation for each solution and represents the probability that all routes are completed within maximum tour duration. For visualizing this trade-off between the deterministic cost of the solutions and their reliability rate, which are conflicting objectives, a Pareto frontier of non-dominated is presented. Accordingly,
Figure 7a–c present the non-dominated solutions for three different instances (
,
, and
), each one belonging to a different inventory scenario. A solution is non-dominated if, no other solution has a greater reliability and a lower or equal travel cost, or if no other solution has a lower travel cost and a greater or equal reliability level. The
solution was randomly chosen, while the
and
solutions are for the same problem but set in the two other inventory scenarios. The square orange dot represents the best deterministic solution found by our MS
BRLH, while the remaining ones, round and blue, represent different solutions with a higher reliability rate, but with higher operating costs.
As we can see in
Figure 7, despite being the solutions with the lowest cost, the best deterministic solutions (square dots) are the least reliable ones for stochastic scenarios. Particularly in
Figure 7a, the best deterministic solution is approximately only 21% reliable under stochasticity. By selecting higher-cost solutions, the reliability rate reaches more than 70%. The same behavior is noticed in
Figure 7b,c, however, the deterministic
solution is reasonably reliable. Usually, low-cost solutions are made up of a small number of large vehicle routes, in terms of travel distance and time. Therefore, when increasing the travel time variability in those scenarios, the risk of exceeding the time constraint is higher. On the other hand, higher cost solutions are built from a larger number of smaller routes, and smaller routes exhibit a lower risk of violating the maximum tour duration constraint in stochastic scenarios. In this way, decision-makers should consider that low-cost solutions under deterministic scenarios might not necessarily be the best option when stochasticity is taken into account.
6. Conclusions and Future Work
With the emergence of online retail channels and the popularization of mobile devices, new retailing modes have become popular. Some of these retailing practices allow customers to browse through different online stores and, then, to get the items bought directly delivered to their homes. Hence, new versions of the vehicle routing problem (VRP) considering additional decision variables and constraints have emerged. Omnichannel retailing leads to an integrated problem combining the VRP and the pick-up and delivery problem. In omnichannel distribution systems, a set of retail stores need to be replenished and, at the same time, products have to be sent from these stores to final customers. The resulting omnichannel VRP consists in two stages: (i) a group of retail stores that must be served from a distribution center; and (ii) a set of online consumers who must be served, by the same fleet of cargo vehicles, from these retail stores.
For solving the deterministic OM-VRP, a simple heuristic was initially introduced. This heuristic was then extended into a multi-start biased-randomized algorithm, which is tested against the state-of-the-art methodologies. Our biased-randomized algorithm performs reasonably well in a set of 60 instances of the deterministic OM-VRP, which allows us to extend it into a full simheuristic for solving the stochastic version of the problem. This is a more realistic version of the OM-VRP where travel times are modeled as random variables following a log-normal distribution. Our simheuristic approach is also capable of measuring the reliability of any proposed solution when it is employed in a stochastic scenario. To the best of our knowledge, this is the first time that such a stochastic variant of the problem has been solved in the literature. Regarding the simheuristic results, we conclude that the best deterministic solutions may perform badly when used in a stochastic scenario. Those solutions are often not reliable in terms of completing all routes within a time limit. On the other hand, our simheuristic approach was able to generate reliable and competitive results for these stochastic scenarios. Therefore, our methodology enables decision makers to choose the solution that better fits his or her utility function in terms of cost and reliability level.
Two-echelon distribution systems, as the one considered here, are typically characterized by the use of different fleets of vehicles at each distribution level. Apart from considering a single fleet of vehicles for serving both delivery levels, the products ordered by customers were assumed to not affect the vehicles’ capacity. Therefore, future directions of research include the consideration of non-negligible sized online customer orders, heterogeneous vehicle fleets, and the incorporation of positive demands for multiple product types for each customer. Regarding the solution approach, different perturbation stages and local search operators could be tested in order to speed up the convergence process towards near-optimal solutions. This might be particularly relevant when considering large-sized instances.