Next Article in Journal
Exploring Facial Somatosensory Distortion in Chronic Migraine: The Role of Laterality and Emotion Recognition—A Cross-Sectional Study
Previous Article in Journal
Improving Quantum Optimization Algorithms by Constraint Relaxation
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Dynamic Scheduling Optimization of Automatic Guide Vehicle for Terminal Delivery under Uncertain Conditions

School of Transportation Engineering, Shenyang Jianzhu University, Shenyang 110168, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2024, 14(18), 8101; https://doi.org/10.3390/app14188101
Submission received: 13 August 2024 / Revised: 2 September 2024 / Accepted: 4 September 2024 / Published: 10 September 2024
(This article belongs to the Section Transportation and Future Mobility)

Abstract

:
As an important part of urban terminal delivery, automated guided vehicles (AGVs) have been widely used in the field of takeout delivery. Due to the real-time generation of takeout orders, the delivery system is required to be extremely dynamic, so the AGV needs to be dynamically scheduled. At the same time, the uncertainty in the delivery process (such as the meal preparation time) further increases the complexity and difficulty of AGV scheduling. Considering the influence of these two factors, the method of embedding a stochastic programming model into a rolling mechanism is adopted to optimize the AGV delivery routing. Specifically, to handle real-time orders under dynamic demand, an optimization mechanism based on a rolling scheduling framework is proposed, which allows the AGV’s route to be continuously updated. Unlike most VRP models, an open chain structure is used to describe the dynamic delivery path of AGVs. In order to deal with the impact of uncertain meal preparation time on route planning, a stochastic programming model is formulated with the purpose of minimizing the expected order timeout rate and the total customer waiting time. In addition, an effective path merging strategy and after-effects strategy are also considered in the model. In order to solve the proposed mathematical programming model, a multi-objective optimization algorithm based on a NSGA-III framework is developed. Finally, a series of experimental results demonstrate the effectiveness and superiority of the proposed model and algorithm.

1. Introduction

Because of its convenient and fast characteristics, takeout has become an essential part of the daily life of the working class. According to the data, the market size of the online takeout terminal delivery industry (China) will grow to CNY 941.74 billion in 2022, with a year-on-year growth of 19.8% [1]. Such large-scale market growth will inevitably bring huge terminal delivery increases [2], which is also an important reason why AGVs are widely used in the field of takeout delivery. However, effective AGV scheduling can improve efficiency, reduce costs, and enhance enterprise competitiveness [3]. Therefore, as an important part of food delivery, AGV scheduling can effectively improve the service level for customers and the competitiveness of food delivery companies.
In the process of AGV delivery, the vehicle will face off-peak hours and peak hours [4,5,6]. In the off-peak hours, AGVs can typically fulfill the delivery demand due to the lower order volume. However, during peak hours, the rapid surge in order volume may surpass the available AGVs, leading to longer customer wait times. In addition, long delivery times can reduce customer trust in the platform. Although increasing the number of AGVs can meet the delivery demand during peak hours, there will be an excessive amount of idle AGVs during off-peak hours, which will lead to a significant increase in costs.
Due to the real-time and random nature of online orders, an AGV needs to carry out dynamic route scheduling in the process of delivery and pick-up. Because the delivery platform is based on a rolling scheduling framework for order allocation [7,8,9,10,11,12,13,14], the route of the AGV delivery is also optimized on a rolling basis. From the microscopic route structure level, the insertion of new orders is always carried out in the process of AGV operation, and this insertion will inevitably have an impact on the planned route of the AGV. Therefore, in dynamic scheduling, most routes based on previous decisions cannot be fully executed.
Due to a variety of complex factors, the meal preparation time of the merchant is uncertain. At the same time, the time when the AGV will reach the customer is also uncertain [15,16]. Therefore, these uncertainties will cause difficulties for AGV scheduling. In order to improve delivery efficiency, the uncertainty of meal arrive time should be fully considered in the route planning of the AGV.
In practice, combined meal pick-up and combined meal delivery are common operation modes, which can improve the efficiency of delivery. Specifically, different orders from the same pick-up point can be combined; different orders for the same customer can be combined for delivery. This mode of operation can significantly reduce the number of redundant driving routes. Obviously, route consolidation increases the difficulty of AGV route planning.
In view of the difficulties above caused by AGV scheduling, this paper makes the following contributions: (1) In order to achieve AGV scheduling optimization, a stochastic programming model is embedded in the rolling framework to solve the uncertain control problem under dynamic conditions in takeout delivery. (2) According to the characteristics of decentralized transportation organization, this paper proposes a method of integrating an open chain structure and order combination mechanism, so that an AGV can realize action combination in the process of dynamic route updating. (3) An after-effect control method is proposed, which does not rely on historical data and demand forecasting. At the same time, the delivery system can adapt to the changes of a dynamic environment through flexible real-time adjustment.
The remainder of this paper is organized as follows. In Section 2, the relevant literature is introduced. In Section 3, detailed problems are described and a mathematical description of the model is given. We then outline our metaheuristic in Section 4. Section 5 presents results from numerical experiments, and the last section provides conclusions and future work.

2. Literature Review

At present, the academic research on AGV scheduling mainly focuses on the application scenarios of the manufacturing industry, warehouse, workshop, and container terminals, and its application in takeout delivery has not been included in any research results. Considering the similarities in scheduling principles, AGV route planning models in other fields still have reference significance in the field of takeout delivery. On the other hand, the field of non-AGV delivery is rich in research results related to delivery models such as the vehicle routing problem (VRP). Therefore, in terms of improving delivery efficiency and delivery quality, it still has reference value for AGV takeout delivery.
The transportation organization models discussed in the current research results mainly fall into two categories. One is the centralized transport organization model with a unified cargo delivery hub, which mainly adopts the VRP variant model for scheduling optimization. The other type is the decentralized transport organization model without a unified cargo delivery hub, which mainly adopts a non-VRP variant model for scheduling optimization.
In recent years, most scholars have used VRP variant models to conduct research on pick-up and delivery route optimization [7,8,13,16,17,18,19,20,21,22,23,24,25,26,27,28]. In the field of AGV research, reducing the driving distance and the delivery cost are the core issues [18,19,20,21,22,23,24,25,26]. In order to improve delivery efficiency, Cheng et al. [19] and Gao et al. [20] set the minimum completion time as the goal and proposed a solution for the multiple AGV dispatching problem (AGVDP). Chen et al. [22] focused on optimizing the material handling and delivery processes of AGVs, aiming to minimize energy consumption and increase customer satisfaction. In the study of non-AGV takeout delivery, although there have been studies on takeout route planning based on known order information [6], these methods are mainly applicable to pre-orders (static situation) and have difficulty coping with real-time changes in order demand. In order to deal with the randomness of customer ordering and the uncertainty of meal preparation time, Ulmer et al. [17] proposed a time buffer using a parametric cost function approximation (CFA). Zheng et al. [28] adopted the fuzzy set theory to obtain the time interval for meal preparation. In addition, considering the randomness of customer demand, Reyes et al. [7] and Yildiz et al. [8] adopted rolling scheduling decisions for takeout delivery in uncertain demand environments (i.e., randomness of customer order).
During the peak hours, some scholars used non-VRP variant models to conduct research on pick-up and delivery route optimization [5,9,10,11,12,14]. However, at present, there is no research using a non-VRP model to solve the AGV scheduling problem. As for the study of non-AGV takeout delivery, Liu et al. [5] studied the optimization of crowdsourcing delivery routes in a deterministic environment, which did not consider dynamic scenarios. Rolling scheduling decisions are more suitable in dynamic situations. Considering the random generation of order demand and carrying passengers, Du et al. [9] proposed a crowdsourcing framework for taxis carrying passengers and delivering takeout at the same time. The framework adopts the method of rolling cycle repeat matching for passengers and food parcels. Considering the real-time dynamics of the order and the inertial action of delivery when the algorithm is involved, Zachary et al. [10] planned the pick-up and delivery path based on the rolling framework. Sun et al. [11] established the coupling relationship between order allocation and path planning based on the delivery process through the real-time dynamics of order allocation and path optimization. A dynamic optimization algorithm is designed to solve the model. In order to optimize delivery routes, Tu et al. [12] proposed an online dynamic optimization framework with an order collection time window based on order dynamics and delivery status. Yu et al. [14] established a real-time order allocation and path optimization model by taking into account the real-time and timeliness of orders.
Based on demand uncertainty, Liu et al. [3] solved the scheduling problem of AGVs by considering the uncertainty of the replenishment time of the workstation. However, in the study of non-VRP takeout delivery, only a small number of scholars have considered the uncertainty factors in the online order delivery problem [17,27,28]. Considering the randomness of customer orders and the uncertainty of food preparation time, a time buffer setting using a parametric cost function approximation (CFA) was proposed by Ulmer et al. [17]. In order to deal with the uncertainty of meal preparation time of merchants, Zheng et al. [27] used the fuzzy set theory to obtain the time interval for meal preparation. Zheng et al. [28] established a Gaussian mixture model reflecting the distribution of uncertain service times. A hybrid distribution estimation algorithm is developed to solve the problem.
In the field of takeout delivery, order combination refers to the combined pick-up and combined delivery of multiple orders. At present, order combination is only discussed in the centralized transportation organization model. On this basis, this paper introduces the concept of action combination. In order to optimize the efficiency of delivery, order combination is considered under the decentralized transportation organization model. In the academic research of AGVs, the principle of one single match is mainly adopted, but the related content of action combination has not been fully considered. However, in the field of non-AGV takeout delivery, scholars have paid attention to the issue of order combination (i.e., bundling and unified distribution of multiple orders) [7,8,13,17]. In addition, Reyes et al. [7] and Yildiz et al. [8] considered the combined take-up of action combination in the VRP mode. Liao et al. [13] proposed to combine the delivery of a group of orders that meet the time window and load requirements. At the same time, a Principal Component Analysis (PCA) and K-means cluster were adopted to decide whether to combine orders.
In route planning, after-effect refers to the consideration of the impact of the previous decision on the current decision. For the study of AGVs, no scholars have specifically discussed the issue of after-effect. In the field of non-AGV takeout delivery, some scholars have paid attention to the importance of after-effect. By predicting the distribution of order demand in the future period, they consider the future order demand in the scheduling process [10,17]. Based on the order quantity data of each restaurant, Ulmer et al. [17] set a time buffer to consider future order demand and generate the probability of customers placing orders. Zachary et al. [10] studied the fairness and decentralized decision making of delivery personnel to accommodate the problem of matching new orders with delivery personnel.
Based on the description above of the existing study, we integrate our research into Table 1. In Table 1, “One-time decision” refers to not considering whether a new customer appears, but only considering whether the customer’s needs fluctuate. “Rolling decision *” means to consider not only the dynamics, but also the inertia routings of delivery personnel. Y means to consider the uncertain factors of the meal preparation time of restaurants, N means not to consider the uncertain factors of the meal preparation time of restaurants. c represents order combination, i.e., under the centralized transportation organization model; C represents action combination, that is, under the decentralized transportation organization model. f represents the scheduling after-effect based on prediction. F represents the scheduling after-effect based on no prediction. S represents a single objective, S* represents the normalization of multiple objectives into a single objective, and M represents multiple objectives.
According to Table 1, the following conclusions can be made: (1) Aiming at the uncertainties in takeaway delivery, some scholars adopt stochastic programming to make one-time scheduling decisions. At present, there is no relevant research to deal with uncertain scheduling optimization under dynamic conditions based on a rolling framework. (2) As for the combination problem in takeout delivery scheduling, the existing studies only discussed the order combination under the centralized transportation organization mode, while the action combination under the decentralized transportation organization mode was not mentioned. (3) For the consideration of after-effect, the existing results are based on statistical models for demand forecasting. This method performs well in macro-order demand forecasting but has difficulty predicting micro discrete events (e.g., customers, meal delivery times, etc.).

3. Problem Formulation

3.1. Problem Description

3.1.1. Takeout Delivery Flow

The takeout delivery process involves three stakeholders: customers, AGVs, and restaurants. The connection among them is established by the takeout platform. Figure 1 shows the flow chart of takeout delivery. Customers place orders on the takeout platform through a mobile application (Step 1, information flow). After the platform receives orders, it sends the order information to the restaurant (Step 2, information flow). The restaurants provide feedback to the platform on whether to accept the order in a short period of time (Step 3, information flow). After receiving feedback, the platform assigns orders to the appropriate AGV (Step 4, information flow). The corresponding AGV goes to the restaurant nodes to collect the food based on its working state (Step 5, physical flow). Then, the AGV delivers the food to the customer’s location within a period (Step 6, physical flow). Finally, they provide feedback to the takeout scheduling platform (Step 7, information flow).

3.1.2. Real-Time Dynamic and Rolling Scheduling

As universally acknowledged, the order time in takeout delivery exhibits significant variability. The AGV scheduling platform needs to continuously receive and assign orders.
Therefore, it is necessary to adjust the AGV’s path constantly. In other words, when a new order is inserted, the algorithm should intervene to re-plan the allocation path of the AGVs. The insertion time of the new order is called the decision time of rolling scheduling. The AGV should not immediately change the current action when performing delivery operations upon receipt of a new order to ensure consistency of operation. This action being carried out by the AGV is called inertia action. Therefore, after each rolling schedule, the starting time of the readjusting path should be the end time of the inertial action. It is important to note that the completion of the current inertia of the AGV does not equal the completion of the current order. Figure 2 shows an example of the order task sequence and routing combination based on the rolling scheduling framework.

3.1.3. Uncertain Preparation Times

The process of an AGV completing delivery tasks involves numerous uncertain factors, including traffic resistance, meal preparation time of restaurants, time of the AGV serving customers, etc. Among them, the uncertainty of meal preparation time of restaurants is the factor that has the greatest impact on the system [27]. Figure 3 shows the difference between the expected delivery time and the actual delivery time. PT1 represents the travel time for the AGV to pick up order 1 from restaurant node 1. PT2 represents the travel time for the AGV to pick up order 2 from restaurant node 2. DT1 represents the travel time of the AGV to customer node 1. DT2 represents the travel time of the AGV to customer node 2. WT1 represents the waiting time of the AGV at restaurant node 1. WT2 represents the waiting time of the AGV at restaurant node 2. ST represents the time of the AGV serving customers. If the food arrives at the restaurant node too late, the waiting time of the AGV at the restaurant node will become longer, which will lead to an order timeout, that is, the actual arrival time of the AGV is later than the latest delivery time of the order, and the customer satisfaction will decline. Therefore, it is necessary to consider the uncertainty of the arrival time, which can better guarantee the robustness of the system.

3.1.4. Routing Combination

In terms of order delivery, for different orders with a small order time span and the same restaurant node, the AGV can wait slightly to pick up the food together; for different orders with the same customer involved, the AGV can unify the collection and delivery. This mode of operation is known as action consolidation (AC). Figure 4 shows the rolling pick-up and delivery path considering routing combination. The original path of the AGV is 1-2-3-4-5-6. A new order is received at customer node 2 (a new restaurant node, 7, is inserted). Since restaurant node 7 and restaurant node 3 correspond to the same customer node, 4, the AGV will merge the food delivery at customer node 4. After the first rolling dispatching, the path is changed to 1-2-7-3-4-5-6 (the executed path is 1-2). A new order is received on restaurant node 7 (a new customer node, 8, is inserted). Since customer node 8 and customer node 4 correspond to the same restaurant node, 3, the AGV merges the food pick-up at restaurant node 3. After the second rolling scheduling, the path is changed to 1-2-7-3-4-8-5-6 (the executed path is 1-2-7). Although taking AC decisions into account in route planning will increase the complexity of optimization, it can significantly improve delivery efficiency and customer satisfaction.

3.1.5. Look-Forward

In real life, route planning is often considered based on look-forward processes. An AGV generally gives priority to the delivery of orders that are closer to their current location, and orders that are relatively far away will be slightly shelved. The purpose is to wait for future orders that may be able to be combined with current orders.

3.2. Modeling

To solve the problem studied, the following multi-objective rolling pick-up and delivery route planning (MO-RPDRP) model is developed.
The model is based on the following assumptions:
  • The inertial action currently being executed cannot be changed.
  • The preparation time of the restaurant is subject to Gaussian distribution.
  • The AGV’s meal pick-up time occupies a small amount of time in the entire delivery process, so it is not considered.
In the MO-RPDRP model, N represents the order set involved in the current rolling period. N 1 represents the current set of orders that have been picked up but not delivered. If the inertial action of the AGV is picking up food, the order involved is also included in N 1 . If the inertial action is food delivery, the order is considered completed and it is not included in N 1 . N 2 represents the current order set that has not been picked up. I represents the set of restaurant nodes. J represents the set of customer nodes. A represents the set of all nodes. If the orders involve one-to-one mapping to both the restaurant nodes and the customers, N 1 + 2 N 2 = A . If there are multiple orders for the same restaurant node or multiple orders for the same customer, N 1 + 2 N 2 > A . If the AC strategy is not executed, the length of the pick-up and delivery path is N 1 + 2 N 2 . Otherwise, the actual length of the pick-up and delivery path is in the interval A , N 1 + 2 N 2 . To make orders and restaurant nodes, orders and customers are mapped one-to-one, and the virtual restaurant nodes and virtual customer nodes need to be set. The set of virtual restaurant nodes is represented as I v i r t u a l , which, along with the set I , forms the set of restaurant node expansion I , I = I I v i r t u a l . The number of the virtual restaurant node set I v i r t u a l continues with the number of the restaurant node set I . The set of virtual customer nodes is represented as J v i r t u a l , which, along with the set J , forms the set of customer node expansion J , J = J J v i r t u a l . The number of virtual customer node set J v i r t u a l continues with the number of the customer node set J . The set of all node expansion is represented as A = I J . For n N 1 , there is a unique customer node corresponding to it. For n N 2 , the only restaurant node can be found, and it has a customer node corresponding to it. The length of the virtual open chain (pick-up and delivery path) is A .
The essence of the MO-RPDRP decision is the sequence of all nodes in the set A and the identification of the AC strategy. The decision variable x i j is set as 0–1 variable. If the AGV travels from node i to node j , the decision variable x i j is equal to 1. Otherwise, it is equal to 0. Decision variable x i j can execute the AC strategy while deciding the sequence of virtual nodes. The identification of the AC strategy is reflected by 0–1 variable, p l 1 and p l 2 . Other sets, parameters, and variables are shown in detail in Table 2.
Objective function 1 minimizes the expected order timeout rate; ω is the timeout rate for all orders.
min F 1 = E ω
Objective function 2 minimizes the expected total customer waiting time. From the perspective of the customer, the customer needs the AGV to deliver the meals as soon as possible. The derivation of the completion time g n of the order n is shown in (14).
min F 2 = E n N g n r n
Objective function 3 maximizes the scheduling after-effect. A look-forward objective is introduced to adjust the impact of this decision on subsequent scheduling. See Section 3.1.5 for a detailed description. In terms of scheduling, orders are completed intensively in the early stage, while orders are completed relatively dispersed in the later stage. In the objective function, it is expressed as (3). Figure 5 shows the relationship between the number of remaining nodes and the number of unfinished orders.
min F 3 = E l = 1 | A | λ l ( a l a l 1 )
Constraints (4) and (5) delineate the deduction constraints governing the sequential visitation order of the order nodes. The access sequence s l of the l th node is deduced by decision variables x i j . Obviously, s | A | is the last access node.
s 1 = j A j x s 0 j
s l + 1 = j A j x s l j , l { 1 , 2 , , | A | 1 }
Constraints (6) and (7) are constraints on the volume of takeout carried by the AGV. The volume of takeout carried upon arrival cannot exceed the maximum capacity of the AGV. If the AGV passes through the restaurant node s l (pick-up stage), the takeout volume is increased. The takeout volume d l + 1 carried by the AGV upon arrival s l + 1 is the sum of d l and v arg α ( s l ) . If the AGV passes through the customer node s l (the delivery stage), the takeout volume is reduced. The takeaway volume d l + 1 carried by the AGV on arrival s l + 1 is the difference between d l and v arg α ( s l ) .
d 0 = n N 1 v n d l + 1 = d l + v arg α ( s l ) s l I d l v arg β ( s l ) s l J l { 1 , , | A | 1 }
d l V , l { 1 , , | A | 1 }
Constraint (8) imposes limitations on the visiting sequence of the order nodes. For the same order, the AGV should visit the restaurant node α n first, then the customer node β n is visited. It is expressed that the time when the AGV arrives at the restaurant node a arg s ( α n ) should be less than the time when it arrives at the customer node a arg s ( β n ) .
a arg s ( α n ) < a arg s ( β n ) , n N
Constraints (9)–(11) are the AC identification constraints of the order.
p l 1 = 1 , m a p s l 1 = m a p s l + 1 1 0 s l , s l + 1 I , l { 1 , | A | 1 }
p l 2 = 1 , m a p s l 2 = m a p s l + 1 2 0 s l , s l + 1 J , l { 1 , | A | 1 }
p l 1 + p l 2 1 , l { 1 , | A | 1 }
Constraint condition (14) is the constraint of the completion time of the order; g n is the sum of a arg s ( β n ) and t n s . The derivation of a l is shown in (13). If there is a combination of routings between s l and s l 1 (merged meal pick-up or delivery), a l 1 is equal to a l . If s l 1 is the restaurant node, which does not produce a merged meal pick-up with the last node, a l is the sum of a l 1 and t s l 1 s l d . If s l 1 is the restaurant node, which produces a merged meal pick-up with the last node, s e t l is expressed as the sequence set of s l 1 and the last merged meal restaurant node. The expression is (12). s e t l is the sequence set of s l 1 and the last merged meal restaurant node. a l 1 is the maximum sum of the ordering time in s e t l 1 and the meal preparation time of the restaurant. The restaurant’s meal preparation time t arg α ( s w ) p follows the Gaussian distribution. In other words, for different restaurants, the Gaussian distribution is obeyed with a different expectation μ i and variance σ i 2 . If s l 1 is the customer node, it does not produce a merged meal delivery with the last node; a l is the sum of a l 1 , t arg β ( s l ) s , and t s l 1 s l d . If s l 1 is the customer node, it produces a merged meal delivery with the last node; a l 1 is equal to a l 2 . The service time of the same customer node is the same, so the calculation of a l is not affected.
s e t l = { l | l = 1 l 1 p l 1 = 1 } l
a 1 = T 0 + t s 0 s 1 d a l = a l 1 p l 1 1 + p l 1 2 = 1 max ( a l 1 , max w s e t l 1 ( r arg α ( s w ) + t arg α ( s w ) p ) ) + t s l 1 s l d p l 1 1 + p l 1 2 = 0 s l 1 I a l 1 + t arg β ( s l 1 ) s + t s l 1 s l d p l 1 1 + p l 1 2 = 0 , s l 1 J
g n = a arg s ( β n ) + t n s , n N
Constraint condition (15) is the quantity constraint of uncompleted orders.
λ 1 = N λ l + 1 = λ l s l I λ l 1 s l J
Constraint conditions (16)–(19) are open chain structure constraints. To maintain the flow balance between intermediate nodes, (16)–(17) traverse all restaurant nodes and customer nodes. (18) and (19) are open chain endpoint constraints; the two endpoints of the path are not connected. (20) is the constraint to eliminate a sub-loop.
i A s 0 x i j = 1 , j A
j A x i j = 1 , i A s A s 0
i A s 0 x s | A | i = 0
i A x i s 0 = 0
i H j H x i j H 1 , H A , 2 H A 2
The established MO-RPDRP is a stochastic programming model. Three objective functions in the model are derived as follows.
min F 1 = E ( ω ) = E ( n N max ( 0 , sgn ( g n b n ) ) N ) = 1 N n N E ( max ( 0 , sgn ( g n b n ) ) ) = 1 N n N 0 + sgn ( g n b n ) f ( g n b n ) d ( g n b n ) = 1 N n N 0 + f ( g n b n ) d ( g n b n )
where f ( g n b n ) represents the probability density function of g n b n .
min F 2 = E n N g n r n = E ( n N g n ) n N r n
min F 3 = E l = 1 | A | λ l ( a l a l 1 ) = l = 1 | A | λ l E a l a l 1 = l = 1 | A | λ l E ( a l ) E ( a l 1 )
Obviously, according to (21)–(23), the core of the derivation of the three objective functions is the derivation of E ( a l ) . So we derive the expression (24) of E ( a l ) according to the recursive form of (13).
a l = a l 1 p l 1 1 + p l 1 2 = 1 max ( a l 1 , max w s e t s l 1 ( r arg α ( s w ) + t arg α ( s w ) p ) ) + t s l 1 s l d p l 1 1 + p l 1 2 = 0 s l 1 I a l 1 + t arg β ( s l 1 ) s + t s l 1 s l d s l 1 J , p l 1 1 + p l 1 2 = 0
Obviously, when p l 1 1 + p l 1 2 = 1 , E ( a l ) = E ( a l 1 ) when p l 1 1 + p l 1 2 = 0 and s l 1 J , E ( a l ) = E ( a l 1 ) + t arg β ( s l 1 ) s + t s l 1 s l d .
When p l 1 1 + p l 1 2 = 0 and s l 1 I , l > 1 , due to the different forms of a l 1 under l . When the AGV did not pass through the restaurant node before arriving at s l 1 , a l 1 is a definite constant. When the AGV passes through the restaurant node before arriving at s l 1 , a l 1 is a random variable.
(1) If the AGV did not pass through the restaurant node before arriving at s l 1 , a l 1 is a constant. The expectation of a l can be obtained by referring to the derivation of E ( a 2 ) .
(2) As can be seen from the above, if the AGV passes through the restaurant node before arriving at node s l 1 , a l 1 is a random variable. Obviously, the original function of the E ( a l ) integrand in the expression is a non-elementary function [29]; it is difficult to solve this problem by directly finding the original function.
Therefore, we choose to use the discretization method to estimate the expectation of a l [30]. In other words, the meal preparation time obeying the Gaussian distribution is transformed into a series of discrete time points. The expectation of a l is estimated by using the rules and linear operations in Algorithms 1–3. Obviously, the time when the AGV arrives at any node is a time set, and the time when it leaves any node is also a time set.
Algorithm 1. Comparison rule 1.
Input: (1) Time set of AGV arriving at restaurant node i : A arg s ( i ) a r r i v a l
(2) The meal preparation time set of restaurant node i : C arg α ( i ) d i n i n g
Output: The set of times when the AGV leaves restaurant node i : Z arg s ( i ) d e p a r t u r e
Z arg s ( i ) d e p a r t u r e = [ ]
For a arg s ( i ) a r r i v a l   A arg s ( i ) a r r i v a l
  For c arg α ( i ) d i n i n g   C arg α ( i ) d i n i n g
    If a arg s ( i ) a r r i v a l > c arg α ( i ) d i n i n g
            Z arg s ( i ) d e p a r t u r e = Z arg s ( i ) d e p a r t u r e   a arg s ( i ) a r r i v a l
    else
            Z arg s ( i ) d e p a r t u r e = Z arg s ( i ) d e p a r t u r e c arg α ( i ) d i n i n g
    End if
  End for
End for
Algorithm 2. Comparison rule 2.
Input: (1) Time set of AGV arriving at restaurant node i : A arg s ( i ) a r r i v a l
(2) Total consolidated orders at restaurant node i : W
(3) The meal preparation time set of restaurant node i : C arg α ( i ) w d i n i n g
Output: The set of times when the AGV leaves restaurant node i : Z arg s ( i ) d e p a r t u r e
Z arg s ( i ) d e p a r t u r e = [ ]
w = 1

For a arg s ( i ) a r r i v a l A arg s ( i ) a r r i v a l
   Z arg s ( i ) d e p a r t u r e = F ( w , W , a arg s ( i ) a r r i v a l , C arg α ( i ) w d i n i n g , Z arg s ( i ) d e p a r t u r e )
End
   F ( W , a arg s ( i ) a r r i v a l , C arg α ( i ) w d i n i n g , Z arg s ( i ) d e p a r t u r e )
Input: (1) Current level: w
   (2) Total order number: W
   (3) Time set of AGV arriving at restaurant node i : a arg s ( i ) a r r i v a l
   (4) The meal preparation time set of restaurant node i : C arg α ( i ) w d i n i n g
   (5) The set of times when the AGV leaves restaurant node i : Z arg s ( i ) d e p a r t u r e
Output: Result
v a l u e = a arg s ( i ) a r r i v a l
If w < W
  For c arg α ( i ) d i n i n g C arg α ( i ) w d i n i n g
    v a l u e 2 = max ( v a l u e , c arg α ( i ) d i n i n g )
    Z arg s ( i ) d e p a r t u r e = F ( w + 1 , W , v a l u e 2 , C arg α ( i ) ( w + 1 ) d i n i n g , Z arg s ( i ) d e p a r t u r e )
  End
Else
  For c arg α ( i ) d i n i n g C arg α ( i ) w d i n i n g
     v a l u e 2 = max ( v a l u e , c arg α ( i ) d i n i n g )
     Z arg s ( i ) d e p a r t u r e = Z arg s ( i ) d e p a r t u r e   v a l u e 2
  End
End
Result = Z arg s ( i ) d e p a r t u r e
Algorithm 3. Selection rule.
Input: (1) The set of times when the AGV leaves restaurant node i : Z arg s ( i ) d e p a r t u r e
  (2) The upper limit of time node density: γ
Output: The set of times when the AGV leaves restaurant node after deletion: Z arg s ( i ) d e p a r t u r e
Z arg s ( i ) d e p a r t u r e = [ ]
n u m = [ ]
q = ( max ( Z arg s ( i ) d e p a r t u r e ) min ( Z arg s ( i ) d e p a r t u r e ) ) / γ
If Z arg s ( i ) d e p a r t u r e > q
   k d e l = Z arg s ( i ) d e p a r t u r e q
   k r e m = mod ( Z arg s ( i ) d e p a r t u r e / k d e l )
   m = Z arg s ( i ) d e p a r t u r e / k d e l
   n = ( Z arg s ( i ) d e p a r t u r e k r e m ) / m
  If k r e m > 0
    Randomly generate a 0–1 q vector with length m . The sum of elements in vector is k r e m
    For i in vector
      If i = 1
         n u m = n u m i n + 1
      Else
         n u m = n u m i n
       End if
  End for
  Divide the [ max ( Z arg s ( i ) d e p a r t u r e ) , min ( Z arg s ( i ) d e p a r t u r e ) ] into m intervals based on the number in n u m
  Randomly delete a time point within every interval
  Union of deleted intervals is Z arg s ( i ) d e p a r t u r e
 Else
  Divide the [ max ( Z arg s ( i ) d e p a r t u r e ) , min ( Z arg s ( i ) d e p a r t u r e ) ] evenly into m intervals
  Randomly delete a time point within every interval
  Union of deleted intervals is Z arg s ( i ) d e p a r t u r e
End if
Else
   Z arg s ( i ) d e p a r t u r e = Z arg s ( i ) d e p a r t u r e
End if
By adopting the discretization method above, on the one hand, the difficulty of directly finding the original function is avoided in the process of finding the expectation of a l ; on the other hand, since this problem is a dynamic problem based on the rolling framework, the discretization method can reduce the time complexity. Anyway, the calculation efficiency can be improved by adjusting the parameters τ and γ .
Obviously, we need to modify the objective function. For objective function (1):
min F 1 = E ( ω ) = 1 N n N E ( max ( 0 , sgn ( g n b n ) ) ) = 1 N n N a arg s ( β n ) a r r i v a l A arg s ( β n ) a r r i v e max ( 0 , sgn ( a arg s ( β n ) a r r i v e + t n s b n ) ) A arg s ( β n ) a r r i v e
For objective function (2):
min F 2 = E n N g n r n = 1 N n N a arg s ( β n ) a r r i v e A arg s ( β n ) a r r i v e ( a arg s ( β n ) a r r i v e + t n s ) A arg s ( β n ) a r r i v e E ( n N r n ) = 1 N n N a arg s ( β n ) a r r i v e A arg s ( β n ) a r r i v e ( a arg s ( β n ) a r r i v e + t n s ) A arg s ( β n ) a r r i v e n N r n
For objective function (3):
min F 3 = E l = 1 | A | λ l ( a l a l 1 ) = l = 1 | A | λ l E ( a l ) E ( a l 1 )
E ( a l ) = 1 A l a r r i v e a l a r r i v e A l a r r i v e a l a r r i v e

4. The Proposed AC-NSGA-III Algorithm

4.1. NSGA-III

The constructed MO-RPDRP model is essentially a Traveling Salesman Problem (TSP) with complex constraints and an AC strategy. Obviously, it falls into the category of NP-hard problems, which does not have an exact polynomial-time algorithm. Furthermore, the three objective functions considered in the model trade off of each other, so this problem is a multi-objective optimization problem. To solve the MO-RPDRP model, we propose the AC-NSGA-III algorithm, which is based on the NSGA-III [31], one of the multi-objective intelligent optimization algorithms. We have a different approach to algorithm encoding. The flow chart of the algorithm is shown in Figure 6.

4.2. Operator Design

4.2.1. IRC Coding

Chromosome encoding is a pivotal component of the AC-NSGA-III algorithm. To ensure that the generated chromosome encodings correspond to feasible solutions and to prevent infeasible solutions resulting from crossover operations that violate model constraints (such as delivery priority, vehicle capacity limits, etc.), a decoding rule based on a cellular array structure was designed.
Path generation rule is shown in Algorithm 4. The cellular array Q represents a chromosome, Q = [ x , y ] . The design of the chromosome structure leverages the concept of constructive heuristic methods, constructing the solution on an order-by-order basis. The variable x represents the construction sequence of the generated orders. The variable y is composed of two components, y = [ y 1 , y 2 ] , where the values indicate the insertion positions of the corresponding nodes. Specifically, y 1 denotes the periodic insertion positions of the restaurant nodes associated with x , and y 2 denotes the periodic insertion positions of the customer nodes associated with x . Both x and y 2 are one-dimensional arrays of length |N|, while y 1 is a one-dimensional array of length |N2|.
The values in y 1 and y 2 exhibit periodic reducibility, meaning they are handled cyclically. When these values exceed the total number of available insertion positions, the insertion position for the corresponding order node is determined by the remainder of this value divided by the total number of insertion positions. If the remainder is zero, the insertion position is the last available slot.
The decoding process for the chromosome involves inserting the restaurant and customer nodes corresponding to the order into the delivery route according to the values specified in y 1 and y 2 . This ensures that the resulting solutions adhere to the model constraints while facilitating effective order construction and scheduling.
Algorithm 4. Path generation rule.
Input: Q; Q = [x, y] %chromosome
    x; % the construction sequence of the generated orders
     y = [y1, y2]; %arrays representing node insertion locations, including periodic insertion
    locations for pick-up and customer nodes
    N; %The number of orders involved
Output: RESULT; %node access order
   L = []; %initially empty set, storing the temporary path (to be inserted)
   Pick (); %function of getting the restaurant node
   Delivery (); %function of getting the customer node
   P (); %function of getting the list of insertable locations
   Insert Node (); %node insertion function
   i = 1;
   L = [pick(x(i)), delivery(x(i))];
   for i = 2: N
    M Node = pick(x(i)); % gets the restaurant node
      n1 = P (M Node, L) %gets the list of insertable locations
      % Calculate the insertion location of the restaurant node
      if mod(y1(i), length(n1) =0
        insert Index=n1(end);
      else
       insert Index = n1((mod(y1(i), length(n1)));
      L = insert Node (L, M Node, insert Index); % inserts the node at the insertion position
    C Node = delivery(i); % gets the customer node
      n2 = P (C Node, L) %gets the list of insertable locations
      % Calculate the customer node insertion location
      if mod(y2(i), length(n2) = 0
        insert Index = n2(end);
      else
       insert Index = n2((mod(y1(i), length(n2)));
    L = insert Node(L, C Node, insert Index); % insert a node at the insertion position
   end
    RESULT = L
(1) In this process, the AC strategy must be followed, and node insertion must not violate constraint (9), meaning that if the actual positions represented by adjacent order nodes in L are the same, they should be combined for pick-up or delivery.
(2) Node insertion must not violate constraints (6)–(8), meaning it should not breach the delivery volume constraints or the order node and visit sequence restrictions.
(3) Since the decoded results are real number sequences, the open chain structure constraint and sub-loop elimination constraints (16)–(20) can be naturally avoided. The positions filtered by this restriction rule are the actual insertable positions.
Figure 7 shows an example of IRC, which includes three orders involving restaurant nodes 1, 2, and 3 (denoted as M1, M2, and M3) and customer nodes 1, 2, and 3 (denoted as C1, C2, and C3). x = ( 2 , 1 , 3 ) ; y = 5 , 7 , 5 3 , 5 , 8 . In the encoding, the process starts with restaurant node 2 (M2) and customer node 2 (C2) associated with the first order, forming the path M2-C2. Then, according to the value 5 in y 1 , the total number of available insertion positions is 2, 5% 2 = 1, so restaurant node M1 is inserted into the first available position. The visit sequence after insertion is M2-M1-C2.
Next, according to the value 5 in y 2 , customer node C1 associated with order 1 is inserted. Based on constraint 8, restaurant node 1 must be visited before customer node 1, so the total number of available insertion positions is 2. 5 mod 2 = 1, so customer node C1 is inserted into the first available position. The visit sequence after insertion is M2-M1-C1-C2.
Finally, according to the value 5 in y 1 , restaurant node M3 associated with order 3 is inserted. Since M3 and M2 correspond to the same actual pick-up location, they are merged for pick-up. Similarly, C3 is inserted following the same principle, and thus the complete delivery and pick-up route for all orders is formed.

4.2.2. Initial Population

The population size is N ; the initial population P 0 is generated randomly.
x is a randomly generated sequence. Every random positive integer in y 1 , y 2 is not greater than A 1 .

4.2.3. Non-Dominant Hierarchical Ranking of Population

Non-dominated sorting effectively distinguishes individuals’ relative advantages in multi-objective optimization, aiding in the selection and retention of high-quality individuals, thereby maintaining population diversity and evolutionary effectiveness. Multi-objective optimization algorithms generate selection pressure through population sorting, encouraging the population to continuously approach the Pareto front [32,33].

4.2.4. Reference Point Evaluation System

In order to evaluate the population diversity, the reference point selection method of NSGA-III is adopted [34].

4.2.5. Binary Tournament Selection

Two individuals from the population are randomly selected to compete. If the non-dominant ranks of them are different, the low-ranking individual is selected to be put into the mating pool. Otherwise, the individual whose associated reference points have a smaller number of niches is select to be put into the mating pool. The selection is repeated N times.

4.2.6. OBX and SBX Mixed Crossover

For x , order-based crossover (OBX) is used, ensuring that no elements are missing after crossover, with a crossover probability of P c . It is shown in Figure 8. In order to better deal with the sparsity of individual space in the process of dealing with multi-objective optimization problems, simulated binary crossover (SBX) [35] is adopted for y 1 , y 2 . The non-integer solution generated by SBX was rounded in the way of rounding off.

4.2.7. Mutation Operator

Exchange mutation is adopted for x . When an exchange mutation is applied to chromosomes based on a binary or integer, two genes are randomly selected. Then, their values are exchanged. When the random number is less than the given mutation rate P m , chromosome mutation occurs [36]. Non-uniform variation was adopted for y 1 , y 2 . The original gene value was randomly distributed. The result was taken as the new gene value. After every locus is calculated with the same probability, it is equivalent to the entire solution vector making a slight change in the solution space. In the operation of a non-uniform mutation from X = x 1 x 2 x 3 x k x 1 to X = x 1 x 2 x 3 x k x 1 , if the value range of mutation point x k is U min k , U max k , the new gene value x k will be determined by (28), where t is the current iteration number.
x k = x k + Δ ( t , U max k x k ) ,   i f   r a n d o m ( 0 , 1 ) = 0 x k Δ ( t , x k U min k ) ,   i f   r a n d o m ( 0 , 1 ) = 1

4.2.8. Elite Strategy

Parent population P t and offspring population Q t are merged into a population R t .
Fast non-dominant hierarchical sorting is when individuals in R t are divided into ranks ( F P t 1 , F P t 2 , ). A new population of S t is constructed from F P t 1 . When F P t L is put in it, S t = N or S t > N .
Association operation is when the line between the origin and the reference point is taken as the reference line. The individuals in population P t + 1 ( P t + 1 = S t / F P t L ) are associated with the reference points on the nearest reference line [37,38]. The number of individuals associated with each reference point (i.e., the number of niches) is calculated; it is denoted as ρ j (the niches number of the j th reference point is ρ j ).
Individual retention operation is when elite individuals from F P t L are selected. Then, they enter P t + 1 . In order to maintain the diversity of the population, the reference point of ρ j with the minimum is denoted as j (if ρ j of multiple reference points is equal and minimum, one of them is denoted as j ¯ at random).

4.2.9. Termination Condition of the Algorithm

The convergence algebra threshold is set as θ , and the inverse generation distance is introduced to identify the convergence algebra of the algorithm [39]. IGD is the average value of the minimum distance between the point (feasible solution) on the front surface of each theoretical Pareto and the non-dominated solution set obtained by the algorithm. Similarly, the distance between the non-dominated solution set of two successive generations of populations can be expressed. Since the elite strategy of AC-NSGA-III makes the non-dominated solution set of the progeny population not inferior to that of the parent population, the IGD value is calculated based on the non-dominated solution set of the progeny population. β is defined as the threshold value. In successive generations θ , when the IGD between the parent generation and the child generation is smaller than β , the algorithm converges. The calculation formula of IGD is shown in (29). Where Q t is the child non-dominant solution set, P t is the parent non-dominant solution set, and d ( v , Q t ) is the minimum distance from v to P t of the individual in Q t .
I G D ( Q t , P t ) = 1 Q t v Q t d ( v , P t )

5. Experiments

5.1. Basic Data

In order to explore the effectiveness of the proposed model and algorithm, the following experiments were conducted. According to the one-month actual investigation of a delivery enterprise, it is calculated that the number of orders received by AGVs is the largest during the peak hours (11:00–13:00) in one day. Therefore, the information of the top four orders of peak hours in the month were selected as examples, and the details of the examples are shown in Appendix A. The design of fixed parameters in the model is shown in Table 3. The hardware environment of the numerical experiment is Intel i9-12900K/KS CPU, DDA4-3600MH-32G(16G*2) memory, 970-EVO-Plus-NVMe-M.2-1T solid state disk, windows 10 operating system. The proposed algorithms are implemented in Python language.

5.2. Experimental Design

(1) Comparison of optimization effects considering uncertainty factors.
Under the same algorithm conditions, to verify the superiority of the proposed model, the uncertainty model considering the meal preparation time and the model without considering the meal preparation time are compared.
(2) Comparison of order combination modes.
Under the same algorithm conditions, to verify the superiority of the proposed order combination method, the AC strategy is compared with other order combination strategies (OCS [13] and NSDP [10]).
(3) Comparison of look-forward optimization effects.
Under the same algorithm conditions, to verify the cumulative optimization effect of look-forward processes in the rolling optimization process, the model considering objective function 3 is compared with the model without considering objective function (3).
(4) Comparison of scheduling methods.
In order to verify the superiority of the proposed scheduling method, comparative tests are conducted based on the example in 5.1. We carefully selected three scheduling methods with the same transport organization framework but different strategies. These scheduling methods come from different literature sources and have the following characteristics:
  • Scheduling method A (Du et al., 2019 [22]): This method focuses on multi-objective optimization, but does not take order consolidation strategy into account, and adopts deterministic scheduling strategy for solution under dynamic environments.
  • Scheduling method B (Yu et al., 2022 [28]): This method also focuses on multi-objective optimization, does not incorporate order consolidation strategy, and implements deterministic scheduling method in dynamic environments.
  • Scheduling method C (Zachary, 2019 [23]): This method focuses on single-objective optimization, does not involve an order consolidation strategy, and adopts a deterministic scheduling strategy in a dynamic environment with additional attention to after-effects.
  • The method proposed in this paper: the multi-objective optimization framework is introduced, the order consolidation strategy is considered comprehensively, and the uncertain scheduling method is adopted in a dynamic environment. In addition, this study further discusses the problem of after-effect, which improves the comprehensive adaptability and practicability of scheduling strategy.
We selected objective function 1, expected order timeout rate, and objective function (2), expected customer waiting time, as evaluation indicators and analyzed the performance of each model through comparative experiments.

5.3. Comparison of Results

  • Comparison of optimization effect considering uncertainty factors.
To evaluate the effectiveness of the proposed model, numerical experiments were conducted on the deterministic model (model-C) and the uncertainty model (Model-Unc) under each example. The two models were run 100 times, respectively. The data characteristics of total customer waiting time and order timeout rate were obtained as shown in the figure below.
Figure 9 is box chart, showing the median, lower quartile, upper quartile, minimum, and maximum values. In the figure, the dashed lines represent the expected values of the solutions of model-C and Model-Unc under each example. As can be seen from the figure, on the one hand, the expected value of the solution under Model-Unc is better. On the other hand, the model-C solution has a larger fluctuation range than Model-Unc solution, and the unfeasible rate of model-C solution is higher. The uncertainty model proposed by us has a good immune effect to uncertainty factors. In other words, the solution of Model-Unc has good robustness in uncertain environment.
To better reflect the optimization effect of the proposed model in uncertain environment, A1 is taken as an example. The mean value of the meal preparation time remained unchanged. As the standard deviation of the meal preparation time increased, the standard deviation trend of the solutions of model-C and Model-Unc was calculated. The experimental results are shown in the figure below.
In Figure 10, as the standard deviation of meal preparation time increases, the standard deviation of total customer waiting time under Model-Unc is much smaller than that under model-C. When the standard deviation of meal preparation time of restaurants is 1.2, the scheduling scheme solved by Model-Unc reduces the standard deviation of total customer waiting time by 38.1% compared with model-C. The experimental results of the order timeout rate are similar, thus they will not be described here.
  • Comparison of order combination patterns.
The solution of the MO-RPDRP model is a three-dimensional Pareto front. In order to enable the Pareto front to be located on a two-dimensional plane, the look-forward objective dimension is removed in the following experiments. In this experiment, the model considers the look-forward objective.
In this experiment, the rolling scheduling with a large number of remaining unfinished orders is taken as an example. It makes the comparative effect of Pareto frontier more obvious under different order combination modes. Figure 11 shows the change of the number of remaining unfinished orders under each rolling schedule of model-1 (four examples). In each group of examples, the 7th and 8th rolling scheduling have the largest number of remaining orders, so they are selected as examples. Figure 12 takes A1 as an example; it shows Pareto frontiers of three order modes under model-1. It can be seen from the figure that the Pareto frontier of the model in AC strategy is significantly better than the other two.
  • Comparison of look-forward optimization effects.
The model considering the look-forward objective is denoted as model-1. A model that does not consider the look-forward objective is denoted as model-2.
Taking A1 as an example, to compare the solutions of mode-1 and mode-2, the orders of the 4th, 7th, 11th, and 15th rolling scheduling are selected as experimental examples. The solution extracted from the Pareto frontier obtained by mode-1 is represented as solu-1. The solution extracted from the Pareto frontier obtained by mode-2 is represented as solu-2. The experimental results are shown in the figure below.
In Figure 13, on the fourth rolling scheduling, the solutions extracted from Pareto front of model-2 dominate the solutions extracted from Pareto front of model-1. With the increasing of rolling schedule times, the scheduling after-effect of look-forward objective increases gradually. Pareto frontier obtained by model-1 considering look-forward objective is significantly better than Pareto frontier obtained by model-2 without look-forward objective. The solutions extracted from the Pareto front of model-2 no longer dominate the solutions extracted from the Pareto front of model-1.
  • Comparison of advantages of scheduling methods
When the experiment is completed, the four scheduling methods are compared comprehensively. The results are shown in Table 4. The experimental data clearly show that the proposed scheduling method shows significant advantages in objective function 1 and objective function 2.

6. Conclusions

In this paper, we considered the uncertainty of the meal preparation time and the AGV route optimization problem was addressed. Based on the realistic constraints, such as order node access sequence restrictions and vehicle volume, a MO-RPDRP model integrating an AC strategy and node sequence is developed. In the model, customer waiting time and order timeout rate are taken into account. Meanwhile, an improved NSGA-III algorithm is designed. In the numerical experiment, the actual survey data of a regional delivery enterprise during the peak hours are taken as examples. The results show that the following: (a) Compared with the deterministic model, the solution obtained by the MO-RPDRP model has good robustness and a better optimization effect in an uncertain environment. (b) The Pareto frontier of the model is better under the AC strategy. (c) The Pareto frontier of models that consider after-effects is better. (d) The proposed scheduling method has advantages over existing methods in terms of the order timeout rate, customer waiting time, and algorithm efficiency.
Combined with the abovementioned results, our research demonstrates robustness, an AC strategy and after-effect strategy, and the improvement of algorithm efficiency in an uncertain environment. These findings not only promote the development of theoretical research, but also provide more effective solutions for practical scheduling problems, in addition to providing important practical significance for improving the efficiency of transportation organizations and customer satisfaction. As far as we know, based on an uncertain environment, the literature considering order combination for the rolling optimization of takeout, pick-up and delivery paths is still in the initial stages. Further research in related areas can be carried out in the future. For example, to achieve higher delivery efficiency, the collaborative optimization problem of multiple AGVs could be studied.

Author Contributions

Conceptualizing J.M., methodology Q.S., validation, T.L., data curation, P.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Basic scientific research Project of colleges and universities of Liaoning Province Department of Education in 2024, grant number LJ212410153031 and the APC was funded by the Liaoning Province Education Department.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Part of the data used in the experiment is shown in Appendix A. Please contact us for more detailed data.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Example 1 (A1).
Table A1. Example 1 (A1).
The Number of Rolling SchedulesOrder TimeRestaurant Preparation Time Is Subject to Distribution ExpectationsRestaurant Preparation Time Follows the Standard Deviation of the DistributionRestaurant Node NumberCustomer Node NumberThe Latest Delivery Time
(1)11:00:2220.511411:30
(2)11:19:5830.221511:45
(3)11:23:192.4131611:53
(4)11:25:242.80.831711:55
(5)11:31:2520.641812:00
(6)11:47:143.4251912:00
(7)11:51:5451.662012:20
(8)11:56:4251.562112:25
(9)11:59:3250.862212:30
(10)12:05:492172312:35
(11)12:11:3640.482412:40
(12)12:20:2020.692412:55
(13)12:27:145.41.7102512:55
(14)12:30:2831.3112613:10
(15)12:31:546.21122713:15
(16)12:39:426.21.2122813:20
(17)12:58:492.30.9132913:30
Table A2. Example 2 (A2).
Table A2. Example 2 (A2).
The Number of Rolling SchedulesOrder TimeRestaurant Preparation Time Is Subject to Distribution ExpectationsRestaurant Preparation Time Follows the Standard Deviation of the DistributionRestaurant Node NumberCustomer Node NumberThe Latest Delivery Time
(1)11:04:0820.911311:35
(2)11:20:0830.221411:50
(3)11:25:5851.731512:00
(4)11:28:243.80.841612:00
(5)11:35:1920.951712:10
(6)11:46:473251812:15
(7)11:50:0750.461912:25
(8)11:58:495.21.572012:30
(9)12:20:322.4182112:55
(10)12:31:2531.292213:00
(11)12:39:3620.992313:20
(12)12:40:2041.792413:25
(13)12:47:145.51102513:30
(14)12:51:2821112613:30
(15)12:51:5461122613:35
Table A3. Example 3 (A3).
Table A3. Example 3 (A3).
The Number of Rolling SchedulesOrder TimeRestaurant Preparation Time Is Subject to Distribution ExpectationsRestaurant Preparation Time Follows the Standard Deviation of the DistributionRestaurant Node NumberCustomer Node NumberThe Latest Delivery Time
(1)11:10:426.21.211311:45
(2)11:20:492.30.921311:50
(3)11:25:0821.731411:55
(4)11:28:134.41.641512:00
(5)11:31:254.60.451612:00
(6)11:40:5920.961712:15
(7)11:55:243171812:30
(8)12:10:192.10.481912:40
(9)12:26:173.70.282012:55
(10)12:29:243292113:00
(11)12:32:0641102113:00
(12)12:45:045.52.4112213:25
(13)12:50:435.51.9122313:30
Table A4. Example 4 (A4).
Table A4. Example 4 (A4).
The Number of Rolling SchedulesOrder TimeRestaurant Preparation Time Is Subject to Distribution ExpectationsRestaurant Preparation Time Follows the Standard Deviation of the DistributionRestaurant Node NumberCustomer Node NumberThe Latest Delivery Time
(1)11:00:3651.511211:30
(2)11:05:084.60.421311:35
(3)11:17:472.4131411:50
(4)11:29:244141511:55
(5)11:39:2020.641612:00
(6)11:48:473.4251712:15
(7)11:53:075.52.461812:25
(8)12:19:2451.571912:40
(9)12:27:324.60.481912:55
(10)12:31:072192013:00
(11)12:44:3952.4102113:20
(12)12:54:2020.8112213:30

References

  1. Analysis of the Size and Development Trend of China’s Online Takeout Market in 2021. Available online: https://bg.qianzhan.com/report/detail/300/211112-19960858.html (accessed on 13 November 2021).
  2. Giuffrida, N.; Fajardo-Calderin, J.; Masegosa, A.D.; Werner, F.; Steudter, M.; Pilla, F. Optimization and machine learning applied to last-mile logistics: A Review. Sustain. 2022, 14, 5329. [Google Scholar] [CrossRef]
  3. Liu, L.; Qu, T.; Thurer, M.; Ma, L.; Zhang, Z.; Yuan, M. A new knowledge-guided multi-objective optimisation for the multi-AGV dispatching problem in dynamic production environments. Int. J. Prod. Res. 2022, 61, 6030–6051. [Google Scholar] [CrossRef]
  4. Practical Operational Optimization of Meituan’s Intelligent Delivery System. Available online: https://tech.meituan.com/2020/02/20/meituan-delivery-operations-research.html (accessed on 12 August 2024).
  5. Liu, Y.; Guo, B.; Chen, C.; Du, H.; Yu, Z.; Zhang, D.; Ma, H. FooDNet: Toward an Optimized Food Delivery Network based on Spatial Crowdsourcing. IEEE Trans. Mob. Comput. 2019, 18, 1288–1301. [Google Scholar] [CrossRef]
  6. Xue, G.; Wang, Z.; Wang, G. Optimization of Rider Scheduling for a Food Delivery Service in O2O Business. J. Adv. Transp. 2021, 2021, 5515909. [Google Scholar] [CrossRef]
  7. Reyes, D.; Erera, A.; Savelsbergh, M.; Sahasrabudhe, S.; O’Neil, R. The Meal Delivery Routing Problem. Optim. Online 2018, 6571, 2018. [Google Scholar]
  8. Yildiz, B.; Savelsbergh, M. Provably High-Quality Solutions for the Meal Delivery Routing Problem. Transp. Sci. 2019, 53, 1372–1388. [Google Scholar] [CrossRef]
  9. Du, J.; Guo, B.; Liu, Y.; Wang, L.; Han, Q.; Chen, C.; Yu, Z. CrowDNet: Enabling a Crowdsourced Object Delivery Network Based on Modern Portfolio Theory. Internet Things J. IEEE 2019, 6, 9030–9041. [Google Scholar] [CrossRef]
  10. Steever, Z.; Karwan, M.; Murray, C. Dynamic courier routing for a food delivery service. Comput. Oper. Res. 2019, 107, 173–188. [Google Scholar] [CrossRef]
  11. Sun, G.; Tian, Z.; Liu, R.; Jing, Y.; Ma, Y. Research on Coordination and Optimization of Order Allocation and Delivery Route Planning in Take-Out System. Math. Probl. Eng. 2020, 2020, 7248492. [Google Scholar] [CrossRef]
  12. Tu, W.; Zhao, T.; Zhou, B.; Jiang, J.; Xia, J.; Li, Q. OCD: Online Crowdsourced Delivery for On-Demand Food. IEEE Internet Things J. 2019, 7, 6842–6854. [Google Scholar] [CrossRef]
  13. Liao, W.; Zhang, L.; Wei, Z. Multi-objective green meal delivery routing problem based on a two-stage solution strategy. J. Clean. Prod. 2020, 258, 120627. [Google Scholar] [CrossRef]
  14. Yu, H.; Luo, X.; Wu, T. Online pickup and delivery problem with constrained capacity to minimize latency. J. Comb. Optim. 2020, 43, 1–20. [Google Scholar] [CrossRef]
  15. Abramov, I.V.; Abramov, A.I.; Nikitin, Y.R.; Sosnovich, E.; Božek, P.; Stollmann, V. Diagnostics of electrical drives. In Proceedings of the 2015 International Conference on Electrical Drives and Power Electronics (EDPE), The High Tatras, Slovakia, 21–23 September 2015; pp. 364–367. [Google Scholar]
  16. Tapak, P.; Kocur, M.; Rabek, M.; Matej, J. Periodical vehicle inspections with smart technology. Appl. Sci. 2023, 13, 7241. [Google Scholar] [CrossRef]
  17. Ulmer, M.W.; Thomas, B.W.; Campbell, A.M.; Woyak, N. The Restaurant Meal Delivery Problem: Dynamic Pickup and Delivery with Deadlines and Random Ready Times. Transp. Sci. 2021, 55, 75–100. [Google Scholar] [CrossRef]
  18. Xin, J.; Wei, L.; Wang, D.; Xuan, H. Receding horizon path planning of automated guided vehicles using a time-space network model. Optim. Control. Appl. Methods 2020, 41, 1889–1903. [Google Scholar] [CrossRef]
  19. Cheng, W.; Meng, W. An efficient genetic algorithm for multi AGV scheduling problem about intelligent warehouse. Robot. Intell. Autom. 2023, 43, 382–393. [Google Scholar] [CrossRef]
  20. Gao, Y.; Chen, C.H.; Chang, D. A Machine Learning-Based Approach for Multi-AGV Dispatching at Automated Container Terminals. J. Mar. Sci. Eng. 2023, 11, 1407. [Google Scholar] [CrossRef]
  21. Qiu, L.; Wang, J.; Chen, W.; Wang, H. Heterogeneous AGV routing problem considering energy consumption. In Proceedings of the IEEE International Conference on Robotics and Biomimetics, Zhuhai, China, 6–9 December 2015; pp. 1894–1899. [Google Scholar]
  22. Chen, J.; Wu, Y.; Huang, S.; Wang, P. Multi-objective optimization for AGV energy efficient scheduling problem with customer satisfaction. AIMS Math. 2023, 8, 20097–20124. [Google Scholar] [CrossRef]
  23. Xin, J.; Wei, L.; D’Ariano, A.; Zhang, F.; Negenborn, R. Flexible time-space network formulation and hybrid metaheuristic for conflict-free and energy-efficient path planning of automated guided vehicles. J. Clean. Prod. 2023, 398, 136472. [Google Scholar] [CrossRef]
  24. Zou, W.Q.; Pan, Q.K.; Meng, T.; Gao, L.; Wang, Y.L. An effective discrete artificial bee colony algorithm for multi-AGVs dispatching problem in a matrix manufacturing workshop. Expert Syst. Appl. 2020, 161, 113675. [Google Scholar] [CrossRef]
  25. Li, G.; Zeng, B.; Liao, W.; Li, X.; Gao, L. A new AGV scheduling algorithm based on harmony search for material transfer in a real-world manufacturing system. Adv. Mech. Eng. 2018, 10, 1687814018765560. [Google Scholar] [CrossRef]
  26. Wang, Y.J.; Liu, X.Q.; Leng, J.Y.; Wang, J.J.; Meng, Q.N.; Zhou, M.J. Study on scheduling and path planning problems of multi-AGVs based on a heuristic algorithm in intelligent manufacturing workshop. Adv. Prod. Eng. Manag. 2022, 17, 505–513. [Google Scholar] [CrossRef]
  27. Zheng, J.; Wang, S.; Wang, L.; Chen, J.F.; Wang, L.; Hao, J.; He, R.; Sun, Z. A two-stage algorithm for fuzzy online order dispatching problem. In Proceedings of the 2020 IEEE Congress on Evolutionary Computation (CEC), Glasgow, UK, 19–24 July 2020; pp. 1–8. [Google Scholar]
  28. Zheng, J.; Wang, L.; Wang, S.; Chen, J.F.; Wang, X.; Duan, H.; Liang, Y.; Ding, X. Modeling stochastic service time for complex on-demand food delivery. Complex Intell. Syst. 2022, 8, 4939–4953. [Google Scholar] [CrossRef]
  29. Advanced Mathematics; Higher Education Press: Beijing, China, 2014.
  30. Jiang, S.Y.; Li, X.; Zheng, Q.; Wang, L.X. Approximate equal frequency discretization method. In Proceedings of the 2009 WRI global Congress on Intelligent Systems, Xiamen, China, 19–21 May 2009; Volume 3, pp. 514–518. [Google Scholar]
  31. Deb, K.; Jain, H. An evolutionary many-objective optimization algorithm using reference-point-based nondominated sorting approach, part I: Solving problems with box constraints. IEEE Trans. Evol. Comput. 2013, 18, 577–601. [Google Scholar] [CrossRef]
  32. Jamil, M.A.; Nour, M.K.; Alotaibi, S.S.; Hussain, M.J.; Hussaini, S.M.; Naseer, A. Software Product Line Maintenance Using Multi-Objective Optimization Techniques. Appl. Sci. 2023, 13, 9010. [Google Scholar] [CrossRef]
  33. Zhao, X.; Zhu, Y.; Mei, Z. A Many-Objective Joint Parallel Simulation Method for Acoustic Optimization Design of Sound-Absorbing Structures. Appl. Sci. 2022, 12, 6892. [Google Scholar] [CrossRef]
  34. Das, I.; Dennis, J.E. Normal-Boundary Intersection: A New Method for Generating the Pareto Surface in Nonlinear Multicriteria Optimization Problems. SIAM J. Optim. 1998, 8, 631–657. [Google Scholar] [CrossRef]
  35. Sato, M.; Oyama, A. Comparative Study of Crossovers for Decision Space Diversity of Non-Dominated Solutions. In Proceedings of the 2021 IEEE Symposium Series on Computational Intelligence (SSCI), Orlando, FL, USA, 5–7 December 2021; pp. 1–8. [Google Scholar]
  36. Park, H.; Son, D.; Koo, B.; Jeong, B. Waiting strategy for the vehicle routing problem with simultaneous pickup and delivery using genetic algorithm. Expert Syst. Appl. 2021, 165, 113959. [Google Scholar] [CrossRef]
  37. Zhang, H.; Wang, G.G. Improved NSGA-III using transfer learning and centroid distance for dynamic multi-objective optimization. Complex Intell. Syst. 2021, 9, 1143–1164. [Google Scholar] [CrossRef]
  38. Gu, Z.M.; Wang, G.G. Improving NSGA-III algorithms with information feedback models for large-scale many-objective optimization. Future Gener. Comput. Syst. 2020, 107, 49–69. [Google Scholar] [CrossRef]
  39. Sun, Y.; Yen, G.G.; Yi, Z. IGD Indicator-Based Evolutionary Algorithm for Many-Objective Optimization Problems. IEEE Trans. Evol. Comput. 2019, 23, 173–187. [Google Scholar] [CrossRef]
Figure 1. The order dispatching process.
Figure 1. The order dispatching process.
Applsci 14 08101 g001
Figure 2. Rolling pick-up and delivery path sequence and routing combination.
Figure 2. Rolling pick-up and delivery path sequence and routing combination.
Applsci 14 08101 g002
Figure 3. Comparison of expected delivery time and actual delivery time.
Figure 3. Comparison of expected delivery time and actual delivery time.
Applsci 14 08101 g003
Figure 4. Rolling pick-up and delivery path considering routing combination.
Figure 4. Rolling pick-up and delivery path considering routing combination.
Applsci 14 08101 g004
Figure 5. Changes in the number of orders and nodes to be delivered.
Figure 5. Changes in the number of orders and nodes to be delivered.
Applsci 14 08101 g005
Figure 6. Algorithm flow chart.
Figure 6. Algorithm flow chart.
Applsci 14 08101 g006
Figure 7. Decoding structure and insertion restriction rules.
Figure 7. Decoding structure and insertion restriction rules.
Applsci 14 08101 g007
Figure 8. Order-based crossover, OBX.
Figure 8. Order-based crossover, OBX.
Applsci 14 08101 g008
Figure 9. Experimental results. (a) Total customer waiting time; (b) Order timeout rate.
Figure 9. Experimental results. (a) Total customer waiting time; (b) Order timeout rate.
Applsci 14 08101 g009
Figure 10. Experimental result (total customer waiting time).
Figure 10. Experimental result (total customer waiting time).
Applsci 14 08101 g010
Figure 11. Change in the number of remaining unfinished orders.
Figure 11. Change in the number of remaining unfinished orders.
Applsci 14 08101 g011
Figure 12. Superiority comparison of order combination mode.
Figure 12. Superiority comparison of order combination mode.
Applsci 14 08101 g012
Figure 13. Superiority comparison of look-forward process.
Figure 13. Superiority comparison of look-forward process.
Applsci 14 08101 g013
Table 1. Related literature.
Table 1. Related literature.
Model TypeAuthorModelScheduling FrameworkUncertain TimesAction ConsolidationLook-ForwardMulti-Objective/Single-Objective
Xin et al.
(2020) [18]
MVPDOne-time decisionN S
Cheng et al.
(2023) [19]
MAGV-SPOne-time decisionN S*
Gao et al.
(2023) [20]
MADPOne-time decisionN S*
Qiu et al.
(2015) [21]
EHARPOne-time decisionN S*
Chen et al.
(2023) [22]
AGVEESCOne-time decisionN M
Xin et al.
(2022) [23]
ARPOne-time decisionN
Zou et al.
(2020) [24]
AGVDPOne-time decisionN S*
Li et al.
(2018) [25]
AGVSPOne-time decisionN S*
Wang et al.
(2022) [26]
MARPOne-time decisionN S
Liu et al.
(2023) [3]
AGVDPOne-time decisionY M
Xue et al.
(2021) [6]
The pick-up
and delivery problem
One-time decisionN S
Ulmer et al.
(2021) [17]
DPDP, SDDPOne-time decisionYcfS
Reyes et al.
(2018) [7]
MDRPRolling decisionNc S
Yildiz et al.
(2019) [8]
DVRPsRolling decisionNc S
Zheng et al. (2020) [27]PDPOne-time decisionY S*
Liao et al.
(2020) [13]
GMDRPRolling decisionNc M
Zheng et al. (2022) [28]On-demand food deliveryOne-time decisionY S*
Non-VRPLiu et al.
(2018) [5]
Spatial crowdsourcingOne-time decisionN S; S*
Du et al.
(2019) [9]
Spatial crowdsourcingRolling decisionN M
Zachary (2019) [10]VFCDPRolling decision *N fS
Sun et al.
(2020) [11]
TRDRRolling decisionN S
Tu et al.
(2019) [12]
Spatial crowdsourcingRolling decisionN S*
Yu et al.
(2022) [14]
DRVRPRolling decisionN M
The title Rolling decision *YCFM
Table 2. Sets, parameters, and variables.
Table 2. Sets, parameters, and variables.
SetsDescription
N 1 the current set of orders that have been picked up but not delivered
N 2 the current order set that has not been picked up
N set of orders involved in the current rolling period, N = N 1 N 2 , n N
I set of restaurant nodes, I = 1 , 2 , , | I |
I v i r t u a l set of virtual restaurant nodes, I v i r t u a l = { | I | + 1 , | I | + 2 , , | N 2 | }
J set of customer nodes, J = { | N 2 | + 1 , | N 2 | + 2 , , | N 2 | + J }
J v i r t u a l set of virtual customer nodes, J v i r t u a l = { | N 2 | + | J | + 1 , | N 2 | + | J | + 2 , , | N 1 | + 2 | N 2 | }
A set of all nodes, A = I J
I set of restaurant nodes expansion, I = I I v i r t u a l
J set of customer nodes expansion, J = J J v i r t u a l
A set of all nodes’ expansion, A = I J
m a p i 1 virtual (actual) restaurant node i mapping of the actual restaurant node, i I
m a p j 2 virtual (actual) customer node j mapping of the actual customer node, j J
α n the restaurant node number corresponding to the order n ; if the restaurant node number corresponding to the order n is i , then there is i = α n , and there is an inverse mapping n = arg α ( i ) , i I , n N
β n the customer node number corresponding to the order n ; if the customer node number corresponding to the order n is j , then there is j = β n , and there is an inverse mapping n = arg β ( i ) , j J , n N
Parameters
V maximum capacity of delivery vehicle
t i j d travel time from node i to node j , i I , j J
s 0 number of the initial node
T 0 the ready time of the AGV
r n order time for order n , n N
b n the latest delivery time for order n , n N
v n the volume occupied by commodities for order n , n N
t n p meal preparation time for order n ; it follows the Gaussian distribution, t n p ~ ( μ i , σ i 2 ) , n N
t n s customer service time for order n , n N
Decision variable
x i j if the deliverer travels from node i to node j , the decision variable is equal to 1, otherwise x i j is equal to 0, i A s 0 , j A
Other variables
s l the number of the l th node to be visited; if the sequence of the node i in the path is l , i = s l , and there is an inverse mapping l = arg s ( i ) , l { 1 , 2 , , | A | } , i A
p l 1 if the l th node and the next node merge to pick up food, p l 1 is equal to 1, otherwise p l 1 is equal to 0, l { 1 , , | A | 1 }
p l 2 if the l th node and the next node merge to deliver food, p l 2 is equal to 1, otherwise p l 2 is equal to 0, l { 1 , , | A | 1 }
a l time to arrive s l , l { 1 , 2 , , | A | }
d l takeout volume carried on arrival at s l , l { 1 , 2 , , | A | }
λ l the number of unfinished orders before visit s l , l { 1 , 2 , , | A | }
g n the completion time of order n , n N
ω the timeout rate of the order
Table 3. Values of some parameters in the numerical experiment.
Table 3. Values of some parameters in the numerical experiment.
ParametersValue
V 68.45 dm3
T 0 11:00
t n s 5 min
N 10
P c 0.8
P m 0.06
Table 4. Comparison of advantages of scheduling method.
Table 4. Comparison of advantages of scheduling method.
Order Timeout Rate (%)Total Customer Waiting Time (min)
OptimumWorstMeanOptimumWorstMean
Spatial Crowdsourcing;
Heuristic Algorithm (A)
0.691.210.9821.3830.6427.63
DRVRP;
WR+WI (B)
0.540.940.8720.1328.8025.38
VFCDP;
Heuristic Algorithm (C)
0.510.990.8218.7527.6721.47
MO-RPDRP;
AC-NSGA-III (D)
0.370.780.5914.6122.9621.47
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Shao, Q.; Miao, J.; Liao, P.; Liu, T. Dynamic Scheduling Optimization of Automatic Guide Vehicle for Terminal Delivery under Uncertain Conditions. Appl. Sci. 2024, 14, 8101. https://doi.org/10.3390/app14188101

AMA Style

Shao Q, Miao J, Liao P, Liu T. Dynamic Scheduling Optimization of Automatic Guide Vehicle for Terminal Delivery under Uncertain Conditions. Applied Sciences. 2024; 14(18):8101. https://doi.org/10.3390/app14188101

Chicago/Turabian Style

Shao, Qianqian, Jiawei Miao, Penghui Liao, and Tao Liu. 2024. "Dynamic Scheduling Optimization of Automatic Guide Vehicle for Terminal Delivery under Uncertain Conditions" Applied Sciences 14, no. 18: 8101. https://doi.org/10.3390/app14188101

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop