1. Introduction
Unmanned ground vehicles (UGVs) are vehicles that can drive and perform certain tasks on a variety of roads or in the field, but with no operator inside the vehicle. The rapid development of advanced mechatronic, computing, and communication technologies in the last two decades enables a new robotic system that can interact with both humans and other UGVs in a cooperative manner. This technology is called multiple unmanned vehicles (MUVs) [
1]. Compared with single vehicles, MUVs have many advantages, such as faster completion of multiple tasks, stronger system fault tolerance, better overall flexibility, and so on. It is due to these advantages that MUVs are increasingly used in the military and social field to perform reconnaissance and surveillance, alert patrol, positioning guidance, high-risk operations, special operations, and material transportation, showing a broad application prospect in the military field [
2].
For all practical purposes, MUVs are usually a collection of large numbers of vehicles. These vehicles are required to form predetermined formations to travel to a location to perform support or combat missions. For ease of management and subsequent formation control, each vehicle has its own group and number in the formation and needs to form a defined formation as required prior to the mission. Therefore, the formation and coordination control of MUVs has become a hot issue at present, because a group of robots can accomplish the mission more effectively by maintaining a predefined formation.
The formation control problems can be mainly summarized as: (1) formation shape generation; (2) formation tracking; (3) formation reconfiguration and selection; (4) task assignment in formation [
3].
Among the above problems, the most widely studied is formation shape generation and tracking. Formation shape generation refers to a certain design method to guide multiple unmanned vehicles to certain designated target locations and form a predefined formation from here; while formation tracking is to design a formation controller to maintain the predefined formation shape stably after the formation is completed. In this regard, various formation control strategies have been proposed. The most common approaches are virtual structure, the behavior-based approach, leader–follower approach, graph-based approach, and artificial potential approach.
In L Yang, Y Jia [
4], a distributed D-type iterative learning scheme is developed for multi-agent systems with unknown nonlinear dynamics. This presents an efficient solution to the multi-agent formation control problem. The switching time and order of this learning scheme vary according to the actual trajectory of the agents to ensure that a predefined formation is always formed after some iterations.
In Giroung and Dongkyoung [
5], a decentralized behavior-based formation control algorithm is designed to accomplish the formation task using only the relative position information between robots and obstacles and between neighboring robots.
Sida et al. [
6] present a hybrid formation control of wheeled mobile robots (WMRs), which is based on model predictive control (MPC) and adaptive terminal sliding mode control (ATSMC). The MPC is used to ensure the stability of the formation, while the ATSMC is used to compensate for external interference. Sarrafan and Shojaei [
7] propose a formation tracking control for non-autonomous wheeled mobile robots (WMRs) based on a leader–follower strategy using an inverse dynamics control technique that combines a high-gain observer and a radial basis function (RBF) network to achieve the desired formation.
Dongyu et al. [
8] propose a hierarchically distributed finite-time estimator to allow agents in each layer to obtain their target positions and velocities based on the information of the agents in their previous layer. A model-based control method is then utilized in order to achieve multi-layer formations.
In Liu et al. [
9], a full potential field and a local potential field are combined to design a formation potential field for multi-agent formation control. The goal is to control a set of agents to automatically generate and maintain a specific formation while avoiding internal collisions and collisions with spatial constraints.
These traditional and commonly used methods all have some capability of formation shape generation that actually comes from the formation tracking control in
Figure 1. When performing formation, it is only necessary to give the geometric configuration of the desired formation, and by having the vehicles track the nodes of each geometric configuration, the desired formation target is eventually formed. When the number of vehicles in the MUVs is small, these methods are fast and effective and facilitate further formation tracking control. When each vehicle is not required to have a fixed position within the formation, the proximity principle can also be used to make the target point as close as possible to the initial position of each vehicle while allowing the vehicle to track the nearest node, which can effectively reduce collisions between vehicles and speed up the formation assembly.
When the number of vehicles in MUVs increases, the density of vehicles in the spatial plane increases, and the distribution area of vehicles increases, which makes it possible for the vehicles to intersect in the process of tracking the expected formation geometry nodes using the traditional UGV formation control method, and therefore it is very easy to lead to the occurrence of collision among vehicles, and the more the number of vehicles, the greater the risk of collision. At this point, the formation shape generation of MUVs is transformed from a target point tracking problem to an obstacle avoidance problem in the process of moving to the target point, i.e., the path planning problem of vehicles. Since the traditional formation control algorithm cannot effectively solve the obstacle avoidance problem, the unmanned vehicle path planning algorithm, which has better results in the obstacle avoidance problem, is considered to complete the formation shape generation of MUVs. Collision avoidance algorithms can be categorized into the following major methods: (1) geometric methods, (2) force-field methods, and (3) optimization-based methods [
10,
11]. Among these methods, optimization-based methods include path planning algorithms. There are many unmanned vehicle path planning algorithms, such as the multi-objective genetic algorithm proposed by Luis et al. [
12], the EBS-A* algorithm proposed by Wang et al. [
13], the ant colony algorithm improved by M Miao C et al. [
14], and so on. Among these algorithms, the rapidly exploring random tree (RRT) has been widely used in the field of UGV path planning because of the advantages of rapid path search and significant collision avoidance, but there are also two problems brought about by the algorithm itself that need improvement.
Non-optimal path. RRT returns the path result as soon as the target point is reached, without considering whether the path is the optimal solution;
Strong randomness. The nature of random exploring in space makes the path results obtained from repeated runs inconsistent.
To improve the adverse effects of the RRT, researchers have explored a number of different optimization approaches. Jingu et al. [
15] proposed a method based on triangular inequality rewiring for the RRT-connect robot path planning algorithm to both guarantee the planning time and make its path results closer to optimal. Jung J W [
16] proposed a post-processing method called bidirectional interpolation to improve the RRT-connect created paths. Feng et al. [
17] proposed an RRT path planning algorithm based on the A* bootstrap domain, which effectively combines A* with RRT and uses A* to improve the sampling efficiency of RRT. J Wang et al. [
18] proposed a convolutional neural network (CNN)-based optimal path planning algorithm, the neural RRT* (NRRT*), using the non-uniform sampling distribution generated from the CNN model. It can be seen that RRT and its improved algorithms are usually used in existing studies for path planning of a single vehicle. Additionally, there is little mention of the research on multiple vehicle path planning using RRT.
RRT has its unique advantages, but there are also some discordant and unavoidable disadvantages. RRT is suitable for path planning of a single vehicle under a known map where the obstacles are stationary. That is, RRT cannot handle dynamic obstacle avoidance in the case of multiple vehicles moving simultaneously. In addition, when the number of vehicles increases, more vehicles are seen as obstacles during the movement. The surrounding environment becomes more complex, and the path results obtained from multiple runs of the algorithm are more likely to expose the problem of non-optimal routes and excessive randomness of RRT. Therefore, in response to the various problems that occur in the formation shape generation of a large number of UGVs, this paper proposes a method for the formation of a large number of UGVs based on dual RRT optimization. By improving the traditional RRT algorithm, the algorithm path result is shortened and the problem of strong randomness of RRT is reduced. In this way, the path planning of obstacle avoidance is performed for a large number of UGVs, so that they can be arranged into the expected formation according to their respective routes and avoid collisions between vehicles.
2. Methods
In order to solve the problem of collision avoidance during formation shape generation for a large number of MUVs, and reduce the adverse effects of RRT itself defects on the path planning results, this paper proposes dual RRT optimization, applicable to a large number of UGVs in formation shape generation. The algorithm as a whole can be divided into two subparts:
Dual RRT optimization: firstly, the RRT-connect is used for global planning, and then the RRT* is used to locally optimize the unsmooth part of the initial path after obtaining the initial path, so as to achieve a better path result by ensuring the efficiency of the algorithm while shortening the path as much as possible;
Introduction of the evaluation function: dual RRT optimization is run several times simultaneously at the starting point of the path to obtaining multiple path results, and then the path evaluation function is set to select the best path as the output of the algorithm.
Firstly, we introduce the advantages and disadvantages of RRT and the commonly used RRT improvement algorithms.
2.1. RRT
As a common path search algorithm, RRT takes the starting point of the path as the root node and the target node as the leaf node and expands based on the starting point to grow in all directions, generating a search tree with branches growing randomly. When a route linking the starting point and the target point similar to the red line in
Figure 2 is obtained during the search, i.e., a solution is found in the space, at which point the expansion of each point stops. In the process of search, once a new expansion node encounters an obstacle, the node will no longer expand and select the previous node of the current node to continue to expand, which ensures that the final search result obtained will not collide with the obstacle.
Since RRT has the characteristic of probabilistic completeness, the algorithm can definitely find the path when the solution exists in that space, that is, when there exists the path to avoid obstacles required for path planning. Moreover, RRT is highly adaptable and still has excellent obstacle avoidance in the face of different environments. However, RRT lacks an optimization mechanism and only pursues the result of “arrival”, so the obtained path is not optimal and the path is often not smooth. In addition, when the number of vehicles increases, the run number of the algorithm increases accordingly due to the strong randomness in the expansion process of the algorithm itself. Even if the algorithm is run repeatedly for the same start and end points, it is difficult to get the same path results, and the difference in results is usually large. Those are the two drawbacks that RRT needs to address.
To address the above disadvantages of RRT, the following RRT improvement algorithms are commonly used.
Extend RRT, which introduces a collection of path points to speed up the convergence and improve the stability of the paths, like in [
19].
RRT-connect, which generates two trees from the initial point and the target point, and when the two trees are connected together, the algorithm converges to generate the path, and this improved type increases the efficiency of the algorithm, like in [
20,
21].
RRT*, which improves the parent node selection on the original RRT, like in [
22,
23]. The RRT* will have a reconnection (rewire) process when selecting the parent node, using a cost function to select the node with the smallest cost in the domain of the expanded node as the parent node, and at the same time, the nodes on the existing tree are reconnected after each iteration, thus ensuring computational complexity and asymptotically optimal solutions.
These improved algorithms have their unique advantages and also reduce the impact of the two main drawbacks of RRT to some extent, but do not eliminate them. For example, the RRT-connect algorithm has a very fast algorithm running efficiency and can obtain the planning path of multiple vehicles in a very short time, but this algorithm still has the problems of non-optimal solutions and excessive randomness, and even the pursuit of efficiency makes the defects more obvious; while the RRT* algorithm can obtain more excellent path results, compared with other algorithms, the results tend to be more asymptotically optimal, at the cost of longer running time, while still not guaranteeing the randomness problem. For a large number of fleets, the complex operating environment caused by the increase in the number of vehicles, the increased chance of collisions between vehicles, and the multiple runs of the algorithm further amplify the shortcomings of the RRT algorithm, which seriously affects the path planning results of the vehicles and thus has a negative effect on the overall queueing arrangement of the fleet. Therefore, this paper addresses these two drawbacks separately to obtain an improved RRT algorithm suitable for the formation of a large number of fleets.
2.2. Dual RRT Optimization
For the path planning task, the fundamental goal is to obtain a path that avoids obstacles and allows the vehicle to reach the target point safely, while the length of this path should be as short as possible, i.e., the optimal solution of the path. RRT is widely used in the field of UGV path planning because of its excellent obstacle avoidance ability. While ensuring the obstacle avoidance ability, how to get the optimal solution of the path needs to be considered. Among the commonly used RRT improvement algorithms, the path result of RRT* is asymptotically optimal, but the running time is also significantly longer than other algorithms. On the contrary, RRT-connect has excellent running efficiency, and the path results are often unsatisfactory. Therefore, in this paper, we combine the advantages of each of the two algorithms and propose dual RRT optimization to ensure the efficiency of the algorithm while shortening the path as much as possible to get better path results.
The dual RRT optimization algorithm divides the path planning process into two phases: (1) global path planning for the RRT-connect algorithm, and (2) local path optimization for the RRT* algorithm.
The vehicle path planning is generally expected to reach the target point with the shortest possible route, and RRT-connect often leads to redundant paths in the path results for efficiency, which are generally manifested by large or frequent corners, and these corners also increase the vehicle energy consumption and wear on mechanical parts. First of all, define as the path corner threshold, when the corner amplitude of a section of the path is larger than , it is considered that the section of the path has a large corner.
Since RRT itself is a tree-like structure, the final path result is obtained as nodes connected to nodes. Let a node in the path result be
, then
denotes the path between node
to node
. After giving the starting point
and the target point
, the initial path result
obtained using RRT-connect can be expressed as:
Because of the characteristics of the RRT-connect,
is not smooth at this time, and there are a number of nodes with large corner turns, as shown in
Figure 3a. Define the corner judgment function
, and use
to make corner judgment for each node of
, then the corner judgment result of the
-th node
on
is:
in which
and
are the forward direction angles of node
and node
, respectively. When the corner judgment result
of a node is equal to 0, it is considered that a large corner occurs at this node
and the path near it needs to be locally optimized. Then,
. nodes are taken forward to node
lk−a
and
nodes are taken backward to node
for node
. The path
between the two nodes is the path to be locally optimized, as shown in
Figure 3b. Take node
and node
as the starting point and end point of the locally optimized path, respectively, and use the RRT* algorithm to generate a new path, i.e.,
Due to the asymptotically optimal nature of the RRT*, the new path
generated is flatter than the original path, so the original path
is replaced by the path
to obtain the new path
, as shown in
Figure 3c.
Continue to make corner judgments on the remaining nodes of
. If large corners are encountered again, continue to optimize them, and finally obtain a more excellent path result
compared to the initial path
. The algorithm framework of dual RRT optimization is described in Algorithm 1.
Algorithm 1: Dual RRT Optimization |
1 | Input: , |
2 | Output: |
3 | ← |
4 | for I = 1→m |
5 | if =0 |
6 | ←- |
7 | ← |
8 |
else |
9 | = 0 |
10 |
end |
11 | ←+ |
12 | end |
13 | ← |
14 | return |
2.3. Dual RRT Optimization with Evaluation Function
The dual RRT optimization proposed in the previous subsection combines two commonly used RRT improvement algorithms, which enables the new algorithm to achieve faster running efficiency as well as path results that tend to be asymptotically optimal. This solves the first defect of the RRT algorithm effectively. However, the dual RRT optimization still suffers from the strong randomness of the path results when the number of vehicles increases and the algorithm needs to be run repeatedly several times. This requires further improvement of the algorithm.
In order to improve the strong randomness of the algorithm, this paper introduces a path result evaluation function to further improve the stability of the dual RRT optimization. As
Figure 4 shows, that the dual RRT optimization algorithm is run multiple times at the starting point to obtain multiple path results, and the path evaluation function is set to select the best path as the output of the algorithm. The evaluation function needs to consider the path length, the number of corners, and the corner amplitudes, so that the final path has a shorter length and fewer and smaller turns than other paths as much as possible. Assuming that
expected paths are repeatedly generated by the dual RRT optimization, where the
-th expected path
contains
nodes, the evaluation function of the path is as follows:
In Equation (5),
is the evaluation index of the
-th expected path;
is the total path length of the
-th expected path;
is the angle between the forward direction of the current
. -th node of the
-th expected path and the forward direction of the previous node;
is the corner threshold when
is greater than
, it is considered that a corner will occur when the vehicle travels along this expected path;
denotes the number of corners;
,
, and
indicate the weights of each index, respectively. By comparing the path evaluation index of each expected path, the path with the smallest evaluation index is selected as the final output of the algorithm. The specific algorithm framework is shown in Algorithm 2.
Algorithm 2: Evaluation Function |
1 | Input: , |
2 | Output: |
4 | while i = 1 < n |
5 | ←Dual RRT Optimization (, ) |
6 | ← |
7 | end |
8 | = |
9 | ← |
10 | return |
Based on the dual RRT optimization, the algorithm further shortens the path length and smoothes the curve by introducing the path evaluation function. Repeatedly running the algorithm many times can still get approximate results, effectively reducing the impact of strong randomness.
2.4. MUV Formation Shape Generation Based on Dual RRT Optimization
For a large number of MUVs with randomly distributed initial positions in a finite spatial plane, it is very easy to cause collisions among vehicles when the traditional formation control method is used to control the vehicles to form the expected formation at a specified location, and the increase in the number of vehicles makes the surrounding environment of each vehicle become very complex during the driving process. As a common path regularization algorithm, RRT has the problems of non-optimal routes and excessive randomness, which also makes the path results obtained using RRT full of more uncertainties when the number of vehicles increases.
To solve the above problems, this paper uses the dual RRT optimization algorithm for path planning for each vehicle within the MUVs, driving sequentially to the target point according to the vehicle markers. While ensuring collision avoidance among vehicles and completing the formation arrangement, the algorithm shortens and smoothes the path results, while improving the stability of the path results.
For vehicles on the move, the obstacles that need to be avoided are not only those originally existing on the map but also the rest of the vehicles within the MUVs, which makes the vehicle’s own environment more complex during the driving process. In order to ensure that a safe distance is maintained during the driving process to avoid collision, this paper adopts the way of expanding the radius to set the obstacle avoidance area of the vehicle, as shown in
Figure 5a, which is regarded as maintaining the safe distance when the vehicle is driving outside the area. In the whole experimental process, each vehicle and its obstacle avoidance area can be simplified as shown in
Figure 5b, the red circle center represents the unmanned vehicle, and the black dashed part represents the obstacle avoidance area formed by this unmanned vehicle based on the obstacle avoidance radius, where
indicates the vehicle length and
indicates the obstacle avoidance radius
When MUVs move towards an area on the map and form a formation, the density of vehicles in the area increases, and the more the number of vehicles, the more obvious this phenomenon is. In addition, RRT and its improved algorithms essentially obtain paths based on the known surroundings, and the RRT algorithms are unable to update the path results if the obstacles move during the vehicle’s progress. Therefore, RRT is suitable for path planning with static obstacle maps rather than dynamic path planning. Considering the above two factors, this paper chooses to plan the path for each vehicle in the MUVs in order of vehicle number after the target position is obtained, and while a vehicle is moving, the other vehicles remain stationary. This has the advantage of avoiding overly complex surroundings caused by many vehicles moving towards the target points at the same time, and making full use of the characteristics of RRT to reduce the possibility of collisions between vehicles. As more and more vehicles complete planning, there are fewer and fewer obstacles on the map, which makes the planning task easier for subsequent vehicles and improves the efficiency of the algorithm.
First, on the premise of ensuring good communication among vehicles, the position of each vehicle in the current coordinate system is obtained, which is the starting point of each vehicle. Then, the position point coordinates are generated in the target area according to the geometric configuration of the expected formation, and according to the label assigned to each vehicle in the MUVs, as the respective target point. Subsequently, based on the starting and target points, the path of the first vehicle is obtained using dual RRT optimization, and the vehicles will follow this path to reach the desired target location. Completing path regularization is considered that the vehicle has moved to the target point, and the vehicle that has completed planning no longer exists on the map for subsequent vehicle planning. The above process is repeated until all vehicles have completed the path regularization. At this point, all vehicles reach their respective target positions and form the MUVs’ desired formation shape.