Next Article in Journal
Design and Verification of Piano Playing Assisted Hand Exoskeleton Robot
Next Article in Special Issue
A Decomposition-Based Multi-Objective Flying Foxes Optimization Algorithm and Its Applications
Previous Article in Journal
A Systematic Review of Human Amnion Enhanced Cartilage Regeneration in Full-Thickness Cartilage Defects
Previous Article in Special Issue
An Improved Multi-Strategy Crayfish Optimization Algorithm for Solving Numerical Optimization Problems
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Multi-UAV Cooperative Coverage Search for Various Regions Based on Differential Evolution Algorithm

1
Xinjiang Institute of Engineering, College of Information Engineering, Urumqi 830091, China
2
Hubei SME Mathematical Intellectualization Innovation Development Research Center, Wuhan Business University, Wuhan 432000, China
3
College of Physics and Information Engineering, Minnan Normal University, Zhangzhou 363000, China
*
Author to whom correspondence should be addressed.
Biomimetics 2024, 9(7), 384; https://doi.org/10.3390/biomimetics9070384
Submission received: 16 April 2024 / Revised: 12 June 2024 / Accepted: 21 June 2024 / Published: 25 June 2024

Abstract

:
In recent years, remotely controlling an unmanned aerial vehicle (UAV) to perform coverage search missions has become increasingly popular due to the advantages of the UAV, such as small size, high maneuverability, and low cost. However, due to the distance limitations of the remote control and endurance of a UAV, a single UAV cannot effectively perform a search mission in various and complex regions. Thus, using a group of UAVs to deal with coverage search missions has become a research hotspot in the last decade. In this paper, a differential evolution (DE)-based multi-UAV cooperative coverage algorithm is proposed to deal with the coverage tasks in different regions. In the proposed algorithm, named DECSMU, the entire coverage process is divided into many coverage stages. Before each coverage stage, every UAV automatically plans its flight path based on DE. To obtain a promising flight trajectory for a UAV, a dynamic reward function is designed to evaluate the quality of the planned path in terms of the coverage rate and the energy consumption of the UAV. In each coverage stage, an information interaction between different UAVs is carried out through a communication network, and a distributed model predictive control is used to realize the collaborative coverage of multiple UAVs. The experimental results show that the strategy can achieve high coverage and a low energy consumption index under the constraints of collision avoidance. The favorable performance in DECSMU on different regions also demonstrate that it has outstanding stability and generality.

1. Introduction

Coverage path planning (CPP) tasks [1] are playing an increasingly important role in many real applications, such as surveillance, surveying, and rescue. Generally, the CPP tasks are mainly divided into two categories, i.e., (1) searching for a specific target from a target region and (2) covering a target region as much as possible with (or without) prior knowledge. In recent years, the use of unmanned aerial vehicles (UAVs) to address the CPP tasks has become very popular since the UAV is a flexible and lightweight aircraft that does not require a pilot. Furthermore, compared to a manned aircraft, a UAV has higher safety and fault tolerance when performing dangerous tasks. Therefore, recently, more scholars have been investigating the use of UAVs to solve CPP tasks [2,3,4].
When using multiple UAVs to address CPP tasks, the main goal is to plan efficient coverage paths for UAVs [4,5]. Each path should enable a UAV to avoid obstacles and maintain a safe distance from other UAVs. Meanwhile, each UAV’s trajectory should satisfy some dynamic constraints [4,6,7], such as the deflection angle, maximum load, hoverability, and so on.
In the last few decades, the zigzag pattern [8] and spiral pattern [9] have become popular methods for planning coverage paths for UAVs or robots in regular or convex regions. These methods have the advantages of fewer calculations, strong fusibility, and low energy consumption. In real-life scenarios, however, most coverage search tasks are non-convex, and some tasks are even discrete. Thus, in various complicated regions, the traditional zigzag pattern and spiral pattern are no longer feasible or effective. Although some area decomposition strategies [10,11] can enhance the performance of the zigzag pattern and spiral pattern on CPP tasks to a certain extent, they have shortcomings in universality. Moreover, the strategies are unable to deal with the damage to UAVs, which is a common problem in a multi-UAV system.
In real applications, when planning an efficient search (or coverage) path for a UAV, it is necessary to use some way to represent the state of the UAV and its environment. For example, a structure called configuration space (C-space) was introduced in [4], in which information about the position and direction of each unmanned ground vehicles existed in the form of a point in the C-space at each time point. In addition, physical obstacles can also be mapped into the C-space. Thus, based on the C-space, a complicated motion problem can be converted into a point-to-point motion planning problem. In contrast, Zhang [7] broke down the search problem into three sub-problems, i.e., information fusion, task assignment, and multi-UAV behavior decision-making. Accordingly, the cooperation process among the sub-problems was conducted in a dynamic environment in order to enhance the cooperative behavior of multiple UAVs.
Considering that evolutionary algorithms (EAs) have been shown to exhibit high reliability and an outstanding performance in many scientific research areas and engineering applications, many researchers are investigating the capacity of EAs to solve the UAV path planning problems [12]. For instance, Ref. [13] designed a breeder genetic algorithm for a terrain with known final destinations. In [14], a genetic algorithm (GA) was used to optimize the path planning results and to improve the search performance of a UAV. To overcome the shortcomings of traditional EAs, the authors of [15] proposed a multiple swarm fruit fly optimization algorithm, in which several populations evolve simultaneously and compete with each other. Based on the parallel evolution mechanism, the algorithm enhances the exploration ability of a UAV and significantly reduces the probability of the planning being classified as a local optimal solution.
Although the abovementioned studies manifest that utilizing a UAV to deal with a CPP task is a promising method, a single UAV can not fulfill a complicated CPP task efficiently due to the limited endurance capability of the UAV as a result of its limited battery life [16]. Thus, using multiple UAVs to solve the CPP tasks has become a research hotspot in recent years [2,4,17]. Fortunately, the popularity of the 5G network as well as the improvement in UAVs’ performance in recent years have made it possible to conduct research on CPP tasks based on the multi-UAV cooperation mechanism. The core idea of the mechanism is to realize the cooperation of search process through information interaction between multiple UAVs so as to realize the intelligent emergence of a large-scale unmanned system [18].
In the cooperative control of large-scale unmanned systems, a primary problem is how to control the emergence of swarm intelligence. Generally, the problem can be divided into three categories [19]: (1) what kind of group behavior will the whole system produce after making local rules for individuals?; (2) what are the individual local rules for a given group behavior?; and (3) how can group behavior be controlled based on the given individual local rules? Generally, the relationship between local rules and group behavior is interactive. Thus, the design of proper local rules and control strategies is crucial for generating desired cluster system behaviors.
In recent years, there has been some research on the local rules of the unmanned system. In 2001, Desai et al. [20] studied the frame transition behavior of mobile robot formation when encountering obstacles. In the study, each robot had its own controllers and sensors, and then various local rules were formulated for different robots to realize obstacle climbing, which can be regarded as a kind of distributed control. Furthermore, Couzin et al. [21] studied a self-organization model, known as the R-A model, in a three-dimensional space. In the model, each individual has its own local rule. Concretely, each individual takes its position as the center to establish three circles with different radii: the smallest circle is called the zone of repulsion (ZoR), the largest circle is called the zone of attraction (ZoA), and the middle circle is called the zone of orientation (ZoO). Based on the three circles, each individual has three distinct movement rules to keep itself in the ZoA during cooperative movement. In 2015, Ji [22] presented a distributed collaborative search algorithm for multi-UAV clusters with limited perception and communication capabilities in a non-convex environment. An adaptive density function was designed based on a real-time updated probability map and an uncertainty map. Relying on the density function, each UAV can adjust its search direction during the search process, and the probability maps of two UAVs can be fused if the two UAVs are in a predefined communication range.
From the above research, it can be concluded that under the premise of good local rules, using a distributed system is a reasonable cluster collaboration strategy. In fact, although a centralized framework can obtain the global optimal solution for some issues [23,24,25], the increasing complexity of a problem and a large number of UAVs may cause the load of the computing center to increase significantly. As a result, the search performance of the centralized framework will deteriorate rapidly. Hence, utilizing a distributed collaborative framework, in which the total computing load is allocated on each unit, is a promising choice to deal with large-scale CPP problems.
In most studies of online path planning for UAVs, the goal is to find one (or more) optimal solution. However, it is very hard to obtain the optimal solution due to the fact that the environmental information in a dynamic and unknown environment is often incomplete. Moreover, even if the true optimal solution can be obtained, the solution may be unsuitable in many real applications. For instance, for some online path planning problems, the real-time ability is more important than an optimal route. Thus, the improvement of the real-time ability has become a research focus for such problems [26,27].
The differential evolution (DE) algorithm is an outstanding heuristic optimization algorithm, and some favorable characteristics of DE, such as a low number of parameters, easy implementation, and parallelism, have enabled it to be successfully applied in many fields. Some studies have shown that DE can offer more promising results than other evolutionary algorithms, such the particle swarm optimization algorithm and genetic algorithm. Thus, in this study, a DE-based cooperative search strategy of multiple UAVs, named DECSMU, is proposed. Its main contributions are detailed as follows:
A Distributed Model Predictive Control (DMPC) strategy is used to complete the cooperative behavior among multiple UAVs. Furthermore, the complexity of the problem is reduced by decomposing a problem into multiple time slices so as to establish the basis of UAV real-time performance.
For each UAV in a single time slice, a simplified DE is used for online path planning.
According to characteristics of different types of regions, a dynamic reward function, which can be changed with the state of the UAV, is designed to enhance the coverage ability and energy saving ability of UAVs.
The remainder of this paper is organized as follows. Section 2 introduces the problem of this paper, give the methods and conditions of map update and fusion. In Section 3, a simplified differential planning algorithm is explained, and the flowchart of DMPC is given. Section 4 introduces the dynamic reward function and gives the pseudocode of DECSMU. In addition, three sets of experiments are designed to verify the proposed method, and the simulation results, along with the analysis, are shown in Section 5. Finally, the conclusions are presented in Section 6.

2. Modeling of the Swarm of UAVs

In this study, each UAV is equipped with a Field of View (FOV) sensor. Multiple UAVs in a swarm are required to fly on the same plane to ensure that the proportions of the images are uniform. Moreover, two UAVs in the swarm can exchange their information when they are within a communication range, but they cannot transfer the information. The swarm of UAVs adopts a distributed collaborative search framework, in which each UAV formulates its search path based on its local information and other UAVs’ strategy information within the communication range. If necessary, the search path can be adjusted.
It is worth noting that, in real environments, an island diagram may be non-convex and discrete. Thus, when utilizing UAVs to cover and search an island terrain, the UAVs need to have the ability to search across the non-search area. Although there are some non-mission areas within an island terrain, the UAVs can fly across the non-mission areas to cover and search the island terrain more effectively.

2.1. Model of Mission Area

In order to facilitate the design and simulation of the search algorithm, this study uses a grid-based method to simplify satellite photographs of the islands. For example, the real island map demonstrated in Figure 1a can be rasterized as a simplified map shown in Figure 1b.
In Figure 1b, the task area M is placed in an Lx × Ly sized raster, where the position of any grid g can be represented as Equation (1).
g x , y = x , y x , y M
In general, this study uses discretized raster location coordinates to represent geographical locations; for example, (x, y) in Equation (1) means that the grid g is located in the row x and the column y. The search state for raster gx,y at time t can be expressed as Equation (2).
S ( g x , y , t ) = 0 ,       g x , y M n 1 ,       g x , y M s 1 ,       g x , y M u
where Mn, Ms, and Mu denote the non-mission area, searched area, and unsearched area, respectively.

2.2. Configuration Transfer Model

In this study, the C-space idea proposed by [4] is used to configure the state of a UAV. Suppose N is the number of UAVs in a swarm, UAVi, i ∈ [1, NU] stands for the i-th UAV, and the configuration of the i-th UAV at time t can be expressed as Equation (3).
U A V i = x ( t ) ,     y ( t ) ,   θ ( t ) | x ( t ) ,     y ( t ) M ,   θ ( t ) 0 ,   2 π
where x(t) and y(t) are the horizontal and vertical coordinates of the UAVi at time t and θ(t) is the deflection angle of the UAVi at time t. Thus, the configuration transfer equation of the UAVi can be obtained as Equation (4).
x ( t + 1 ) = x ( t ) + Δ t V cos θ ( t ) y ( t + 1 ) = y ( t ) + Δ t V sin θ ( t ) θ ( t + 1 ) = θ ( t ) + Δ u ( t )
where ∆t is a unit time slice, V is the flight velocity of the UAVi, and ∆u is the change value of deflection angle.
According to this study, UAVs can be separated into two types, i.e., hoverable and non-hoverable UAVs. When a hoverable UAV flies within a grid map, it has eight candidate search directions at each time point, i.e., front, back, left, right, left-front, right-front, left-rear, and right-rear. On the contrary, a non-hoverable UAV only has three candidate search directions at each time point, i.e., front, left-front, and right-front of the current flight directions. In this study, non-hoverable UAVs are considered, and the motion model of each UAV is shown in Figure 2, in which each red circle means the new position of the UAV and each black arrow denotes the UAV’s fly directions.
Generally, a UAV has a view window during the coverage search process. Concretely, based on the grid diagram, the view window of UAVi can be represented by a view radius r. For example, at time t, if the position of UAVi is (x(t) and y(t)); then, the view window Wi(t) of the UAVi can be represented as (x(t) ± r and y(t) ± r). In this case, we can assume that the regions within the view window have been searched. The view window of the UAVi at time t, denoted as Wi(t), in a grip map is shown in Figure 3.

2.3. Map Update and Integration

When using multiple UAVs to perform a coverage search task, each UAV needs to know the current coverage state according to its own local map. If two UAVs are within their communication range, they can share their maps. In other words, two local maps need to be integrated into a new map, which can be shared by the two UAVs.
For example, the local map of UAVi is Mi, which is initialized as Mi = M. Then, the local map status at time t is represented by Si(gx,y, t). Based on the search window, UAVi can move to the next position and then update its local map information. The update rule of the local map information can be defined as Equation (5).
S i ( g x , y , t ) = 1 ,         if   g x , y W i ( t )   and   S i ( g x , y , t 1 ) 0 0 ,         if   g x , y W i ( t )   and   S i ( g x , y , t 1 ) = 0
During the search process, if two UAVs within a same communication range and the local maps of the two UAVs can be integrated, then, more complete environmental information is obtained. Because the communication network changes dynamically with time, we need to use a dynamic communication topology matrix to represent this relationship. In this study, the communication condition at t is defined as Equation (6).
c i j = 0 ,       d i s t x , y t > c d 1 ,       s d < d i s t x , y t c d
where cij is the communication condition between UAVi and UAVj; distx,y(t) represents the distance between UAVi and UAVj at t; cd represents the communication distance; and sd represents the safety distance.
When cij = 1, the UAVi and UAVj can communicate with each other. In this case, the maps of two UAVs can be integrated into one map, which can be shared by the two UAVs in the following search process. The integration operator in this study is defined as , which can be described as Equation (7).
S i ( g x , y , t ) S j ( g x , y , t ) 1 1 = 1 1 1 = 1 0 0 = 0
Obviously, the integration operator satisfies the commutative law and associative law.

3. DE-Based Cooperative Search Strategy of Multiple UAVs

3.1. Planning Strategy Collaboration Using DMPC

Model predictive control (MPC), also known as rolling time domain control, is an optimal control method that divides the long-time span control problem into several short-time span control problems. When using UAVs to solve CPP problems, the path plan is a dynamic process. Concretely, each UAV in a swarm needs to optimize its search path at each time point t according to other UAVs’ conditions and the task completion of the CPP. Thus, in this study, DMPC is applied to the UAV-based path planning problem.
The core concept of MPC is the division of an entire problem into current and future problems in a control cycle. Accordingly, the current problem can be solved or optimized based on the current system state, while some states of the future problems can be predicted. When using UAVs to solve a CPP problem, the coverage search problem is a dynamic optimization problem. Thus, we combine a distributed framework with MPC, named DMPC, to establish the UAV decision-making process, which is illustrated in Figure 4.
In Figure 4, DMPCi and DMPCj (ji) represent the model predictive control system of UAVi and other UAVs, respectively. Figure 4 shows that, in the control cycle t, the decision-making process of UAVi is divided into three parts, as follows:
  • Current status acquisition. Each UAVi obtains its current local environment map Mi(t) through local decision-making. Meanwhile, an environment map Mj(t) of other UAVj in time t can also be obtained through a communication network.
  • System state prediction. Based on the current states Mi(t) and Mj(t), the predicted state M’j(t + 1) at time t can be obtained.
  • Decision optimization. The proposed DE is used to optimize the current decision, i.e., path planning, according to the prediction state and constraint quantity, and to obtain the final decision Mi(t + 1) at time t + 1.
The system will update continuously with time according to the above decision-making process and obtain the final solution after all the time slices are used up.
In Figure 4, it can be seen that the DE-based decision optimization part plays a crucial role in improving the coverage search performance of multiple UAVs. Based on DMPC, the flowchart of a UAV’s path planning framework is shown in Figure 5, in which Pit denotes the best solution obtained (i.e., the best path) for UAVi at time t.

3.2. Path Generation Based on DE

DE is an outstanding heuristic optimization algorithm, and some favorable characteristics of DE, such as a low number of parameters, easy implementation, and parallelism, enable it to be successfully applied in many fields [28,29,30,31]. Thus, in this study, DE is adopted as an optimizer to plan UAVs’ coverage search paths. A standard DE consists of four distinct steps, i.e., initialization, mutation, crossover, and selection, the details of which are introduced as follows.

3.2.1. Initialization

When utilizing DE to optimize a problem, the first issue is to obtain an initial population. Considering that a distributed path planning framework is adopted in this study, we design an improved DE based on multiple subpopulations. Then, each UAV evolves in a separate subpopulation Pti locally to find an optimized coverage search path.
In the initial stage of the search process, NP individuals X i , n t with H genes are randomly generated for each UAVi, where t represents the number of iterations and H denotes the steps in each plan path. In X i , n t , each element represents the direction of each step; for example, X i , n t = {1, 1, 0, 0, −1} indicates that the turning directions of the next five successive steps are as follows: turn left, turn left, go straight, go straight, and turn right. The generalization formula of X i , n t in any number of iterations t be described as Equation (8).
X i , n t = x i , n , 1 t , x i , n , h t , , x i , n , H t P i t
where 1 i N U , 1 n N P , 1 t T , and 1 h H .
The initialization of the population is generated using a uniform random function. For example, the h-th position of an individual can be randomly initialized according to Equation (9).
x i , n , h 1 = x min + r a n d x max x min
where xmin and xmax represent the lower and upper boundaries, respectively, and rand is a random value in the interval [0, 1].

3.2.2. Mutation

In each generation, each individual X i , n t undergoes a mutation operator to obtain a mutant vector V i , n t . Generally, there are six widely applicable mutation operators. In this study, the basic mutation operator “DE/rand/1” is selected by each individual, and the operator is defined as Equation (10).
V i , n t = X r 1 , n t + F X r 2 , n t X r 3 , n t
where X r 1 , n t , X r 2 , n t , and X r 3 , n t are three randomly selected individuals from a subpopulation that the individual X i , n t belongs to. F ( 0, 1) can be regarded as a scaling factor, and V i , n t = v i , n , 1 t , v i , n , 2 t , v i , n , H t is the generated mutant vector for the individual X i , n t .

3.2.3. Crossover

After the mutation operation, the original individual X i , n t and its mutant vector V i , n t need to be crossed to generate a new individual X i , n t + 1 = x i , n , 1 t + 1 , x i , n , 2 t + 1 , x i , n , H t + 1 . Because there is no selection operation, the new individual will replace the old individual in the corresponding position in the original population directly. The formula of the crossover operation is described as Equation (11).
x i , n , h t + 1 = v i , n , h t ,   r a n d C R x i , n , h t ,   o t h e r
where CR is a crossover rate, which determines whether x i , n , h t + 1 is copied from x i , n , h t or v i , n , h t .
Finally, each floating-point vector needs to be converted into an integer vector according to Equation (12).
x i , n , h t + 1 = 1 , x i , n , h t > 1 3 x max 0 ,   other   1 , x i , n , h t < 1 3 x max
where x i , n , h t + 1 = 1 ,     0 ,   1 indicates whether UAVi will turn left, right, or straight in the current direction in the h-th step.

3.2.4. Selection

After obtaining the new path X i , n t + 1 , its performance needs to be evaluated. Detailed information about the metrics and evaluation methods is provided in Section 4. Based on the results of the evaluation, the optimized path can be selected based on Equation (13)
X i , n t + 1 = X i , n t + 1 ,   if   f i t ( X i , n t + 1 ) f i t ( X i , n t ) X i , n t ,   else  
After T iterations of the above operators, i.e., mutation, crossover, and selection operators, we can obtain NP candidate coverage search paths. Then, the best path X i , b e s t t + 1 measured by the evaluation metrics introduced in Section 4 is adopted as the search path of UAVi in the next search process.

4. Fitness of a Planning Path

When using DE to optimize search paths of multiple UAVs, the first issue is to determine how to evaluate a search path. In this study, a dynamic fitness function is proposed, in which energy consumption and coverage are two optimization objectives. The details of the fitness function are discussed as follows.

4.1. Coverage Rate

When performing a coverage search task, the coverage rate of a UAV is a core index that is used to evaluate its performance. The higher the coverage rate is, the greater the possibility that the task can be completed.
To compute the coverage rate of a planning path of UAVi, we need to determine the coverage of the two consecutive steps; the coverage increment of the UAV at the time t can be calculated according to Equation (14).
Δ C o v i ( t ) = C o v i ( t ) C o v i ( t 1 )
where the coverage rate C o v i ( t ) is calculated according to Equation (15).
C o v i ( t ) = x = 1 L x y = 1 L y S i ( g x , y , t )
Finally, the coverage evaluation function can be defined as Equation (16).
R 1 i ( t ) = Δ C o v i ( t ) 5 ( H 1 ) + ( 2 r + 1 ) 2
where H is the number of search steps in each planning path, and r is the radius of a UAV’s view window. In this study, H and r are set as 7 and 1, respectively.
This equation is the normalization of the coverage increment intended to facilitate the calculation of the total reward function. The denominator of Equation (16) is the maximum coverage increment of the UAV fleet after H steps.

4.2. Energy Consumption Estimation

The energy consumption of a UAV is an important index for determining its performance. There are many factors that affect the energy consumption of a UAV, but there is no doubt that frequently changing direction always consumes more energy than a straight-line flight with a constant speed. Therefore, this study takes the total turning times of a UAV in each complete planning path as a criterion to evaluate the UAV’s energy consumption. The energy consumption evaluation function can be defined as Equation (17).
R 2 i ( t ) = 1 t u r n N ( t ) H
where turnN(t) ∈ {1, 2, …, H} is the number of turns in the planning path at time t and H is the number of search steps in each planning path.
The maximum number of turns that a UAV completes in a single planning path is H. Thus, R 2 i ( t ) epresents the ratio of the number of turns without turning to the step. The larger R 2 i ( t ) indicates that the UAV has fewer turns in a single planning step and lower energy consumption.

4.3. Dynamic Fitness Function

In this study, we aim for multiple UAVs to perform efficient coverage searches in the search area. In practical applications, an island map may be non-convex and discrete. Thus, it is unavoidable that a UAV flies across the non-search areas. Therefore, in this study, it is also important to determine how to effectively deal with this situation and achieve better search results.
For example, if UAVi is located in a search area at time t, it should pay more attention to coverage as well as energy consumption. Conversely, if UAVi is located in a non-search area at time t, the priority of the UAV should be to find a route back to the search area rather than the coverage rate and energy consumption. In this case, the UAV needs to perform more turns to re-enter the search area. Thus, when a UAV is out of the search area, the evaluation metric can be defined as Equation (18).
R 3 i ( t ) = t u r n N ( t ) H
Based on the three evaluation metrics, which are separately defined as Equations (16)–(18), the fitness of UAVi can be evaluated based on Equation (19).
f i t ( X i , n t ) = ω 1 R 1 i ( t ) + ω 2 R 2 i ( t ) ,   if   X i , n t   M n ω 3 R 1 i ( t ) + ω 4 R 3 i ( t ) ,   else  
where ω 1 ~ ω 4 are four weights of the different evaluation metrics in different conditions.
The dynamic fitness function is used to evaluate the performance of each planning path of UAVi, and the path with the largest fitness value is the optimal solution for UAVi. Generally, the optimal solution of UAVi at time t, recorded as X i , b e s t t , is regarded as a real search path of UAVi.

4.4. Map Integration

To realize the cooperative coverage search of multiple UAVs, different UAVs need to share their search information through a communication process. Specifically, two different UAVs share and integrate their local maps and then update their map information according to Equation (7). Consequently, each UAV plans its own subsequent search path based on the updated map. Without loss of generality, this study defines the communication range of each UAV as a standard circle. When the Euclidean distance between two UAVs is in a preset communication range, the two UAVs can communicate with each other, and then, their map information can be integrated and shared.

4.5. Framework of DECSMU

Finally, based on the aforementioned discussions, the pseudocode of the proposed DECSMU is given in Algorithm 1.
Algorithm 1. DECSMU
Input: Initialization: t = 1, ω 1 , ω 2 , ω 3 , ω 4 , X i , n t (1 ≤ iNU, 1 ≤ nNP);
   Mi, = M (1 ≤ iNU), where M is a rasterized map of the task area;
1: while not satisfy stop conditions do
2:   for i = 1: NU do // Nu is the number of UAVs
3:    for j = 1: NP do // Np is the number of paths for each UAV
4:     Random generate X i , j t ;
5:     Performing Mutation, Crossover, and Selection operators on X i , j t (see Section 3);
6:     Generate X i , j t + 1 ;
7:    end for
8:    Obtain X i , b e s t t + 1 from all the X i , j t + 1 ;
9:   end for
10:   Each UAV performing its search path according to the obtained X i , b e s t t + 1 ;
11:   for i = 1: NU do
12:     for j = 1: NU do
13:      Perform map cooperation based on Mi and Mj (see Section 4.4);
14:     end for
15:   end for
16:   t = t + 1;
17: end while
18: Integrate local maps of all UAVs into a single map M according to Equation (7);
19: Return M.
Note that, in Algorithm 1, the mutation, crossover, and selection operators in line 5 are executed in multiple iterations rather than in a single iteration. In other words, the evolution process of DE is described by line 5 in Algorithm 1. Then, the optimal path (i.e., X i , j t + 1 in line 6) is obtained after the optimization process of DE.

5. Experiments and Discussion

5.1. Performance of Parameters

In DECSMU, many parameters determine its behavior. In this part, the performance of the two key parameters, i.e., the number of UAVs (NU) and the search step (H), are examined by a set of experiments. In the experiments, four different NU (NU = 2, 3, 4, and 5) and seven different H (H = 3, 5, 7, 9, 11, 13, and 15) are tested, and the experimental results are demonstrated in Figure 6. Note that all the experiments in this study are conducted based on the following systems:
  • OS: Windows 10
  • CPU: Intel i7-7700K, 4.20 GHz
  • RAM: 16 GB
  • Language: Matlab R2020a
In the experiments, two metrics are used to measure the performance of different values of the two parameters. One of the metrics is the coverage rate, which is the main objective of DECSMU. The other one is the average turn time of each UAV, which can be utilized to evaluate the energy consumption of the UAV.
In Figure 6a, we can see that DECSMU with two UAVs achieves the lowest coverage ratio, while DECSMU with five UAVs yields the highest coverage ratio. The results suggest that more UAVs can bring a higher coverage ratio. Meanwhile, it can also be seen that as the number of UAVs increases, the difference in the coverage ratio of DECSMU becomes smaller. For instance, the difference in the coverage ratio between four UAVs and five UAVs is less than 5% for the entire tested search step. Moreover, the experimental results also indicate that H = 5 or 7 can result in very favorable performance.
In addition, the experimental results illustrated in Figure 6b show that the average turn time of each UAV rises rapidly with an increase in plan steps. Moreover, we also notice that when using two UAVs, each one has a higher turn time in almost all the planned steps. In fact, the phenomenon does not mean that each UAV has a higher turn time than other conditions (such as three, four, and five UAVs). On the contrary, the phenomenon indicates that when using fewer UAVs to cooperate in the search process, each UAV needs to fly a longer distance.
According to the experimental results presented in Figure 6a,b, we can obtain the following preliminary conclusion. In this study, four and five UAVs can offer outstanding performance when H = 5 or 7. On the one hand, the configuration can bring a coverage ratio of more than 90% in the predefined search process. On the other hand, when utilizing four or five UAVs in DECSMU, each UAV has a lower turn time if H = 5 or 7. As a result, in the following experiments, four UAVs and H = 7 are adopted in DECSMU since we assume that the configuration can bring a balance between the coverage ratio and energy consumption.

5.2. Experimental Setup

To verify the performance of DECSMU, we set up a simulation environment for the coverage search of multiple UAVs and designed three experiments to verify the robustness of DECSMU in different search regions.
In the experiments, four UAVs, which are placed in different locations on the simulation task map, execute the coverage search. The size of the simulation task map is 100 × 100, the view radius of each UAV is r = 1, and the search step is H = 7, replacing time t with the number of iterations. The stop condition of DECSMU is that each UAV performs path planning 100 times. The three different search maps used in the experiments are illustrated in Figure 7, in which the blue part denotes the area that needs to be searched, while the white part represents the area that does not need to be searched. In this study, DE is utilized to optimize the coverage search path for each UAV. The parameters in DE are important for the coverage task. Thus, in this study, some popular parameters are set in advance, and a set of optimal parameters is adopted in the following study. These parameters include NP = 100, T = 100, H = 7, F = 0.5, and CR = 0.1. Four weights are set as follows: ω 1 = 0.6, ω 2 = 0.4, ω 3 = 1.0, and ω 4 = 0.
In this study, two convex regions (i.e., a circular region and a rectangular region) and a non-convex region are adopted to evaluate the performance of DECSMU. The maps of the three regions are illustrated in Figure 7.

5.3. Experimental Results and Discussion

In this section, the experimental results and discussion are presented. To verify the overall performance, DECSMU is executed through 30 independent runs on each map, and the results, in terms of the coverage ratio, are presented in Table 1.
In the table, we can see that DECSMU exhibits a very reliable performance for the different maps. Concretely, it achieves a coverage ratio of more than 90% for the three different maps, and each optimal search step can be obtained within 2 s. Moreover, it displays more outstanding convex maps, i.e., circular and rectangular maps. Although the results for the non-convex map are dominated by the results for the other two maps, DECSMU still offers acceptable performance for the complicated map.
In order to describe the characteristics of DECSMU more intuitively, the coverage results for the three regions are demonstrated in Figure 8, Figure 9 and Figure 10, respectively. In the three figures, each symbol ‘*’ in the map represents the starting location of a UAV, and each symbol ‘★’ in the map represents the ending location of a UAV. Each UAV’s path is identified by a line of a different color, and the black area denotes the coverage area.

5.3.1. Experimental Results for the Circular Region

Figure 8a shows that all four UAVs search within the circular region after they enter this region. Moreover, Figure 8b shows that the coverage rate of multiple UAVs is higher than 98% after 100 search steps. Figure 8b also shows that the coverage rate is more than 70% after 50 search steps. In Figure 8c,d we can see that all UAVs have similar energy consumption when they perform a coverage search within the circular region, except UAV4, which consumes more energy than the other three UAVs outside the circular region. However, after a few search steps, the UAV flies towards the circular region based on the dynamic reward function.

5.3.2. Experimental Results for the Rectangular Region

The rectangular region is another convex area adopted in this experiment. From the result demonstrated by Figure 9a, we can observe that the cooperation of four UAVs can attain a very favorable coverage performance in the rectangular region. Concretely, in Figure 9b, one can see that the coverage rate of the four UAVs is more than 95% of the region after 100 search steps. Similar to the results for the circular region, it also can be observed from Figure 9b that the coverage rate is more than 70% after 50 search steps. Moreover, in Figure 9c–f, we can observe an interesting phenomenon, i.e., each UAV mainly searches within a region. For instance, UAV1 pays more attention to the upper region, and UAV2 focuses on the lower right corner of the rectangular region, while UAV3 focuses on the left corner of the region. The planned paths of all four UAVs indicate that multiple UAVs can cooperate effectively.

5.3.3. Experimental Results for the Non-Convex Region

In the above experiments, two convex regions, i.e., the circular region and the rectangular region, were adopted. In fact, non-convex regions are more common in the real environment. Moreover, a non-convex region is more difficult to cover than a convex region. Thus, in this section, a non-convex region is selected to evaluate the performance of DECSMU, and the results of the experiments are demonstrated in Figure 10.
The experimental results demonstrated in Figure 10a show that the majority of the non-convex region was searched by the four UAVs, except for a few spikes on the region’s edge. Figure 10b indicates that the coverage rate of the four UAVs is about 95% of the difficult region after 100 search steps, which is slightly worse than the results for the rectangular region.
Furthermore, from the search paths of the four UAVs demonstrated in Figure 10c–f, we can observe that each UAV can rapidly re-fly into the target areas if the UAV has flown outside the target areas. The results show that the dynamic reward function is helpful for a UAV to plan efficient paths in different conditions.

6. Conclusions

In this paper, we proposed a multi-UAV-based cooperative coverage search algorithm to deal with the coverage problem in various regions. In the proposed algorithm, named DECSMU, each UAV independently uses DE to plan and optimize its search path. To reduce the complexity of the coverage problem, we introduced a DMPC strategy to establish the UAV decision-making process. Based on DMPC, the complexity of the coverage problem can be reduced effectively. Moreover, in order to solve some constraints in real environments, a dynamic reward function is proposed, in which the turning angle, communication constraints, energy consumption, and coverage are considered.
In order to evaluate the performance of the proposed method, two typical regions, i.e., a circular region and a rectangular region, and a non-convex region are adopted to verify the robustness, error correction ability, and energy saving ability of multiple UAVs. The experimental results demonstrated that DECSMU yields an outstanding and reliable performance in the three regions. Moreover, the favorable property of the dynamic reward function was also verified by a set of experiments. In future studies on DECSMU, we will improve its application in a more realistic communication environment, such as how to search more effectively when considering the communication tower.

Author Contributions

Software and writing—original draft preparation, H.Z.; writing—original draft preparation and editing, L.T.; and data curation and supervision, X.X. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Natural Science Foundation of Xinjiang Uygur Autonomous Region (Grant No. 2021D01A67), the Knowledge Innovation Program of Wuhan-Shuguang Project (Grant No: 2022010801020426), and the Natural Science Foundation of Fujian Province (Grant No. 2021J011008).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

All the authors declare no conflicts of interest.

References

  1. Azpurua, H.; Freitas, G.M.; Macharet, D.G.; Campos, M.F.M. Multi-robot coverage path planning using hexagonal segmentation for geophysical surveys. Robotica 2018, 36, 1144–1166. [Google Scholar] [CrossRef]
  2. Zhu, S.; Wang, D. Adversarial ground target tracking using uavs with input constraints. J. Intell. Robot. Syst. 2012, 65, 521–532. [Google Scholar] [CrossRef]
  3. Wang, N.; Li, Z.; Liang, X.L.; Hou, Y.Q.; Wu, A. Cooperative search algorithm for UAV swarm based on search intention interaction. J. Beijing Univ. Aero. Astro. 2022, 48, 454–463. [Google Scholar] [CrossRef]
  4. Aminzadeh, A.; Khoshnood, A.M. Multi-UAV cooperative search and coverage control in post-disaster assessment: Experimental implementation. Intel. Serv. Robot. 2023, 16, 415–430. [Google Scholar] [CrossRef]
  5. Giesbrecht, J. Global Path Planning for Unmanned Ground Vehicles; Technical Report; Defence Research and Development Suffield: Alberta, AB, Canada, 2004. [Google Scholar]
  6. Hasircioglu, I.; Topcuoglu, H.R.; Ermis, M. 3-D path planning for the navigation of unmanned aerial vehicles by using evolutionary algorithms. In Proceedings of the 10th Annual Conference on Genetic and Evolutionary Computation, Atlanta, GA, USA, 12–16 July 2008; ACM Press: New York, NY, USA; pp. 1499–1506. [Google Scholar]
  7. Zhang, M.G.; Wu, X.N.; Li, J.; Wang, X.K.; Shen, L.C. Integrated design of cooperative area coverage and target tracking with multi-UAV system. J. Intell. Robot. Syst. 2023, 108, 77. [Google Scholar] [CrossRef]
  8. Maza, I.; Ollero, A. Multiple uav cooperative searching operation using polygon area decomposition and efficient coverage algorithms. In Proceedings of Distributed Autonomous Robotics Systems; Springer Press: Tokyo, Japan, 2007; pp. 221–230. [Google Scholar]
  9. Acevedo, J.J.; Arrue, B.C.; Maza, I.; Ollero, A. Cooperative large area surveillance with a team of aerial mobile robots for long endurance missions. J. Intell. Robot. Syst. 2013, 70, 329–345. [Google Scholar] [CrossRef]
  10. Balampanis, F.; Maza, I.; Ollero, A. Area decomposition, partition and coverage with multiple remotely piloted aircraft systems operating in coastal regions. In Proceedings of the 2016 International Conference on Unmanned Aircraft Systems (ICUAS), Arlingto, VA, USA, 7–10 June 2016; IEEE Press: Piscataway, NJ, USA; pp. 275–283. [Google Scholar]
  11. Balampanis, F.; Maza, I.; Ollero, A. Coastal areas division and coverage with multiple UAVs for remote sensing. Sensors 2017, 17, 808. [Google Scholar] [CrossRef] [PubMed]
  12. Aggarwal, S.; Kumar, N. Path planning techniques for unmanned aerial vehicles: A review, solutions, and challenges. Comput. Commun. 2020, 149, 270–299. [Google Scholar] [CrossRef]
  13. Nikolos, I.K.; Valavanis, K.P.; Tsourveloudis, N.C.; Kostaras, A.N. Evolutionary algorithm based offline/online path planner for uav navigation. IEEE Trans. Syst. Man Cyber. Part B 2003, 33, 898–912. [Google Scholar] [CrossRef]
  14. Li, B.; Patankar, S.; Moridian, B.; Mahmoudian, N. Planning large-scale search and rescue using team of uavs and charging stations. In Proceedings of the 2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), Philadelphia, PA, USA, 6–8 August 2018; IEEE Press: Piscataway, NJ, USA; pp. 1–8. [Google Scholar]
  15. Wu, X.J.; Xu, L.; Zhen, R.; Wu, X.L. Global and local moth-flame optimization algorithm for UAV formation path planning under multi-constraints. Int. J. Control Autom. Syst. 2023, 21, 1032–1047. [Google Scholar] [CrossRef]
  16. He, Y.; Wang, M.R. An improved chaos sparrow search algorithm for UAV path planning. Sci. Rep. 2024, 14, 366. [Google Scholar] [CrossRef] [PubMed]
  17. Xing, P.Z.; Zhang, H.; Ghoneim, M.E.; Shutaywi, M. UAV flight path design using multi-objective grasshopper with harmony search for cluster head selection in wireless sensor networks. Wireless Netw. 2023, 29, 955–967. [Google Scholar] [CrossRef]
  18. Liang, X.; Sun, Q.; Yin, Z.; Wang, Y.; Liu, P. Review on large-scale unmanned system swarm intelligence control method. Appl. Res. Comput. 2015, 32, 11–16. [Google Scholar]
  19. Han, J.; Li, M.; Guo, L. Soft control on collective behavior of a group of autonomous agents by a shill agent. J. Syst. Sci. Complex. 2006, 19, 54–62. [Google Scholar] [CrossRef]
  20. Desai, J.P.; Ostrowski, J.P.; Kumar, V. Modeling and control of formations of nonholonomic mobile robots. IEEE Trans. Robot. Autom. 2001, 17, 905–908. [Google Scholar] [CrossRef]
  21. Couzin, I.D.; Krause, J.; James, R.; Ruxton, G.D.; Franks, N.R. Collective memory and spatial sorting in animal groups. J. Theor. Biol. 2002, 218, 1–11. [Google Scholar] [CrossRef] [PubMed]
  22. Ji, X.; Wang, X.; Niu, Y.; Shen, L. Cooperative search by multiple unmanned aerial vehicles in a nonconvex environment. Math. Probl. Eng. 2015, 32, 196730. [Google Scholar] [CrossRef]
  23. Li, J.; Li, X.; Yu, L. Multi-uav cooperative coverage path planning in plateau and mountain environment. In Proceedings of the 2018 33rd youth academic annual conference of Chinese association of automation (YAC), Nanjing, China, 18–20 May 2018; IEEE Press: Piscataway, NJ, USA; pp. 820–824. [Google Scholar]
  24. Lin, W.; Zhu, Y.; Zeng, W.; Wang, S. Track planning model for multi-uav based on new multiple ant colony algorithm. In Proceedings of the 2018 Chinese Automation Congress (CAC), Xi’an, China, 23–25 November 2018; IEEE Press: Piscataway, NJ, USA; pp. 3862–3867. [Google Scholar]
  25. Su, X.H.; Zhao, M.; Zhao, L.L.; Zhang, Y.h. A novel multi stage cooperative path re-planning method for multi uav. In Proceedings of the Pacific Rim International Conference on Artificial Intelligence, Phuket, Thailand, 22–26 August 2016; Springer Press: Berlin/Heidelberg, Germany; pp. 482–495. [Google Scholar]
  26. Chen, J.; Zha, W.; Peng, Z.; Zhang, J. Cooperative area reconnaissance for multi-uav in dynamic environment. In Proceedings of the 2013 9th Asian Control Conference (ASCC), Istanbul, Turkey, 23–26 June 2013; IEEE Press: Piscataway, NJ, USA; pp. 1–6. [Google Scholar]
  27. Zhu, M.; Chen, H.; Xiong, G. A model predictive speed tracking control approach for autonomous ground vehicles. Mech. Syst. Signal Process 2017, 87, 138–152. [Google Scholar] [CrossRef]
  28. Storn, R.; Price, K. Differential evolution–a simple and efficient heuristic for global optimization over continuous space. J. Glob. Optim. 1997, 11, 341–359. [Google Scholar] [CrossRef]
  29. Xia, X.; Tong, L.; Zhang, Y.; Xu, X. NFDDE: A novelty-hybrid-fitness driving differential evolution algorithm. Inf. Sci. 2021, 579, 33–54. [Google Scholar] [CrossRef]
  30. Xia, X.; Gui, L.; Zhang, Y.; Xu, X.; Yu, F. A fitness-based adaptive differential evolution algorithm. Inf. Sci. 2021, 549, 116–141. [Google Scholar] [CrossRef]
  31. Zhou, X.G.; Peng, C.X.; Liu, J.; Zhang, Y.; Zhang, G.J. Underestimation-assisted global-local cooperative differential evolution and the application to protein structure prediction. IEEE Trans. Evol. Comput. 2019, 24, 536–550. [Google Scholar] [CrossRef] [PubMed]
Figure 1. A simplified rasterized map for an island map. (a) A real island map. (b) A grid map of the island.
Figure 1. A simplified rasterized map for an island map. (a) A real island map. (b) A grid map of the island.
Biomimetics 09 00384 g001
Figure 2. Candidate flight directions of a non-hoverable UAV in a grid map.
Figure 2. Candidate flight directions of a non-hoverable UAV in a grid map.
Biomimetics 09 00384 g002
Figure 3. View window (view radius r = 1) of the UAVi in a grid map. In the grid map, the dark grid denotes the UAV’s current position, and the gray grids denote the regions that have been searched by the UAV.
Figure 3. View window (view radius r = 1) of the UAVi in a grid map. In the grid map, the dark grid denotes the UAV’s current position, and the gray grids denote the regions that have been searched by the UAV.
Biomimetics 09 00384 g003
Figure 4. Decision-making process diagram of multi-UAV cooperative control based on DMPC.
Figure 4. Decision-making process diagram of multi-UAV cooperative control based on DMPC.
Biomimetics 09 00384 g004
Figure 5. The flowchart of a UAV’s path planning framework.
Figure 5. The flowchart of a UAV’s path planning framework.
Biomimetics 09 00384 g005
Figure 6. Performance of the number of UAVs and the plan steps in terms of the coverage rate and average turn time.
Figure 6. Performance of the number of UAVs and the plan steps in terms of the coverage rate and average turn time.
Biomimetics 09 00384 g006
Figure 7. Maps of three different search regions in this study.
Figure 7. Maps of three different search regions in this study.
Biomimetics 09 00384 g007
Figure 8. Coverage results within the circular region.
Figure 8. Coverage results within the circular region.
Biomimetics 09 00384 g008
Figure 9. Coverage results within the rectangular region.
Figure 9. Coverage results within the rectangular region.
Biomimetics 09 00384 g009
Figure 10. Coverage results within the non-convex region.
Figure 10. Coverage results within the non-convex region.
Biomimetics 09 00384 g010
Table 1. The coverage results of DECSMU for the 3 maps.
Table 1. The coverage results of DECSMU for the 3 maps.
Results Avg.
(%)
Std.Dev.
(%)
Best
(%)
Time Usage
(s)
Maps
Circular98.90.1199.31.1
Rectangular97.50.1298.11.3
Non-Convex94.40.1595.61.4
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zeng, H.; Tong, L.; Xia, X. Multi-UAV Cooperative Coverage Search for Various Regions Based on Differential Evolution Algorithm. Biomimetics 2024, 9, 384. https://doi.org/10.3390/biomimetics9070384

AMA Style

Zeng H, Tong L, Xia X. Multi-UAV Cooperative Coverage Search for Various Regions Based on Differential Evolution Algorithm. Biomimetics. 2024; 9(7):384. https://doi.org/10.3390/biomimetics9070384

Chicago/Turabian Style

Zeng, Hui, Lei Tong, and Xuewen Xia. 2024. "Multi-UAV Cooperative Coverage Search for Various Regions Based on Differential Evolution Algorithm" Biomimetics 9, no. 7: 384. https://doi.org/10.3390/biomimetics9070384

Article Metrics

Back to TopTop