1. Introduction
Every year, avalanches in Europe result in at least one hundred victims. Avalanche transceivers have been demonstrated to reduce mortality from 70.6% to 55.2% by decreasing the rescue time from about 125 to 25 min [
1]. However, this time is still longer than the maximum survival phase duration of 18 min, during which 91% of people can be successfully rescued.
Currently, the use of transceivers is mandatory in several countries and regions of the globe (for example, see Italian law 363/2003), but search missions still involve humans, making these operations dangerous and slow.
The advent of UAVs was an important addition to search-and-rescue operations, but the use of single UAVs is not always the most efficient solution for completing missions [
2,
3,
4,
5]. Depending on the type of operation, the collaboration of more agents with the same objectives can enhance overall efficiency. A typical example arises from the decision to employ smaller and lighter vehicles to reduce costs and increase reliability. The costs of a single aircraft exponentially increase with the take-off weight [
6,
7,
8,
9], whereas the use of several smaller UAVs can be effective in reducing the production and operating costs [
10,
11].
On the other hand, system efficiency can be improved by deploying various sensors on multiple UAVs, thereby reducing the payload of each individual unit and, in the case of redundancy [
2,
3,
4,
12], increasing the robustness of the whole system, which would no longer be related to the reliability of a single aircraft. Furthermore, involving several smaller vehicles with enhanced maneuverability in confined spaces allows for wider areas to be surveyed compared to a larger single aircraft, even when equipped with higher quality sensors. Furthermore, the use of several vehicles can reduce the mission duration, which represents one of the most important challenges to address.
These features can be summarized into three desirable characteristics, namely flexibility, scalability and robustness [
13], which provide improved performance when compared to the use of a single aircraft. Specifically, flexibility is the capability to adapt to changing environmental requirements; robustness is defined as the ability of a formation to continue operations in the presence of partial failures or other abnormal conditions, such as the loss of a vehicle; and finally, scalability represents a self-organized mechanism that supports larger or smaller numbers of individuals without impacting performance [
14].
One of the most significant challenges in deploying a formation of robots in emergency scenarios is the lack of autonomy. This issue weakens system scalability, since UAVs are typically teleoperated by operators at a one-to-one ratio, requiring a large and well-trained team of search-and-rescue operators to be engaged in UAV guidance.
Information sharing and cooperation among UAVs are the fundamental requirements for creating a formation able to act as a unique system. To ensure global situational awareness, a multi-sensor data fusion grid must be designed [
15,
16], with individual UAVs serving as both producers and consumers of information.
These data are shared through a distributed and dynamic communication graph. The gathered data enable awareness of the surrounding environment, granting UAVs the capability to identify obstacles and targets and locate the whole formation in rapidly changing dynamic environments. The measurement data from each agent, with multiple perspectives distributed throughout the environment, can enhance the “detect & avoid” capabilities of each UAV.
The main architectures employed in multi-sensor fusion problems are centralized, decentralized and distributed schemes [
17].
For centralized fusion architectures, the extended Kalman filter (EKF) is the preferred solution for state estimation [
18,
19,
20,
21]. In this approach, all network nodes broadcast their information to a central unit, which processes it and provides an estimation of the overall state. However, this solution becomes impractical when the number of nodes increases due to the high computational and communication burden associated with a single central processing unit and point-to-point transmission of extensive data. Moreover, if the central node becomes unreachable, it leads to a system failure. Therefore, in scenarios requiring fault tolerance against system failures and the minimization of computation time to operate swiftly in emergency situations, it is imperative to decentralize and distribute the computational workload among all network nodes.
A decentralized scheme involves multiple centers of information that are able to communicate with their neighbors, partially mitigating vulnerability to failure [
22,
23,
24]. Although multi-sensor fusion through a centralized approach is ideally Bayesian-optimal in terms of tracking performance [
17], this kind of architecture lacks scalability and requires reliable sensors, making it expensive.
In a distributed architecture, each node performs estimation using data obtained from locally connected neighbors. This kind of architecture is inherently redundant, enhancing robustness against failure. Furthermore, it presents a lower communication burden since data need not be transmitted to a central unit.
In decentralized and distributed approaches, achieving convergence to the same value for all local state estimates can be obtained through consensus theory, originally introduced from a computer science perspective by Lynch in [
25]. Broadly speaking, consensus is a form of agreement among several spatially distributed agents regarding a certain output value, without resorting to a central processing unit. The main contribution to solving consensus problems in networked systems was given in [
26,
27,
28,
29]. Another significant contribution appears in [
30], where the authors, starting from a previous work [
31], investigate the alignment problem of a set of
N autonomous agents and provide a mathematical analysis of convergence behavior. In [
32], the authors provide a series of interaction rules known as consensus protocols or algorithms, which detail the exchange of information between an agent and its neighbors. The problem of cooperative estimation for a swarm of UAVs using consensus protocols has been addressed in [
33,
34] among others.
Over the years, several forms of consensus strategies have been proposed. The two main instances are the consensus on information (CI) and the consensus on measurement (CM) approaches, particularly adopted for their stability properties. Both methods have complementary characteristics. CI guarantees stability of the solution, even for a single consensus step [
35]. However, as the correlations between measurements from different nodes are completely unknown, the boundedness of the estimation error is not guaranteed. On the other hand, CM can overcome the conservative issue of the CI approach, tending towards optimal performance similar to centralized solutions, but it can only guarantee stability with a large number of consensus steps [
36]. To combine the benefits of both CI and CM, in [
37], the authors introduced a hybrid approach called hybrid consensus on measurements and consensus on information (HCMCI), proving the effectiveness of the proposed algorithm in target tracking problems [
36].
However, while a decentralized/distributed approach decreases the communication burden, avoiding the use of a star network graph with a single central node, it cannot completely address the problem of reducing computational burden. Indeed, in a distributed scheme, the cardinality of the state vector considers the overall system, leading to an excessive computational load on each local CPU in the presence of large number of agents.
The main goal of this paper is to present a distributed navigation algorithm for a formation of UAVs capable of operating in complex and uncertain scenarios, such as search-and-rescue missions. The aim of the proposed estimation method is to be distributed, scalable and fault-tolerant, in order to achieve a high level of situational awareness among UAVs. This method must be suitable to formations involved in critical and hazardous missions, where both the number of aircraft and the reliability of the system can be key factors for the success of the mission.
The proposed algorithm combines the use of Kalman filtering and consensus with a process known as internodal transformation theory (ITT) [
38]. ITT effectively reduces the order of the local estimated state vector, thus significantly lowering the computational complexity of the problem.
The navigation model exploits the benefits of the HCMCI technique and the advantages of ITT in terms of computational efficiency. This enables the estimation of the global state of the formation, supports the guidance and control algorithm and facilitates the detection of targets in the operational scenario.
In this paper, a sensitivity analysis of the main parameters for the proposed algorithm is presented. These parameters take into account the network complexity and the number of consensus steps. Additionally, several numerical results are illustrated to prove the robustness against communication delays and the presence of faults or communication losses. The objective is to illustrate how the performance of the navigation algorithm is affected in realistic situations.
The paper is structured as follows:
Section 2 provides the problem statement, including details about assumptions and model equations; in
Section 3, the distributed target localization algorithm is explained, starting from a centralized version of the Kalman filter and progressively incorporating ITT and consensus;
Section 4 presents the numerical results obtained by running the proposed algorithm through five simulation tests designed to assess its capabilities in achieving situational awareness under several parameter settings, communication delays or interruptions and the presence of a non-cooperative target;
Section 5 offers a discussion of the main findings; finally,
Section 6 contains the conclusions of this paper.
2. Problem Statement
Let us consider a formation of N autonomous UAVs. These UAVs are in flight while surveying a designated area to detect the possible presence of a target. Each aircraft is equipped with its own flight control system and autopilot to ensure stability and maneuverability. At a high level, they can be treated as material points.
For each UAV, the objective of the navigation system is to estimate the position of the other UAVs within the formation and the target. This estimation is based on data provided by on-board sensors such GPS, a transponder for measuring distances to others UAVs, and a dedicated transponder for measuring the distance to the target (apparecchio di ricerca in dei travolti in valanga—ARTVA—or avalanche transceiver or détecteur de victime d’avalanche—DVA).
To achieve this goal, each UAV is equipped with an on-board navigation algorithm based on the consensus Kalman filter, which can fuse information provided by neighbors through dedicated communication channels.
This Kalman filter uses a discrete-time kinematic model, described in the inertial reference frame by the following equations:
where
k is the time step,
represents the vector of the spatial coordinates,
indicates the velocity vector of the
i-th UAV,
is the process disturbance vector and
is the sample time. The process disturbance vector is assumed as a zero-mean white Gaussian noise.
Considering the specific application, let us consider the target as a static point on the map, with zero speed and its position denoted as .
The global state vector
can be defined by incorporating the 3D position and velocity of each aircraft, along with the 3D position of the target, as follows:
The set of sensors is modeled as follows:
The GPS, to measure the position and the speed of the aircraft in the airspace as
where
is the measurement noise and
is the speed of the aircraft, computed as the norm of the velocity vector.
The N transponders, to evaluate the relative distance to any other formation agent. Given that
is the measurement noise and
the Euclidean distance between the aircraft
i and
j, the output
of each transponder is
The ARTVA, to determine the distance to a potential target in the operational scenario. The device considered in this work is capable of measuring only the distance to the target, and it does not incorporate additional functionalities, such as the direction estimation provided by modern digital ARTVAs. This choice was made to test the algorithm under the most challenging conditions, simulating an older analog ARTVA, whose pulses can only be used for distance estimation. Denoting
as the measurement error and
as the distance to the target, the output
of each ARTVA is
The measurement vector of the overall set of sensors is represented by
with
as the overall measurement noise vector. The measurement noise is modeled as a Gaussian process, whose mean value is unknown.
Imagine the UAV formation as a network of sensor nodes. At any time instant k, the communication structure can be depicted as a directed graph , where represents the set of nodes corresponding to the UAVs, and denotes the set of edges describing communication links between the aircraft i and j. The i-th UAV can receive data from the j-th agent if there exists an arc . For each aircraft i, represents the set of its neighbors, which includes the i-th UAV. Let indicate the degree of node i, where denotes the cardinality of a given set.
3. Distributed Target Localization
Let be the global state vector, containing information to describe the behavior of each UAV in the swarm, and be the set of variables measured by the i-th agent.
Considering the definitions of the state vector (
2) and the local state model (
1), the global state equation can be expressed as follows:
where
represents the overall process noise vector, with zero mean and covariance matrix
.
Similarly, the measurement model for the
i-th UAV is defined as:
where the measurement noise is modeled as a white Gaussian noise, with covariance matrix
.
Let us denote with
the overall set of measurements at time
k:
To effectively address the navigation problem for a swarm of aircraft, a multi-sensor fusion approach can be adopted, merging information and measurements from multiple aircraft to provide, at each time instant k, an estimate of the overall system state.
In a centralized approach based on the extended Kalman filter (EKF), the estimate of the global state is obtained through two distinct phases: the prediction phase and the correction phase.
In the prediction phase, the a priori state estimate and its covariance are derived using the mathematical model and measurements acquired up to the time instant . In the correction phase, the a-posteriori state estimate and its covariance are computed by updating and with the most recent set of measurements, denoted as .
The information Kalman filter (IKF) [
38] is a variant of the EKF expressed in terms of information. It is obtained by reformulating the state estimation vector and covariance matrix in terms of the information matrix and information state vector.
The a priori information matrix and information state vector are defined as follows:
The predicted information vector can be computed as:
The a priori information matrix is computed as:
with
At each time step
k, the information associated with the measurement
is:
with
the covariance matrix of measurement noise
,
By using the information associated to
, it is possible to compute the a posteriori information vector:
The a posteriori information matrix and state estimate can be defined as follows:
The proposed distributed estimation strategy combines the HCMCI model with the principles of ITT to ensure stability and convergence to a common state estimate. This is done under the assumption of minimal requirements in terms of connectivity and collective observability [
39]. This strategy allows for the decentralization of the estimation process, providing an estimate of a reduced state vector
for each agent
i, where
. This vector encompasses the state of the
i-th aircraft and its respective neighbors, denoted by
. This approach reduces the computational load of individual agents while simultaneously enhancing the formation fault tolerance in case of central node failures.
In accordance with the ITT model, in order to preserve dynamic equivalence between the global and local models, at any time
k, the local state vector is related to the global one as follows:
where
represents the linear nodal transformation matrix that selects either all the states or a linear combination of them from the global state vector. This selection depends on the UAVs visible from the
i-th aircraft at time
k. The matrix consists of elements that are either 1 or 0, activating or deactivating states, and it is time-varying to take into account changes in the formation topology. Generally, the size of
is
with
. For an effective model reduction, it must be
. In such cases, it admits only the pseudo-inverse matrix, denoted with a superscript plus,
.
On the basis of local measurements, the
j-th node provides the information matrix
and the information vector
with
The information contribution at the
i-th node due to the observations supplied by the
j-th node, expressed in terms of the information matrix
, is the following.
Analogously, the information vector
is defined as follows:
where
The matrix
is the state-space transformation matrix. It defines which UAV needs to communicate its information and indicates which piece of information must be exchanged between each pair of UAVs. Meanwhile, the matrix
represents the internodal transformation matrix within the information space, responsible for selecting and mapping all relevant information from UAV
j to UAV
i. Additionally, the local error covariance matrix for node
i, given the observation from node
j, is defined as:
The corresponding local estimate of the state vector is:
Given the definitions provided above, it is possible to use the ITT to formulate a scalable solution to the estimation problem.
In accordance with the proposed model, the
i-th agent of the swarm generates the a priori state estimate,
, and updates the information matrix
and the information vector
as follows:
where
with
Let us assume that, at the first consensus step (
), the matrices and vectors of information are
and
, respectively. Each aircraft evaluates the average values of
and
as follows:
for
consensus steps.
Coefficients
can be computed using the
Metropolis formula [
40]:
so that, in order to determine the weight, each agent
i does not require any knowledge of the communication graph but only the degrees of its neighboring nodes, denoted as
, with
.
UAVs use the consensus on measurement to update the a priori state estimate using the information associated with the measurements from neighboring aircraft. By employing its local measurements, the
i-th UAV computes the vectors
and
following (
23) and (
24).
Let us assume that, at the first consensus step (
), the matrix and the vector of information are
and
, respectively. At each consensus step
, the
i-th aircraft merges the information sent by its neighbors as follows:
At the end of the consensus procedure, the information matrix and vector,
and
, and the a-posteriori state estimate
are computed as:
where
, is the cardinality of the set
.
A representation of the algorithm flow and the relationship among phases can be seen in
Figure 1.
4. Numerical Simulations
To assess the performance of the proposed algorithm, five simulation tests were conducted with formations of UAVs flying over a designated area of interest. The first simulation test involved a sensitivity analysis to evaluate the impact of key operational parameters, such as the number of consensus iterations L and the communication link schemes , on situational awareness. The second simulation test examined the algorithm resilience by introducing communication disruptions among certain UAVs, demonstrating its ability to resume estimation once communication was reestablished. The third simulation assessed the algorithm performance in estimating the position of a stationary target within the operational scenario. The fourth example explored the robustness of the algorithm to communication delays. Finally, the fifth test case examined the capability of the algorithm to be integrated in a complete guidance, navigation and control algorithm and involved in a realistic post-avalanche scenario. Additionally, a centralized EKF was employed as a benchmark to compare the performance trade-off between centralized and decentralized techniques.
The numerical simulations presented in this study were conducted using MATLAB routines with a time step size of s. Initially, each aircraft knows only its own position information, and the overall state estimate was initialized with random values. This assumption allowed for the evaluation of algorithm convergence towards the correct estimate and the assessment of its ability to achieve consensus.
To take into account measurement errors, they were simulated by introducing biases and white Gaussian noise into the simulation output. The main parameters are summarized in
Table 1.
In order to evaluate the performance of the proposed algorithm, two indicators were defined: standard deviation (SD) and mean absolute error (MAE) [
41]. The estimation error made by the
i-th UAV, relative to the position of the
j-th agent at the time instant
k, is:
where
indicates the position of the
j-th agent estimated by the aircraft
i The average error in a time window composed by
time steps is
The mean absolute error
made by the
i-th UAV is defined as follows:
Similarly, it is possible to define the standard deviation
with respect to the average error as follows:
4.1. Simulation Test #1
Simulation test #1 presents the outcomes of a preliminary sensitivity analysis aimed at examining the impact of key operational parameters, specifically the number of consensus steps L and the communication link schemes , on the performance of the proposed algorithm.
As an illustrative scenario, consider a swarm of 10 UAVs with their initial positions summarized in
Table 2. Assume that all UAVs in the swarm fly in the same direction at a constant speed of 2 m/s, maintaining a fixed altitude of 100 m above the terrain. Throughout the flight, several maneuvers are performed in both the horizontal and vertical planes, involving changes in altitude. These maneuvers enable the evaluation of the navigation algorithm capability to estimate trajectory modifications.
The sensitivity analysis was conducted by considering four distinct configurations of consensus steps, namely
, and three communication schemes (
,
,
), characterized by an increasing number of communication links (refer to
Figure 2).
As a representative result,
Table 3 and
Table 4 present the standard deviation (
54) and the mean absolute error (MAE) (
53) of the estimated position of
performed by
. These results provide insights into the influence of consensus steps and communication links on the estimation process. It is observed that performance improvement is not consistently achieved by increasing the parameter value. Notably, a higher number of communication links among UAVs leads to better performance, indicated by lower values of standard deviation and mean absolute error, and exhibits limited sensitivity to an increase in consensus steps. This characteristic of the algorithm is further depicted in
Figure 3, which visually illustrates the findings from the analysis of
Table 3 and
Table 4.
Figure 4 presents an example of the estimated trajectory of UAVs as viewed from
compared with reference trajectories. In this example, with an arbitrary communication configuration
and
, only
,
and
are visible from
, as indicated in
Figure 2. Despite the presence of maneuvers, each agent is capable of estimating the positions of the visible UAVs. Specifically, the scenario includes left/right turn maneuvers in the horizontal plane at
s and
s, followed by pull up/pull down maneuvers in the vertical plane at
s and
s.
4.2. Simulation Test #2
For simulation test #2, consider the same scenario with 10 UAVs as in simulation test #1. The UAVs start from the initial positions specified in
Table 2 and fly in the same direction at a constant speed of 2 m/s, maintaining a fixed altitude of 100 m above the terrain. The trajectories of the UAVs, depicted in
Figure 5, include several maneuvers in the horizontal plane before reaching the destination point.
The primary objective of simulation test #2 is to evaluate the algorithm capability to resume trajectory estimation for UAVs following communication interruptions among the aircraft. To illustrate such a feature, the proposed scenario introduces communication link failures during the flight due to obstacle avoidance. Specifically, at two specified time intervals,
s and
s, lasting
s, the communication links among UAVs are disrupted. The affected aircraft include
,
,
and
.
Figure 6 displays the communication links before and after the interruptions, highlighting the formation of two sub-swarms. When the data transmission between
and
, as well as between
and
, is lost, the two sub-swarms reorganize into separate cycle graphs to accurately estimate the state of their neighboring UAVs [
42,
43].
Figure 7 depicts the estimated coordinates of
by
compared with the reference trajectory. As observed in the figure, the position estimation made by
is interrupted solely during the two designated time intervals of communication loss, and it is promptly resumed once the connection among the aircraft is re-established.
4.3. Simulation Test #3
Simulation test #3 aims to evaluate the navigation algorithm ability to estimate the position of a stationary target within the operational scenario. To validate its estimation performance, a comparison was conducted with a centralized version of the KF.
In this test, a sample scenario was considered, involving a formation of 10 UAVs flying together to survey the area of interest and locate the target. Each aircraft maintains a constant speed of 2 m/s, flying in the same direction at an altitude of 100 m above the terrain. Communication capabilities are limited to two neighboring agents, forming a cycle graph, while each UAV is equipped with an additional receiver to detect the signal emitted by the target within a range of
m. To account for the noise in the target transmitter signal, the simulation parameters outlined in
Table 1 are augmented with the bias and noise covariance values specified in
Table 5.
The aircraft navigate towards the target position located at
m, starting from their respective initial positions as presented in
Table 6 and
Figure 8.
Figure 9,
Figure 10 and
Figure 11 depict the estimated components of the target position vector, providing a comparison among the actual value, the estimation obtained from the centralized KF and that derived from the proposed decentralized KF. As observed in the figures, during the initial phase of the simulation, the target position remains unknown as it lies outside the sensor range of the UAVs.
At
s, the target enters the sensor range of
, but localization is not yet feasible. It is worth noting that for localization problems based on fusion algorithms, a minimum of three aircraft measurements is required to accurately determine the target location [
44,
45,
46].
From s, the target comes within the sensor range of , , and , thus becoming localizable.
The estimated target position definitively converges around s.
The estimation error made by the
i-th UAV, relative to the position of the target at the time instant
k, is:
where
indicates the position component of the target estimated by the aircraft
i. The average error in a time window composed by
time steps is
The mean absolute error
made by the
i-th UAV is defined as follows:
Similarly, it is possible to define the standard deviation
with respect to the average error as follows:
Table 7 shows mean absolute error and standard deviation computed by UAVs for any component
of the target position vector.
Figure 12 and
Figure 13 show the uncertain target volume detected by the proposed algorithm with respect to the centralized KF and the real target position.
4.4. Simulation Test #4
Simulation test #4 aims to assess the navigation algorithm’s capability to estimate the position of a stationary target within the operational scenario in the presence of communication delays among the swarm of UAVs. The same scenario shown in the previous subsection was considered (see
Table 6), where the communication channel among UAVs is not considered ideal but introduces a delay typical of wireless communication.
Considering the size of UAVs and the typical size of the considered scenarios, three different constant communication delays among aircraft in the formation were considered, s, s and s, to stress the procedure in the worst situations.
As already shown, results do not depend on the number of consensus steps when , while the communication scheme can give a performance improvement.
As a representative result,
Figure 14 shows the capability of the algorithm to localize the target also in the configuration
, considered as the worst possible because it shows the slowest convergence in previous results. It is worth noticing that the algorithm is robust to any communication delay until
s, with a convergence speed that decreases by increasing the length of the delay. Without delays, the estimation of the target position converges to the real value at
s (see
Figure 9,
Figure 10 and
Figure 11), while with a delay of
s, the convergence is assured only after
s.
4.5. Simulation Test #5, a Post-Avalanche Scenario
The primary objective of this section is to prove the effectiveness of the proposed navigation algorithm in a SAR mission for locating a missing skier. An illustrative example is shown in
Figure 15, where a swarm of UAVs identifies the missing individual, points towards the target and subsequently awaits the rescue phase by hovering over the skier.
To address this issue, consider a swarm of
N UAVs flying at a fixed altitude whose guidance algorithm is based on the DTG-based approach [
47]. This approach ensures that the the swarm remains distributed and scalable, without a leader–follower architecture, while maintaining a local triangular topology that is highly efficient in terms of the packing problem.
The integrated GNC scheme is depicted in
Figure 16, which provides an overview of the overall architecture and the information flow between the high-level DTG-based guidance and control and the consensus-based navigation modules.
The proposed scenario represents a simplified environment with no disturbances, such as wind. It is also assumed that the swarm is homogeneous, consisting of the same type of quadrotors. We have assumed perfect conditions for communication and reconnaissance of the search area. Therefore, no communication delays and no loss of precision over greater distances are considered. Furthermore, the take off and landing of UAVs are not modeled; it is assumed that all UAVs have already been launched at the beginning of the scenario and are flying at a fixed altitude. Battery consumption is not included in the model.
The case study takes into account real research techniques in post avalanche scenarios. The most known search strategies are the parallel track and creeping line search. They involve maintaining parallel tracks, with the former being parallel to the major axes and the latter parallel to the minor axes. These strategies are suitable for uncertain target locations and flat terrain, ensuring uniform coverage.
The typical post-avalanche zones to be surveyed are usually small [
48]; in this section, the operational scenarios are defined over an area of
m. Two scenarios are provided, with the same area and different positions of the skier, as summarized in
Table 8.
Given the search area, a formation of 5 UAVs was deployed, flying at a fixed altitude of 5 m and cruising at a speed of 3 m/s.
To cover the selected search area, two coverage techniques were explored, as they are the most suitable for this type of mission: the parallel track technique was used in scenario #1 (
Figure 17) and the creeping line in scenario #2 (
Figure 18). Both search paths have the same length and guarantee complete surveillance of the defined area.
UAV starting positions for both operational scenarios are given in
Table 9.
The ARTVA receiver is the device responsible for detecting the presence of the skier under the snow. Its operational range is currently about 60 m [
49]. However, although the most modern versions of these devices are valid and efficient, their performance in real operating conditions are degraded due to several factors, like ambient noise and interference with other electronic devices or other beacons. Considering an altitude of 5 m, the projected detection area over the terrain has a diameter of
m.
Table 10 shows the sensor biases and the measurement noise covariance matrices.
At the beginning of the simulation, it is assumed that each UAV knows only its position, initializing the overall state estimate with random values. Such an assumption is useful for evaluating algorithm convergence to the correct estimate and ability to reach the consensus.
The proposed scenarios were simulated in MATLAB routines with a step size of s, consensus steps and a simulation period of s.
Simulation parameters are presented in
Table 11.
Figure 19 represents the trajectories of the UAV swarm following the parallel track (
Figure 19a) and the creeping line (
Figure 19b) search patterns. Green solid lines indicate the obtained formation graph. The distance among aircraft tends to remain near the desired value,
, for all the duration of the simulation and for both the considered research methodologies, as also confirmed by the analysis of
Figure 20. The purpose of the swarm is to fly over the area near the missing person, once its position is identified. This behavior is highlighted by the continuous flight above the skier position at
s (see
Figure 17 and
Figure 18), as shown in
Figure 19.
Figure 21 and
Figure 22 show trajectories of
and
as estimated by
, compared with reference paths. As can be seen,
is always able to estimate its neighbors’ trajectories. It is worth remembering that each UAV in the swarm, at the beginning of the simulation, knows only its position, and the estimates start at random values. As can be seen, the estimates made by
converge to the correct values after a few time instants, necessary for the convergence of the consensus algorithm.
Figure 23,
Figure 24 and
Figure 25 show the effectiveness of the proposed algorithm in the estimation process of the skier position, comparing the real value with the estimation made by
, for both the analyzed case studies. As shown, at the beginning of the simulation the skier position is unknown, still being far from the range of the receivers onboard the UAVs.
Once the skier ARTVA transmitter enters the range of at least three aircraft receivers (at s (#1) and s (#2)), the estimates tend to the real position, converging definitively after the time necessary for the consensus phase.
In case study #1, it is worth noticing that a first convergence estimation is reached at
s. This is due to the particular research path; in fact, as shown in
Figure 19a, at
s the skier enters the sensor range of only one UAV of the swarm, whose measurement is still not sufficient to correctly estimate the position of the missing person (see
Section 4.4).
Table 12 shows mean absolute error (Equation (
53)) and standard deviation (Equation (
54)) for any component of the target position vector,
. These uncertainties in the estimation are graphically represented in
Figure 26 and
Figure 27 for both the scenarios.
The data presented in this section represent the results regarding the accuracy of the proposed algorithm, with standard deviations never exceeding 1 m and MAE values consistently below 0.1 m for the estimation of the X and Y coordinates. Furthermore, the potential application of the proposed GNC model in SAR scenarios is demonstrated. The aircraft can promptly estimate the position of the missing skier when trilateration is feasible. Additionally, the swarm consistently follows the predetermined search path, using the situational awareness provided by the local estimates.
5. Limitations and Discussion
Several numerical simulations were carried out to highlight the advantages and disadvantages of the proposed procedure in the following contexts: (1) the ability to achieve situational awareness regarding the position and speed of each agent in the formation; (2) the capability to localize a static target in a distributed manner using the transceivers on each UAV; (3) resilience to communication delays and interruptions; and (4) integration with a comprehensive guidance, navigation and control procedure, simulating a realistic post-avalanche search-and-rescue mission.
Regarding the ability to achieve situational awareness, a sensitivity analysis of the main parameters showed that the algorithm is nearly insensitive to the number of communication steps, at least when dealing with a limited number of UAVs. Instead, performance is primarily influenced by the communication scheme. As expected, the topology of the communication graph is crucial, representing how the agents share information among themselves. The communication scheme is considered the worst-case scenario and is thus employed in the other tests.
From the perspective of target localization capability, a comparison with a centralized EKF showed that the proposed algorithm achieves nearly identical performance in terms of target localization, as presented in
Table 7, for both the
and standard deviation
. The distributed procedure is only 5 s slower than the centralized one, as depicted in
Figure 9,
Figure 10 and
Figure 11.
The robustness to communication delays and interruptions was confirmed through numerical simulations. These assessments included evaluating the algorithm capability to resume UAV position estimation following disruptions in communication links at specified time intervals (
Figure 7) and showing the convergence capabilities in the presence of three different constant communication delays among UAVs (
Figure 14). As observed, the position estimation is temporarily interrupted during communication loss but promptly resumes once the connection among agents is re-established. The procedure remains resilient to communication delays, albeit affecting the convergence time for target localization, as expected.
Finally, the last set of simulation results proves the algorithm capability to be employed within a comprehensive guidance navigation and control procedure. It has been demonstrated to be useful in estimating UAV positions, which are essential for trajectory tracking, collision avoidance and target localization. The results show a localization error of less than 1 m, with no faults during the mission.
6. Conclusions
The main objective of this paper was to develop a distributed navigation algorithm for a swarm of UAVs to estimate the position of a target in the operational scenario. The aim of the algorithm is to provide situational awareness to the aircraft formation and eliminate the need for a central unit, thereby making it scalable and computationally efficient.
The proposed algorithm combined the advantages of the hybrid consensus on measurements and consensus on information (HCMCI) technique with the internodal transformation theory (ITT). This combination allowed for the distributed estimation of the swarm overall state and the detection of the target position. The algorithm was designed to be adaptable to various swarm architectures and environments, enabling the swarm to self-organize and adjust its configuration over time.
Numerical analyses were conducted to evaluate the algorithm effectiveness. A sensitivity analysis was performed to assess its performance by varying the main operational parameters and communication link schemes.
Results highlight that performance is not significantly improved by increasing the number of consensus steps which, on the other hand, increases the time required to execute the consensus algorithm. Furthermore, as expected, the graph topology can improve algorithm convergence by optimizing communication.
Regarding reliability, it is observed that communication delays can decrease the convergence capability, as intuitively expected. However, in realistic scenarios with small delays (approximately s), it does not adversely affect performance.
Furthermore, some simulation tests were also conducted to evaluate the algorithm’s ability to resume estimation after communication interruptions and to estimate the position of a fixed target. A comparison with a centralized version of the EKF was included to evaluate the performance trade-offs of the decentralized technique.
The proposed navigation algorithm successfully achieved the objectives outlined in this paper, demonstrating its capability to estimate the target position and maintain overall situational awareness.
Future research in this field should explore the possibility of detecting and isolating multiple targets, which would represent another important factor in enhancing the effectiveness of such algorithms in emergency scenarios.