Next Article in Journal
Automatic Calibration of Odometry and Robot Extrinsic Parameters Using Multi-Composite-Targets for a Differential-Drive Robot with a Camera
Next Article in Special Issue
Unsupervised Machine Learning for Advanced Tolerance Monitoring of Wire Electrical Discharge Machining of Disc Turbine Fir-Tree Slots
Previous Article in Journal
Optimization-Based Wi-Fi Radio Map Construction for Indoor Positioning Using Only Smart Phones
Previous Article in Special Issue
Signal Processing for Time Domain Wavelengths of Ultra-Weak FBGs Array in Perimeter Security Monitoring Based on Spark Streaming
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Network Distance-Based Simulated Annealing and Fuzzy Clustering for Sensor Placement Ensuring Observability and Minimal Relative Degree

by
Daniel Leitold
1,2,
Agnes Vathy-Fogarassy
1,2 and
Janos Abonyi
2,*
1
Department of Computer Science and Systems Technology, University of Pannonia, Egyetem u. 10, H-8200 Veszprém, Hungary
2
MTA-PE Lendület Complex Systems Monitoring Research Group, University of Pannonia, Egyetem u. 10., POB. 158, H-8200 Veszprém, Hungary
*
Author to whom correspondence should be addressed.
Sensors 2018, 18(9), 3096; https://doi.org/10.3390/s18093096
Submission received: 30 June 2018 / Revised: 11 September 2018 / Accepted: 11 September 2018 / Published: 14 September 2018
(This article belongs to the Special Issue Artificial Intelligence and Machine Learning in Sensors Networks)

Abstract

:
Network science-based analysis of the observability of dynamical systems has been a focus of attention over the past five years. The maximum matching-based approach provides a simple tool to determine the minimum number of sensors and their positions. However, the resulting proportion of sensors is particularly small when compared to the size of the system, and, although structural observability is ensured, the system demands additional sensors to provide the small relative order needed for fast and robust process monitoring and control. In this paper, two clustering and simulated annealing-based methodologies are proposed to assign additional sensors to the dynamical systems. The proposed methodologies simplify the observation of the system and decrease its relative order. The usefulness of the proposed method is justified in a sensor-placement problem of a heat exchanger network. The results show that the relative order of the observability is decreased significantly by an increase in the number of additional sensors.
PACS:
02.10.Yn; 02.30.Yy; 02.40.Pc; 02.40.Re; 02.50.Cw; 02.50.Sk; 07.05.Mh; 89.20.Ff; 89.75.-k; 89.75.Da; 89.75.Fb; 89.75.Hc; 89.75.Kd
MSC:
20N25; 28E10; 93B07; 93B51

1. Introduction

The placement of sensors significantly affects the performance of identification, state estimation as well as fault detection and isolation (FDI) algorithms. The goal-oriented placement of sensors for dynamical systems is a challenging task [1]. Parameter estimation-oriented information entropy-based optimal sensor placement was investigated in [2]. Based on robust information entropy, a Bayesian sequential sensor placement algorithm for multi-type of sensor is also proposed [3]. Computational Fluid Dynamics (CFD) models were generated to predict wind-flow that used the data of a sensor placement utilised with prediction-value joint entropy [4]. For fault detection and isolation, an incremental analytical redundancy relation (ARR)-based algorithm was introduced in [5]. Structural analysis-based sensor placement strategies place additional sensors to isolate otherwise undisguisable faults [6]. In the initial sensor-constrained problems, the number of sensors is given and the detection time has to be minimized [7], while, as far as time-constrained problems are concerned, the aim is to minimize the number of required sensors that ensure a predefined detection time. In the case of dynamical systems, the previously mentioned detection time is related to the relative degree of the system [8]. Besides good FDI performance, the optimal placement of the sensors should also ensure the observability of the systems [9].
Network representation-based structural controllability and observability analysis of dynamical systems is a hot topic in literature [10,11,12,13,14,15,16] since the introduction of this new approach [17]. The methodology utilizes the maximum matching algorithm to determine the minimum set of actuators and sensors required to grant structural controllability and observability for an arbitrary dynamical system. Although the results are very promising, the methodology can overestimate the number of necessary actuators [18] when the details of the edge dynamic are neglected [14].
When the observability analysis of the system is performed on a properly defined network, the problem concerning the observability analysis is reduced to a reachability problem [19]. In some cascaded systems, this highlights that the number of necessary sensors is significantly smaller than the size of the state variables, which makes the observation vulnerable and the relative degree, i.e., the minimum number of derivatives that is necessary to observe at least one signal of inputs, is very high. To deal with the vulnerability, Liu et al. proposed a methodology to grant robustness in the undirected representation of the system [20].
There is an almost infinite number of objectives that can be used to define a sensor placement problem [21]. In most of the cases, the optimization problem is formulated as that the instrumentation cost should be minimized and the precision of reconciled values should be ensured in a reliable, resilient and robust manner [22]. As the problem is NP-hard (non-deterministic polynomial-time hard), the mixed-integer programming-based approach is suitable to small-to-medium sized networks [23]. To handle the complexity of the problem, a heuristic approach is mostly followed. For example, to reconstruct the internal states of biochemical reaction systems with the minimum number of sensors, a graphical method has been developed [24]. Among population-based optimization algorithms, genetic algorithms (GAs) are the most frequently applied, e.g., GA was utilized to find a sensor configuration that minimizes cost and maximizes the reliability and observability of the system [25]. The main benefit of these gradient-free heuristic optimization methods is that they can be utilized with a wide range of models [26]. Simulated annealing (SA) is also widely employed optimization algorithm in sensor placement [27]. SA lends itself to be combined with other search algorithms, e.g., with local search heuristics [28] or other SA inner loop, which results in dual representation simulated annealing (DRSA) [29]. These techniques are often combined also in sensor placement, e.g., GA and SA were successfully integrated [30]. Recently, Particle Swarm Optimization (PSO) was applied in an optimal sensor location problem (OSLP) [31] and in the design of an ultrasonic Structural Health Monitoring (SHM) system [32]. According to these publications, heuristic optimization techniques are efficient in finding a small number of sensors with less time-consuming computations [33].
In this paper, to ensure observability and minimize the relative order of dynamical systems, two methodologies for sensor placement are introduced. The presented methods combine the metaheuristic SA optimization method with clustering. SA is chosen as it exhibits excellent levels of performance with regard to graph-represented combinatorial optimization (CO). The use of simulated annealing is strengthened by the fact that the relative degree of the system can be interpreted as a path in a network, and simulated annealing models a random walk on the search graph. Based on this concept, SA was applied to place micro-hydropowers in a water supply network [34] and network alignment in biological systems [35].
As sensors practically group the observed state variables, sensor placement can be considered as a special k-medoid clustering problem. This concept has been utilized in Ref. [36], where fuzzy c-means clustering algorithm was utilized to characterize the spatial distribution of the sensor placement problem.
The Clustering Large Applications (CLASA) algorithm [37] developed as a fast and robust solution to the well-known k-medoid clustering problem. The fundamental idea of the paper is that instead of considering the distance between the cluster centers and the clustered objects, a goal-oriented sensor-placement algorithm can be derived by the introduction of a problem-relevant objective function into the scheme of the CLASA algorithm. The second novelty of the paper is that the random search of the resulted algorithm is fine-tuned by calculating the selection probabilities based on network distance-related fuzzy membership functions.
The structure of the paper is the following. In Section 2.1, the theoretical foundations and nomenclature of the relative degree of multiple-input and multiple-output (MIMO) dynamical systems are introduced. Section 2.2 presents the maximum matching-based sensor placement algorithm that ensures structural observability. In Section 2.3, the proposed optimization algorithms are introduced. Section 3 provides examples of application. Finally, Section 4 concludes the paper.

2. Sensor Placement to Ensure Observability and Minimal Relative Order

In Section 2.1, the relative degree of MIMO systems is defined and the sensor placement task is considered to be an optimization problem. Section 2.2 presents the first step of our methodology, which places the sensors as the structural observability of the system is guaranteed. Section 2.3 presents the proposed simulated annealing-based clustering algorithms that optimally place the additional sensors to minimize the relative degree of the system.

2.1. Problem Formulation

A nonlinear MIMO system can be represented by state-space model:
x ˙ ( t ) = f ( x ( t ) ) + g ( x ( t ) ) u ( t ) ,
y ( t ) = h ( x ( t ) ) ,
where the vectors x , u and y stand for the state variables, inputs and outputs, respectively.
The model can be linearized and the resultant linear approximation represented by a linear state-space model [38]:
x ˙ ( t ) = A x ( t ) + B u ( t ) ,
y ( t ) = C x ( t ) .
When the number of state variables is denoted by N, the number of actuators by M and the number of sensors by K, then the matrices A N × N , B N × M and C K × N define how state variables influence each other, how the actuators influence the state variables, and how the sensors record the state variables, respectively.
For each pair of output y i and input u j ( i = 1 , , K and j = 1 , , M ), the relative degree of the system r i , j can be defined as the minimum number of derivatives of y i ( t ) that is directly influenced by the change in u j ( t ) . The relative degree for an arbitrary output i is defined as r i = min j = 1 M r i , j , which requires the observer to observe the effect of at least one of the inputs.
When the aim of the observer is to estimate the effect of the d ( t ) N disturbance vector that independently influences all state variables,
x ˙ ( t ) = A x ( t ) + E d ( t ) ,
y i ( t ) = C x ( t ) ,
then E is an identity matrix I N and the relative order of the system is r = m a x i = 1 K r i .
As Figure 1 illustrates the relative degree of an x j state variable can be interpreted as the d i s t ( i , j ) length of the path between the j-th state variable and the nearest sensor, C ( i ) , selected as arg min j d i s t ( i , j ) , while the relative order of the whole system is defined by the maximum of these minimal distances.
As the generation of a balanced placement of sensors is of interest, the set of sensor nodes S is determined by minimizing the following cost function:
min S c o s t ( G , S , β ) = β max i = 1 K r i + ( 1 β ) i = 1 K r i N ,
where parameter β = [ 0 , 1 ] weights between the maximum and the balance-related average of the relative order of the system.

2.2. Placing Sensors to Ensure Observability

Besides the minimisation of the relative order, the observability of the system should also be ensured. Based on the Kalman rank criterion, a linear dynamical system is said to be structurally observable, if and only if the observability matrix O = [ C ] T , [ CA ] T , [ CA 2 ] T , , [ CA N 1 ] T T is of full rank ( r a n k ( O ) = N ) [39].
Based on the state-transition matrix A , a graph G ( V , E ) can be constructed where the set of vertices V represent the state variables and the edges E = V × V are determined by the nonzero elements of A (see Figure 2). In the graph representation, the relative degree r i , j can be defined as the distance between the input/disturbance d j and the output y i , which is the distance between the j-th and i-th nodes in the directed network G ( V , E ) , so the the optimization problem can be considered as a special graph partitioning problem. This graph-based representation is beneficial as the maximum matching of M M E can be used to determine the minimal number and places of the sensor nodes that are required to ensure the observability of the system.
As directed graphs can be represented as bipartite graphs, the maximum matching of the related bipartite graph was studied [14,17]. The endpoints of the directed edges in M M are the matched nodes ( V ( M M ) ), while others are the unmatched nodes ( V ( M M ) ¯ ), thus V ( M M ) V ( M M ) ¯ = V . Based on [17], the unmatched nodes V ( M M ) ¯ of G ( V , E ) , where E is determined by A T , define the driver nodes, i.e., where the actuators should be placed. Furthermore, unmatched nodes of G ( V , E ) , where E is defined by A , should be the sensor nodes. In other words, the nodes where sensors should be placed. If the determined driver nodes or sensor nodes are assigned to the system, then it becomes a structurally controllable or structurally observable system [14]. As the aim of this paper is to deal with system observability, G ( V , E ) was used with the adjacency matrix based on A . To ensure the structural observability of the system, the path-finding method was used to define the minimum number of mandatory sensors and their location [14].
In order to generate the output matrix C such that the relative order is minimal and the system observable, first the unmatched set of nodes S f is generated, followed by the set of candidate sensor nodes S c . The cardinality of the candidate sensor nodes is K + ( K + = | S c | ). The resulting C is designed based on the set of sensor nodes S = S f S c such that for each x i S , c j is a one-hot row vector whose i-th element is non-zero, so C = [ c 1 T , , c K T ] T .
As the number of sensors which can be allocated without any restrictions is K + , K + ! | V \ S f | ! ( | V \ S f | K + ! ) different combinations of possible placements of sensors exists. To solve this NP-hard problem, simulated annealing-based heuristic optimization algorithms are proposed in the following section.

2.3. Simulated Annealing and Fuzzy Clustering-based Output Configuration Design

To optimize the relative order and smoothness of these orders, the objective function (Equation (7)), which can be considered as a linear combination of the maximum and average distance of the sensors from the observed state variables was defined. Based on this interpretation, the minimisation problem can also be seen as a k-medoid clustering problem, where the centroid elements are the sensor nodes and the members of the clusters are the observed state variables.
To determine the locations of the additional sensors two approaches are proposed inspired by Clustering Large Applications based on Simulated Annealing algorithm (CLASA) [37] and the Geodesic Distance-based Fuzzy c-Medoid Clustering method (GDFCM) [40]. In the first algorithm, the CLASA algorithm is modified as follows. The original objective function of CLASA that calculates the distances between the objects and the medoids (in our case, the distances between the state variables and the sensors) is replaced by what was proposed in Equation (7). With the new objective function, not just the minimum relative order can be granted, but different sensor configurations with the same relative order can be distinguished to balance the load of the sensors. The search mechanism should be fine-tuned as the medoids of the fixed sensors ( S f ) have fixed positions to ensure the observability of the system. The second algorithm enhances the random search of the resulted mCLASA algorithm by the introduction of distance-dependent selection probability calculated based on the Geodesic Distance-based Fuzzy c-Medoid Clustering method (GDFCM) [40].
In both algorithms, firstly the S f fixed sensor nodes are determined to grant the observability property based on the unmatched set of nodes. Then, the candidate sensor nodes S c are generated randomly, and the cost is calculated. Following this, in each iteration, one of the candidate sensor nodes ( s ) is randomly changed, and the new cost is calculated. From this, the difference between the new cost and the previously applied cost is calculated ( Δ ). The new placement of the sensors is accepted if Δ < 0 .
If Δ 0 , the worsened placement of the sensor is accepted only with the probability defined as exp ( Δ T i ) , where T i denotes the temperature of the cooling process of the SA in iteration i, whose dynamics can be seen in Equation (8). This randomization of the search allows for the algorithm to explore the search space and converge with the increase of the number of iterations:
T i + 1 = α T i , α = T m i n T m a x 1 / m a x i t e r .
The pseudo code of the proposed modified Clustering Large Applications based on Simulated Annealing algorithm (mCLASA) can be seen in Algorithm 1. The algorithm possesses the following inputs and parameters: G denotes the network representation of the system from Equation (5), β ( 0 , 1 ) stands for the parameter of the cost function from Equation (7), K + represents the number of additional sensors, m a x i t e r is the number of iterations of the simulated annealing, T m a x denotes the maximum, i.e., the initial temperature, and T m i n stands for the minimum temperature. The following values were used for the parameters: β = 0.5 , m a x i t e r = 500 , T m a x = 10 and T m i n = 10 2 .
Algorithm 1 Pseudo code of the modified CLASA algorithm (mCLASA).
1:
proceduremCLASA(G, β , K + , m a x i t e r , T m a x , T m i n )
2:
     α = T m i n T m a x 1 / m a x i t e r , T 1 = T m a x
3:
     S f = g e t S e n s o r s ( G )               ▷ using the path finding method: [14]
4:
    Let S c be the set of the K + randomly chosen elements from V \ S f
5:
     c o s t = c o s t ( G , S f S c , β )                        ▷Equation (7)
6:
    for i = 1 to m a x i t e r do
7:
        Let s be a randomly selected element from S c
8:
         S c = S c \ { s }
9:
        Let s + be a randomly chosen node from V \ ( S f S c )
10:
         S c = S c { s + }
11:
         n e w c o s t = c o s t ( G , S f S c , β )
12:
         Δ = n e w c o s t c o s t
13:
        if Δ < 0 then
14:
            c o s t = n e w c o s t
15:
            S c = S c
16:
        else
17:
           if r a n d o m ( ) < exp ( Δ T i ) then
18:
                c o s t = n e w c o s t
19:
                S c = S c
20:
           end if
21:
        end if
22:
         T i + 1 = α T i
23:
    end for
24:
    return c o s t
25:
end procedure
The second method utilizes the GDFCM method [40] to extend the previously introduced mCLASA method. This extension increases the probability that in each iteration potentially better neighboring sensors are selected to replace the sensors placed in the previous iteration steps s . The algorithm considers the sensors as cluster centers (medoids), determines the cluster assignments of the state variables, and swaps the medoid (the sensor) with a randomly selected state variable. The random selection utilizes distance-based fuzzy membership values. The cluster memberships μ i , j of the state variables R ( s j ) are determined by the fuzzy membership function (see Equation (9) in the description of the algorithm), where the distances are calculated in the undirected version of the graph ( G u ). A roulette wheel selection [41] selects the new medoid according to the membership values. The search space of the algorithm is also controlled by the fuzzy exponent m which is decreased in each iteration by α m , in the same manner as α in Equation (8).
The pseudo code of the suggested Geodesic Distance-based Fuzzy c-medoid Clustering with Simulated Annealing algorithm (GDFCMSA) can be seen in Algorithm 2. The algorithm possesses the following inputs and parameters: G denotes the network representation of the system, β ( 0 , 1 ) stands for the balance parameter of the cost function, K + represents the number of additional sensors, m a x i t e r is the number of iterations of the simulated annealing, m m a x denotes the maximum fuzzy component value and m m i n stands for the minimum fuzzy component value. The following values were used for the parameters: β = 0.5 , m a x i t e r = 500 , T m a x = 10 , T m i n = 10 2 , m m a x = 10 and m m i n = 2 .
Algorithm 2 Pseudo code of the Geodesic Distance-based Fuzzy c-Medoid Clustering with Simulated Annealing algorithm (GDFCMSA).
1:
procedureGDFCMSA(G, β , K + , m a x i t e r , T m a x , T m i n , m m a x , m m i n )   
2:
     α = T m i n T m a x 1 / m a x i t e r , T 1 = T m a x , α m = m m i n m m a x 1 / m a x i t e r , m 1 = m m a x
3:
     S f = g e t S e n s o r s ( G )                 ▷ using the path finding method: [14]  
4:
    Let S c be the set of the K + randomly chosen state elements from V \ S f
5:
     c o s t = c o s t ( G , S f S c , β )                         ▷ Equation (7)
6:
    for i = 1 to m a x i t e r do
7:
        Calculate the fuzzy membership values for G u , 1 i K and 1 j N :
μ i , j = 1 k = 1 K r i , j r k , j 2 m i 1
8:
        Let s be a randomly selected sensor node from S c
9:
         S c = S c \ { s }
10:
         R ( s ) = { x j | x j V , s = arg max k = 1 K μ k , j }
11:
        for all s j R ( s ) do
12:
            P ( s j ) = μ s , s j
13:
        end for
14:
         s + = { x j | x j = r o u l e t t e _ w h e e l _ s e l e c t i o n ( P ) }
15:
         S c = S c { s + }
16:
         n e w c o s t = c o s t ( G , S f S c , β )
17:
         Δ = n e w c o s t c o s t
18:
        if Δ < 0 then
19:
            c o s t = n e w c o s t
20:
            S c = S c
21:
        else
22:
           if r a n d o m ( ) < exp ( Δ T i ) then
23:
                c o s t = n e w c o s t
24:
                S c = S c
25:
           end if
26:
        end if
27:
         T i + 1 = α T i , m i + 1 = α m m i
28:
    end for
29:
    return c o s t
30:
end procedure

3. Results

In this section, four case studies will be introduced to demonstrate the applicability of the previously presented methods. Firstly, the details of the case studies will be given. We evaluate the proposed methods from two aspects. Firstly, the effect of the additional sensor nodes on the cost function and to the relative order of the system is studied; secondly, the speed of converge is analyzed.

3.1. Description of the Case Studies

The first case study is related the control of Heat Exchanger Networks (HENs). HENs are widely studied dynamical systems because the complexity of interlinked heat exchangers requires advanced process monitoring and control algorithms. The network topology of the studied HEN is shown in Figure 3. The network consists of six hot streams, two cold streams, ten heat exchanger cells and two utility coolers [42]. The state-transition matrix A of the problem can be easily determined based on the structure of the process [19], which results in results in 22 state variables.
Following the detailed analysis of the problem with regard to the placement of sensors in the HEN, three other dynamical systems will be analyzed to illustrate the applicability of the methods on larger examples. These benchmark examples are not typical control-relevant problems as they are used to study model-reduction algorithms [43], but these problems illustrate how the algorithms are able to detect structurally efficient placements. The first two additional examples are based on Modified Nodal Analysis (MNA) [44] and have 578 and 980 state variables, respectively. The last case study is based on the state space model of a partial element equivalent circuit (PEEC) of a patch antenna structure with 172 inductances, 6990 mutual inductances, and 2100 capacitances that define 480 state variables [45]. Further information about the case studies can be found in the Supplementary Material of the paper.
The parameters of the resulted case studies are summarized in Table 1. The network representations of the case studies and the resulted sensor placements are presented in the Supplementary Material of the paper.
As random search-based techniques were evaluated, each result was evaluated based on 100 independent runs of the algorithms. The parameters of the algorithms were the same in each scenario: β = 0.5 , T m a x = 10 , T m i n = 10 2 , m m a x = 10 , m m i n = 2 . The reduction rates α and α m were determined by Equation (8), as can be seen in Algorithms 1 and 2.

3.2. Effect of the Additional Nodes and the Robustness of the Optimization Algorithms

In all the examples, firstly the S f set of fixed sensors that are required to ensure the observability of the system is determined. The number of sensors added to improve the dynamical properties of the system is represented by the cardinality of | K + | .
To evaluate the robustness of the random search-based algorithms, the results of 100 independent runs were visualized in boxplots to show the distribution of the optimized cost functions. As can be seen in Figure 4 and Figure 5, the algorithms are consistent in most of the cases; however, the presence of outliers show that the algorithms can fail in local optimums.
It is visible that in all cases both the cost functions and the relative orders of the observed state variables decrease significantly as the number of sensors increases, so a relatively small quantity of additional sensors can cause a noticeable improvement in terms of the relative order compared to the observer designed only to ensure observability. It can also be pointed out that additional sensors ( K + = 25 or 35) do not significantly reduce the order of the system.
The heights of the box plots show the interquartile ranges of the optimized cost functions. Higher box plots reflect higher uncertainty in the solutions which could indicate the increased difficulty of the sensor placement problem. For example, in Figure 4, for K + = 7 , the boxplot shows a much wider distribution than for other K + that can be explained there is a transient in the structure of the solutions.

3.3. Convergence Analysis

In the second phase of the analysis, the convergence of the proposed methods was tested. In Figure 6, the convergences of the methods can be seen for the HEN case study with the parameter setting K + = 3 . The results and the network representation of the other case studies are presented in the Supplementary Material. In the figure, it can be seen that how fast the methods converge in terms of the number of iterations. With the addition of three sensors, the cost function was reduced by more than 50% over the 500 iterations, which illustrate well the benefit of the proposed methods.
In this figure, it can also be seen that it is worth selecting a high value of the parameter m a x i t e r because, even during the 500th iteration, the cost functions still show a small decrease.
From this figure, it is evident that the fuzzy extension of the algorithm improves the speed of convergence which is also reflected from the slightly better performances shown in Figure 4 and Figure 5.
The durations of the calculations were also measured with m a x i t e r = 500 to provide an illustrative comparison between the algorithm and the complexity of the problem. The results are presented in Table 2. The durations of calculations are valid for our MATLAB implementation (R2016a, The MathWorks, Natick, MA, USA) when run on a notebook using Windows 10 (Microsoft, Redmond, WA, USA), an Intel Core i7-6600U processor (Santa Clara, CA, USA) and 16 GB of RAM.
The short running times shown in Table 2 raise the question of whether the exhaustive enumeration of the solutions could handle the problem and if the heuristic search is necessary. The problem is combinatorially complex, for each K + N | S f | K + possible solutions should be examined. In the HEN case study for K + = 9 , the SA examines only 500 solutions instead of the 92,378 possible configurations. For larger problems, the difference is more significant, e.g., the example of MNA_4 with K + = 35 would require 978 35 2.4 × 10 64 function evaluations, although this problem cannot be considered as a huge system.

4. Conclusions

To ensure the structural observability in addition to fast and robust observer response, two clustering- and simulated annealing-based methods were proposed.
Additional sensors are placed into the system based on the CLASA algorithm by the mCLASA algorithm. The placements of additional sensors is further improved by the Geodesic Distance-based Fuzzy c-Medoid Clustering with Simulated Annealing algorithm (GDFCMSA) as their positioning is based on a geodesic distance-based fuzzy membership functions. Simulated annealing is applied by both algorithms to minimize the cost function, which in turn minimizes the maximum and average of the relative orders of the system to generate a balanced placement of sensors.
A slightly better solution is provided by the GDFCMSA algorithm than the mCLASA method at the expense of some additional computational resources for the evaluation of the membership degrees. Solutions to massive problems are generated by both methods over a short period of time, which illustrates the applicability of these methods in an industrial setting.
Although this paper deals with the placement of sensors, the proposed methodology can be applied to control configuration design as well, since this is the dual problem of the design of the output configuration.

Supplementary Materials

The supplementary material of the paper (additional figures, MATLAB source codes, the details of the networks) is available from https://www.abonyilab.com/network-science/controlability-and-observability.

Author Contributions

D.L. reviewed the literature concerning the observation of dynamical systems, developed the algorithms, in addition to designing, performing and evaluating the experiments. Moreover, he wrote the related sections of the article. A.V.-F. reviewed the literature concerning the applied clustering and artificial intelligence methods, participated in the formalization of the formulas and algorithms and wrote as well as corrected the related sections of the paper. J.A. conceived and designed the core concept, developed the algorithms and proofread the paper.

Funding

This research was supported by the National Research, Development and Innovation Office NKFIH, through the project OTKA-116674 (process mining and deep learning in the natural sciences and process development) and the EFOP-3.6.1- 16-2016- 00015 Smart Specialization Strategy (S3) Comprehensive Institutional Development Program.

Conflicts of Interest

The authors declare no conflicts of interest. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; nor in the decision to publish the results.

Abbreviations

Abbreviations used in this article:
CLASA:Clustering Large Applications based on Simulated Annealing
GDFCM:Geodesic Distance-based Fuzzy c-Medoid Clustering method
GDFCMSA:Geodesic Distance-based Fuzzy c-Medoid Clustering with Simulated Annealing method
HEN:Heat Exchanger Network
mCLASA:modified Clustering Large Applications based on Simulated Annealing
MIMO:Multiple-input and multiple-output
SA:Simulated Annealing

Nomenclature

x vector of state variables
u vector of inputs
d vector of disturbances
y vector of outputs
A state-transition matrix
B input matrix
E disturbance matrix
C output matrix
O observability matrix
Nnumber of state variables
Mnumber of inputs
Knumber of outputs
K + number of additional sensor nodes
b j jth column of matrix B
b j jth row of matrix B
r i , j relative degree of y i and u j
r i relative degree of y i
rrelative order of the system
G ( V , E ) network representation of the system
Vset of vertices of the network representation of the system
Econnections between the state variables
G u undirected representation of G
Sset of sensor nodes
S f set of fixed sensor nodes
S c set of candidate sensor nodes
β weighting parameter of cost function
M M edges of a maximum matching
V ( M M ) matched nodes
V ( M M ) ¯ unmatched nodes
Δ difference between cost functions in SA
T m a x , T m i n maximum and minimum temperature in SA
m m a x , m m i n maximum and minimum fuzzy exponent in SA
T i temperature in iteration i
m i fuzzy exponent in iteration i
α reduction rate of temperature
α m reduction rate of the fuzzy exponent
m a x i t e r number of iterations of SA
R ( s j ) set of the state variables of the cluster of sensor node s j
μ i , j fuzzy membership function

References

  1. Chmielewski, D.J.; Palmer, T.; Manousiouthakis, V. On the theory of optimal sensor placement. AIChE J. 2002, 48, 1001–1012. [Google Scholar] [CrossRef]
  2. Papadimitriou, C.; Lombaert, G. The effect of prediction error correlation on optimal sensor placement in structural dynamics. Mech. Syst. Signal Process. 2012, 28, 105–127. [Google Scholar] [CrossRef]
  3. Yuen, K.V.; Kuok, S.C. Efficient Bayesian sensor placement algorithm for structural identification: A general approach for multi-type sensory systems. Earthq. Eng. Struct. Dyn. 2015, 44, 757–774. [Google Scholar] [CrossRef]
  4. Papadopoulou, M.; Raphael, B.; Smith, I.F.; Sekhar, C. Hierarchical sensor placement using joint entropy and the effect of modeling error. Entropy 2014, 16, 5078–5101. [Google Scholar] [CrossRef]
  5. Rosich, A.; Sarrate, R.; Puig, V.; Escobet, T. Efficient optimal sensor placement for model-based FDI using an incremental algorithm. In Proceedings of the 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; IEEE: Piscataway, NJ, USA, 2007; pp. 2590–2595. [Google Scholar]
  6. Düştegör, D.; Frisk, E.; Cocquempot, V.; Krysander, M.; Staroswiecki, M. Structural analysis of fault isolability in the DAMADICS benchmark. Control Eng. Pract. 2006, 14, 597–608. [Google Scholar] [CrossRef]
  7. Berger-Wolf, T.Y.; Hart, W.E.; Saia, J. Discrete sensor placement problems in distribution networks. Math. Comput. Model. 2005, 42, 1385–1396. [Google Scholar] [CrossRef]
  8. Isidori, A. Nonlinear Control Systems; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  9. Boukhobza, T.; Hamelin, F. State and input observability recovering by additional sensor implementation: A graph-theoretic approach. Automatica 2009, 45, 1737–1742. [Google Scholar] [CrossRef]
  10. Yan, G.; Tsekenis, G.; Barzel, B.; Slotine, J.J.; Liu, Y.Y.; Barabási, A.L. Spectrum of controlling and observing complex networks. Nat. Phys. 2015, 11, 779–786. [Google Scholar] [CrossRef] [Green Version]
  11. Ruths, J.; Ruths, D. Control profiles of complex networks. Science 2014, 343, 1373–1376. [Google Scholar] [CrossRef] [PubMed]
  12. Pósfai, M.; Liu, Y.Y.; Slotine, J.J.; Barabási, A.L. Effect of correlations on network controllability. Sci. Rep. 2013, 3, 1067. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Leitold, D.; Vathy-Fogarassy, Á.; Süle, Z.; Manchin, R.; Abonyi, J. Measuring sustainability—Network science based analysis of water resources models. In Proceedings of the 7th EDSI Conference, Helsinki, Finland, 24–27 May 2016; p. 189. [Google Scholar]
  14. Leitold, D.; Vathy-Fogarassy, Á.; Abonyi, J. Controllability and observability in complex networks—The effect of connection types. Sci. Rep. 2017, 7, 151. [Google Scholar] [CrossRef] [PubMed]
  15. Zhang, X.; Lv, T.; Yang, X.; Zhang, B. Structural controllability of complex networks based on preferential matching. PLoS ONE 2014, 9, e112039. [Google Scholar] [CrossRef] [PubMed]
  16. Kang, L.; Liu, Y. Design of Control Structure for Integrated Process Networks Based on Graph-Theoretic Analysis. Chem. Eng. Trans. 2017, 61, 1711–1716. [Google Scholar]
  17. Liu, Y.Y.; Slotine, J.J.; Barabási, A.L. Controllability of complex networks. Nature 2011, 473, 167–173. [Google Scholar] [CrossRef] [PubMed]
  18. Müller, F.J.; Schuppert, A. Few inputs can reprogram biological networks. Nature 2011, 478, E4–E5. [Google Scholar] [CrossRef] [PubMed]
  19. Varga, E.; Hangos, K.; Szigeti, F. Controllability and observability of heat exchanger networks in the time-varying parameter case. Control Eng. Pract. 1995, 3, 1409–1419. [Google Scholar] [CrossRef]
  20. Liu, X.; Mo, Y.; Pequito, S.; Sinopoli, B.; Kar, S.; Aguiar, A.P. Minimum robust sensor placement for large scale linear time-invariant systems: A structured systems approach. IFAC Proc. Vol. 2013, 46, 417–424. [Google Scholar] [CrossRef]
  21. Sharon, L.P.; Rex, K.K. Optimization Strategies for Sensor and Actuator Placement; Technical Report; NASA Langley Research Center: Hampton, VA, USA, 1999.
  22. Bagajewicz, M.J. Design and retrofit of sensor networks in process plants. AIChE J. 1997, 43, 2300–2306. [Google Scholar] [CrossRef]
  23. Watson, J.P.; Hart, W.E.; Berry, J.W. Scalable high-performance heuristics for sensor placement in water distribution networks. In Impacts of Global Climate Change; American Society of Civil Engineers: Reston, VA, USA, 2005; pp. 1–12. [Google Scholar]
  24. Liu, Y.Y.; Slotine, J.J.; Barabási, A.L. Observability of complex systems. Proc. Natl. Acad. Sci. USA 2013, 110, 2460–2465. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  25. Carballido, J.A.; Ponzoni, I.; Brignole, N.B. CGD-GA: A graph-based genetic algorithm for sensor network design. Inf. Sci. 2007, 177, 5091–5102. [Google Scholar] [CrossRef]
  26. Aguirre-Salas, L.; Begovich, O.; Ramirez-trevino, A. Sensor assignment for observability in interpreted petri nets. IFAC Proc. Vol. 2004, 37, 441–446. [Google Scholar] [CrossRef]
  27. Chiu, P.; Lin, F.Y. A simulated annealing algorithm to support the sensor placement for target location. In Proceedings of the 2004 Canadian Conference on Electrical and Computer Engineering, Niagara Falls, ON, USA, 2–5 May 2004; IEEE: Piscataway, NJ, USA, 2004; Volume 2, pp. 867–870. [Google Scholar]
  28. Martin, O.C.; Otto, S.W. Combining simulated annealing with local search heuristics. Ann. Oper. Res. 1993, 63, 57–75. [Google Scholar] [CrossRef]
  29. Torres-Jimenez, J.; Izquierdo-Marquez, I.; Garcia-Robledo, A.; Gonzalez-Gomez, A.; Bernal, J.; Kacker, R.N. A dual representation simulated annealing algorithm for the bandwidth minimization problem on graphs. Inf. Sci. 2015, 303, 33–49. [Google Scholar] [CrossRef]
  30. Worden, K.; Burrows, A. Optimal sensor placement for fault detection. Eng. Struct. 2001, 23, 885–901. [Google Scholar] [CrossRef]
  31. Qin, B.Y.; Lin, X.K. Optimal sensor placement based on particle swarm optimization. Adv. Mater. Res. Trans. Technol. Publ. 2011, 271, 1108–1113. [Google Scholar] [CrossRef]
  32. Blanloeuil, P.; Nurhazli, N.A.; Veidt, M. Particle swarm optimization for optimal sensor placement in ultrasonic SHM systems. In Nondestructive Characterization and Monitoring of Advanced Materials, Aerospace, and Civil Infrastructure 2016; International Society for Optics and Photonics: Bellingham, WA, USA, 2016; Volume 9804, p. 98040E. [Google Scholar]
  33. Zhang, X.; Li, J.; Xing, J.; Wang, P.; Yang, Q.; Wang, R.; He, C. Optimal sensor placement for latticed shell structure based on an improved particle swarm optimization algorithm. Math. Probl. Eng. 2014, 2014. [Google Scholar] [CrossRef]
  34. Samora, I.; Franca, M.J.; Schleiss, A.J.; Ramos, H.M. Simulated annealing in optimization of energy production in a water supply network. Water Res. Manag. 2016, 30, 1533–1547. [Google Scholar] [CrossRef]
  35. Mamano, N.; Hayes, W. SANA: Simulated Annealing Network Alignment Applied to Biological Networks. arXiv, 2016; arXiv:1607.02642. [Google Scholar]
  36. Zhang, X.X.; Li, H.X.; Qi, C.K. Spatially constrained fuzzy-clustering-based sensor placement for spatiotemporal fuzzy-control system. IEEE Trans. Fuzzy Syst. 2010, 18, 946–957. [Google Scholar] [CrossRef]
  37. Chu, S.C.; Roddick, J.F.; Pan, J.S. A Comparative Study and Extension to K-Medoids Algorithms; Contemporary Development Company: Hongkong, China, 2001; pp. 1–10. [Google Scholar]
  38. Sastry, S.S.; Isidori, A. Adaptive control of linearizable systems. IEEE Trans. Autom. Control 1989, 34, 1123–1131. [Google Scholar] [CrossRef]
  39. Kalman, R.E. Mathematical description of linear dynamical systems. J. Soc. Ind. Appl. Math. Ser. A Control 1963, 1, 152–192. [Google Scholar] [CrossRef]
  40. Király, A.; Vathy-Fogarassy, Á.; Abonyi, J. Geodesic distance based fuzzy c-medoid clustering–searching for central points in graphs and high dimensional data. Fuzzy Sets Syst. 2016, 286, 157–172. [Google Scholar] [CrossRef]
  41. Goldberg, D.E. Genetic Algorithms in Search, Optimization, and Machine Learning; Addison-Wesley: Boston, MA, USA, 1989. [Google Scholar]
  42. Westphalen, D.L.; Young, B.R.; Svrcek, W.Y. A controllability index for heat exchanger networks. Ind. Eng. Chem. Res. 2003, 42, 4659–4667. [Google Scholar] [CrossRef]
  43. Chahlaoui, Y.; Van Dooren, P. A Collection of Benchmark Examples for Model Reduction of Linear Time Invariant Dynamical Systems; The University of Manchester: Manchester, UK, 2002; pp. 1–26. [Google Scholar]
  44. Odabasioglu, A.; Celik, M.; Pileggi, L.T. PRIMA: Passive reduced-order interconnect macromodeling algorithm. In Proceedings of the 1997 IEEE/ACM International Conference on Computer-Aided Design, Santa Clara, CA, USA, 9–13 November 1997; pp. 58–65. [Google Scholar]
  45. Heeb, H.; Ruehli, A.E.; Bracken, J.E.; Rohrer, R.A. Three dimensional circuit oriented electromagnetic modeling for VLSI interconnects. In Proceedings of the IEEE 1992 International Conference on Computer Design: VLSI in Computers and Processors, ICCD’92, Cambridge, MA, USA, 11–14 October 1992; pp. 218–221. [Google Scholar]
Figure 1. Illustration of the concept of relative degrees of the state variables and how one additional sensor decreases the relative degree of the system. (a) relative degrees in the case of two sensors that are necessary to grant observability; (b) relative degrees in the case of one additional sensor.
Figure 1. Illustration of the concept of relative degrees of the state variables and how one additional sensor decreases the relative degree of the system. (a) relative degrees in the case of two sensors that are necessary to grant observability; (b) relative degrees in the case of one additional sensor.
Sensors 18 03096 g001
Figure 2. (a) example system x ( t ) = A x ( t ) ; (b) graph representation of (a); state variables { x 1 , x 2 , x 3 } are matched by M M = { a 13 , a 21 , a 32 } = { ( x 1 , x 3 ) , ( x 2 , x 1 ) , ( x 3 , x 2 ) } and the system is observable, since a sensor on x 4 can observe all state variables. If x 4 is removed from the system, then V ( M M ) ¯ is empty, which would mean that no sensor node could be identified as belonging to the system. This lack of the maximum matching-based approach can be eliminated by the path-finding method [14].
Figure 2. (a) example system x ( t ) = A x ( t ) ; (b) graph representation of (a); state variables { x 1 , x 2 , x 3 } are matched by M M = { a 13 , a 21 , a 32 } = { ( x 1 , x 3 ) , ( x 2 , x 1 ) , ( x 3 , x 2 ) } and the system is observable, since a sensor on x 4 can observe all state variables. If x 4 is removed from the system, then V ( M M ) ¯ is empty, which would mean that no sensor node could be identified as belonging to the system. This lack of the maximum matching-based approach can be eliminated by the path-finding method [14].
Sensors 18 03096 g002
Figure 3. (a) the structure of the studied heat exchanger network represents how cold and hot streams influence each other; (b) the network representation of the state variables are related to the temperatures of the streams depicted in (a). Colors indicate strongly connected components.
Figure 3. (a) the structure of the studied heat exchanger network represents how cold and hot streams influence each other; (b) the network representation of the state variables are related to the temperatures of the streams depicted in (a). Colors indicate strongly connected components.
Sensors 18 03096 g003
Figure 4. The values of the cost functions and relative degrees with regard to the number of added sensors in the case studies HEN and MNA_1. The box plots show the interquartile ranges and the medians of the cost functions, while the scatter plots visualize the minimum value of the relative order for 100 independent runs. For each value of the x-axis, the results provided by the mCLASA are shown on the left-hand side, and the results provided by the GDFCMSA algorithm are shown on the right-hand side. The triangles represent the relative orders, while the plus signs denote the outliers (if they exist) of the corresponding boxes.
Figure 4. The values of the cost functions and relative degrees with regard to the number of added sensors in the case studies HEN and MNA_1. The box plots show the interquartile ranges and the medians of the cost functions, while the scatter plots visualize the minimum value of the relative order for 100 independent runs. For each value of the x-axis, the results provided by the mCLASA are shown on the left-hand side, and the results provided by the GDFCMSA algorithm are shown on the right-hand side. The triangles represent the relative orders, while the plus signs denote the outliers (if they exist) of the corresponding boxes.
Sensors 18 03096 g004
Figure 5. The values of the cost functions and relative degrees with regard to the number of added sensors in the case studies MNA_4 and peec. The box plots show the interquartile ranges and the medians of the cost functions, while the scatter plots visualize the minimum value of the relative order for 100 independent runs. For each value of the x-axis, the results provided by the mCLASA are presented on the left-hand side, while the results provided by the GDFCMSA algorithm are shown on the right-hand side. The triangles denote the relative orders, while the plus signs represent the outliers of the corresponding boxes.
Figure 5. The values of the cost functions and relative degrees with regard to the number of added sensors in the case studies MNA_4 and peec. The box plots show the interquartile ranges and the medians of the cost functions, while the scatter plots visualize the minimum value of the relative order for 100 independent runs. For each value of the x-axis, the results provided by the mCLASA are presented on the left-hand side, while the results provided by the GDFCMSA algorithm are shown on the right-hand side. The triangles denote the relative orders, while the plus signs represent the outliers of the corresponding boxes.
Sensors 18 03096 g005
Figure 6. Analysis of the convergence with regard to the proposed methods in the case of the HEN. The colour red denotes the method mCLASA while the colour blue denotes the method GDFCMSA. It is clearly visible that the value of m a x i t e r should be high because the methods also converge after 500 iterations. GDFCMSA provides a slightly better solution.
Figure 6. Analysis of the convergence with regard to the proposed methods in the case of the HEN. The colour red denotes the method mCLASA while the colour blue denotes the method GDFCMSA. It is clearly visible that the value of m a x i t e r should be high because the methods also converge after 500 iterations. GDFCMSA provides a slightly better solution.
Sensors 18 03096 g006
Table 1. The main parameters of the networks studied in the examples. | V | denotes the number of state variables, | E | stands for the number of the connections between the state variables, | S f | represents the number of fixed sensor nodes, and K + denotes the number of additional nodes to be analyzed.
Table 1. The main parameters of the networks studied in the examples. | V | denotes the number of state variables, | E | stands for the number of the connections between the state variables, | S f | represents the number of fixed sensor nodes, and K + denotes the number of additional nodes to be analyzed.
NetworkDescription | V | | E | | S f | K +
HENA simple heat exchanger network.22563 0 , , 15
MNA_1Modified Nodal Analysis of a multiport, voltage sources.57816943 0 , , 35
MNA_4Modified Nodal Analysis of a multiport, voltage sources.98028722 0 , , 35
peecPartial element equivalent circuit (PEEC) model.48013461 0 , , 35
Table 2. Durations of calculations in seconds in terms of the optimal placement of K + additional sensors. The presented results are the means of 100 independent runs.
Table 2. Durations of calculations in seconds in terms of the optimal placement of K + additional sensors. The presented results are the means of 100 independent runs.
K + HENMNA_1MNA_4peec
mCLASAGDFCMSAmCLASAGDFCMSAmCLASAGDFCMSAmCLASAGDFCMSA
10.0158 s0.0164 s0.1311 s0.1396 s0.1795 s0.1892 s0.0704 s0.0715 s
20.0173 s0.0183 s0.1752 s0.1689 s0.2240 s0.2306 s0.0945 s0.0955 s
30.0205 s0.0227 s0.2230 s0.2120 s0.2676 s0.2922 s0.1176 s0.1179 s
50.0233 s0.0280 s0.2638 s0.2999 s0.3724 s0.3749 s0.1602 s0.1614 s
100.0310 s0.0315 s0.3965 s0.4106 s0.6353 s0.7106 s0.2829 s0.2850 s
150.0427 s0.0428 s0.5984 s0.6721 s0.9049 s0.9176 s0.4631 s0.4237 s
25n.a.n.a.0.8052 s0.8170 s1.5417 s1.4754 s0.6385 s0.6862 s
35n.a.n.a.1.3258 s1.3404 s2.1008 s2.0255 s0.9229 s1.0077 s

Share and Cite

MDPI and ACS Style

Leitold, D.; Vathy-Fogarassy, A.; Abonyi, J. Network Distance-Based Simulated Annealing and Fuzzy Clustering for Sensor Placement Ensuring Observability and Minimal Relative Degree. Sensors 2018, 18, 3096. https://doi.org/10.3390/s18093096

AMA Style

Leitold D, Vathy-Fogarassy A, Abonyi J. Network Distance-Based Simulated Annealing and Fuzzy Clustering for Sensor Placement Ensuring Observability and Minimal Relative Degree. Sensors. 2018; 18(9):3096. https://doi.org/10.3390/s18093096

Chicago/Turabian Style

Leitold, Daniel, Agnes Vathy-Fogarassy, and Janos Abonyi. 2018. "Network Distance-Based Simulated Annealing and Fuzzy Clustering for Sensor Placement Ensuring Observability and Minimal Relative Degree" Sensors 18, no. 9: 3096. https://doi.org/10.3390/s18093096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop