Next Article in Journal
Game Design to Measure Reflexes and Attention Based on Biofeedback Multi-Sensor Interaction
Previous Article in Journal
Cooperative Environment Scans Based on a Multi-Robot System
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis

Department of Precision Machinery and Precision Instrumentation, University of Science and Technology of China (USTC), Hefei 230027, China
*
Author to whom correspondence should be addressed.
Sensors 2015, 15(3), 6497-6519; https://doi.org/10.3390/s150306497
Submission received: 27 November 2014 / Revised: 20 February 2015 / Accepted: 27 February 2015 / Published: 17 March 2015
(This article belongs to the Section Physical Sensors)

Abstract

: Sparse component analysis (SCA) has been widely used for blind source separation(BSS) for many years. Recently, SCA has been applied to operational modal analysis (OMA), which is also known as output-only modal identification. This paper considers the sparsity of sources' time-frequency (TF) representation and proposes a new TF-domain SCA under the OMA framework. First, the measurements from the sensors are transformed to the TF domain to get a sparse representation. Then, single-source-points (SSPs) are detected to better reveal the hyperlines which correspond to the columns of the mixing matrix. The K-hyperline clustering algorithm is used to identify the direction vectors of the hyperlines and then the mixing matrix is calculated. Finally, basis pursuit de-noising technique is used to recover the modal responses, from which the modal parameters are computed. The proposed method is valid even if the number of active modes exceed the number of sensors. Numerical simulation and experimental verification demonstrate the good performance of the proposed method.

1. Introduction

During the last few decades, modal analysis has been a useful analytical tool in linear dynamical systems. Modal information, including natural frequency, damping ratio and mode shape, can be extracted through modal identification to completely characterize the dynamics of a linear system. Output-only modal identification or operational modal analysis (OMA) has attracted the interest of many researchers in structural dynamics because of its capacity to extract modal information from output measurements alone [13].

Modal identification coincides with the concept of blind source separation(BSS), which is the process of extracting original sources and mixing matrix from output signals without prior knowledge of the mixing procedures or sources. The linear instantaneous-mixing BSS model can be formulated as

x ( t ) = A s ( t )
where x(t) is a column vector of m, zero mean, and output observations that represent an instantaneous linear mixture of source signals in column vector s(t); A ∈ ℝm×n is an m × n matrix, which is referred to as the mixing matrix. The observation vector x(t) may be construed as a superposition of the columns of matrix A loaded by individual sources si(t), i = 1, … , n. The objectivity of BSS is to recover sources s(t) from observations x(t) only without any knowledge of A. From the perspective of modal expansion, the BSS problem has a similar form to OMA, in which the output responses are a linear combination of decoupling monotone modes viewed as source signals.

Many studies have attempted to use BSS for OMA [48]. [4] was the first to recognize the correlation between the vibration modes of mechanical systems and the sources computed through independent component analysis(ICA); ICA is effective in weakly damped modes(with less than 1% damping). Classical second-order BSS(SOBSS) algorithms such as multiple unknown signal extraction(AMUSE) and second-order blind identification(SOBI) were successfully applied to modal identification and enhanced damped mode separation [5,6]. [7,9] highlighted the limitations of SOBI and proposed a framework using Hilbert transform to estimate complex mode shapes. [10] proposed a method that combines SOBI and state-space realization on the basis of the non-Hermitian joint approximate diagonalization of the correlation matrices at several time lags; the method established a theoretical link between SOBSS and stochastic subspace identification(SSI) [2,11], which is a popular modal identification algorithm.

Unlike traditional methods, BSS is straightforward, computationally efficient and can function unsupervised; however, it can estimate only as many active modes as the number of output responses [12]. The BSS problem that occurs when the number of sensors is less than the number of the sources is referred to as underdetermined BSS(UBSS), which has received much attention in recent years [10,13]. Sparse component analysis(SCA) [1416] was recommended to solve UBSS. Unlike traditional BSS methods that rely on source independence, SCA requires sources to have sparsity in the time domain or some transformed domain. Many studies proposed algorithms based on the sparsity in the TF domain [1722]. In [17], singular value decomposition was used on wavelet coefficients to identify frequencies and damping ratios; this method is faster than other ridge detection algorithms. In [19], principal component analysis was performed on the wavelet coefficients to produce the partial mixing matrix of UBSS.

Speech signals are highly transient, consisting of voiced segments exhibiting a well-defined frequency structure, which are separated by noise bursts. In each short period of time, only a small number of frequency bands hold most of the energy, while other frequency bands contain little energy. Therefore, if the speech signals are transformed to time-frequency domain, magnitude of most of the time-frequency coefficients in the time window are approximate to zero, which meets the sparsity condition of SCA; hence SCA has been widely studied in speech processing [2330]. [23,24] propsed the degenerate types of the unmixing estimation techniques under the assumption that the W-disjoint orthogonal condition occurs across the whole TF region. [25,26] proposed the types of time-frequency ratio of mixtures(TIFROM) methods to relax the restrictive orthogonal assumption. The TF coefficients of sources were allowed to overlap, but adjacent TF regions where only one source occurs must exist. In [27], the restriction of the TIFROM algorithm was further relaxed because only two adjacent points with a single source contribution in the same frequency bin were demanded. This relaxation was based on the fact that the magnitude of the mixture vector at each single-source-point(SSP) is proportional to that of one of the mixing column vectors. [28] reported a simple algorithm of identifying SSPs through hierarchical clustering. This algorithm achieved high estimation performance which could be further improved by using highly suitable clustering algorithms.

In OMA, the sources are exponentially-decaying sinusoids with different instinct frequencies, the sparse property of the sources in frequency or TF domain makes it suitable to execute modal identification using SCA. [31] studied SCA by using the sparsity of modal responses in the frequency domain, and estimated the mixing matrix and modal parameters. Instead of exploring frequency-domain sparsity, the present study investigates sparse representations in the TF domain. The proposed SCA consists of two steps: estimation of the mixing matrix and reconstruction of sources. The mixed signals are first transformed into the TF domain. The scatter plot of the TF coefficients reveals the hyperlines that correspond to the column vectors of the mixing matrix because of sparsity. The mixing matrix can be estimated through cluster analysis. SSP techniques [28] and K-hyperline clustering are combined to improve the performance of SCA because the estimation of the mixing matrix plays a key role in the latter reconstruction of sources. Finally, the theory of linear equations for determined BSS or basis pursuit de-noising (BPDN) for UBSS is used to recover the modal responses. Modal parameters are then estimated by using the single-degree-of-freedom(SDOF) parameter fitting method. The main contribution of this paper is to introduce the TF-SSP technique to OMA. The procedure of extracting SSPs can eliminate the points which don't lie in hyperlines and make estimation of mixing matrix more accurate, especially in the presence of noise. Numerical and experimental results show the accurate and robust identification performance of the method.

The paper is structured as follows. Section 2 introduces the theory of modal expansion, and Section 3 presents the derivation of SCA. Sections 4 and 5 provide the numerical simulations and experimental verification, respectively. The conclusion is drawn in Section 6.

2. Modal Expansion and BSS

For an n-degree-of-freedom linear system, the governing equations can be written as

M x ¨ ( t ) + C x ˙ ( t ) + K x ( t ) = f ( t )
where M ∈ ℝ n×n is the mass matrix, C ∈ ℝ n×n is the damping matrix, and K ∈ ℝ n×n is the stiffness matrices; f(t) is the external force, x is the n-dimensional displacement response vector, and dot denotes derivatives with respect to time.

On the basis of modal expansion theory, the system responses x(t) = [x1(t),… ,xn(t)]T with diagonalizable damping that can be expressed as

x ( t ) = Φ q ( t ) = i = 1 n φ i q i ( t )
where Φ ∈ ℝ n×n is the modal matrix that consists of the mode shape vector φi ∈ ℝn , and q(t) = [q1(t),…, qn(t)]T is a matrix of the corresponding modal coordinates. In particular, in free vibration, that is f(t) = 0, the single-mode signals can be expressed as monotone exponentially-decaying sinusoids
q i ( t ) = u i exp - ς i ω i t cos ( ω d i t + θ i ) , i = 1 , , n
where ωi and ζi are natural frequencies and damping ratios, respectively; ωdi is the damped natural frequency, ω d i = ω i 1 - ς i 2; and ui and θi are constants under corresponding initial conditions.

To identify Φ and q(t) on the sole basis of x(t), a parametric model must be established by using traditional identification methods. The common method of SSI [2,11] is based on the state space model and uses the theory of subspace identification to identify the parameters of the model structure. However, this method has some drawbacks in terms of model order determination and spurious mode problems, which require the standard stabilization diagram to obtain exact result. Note that Equations (1) and (3) share the same format. The modal coordinate q(t) is a special case of the general sources s(t) with time structure, whereas Φ corresponds to the mixing matrix A. Furthermore, distinct modal coordinates are uncorrelated and thus automatically satisfy the requirement of sources in BSS. The coordinates are also spectrally monotone in the frequency domain and thus meet the sparsity assumption of SCA . Therefore, Φ and q(t) can be obtained from the output x(t) by applying BSS and SCA, both of which provide a straightforward and efficient alternative for output-only modal identification.

3. Sparse Component Analysis (SCA)

In this section, we discuss all techniques used in the proposed SCA. First, the sparsity of signals in different transformed domains is illustrated, and the reason for the use of TF domain is explained. Then, SSP detection and K-hyperline clustering are presented to improve the accuracy of the mixing matrix estimation. Finally, the algorithm for sources reconstruction is discussed.

3.1. Sparsity in Transformed Domain

A signal is considered sparse in some ξ-domains if only a small number of coefficients in the ξ-domain significantly differ from zero. The linear transformation that changes the time-domain BSS model to the sparse ξ-domain can be represented as follows:

x ( ξ ) = A s ( ξ ) = i = 1 n a i s i ( ξ )

If s(ξ) is sufficiently sparse and disjoint at some ξk, that is, sj(ξk) ≠ 0 and si(ξk) = 0 for ij, then

x ( ξ k ) = i = 1 n a i s i ( ξ k ) = a j s j ( ξ k )

At these points (i.e., SSPs), only one source is active and the mixture vector x(ξk) is proportional in magnitude to one of the mixing matrix columns. Therefore plotting the magnitude of the mixture vectors in ξ-domain as a scatter diagram (e.g., a plot of x1(ξ) versus x2(ξ)) will form clusters and show a clear orientation toward the directions of the column vectors of the mixing matrix A. The number of active sources can be counted by recognizing significant directions. The clustering algorithm can be applied to estimate A.

A simple example is presented to display the orientation-clustering appearances in different ξ-domains. Three sources with distinct frequencies, namely, s1(t) = cos(2π·0.2t), s2(t) = cos(2π·0.6t), and s3(t) = cos(2π · 1.3t), are mixed by the matrix A = [ 4 - 2 1 1 2 - 3 ].

Two mixtures are generated as

x 1 ( t ) = 4 s 1 ( t ) - 2 s 2 ( t ) + s 3 ( t ) x 2 ( t ) = s 1 ( t ) + 2 s 2 ( t ) - 3 s 3 ( t )

The two mixtures are respectively transformed into the frequency and TF domain for comparison. Instead of Fourier transform, discrete cosine transform (DCT) is used in the frequency domain on account of the reason reported in [31]. In the TF domain, short-time Fourier transform (STFT), which has been successfully applied in speech signals, is used because of its simplicity and intuitiveness. Mirror operation and window function with long overlap are applied to solve the problems of edge effect and low accuracy. In this study, STFT is conducted with a Hanning window(length is 256 and overlap is 192). In order to better compare the result between DCT and STFT, the coefficient matrix xi(t, f),i = 1, 2 obtained by STFT is first reorganized to a vector x(ξ),i =1, 2. The scatter diagrams of x1(t) versus x2(t) or x1(ξ) versus x2(ξ) are plotted in Figure 1.

The scatter points in the time domain are disordered and reveal minimal information about the mixing matrix. However, the points in the frequency and TF domains form clusters around three significant directions, which correspond to the three columns of the mixing matrix A. Furthermore, the STFT result is superior to that of the DCT because only a small fraction of points remarkably reveal the directions in the DCT scatter diagram. DCT performance is thus sensitive to noise and outliers. In this study, we choose to perform SCA in the TF domain by using STFT.

3.2. Mixing Matrix Estimation

Estimation of the mixing matrix is important in SCA. Estimation accuracy can be improved in two ways: SSP detection and K-hyperline clustering.

3.2.1. SSP Detection

The SSP is the point where only one source is active, and the direction of the magnitude of the mixture vectors is proportional to that of the columns of the mixing matrix. SSP detection can thus reduce the effect of noise and outliers as well as improve clustering performance.

Equation (1) can be expressed in the TF domain using STFT as

X ( t , k ) = AS ( t , k ) = i = 1 n a i S i ( t , k )
where X(t, k) = [X1(t, k),…, Xm(t, k)]T and S(t, k) = [S1(t, k),…, Sn(t, k)]T represent the STFT coefficients of the mixtures and sources, respectively, in the kth frequency bin at time frame t; ai = [a1i, … , ami]T is the ith column of the mixing matrix A. We assume a case with two sources and two sensors. If the (t1, k1) in the TF domain is an SSP, that is, s1(t1, k1) ≠ 0 and s2(t1, k1) = 0, then the following can be obtained from Equation (7):
X ( t 1 , k 1 ) = a 1 S 1 ( t 1 , k 1 )

The real and imaginary parts can be equated through plural unfolding as

Re { X ( t 1 , k 1 ) } = a 1 Re { S 1 ( t 1 , k 1 ) }
Im { X ( t 1 , k 1 ) } = a 1 Im { S 1 ( t 1 , k 1 ) }

In this case, the absolute directions of the real and imaginary parts are the same as that of a1. This condition is derived from the fact that only one source is active at an SSP. The real and imaginary parts of a non-SSP (t2, k2) at which both sources are active can be written as

Re { X ( t 2 , k 2 ) } = a 1 Re { S 1 ( t 2 , k 2 ) } + a 2 Re { S 2 ( t 2 , k 2 ) }
Im { X ( t 2 , k 2 ) } = a 1 Im { S 1 ( t 2 , k 2 ) } + a 2 Im { S 2 ( t 2 , k 2 ) }

If the absolute directions of the real and imaginary parts need to be the same, then the following condition must be satisfied:

Re { S 1 ( t 2 , k 2 ) } Im { S 1 ( t 2 , k 2 ) } = Re { S 2 ( t 2 , k 2 ) } Im { S 2 ( t 2 , k 2 ) }

When extending to the general case of multiple mixtures and multiple sources, the condition that the absolute directions of Re{X(t, k)} and Im{X(t, k)} at any point (t, k) are the same requires that the ratios between the real and imaginary parts of the STFT coefficients of all source signals at those points must also be the same. However, the probability of satisfying this requirement is low and decreases as the number of sources increases [28]. Therefore, we can conclude that a point is an SSP if the absolute directions of Re{X(t, k)} and Im{X(t, k)} at a point (t, k) are the same; otherwise, the point is a multiple-source-point. However, the points that accurately satisfy this condition are limited. Therefore, [28] developed an algorithm that relaxes this condition for SSP detection. In the TF plane, the points where the difference between the absolute directions of Re{X(t, k)} and Im{X(t, k)} is less than Δθ are taken as SSPs. This condition can be expressed as

| Re { X ( t , k ) } T Im { X ( t , k ) } Re { X ( t , k ) } Im { X ( t , k ) } | > cos ( Δ θ )
where | · | represents the absolute value, and y = y T y. The SSPs that satisfy Equation (14) can be preserved for the estimation of the mixing matrix.

Figure 2 illustrates an example of speech utterances with five sources and two mixtures. All STFT coefficients are shown in Figure 2a, whereas the SSPs detected by using Equation (14) are shown in Figure 2b. The directions in Figure 2b are much clearer than those before SSP detection, and the scale of points for clustering is also reduced. Therefore, SSP detection makes SCA simple, effective, and robust to noise and outliers.

3.2.2. K-Hyperline Clustering

After SSP detection, estimating the mixing matrix A can be cast into a hyperline clustering problem. Several standard clustering methods have been used to extract directions; examples of these methods include K-means, fuzzy-C clustering [32] and the linear geometric ICA-based method [33]. However, these methods require the sources to be sufficiently sparse, and their performance in practice may be affected by many factors such as noise, outliers, and insufficient sparseness of sources, as mentioned in [16].

Considering these problems, [34,35] proposed and studied K-hyperline clustering method. In contrast to the K-means clustering method, K-hyperline allows each data sample to have a coefficient value that corresponds to the cluster centroid, and the cluster centroids are normalized to unit 2–norm. Clustering can be rapidly realized by solving an optimization problem using singular value decomposition or an iterative linear scheme. Interestingly, this method is robust to the outliers and performs well even in noisy cases. Further discussion on this clustering method is not presented in this paper due to space limitation.

3.3. Reconstruction of Sources

Once the mixing matrix is estimated and denoted as , the sources can be directly recovered by solving (t) = −1x(t) for m = n or by using the least squares method for m > n. However, for the case of UBSS(i.e., m < n), an infinite amount of feasible solutions may be obtained in accordance with the theory of linear equations. From these solutions, we want to find the one that has the least error with our real sources. Given the sparsity of the sources in ξ-domain, the solution is required to have the fewest non-zero elements, which can be formulated as: (P0) : (ξ) = argmin ‖s(ξ)‖0, where ‖x‖0 denotes the 0-norm of x. However, (P0) is subject to its non-convexity and high-discreteness and is therefore difficult to executed in practice. Replacing the subscript 0 with 1, we obtain the problem (P1) : (ξ) = argmin‖s(ξ)‖1, which is also called the basis pursuit(BP) problem. [36] proved that (P1) is a good approximation to (P0). (P1) can be realized by a well-defined convex optimization problem, whose solution can be guaranteed as global optimum and can be solved efficiently by using standard linear programming techniques. For the noiseless case, BP is a effective solution to recover the sparse sources. However, BP can not find a good solution in the presence of noise. Thus, we use BPDN technique to recover the sources in this manuscript. The optimization problem of BPDN is formulated as follows:

s ˜ ( ξ ) = arg min s ( ξ ) 1 subject to x ( ξ ) - A ˜ s ( ξ ) 2 < σ

The case σ = 0 corresponds to a solution of BP. The MATLAB solver SPGL1 is used to recover the sources from the time-frequency coefficients. For the noiseless case in Sections 4.1 and 4.2, the BP method is used, and the BPDN method is used in the experimental part and the noise case of the simulation part. Additional description of the 1-minimization technique can be found in [36-40].

When the source signals (ξ) in ξ-domain are recovered, the time-domain sources (t) in Equation (1) can be calculated by the corresponding inverse transform as

s ˜ ( t ) = T - 1 ( s ˜ ( ξ ) )

Then the modal parameters can be extracted from the recovered sources by using the SDOF parameter fitting method or logarithmic decrement method.

4. Numerical Simulations

A 3-dof mass-spring-damper system (Figure 3) is considered in this section to validate the effect of the proposed TF-domain SCA method. Different system parameters are selected for various situations, such as well-separated modes, highly-damped modes, and closely-spaced modes. In each situation, determined and underdetermined identification are both considered. The robustness to noise is studied in Section 4.3.

The proposed algorithm is performed under the following settings. The simulation outputs are first transformed into the TF plane using STFT with a Hanning window length of 512 and an overlap of 384. Then, the SSPs in the TF plane are found by the value Δθ of 10°. a time-frequency trade-off occurs when using STFT. The width of the windowing function relates to how the signal is represented-it determines whether there is good frequency resolution or good time resolution. a wide window gives better frequency resolution but poor time resolution. a narrower window gives good time resolution but poor resolution. The methods for obtaining high resolution STFT have been studied in [41,42]. [41] proposed a method to separate nonstationary and stationary signals overlapping in time-frequency domain; time and frequency varying windows and overlapped STFT was used to improve time and frequency resolution. [42] developed a high-resolution time-frequency representation to achieve robust estimation of multiple frequency hopping signals based on sparse Bayesian method. The source signals in OMA are exponentially-decaying sinusoids. Thus, the time resolution is not so important and we can select a wide window. Meanwhile, the overlap is set at 75% of the window length to obtain adequate single source points. In practice, the minimal natural frequency is another factor that should be considered in deciding the window length. As the model in Section 4.1 shows, the minimal natural frequency is 0.0895 (the period is 112 points at the sampling frequency 10 Hz); thus, the window length should contain at least several periods. On the basis of the reasons above, the window length and overlap are set as the ones above. Δθ is set to 10° to make sure that we can obtain adequate clean SSPs. The mixing matrix is obtained through K-hyperline clustering. With the estimated mixing matrix, the modal responses are directly recovered by solving Equation (1) in the time domain for the determined case. For the underdetermined case, the frequency-domain modal responses are first recovered by BPDN. Then the overlap add (OLA) method is used to execute inverse STFT to get the time sequence, details of OLA method can be found in [43]. After the inverse STFT is carried out, the time-domain modal responses are obtained. Finally, the natural frequency and damping ratio of each estimated modal response can be calculated by using either the simple SDOF parameter fitting or logarithmic decrement method.

The modal assurance criterion (MAC) is used to determine the errors of the estimated mode shape as

MAC ( φ ˜ i , φ i ) = ( φ ˜ i T φ i ) 2 ( φ ˜ i T φ ˜ i ) ( φ ˜ i T φ i )
where φ̃i and φ̃i are the ith estimated and theoretical mode vectors, respectively. The MAC value ranges from 0 to 1, in which 0 means no correlation and 1 indicates complete correlation. For the underdetermined case, the estimated mode vectors have lower dimensions than the theoretical ones. The corresponding rows of the theoretical mode vectors are selected to constitute the partial mode shapes. Thus the MAC indicates the estimating accuracy of the partial mode shapes.

4.1. Well-Separated Modes

First, the case of well-separated modes is considered. The system parameters, identical to those in [31], are set as follows:

M = [ 2 0 0 0 1 0 0 0 3 ] , K = [ 1 0 0 0 1 0 0 0 1 ]

The damping matrix is determined by the mass matrix as C = αM, where α = 0.08, 0.13 to account for different damping levels. The free response is adopted, and the initial states are x(0) = [0 0 0]T and x¨(0) = [0 1 0]T.

4.1.1. Determined Modal Identification

In this section, all sensor measurements are used for identification. The responses of the free vibration are shown in Figure 4, and the scatter diagram of the identified SSPs is shown in Figure 5, from which we can see that the SSPs form three clear directions. The mixing matrix, that is, the matrix of mode shape vectors, can then be accurately estimated through K-hyperline clustering. The performance can be indicated by the high MAC values presented in Table 1. The recovered time-domain modal responses, as shown in Figure 6, have waveforms that are similar to those of monotone exponentially decaying sinusoids. The frequencies and damping ratios are estimated from these modal responses, as shown in Table 1. Compared with the theoretical values, the proposed algorithm obtains excellent results regardless of damping levels.

4.1.2. Underdetermined Modal Identification

In the underdetermined case, the first two measurements are utilized. Figure 7 shows the 2D scatter of the SSPs. Furthermore, three clear directions represent the mode shape vectors. The 1-optimization algorithm is used to recover the time-domain modal responses, and the results of the case where α = 0.08 are shown in Figure 8. The high MAC values and the accurately identified parameters are presented in Table 1. These results reveal that the proposed algorithm is also suitable for underdetermined modal identification with limited sensors.

The TF-domain SCA in this study achieves a better performance compared with the frquency-domain SCA in [31]. On the one hand, more points remarkably reveal the orientation, which make the directions more obvious and recognizable. This feature is helpful in enhancing estimation accuracy of the mixing matrix. On the other hand, the system parameters and mode shapes are identified with high accuracy.

4.2. Closely-Spaced Modes

Closely-spaced modes are studied in this section to further evaluate the capacity of the proposed SCA. The model is set up with the following parameters:

M = [ 1 0 0 0 2 0 0 0 1 ] , K = [ 5 - 1 0 - 1 4 - 3 0 - 3 3.5 ] , C = [ 0.0894 - 0.0084 0.0003 - 0.0084 0.1301 - 0.0244 0.0003 - 0.0244 0.0772 ]

Figure 9 shows the free vibration of the model under the same initial conditions as those in Section 4.1. As indicated in the spectrum, the second and third modes cannot be clearly separated. However, the scatter plot of the identified SSPs in Figure 10 reveals three significant directions. Although the angle between the two of them is slightly small, high MAC values can be obtained as presented in Table 2, where the estimated natural frequencies and damping ratios in the determined and underdetermined cases are also displayed. The identified parameters match the theoretical ones well. Figure 11 shows the recovered modal responses in the underdetermined case.

Similar to the frequency-domain SCA, the TF-domain SCA is effective for closely-spaced modes.

4.3. Robustness to Noise

To investigate the robustness to noise, the simulation outputs are corrupted by Gaussian white noise, the root mean square(RMS) amplitude of which is set to a certain percentage of the signal RMS value. The model is the same as that used in Section 4.1. The identified SSPs and the recovered modal responses under 10% noise are shown in Figures 12 and 13. Although the addition of noise results in a confusing scatter, many points are still distributed around the direction lines, and their average-value are used to reduce the effect of noise. BP and BPDN are used to recover the sources. Comparison shows that BPDN performs better than BP in the presence of noise. To evaluate the performance of the proposed algorithm with increasing noise percentage, 5%, 10%, 15% and 20% RMS noise are added to the original source. Fifty groups of sources with different white noise under each noise level are processed with the proposed algorithm because of the non-deterministic characteristic of the noise. The mean values of the MAC of the successful identifications are shown in Table 3. The identification is considered successful when the MAC of each identified mode is greater than 0.95. The high MAC values shown in Table 3 indicate that the proposed algorithm can also achieve good results in the presence of noise. The proposed algorithm may fail when the noise level increases to a certain extent (20% for this example).

The scatter diagram of one failed case under 20% noise leve is shown in Figure 14a. The valuable single source points that correspond to the shortest hyperline are flooded in the noise, and the direction vector cannot be found in this case. There is another condition in which the method may fail due to noise exists. We have examined the performance for the close-separated modes in Section 4.2. The scatter plot of the mixture when the sources are polluted by noise (10%) is shown in Figure 14b. This figure shows that the single source points deviate from their original position(due to noise) and become mixed. As a result, two hyperlines are identified as one, and the other direction vector may be arbitrary. The MAC values [0.9972 0.4851 0.9995] validate this finding. If not in these two conditions, the proposed algorithm can behave well enough.

5. Experimental Verification

5.1. Experimental Setup

A steel cantilever beam that measures 0.31 m × 0.0012 m × 0.0004 m is used to validate the effectiveness of the proposed modal identification algorithm. The steel beam is made of carbon tool steel whose Young's modulus and density are 2.06 × 1011 N ·m−2 and 7.85 × 103 kg · m−3, respectively. One end of the steel beam is fixed on a table, and the length from the fulcrum to the end is 0.28 m.

When excitation is provided by human finger tapping, the steel beam begins to vibrate. To capture the displacement responses, a high-speed camera is used instead of traditional accelerometer. Figure 15a shows the experimental setup, and Figure 15b shows the high-speed camera head. Twenty-one locations with a spacing of approximately 0.01 m are marked for displacement measurement. Gray 8-bit images can be transferred at 500 fps (in a 150 × 1200 pixel area, one pixel corresponds to 0.0002 m) by using the device. The camera head is connected to a PC. After image acquisition, the displacement responses at marked locations are extracted by image processing. The displacement responses extracted from some of the locations are shown in Figure 16.

5.2. Cantilever Beam Theory

The vibration of the cantilever beam satisfies Euler-Bernoulli beam theory, by which the governing differential equation without damping is formulated as

4 y x 4 + a 2 y t 2 = 0
where a = E J / ρ A, E is Young's modulus, J is the second moment area of the beam cross-section, ρ is the material density, and a is the cross-sectional area of the beam. Under the boundary conditions of the cantilever beam, the natural frequency function and corresponding mode shape function can be calculated as
ω i = ( β i l ) 2 l 2 a
Y i ( x ) = cos ( β i l ) + c h ( β i l ) sin ( β i l ) + s h ( β i l ) [ sin ( β i x ) - s h ( β i x ) ] + c h ( β i x ) - cos ( β i x )
where βil is determined by cos(βl)ch(βl) = − 1, and l is the length of the cantilever beam. The natural frequencies of the former three orders based on the substitution of the actual parameters of the steel beam are shown in Table 4. The corresponding mode shapes are shown in Figure 17.

5.3. Results Analysis

To accomplish the modal analysis of the steel cantilever beam, the proposed SCA algorithm is applied to the displacement responses. Figure 18 depicts the scatter diagram of the identified SSPs using two of the measurements, which reveals three significant directions. This result indicates that three modes of the cantilever beam are excited. The modal responses recovered in the determined case using all the displacement responses as well as that using only two measurements (1st and 10th markers) with the proposed SCA are shown in Figures 19 and 20. The recovered responses are similar to the monotone exponentially decaying sinusoids.

As a common and effective method for modal identification, SSI is also used in this study for comparison with the proposed SCA. To avoid the drawbacks of the SSI, all displacement responses are used as the input in subspace identification. The detailed results of the corresponding natural frequencies and damping ratios are listed in Table 4. The identified mode shapes by both SSI and the proposed SCA in the determined case are shown in Figure 17 for contrast with the theoretical ones.

In Table 4, the identified natural frequencies and damping ratios obtained by the proposed SCA in both the determined and underdetermined cases agree very well with those obtained by SSI. The identified frequencies also match the theoretical results. In addition, the identified mode shapes by the two methods show a high correlation with the theoretical waveforms in Figure 17, as also indicated by the MAC values. a comparison between the underdetermined case and the determined one shows that the former uses fewer measurements but attains a similarly good performance.

This experiment successfully validates the effectiveness of the proposed SCA for OMA in practice. Especially for the underdetermined case, similar performance can be obtained despite the use of limited measurements, whereas the SSI method needs more measurements to ensure good results. The proposed method provides a new way for the output-only modal identification.

6. Conclusions

This paper presents a new SCA method for OMA. Compared with existing methods, the proposed method explores the sparse representations in the TF domain to perform output-only modal identification. Some clusters are formed when the mixed signals are transformed into the TF plane. SSP detection and K-hyperline clustering are combined to improve the estimation accuracy of the mixing matrix. The SSP is the point in which only one source is active, and the direction of the magnitude of the mixture vectors are proportional to those of the columns of the mixing matrix. Therefore, SSP detection can reduce the effect of noise and outliers as well as improve the clustering performance. After SSP detection, the estimation of the mixing matrix is cast into a hyperline clustering problem. Therefore, K-hyperline clustering is introduced to fulfill the clustering task. After the mixing matrix is estimated, the theory of linear equations for determined BSS or the 1-minimization technique for UBSS is used to recover the modal responses, from which the modal information (natural frequency, damping ratio, and mode shapes) can be extracted. The numerical simulations and experimental results demonstrate the effectiveness of the proposed SCA method in various cases. a comparison of the proposed method with SSI reveals that the two methods show equal performance in the experimental section. Further investigation is required to explore the applications of the proposed SCA in complex modes cases, which are not covered in this study.

Acknowledgements

This work was supported by the National Key Basic Research Program of China (973 Program) under Grant No.2014CB049500 and the Key Technologies R&D Program of Anhui Province under Grant No.1301021005.

Author Contributions

Qin and Guo provided the initial idea and conception; Qin implemented the software and conducted the numerical simulation; Guo carried out the experimental validation; Qin and Guo wrote the main manuscript. Zhu reviewed and edited the manuscript. All authors reviewed the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mevel, L.; Hermans, L.; van der Auweraer, H. Application of a subspace-based fault detection method to industrial structures. Mech. Syst. Signal Process. 1999, 13, 823–838. [Google Scholar]
  2. Peeters, B.; de Roeck, G. Reference-based stochastic subspace identification for output-only modal analysis. Mech. Syst. Signal Proc. 1999, 13, 855–878. [Google Scholar]
  3. Reynders, E. System Identification Methods for (Operational) Modal Analysis: Review and Comparison. Arch. Comput. Methods Eng. 2012, 19, 51–124. [Google Scholar]
  4. Kerschen, G.; Poncelet, F.; Golinval, J.C. Physical interpretation of independent component analysis in structural dynamics. Mech. Syst. Signal Proc. 2007, 21, 1561–1575. [Google Scholar]
  5. Poncelet, F.; Kerschen, G.; Golinval, J.C.; Verhelst, D. Output-only modal analysis using blind source separation techniques. Mech. Syst. Signal Proc. 2007, 21, 2335–2358. [Google Scholar]
  6. Zhou, W.; Chelidze, D. Blind source separation based vibration mode identification. Mech. Syst. Signal Proc. 2007, 21, 3072–3087. [Google Scholar]
  7. McNeill, S.I.; Zimmerman, D.C. A framework for blind modal identification using joint approximate diagonalization. Mech. Syst. Signal Proc. 2008, 22, 1526–1548. [Google Scholar]
  8. Antoni, J. Blind separation of vibration components: Principles and demonstrations. Mech. Syst. Signal Proc. 2005, 19, 1166–1180. [Google Scholar]
  9. McNeill, S.I. An analytic formulation for blind modal identification. J. Vib. Control 2012, 18, 2111–2121. [Google Scholar]
  10. Antoni, J.; Chauhan, S. A study and extension of second-order blind source separation to operational modal analysis. J. Sound Vib. 2013, 332, 1079–1106. [Google Scholar]
  11. Peeters, B.; de Roeck, G. Stochastic system identification for operational modal analysis: A review. J. Dyn. Syst. Meas. Control Trans. Asme 2001, 123, 659–667. [Google Scholar]
  12. Poncelet, F.; Kerschen, G.; Golinval, J. Experimental modal analysis using blind source separation techniques. Proceedings of International Conference on Noise and Vibration Engineering (ISMA2006), Heverlee, Belgium, 18–20 September 2006; pp. 3315–3329.
  13. Hazra, B.; Sadhu, A.; Roffel, A.J.; Paquet, P.E.; Narasimhan, S. Underdetermined Blind Identification of Structures by Using the Modified Cross-Correlation Method. J. Eng. Mech. Asce 2012, 138, 327–337. [Google Scholar]
  14. Belouchrani, A.; Amin, M.G. Blind source separation based on time-frequency signal representations. IEEE Trans. Signal Proc. 1998, 46, 2888–2897. [Google Scholar]
  15. Bofill, P.; Zibulevsky, M. Underdetermined blind source separation using sparse representations. Signal Proc. 2001, 81, 2353–2362. [Google Scholar]
  16. Li, Y.Q.; Amari, S.I.; Cichocki, A.; Ho, D.W.C.; Xie, S.L. Underdetermined blind source separation based on sparse representation. IEEE Trans. Signal Proc. 2006, 54, 423–437. [Google Scholar]
  17. Le, T.P.; Paultre, P. Modal identification based on the time-frequency domain decomposition of unknown-input dynamic tests. Int. J. Mech. Sci. 2013, 71, 41–50. [Google Scholar]
  18. Hazra, B.; Narasimhan, S. Wavelet-based blind identification of the UCLA Factor building using ambient and earthquake responses. Smart Mater. Struct. 2010, 19. [Google Scholar] [CrossRef]
  19. Sadhu, A.; Hazra, B.; Narasimhan, S.; Pandey, M.D. Decentralized modal identification using sparse blind source separation. Smart Mater. Struct. 2011, 20, 125009. [Google Scholar]
  20. Hazra, B.; Sadhu, A.; Roffel, A.J.; Narasimhan, S. Hybrid Time-Frequency Blind Source Separation Towards Ambient System Identification of Structures. Comput. Aided Civ. Infrastruct. Eng. 2012, 27, 314–332. [Google Scholar]
  21. Li, Z.; Crocker, M.J. A study of joint time-frequency analysis-based modal analysis. IEEE Trans. Instrum. Meas. 2006, 55, 2335–2342. [Google Scholar]
  22. Yan, B.F.; Miyamoto, A.; Bruhwiler, E. Wavelet transform-based modal parameter identification considering uncertainty. J. Sound Vib. 2006, 291, 285–301. [Google Scholar]
  23. Jourjine, A.; Rickard, S.; Yilmaz, O. Blind separation of disjoint orthogonal signals: Demixing N sources from 2 mixtures. Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing, Istanbul, Turkey, 5–9 June 2000; Volumes 5, pp. 2985–2988.
  24. Yilmaz, O.; Rickard, S. Blind separation of speech mixtures via time-frequency masking. IEEE Trans. Signal Proc. 2004, 52, 1830–1847. [Google Scholar]
  25. Abrard, F.; Deville, Y. Blind separation of dependent sources using the “time-frequency ratio of mixtures” approach. Proceedings of the Seventh International Symposium on Signal Processing and Its Applications, Paris, France, 1–4 July 2003; Volume 2, pp. 81–84.
  26. Abrard, F.; Deville, Y. A time-frequency blind signal separation method applicable to underdetermined mixtures of dependent sources. Signal Proc. 2005, 85, 1389–1403. [Google Scholar]
  27. Bofill, P. Identifying Single Source Data for Mixing Matrix Estimation in Instantaneous Blind Source Separation. Artif. Neural Netw. ICANN 2008, 5163, 759–767. [Google Scholar]
  28. Reju, V.G.; Koh, S.N.; Soon, I.Y. An algorithm for mixing matrix estimation in instantaneous blind source separation. Signal Proc. 2009, 89, 1762–1773. [Google Scholar]
  29. Pappu, V.; Pardalos, P.M. High-Dimensional Data Classification. In Clusters, Orders, and Trees: Methods and Applications; Springer: New York, NY, USA, 2014; pp. 119–150. [Google Scholar]
  30. Syed, M.N.; Georgiev, P.G.; Pardalos, P.M. A hierarchical approach for sparse source Blind Signal Separation problem. Comput. Oper. Res. 2014, 41, 386–398. [Google Scholar]
  31. Yang, Y.C.; Nagarajaiah, S. Output-only modal identification with limited sensors using sparse component analysis. J. Sound Vib. 2013, 332, 4741–4765. [Google Scholar]
  32. Zibulevsky, M.; Pearlmutter, B.A. Blind source separation by sparse decomposition in a signal dictionary. Neural Comput. 2001, 13, 863–882. [Google Scholar]
  33. Theis, F.J.; Jung, A.; Puntonet, C.G.; Lang, E.W. Linear geometric ICA: Fundamentals and algorithms. Neural Comput. 2003, 15, 419–439. [Google Scholar]
  34. He, Z.S.; Cichocki, A.; Li, Y.Q.; Me, S.L.; Sanei, S. K-hyperline clustering learning for sparse component analysis. Signal Proc. 2009, 89, 1011–1022. [Google Scholar]
  35. Thiagarajan, J.J.; Ramamurthy, K.N.; Spanias, A. Optimality and stability of the K-hyperline clustering algorithm. Pattern Recognit. Lett. 2011, 32, 1299–1304. [Google Scholar]
  36. Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar]
  37. Chen, S.S.B.; Donoho, D.L.; Saunders, M.A. Atomic decomposition by basis pursuit. Siam Rev. 2001, 43, 129–159. [Google Scholar]
  38. Tsaig, Y.; Donoho, D.L. Extensions of compressed sensing. Signal Proc. 2006, 86, 549–571. [Google Scholar]
  39. Van den Berg, E.; Friedlander, M.P. Probing the Pareto frontier for basis pursuit solutions. SIAM J. Sci. Comput. 2008, 31, 890–912. [Google Scholar]
  40. Van den Berg, E.; Friedlander, M.P. SPGL1: A solver for large-scale sparse reconstruction, 2007. Available online: http://www.cs.ubc.ca/labs/scl/spgl1 (accessed on 29 January 2015).
  41. Stankovic, L.; Orovic, I.; Stankovic, S.; Amin, M. Compressive sensing based separation of nonstationary and stationary signals overlapping in time-frequency. Signal Proc. IEEE Trans. 2013, 61, 4562–4572. [Google Scholar]
  42. Zhao, L.; Wang, L.; Bi, G.; Zhang, L.; Zhang, H. Robust Frequency-Hopping Spectrum Estimation based on Sparse Bayesian Method. Wirel. Commun. IEEE Trans. 2015, 14, 781–793. [Google Scholar]
  43. Allen, J.B.; Rabiner, L. A unified approach to short-time Fourier analysis and synthesis. IEEE Proc. 1977, 65, 1558–1564. [Google Scholar]
Figure 1. Scatter diagram of two mixtures with three sources in (a) time domain; (b) frequency domain by DCT; and (c) TF domain by STFT.
Figure 1. Scatter diagram of two mixtures with three sources in (a) time domain; (b) frequency domain by DCT; and (c) TF domain by STFT.
Sensors 15 06497f1 1024
Figure 2. Scatter diagram of an example of speech utterances with five sources and two mixtures: (a) all STFT coefficients; (b) the detected SSPs.
Figure 2. Scatter diagram of an example of speech utterances with five sources and two mixtures: (a) all STFT coefficients; (b) the detected SSPs.
Sensors 15 06497f2 1024
Figure 3. Three-dof linear model.
Figure 3. Three-dof linear model.
Sensors 15 06497f3 1024
Figure 4. Free-vibration system responses and their power spectral density in the case of well-separated modes, α = 0.08
Figure 4. Free-vibration system responses and their power spectral density in the case of well-separated modes, α = 0.08
Sensors 15 06497f4 1024
Figure 5. Scatter diagram of the identified SSPs using all three sensors in the case of well-separated modes, α = 0.08.
Figure 5. Scatter diagram of the identified SSPs using all three sensors in the case of well-separated modes, α = 0.08.
Sensors 15 06497f5 1024
Figure 6. Recovered modal responses using all three sensors in the case of well-separated modes, α = 0.08.
Figure 6. Recovered modal responses using all three sensors in the case of well-separated modes, α = 0.08.
Sensors 15 06497f6 1024
Figure 7. Scatter diagram of the identified SSPs using the first two sensors in the case of well-separated modes, α = 0.08.
Figure 7. Scatter diagram of the identified SSPs using the first two sensors in the case of well-separated modes, α = 0.08.
Sensors 15 06497f7 1024
Figure 8. Recovered modal responses using the first two sensors in the case of well-separated modes, α = 0.08.
Figure 8. Recovered modal responses using the first two sensors in the case of well-separated modes, α = 0.08.
Sensors 15 06497f8 1024
Figure 9. Free-vibration system responses and their power spectral density in the case of closely-spaced modes.
Figure 9. Free-vibration system responses and their power spectral density in the case of closely-spaced modes.
Sensors 15 06497f9 1024
Figure 10. Scatter diagram of the identified SSPs using two sensors in the case of closely-separated modes.
Figure 10. Scatter diagram of the identified SSPs using two sensors in the case of closely-separated modes.
Sensors 15 06497f10 1024
Figure 11. Recovered modal responses using two sensors in the case of closely-separated modes.
Figure 11. Recovered modal responses using two sensors in the case of closely-separated modes.
Sensors 15 06497f11 1024
Figure 12. Scatter diagram of the identified SSPs using two sensors with 10% RMS noise.
Figure 12. Scatter diagram of the identified SSPs using two sensors with 10% RMS noise.
Sensors 15 06497f12 1024
Figure 13. Under the condition of two sensors, 10% RMS noise, the modal responses recovered by different methods: (a) BP; (b) BPDN.
Figure 13. Under the condition of two sensors, 10% RMS noise, the modal responses recovered by different methods: (a) BP; (b) BPDN.
Sensors 15 06497f13 1024
Figure 14. Two conditions when the proposed method fails because of noise.
Figure 14. Two conditions when the proposed method fails because of noise.
Sensors 15 06497f14 1024
Figure 15. Experimental Setup: (a) experimental setup; (b) high-speed camera head.
Figure 15. Experimental Setup: (a) experimental setup; (b) high-speed camera head.
Sensors 15 06497f15 1024
Figure 16. System responses of the steel beam extracted by image processing.
Figure 16. System responses of the steel beam extracted by image processing.
Sensors 15 06497f16 1024
Figure 17. Theoretical and identified mode shapes.
Figure 17. Theoretical and identified mode shapes.
Sensors 15 06497f17 1024
Figure 18. Scatter diagram of the identified SSPs using only two measurements.
Figure 18. Scatter diagram of the identified SSPs using only two measurements.
Sensors 15 06497f18 1024
Figure 19. Modal responses recovered by the proposed SCA using all of the 21 measurements.
Figure 19. Modal responses recovered by the proposed SCA using all of the 21 measurements.
Sensors 15 06497f19 1024
Figure 20. The modal responses recovered by the proposed SCA using only two measurements.
Figure 20. The modal responses recovered by the proposed SCA using only two measurements.
Sensors 15 06497f20 1024
Table 1. Modal parameters identified by the proposed SCA for the simulation model in the case of well-separated modes.
Table 1. Modal parameters identified by the proposed SCA for the simulation model in the case of well-separated modes.
αModeFrequency (Hz)Damping Ratio(%)MAC



TheoreticalDeterminedUnder-TheoreticalDeterminedUnder-DeterminedUnder-
0.0810.08950.08920.08867.117.117.501.00001.0000
20.14580.14560.14884.374.374.870.99980.9998
30.25220.25210.25222.522.522.551.00001.0000
0.1310.08950.08890.087611.5511.5511.791.00001.0000
20.14580.14540.15177.107.097.571.00000.9996
30.25220.25200.25214.104.104.131.00001.0000
Table 2. Modal parameters identified by the proposed SCA to the simulation model in the case of closely-spaced modes.
Table 2. Modal parameters identified by the proposed SCA to the simulation model in the case of closely-spaced modes.
ModeFrequency (Hz)Damping Ratio(%)MAC



TheoreticalDeterminedUnder-TheoreticalDeterminedUnder-DeterminedUnder-
10.10390.10380.10384.004.004.591.00001.0000
20.34250.34240.34352.002.001.770.99960.9963
30.37130.37120.37182.002.001.711.00000.9983
Table 3. Accuracy of mixing matrix estimation(MAC) and modal parameters identified by the proposed SCA to the simulation model under different noise levels.
Table 3. Accuracy of mixing matrix estimation(MAC) and modal parameters identified by the proposed SCA to the simulation model under different noise levels.
Noise Percentage (%)Mode 1Mode 2Mode 3Successful Identifications
510.99990.999950/50
100.99980.99980.999850/50
150.99950.99950.999547/50
200.99800.97780.998018/50
Table 4. Modal parameters identified by the SSI and the proposed SCA for the experimental steel beam.
Table 4. Modal parameters identified by the SSI and the proposed SCA for the experimental steel beam.
ModeFrequency (Hz)Damping Ratio(%)MAC



TheoreticalSSIDeterminedUnder-SSIDeterminedUnder-DeterminedUnder-
14.224.244.334.331.481.481.461.00001.0000
226.4626.6526.7626.760.360.360.361.00001.0000
374.0974.8374.9274.920.260.260.260.99980.9998

Share and Cite

MDPI and ACS Style

Qin, S.; Guo, J.; Zhu, C. Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis. Sensors 2015, 15, 6497-6519. https://doi.org/10.3390/s150306497

AMA Style

Qin S, Guo J, Zhu C. Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis. Sensors. 2015; 15(3):6497-6519. https://doi.org/10.3390/s150306497

Chicago/Turabian Style

Qin, Shaoqian, Jie Guo, and Changan Zhu. 2015. "Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis" Sensors 15, no. 3: 6497-6519. https://doi.org/10.3390/s150306497

APA Style

Qin, S., Guo, J., & Zhu, C. (2015). Sparse Component Analysis Using Time-Frequency Representations for Operational Modal Analysis. Sensors, 15(3), 6497-6519. https://doi.org/10.3390/s150306497

Article Metrics

Back to TopTop