Next Article in Journal
Robust Direct Power Control of Three-Phase PWM Rectifier with Mismatched Disturbances
Previous Article in Journal
Effect of In-Situ H Doping on the Electrical Properties of In2O3 Thin-Film Transistors
Previous Article in Special Issue
PCA-Based Preprocessing for Clustering-Based Fetal Heart Rate Extraction in Non-Invasive Fetal Electrocardiograms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

On the Regularization of Recursive Least-Squares Adaptive Algorithms Using Line Search Methods

by
Cristian-Lucian Stanciu
1,*,
Cristian Anghel
1,
Ionuț-Dorinel Fîciu
1,2,
Camelia Elisei-Iliescu
1,
Mihnea-Radu Udrea
1 and
Lucian Stanciu
1
1
Department of Telecommunications, National University of Science and Technology Politehnica Bucharest, 1–3, Iuliu Maniu Blvd., 061071 Bucharest, Romania
2
Academy of Romanian Scientists, Ilfov 3, 050044 Bucharest, Romania
*
Author to whom correspondence should be addressed.
Electronics 2024, 13(8), 1479; https://doi.org/10.3390/electronics13081479
Submission received: 15 February 2024 / Revised: 26 March 2024 / Accepted: 10 April 2024 / Published: 13 April 2024

Abstract

:
Stereophonic acoustic echo cancellation (SAEC) requires the identification of four unknown impulse responses corresponding to four loudspeaker-to-microphone pairs. Recent developments in the field of adaptive filters for SAEC setups have allowed for the handling of a single complex-valued adaptive impulse response, instead of the four classical real-valued adaptive filters. With the simplified framework provided by the widely linear (WL) model, more advanced versions of recursive least-squares (RLS) were employed in order to take advantage of their superior tracking speeds when working with highly correlated input signals (such as speech). Despite the performances and numerical stability provided by using exponentially weighted versions of the RLS family in combination with line search methods (LSMs), the SAEC configurations have limited capabilities in mitigating the negative effects caused by high-level disturbances affecting the two microphone signals. Such is the case of double-talk scenarios, which considerably reduce the update accuracy of the adaptive system. This paper analyzes a regularization technique for the named WL-RLS-LSM adaptive filters by adjusting the correlation matrix associated with the input signals and creating a reaction in the update process. The proposed method is designed to considerably slow (or even freeze) the adaptation process while the disturbance is manifested. Simulation results are discussed in order to validate the theoretical content.

1. Introduction

In the telecommunications domain, every audio terminal is equipped with a number of M microphones and L loudspeakers. When aiming to create the sensation of acoustic directionality, the values of M, respectively of L, are higher than 1. Correspondingly, every loudspeaker-to-microphone pair can form an undesired acoustic echo path and generate an associated acoustic echo signal [1,2,3,4]. The stereophonic configuration is obtained when employing the configuration M = L = 2 , and M × L = 4 unknown impulse responses must be estimated using four adaptive filters in order to cancel the effects of the associated echo signals [5,6,7,8,9,10].
In [2,6,11], the stereophonic acoustic echo cancellation (SAEC) problem was recast using the widely linear (WL) model, and the four adaptive filters were combined into a single adaptive impulse response with complex-valued coefficients. The WL framework maintains the same overall arithmetic effort and performance, with respect to the classical approach. Moreover, the new model can be applied to both the least-mean-square (LMS), respectively the recursive least-squares (RLS), families of adaptive algorithms [6,12,13,14].
From a practical point of view, SAEC applications have to manage hundreds or thousands of filter coefficients. Correspondingly, the convenient solution for the associated system identification problems is the LMS methods, which have poor performance when working with highly correlated input signals (e.g., with speech) [6,15,16,17,18]. As an alternative, the RLS methods have been the subject of ample research efforts because they can reduce the mentioned correlation problem and generate superior results. However, most RLS versions generate a direct solution for a system of N equations by using a complexity proportional to O ( N 3 ) (in terms of multiplications), where N is the length of the associated adaptive filter. The approach is prohibitive for most modern chips. Even when applying less complex solutions to decrease the arithmetic workload, such as the one based on the matrix inversion lemma or the fast-RLS algorithm (complexities of O ( N 2 ) , respectively O ( N ) ), they tend to have numerical stability issues proportional to the reduction that is applied on the complexity side [6,12].
An alternative approach for the RLS adaptive algorithms with direct estimation of coefficients is to exploit the statistical properties of the input signal and replace the mentioned system of equations with an auxiliary system, which can be solved using iterative methods [2,19,20,21,22]. The result determined at each filter iteration is added to the previous filter estimate in order to generate the newest set of coefficients [19]. The approach is convenient for employing various line search methods (LSMs), which comprise several arithmetically efficient versions well suited for processing systems of equations specific to the complex-valued RLS adaptive algorithms [13].
Despite the fact that the named WL-RLS-LSM adaptive algorithms do not manifest numerical stability issues and are attractive for SAEC practical applications, they manifest the same vulnerabilities in high-noise or double-talk (DT) scenarios [23,24,25,26,27,28,29,30]. In such occurrences, the microphone signals are affected by other speech signals, and the update process of the complex-valued adaptive filter loses most of its accuracy. This paper studies a variable regularization (VR) technique for the RLS-specific correlation matrix designed to slow down or freeze the update operation [31,32]. We combine the mentioned regularization method with several complex-valued LSM alternatives, and we analyze the resulting algorithms from several points of interest: tracking speeds, accuracy at steady state, and efficiency in DT scenarios, respectively, and arithmetic complexity.
Traditional approaches require performing DT detection (DTD) on the four adaptive filters, estimating the four unknown impulse responses. The corresponding process of slowing down (or freezing) the update process of the coefficients depends on four associated indicators and can affect the performances of the entire mechanism. By interpreting the four adaptive filters as a single complex-valued filter, the decision is simplified, and the methodology can be focused on the efficiency of fewer decision indicators and parameters (e.g., thresholds, etc.). Consequently, for the case of the WL-RLS-LSM adaptive systems, the VR methodology is applied for a single correlation matrix instead of four such matrices.
The paper is organized as follows. Section 2 describes the system model corresponding to the SAEC setup working with RLS adaptive algorithms employing LSMs. In Section 3, the principle of the variable regularized RLS-LSM methods working within the WL framework is discussed. Section 4 presents several LSMs by taking into consideration the associated arithmetic complexities, and Section 5 analyzes the simulation results with the associated compromises between performances and complexity. Several conclusions are drawn in Section 6 regarding the possible practical applications of the WL-RLS-LSM family of adaptive algorithms.

2. System Model

This section describes the SAEC configuration using the WL model with respect to its corresponding particularities for the complex-valued RLS family of adaptive methods. To this purpose, the basic notation and the main relations that define the WL framework for SAEC from [2,6] are briefly described in this section, summarizing the required background for the upcoming developments. Moreover, using the simplified handling provided by the WL model, the exponentially weighted RLS adaptive algorithm using LSMs (i.e., the WL-RLS-LSM) is presented for solving the corresponding system of equations. The arithmetic workloads are also discussed.

2.1. The Widely-Linear Model

For the SAEC setup, the signals generated by the two loudspeakers represent the input information. We denote the discrete time index as n, and we write the corresponding acoustic signals by using x L c ( n ) (for the left channel) and x R c ( n ) (for the right channel). The WL model combines the two real-valued signals into a single complex-valued notation, and we write
x ( n ) = x L c ( n ) + j x R c ( n ) ,
where j = 1 [6,11]. Correspondingly, we can express the two N × 1 vectors comprising the last samples associated with the two acoustic channels as
x L c ( n ) = [ x L c ( n ) x L c ( n 1 ) x L c ( n N + 1 ) ] T ,
x R c ( n ) = [ x R c ( n ) x R c ( n 1 ) x R c ( n N + 1 ) ] T ,
where the superscript T represents the transpose operator. We can also cumulate all input information into a single N × 1 complex-valued vector as
x ( n ) = x L c ( n ) + j x R c ( n ) .
Furthermore, we denote the four N × 1 impulse responses associated with the loudspeaker-to-microphone echo paths by using h t , L c L c , h t , L c R c , h t , R c L c with h t , R c R c , respectively. The output samples corresponding to the left and right channels are obtained by employing the inputs in (2) and (3) [2,11] and writing
y L c ( n ) = h t , L c L c T x L c ( n ) + h t , R c L c T x R c ( n ) ,
y R c ( n ) = h t , L c R c T x L c ( n ) + h t , R c R c T x R c ( n ) ,
where we can also express a complex-valued output signal in the form of
y ( n ) = y L c ( n ) + j y R c ( n ) .
The four real-valued acoustic impulse responses can be combined into two N × 1 complex-valued systems
h α = h t , L c L c + h t , R c R c 2 + j h t , R c L c h t , L c R c 2 ,
h β = h t , L c L c h t , R c R c 2 j h t , R c L c + h t , L c R c 2 ,
and we introduce a single notation for handling all the real-valued loudspeaker-to-microphone echo paths in the form of a 2 N × 1 complex-valued impulse response using the interleaving operation:
h t = h α , 0 h β , 0 h α , N 1 h β , N 1 T ,
where h α , l and h β , l , with l = 0 , 1 , , N 1 , are the elements of the vectors h α and h β , respectively [6].
The input information can also be arranged in a suitable manner by employing the interleaving operation. We write the 2 N × 1 complex-valued vector
x in ( n ) = x ( n ) x * ( n ) x ( n N + 1 ) x * ( n N + 1 ) T ,
and we can obtain a simpler expression for the complex output as
y ( n ) = h t H x in ( n ) .
where H denotes the Hermitian operator [6].
With the improved handling provided by the WL model, we also write the complex reference signal as
d ( n ) = y ( n ) + w ( n ) ,
where w ( n ) = w L c ( n ) + j w R c ( n ) represents the complex noise affecting the microphone channels. Its corresponding components (i.e., w L c ( n ) and w R c ( n ) ) are uncorrelated with the loudspeaker signals. Moreover, considering the complex estimate y ˜ ( n ) generated by the SAEC system for the output y ( n ) , by employing a single adaptive filter, h ˜ ( n ) , with 2 N × 1 complex-valued coefficients, the a posteriori error can be defined as
e ( n ) = d ( n ) y ˜ ( n ) = d ( n ) h ˜ H ( n ) x in ( n ) .
The described WL model simplifies the handling of the SAEC setup by employing the minimum possible number of notations for all the involved signals. Additionally, four unknown acoustic impulse responses, which would normally require four adaptive algorithms, are now combined into a single complex-valued system estimated using a single adaptive filter. The overall arithmetic complexity and the associated performances do not change [2,6].

2.2. The WL-RLS-LSM Algorithm

The exponentially weighted versions of the RLS adaptive methods are developed based on the cost function
J LS h ˜ ( n ) = i = 1 n λ n i d ( i ) h ˜ H ( n ) x in ( i ) 2 ,
where 0 < λ 1 denotes the forgetting factor [6,12]. The value of λ is associated with a compromise made between the tracking capabilities of the algorithm, on one side, with respect to the accuracy it has at steady state on the other side. For lower values of λ , the memory of the algorithm is diminished, and the contributions associated with older information become negligibly faster. In such cases, the algorithm adapts faster to any changes that might occur in the unknown system to be identified. When the value of λ is closer to 1 (i.e., to infinite memory), the tracking of changes is slower. However, more information about loudspeakers and the respective microphone signals is stored at once. Thus, the accuracy of the RLS is better under a steady state, and the misadjustment decreases [13]. We employ the forgetting factor and express the recursive notations
R ( n ) = λ R ( n 1 ) + x in ( n ) x in H ( n ) ,
p ( n ) = λ p ( n 1 ) + x in ( n ) d ( n ) ,
where R ( n ) is the 2 N × 2 N estimate of the correlation matrix associated with the complex input signal, and p ( n ) represents the 2 N × 1 cross-correlation vector between the input signal and the complex reference, respectively [6,12]. Consequently, we can employ the notations mentioned above, and the minimization of the cost function in (15) leads to the system of equations
R ( n ) h ˜ ( n ) = p ( n ) .
The direct solution for (18) requires computing the inverse of the matrix R ( n ) at every time index, n. Such an approach is prohibitive for modern chips because the usual number of filter coefficients for SAEC applications is in the range of thousands. A popular alternative to the direct solution is the complex-valued RLS method based on the matrix inversion lemma (also known as Woodbury’s identity), which is often used as a reference for research work [12]. In the rest of the paper, we will call the mentioned algorithm the WL-RLS method. It requires a number of complex multiplications proportional to 4 N 2 , as well as 4 N real-valued divisions for every filter iteration. Despite the reduction in computational effort by an order of magnitude, the overall complexity is still considered difficult to approach for SAEC applications, which might also be affected by the associated numerical stability issues.
Fundamental research led to the reinterpretation of the system in (18) in order to exploit the statistical properties of the loudspeaker signals, as they are reflected in the correlation matrix R ( n ) [19,20,21]. Consequently, the exponentially weighted RLS methods can be combined with different LSM variants in order to solve an auxiliary system of equations. The resulting WL-RLS-LSM algorithms can provide performances comparable to the WL-RLS without manifesting numerical stability issues [2,19]. Several versions also have lower arithmetic complexities with respect to the WL-RLS. The steps of the WL-RLS-LSM general method are presented in Table 1, where the corresponding amounts of necessary real-valued multiplications are also displayed.
The initialization stage runs once and employs the 2 N × 1 zero-valued vector 0 2 N × 1 to set the values of the filter coefficients h ˜ ( 0 ) of the 2 N × 1 residual vector, r ( 0 ) . Moreover, the 2 N × 2 N identity matrix I 2 N is used to fill the correlation matrix estimate R ( 0 ) with zeroes on all positions, excepting the main diagonal, where the small, real-valued positive constant Φ is used to avoid the singular property in the initial stages of the algorithm. In the form presented in Table 1, the effect of Φ becomes negligible after the WL-RLS-LSM runs for a certain number of iterations due to the effect of the forgetting factor.
Steps 1 to 7 run each time for every time index n, and the first four stages are the same as the other exponentially weighted RLS methods working with complex values in the SAEC framework [6,11]. Steps 1 and 2 perform the update of the input vector x in ( n ) with respect to the correlation matrix R ( n ) . Apparently, the computation of the latter using (16) requires an amount of multiplications proportional to the square of 2 N . However, the input vector has the t i m e - s h i f t property and can be expressed as
x in ( n ) = x ( n ) x * ( n ) x in , 1 : 2 N 2 T ( n 1 ) T ,
where x in , 1 : 2 N 2 ( n 1 ) is the ( 2 N 2 ) × 1 vector comprising the last 2 N 2 complex-valued input samples from time index n 1 . Correspondingly, R ( n ) can be updated by exploiting its highly redundant structure. The matrix is Hermitian, and every consecutive two columns, starting from the first one, comprise the same information, each pair of values being conjugated and having switched positions in the neighboring column [2,19,33]. The same rules apply to the numerical composition of the rows. Thus, by copying the upper left ( 2 N 2 ) × ( 2 N 2 ) submatrix of R ( n 1 ) to the lower right ( 2 N 2 ) × ( 2 N 2 ) submatrix of R ( n ) , and computing the first column R : , 1 ( n ) as
R : , 1 ( n ) = λ R : , 1 ( n 1 ) + x * ( n ) x in ( n ) ,
then the second column, or the first two rows, respectively, can be determined by copying values from the first column and considering the enumerated properties. We considered that λ can be chosen using the form λ = 1 1 / ( 2 N K ) , where N, respectively the positive constant K, can be expressed as powers of 2. Thus, any multiplication performed with λ can be replaced with a bit-shift and a subtraction. Moreover, it can be deduced that approximately a quarter of the complex values comprising R ( n ) have to be stored in memory. The complexity of step 2 is, therefore, proportional to N in terms of multiplications.
Considering that the filter output computation in step 3 requires 8 N multiplications, and step 4 (the determination of the a priori error) employs no arithmetic operations more complex than additions, the first four steps require only additions and multiplications. The corresponding overall effort is proportional to the adaptive filter’s length, i.e., O ( N ) .
The discussion regarding the rest of the arithmetic complexity is more elaborate when referring to steps 5, 6, and 7, which implement the auxiliary solution associated with the system in (18). The auxiliary system of equations
R ( n ) Δ h ( n ) = p 0 ( n ) ,
that must be solved in step 6 provides an estimate for the solution vector  Δ h ˜ ( n ) and requires the computation of the so-called residual component denoted by p 0 ( n ) (step 5) by employing the residual vector, r ( n 1 ) , generated at the previous filter iteration (weighted using the forgetting factor) and cumulated with the product between the error and the input signal vector [2,19]. Considering that λ can have the form mentioned when discussing step 2, and the redundant structure of x in ( n ) , the only multiplications necessary for step 5 correspond to the multiplication between the conjugate form of e ( n ) and half the values comprising the vector x in ( n ) .
As mentioned before, the system to be solved in step 6 generates the estimate solution vector Δ h ˜ ( n ) and an updated version of the residual vector. The latter will be employed again in the next adaptive filter iteration (in step 5). Moreover, considering the unknown complex-valued system as being fixed for a given number of iterations, as the adaptive filter converges, the absolute values of the real and imaginary parts comprising r ( n ) tend to decrease [33]. The task in step 6 is suitable for LSMs, which can compute both mentioned vectors, Δ h ˜ ( n ) and r ( n ) . There are multiple methods that can be employed in this stage of the WL-RLS-LSM [20]. As they rely more on the specific properties of the input signal of the correlation matrix R ( n ) , with respect to r ( n ) , the arithmetic complexity tends to decrease. However, the correlation properties of some signals, such as speech, tend to be stationary for limited amounts of time, such as the corresponding variations determined in R ( n ) . Validating some low-complexity LSM variants is necessary, especially when combining them with other features. Finally, in step 7, the solution vector is added to the previous set of filter coefficients in order to generate a new estimate.

3. The Variable Regularized WL-RLS-LSM

In [31], the cost function expressed in (15), corresponding to the exponentially weighted RLS methods, was enhanced to take into account a permanent addition to the main diagonal of the correlation matrix. The modification aims to alter the purpose of the regularization parameter Φ , which is traditionally inserted in the initialization stage in order to avoid the non-singular property of R ( n ) . Consequently, the value of Φ will not be arbitrarily chosen, and it will be determined at every time index, n, as a function of the estimated echo-to-noise (ENR) ratio. The proposed approach targets to mitigate the effects of the DT scenarios when the value of the ENR is usually very low.
We define the new cost function as [31,34]
J LS REG h ˜ ( n ) = J LS h ˜ ( n ) + Φ h ˜ ( n ) 2 2 ,
where · 2 denotes the 2 norm. For the minimization of (22), the updated equation of the filter coefficients can be developed using (12), with respect to (14), and then emphasizing two important components as
h ˜ ( n ) = Q ( n ) h ˜ ( n 1 ) + h ̲ ( n ) ,
where we employed the notations
Q ( n ) = I 2 L + R ( n ) + Φ I 2 N 1 x in ( n ) x in H ( n ) ,
h ̲ ( n ) = R ( n ) + Φ I 2 N 1 ( n ) x in ( n ) d ( n ) .
It can be noticed that both occurrences of the matrix R ( n ) are added with the unit matrix I 2 N multiplied by the contribution of the regularization parameter Φ . Furthermore, the notation in (25) can be regarded as the solution for the system of equations
R ( n ) + Φ I 2 L h ̲ ( n ) = x in ( n ) d ( n ) ,
which is relevant because it comprises the contribution of the complex microphone signal d ( n ) and (implicitly) the noise source influencing the performances of the SAEC system. The associated complex-valued error for the newly employed system of equations can be written as
e ̲ ( n ) = d ( n ) h ̲ H ( n ) x in ( n ) ,
where the unknown system h ̲ would be estimated using the set of coefficients h ˜ ̲ . The corresponding update process is influenced by the already introduced complex-valued noise, w ( n ) .
For the system described in (26), with respect to (27), we intend to employ a variable Φ ( n ) at each filter iteration in order to influence the update process. The feature has an equivalent effect on any solutions for (18) or (21), including the WL-RLS-LSM approach. Thus, we impose the equality between the variances of the complex-valued signals e ̲ ( n ) and w ( n ) [31,32] by employing the mathematical expectation E · as
E e ̲ ( n ) 2 = σ w 2 = E w ( n ) 2 .
When replacing (27) in (28), we also employ several approximations. We consider that the complex input signal is stationary, with the statistical properties of white noise. After a reasonable amount of iterations has passed, we can approximate for (16):
R ( n ) ( 1 λ ) x in ( n ) x in H ( n ) σ x in 2 I 2 N ,
where σ x in 2 is the variance of x in . We can write R ( n ) + Φ I 2 N as
R ( n ) + Φ I 2 N σ x in 2 1 λ + Φ I 2 N ,
and we limit the effect of the forgetting factor to the one of a rectangular window with the length of the complex-valued adaptive filter (i.e., to 2 N ). Correspondingly,
i = 1 λ i = 1 1 λ = 2 N ,
where it is reasonable enough to assume that 2 N 1 , and the value of λ can be written as
λ = 1 1 2 N .
The expression in (32) implies that the forgetting factor from (30) can be replaced using the length of the adaptive filter, and we can write
R ( n ) + Φ I 2 L 2 N σ x in 2 + Φ I 2 N .
Additionally, we can use the variance of the input signal to write the approximation
x in H ( n ) x in ( n ) 2 N σ x in 2 .
If we denote the variance of the complex reference signal d ( n ) as σ d 2 , after some calculations, we obtain the equation
σ d 2 2 2 N σ x in 2 2 N σ x in 2 + Φ σ d 2 + 2 N σ x in 2 2 2 N σ x in 2 + Φ 2 σ d 2 = σ w 2 ,
where Φ is considered the unknown quantity. In order to simplify (35), we can denote α = 2 N σ x in 2 , and we obtain
σ d 2 2 α α + Φ σ d 2 + α 2 α + Φ 2 σ d 2 = σ w 2 .
Furthermore, we can express the variance of the reference signal as
σ d 2 = σ y 2 + σ w 2 ,
where the samples of y ( n ) , with respect to w ( n ) , are uncorrelated. By considering the ENR as
ENR = σ y 2 σ w 2 ,
after performing some calculations, the expression in (36) becomes
Φ 2 ENR 2 α Φ α 2 = 0 ,
with the possible solutions
Φ 1 = α 1 + 1 + ENR ENR , Φ 2 = α 1 1 + ENR ENR .
By taking into account that the values on the main diagonal of R ( n ) are always real and positive, the regularization process based on adding Φ to all the values on the main diagonal can be applied by using only positive values [31]. Consequently, from (40), the form of Φ 1 is usable in practice.
A practical concern is the determination of the variances σ x in 2 , σ d 2 , and σ w 2 . An estimate, σ ˜ w 2 , corresponding to σ w 2 is not straightforwardly available. However, after the adaptive filter has converged to a certain point, we can use the output y ˜ ( n ) and write
σ ˜ w 2 = σ ˜ d 2 σ ˜ y ˜ 2 .
At every iteration of the algorithm, for x ( n ) , d ( n ) , and y ˜ ( n ) , which can be generically denoted by c ( n ) , we can use an exponential window for the estimation of the associated variances:
σ ˜ c 2 ( n ) = γ σ ˜ c 2 ( n 1 ) + ( 1 γ ) c ( n ) 2 ,
where 0 < γ 1 is a real-valued, positive, and sub-unitary parameter used to control the memory of the estimates.
A variable-regularized (VR) version of the WL-RLS-LSM is presented in Table 2, which estimates the value of the ENR at every time index, n t, with respect to the variance σ x in 2 . We considered the filter length 2 N as a power of two, and we excluded the corresponding multiplications (they are performed using bit-shifts, just as in the case of Table 1).
We also employed a new column for real-valued divisions, which are necessary in steps 6 and 7. With respect to the WL-RLS-LSM algorithm, the VR-WL-RLS-LSM requires two divisions and one square root operation per filter iteration (see step 7). The impact of the VR portion on the overall arithmetic workload does not change the corresponding orders of degree between Table 1 and Table 2 (for additions, complex multiplications, etc.). With the minimal extra complexity, the VR-WL-RLS-LSM is expected to better mitigate the effects of low ENR scenarios, such as the DT situations. The next section will describe several WL-RLS-LSM versions and demonstrate the practical advantages of performing the associated matrix regularization. The analysis will comprise performances and arithmetic complexities.

4. VR-WL-RLS-LSM Versions

The complex-valued exponentially weighted RLS method can be combined, as mentioned in the previous sections, with different LSMs in step 6 of Table 1, with respect to step 10 of the algorithm presented in Table 2. The LSMs are suited to solve the auxiliary system of equations in (21) by minimizing the cost function [19,35]
F Δ h ˜ ( n ) = 1 2 Δ h ˜ H ( n ) R ( n ) Δ h ˜ ( n ) p 0 ( n ) Δ h ˜ ( n ) .
When considering (43), the residual vector is also named the gradient vector, and can be expressed as
r ( n ) = F Δ h ˜ ( n ) = p 0 ( n ) R ( n ) Δ h ˜ ( n ) ,
where ∇ denotes the gradient operator [19,20,35].
This section will describe several such iterative methods, which generate estimates for the solution vector Δ h ˜ ( n ) , with respect to the residual vector r ( n ) . The presented LSMs have different arithmetic complexities and exploit (up to certain levels) the statistical properties of the input data. They initialize Δ h ˜ ( n ) with 0 2 N × 1 , with respect to the transitory residual vector r tmp , by using the residual component p 0 ( n ) , and this iteratively updates the two complex vectors for a number of N u iterations. At every iteration, k, with k = 1 N u , the step-size (denoted by α and an associated subscript) is determined and employed in order to perform the mentioned updates. As the value of k progresses towards the value of N u (within an adaptive filter’s iteration n), the absolute values of the real and imaginary parts comprising r tmp are expected to decrease. When the LSM iterations are finished, the last set of 2 N values comprising r tmp are output as r ( n ) . Moreover, as mentioned before, when considering the unknown system h t as fixed, the same observation regarding the constituent real and imaginary parts should be true for r ( n ) as h ˜ ( n ) converges towards h t .
For the next three subsections, we will alternatively denote (between parantheses) the time index n or the corresponding LSM iteration k at the associated adaptive filter iteration n. Furthermore, we will use a subscript to denote the position of an element in a vector, with respect to the column of a matrix or the coordinates of a single element in a matrix (when employing 2 coordinates in the subscript). In perspective, the employed LSM performs a number of N u iterations for every time index, n.

4.1. The Conjugate Gradient LSM

The conjugate gradient (CG) method is presented in Table 3 [19,35]. It recursively determines a 2 N × 1 direction vector d ( k ) using r tmp ( k 1 ) (from the previous CG iteration), with respect to the values of the parameter δ at iterations k 1 and k 2 , which are computed as the squared 2 norms of the corresponding r tmp iterations. The orthogonality of the residual vector (i.e., r tmp H ( k ) r tmp ( i ) = 0 for i = 0 k 1 ) and the form in step 1 ensure the R ( n ) conjugacy for the direction vector, i.e.,
d H ( k ) R ( n ) d ( i ) = 0 , i k .
Consequently, the step size α CG determined in step 3 minimizes the cost function in (43), and an update is made for the solution vector, with respect to the residual vector (steps 4 and 5). Every iteration is ended with the computation of the newest value for δ k , corresponding to the latest form r t m p ( k ) .
Despite being considered too complex for acoustic applications, which require hundreds or thousands of filter coefficients, the CG is popular for its attractive performances, especially the tracking speeds. It requires one division in step 1 and another one in step 3, summing up to a total amount of 2 N u of such operations for all N u iterations. However, step 2 is the most costly from an arithmetical point of view, with a multiplication effort proportional to the square of the adaptive filter’s length. It makes the named WL-RLS-CG and VR-WL-RLS-CG algorithms have a complexity of O ( 4 N 2 ) , which is prohibitive for applications requiring long adaptive filters.

4.2. The Coordinate Descent LSM

The coordinate descent (CD) method (Table 4) is designed to exploit the statistical properties of the correlation matrix [19,21,35]. The values comprising R ( n ) are real and positive on the main diagonal. Moreover, the absolute values of the real and imaginary parts corresponding to the other elements of the matrix are much smaller than the values situated on the main diagonal (from a statistical point of view).
In step 1, the CD algorithm searches for the position with the maximum absolute value in the residual vector r t m p ( k ) , and determines the step size α CD using the associated value on the main diagonal of R ( n ) (step 2). Step 3 updates a single value (its real or imaginary part) in the solution vector Δ h ˜ t m p , and step 4 adjusts r t m p ( k ) correspondingly.
The CD method requires one division per iteration and an amount of multiplications proportional to the adaptive filter’s length. Together with the CG, the CD iterations are called exact line search methods. Nevertheless, the arithmetic workloads of the WL-RLS-CD, respectively VR-WL-RLS-CD, are of O ( 2 N N u ) multiplications, which makes them more suitable for hardware implementations of acoustic applications.

4.3. The Dichotomous Coordinate Descent LSM

The dichotmous coordinate descent (DCD) iterations are presented in Table 5 [2,19,21,22,35]. It is an inexact LSM, which employs a greedy approach when solving the auxiliary system in (21). By searching at each iteration for the maximum absolute value among the real and imaginary components of the residual vector r tmp (step 1), the DCD determines the most likely position on which the method could perform a so-called successful iteration. With respect to the CG and CD methods, for the DCD, the significance of parameter N u is slightly different. It represents the maximum number of successful updates.
When selecting the initial value H max for the step size as a power of two, all the multiplication operations with α DCD can be replaced by bit-shifts. The updates performed in steps 6 and 7 are decided based on the condition assessed in step 4, where the associated value on the main diagonal of R ( n ) scaled with α DCD is used as a reference for comparison. The method attempts to execute updates equivalent to setting bits in a binary numerical representation up to a depth of M b bits. One update represents a single bit set to the value 1 in a real or imaginary position from the solution vector. When the current bit to be processed (i.e., the variable m) indicates a value greater than M b , the algorithm stops, even if k < N u (step 3).
The DCD is the most efficient from an arithmetical point of view. It requires only additions [ O ( 2 N N u ) ], and no multiplications or divisions. With a correct use of the value H max , the multiplications are all replaced by bit-shifts. Considering that the DCD method can have unsuccessful iterations, it is possible that many of the DCD runs employ less than N u updates (especially when the adaptive algorithm approaches convergence). Consequently, the namely WL-RLS-DCD, respectively the VR-WL-RLS-DCD, are the most attractive variants from the point of view of necessary chip areas for SAEC hardware applications.

5. Simulations

This section presents simulations performed in the context of SAEC. It is comprised of two subsections, which describe the practical aspects regarding the considered setup, respectively the presentation of the actual simulation results and the associated interpretations.

5.1. Practical Considerations

For the SAEC setup using different WL-RLS-LMS/VR-WL-RLS-LMS variants, we employed different types of input signals with different correlation properties, such as Gaussian noise sequences filtered through an autoregressive system with a single pole [AR(1)] with the value 0.99 or speech signals. Moreover, for each simulation, the original signal is filtered through two real-world acoustic paths in order to simulate the propagation from the acoustic source to each of the two microphones.
In comparison to single-channel acoustic echo cancellation (AEC) scenarios, the SAEC system has to mitigate the correlation between the two input channels, which can drastically reduce the performance of the adaptive algorithm. Therefore, after generating the two acoustic inputs, we use the predistortion block introduced in [6,11] in order to reduce the mentioned correlation. The samples of the new input signal can be expressed using the half-wave rectifier, as
x ^ L c ( n ) = x L c ( n ) + α r x L c ( n ) + | x L c ( n ) |
x ^ R c ( n ) = x R c ( n ) + α r x R c ( n ) | x R c ( n ) | ,
where we can have the predistortion parameter  α r < 0.5 . In practice, the effect of α r is considered adequate for the interval ( 0 , 0.25 ] in order to keep the distortion effect within acceptable limits. The new signals, x ^ L c ( n ) and x ^ R c ( n ) , are computed before being reproduced by the loudspeakers and become the inputs of the complex-valued adaptive filter.
The four acoustic impulse responses generating the echo signals are also real-world measured impulse responses decimated to obtain the lengths N = 128 and N = 256 . After combining them according to (8)–(10), the obtained complex echo path h t is employed at every filter iteration for measuring the normalized misalignment(NM) [6]:
NM ( n ) = 10 lg h t h ˜ ( n ) 2 2 h t 2 2 [ dB ] .
Finally, when displaying simulation results using the performance criterion expressed in (48), we do not consider the initial convergence as relevant.

5.2. Results

For the first set of experiments, the WL-RLS-CG adaptive algorithm was studied in tracking scenarios with four acoustic echo paths having the length N = 128 . In order to simulate the acoustic echo impulse response changes, the corresponding sets of coefficients were shifted by 25 positions after some simulation time. For the input signals, we employed Gaussian noise filtered through the AR(1) system mentioned above with respect to a speech sequence.
The results are displayed in Figure 1a,b. The performance of the WL-RLS based on Woodbury’s identity was also added as a reference. It can be noticed that increasing the number of iterations N u leads to better tracking speeds, with the compromise of lower accuracy at a steady state. In Figure 1a, the WL-RLS-CG with the highest values of N u converge in less than 2 s and match the tracking speed of the WL-RLS. At steady state, the WL-RLS-CG with N u = 4 outperforms the WL-RLS by more than 5 dB for the NM value. The gap in performance between the different values of N u is narrower in Figure 1b, where the input is speech. As expected, the steady-state values for the NM are higher (i.e., lower accuracy with highly correlated input signals). The curves for N u 2 , with respect to WL-RLS, have very similar tracking speeds, with only N u = 2 having consistent NM values below 20 dB at a steady state. Although for N u = 1 the NM approaches 25 dB for most of the time, its corresponding tracking speed is 2.5 seconds longer than the cases for N u 2 (i.e., double).
In both cases, above a certain value for the parameter N u , the associated extra arithmetic workload does not provide satisfactory results (i.e., the tracking performances become capped for the higher values of N u ). The WL-RLS is the algorithm to match during the tracking period, and has the worst accuracy after convergence is achieved.
The scenarios discussed in the first set of experiments were repeated for the WL-RLS-CD adaptive algorithm (Figure 2a,b) with respect to the WL-RLS-DCD (Figure 3a,b). The differences between the tracking performances with respect to the value of the parameter N u are more evident in comparison to the WL-RLS-CG method. For the case of Figure 2a (the input is an AR(1) sequence), the WL-RLS has superior tracking speeds with respect to all versions of the WL-RLS-CD, including the variant with N u = 8 , which requires two more seconds to reach a steady state after the echo paths change. In the scenario corresponding to speech input (i.e., for Figure 2b), the WL-RLS and the WL-RLS-CD with N u = 8 re-converge approximately at the same time. However, all the CD variants are more accurate at a steady state and attain NM values of less than 25 dB, while the WL-RLS rarely achieves misalignment values of 20 dB.
In Figure 3a,b, the WL-RLS-DCD benefit from the greedy nature of the DCD method and are able to match the tracking speeds of the WL-RLS algorithm for N u = 4 , with respect to N u = 8 . Similarly to the previous two figures, at a steady state, the WL-RLS-DCD outperforms the WL-RLS. The accuracy at steady state is better for both the WL-RLS-CD and the WL-RLS-DCD, with gaps of up to 9 dB for all values of N u in comparison to the reference WL-RLS.
For the following simulations, we consider that the tracking performances associated with all presented LSMs are suitable for N u > 1 and that a number of iterations, N u = 8 , does not provide satisfactory improvements in performance in order to justify the corresponding arithmetic workload. Consequently, we will analyze the behavior of the different VR-WL-RLS-LSM versions for N u = 2 and N u = 4 .
In Figure 4, the WL-RLS-CG and VR-WL-RLS-CG methods are studied when the acoustic echo paths change, and a DT situation occurs later in the same simulation. The tracking happens at the moment t 0 = 120 s, and the DT manifests in the interval [ 230 , 234 ] seconds (i.e., another speech sequence is added to the microphone signals). The input signal is also speech. It can be noticed that the VR-WL-RLS-CG versions track the echo path changes approximately 10 s slower than the non-VR counterparts. However, they have better accuracy at steady state, and the VR-WL-RLS-CG with N u = 2 outperforms the VR variant with N u = 4 .
When the DT situation occurs, the NM values associated with the WL-RLS-CG versions increase up to 14.5 dB. At the same time, both VR-WL-RLS-CG variants perform the update process in a slower manner, and the corresponding spikes in accuracy loss are situated at approximately 11 dB (more than 25 dB lower). Consequently, both VR variants return to the accuracy levels from before the DT occurrence with a delay of approximately 4 s (i.e., around a time of 238 s). Considering that the non-VR algorithms need more than 10 s to re-converge and attain previous accuracies at time 244 s, the results demonstrate that the accuracy penalties imposed on the VR-WL-RLS-CG filters are greatly diminished.
The same observations can be made when repeating the scenario for WL-RLS-CD and VR-WL-RLS-CD (Figure 5) with respect to WL-RLS-DCD and VR-WL-RLS-DCD (Figure 6). The VR versions suffer losses in terms of their tracking capabilities. Firstly, the WL-RLS-CD variants completely track the echo path changes approximately 10 s faster than their VR counterparts with the same N u values. However, when the DT occurs, the differences between accuracies are around 24 dB in favor of the VR-WL-RLS-CD variants.
Secondly, for the DCD-based algorithms, the differences in tracking capabilities are less influenced by the value of N u and more by employing the VR approach. Consequently, the VR-WL-RLS-DCD has similar tracking speeds for N u = 2 and N u = 4 and a 12–13 s delay for fully estimating the complex echo path after it changes (with respect to the WL-RLS-DCD). Nevertheless, in the DT interval, the non-VR algorithms have an NM of up to 13 dB, while the VR counterparts have accuracies no worse than 12 dB.
In Figure 7, all the VR versions from the three previous simulations are compared in the same scenario (i.e., first the tracking and then the DT period). It can be noticed that the VR-WL-RLS-CG has better convergence speeds, with a penalty in terms of accuracy at a steady state, with respect to the CD and the DCD versions. The VR-WL-RLS-DCD with both N u values and the VR-WL-RLS-CD with N u = 4 track the changes in the echo paths with a 5–6 s delay, and the VR-WL-RLS-CD with N u = 2 has the slowest reaction with a more than 15 s delay, with respect to the VR-WL-RLS-CG. However, at a steady state, the latter has NM values with minimum values of 22 dB, while the VR-WL-RLS-DCD and the VR-WL-RLS-CD with N u = 4 can easily reach 23 dB. The best steady-state results are obtained for VR-WL-RLS-CD for N u = 2 with NM values of around 24 and 25 dB. In the DT interval, all the methods have similar performances for both values of N u (the worst performance is situated around 12 dB). From a practical point of view, the VR-WL-RLS-CD and the VR-WL-RLS-DCD are more attractive because they require fewer arithmetic resources and deliver satisfactory results.
For the next experiments, the tracking and DT scenario was simulated again with the ENR set to 10 dB. In Figure 8 (WL-RLS-CG vs. VR-WL-RLS-CG), Figure 9 (WL-RLS-CD vs. VR-WL-RLS-CD), and Figure 10 (WL-RLS-DCD vs. VR-WL-RLS-DCD), it can be noticed that the difference in performance between the VR and non-VR algorithms has increased. At a steady state, all the algorithms have worse NM values with respect to the case of the scenarios with ENR = 25 dB. Although the delay in tracking manifests for all VR methods (like in previous cases), the performance reductions in the DT occurrences are smaller. The VR-WL-RLS-CG has a slight jump from 11.5 dB (at a steady state) to 9.5 for 3–4 s during the DT interval. Similarly, the VR-WL-RLS-CD has a performance reduction of 2.5 dB, and the VR-WL-RLS-DCD has a performance reduction of 2 dB, with the same re-convergence interval of less than 4 s. Moreover, we can notice that the discrepancies in performances between the VR and non-VR algorithms are higher than in the ENR = 25 dB scenarios. From the differences in 3–4 dB, the VR versions perform well and are more than 7 dB better in Figure 8, 4.5 dB in Figure 9, and 6 dB in Figure 10. The regularization leads to better accuracy at a steady state for all VR algorithms, and the update processes are almost unaffected by the DT occurrences.
Moreover, when only the VR versions are compared in Figure 11, the misalignment curves are very similar. Considering the compromise between performance and chip area costs, the VR-WL-RLS-DCD method can be considered the most attractive as the ENR decreases.

6. Conclusions

The paper describes and analyzes several RLS-LSM adaptive algorithms for SAEC applications. The WL framework employs complex-valued variables and simplifies the handling of the system. The exponentially weighted complex-valued RLS algorithm working within the WL model is combined with several LSMs in order to replace the classical system of equations with an auxiliary set of equations. Three types of LSMs are combined with the VR mechanism applied to the complex-valued RLS. The CG is the most costly method, which leads to an overall arithmetic complexity proportional to the square of the VR-WL-RLS-CG adaptive filter’s length. The CD and the DCD methods are progressively simpler in terms of arithmetic workloads and are designed to exploit the statistical properties of the input signals. Both VR-WL-RLS-CD and VR-WL-RLS-DCD require O ( 2 N ) multiplications, with an advantage for the latter, which solves the auxiliary system of equations using only additions and bit-shifts.
The simulation results demonstrate that the VR variants of WL-RLS-LMSs decrease the NM by approximately 25 dB during the DT intervals, with the compromise of reducing the tracking speeds of the corresponding algorithms. Moreover, the VR-WL-RLS-LMSs have better accuracy at a steady state with respect to the non-VR versions (at least 2–3 dB in terms of NM for ENR = 25 dB, and more than 7 dB in terms of NM for ENR = 10 dB). The performance gap at a steady state is better in favor of the VR variants as the ENR decreases.
The simulation results also showed that the greedy nature of the DCD iterations partially compensates for the tracking loss of VR-WL-RLS-DCD and allows it to match the tracking performances generated by VR-WL-RLS-CG. The associated robust performances in low-ENR conditions (including DT situations), combined with the overall arithmetic complexity (which is directly proportional to the adaptive filter’s length), make VR-WL-RLS-DCD an attractive candidate for hardware implementations for SAEC. Future research will concentrate on in-depth analysis for efficient hardware implementations (fixed-point vs. floating-point) of WL-RLS-DCD incorporating a VR approach.

Author Contributions

Conceptualization, C.-L.S., M.-R.U., and C.E.-I.; Formal Analysis, C.A., C.E.-I., and C.-L.S.; Methodology, C.-L.S., and M.-R.U.; Software, I.-D.F., L.S., and C.-L.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Program for Research of the National Association of Technical Universities—GNAC ARUT 2023 under Grant 36/09.10.2023, code 133, and in part by the Ministry of Research, Innovation and Digitization, CNCS–UEFISCDI, under Project PN-III-P1-1.1-TE-2021-0393, within PNCDI III. The work of Ionuț-Dorinel Fîciu was also supported by a grant of the Academy of Romanian Scientists (AOŞR), within AOŞR-TEAMS-III, project registration code 131/2024.

Data Availability Statement

The data presented in this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Benesty, J.; Huang, Y. Adaptive Signal Processing–Applications to Real-World Problems; Springer: Berlin/Heidelberg, Germany, 2003. [Google Scholar]
  2. Stanciu, C.; Benesty, J.; Paleologu, C.; Gänsler, T.; Ciochină, S. A widely linear model for stereophonic acoustic echo cancellation. Signal Process. 2013, 93, 511–516. [Google Scholar] [CrossRef]
  3. Benesty, J.; Morgan, D.; Sondhi, M. A better understanding and an improved solution to the specific problems of stereophonic acoustic echo cancellation. IEEE Trans. Speech Audio Process. 1998, 6, 156–165. [Google Scholar] [CrossRef]
  4. Diniz, P.S.R. Adaptive Filtering: Algorithms and Practical Implementation, 4th ed.; Springer: New York, NY, USA, 2013. [Google Scholar]
  5. Sondhi, M.; Morgan, D.; Hall, J. Stereophonic acoustic echo cancellation-an overview of the fundamental problem. IEEE Signal Process. Lett. 1995, 2, 148–151. [Google Scholar] [CrossRef] [PubMed]
  6. Benesty, J.; Paleologu, C.; Gänsler, T.; Ciochină, S. A Perspective on Stereophonic Acoustic Echo Cancellation; Springer: Berlin/Heidelberg, Germany, 2011; Volume 4. [Google Scholar] [CrossRef]
  7. Hong, J. Stereophonic Acoustic Echo Suppression for Speech Interfaces for Intelligent TV Applications. IEEE Trans. Consum. Electron. 2018, 64, 153–161. [Google Scholar] [CrossRef]
  8. Cho, B.J.; Park, H.M. Stereo Acoustic Echo Cancellation Based on Maximum Likelihood Estimation with Inter-Channel-Correlated Echo Compensation. IEEE Trans. Signal Process. 2020, 68, 5188–5203. [Google Scholar] [CrossRef]
  9. Schneider, M.; Kellermann, W. Multichannel acoustic echo cancellation in the wave domain with increased robustness to nonuniqueness. IEEE/ACM Trans. Audio Speech Lang. Process. 2016, 24, 518–529. [Google Scholar] [CrossRef]
  10. Lv, S.; Zhao, H.; Xu, W. Robust widely linear affine projection M-estimate adaptive algorithm: Performance analysis and application. IEEE Trans. Signal Process. 2023, 71, 3623–3636. [Google Scholar] [CrossRef]
  11. Stanciu, C.; Benesty, J.; Paleologu, C.; Gänsler, T.; Ciochină, S. A novel perspective on stereophonic acoustic echo cancellation. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Kyoto, Japan, 25–30 March 2012; pp. 25–28. [Google Scholar] [CrossRef]
  12. Haykin, S. Adaptive Filter Theory; Prentice-Hall Information and System Sciences Series; Prentice Hall: Hoboken, NJ, USA, 2002. [Google Scholar]
  13. Sayed, A. Adaptive Filters; Wiley: New York, NY, USA, 2008. [Google Scholar]
  14. Chu, Y.J.; Chan, S.C. A New Local Polynomial Modeling-Based Variable Forgetting Factor RLS Algorithm and Its Acoustic Applications. IEEE/ACM Trans. Audio Speech Lang. Process. 2015, 23, 2059–2069. [Google Scholar] [CrossRef]
  15. Romoli, L.; Cecchi, S.; Peretti, P.; Piazza, F. A Mixed Decorrelation Approach for Stereo Acoustic Echo Cancellation Based on the Estimation of the Fundamental Frequency. IEEE Trans. Audio Speech Lang. Process. 2012, 20, 690–698. [Google Scholar] [CrossRef]
  16. Romoli, L.; Cecchi, S.; Piazza, F. A novel decorrelation approach for multichannel system identification. In Proceedings of the 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Florence, Italy, 4–9 May 2014; pp. 6652–6656. [Google Scholar] [CrossRef]
  17. Paleologu, C.; Benesty, J.; Ciochină, S. Data-Reuse Recursive Least-Squares Algorithms. IEEE Signal Process. Lett. 2022, 29, 752–756. [Google Scholar] [CrossRef]
  18. Hänsler, E.; Schmidt, G. Acoustic Echo and Noise Control—A Practical Approach; Wiley: Hoboken, NJ, USA, 2004. [Google Scholar]
  19. Zakharov, Y.V.; White, G.P.; Liu, J. Low-Complexity RLS Algorithms Using Dichotomous Coordinate Descent Iterations. IEEE Trans. Signal Process. 2008, 56, 3150–3161. [Google Scholar] [CrossRef]
  20. Liu, J.; Zakharov, Y.V.; Weaver, B. Architecture and FPGA Design of Dichotomous Coordinate Descent Algorithms. IEEE Trans. Circuits Syst. I Regul. Pap. 2009, 56, 2425–2438. [Google Scholar] [CrossRef]
  21. Zakharov, Y.V.; Nascimento, V.H. DCD-RLS Adaptive Filters With Penalties for Sparse Identification. IEEE Trans. Signal Process. 2013, 61, 3198–3213. [Google Scholar] [CrossRef]
  22. Yu, Y.; Lu, L.; Zheng, Z.; Wang, W.; Zakharov, Y.; de Lamare, R.C. DCD-Based Recursive Adaptive Algorithms Robust Against Impulsive Noise. IEEE Trans. Circuits Syst. II Express Briefs 2020, 67, 1359–1363. [Google Scholar] [CrossRef]
  23. Paleologu, C.; Benesty, J.; Ciochină, S. A Robust Variable Forgetting Factor Recursive Least-Squares Algorithm for System Identification. IEEE Signal Process. Lett. 2008, 15, 597–600. [Google Scholar] [CrossRef]
  24. Malik, S.; Wung, J.; Atkins, J.; Naik, D. Double-Talk Robust Multichannel Acoustic Echo Cancellation Using Least-Squares MIMO Adaptive Filtering: Transversal, Array, and Lattice Forms. IEEE Trans. Signal Process. 2020, 68, 4887–4902. [Google Scholar] [CrossRef]
  25. Zhang, Y.; Wu, T.; Zakharov, Y.V.; Li, J. MMP-DCD-CV based sparse channel estimation algorithm for underwater acoustic transform domain communication system. Appl. Acoust. 2019, 154, 43–52. [Google Scholar] [CrossRef]
  26. Liao, M.; Zakharov, Y.V. DCD-based joint sparse channel estimation for OFDM in virtual angular domain. IEEE Access 2021, 9, 102081–102090. [Google Scholar] [CrossRef]
  27. Yu, Y.; Lu, L.; Zakharov, Y.; de Lamare, R.C.; Chen, B. Robust sparsity-aware RLS algorithms with jointly-optimized parameters against impulsive noise. IEEE Signal Process. Lett. 2022, 29, 1037–1041. [Google Scholar] [CrossRef]
  28. Yu, Y.; Huang, Z.; He, H.; Zakharov, Y.; de Lamare, R.C. Sparsity-aware robust normalized subband adaptive filtering algorithms with alternating optimization of parameters. IEEE Trans. Circuits Syst. II Express Briefs 2022, 69, 3934–3938. [Google Scholar] [CrossRef]
  29. Niedźwiecki, M.; Gańcza, A.; Shen, L.; Zakharov, Y. Adaptive identification of sparse underwater acoustic channels with a mix of static and time-varying parameters. Signal Process. 2022, 200, 108664. [Google Scholar] [CrossRef]
  30. Yu, Y.; Ye, J.; Zakharov, Y.; He, H. Robust proportionate subband adaptive filter algorithms with optimal variable step-size. IEEE Trans. Circuits Syst. II Express Briefs 2024, in press. [CrossRef]
  31. Benesty, J.; Paleologu, C.; Ciochina, S. Regularization of the RLS Algorithm. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 2011, E94.A, 1628–1629. [Google Scholar] [CrossRef]
  32. Elisei-Iliescu, C.; Stanciu, C.L.; Paleologu, C.; Benesty, J.; Anghel, C.; Ciochină, S. Robust Variable-Regularized RLS Algotihms. In Proceedings of the 2017 Hands-Free Speech Communications and Microphone Arrays (HSCMA), San Francisco, CA, USA, 1–3 March 2017. [Google Scholar] [CrossRef]
  33. Stanciu, C.; Anghel, C.; Stanciu, L. Efficient FPGA implementation of the DCD-RLS algorithm for stereo acoustic echo cancellation. In Proceedings of the 2015 International Symposium on Signals, Circuits and Systems (ISSCS), Iasi, Romania, 9–10 July 2015; pp. 1–4. [Google Scholar] [CrossRef]
  34. Elisei-Iliescu, C.; Paleologu, C.; Benesty, J.; Stanciu, C.; Anghel, C.; Ciochină, S. Recursive Least-Squares Algorithms for the Identification of Low-Rank Systems. IEEE/ACM Trans. Audio Speech Lang. Process. 2019, 27, 903–918. [Google Scholar] [CrossRef]
  35. Chang, P.S.; Willson, A.N. Analysis of Conjugate Gradient Algorithms for Adaptive Filtering. IEEE Trans. Signal Process. 2000, 48, 409–418. [Google Scholar] [CrossRef]
Figure 1. Misalignment of the WL-RLS and WL-RLS-CG for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Figure 1. Misalignment of the WL-RLS and WL-RLS-CG for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g001
Figure 2. Misalignment of the WL-RLS and WL-RLS-CD for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Figure 2. Misalignment of the WL-RLS and WL-RLS-CD for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g002
Figure 3. Misalignment of WL-RLS and WL-RLS-DCD for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Figure 3. Misalignment of WL-RLS and WL-RLS-DCD for different values of N u . The input signal is an AR(1) sequence with the pole 0.99 for (a), with respect to a speech sequence for (b). The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 128 and λ = 1 1 / ( 16 N ) . The echo paths change at time index t a = 50 s for scenario (a), with respect to t b = 60 s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g003
Figure 4. Misalignment of WL-RLS-CG and VR-WL-RLS-CG for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Figure 4. Misalignment of WL-RLS-CG and VR-WL-RLS-CG for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g004
Figure 5. Misalignment of WL-RLS-CD and VR-WL-RLS-CD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Figure 5. Misalignment of WL-RLS-CD and VR-WL-RLS-CD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g005
Figure 6. Misalignment of the WL-RLS-DCD and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 and λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Figure 6. Misalignment of the WL-RLS-DCD and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 and λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g006
Figure 7. Misalignment of the VR-WL-RLS-CG, VR-WL-RLS-CD, and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Figure 7. Misalignment of the VR-WL-RLS-CG, VR-WL-RLS-CD, and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 25 dB.
Electronics 13 01479 g007
Figure 8. Misalignment of WL-RLS-CG and VR-WL-RLS-CG for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Figure 8. Misalignment of WL-RLS-CG and VR-WL-RLS-CG for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Electronics 13 01479 g008
Figure 9. Misalignment of the WL-RLS-CD and VR-WL-RLS-CD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Figure 9. Misalignment of the WL-RLS-CD and VR-WL-RLS-CD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Electronics 13 01479 g009
Figure 10. Misalignment of WL-RLS-DCD and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Figure 10. Misalignment of WL-RLS-DCD and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. Input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and for the VR algorithm, it is γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Electronics 13 01479 g010
Figure 11. Misalignment of VR-WL-RLS-CG, VR-WL-RLS-CD, and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Figure 11. Misalignment of VR-WL-RLS-CG, VR-WL-RLS-CD, and VR-WL-RLS-DCD for different values of N u . The input signal is a speech sequence. The input signals are predistorted with α r = 0.175 . The length of the four unknown impulse responses is N = 256 , λ = 1 1 / ( 64 N ) , and γ = 0.999 . The echo paths change at time index t 0 = 120 , and a DT situation occurs in the time interval [230, 234] s. The ENR is experimentally set to 10 dB.
Electronics 13 01479 g011
Table 1. WL-RLS-LSM Algorithm.
Table 1. WL-RLS-LSM Algorithm.
StepActions×
Initialization :
h ˜ ( 0 ) = 0 2 N × 1 ; 0
r ( 0 ) = 0 2 N × 1 ; 0
R ( 0 ) = Φ I 2 N , Φ > 0 0
For n = 1 , 2 , 3
1      Update x in ( n ) using (1) and (11)0
2      Update R ( n ) using time-shift
       R : , 1 ( n ) = λ R : , 1 ( n 1 ) + x * ( n ) x in ( n ) 4 N
3       y ˜ ( n ) = h ˜ H ( n 1 ) x in ( n ) 8 N
4       e ( n ) = d ( n ) y ˜ ( n ) 0
5       p 0 ( n ) = λ r ( n 1 ) + e * ( n ) x in ( n ) 4 N
6       R ( n ) Δ h ( n ) = p 0 ( n ) LSM Δ h ˜ ( n ) , r ( n ) ...
7       h ˜ ( n ) = h ˜ ( n 1 ) + Δ h ˜ ( n ) 0
Table 2. VR-WL-RLS-LSM Algorithm.
Table 2. VR-WL-RLS-LSM Algorithm.
StepActions×/
Initialization :
h ˜ ( 0 ) = 0 2 N × 1 ; 00
r ( 0 ) = 0 2 N × 1 ; 00
R ( 0 ) = Φ ˜ ( 0 ) I 2 N , Φ ˜ ( 0 ) > 0 00
σ ˜ x 2 ( n ) = 0 ; σ ˜ d 2 ( n ) = 0 ; σ ˜ y ˜ 2 ( n ) = 0 ; σ ˜ w ˜ 2 ( n ) = 0 00
For n = 1 , 2 , 3
1      Update x in ( n ) using (1) and (11)00
2      Update R ( n ) using time-shift
       R : , 1 ( n ) = λ R : , 1 ( n 1 ) + x * ( n ) x in ( n ) 4 N 0
3      Update σ ˜ x in 2 ( n ) , σ ˜ d 2 ( n ) using (42); α = 2 N σ ˜ x in 2 ( n ) 80
4       y ˜ ( n ) = h ˜ H ( n 1 ) x in ( n ) 8 N 0
5      Update σ ˜ y ˜ 2 ( n ) using (42)40
6       σ ˜ w 2 ( n ) = σ ˜ d 2 ( n ) σ ˜ y ˜ 2 ( n ) ; ENR ˜ = σ ˜ y ˜ 2 ( n ) σ ˜ w 2 ( n ) 01
7       Φ ˜ ( n ) = 1 + 1 + ENR ˜ α / ENR ˜ 11
8       e ( n ) = d ( n ) y ˜ ( n ) 00
9       p 0 ( n ) = λ r ( n 1 ) + e * ( n ) x in ( n ) 4 N 0
10       R ( n ) + Φ ˜ ( n ) I 2 N Δ h ( n ) = p 0 ( n ) LSM Δ h ˜ ( n ) , r ( n ) ......
11       h ˜ ( n ) = h ˜ ( n 1 ) + Δ h ˜ ( n ) 00
Table 3. Complex-valued CG algorithm.
Table 3. Complex-valued CG algorithm.
StepActions×/
Initialization :
Δ h tmp = 0 2 N × 1 ; r tmp ( 0 ) = p 0 ( n ) ; δ ( 0 ) = r tmp H ( 0 ) r tmp ( 0 )
For k = 1 , 2 , , N u :
1       d ( k ) = ( k > 1 ) ? r tmp ( k 1 ) + δ ( k 1 ) δ ( k 2 ) d ( k 1 ) : r tmp ( 0 ) 4 N 1
2       v = R ( n ) d ( k ) 16 N 2 0
3       α CG = δ ( k 1 ) / [ d H ( k ) v ] 8 N 1
4       Δ h tmp = Δ h tmp + α CG d ( k ) 4 N 0
5       r tmp ( k ) = r tmp ( k 1 ) α CG v 4 N 0
6       δ ( k ) = r tmp H ( k ) r tmp ( k ) 8 N 0
Table 4. Complex-valued CD algorithm.
Table 4. Complex-valued CD algorithm.
StepActions×/
Initialization :
Δ h tmp = 0 2 N × 1 ; r tmp ( 0 ) = p 0 ( n )
For k = 1 , 2 , , N u :
1       Get val , pos and real / imaginary status
       ( v ; p ) = max Re { r tmp ; i ( k 1 ) } , Im { r tmp ; i ( k 1 ) } ,00
       i = 1 2 N
       s = v a l is Re { r tmp ; p ( k 1 ) } ? 1 : j
2       α CD = v / R p , p 01
3       Δ h tmp ; p = Δ h tmp ; p + s α CD 00
4       r tmp ( k ) = r tmp ( k 1 ) s α CD R p 4 N 0
Table 5. Complex-valued DCD iterations with a leading element.
Table 5. Complex-valued DCD iterations with a leading element.
StepActions+
Initialization :
Δ h tmp = 0 2 N × 1 ; r tmp = p 0 ( n ) ; α DCD = H max , m = 0
For k = 1 , 2 , , N u :
1       Get val , pos and real / imaginary status
       ( v ; p ) = max Re { r tmp ; i } , Im { r tmp ; i } , i = 1 2 N 2 N 1
       s = v a l is Re { r tmp ; p } ? 1 : j
2       α DCD = α DCD / 2 ; m + +
3      If m > M b  → RETURN Δ h tmp and r tmp
4      If v ( α DCD / 2 ) R p , p jump Step 21
5       β k = sign { v } s α DCD
6       Δ h tmp ; p = Δ h tmp ; p + β k 1
7       r tmp = r tmp β k R p 4 N
RETURN Δ h tmp and r tmp
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Stanciu, C.-L.; Anghel, C.; Fîciu, I.-D.; Elisei-Iliescu, C.; Udrea, M.-R.; Stanciu, L. On the Regularization of Recursive Least-Squares Adaptive Algorithms Using Line Search Methods. Electronics 2024, 13, 1479. https://doi.org/10.3390/electronics13081479

AMA Style

Stanciu C-L, Anghel C, Fîciu I-D, Elisei-Iliescu C, Udrea M-R, Stanciu L. On the Regularization of Recursive Least-Squares Adaptive Algorithms Using Line Search Methods. Electronics. 2024; 13(8):1479. https://doi.org/10.3390/electronics13081479

Chicago/Turabian Style

Stanciu, Cristian-Lucian, Cristian Anghel, Ionuț-Dorinel Fîciu, Camelia Elisei-Iliescu, Mihnea-Radu Udrea, and Lucian Stanciu. 2024. "On the Regularization of Recursive Least-Squares Adaptive Algorithms Using Line Search Methods" Electronics 13, no. 8: 1479. https://doi.org/10.3390/electronics13081479

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop