1. Introduction
With the rapid development of mobile intelligent terminals, wearable devices, and industrial monitoring in 5G applications, the amount of mobile data traffic is growing exponentially [
1]. To ensure the reliability and real-time performance of user services, higher transmission rates are required to achieve real-time data interaction. The research on sixth-generation (6G) mobile communication aims to establish a powerful network to cover ground, air, and sea communications and provide a solution superior to 5G that meets the demands of high throughput, large capacity, and low latency. Cognitive radio (CR) is an effective way to improve spectrum utilization [
2]. Spectrum sensing aims to achieve real-time monitoring of spectrum holes, which is a prerequisite in CR. In cognitive networks, the secondary users (Sus) can continuously monitor and opportunistically use the licensed idle frequency bands that are not being used by the primary user (PU) [
3]. For the Sus, once the PU signal is detected to re-access the currently authorized frequency, the SU must immediately give up using the frequency to avoid affecting the normal communication of the PU. The shortage of spectrum resources is widespread in various communication systems, and spectrum sensing has also been found to be applied in many fields, not only in cellular communication but also, for instance, in cognitive radar systems [
4,
5,
6,
7].
Generally speaking, different detecting techniques require different amounts of wireless environment information. According to the amount of prior information required, spectrum sensing methods can be divided into un-blind detection, semi-blind detection, and full-blind detection. The likelihood ratio test (LRT) based on the Neyman-Pearson criterion is currently known as the optimal detection method, which requests a known distribution of the received signal that is generally related to the PU information, the noise variance, and channel statistic characteristics. Therefore, the LRT is an un-blind detection method [
8]. Conversely, when the detection method relies only on the obtained signal samples without any prior information, it is referred to as a fully blind detection method. Cyclostationary detection utilized the periodic stationarity of communication signals to distinguish the PU signals from noise by analyzing the difference in cyclostationary spectra [
9]. As long detection times and high computational complexity are required for cyclostationary detection, it is difficult to apply them in practice. Similarly, the maximum minimum eigenvalue (MME) detection also demanded a significant amount of computation as it involved matrix calculation [
10,
11,
12].
Owing to the intricacy of the full-blind approach, several studies employed a semi-blind approach. The semi-blind detection method expects knowledge of certain parameter values such as noise variance, PU information, or channel occupancy. Energy detection (ED) is widely used in spectrum sensing due to its easy implementation. However, the detection accuracy is susceptible to noise uncertainty, as the noise variance should be known to calculate its threshold value [
13]. In addition, a generalized likelihood ratio test (GLRT) has been developed to employ maximum likelihood estimation to replace the SNR of the received signal in LRT [
14]. However, the detection performance of GLRT cannot be guaranteed when the channel environment changes rapidly. There were also studies to discuss the application of GLRT in multi-antenna scenarios, which involved more complex operations [
15,
16]. Two-step GLRT can make the calculation simple, Ref. [
17] utilizes a two-step GLRT method to achieve adaptive detection under the sub-Gaussian symmetric alpha-stable sea clutter background, considering the unknown parameters of the signal complex amplitude and covariance matrix. A semi-blind detection method for estimating the SNR using the expectation maximization (EM) algorithm was derived in [
18], which achieved stable estimation performance in the presence of channel variations. Nonetheless, it only considered unknown SNR and ignored scenarios where noise variance and channel occupancy rate could not be obtained.
Compared with full-blind detection, semi-blind detection has the advantage of being easy to implement and relies on the assumption of known noise variance. In practical scenarios, it is difficult for semi-blind detection methods to obtain real-time noise variance values, which makes semi-blind detection face challenges in practice where noise uncertainty exists. The semi-blind detection method with known noise variance can availably serve as a full-blind detection method in spectrum sensing when noise variance estimation is conducted. In noise variance estimation, some special channels for noise variance estimation can effectively avoid interference from the PU signal, but this estimation method sacrifices spectrum utilization [
19,
20]. Moreover, the noise variance was estimated by using the eigenvalues of the covariance matrix of the received signal samples, which leads to high computational complexity [
21]. In [
22], a noise uncertainty estimation method was proposed to improve the threshold set using the estimated noise uncertainty interval. However, due to its demand for a huge quantity of signal samples, a large detection delay is usually generated. In [
23], empirical mode decomposition was used for noise variance estimation, which transformed the traditional semi-blind ED and maximum eigenvalue detection (MED) into blind detectors. In the end, wavelet decomposition was used to separate the signal from the noise. A suboptimal cooperative sensing method based on wavelet de-noising was proposed in [
24]. Furthermore, the noise variance was estimated by subtracting the de-noised signal, which was reconstructed from the original received signal [
25,
26].
Due to the limitations of path loss and multipath fading, local spectrum sensing performance may be affected. When reusing the licensed spectrum, it is usually hoped to minimize the impact on the PU, which relies on highly reliable spectrum sensing techniques for global sensing of the wireless channel. Cooperative spectrum sensing (CSS) using spatial diversity among SUs is a good solution to achieve highly reliable spectrum sensing [
27]. In CSS, there are two types of fusion criteria based on the different forms of data that local SUs upload to the fusion center (FC), which are hard fusion criteria and soft fusion criteria [
28]. Hard fusion criteria represent the local decision results as 0 or 1 and upload them to the FC. The commonly used hard decision criteria include OR, AND, and K-out-of-M criteria [
29]. Classic soft fusion methods include equal gain combining (EGC), selecting combining (SC), and maximum ratio combining (MRC) [
30,
31,
32]. The weighting coefficients of MRC and SC are dependent on the SNR of the received signal for each secondary user, and the weighting coefficient of EGC is usually set to a constant value of 1. Soft fusion methods generally acquire abundant local detection information, which leads to superior performance compared to hard fusion criteria [
33].
In this paper, we investigate full-blind cooperative spectrum sensing based on the classical optimal LRT. In order to eliminate the coupling between the unknown parameter estimates, we independently discuss the noise variance estimation problem and combine it with the EM estimation algorithm to achieve fully-blind detection. The main contributions are: (1) in un-blind optimal likelihood ratio detection, many parameters need to be known in advance. Traditional full-blind detection approaches are hard to use in practice as they have high computational complexity. When the parameters are unknown, the detection problem becomes a composite hypothesis. In order to achieve full-blind spectrum sensing with multiple unknown parameters, the EM algorithm is used to estimate the noise variance, SNR of the received signal, and channel prior occupancy rate. (2) Besides, a simple noise variance estimation method based on the lifting wavelet transform is used in the EM algorithm to further shorten the detection time and improve the detection performance and fitness for actual scenarios with noise uncertainty. (3) Finally, the stream learning strategy based on historical information is used in estimating SNR and the channel prior occupancy rate, making the proposed method adaptable to the scenario where the SU has mobility.
The remainder of this paper is outlined as follows: The corresponding system model and the optimal detection problem are described in
Section 2.
Section 3 presents the formula for multi-parameter EM estimation and detection and proposes a Lifting Wavelet-assisted EM estimation method. The simulation results under different scenarios are presented in
Section 4. Finally, the work of this paper is summarized in
Section 5.
2. System Model
It is assumed that the cognitive radio network consists of one PU, one FC, and
J SUs, which is shown in
Figure 1; each SU with only one antenna is not affected by interference from other SUs. Meanwhile, the reporting channel between SUs and FCs is error-free [
34].
Table 1 presents the main parameters used in the analysis, along with a description of each parameter.
According to Neyman-Pearson criteria, the spectrum sensing problem can be modeled as a binary hypothesis test, and the received signal sample at the SU
j is defined as [
35]
where
represents the channel state,
and
represent the absence and presence of the PU signal, respectively. The channel gain from the PU to SU
j is denoted by
,
(
) represents the
m-th sampling point of the PU signal, which follows a Gaussian distribution with a zero mean.
is the zero-mean Gaussian white noise at the receiver of SU
j with variance
. It can be assumed without loss of generality that
and
are independent and uncorrelated. The received signal obeys the following distribution:
where
is the transmit power of the PU signal.
To better track the parameter changes in mobile scenarios, the local detection statistics at
j-th SU represented by
is computed by using small samples, and the corresponding distribution can be described as
where the random variable
represents a central chi-square distribution with
degrees of freedom, and
is the SNR of the received signal at SU
j.
From (3), the probability density function (pdf) of
and its logarithmic forms are:
The detection statistic, SNR, and noise variance for all SUs in the entire network are denoted as
,
,
, respectively. The logarithmic probability density function for all SUs is the sum of each SU probability density function and can be represented as
According to the likelihood ratio detection principle, the decision expression based on the log-likelihood ratio function for the whole work can be written as [
36]
where
is the threshold value of the detection. If the difference is larger than
, it indicates the channel is occupied by the PU. Otherwise, the channel is absent and can be used by the SUs.
By substituting (5) and (7) into (9), the decision expression can be simply described as
where,
is the decision threshold, which is related to the received SNR of
and the threshold of
. According to (10), the global statistic is a linear combination of the detection statistic of
, the SNR and the noise variance.
3. Blind Spectrum Sensing Based on the EM Estimation
According to (10), the decision can be made if the SNR and the noise variance for each SU are known. However, these parameters usually vary in a dynamic environment. In order to determine the state of the channel, it is necessary to estimate the SNR and the noise variance of each SU. The unknown parameter vector is defined as
, where
. According to the maximum likelihood estimation theory, when the exact distribution of
is known, the optimal estimation of unknown parameters could be obtained by applying the logarithmic maximum likelihood estimation, which can be represented as
Since the channel occupancy is unknown when SUs acquire the detection statistic of
, therefore
is a hidden variable during estimation. The expectation-maximization (EM) algorithm can be used to solve the maximum likelihood estimation problem with hidden variables [
37]. In the EM algorithm, two steps are used to obtain the estimation parameter values, which are called the E-step and M-step, respectively. The E-step aims to obtain the conditional expectation, whereas the conditional expectation maximum is conducted to produce a new set of parameter estimates for the next round of detection in the M-step.
In the
n-th detection, the
is defined as the posterior probability of the channel occupancy state.
represents the posterior probability of the channel being occupied based on the value of
for the
n-th detection statistic for the
j-th SU. According to Bayes’ rules,
can be expressed as
where
and
are channel prior occupancy rate.
which can be seen as the marginal distribution of
with respect to
.
The conditional expectation of (8) given the posterior probability of the channel occupancy state of
is denoted as
In the M-step, the corresponding estimation parameters of
,
, and
in
n-th round of detection are obtained by maximizing the
, where the posterior probability is calculated using the estimates got from the last detection:
From (15), the conditional expectation
involves optimizing multiple parameters. In order to reduce the complexity of the estimation process, when one parameter is estimated, the other parameters are assumed to have fixed values. The noise variance is estimated first by setting
, and the estimated value is
By substituting (18) into (15) and let the
:
Finally, we compute the
and
by setting
:
The detailed solution to the optimization problem is given in
Appendix A.
According to the above derivation, this multi-parameter estimating method using the EM principle is denoted as a fully Blind-EM estimation and detection method.
3.1. Noise Variance Estimation Based on Lifting Wavelets
Because the estimated values of noise variance and SNR are coupled with each other, the performance of multi-parameter estimation will be affected in fully Blind-EM estimation and detection. Consequently, we consider decoupling the two estimates. Since the noise variance in the received signal is mainly concentrated in the high-frequency part and the wavelet transform can realize the variable resolution decomposition of the signal, the noise variance value can be estimated by using the high-frequency coefficient decomposed by the wavelet transform.
The dynamic environment caused by SU mobility puts forward a higher demand for detection delay; a shorter detection delay will help the fusion center estimate and track the changes in various parameters promptly. The traditional wavelet transform relies on the Fourier transform and has complex operations, so it is not suitable for dynamic scenarios. The lifting wavelet can realize the specific wavelet function by designing different prediction operators and update operators, which has the advantages of fast decomposition speed and small memory consumption and is completely equivalent to the traditional Mallat algorithm [
38]. A low-complexity noise variance estimation method based on lifting wavelets can play a role in the EM blind estimation and detection method to enhance the performance and accelerate the convergence speed of the algorithm.
The decomposition process of the
p-th layer is shown in
Figure 2. The lifting wavelet mainly uses prediction and update to decompose the original signal.
and
represent the high-frequency sub-band and low-frequency sub-band coefficients of lifting wavelet decomposition, respectively. First, the signal
is divided into even sequence
and odd sequence
, where
represents the decomposition number of the layer, and then the repeated prediction and updating processes are conducted to complete the wavelet decomposition. Both
and
can be initialized separately using
and
, respectively. In terms of prediction,
can be predicted by using a prediction operator
and
, which can be expressed as
which can be seen as the lifting of the high-pass sub-band using the low-pass sub-band, and
k represents the number of prediction times.
Similarly,
can be updated by using an update operator
and
, which can be expressed as
where
reflects a coarse version of the original signal, which can be seen as a process of lifting the low-frequency sub-band through the high-frequency sub-band.
If the case of
p = 1 is considered, the db2 wavelet transform can be achieved by performing two predictions and one update, and the corresponding operator can be written as
where
and
represent the left and right shifts of the signal in the time domain, respectively.
The specific calculation process is as follows:
In the end, by multiplying the normalization factors by the final wavelet factorization coefficient, the following can be obtained:
The noise variance can be estimated using the high-frequency coefficients obtained from the current period’s decomposition [
25].
where
indicates the median value of the
.
The noise variance estimation conducted in the Blind-EM algorithm is replaced by the estimation through the lifting wavelet, which is called the lifting wavelet-assisted EM iteration joint estimation method (WEMJD), which performs estimation of other unknown parameters by setting
. The new decision expression can be denoted as
where
is the threshold value of WEMJD, which can be determined using a constant false alarm probability, is a detailed description, please refer to
Section 3.3.
3.2. The SNR and CPOR Estimation
Compared with batch learning, which needs to wait until all the observed data are obtained, online learning is better suited for scenarios where data arrives continuously [
39]. The parameters can be estimated and predicted over time using online streaming learning to enhance the tracking capability of parameter estimation by utilizing historical available information in dynamic environments. The related variables
and
for parameter estimation in (19) and (21) can be written in the following form:
where
is the learning rate and
and
are the predicted values of
and
, respectively.
By replacing (19) and (21) with (29), the final estimating expression of SNR and channel prior occupancy rate can be expressed, respectively, as
As the noise samples are independent of each other and do not have a correlation, it should be noted that historical learning is not suitable for predicting and tracking the irregular changes in the noise variance. Hence, only the sliding window is used to smooth the noise variance estimation results.
3.3. Initialization and Decision Thresholds
3.3.1. Initialization
Due to the sensitivity of the EM algorithm to the initial values, reliable initial values can help the EM algorithm converge to the global maximum and greatly reduce the iteration time of the algorithm. Since the PU activity status is independent of the noise variance and SNR, it is assumed that the probability of the channel being occupied and unoccupied is equal at the initial time, therefore
is set. For the initial noise variance estimation, we consider that FC has no prior knowledge of PU activity. The initial estimate of the noise variance can be acquired by (27). The initial value of the instantaneous SNR in the WEMJD method is consistent with the initial estimate value adopted in the GLRT algorithm, which can be obtained using maximum likelihood estimation and expressed as
3.3.2. Threshold Calculation
To evaluate the performance of a detection method, its ability to correctly detect the presence of a target signal can be determined by using the false alarm probability and detection probability.
The ability of the detection method to correctly detect the presence of a target signal can be described by using the false alarm probability and detection probability. The detection probability is the probability of correctly detecting the existence of the target signal. The false alarm probability refers to the probability that the SU falsely claims PU activity when the channel is not occupied by the PU signal.
Based on the Neyman-Pearson criterion, the threshold value should maximize the detection probability with constraint , which is known as constant false alarm rate (CFAR) detection. In CFAR detection, the threshold setting is related to the preset false alarm probability. The detection statistic in (10) is the positive weighted sum of chi-square random variables: , . The signal energy of here follows a non-central chi-square distribution, hence, .
In general, the exact distribution of the weighted sum is difficult to obtain, So we approximate
by the chi-square distribution with
d degrees of freedom forms like
[
40,
41,
42]. The first three cumulative moments of
and
is:
where
, and
Appendix B provides an explanation.
Let the first three cumulative moments of
and
be the same, it is obtained that
The approximate distribution of
is given by:
The value of
in (38) is generally not an integer. The distribution of
is actually a gamma distribution with a scale parameter of 2 and a shape parameter of
. Therefore, the chi-square distribution in (39) can be approximated by a gamma distribution and replaced as
Refer to
Appendix C, the global detection threshold can be expressed as follows
Figure 3 shows the algorithm execution process of WEMJD at the secondary users and the fusion center.
The algorithm complexity is evaluated under the case of J SUs, M sampling points, and N detection times. At FC, for WEMJD, Semi-blind EM and GLRT have the same computational complexity is , while the complexity of matrix inversion for eigenvalue detection is . In the case of a small sample size, M is the same size as N, while J is usually smaller than M and N, so the proposed method has a lower degree of complexity. At SUs, we assume that the length of the prediction operator is and the length of the update operator is , and the number of decomposition layers is p, the complexity of the noise estimation part is . Considering the case of , the total complexity of the proposed algorithm is .
4. Simulation Results and Analysis
In this section, the simulation results under different settings are shown. To make the results more realistic and reliable, we performed Monte Carlo experiments to obtain the final results by setting the threshold based on the CFAR, where it is assumed that all SUs have the same false alarm probability of .
In the network, due to the random distribution of SUs, differences in geographic location also bring about differences in received SNR. Therefore, we define the global average SNR as
where
represents the average SNR at SU
j.
In order to evaluate the estimation performance of the proposed algorithm, the mean squared error (MSE) is used as the evaluation metric. The MSE for the
n-th detection is defined as
Results and Discussions
To discuss the convergence of the method, it is first assumed that the secondary users are stationary and have the same nominal noise variance of
. The simulation parameters are
,
,
, and
.
Figure 4 shows the MSE for SNR estimations vary with different iteration times of
. It can be seen that the smaller MSE is obtained if the smaller
is chosen as the iteration time of
is increased. Clearly, a smaller
allows the historical data to be more thoroughly trained and it will slow down the convergence speed. In addition, since too small
will lead to over-learning of history data, it will not further enhance the detection performance by blindly reducing the value of
. Through considering the compromise between the convergence speed and the estimated performance,
is selected for subsequent simulation.
Figure 5 plots the relationship between detection probability and the number of iterations for different methods. The simulation parameters are set as follows
,
,
, and
. As the SemiBlind-EM method requires fewer joint parameter estimations compared to Blind-EM, it indicates faster method convergence. Compared with the Blind-EM which estimates all parameters by using the EM algorithm, the WEMJD algorithm exhibits a faster convergence speed as it utilizes the lifting wavelet to obtain noise variance information. Due to the limited historical information for learning in the early stages of detection, the performance of SemiBlind-EM, Blind-EM, and WEMJD methods is poor. However, as historical data accumulates, the method’s performance gradually stabilizes. It can be observed that both SemiBlind-EM and the proposed WEMJD outperform the semi-blind GLRT method, which does not incorporate learning strategies. Furthermore, the fully blind WEMJD achieves performance comparable to SemiBlind-EM while maintaining an adjacent convergence speed. Considering the convergence speed and detection performance of the method, WEMJD exhibits significant advantages, which we will mainly discuss its performance below.
The setting of the false alarm probability is related to the threshold value. From
Figure 5, a lower probability of false alarms usually results in a higher threshold, which leads to the detection probability reducing with the decrease in false alarm probability. In
Appendix B, during the threshold calculation, we employ the estimated value instead of the true value, which also causes the calculation of the threshold to be affected by the estimation error. So here, the threshold is related not only to the false alarm probability but also to the estimation algorithm used. The detection probability analysis reveals that the likelihood ratio test (LRT) exhibits the lowest sensitivity to false alarm probability, meaning its detection performance is less affected by changes in false alarm probability. On the other hand, the detection probability of Blind-EM, which involves the estimation of more parameters, experiences the most significant reduction as the false alarm probability increases.
By taking account of the scenario of changing channel conditions, it is assumed that the SUs exhibit mobility, which means that the received SNR at the SUs changes over time. In the presence of
and
, the estimated SNR for each user using the WEMJD method is shown in
Figure 6. It can be seen that the estimated value does not conform to the change law of instantaneous SNR at the beginning. When the required number of iterations is reached, the WEMJD can effectively track and estimate the instantaneous SNR of each user’s received signal.
To observe the sustained performance of the method, the simulation parameters are set as
,
,
, and
. As shown in
Figure 7, the detection probability increases as the SNR increases. It can also be observed that the detection performance continuously improves as the sample size of
increases with the same SNR. Moreover, the difference in detection probability between the WEMJD and SemiBlind-EM gradually becomes larger as the sample size of
decreases. It can be interpreted that the estimation performance of the proposed WEMJD method will be degraded as the noise variance estimation becomes poorer as the number of sample points used becomes smaller.
Figure 8 shows the relationship between detection probability and global average SNR with different numbers of users. Obviously, as the number of users involved in cooperation increases, the detection performance improves. Moreover, with a fixed SNR, the performance improvement is more pronounced as the number of users increases from 3 to 5 compared to the increase from 5 to 7 users, which indicates that simply increasing the number of SUs in the network cannot continuously enhance the detection performance. Furthermore, the detection performance for the proposed method approximately approaches that of the optimal LRT method as more secondary users participate in spectrum sensing.
To observe the performance of different estimation and detection methods with noise uncertainty. It is assumed that the noise variance follows a uniform distribution within a certain range, where is the fluctuation ratio, and the noise uncertainty is defined as . In practice, it is challenging to obtain real-time accurate values of noise variance for semi-blind detection methods, while it is relatively easy to obtain the nominal power and fluctuation range of the noise variance. Therefore, the weighting coefficients are calculated by using the maximum noise variance fluctuation value for the semi-blind method to minimize the impact on PU.
Figure 9 illustrates the interrelationship among detection probability, false alarm probability, and SNR in the presence of noise uncertainty. The simulation parameters are set as
,
,
,
. From
Figure 9a,c, it can be seen that the detection probabilities of the SemiBlind-EM and GLRT decrease with the increase in noise uncertainty. Compared to the two semi-blind methods, the proposed WEMJD has a more stable detection probability under noise uncertainties as it has real-time noise variance estimation capabilities. For the ideal detection LRT, the real-time variation of the known noise variance is considered, so the performance of the LRT is not affected by the noise uncertainty.
It may be noted that the false alarm probability of the proposed method fluctuates around the set value as the global average SNR is varied, which is shown in
Figure 9b. When the SNR is greater than −0.5 dB, the actual false alarm probability of WEMJD is less than the set value of 0.01, conversely, when
, the actual false alarm probability exceeds 0.01. The observed phenomenon can be attributed to the fact that at high SNR, the noise variance estimates increase, causing the SNR estimate in (31) to descend. Consequently, the weighting coefficient in (37) of each user decreases, which will lead to a lower false alarm probability compared to the predetermined threshold value. On the contrary, the
will become higher at a smaller SNR. For safety considerations, we adopted
to obtain a lower decision threshold. Therefore, in the presence of noise uncertainty, the actual false alarm probability of the SemiBlind-EM and GLRT is less than 0.01 in
Figure 9d. Moreover, by comparing
Figure 9b,d, it can be seen that the false alarm probability of WEMJD has increased as noise uncertainty exists. Comparing
Figure 9a,c, this increase in false alarm probability can be seen as sacrificing a certain level of false alarm probability to maintain the detection probability.