1. Introduction
With the increasingly complex electromagnetic interference environment, the anti-interference capability of communication systems must meet higher and more stringent requirements to ensure the reliable transmission of information [
1]. While traditional frequency hopping communication has strong anti-interference and anti-interception capabilities, complex electromagnetic interference and intelligent interference have a significant impact on its information transmission performance. Therefore, an intelligent anti-interference frequency hopping communication system is needed to cope with complex and changing interference environments and ensure normal communication between the two parties. By dynamically changing parameters such as frequency hopping sequence, hopping speed, and signals bandwidth through learning and intelligent decision-making, the transmitter achieves intelligent anti-interference based on full awareness of the electromagnetic environment. An intelligent anti-interference decision algorithm based on a Deep Q-Network with priority experience replay based on Pareto sample (PPER-DQN) for bivariate frequency hopping mode has been proposed to intelligently generate high-performance frequency hopping patterns of the frequency hopping communication system [
2]. However, due to the intelligent frequency hopping decision of the transmitter, the receiver does not know the hopping pattern and cannot guarantee the proper reception of the message. Therefore, the study of intelligent reception of frequency-hopping communication holds significant theoretical significance and application value.
The current research on intelligent communication reception has primarily focused on each processing module of conventional communication reception systems by using deep learning techniques to improve the performance of the communication system. In terms of channel estimation, in [
3] neural networks were employed to learn the channel structure from a significant amount of training data in a large-scale Multiple-Input Multiple-Output (MIMO) system. In [
4], a channel estimator utilizing deep learning was initially trained offline via simulated data, after which it was dynamically adjusted online to enhance its generalization capabilities. In [
5] a deep learning-based channel estimation method was proposed under time-varying Rayleigh fading channels, which outperformed traditional methods in terms of mean square error (MSE) performance. Several studies have investigated the use of deep learning methods to address the challenges associated with channel estimation in orthogonal frequency division multiplexing (OFDM) systems [
6,
7,
8]. Regarding channel equalization, an equalizer based on Multi-Layer Perceptron (MLP) networks was introduced [
9], which demonstrates better error performance than traditional equalization methods. Concerning signal demodulation, convolutional neural networks were used [
10] to demodulate binary phase-shift keying signals with transmission rates faster than the Nyquist rate. Furthermore, deep confidence networks and stacked self-encoders were employed [
11] to implement signals demodulation for near-range multipath channels. The long short-term memory (LSTM) unit-aided intelligent DNN(Deep Neural Networks)-based deep learning (DL) demodulator for orthogonal frequency division multiplexing-aided differential chaos shift keying (OFDM-DCSK) systems was proposed [
12]. In terms of channel decoding, Recurrent Neural Networks (RNNs) were utilized as decoders for convolutional codes under correlated noise [
13]. It extracts sequence information of convolutional codes using a bidirectional Gate Recurrent Unit (GRU) network and calculates decoding results by a fully connected neural network. Although this approach outperforms traditional algorithms, it is not suitable for storing convolutional codes of long length. Another study [
14] proposed a minimum-sum decoding algorithm based on a Back Propagation (BP) network, which reduces the computational complexity of the BP algorithm and improves decoding speed.
In addition to optimizing individual processing modules of the receiver with deep learning, several studies utilized deep learning to optimize multiple modules simultaneously. In [
15], a fully connected network was employed to simultaneously perform equalization and decoding in multipath channels. However, its performance is worse than the minimum MSE method with known channel statistics. In [
16], DNN (Deep Neural Networks) network structure was utilized to implement channel equalization and symbol detection in OFDM (Orthogonal Frequency Division Multiplexing) receivers, and experimental results demonstrate its superiority over traditional methods. To achieve end-to-end optimization of wireless communication systems, [
17,
18] have investigated the use of deep learning to optimize the entire communication system, excluding RF (Radio Frequency) A/D and D/A. Additionally, an end-to-end wireless communication system using DNN was developed [
19] and a conditional generative adversarial network (GAN) was used to represent channel effects and to implement key functions such as encoding, decoding, modulation, and demodulation. In [
20] a deep receiver was proposed that optimally implements the entire information recovery process for conventional reception using deep neural networks, and the BER (Bit Error Rate) of the proposed deep receiver outperforms that of the conventional receiver in the presence of various factors, such as noise, power amplifier nonlinear distortion, frequency bias, multipath fading, co-channel interference, and dynamic environment.
All of the above studies are related to conventional communication reception systems. In the detection of frequency hopping signals, in [
21], a threshold selection method based on K-means clustering was proposed to achieve fast detection of frequency hopping signals under low signal-to-noise ratio conditions. In [
22,
23], the CNN networks are used to determine the presence of frequency hopping signals. However, they cannot estimate the specific hopping frequencies. In [
24], the Radial Basis Function (RBF) neural network is employed to facilitate the nonlinear prediction of Frequency Hopping (FH) sequences and to enable the discrimination of multiple FH sequence signals. In [
25,
26], the LSTM networks are employed to predict frequency hopping sequences, but most of these sequences are generated using
m-sequences, which are pseudo-random sequences with certain regularities. In contrast, the hopping sequences in this paper are intelligently generated by the transmitter based on different interference environments, and the deep reinforcement learning method utilized ensures that the hopping sequences are completely random. As a result, it is not possible to identify patterns from existing frequency hopping sequences for prediction. There is less research on intelligent reception of frequency hopping communication. In [
27], a convolutional neural network-gated recurrent unit (CNN-GRU) network was designed to estimate multi-time slot frequency hopping sequences, and the simulation results demonstrated the network’s good generalization ability and robustness. However, this method suffers from a long delay in estimating frequency hopping sequences. So we will study the estimation of the frequency point of frequency hopping communication in a one-time slot.
In summary, the main contributions of this paper are as follows.
We design a CVDP network that exhibits robust generalization performance for the estimation of hopping frequency during the one-time slot. This network reduces the delay and overcomes the difficulty of training due to few features and a small data set of frequency hopping signals in the one-time slot.
The CVDP network combines the inductive bias property of CNN networks with the global inductive modeling capability of VIT networks [
28] to solve the problem of poor convergence of VIT networks on small data sets and to improve the stability and robustness of VIT networks. A dual multi-head self-attention mechanism is proposed for extracting the time-frequency features of different low-dimensional subspaces to improve the network performance. Meanwhile, the idea of
self-attention [
29] is introduced to calculate only the self-attention of key points to reduce the computational complexity.
Extensive simulation experiments have been conducted to evaluate the performance of the CVDP network in various scenarios, including multi-tone interference, single-band interference, multi-band interference, swept frequency interference, mixed interference, and channel fading, encompassing both frequency-flat Rayleigh fading and frequency-selective Rayleigh fading. The simulation results indicate that the CVDP network exhibits a high level of generalization performance across all of these environments. Moreover, the intelligent receiver system, which incorporates the CVDP network, performs remarkably close to an ideal receiver when dealing with an unknown frequency hopping sequence.
3. Algorithm Simulation and Performance Analysis
3.1. Data Set Generation and Hyperparameter Setting
The set of frequency hopping frequencies is used for training the network, the frequency set is in (10 MHz, 20 MHz) with 25 KHz interval, the number of frequencies sets is 400, and the hopping frequency is selected randomly when generating samples, and the hopping speed is 2000 hop/s. The training data set contains two kinds of hopping signals’ time-frequency spectrograms. The one kind is the hopping signals under Gaussian white noise background with SNR (Signal-Noise Ratio) ranging from −10 dB to 10 dB at 2 dB intervals, generating a total of 13,200 sets of signals. The other kind is the frequency hopping signals under four different interferences, which are multi-tone interference, single-band interference, multi-band interference, and swept frequency interference with SIR (Signal-Interference Ratio) ranging from −10 dB to 10 dB at 2 dB intervals, and 1200 sets of hopping signals are generated in each SIR, resulting in a total of 52,800 sets of signals. A signal is processed by STFT with a window length of 1024 points and is corresponding to a hopping frequency. A training dataset containing 66,000 samples of time-frequency spectrograms is obtained.
The test data set contains the hopping signals under different interference and Rayleigh fading channels. The interference of the above four interfering signals superimposed is called mixed interference signals. SIR ranges from −16 dB to 2 dB at 2 dB intervals, and 500 sets of hopping signals under each of five kinds of interferences are generated in each SIR, resulting in a total of 25,000 sets of signals. Frequency hopping signals under frequency flat Rayleigh fading channels and frequency selective Rayleigh fading channels are generated with SNR ranging from −16 dB to 2 dB at 2 dB intervals, and the number of signal sets per SNR is 500. So the total sample size of time-frequency spectrograms of the test data set is 35,000.
MATLAB software was used to generate the training and test datasets. The experimental environment and configuration are shown in
Table 2.
To ensure the network training effect while speeding up the training speed, let the initial learning rate be 0.001 and the total of rounds be . After each round of learning, the learning rate decreases by . Adam is chosen for the optimizer and the training batch size is 64.
3.2. Estimation Performance of Hopping Frequency
The accuracy of the CVDP network’s hopping frequency estimation in the one-time slot determines its performance.
3.2.1. Experiment 1
Estimation of hopping frequency in the one-time slot by CVDP network under multiple interference environments. To analyze the role of each module, four comparison networks are built based on the network model in this paper (as shown in
Figure 2). CNN network only retains the CNN module of CVDP, and the global pooling layer and the fully connected layer are added later. CNN-VIT0 network is that the dual multi-head
self-attention mechanism of CVDP is replaced by the original self-attention mechanism. CNN-VIT1 network is that the dual multi-head
self-attention mechanism of CVDP is replaced by the dual multi-head self-attention mechanism. CNN-VIT2 network is that the dual multi-head
self-attention mechanism of CVDP is replaced by the single multi-head
self-attention mechanism. The above four networks, the CNN-GRU network [
25], and the CVDP network of this paper were trained under the same training set with the same hyperparameter settings. Under five interference environments including multi-tone interference, single-band interference, multi-band interference, swept frequency interference, and mixed interference, the generalization ability of the six networks for hopping frequency estimation in the one-time slot is tested respectively. To further investigate the optimal setting for parameter
h, the performance of CVDP networks with different values of
h was tested under mixed interference environments. To more audiovisual demonstrate the difference between the interfering signals and the frequency hopping signals,
Figure 3 shows the time-frequency spectrograms in 20-time slots under five interfering environments. The accuracy of hopping frequency estimation is shown in
Figure 4 when time-frequency spectrograms in the one-time slot are input to the network.
As seen in
Figure 4, the four improved VIT networks (CNN-VIT0, CNN-VIT1, CNN-VIT2, and CVDP) have better network performance than CNN networks and CNN-GRU networks, this is because after combining the inductive bias characteristics of CNN networks and the global inductive modeling capability of VIT networks, the VIT networks demonstrate approximately 15 to 20% improvement in convergence, stability, and robustness on small datasets. Moreover, the self-attention mechanism of the VIT network makes the network focus on the frequency hopping signals region and minimizes the influence of interfering signals. The CNN network and CNN-GRU network cause serious performance degradation because they do not accurately extract the differences between frequency hopping signals and interfering signals in terms of time-frequency characteristics. Among the four improved VIT networks, the CNN-VIT1 network and the CVDP network have better performance, this is because the CNN-VIT1 and CVDP, through a dual multi-head self-attention mechanism, can better extract the positional features of frequency hopping signals in the entire spectrum, capturing global dependencies. The CNN-GRU network is suitable for extracting joint features of multiple time slots, but they are not included in the one-time slot. So its performance deteriorates due to the inability to extract joint features. Therefore, CNN-VIT1 and CVDP networks are more suitable for solving the task of a one-time slot frequency hopping frequency estimation. In addition, the accuracy of the hopping frequency estimation of the CVDP network is only slightly lower than that of the CNN-VIT1 network, indicating that the
self-attention mechanism can achieve the purpose of reducing the computation while not affecting the network performance. Furthermore, it was found that the
h should not be too large or too small. Therefore, in the case of a total dimension of 256, the
h of one multi-head self-attention layer is set to 8 heads, while the
h of the other multi-head self-attention layer is set to 16 heads, as this combination achieves the best performance.
3.2.2. Experiment 2
Estimation performance of hopping frequency in the one-time slot by CVDP network under the influence of fading channels. The propagation of signals can be influenced by various factors such as terrain and obstacles, which may lead to multipath fading in the received signals. Additionally, the relative motion between the transmitter and receiver can cause a Doppler shift, as demonstrated in Equation (
13)
where
f is the frequency of the hopping signals,
v is the relative speed between the transmitter and receiver,
represents the angle between the direction of hopping signals motion and the incident direction, and
is the speed of light.
The network’s ability to generalize over both frequency flat and frequency selective Rayleigh fading channels was tested due to the potential for severe signal distortion caused by various factors. The maximum Doppler shift was set to 10 Hz, and for frequency-selective Rayleigh fading, three paths were used with path delays of 0 s, 1.5 µs, and 90 µs, respectively, and the average path gains were 0 dB, −3 dB, and −6 dB. The accuracy of hopping frequency estimation in the one-time slot was measured in
Figure 5, and the average running time required for training and testing the six networks was shown in
Figure 6 and
Figure 7 to comprehensively evaluate the network performance.
From
Figure 5, it can be seen that the CVDP and CNN-VIT1 networks exhibit superior accuracy compared to other networks in frequency-flat Rayleigh fading channels, indicating excellent generalization capabilities. However, the network’s performance generally degrades in frequency-selective Rayleigh fading channels due to inter-symbol interference and signal fading. Despite this, the CVDP and CNN-VIT1 networks still exhibit the highest accuracy, and the performance gain is more significant than that observed in flat fading channels. Moreover, the CVDP network maintains a similar level of accuracy as the CNN-VIT1 network but requires less computational amount, leading to the best overall performance under both fading channels.
The running speed of the CVDP network, as shown in
Figure 6 and
Figure 7, is faster than that of the CNN-VIT1 network which utilizes only a dual multi-head self-attention mechanism. Additionally, the running time of the CVDP network is near to that of the CNN-VIT0 network. These results indicate that the incorporation of
self-attention reduces computation complexity. CNN networks and CNN-GRU networks have simple structures and consequently exhibit the shortest running times.
In summary, the CVDP network in this paper has superior estimation accuracy and a shorter running time compared to other comparison networks.
3.3. Performance of Frequency Hopping Intelligent Reception
The CVDP network is applied to the frequency hopping reception system shown in
Figure 1 to realize the intelligent reception when the frequency hopping sequence is unknown. A 100,000-bit (0,1) sequence is sent randomly to test the reception performance of the frequency hopping system separately under BPSK (Binary Phase-Shift Keying) modulation and QPSK (Quadrature Phase Shift Keying) modulation. The experiment was conducted in seven environments, namely multi-tone interference environment, single-band interference environment, multi-band interference environment, swept frequency interference environment, mixed interference environment, frequency flat Rayleigh fading channel, and frequency selective Rayleigh fading channel. The BER curves of the CVDP-based frequency hopping intelligent receiver system and the conventional receiver system are shown in
Figure 8 and
Figure 9.
Conventional reception refers to an ideal reception scenario in which a receiver is aware of the frequency hopping sequence transmitted from the transmitter. It can be seen from
Figure 8 and
Figure 9 that the intelligent reception system performs well under both BPSK modulation and QPSK modulation. The intelligent reception system exhibits remarkable reception performance in five different interference environments, and its BER is only slightly different from that of the conventional reception system when the SIR is low. Despite severe signal distortion caused by multipath fading and Doppler shift in two fading channel environments, the CVDP network ensures accurate hopping frequency estimation and enables the intelligent receiver system to attain the performance of a conventional receiver system.
When the transmitter of a frequency hopping system makes intelligent decisions according to an electromagnetic interference environment, the hopping sequence is no longer fixed [
2]. Consequently, a conventional receiver is unable to predict the hopping sequence, and its BER may increase up to 50% or more, resulting in communication un normally. However, an intelligent reception system based on CVDP for hopping signals can achieve optimal reception performance when the hopping period and signal bandwidth are known, and it has better adaptability and interference immunity.