1. Introduction
The underwater acoustic (UWA) channel is a dual-selective fading channel and has time-varying and space-varying characteristics, which brings difficulties to underwater information interaction based on the acoustic wave [
1]. Direct-sequence spread spectrum (DSSS) communication technology has low spectrum density and strong resistance to multipath fading and the Doppler effect. Thus, it is widely used in UWA communication scenarios [
2,
3]. Compared with the conventional DSSS system, the cyclic shift keying spread spectrum (CSK-SS) modulation can provide a higher data rate. He et al. proposed a passive time reversal with CSK-SS using hyperbolic frequency-modulated (HFM) waveform for a reliable point-to-point UWA communication system [
4]. Jing et al. proposed a novel interleave-division multiple access system based on CSK-SS modulation for multiuser UWA communications [
5]. The communication rate of the above two systems is higher than that of the conventional DSSS communication system.
Recently, deep learning (DL) has received more and more attention as it can transform the original data features through multi-step feature conversion to obtain a higher and more slightly abstract feature representation, and further input to the prediction function to obtain the final result [
6,
7]. Due to the neural network models being able to easily solve credit assignment problems [
8], the neural network model has become the main model used in DL. Many neural network models such as long- and short-term memory (LSTM) have been widely used in natural language processing [
9,
10,
11]. In addition, DL is also good at discovering intricate structures in high-dimensional data, so it has gradually become widely used in acoustic signal processing and other fields [
12,
13,
14,
15].
At present, DL has been successfully applied in the field of communication, such as in orthogonal frequency division multiplexing (OFDM) communication systems. Ye et al. proposed a channel estimation and symbol detection method using a deep neural network in the OFDM communication system [
16]. They pointed out that in OFDM wireless communication with complex channel distortion and interference, the DL-based method can solve the problem of channel distortion and detect transmission symbols. Gao et al. proposed a model-driven DL method, which combines DL with expert knowledge to replace the existing wireless OFDM communication receiver [
17]. They explained that a bidirectional LSTM (BiLSTM) recurrent neural network could utilize the internal relationship of intersymbol interference (ISI) between sequence data. Zhang et al. proposed a DL-based OFDM UWA communication system, which replaces the receiving module in the conventional OFDM UWA communication system with the deep neural network (DNN) [
18]. They pointed out that after the neural network has been sufficiently trained, the transmitted symbols can be recovered directly through the neural network. There is no need to use explicit channel estimation and equalization as in conventional UWA communication. However, there are few applications of DL in spread spectrum UWA communication. Qasem et al. proposed an autonomous underwater vehicles communication scheme based on DL coding index modulation spread spectrum (CIM-SS) [
19]. In this system, the DNN model is used as the de-mapper to demodulate the baseband signal, avoids the deterioration in the performance of CIM-SS over long tap delay UWA channel, and improves the system data rate as well as the energy efficiency and the bit error rate (BER) performance.
For the spread spectrum communication system, the spreading gain will severely affect the communication rate of the DSSS UWA communication system. However, the CSK-SS UWA communication system uses a circular cyclic shift of the spreading sequence to carry the information, which breaks the limitation of spreading gain on data rate [
4]. In the process of demodulation, the conventional CSK-SS system performs correlation processing on the received baseband signal and the local cyclic shifted spreading sequence during the demodulation process and selects the maximum correlation value for decision. The position of the peak value is the information modulated on the code phase to realize the recovery of the source information. However, it needs to be pointed out that the improvement of data rate comes at the expense of destroying the partial autocorrelation function (PACF) characteristics of the sequence in the DSSS system [
20]. Due to the influence of the noise and UWA multipath fading channel, the size of the correlation peak in the demodulation process of the CSK-SS UWA communication system changes, resulting in inaccurate decision results, making demodulation results wrong and affecting the performance of the CSK-SS system.
Inspired by the application of DL-based methods in the field of acoustic signal processing [
16,
17], in this paper, a DL-based CSK-SS UWA communication system is proposed, which innovatively applies DL to complete the demodulation of CSK-SS UWA communication signals and obtain several times higher communication rates than DSSS. More importantly, it avoids the performance limitations of conventional CSK-SS systems due to UWA multipath channels and noise environments. Furthermore, compared with the conventional system, this system can directly demodulate the received signal at the receiving end without completing the de-carrier and despreading operation of the received signal, which simplifies the processing flow of the received signal to a certain extent. By sufficiently training the LSTM architecture-based neural network model in the offline stage, a large number of random data samples modulated by CSK-SS and a large number of channel impulse responses (CIRs) generated under the ray acoustics model are used as the training datasets of the network, which gives the trained model the ability to remember and analyze CSK-SS time domain signals affected by noise and multipath fading. This paper is aimed at the application scenario of signal transmission and command control between fixed nodes of the underwater communication network [
21]. It is based on the actual measured sound speed profile (SSP), adjusts the horizontal distance and vertical depth of transceiver position for a specific sea area, and uses BELLHOP [
22] to generate multiple groups CIRs through spatial position change. It provides a prerequisite guarantee for the sufficient training of LSTM architecture-based neural network models. Therefore, when the trained model is deployed online in a specific sea area practical application scenario, by inputting the time domain waveform of the unknown random CSK-SS signal affected by multipath fading into the neural network model, the LSTM architecture-based neural network completes the information processing through the internal LSTM cell according to the input received signal time sequence data. Lastly, the classifier completes the output of its category to realize the demodulation of the signal.
The main contribution of this paper is to propose a new DL-based CSK-SS UWA communication system for the application scenario of underwater fixed node acoustic communication. Taking the LSTM architecture-based neural network model as the receiving module of the system, the CSK-SS communication system can overcome the influence of low signal-to-noise ratio (SNR) and complex shallow water acoustic channels. Meanwhile, the system uses a shorter spreading sequence and allowing each spreading sequence to carry multiple bits. While increasing the communication rate of the system, it avoids the degradation of the CSK-SS system performance under the influence of complex multipath fading. In addition, the robustness of the DL-based system is evaluated to analyze the impact on the system performance when the change of marine environment causes the sample mismatch in the training and test stages. Moreover, a water tank experiment was carried out, and some suggestions for future experiments are provided according to the analysis of experimental data.
The rest of this paper is organized as follows. In
Section 2, the structure of the conventional CSK-SS UWA communication system, the structure of LSTM cell in LSTM neural network, and the structure of DL-based CSK-SS UWA communication system are introduced. In
Section 3, a detailed description of the environment configuration and parameter settings in the simulation and the simulation results are given. In
Section 4, we provide a water tank experiment and data analysis, and share some suggestions for future sea trials.
Section 5 summarizes this paper and gives prospects for the future.
3. Analysis of Simulation Results
3.1. Environment Configuration and Parameter Settings
In the simulation, two neural network models are discussed. Similarly, both models comprise an input layer, hidden layer, fully connected layer, softmax layer, and output layer. Different from each other, the hidden layer is LSTM and BiLSTM, which respectively complete unidirectional propagation and bidirectional propagation of information according to the input time series. The size of the input layer of the two neural network models is 744, the number of hidden units in the LSTM cells is 30, and the output size of the fully connected layer and the number of neural units in the output layer are determined by the number of types of tags. The training dataset is generated by using the m-sequence with a spreading gain of 31 as a spreading sequence and generated under SNR = 5 dB. The training dataset and the test dataset are divided according to a ratio of 3:1.
In the offline training stage, in order to sufficiently train the neural network models and make them have the ability to remember and analyze complex UWA channels, based on the SSP actually collected in a specific sea area, the spatial positions of the transmitting transducer and the receiving hydrophone are continuously adjusted according to a specific step. In addition, the BELLHOP model is used to obtain multiple groups of CIRs according to the combination of different positions of the transmitting transducer and the receiving hydrophone. In addition, in the online test stage, the spatial positions of the transmitting transducer and the receiving hydrophone will be further adjusted to obtain a variety of position combinations to generate multiple groups of CIRs used in the test stage.
Figure 6 shows an environmental configuration for generating CIRs.
Figure 7 shows the SSP obtained through the actual collection in the Yellow Sea of China in May 2020.
The depth of seawater is 26.5 m. In
Figure 6, the orange square represents the position of the transmitting transducer in the offline training stage. At this time, the depth range of the transmitting transducer is 1–26 m and the depth step between transmitting transducers is set to 5 m. The pink triangle represents the position of the receiving hydrophone, the depth range is 1–26 m, and the depth step between them is set to 5 m. The horizontal distance between the transmitting transducer and the receiving hydrophone varies in the range of 4–5 km and the distance step is set to 200 m. The green circle represents the position of the transmitting transducer in the online test stage. Their positions are randomly arranged within the depth range of 1–26 m and the horizontal distance from the receiving hydrophone range of 4–5 km.
The parameter settings of the UWA channel simulation are given in
Table 1.
In the simulation, the ocean bottom parameters [
32] are set. In this paper, the flat seabed with very fine sand is considered, in which the density ratio is 1.268, the sound velocity ratio is 1.0568, and the attention coefficient is 0.01875
.
Figure 8 shows the shallow water CIR obtained through SSP through BELLHOP software.
Figure 9a–c shows CIRs obtained by controlling the change of the position of the transmitting transducer and the change of the horizontal distance between the transmitting transducer and the receiving hydrophone. The shallow water environment generally has the characteristics of high temporal and spatial variability. The propagation of acoustics in shallow water is mainly the repeated interaction with the sea surface and seabed.
Figure 10a–f shows the difference in acoustic transmission loss in shallow water when the transmitting transducer is located at different depths.
3.2. Performance Analysis of the System without Preprocessing in Shallow Water Channels
In the test stage, the input size of the information sequence in each time step of the neural network model is determined by the symbol length of the signal. By comparing the output of the neural network model with the source information, the BER curve is obtained and the performance of the DL-based CSK-SS UWA communication system is evaluated.
In CSK-SS modulation, the number of bits carried by each symbol determines the symbol rate of the system. When each symbol carries 4 bit, 3 bit, and 2 bit information, the symbol rates are 275.86 bps, 206.89 bps, and 137.9 bps, respectively. The simulation results of the DL-based CSK-SS communication system in shallow water acoustic channels are shown in
Figure 11. Due to the random selection of several UWA channels which are not in the range of training samples in the test stage (like the green circle in
Figure 6), the BER curves of the conventional system and the DL-based system represent the mean BER of various channels under different SNR.
Without channel equalization preprocessing, the performance of DL-based systems on two different neural network models is better than that of the conventional system in the SNR range of −14 dB to 0 dB. Meanwhile, the anti-noise ability of the LSTM neural network model is improved by 4.5 dB, 8 dB, and 9 dB when the symbol rate is 275.86 bps, 206.89 bps, and 137.9 bps, and the magnitude of the BER is 10−2. Compared with the conventional system, the anti-noise ability of the BiLSTM neural network model is improved by 7 dB, 10 dB, and 10 dB. It can be seen that the BiLSTM neural network model has better performance than the LSTM neural network model.
From one perspective, the BiLSTM network increases the vertical depth of the network compared with the unidirectional LSTM network by adding a network layer that transmits information in reverse time to enhance the capability of the network. From another perspective, under the condition of a complex shallow water channel with a serious multipath effect, the BiLSTM network can use the internal relationship of ISI in sequential data to reduce the impact of ISI on performance.
3.3. Performance Analysis of the System after Preprocessing in Shallow Water Channels
In the preprocessing stage, firstly, the CIR is reconstructed by the OMP algorithm, then the channel equalization is realized by VTRM technology, and finally, the processed signal data are input into the neural network model to complete the signal demodulation. Through simulation analysis, it was found that the performance of the preprocessed DL-based system is better than that of the preprocessed conventional system.
The simulation results of the preprocessed DL-based CSK-SS communication system in shallow water channels are shown in
Figure 12. When each symbol carries 4 bit, 3 bit, and 2 bit information, and the magnitude of the BER is 10
−3, the SNR of the LSTM neural network model is about 9.5 dB, 10 dB, and 10.5 dB lower than that of the conventional system, respectively. In addition, when each symbol carries 4 bit and 3 bit information, the SNR required by the BiLSTM neural network model to achieve the same system performance is reduced by about 1 dB and 0.5 dB, respectively, compared with the LSTM neural network model.
When each symbol carries 2 bit information, the performance of the two models is similar, but the BiLSTM neural network model still has a slight advantage at −8 dB. The whys and wherefores are that after preprocessing, the performance of the DL-based system is better than that of the conventional system and the BiLSTM neural network model has more advantages in performance.
As shown in
Figure 13, compared with the DL-based CSK-SS UWA communication system without VTRM technology, for the BiLSTM neural network model, the required SNR is reduced by about 7 dB, 3.5 dB, and 2 dB, respectively. VTRM technology suppresses the ISI caused by multipath expansion of the UWA channel so that the multipath signal energy is superimposed to obtain the focusing gain. In addition, VTRM can make the signal components coherently superimposed and the noise components incoherently superimposed to increase the SNR of the signal. Therefore, after the preprocessing of the communication signal is completed by using this technology, the performance of the system will be further improved when demodulated by the neural network model.
3.4. System Robustness Analysis in Specific Application Scenarios
For the specific application scenarios of fixed UWA communication nodes, the robustness of the system needs to be analyzed. In the previous section, the test channel samples were taken outside the range of the training samples. This is because the underwater nodes will be affected by ocean currents and tides, which will lead to the change of node position. In order to analyze the impact of the sample mismatch between the offline training stage and the online test stage on the system performance, this section will first select the training channel sample space and use it as the test channel to obtain the system performance when the two-stage samples match each other. Secondly, by changing the transmitting transducer depth (
), receiving hydrophone depth (
), and the distance (
), the CIR outside the training sample space is obtained to analyze the system performance when the two-stage samples mismatch. Only the robustness of the BiLSTM model is analyzed in this section. The simulation results are shown in
Figure 14. By comparing the BER curves, it can be seen that based on the simulation results in this application scenario, due to the change of underwater node position, the UWA channel sample mismatch between the training stage and the test stage does not have significant damage to the performance of the DL-based CSK-SS UWA communication system.
4. Analysis of Experimental Results
4.1. Experimental Scene Construction and Parameter Setting
In this section, to verify the performance of the DL-based CSK-SS communication system, the demodulation of the actual signal is tested in the water tank experiment. The experiment also consists of two stages: offline training and online test. The size of the water tank is 45 m (length) ∗ 6 m (width) ∗ 5 m (depth). There are anechoic tiles installed on the walls on both sides of the water tank, and the bottom of the water tank is covered with smooth tiles, which will make the sound waves reach the receiving end of the system after multiple reflections from the bottom and the water surface, thereby simulating the propagation process of the signal in the shallow water channel.
Figure 15 shows the scene construction of the experiment, in which A represents the position of the transmitting transducer, B1, B2, B3, and B4 represent the position of the receiving hydrophone in the offline training stage, and C represents the position of the receiving hydrophone in the online test stage. In the experimental equipment, the power amplifier (PA) is B&K2713 [
33], the operating frequency band of the transmitting transducer is 8–16 kHz, the attenuator is BEHRINGER DI-100 [
34], and the receiving hydrophone is ST300HF [
35].
The transmitted signal used in the offline training stage is composed of several data packets. The signal in each data packet is composed of an HFM signal and communication signal, in which the communication signal is the known source information modulated by CSK-SS. After being transmitted by the transducer, the hydrophone receives the signal after passing through the water tank acoustic channel. Finally, the received signal is processed as the training dataset of the neural network model to complete the training of the neural network model.
In the online test stage, the hydrophone receives the transmitted signal and is directly input into the trained neural network model after resampling to complete the signal demodulation.
The analysis of the simulation results shows that in the DL-based CSK-SS UWA communication system, the BiLSTM network has better performance than the LSTM network. Therefore, the neural network model used in the water tank experiment is BiLSTM and the parameter settings in the neural network are consistent with those in the simulation.
The primary parameter setting and data structure of the transmitted signal are shown in
Table 2 and
Table 3.
4.2. Analysis of Experimental Results
In the water tank experiment, the performance of the conventional system and the DL-based system are compared. Moreover, in CSK-SS modulation, three schemes with communication rates of 180.45 bps (each symbol carries 4-bit), 146.34 bps (each symbol carries 3-bit), and 106.19 bps (each symbol carries 2-bit) are analyzed. In addition, in experimental data processing, noise interference is added artificially to obtain the condition of a low SNR.
In the experiment, the training data samples of the neural network are received by hydrophones at positions B1, B2, B3, and B4, respectively. The data used to test the performance of the DL-based system are received by the hydrophone at position C. Due to the different locations of the hydrophone, the UWA channel structures in the training and test stage are also different. Due to the change of the position of the receiving hydrophone, the different CIRs is shown in
Figure 16.
The performance of the conventional and the DL-based systems without preprocessing are compared experimentally. The scatter diagram of BER under three CSK-SS modulation schemes under different SNRs is given in
Figure 17. The BER curve is given by calculating the mean of scatter points under each SNR. A more intuitive representation is given in
Figure 18. Through the analysis of the data, it can be found that under the three CSK-SS modulation schemes, the performance of the DL-based system is better than that of the conventional system, and the anti-noise performance is improved by 1 dB to 3 dB.
The experiment also compares the performance of the conventional and the DL-based systems after preprocessing.
Figure 19 shows the scatter diagram of BER under three CSK-SS modulation schemes under different SNRs. Similarly, the BER curve is given through
Figure 20. The data analysis shows that the preprocessing improves the performance of the DL-based CSK-SS UWA communication system. Under the three CSK-SS modulation schemes, compared with the preprocessed conventional system, the preprocessed DL-based system improves the anti-noise ability by 9 dB to 10 dB. When the SNR is greater than −8 dB, each symbol in CSK-SS modulation carries 4 bits, 3 bits, and 2 bits, the BER of the DL-based system is less than 2.5 × 10
−3, 1.7 × 10
−3, and 0.8 × 10
−3, respectively. Under the condition of SNR = −14 dB, when each symbol in CSK-SS modulation carries 4 bit, 3 bit, and 2 bit information, the BER is 8.6 × 10
−2, 7.6 × 10
−2, and 4.5 × 10
−2, respectively.
Table 4 and
Table 5 respectively shows the BER of the conventional and DL-based systems with or without preprocessing operations under different SNRs.
The experimental results show that the performance of the DL-based CSK-SS UWA communication system is better than that of the conventional CSK-SS UWA communication system. In addition, for the DL-based system, when the UWA channel structure in the offline training stage is inconsistent with that in the online test stage, the BiLSTM network still has a certain generalization ability. It is worth noting that under the condition of a low SNR, the channel equalization preprocessing method can significantly improve the performance of the DL-based system. Therefore, the DL-based CSK-SS UWA communication system can realize reliable signal transmission in complex shallow water acoustic channels under the condition of a low SNR. The memory and analysis of the received signal are completed through the BiLSTM network model, which solves the problems in the signal demodulation process of the conventional CSK-SS UWA communication system affected by multipath fading and noise.
4.3. Suggestions for Future Experiments
By processing and analyzing the actual data in the water tank experiment, we have thought about the problems that may be encountered in future experiments. In order to deal with these problems, we will provide suggestions and references for all scholars to pay attention to the process of experimental verification in the future.
- −
Influence of experimental equipment
It is necessary to consider the negative impact of the PA on the transmitted signal in the experiment. When the PA amplifies the signal, it usually introduces the corresponding nonlinear distortion [
36]. The additional high-order components obtained after the PA will cause certain distortion to the amplitude and phase of the signal. This influence will produce phase modulation components and cause clutter interference. Digital pre-distortion technology [
37] can be used to compensate for the nonlinear distortion caused by the PA by distorting the signal before passing through the PA in the digital domain to reduce the impact on the performance of the neural network. In addition, the frequency response of the preamplifier, filter, and transducer will also affect the signal waveform. It is necessary to adopt more strict standards to screen the instruments and equipment that may be used in the test process.
- −
Variation of UWA channel and coping strategies
The environment of the water tank is relatively stable and the channel conditions will not change much. However, the marine environment is more complex and changeable, and the channel will change due to marine dynamic factors such as sea surface floating. It is necessary to “saturation train” the network model. For the specific application scenario of underwater fixed communication nodes, it can be considered to wake up the underwater nodes regularly in non-working hours, send training signals to the receiver to obtain more diverse UWA channel information, and periodically strengthen the neural network model to improve the performance of the system.
5. Conclusions and Prospect
In this paper, the neural network model is used as the receiver structure of the DL-based CSK-SS UWA communication system to demodulate the signal. The neural network model is trained based on the training samples with distortion caused by the influence of UWA channels and noise in the offline stage.
The simulation results show that the DL-based CSK-SS UWA communication system has better reliability than the conventional system in the complex shallow water acoustic channels with low SNR. The neural network model-based receiver module will reduce the impact of the multipath effect on the performance of the conventional CSK-SS system in fading channel. Furthermore, the neural network model has good generalization ability. When the online deployment conditions do not precisely agree with the offline training conditions, the neural network model can still work effectively to a certain extent. In other words, the model can analyze and memorize the complex characteristics of the UWA channel. In addition, the focusing gain brought by VTRM will introduce new beneficial features to the neural network, which will bring more reliable performance to this system.
The performance of the DL-based CSK-SS UWA communication system is verified by a water tank experiment. The experimental results show that the performance of the DL-based system is improved compared with the conventional system. Especially for the DL-based system after channel equalization preprocessing, when SNR is greater than −8 dB, the BER is less than 2.5 × 10−3. It should be pointed out that the BER of the communication system can be further reduced with appropriate coding under the condition of low BER. In addition, some suggestions are put forward for the problems that scholars may encounter in future experiments.
In the future, we can also consider using convolutional neural network, combined with transfer learning, few-shot learning, and other technologies to expand the application scope of the system further and reduce the cost and complexity of network training. Moreover, further exploration of the application of the system in practical engineering will be considered.