1. Introduction
In recent years, underwater acoustic (UWA) communication has received much attention, with numerous applications emerging in environmental monitoring, ocean exploration, and military missions [
1,
2,
3]. However, the UWA channel fluctuates and causes a time-varying multipath, which may result in intersymbol interference (ISI) and Doppler shifts and spreads [
4,
5].
Recently, several commercial UWA modems have already been introduced. In the past, the Teledyne Benthos ATM-886 model had a baud rate of 360 bps, and it has 1/2 rate convolution coding, a multipath guard period, multiple frequency shift keying (MFSK), and phase shift keying (PSK) modulation schemes [
6]. LinkQuest’s SoundLink UWA modem uses broadband acoustic spread spectrum technology. The data rate achieved is up to 38,400 baud [
7]. AquaSeNT provides a UWA communication modem using orthogonal frequency division multiplexing (OFDM) technology. These modems can operate in two modes: command mode and data mode [
8]. DSPComm Aquacomm Gen2 modems can now operate below the noise floor at a signal-to-noise ratio (SNR) of −10 dB to −15 dB with +/−10 knot Doppler compensation in highly reflective and noisy environments. It is available in 100 bps to 1000 bps by the direct sequence spread spectrum (DSSS)/OFDM. The possible distance was tested to 8 km [
9]. The Micron modem working frequency band is 20~28 kHz for the chirp spread spectrum (CSS). The data rate is 40 bps, and the micron data modem provides multipath noise rejection and compact size, with low error rates and 500 m and 150 m horizontal and vertical ranges [
10]. The EvoLogics underwater communication modem is based on sweep spread technology, which provides full duplex communication. The modem has data rates up to 13.9 kbps over a 3500 m range with a frequency of 18~34 kHz [
11].
The spread spectrum technique is robust to underwater wireless channel distortions; for this reason, it has been widely used in UWA communications [
12]. For example, there is the frequency hopping spread spectrum (FHSS), the DSSS, and the CSS method. The CSS compensates for the drawbacks of other spread spectrum techniques and uses a wide bandwidth against various frequency selective fading; thus, it is widely used. In general, the conventional CSS method consists of linear frequency modulation (LFM) waveforms. Low sidelobe levels can be achieved without SNR loss by employing a nonlinear frequency modulation (NLFM) waveform [
13].
There are many types of NLFM, such as sinusoidal frequency modulation (SFM), in which the instantaneous frequency (IF) function has a sinusoidal form [
14]. However, it contains many high sidelobes in the ACF due to the periodicity of the IF. For this reason, generalized sinusoidal frequency modulation (GSFM) was suggested to eliminate the drawbacks of the SFM [
15,
16]. The GSFM waveform is a generalized form of the SFM waveform and uses a variable exponent parameter. Unlike the SFM waveform, the broadband ambiguity function (AF) of this variant GSFM waveform has a distinct mainlobe centered at the origin without a peak sidelobe, and the AF is very similar to a thumbtack. Designing GSFM waveforms with different parameter values (determined by the user) can produce a family of waveforms that occupy the same band of frequencies and are nearly orthogonal to each other [
15]. For this reason, the GSFM waveform was recently researched in active sonar by Hague and Buck [
16]. The integrated waveform based on the GSFM waveform was researched for continuous active sonar detection and communication, which was called GSFM-com [
17]. That method was modulated by multiplying the baseband binary pulse amplitude modulation signal with the original GSFM signal. Therefore, it must consider phase synchronization, and the configuration of the receiver becomes complicated. In [
18], a UWA communication using two types of GSFM signals was proposed.
In this paper, we propose a method with a GSFM waveform in UWA communication using four different types of waveforms: forward type, reverse-time type, forward-time/flipped-frequency type and reverse-time/flipped-frequency type. For waveforms that have near orthogonality relative to each other, the waveform minimizes the ISI by sharing public bandwidth, which means maximizing the time-bandwidth product. There is a difference from the conventional CSS method in symbols representing using multiple GSFM waveforms. Its characteristics are demonstrated using the AF and correlation in this paper. Using this orthogonality of each waveform and distinct mainlobe regions in the AF and ACF, we expect robustness in the ISI-generated multipath channel. We design the receiver, which consists of the bank of the matched filter (MF). To demonstrate the performance of the CSS method by a GSFM waveform, we evaluate the performance through an experimental comparison with the conventional CSS via a simulation, lake trial and sea trial. In this paper, the conventional CSS method is divided into two bands to match the data rate and time bandwidth product with the GSFM method. In the simulation, we consider multipath and additive white Gaussian noise (AWGN) channels, and the multipath is programmed by VirTEX (Virtual Time series Experiment), which is modeled by the Bellhop ray tracing code [
19]. VirTEX was designed to model the propagation through the underwater sound channel of a known time series transmitted from a hypothetical source. The lake trial was conducted with a distance of 300~400 m between the transmitter and receiver. The sea trial was conducted at a distance of approximately 20 km between the transmitter and receiver and at a depth of 1500 m. According to [
20], the conducted range of communication can be regarded as long-range communication. From the results of various experiments, this paper demonstrates the performance of the proposed method in UWA communication.
The rest of this research article proceeds as follows.
Section 2 introduces the GSFM waveform and the proposed method.
Section 3 simulates the proposed method and conventional method and demonstrates the comparison according to the SNRs.
Section 4 demonstrates the performance of the proposed method using a lake trial and sea trial, and the results are given.
Section 5 concludes the paper.
2. UWA Communication by Generalized Sinusoidal Frequency Modulation
The GSFM, which has a generalized form of SFM, is suggested for robust UWA communication. The GSFM waveform has much-lower-range sidelobes and maintains the mainlobe width without using the tapering function in the ACF compared with the LFM waveform.
- A.
Signal Design for UWA Communication
The waveform signal
and IF function
are expressed as shown in Equations (1) and (2), respectively [
16,
17].
in Equation (1) is the normalizing factor;
is the carrier frequency; and
is the instantaneous phase (IP) of the signal.
and
are the IP and IF functions of the GSFM, respectively, and their expressions are as shown in Equations (3) and (4).
is a variable exponent parameter that can be used to adjust the asymmetric IF function of the waveform.
is a modulation index and
is the frequency modulation term.
In using the same index
, the waveform of the GSFM can be used to generate various types of waveforms by reversing the time and frequency domains. In this paper, we use 4 types of waveforms, which are represented in the spectrograms of
Figure 1. In
Figure 1, (a) represents the forward type, (b) represents the flipped frequency of type (a) and is the same as the modulated cosine function, (c) is the reverse time variant of type (a), and (d) has reversed time and a flipped frequency compared with type (a).
In this case, we assume that
is 16 kHz, the bandwidth is 2 kHz and the waveform length T is 1 s. In
Figure 1,
and
is the same as
, which determines the number of cycles
. The four types of GSFM waveforms are orthogonal to each other.
We expect the UWA communication performance to be more effective as a result of using the orthogonal characteristic of the GSFM AFs than that of the conventional LFM. In this paper, we use Equation (5), which is expressed as
Here, is the forward-type waveform, is the forward-time/flipped-frequency-type waveform, is the reverse-time-type waveform and is the reverse-time/flipped-frequency-type waveform of the GSFM. is the n-th bit sequence, T is the length of the symbol, and each symbol represents a 2-bit sequence.
The block diagram of the proposed method is shown in
Figure 2. The received signal that passes through a channel leaves only the band energy using a band pass filter and uses preamble for fine synchronization of the data packet. Finally, the outputs of each matched filter are compared to find the maximum value.
- B.
Orthogonality
The orthogonality between the waveforms representing the symbol plays an important role in the theoretical background of communication performance. The orthogonality of these waveforms determines the performance of the communication system. To determine the relation of each waveform, the AF is used, and its expression is shown in Equation (6).
is the Doppler scaling factor, and
is the time delay.
Figure 3a represents the auto-AF of the FT, and
Figure 3b–d represent the cross-AF of the FT/FF, RT and RT/FF with FT, respectively. In the auto-AF and the cross-AF of
Figure 3, the time delay
range is
, and the Doppler scaling factor
range is
. In
Figure 3a, the auto-AF has a high main level and low sidelobe level; on the other hand, even when using the same bandwidth and parameter, the cross-AF has a low correlation level.
Furthermore, the orthogonality of the four types of GSFM waveforms can also be represented in the correlation function. To determine the correlation of each waveform, we use Equation (7) to Equation (9). Equation (7) represents the basic definition of the correlation function. By the Wiener-Khintchine Theorem, Equation (7) can be represented as Equations (8) and (9).
is the inverse Fourier transform of
, and
is the spectrum of
. In Equation (7), the spectrum of the GSFM waveform is represented by Equation (10) [
16].
Figure 4 represents the correlation of the FT.
Figure 4a represents the ACF,
Figure 4b–d represent the cross-correlation function of the FT/FF, RT and RT/FF with FT, respectively.
3. Simulation Results
This simulation was compared with that of the conventional CSS method to demonstrate the performance of the proposed method, and the simulation considered two channel characteristics: multipath propagation and AWGN channels. To match the data rate, the CSS method is divided into 2 bands that also match the time bandwidth product with the proposed GSFM method. The simulation parameters are presented in
Table 1.
In the simulation, we used the VirTEX simulator with the Bellhop model for a UWA multipath channel. The simulation channel is represented in
Figure 5 and assumes that the depths of the transmitter and receiver are approximately 5 m and 25 m, respectively, the water depth is 50 m and the distance between the transceivers is 400 m. In
Figure 5a, the red lines represent the 1st path (direct path), the black lines represent the 2nd path and the blue lines represent the 3rd path. In this channel,
Figure 5b represents the channel impulse response and represents the received signal’s arrival time depending on each path. No channel coding technique was applied to the data for simulation. The simulation result is represented in
Figure 6, which shows the difference in performance according to the SNR.
In
Figure 6, the horizontal axis represents the scale of the SNR, and the vertical axis represents the uncoded bit error rate (BER). It is analyzed according to the SNR level in the multipath channel. The red line represents the GSFM, which is the proposed method, and the blue line represents the conventional CSS method. We could confirm that the proposed methods have a better BER performance than the conventional CSS method. In the figure, it can be seen that the proposed method has an average gain of approximately 2~3 dB compared with the conventional method. In particular, the performance of the proposed method is relatively good in an environment with a low SNR.
5. Conclusions
In this paper, we proposed UWA communication using GSFM waveforms, which offers advantageous modulation in a fluctuating UWA channel, and the proposed method’s performance was demonstrated by a comparison with that of the conventional CSS method. According to the AF and ACF, we demonstrated that the GSFM waveform has a low sidelobe level and a distinct mainlobe level simultaneously, which has better performance in signal interference. Regarding the reliability of the proposed method, experiments were conducted using three kinds of methods: a simulation that considered the multipath propagation and AWGN channel, a lake trial and a sea trial. In this paper, multiple GSFM waveforms orthogonal to each other are used to transmit data in UWA communication. This is what sets it apart from past studies and is our original contribution on this topic.
The performance of UWA communication is not simply proportional to the trans-mission distance. It is affected by many factors, such as the SSP, the placement of the transceiver, the sea state and the geometry of the channel. In a lake environment, multipath propagation is very severe, affecting the UWA communication performance. This is also shown in the experimental results presented in this paper. The trial environment was a deep sea, and since it is an open space, it was less affected by such multipath propagation. As a result, the proposed method is more robust in UWA communication with multipath propagation and AWGN channels than the conventional CSS method. The Doppler channel was not taken into account in the simulation, but the acquired data from the lake and sea trial contained Doppler shifts of several Hz. If it is to be applied in a severe Doppler channel, a Doppler shift frequency estimation and a modified correlation function will probably be added. There is a need for research on performance verification and improvement in the Doppler channel in future research.