1. Introduction
The demand for high-speed underwater acoustic (UWA) communication has progressively intensified in tandem with advancements in marine engineering and technology [
1,
2,
3]. The conventional phase-shift keying (PSK) modulation scheme faces significant limitations due to the constrained bandwidth of UWA channels and the restricted power output of UWA transducers. Continuous phase modulation (CPM) [
4,
5], which has found widespread application in aviation telemetry and satellite communications [
6,
7,
8], offers a compelling alternative. CPM enhances the bandwidth and power efficiency of UWA communication by virtue of its constant envelope, minimal out-of-band radiation, inherent robustness, and intrinsic coding gain.
Among the various wireless communication channels, UWA channels are among the most intricate, leading to substantial signal losses [
9,
10]. The continuous phase and the nonlinear nature inherent to CPM signals further complicate the design of signal receivers. Ideally, a CPM-modulated signal is detected using the Viterbi algorithm for maximum likelihood sequence estimation, involving a search within a singular super grid [
11]. However, the practical realization of this approach is impeded by the algorithm’s complexity, which escalates exponentially with increasing signal length and the varying characteristics of the communication channel. Duel-Hallen and Heegard [
12] mitigated this complexity by employing a delayed decision-feedback sequence estimation method. Nevertheless, significant challenges remain, particularly when addressing long-delay spread channel responses with relatively high complexity. Numerous studies have sought to simplify the receiver algorithm by decoupling equalization and detection, as proposed by Laurent [
13] and Mengali and Morelli [
14], through CPM decomposition. Additionally, minimum mean square error (MMSE)-based suboptimal linear equalization has been applied to counter inter-symbol interference.
Tan and Stuber [
15] employed Laurent and orthogonal decompositions to achieve frequency domain equalization (FDE) of CPM signals in frequency-selective fading channels. A straightforward differential detection scheme for CPM signals with a modulation index of 1/2 was introduced, significantly reducing the complexity of symbol detection. The effectiveness of the algorithm was validated using minimum shift keying (MSK) and Gaussian MSK (GMSK) signals. Pancaldi and Vitetta [
16] further advanced the development of minimum mean square error (MMSE) equalization for CPM signals in the frequency domain by implementing linear, decision feedback, and turbo equalization techniques. Additionally, leveraging the inherent characteristics of CPM, they proposed an iterative symbol detection algorithm without channel coding, which was integrated with decision feedback equalization. The binary 3-RC format was adopted for performance verification.
Several research efforts have focused on simplifying the process for CPM signals. Park et al. [
17] proposed a simplified decision-feedback equalization scheme, which effectively mitigates the computational complexity associated with the Fourier transform and the Viterbi decoder. Van Thillo et al. [
18] approximated the autocorrelation matrix of a CPM signal as a diagonal block matrix, thereby simplifying the MMSE equalization into a low-complexity zero-forcing equalizer, significantly reducing the computational burden of FDE. Building on the insights from these studies, Saleem and Stüber [
19] further reduced complexity by simplifying low-energy pulses derived from Laurent decomposition. Chayot et al. [
20] leveraged the fractional interval representation of CPM to formulate MMSE-based FDE, which approximates the channel matrix and the pseudo-symbol correlation matrix of the CPM signal. Meanwhile, through the use of shaped-offset quadrature PSK, Rice and Perrins [
21] demonstrated that the FDE techniques proposed by Tan and Stüber, Pancaldi and Vitetta, and Van Thillo et al. exhibit comparable performance in weak multipath communication channels.
Numerous studies have explored the integration of channel estimation [
22,
23,
24,
25] and coding techniques [
26,
27] to enhance system performance. Van Thillo et al. [
22,
23] introduced a novel block structure designed to ensure the phase continuity of a single symbol block, replacing the traditional training sequence for channel estimation with a cyclic prefix (CP). This approach reduces transmission information redundancy while maintaining channel estimation performance. Ozgul et al. [
26] developed a receiver based on turbo equalization combined with various channel coding methods, though they continued to employ the Viterbi algorithm for symbol detection in CPM signals. While existing FDE algorithms for CPM signals demonstrate effectiveness in frequency-selective fading channels, their application to symbol detection becomes increasingly complex when using the Viterbi algorithm. Furthermore, simple phase detection techniques fail to account for the inherent memory of CPM signals, which leads to a degradation in performance.
This study addresses the issues of high complexity in the Viterbi algorithm and performance loss due to simple phase detection in CPM signal reception by proposing a posterior probability-based symbol detection algorithm based on the CPM symbol detection concept from [
16]. The posterior probability-based symbol detection combined with FDE (PS-FDE) can significantly enhance the reception performance of CPM signals while incurring only a marginal increase in computational complexity. This symbol detection algorithm relies solely on the inherent coding gain of CPM signals and does not depend on additional channel coding. After passing through FDE, the CPM signal obtains the initial likelihood values for each symbol via simple phase detection. Then, based on the characteristics of CPM signals, the contained information is separated, delayed, and merged to obtain the symbol log-likelihood value after the information merging. The estimated log-likelihood value can be used as the initial likelihood value to adjust the weight of information merging. PS-FDE detection, after 2–3 rounds of the information merging, can effectively enhance the reception performance of CPM signals.
The remainder of this paper is organized as follows:
Section 2 introduces the system model of CPM UWA communication based on Laurent decomposition that includes the frame structure and preprocessing before PS-FDE;
Section 3 describes the proposed algorithm and analyzes the computational complexity;
Section 4 presents the numerical simulations used to evaluate the proposed algorithm performance;
Section 5 discusses the sea trial performed to verify the proposed algorithms; and
Section 6 concludes the paper.
4. Numerical Simulation
Numerical simulations were conducted to evaluate the performance of PS-FDE when applied to MSK and GMSK, comparing it with traditional FDE employing simple phase detection [
15]. The system parameters used in the simulation (
Table 2) were configured as follows: a 48 kHz sampling frequency, a 6 kHz center frequency, a 2 kbps transmission rate, a 4 kHz bandwidth, a signal sequence length of 1024 bits, and a CP length of 512 bits. The designed signal sequence ensured compliance with the modulated signal phase and maintained phase continuity after CP insertion. Consequently, no additional tail symbols were required.
4.1. Channel Model
UWA channels are characterized by severe multipath interference and time-varying properties. This paper primarily discusses the symbol detection algorithm for CPM based on posterior probabilities in conjunction with FDE; hence, the numerical simulations only consider the multipath interference of the UWA channels. The simulations were performed over a selection of representative channels, including a typical sparse channel, denoted as ChA [
15], a measured shallow-sea channel referred to as ChB, and a measured deep-sea channel designated as ChC.
Table 3 presents the channel impulse response of the sparse channel ChA, with delay values expressed in symbol intervals. The response intervals correspond to integer multiples of the symbol duration. At a transmission rate of 2 kbps, the maximum delay spread is 12.5 ms, and the majority of the channel’s energy is concentrated within the first three symbol intervals.
ChB was obtained through a sea trial conducted over a communication distance of approximately 5.3 km in a trial area with a water depth of around 163 m. The transmitter was deployed at a depth of 80 m, and the receiver was positioned at approximately 97 m. The measured channel response was derived from a transmitted 4–8 kHz chirp signal.
Figure 4a illustrates the normalized time–domain response waveform of the measured channel after applying a 4–8 kHz bandpass filter, while
Figure 4b shows the corresponding normalized frequency–domain response. The principal energy of ChB was highly concentrated, with a maximum delay spread of approximately 25 ms.
ChC was obtained through a sea trial conducted over a communication distance of approximately 10 km in a trial area with a water depth of around 3700 m. The transmitter was located at a depth of 50 m, and the receiver was positioned at approximately 3505 m. The channel response was derived from a transmitted 4–8 kHz chirp signal.
Figure 5a presents the normalized time–domain response waveform of the measured channel after applying a 4–8 kHz bandpass filter, while
Figure 5b depicts the corresponding normalized frequency–domain response. A notable characteristic of ChC is that its primary energy is distributed across four distinct clusters, with a maximum delay spread of approximately 120 ms.
4.2. Minimum Shift Keying
MSK is a distinctive form of CPM signal characterized by a 1/2 modulation index and a memory length of
L = 2. Through Laurent decomposition, the correlation function is expressed as
. Then,
If
Lm = 1, then
is given as
where the weight coefficients are
,
.
Table 4 presents the state transitions to different
l with normalization.
Figure 6 presents the simulation results of the MSK signal over channels ChA, ChB, and ChC. The dotted line represents the theoretical bit error rate (BER) of the MSK signal in an AWGN channel, while the black line illustrates the BER performance of traditional FDE with simple phase detection [
17]. The red, green, and blue curves correspond to the BER of PS-FDE for
k = 1,
k = 2, and
k = 3, respectively.
Figure 6a shows the performance of PS-FDE over ChA. During symbol detection, the first two information merges (i.e.,
k = 1 and
k = 2) can significantly enhance performance. At BER = 10
−4,
k = 1 and
k = 2 improved the performance by 0.6 and 0.4 dB, respectively, compared with the traditional FDE [
15]. Compared to the first two information merges, the third merging (
k = 3) exhibited a modest performance enhancement of approximately 0.2 dB.
As illustrated in
Figure 6b, for ChB, the BER of the MSK signal exhibited a decreasing trend as the number of information merges increased. At BER = 10
−4, the performance of FDE was 11 dB, while PS-FDE outperformed FDE by 0.1 dB, 0.15 dB, and 0.2 dB after the first, second, and third information merging, respectively.
Figure 6c presents the results for ChC, where the performance of PS-FDE after three iterations of information merging showed improvements of 0.1 dB, 0.25 dB, and 0.45 dB, respectively, compared with the FDE performance.
The simulations conducted across the three channels consistently demonstrated that the first two iterations of information merging resulted in a significant performance improvement, whereas the third iteration exhibited a comparatively smaller performance gain.
4.3. Gaussian Minimum Shift Keying
A GMSK signal featuring a memory length of L = 3 and a bandwidth–time product of 0.3 was employed. The correlation function is represented as follows with Laurent decomposition:
Then,
here
where
Q(
x) is a complementary error function.
The detection gain is low because the symbol energy of
l = ±4 is low. Thus,
Lm = 3 was used to calculate the
from (29):
where
.
Table 4 provides the state transitions to a different
l with normalization.
Figure 7 depicts the simulation results of the GMSK signal over ChA, ChB, and ChC. The dotted line represents the theoretical value of the GMSK signal over the AWGN channel, while the black line displays the BER of traditional FDE with simple phase detection [
17]. The red, green, and blue curves correspond to the BER of PS-FDE after
k = 1,
k = 2, and
k = 3 information merging, respectively.
Figure 7a shows the simulation results of the GMSK signal within ChA. At a BER of 10
−4, the performance of FDE was 13 dB, while PS-FDE, after three iterations of information merging, exhibited improvements of 1.4 dB, 2 dB, and 2.3 dB, respectively.
Figure 7b presents the BER curve within ChB. At a BER of 10
−4, FDE performance yielded 10.5 dB, whereas that of PS-FDE following three iterations of information merging improved by 0.2, 0.3, and 0.3 dB, respectively. Note the similarity of the performance in the third information merging to that in the second.
Figure 7c displays the BER curve within ChC. At a BER of 10
−4, FDE performance was 11.2 dB, whereas PS-FDE resulted in enhancements of 0.3, 0.45, and 0.45 dB, respectively. Overall, the first two iterations of information merging resulted in significant improvements in symbol detection performance, whereas the third iteration provided only a marginal enhancement in BER performance.
The simulations conducted on the aforementioned channels with MSK and GMSK signals effectively validated the performance of PS-FDE. PS-FDE achieved substantial improvements during the first two iterations of the information merging, while the incremental gains became significantly limited in the third iteration. Given that the information merging of PS-FDE was predicated on the memory inherent within CPM signals, the information provided by CPM in the information merging gradually converged as the iteration count increased; the error correction capability also gradually weakened. The inherent coding gain of CPM signals was comparably lower than the traditional channel coding, limiting the extent of the performance augmentation. Moreover, tailoring a suitable κ for the information merging necessitates due consideration that corresponds to the diverse CPM signals and the distinct channel structures at play.
5. Sea Trial
A sea trial of MSK UWA communication was conducted during winter in the South China Sea to verify the proposed algorithms.
Figure 8 shows that the average depth of the test area was 208 m, and the communication distance was approximately 5.3 km. A transducer operating at an 80 m depth transmitted MSK signal at three distinct source levels of 196, 190, and 184 dB. The hydrophone positioned at a 97 m depth received the signal.
Figure 9 presents the measured structure of the UWA channel, which was obtained by applying the OMP algorithm to the MSK signal. The maximum delay of the channels was approximately 23 ms, which could be divided into four multiple-path clusters. Throughout the course of the sea trial, 210 groups of MSK signals were transmitted at 6 kHz center frequency, 4–8 kHz transducer bandwidth, 2 kbps communication rate, and 256 bits per frame transmission information sequence. FDE and PS-FDE were used to receive the MSK signal, wherein the FDE error bit was equivalent to that of the PS-FDE at
k = 0.
Table 5 presents the number of bit errors in the received signal at different signal-to-noise ratios (SNRs) within the 4–8 kHz bandwidth. Variations in source levels and propagation losses during UWA communication led to differences in the computed SNR values. The BER of different algorithms exhibited a significant decline as SNR increased. At
k = 0, the performance of PS-FDE was equivalent to that of traditional FDE (FDE with simple phase detection). Compared with traditional FDE, PS-FDE reduced the number of bit errors by 10, 34, and 44 at
k = 1,
k = 2, and
k = 3, respectively, demonstrating its effectiveness. As the number of information merging iterations
k increases, the amount of effective symbol information obtained in the final merging process decreases. Consequently, the detection gain approached saturation, leading to a gradual reduction in performance improvement.
6. Conclusions
CPM was applied to improve the bandwidth and the power efficiency of UWA communication, and PS-FDE was proposed to further improve CPM signal detection performance compared with the traditional FDE. It has a lower complexity than the Viterbi algorithm and uses simple phase detection to extract preliminary information. Combined with the correlation function of CPM, PS-FDE effectively harnesses the inherent coding gain within CPM, ultimately improving the symbol detection performance. Simulations were conducted using MSK and GMSK signals over UWA channels, and the results show that PS-FDE significantly improves detection performance after 2–3 information mergings, compared with that of FDE [
15]. The performance improvement for MSK signals ranged from 0.2 to 1.2 dB under different channel conditions, while for GMSK signals, it ranged from 0.3 to 2.3 dB. The results of the sea trial conducted to prove the effectiveness of the algorithms revealed that compared with that of FDE [
15], the BER of PS-FDE exhibited a 15.83% reduction after the third information merging.
Due to the time-varying nature of UWA channels, the FDE and OMP channel estimation algorithms used in this paper struggle to effectively track channel variations. Consequently, the BER of sea trials is relatively high. Due to the high compatibility of the symbol detection algorithm proposed in this paper with channel equalization, future work will consider integrating this symbol detection algorithm with channel equalization algorithms capable of tracking time-varying channels to enhance the capability to combat time-varying UWA channels.