Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning

Zhao, Liyuan; Zhang, Jianghua; Huang, Lei; Peng, Yuanxi; Yin, Ke; Zheng, Xin; Zhang, Zhuohang; Shen, Meili; Song, Denghui; Niu, Hongxiao

doi:10.3390/photonics9110794

Open AccessArticle

Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning

by

Liyuan Zhao

¹,

Jianghua Zhang

^2,*,

Lei Huang

²

,

Yuanxi Peng

¹,

Ke Yin

²,

Xin Zheng

²,

Zhuohang Zhang

²,

Meili Shen

²,

Denghui Song

³

and

Hongxiao Niu

⁴

¹

State Key Laboratory of High Performance Computing, College of Computer Science and Technology, National University of Defense Technology, Changsha 410073, China

²

National Innovation Institute of Defense Technology, Academy of Military Sciences PLA China, Beijing 100071, China

³

College of Advanced Interdisciplinary Studies, National University of Defense Technology, Changsha 410073, China

⁴

School of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

^*

Author to whom correspondence should be addressed.

Photonics 2022, 9(11), 794; https://doi.org/10.3390/photonics9110794

Submission received: 13 August 2022 / Revised: 9 October 2022 / Accepted: 21 October 2022 / Published: 25 October 2022

(This article belongs to the Section Optical Communication and Network)

Download

Browse Figures

Versions Notes

Abstract

:

Due to the nonlinear and aliasing effects, the sub-Nyquist photonic receiver for radio frequency (RF) signals with large instantaneous bandwidth suffers limited dynamic range and noise performance. We designated a deep residual network (Resnet) to realize adaptive linearization across 40 GHz bandwidth. In contrast to conventional linearization methods, the deep learning method achieves the suppression of multifactorial spurious distortions and the noise floor simultaneously. It does not require an accurate calculation of the nonlinear transfer function or prior signal information. The experiments demonstrated that the proposed Resnet could improve the spur-free dynamic range (SFDR) and the signal-to-noise ratio (SNR) significantly by testing with single-tone signals, dual-tone signals, wireless communication signals, and modulated radar signals.

Keywords:

linearization; deep learning; sub-Nyquist photonic receiver; spur-free dynamic range

1. Introduction

With the proliferation of various electronic appliances, the electromagnetic environment has become congested and complex. The operating frequency and bandwidth of communication and radar are becoming higher and larger. This poses great challenges to the comprehensive reconnaissance and receipt of electromagnetic signals in electronic warfare (EW). Consequently, the establishment of receiving systems with low SWaP (Size, Weight and Power) and large dynamic range has attracted tremendous attention. Traditional electronic receivers are limited by the electronic bottleneck of the sampling rate from analog-to-digital (ADC), which makes it difficult to directly receive and digitize wideband electromagnetic signals. Generally, wideband signals can be sampled by low-speed ADCs through serial [1] and parallel processing [2]. Alternatively, an electrical sub-Nyquist receiver folds the broadband RF input prior to direct digitization by a narrowband ADC [3,4,5], which enables information recovery with low computational complexity algorithms, based on compressive sensing (CS) principles [6]. However, the efficiency of the electrical down conversion is unsatisfactory, due to the high timing jitter [7]. Besides, electronic cables exhibit the deficiencies of high transmission loss and low electromagnetic emission capacity.

The microwave photonic technique has been investigated as a promising option for characterizing and processing wideband RF signals [8]. It has the dramatic optical merits of wide bandwidth, low loss, immunity to electromagnetic interference, and long-distance transmission [9]. Contemporary microwave photonic receivers with wide instantaneous bandwidths can be roughly classified into photonic channelization receivers [10,11] and sub-Nyquist photonic receivers [12]. Compared to the channelization system, the Sub-Nyquist photonic receivers employ a single low-speed ADC [13], which obviates redundant hardware setups, and saves power consumption [14,15].

The architecture of sub-Nyquist photonic receivers [16] originates from the sampled (pulse-based) analog photonic link [17]. The sampled analog photonic link is applied in wideband radio frequency (RF) signal receivers with carrier frequencies of up to 40 GHz and a bandwidth of 20 MHz [18]. Nevertheless, the sinusoidal-like transfer response of the electro-optic modulator (EOM) inherently leads to nonlinear distortions [19,20]. In addition, the low noise amplifiers (LNAs) for RF signals also induce unexpected nonlinear distortions and inevitable noise. Besides, the harmonic distortions (HD) and intermodulation distortions (IMD) are folded into the first Nyquist zone (NZ) of the sampled analog photonic link. The spurious harmonics of signals with large frequency intervals may produce new aliasing and intermodulation components. In addition, the sampled analog photonic link down-converts the thermal noise and shot noise from all Nyquist zones to the first Nyquist zone. These put immense pressure on the back-end signal processing and seriously deteriorate the working bandwidth, reconnaissance distance, signal-to-noise ratio (SNR), and signal bit error rate (BER). Hence, the intricate spurious distortions and noises of the sampled analog link place restrictions on its practical application in wideband receivers.

There are considerable publications focusing on the linearization of the sampled analog photonic link with nonlinear distortions introduced by EOM. Clark et al. achieved the suppression of third-order intermodulation distortions by 25 dB with an instantaneous bandwidth of 500 MHz [21]. Urick V. J. et al. applied pre-distortion linearization to improve the spur-free dynamic range (SFDR) of an externally modulated, long-haul analog fiber-optic link by 6 dB in the band range of 6–12 GHz, which may have brought about significant imperfections at the same time [22]. Besides, R. Duan et al. proposed a digital processing compensation method that realized the suppression of the third-order intermodulation distortion (IMD3) by more than 20 dB [20]. X. Liang et al. published digital post-compensation methods for a photonic sampling link that realized the suppression of the cross-modulation distortion (XMD) and IMD3 by 33 dB and 25 dB, respectively, which could perform poorly under large dispersion after long-distance transmission [23]. D. Lam et al. proposed a digital postprocessing linearization technique to efficiently suppress dynamic distortions added to a wideband signal in an analog optical link, which achieved suppression of the IMD by up to 35 dB over multiple octaves of signal bandwidth [24]. Y. Dai et al. proposed a digital linearization by directly acquiring the output third-order intercept point from the system hardware, which improved the SFDR by using a high-performance pre- and post-amplifier [25].

These linearization schemes generally required construction of a well-designed distortion-matched hardware path with careful synchronization between the distortion and compensation signals [26], or the addition of further optical devices [27]. Moreover, most of them could only eliminate the nonlinearities caused by the EOM. The nonlinear distortions caused by other devices (like amplifiers), and the noise interferences are rarely coped with simultaneously. Therefore, it is still a great challenge to automatically suppress the nonlinearity and noise of various sources in sub-Nyquist photonic receivers for wideband signals.

In recent years, deep learning (DL) has made remarkable achievements in image denoising [28,29], speech enhancement [30,31], optical image reconstruction [32,33] and other related fields [34,35]. Applying deep neural networks to photonics processing that are difficult to realize by traditional processing algorithms exhibits tremendous attractive perspectives. Deep neural networks can automatically extract the characteristics of signals and analyze the extracted features by establishing the mapping relationship between inputs and outputs in a regression task. By training the models in a way of approaching the minimum value of the loss function with adequate datasets, the deep neural models could adaptively convert the untrained input to the expected target output and dispense with the complicated mathematical operation.

Recent developments in applying deep learning to photonic links have attracted much attention. Wan et al. proposed an artificial neural network (ANN) equalizer to mainly mitigate the nonlinear impairments of optical fibers [36]. Zhou et al. proposed a microwave photonic instantaneous frequency measurement scanning receiver, in which deep neural network frequency estimation was used to deal with system defects and improve accuracy [37]. Xu et al. proposed two deep neural networks to recover the nonlinear effect and channel mismatch for a two-channel photonic ADC, which improved the effective number of bits (ENOB) of the single-tone signals with an input frequency of 23.332 GHz [38]. Furthermore, they applied the modified photonic ADC in real scenarios with complicated signals utilizing the same setup, which improved the SFDR by ~18 dB [39]. Subsequently, Yi et al. proposed a transfer-learning network in a 20 GS/s photonic system for denoising multiband signals [40]. Besides, Zou et al. investigated a simplified convolutional recurrent autoencoder (CRAE) network for the mismatch compensation of a channel-interleaved photonic analog-to-digital converter (PADC), which improved both the frame rate and the dynamic range [41].

In this paper, we applied a deep residual network (Resnet) in a sub-Nyquist photonic receiver to realize the adaptive linearization and lower the noise floor at the same time. Compared to the conventional linearization methods, the deep learning method requires no complicated hardware setups or miscellaneous digital processing steps, and it is suitable for dealing with multi-source nonlinear spurious distortions. To our knowledge, this is the first attempt that realizes adaptive linearization for a sub-Nyquist photonic receiver with RF signals ranging up to 40 GHz, which are available to single-tone signals, multi-carrier signals, wireless communication signals, and linear frequency modulation (LFM) signals. More importantly, due to the characteristics of adaptive intelligence, the deep learning method is more suitable for non-cooperative communication scenarios and applicable to the time-varying electromagnetic environment.

2. Materials and Methods

A proof-of-concept experimental setup based on the sub-Nyquist photonic receiving system is illustrated in Figure 1. The optical source was a self-developed mode-locked fiber (MLL) laser, based on a semiconductor saturable absorption mirror with long-term stability. The optical pulse width was 247 fs and the spectral width was 10.10 nm. The MLL was amplified by an erbium-doped fiber amplifier (EDFA) (Accelink, EDFA array series, Wuhan, China) with an output power of 20 dBm. The RF signals modulated the MLL through a Mach–Zehnder intensity modulator (MZM) (iXblue, MXLN-40, Paris, France), and the frequency modulation range of the RF signal was up to 40 GHz. Subsequently, the photoelectric conversion process was carried out by an encapsulated high-speed positive –intrinsic–negative (PIN) photodetector (PD) (Bonphot Optoelectronics, PD03, Suzhou, China). The output down-converted RF signals were amplified by a tailor-made wideband low-noise amplifier (YQAMP01040, LNA), and then the high-frequency components were filtered out by a low-pass filter (LPF). The receiving signals were sampled and observed by a 20 GS/s Oscilloscope (Rohde & Schwarz, RTO2044, Muenchen, Germany). After the analog-to-digital process, the digital signals were processed by digital signal processing (DSP), and then fed into the deep learning (DL) module, where the signals polluted by multi-source distortions could be automatically linearized.

The down-conversion principle of receiving signals with deep learning-assisted linearization is shown in Figure 1 as well. In the frequency domain, the mode-locked laser was performed as an optical frequency comb (OFCs) with a repetition rate of ~1030 MHz. The Nyquist zone was expressed from 1 to N with a frequency width of 515 MHz. To maximize the down-conversion efficiency, the RF signals were copied to every optical frequency comb through the MZM with dual-sideband modulation mode. After the propagation through optical fibers, the loaded RF signals beat with the near combs in the photoelectric conversion process. The copies of down-converted RF signals, harmonic distortions, and intermodulation distortions emerged via direct detection through PD. Subsequently, the high-frequency components were filtered by the low-pass filter, and all the frequency components falling into the first Nyquist zone could be sampled by a low-speed ADC. Ultimately, the harmonics and intermodulation distortions were suppressed by the deep learning method.

As deep learning is a data-driven approach, the appropriate dataset generation provided for training and testing is essential. The datasets were derived from the down-converted duplicates of single-tone sinusoidal signals, dual-tone sinusoidal signals, wireless communication signals, and linear frequency modulation (LFM) radar signals. The single-tone and dual-tone sinusoidal signals were generated by a microwave analog signal generator (Sinolink Technologies, SLFS-440D, Beijing, China) with various frequencies (0–40 GHz) and amplitudes (0 dbm–20 dbm). The wireless communication signals were generated and transmitted from a universal software radio peripheral (USRP) (National Instruments, USRP-2794, Austin, TX, USA) with multiple modulation formats and carrier frequencies (0–6 GHz). The LFM signals were generated by an Arbitrary Waveform Generator (Tektronix, AWG5202, Tokyo, Japan).

To produce the training targets (reference data) for the deep neural network, the acquired receiving raw digital signals were preprocessed using digital signal processing (DSP) filtering algorithms in the time domain. Afterward, every processed reference signal and the raw signal were dominated as a pair of reference/raw data. All the reference/raw data pairs constructed the total datasets. As shown in Figure 2, the deep learning module was divided into two processes: training and testing. Correspondingly, the datasets were randomly split into training datasets and testing datasets, with a ratio of 0.2 without overlap (169 samples for training and 47 samples for testing). In brief, the testing datasets were set as validation datasets to monitor the fitting states of the network. The deep learning layers and training process were implemented by the Keras framework from TensorFlow on the Nvidia TITAN RTX GPU.

In this work, we designated a deep residual network (Resnet) that was inspired by the field of computer vision (CV). It is supposed that a deeper network can extract more complex feature patterns by increasing the number of network layers. However, this may raise problems of gradient disappearance or gradient explosion, which make the deep learning model difficult to train. The Resnet solves the gradient degradation problem of the deep network through residual learning, so as to realize the training of a deeper network [42]. From a mathematical point of view, the residual unit can be expressed as:

y_{l} = h (x_{l}) + F (x_{l}, W_{l})

(1)

x_{l + 1} = f (y_{l})

(2)

Among the Equations (1) and (2),

x_{l}

and

x_{l + 1}

represent the input and output of the

l_{t h}

residual unit, respectively. The residual function is represented by

F (x_{l}, W_{l})

, which represents the learned residual. The identity mapping is represented by

h (x_{l}) = x_{l}

, and

f (y_{l})

is the activation function. Based on the above formulas, the learned characteristics of the units from

l

to

L

are as follows:

x_{L} = x_{l} + \sum_{i = l}^{L - 1} F (x_{i}, W_{i})

(3)

\frac{\partial l o s s}{\partial x_{l}} = \frac{\partial l o s s}{\partial x_{L}} \cdot \frac{\partial x_{L}}{\partial x_{l}} = \frac{\partial l o s s}{\partial x_{L}} (1 + \frac{\partial}{\partial x_{l}} \sum_{i = l}^{L - 1} F (x_{i}, W_{i}))

(4)

Using the chain rule, the reverse gradient can be obtained by Equation (4). The first factor (

\frac{\partial l o s s}{\partial x_{L}}

) of the gradient in Equation (4) represents the gradient of the loss function reaching

L

. The residual mechanism can propagate the gradient lossless, while another residual gradient needs to pass through the layer with weights, and the gradient is not transmitted directly.

Figure 2 presents the schematic diagram of the designated deep Resnet, which was constructed with 20 residual blocks. Every residual block was formed with two cascaded 1D convolutional layers, and each 1D convolutional layer was constituted by 32 filters. The kernel size of each 1D convolutional layer was set to 16. In the residual learning schemes, the output of every block was summed with the input data. To ensure the same scale of input and output, the final 1D convolutional layer with 1 filter was added. Specifically, since the signals have both positive and negative amplitude values, to avoid the gradient disappearance, the leaky rectified linear units (

L e a k y R e l u

) function was employed as the activation layer at the end of every 1D convolutional layer. The loss function of the

L e a k y R e l u

is depicted in Equation (5), where

α

is a constant.

y_{L e a k y R e l u} = {\begin{matrix} x, x \geq 0 \\ x / α, x < 0, (α = 0.03) \end{matrix}

(5)

During the training process, we conducted the supervised learning scheme of the deep neural network by minimizing the Mean Absolute Error (

M A E

) loss function between the rectified data (output) and raw data (input). The loss function is shown in Equation (6). In detail, as shown in Figure 2, the input of the deep Resnet was the raw time-domain signal waveform, and the training target was the reference time-domain signal waveform. In the testing process, the untrained raw data was fed into the trained model, while the output of the model was the linearized time-domain signal waveform data.

L o s s_{M A E} = a b s (y_{r e f} - y_{p r e d})

(6)

The hyperparameter settings are shown in Table 1. The length of the input signal data was intercepted to 49,900 before being fed into the Resnet. Furthermore, Adam was adopted as the optimizer, and the initial learning rate was set to 0.001. To ensure adequate learning capacity, the initial quantities of epochs were set to 10,000. Meanwhile, the early stopping mechanism was introduced to prevent overfitting, in which the training was stopped when the loss function did not change after 150 epochs. In addition, we applied the adaptive learning rate reduction mechanism (ReduceLROnPlateau), in which the learning rate was reduced by 1% when the loss function of the validation set was no longer reduced after 50 epochs.

3. Results and Discussion

To ensure reasonable learning capacity during the training process, the loss functions of the training dataset and validation dataset were monitored. Figure 3a depicts the training and validation loss curves of the Resnet model. The training loss and validation loss were calculated by the absolute error between the network output and the reference data on the training set and validation set, respectively. Essentially, due to the early stopping scheme, which remedied the overfitting and saved the training time, the training process experienced 5535 epochs. The training loss was reduced from 14.450 to 0.0560, while the validation loss was stable at 0.0604. The training loss showed the same convergence trend as the validation loss. Furthermore, Figure 3a also elucidates the curve of the learning rate, in which the optimal model was obtained when the learning rate was reduced to 10⁻⁶. From the convergence of the two loss functions of the Resnet model, the learning mode was explicitly correct where there was no abnormal state of overfitting or underfitting. Figure 3b,c depicts the distribution of the samples on the training set and testing set in different loss values, respectively. This ensured that the sample distribution of the training set and testing set was basically the same, and most samples were concentrated between 0.005–0.020. The training process took about 71,244.4 s.

In the sampled analog photonic link, all the high-frequency components were supposed to be down-converted within the first Nyquist zone (0–515 MHz). Hence, it was sufficient to set the frequency observation window, which just accommodated the first Nyquist zone. According to the down-conversion principle of the sampled analog photonic link, the mapping relationship of the input frequency to the output frequency is described by Equation (7), where

n

is the comb of the optical frequency comb, which is the nearest comb to the signal, and

f_{s}

is the repetition frequency of the OFC.

f_{o u t} = | f_{i n} - n f_{s} |

(7)

Figure 4 displays the effects of the deep learning-assisted spurious elimination spectrum of the single-tone sinusoidal signals formed by splicing the spectrums of several representative frequency bands from 0–40 GHz. All the input frequency components were down-converted to the first Nyquist zone (0–515 MHz) after being received by the sub-Nyquist photonic link. It should be noted that when the power of the input signal is high, the harmonic and intermodulation distortions will also fall into the first Nyquist zone. As shown in Figure 4, with the aid of deep learning linearization, intricate distortions like the down-converted second-order harmonics and third-order harmonics were automatically suppressed under the condition of unknown signal frequency information. Significantly, the deep learning method also reduced the noise floor (NF) to improve the

S N R

in all covered frequency bands.

Figure 5a,b show the intuitive effects of eliminating spurs of a single-tone sinusoidal signal in the perspectives of the time domain and frequency domain. The frequency of the input signal was 0.60 GHz and was folded to 430 MHz by the sub-Nyquist photonic link. Simultaneously, the down-converted second harmonic (170 MHz), third harmonic (260 MHz), and fourth harmonic (340 MHz) occurred due to the nonlinearity of the MZM and LNA. As shown in Figure 5a, the signal rectified by the Resnet model was more inclined to a standard sinusoidal signal, as a result of dispelling the intricate spurious distortions and noise floor of the raw signal data.

As depicted in Equation (8), the spurious suppression ratio

(S S R)

of the sampled analog photonic link can be equivalently defined as the ratio of the power of the signal to the power of the largest harmonic or distortion, which can be calculated from the spectra. The

S N R

could be calculated as the power of the signal to the power of the noise, as depicted in Equation (9). From the perspective of the frequency domain in Figure 5b, the

S S R

improved by about 42.15 dB, owing to the deep learning method. Specifically, the second harmonic was suppressed by about 64.26 dB, and the third harmonic was suppressed by about 41.98 dB. Simultaneously, the noise floor in the first NZ declined from about −80 dBm to −95 dBm on average.

S S R (d B) = 10 l o g (\frac{P_{s i g n a l}}{P_{\max_d i s t o r t i o n}})

(8)

S N R (d B) = 10 l o g (\frac{P_{s i g n a l}}{P_{N F}})

(9)

In addition, the intermodulation distortions in multi-carrier scenarios can be effectively suppressed by using the proposed deep learning method. In the proof-of-concept experiments, the input multi-carrier RF signals were emulated by dual-tone sinusoidal signals at frequencies of 200 MHz and 220 MHz, as shown in Figure 5c,d. The input powers of the dual-tone signals were 16 dBm. Correspondingly, the third-order intermodulation distortions (IMD3) were located at 180 MHz and 240 MHz, respectively. Figure 5c shows a comparison of the measured waveform of the raw dual-tone signals with sophisticated spurs and the rectified waveform after the deep learning linearization. Figure 5d depicts the spurious suppression effects of the deep learning method in the frequency domain. The

S S R

of the first Nyquist zone was optimized from 16.93 dB to 40.98 dB, and increased by 24.05 dB. The IMD3 with a frequency of 180 MHz was suppressed from −35.06 dBm to −59.20 dBm. Interestingly, for the second harmonics (400 MHz and 440 MHz), the products of the frequency separation of the two signals (20 MHz), and the fifth-order intermodulation distortions (IMD5,160 MHz and 260 MHz) were suppressed.

Figure 6a displays the measured powers of the folded third-order intermodulation distortion (IMD3), the fifth-order intermodulation distortion (IMD5), and the sum-frequency sidebands as a function of the input RF powers, with and without deep learning-assisted signal linearization. Owing to the deep learning method, the suppression of the IMD3 could be realized up to 28.9 dB, and the suppression of the IMD5 could be realized up to 21.3 dB. Meanwhile, the suppression of the sum frequency (420 MHz) could be realized up to 44.1 dB. It was demonstrated that the Resnet could automatically recognize the dual-tone signals and suppress the stray components, except for signals at one time without any prior knowledge.

The third-order-spurious-free dynamic range (

S F D R_{2 / 3}

) is generally an essential evaluation criterion of the photonic link, which can be calculated in the format distributed in Equation (10). The

O I P_{3}

is the output cut-off point of the third-order nonlinear distortion component, and N₀ is the output noise of the link normalized in 1 Hz.

S F D R_{2 / 3} = \frac{2}{3} (O I P_{3} - N_{0})

(10)

By varying the input RF powers, both the powers of the target intermediate frequency (IF) and IMD3 sidebands were monitored and dominated the major powers. Figure 6b depicts the comparison of the

S F D R_{2 / 3}

with and without the deep learning method. The slope for the third-order-inter-modulation component was 3. The measured raw noise floor was −133 dBm/Hz, which was reduced to −148 dBm/Hz after the deep learning processing. The

S F D R_{2 / 3}

was improved from 96.92

dBm \cdot {Hz}^{2 / 3}

to 115.08

dBm \cdot {Hz}^{2 / 3}

, which indicated that the

S F D R_{2 / 3}

enhancement was 18.16

dBm \cdot {Hz}^{2 / 3}

. It should be noted that the current

S F D R_{2 / 3}

was also restricted by the quantization noise of the electrical ADC, which cost a lot in hardware upgrade.

Figure 7a,b show the effects of signal linearization testing on wireless communication quadrature phase-shift keying (QPSK) signals. The carrier frequency of the testing example of the QPSK signal was 0.14 GHz. It was confirmed that the deep Resnet model delineated the compelling generalizing capacity on wideband wireless communication signals. In detail, it is observed that the burr noises were eliminated in the time domain waveforms in Figure 7a. In Figure 7b, the

S S R

was improved from 25.67 dB to 65.46 dB on wideband communication QPSK signals, realizing an improvement of 39.86 dB. Specifically, the second harmonics (280 MHz) and third harmonics (420 MHz) were almost eliminated. The noise floor dropped by about 15 dB, which improved the

S N R

without any loss of the intensity of the signals.

Besides, LFM radar signals with wide bandwidth were also exploited to validate the proposed model. The testing LFM signal was centered at 50 MHz with a bandwidth of 80 MHz. As with the other kinds of signals, it is observed that the burr noises were eliminated in the time domain waveforms in Figure 7c. In Figure 7d, the

S S R

was improved by about 25.39 dB. At the same time, the noise floor dropped by about 15 dB.

4. Conclusions

In conclusion, it is inevitable that inherent hardware imperfections cause the transmitted signal to deviate from its ideal representation in sampled analog photonic links. Particularly, the HD and IMD distortions introduced by MZMs and LNAs severely deteriorate the quality of the received signals. To solve these problems, a deep Resnet was designated to adaptively mitigate the multifactorial spurious distortions and noises, while simultaneously maintaining the rare patterns of signals.

The experimental results validated that the deep learning linearization method could automatically suppress spurious distortions with frequencies of 0–40 GHz. It was available for different types of signal generators and signal modulation types, including single-tone signals, dual-tone signals, wireless modulated communication signals, and LFM wideband signals. The proposed method could improve the

S S R

and

S N R

by 42.15 dB and ~15 dB, respectively.

In contrast to the previous linearization methods, the deep learning method could synchronously handle the multi-source induced spurious distortions and noise interference that are intractable for traditional digital signal processing techniques. Meanwhile, it could save miscellaneous hardware resources, and is expected to lower the SWaP of the receiver. The capabilities shown in the high-frequency and multi-carrier signals processing are also promising for ultra-wideband (UWB) microwave signal processing.

In the future, the proposed deep learning algorithms could be ported to embedded applications, such as field-programmable gate arrays (FPGA), for real-time signal processing. Besides, we could explore the possibility of applying the deep learning linearization method to other photonic links as well.

Author Contributions

Conceptualization, J.Z.; methodology, L.Z. and J.Z; software, L.Z.; validation, L.Z. and D.S.; formal analysis, J.Z. and K.Y.; investigation, L.Z.; resources, Y.P. and X.Z.; data curation, H.N.; writing—original draft preparation, L.Z.; writing—review and editing, L.H., Z.Z., M.S.; visualization, L.H.; supervision, J.Z.; project administration, X.Z.; funding acquisition, X.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China under Grant (grant number: 2020YFB2205804), the National Natural Science Foundation of China (grant number: 62075240) and the National Key R&D Program of China (grant number: 2021ZD0140301).

Data Availability Statement

Not applicable. Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Conflicts of Interest

The authors declare no conflict of interest.

References

Silva, M.D. Real-Time Spectrum Analysis and Time-Correlated Measurements Applied to Nonlinear System Characterization. In Modern RF and Microwave Measurement Techniques; Teppati, V., Ferrero, A., Sayed, M., Eds.; Cambridge University Press: Cambridge, UK, 2013; pp. 64–97. ISBN 978-1-139-56762-6. [Google Scholar]
Anderson, G.W.; Webb, D.C.; Spezio, A.E.; Lee, J.N. Advanced Channelization for RF, Microwave, and Millimeterwave Applications. Proc. IEEE 1991, 79, 355–388. [Google Scholar] [CrossRef]
Fudge, G.L.; Bland, R.E.; Chivers, M.A.; Ravindran, S.; Haupt, J.; Pace, P.E. A Nyquist Folding Analog-to-Information Receiver. In Proceedings of the 2008 42nd Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 26–29 October 2008; pp. 541–545. [Google Scholar]
Fudge, G.L.; Chivers, M.A.; Ravindran, S.; Bland, R.E.; Pace, P.E. A Reconfigurable Direct RF Receiver Architecture. In Proceedings of the 2008 IEEE International Symposium on Circuits and Systems, Seattle, WA, USA, 18–21 May 2008; pp. 2621–2624. [Google Scholar]
Fudge, G.L.; Azzo, H.M.; Boyle, F.A. A Reconfigurable Direct RF Receiver with Jitter Analysis and Applications. IEEE Trans. Circuits Syst. I 2013, 60, 1702–1711. [Google Scholar] [CrossRef]
Donoho, D.L. Compressed Sensing. IEEE Trans. Inform. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Khilo, A.; Spector, S.J.; Grein, M.E.; Nejadmalayeri, A.H.; Holzwarth, C.W.; Sander, M.Y.; Dahlem, M.S.; Peng, M.Y.; Geis, M.W.; DiLello, N.A.; et al. Photonic ADC: Overcoming the Bottleneck of Electronic Jitter. Opt. Express 2012, 20, 4454. [Google Scholar] [CrossRef] [PubMed]
Wang, C.; Zhao, Y.; Zhao, Z.; Zhang, W.; Wang, W.; Jia, Q.; Liu, J. A Microwave Photonic Converter with High In-Band Spurs Suppression Based on Microwave Pre-Upconversion. Photonics 2022, 9, 388. [Google Scholar] [CrossRef]
Pan, S.; Yao, J. Photonics-Based Broadband Microwave Measurement. J. Lightwave Technol. 2017, 35, 3498–3513. [Google Scholar] [CrossRef]
Xie, X.; Dai, Y.; Xu, K.; Niu, J.; Wang, R.; Yan, L.; Lin, J. Broadband Photonic RF Channelization Based on Coherent Optical Frequency Combs and I/Q Demodulators. IEEE Photonics J. 2012, 4, 1196–1202. [Google Scholar] [CrossRef]
Zheng, Y.; Li, J.; Dai, Y.; Yin, F.; Xu, K. Real-Time Fourier Transformation Based on the Bandwidth Magnification of RF Signals. Opt. Lett. 2018, 43, 194. [Google Scholar] [CrossRef] [PubMed]
Harmon, S.R.; Mckinney, J.D. Precision Broadband RF Signal Recovery in Subsampled Analog Optical Links. IEEE Photon. Technol. Lett. 2015, 27, 620–623. [Google Scholar] [CrossRef]
Harmon, S.R.; McKinney, J.D. Broadband RF Disambiguation in Subsampled Analog Optical Links via Intentionally-Introduced Sampling Jitter. Opt. Express 2014, 22, 23928. [Google Scholar] [CrossRef] [PubMed]
Pile, B.C.; Taylor, G.W. Performance of Subsampled Analog Optical Links. J. Lightwave Technol. 2012, 30, 1299–1305. [Google Scholar] [CrossRef]
Schermer, R.T.; McKinney, J.D. Non-Uniform Sub-Nyquist Optical Sampling by Acousto-Optic Delay Modulation. J. Lightwave Technol. 2018, 36, 5058–5066. [Google Scholar] [CrossRef]
Guo, Q.; Chen, M.; Liang, Y.; Chen, H.; Yang, S.; Xie, S. Photonics-Assisted Compressive Sampling System for Wideband Spectrum Sensing. Chin. Opt. Lett. 2017, 15, 010012. [Google Scholar]
McKinney, J.D.; Williams, K.J. Sampled Analog Optical Links. IEEE Trans. Microw. Theory Techn. 2009, 57, 2093–2099. [Google Scholar] [CrossRef]
Laghezza, F.; Scotti, F.; Ghelfi, P.; Bogoni, A.; Pinna, S. Jitter-Limited Photonic Analog-to-Digital Converter with 7 Effective Bits for Wideband Radar Applications. In Proceedings of the 2013 IEEE Radar Conference (RadarCon13), Ottawa, ON, Canada, 29 April–3 May 2013; pp. 1–5. [Google Scholar]
Zhang, G.; Zheng, X.; Li, S.; Zhang, H.; Zhou, B. Postcompensation for Nonlinearity of Mach–Zehnder Modulator in Radio-over-Fiber System Based on Second-Order Optical Sideband Processing. Opt. Lett. 2012, 37, 806. [Google Scholar] [CrossRef]
Duan, R.; Xu, K.; Dai, J.; Lv, Q.; Dai, Y.; Wu, J.; Lin, J. Digital Linearization Technique for IMD3 Suppression in Intensity-Modulated Analog Optical Links. In Proceedings of the 2011 International Topical Meeting on Microwave Photonics jointly held with the 2011 Asia-Pacific Microwave Photonics Conference, Singapore, 18–21 October 2011; pp. 234–237. [Google Scholar]
Clark, T.R.; Currie, M.; Matthews, P.J. Wide-Band Analog-Digital Photonic Link with Third-Order Linearization. In Proceedings of the 2000 IEEE MTT-S International Microwave Symposium Digest (Cat. No.00CH37017), Boston, MA, USA, 11–16 June 2000; Volume 2, pp. 691–694. [Google Scholar]
Urick, V.J.; Rogge, M.S.; Knapp, P.F.; Swingen, L.; Bucholtz, F. Wide-Band Predistortion Linearization for Externally Modulated Long-Haul Analog Fiber-Optic Links. IEEE Trans. Microw. Theory Techn. 2006, 54, 1458–1463. [Google Scholar] [CrossRef]
Liang, X.; Dai, Y.; Yin, F.; Liang, X.; Li, J.; Xu, K. Digital Suppression of Both Cross and Inter-Modulation Distortion in Multi-Carrier RF Photonic Link with down-Conversion. Opt. Express 2014, 22, 28247. [Google Scholar] [CrossRef]
Lam, D.; Fard, A.M.; Buckley, B.; Jalali, B. Digital Broadband Linearization of Optical Links. Opt. Lett. 2013, 38, 446. [Google Scholar] [CrossRef]
Dai, Y.; Cui, Y.; Liang, X.; Yin, F.; Li, J.; Xu, K.; Lin, J. Performance Improvement in Analog Photonics Link Incorporating Digital Post-Compensation and Low-Noise Electrical Amplifier. IEEE Photonics J. 2014, 6, 1–7. [Google Scholar] [CrossRef]
Park, S.-H.; Choi, Y.-W. Significant Suppression of the Third Intermodulation Distortion in Transmission System with Optical Feedforward Linearized Transmitter. IEEE Photonics Technol. Lett. 2005, 17, 1280–1282. [Google Scholar] [CrossRef]
Wang, R.; Gao, Y.; Wang, W.; Zhang, J.; Tan, Q.; Fan, Y. Suppression of Third-Order Intermodulation Distortion in Analog Photonic Link Based on an Integrated Polarization Division Multiplexing Mach–Zehnder Modulator. Opt. Commun. 2020, 475, 126253. [Google Scholar] [CrossRef]
Shah, Z.H.; Müller, M.; Wang, T.-C.; Scheidig, P.M.; Schneider, A.; Schüttpelz, M.; Huser, T.; Schenck, W. Deep-Learning Based Denoising and Reconstruction of Super-Resolution Structured Illumination Microscopy Images. Photonics Res. 2021, 9, B168. [Google Scholar] [CrossRef]
Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [Green Version]
Ouyang, Z.; Yu, H.; Zhu, W.-P.; Champagne, B. A Fully Convolutional Neural Network for Complex Spectrogram Processing in Speech Enhancement. In Proceedings of the ICASSP 2019—2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, UK, 12–17 May 2019; pp. 5756–5760. [Google Scholar]
Fang, H.; Carbajal, G.; Wermter, S.; Gerkmann, T. Variational Autoencoder for Speech Enhancement with a Noise-Aware Encoder. In Proceedings of the ICASSP 2021—2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 6–11 June 2021; pp. 676–680. [Google Scholar] [CrossRef]
Zhang, L.; Lam, E.Y.; Ke, J. Temporal Compressive Imaging Reconstruction Based on a 3D-CNN Network. Opt. Express 2022, 30, 3577. [Google Scholar] [CrossRef]
Pronina, V.; Lorente Mur, A.; Abascal, J.F.P.J.; Peyrin, F.; Dylov, D.V.; Ducros, N. 3D Denoised Completion Network for Deep Single-Pixel Reconstruction of Hyperspectral Images. Opt. Express 2021, 29, 39559. [Google Scholar] [CrossRef]
Wang, S.; Liu, X.; Zhu, X.; Zhang, P.; Zhang, Y.; Gao, F.; Zhu, E. Fast Parameter-Free Multi-View Subspace Clustering with Consensus Anchor Guidance. IEEE Trans. Image Process. 2022, 31, 556–568. [Google Scholar] [CrossRef]
Wang, S.; Liu, X.; Zhu, E.; Tang, C.; Yin, J. Multi-View Clustering via Late Fusion Alignment Maximization. In Proceedings of the IJCAI 2019, the 28th International Joint Conference on Artificial Intelligence, Macao, China, 10–16 August 2019. [Google Scholar]
Wan, Z.; Li, J.; Shu, L.; Luo, M.; Li, X.; Fu, S.; Xu, K. Nonlinear Equalization Based on Pruned Artificial Neural Networks for 112-Gb/s SSB-PAM4 Transmission over 80-Km SSMF. Opt. Express 2018, 26, 10631. [Google Scholar] [CrossRef]
Zhou, Y.; Zhang, F.; Shi, J.; Pan, S. Deep Neural Network-Assisted High-Accuracy Microwave Instantaneous Frequency Measurement with a Photonic Scanning Receiver. Opt. Lett. 2020, 45, 3038. [Google Scholar] [CrossRef]
Xu, S.; Zou, X.; Ma, B.; Chen, J.; Yu, L.; Zou, W. Deep-Learning-Powered Photonic Analog-to-Digital Conversion. Light Sci. Appl. 2019, 8, 66. [Google Scholar] [CrossRef] [Green Version]
Xu, S.; Wan, J.; Wang, R.; Zou, W. Modified Deep-Learning-Powered Photonic Analog-to-Digital Converter for Wideband Complicated Signal Receiving. Opt. Lett. 2020, 45, 5303. [Google Scholar] [CrossRef]
Yi, S. Multi-Band Low-Noise Microwave-Signal-Receiving System with a Photonic Frequency down-Conversion and Transfer-Learning Network. Opt. Lett. 2021, 46, 5982–5985. [Google Scholar] [CrossRef] [PubMed]
Zou, X.; Xu, S.; Deng, A.; Qian, N.; Wang, R.; Zou, W. Photonic Analog-to-Digital Converter Powered by a Generalized and Robust Convolutional Recurrent Autoencoder. Opt. Express 2020, 28, 39618. [Google Scholar] [CrossRef] [PubMed]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]

Figure 1. The experimental setup schematic diagram of the deep learning-assisted signal linearization method (MLL: mode-locked laser, EDFA: erbium-doped fiber amplifier, AWG: Arbitrary Waveform Generator, MZM: Mach–Zehnder intensity modulator, PD: photodetector, LNA: low-noise amplifier, LPF: low-pass filter, ADC: analog-to-digital converter, DSP: digital signal processing, DL: deep learning). The down-conversion principle is shown in the spectrum diagrams of small subplots next to the related components. The solid line divides the Nyquist zone (from 1 to N), while two Nyquist zones form an optical frequency comb spacing. The solid blue and orange trapezoids represent the two wideband radio frequency (RF) signals, respectively. The hollow blue and orange trapezoids represent the harmonics of the two RF signals, respectively, and the hollow green trapezoid represents the product of intermodulation and other strays.

Figure 2. The schematic diagrams of the training (yellow block) and testing (blue) process and the architecture of the deep Resnet network.

Figure 3. (a) The performance of the deep Resnet under training; (a) The training loss curve, the validation loss curve, and the learning rate curve as the training epochs increased; (b) The distribution of the number of training samples with different training

M A E

losses; (c) The distribution of the number of test samples with different testing

M A E

losses on the testing set.

Figure 3. (a) The performance of the deep Resnet under training; (a) The training loss curve, the validation loss curve, and the learning rate curve as the training epochs increased; (b) The distribution of the number of training samples with different training

M A E

losses; (c) The distribution of the number of test samples with different testing

M A E

losses on the testing set.

Figure 4. Effects of deep learning-based spurious elimination in the frequency domain with the input signal of various input power, ranging from 0 to 40 GHz. The selected displayed input frequencies are 0.2 GHz, 2.7 GHz, 5.8 GHz, 10.0 GHz, 14.0 GHz, 20.0 GHz, 24.5 GHz, 27.5 GHz, 32.5 GHz, and 40.0 GHz, which were all down-converted to the first Nyquist zone (0–515 MHz) with the frequencies of 200 MHz, 390 MHz, 380 MHz, 300 MHz, 420 MHz, 430 MHz, 220 MHz, 310 MHz, 460 MHz, and 170 MHz, respectively. N represents the Nyquist zone of each input signal.

Figure 5. Examples of spurious elimination for a single-tone sinusoidal signal with the frequency of 0.60 GHz, which was down-converted to the frequency of 430 MHz. (a) Comparison of the time-domain waveform before and after rectification by Resnet model; (b) Elimination of spurious spectral components of the down-converted second harmonic (170 MHz), third harmonic (260 MHz), and fourth harmonic (340 MHz). Examples of spurious suppression testing by dual-tone signals with frequencies of 200 MHz and 220 MHz; (c) Comparison of the time-domain waveform before and after rectification by Resnet model; (d) Result-contrast effects of the suppression of the third intermodulation products (180 MHz and 240 MHz), the sum frequency (420 MHz), and other cross-modulation distortions in the frequency domain.

Figure 6. (a) Measured sum frequency (420 MHz), third-order intermodulation (180 MHz), and fifth-order inter-modulation (160 MHz) components versus the input RF powers with and without the proposed deep learning-assisted method. (b) The comparison of the third-order-spurious-free dynamic range (

S F D R_{2 / 3}

) with and without the deep learning-assisted method. The black and blue solid line represents the

S F D R_{2 / 3}

before and after the linearization, respectively.

Figure 6. (a) Measured sum frequency (420 MHz), third-order intermodulation (180 MHz), and fifth-order inter-modulation (160 MHz) components versus the input RF powers with and without the proposed deep learning-assisted method. (b) The comparison of the third-order-spurious-free dynamic range (

S F D R_{2 / 3}

) with and without the deep learning-assisted method. The black and blue solid line represents the

S F D R_{2 / 3}

before and after the linearization, respectively.

Figure 7. Examples of spurious elimination for the quadrature phase-shift keying (QPSK) communication signal. (a) Comparison of the time-domain waveform before and after rectification by the Resnet model; (b) Elimination of intricate spurious spectral components and reduction of the noise floor in the frequency domain. Examples of spurious suppression for the linear frequency modulation (LFM) radar signal; (c) Comparison of the time-domain waveform before and after rectification by the Resnet model; (d) Elimination of intricate spurious spectral components and the reduction of the noise floor.

Table 1. Hyperparameters in the Training Process.

Hyperparameter	Initial Value
Learning Rate	10⁻³
Epoch	10,000
Batchsize	32
$α$ for $L e a k y R e l u$	0.03
Patience for ReduceLROnPlateau	50
Factor for ReduceLROnPlateau	0.1
Patience for Early Stopping	100

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhao, L.; Zhang, J.; Huang, L.; Peng, Y.; Yin, K.; Zheng, X.; Zhang, Z.; Shen, M.; Song, D.; Niu, H. Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning. Photonics 2022, 9, 794. https://doi.org/10.3390/photonics9110794

AMA Style

Zhao L, Zhang J, Huang L, Peng Y, Yin K, Zheng X, Zhang Z, Shen M, Song D, Niu H. Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning. Photonics. 2022; 9(11):794. https://doi.org/10.3390/photonics9110794

Chicago/Turabian Style

Zhao, Liyuan, Jianghua Zhang, Lei Huang, Yuanxi Peng, Ke Yin, Xin Zheng, Zhuohang Zhang, Meili Shen, Denghui Song, and Hongxiao Niu. 2022. "Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning" Photonics 9, no. 11: 794. https://doi.org/10.3390/photonics9110794

APA Style

Zhao, L., Zhang, J., Huang, L., Peng, Y., Yin, K., Zheng, X., Zhang, Z., Shen, M., Song, D., & Niu, H. (2022). Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning. Photonics, 9(11), 794. https://doi.org/10.3390/photonics9110794

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Linearization for the Sub-Nyquist Photonic Receiver Based on Deep Learning

Abstract

1. Introduction

2. Materials and Methods

3. Results and Discussion

4. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI