1. Introduction
Efficiency is of paramount importance in modern wireless communication systems, affecting different aspects of their implementation. On the one hand, high spectrally efficient modulation schemes are required, such as Orthogonal Frequency Division Multiplexing (OFDM), in order to satisfy the ever-increasing demand for data rate and connectivity. On the other hand, the evolution towards more sustainable communications involves the search for energy-efficient transceivers, with the power amplifier (PA) being the most critical subsystem of the transmitter in terms of power consumption. There is an inherent trade-off between linearity and power efficiency in PAs [
1]. Applying a certain backoff from the PA saturation point in order to limit the nonlinear distortion is a simple option with the cost of reducing its efficiency. In contrast to this, the use of linearization techniques such as digital predistortion (DPD) can help to tip the balance in favor of efficiency [
2]. Nevertheless, the design of a DPD relies on a behavioral model that veraciously resembles the inverse function of the PA.
Behavioral models of PAs have evolved from the traditional characterization based on the AM/AM and AM/PM static curves to the modern discrete-time behavioral models. The latter are tailored to take into account not only nonlinear distortion but also memory effects. The relevance of memory effects is emphasized by the waveforms based on OFDM, with scalable numerology and augmented bandwidths, which are defined by the fifth generation New Radio (5G-NR) air interface. As the discrete-time models are better suited for the implementation of DPDs, the correct selection of the model structure plays a key role to optimize the linearization performance.
Diverse approaches can be adopted for the discrete-time modeling of nonlinear systems, such as multidimensional polynomial filters or artificial neural networks. Volterra series [
3,
4] is also a useful mathematical tool for the analysis of nonlinear microwave and wireless systems [
5]. It can be considered the extension of Taylor series to nonlinear systems with memory, whose
n-order terms are obtained by means of a nonlinear generalization of the convolution integral that is commonly employed for the study of linear time invariant (LTI) systems. Volterra series produce behavioral models that are linear with respect to the parameters, thus allowing the use of linear regression methods for their identification [
6]. With the purpose of modeling wireless nonlinear systems that are driven by communications signals, Volterra series expansions are considered in a discrete, time-sample form. The input complex envelope is sampled at time
, with
T being the sampling period and
, thus producing the discrete-time sequence
.
Based on the baseband equivalent Volterra model [
7], the discrete-time amplifier output at the fundamental frequency zone is described by a linear combination of regressors. These regressors are given by odd-order monomials of the discrete-time input complex envelope
and its conjugate
with different delays that are denoted by
etc. If this univariate Volterra series is truncated to a maximum nonlinear order and memory depth, the resulting structure is denoted as the full Volterra (FV) model. Pioneering publications of baseband DPDs have also proposed the memory polynomial MP [
8] and the generalized memory polynomial GMP models [
9]. The MP and GMP models involve terms with the cross product of the signal and its exponentiated envelope, e.g., the MP model contains terms
and the GMP model contains terms
for all
n, odd and even. Since only terms with odd
n values are expected in a conventional FV model, some researchers have been reticent with regard to the use of
n-even terms notwithstanding their beneficial effect on the model fitting capabilities [
10,
11]. Apart from these widely popular heuristic approaches, several models derived from previous knowledge of the device have been proposed [
12,
13,
14]. In particular, the kernel structure of the model published in [
14] was derived on basis of the available equivalent-circuit information of a typical FET device with thermal power feedback. This circuit-knowledge based Volterra model is referred to here to as the CKV model. Like the FV model, the CKV model contains only conventional odd-order terms, which impose a limitation to its accuracy and a pertinent question arises: can the univariate FV and CKV models also be made richer with some kind of “beneficial” circuit-knowledge derived terms?
In the present paper, a novel structure with beneficial terms is demonstrated following a procedure based on a circuit knowledge perspective. First, an internal mechanism which is responsible for the signal envelope generation is identified and this new subnetwork is included in the CKV approach to augment its kernel structure. A basic example of the mentioned enhancement was advanced in [
15], for a simple case of the bivariate CKV model due to the space limitations in a conference paper. In addition to the benefits of these terms in a general CKV model, this new communication also reveals the enhancement of the FV model when the bivariate structure is included.
The existence of the internal envelope waveform provides a rationale to its use as a second control signal in a double Volterra series proposal [
16]. In
Section 2, a general bivariate model is derived from the conventional baseband FV model, the bivariate FV, with an enhanced regressor set. Additionally, the demonstration goes further into the PA circuit knowledge to provide a less complex bivariate Volterra series model, the bivariate CKV. Compressed-sensing techniques are applied to identify the optimum set of active regressors of the proposed models in different experimental scenarios. In
Section 3, the modeling performance of this approach is contrasted with the conventional Volterra models in the case of two amplifiers, a class AB and a class J. For comparison, the results for the GMP and the simplified model presented in [
15] are also included. Finally, the aptitude of the bivariate models in the design of DPDs to linearize PAs is considered in
Section 4 and a summary is presented in the last section.
3. PA Modeling Performance
The experimental setup shown in
Figure 3 was employed for the comparative assessment of the proposed approach with bivariate models versus the use of their conventional counterparts. The probing signal, designed according to the 5G-NR standard [
22] with a 30 MHz bandwidth, was created by an SMU200A vector signal generator (VSG) from Rohde & Schwarz with built-in arbitrary waveform generator. With a peak-to-average power ratio (PAPR) of 10.5 dB, it contained over 180,000 samples, corresponding to a sampling frequency of 92.16 MSample/s. The VSG was followed by two cascaded Mini-Circuits TVA-4W-422A+ preamplifiers in order to drive the PA under test into a mildly nonlinear operation, while keeping the behavior of the modulator sufficiently linear.
Two different PAs were employed in the experiments. The first PA under test was the evaluation board of a class AB amplifier based on the CGH40010 GaN HEMT from Cree Inc. (Durham, NC, USA), operated with a drain-to-source current of about 200 mA at 3.6 GHz and an average output power of 27.4 dBm. The second PA under test was a continuous-mode class J amplifier designed over the CGH35015F GaN HEMT from Cree Inc., providing an average output power of 26 dBm at 850 MHz. The output of the PAs was fed to a PXA-N9030A vector signal analyzer (VSA) from Keysight Technologies through a directional coupler and an attenuator in order to avoid introducing undesired distortion from the equipment. The RF output signal was downconverted to baseband and acquired in the VSA, where the measurement dynamic range was optimized by averaging 300 repetitions of the measured signal. Finally, the signals were time-aligned in order to synchronize the input and output data sets.
First, we compare the different modeling performances of the CKV, FV, bi-CKV and bi-FV models for the class AB PA. An important drawback of the bi-FV model compared to the bi-CKV model is its greater complexity in terms of number of regressors (and coefficients, correspondingly). In order not to go beyond the limits of computing capacity when predicting the PA output, all the models were selected with a ninth nonlinear order and a memory of three samples. Furthermore, the memory is considered only for regressors with nonlinear order below 7 in the case of the FV (univariate and bivariate) models.
The reduction in the NMSE between the modeled and the measured output of the class AB PA as the DOMP algorithm incorporates new regressors to the active set is shown in
Figure 4. For the conventional CKV model (plotted with a dashed line), the best NMSE according to the BIC rule is about
dB. An FV model with the same order and memory improves the NMSE (dashed line with circles), an expected result considering its richer regressors set. On the other hand, the improvement is marginal, indicating that the CKV model is quite appropriate despite its sparseness. For comparison purposes, the GMP model curve is also plotted (dotted line), showing the effect of its beneficial terms. The better accuracy of the bi-CKV model (
10) (displayed in the figure with a solid line) and the further NMSE improvement obtained with the bi-FV model (solid line with circles) are clear results of the bivariate terms.
The NMSEs of the conventional CKV and FV models do not reach the level of error of the corresponding bivariate models, even in the case that more regressors are incorporated, evidencing that without bivariate terms, the stock of regressors is insufficient.
Since the extension of nonlinear order and memory length is impracticable for the bivariate FV model, in the following the focus will be put in the comparison of the conventional and bivariate CKV models with optimized order and memory.
3.1. Signal Alignment
Notice that a perfect time alignment of the envelope
and the input
is assumed in the bi-CKV model (
10), i.e., the envelope delays
. In case that the envelope is advanced with respect to
, the lower limit of
must be a negative index. A second presumption is a perfect synchronization of the acquired output
and the input. This signal alignment is commonly performed with a postprocessing procedure based on the best cross correlation between
and
[
6]. An error in the timing coincidence can yield a raw signal
advanced in time, and the bi-CKV model would require noncausal terms with a dependence on “future” input samples. Alternatively, it is possible to refine the synchronization by delaying the coarse aligned signal
, the number of samples necessary to optimize the NMSE. The procedure is illustrated in
Figure 5, where the NMSE of a bi-CKV model with ninth-order and 10 samples of memory length are plotted for the raw output signal of the class AB PA under the same operating conditions. When repeating the modeling with the signal delayed one sampling time, the NMSE improvement is appreciable. The optimum result is given for a delay of two samples, with an error reduction of about 1 dB.
3.2. Comparison to a Simplified bi-CKV
In the conference paper [
15], a simplified case of the bi-CKV model (
10) was proposed assuming
and
. When grouping separately the terms with the envelope raised to an even power and the terms with the envelope raised to an odd power, the output is given by
The prima (
) indicates that only odd
n indices are included in the first sum and the double prima (
) indicates that only even
n indices are included in the second sum. Under this perspective, the GMP model can be considered a particular case of the bivariate CKV model. In
Figure 6, the results of the model published in [
15] are plotted with asterisks marks, exhibiting a superior performance compared to the univariate CKV model. The NMSE of the present bi-CKV model, indicated with a solid line in the same figure, displays an appreciable further improvement.
The NMSE results for a class AB and a class J amplifiers are depicted in
Figure 7. The evolution of the error versus the number of coefficients that the DOMP algorithm adds to the active regressors set under the restriction of the BIC rule demonstrates an optimum number between 15 and 20 regressors for the class AB PA model. The behavior of the class J PA is more complex—it exhibits not only gain compression but also a notable gain expansion for low power levels, and the corresponding optimum model contains a number between 20 and 25 regressors fixed by the BIC. The data show that conventional models are outperformed by the bivariate models in the two PA cases. It can be observed that even in the case that we continue adding more coefficients, the conventional models do not improve the NMSE. This fact indicates that, without bivariate terms, their stock of regressors lacks the necessary terms to reach a better precision.
4. Application of the Bivariate Model to DPD
With the aim of highlighting the enhancement in linearization capabilities after the inclusion of bivariate model terms, a direct learning architecture (DLA) [
23] was executed in the DPD of a commercial PA. The experimental setup of
Figure 3 was employed for that purpose, where the selected operation point for the class AB PA was characterized by an average output power of 31.2 dBm (39.8 dBm of peak output power) and a gain compression of 2.8 dB.
First, a search of the optimum structure with the DOMP technique was executed over the GMP and the CKV models in the conventional and bivariate forms. The training signals for this experiment were the input and output waveforms of the PA without DPD. In order to have a comparable set of initial regressors, the models were set to seventh order and maximum memory depth of seven taps. The initial numbers of coefficients for the GMP, the CKV and the bi-CKV models were 248, 328 and 812, respectively. The learning curve of the DOMP algorithm with respect to the number of coefficients is shown in
Figure 8, where the expectable decreasing error with the number of coefficients shows the best modeling capabilities in the bivariate CKV structure. The enhancement with respect to the conventional version motivates the increase in number of initial coefficients. The bi-CKV model also shows an enhancement with respect to the GMP model, since it represents a richer regressor structure. The BIC was executed over the sweep in number of components to set the optimum number of coefficients of 99, 165 and 210 for the GMP for the CKV and the bi-CKV models, respectively. These selected regressors were fixed and a DPD was designed based on them in a DLA scheme.
The DPD training was executed for 30 iterations with a step size of
. In this experiment, the linearization performance was measured by means of the adjacent channel power ratio (ACPR) of the output signal. The linearization NMSE exhibits a similar behavior and it is omitted for simplicity. The DPD performance with respect to DLA iterations is shown in
Figure 9, where the linearization capabilities of each algorithm in comparison follow a similar trend closely related to the search of the optimum structure.
Finally, once the DPD coefficients were trained and fixed, a new validation signal was set as the input to feature the algorithm’s linearization performance by their ability to work with signals not used in the model identification stage. The normalized power spectral densities (PSDs) of the output waveform without DPD and with the benchmarked algorithms are presented in
Figure 10. This figure reveals the effectiveness of the DPD exhibiting a spectral regrowth reduction in accordance to the attained performance in finding the model structure and training the DPD coefficients.
5. Conclusions
A bivariate proposal for PA modeling has been presented in this work. The approach considers an internally generated variable which operates as the second input to a bivariate Volterra system, giving a PA output expressed as a double Volterra series. In the present exposition, the beneficial terms with the envelope raised to an odd power respond to a bivariate Volterra model structure and are formally obtained following a procedure based on previous circuit model knowledge. Starting from a general representation, the particular case of the GMP model seen as a bivariate Volterra model is demonstrated.
Experimental data show that the modeling capabilities of a reduced-order bivariate Volterra model applied to both a class AB and a class J PAs outperform those of the univariate Volterra model. Finally, the linearization results for a 30 MHz 5G-NR signal and the class AB PA operating with an average output power of 31.2 dBm (gain compression of 2.8 dB) indicate that the NMSE and ACPR for the upper and lower adjacent channels achieved by the conventional CKV model are −41.9 dBc and −48.5/−48.1 dBc, respectively, compared to the values of −47.0 dB and −54.8/−54.2 dBc for the GMP model, and values of −47.4 dB and −55.7/−55.5 dBc for the bivariate CKV model.
Further development of the present approach, constructed with more complex models (e.g., the FV model), can provide bivariate models with a richer set of regressors and a reasonably improved performance.