Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels

Xiao, Lei; Han, Yubing; Weng, Zuxin

doi:10.3390/rs14205086

Open AccessArticle

Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels

by

Lei Xiao

,

Yubing Han

^*

and

Zuxin Weng

School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2022, 14(20), 5086; https://doi.org/10.3390/rs14205086

Submission received: 17 August 2022 / Revised: 27 September 2022 / Accepted: 9 October 2022 / Published: 12 October 2022

(This article belongs to the Special Issue Radar Techniques and Imaging Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

A novel framework for a low-cost coding digital receiving array based on machine learning (ML-CDRA) is proposed in this paper. The received full-array signals are encoded into a few radio frequency (RF) channels, and decoded by an artificial neural network in real-time. The encoding and decoding networks are studied in detail, including the implementation of the encoding network, the loss function and the complexity of the decoding network. A generalized form of loss function is presented by constraint with maximum likelihood, signal sparsity, and noise. Moreover, a feasible loss function is given as an example and the derivations for back propagation are successively derived. In addition, a real-time processing implementation architecture for ML-CDRA is presented based on the commercial chips. It is possible to implement by adding an additional FPGA on the hardware basis of full-channel DRA. ML-CDRA requires fewer RF channels than the traditional full-channel array, while maintaining a similar digital beamforming (DBF) performance. This provides a practical solution to the typical problems in the existing low-cost DBF systems, such as synchronization, moving target compensation, and being disabled at a low signal-to-noise ratio. The performance of ML-CDRA is evaluated in simulations.

Keywords:

coding digital receiving array; machine learning; low-cost DBF system; few-RF-channel; encoding and decoding

Graphical Abstract

1. Introduction

Digital beamforming (DBF) is a key requirement in modern wireless systems. The cost and power consumption are the main challenges for the wide application of DBF [1], due to the higher number of radio frequency (RF) channels and analog-to-digital converters (ADC) that are required. Therefore, it is critical to develop a novel, low-cost, power-efficient DBF system with few RF channels.

To reduce RF channels, the received array signals have to be compressed/encoded at the beginning of the front-end. However, the decrease in sampled signal dimension will lead to an unacceptable DBF performance. Thus, it is necessary to recover/decode the original received full-array signals to ensure that the signal dimension for DBF is consistent with the traditional full-channel array. Considerable efforts [2,3,4,5,6,7,8,9,10,11] have been made to reduce the digital receiving array (DRA) cost based on the above encoding and decoding architecture. According to the difference in codec, these approaches can be classified into the compressed-sensing-based coding DRA (CS-CDRA) [2,3,4,5,6] and the orthogonal-coding-based coding DRA (OC-CDRA) [7,8,9,10,11,12], as shown in Table 1.

For a typical CS-CDRA, the received full array signals are encoded by a measurement matrix that complies with the restricted isometry property (RIP) [13], such as random Gaussian matrix, random Bernoulli matrix, partial Hadamar matrix, and partial Fourier matrix. Since the received array signal is compressible, it could be reconstructed by solving a convex optimization problem [14]. However, it is impossible to recover the received array signal in real-time due to the iteration requirements of the existing sparse reconstruction algorithms [15,16,17]. In addition, the sparse reconstruction algorithms cannot work stably with a low signal-to-noise ratio (SNR).

The OC-CDRA introduces a code division multiplexing (CDM) [18] technique to identify each signal path associated with every unique antenna element. Specifically, the received signal from each antenna element is mixed with a high-speed spread spectrum code, such as the Walsh–Hadamard and Gold codes, and decorrelated with the same spread spectrum code after digitization. According to the mixer position, the OC-CDRA can be further divided into on-site coding (OSC) architecture [7,8,9] and time sequence phase weighting (TSPW) architecture [10,11,12]. The advantage of OSC architecture is that the received full array signals are encoded at an intermediate frequency (IF) after down-conversion, which is friendly to the implementation of the encoding network. The requirements of ADCs and parallel input/output (I/O) channels at the digital back-end are significantly reduced, but the RF channels are not. Therefore, the OSC architecture is not efficient in terms of cost reduction. However, the TSPW architecture encodes the received array signal at RF before down-conversion. The encoded signals are sampled by few RF channels and ADCs, which significantly reduce the system budget. However, both TSPW and OSC face problems in code synchronization (in both encoding and decoding) and moving target compensation [19]. More importantly, each information bit is decoded from L chips, where L is the code length. the recovered signal in OC-based architecture is only

1 / L

data volume of the full-channel array, which inevitably leads to data loss after decoding. The system-accumulated gain will be decreased

10 l o g (L) dB

.

Different from the above coding DRA (CDRA), there is a generalized coding array, called antenna selection (AS) architecture [20,21,22]. The fundamental idea behind AS is to reduce the number of RF chains by judiciously activating a subset of antennas. This can realize a reconfigurable array beamforming through element selection (not combination) without decoding. However, meanwhile, the array accumulated gain will be decreased according to the number of discarded array elements, which is also a form of data loss.

Artificial intelligence has shown great potential in various fields, and beamforming systems are no exception. A beamforming neural network (BFNN) is proposed in [23] to optimize the beamformer to maximize the spectral efficiency with hardware limitations. It has strong robustness to imperfect channel state information (CSI). Ref. [24] proposes a beamforming prediction network (BFNet) to jointly optimize the power allocation and virtual uplink beamforming of Multiple-Input and Multiple-Output (MIMO) systems. This can get rid of the complexity caused by excessive iteration to realize real-time calculation. For the local scattering caused by sea surface, [25] introduce a convolutional neural network (CNN) framework to estimate the transmitter’s incident angle.

This paper proposes a novel, low-cost, power-efficient DBF system framework called machine-learning-based coding DRA (ML-CDRA). The received full-array signals are encoded into a few RF channels, and decoded by an artificial neural network (ANN) in real-time without any data volume loss. The proposed ML-CDRA can work stably at a low SNR, as verified in simulations at

SNR = - 60 dB

. Since the recovered signals are decoded by a single snapshot, the moving target compensation problem will not bother ML-CDRA. Moreover, we present a generalized loss function of the decoding network, which is carried out from three directions: maximum likelihood, signal sparsity, and noise (including noise power suppression, equal power constraint, and noise whitening). A feasible loss function is given as an example and the derivations for back propagation are successively derived. At the end, the implementation of ML-CDRA is also discussed based on the existing technologies and devices. A real-time processing architecture for ML-CDRA, with the decoding network implemented by a field-programmable gate array (FPGA), is presented.

It should be emphasized that the proposed ML-CDRA is a novel, low-cost, DBF system framework. The form of encoding and decoding networks are not fixed, which leads to a systematic trade-off between cost, resources, and performance. The loss function of decoding network should also be modified according to the applications.

2. Materials and Methods

As shown in Figure 1, the proposed ML-CDRA is composed of two networks: encoding and decoding networks. The signals received from each antenna are selected and combined into a few RF channels through an encoding network. After digitization, the sampled signals are decoded by a decoding network based on machine learning to recover the originally received full-array signals.

2.1. Signal Model

The encoded signals sampled by ADCs are:

\begin{matrix} y & = G Φ (x + n_{1}) + n_{2} \end{matrix}

(1)

where

y \in C^{M}

, G is the gain of receiver module,

Φ \in C^{M \times N}

is the encoding matrix (

M < N

, M is the number of few RF channels, N is the number of antenna elements),

x \in C^{N}

is the received array signal,

n_{1} \sim N (0, Σ_{n_{1}})

is the independent and identically distributed Gaussian noise in wireless channels, and

n_{2} \sim N (0, Σ_{n_{2}})

is the random thermal noise in RF channels.

The traditional array signal model supposes that the noise in each channel is independent and identically distributed. However, this is totally different in CDRA, which is correlated due to the encoding. The same wireless channel noise may be mixed into two different RF channels.

Thus, it is necessary to clarify the composition of the noise in sampled signals.

η = G Φ n_{1} + n_{2}

(2)

where

η

is the noise in the sampled signal.

The power of

n_{2}

is determined by the noise figure F of the digital receiver module. For example, considering the typical value

F = 3 dB

, the introduced thermal noise

n_{2}

has equal power with the input noise after being amplified. In addition, the power of

n_{2}

in each RF channel may be different, which is controlled by the encoding matrix

Φ

.

The covariance matrix of noise

η

is

\begin{matrix} Σ_{η} & = G^{2} Φ Σ_{n_{1}} Φ^{H} + Σ_{n_{2}} \\ = G^{2} Φ Σ_{n_{1}} Φ^{H} + γ G^{2} (Φ Σ_{n_{1}} Φ^{H}) \circ I \end{matrix}

(3)

where

γ = 10^{(F / 10)} - 1

,

{(\cdot)}^{H}

denotes the conjugate transpose, ∘ denotes the Hadamard product, and

I

is the identity matrix.

Obviously, the noise in the sampled signal is correlated.

2.2. Encoding Network

The encoding network can significantly reduce the requirement of RF channels. The received array signals are compressed into a few RF channels according to the encoding matrix

Φ

, which describes the connection between antenna elements and RF channels. The form of encoding matrix

Φ

is diverse. Different encoding matrices will bring a different spatial sensitivity to CDRA.

Consider the single far-field target case; the received array signal

x

can be expressed as

x = a (θ) s

(4)

where

a (θ) \in C^{N}

is the steering vector, and

θ

is the direction of arrival.

The encoded signal of

m^{t h}

channel is

\begin{matrix} y_{m} & = G φ_{m}^{H} (x + n_{1}) + n_{2, m} \\ = G φ_{m}^{H} a (θ) s + (G φ_{m}^{H} n_{1} + n_{2, m}) \end{matrix}

(5)

where

φ_{m}^{H}

is the

m^{t h}

row vector of

Φ

, and

n_{2, m}

is the thermal noise of the

m^{t h}

channel.

Therefore, the SNR of

m^{t h}

channel is

\begin{matrix} {SNR}_{m} = \frac{P_{s}}{P_{n}} & = \frac{G^{2} {| φ_{m}^{H} a (θ) |}^{2} σ_{s}^{2}}{D (G φ_{m}^{H} n_{1}) + σ_{n_{2}, m}^{2}} \end{matrix}

(6)

where

σ_{s}^{2}

is the signal power,

σ_{n_{2}, m}^{2}

is the thermal noise power of mth channel,

| \cdot |

denotes the magnitude, and

D (\cdot)

denotes the variance.

It is easy to find that the signal power

P_{s}

is a function of the direction. This means that the signal from different directions may suffer losses after the encoding network compared with the phased coherent combination. This loss is determined by the array arrangement, direction of arrival, and encoding matrix. It will eventually be reflected in the maximum SNR after DBF according to the weight of the corresponding direction, namely, spatial sensitivity. Thus, the encoding network should be carefully designed in different applications.

According to the difference in implementation, the encoding network can be divided into the fixed encoding network and the tunable encoding network. As shown in Figure 2a, the fixed encoding network is implemented by a multiple input–multiple output (MIMO) feed network without any additional equipment. The tunable encoding network introduces additional RF switches or phase shifters to the feed network, as shown in Figure 2b,c. An RF switch or phase shifter can provide more freedoms for signal processing. Therefore, the received array signals can be encoded in various schemes to obtain optimal performance in different applications by changing the encoding matrix.

The main difficulty of the encoding network is the topological structure overlapping of the feed network. The overlapping wiring can be realized by a multi-layer printed circuit board (PCB) with through holes. Nevertheless, the design of an overlapping feed network is still a challenge when the topological structure is too complex. The RF micro-electromechanical systems (RF-MEMS) switches matrix [26] is one possible scheme to realize the overlapping feed network. However, the RF-MEMS switch matrix still needs further development, especially regarding large-scale and RF performance loss.

The most common feed network without overlap is the adjacent subarray structure. A typical passive array antenna structure is given in Figure 3a. Each group of four adjacent elements is combined into a single RF channel by a fixed feed network. The subarray pattern of this structure is immutable, and the main lobe is aligned to the broadside of the array. Figure 3b is a typical phased array antenna with subarray structure, and each group of four adjacent elements is combined into a single RF channel after the phase shifter. However, its subarray pattern can be changed by switching the phase shifter.

2.3. Decoding Network

Due to (1) is an under-determined equation, which has an infinite number of solutions. It is impossible to recover the original received array signal from the few-channel signal by a linear transformation with a single snapshot. Therefore, non-linear processing is an inevitable choice for the decoding network.

Considering the computational complexity and the real-time processing requirements in engineering, the existing sparse reconstruction algorithms are unsuitable for CDRA [15,16,17]. ANN, which has been rapidly developing in this decade, has the ability to cope with these difficulties.

The decoding network of the proposed ML-CDRA is carried out by an ANN in this paper. The forward propagation of ANN is implemented by limited multipliers and adders with a low latency. The pipeline architecture can achieve single-snapshot, real-time decoding in a pressureless manner. It should be noted that the specific network structure of ANN is open for ML-CDRA. This is a bargain between resources and performance.

Here, a generalized loss function of the decoding network based on the maximum a posteriori estimation (MAP) is given as

J = J_{1} + α J_{2} + β J_{3}

(7)

where

J_{1}

is derived from the maximum likelihood estimation.

J_{2}

and

J_{3}

are the constraints for signal and noise, respectively.

α

and

β

are hyperparameters. The details of the generalized loss function are presented in the next section.

The decoding network is trained based on back propagation, which is carried out by gradient descent as

Z_{i + 1} = Z_{i} + μ \frac{\partial}{\partial Z^{*}} J (Z, Z^{*})

(8)

where

Z_{i}

is the complex network parameter

Z

in

i^{t h}

iteration,

μ

is the iteration step-size,

\frac{\partial}{\partial Z^{*}} J (Z, Z^{*})

is the partial derivative of

J (Z, Z^{*})

to

Z^{*}

,

J (Z, Z^{*})

is the loss function, and

{(\cdot)}^{*}

denotes the conjugate.

Figure 4 describes the training of a decoding network based on the proposed generalized loss function. The forward propagation of the network is performed by single snapshot. Therefore, the reconstructed noise is independent in the time dimension. This ensures that the time accumulation gains of ML-CDRA will not deteriorate compared with the full-channel DRA. The back propagation of the network is performed by batch processing. According to the authors’ experience, it is sufficient to accomplish the training of the decoding network within 1000 batches. Moreover, the training time can be shorter if an offline pre-training is performed based on a simulation or limited actual data. Considering that the non-cooperative objects are more common, the online training ability is essential for the decoding network.

2.4. An Example for Generalized Loss Function

It should first be emphasized that the form of loss function for the decoding network is diverse. The basic idea of the proposed generalized loss function is to focus on the signal and noise simultaneously, especially the noise.

A feasible loss function is given as an example based on the proposed generalized loss function, as follows. The back propagation of the decoding network is carried out by gradient descent, which is derived in Appendix A.

J = J_{1} + α J_{2} + β J_{3}

(9)

where

\begin{matrix} J_{1} & = \sum_{t = 1}^{L} {∥y (t) - G Φ \hat{x} (t)∥}_{Σ^{- 1}}^{2} \\ J_{2} & = K - \sum_{n = 1}^{N} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) \\ J_{3} & = {∥R_{\hat{η}} - δ_{n}^{2} I∥}_{F}^{2} \end{matrix}

where

{∥Z∥}_{Σ^{- 1}}^{2} = Z^{H} Σ_{η}^{- 1} Z

,

{(\cdot)}^{- 1}

denotes the inverse of matrix, L is the number of snapshots,

R_{\hat{x}} = \frac{1}{L} \sum_{t = 1}^{L} \hat{x} (t) {\hat{x}}^{H} (t)

is the covariance of the decoded signal

\hat{x}

,

{∥\cdot∥}_{F}

denotes the

l_{F}

-norm,

λ_{n} (R_{\hat{x}})

is the

n^{t h}

eigenvalue of

R_{\hat{x}}

, K is the sparsity,

δ_{s}

is the shape parameter,

R_{\hat{η}}

is the covariance of the decoded noise, and

δ_{n}

is the noise power-suppression coefficient.

2.4.1. Maximum Likelihood Estimation

According to Section 2.1, the sampled signals are

y = G Φ x + η

(10)

where noise

η

obeys the normal distribution with mean

0

and covariance matrix

Σ_{η}

.

The maximum likelihood estimation is obtained by maximizing the Gaussian density function

f (\hat{x}; x (t)) = {(\frac{1}{\sqrt{{(2 π)}^{M} {| Σ_{η} |}_{\det}}})}^{L} exp (- \frac{1}{2} \times \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{H} Σ_{η}^{- 1} (y (t) - G Φ \hat{x} (t)))

(11)

where

{| \cdot |}_{\det}

denotes the determinant.

Therefore, the maximum likelihood estimation can be regarded as

J_{1} = \sum_{t = 1}^{L} {∥y (t) - G Φ \hat{x} (t)∥}_{Σ^{- 1}}^{2}

(12)

2.4.2. Sparsity Constraint for Signal

The eigenvalues of the received array signal covariance are sparse, since the targets are limited. Therefore, the sparsity of signal can be constrained as

J_{2} = K - \sum_{n = 1}^{N} {∥λ_{n} (R_{\hat{x}})∥}_{0}

(13)

where

{∥\cdot∥}_{0}

denotes the

l_{0}

-norm.

As the

l_{0}

-norm minimization is an NP-hard problem, it should be replaced by other models, such as

l_{1}

-norm and smoothed

l_{0}

-norm (SL0) [27]. The SL0 approximates

l_{0}

-norm by a smooth Gaussian function, which is differentiable during the back propagation of the decoding network. (13) can be rewritten by SL0 as

J_{2} = K - \sum_{n = 1}^{N} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}})

(14)

2.4.3. Noise Reduction and Whitening

Suppose the received array signals are accurately reconstructed by the constraints of

J_{1}

and

J_{2}

. As we know, the SNR is determined by signal and noise. Thus, the noise constraint can be designed as

J_{3} = {∥R_{\hat{η}} - δ_{n}^{2} I∥}_{F}^{2}

(15)

Noise reduction and whitening are carried out by

δ_{n}

and

I

, respectively. In addition, the

δ_{n}^{2} I

also implies an equal-power constraint for different channels.

The existing reconstruction algorithms [15,16,17] often neglect the statistical characteristics of the reconstructed noise, which is crucial to signal accumulation (including array signal processing and time-domain accumulation). The reconstructed noise power and correlation directly determine the SNR after DBF. We can clearly point out that the upper limit of the array signal reconstruction problem is determined by the noise constraint.

3. Results

To evaluate the performance of the proposed ML-CDRA, a 48-element 12-channel ML-CDRA is studied and compared with different DBF systems as shown in Figure 5, including a 48-element, traditional, full-channel DRA (48-DRA, which has the same antenna elements), a 12-element, traditional, full-channel DRA (12-DRA, which has the same RF channels), and a 48-element, single-channel TSPW array [10] with code length

L = 48

.

As the receiving antenna of each scheme is a uniform, linear array antenna with half-wavelength inter-sensor spacing, each antenna element is omnidirectional. The encoding network of ML-CDRA is the 4-in-1 subarray structure without a phase shifter, as given in Figure 3a. The decoding network is a three-layer, fully connected, multi-layer perceptron (MLP) with 12 neurons in the input layer, 1024 neurons in the hidden layer, and 48 neurons in the output layer. The loss function is given in Section 2.4. The other details of the decoding network are given in Appendix A, including the activation function and the gradient descent.

3.1. SNR after DBF

Assume that a far-field target at

θ_{1} = 5^{\circ}

with single frequency. Both wireless channel noise

n_{1}

and RF channel noise

n_{2}

are Gaussian noise. The noise figure of the digital receiver module

F = 3 dB

. The beamforming weight

w = (1, exp (- j 2 π \frac{d sin (θ_{1})}{λ}), \dots,

exp {(- j 2 π \frac{(N - 1) d sin (θ_{1})}{λ})}^{T})

, where

d = λ / 2

.

As shown in Figure 6, the SNR after the DBF of ML-CDRA is almost consistent with 48-DRA, but only

1 / 4

RF channels are required. This is nearly

6 dB

higher than 12-DRA, which has the same RF channels. In addition, the simulation results also reflect that the system-accumulated gain decreases in TSPW caused by data loss are

10 l o g (L) = 16.8 dB

. More importantly, the ML-CDRA works stably even if the input SNR is as low as

- 60 dB

. According to the simulation results, it can be reasonably speculated that the proposed ML-CDRA will still work effectively with a lower SNR.

Furthermore, the spectrum of the decoded signal after DBF is given in Figure 7 to prove that the proposed ML-CDRA can recover the received array signal perfectly. In the normalized spectrum diagram, the noise power of ML-CDRA is basically the same as 48-DRA, and lower than 12-DRA. It should be noted that the noise of ML-CDRA is approximately uniformly distributed across the spectrum.

3.2. Beamforming Performance

The normalized array pattern of different DBF systems are given in Figure 8. The proposed ML-CDRA has the same beamforming performance compared with 48-DRA, and a narrower beamwidth compared with 12-DRA, which means a better directivity. The traditional subarray, which has the same 4-in-1 structure without phase shifter as shown in Figure 3a, can also achieve a similar narrow beam compared with ML-CDRA. However, the spatial filtering cannot be effectively realized when the undesired targets appear at the gate lobe. This problem was solved in the proposed ML-CDRA by recovering the received array signal.

3.3. Spatial Sensitivity

Different from the antenna pattern, the spatial sensitivity describes the relationship between the maximum SNR after DBF and the directions. Signals from different directions with the same input SNR are fed into ML-CDRA in turn. After decoding, the recovered array signals are combined according to the DBF weight of the corresponding direction, e.g.,

w = (1, exp (- j 2 π \frac{d sin (θ)}{λ}), . . .,

exp {(- j 2 π \frac{(N - 1) d sin (θ)}{λ})}^{T}

, where

θ

is the direction of arrival.

Figure 9 shows several subarray structure-encoding networks, and the spatial sensitivities of these schemes are given in Figure 10a. Wherever the signal comes from, the full-channel DRA can achieve the same maximum DBF gain by phased coherent combination. Therefore, both 48-DRA and 12-DRA appear as a flat line. It is easy to see that the subarray structure ML-CDRA has a better performance than 12-DRA around the broadside direction. The SNR after the DBF of the 4-in-1 structure is profitable in about

(- 20^{\circ}, 20^{\circ})

compared with 12-DRA. Moreover, if the phase-shifters are introduced into the encoding network, as shown in Figure 3b, the ML-CDRA can obtain the maximum SNR after DBF in any direction. As shown in Figure 10b, the maximum gain of ML-CDRA is achieved at

15^{\circ}

by adjusting the phase shifters.

For the ML-CDRA with random encoding structure, as in Figure 2a, the received array signals are combined into a few RF channels according to a random Bernoulli matrix. Figure 11 shows that the SNR of this structure after DBF is similar to the 48-DRA in the broadside direction, and fluctuates around the 12-DRA in other directions while obtaining a narrower beamwidth, as discussed in Section 3.2. The random encoding structure ML-CDRA can take care of the entire space simultaneously, and the SNR fluctuation in different directions is more robust than the subarray structure.

A different encoding structure means a different spatial sensitivity, which may be an advantage in some scenes. For example, in the case of target tracking, the subarray structure ML-CDRA can obtain the maximum gain in the desired direction using phase shifters, as in Figure 10b. The signal from other directions will be attenuated due to the spatial sensitivity. Therefore, the ML-CDRA can naturally realize spatial anti-jamming.

4. Implementation

To further prove the “low-cost” property, the implementation of the proposed ML-CDRA is discussed based on the commercial chips in this section. The complexity of the decoding network is also studied.

A real-time processing implementation architecture for ML-CDRA is presented in Figure 12. The forward propagation of the decoding network is performed by multipliers and adders in FPGA. The back propagation is carried out by gradient descent in a digital signal processor (DSP) and updated with a coherent processing cycle (CPI). For the full-channel DRA, the DBF weight calculation is also performed in DSP based on the array signals, as shown in Figure 12a. Therefore, the gradient descent of ML-CDRA can be integrated into the same DSP chip without any additional data-transfer overhead.

To realize the real-time decoding, each multiplication and addition operation of the decoding network needs to be configured with independent resources. The pipeline architecture shown in Figure 13, which can execute all the operations of each layer of the decoding network in each clock cycle, is more suitable to realize the real-time decoding compared with the instruction set architecture.

To be more specific, the resources consumed to implement the decoding network can be divided into registers, adders, and multipliers. For a scale-limited network, the logic cells and configurable logic blocks (CLB) in FPGA can supply the registers and adders required in the decoding network. The activation functions of the hidden layer neurons (such as Sigmoid, ReUL, sech, etc.) can be implemented by the look-up tables (LUT). In addition, the multiplication ability can be evaluated by multiply accumulate (MAC). Many operations (such as convolution, dot product, and matrix operations) can be converted to multiple MAC operations. Therefore, the networks requiring convolution operation, such as convolutional neural network (CNN), also have the possibility to be used in the proposed ML-CDRA.

Consider a N-element M-channel ML-CDRA, realized by a three-layer, fully connected MLP with M neurons in the input layer, K neurons in the hidden layer, and N neurons in the output layer. The system clock is

f_{c}

. For the super-heterodyne receiver, the received signal is divided into I&Q branches after down-conversion. Therefore, the number of multiply accumulate operations performed per second is

2 (M K + K N) \times f_{c}

(16)

Take a 48-element 12-channel ML-CDRA as an example, which is studied in the simulation. The decoding network is realized by a three-layer, fully connected MLP with 12 neurons in the input layer, 1024 neurons in the hidden layer, and 48 neurons in the output layer. Suppose the system clock is 100 MHz. Therefore, the number of multiply accumulate operations performed per second is 2 * (12 * 1024 + 1024 * 48) * 100 MHz = 12,288 GMAC/s.

Table 2 gives several available commercial FPGAs [28,29]. The Virtex UltraScale+ FPGA VU11P and VU13P are powerful enough to implement the above decoding network using only one chip. The Virtex-7 XC7VX690T can also be used to implement the decoding network with lower complexity. In addition, these FPGAs have enough I/O to assign a dedicated pin to each ADC without time division multiplexing. It should be noted that the decoding network mainly consumes the DSP Slice resources. The remaining resources in FPGA can still be used for other functions.

Hence, the proposed ML-CDRA can be implemented by an additional FPGA on the basis of full-channel DRA. As shown in Figure 12, the required RF channels, variable gain amplifiers (VGA), ADCs and I/Os of ML-CDRA are significantly reduced compared with the full-channel DRA. Moreover, the space requirement, heat dissipation and crosstalk of RF channels can also be alleviated. Considering that the additional cost of the decoding network is lower than that of the reduced RF channel overhead, the proposed ML-CDRA is attractive.

Furthermore, for the large-scale decoding network, adding one chip cannot implement the ML-CDRA. A multiple FPGA cascade is a possible means to solve this problem. In addition, the sparsely connected networks may be a potential direction to reduce the redundant computing and resource consumption. Resource reuse is also a good choice in scenes with a low sampling rate. Some new hardware architectures/devices designed for artificial intelligence, such as the Adaptive Compute Acceleration Platform (ACAP) of Xilinx, may solve the real-time decoding challenge from the hardware level.

5. Discussion

As shown in Figure 10, the maximum SNR of the ML-CDRA with 6-in-1 overlapping is higher than 48-DRA in the broadside direction. This is an interesting result of the proposed encoding and decoding architecture. We have two explanations for this result. The first key point is the broadside direction, while the signal comes from the broadside of the array antenna, there is no phase difference between the received array signals. Hence, the array signal is lossless during the encoding. Another key point is the loss function of the decoding network. The proposed generalized loss function includes the maximum likelihood estimation, the signal sparsity constraint, and the noise constraint, where the noise constraint has noise suppression and whitening abilities. Therefore, the proposed ML-CDRA architecture should be considered to have denoising abilities. In conclusion, it is possible to exceed the full channel array in theory.

The encoding and decoding network optimizations may be a key piece of research for the ML-CDRA in the future. This paper does not require the specific network structure of the decoding network, but its importance is indisputable. The system performance may be further improved by using the network structures proposed in recent years. As explained in Section 2.2, the spatial sensitivity of the ML-CDRA is determined by the encoding network.

6. Conclusions

This paper proposes a novel, low-cost DBF system framework called ML-CDRA. The received full array signals are encoded into a few RF channels, and decoded by an ANN in real-time without any data volume loss. The low-cost power-efficient ML-CDRA framework has a similar digital beamforming performance compared with the traditional full-channel receiving array. The encoding and decoding networks can be flexibly designed according to their requirements in different applications. Moreover, they can be implemented based on the existing commercial chips.

Author Contributions

Conceptualization, L.X. and Y.H.; Data curation, L.X.; Formal analysis, L.X.; Funding acquisition, Y.H.; Investigation, L.X.; Methodology, L.X. and Y.H.; Project administration, L.X.; Resources, Y.H.; Supervision, L.X. and Y.H.; Validation, L.X. and Z.W.; Visualization, L.X. and Z.W.; Writing—original draft, L.X.; Writing—review & editing, L.X. and Y.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Derivatives of the Loss Function

The decoding network is implemented by a three-layer, fully connected MLP network, as shown in Figure A1, which is composed of an input layer, a hidden layer, and an output layer. There are K neurons in the hidden layer, and the activation function of the

k^{t h}

neuron is

g_{k} (t) = sech [v_{k}^{T} \times (y (t) - c_{k})]

. The decoded signal

\hat{x} (t) = Wg (t)

, where

g (t) = {[g_{1} (t), g_{2} (t), . . ., g_{K} (t)]}^{T}

.

Figure A1. The structure of three-layer fully connected MLP network.

The loss function given in Section 2.4 is

J = J_{1} + α J_{2} + β J_{3}

(A1)

where

\begin{matrix} J_{1} & = \sum_{t = 1}^{L} {∥y (t) - G Φ \hat{x} (t)∥}_{Σ^{- 1}}^{2} \\ J_{2} & = K - \sum_{n = 1}^{N} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) \\ J_{3} & = {∥R_{\hat{η}} - δ_{n}^{2} I∥}_{F}^{2} \end{matrix}

The derivatives of this loss function to

W^{*}

,

v_{k}^{*}

, and

c_{k}^{*}

are derived as follows.

Appendix A.1. The Preliminary

The derivatives

D_{Z} J (Z, Z^{*})

and

D_{Z^{*}} J (Z, Z^{*})

are defined by the following differential expression

\begin{matrix} d J (Z, Z^{*}) = & D_{Z} J (Z, Z^{*}) d vec (Z) \\ + & D_{Z^{*}} J (Z, Z^{*}) d vec (Z^{*}) \end{matrix}

(A2)

where

\begin{matrix} D_{Z} J (Z, Z^{*}) & = {vec}^{T} (\frac{\partial}{\partial Z} J (Z, Z^{*})) \\ D_{Z^{*}} J (Z, Z^{*}) & = {vec}^{T} (\frac{\partial}{\partial Z^{*}} J (Z, Z^{*})) \end{matrix}

where

vec (\cdot)

is the vectorization operator,

{(\cdot)}^{T}

is the transpose operator.

Appendix A.2. The Derivatives of J₁

\begin{matrix} J_{1} & = \sum_{t = 1}^{L} {∥y (t) - G Φ \hat{x} (t)∥}_{Σ^{- 1}}^{2} \\ = \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{H} Σ_{η}^{- 1} (y (t) - G Φ \hat{x} (t)) \end{matrix}

(A3)

(1) The derivative

D_{W^{*}} J_{1}

is

D_{W^{*}} J_{1} = D_{\hat{x}} J_{1} \times D_{W^{*}} \hat{x} + D_{{\hat{x}}^{*}} J_{1} \times D_{W^{*}} {\hat{x}}^{*}

(A4)

where

D_{W^{*}} \hat{x} = 0

Therefore,

D_{W^{*}} J_{1} = D_{{\hat{x}}^{*}} J_{1} \times D_{W^{*}} {\hat{x}}^{*}

Deriving the derivatives

D_{{\hat{x}}^{*}} J_{1}

and

D_{W^{*}} {\hat{x}}^{*}

, respectively. The total differential of (A3) is

\begin{matrix} d (J_{1}) = & \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{H} Σ_{η}^{- 1} (- G Φ) d (\hat{x}) \\ + & \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{T} {(Σ_{η}^{- 1})}^{T} (- G Φ^{*}) d ({\hat{x}}^{*}) \end{matrix}

Therefore, the derivative

D_{{\hat{x}}^{*}} J_{1}

is

D_{{\hat{x}}^{*}} J_{1} = \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{T} {(Σ_{η}^{- 1})}^{T} (- G Φ^{*})

The derivative

D_{W^{*}} {\hat{x}}^{*}

is

D_{W^{*}} {\hat{x}}^{*} = g^{H} (t) \otimes I_{N}

where ⊗ is the Kronecker product;

I_{N}

is the N-order identity matrix.

Therefore, the derivative of

J_{1}

to

W^{*}

is

\begin{matrix} D_{W^{*}} J_{1} = \\ \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{T} {(Σ_{η}^{- 1})}^{T} (- G Φ^{*}) (g^{H} (t) \otimes I_{N}) \end{matrix}

(A5)

(2) The derivative

D_{v_{k}^{*}} J_{1}

is

D_{v_{k}^{*}} J_{1} = D_{\hat{x}} J_{1} \times D_{v_{k}^{*}} \hat{x} + D_{{\hat{x}}^{*}} J_{1} \times D_{v_{k}^{*}} {\hat{x}}^{*}

(A6)

where

\begin{matrix} D_{v_{k}^{*}} \hat{x} & = D_{g_{k}} \hat{x} \times D_{v_{k}^{*}} g_{k} + D_{g_{k}^{*}} \hat{x} \times D_{v_{k}^{*}} g_{k}^{*} \\ D_{v_{k}^{*}} {\hat{x}}^{*} & = D_{g_{k}} {\hat{x}}^{*} \times D_{v_{k}^{*}} g_{k} + D_{g_{k}^{*}} {\hat{x}}^{*} \times D_{v_{k}^{*}} g_{k}^{*} \end{matrix}

Due to

\begin{matrix} D_{g_{k}} \hat{x} & = w_{k} \\ D_{g_{k}^{*}} \hat{x} & = 0 \\ D_{g_{k}} {\hat{x}}^{*} & = 0 \\ D_{g_{k}^{*}} {\hat{x}}^{*} & = w_{k}^{*} \end{matrix}

where

W = [w_{1}, w_{2}, . . ., w_{K}]

.

Therefore,

D_{v_{k}^{*}} J_{1} = D_{\hat{x}} J_{1} \times D_{g_{k}} \hat{x} \times D_{v_{k}^{*}} g_{k} + D_{{\hat{x}}^{*}} J_{1} \times D_{g_{k}^{*}} {\hat{x}}^{*} \times D_{v_{k}^{*}} g_{k}^{*}

Deriving the derivatives

D_{v_{k}^{*}} g_{k}

and

D_{v_{k}^{*}} g_{k}^{*}

, respectively. The total differential of the activation function is

\begin{matrix} d (g_{k} (t)) = & {sech}^{'} [v_{k}^{T} (y (t) - c_{k})] {(y (t) - c_{k})}^{T} d (v_{k}) \\ + & {sech}^{'} [v_{k}^{T} (y (t) - c_{k})] (- v_{k}^{T}) d (c_{k}) \\ d (g_{k}^{*} (t)) = & {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} {(y (t) - c_{k})}^{H} d (v_{k}^{*}) \\ + & {sech}^{'} [v_{k}^{T} (y (t) - c_{k})] (- v_{k}^{H}) d (c_{k}^{*}) \end{matrix}

where

{sech}^{'} (\cdot)

is the derivative of

sech (\cdot)

.

Therefore,

\begin{matrix} D_{v_{k}^{*}} g_{k} & = 0 \\ D_{v_{k}^{*}} g_{k}^{*} & = {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} {(y (t) - c_{k})}^{H} \\ D_{c_{k}^{*}} g_{k} & = 0 \\ D_{c_{k}^{*}} g_{k}^{*} & = - {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} v_{k}^{H} \end{matrix}

Therefore, the derivative of

J_{1}

to

v_{k}^{*}

is

\begin{matrix} D_{v_{k}^{*}} J_{1} = & D_{{\hat{x}}^{*}} J_{1} \times D_{g_{k}^{*}} {\hat{x}}^{*} \times D_{v_{k}^{*}} g_{k}^{*} \\ = & \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{T} {(Σ_{η}^{- 1})}^{T} (- G Φ^{*}) w_{k}^{*} \\ {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} {(y (t) - c_{k})}^{H} \end{matrix}

(A7)

(3) The derivative

D_{c_{k}^{*}} J_{1}

is

\begin{matrix} D_{c_{k}^{*}} J_{1} = & D_{{\hat{x}}^{*}} J_{1} \times D_{g_{k}^{*}} {\hat{x}}^{*} \times D_{c_{k}^{*}} g_{k}^{*} \\ = & \sum_{t = 1}^{L} {(y (t) - G Φ \hat{x} (t))}^{T} {(Σ_{η}^{- 1})}^{T} (- G Φ^{*}) w_{k}^{*} \\ {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} (- v_{k}^{H}) \end{matrix}

(A8)

The derivation of derivative

D_{c_{k}^{*}} J_{1}

is the same as

D_{v_{k}^{*}} J_{1}

.

Appendix A.3. The Derivatives of J₂

\begin{matrix} J_{2} & = K - \sum_{n = 1}^{N} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) \end{matrix}

(A9)

where

R_{\hat{x}} = \frac{1}{L} \sum_{t = 1}^{L} \hat{x} (t) {\hat{x}}^{H} (t)

.

(1) The derivative

D_{W^{*}} J_{2}

is

D_{W^{*}} J_{2} = D_{\hat{x}} J_{2} \times D_{W^{*}} \hat{x} + D_{{\hat{x}}^{*}} J_{2} \times D_{W^{*}} {\hat{x}}^{*}

(A10)

According to the derivatives of

J_{1}

\begin{matrix} D_{W^{*}} \hat{x} & = 0 \\ D_{W^{*}} {\hat{x}}^{*} & = g^{H} (t) \otimes I_{N} \end{matrix}

Therefore,

D_{W^{*}} J_{2} = D_{{\hat{x}}^{*}} J_{2} \times D_{W^{*}} {\hat{x}}^{*}

where

D_{{\hat{x}}^{*}} J_{2} = D_{λ_{n}} J_{2} \times D_{R_{\hat{x}}} λ_{n} \times D_{{\hat{x}}^{*}} R_{\hat{x}}

Deriving the derivatives

D_{λ_{n}} J_{2}

,

D_{R_{\hat{x}}} λ_{n}

and

D_{{\hat{x}}^{*}} R_{\hat{x}}

, respectively.

D_{λ_{n}} J_{2} = \sum_{n = 1}^{N} \frac{λ_{n} (R_{\hat{x}})}{δ_{s}^{2}} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}})

According to [30], the derivative

D_{R_{\hat{x}}} λ_{n}

is

D_{R_{\hat{x}}} λ_{n} = {vec}^{T} (\frac{v_{n}^{*} u_{n}^{T}}{v_{n}^{H} u_{n}})

where

v_{n}^{H}

and

u_{n}

are the left and right eigenvector of

R_{\hat{x}}

, respectively.

\begin{matrix} R_{\hat{x}} u_{n} & = λ_{n} u_{n} \\ v_{n}^{H} R_{\hat{x}} & = v_{n}^{H} λ_{n} \end{matrix}

The derivative

D_{{\hat{x}}^{*}} R_{\hat{x}}

is

D_{{\hat{x}}^{*}} R_{\hat{x}} = \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{x} (t))

Therefore,

\begin{matrix} D_{W^{*}} J_{2} = \sum_{n = 1}^{N} & \frac{λ_{n} (R_{\hat{x}})}{δ_{s}^{2}} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) {vec}^{T} (\frac{v_{n}^{*} u_{n}^{T}}{v_{n}^{H} u_{n}}) \\ \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{x} (t)) (g^{H} (t) \otimes I_{N}) \end{matrix}

(A11)

(2) The derivative

D_{v_{k}^{*}} J_{2}

is

\begin{matrix} D_{v_{k}^{*}} J_{2} = D_{{\hat{x}}^{*}} J_{2} \times D_{v_{k}^{*}} {\hat{x}}^{*} \\ = & \sum_{n = 1}^{N} \frac{λ_{n} (R_{\hat{x}})}{δ_{s}^{2}} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) {vec}^{T} (\frac{v_{n}^{*} u_{n}^{T}}{v_{n}^{H} u_{n}}) \\ \frac{1}{L} & \sum_{t = 1}^{L} (I_{N} \otimes \hat{x} (t)) w_{k}^{*} {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} {(y (t) - c_{k})}^{H} \end{matrix}

(A12)

The derivation of derivative

D_{v_{k}^{*}} J_{2}

is the same as

D_{v_{k}^{*}} J_{1}

.

(3) The derivative

D_{c_{k}^{*}} J_{2}

is

\begin{matrix} D_{c_{k}^{*}} J_{2} = D_{{\hat{x}}^{*}} J_{2} \times D_{c_{k}^{*}} {\hat{x}}^{*} \\ = & \sum_{n = 1}^{N} \frac{λ_{n} (R_{\hat{x}})}{δ_{s}^{2}} exp (- \frac{λ_{n}^{2} (R_{\hat{x}})}{2 δ_{s}^{2}}) {vec}^{T} (\frac{v_{n}^{*} u_{n}^{T}}{v_{n}^{H} u_{n}}) \\ \frac{1}{L} & \sum_{t = 1}^{L} (I_{N} \otimes \hat{x} (t)) w_{k}^{*} {({sech}^{'} [v_{k}^{T} (y (t) - c_{k})])}^{*} (- v_{k}^{H}) \end{matrix}

(A13)

The derivation of derivative

D_{c_{k}^{*}} J_{2}

is the same as

D_{c_{k}^{*}} J_{1}

.

Appendix A.4. The Derivatives of J₃

J_{3} = {∥R_{\hat{η}} - δ_{n}^{2} I_{N}∥}_{F}^{2}

(A14)

where

R_{\hat{η}} = \frac{1}{L} \sum_{t = 1}^{L} \hat{η} (t) {\hat{η}}^{H} (t)

.

(1) The derivative

D_{W^{*}} J_{3}

is

D_{W^{*}} J_{3} = D_{\hat{η}} J_{3} \times D_{W^{*}} \hat{η} + D_{{\hat{η}}^{*}} J_{3} \times D_{W^{*}} {\hat{η}}^{*}

(A15)

According to the derivatives of

J_{1}

\begin{matrix} D_{W^{*}} \hat{η} & = 0 \\ D_{W^{*}} {\hat{η}}^{*} & = g^{H} (t) \otimes I_{N} \end{matrix}

Therefore,

D_{W^{*}} J_{3} = D_{{\hat{η}}^{*}} J_{3} \times D_{W^{*}} {\hat{η}}^{*}

where

D_{{\hat{η}}^{*}} J_{3} = D_{R_{\hat{η}}} J_{3} \times D_{{\hat{η}}^{*}} R_{\hat{η}} + D_{R_{\hat{η}}^{*}} J_{3} \times D_{{\hat{η}}^{*}} R_{\hat{η}}^{*}

Deriving the derivatives

D_{R_{\hat{η}}} J_{3}

,

D_{{\hat{η}}^{*}} R_{\hat{η}}

,

D_{R_{\hat{η}}^{*}} J_{3}

and

D_{{\hat{η}}^{*}} R_{\hat{η}}^{*}

, respectively.

\begin{matrix} D_{R_{\hat{η}}} J_{3} & = {vec}^{H} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \\ D_{R_{\hat{η}}^{*}} J_{3} & = {vec}^{T} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \\ D_{{\hat{η}}^{*}} R_{\hat{η}} & = \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{η} (t)) \\ D_{{\hat{η}}^{*}} R_{\hat{η}}^{*} & = \frac{1}{L} \sum_{t = 1}^{L} (\hat{η} (t) \otimes I_{N}) \end{matrix}

Therefore,

\begin{matrix} D_{W^{*}} J_{3} = [{vec}^{H} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{η} (t)) \\ + & {vec}^{T} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (\hat{η} (t) \otimes I_{N})] (g^{H} (t) \otimes I_{N}) \end{matrix}

(A16)

(2) The derivative

D_{v_{k}^{*}} J_{3}

is

\begin{matrix} D_{v_{k}^{*}} J_{3} = [ & {vec}^{H} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{η} (t)) \\ + & {vec}^{T} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (\hat{η} (t) \otimes I_{N})] \\ \times w_{k}^{*} ( & {sech}^{'} [v_{k}^{T} (y (t) - c_{k})])^{*} {(y (t) - c_{k})}^{H} \end{matrix}

(A17)

The derivation of derivative

D_{v_{k}^{*}} J_{3}

is the same as

D_{v_{k}^{*}} J_{1}

.

(3) The derivative

D_{c_{k}^{*}} J_{3}

is

\begin{matrix} D_{c_{k}^{*}} J_{3} = [ & {vec}^{H} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (I_{N} \otimes \hat{η} (t)) \\ + & {vec}^{T} (R_{\hat{η}} - δ_{n}^{2} I_{N}) \frac{1}{L} \sum_{t = 1}^{L} (\hat{η} (t) \otimes I_{N})] \\ \times w_{k}^{*} ( & {sech}^{'} [v_{k}^{T} (y (t) - c_{k})])^{*} (- v_{k}^{H}) \end{matrix}

(A18)

The derivation of derivative

D_{c_{k}^{*}} J_{3}

is the same as

D_{c_{k}^{*}} J_{1}

.

Appendix A.5. The Derivatives of J

Finally, the derivatives of J to

W^{*}

,

v_{k}^{*}

, and

c_{k}^{*}

are

\begin{matrix} D_{W^{*}} J & = D_{W^{*}} J_{1} + α D_{W^{*}} J_{2} + β D_{W^{*}} J_{3} \\ D_{v_{k}^{*}} J & = D_{v_{k}^{*}} J_{1} + α D_{v_{k}^{*}} J_{2} + β D_{v_{k}^{*}} J_{3} \\ D_{c_{k}^{*}} J & = D_{c_{k}^{*}} J_{1} + α D_{c_{k}^{*}} J_{2} + β D_{c_{k}^{*}} J_{3} \end{matrix}

(A19)

References

Cantrell, B.; de Graaf, J.; Willwerth, F.; Meurer, G.; Leibowitz, L.; Parris, C.; Stapleton, R. Development of a Digital Array Radar (DAR). IEEE Aerosp. Electron. Syst. Mag. 2002, 17, 22–27. [Google Scholar] [CrossRef] [Green Version]
Zhang, R.; Shim, B.; Wu, W. Direction-of-Arrival Estimation for Large Antenna Arrays with Hybrid Analog and Digital Architectures. IEEE Trans. Signal Process. 2022, 70, 72–88. [Google Scholar] [CrossRef]
Ibrahim, M.; Ramireddy, V.; Lavrenko, A.; König, J.; Römer, F.; Landmann, M.; Grossmann, M.; Del Galdo, G.; Thomä, R.S. Design and analysis of compressive antenna arrays for direction of arrival estimation. Signal Process. 2017, 138, 35–47. [Google Scholar] [CrossRef] [Green Version]
Shen, Q.; Liu, W.; Cui, W.; Wu, S. Underdetermined DOA Estimation Under the Compressive Sensing Framework: A Review. IEEE Access 2016, 4, 8865–8878. [Google Scholar] [CrossRef]
Wang, J.; Sheng, W.; Han, Y.; Ma, X. Adaptive beamforming with compressed sensing for sparse receiving array. IEEE Trans. Aerosp. Electron. Syst. 2014, 50, 823–833. [Google Scholar] [CrossRef]
Bajor, M.; Haque, T.; Han, G.; Zhang, C.; Wright, J.; Kinget, P.R. A Flexible Phased-Array Architecture for Reception and Rapid Direction-of-Arrival Finding Utilizing Pseudo-Random Antenna Weight Modulation and Compressive Sampling. IEEE J. Solid-State Circuits 2019, 54, 1315–1328. [Google Scholar] [CrossRef]
Rahman, M.R.; Bojja-Venkatakrishnan, S.; Alwan, E.A.; Volakis, J.L. Spread Spectrum Techniques with Channel Coding for Wideband Secured Communication Links. In Proceedings of the 2020 IEEE International Symposium on Antennas and Propagation and North American Radio Science Meeting, Toronto, ON, Canada, 5–10 July 2020; pp. 1783–1784. [Google Scholar] [CrossRef]
Alwan, E.A.; Khalil, W.; Volakis, J.L. Ultra-wideband on-site coding receiver (OSCR) for digital beamforming applications. In Proceedings of the 2013 IEEE Antennas and Propagation Society International Symposium (APSURSI), Orlando, FL, USA, 7–13 July 2013; pp. 620–621. [Google Scholar] [CrossRef]
Ullah, K.; Venkatakrishnan, S.B.; Volakis, J.L. Low Power and Low Cost Millimeter-Wave Digital Beamformer Using an Orthogonal Coding Scheme. In Proceedings of the 2021 USNC-URSI Radio Science Meeting (USCN-URSI RSM), Honolulu, HI, USA, 9–13 August 2021; pp. 64–68. [Google Scholar] [CrossRef]
Zhang, J.; Wu, W.; Fang, D.G. Single RF Channel Digital Beamforming Multibeam Antenna Array Based on Time Sequence Phase Weighting. IEEE Antennas Wirel. Propag. Lett. 2011, 10, 514–516. [Google Scholar] [CrossRef]
Zhang, D.; Zhang, J.D.; Cui, C.; Wu, W.; Fang, D.G. Single RF Channel Digital Beamforming Array Antenna Based on Compressed Sensing for Large-Scale Antenna Applications. IEEE Access 2018, 6, 4340–4351. [Google Scholar] [CrossRef]
Ma, Y.; Miao, C.; Wu, W.; Li, Y. Hyper Beamforming in Time Modulated Linear Arrays. In Proceedings of the 2020 IEEE Asia-Pacific Microwave Conference (APMC), Hong Kong, China, 8–11 December 2020; pp. 448–450. [Google Scholar] [CrossRef]
Ding, J.; Chen, L.; Gu, Y. Perturbation Analysis of Orthogonal Matching Pursuit. IEEE Trans. Signal Process. 2013, 61, 398–410. [Google Scholar] [CrossRef] [Green Version]
Donoho, D.L. Compressed sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [Google Scholar] [CrossRef]
Bian, S.; Zhang, L. Overview of Match Pursuit Algorithms and Application Comparison in Image Reconstruction. In Proceedings of the 2021 IEEE Asia-Pacific Conference on Image Processing, Electronics and Computers (IPEC), Dalian, China, 14–16 April 2021; pp. 216–221. [Google Scholar] [CrossRef]
Yaghoobi, M.; Wu, D.; Davies, M.E. Fast Non-Negative Orthogonal Matching Pursuit. IEEE Signal Process. Lett. 2015, 22, 1229–1233. [Google Scholar] [CrossRef] [Green Version]
Han, B.; Jiang, H. An Improved Stagewise Orthogonal Matching Pursuit-Recursive Algorithm for Power Quality Signal Reconstruction. In Proceedings of the 2019 Computing, Communications and IoT Applications (ComComAp), Shenzhen, China, 26–28 October 2019; pp. 293–298. [Google Scholar] [CrossRef]
Hanzo, L.L.; Yang, L.L.; Kuan, E.L.; Yen, K. CDMA Overview. In Single and Multi-Carrier DS-CDMA: Multi-User Detection, Space-Time Spreading, Synchronisation, Networking and Standards; IEEE Press: Piscataway, NJ, USA, 2004; pp. 35–80. [Google Scholar] [CrossRef]
Chen, Q.; Zhang, J.D.; Wu, W.; Fang, D.G. Enhanced Single-Sideband Time-Modulated Phased Array with Lower Sideband Level and Loss. IEEE Trans. Antennas Propag. 2020, 68, 275–286. [Google Scholar] [CrossRef]
Wang, X.; Greco, M.S.; Gini, F. Adaptive Sparse Array Beamformer Design by Regularized Complementary Antenna Switching. IEEE Trans. Signal Process. 2021, 69, 2302–2315. [Google Scholar] [CrossRef]
Tabrikian, J.; Isaacs, O.; Bilik, I. Cognitive Antenna Selection for Automotive Radar Using Bobrovsky-Zakai Bound. IEEE J. Sel. Top. Signal Process. 2021, 15, 892–903. [Google Scholar] [CrossRef]
Garcia-Rodriguez, A.; Masouros, C.; Rulikowski, P. Efficient large scale antenna selection by partial switching connectivity. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 6269–6273. [Google Scholar] [CrossRef] [Green Version]
Lin, T.; Zhu, Y. Beamforming Design for Large-Scale Antenna Arrays Using Deep Learning. IEEE Wirel. Commun. Lett. 2020, 9, 103–107. [Google Scholar] [CrossRef] [Green Version]
Huang, H.; Peng, Y.; Yang, J.; Xia, W.; Gui, G. Fast Beamforming Design via Deep Learning. IEEE Trans. Veh. Technol. 2020, 69, 1065–1069. [Google Scholar] [CrossRef]
Zhao, X.; Yang, Y.; Chen, K.S. Direction-of-Arrival Estimation over Sea Surface from Radar Scattering Based on Convolutional Neural Network. Remote Sens. 2021, 13, 2681. [Google Scholar] [CrossRef]
Zhou, W.W.; Sheng, W.X.; Cui, J.; Han, Y.B. SR-Crossbar topology for large-scale RF MEMS switch matrices. IET Microwaves Antennas Propag. 2019, 13, 231–238. [Google Scholar] [CrossRef]
Komolafe, T.E.; Wang, K.; Du, Q.; Hu, T.; Yuan, G.; Zheng, J.; Zhang, C.; Li, M.; Yang, X. Smoothed L0-Constraint Dictionary Learning for Low-Dose X-Ray CT Reconstruction. IEEE Access 2020, 8, 116961–116973. [Google Scholar] [CrossRef]
Xilinx. 7 Series FPGAs Data Sheet: Overview; Xilinx: San Jose, CA, USA, 2020; Rev. 2.6.1. [Google Scholar]
Xilinx. UltraScale Architecture and Product Data Sheet: Overview; Xilinx: San Jose, CA, USA, 2022; Rev. 4.1.1. [Google Scholar]
Hjorungnes, A.; Gesbert, D. Complex-Valued Matrix Differentiation: Techniques and Key Results. IEEE Trans. Signal Process. 2007, 55, 2740–2746. [Google Scholar] [CrossRef]

Figure 1. The block diagram of the proposed ML-CDRA.

Figure 2. Different encoding networks. (a) Fixed encoding network. (b) Tunable encoding network with RF switches. (c) Tunable encoding network with phase shifters.

Figure 3. The encoding networks with subarray structure. (a) Subarray without phase shifter. (b) Subarray with phase shifter.

Figure 4. The training of decoding network.

Figure 5. Different DBF systems studied in simulations.

Figure 6. The SNR after DBF with different input SNR.

Figure 7. The normalized spectrum diagram of the decoded signal after DBF with input

SNR = - 20 dB

.

Figure 7. The normalized spectrum diagram of the decoded signal after DBF with input

SNR = - 20 dB

.

Figure 8. The normalized array pattern of different DBF systems.

Figure 9. Different subarray structure studied for spatial sensitivity.

Figure 10. The spatial sensitivity of subarray structure with input SNR =

- 20 dB

. (a) Without phase shifter. (b) With phase shifter and pointing at

15^{\circ}

.

Figure 10. The spatial sensitivity of subarray structure with input SNR =

- 20 dB

. (a) Without phase shifter. (b) With phase shifter and pointing at

15^{\circ}

.

Figure 11. The spatial sensitivity of random encoding structure with input SNR

= - 20 dB

.

Figure 11. The spatial sensitivity of random encoding structure with input SNR

= - 20 dB

.

Figure 12. The implementation architectures for the full-channel DRA and the ML-CDRA. (a) A typical architecture for full-channel DRA. (b) A real-time processing architecture for ML-CDRA with the decoding network implemented by FPGA.

Figure 13. Types of processing architecture. (a) The pipeline architecture. (b) The instruction set architecture.

Table 1. Different architectures for low-cost DBF systems.

	Type	RF Channels	ADCs	Cost ¹	Real-Time Processing	Low SNR Available	Without Data Loss	Without Moving Target Compensation	Without Gate Lobe
	DRA	Massive	Massive	Expensive	√	√	√	√	√
	Subarray	Few	Few	Middle	√	√	√	√	×
Coding-DRA	CS-based	Few	Few	Middle	×	×	√²	√²	√
	OC-based TSPW	Single	Single	Cheap	√	√	×	×	√
	OC-based OSC	Massive	Single	Expensive	√	√	×	×	√
	ML-based	Few	Few	Middle	√	√	√	√	√

¹ System cost is mainly determined by the requirement of RF channels. ² Data loss will occur if the reconstruction is based on multi-snapshots. In the meantime, moving target compensation is necessary.

Table 2. Several available commercial FPGAs for ML-CDRA.

	Device	Logic Cells	CLB			DSP			RAM		I/O
	Device	Logic Cells	Slices	Flip-Flops	LUTs	DSP Slices	Fmax /MHz	DSP Performance GMAC/s ¹	Distributed RAM/Mb	Block RAM /Mb
Virtex-7	XC7VX 690T	693.12 K	108.3 K	866.4 K	433.2 K	3600	741	5335	10.8	52.92	1000
Virtex UltraScale+	VU11P	2835 K	324 K	2592 K	1296 K	9216	891	16,422	36.2	70.9	624
Virtex UltraScale+	VU13P	3780 K	432 K	3456 K	1728 K	12,288	891	21,897	48.3	94.5	832

¹ Peak DSP performance numbers are based on symmetrical filter implementation, DSP performance = DSP Slices × Fmax × 2.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Xiao, L.; Han, Y.; Weng, Z. Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels. Remote Sens. 2022, 14, 5086. https://doi.org/10.3390/rs14205086

AMA Style

Xiao L, Han Y, Weng Z. Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels. Remote Sensing. 2022; 14(20):5086. https://doi.org/10.3390/rs14205086

Chicago/Turabian Style

Xiao, Lei, Yubing Han, and Zuxin Weng. 2022. "Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels" Remote Sensing 14, no. 20: 5086. https://doi.org/10.3390/rs14205086

APA Style

Xiao, L., Han, Y., & Weng, Z. (2022). Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels. Remote Sensing, 14(20), 5086. https://doi.org/10.3390/rs14205086

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels

Abstract

1. Introduction

2. Materials and Methods

2.1. Signal Model

2.2. Encoding Network

2.3. Decoding Network

2.4. An Example for Generalized Loss Function

2.4.1. Maximum Likelihood Estimation

2.4.2. Sparsity Constraint for Signal

2.4.3. Noise Reduction and Whitening

3. Results

3.1. SNR after DBF

3.2. Beamforming Performance

3.3. Spatial Sensitivity

4. Implementation

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Derivatives of the Loss Function

Appendix A.1. The Preliminary

Appendix A.2. The Derivatives of J₁

Appendix A.3. The Derivatives of J₂

Appendix A.4. The Derivatives of J₃

Appendix A.5. The Derivatives of J

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Machine-Learning-Based Framework for Coding Digital Receiving Array with Few RF Channels

Abstract

1. Introduction

2. Materials and Methods

2.1. Signal Model

2.2. Encoding Network

2.3. Decoding Network

2.4. An Example for Generalized Loss Function

2.4.1. Maximum Likelihood Estimation

2.4.2. Sparsity Constraint for Signal

2.4.3. Noise Reduction and Whitening

3. Results

3.1. SNR after DBF

3.2. Beamforming Performance

3.3. Spatial Sensitivity

4. Implementation

5. Discussion

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Derivatives of the Loss Function

Appendix A.1. The Preliminary

Appendix A.2. The Derivatives of J1

Appendix A.3. The Derivatives of J2

Appendix A.4. The Derivatives of J3

Appendix A.5. The Derivatives of J

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Appendix A.2. The Derivatives of J₁

Appendix A.3. The Derivatives of J₂

Appendix A.4. The Derivatives of J₃