Next Article in Journal
Joint Time-Reversal Space-Time Block Coding and Adaptive Equalization for Filtered Multitone Underwater Acoustic Communications
Previous Article in Journal
Optimization of Ring Laser Gyroscope Bias Compensation Algorithm in Variable Temperature Environment
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Complex MIMO RBF Neural Networks for Transmitter Beamforming over Nonlinear Channels

by
Kayol Soares Mayer
*,
Jonathan Aguiar Soares
and
Dalton Soares Arantes
Department of Communications, School of Electrical and Computer Engineering, University of Campinas, Campinas 13083-852, Brazil
*
Author to whom correspondence should be addressed.
Sensors 2020, 20(2), 378; https://doi.org/10.3390/s20020378
Submission received: 3 December 2019 / Revised: 30 December 2019 / Accepted: 7 January 2020 / Published: 9 January 2020
(This article belongs to the Section Intelligent Sensors)

Abstract

:
The use of beamforming for efficient transmission has already been successfully implemented in practical systems and is absolutely necessary to even further increase spectral and energy efficiencies in some configurations of the next-generation wireless systems and for low earth orbit satellites. A remarkable capacity increase is then achieved and spectral congestion is minimized. In this context, this article proposes a novel complex multiple-input multiple-output radial basis function neural network (CMM-RBF) for transmitter beamforming, based on the phase transmittance radial basis function neural network (PTRBFNN). The proposed CMM-RBF is compared with the least mean square (LMS) algorithm for beamforming with six dipoles arranged in a uniform and circular array and with 16 dipoles in a 2D-grid array. Simulation results show that the proposed solution presents lower steady-state mean squared error, faster convergence rate and enhanced half-power beamwidth (HPBW) when compared with the LMS algorithm in a nonlinear scenario.

1. Introduction

In the last decades, artificial neural networks (ANNs) have attracted much attention, performing specific tasks in different applications, such as clustering, prediction, classification, pattern recognition, machine learning and artificial intelligence. As ANNs are mainly designed to mimic the human brain, a considerable number of approaches only handle real-valued signals [1,2,3,4]. However, some engineering problems are intrinsically dependent on complex-valued signals (e.g., channel equalization and beamforming). In order to circumvent this limitation, ANN algorithms based on complex numbers have already been proposed for some applications, such as channel equalization [5,6,7] and adaptive beamforming for wireless receivers [8,9,10,11].
Digital communication systems over wireless channels may suffer severe signal distortions due to multipath propagation, additive white Gaussian noise (AWGN) [12,13], Doppler effects and, not infrequently, nonlinearities at the receiver front-end and at the transmitter high power amplifier [5,7]. Since nonlinear impairments usually worsen the performance of linear channel equalizers, nonlinearities in the channel are better dealt with using robust nonlinear equalizers [5,7,14]. In this context, based on the phase transmittance radial basis function neural network (PTRBFNN) equalizer [7], a blind fuzzy controller algorithm was applied to increase the concurrent neural network equalizer (CNNE) convergence speed and decrease the residual mean squared error (MSE) [5]. Another equally important technique is the butterfly neural equalizer (BNE) which, when applied to optical communications with two-dimensional digital modulation, is able to mitigate nonlinearities in the photo-electric converters and simultaneously compensate chromatic and polarization mode dispersions [6].
For beamforming at the receiver, ANN architectures such as the bi-dimensional neural beamformer with joint error (BNB-JE), the butterfly neural beamformer (NB-Butterfly), and the beamformer neural network (BNN) are potential algorithms for improving receiver performance [9,10,11]. On the other hand, beamforming for efficient transmission is necessary to increase spectral and energy efficiencies in some configurations of the next-generation wireless systems [15]. In addition, for low earth orbit (LEO) satellites, the use of beamforming techniques is economically important to mainly reduce the power consumption and to increase the data throughput [16,17]. Also, nonlinear beamfoming algorithms can play a key role for band-limited systems employing nonlinear power amplifiers, as in satellite communication systems [18].
In communication systems with beamforming at the transmitter, arrays with several antennas are applied to focus the electromagnetic signal towards the desired receiver [19]. These arrays can be controlled by three different architectures: digital, analog, and hybrid. Usually, in digital beamforming the channel knowledge is required at the transmitter and some precoding technique is necessary [20,21,22,23,24,25], impacting the hardware with very high computational complexity and energy consumption. [26]. On the other hand, in analog beamforming the RF signals are manipulated by means of controlling phase shifters and/or variable gain amplifiers (VGAs). Although this architecture has low computational complexity and power consumption, it is less flexible and presents inferior results when compared with digital beamforming [26,27].
In the context of LEO satellites, the use of the classical digital beamforming techniques [20,21,22,23,24,25] is prohibitive due to the power and computational complexity constraints, which is why analog beamforming techniques are employed in this area [16,17]. However, by means of a digital beamforming without channel knowledge and precoding, as proposed here, it is possible to perform a low power digital architecture which is more flexible than the analog one. This problem can be modeled as a set of electric currents whose phases and amplitudes are modulated in such a way that the antenna radiation pattern points to the correct direction. A useful method to determine the beamformer electric currents is via the least mean square (LMS) algorithm. In this linear method, the filter weights, which represent the array of electric currents, are updated by a convex cost function [28] to minimize the error between the obtained and the desired radiation patterns [29]. However, LEO satellites frequently operate with high power amplifiers which suffer severe signal distortion due to the nonlinearities [18]. This nonlinear scenario reduces the LMS performance because of its linear design.
Differently from the LMS, neural networks can operate like nonlinear filters [30]. The nonlinear structure of a neural network is modeled by nonlinear activation functions in multilayer perceptrons (MLPs) or by Gaussian neurons in radial basis function neural networks (RBFNNs) [30]. The RBFNNs Gaussian neurons have two free parameters, namely the Gaussian centers and the variances. Besides, there is a linear free vector parameter of weights, which linearly weighs the output of the neurons to yield the network output [7]. Via these three free parameters, RBFNNs can represent high-order nonlinear spaces without the necessity of increasing the number of layers, reducing its complexity in comparison with deep neural networks. Although artificial neural networks have been employed for beamforming, the ANN architectures presented in the literature are unfeasible for the proposed application, since they are designed for beamforming in receiver devices or require channel information and/or precoding.
In such context, this article proposes a novel architecture of RBFNN, based on a complex multiple-input multiple-output (MIMO) RBFNN (CMM-RBF) for beamforming transmitters, in contrast to [9,10,11] which are designed to beamforming receivers. The proposed system applies a MIMO variation of the multiple-input single-output (MISO) phase transmittance RBFNN (PTRBFNN) [7] to a beamforming structure to generate a unified nonlinear solution. The PTRBFNN model was chosen due its lower computational complexity in comparison with deep neural networks, and due to its important role in avoiding any phase invariance at the output of the neurons in comparison with a complex RBFNN [7]. Results show that the proposed architecture achieves enhanced half-power beamwidth (HPBW), faster convergence rate and lower steady-state mean squared error (MSE) when compared with LMS beamforming in a nonlinear scenario.
The remainder of this article is organized as follows. In Section 2, a mathematical modeling is described for a general arrangement of antennas. The LMS algorithm and the proposed complex MIMO radial basis function for beamforming are presented in Section 3 and Section 4, respectively. In Section 5, simulation results of the CMM-RBF are compared to results obtained by LMS, considering half-power beamwidth (HPBW), steady-state mean squared error, and convergence rate in a nonlinear scenario. Conclusions are discussed in Section 6.

2. Antenna Array Modeling

In a transmitter, when operating with an antenna array with P dipoles of length l and an arrangement of Q sensors around the antenna array, the matrix of steering vectors Ψ = [ ψ 1 T ψ 2 T ψ Q T ] T is
Ψ = ζ V     C Q × P ,
in which [ · ] T is the transpose operator. The qth steering vector ψ q C P × 1 expresses the radiation pattern towards the qth sensor. The matrix of relative intensity of the electric field ζ R Q × Q is
ζ = ζ 1 0 0 0 ζ 2 0 0 0 ζ Q .
The qth element of the main diagonal of ζ is
ζ q = cos ( π l cos ( θ q ) λ 1 ) cos ( π l λ 1 ) sin ( θ q ) ,
in which λ = c / f is the signal wavelength, c = 299 , 792 , 458 m/s is the speed of light in vacuum, f is the frequency of the transmitted signal, and θ q is the zenith angle of the qth sensor. Figure 1 presents the angular position of the qth sensor ( θ q , ω q ) and the related radiation pattern ( d q ), for any arrangement of dipoles.
The matrix V = [ v 1 T v 2 T v Q T ] T C Q × P is
V = exp ( 𝚥 2 π λ 1 Ω C ) ,
where exp ( · ) is the scalar exponential function, C R 3 × P is the matrix of Cartesian coordinates (x, y, z) of the dipoles:
C = x 1 x 2 x P y 1 y 2 y P z 1 z 2 z P ,
and Ω R Q × 3 is the sensors matrix of angular position:
Ω = sin θ 1 cos ω 1 sin θ 1 sin ω 1 cos θ 1 sin θ 2 cos ω 2 sin θ 2 sin ω 2 cos θ 2 sin θ Q cos ω Q sin θ Q sin ω Q cos θ Q ,
in which ω r is the azimuth angle of the r t h sensor.
Note that this modeling is applicable for any array setup in three dimensions, taking into account the matrix of Cartesian coordinates Equation (2) and the sensors matrix of angular position Equation (3). This Section was based on [31] (Chapter VI), in which the array equations are presented in a generalized matrix structure.

3. Least Mean Square Algorithm for Beamforming

Considering a beamforming transmission with P antennas and an arrangement of Q sensors around the antenna array, then the vector of radiation pattern g towards the sensors is given by
g = Ψ i     C Q × 1 ,
where i C P × 1 is the vector of antenna electric currents. Figure 2 presents the LMS architecture for beamforming.
In order to control the array boresight, it is chosen a set of radiation conditions d C Q × 1 , verified by Q sensors, which well describes the desired radiation pattern. Thus, with d , g and ψ , the LMS algorithm can be used to estimate i to the qth sensor at the uth training epoch by the minimization of the following cost function:
J q [ u ] = 1 2 | d q g q [ u ] | 2 ,
where | · | stands for absolute value and ( d q , g q ) are the qth target components of ( d , g ) .
Thus, by means of the steepest descent algorithm, the update of the pth electric current of the LMS algorithm, to the qth steering vector, is given by:
i p [ u ] = i p [ u ] η l i J q [ u ] ,
in which η l is the LMS adaptive step and i is the complex gradient operator of i p .
Applying the complex gradient operators ( i ) to Equation (4) yields:
i p [ u ] = i p [ u ] + η l ϵ q [ u ] ψ q , p * ,
where [ · ] * denotes the complex conjugate operator and ϵ q [ u ] = d q g q [ u ] is the instantaneous error for the qth sensor at the uth training epoch. Finally, generalizing Equation (6) for i [ u ] :
i [ u ] = i [ u ] + η l ϵ q [ u ] ψ q * .
In this LMS algorithm, each training epoch is composed of Q updates over i , and at the beginning of each training epoch i [ u ] i [ u 1 ] . However, if u = 0 , then the vector of currents starts with i [ 0 ] 0 + 𝚥 0 .

4. Complex MIMO Radial Basis Function Neural Network for Beamforming

As in the LMS beamforming technique, the input signal to the CMM-RBF algorithm is the set of steering vectors of Ψ , as shown in Figure 3. The CMM-RBF architecture, with N neurons, has three free parameters: the matrix of synaptic weights W C P × N , the matrix of center vectors Γ C N × P and the vector of variances σ 2 C N × 1 . Besides the fact that the CMM-RBF is an extension of the PTRBFNN for multiple outputs, the key difference between both architectures is the linear layer which relates the obtained vector of electric currents with the desired radiation pattern.
The output vector of electric currents is then given by
i [ u ] = W [ u ] ϕ [ u ] .
Following the complex-valued radial basis function presented in [7], the nth neuron output of the CMM-RBF ( ϕ n ), for the qth steering vector of Ψ , is
ϕ n = exp Re { ψ q } Re { γ n } 2 2 Re { σ n 2 } + 𝚥 exp Im { ψ q } Im { γ n } 2 2 Im { σ n 2 } ,
where · 2 is the operator which returns the Euclidean norm of its argument and Re { · } and Im { · } are the respective real and imaginary parts of their arguments. Additionally, as shown in Figure 3, the output of the neurons can be represented by the vector ϕ = [ ϕ 1 ϕ 2 ϕ N ] T C N × 1 . This kernel partitioning into real and imaginary components has an important role in avoiding any phase invariance at the output of the neurons. As the steering vector phase is important to define the electric currents to the desired boresight, a complex RBFNN is not suitable for this application. As addressed in [7], the kernel of the complex RBFNN is not partitioned into real and imaginary parts, which implies that the Euclidean norm eliminates the phase component of the input signal. Consequently, a complex RBFNN is only suitable for phase independent systems.
Thus, by means of the steepest descent algorithm, the update of the CMM-RBF free parameters, to the qth steering vector, is given by:
w p , n [ u ] = w p , n [ u ] η w w J q [ u ] ,
γ n [ u ] = γ n [ u ] η γ γ J q [ u ] ,
σ n 2 [ u ] = σ n 2 [ u ] η σ σ J q [ u ] ,
in which η w , η γ and η σ are the adaptive steps of w p , n , γ n and σ n 2 , respectively. Also, w , γ and σ are, respectively, the complex gradient operators of w p , n , γ n and σ n 2 .
The CMM-RBF cost function is the same utilized in the LMS algorithm Equation (4). Applying the complex gradient operators ( w , γ and σ ) to Equation (4) yields:
w J q [ u ] = ϵ q [ u ] ψ q , p * ϕ n [ u ] * ,
γ J q [ u ] = ξ n [ u ] ( Re { α n [ u ] } + Im { α n [ u ] } ) + ξ n [ u ] * ( Re { α n [ u ] } Im { α n [ u ] } ) ] ,
σ J q [ u ] = ξ n [ u ] ( Re { β n [ u ] } + Im { β n [ u ] } ) + ξ n [ u ] * ( Re { β n [ u ] } Im { β n [ u ] } ) ] ,
in which ϵ q [ u ] = d q g q [ u ] , as in the LMS algorithm. The synaptic transmittance of the nth neuron ξ n [ u ] is
ξ n [ u ] = ϵ q [ u ] * p = 1 P ψ q , p w p , n [ u ] .
The nth element of the vector of weighted kernel β [ u ] = [ β 1 [ u ] β 2 [ u ] β N [ u ] ] T C N × 1 is:
β n [ u ] = Re { ϕ n [ u ] } Re { ψ q } Re { γ n [ u ] } 2 2 Re { σ n 2 [ u ] } 2 + 𝚥 Im { ϕ n [ u ] } Im { ψ q } Im { γ n [ u ] } 2 2 Im { σ n 2 [ u ] } 2 .
Similarly, the matrix of weighted centers is represented as A [ u ] = [ α 1 T [ u ] α 2 T [ u ] α N T [ u ] ] T C N × P , where the nth vector of A [ u ] is
α n [ u ] = Re { ϕ n [ u ] } ( Re { ψ q } Re { γ n [ u ] } ) Re { σ n 2 [ u ] } + 𝚥 Im { ϕ n [ u ] } ( Im { ψ q } Im { γ n [ u ] } ) Im { σ n 2 [ u ] } .
Finally, applying Equation (9) into Equation (8), the update of the CMM-RBF free parameters for beamforming is expressed as follows:
w p , n [ u ] = w p , n [ u ] + η w ϵ q [ u ] ψ q , p * ϕ n [ u ] * ,
γ n [ u ] = γ n [ u ] + η γ [ ξ n [ u ] ( Re { α n [ u ] } + Im { α n [ u ] } ) + ξ n [ u ] * ( Re { α n [ u ] } Im { α n [ u ] } ) ] ,
σ n 2 [ u ] = σ n 2 [ u ] + η σ [ ξ n [ u ] ( Re { β n [ u ] } + Im { β n [ u ] } ) + ξ n [ u ] * ( Re { β n [ u ] } Im { β n [ u ] } ) ] .
Generalizing Equation (13) to matrix structures, results in:
W [ u ] = W [ u ] + η w ϵ q [ u ] ψ q * ϕ [ u ] H ,
Γ [ u ] = Γ [ u ] + η γ [ Ξ [ u ] ( Re { A [ u ] } + Im { A [ u ] } ) + Ξ [ u ] H ( Re { A [ u ] } Im { A [ u ] } ) ] ,
σ 2 [ u ] = σ 2 [ u ] + η σ [ Ξ [ u ] ( Re { β [ u ] } + Im { β [ u ] } ) + Ξ [ u ] H ( Re { β [ u ] } Im { β [ u ] } ) ] .
in which [ · ] H denotes the transpose conjugate operator and Ξ [ u ] is the diagonal matrix of synaptic transmittance:
Ξ [ u ] = ξ 1 [ u ] 0 0 0 ξ 2 [ u ] 0 0 0 ξ N [ u ]     C N × N .
Although Equation (14) minimizes the error between the obtained and the desired radiation patterns, as the neurons are dependent on exponential functions, a risk of instability is assumed if the exponential argument is positive. In order to circumvent this issue, based on Theorem A1 (Appendix A), the real and imaginary parts of each scalar component of the vector of variances is lower bounded by the limit μ > 0 , which, consequently, bounds the real and imaginary parts of the neurons output from 0 to 1.
As in the LMS algorithm, each training epoch is composed of Q updates, due to the Q sensors, and at the beginning of each training epoch W [ u ] W [ u 1 ] , Γ [ u ] Γ [ u 1 ] , and σ 2 [ u ] σ 2 [ u 1 ] . However, for u = 0 , the CMM-RBF free parameters are initialized following some criterion defined by the user (e.g., based on the probability distribution of the input data).

5. Simulations and Discussion

The proposed complex MIMO RBF architecture was evaluated and compared with the LMS algorithm for beamforming at 2.4 GHz. Simulations consider two array arrangements with dipoles of equal length l = 0.5 λ = 6.25 cm and separation distances s d = 0.25 λ = 3.12 cm: (1) uniform and circular array (UCA) with P = 6 dipoles; and (2) 2D-grid array (2D-GA) with P = 16 dipoles. For each array arrangement, the matrix of steering vectors is computed via Algorithm A1 (Appendix B). The vectors of angular position θ R Q × 1 and ω R Q × 1 have their number of components (Q) defined by the number of sensors, which is selected by the user in a manner to well describe the desired radiation pattern. Besides, each component of the desired radiation pattern d is defined between 0 and 1, in which 0 is used for nulls and 1 is used for the maximum value of the radiation pattern.
As practical systems may suffer severe signal distortion due to occurrence of nonlinearities at the transmitter high power amplifier, based on [32], the nonlinearities are introduced as: g q [ u ] = ψ q T ( ρ 1 i [ u ] + ρ 2 i 2 [ u ] + ρ 3 i 3 [ u ] ) , where ρ = [ ρ 1 ρ 2 ρ 3 ] T = [ 0.8 0.3 0.2 ] T is the coefficient vector of nonlinearities, i 2 [ u ] = [ i 1 [ u ] 2 i 2 [ u ] 2 i P [ u ] 2 ] T and i 3 [ u ] = [ i 1 [ u ] 3 i 2 [ u ] 3 i P [ u ] 3 ] T . Note that, if ρ 1 = 1.0 and ρ 2 = ρ 3 = 0.0 , it is the linear case, since g q [ u ] = ψ q T i [ u ] .
The LMS and the CMM-RBF were implemented using Algorithms A2 and A3 (Appendix B), respectively. The vector of electric currents is initialized with i [ 0 ] 0 + 𝚥 0 , for both beamforming techniques. The free parameters of the CMM-RBF are initialized as: W [ 0 ] = 0 + 𝚥 0 ; each center vectors of Γ [ 0 ] starts with an unique steering vector of Ψ divided by 10; and σ 2 [ 0 ] = 5 + 𝚥 5 . Also, as discussed in Section 4, μ = 0.1 to bound the real and imaginary outputs of the neurons of the CMM-RBF. In order to well represent the set of inputs (steering vectors) into the nonlinear space of the proposed neural network, while maintaining a low computational complexity, we have found by trial and error that N = 4 neurons implemented in the CMM-RBF is sufficient. By trial and error, the adaptive steps of the LMS and the CMM-RBF were found to be around η l = 0.1 , η w = 0.7 , η γ = 0.5 , and η σ = 0.5 for the linear and circular arrays. On the other hand, for the 2D-grid array, we found that the adaptive step of the LMS should be reduced to η l = 0.035 . These adaptive steps were chosen to maximize the convergence rate and the HPBW, maintaining the side lobe of the radiation pattern smaller than 20  dB.
The performance of the proposed CMM-RBF is evaluated against LMS by means of the resulting MSE and HPBW for a specified boresight ω ^ . The MSE is computed considering an average of 500 simulations for each ω ^ . In addition, via the 500 MSE computations, the mean of the normalized radiation patterns were selected to graphically illustrate the HPBW.

5.1. Uniform and Circular Array

In this scheme, six dipoles of equal length are spatially distributed as shown in Figure 4. Both of the presented beamforming techniques share the same Q = 10 restrictions, given by the position of the sensors depicted in Table A1 (Appendix C). Restrictions were generalized for the boresight angle ω ^ .
Figure 5 presents the evolution of the simulated mean squared error (MSE) of the LMS and CMM-RBF algorithms for UCA with ω ^ = 160 ° . As a fast convergence rate characteristic is extremely important for LEO satellites, it is assumed here that convergence for both algorithms is reached when the respective MSE drops below 35 dB. Notice that the MSE for both algorithms decrease similarly up to the second training epoch, after that, the MSE of the CMM-RFB decreases at a much faster rate. It is clear, therefore, that the CMM-RBF achieves a faster convergence rate and delivers a 2.3 dB lower residual MSE (after only three epochs) in comparison with the LMS.
Figure 6 shows the radiation diagram of the LMS and CMM-RBF algorithms for θ = 90 ° and ω ^ = 160 ° . One may note that both algorithms presented similar performance, however, the CMM-RBF enhanced HPBW by 1.12° in comparison with LMS.
In addition, maintaining the same initialization scheme, but varying the boresight angle from 0 ° to 360 ° , in steps of 10 ° , a number of 500 simulations were performed for each boresight. The mean HPBW and MSE of each boresight is used to compute the statistical results presented in Table 1. The proposed CMM-RBF algorithm is able to enhance the HPBW of the antenna arrays by about 4.15° when operating under the same conditions as the LMS, taking into account the nonlinearities of the transmitter power amplifier.

5.2. 2D-Grid Array

In this scheme, 16 dipoles of equal length are spatially distributed in a squared grid, as shown in Figure 7. The Q = 10 restrictions are depicted in Table A2 (Appendix C).
Figure 8 presents the evolution of the simulated mean squared error (MSE) of the LMS and CMM-RBF algorithms for 2D-GA with ω ^ = 160 ° . Convergence is reached when the MSE drops below 35 dB. In this case, diferently from the UCA simulation, the MSE of the CMM-RFB decreases at a faster rate since the first training epoch. At the 3rd epoch, the CMM-RBF delivers a 2.27 dB lower residual MSE in comparison with the LMS.
Figure 9 shows the radiation diagram of the LMS and CMM-RBF algorithms for θ = 90 ° and ω ^ = 160 ° . Both algorithms presented similar performances, however, the CMM-RBF enhanced HPBW by 3.85° in comparison with LMS.
Moreover, varying the boresight angle from 0 ° to 360 ° , in steps of 10 ° , a number of 500 simulations were also performed for each boresight and for both architectures, as in Table 1. Results for the mean and standard deviation are presented in Table 2. As in the UCA, the proposed CMM-RBF algorithm is able to enhance the HPBW of the 2D-grid array by about 4.15° when operating under the same conditions as the LMS, taking into account the nonlinearities of the transmitter power amplifier. In 77.78% of the simulations, the CMM-RBF was able to maintain the side lobes of the radiation pattern smaller than 20 dB; on the other hand, the LMS only achieved this condition for 44.45% of the simulations.

6. Conclusions

This work presented a novel artificial neural network beamforming scheme for wireless transmission affected by nonlinearities of the transmitter power amplifier. The proposed complex MIMO RBFNN (CMM-RBF) is an extension of the PTRBFNN, used for channel equalization, which is able to handle multiple complex-valued outputs, keeping the phase transmittance information. With the proposed architecture it is possible to simultaneously achieve fast convergence rate, lower MSE and enhanced HPBW in comparison with the LMS in nonlinear scenarios.
The performance of the proposed approach was compared with the LMS for beamforming with a uniform and circular array (6 dipoles of equal length) and a 2D-grid array (16 dipoles of equal length), operating at 2.4 GHz. A set of 36 boresight angles was evaluated and from each generated radiation diagram the respective HPBW was obtained; besides, for each boresight the convergence rate was estimated for the minimum number of epochs possible, in order to obtain a faster tracking.
The proposed MIMO artificial neural network architecture proved to be robust, independent of the boresight angle, achieving faster convergence rate in only three training epochs (after crossover with the LMS) and reducing the MSE by about 2 dB when compared with the LMS algorithm. As for the HPBW, the results obtained with the CMM-RBF are 4.15° better than with the LMS. As the nonlinear behavior of the transmitter power amplifier becomes more prominent when the number of antennas is increased and when operating with a more complex architecture (2D-GA), the LMS beamforming presented a poor performance regarding side lobe restrictions, correctly operating in less than half of the cases. Conversely, the CMM-RBF achieved the side lobe restrictions in more than three quarters of the simulations.
The proposed algorithm finds potential applications in some configurations of the next-generation wireless systems and in satellite communications. For LEO satellites, the CMM-RBF can be implemented using low-power graphical processing units (LPGPUs), taking advantage of the neuron’s parallelism. Thus, in the proposed architecture the CMM-RBF can work with low-power consumption, with the ability to handle the distortions of nonlinear power amplifiers while maintaining a fast convergence rate. It should be emphasized that a fast convergence characteristic is extremely important for LEO satellites, since they orbit non-stationarily at low altitudes.

Author Contributions

All the authors have contributed to various degrees to ensure the quality of this work: Conceptualization, K.S.M.; methodology, K.S.M. and J.A.S.; formal analysis, K.S.M., J.A.S. and D.S.A.; writing—original draft preparation, K.S.M.; writing—review and editing, K.S.M., J.A.S. and D.S.A.; supervision, D.S.A. All authors have read and agreed to the published version of the manuscript.

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior—Brasil (CAPES)—Finance Code 001.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Theorem A1.
For any two vectors of same size x , y R n × 1 and a scalar z R , a kernel function f ( x , y , z ) exp x y 2 2 z is bounded between 0 and 1, if z > 0 .
Proof. 
By the properties of nonnegativity and definiteness, which state that the norm is always nonnegative (see [33], p. 46), let d = x y 2 0 . Also, let g ( d ) = d 2 / z , where z is a constant scalar. As g ( d ) is continuous for any z > 0 (see [34], p. 43), its natural domain is ( , + ) . In order to define the extremes of the image of g ( d ) it is necessary to assess the extremes of the natural domain and the second derivative test theorem (see [34], p. 186) of g ( d ) . Firstly, we verify the extremes of g ( d ) for z > 0 :
lim d g ( d ) = ,
lim d + g ( d ) = .
We then analyze the first derivative, equating its result to zero:
g ( d ) d = 2 d z = 0 ,
which implies that g ( d ) has only one point of minimum, maximum, or inflection. Due to this, verifying the second derivative of g ( d ) for d = 0 :
2 g ( d ) d 2 = 2 z < 0 ,
thus, d = 0 is the maximum point of g ( d ) , which yields g ( 0 ) = 0 and, consequently, g ( d ) 0 .
Finally, with h ( g ) exp ( g ( d ) ) , we can verify the image of h ( g ) :
lim g ( d ) h ( g ( d ) ) = 0 ,
lim g ( d ) 0 h ( g ( d ) ) = 1 ,
which yields h ( g ( d ) ) ( 0 , 1 ] d R and z > 0 . □

Appendix B

Algorithm A1 Steering vectors computation.
1:
procedureComputeSteeringVectors( C , θ , ω , Q, f, l)
2:
   c 299 , 792 , 458 m/s
3:
   λ c / f
4:
   Ω compute Equation (3) with θ and ω
5:
   V exp ( 𝚥 2 π λ 1 Ω C )
6:
  for q 1 , Q do
7:
     ζ q sin ( θ q ) 1 [ cos ( π l cos ( θ q ) λ 1 ) cos ( π l λ 1 ) ]
8:
  end for
9:
   ζ compute Equation (1) with ζ 1 ζ 2 ζ Q
10:
   Ψ ζ V
11:
  return Ψ
12:
end procedure
Algorithm A2 LMS for beamforming.
1:
procedureLMS( i [ 0 ] , Ψ , d , η l , ρ , Q, U)
2:
   for u 1 , U do
3:
    i [ u ] i [ u 1 ]
4:
   for q 1 , Q do
5:
      i 2 [ u ] [ i 1 [ u ] 2 i 2 [ u ] 2 i P [ u ] 2 ] T
6:
      i 3 [ u ] [ i 1 [ u ] 3 i 2 [ u ] 3 i P [ u ] 3 ] T
7:
      g q [ u ] ψ q T ( ρ 1 i [ u ] + ρ 2 i 2 [ u ] + ρ 3 i 3 [ u ] )
8:
      ϵ q [ u ] d q g q [ u ]
9:
      i [ u ] i [ u ] + η l ϵ q [ u ] ψ q *
10:
   end for
11:
  end for
12:
  return i [ U ]
13:
end procedure
Algorithm A3 CMM-RBF for beamforming.
1:
procedureCMM-RBF( i [ 0 ] , W [ 0 ] , Γ [ 0 ] , σ 2 [ 0 ] , η w , η γ , η σ , μ , ρ , Ψ , d , Q, U, N)
2:
  for u 1 , U do
3:
     i [ u ] , W [ u ] , Γ [ u ] , σ 2 [ u ] i [ u 1 ] , W [ u 1 ] , Γ [ u 1 ] , σ 2 [ u 1 ] , respectively.
4:
    for q 1 , Q do
5:
       i 2 [ u ] [ i 1 [ u ] 2 i 2 [ u ] 2 i P [ u ] 2 ] T
6:
       i 3 [ u ] [ i 1 [ u ] 3 i 2 [ u ] 3 i P [ u ] 3 ] T
7:
       g q [ u ] ψ q T ( ρ 1 i [ u ] + ρ 2 i 2 [ u ] + ρ 3 i 3 [ u ] )
8:
       ϵ q [ u ] d q g q [ u ]
9:
      for n 1 , N do
10:
        ϕ n [ u ] compute Equation (7) with γ n [ u ] , σ n 2 [ u ] and ψ r [ u ]
11:
        ξ n [ u ] compute Equation (10) with ψ q , W [ u ] , and ϵ q [ u ]
12:
        β n [ u ] compute Equation (11) with ψ q , γ n [ u ] , ϕ n [ u ] , and σ n 2
13:
        α n [ u ] compute Equation (12) with ψ q , γ n [ u ] , ϕ n [ u ] , and σ n 2
14:
      end for
15:
       Ξ [ u ] compute Equation (15) with ξ 1 [ u ] ξ 2 [ u ] ξ N [ u ]
16:
       W [ u ] W [ u ] + η w ϵ q [ u ] ψ q * ϕ [ u ] H
17:
       Γ [ u ] Γ [ u ] + η γ [ Ξ [ u ] ( Re { A [ u ] } + Im { A [ u ] } ) + Ξ [ u ] H ( Re { A [ u ] } Im { A [ u ] } ) ]
18:
       σ 2 [ u ] σ 2 [ u ] + η σ [ Ξ [ u ] ( Re { β [ u ] } + Im { β [ u ] } ) + Ξ [ u ] H ( Re { β [ u ] } Im { β [ u ] } ) ]
19:
      for n 1 , N do
20:
        if Re { σ n 2 [ u ] } < μ then
21:
           Re { σ n 2 [ u ] } μ
22:
        end if
23:
        if Im { σ n 2 [ u ] } < μ then
24:
           Im { σ n 2 [ u ] } μ
25:
        end if
26:
      end for
27:
       i [ u ] W [ u ] ϕ q [ u ]
28:
    end for
29:
  end for
30:
  return i [ U ]
31:
end procedure

Appendix C

Table A1. Beamformer restrictions of the uniform and circular array (UCA) with 6 dipoles: ω ^ is the generalized boresight.
Table A1. Beamformer restrictions of the uniform and circular array (UCA) with 6 dipoles: ω ^ is the generalized boresight.
Sensor (r)Zenith θ r Azimuth ω r Desired Radiation Pattern d r
190° ω ^ 1.0
290° ω ^ + 180°0.0
390° ω ^ + 50°0.5
490° ω ^ 50°0.5
590° ω ^ + 60°0.2
690° ω ^ 60°0.2
790° ω ^ + 115°0.0
890° ω ^ 115°0.0
990° ω ^ + 155°0.0
1090° ω ^ 155°0.0
Table A2. Beamformer restrictions of the 2D-grid array (2D-GA) with 16 dipoles: ω ^ is the generalized boresight.
Table A2. Beamformer restrictions of the 2D-grid array (2D-GA) with 16 dipoles: ω ^ is the generalized boresight.
Sensor (r)Zenith θ r Azimuth ω r Desired Radiation Pattern d r
190° ω ^ 1.0
290° ω ^ + 180°0.0
390° ω ^ + 30°0.5
490° ω ^ 30°0.5
590° ω ^ + 40°0.2
690° ω ^ 40°0.2
790° ω ^ + 70°0.0
890° ω ^ 70°0.0
990° ω ^ + 105°0.0
1090° ω ^ 105°0.0

References

  1. Wohwe Sambo, D.; Yenke, B.O.; Förster, A.; Dayang, P. Optimized Clustering Algorithms for Large Wireless Sensor Networks: A Review. Sensors 2019, 19, 322. [Google Scholar] [CrossRef] [Green Version]
  2. Daoud, M.; Mayo, M. A Survey of Neural Network-Based Cancer Prediction Models from Microarray Data. Artif. Intell. Med. 2019, 97, 204–214. [Google Scholar] [CrossRef] [PubMed]
  3. Sultana, N.; Chilamkurti, N.; Alhadad, R. Survey on SDN Based Network Intrusion Detection System Using Machine Learning Approaches. Peer-to-Peer Netw. Appl. 2019, 12, 493–501. [Google Scholar] [CrossRef]
  4. Nassif, A.B.; Shahin, I.; Attili, I.; Azzeh, M.; Shaalan, K. Speech Recognition Using Deep Neural Networks: A Systematic Review. IEEE Access 2019, 7, 19143–19165. [Google Scholar] [CrossRef]
  5. Mayer, K.S.; De Oliveira, M.S.; Müller, C.; De Castro, F.C.; De Castro, M.C. Blind Fuzzy Adaptation Step Control for a Concurrent Neural Network Equalizer. Wirel. Commun. Mob. Comput. 2019, 2019, 9082362. [Google Scholar] [CrossRef]
  6. De Sousa, T.F.B.; Fernandes, M.A.C. Butterfly Neural Equalizer Applied to Optical Communication Systems with Two-Dimensional Digital Modulation. Opt. Express 2018, 26, 30837–30850. [Google Scholar] [CrossRef]
  7. Loss, D.; De Castro, M.C.F.; Franco, P.R.G.; De Castro, F.C.C. Phase Transmittance RBF Neural Networks. Electron. Lett. 2007, 43, 882–884. [Google Scholar] [CrossRef]
  8. Savitha, R.; Suresh, S.; Sundararajan, N. A Fully Complex-Valued Radial Basis Function Network and Its Learning Algorithm. Int. J. Neural Syst. 2009, 19, 253–267. [Google Scholar] [CrossRef]
  9. De Sousa, T.F.B.; Fernandes, M.A.C. Butterfly Neural Filter Applied to Beamforming. IEEE Access 2019, 7, 96455–96469. [Google Scholar] [CrossRef]
  10. De Sousa, T.F.B.; Arantes, D.S.; Fernandes, M.A.C. Adaptive Beamforming Applied to OFDM Systems. Sensors 2018, 18, 3558. [Google Scholar] [CrossRef] [Green Version]
  11. Xia, W.; Zheng, G.; Zhu, Y.; Zhang, J.; Wang, J.; Petropulu, A.P. Deep Learning Based Beamforming Neural Networks in Downlink MISO Systems. In Proceedings of the 2019 IEEE International Conference on Communications Workshops (ICC Workshops), Shanghai, China, 20–24 May 2019. [Google Scholar]
  12. Haykin, S. Digital Communication Systems, 1st ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2013; p. 87. [Google Scholar]
  13. Proakis, J.G.; Salehi, M. Digital Communications, 5th ed.; McGraw-Hill: New York, NY, USA, 2008; p. 8. [Google Scholar]
  14. Mayer, K.S.; Müller, C.; de Castro, M.C.F.; de Castro, F.C.C. Nonlinear Modified Concurrent Equalizer. J. Commun. Inf. Syst. 2019, 34, 201–205. [Google Scholar] [CrossRef] [Green Version]
  15. Agiwal, M.; Saxena, N.; Roy, A. Towards Connected Living: 5G Enabled Internet of Things (IoT). IETE Tech. Rev. 2019, 36, 190–202. [Google Scholar] [CrossRef]
  16. Ivanov, A.; Stoliarenko, M.; Kruglik, S.; Novichkov, S.; Savinov, A. Dynamic Resource Allocation in LEO Satellite. In Proceedings of the 15th InternationalWireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco, 24–28 June 2019; pp. 930–935. [Google Scholar]
  17. Ivanov, A.; Stoliarenko, M.; Savinov, A.; Novichkov, S. Physical Layer Representation in LEO Satellite with a Hybrid Multi-Beamforming. In Proceedings of the 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco, 24–28 June 2019; pp. 140–145. [Google Scholar]
  18. Amat, A.G.I.; Nour, C.A.; Douillard, C. Serially Concatenated Continuous Phase Modulation for Satellite Communications. IEEE Trans. Wirel. Commun. 2009, 8, 3260–3269. [Google Scholar] [CrossRef]
  19. Anderson, M.G.; Thielens, A.; Wielandt, S.; Niknejad, A.; Rabaey, J. Ultralow-Power Radio Frequency Beamformer Using Transmission-Line Transformers and Tunable Passives. IEEE Microw. Wirel. Compon. Lett. 2019, 29, 158–160. [Google Scholar] [CrossRef]
  20. Love, D.J.; Heath, R.W.; Strohmer, T. Grassmannian Beamforming for Multiple-Input Multiple-Output Wireless systems. IEEE Trans. Inf. Theory 2003, 49, 2735–2747. [Google Scholar] [CrossRef] [Green Version]
  21. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An Overview of Massive MIMO: Benefits and Challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [Google Scholar] [CrossRef]
  22. Chung, J.; Hwang, C.S.; Kim, K.; Kim, Y.K. A Random Beamforming Technique in MIMO systems Exploiting Multiuser Diversity. IEEE J. Sel. Areas Commun. 2003, 21, 848–855. [Google Scholar] [CrossRef]
  23. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for Next Generation Wireless Systems. IEEE J. Sel. Areas Commun. 2014, 52, 186–195. [Google Scholar] [CrossRef] [Green Version]
  24. Zhang, S.; Guo, C.; Wang, T.; Zhang, W. On-Off Analog Beamforming with Per-Antenna Power Constraint. In Proceedings of the 2017 IEEE 85th Vehicular Technology Conference (VTC Spring), Sydney, Australia, 4–7 June 2017. [Google Scholar]
  25. Zhang, Y.; Gao, J.; Yang, L. MRT Precoding in Downlink Multi-User MIMO Systems. EURASIP J. Wirel. Commun. 2016, 2016, 241. [Google Scholar] [CrossRef] [Green Version]
  26. Reil, M.; Lloyd, G. Millimeter-Wave Beamforming: Antenna Array Design Choices & Characterization; Technical Report; Rhode & Schwarz: Munich, Germany, 2016. [Google Scholar]
  27. Roh, W.; Seol, J.Y.; Park, J.; Lee, B.; Lee, J.; Kim, Y.; Cho, J.; Cheun, K.; Aryanfar, F. Millimeter-Wave Beamforming as an Enabling Technology for 5G Cellular Communications: Theoretical Feasibility and Prototype Results. IEEE Commun. Mag. 2014, 52, 106–113. [Google Scholar] [CrossRef]
  28. Mandic, D.P.; Goh, V.S.L. Complex Valued Nonlinear Adaptive Filters: Noncircularity, Widely Linear and Neural Models, 1st ed.; John Wiley & Sons, Inc.: Chippenham, WT, UK, 2009; p. 73. [Google Scholar]
  29. Godara, L.C. Improved LMS Algorithm for Adaptive Beamforming. IEEE Trans. Antennas Propag. 1990, 38, 1631–1635. [Google Scholar] [CrossRef]
  30. Haykin, S. Neural Networks and Learning Machines, 3rd ed.; Prentice-Hall: Upper Saddle River, NJ, USA, 2009; pp. 2, 236–237. [Google Scholar]
  31. Balanis, C.A. Antenna Theory: Analysis and Design, 4th ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2016; pp. 285–367. [Google Scholar]
  32. Al-Awami, A.T.; Zerguine, A.; Cheded, L.; Zidouri, A.; Saif, W. A New Modified Particle Swarm Optimization Algorithm for Adaptive Equalization. Digit. Signal Process. 2011, 21, 195–207. [Google Scholar] [CrossRef]
  33. Boyd, S.; Vandenberghe, L. Introduction to Applied Linear Algebra: Vectors, Matrices, and Least Squares, 1st ed.; Cambridge University Press: Cambridge, UK, 2018; p. 46. [Google Scholar]
  34. Anton, H.; Bivens, I.; Davis, S. Calculus: Early Transcendentals, 11st ed.; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2015; pp. 43, 186. [Google Scholar]
Figure 1. Representation of the angular position of the qth sensor around the array of antennas.
Figure 1. Representation of the angular position of the qth sensor around the array of antennas.
Sensors 20 00378 g001
Figure 2. Least mean square (LMS) architecture for beamforming.
Figure 2. Least mean square (LMS) architecture for beamforming.
Sensors 20 00378 g002
Figure 3. Complex multiple-input multiple-output radial basis function neural network architecture for beamforming.
Figure 3. Complex multiple-input multiple-output radial basis function neural network architecture for beamforming.
Sensors 20 00378 g003
Figure 4. Uniform and circular array (UCA) with six dipoles, where l is the dipole length and s d is the distance between dipoles.
Figure 4. Uniform and circular array (UCA) with six dipoles, where l is the dipole length and s d is the distance between dipoles.
Sensors 20 00378 g004
Figure 5. Mean squared error (MSE) of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for three training epochs: uniform and circular array (UCA) with target ω ^ = 160°.
Figure 5. Mean squared error (MSE) of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for three training epochs: uniform and circular array (UCA) with target ω ^ = 160°.
Sensors 20 00378 g005
Figure 6. Normalized radiation pattern of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for the uniform and circular array (UCA) with θ = 90° and ω ^ = 160°.
Figure 6. Normalized radiation pattern of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for the uniform and circular array (UCA) with θ = 90° and ω ^ = 160°.
Sensors 20 00378 g006
Figure 7. 2D-grid array (2D-GA) with 16 dipoles, where l is the dipole length and s d is the distance between dipoles.
Figure 7. 2D-grid array (2D-GA) with 16 dipoles, where l is the dipole length and s d is the distance between dipoles.
Sensors 20 00378 g007
Figure 8. Mean squared error (MSE) of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for three training epochs: 2D-grid array (2D-GA) with target ω ^ = 160°.
Figure 8. Mean squared error (MSE) of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for three training epochs: 2D-grid array (2D-GA) with target ω ^ = 160°.
Sensors 20 00378 g008
Figure 9. Normalized radiation pattern of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for the 2D-grid array (2D-GA) with θ = 90° and ω ^ = 160°.
Figure 9. Normalized radiation pattern of the complex multiple-input multiple-output radial basis function neural network (CMM-RBF) and least mean square (LMS) algorithms for the 2D-grid array (2D-GA) with θ = 90° and ω ^ = 160°.
Sensors 20 00378 g009
Table 1. Half-power beamwidth (HPBW) and mean squared error (MSE) statistical results for the uniform and circular array (UCA), varying the boresight angle from 0° to 360°.
Table 1. Half-power beamwidth (HPBW) and mean squared error (MSE) statistical results for the uniform and circular array (UCA), varying the boresight angle from 0° to 360°.
Algorithmmean HPBWstd HPBWmean MSEstd MSE
LMS81.00°0.52°−33.94 dB0.15 dB
CMM-RBF76.85°0.64°−36.05 dB0.37 dB
Table 2. Half-power beamwidth (HPBW) and mean squared error (MSE) statistical results for the 2D-grid array, varying the boresight angle from 0° to 360°.
Table 2. Half-power beamwidth (HPBW) and mean squared error (MSE) statistical results for the 2D-grid array, varying the boresight angle from 0° to 360°.
AlgorithmMean HPBWStd HPBWMean MSEStd MSE
LMS52.57°0.88°−35.36 dB1.28 dB
CMM-RBF48.42°0.33°−37.80 dB1.04 dB

Share and Cite

MDPI and ACS Style

Soares Mayer, K.; Aguiar Soares, J.; Soares Arantes, D. Complex MIMO RBF Neural Networks for Transmitter Beamforming over Nonlinear Channels. Sensors 2020, 20, 378. https://doi.org/10.3390/s20020378

AMA Style

Soares Mayer K, Aguiar Soares J, Soares Arantes D. Complex MIMO RBF Neural Networks for Transmitter Beamforming over Nonlinear Channels. Sensors. 2020; 20(2):378. https://doi.org/10.3390/s20020378

Chicago/Turabian Style

Soares Mayer, Kayol, Jonathan Aguiar Soares, and Dalton Soares Arantes. 2020. "Complex MIMO RBF Neural Networks for Transmitter Beamforming over Nonlinear Channels" Sensors 20, no. 2: 378. https://doi.org/10.3390/s20020378

APA Style

Soares Mayer, K., Aguiar Soares, J., & Soares Arantes, D. (2020). Complex MIMO RBF Neural Networks for Transmitter Beamforming over Nonlinear Channels. Sensors, 20(2), 378. https://doi.org/10.3390/s20020378

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop