BRNN-LSTM for Initial Access in Millimeter Wave Communications

Aldalbahi, Adel; Shahabi, Farzad; Jasim, Mohammed

doi:10.3390/electronics10131505

Open AccessArticle

BRNN-LSTM for Initial Access in Millimeter Wave Communications

by

Adel Aldalbahi

¹

,

Farzad Shahabi

² and

Mohammed Jasim

^3,*

¹

Department of Electrical Engineering, College of Engineering, King Faisal University, P.O. Box 380, Al-Ahsa 31982, Saudi Arabia

²

Department of Electrical Engineering, School of Engineering, University of South Florida, Tampa, FL 33620, USA

³

School of Engineering, University of Mount Union, Alliance, OH 44601, USA

^*

Author to whom correspondence should be addressed.

Electronics 2021, 10(13), 1505; https://doi.org/10.3390/electronics10131505

Submission received: 10 February 2021 / Revised: 27 March 2021 / Accepted: 20 April 2021 / Published: 22 June 2021

(This article belongs to the Section Microwave and Wireless Communications)

Download

Browse Figures

Versions Notes

Abstract

:

The use of beamforming technology in standalone (SA) millimeter wave communications results in directional transmission and reception modes at the mobile station (MS) and base station (BS). This results in initial beam access challenges, since the MS and BS are now compelled to perform spatial search to determine the best beam directions that return highest signal levels. The high number of signal measurements here prolongs access times and latencies, as well as increasing power and energy consumption. Hence this paper proposes a first study on leveraging deep learning schemes to simplify the beam access procedure in standalone mmWave networks. The proposed scheme combines bidirectional recurrent neural network (BRNN) and long short-term memory (LSTM) to achieve fast initial access times. Namely, the scheme predicts the best beam index for use in the next time step once a MS accesses the network, e.g., transition from sleep to active (or idle) modes. The scheme eliminates the need for beam scanning, thereby achieving ultra-low access times and energy efficiencies as compared to existing methods.

Keywords:

millimeter wave; beamforming; initial beam access; bidirectional recurrent neural network; long short-term memory; access times

1. Introduction

Millimeter Wave (mmWave) frequencies constitute a major component of SA 5G networks for high data rates support in enhanced mobile broadband (eMBB). One key advantage here is the contiguous available spectrum at these bands. However, the aggregated path losses impose the use of beamforming techniques to achieve higher link gains. This results in directional transmission and reception modes, which yields prolonged access times. Now the International Mobile Telecommunications (IMT) framework specifies 10 millisecond (ms) latency levels for eMBB in 5G systems [1]. Hence, a major challenge here is to provide fast access schemes that feature ultralow times, along with reduced power and energy consumption levels. Additionally, these access schemes need to consider channel fluctuations and variations in link status as a function of blockage, as well as mobility effects.

Currently, conventional schemes dictate that the MS and BS perform spatial search over all directions, in order to determine the best beamforming and combining vectors with the highest received signal level. For example, work in [2] proposes a hierarchical codebook for iterative search that uses wide beams in the initial search stages, then refinement is conducted in subsequent stages using narrow beams. However, this technique can suffer from reduced directivity, outages and sensitivity to blockage due to the low gains achieved in the initial codebook stage. Moreover, work in [3,4] uses metaheuristics in efforts to accelerate the access times and reduce energy consumption, e.g., generalized pattern search and Hooke Jeeves methods. The work in [5] exploits the sidelobe information to retrieve the direction of the main lobe. However, this scheme is limited to line-of-sight (LoS) and single-ray channel. Moreover, work in [6] exploits grating lobes for simultaneous transmission to increase directivity. However, this scheme features a complex beamforming structure with a large number of antennas and high-power requirements.

Furthermore, the geolocation-assisted access scheme in [7] utilizes global positioning system (GPS) at the MS to determine the BS location. However, this context-based scheme is limited to outdoor settings with permanent GPS connectivity, and it requires the BS to conduct exhaustive beam search to allocate the MS. The work in [8] proposes a single RF chain architecture for multi-user that uses downlink (DL)–uplink (UL) and DL–DL beam-training techniques. A subset beam group is trained here in a single time slot. However, the use of a single RF chain at the BS limits the number of connected MS and scalability. Finally, the work in [9] proposes a subarray-cooperation multiresolution codebook design. It features a beam alignment scheme that adaptively selects initial layers based on various simultaneous signal-to-noise (SNR) levels. Hence, it quickly aligns the desired beam pairs under single dominant path channels by using hybrid beamforming. Overall, the aforementioned schemes still yield great computational complexity, prolonged access times and high power and energy requirements.

Various studies have used deep learning to solve the problem of beam management for mmWave communication systems. First, the authors of [10] use deep neural network (DNN) to predict the beam direction at the BS, while implementing an omnidirectional antenna at the MS. The work only studies the accuracy of the DNN algorithm using 24 beams at the BS and aims to outperform the exhaustive beam mechanism. Additionally, the work in [10] considers omnidirectional MS and directional BS. However, the omni-directional mode at the MS presents various challenges in terms of signal quality and throughput (requires further investigation). By contrast, the proposed work in this paper uses 64 beams and considers system performance including access times, power and energy consumption. Furthermore, results are compared to the fastest beam access schemes reported in the literature. In addition, this paper proposes beamforming models at the MS and BS, which then uses the deep learning network to study comprehensive performance metrics.

Furthermore, deep-learning-based beam selection is proposed in [11] to reduce the time overhead by exploiting sub-6 GHz channel information. A DNN algorithm is used to estimate the power delay profile (PDP) of a sub-6 GHz channel, then acting an input of the DNN. Overall, this work relies on the support of sub-6 GHz connections, thus limiting the ability of mmWave networks to operate separately. This becomes efficient for a 5G new radio network operating at FR2. Moreover, the work assumes that the sub-6 GHz link is already established, which makes the access time incomplete, i.e., it is required here to study the time complexity for beam association once a MS joins a network until the start of the data-plane. There is also a lack of comprehensive beamforming designs at the MS and BS, where it is limited to a conventional discrete Fourier transform (DFT)-based codebook. Similarly, the work in [12] also relies on sub-6 GHz channel vectors for initial beam access and blockage in mmWave systems. As opposed to the methodology of [11] which extracts spatial channel characteristics at the sub-6 GHz band and then uses them to reduce the mmWave beam training overhead, here mapping functions are predicted directly from the sub-6 GHz channel. Specifically, the model leverages transfer learning to reduce the learning time overhead. However, the estimation of the mapping functions is often complicated and requires large neural network to achieve accuracy. In addition, the work again relies on sub-6 GHz bands to realize the beam access at mmWave. Namely, dual-band (microwave and mmWave transceivers) systems are needed at the BS and MS. Here the power consumption analysis and access time need to be further investigated.

For mmWave vehicular communication, the authors od [13] propose a beam alignment procedure based on fingerprinting approach, over which a set of beam pairs constitute the fingerprint of a given location. Deep learning is deployed at the BS to adapt and update these fingerprints. Moreover, a plurality mechanism is proposed for the beams that meet the received signal strength, i.e., to achieve multiplexing and diversity gains. The outcomes aim to improve the fidelity as compared to exhaustive beam search fingerprinting without deep learning.

Another use for DNN is for beam management and interference coordination in indoor dense mmWave networks for IEEE 802.11ay networks in [14] that optimizes the beam directions, beamwidths and transmit power. The goal is to reduce the computational complexity and time, while obtaining comparable sum-rate to conventional methods. However, it uses a beamforming training mechanism to establish the directional links between mobile access point (MAB) and stationary access point (SAP), which is used subsequently to generate training data for DNN to mitigate interference. Therefore, the deep learning network is not used here for initial access. Moreover, the implementation is limited to indoor wireless local area networks (WLAN) and not applied for outdoor settings at larger separation distances.

The authors of [15] describe a specific dataset for beam selection techniques on vehicle-to-infrastructure using millimeter waves. A methodology for channel data generation in mmWave multiple-input multiple-output (MIMO) scenarios is presented that aims to simplify creating data in mobility scenarios by invoking a traffic simulator and a ray-tracing simulator. However, the context here is different and unrelated to initial access between MS and BS cells. Namely, the propagation channel dataset developed here by raytracing is specific for vehicle-to-infrastructure mmWave networks. Overall, the work in [15] focuses on modeling mobility only and lacks the analysis of the downlink performance. By contrast, the proposed work in this paper focuses on standalone mmWave networks, considering link performance with beamforming architectures at the BS and MS.

Furthermore, deep learning is also used for beam training in [16] for a mmWave massive MIMO system. The nonlinear properties of channel power leakage are used in the estimation process, where DNN to predict the best beam combination that yields the strongest channel path based on the probability vector in efforts to improve the successful and achievable rates at lower overhead. However, this work lacks latency and power models, as well as comprehensive beamforming modeling at the BS/MS.

Moreover, the work in [17] presents a beam alignment technique with partial beams using neural networks for multi-user mmWave massive MIMO system in efforts to improve the spectral efficiency at reduced training overhead, as compared to hierarchical search and compressed sensing methods. Offline training is conducted on the channel model, after which online prediction is achieved for beam distribution vector using partial beams. Here the obtained dominant indices from the beam distribution vector are used to align the beams for the multi-user. The work in [18] combines machine learning and situational awareness to learn the power and optimal beam index, during which the angles of arrival (AoAs) are first estimated based on the location and then this information is used as input to the neural network for beam selection. However, the requirement for user location information (prior knowledge) here for training weakens the proposed algorithm and adds to the system complexity.

A joint beamforming approach between distributed BSs is developed in [19] that deploys machine learning to simultaneously serve a mobile MS. The latter transmits a single uplink training sequence to the participating BSs using omni or quasi-omni beam patterns to develop a pattern for the location signatures. The signatures are then deployed at the deep learning stage to estimate the beamforming vectors at these BSs, thus reducing the training overhead. The limitation of this work is the use of wide beams (omni or quasi-omni), which makes it inefficient in blockage scenarios during the user mobility. These wide beams also yield low channel gains and throughput levels. Out-of-band information is likewise used in [20] for deep learning beam prediction scheme to minimize the training complexity. Namely, a dual-band (sub-6 GHz and mmWave links) approach is implemented, where the optimal beam in mmWave band is estimated from sub-6 GHz channel state information (CSI). The work focusses on testing the network accuracy without investigating the beamforming and channel models. One limitation here is the assumption of similar spatial features between the channels of the two bands, which is an inaccurate assumption.

Overall, existing deep-learning-based beam access schemes (summarized in Table 1) still lack key operating assumptions that are in conflict with the objectives of the FR2 NR of 5G systems. First and foremost, some models assume omni-directional mode at the MS, where the beam discovery is limited to the BS. Others are dependent on the multiple BSs, the MS location and sub-6 GHz bands, and thus fail to operate mmWave as a standalone network. Other limitations include indoor implementation and marginal enhancement to existing conventional methods (e.g., beam sweeping and exhaustive searches). These models overall lack time delay and power consumption models in the control plane and hence work is needed to investigate the delay in standalone beamforming-based mmWave networks.

In light of the above, this paper proposes a first use of a deep learning network model for initial beam access in mmWave communications, with the goal to develop one of the fastest beam access schemes. The model operates in learning and training modes, which aims to predict the best beam index over subsequent time steps, where these indices are affiliated with specific beamforming and combining vectors.

The key practical application for the proposed work is enhancing mmWave networks as part of the 5G FR2 New Radio. Current 5G implementation relies on conventional sub-6 GHz and leverages mmWave bands as a supplementary component, e.g., dual bands and carrier aggregation. However, this is projected only in the first phase of 5G, where the mmWave bands are expected to work independently starting in 2022–2024, contingent upon the development of mature technologies and optimization to support the targeted throughout and latencies. Hence, the mmWave bands are projected to provide standalone service without dependency on microwave (legacy) bands. Thus, the beamforming capability enhances the channel quality and throughput for the user. Furthermore, the deep learning algorithm reduces the access times and control-plane latencies, which meets the ultra-low delays, as defined by the 3GPP targeted at 1 ms. This improves the quality of service (QoS) and enables the implementation of mmWave standalone networks.

Moreover, the technique can be adopted in wireless local area networks (WLAN) as part of the IEEE 802.11ay standard and mmWave links for vehicle-to-everything (V2X) after adding the mobility component. Moreover, the deep learning algorithm will enable the use of highly directional beams without the reliance on wide-beam codebooks; this in turn eliminates the vulnerability of bean-blockage due to low directivity. Namely, the MS will be able to use narrow beams when transiting from sleep (off) mode to idle or active mode in the control plane, thus helping to reduce the beam search time. As a result, high data rates can be supported here, i.e., leveraging the high channel capacities and aggregated antenna gains.

This paper is organized as follows. Section 2 presents the beamforming, signals and channel models. Then the beam access scheme is proposed in Section 3. Performance evaluation is presented in Section 4, along with conclusions in Section 5.

2. System Model

2.1. Analog Beamforming Model at the MS

Consider a MS equipped with a single RF chain that is connected to a uniform linear array (ULA) radiating a primary beam

b_{i}^{MS}

at

Θ_{i}^{MS}

direction, where i is the beam index, i.e., i = 1, 2, …, I, see Figure 1. The ULA is composed of

N_{MS}

antennas that are equispaced at

d_{a n t}

= λ/2, where λ is the wavelength,

λ = ς / f

, where

ς

is the speed of light and f is the carrier frequency. The antennas are fed in parallel by analog phase shifters to provide continuous beam scanning. The MS combining vector,

v_{MS}

, with

Θ_{i}^{MS}

direction in the combining matrix,

V_{MS}

, at the analog stage (i.e.,

v_{MS}

=

v_{a n}

, where

v_{a n}

is the analog precoder) is determined by the array response vector for the ULA,

A_{MS}

, computed as [21],

A_{MS} = \frac{1}{N_{MS}} \sum_{n_{MS} = 1}^{N_{MS}} a_{n} \exp (j (n_{MS} - 1) (k d_{a n t} \cos Θ_{i}^{MS} + δ_{MS}),

(1)

where

a_{n}

, k, and

δ_{MS}

denote the amplitude of the n-th antenna at the MS, wavenumber, i.e.,

k = 2 π / λ

, and the progressive phase shift between the elements at the MS, respectively. Furthermore,

Θ_{i}^{MS}

is specified by

Θ_{i}^{MS} = c o s^{- 1} [\frac{- δ_{MS} λ}{2 π d_{a n t}}]

(2)

Next, the half-power beamwidth (HPBW) for each beam is given by [21],

ϕ_{i}^{MS} = \{\begin{cases} c o s^{- 1} (λ / 2 π d_{a n t} (k d_{a n t} c o s Θ_{i}^{MS} \pm 2.782 / n_{MS})), f o r 0 < Θ_{i}^{MS} < π, \\ 2 c o s^{- 1} (1 - 1.391 λ / n_{MS} d_{a n t}), f o r Θ_{i}^{MS} = 0, π . \end{cases}

(3)

2.2. Digital Beamforming Model at the BS

Digital beamforming architectures are used at the BS due to the abundant input power. This is also necessary to support multi-user connectivity. Hence, consider a BS equipped with a ULA composed of

N_{BS}

antennas. In contrast to the MS design, each

n_{BS}

antenna here is connected to one RF chain,

r_{BS}

. Note that the total number of antennas is equal to the number of RF chains and the number of transmit data streams

D_{tran}

, i.e.,

N_{BS} {= R}_{BS} {= D}_{tran}

. Additionally, the overall radiated pattern from

r_{BS}

is represented by a beamforming vector,

p_{BS}

, in the beamforming matrix,

p_{BS} {= p}_{bb}

, where

p_{bb}

represents the baseband beamforming stage, i.e.,

p_{bb}

\in C^{N_{BS} D_{tran}}

.

2.3. Signal Model

Consider a MS using the aforementioned analog beamformer, and which communicates with a BS in data-plane at d (meters) separation distance. The MS uses its primary beam

b_{i}^{MS}

for initial access procedure (e.g., iterative search). The DL signal,

y_{MS}

, at the l-th path after the RF stage at the MS is as

y_{MS} = \sqrt{p r_{MS}} v_{MS}^{H} (Θ_{i}^{MS}) H p_{BS} (Θ_{i}^{BS}) Z + v_{MS}^{H} (Θ_{i}^{MS}) w,

(4)

where

{p r}_{MS}

and

{(.)}^{H}

denote the average received power and the Hermitian matrix, respectively. Here the combining vector

v_{MS}

is at

Θ_{i}^{MS}

pointing direction, and the beamforming vector

p_{BS}

is at

Θ_{i}^{BS}

pointing direction. Furthermore, Z is the control signal that carries the synchronization information. Finally, w denotes the additive white Gaussian noise (AWGN), i.e.,

{w ~ N (0, σ}_{w}^{2})

, with

σ_{w}^{2}

variance. Finally, H is the channel between MS and BS, specified by the geometric model. This is attributed to the small wavelength at mmWave bands, which results in high dependence on the geometry of the objects in the propagation channel, i.e.,

H = \sqrt{\frac{N_{B S} N_{M S}}{Γ_{p l}}} \sum_{l = 1}^{L} h_{l} V_{M S} P_{B S}^{H},

(5)

where

Γ_{b l}

and

h_{l}

in order denote the blockage loss and the gain of the l-th path, for L paths received in K clusters.

3. Beam Prediction Access Scheme

The key processing elements of the proposed BRNN-LSTM deep learning model for beam prediction are now presented. First, unidirectional RNN updates hidden layers based upon information received from the input layer as well as the activation state. However, a limitation of unidirectional RNN is learning from past only. Hence, a bidirectional approach is adopted in this work to improve RNN. In this merge, one direction learns the past state, whereas the other learns the future state, then the two outputs are combined for an enhanced estimate. Therefore, the bidirectional feature enables LSTM to train each input sequence in disjoint forwards and backward states that are subsequently connected to the same output layer. This strengthens LSTM to retrieve additional beam index contextual information as compared to the conventional LSTM method. The process at the backward and forward states are similar at each of the bidirectional units.

Another limitation for RNN networks during the training process is the gradient vanishing problem for long data sequences. Hence LSTM networks are adopted to solve this problem by introducing memory blocks (units) that are comprised of self-connected memory cells and multiplicative gate, thus enabling the learning of long term dependencies. Therefore, the work here combines the saliences of past/future (backward and forward) state information at the BRNN with the powerful memory blocks for extended training periods and more information in LSTM, to improve the quality and accuracy of the beam prediction problem. Furthermore, four BRNN-LSTM layers are stacked (chained) to achieve higher precision. The proposed BRNN-LSTM method has three phases, i.e., input, hidden layers and output, where each hidden layer is represented by a bidirectional LSTM cell. Along these lines, the prediction scheme combines BRNN and LSTM to achieve a suitable solution for time-series prediction of variable sequences lengths, i.e., duplicating the training on the input sequences (information from dataset) by leveraging forward and backward states. The architecture for the proposed scheme is presented next.

3.1. Network Architecture

The network architecture for the proposed scheme is presented in Figure 2. It is composed of input sequences, four BRNN-LSTM layers, where each layer is composed of 50 cells (neurons).

Each LSTM cell in the BRNN-LSTM model is composed of an input

g_{t}^{i n}

, input modulation

g_{t}^{m o d}

, forget

g_{t}^{f}

and output

g_{t}^{o u t}

gates that determine information entering the cell state, see Figure 3.

The output of the last BRNN-LSTM layer is fed as the input of the dense layer (as per Figure 2), which is composed of linear activation function. The output of the dense layer presents the output of the proposed scheme, which is the beam index prediction at time step t + 1. The process over which this prediction process is achieved is now presented.

3.2. Operating Modes

The network operates in two modes, i.e., the learning (Mode I) and training (Mode II).

Learning Mode (Mode I): The network here operates in normal mode, where beam scanning is performed at the MS and BS using conventional schemes, e.g., codebook-based iterative search. Namely, once a MS transits from sleep to active mode and joins the mmWave network, then search is conducted over all beamforming and combining vectors to determine the best beam index and its affiliated direction at time step t, i.e., yielding the highest signal level. In notations,

{{(b}_{i}^{MS} {, b}_{i}^{BS})}_{bst} ⊨ \arg \max |y_{MS}|

(6)

Thereafter, the BS and MS will feed the best beam index at every time index (step) for use in the training mode. After the model is trained well, the MS and BS leverage it to predict the next best beam used, as presented next in Mode II.

Training Mode (Mode II): Given the sequences of beam indices with the highest selection over time step t, retrieved from the dataset, the MS and BS now predict the next most likely beam to be used at time step t + 1. Namely, the prediction scheme leverages parametric information of previous time steps (periods) and then labels the next to predict the beam index that returns the highest signal. The BRNN-LSTM scheme here recursively processes beam sequences at every time step of the input. It then maintains a hidden state which is a function of the previous state and the current input.

Problem Formulation: Let

{\hat{s}}_{t}

be the prediction status at time step t. Hence, the beam access problem is defined as a prediction of the best beam direction at time step t + 1, given the status at time step t. Thus, the goal is to maximize the probability of successful beam prediction at the BS using the proposed BRNN-LSTM deep learning model.

3.3. BRNN-LSTM Deep Learning Model

The beam prediction algorithm relies on the BRNN-LSTM network in two stages. First, the outer processing stage that is between the layers, and the input state inside the LSTM cell. For the outer stage between the layers, first the input data is processed by both the bidirectional forward and backward layers to obtain the hidden states. These data reflect contextual information about the beam indices and affiliated power levels. Following this step, the hidden states are fused to obtain the output layer. Here the bidirectional property enables LSTM to retrieve additional beam index contextual information as compared to conventional LSTM, i.e., obtaining the current and future time steps by the backward and forward states. The process at the backward and forward states are similar at each of the bidirectional LSTM units, where each LSTM cell is composed of a state consisting of input

g_{t}^{i n}

, input modulation

g_{t}^{m o d}

, forget

g_{t}^{f}

and output

g_{t}^{o u t}

gates that determine information entering the cell state. Consider the details inside the LSTM cell.

For the inner stage inside the LSTM unit, first, the cell state

c_{t}

at time t which specifies information carried to the next sequence is modified by

g_{t}^{f}

in the sigmoid layer placed underneath it, which is in turn adjusted by

g_{t}^{m o d}

that delivers the new candidate cell state. The forget gate

g_{t}^{f}

receives hidden-state vector

η_{t - 1}

(output vector of the LSTM unit) at time step t − 1, and input vector at time step t,

x_{t}

, as its inputs. This gate then produces an output number between 0 and 1 for each number in the previous cell state at time step t − 1,

c_{t - 1}

. Namely, the output of the

g_{t}^{f}

instructs the cell state on which information to forget or discard by multiplying 0 by a position in the matrix. Meanwhile, if the output of

g_{t}^{f}

is 1, then the information is kept in the cell state, where a sigmoid function,

σ_{g},

is applied to the weighted input and previous hidden state. Equations (7)–(11) represent the

c_{t}

,

g_{t}^{f}

,

g_{t}^{i n}

,

g_{t}^{m o d}

, and

g_{t}^{o u t}

formulations at time step t, respectively, expressed as [22]

c_{t} {= g}_{t}^{f} c_{t ࢤ 1} {+ g}_{t}^{i n} g_{t}^{m o d}

(7)

g_{t}^{f} {= σ}_{g} (W_{f} [η_{t - 1} {, x}_{t}] {+ β}_{f})

(8)

g_{t}^{i n} {= σ}_{g} (W_{i n} [η_{t - 1} {, x}_{t}] {+ β}_{i n})

(9)

g_{t}^{m o d} = \tan h (W_{c} [η_{t - 1} {, x}_{t}] {+ β}_{c})

(10)

g_{t}^{o u t} {= σ}_{g} (W_{o u t} [η_{t - 1} {, x}_{t}] {+ β}_{o u t})

(11)

The parameters

W_{f}

,

W_{i n}

,

W_{o u t}

and

W_{c}

are the weight matrices and

b_{f}

,

β_{i n} {, β}_{o u t} {, β}_{c}

are the bias vectors for the

g_{t}^{f}

,

g_{t}^{i n}

,

g_{t}^{o u t}

, and

c_{t}

, respectively, i.e., learnt during the training mode. Finally, the hidden-state layer output

η_{t}

(working memory) is modeled as

η_{t} {= g}_{t}^{o u t} {. \tan h (c}_{t})

. Note that the key parameters here for the model are the logistic sigmoid and the hyperbolic tanh nonlinear activation function for each gate to predict probability of the output. Foremost,

g_{t}^{i n}

is a sigmoid function with range ∈ [0,1] that can only add memory. Note that the sigmoid function is unable to forget/clear memory, since the cell state equation is a summation between the previous cell state. Therefore,

g_{t}^{m o d}

is activated with a tanh activation function with a [−1, 1] range that allows the cell state to forget memory. Overall, the training settings for the model include four layers and one dense layer, as presented in Figure 2.

The dropout layer is used to control the weight of the hidden layers, where the drop-out regularization rate in each layer used as a regularizer is set at 0.2. The model is trained with 350 epoches over a period of two weeks. A data structure is created with 60 time steps, each of 10 min, and a single output is created, since LSTM cells store long-term memory state. Hence, in each training stage, there are 60 previous training set elements for each taken sample. Consequently, in the testing stage, the first 60 samples are needed for an accurate estimate of the subsequent best beam index. Overall, the training objective is to compute weight matrices, and bias vectors that minimize the loss function for all training time steps, as shown next. See Table 2 for the parametric settings chosen for the layers in the BRNN-LSTM model.

Dataset: The dataset used in this paper is part of the BigData Challenge in [23] recorded over the period of two weeks. Namely, this dataset revealed MSs traffic volumes and used beam indices in sectorized geographical grids.

4. Simulation Results and Performance Evaluation

The proposed BRNN-LSTM prediction scheme is simulated in Figure 4 over various time steps, which shows high approximation between the ground truth and the prediction pattern. This shows that the proposed scheme achieves high accuracy. Thereby, it can successfully predict the subsequent beam indices with high success probabilities, once the network is sufficiently trained. Moreover, the accuracy of the proposed scheme is further studied by computing the loss function as presented next.

Loss Function: The training objective aims to reduce the loss function, i.e., mean square error (MSE) between the prediction vector of the proposed model

\hat{Y}

and the actual ground truth

Y

at the upcoming time step generated from a sample of U data points on all variables. This function is evaluated at every time step t as

L o s s (Y_{u}, {\hat{Y}}_{u}) = \frac{1}{U} \sum_{u = 1}^{U} {(Y_{u}, {\hat{Y}}_{u})}^{2} .

(12)

The reduced loss functions are computed over 350 Epochs, as depicted in Figure 5, which shows that the proposed scheme yields high accuracy and success probability. This in turn highly impacts the beam access procedure. This is shown next, by evaluating the proposed scheme for key metrics in beam access versus major existing schemes, i.e., access times and energy consumption.

4.1. Access Times

The initial beam access time at the MS

T_{acc}^{MS}

(likewise at BS) is defined as the duration required to determine the best beam index that returns the highest signal level, i.e.,

T_{acc}^{MS} = τ_{acc} D_{RS} {/ R}_{MS}

(13)

where

τ_{acc}

,

D_{RS}

and

R_{MS}

are the number of time slots occupied during control signals exchanged between MS and BS, the reference signal duration, and number of RF chains at the MS. Figure 6 shows that the proposed scheme yields significant reduction in access times versus existing schemes. Namely, 0.2 ms are required to acquire a pencil beam (

{5.5}^{°}

) when using 64 beamforming vectors. This is compared to 0.8 ms for the grating lobes, 4.8 ms for the sidelobes, 7.6 ms for the metaheuristics, 9 ms for the iterative search, 12.8 ms for the subarray and GPS, and 25 ms DL–UL beam-training schemes, respectively.

4.2. Energy Consumption

The energy consumption is measured at the MS in Figure 7, and it is defined as the power consumption (in microjoules) during the beam access time interval. It is given by

E_{C} {= Q}_{MS}^{ABF} T_{a c c}^{M S}

, where

Q_{MS}^{ABF}

is the power consumption in the ABF at the MS,

Q_{MS}^{ABF} = N_{MS}^{} (q_{n}^{} + q_{PS}^{} + q_{PS}^{}) + q_{LNA} + q_{RF} + q_{ADC} + q_{BB},

(14)

q_{RF} = q_{M} + q_{LO} + q_{LPF} + q_{AMP}, q_{ADC} = E_{ADC}^{step} S r_{ADC} 2^{B},

(15)

where

q_{n}

,

q_{PS}

,

q_{LNA}

,

q_{RF}

,

q_{ADC}

,

q_{BB}

,

q_{M}

,

q_{LO}

,

q_{LPF}

,

q_{AMP}

are the power consumption values for a single microstrip antenna, the phase shifter (PS), low-noise amplifier (LNA), RF chain, ADC, baseband combiner (BB), mixer (M), local oscillator (LO), low-pass filter (LPF) and the baseband amplifier (AMP), respectively. Additionally, the terms

E_{ADC}^{step}

,

{S r}_{ADC}

and

B

are the energy consumption per conversion in the ADC, sampling rate and number of bits, respectively [7]. The power consumption values (in milliwatts) for these components are listed in Table 3, recorded from studies in [24].

5. Conclusions

The energy consumption levels in Figure 7 show that the proposed scheme yields very low energy requirements compared to other schemes. The energy efficiency here is attributed to the reduced time using the RF chains. For example, 4.9 millijoulles are required to achieve initial access for the deep learning scheme. Meanwhile, the grating lobes, sidelobes, metaheuristics, iterative search, GPS and subarray, and DL–UL schemes consume 16, 31.25, 47, 62.5, 84 and 166 microjoules, respectively. Overall, the proposed scheme achieves 75% higher energy efficiency and faster access times compared to the closest scheme, i.e., grating lobes approach.

In this paper, a novel initial access scheme is proposed for standalone millimeter wave communications using deep learning models. The network operates in learning and training modes, where the best beam index in the subsequent time step is predicted without the requirement for beam scanning. Hence, the scheme features ultralow access times, along with efficient power and energy consumption levels versus existing schemes. Future efforts will investigate the learning model while taking into account outage effects caused by blockage and mobility.

Author Contributions

Conceptualization, A.A., F.S., and M.J.; methodology, M.J.; software, F.S.; validation, A.A., F.S. and M.J.; formal analysis, M.J.; investigation, A.A.; resources, A.A.; data curation, F.S.; writing—original draft preparation, M.J.; writing—review and editing, A.A.; visualization, F.S.; supervision, A.A.; project administration, A.A.; funding acquisition, A.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by DEANSHIP OF SCIENFOT RESAERCH at KING FAISAL UNIVERSITY, grant number 1811025.

Acknowledgments

The authors acknowledge the Deanship of Scientific Research at King Faisal University for the financial support under the Research Group support track (Grant No: 1811025).

Conflicts of Interest

The authors declare no conflict of interest.

References

International Mobile Telecommunications: Minimum Requirements Related to Technical Performance for IMT-2020 Radio Interface, ITU-R Study Group 5. No. R15-SG05-C-0040. February 2017; 6–7.
Alkhateeb, A.; El Ayach, O.; Leus, G.; Heath, R.W. Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 831–846. [Google Scholar] [CrossRef] [Green Version]
Jasim, M.; Ghani, N. Generalized Pattern Search for Beam Discovery in Millimeter Wave Systems. In Proceedings of the IEEE 86th Vehicular Technology Conference (VTC-Fall), Toronto, ON, Canada, 24–27 September 2017. [Google Scholar]
Jasim, M.; Aldalbahi, A.; Khreishah, A.; Ghani, N. Hooke Jeeves Search Method for Initial Beam Access in 5G mmWave Cellular Networks. In Proceedings of the IEEE 28th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 8–13 October 2017. [Google Scholar]
Jasim, M.; Ghani, N. Sidelobe Exploitation for Beam Discovery in Line-of-Sight Millimeter Wave Systems. IEEE Wirel. Commun. Lett. 2018, 7, 234–237. [Google Scholar] [CrossRef]
Jasim, M.; Pezoa, J.; Ghani, N. Simultaneous Multi-Beam Analog Beamforming and Coded Grating Lobes for Initial Access in mmWave Systems. In Proceedings of the IEEE Chilean Conference on Electrical, Electronics Engineering, Information and Communication Technologies (ChileCon), Pucon, Chile, 18–20 October 2017. [Google Scholar]
Abbas, W.D.; Zorzi, M. Context Information Based Initial Cell Search for Millimeter Wave 5G Cellular Networks. In Proceedings of the European Conference on Networks and Communications (EuCNC), Athens, Greece, 27–30 June 2016. [Google Scholar]
Zhao, X.; Abdo, A.M.; Zhang, Y.; Geng, S.; Zhang, J. Single RF-Chain Beam Training for MU-MIMO Energy Efficiency and Information-Centric IoT Millimeter Wave Communications. IEEE Access 2019, 7, 6597–6610. [Google Scholar] [CrossRef]
Zhang, R.; Zhang, H.; Xu, W.; You, X. Subarray-Cooperation-Based Multi-Resolution Codebook and Beam Alignment Design for mmWave Backhaul Links. IEEE Access 2019, 7, 18319–18331. [Google Scholar] [CrossRef]
Cousik, T.S.; Shah, V.K.; Reed, J.H.; Erpek, T.; Sagduyu, Y.E. Fast initial access with deep learning for beam prediction in 5G mmWave networks. arXiv 2020, arXiv:2006.12653. [Google Scholar]
Sim, M.S.; Lim, Y.G.; Park, S.H.; Dai, L.; Chae, C.B. Deep learning-based mmWave beam selection for 5G NR/6G with sub-6 GHz channel information: Algorithms and prototype validation. IEEE Access 2020, 8, 51634–51646. [Google Scholar] [CrossRef]
Alrabeiah, M.; Alkhateeb, A. Deep learning for mmWave beam and blockage prediction using Sub-6 GHz channels. IEEE Trans. Commun. 2020, 68, 5504–5518. [Google Scholar] [CrossRef]
Satyanarayana, K.; El-Hajjar, M.; Mourad, A.; Hanzo, L. Deep learning aided fingerprint-based beam alignment for mmWave vehicular communication. IEEE Trans. Veh. Technol. 2019, 68, 10858–10871. [Google Scholar] [CrossRef]
Zhou, P.; Fang, X.; Wang, X.; Long, Y.; He, E.; Han, X. Deep learning-based beam management and interference coordination in dense mmWave networks. IEEE Trans. Veh. Technol. 2018, 68, 592–603. [Google Scholar] [CrossRef]
Aldebaro, K.; Batista, P.; González-Prelcic, N.; Wang, Y.; Heath, R.W. 5G MIMO data for machine learning: Application to beam-selection using deep learning. In Proceedings of the 2018 Information Theory and Applications Workshop (ITA), San Diego, CA, USA, 11–16 February 2018. [Google Scholar]
Qi, C.; Wang, Y.; Li, Y.G. Deep Learning for Beam Training in Millimeter Wave Massive MIMO Systems. IEEE Trans. Wirel. Commun. 2020. [Google Scholar] [CrossRef]
Wenyan, M.; Qi, C.; Li, Y.G. Machine learning for beam alignment in millimeter wave massive MIMO. IEEE Wirel. Commun. Lett. 2020, 9, 875–878. [Google Scholar]
Wang, Y.; Narasimha, M.; Heath, R.W. MmWave beam prediction with situational awareness: A machine learning approach. In Proceedings of the 2018 IEEE 19th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Kalamata, Greece, 25–28 June 2018. [Google Scholar]
Alkhateeb, A.; Alex, S.; Varkey, P.; Li, Y.; Qu, Q.; Tujkovic, D. Deep learning coordinated beamforming for highly-mobile millimeter wave systems. IEEE Access 2018, 6, 37328–37348. [Google Scholar] [CrossRef]
Ma, K.; Zhao, P. Deep learning assisted beam prediction using out-of-band information. In Proceedings of the 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring), Antwerp, Belgium, 25–28 June 2020. [Google Scholar]
Balanis, C. Antenna Theory: Analysis and Design, 3rd ed.; John Wiley & Sons: Hoboken, NJ, USA, 2005; pp. 290–304. [Google Scholar]
Singh, M. Long Short-Term Memory. In Advances in Computing and Data Sciences, 1st ed.; Springer: Berlin, Germany, 2018; pp. 427–438. [Google Scholar]
T. Italia. Telecommunications—SMS, Call, Internet—mi. 2015. Available online: https://doi.org/10.7910/DVN/EGZHFV (accessed on 15 April 2020).
Méndez-Rial, R.; Rusu, R.; Alkhateeb, A.; González-Prelcic, N.; Heath, R.W. Channel estimation and hybrid combining for mmWave: Phase shifters or switches? In Proceedings of the Information Theory and Applications Workshop (ITA), San Diego, CA, USA, 1–6 February 2015; pp. 90–97. [Google Scholar]

Figure 1. Beam indices at the mobile station (MS).

Figure 2. Proposed BRNN-LSTM network architecture. BRNN: bidirectional recurrent neural network; LSTM: Long short-term memory.

Figure 3. LSTM cell structure.

Figure 4. Beam index prediction over time.

Figure 5. Loss model for the proposed prediction network.

Figure 6. Initial access times for various access schemes, downlink (DL)–uplink (UL).

Figure 7. Energy consumption levels for various access schemes.

Table 1. Existing Research on Deep Learning Methods for Beam Management in mmWave Communications.

Reference	Algorithm	Metrics	Advantages	Limitations
[10]	Deep neural network for beam prediction at BS	Algorithm Accuracy	Improves beam search at the BS versus exhaustive scheme	Omnidirectional mode at the MS, no beam ‘discovery at the MS, wide beams. Lacks time and power models
[11]	Deep neural network	Beam selection accuracy	Ray-tracing simulations and over-the-air experiments	Dependent on sub-6 GHz bands. It lacks time and power models
[12]	Approximation theory, deep neural network	Beam-prediction accuracy, SNR, spectral efficiency	Empirical models for the beam prediction performance	Dependent on sub-6 GHz bands. It lacks delay measurements.
[13]	Deep learning model, multifingerprint database plurality mechanism	Instantaneous RSS for various traffic densities, probability distributions of fingerprints	High spectral efficiency	Similar performance to beam-sweeping approach, lack of beam delay and power analysis and simulation. Requirements for multiple BSs and location dependent. Lack of estimation at the MS. Limited to vehicular networks.
[14]	Deep neural network, beam management and interference coordination	Sum-rate and time ratio	High sum-rates and reducing computational complexity	Limited to indoor environment, lack of outdoor scalability. Lack of latency and power models/simulation.
[15]	Deep learning for vehicle-to-infrastructure	Average reward	Consideration for mobility	Not tested for outdoor cellular networks. Lack of downlink performance.
[16]	Deep neural network	Achievable and success rates for various SNR	Success rates, consideration for power leakage	Lack of beamforming, time and power models
[17]	Deep neural networks	Network convergence and spectral efficiency	Multi-user beam alignment, online prediction	Lack of time and power models/simulation
[18]	Machine learning and situational awareness	Root mean squared and regression errors	Multi-user beam alignment	Location-dependent estimation, lack of time model
[19]	Deep neural network	Effective achievable rate	High achievable rates at different dataset size	Use of omnidirectional antennas at the MS, requirements for multiple BSs, lack of beam estimation at the MS
[20]	Convolutional neural network	Training and validation loss of different depths	High accuracy for different SNR	Dependence on sub-6 GHz bands. Lack of beamforming models and details of the algorithm. Similar spatial features between microwave and mmWave bands

Table 2. Parameter Settings for the BRNN-LSTM Model.

Class	Parameter	Value
LSTM layers	No. of cells in LSTM layers	4: [50,50,50,50]
Dense layer	Activation function	ReLU
Other Hyper-parameters	Batch size	16
	Epochs	350
	Dropout rate	0.2
Training Parameters	Optimizer	RMSprop
	Learning rate	0.001
	Drop-out regularization rate	0.2

Table 3. System Parameters.

Parameters	Variable	Value
Operation	f (GHz), BW(MHz), d	28, 500, 200
Power consumption (mWatt)	$q_{n}$ , $q_{PS}$ , $q_{LNA}$ , $q_{ADC}$ , $q_{BB}$ , $q_{MIX}$ , $Q_{LO}$ , $q_{LPF}$ , $q_{AMP}$	5.4, 78, 20, 20, 20, 19, 5, 14, 5

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Aldalbahi, A.; Shahabi, F.; Jasim, M. BRNN-LSTM for Initial Access in Millimeter Wave Communications. Electronics 2021, 10, 1505. https://doi.org/10.3390/electronics10131505

AMA Style

Aldalbahi A, Shahabi F, Jasim M. BRNN-LSTM for Initial Access in Millimeter Wave Communications. Electronics. 2021; 10(13):1505. https://doi.org/10.3390/electronics10131505

Chicago/Turabian Style

Aldalbahi, Adel, Farzad Shahabi, and Mohammed Jasim. 2021. "BRNN-LSTM for Initial Access in Millimeter Wave Communications" Electronics 10, no. 13: 1505. https://doi.org/10.3390/electronics10131505

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

BRNN-LSTM for Initial Access in Millimeter Wave Communications

Abstract

1. Introduction

2. System Model

2.1. Analog Beamforming Model at the MS

2.2. Digital Beamforming Model at the BS

2.3. Signal Model

3. Beam Prediction Access Scheme

3.1. Network Architecture

3.2. Operating Modes

3.3. BRNN-LSTM Deep Learning Model

4. Simulation Results and Performance Evaluation

4.1. Access Times

4.2. Energy Consumption

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI