1. Introduction
Rate-splitting multiple access (RSMA) has become a promising multiple-access framework that is being considered for implementation in future 6G networks and beyond [
1]. Recent studies have shown that RSMA outperforms traditional multiple-access techniques, such as non-orthogonal multiple access (NOMA) and space-division multiple access (SDMA), in terms of energy and spectral efficiency for multi-antenna systems [
1,
2,
3,
4]. The concept behind RSMA involves dividing user messages into multiple parts, which are transmitted using superposition coding at the transmitter and decoded using successive interference cancellation (SIC) at the receivers. The most common approach is based on single-layer rate splitting (RS), involving two parts: a common part and a private part. The common parts from all users are transmitted simultaneously using a shared codebook and must be decoded by all users, whereas the private parts are transmitted using private codebooks. This approach allows RSMA to transmit all the messages simultaneously and in the same frequency band using the power and spatial domains. Consequently, RSMA offers flexibility, where interference can be treated as noise or fully decoded, making it a versatile framework encompassing SDMA and power-division NOMA as special cases [
3,
5].
Modern wireless systems consider communication trends, such as the Internet of Things (IoT) and machine-type communications (MTC). The key priorities in wireless communications involve advancing green technology and reducing device power consumption [
6]. Various studies have been undertaken to establish self-sustainable communication systems using energy-harvesting (EH) techniques. One of the EH technologies contributing to these objectives is simultaneous wireless information and power transfer (SWIPT) [
7]. SWIPT is an effective technology that enables the base station (BS) to transmit energy and information simultaneously to wireless users. Within SWIPT, the most well-known architectures are the power-splitting (PS) architecture and time-switching (TS) architecture, where both schemes consider an EH module and an information decoding (ID) module at the receiver. In the TS architecture, the receiver periodically alternates between the ID and EH modules based on a TS sequence. In the PS architecture, the incoming radio frequency (RF) signal is divided into two streams based on a PS ratio, which are sent to the ID and EH modules. The authors of [
7] focused on the PS architecture because it has been established in the literature as achieving the best balance between energy harvesting and information decoding.
A cost-effective solution was proposed in [
8] using low-cost stations called power beacons (PBs) for wirelessly recharging devices using RF energy. PBs primarily serve as a source of wireless energy, extending the operational lifespan of battery-powered devices through wireless recharging. PBs do not necessitate complex computations and have low backhaul link requirements, enabling cost-effective and adaptable placement, making them a valuable addition to the system.
SWIPT has been widely studied, ranging from single-antenna systems [
9] to multiple-antenna systems [
10,
11]. Shi et al. [
10] optimized the precoding vectors and PS ratios in a multiuser multiple-input single-output (MU MISO) system to minimize the transmission power while considering both the minimum signal-to-interference-plus-noise ratio (SINR) and minimum EH at the user side. An extension to a multiuser multi-input multi-output (MIMO) system was explored in [
11], considering the EH constraints and a maximum tolerable mean square error (MSE) for received information. SWIPT has also been implemented along with the NOMA method in multi-antenna systems to maximize data rates [
12], minimize transmit power [
13], and enhance energy efficiency [
14]. On the other hand, there is limited research on the performance of networks implementing SWIPT in conjunction with the RSMA method.
In the context of RSMA, a pioneering study introduced the concept of RS [
2]. The authors elucidated the principal limitations of conventional methods while emphasizing the potential advantages of RS in terms of spectral and energy efficiencies compared to traditional techniques. The initial investigations of RS in a multiuser MISO system [
15,
16] aimed to maximize the minimum rate and maximize the sum rate. The solutions to these optimization problems were based on the weighted minimum mean square error (WMMSE) method and the alternate optimization (AO) algorithm for optimizing the precoding vectors and common rate variables. These studies reported that the RS framework consistently outperformed conventional methods, as evidenced by a comprehensive analysis of data rates and complexity in scenarios involving imperfect channel state information at the transmitter (CSIT). In [
17], an extension analysis was conducted on massive MIMO systems to maximize the minimum achievable rate of the common message. A hierarchical rate-splitting approach was proposed to address the challenges posed by the extensive array of antennas at the transmitter. The simulation results revealed the superior performance of RS over conventional broadcasting methods, considering perfect and imperfect CSIT. The RSMA method was initially introduced for downlink multiuser MISO systems [
3]. The authors highlighted the generality of RSMA compared to SDMA and NOMA. They also addressed the maximization of the weighted sum rate while considering the minimum rate constraints and power limitations. The validity of the authors’ assertions was confirmed by the simulation results, emphasizing the superior performance of RSMA across various scenarios with different network loads and numbers of users. An exhaustive analytical analysis was conducted for a two-user case in [
5], demonstrating the generality of RSMA by showcasing how it encompasses SDMA, NOMA, and orthogonal multiple access (OMA) as particular cases. Furthermore, the energy efficiency maximization and sum-rate maximization in multiuser MISO systems examined using RS emphasized the outstanding performance of RSMA, demonstrating its superior spectral and energy efficiency compared to SDMA and NOMA [
18].
An initial study of RSMA and SWIPT was conducted to maximize the sum rate of users [
19], considering two types of users: one for decoding information only and the other for harvesting energy only. A previous study [
20] investigated the performance of RSMA with SWIPT in multiuser MISO systems, considering that users can decode information and harvest energy based on a PS factor. Furthermore, the scenario of RSMA with SWIPT when an IRS is deployed in the system was investigated in [
21]. These studies reported significant improvements in performance provided by RSMA compared to traditional SDMA and NOMA methods. On the other hand, the aforementioned studies primarily focused on users close to the BS, driven by the EH requirements imposed by the optimization problem. This limitation becomes evident in real-world deployments, where users may be located far from the BS, posing challenges in meeting the EH requirements. Therefore, this paper investigated the deployment of a PB with RSMA as an efficient approach to address the previously mentioned challenge.
Huang et al. [
22] introduced a system involving PBs for wireless power transfer (WPT). These PBs can wirelessly charge receivers and be strategically placed alongside femtocell base stations (BSs) to provide short-range SWIPT to wireless devices. A PB-assisted wireless-powered communication network (WPCN) was proposed in [
23], consisting of a single-antenna PB and several single-antenna access points (APs). The PB provides RF energy to the APs, which transmit their information using the energy harvested from the PB. An extension of the PB-assisted WPCN, considering multi-antenna PBs, was reported in [
24] to optimize the energy beamformer vector and maximize the spectrum efficiency. A previous study [
25] considered the coexistence of a multi-antenna PB with a multi-antenna BS, assuming one ID user and several EH users, where the authors addressed the maximization of the total harvested energy. On the other hand, the aforementioned works did not consider SWIPT users equipped with a PS architecture and did not incorporate the RSMA framework.
Regarding state-of-the-art SWIPT systems assisted by a PB, the minimization of transmit power in a single-antenna system was investigated in [
10]. Extensions to a multiuser MISO system incorporating SDMA and NOMA were introduced in [
26,
27], respectively. Vu et al. [
28] considered a scenario in which a multi-antenna PB transmits RF energy to a single-antenna transmitter, serving two users with several relays. The transmitter employed the NOMA method and harvested energy from the PB to transmit the two messages. The relay applied the PS architecture to harvest energy and decode the message for the distant user, which was then forwarded. A part of this investigation was presented in a conference article [
29], where the SWIPT system with RSMA was introduced with the aid of a PB, where the optimal scenario of decoding the whole interference from the PB at the user side was assumed. On the other hand, this paper considered the general case of treating the interference from the PB as noise. In addition, a low-complexity and efficient scheme was proposed based on deep learning (DL).
DL is gaining popularity as a technique for resource optimization in wireless networks because of its ability to significantly reduce computational complexity compared to traditional optimization methods. A review of the most common schemes based on machine learning and reinforcement learning methods for network optimization problems was published in [
30,
31]. A deep neural network (DNN) was introduced as an approximation method for solving the sum-rate maximization problem using the WMMSE algorithm in a single-antenna system [
32]. The results of the simulations demonstrated the effectiveness of the DNN, which closely approximated the solution of the WMMSE algorithm while reducing the computational time. Xia et al. [
33] addressed three popular optimization problems in a multiuser MISO system with SDMA. Their proposed solution leveraged a neural network module to predict key features based on the channel vectors and a beamforming module to construct the beamforming vector from the predicted vital features. Building upon the previous system model, the sum-rate maximization problem was investigated in [
34], and two types of schemes based on a DNN were proposed: one where the beamformers are directly generated by the DNN output and the other utilizing a beamformer recovery module after the DNN module. The simulation results demonstrated performance close to traditional optimization methods, with the model incorporating the beamformer recovery module yielding the best results. On the other hand, the aforementioned work did not consider SWIPT technology and the RSMA method. The solutions are problem-dependent and rely on expert knowledge. Furthermore, state-of-the-art methods consider the channel vectors as direct inputs to the deep learning module. This approach involves using hundreds of features as the number of antennas and users increases, increasing the complexity of the deep learning model. A previous work [
35] considered a MISO SWIPT system with RSMA, where a deep learning-based solution was proposed. The scheme comprised three modules: an autoencoder for dimension reduction, a DNN to predict a set of target variables, and a precoding module to obtain the precoding vectors. On the other hand, the authors did not account for EH users or the deployment of a PB, which significantly increased the number of input features. Furthermore, the autoencoder module required problem-dependent adjustments of its hyperparameters, such as the number of hidden layers, hidden nodes, activation functions, and learning rate. These factors made it difficult to generalize the scheme to other scenarios.
The RSMA framework has showcased significant enhancements in spectral and energy efficiencies compared to conventional SDMA and NOMA techniques. This establishes RSMA as a promising candidate for future 6G networks, prompting a thorough examination of its performance across diverse multi-antenna systems. Moreover, the deployment of PBs holds vital significance in extending the lifespan of wireless devices, especially in IoT scenarios where battery replacement poses challenges in hard-to-access areas. However, the interference generated by PBs during the information decoding process at the user side necessitates a comprehensive investigation into an efficient beamforming design at both the BS and PB. Consequently, this study aims to develop a high-performance, low-complexity solution to jointly optimize the beamforming vectors and PS ratios in a multi-antenna system. The proposed system model integrates a multi-antenna transmitter implementing RSMA with SWIPT, along with a multi-antenna PB. The primary objective of this study is to minimize the total transmission power while fulfilling data rates and EH requirements for EH and PS users. The main contributions of this paper are as follows:
In the considered system model, the objective is to minimize the total transmission power of the BS and PB while meeting the minimum EH requirements for EH and PS users and ensuring a minimum data rate for PS users.
A two-step approach is adopted to address the non-convex problem presented in this study. In the first step, we optimize the common rate variables through a DL-based scheme that combines the principal component analysis (PCA) technique for dimensionality reduction with a DNN. The second step focuses on optimizing the beamforming vectors of the BS and PB, along with the PS factors. The SDR technique is used to accomplish this.
As a comparative scheme, the proposed minimization problem is addressed using a PSO-SDR approach, which is an iterative-based method that provides near-optimal solutions to the proposed problem. This scheme serves as a reference to analyze the performance of the proposed DL-based solution.
Simulation results show that the proposed DL-based method can perform similarly to traditional iterative-based schemes while significantly reducing computational complexity. Furthermore, the proposed RSMA solution is compared with NOMA and SDMA, showing that the RSMA approach achieves the lowest transmit power. Moreover, the generalization performance of the proposed DL-based method is validated by testing its performance across several challenging scenarios not included in the DL model training.
The remainder of this paper is structured as follows.
Section 2 introduces the system model and formulates the problem.
Section 3 outlines the proposed solution, and presents the comparative schemes.
Section 4 provides the simulation results, and
Section 5 reports the conclusions.
2. System Model and Problem Formulation
A MU MISO RSMA system assisted by a PB with SWIPT is considered, as shown in
Figure 1. The number of antennas at the BS is
, the number of antennas at the PB is
, and there are
K single-antenna PS users and
G single-antenna EH users. EH users are exclusively dedicated to harvesting RF energy, whereas PS users are equipped with a PS mechanism, allowing them to partition the incoming RF signal into two components, serving both information decoding and EH based on a designated PS ratio,
. The BS transmits information signals to both harvest energy at the EH module and decode information at the ID module for PS users. The PB, on the other hand, transmits energy-carrying signals used for energy harvesting by EH users and the EH module of PS users. However, these energy signals are considered interference at the ID module of PS users.
In one-layer RSMA, the original message intended for the
kth PS user,
, is separated into a common message,
, and a private message,
. A super-common message is produced by combining all the
K common messages and encoding them into the common stream
, which needs to be decoded by all PS users. The
K private messages are independently encoded into
K private streams
to be decoded by their respective user. At the PB, the energy-carrying signals are represented by
,
. Therefore, the signals transmitted at the BS and PB after precoding are
and
, respectively, where
and
represent the information beamforming vectors, and
corresponds to the energy beamforming vector for
,
, and
.
Figure 2 shows the RSMA transmission scheme in the considered system model.
At the user side, the
kth PS user first decodes the common stream
with an achievable rate given by
where
represents the channel vector from the BS to the
kth PS user;
is the channel vector from the PB to the
kth PS user;
is the PS factor;
represents the Gaussian noise at the antenna of the
kth PS user; and
is the data processing noise at the ID module of the
kth PS user.
After decoding
, RSMA employs the SIC procedure to eliminate interference due to the common stream
. The
kth PS can decode its private stream
with an achievable rate given by
The rate at which is transmitted, expressed as , must satisfy because the common stream needs to be decoded by all users. Moreover, the rate is composed of the rates to transmit each and can be expressed as , where is the rate to transmit the common part of the kth message, .
The energy harvested at the EH module of the
kth PS user can be expressed as follows:
where
is the EH efficiency at the
kth PS user. Moreover, at the
gth EH user, the harvested energy can be expressed as
where
represents the EH efficiency at the
gth EH user, and
and
are the channel vectors from the PB and BS to the
gth EH user, respectively.
We seek to optimize the beamforming vectors,
; the common rate variables,
; and the PS ratios,
, which together achieve a minimum rate requirement, and at the same time, they can harvest a required minimum EH for future use. The minimization of the sum transmission power of the PB and BS can be formulated as follows:
Constraint (5b) ensures that the kth PS user achieves a minimum rate requirement, denoted as . Constraint (5c) is set to guarantee that the common stream can be decoded by all PS users. Constraints (5d) and (5e) ensure that each kth PS user and gth EH user can harvest a minimum EH requirement, denoted as and , respectively. Solving the power minimization problem (5) is challenging because of its non-convex constraints (5b)–(5e). In the following, a near-optimal solution is proposed based on DL techniques and the SDR method.
3. Proposed Approach for Addressing Problem (5)
A DL-based scheme and SDR method are developed to address the non-convex problem (5). First, problem (5) is reformulated into two subproblems, denoted as follows:
where the first subproblem, represented in (6a), optimizes the common rate variables,
, for the given beamforming vectors,
, and PS ratios,
.
Section 3.1 presents a DNN-based scheme with PCA to solve this first subproblem. The second subproblem, represented by
in (6b), optimizes the beamforming vectors,
, and the PS ratios,
, for the given common rate variables,
. The proposed solution for the second subproblem is based on the SDR technique with the penalty function method and is described in
Section 3.2.
Figure 3 illustrates the overall procedure of the proposed scheme during both the training and online stages. In the training phase, the initial step involves generating the dataset. The dataset is generated by solving a minimization problem using conventional optimization methods, such as combining the PSO algorithm and SDR method. The features of the dataset consist of the rate and EH requirements, along with a reduced representation of the channel vectors obtained through PCA. The target values in this dataset correspond to the common rate,
. Subsequently, the K-fold cross-validation method is used to partition the dataset into training and validation subsets. These subsets are used to determine the optimal hyperparameters for the DNN module, including the number of hidden layers, hidden nodes, learning rates, batch size, and others. Once the DNN is trained, it is deployed at the BS for the online stage. During the online stage, the current channel vectors undergo dimensionality reduction via PCA. The reduced representations, rate, and EH requirements serve as inputs for the trained DNN. The result of the DNN is the prediction of the common rate variables,
, which are used in the SDR module to optimize the PS factors,
, and the beamforming vectors,
.
Moreover, by utilizing the predicted common rate variables,
, the minimization problem (5) can be reformulated as follows:
where
, and
.
3.1. Deep Learning-Based Scheme for Optimizing
This subsection describes the PCA module for dimensionality reduction and the DNN-based method to predict the common rate variables, .
Figure 4 presents the PCA module used to derive a reduced representation of each channel vector. The PCA technique [
36] projects the input data onto a lower-dimensional subspace to maximize the variance of the projected data. The training dataset is defined as
, where
is the total number of channel vectors used as the training dataset, and
is a
W-dimensional vector composed of the real and imaginary components of the
nth channel. The PCA module maps a channel vector onto an
L-dimensional subspace that satisfies
. The projection of
can be represented by
, where
with
for
. A
that maximizes the trace of the covariance matrix of
is needed to maximize the variance of
, which can be expressed as:
where
, and
. Let
represent the covariance matrix of
, and
. The solution of (8), described in [
36,
37], is obtained when
which means that the variance of the projected data is maximized when
is an eigenvector of
. Therefore, the optimal
is a matrix composed of the first
L eigenvectors of
as columns. Algorithm 1 summarizes the steps for performing the PCA technique.
Algorithm 1 PCA for dimensionality reduction |
- 1:
inputs: Training dataset containing the wireless channels and the number of components to keep, L. - 2:
Evaluate the mean . - 3:
Calculate the covariance matrix . - 4:
Decompose to obtain the eigenvectors and eigenvalues. - 5:
Select L eigenvectors corresponding to the L largest eigenvalues and create . - 6:
Map the W-dimensional channel vector into an L-dimensional representation by . - 7:
output: Reduced representation of the channel vectors by performing step 6 for each channel vector.
|
Figure 5 shows the DNN model for the proposed approach, composed of an input layer,
hidden layers, and an output layer. The number of nodes in the input layer is determined by the user’s rate requirements, EH requirements, and the reduced representation of the channel vectors after the PCA module,
. Each hidden layer consists of
hidden nodes, where the number of hidden layers and the number of hidden nodes are determined by fine-tuning the hyperparameters, as discussed in
Section 4. The common rate variables specify the number of nodes in the output layer,
. During the training stage, the weights of the DNN are optimized using the backpropagation algorithm [
38], given a training dataset with
as the real-valued targets. In conjunction with the grid search method, the K-fold cross-validation method is employed to select the best hyperparameters, including the number of hidden layers, hidden nodes, learning rates, and activation functions. Subsequently, during the online stage, after successfully training the DNN, the predicted common rate variables for the current channel vectors are constrained to feasible ranges, defined by the constraints (5b) and (5g), and are expressed as follows:
The modified common rate variables are passed to the SDR-based scheme to optimize the PS ratios, , and the beamforming vectors, . Within the DNN, we analyze the computational complexity in the online stage, as the training process takes place offline. This complexity is closely tied to the number of nodes and layers within the DNN. In particular, there are a total of nodes in the input layer, K nodes in the output layer, and each ith hidden layer has nodes, where . Consequently, the computational complexity of the DNN module can be expressed as + , even though this can be approximated to when the DNN parameters are considered fixed.
3.2. SDR-Based Method for Optimizing
We introduce an approach based on the SDR technique to jointly optimize the PS factors and beamforming vectors, with a fixed value for
. The matrix variables are denoted as
,
,
,
,
, and
. In addition, the matrix variable
enforces the conditions
,
, while matrix
enforces
and
. Problem (7) can be transformed into a convex problem by removing the rank-one constraints and given
, as follows:
Problem (11) is convex and can be solved efficiently using the CVX toolbox in MATLAB [
39]. Problem (11) involves
matrices of size
and
G matrices of size
, along with the
linear constraint variables. Therefore, the computational complexity of addressing problem (11) amounts to
while maintaining a solution accuracy of
[
40]. The optimal solutions are represented in problem (11) as
. When the matrix solutions have a rank of one, the optimal beamforming vectors,
, are given by
where
represents the largest eigenvalue of the matrix
, and
denotes its corresponding eigenvector. If
are rank-one, alternative approaches, such as the penalty function method [
41] or the Gaussian randomization technique [
20], can be employed to approximate the beamforming vectors.
The following details the penalty function method for the scenario where the matrix variables
are not rank-one. The proposed penalty-based method is based on the definition of
being positive semidefinite matrices, which satisfy the conditions
and
, where the matrices
are rank-one if
and
. Hence, the proposed penalty-based approach aims to minimize
and
by introducing a penalty factor,
, and incorporating these terms into the objective function as follows:
Problem (13) is non-convex because of the terms
and
. Therefore, the following inequality that holds for any
is used to handle the aforementioned non-convex terms:
where
represents the unit-norm eigenvector associated with the largest eigenvalue of
. Hence, (14) is used to approximate
based on a feasible matrix
at the
jth iteration as follows:
By applying a similar procedure as in (15), the term
can be equivalently transformed. As a result, problem (13) can be reformulated as follows:
Note that problem (16) is convex, and the solution can be obtained using the CVX toolbox in MATLAB. Algorithm 2 lists the proposed iterative scheme based on the penalty method, which is applied when the solutions to problem (11),
, are not rank-one.
Algorithm 2 Penalty-based method for solving problem (11) |
- 1:
inputs: Matrix solutions of problem (11), , tolerance value, , penalty factor, , channel vectors, rate, and EH requirements. - 2:
Set iteration counter . - 3:
Set initial feasible matrices, . - 4:
repeat - 5:
Solve problem (16) given the feasible matrices and denote the solutions as . - 6:
Increase counter . - 7:
Update the feasible matrices for the next iteration: . - 8:
until - 9:
Obtain the beamforming vectors with (12a) and (12b). - 10:
Output: .
|
3.3. PSO-Based Approach for Optimizing with a Given
This subsection presents a comparative scheme for optimizing the common rate variables using a PSO-based approach [
20] with given PS factors and beamforming vectors. PSO is a potent metaheuristic algorithm inspired by the social behavior of flocking birds, where the collective knowledge of the swarm guides each particle through the search space to discover the optimal solution. In the proposed PSO-based scheme, there is a population of
S particles whose position represents the common ratio variables to be optimized, i.e., the position of the
sth particle is given by
,
. The position of each particle is initialized randomly within the range
. The local best position for the
sth particle, denoted as
, represents the best location of the
sth particle. Moreover, the global best position, denoted as
, represents the best location of all the swarm particles. The location of the
sth particle is based on its velocity and is expressed as
where
w is the inertia weight;
are the acceleration parameters; and
. Subsequently, the position of the
sth particle is modified according to the following:
The objective function,
, is defined by the sum transmission power (7a), obtained by solving problem (11), with
being determined by the position of the
sth particle,
. The computational complexity of the PSO-based method is given by
, where
represents the complexity of solving problem (7) using the SDR method detailed in
Section 3.2. Algorithm 3 presents the algorithm based on PSO designed to optimize
.
Algorithm 3 Comparative scheme based on PSO for optimizing |
- 1:
inputs: Number of particles, S, maximum number of iterations, , rate, and EH requirements. - 2:
Set iteration counter and initialize the position of each particle and its velocity, . - 3:
Set the initial and . - 4:
While
do - 5:
for , do - 6:
Update the velocity and the position based on (17) and (18) to obtain and . - 7:
Restrict the value of the position based on (10). - 8:
Solve problem (11) with the common rates given by to get . - 9:
if then end if - 10:
end for - 11:
- 12:
Increase counter . - 13:
end while - 14:
Output: Best common rates, , defined by .
|
4. Simulation Results
Numerical simulations were conducted to assess the performance of the proposed DL-based solution in minimizing the sum transmission power of the BS and PB in the considered MISO SWIPT system with RSMA. The simulation parameters were set to
PS users,
EH users,
antennas at the BS,
antennas at the PB,
,
,
, and
. The channel vector between the
kth PS user and the BS is given by
where
defines the path-loss exponent;
;
denotes the distance between the BS and the
kth PS user; and
is subject to independent Rician fading.
are channels established according to (13). The BS is positioned at coordinates (8 m, 20 m), whereas the PB is located at (13 m, 7 m). PS users are randomly distributed within a designated region defined by
and
. Similarly, EH users are randomly distributed within an area bounded by
and
.
The effectiveness of the proposed DL-based method was evaluated by comparing it with a PSO-SDR method and conventional techniques, such as SDMA and NOMA. The proposed scheme is denoted as DNN-RSMA, and the comparative scheme based on PSO and SDR is denoted as PSO-RSMA. In SDMA [
26], the
kth PS user’s message,
, is encoded directly into the data stream,
, without common rate variables, and interference originating from other users is considered noise. In addition, a comparative scenario was incorporated by assuming the absence of the power beacon (PB) deployment in the network.
In NOMA [
27], interference originating from other users is mitigated by employing multiple layers in the SIC process. The message intended for the
kth PS user,
, is encoded into the private stream,
. The transmitted signal at the BS is given by
, where
is the information beamforming vector for
. The signal transmitted at the PB is the same as in the proposed RSMA-based framework. On the user side, the decoding order for PS users is determined by their respective channel strengths, denoted as
. In particular, the
kth PS user begins by decoding messages intended for the
Kth,
th, …,
th PS users. Subsequently, the
kth PS user proceeds to decode its intended message while treating the interference from the remaining messages as noise. The achievable rate to decode the
lth message at the
kth PS user is given by
Two datasets were considered in the simulation analysis: a training/validation dataset and a testing dataset. As described in
Section 3, the PSO-SDR method was used to process all the datasets considered. A fivefold cross-validation method was used to divide the training/validation dataset into separate training and validation datasets. These datasets were used to determine the optimal hyperparameters for the DNN module, including the number of hidden layers, hidden nodes, learning rates, activation functions, and batch sizes. Subsequently, the testing dataset was used to evaluate the ability of the model to generalize. The testing dataset encompassed scenarios not encountered during the training and validation phases to assess the generalization performance of the model. These scenarios included different user distances and varying numbers of antennas at the BS. The training/validation dataset consisted of 14,000 samples, covering data rates ranging from
bits/s/Hz to
bits/s/Hz, EH requirements from
dBm to
dBm, eight antennas at the BS, and the previously mentioned range of distances. The specifics of the testing dataset for each scenario are outlined later in this paper. In addition, the exponential linear unit (ELU) was selected as the activation function for the hidden layers, and the Adam algorithm was used as the optimizer. The training data for the PCA module were composed of 6000 channel vectors. Regarding the PSO-SDR method, the simulation parameters considered for the PSO algorithm after fine-tuning the parameters were
,
,
, and
.
Firstly, this paper presents an analysis of the convergence behavior of the DNN module while considering different hyperparameters.
Figure 6 shows the convergence performance of the DNN module under varying learning rates (lr), numbers of hidden layers, and numbers of principal components (PCs) in the PCA module. In the legend in
Figure 6, the last term represents the number of units × the number of hidden layers, where all hidden layers have the same number of hidden units. Regarding the learning rate, a high learning rate leads to rapid convergence, but it can also result in overfitting, as observed in the case of lr = 0.005. On the other hand, low learning rate values require a significantly higher number of epochs to achieve convergence, and they may not guarantee the lowest error value, as observed in the case of lr = 0.0001. Regarding the number of PCs, it is important to note that the PCA module was applied to each channel vector, resulting in each channel vector being represented by a certain number of PCs. The lowest error was achieved when the number of PCs was 2 or 3. In particular, when PC = 2, it implies a lower number of required input nodes in the DNN module. For the remainder of the simulations, a learning rate of lr = 0.001 and PC = 2 were selected based on these observations.
We utilized the recurrent neural network (RNN) model [
38] in our simulation results for comparative analysis. RNN, a type of neural network, is specifically designed to analyze sequences of vectors, where values in successive vectors exhibit interrelationships. However, it is noteworthy that each sample in our dataset comprised a singular vector containing the user’s rate requirements, EH requirements, and the reduced representation of the channel vectors post-PCA module. To incorporate the RNN into our simulations, we configured the number of vectors in the sequence to one and set the size of the hidden layer to 150.
Figure 7 shows the variation of the sum transmission power of the PB and BS with respect to the data rate requirements,
, considering a minimum EH requirement of
dBm. The data rate requirement serves as a constraint in the proposed optimization problem (5) and must be satisfied by the BS for all users in the system. As the rate requirement increases, the BS must allocate more transmission power to meet the specified rate, thereby increasing the total transmission power. Although the PB does not transmit information signals, their energy-carrying signals aid in meeting the EH requirements but are considered interference at the ID module of PS users. In particular, we observed that deploying the PB enabled a reduction of up to 3 dBm in the total transmission power for all the multiple-access methods compared. Furthermore, the RSMA scheme achieved significantly lower transmission power with and without the PB compared to their respective SDMA- and NOMA-based counterparts. Moreover, the proposed DNN-RSMA-based solution achieved a similar result to the near-optimal scheme of the PSO-SDR RSMA while significantly reducing the computational complexity. In particular, the complexity of the PSO-SDR method is denoted as
, and the complexity of the DNN-based method is represented as
, where
is the computational complexity of solving problem (7) using the SDR method, and
represents the computational complexity of predicting the common rate variables. Consequently, the proposed DNN-based method was approximately
times faster than the PSO-SDR near-optimal scheme. In addition, we observed that the RNN model achieved performance comparable to that of the proposed DNN. This similarity arises because when the input is a single vector, the RNN exhibits a layer-wise architecture similar to that of a basic neural network. Furthermore, the proposed DNN demonstrated lower computational time compared to the RNN. Specifically, the RNN required a computational time of 191.232 s for training and 0.002 s for testing, whereas the proposed DNN required a computational time of 40.295 s for training and 0.001 s for testing. These simulations were conducted on a computer equipped with an Intel Core i7-6700 CPU and 16 GB of RAM. Therefore, the proposed DNN emerges as the most suitable neural network architecture for solving the proposed optimization problem.
RSMA outperformed SDMA because of its capability to decode a portion of the interference through the SIC process on the common message (
Figure 7), whereas SDMA treats all interference as noise. In the NOMA scheme, the difference arises from using the SIC process. In RSMA, the SIC process aims to cancel the interference from the common message. In contrast, in the NOMA scheme, the SIC process is employed to eliminate interference from the messages of users with weaker channel strengths. On the other hand, achieving excellent performance under the NOMA scheme in multi-antenna systems necessitates that users whose messages will be decoded by SIC have weaker channel strengths and sufficiently aligned channels. This alignment requirement is uncommon in real-world deployments and the channel model considered in Equation (
13). Furthermore, as the rate requirement of the PS users increases, the transmission power needed for the beamforming vectors also increases, leading to performance degradation in the NOMA scheme after reaching
bits/s/Hz.
The variation of the sum transmission power of the PB and BS with respect to the EH requirements,
, is presented in
Figure 8, considering a minimum rate requirement of
bits/s/Hz. Similar to
Figure 7, a significant reduction of approximately 3 dBm was observed in the total transmission power due to the deployment of the PB. Furthermore, the benefit of applying RSMA compared to conventional methods, such as NOMA and SDMA, in considerably reducing the transmission power was demonstrated. This is because RSMA employed the SIC procedure, which improved the data rate at the ID module of PS users while simultaneously reducing the PS factor to enhance harvested energy, thus reducing the transmission power. Furthermore, the proposed DNN-based approach performed similarly to the PSO-based method while significantly reducing computational complexity.
Next, we analyzed the generalization performance of the proposed DNN-based method across considerably different scenarios from those used in the training phase. In particular, we kept the trained DNN module fixed and tested its generalization performance by varying the number of antennas, resulting in a modification of the components of the channel vectors, and by varying the distance from the BS, resulting in a significant alteration of the channel strengths. The channel vectors, along with the requirements for data rates and energy harvesting, served as inputs for the PCA and DNN modules in the proposed scheme to generate the common rate variables, as detailed in
Section 3.1.
Figure 9 presents the variation of the sum transmission power of the PB and BS with respect to the number of antennas equipped at the BS, considering a required minimum rate of
bits/s/Hz and a required minimum EH of
dBm. As the number of antennas at the BS increased, there was a reduction in the transmission power due to the increased degrees of freedom. In the case of the DNN-based method, the training dataset solely consisted of samples representing a scenario with a BS equipped with eight antennas, as detailed at the beginning of
Section 4. In contrast, the testing data in
Figure 9 contained samples encompassing scenarios with varying numbers of antennas at the BS. The utilization of the same trained DNN module with different numbers of antennas can be attributed to the PCA module. In particular, independent PCA modules for each scenario were trained based on the number of antennas while maintaining the same number of principal components,
L. Consequently, despite variations in the dimensionality of the channel vectors, the output of the PCA module remained consistent across all different numbers of antennas. Training a PCA module is a straightforward task because it only requires channel vector samples and does not necessitate labels or target values.
The proposed DNN-based scheme demonstrated remarkable performance comparable to that of the PSO-based method, even when the number of antennas differed from the training scenarios (
Figure 9). This indicates strong generalization capabilities and robustness to environmental changes. Furthermore, the RSMA-based methods consistently achieved lower total transmission power compared to the traditional NOMA and SDMA methods. Moreover, the deployment of the PB resulted in a significant decrease in the transmission power, even with an increasing number of antennas at the BS. This can be attributed to the ability of the PB to mitigate signal attenuation because of the distance from the transmitter, a critical factor in scenarios with EH requirements for users.
Figure 10 shows the variation of the sum transmission power of the PB and BS with respect to the position of the PS users, considering a minimum rate requirement of
bits/s/Hz and a required EH of
dBm. In particular, the range of the position of the PS users on the x-axis,
, was varied. This position was randomly selected within the region of
, where
varied to analyze performance as the distance between the PS users and the BS increased. The transmission power increased as
increased because the average distance from the user to the BS also increased with higher values of
. Additionally, the difference in the transmission power between scenarios with and without the PB diminished as the value of
increased, owing to the fixed location of the PB across all
values. As
increased, users were positioned farther from the PB, resulting in increased attenuation of the energy signal. Although the energy signal at the PS users was utilized for energy harvesting at the EH module, it was considered interference for the ID module. Consequently, as the received energy signal power from the PB diminished due to increased distance, it contributed less to the EH at the PS users. Meanwhile, reliance on the received power from the information signal transmitted by the BS became more dominant, leading to a reduction in the impact of the PB deployment. Moreover, as shown in the previous figures, the RSMA method consistently achieved a considerable decrease in the total transmission power in comparison to the NOMA and SDMA methods.
In the case of the DNN-based method, the training dataset consisted solely of samples representing a scenario with
m, as detailed at the beginning of
Section 4. In contrast, the testing data in
Figure 10 contained samples encompassing scenarios with varying values of
. The proposed DNN-based scheme exhibited high generalization performance, achieving comparable transmission power to the PSO-based method. Furthermore, this study analyzed the effect of slightly changing the number of PCs in the PCA module. Similar to the results shown in
Figure 6, the cases of PC = 2 and PC = 3 achieved similar results, with PC = 2 having lower transmission power and demonstrating the best generalization performance.