Next Article in Journal
A Novel Integrating Data Envelopment Analysis and Spherical Fuzzy MCDM Approach for Sustainable Supplier Selection in Steel Industry
Next Article in Special Issue
Validation of Parallel Distributed Adaptive Signal Processing (PDASP) Framework through Processing-Inefficient Low-Cost Platforms
Previous Article in Journal
Building Damage Assessment Based on Siamese Hierarchical Transformer Framework
Previous Article in Special Issue
Performance of Dense Wireless Networks in 5G and beyond Using Stochastic Geometry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Scalable Cell-Free Massive MIMO with Multiple CPUs

School of Information Science and Technology, Nantong University, Nantong 226019, China
*
Author to whom correspondence should be addressed.
Mathematics 2022, 10(11), 1900; https://doi.org/10.3390/math10111900
Submission received: 28 April 2022 / Revised: 26 May 2022 / Accepted: 31 May 2022 / Published: 1 June 2022

Abstract

:
In this paper, we consider the uplink of a scalable cell-free massive MIMO (CF-M-MIMO) system where user equipments (UEs) are served only by a subset of access points (APs). All APs are physically divided into predetermined “real clusters”, which are linked to different cooperative central processing units (CPUs). Based on the cooperative nature of the considered communications framework, we assume that each UE is affiliated with a “virtual cluster”, which is associated with some APs coming from different real clusters. Thanks to the degrees of cooperation among multiple CPUs, the uplink spectral efficiencies (SEs) of four different levels are analyzed. To achieve system scalability, the CF-M-MIMO system with multiple CPUs is introduced, which leads to lower SE. To this end, we design a joint combining method based on statistical channel state informations (CSIs), which not only has low complexity but also improves the SE of the system. Simulation results indicate that the average rate of our proposed method can be improved by about 30%.

1. Introduction

The demand of user equipments (UEs) for high transmission rates and wide communication coverage are boosting the evolution of wireless network architecture. The distributed communication system [1] cooperates through multiple access points (APs) to realize coherent combinations, reduce interference and supply macro diversity gain, so as to obtain high spectral efficiency. Distributed antenna systems (DAS), coordinated multipoint with joint transmission (CoMP-JT) and network multiple-input multiple-output (Network MIMO) are suggested as some examples of cellular distributed systems with the joint transmission of multiple APs [2,3,4]. Theoretically, the interference between cells can be mitigated by designing the cooperation of adjacent clusters, in which the APs belonging to different clusters jointly serve all UEs in the region. However, the cooperation requires perfect channel state informations (CSIs) to be exchanged, which is not scalable in practice [5]. In addition, the inter-cell or out-of-cluster interference caused by the cellular mode cannot be suppressed, which also results in performance limitations. In fact, the standardization of CoMP-JT in cellular networks has not been widely promoted.
Recently, massive MIMO has become the key technology of 5G. Massive MIMO has the ability to improve the spectral efficiencie (SE) by about 10 times over small cell networks [6,7], by improving the base station (BS) hardware instead of increasing the number of BSs. The SE gain benefits from a lot of antennas in each BS, which are applied for digital beamforming and, particularly, to spatially multiplex multiple UEs on the same time-frequency resource [8,9]. With the development of massive MIMO technology [10], the interest in distributed massive MIMO systems has been revived. Distributed massive MIMO systems are equipped with many BSs, and each BS has an array formed by a large number of antennas. Similar to CoMP-JT, distributed massive MIMO systems can be designed for joint transmission [7,11] through multiple BSs, and are expected to achieve better performance than cellular systems. Therefore, many different architectures based on massive MIMO have been proposed.
A recent system architecture, referred to as cell-free massive MIMO (CF-M-MIMO), was proposed in [12,13,14,15]. The basic idea is that a large number of APs are distributed in a wide area to serve a small number of UEs. A central processing unit (CPU) connects to these APs, which can also be called an edge-cloud processor [16] or cloud radio access network (C-RAN) data center [17].

1.1. Related Work

Compared with traditional small cell systems, CF-M-MIMO systems have made changes in the way of serving the UE. Unlike small cell systems in that each UE is served by only one AP, CF-M-MIMO systems allow the CPU to connect multiple APs for digital beamforming [11], which provides coordinated transmission for several UEs on the same time-frequency resource. Early studies have shown that appropriate signal processing methods, such as regularized zero-forcing (RZF) or minimum mean-squared error (MMSE) precoding in the downlink, can be used separately at each CPU to suppress the interference from other APs connected to the same CPU [18,19,20], rather than cooperate with other CPUs.
CF-M-MIMO combines the benefits of using time division duplex (TDD) mode [10], distributed massive MIMO technology and user-centric transmission. The TDD operation relies on channel reciprocity to obtain CSIs [21], which can reduce the fronthaul burden. Thanks to the antennas with uniform amplitude distribution that can more effectively combat shadow fading, the distributed massive MIMO systems can have greater coverage than collocated massive MIMO systems [22]. In addition, the user-centric approach enables CF-M-MIMO systems to mitigate the inter-cell interference [23]. Without cell boundaries, all the UEs are considered to be surrounded by a large number of APs, thus the boundary effect can be eliminated [13,23]. Although recent works have shown that CF-M-MIMO systems have significant advantages in many aspects, there is still a big gap between theoretical analysis and practical application. For example, the key assumption that all the APs are controlled by a single CPU is hard to achieve scalability in CF-M-MIMO.
Actually, the implementation of the scalable CF-M-MIMO mainly faces two problems.
  • Despite [18] having proved that excellent performance can be achieved when all the APs are connected to a single CPU for joint transmission, it is difficult to let a single CPU control all the APs when the number of APs is large. Moreover, joint transmission requires a single CPU to centralize the signal processing, which puts high demands on the CPU’s processing capacity.
  • The traditional precoding and power control of CF-M-MIMO act on the overall APs and UEs, which are difficult to realize in practice. Although a series of studies have proved the superiority of the final results [18,24,25], the complexity of these algorithms grows polynomially with the number of APs or UEs. Moreover, each AP needs to transmit instantaneous CSIs to the CPU, which is also difficult to achieve scalability when there are a large number of APs in the CF-M-MIMO systems.
The scalability of the power control or precoding algorithms has been considered in [26,27,28,29,30]. The article [26] concentrated on the scalability of CF-M-MIMO with a single CPU. If the CPU only controls a partial number of APs serving each UE, the complexity of precoding or power control is not increased with the number of APs. The introduction of the dynamic cooperation clustering (DCC) network makes the above method possible. In papers [28,29], the authors proposed the power allocation strategies based on long-term fading, which avoid a large amount of information exchanging among multiple CPUs and APs. The local partial zero-forcing precoding and combining were considered in [27,30], which depend on the large-scale fading and act on partial UEs and APs. These papers provide feasible methods to achieve scalable implementations. However, these papers mentioned above only focus on the first issue, but do not discuss the second one.
In [31], the author considered the scenario of multiple CPUs and discussed the scalability aspects of CF-M-MIMO. However, cooperation among multiple CPUs is less understood in the literature and still open for investigation. In order to enhance the competitiveness of multiple CPUs systems, how to exchange the information for cooperation among multiple CPUs is an essential problem. As the above considerations demonstrate, exchanging instantaneous CSIs for joint transmission achieves high performance gain [18,32,33]. However, this method puts a great burden on the backhaul, thus it is unscalable. Since the statistical CSIs remain unchanged for a long time, it is considered to be available in a scalable network. In summary, how to use statistical CSIs for cooperative transmission is a key problem. In [34], the authors did not fully exploit the potential performance of CF-M-MIMO with multiple CPUs when statistical CSIs can be exchanged among multiple CPUs. In [34], the estimation of signals from UEs was calculated by simply taking the average of all the signals, which led to a large spectral efficiency (SE) loss. In addition, the authors did not take into account the DCC network, which made it difficult for the network to achieve scalability. All of these motivate the current work to focus on cooperation among multiple CPUs with statistical CSIs. The contributions of this paper are summarized as follows:
  • We consider a taxonomy with four different implementations of CF-M-MIMO with multiple CPUs, which are classified by different degrees of cooperation among the CPUs. The four different levels of cooperation can be called centralized connectivity, distributed connectivity and complex processing, distributed connectivity and simple processing, and no connectivity, respectively. The difference of these levels is shown in Table 1. We derive novel SE expressions for different levels of multiple CPUs cooperation in the uplink transmission. In addition, unlike most scenarios that specify the number of CPUs participating in the service, we consider a completely user-centric way to select APs.
  • We propose a novel signal processing algorithm for cooperation among multiple CPUs. Each CPU processes the local information from its APs, and then transmits these signals to a CPU for final decoding. Based on the generalized Rayleigh quotient, we use simple weighted processing to linearly combine received signals from multiple CPUs with statistical CSIs.
  • We compare the performance of different cooperation levels. Monte Carlo simulation results show that our proposed distributed connectivity scheme can achieve scalability with lower backhual burden, and the performance loss is negligible compared to the centralized connectivity scheme.

1.2. Paper Structure

The rest of this paper is organized as follows. Section 2 defines the uplink of CF-M-MIMO system with multiple CPUs. Next, Section 3 presents the four levels of CPU cooperation. Simulation results compare the performance of the four levels of CPUs cooperation in Section 4. Finally, Section 5 concludes this paper.

2. System Model

Notation: Boldface lowercase letters denote column vectors and boldface uppercase letters denote matrices. The superscripts () * , ( ) T , and () H indicate transpose, conjugate and transpose. Since we cannot guarantee full rank of matrices, we use · to denote the matrix pseudoinverse. The n × n identity matrix stands for I N . The diag h 1 , , h n is used to denote a block-diagonal matrix. z CN 0 , σ 2 denotes a circularly symmetric complex Gaussian random variable (RV) z with zero mean and variance σ 2 . | Z | is applied to denote the cardinality of the set Z .
We consider a CF-M-MIMO system operating in TDD mode, which consists of K single-antenna UEs, L single-antenna APs and U CPUs as shown in Figure 1. All of these APs are divided into U predetermined real clusters (RCs). Each RC is connected to a different CPU via error-free fronthaul links. The indexes of the APs belonging to the u-th RC are included in the subsets P u . All CPUs are interconnected through error-free backhual links and can be cooperated with each other. The generic k-th UE is served by a subset of the APs and the subset is termed virtual cluster (VC). We define by Z k the sets consisting of AP indexes in the k-th VC. Although the RCs are non-overlapped, due to the connection between different CPUs, the VCs can be overlapped so that CPUs can cooperate to serve UEs in the network. In addition, A k u is used to define the set including the AP indexes belonging to both the u-th RC and the k-th VC, i.e.,
A k u = l P u : l Z k .
If the set A k u , is not empty, i.e., at least one AP in P u serves the k-th UE, then the u-th CPU serves UE k.
The channel between AP l and UE k can be modeled as:
h k l = β k l 1 / 2 g k l ,
where β k l is the large-scale fading coefficient related to geometric path loss and shadow fading and g m k is the small scale fading. The large-scale fading coefficient β k l changes tardily, and thus, can be traced and acquired. We assume that g k l , l = 1 , , L , k = 1 , , K , are independent and identically distributed (i.i.d.) CN ( 0 , 1 ) RVs.

2.1. Channel Estimation

We consider the uplink with τ c channel use, which consists of τ p channels for transmitting pilots and τ c τ p channels for payload data. In the pilot transmission phase, τ p mutually orthogonal τ p -length pilot signals φ 1 , φ 2 φ τ p are applied for channel estimation, where φ n 2 = τ p . The case of practical interest is a big network with τ p < K , which means that more than one UE is assigned to each pilot. This is so-called pilot contamination that degrades the system performance. We suppose that t k 1 , , τ p is the index of the pilot assigned to UE k and G k 1 , 2 . . . K denotes the subset of UEs which are assigned the same pilot as UE k. When all UEs send their pilots, the received signal at AP l is:
y l p i l o t = i = 1 K p i h i l φ t k H + n l ,
where p i is the transmission power of UE i, n l C τ p × 1 is the noise vector whose elements are i.i.d. C N ( 0 , σ 2 ) RVs. According to the received signal y l p i l o t , AP l places it on the project with φ t k * to obtain y ^ l , which is given by:
y ^ l = i = 1 K p i τ p h i l φ t k H φ t k * + 1 τ p n l φ t k * = i G k p i τ p h i l + 1 τ p n l φ t k * .
Applying standard formulas from paper [13], the minimum mean-squared error (MMSE) estimate of h k l is expressed as:
h ^ k l = p k τ p β k l Ψ t k l 1 y ^ l ,
where:
Ψ t k l = E y ^ l y ^ l H = i G k τ p p i β i l + σ 2 .
We define the estimation error as h ˜ k l = h k l h ^ k l , where h ^ k l CN 0 , p k τ p β k l Ψ t k l 1 β k l and h ˜ k l CN 0 , C k l with:
C k l = β k l p k τ p β k l Ψ t k l 1 β k l .
The channel estimation error mainly caused by pilot contamination.

2.2. Uplink Payload Transmission

During the uplink transmission, all the UEs transmit signals to the APs. In the AP cluster A k u , the signal received by AP l is:
y l ul = i = 1 K h i l s i + n u , l ,
where s i is information-bearing signal sent by UE i with E s i 2 = 1 and n u , l CN 0 , 1 is additive noise at AP l.

2.3. Dynamic Cooperation Clustering Network

The DCC network was considered in [20,26] to achieve scalability in CF-M-MIMO, and we use it to set the VCs. It is obtained by stipulating a number d i l , for i = 1 , , K and l = 1 , , L , which set the subset of APs that serve UEs. More specifically, d i l is 1 if AP l is allowed to convey and decode signals from UE i and 0 otherwise. The DCC network does not alter the received uplink signals since all the APs physically receive the signals from all the UEs. However, only a subset of the APs is participating in signal detection. This is equivalent to setting d i l as:
d i l = 1 if l Z i 0 if l Z i .

3. Multiple CPUs Cooperative Transmission

In order to obtain better performance, each VC appoints a M a s t e r C P U that possesses high computational resources to receive information from other CPUs for joint processing. Four different levels of cooperation are considered below.

3.1. Level 4: Centralized Connectivity

The highest level of multiple CPUs cooperation network is that all the CPUs share their instantaneous CSIs with the Master CPU. Thus, the Master CPU can be capable of designing an overall MMSE combining matrix. In other words, all the CPUs except the Master CPU play as relays without processing signals from the APs and forward all signals to the Master CPU [35]. In fact, it is equivalent to setting a large VC for each UE, where all the APs in the large VC are managed by a single CPU. In the uplink transmission phase, the received signal at the Master CPU is expressed as:
s ^ k = l = 1 L v k l H d k l y l u l = v k H D k h k s k + i = 1 , i k K v k H D k h i s i + v k H D k n ,
where D k  = diag [ d k 1 , , d k L ] C L × L is a block diagonal matrix, n = n 1 , , n L H C L × 1 denotes noise vector at the k-th VC and h k = h k 1 , , h k L H C L × 1 is the collective channel vector at the k-th VC.
Although the capacity of Level 4 with perfect CSIs is supposed to be known in some cases [35], the ergodic capacity is usually unknown when taking into account imperfect CSIs. Nevertheless, we can utilize the lower bounds of standard capacity to analyze the performance [36], which can be used to achieve the following SE.
Proposition 1.
At Level 4, an achievable SE for UE k is:
SE k ( 4 ) = 1 τ p τ c E log 2 1 + SINR k ( 4 ) ,
and a lower bound of capacity to signal-to-interference-and-noise ratio (SINR) for UE k is given by:
SINR k ( 4 ) = p k v k H D k h ^ k 2 i = 1 , i k K p i v k H D k h ^ i 2 + v k H L k v k
with:
L k = D k i = 1 K p i C i + σ 2 I L D k ,
where C i = diag [ C i 1 , , C i L ] C L × L .
Proof. 
This proof follows the same steps as the proof of [18] for CF-M-MIMO with a single CPU.    □
The pre-log factor 1 τ p τ c in (11) denotes the fraction of channel uses which are utilized for uplink data transmission. The term SINR k ( 4 ) uses the form as effective instantaneous SINR [24]. The numerator is the desired signal power received over the estimated channel, and the denominator is interference plus noise. We note that SINR k ( 4 ) is in the form of the generalized Rayleigh quotient. Hence, the MMSE combining can be achieved to maximize (12).
Corollary 1.
According to the generalized Rayleigh quotient [24], the SINR k ( 4 ) (12) for UE k is maximized by the MMSE combining:
v k = p k i = 1 K p i D k h ^ i h ^ i H D k + L k D k h ^ k ,
which obtains the maximum value:
SINR k ( 4 ) = p k D k H h ^ k i = 1 K p i D k h ^ i h ^ i H D k + L k D k h ^ k .
Proof. 
It follows from [24] (Lemma B.10), and note that (12) is a generalized Rayleigh quotient with respect to v k .    □
Table 2 shows the number of complex scalars that the other CPUs send to the Master CPU through the backhaul, in which A M C k Z k includes the indexes of the APs that serve UE k and are controlled by the Master CPU. In each coherence block, all the CPUs except the Master CPU need to send τ c Z k τ c A M C k complex scalars to the Master CPU. In addition, the collective channel D k h k : k = 1 , , K are supposedly available at the Master CPU, which can be expressed as ( K Z k ) / 2 complex scalars.
Although centralized processing can obtain the best performance in most cases, it puts high demands on bandwidth and the delay of backhaul. This poses a difficulty in achieving scalability in CF-M-MIMO. In addition, it is difficult for the Master CPU to handle a large amount of instantaneous CSIs. Therefore, a more practical connection mode is considered below.

3.2. Level 3: Distributed Connectivity and Complex Processing

This solution requires that statistical CSIs can be exchanged among multiple CPUs. Instead of setting a large VC and sending all the signals to the Master CPU for unified processing, we select a set of CPUs for each UE to reduce the pressure of backhaul. We assume that the CPU in each RC can deal with its signals collected from APs, and then these signals are transferred to the Master CPU for final decoding.
Based on the neighborhood criterion, we suppose that the corresponding setting of the DCC framework in the u-th RC is D k u = diag [ b k 1 , , b k | P u | ] C | P u | × | P u | , where b k l can be written as:
b k l = 1 if l A k u 0 if l A k u .
We assume by U k the sets consisting of the CPU indexes in the k-th VC. Unlike Level 4, each CPU can utilize the local channel estimation to design the combining. Let v k u C | P u | × 1 be the local combining vector that CPU u determines for UE k. The local estimates at CPU u is:
s ^ k u = v k u H D k u h k u s k + i = 1 , i k K v k u H D k u h i u s i + v k u H D k u n k u ,
where v k u = v k 1 , , v k | P u | H C | P u | × 1 denotes the local combining for UE k designd by CPU u, h k u = h k 1 , , h k | P u | H C | P u | × 1 is the collective channel vector at CPU u and n k u = n 1 , , n | P u | H C | P u | × 1 denotes the collective noise vector at CPU u.
Then, the local estimates s ^ k u : u U k need to be sent to the Master CPU, and the Master CPU is capable of using the weights a k u : u U k for linear combination to obtain s ^ k = u U k a k u * s ^ k u . Let g k i = v k 1 H D k 1 h i 1 . . . v k | U k | H D k | U k | h i | U k | H C | U k | × 1 be the | U k | -dimensional vector with the receive-combined channels between UE k and each of the CPUs. Equation (17) can be written as:
s ^ k = a k H g k k s k + i = 1 , i k K a k H g k i s i + n k ,
where n k = u U k a k u * v k u H n k u , a k = a k 1 , , a k | U k | H C | U k | × 1 denotes the weighting coefficient vector and a k H g k i : i = 1 , , K represents weighted signal. Using the same combining as Level 4, v k u is expressed as:
v k u = p k i = 1 K p i D k u h ^ i u h ^ i u H D k u + L k u D k u h ^ k u ,
where:
L k u = D k u i = 1 K p i C i + σ 2 I | P u | D k u .
Note that a k is optimized by the Master CPU through statistical CSIs to maximize the SEs. This approach is similar to a continuation of the large-scale fading decoding (LSFD) framework in [18,33,37,38]. We make it further developed to CF-M-MIMO with multiple CPUs. Although the Master CPU does not know the effective channel a k H g k i , it can be replaced by its average a k H E g k i . Note that a k H E g k i is non-zero. Hence, SINR k ( 3 ) for UE k can be achieved below.
Proposition 2.
At Level 3, an achievable SE for UE k is:
SE k ( 3 ) = 1 τ p τ c E log 2 1 + SINR k ( 3 ) ,
and the effective SINR k ( 3 ) can be written as:
SINR k ( 3 ) = p k a k H E g k k 2 i = 1 K p i E a k H g k i 2 p k a k H E g k k 2 + σ 2 a k H N k a k ,
where N k = diag E v k 1 H D k 1 2 , , E v k | U k | H D k | U k | 2 C | U k | × | U k | .
Proof. 
The proof is given in Appendix A.    □
Note that SINR k ( 3 ) conforms to the generalized Rayleigh quotient [24] (Lemma B.10). The deterministic weighting vector a k can be calculated to maximize SINR k ( 3 ) .
Corollary 2.
Weighting vector a k is expressed as:
a k = i = 1 K p i E g k i g k i H + σ 2 N k E g k k ,
which leads to the maximum result:
SINR k ( 3 ) = p k E g k k H i = 1 K p i E g k i g k i H + σ 2 N k p k E g k k E g k k H E g k k .
The above SINR k ( 3 ) holds for any combining scheme and only requires statistical CSIs. Although it can not approximate a k H g k i with its mean value a k H E g k i completely when the number of antennas is small, it is still the best available capacity bound. The pseudo-code for this algorithm is shown in Algorithm 1.
Algorithm 1 Optimization algorithm for Level 3
Input: Channel gain h k l , MMSE combining v k u , noise variance σ 2 , DCC matrix D k u
Output: a k
  1:  Initialization: calculate U k -dimensional vector g k i = v k 1 H D k 1 h i 1 , . . . , v k | U k | H D k | U k | h i | U k | H
  2:  for u = 1:U do
  3:        for k = 1:K do
  4:              if  D k u = 1  then
  5:                    calculate diagonal matrix N k = diag E v k 1 H D k 1 2 , , E v k | U k | H D k | U k | 2
  6:                    update g k k = v k 1 H D k 1 h k 1 , . . . , v k | U k | H D k | U k | h k | U k | H
  7:                    calculate a k by (23).
  8:              end if
  9:        end for
10: end for
In each coherence block, all the CPUs except the Master CPU are required to send K ( τ c τ p ) to the Master CPU, which become K ( τ c τ p ) ( | U k | 1 ) totally. Moreover, the computation of (23) demands for the knowledge of | U k | -dimensional complex scalar g k i , of the | U k | × | U k | complex matrix E g k i g k i H and of the real-valued K × Z k matrix N k for k = 1 , , K . Total K Z k + ( | U k | 2 K 2 + K | U k | ) / 2 , as shown in Table 2.
Compared with Level 4, Level 3 has much milder requirements for backhaul overhead. Owing to the large-scale information being constant for a long time, exchanging statistical CSIs among multiple CPUs is a feasible method.

3.3. Level 2: Distributed Connectivity and Simple Processing

Although Level 3 can achieve high SEs among schemes with local combining at each CPU, it requires the knowledge of a great quantity of statistical parameters, which is also a large number of expense. A large number of statistical parameters needs to be sent to the CPU by APs and then exchanged among multiple CPUs. It may be difficult, especially when there are a lot of CPUs in the network. To confront the challenge, the Master CPU can calculate the estimation of signals s ^ k by simply averaging the local estimates without statistical parameters, as considered in early articles [13,32]. It can be written as:
s ^ k = 1 | U k | u U k s ^ k u ,
where s ^ k u is given by (17). This is equivalent to putting up a k = [ 1 / | U k | , , 1 / | U k | ] H . The achievable SE is shown as follows.
Proposition 3.
At Level 2, an achievable SE for UE k is:
SE k ( 2 ) = 1 τ p τ c E log 2 1 + SINR k ( 2 ) .
The effective SINR k ( 2 ) can be expressed as:
SINR k ( 2 ) = p k E g k k 2 i = 1 K p i E g k i 2 p k E g k k 2 + σ 2 N k ,
where g k i = v k 1 H D k 1 h i 1 , . . . , v k | U k | H D k | U k | h i | U k | H C | U k | × 1 .
Proof. 
This proof follows the same steps as the proof of Proposition 3. □
As for Proposition 3, the above SE can also be calculated by using statistical CSIs. Actually, this method is similar to “ w e a k c o n n e c t i v i t y ” in paper [34].
The number of complex scalars that need to be exchanged among multiple CPUs is the same as Level 3. The main difference between Levels 2 and 3 is that the Master CPU does not need statistical parameters in Level 2. Thus, Level 2 further reduces data requirements compared to Level 3, as shown in Table 2.

3.4. Level 1: No Connectivity

The lowest level is that exchanging the CSIs and signals is only performed between APs and CPUs in each VC. In this case, there is no signal exchange among multiple CPUs. In reality, this network is equivalent to arranging some RCs for UE k and selecting one of them to serve UE k. Note that UE k is only served by a single CPU. This method is similar to Level 4 but the VC contains fewer APs.
Proposition 4.
At Level 1, we let the CPU providing the highest SINR to UE k be responsible for supplying the signal. An achievable SE for UE k is:
SE k ( 1 ) = 1 τ p τ c max u U k E log 2 1 + SINR k u ( 1 ) .
For CPU u, SINR k u ( 1 ) can be expressed as:
SINR k u ( 1 ) = p k v k u H D k u h ^ k u 2 i = 1 , i k K p i v k u H D k u h ^ i u 2 + v k u H L k u v k u ,
where L k u = D k u i = 1 K p i C i + σ 2 I | P u | D k u . The expected value is related to the channel estimation value. Similar to (15), the maximum value of (29) is given by:
SINR k u ( 1 ) = p k D k u H h ^ k u i = 1 K p i D k u h ^ i u h ^ i u H D k u + L k u D k u h ^ k u ,
when v k u = p k i = 1 K p i D k u h ^ i u h ^ i u H D k u + L k u D k u h ^ k u obtained.
Proof. 
This proof follows the same steps as the proof of Proposition 1. □
Table 3 shows the computational complexity of different level of cooperation per coherence block. The proof follows [24] (Lemma B.1).

4. Simulation Results

In this section, we have made a series of comparisons on the uplink performance of CF-M-MIMO with multiple CPUs, with different levels of multiple CPUs cooperation and either using DCC or not, and power control. In addition, when the total number of APs remains unchanged, the effect of the number of CPUs in the VCs on the SEs is investigated.
We consider a CF-M-MIMO network where 300 APs, 20 UEs, and 30 CPUs are randomly distributed in a 2000 × 2000 m square. Each CPU controls the 10 APs closest to it. By using a wrap-around structure, we ensure that there are a large number of APs around each UE. In addition, we assume each UE is served by 20 APs. We use the same propagation model as in [26] with τ c = 200 , τ p = 10 , p i = 100 mW, and 20 MHz bandwidth.

4.1. Uplink Transmission

Figure 2 shows the cumulative distribution function (CDF) of the SEs at different levels of multiple CPUs cooperation. Level 4 provides the highest SEs but it does not have significant advantages at high possible SE points. The reason for this phenomenon is that the interference between UEs is very large and cannot be completely eliminated due to the few APs in the VCs. By contrast, although Level 3 also cannot completely suppress interference, it can use weights to linearly combine the received information, which achieves better performance. Since Level 2 does not have the above functions, it lags behind Level 3 in performance. Level 1 did not cooperate with multiple CPUs, which leads to the lowest SEs.
Figure 3 considers the same setup but all the APs serve all the UEs. It indicates that Level 4 performs the best all the time. The reason is that there are enough APs provided for Level 4 to suppress the interference. It shows that Level 4 relies on the cooperation of multiple APs, thus it is more sensitive to the number of APs. Level 1 still performs the worst in terms of SE. Levels 2 and 3 are roughly in the middle of Levels 1 and 4.

4.2. Power Allocation

A power allocation scheme is considered in [29,39], which is carried out in the uplink transmission phase and only related to statistical CSIs, so it is a feasible scheme for Level 3. The power allocation algorithm can be expressed as:
p k = η l Z k β k l p max ,
where η = min 1 i K l Z i β k l and p max denotes the maximum uplink power of each UE. This algorithm is applied to Level 3 and the results are shown in Figure 4.
Figure 4 exposes that Level 3 with power allocation improves the 90%-likely throughput. However, Level 3 with with full power can achieve higher SEs, which means the power allocation algorithm can obtain better SEs of the worst UEs at the expense of total SE loss.

4.3. Varying Numbers of CPUs

Figure 5 plots the CDF of the SEs of Level 3 when varying numbers of CPUs in the VCs. We assume that the total number of APs and CPUs serving for each UE is 24 and U ¯ , respectively. To facilitate calculation, we specify that each CPU provides the same number of APs in the VCs (i.e., when U ¯ = 3 , 8 APs are selected from the RC of each CPU to associate with the UE.) To satisfy fairness, we shrink the simulation area by 100 times to ensure that each AP can provide the effective signals. The result shows that increasing the number of U ¯ leads to decreasing the SEs generally, which are caused by reducing the degree of freedom for suppressing the interference in each RC.

4.4. Varying Numbers of UEs

Figure 6 demonstrates the CDF of the SEs of Level 3 under different numbers of UEs. As can be shown in Figure 6, we can see that the performance of Level 3 only slightly degrades as more UEs are served. It indicates that Level 3 can effectively suppress the interference coming from more UEs.

4.5. Varying the Uplink Transmit Power

Figure 7 plots the CDF of the SEs of Level 3 when varying the uplink transmit power for p i = p . Although Level 3 can deal with the interference that comes from UEs, it is greatly affected by the uplink transmit power. The result shows that the performance of Level 3 decreases with the degrades in uplink transmit power.

5. Conclusions

This paper has studied a scalable and user-centric CF-M-MIMO with multiple CPUs. All the APs were divided into several RCs linked to different CPUs and each UE was affiliated with a VC. According to the different degrees of multiple CPUs cooperation, we have derived SE expressions for four different levels of cooperation among multiple CPUs. In particular, we have proposed a low complexity optimization algorithm based on statistical CSI (i.e., Level 3). The results have shown that Level 3 can achieve scalability with lower backhual burden, and the performance loss is negligible compared to Level 4.

Author Contributions

Conceptualization, F.L. and Q.S.; methodology, F.L.; software, F.L.; validation, F.L. and Q.S.; formal analysis, F.L.; investigation, F.L.; resources, Q.S.; data curation, F.L.; writing—original draft preparation, F.L. and Q.S.; writing—review and editing, F.L., Q.S., X.J. and X.C.; visualization, F.L.; supervision, Q.S.; project administration, Q.S.; funding acquisition, Q.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by National Nature Science Foundation of China under Grant 61971467, in part by the Key Research and Development Program of Jiangsu Province of China under Grant BE2021013-1, in part by the Qinlan Project of Jiangsu Province, and in part by Natural Science Research Program of Nantong Vocational University under Grant 21ZK01.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

Proof of Proposition 2.
Since the Master CPU does not know the effective channel, it needs to treat the average channel gain a k H E { g k i } as the true deterministic channel. Hence, the signal model can be written as:
s ^ k = a k H E { g k k } s k + I k ,
which is a “deterministic” channel with the additive interference plus noise term:
I k = a k H g k k a k H E { g k k } s k + i = 1 , i k K a k H g k i g k i H a k s i + n k .
The interference term I k has zero mean and is uncorrelated with the signal term in (A1) since:
E { a k H g k k a k H E { g k k } } = 0 E { | s k | 2 } = 0 .
Therefore, we can apply [24] (Cor. 1.3) to obtain the achievable SE in Proposition (2).

References

  1. Dai, H.; Molisch, A.F.; Poor, H.V. Downlink capacity of interference-limited MIMO systems with joint detection. IEEE Trans. Wirel. Commun. 2004, 3, 442–453. [Google Scholar] [CrossRef]
  2. Choi, W.; Andrews, J.G. Downlink performance and capacity of distributed antenna systems in a multicell environment. IEEE Trans. Wirel. Commun. 2007, 6, 69–73. [Google Scholar] [CrossRef]
  3. Andrews, J.G.; Choi, W.; Heath, R.W. Overcoming interference in spatial multiplexing MIMO cellular networks. IEEE Wirel. Commun. Mag. 2007, 14, 95–104. [Google Scholar] [CrossRef]
  4. Irmer, R.; Droste, H.; Marsch, P.; Grieger, M.; Fettweis, G.; Brueck, S.; Mayer, H.P.; Thiele, L.; Jungnickel, V. Coordinated multipoint: Concepts, performance, and field trial results. IEEE Commun. Mag. 2011, 49, 102–111. [Google Scholar] [CrossRef] [Green Version]
  5. Lozano, A.; Heath, R.W.; Andrews, J.G. Fundamental limits of cooperation. IEEE Trans. Inf. Theory 2013, 59, 5213–5226. [Google Scholar] [CrossRef] [Green Version]
  6. Björnson, E.; Kountouris, M.; Debbah, M. Massive MIMO and small cells: Improving energy efficiency by optimal soft-cel coordination. In Proceedings of the 20th International Conference Telecommunication (ICT 2013), Casablanca, Morocco, 6–8 May 2013; pp. 1–5. [Google Scholar]
  7. Truong, K.T.; Heath, R.W. The viability of distributed antennas for massive MIMO systems. In Proceedings of the 45nd Asilomar Conference Signals, Systems and Computers, Pacific Grove, CA, USA, 3–6 November 2013; pp. 1318–1323. [Google Scholar]
  8. Björnson, E.; Sanguinetti, L.; Hoydis, J.; Debbah, M. Optimal design of energy-efficient multi-user MIMO systems: Is massive MIMO the answer. IEEE Trans. Wirel. Commun. 2015, 14, 3059–3075. [Google Scholar] [CrossRef] [Green Version]
  9. Larsson, E.G.; Tufvesson, F.; Edfors, O.; Marzetta, T.L. Massive MIMO for next generation wireless systems. IEEE Commun. Mag. 2014, 52, 186–195. [Google Scholar] [CrossRef] [Green Version]
  10. Marzetta, T.L.; Ngo, H.Q. Fundamentals of Massive MIMO; Cambridge University Press: Cambridge, UK, 2016. [Google Scholar]
  11. Björnson, E.; Larsson, E.G.; Debbah, M. Massive MIMO for maximal spectral efficiency: How many users and pilots should be allocated? IEEE Trans. Wirel. Commun. 2015, 15, 1293–1308. [Google Scholar] [CrossRef] [Green Version]
  12. Ngo, H.Q.; Ashikhmin, A.; Yang, H.; Larsson, E.G.; Marzetta, T.L. Cell-free massive MIMO: Uniformly great service for everyone. In Proceedings of the 2015 IEEE 16th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Stockholm, Sweden, 28 June–1 July 2015; pp. 201–205. [Google Scholar]
  13. Ngo, H.Q.; Ashikhmin, A.; Yang, H.; Larsson, E.G.; Marzetta, T.L. Cell-free massive MIMO versus small cells. IEEE Trans. Wirel. Commun. 2017, 16, 1834–1850. [Google Scholar] [CrossRef] [Green Version]
  14. Interdonato, G.; Björnson, E.; Ngo, H.Q.; Frenger, P.; Larsson, E.G. Ubiquitous cell-free massive MIMO communications. EURASIP J. Wirel. Commun. Netw. 2019, 2019, 197. [Google Scholar] [CrossRef] [Green Version]
  15. Zhang, J.; Björnson, E.; Matthaiou, M.; Ng, D.W.K.; Yang, H.; Love, D.J. Prospective Multiple Antenna Technologies for Beyond 5G. IEEE J. Sel. Areas Commun. 2020, 38, 1637–1660. [Google Scholar] [CrossRef]
  16. Burr, A.; Bashar, M.; Maryopi, D. Ultra-dense Radio Access Networks for Smart Cities: Cloud-RAN, Fog-RAN and cell-free Massive MIMO. arXiv 2018, arXiv:1811.11077. [Google Scholar]
  17. Li, S.; Tolbert, L.M.; Wang, F.; Peng, F.Z. Stray inductance reduction of commutation loop in the P-cell and N-cell-based IGBT phase leg module. IEEE Trans. Power Electron. 2013, 29, 3616–3624. [Google Scholar] [CrossRef]
  18. Björnson, E.; Sanguinetti, L. Making cell-free massive MIMO competitive with MMSE processing and centralized implementation. IEEE Trans. Wirel. Commun. 2019, 19, 77–90. [Google Scholar] [CrossRef] [Green Version]
  19. Nguyen, L.D.; Duong, T.Q.; Ngo, H.Q.; Tourki, K. Energy efficiency in cell-free massive MIMO with zero-forcing precoding design. IEEE Commun. Lett. 2017, 21, 1871–1874. [Google Scholar] [CrossRef] [Green Version]
  20. Björnson, E.; Jorswieck, E. Optimal Resource Allocation in Coordinated Multi-Cell Systems; Now Publishers Inc.: Delft, The Netherlands, 2013. [Google Scholar]
  21. Kaltenberger, F.; Jiang, H.; Guillaud, M.; Knopp, R. Relative channel reciprocity calibration in MIMO/TDD systems. In Proceedings of the 2010 Future Network & Mobile Summit, Florence, Italy, 16–18 June 2010; pp. 1–10. [Google Scholar]
  22. Zhou, S.; Zhao, M.; Xu, X.; Wang, J.; Yao, Y. Distributed wireless communication system: A new architecture for future public wireless access. IEEE Commun. Mag. 2003, 41, 108–113. [Google Scholar] [CrossRef]
  23. Buzzi, S.; D’Andrea, C. Cell-free massive MIMO: User-centric approach. IEEE Commun. Lett. 2017, 6, 706–709. [Google Scholar] [CrossRef]
  24. Björnson, E.; Hoydis, J.; Sanguinetti, L. Massive MIMO networks: Spectral, energy, and hardware efficiency. Found. Trends Signal Process. 2017, 11, 154–655. [Google Scholar] [CrossRef]
  25. Sanguinetti, L.; Björnson, E.; Hoydis, J. Toward massive MIMO 2.0: Understanding spatial correlation, interference suppression, and pilot contamination. IEEE Trans. Commun. 2019, 68, 232–257. [Google Scholar] [CrossRef] [Green Version]
  26. Björnson, E.; Sanguinetti, L. Scalable cell-free massive MIMO systems. IEEE Trans. Commun. 2020, 68, 4247–4261. [Google Scholar] [CrossRef] [Green Version]
  27. Zhang, J.; Zhang, J.; Björnson, E.; Ai, B. Local partial zero-forcing combining for cell-free massive MIMO systems. IEEE Trans. Commun. 2021, 69, 8459–8473. [Google Scholar] [CrossRef]
  28. Interdonato, G.; Buzzi, S. Conjugate beamforming with fractional-exponent normalization and scalable power control in cell-free massive MIMO. In Proceedings of the 2021 IEEE 22nd International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Lucca, Italy, 27–30 September 2021; pp. 396–400. [Google Scholar]
  29. Nikbakht, R.; Mosayebi, R.; Lozano, A. Uplink fractional power control and downlink power allocation for cell-free networks. IEEE Wirel. Commun. Lett. 2020, 9, 774–777. [Google Scholar] [CrossRef]
  30. Interdonato, G.; Karlsson, M.; Björnson, E.; Larsson, E.G. Local partial zero-forcing precoding for cell-free massive MIMO. IEEE Trans. Wirel. Commun. 2020, 19, 4758–4774. [Google Scholar] [CrossRef] [Green Version]
  31. Interdonato, G.; Frenger, P.; Larsson, E.G. Scalability aspects of cell-free massive MIMO. In Proceedings of the ICC 2019—2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–6. [Google Scholar]
  32. Nayebi, E.; Ashikhmin, A.; Marzetta, T.L.; Yang, H.; Rao, B.D. Precoding and power optimization in cell-free massive MIMO systems. IEEE Trans. Wirel. Commun. 2017, 16, 4445–4459. [Google Scholar] [CrossRef]
  33. Nayebi, E.; Ashikhmin, A.; Marzetta, T.L.; Rao, B.D. Performance of cell-free massive MIMO systems with MMSE and LSFD receivers. In Proceedings of the 2016 50th Asilomar Conference on Signals, Systems and Computers, Pacific Grove, CA, USA, 6–9 November 2016; pp. 203–207. [Google Scholar]
  34. Riera-Palou, F.; Femenias, G. Decentralization issues in cell-free massive MIMO networks with zero-forcing precoding. In Proceedings of the 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 24–27 September 2019; pp. 521–527. [Google Scholar]
  35. Aguerri, I.E.; Zaidi, A.; Caire, G.; Shitz, S.S. On the capacity of cloud radio access networks with oblivious relaying. IEEE Trans. Inf. Theory 2019, 65, 4575–4596. [Google Scholar] [CrossRef] [Green Version]
  36. Biglieri, E.; Proakis, J.; Shamai, S. Fading channels: Information-theoretic and communications aspects. IEEE Trans. Inf. Theory 1998, 44, 2619–2692. [Google Scholar] [CrossRef]
  37. Ngo, H.Q.; Tataria, H.; Matthaiou, M.; Jin, S.; Larsson, E.G. On the performance of cell-free massive MIMO in Ricean fading. In Proceedings of the 2018 52nd Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, CA, USA, 28–31 October 2018; pp. 980–984. [Google Scholar]
  38. Adhikary, A.; Ashikhmin, A.; Marzetta, T.L. Uplink interference reduction in large-scale antenna systems. IEEE Trans. Commun. 2017, 65, 2194–2206. [Google Scholar] [CrossRef]
  39. Nikbakht, R.; Lozano, A. Uplink fractional power control for cell-free wireless networks. In Proceedings of the ICC 2019—2019 IEEE International Conference on Communications (ICC), Shanghai, China, 20–24 May 2019; pp. 1–5. [Google Scholar]
Figure 1. Example of virtual clusters in the CF-M-MIMO with multiple CPUs. The yellow area, red area and blue area represent virtual cluster, real cluster and the intersection of virtual cluster and real cluster, respectively.
Figure 1. Example of virtual clusters in the CF-M-MIMO with multiple CPUs. The yellow area, red area and blue area represent virtual cluster, real cluster and the intersection of virtual cluster and real cluster, respectively.
Mathematics 10 01900 g001
Figure 2. Comparison of CF-M-MIMO with different levels of multiple CPUs cooperation in the case that partial APs serve the UEs.
Figure 2. Comparison of CF-M-MIMO with different levels of multiple CPUs cooperation in the case that partial APs serve the UEs.
Mathematics 10 01900 g002
Figure 3. Comparison of CF-M-MIMO with different levels of multiple CPUs cooperation in the case that all APs serve the UEs.
Figure 3. Comparison of CF-M-MIMO with different levels of multiple CPUs cooperation in the case that all APs serve the UEs.
Mathematics 10 01900 g003
Figure 4. In the case that partial APs serve the UEs, the SE comparison of Level 3 between full power and power allocation.
Figure 4. In the case that partial APs serve the UEs, the SE comparison of Level 3 between full power and power allocation.
Mathematics 10 01900 g004
Figure 5. When the total number of APs is fixed, the SE comparison of different number of CPUs.
Figure 5. When the total number of APs is fixed, the SE comparison of different number of CPUs.
Mathematics 10 01900 g005
Figure 6. The SE comparison of different number of UEs.
Figure 6. The SE comparison of different number of UEs.
Mathematics 10 01900 g006
Figure 7. The SE comparison under different uplink transmit power.
Figure 7. The SE comparison under different uplink transmit power.
Mathematics 10 01900 g007
Table 1. Type of CSIs that needs to exchange among CPUs, and the comparison of different levels of cooperation.
Table 1. Type of CSIs that needs to exchange among CPUs, and the comparison of different levels of cooperation.
Type of CSIsLevel of
Computational Complexity
Level 4 centralized connectivityinstantaneous CSIshigh
Level 3 distributed connectivity
and complex processing
statistical CSIsmedium
Level 2 distributed connectivity
and simple processing
statistical CSIslow
Level 1 no connectivitylowest
Table 2. Number of complex scalars that other CPUs send to the Master CPU through the backhaul, either in each coherence block or for each realization of the UE locations/statistics.
Table 2. Number of complex scalars that other CPUs send to the Master CPU through the backhaul, either in each coherence block or for each realization of the UE locations/statistics.
Each Coherence BlockStatistical Parameters
Level 4 τ c Z k τ c A M C k ( K Z k ) / 2
Level 3 K ( τ c τ p ) ( | U k | 1 ) K Z k + ( | U k | 2 K 2 + K | U k | ) / 2
Level 2 K ( τ c τ p ) ( | U k | 1 )
Level 1
Table 3. Computational complexity of different level of cooperation per coherence block. Only complex multiplications and complex divisions are considered, and additions/subtractions are neglected.
Table 3. Computational complexity of different level of cooperation per coherence block. Only complex multiplications and complex divisions are considered, and additions/subtractions are neglected.
Computing Combining VectorsComputing Weighted Vectors
MultiplicationsDivisionsMultiplicationsDivisions
Level 4 3 Z k 2 + Z k K 2 + Z k 3 Z k 3 + Z k τ p τ p K | Z k |
Level 3 u U k ( 3 P u 2 + P u K 2 + P u 3 P u 3 + P u τ p τ p K ) u U k | P u | | U k | 2 + | U k | K 2 + | U k | 3 | U k | 3 + | U k | 2 | U k |
Level 2 u U k ( 3 P u 2 + P u K 2 + P u 3 P u 3 + P u τ p τ p K ) u U k | P u |
Level 1 3 A M C k 2 + A M C k K 2 + A M C k 3 A M C k 3 + A M C k τ p τ p K | A M C k |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Li, F.; Sun, Q.; Ji, X.; Chen, X. Scalable Cell-Free Massive MIMO with Multiple CPUs. Mathematics 2022, 10, 1900. https://doi.org/10.3390/math10111900

AMA Style

Li F, Sun Q, Ji X, Chen X. Scalable Cell-Free Massive MIMO with Multiple CPUs. Mathematics. 2022; 10(11):1900. https://doi.org/10.3390/math10111900

Chicago/Turabian Style

Li, Feiyang, Qiang Sun, Xiaodi Ji, and Xiaomin Chen. 2022. "Scalable Cell-Free Massive MIMO with Multiple CPUs" Mathematics 10, no. 11: 1900. https://doi.org/10.3390/math10111900

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop