Next Article in Journal
The Index System for the Development Level Evaluation of Regional Construction Industrialization: A Case Study in Jiangsu, China
Next Article in Special Issue
Energy-Efficient Caching for Mobile Edge Computing in 5G Networks
Previous Article in Journal
Mechanical Performance Monitoring for Prestressed Concrete Piles Used in a Newly-Built High-Piled Wharf in a Harbor with Fiber Bragg Grating Sensor Technology When Pile Driving
Previous Article in Special Issue
An Initial Load-Based Green Software Defined Network
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory

1
School of Electrical and Electronic Engineering, North China Electric Power University, Beijing 102206, China
2
State Grid Laboratory of Electric Power Communication Network Technology (Global Energy Interconnection Research Institute), Beijing 102209, China
3
Instituto de Telecomunicações, 1049-001 Aveiro, Portugal
4
University of South Wales, Pontypridd CF37 1DL, UK
*
Author to whom correspondence should be addressed.
Appl. Sci. 2017, 7(5), 491; https://doi.org/10.3390/app7050491
Submission received: 26 January 2017 / Revised: 10 April 2017 / Accepted: 26 April 2017 / Published: 10 May 2017
(This article belongs to the Special Issue Green Wireless Networks)

Abstract

:
Device-to-device (D2D) cooperative relay can improve network coverage and throughput by assisting users with inferior channel conditions to implement multi-hop transmissions. Due to the limited battery capacity of handheld equipment, energy efficiency is an important issue to be optimized. Considering the two-hop D2D relay communication scenario, this paper focuses on how to maximize the energy efficiency while guaranteeing the quality of service (QoS) requirements of both cellular and D2D links by jointly optimizing relay selection, spectrum allocation and power control. Since the four-dimensional matching involved in the joint optimization problem is NP-hard, a pricing-based two-stage matching algorithm is proposed to reduce dimensionality and provide a tractable solution. In the first stage, the spectrum resources reused by relay-to-receiver links are determined by a two-dimensional matching. Then, a three-dimensional matching is conducted to match users, relays and the spectrum resources reused by transmitter-to-relay links. In the process of preference establishment of the second stage, the optimal transmit power is solved to guarantee that the D2D link has the maximized energy efficiency. Simulation results show that the proposed algorithm not only has a good performance on energy efficiency, but also enhances the average number of served users compared to the case without any relay.

1. Introduction

1.1. Background and Motivation

As the demand for wireless data services grows dramatically, revolutionary technologies that focus on improving network coverage and throughput within the limited system resources have been constantly emerging. Device-to-device (D2D) communication that enables user equipment (UEs) to communicate with each other through direct links without the base station (BS) relaying [1] is considered to be one of the key technologies in the future 5G wireless system [2]. In the underlay mode, D2D links share the same spectrum resources with traditional cellular UEs (CUEs), providing significant increases in spectrum efficiency and system throughput [1,3]. There are two basic transmission types in D2D communication: single-hop transmission, in which data are transmitted from the transmitter to the receiver directly as the channel condition between them meets the quality of service (QoS) requirement; and multi-hop transmission, where some devices act as relay stations, in which data are transmitted from the transmitter to the receiver hop by hop, and the route must be opportunistically optimized to guarantee the performance [4]. In general, because of the high channel quality of short-range D2D links, the D2D UEs can achieve a high single-hop transmission rate [5,6]. However, there exists a possibility that the channel condition between two potential D2D UEs cannot support a direct single-hop link. For this reason, D2D cooperative relay has naturally become an important application that can assist two UEs with an inferior direct link to communicate [7,8,9].
Though applying underlying mode D2D communication in cellular networks can bring some advancements such as spectrum efficiency and the coverage of communication, the UEs can no longer neglect the co-channel interference caused by spectrum resource reusing [10,11]. Since the underlay mode D2D communication brings new problems and challenges to the cellular networks, one of the crucial issues is the intelligent resource allocation strategy for both D2D UEs and CUEs to guarantee their QoS [12,13,14]. Due to the low cost and the few modifications to current infrastructures, the D2D relay may work more efficiently than fixed relays [15]. Nonetheless, the co-channel interference caused by spectrum resource sharing exists in D2D cooperative relay transmissions, which is identical to the single-hop transmission. As the relay communication requires two-hop spectrum resources, the resource management has become more complicated than in the single-hop case. Moreover, relay selection needs to be optimized jointly with the spectrum allocation and power control for both the first hop and the second hop links, which can realize the system performance maximization. There have already been some works that focus on relay selection [16,17] and resource allocation [18,19,20,21,22] to optimize the performance of D2D relay communications. In [20], the joint resource allocation and relay selection problem was formulated as a binary integer nonlinear programming problem, and the authors in [21] also considered the joint optimization and proposed a two-stage method to maximize system throughput. However, few works above have covered the power control problem, which is closely related to energy efficiency (EE). As the booming wireless services consume a large amount of energy from UE batteries and UEs are handheld equipment with limited battery capacity, one key issue is to reduce the energy consumption for both D2D transmitters (TXs) and D2D relay stations (RSs). An iterative Hungarian method was proposed to solve the relay and spectrum allocation problem with optimal transmit power at each node in [23]. Nevertheless, EE has not been considered in this work, and the cellular spectrum resources are allowed to be reused by only one of the two hops in the D2D relay link.
Therefore, we put emphasis on energy-efficient resource management in D2D cooperative relay communications. The problem is formulated as the joint relay selection, spectrum allocation and power control optimization for two-hop D2D relay transmission underlay cellular networks. Both the first hop from D2D TX to D2D RS and the second hop from D2D RS to the D2D receiver (RX) are allowed to reuse the spectrum of CUEs to improve spectrum efficiency. Our objective is to find an approach to determine the optimal relay, spectrum and transmit power of TXs and RSs, which maximizes the EE of D2D links, while guaranteeing the QoS of all of the links.
The problem is difficult to solve, as it not only includes continuous and discrete variables, but also involves four-dimensional resource allocation indicators [24]. It is a challenge to solve the problem with four-dimensional variables directly, since it is infeasible to achieve a solution in polynomial time for this NP-hard problem [25]. Besides, the transmit power of D2D TXs and RSs must be controlled to guarantee the QoS of UEs and achieve the maximum EE simultaneously, which also increases the complexity of the problem. Compared with our previously proposed works [11,12,26,27], this work focuses on the multi-hop D2D links, which was not investigated before, and the optimization problem in this work is a four-dimensional matching among D2D pairs, relay selections and resource blocks (RBs) reused in the first and second hop of two-hop D2D links, which is much more complicated than the previous works. Since the problem involved in this work is a joint optimization problem with multidimensional variables, it cannot be solved by using the previously proposed algorithm directly. To give a tractable solution of this four-dimensional matching problem, based on the previously proposed three-dimensional matching algorithm, we proposed an iterative four-dimensional matching algorithm.

1.2. Contributions

Noting that D2D TX-RX pairs (TRs), RSs, the first hop spectrum and the second hop spectrum should be matched effectively, thus, in our paper, we solve the mixed integer nonlinear programming problem using matching theory [28], which considers two side preferences in a traditional form and provides a low-complexity solution. However, the proposed joint optimization involves four dimension variables that cannot be solved by a traditional matching algorithm. Therefore, we first decompose the problem into a two-dimensional matching and a three-dimensional matching with a low dimension reduction error and then solve the two matching problems based on a pricing mechanism. The power control process is coupled with the three-dimensional matching, and the optimal transmit power maximizes the total EE of the two-hop link. The auction algorithm has been used in some previous works to solve the resource allocation problems under the D2D scenario [29,30]. In [29], the authors proposed a two-phase auction-based resource allocation algorithm for underlying D2D communication. Additionally, in [30], the authors proposed a two-step distributed algorithm of channel allocation and mode selection for mobile stations with the objective of capacity maximization. Compared to these works, we consider a much more complicated problem with multidimensional variables that cannot be solved by the above-mentioned auction algorithm, which includes only two dimension variables, i.e., bidders and bidding items. The similarity is just that we use the pricing strategy when the conflicts appear in the matching algorithm, and we employ the idea of price rising in an auction.
Remark 1.
The Bertsekas auction algorithm and Munkres’ Hungarian algorithms are not considered in this work. However, the joint optimization involved in this work is a high complexity problem with multidimensional variables, which cannot be solved directly by a traditional matching algorithm or auction algorithm. Therefore, we decompose the original problem into a two-dimensional matching and a three-dimensional matching with a low dimension reduction error and then solve the two matching problems based on a pricing mechanism. Although the employed pricing-based matching algorithm is similar to the auction algorithm, to solve the joint optimization problem, both of them should be modified in the same way, which is the focus of this work. The innovation of this work is not only the proposed pricing-based matching algorithm, but also the idea of problem simplification.
The main contributions of this paper are summarized as follows:
  • We design a two-hop D2D relay strategy for the transmitter-receiver pair with an inferior direct link to improve system performance. Specifically, we propose a joint relay selection, power optimization and spectrum allocation problem, which involves the matching between D2D TRs and relay stations and the matching between two-hop D2D links and the respective spectrum resources and can be formulated as four-dimensional matching to maximize the EE of D2D UEs.
  • The joint relay selection and resource allocation problem is intractable and belongs to the class of NP-hard problems; thus, we simplify the problem into one two-dimensional matching between D2D TRs and the second hop spectrum resources and one three-dimensional matching among D2D TRs, relays and the first hop spectrum resources. By solving these two matching problems, we can achieve a sub-optimal solution, which can approach the performance of the exhaustive optimal algorithm with a much lower complexity. The two-dimensional matching problem is solved by using a pricing strategy directly, which decides the winner when more than one D2D TRs propose the same spectrum resource for the second hop. For the three-dimensional matching, we transform it into a two-sided matching, in which the preference lists of D2D TRs from one side over the combinations of RSs and spectrums for the first hop from the other side are established based on the achievable maximum EE. Therefore, the three-dimensional matching can use the pricing strategy to achieve a sub-optimal solution. In the three-dimensional matching process, we also consider the power optimization for both D2D TXs and RSs to maximize the EE of D2D links, as well as avoid excessive interference to CUEs.
  • The properties of the proposed three-dimensional matching and two-dimensional matching algorithm, such as optimality, stability, convergence and complexity, are analyzed theoretically. In the simulation, we compare the proposed matching algorithm with the exhaustive optimal and random matching algorithm in terms of the achieved EE for D2D UEs under different system configurations. Numerical results show that our proposed scheme can achieve a considerable performance gain, and the average percentage of served users is substantially improved compared with that without any relay.
The rest of this paper is organized as follows. In Section 2, we provide a brief review of the related works. In Section 3, we describe the system model. The formulation of the four-dimensional matching problem is introduced in Section 4. In Section 5, we present the proposed pricing-based matching algorithm, and the simulation results are shown in Section 7 with related discussions. Finally, we conclude the paper in Section 8.

2. Related Works

The purpose of this paper is to solve the joint relay selection, channel assignment and power optimization problem by using the pricing-based matching algorithm. In [31,32,33,34], the proper relay stations were selected to improve the performance of the multi-hop D2D network while simultaneously considering the co-channel interference. In [35], the authors presented a theoretical framework for multi-hop D2D communications, which enables modeling of interference and performance analysis. In [36], the authors proposed a novel routing protocol for underwater sensor networks (UWSNs), which was named the location-aware routing protocol (LARP) and could improve packet delivery ratio and reduce normalized routing overhead. In [37], the authors modeled the problems of relay selection and channel allocation as a classical weighted bipartite graph matching problem and solved it by using the Hungarian algorithm. In [38], a novel quantum particle swarm optimization-based relay selection scheme was proposed to maximize the system throughput of the cooperative relay networks and reduce the computational complexity compared with the exhaustive search. The above works mainly focus on how to maximize the throughput of the system through relay selection with the presence of interference. In particularly, the channel allocation problem was also considered in [37].
In [39,40,41,42,43], the resource allocation problem was solved simultaneously with the relay selection problem. In [20], the joint resource allocation and relay selection problem was formulated as a binary integer non-linear programming problem, and the optimal solution of the random linear network coding aided D2D communications was obtained by introducing the concept of D2D cluster. In [44], an algorithm was proposed to maximize the throughput of a D2D pair by finding the optimum multi-hop routes and the corresponding transmit powers. To achieve simultaneous downlink transmission for multiple users, in [45], the authors proposed a novel MAC scheme based on an opportunistic channel-aware scheduling policy. In [46], the authors employed the Lyapunov optimization theory to decompose the dynamic communication mode selection problem and provide a lower bound of the system throughput and proposed an algorithm to achieve approximate maximization. In [47], the joint relay selection and resource allocation problem were solved in a multi-user, multi-carrier and multi-cellular network, thus increasing the throughput of users in the cell-edge. As the co-channel interference exists in the underlying mode of the D2D communication network, the resource allocation problem must be solved under the interference constrains to guarantee the quality of communications [48,49]. In [12], a three-dimensional matching algorithm was applied to maximize the weighted channel rate of the system, which explored the information from both the social layer and the physical layer. However, the EE performance was neglected.
To extend the lifetime of the rapidly-drained battery of user equipment, when matching D2D users with relays, it is necessary to consider the EE [26,27,50]. In [51], the selection of the energy harvesting relay has been studied for D2D communications in a public safety environment. In [23], the iterative Hungarian method was devised to solve the associated relay selection and resource assignment problems, and it showed that the optimal power allocation problem can be solved in a closed-form. In [52], the authors have proven that threshold-based power control policies function efficiently, and this would benefit the limited battery of users’ handheld equipment.
However, most of the above works have not considered EE of D2D links during the optimization process. Although in [26,51], the EE issue has been considered, the multi-hop D2D communication was neglected, and the application scenario is different from our study. Considering the energy efficiency problem, the purpose of the original four-dimensional matching among D2D pairs, relay selections and resource blocks (RBs) reused in the first and second hop of two-hop D2D links is to maximize the EE of D2D links.
In [11], the authors formulated the joint resource allocation problem as a one-to-one matching problem under two-sided preferences and employed the Gale–Shapley (GS) algorithm to match D2D pairs with cellular UEs. In [12], a three-dimensional iterative matching algorithm was proposed to maximize the sum rate of D2D pairs weighted by the intensity of social relationships. The work in [26] focused on the energy-efficient context-aware resource allocation problem. The authors transformed the NP-hard optimization problem into a one-to-one matching problem under two-sided preferences and proposed an energy-efficient matching algorithm. In [27], a two-stage EE optimization problem, which consists of a joint spectrum and power allocation problem in the first stage and a context-aware D2D peer selection problem in the second stage, was investigated. The authors proposed an iterative power allocation algorithm to optimize EE under a specific match and proposed an iterative matching algorithm to solve the combinatorial problem. The difference between previous works and this work is summarized as follows. First of all, we focused on the multi-hop D2D scenario, which was not investigated by [11,12,26,27]. Second, the optimization problem involves a four-dimensional matching among D2D pairs, relay selections and resource blocks (RBs) reused in the first and second hop of two-hop D2D links, which is much more complicated than the problems studied in [11,12,26,27]. Last, but not least, the solutions proposed in previous works cannot be directly applied to this work. To provide a tractable solution, we firstly decompose the origin problem into a two-dimensional matching among D2D TRs and RBs for the second hop and a three-dimensional matching problems among D2D TRs, RSs and RBs for the first hop. Then, we employ the proposed pricing-based iterative two-dimensional matching and pricing-based iterative three-dimensional matching algorithm to provide a tractable solution. In summary, the system model, problem formulation and proposed solution of this work are completely different from our previous works in [11,12,26,27].

3. System Model

We consider a D2D relay-assisted cellular network, which consists of a BS, CUEs, D2D TX-RX pairs (TRs) and idle users acting as D2D RSs. Each CUE is allocated with an orthogonal uplink resource block (RB), and each RB can be reused by at most one D2D link. In this paper, we only consider the static scenario, and the moving speeds of users are not studied. Actually, the fast mobility of users has a significant impact on the network topology, mode selection and channel gains. This is a new problem and is left for future study. Nevertheless, our work is still valid for users with walking speeds. The reason is that since the optimization is performed in a slot by slot fashion, the position variations caused by the low mobility of users during channel coherence time are negligible. For instance, assuming that the moving speeds of users are an average of 1.5 m/s and the time slot for the proposed algorithm to optimize is 1 ms, therefore, after one time slot, the moving distance of the users is 0.0015 m, and this is ignorable when compared to the distance between the transmitters and the receivers.
We focus on the D2D TRs, which suffer from bad channel qualities and cannot achieve direct single-hop links. To improve network coverage and throughput, D2D communication is operated in a wo-hop fashion via D2D RSs, which is shown in Figure 1. That is, a D2D TX first sends the signal to a D2D RS, and then, the RS forwards the signal to the D2D RX. In each hop, the RB of a CUE is reused by a D2D link. We also assume that each D2D RS can serve at most one D2D TR, and each D2D TR can utilize only one RS.
It is assumed that there are M D2D TRs, R D2D RSs and C CUEs in the system. D2D TXs and RXs are denoted by the sets M T = { 1 , , m t , , M } and M R = { 1 , , m r , , M } , respectively. D2D RSs and CUEs (i.e., RBs) are denoted by the sets R = { 1 , , r , , R } and C , respectively. We assume that D2D TX m t , D2D RS r and D2D RX m r form a two-hop D2D link. The RBs reused by the first hop and second hop D2D links are denoted as c f and c s , respectively, which also represent the c f -th and c s -th CUEs, c f , c s C . All the meaning of used variables are given in Table 1. For the channel model, we use the Rayleigh fading to model the small-scale fading and employ the free space propagation path-loss to model the large-scale fading [53,54]. The free space propagation path-loss model employed in this work is an ideal model, but it does not impact the solution structure of the proposed algorithm. In other words, the proposed algorithm is adaptive to other more complicated path-loss models. The signal to interference plus noise ratio (SINR) received by RS r can be expressed as:
γ r = P m t h m t r 2 P c f h c f r 2 + N 0 = P m t d m t r α h s , m t r 2 P c f d c f r α h s , c f r 2 + N 0 .
Here, P m t and P c f are the transmit power of D2D TX m t and CUE c f , respectively. h m t r and h c f r denote the channel responses of the first hop D2D link and the interference link from CUE c f , respectively. d m t r is the transmission distance between D2D TX m t and RS r, while d c f r is the distance between CUE c f and RS r. α is the path-loss exponent corresponding to the large-scale fading of the transmission channel. h s , m t r and h s , c f r are the small-scale fading (Rayleigh) channel coefficients, which obey the complex Gaussian distribution CN ( 0 , 1 ) . N 0 represents the one-sided power spectral density of the additive white Gaussian noise (AWGN).
The SINR received by D2D RX m r is given by:
γ m r = P r h r m r 2 P c s h c s m r 2 + N 0 = P r d r m r α h s , r m r 2 P c s d c s m r α h s , c s m r 2 + N 0 ,
where P r and P c s are the transmit power of D2D RS r and CUE c s , respectively. h r m r and h c s m r represent the channel responses of the second hop D2D link and the interference link from CUE c s , respectively. d r m r is the transmission distance between RS r and D2D RX m r , and d c s m r is the distance between CUE c s and D2D RX m r . h s , r m r and h s , c s m r are the corresponding small-scale fading channel coefficients.
Lemma 1.
Accordingly, the effective SINR of the two-hop D2D link from TX m t to RX m r through RS r by reusing RBs of CUE c f and CUE c s can be calculated by [23]:
γ m t , m r = γ r γ m r γ r + γ m r + 1 .
The proof of Lemma 1 can be found in Appendix A.
For the cellular transmissions, the SINR received by the BS corresponding to the CUE whose RB is reused by the first hop D2D link, i.e., c f , is given by:
γ c f = P c f h c f B 2 P m t h m t B 2 + N 0 = P c f d c f B α h s , c f B 2 P m t d m t B α h s , m t B 2 + N 0 ,
and the SINR received by the BS corresponding to the CUE whose RB is reused by the second hop D2D link, i.e., c s , is given by:
γ c s = P c s h c s B 2 P r h r B 2 + N 0 = P c s d c s B α h s , c s B 2 P r d r B α h s , r B 2 + N 0 .
Here, h c f B , h m t B , h c s B and h r B are the channel responses of the cellular link from CUE c f , the interference link from TX m t , the cellular link from CUE c s and the interference link from RS r, respectively. d c f B , d m t B , d c s B and d r B are the corresponding transmission distances. h s , c f B , h s , m t B , h s , c s B and h s , r B are the corresponding small-scale fading channel coefficients.

4. Problem Formulation

The purpose of this work is to improve network coverage by D2D cooperative relay communications with the aim of maximizing the total EE of the two-hop D2D TR pairs, while simultaneously satisfying the QoS requirements of both D2D and cellular links. Relay selection, spectrum allocation and power control are optimization variables or methodologies rather than the optimization objectives. To achieve the EE maximization, we need to design an approach to jointly optimize relay selection, spectrum allocation and power control for the two-hop links. We employ a four-dimensional variable X = { X m , r , c f , c s } to denote the allocation of relay and spectrum resources, where m M = { 1 , 2 , , M } , X m , r , c f , c s { 0 , 1 } . If X m , r , c f , c s = 1 , the signal transmitted from TX m t to RX m r is relayed by RS r, and meanwhile, RB c f and RB c s are reused by the first hop and the second hop D2D links, respectively. We define the EE as the ratio of SE to the total power consumption. The definition of SE is how many bits can be transmitted per second per Hz. The system bandwidth is 5 MHz in simulations. Then, the total throughput needs to be divided by the system bandwidth to derive the SE. Therefore, when X m , r , c f , c s = 1 , the EE of the two-hop D2D link can be obtained by:
E E m , r , c f , c s = log 2 1 + γ m t , m r P m t + P r + 2 P c i r ,
and substituting (3) into (6), we have:
E E m , r , c f , c s = log 2 1 + γ r γ m r γ r + γ m r + 1 P m t + P r + 2 P c i r ,
where P c i r is the circuit power of D2D TXs or RSs. The numerator of (6), i.e., log 2 1 + γ m t , m r , represents the SE, which is calculated based on the effective SINR of the two-hop D2D link, defined by (3). The denominator of (6), i.e., P m t + P r + 2 P c i r , represents the total power consumption, which consists of the transmission power of D2D TX, the transmission power of RS and the circuit power.
Based on the above derivation, we jointly design the binary allocation variable { X m , r , c f , c s } , the continuous power variables P m t and P r to maximize the total EE of the two-hop D2D links, thus formulating a mixed integer nonlinear programming (MINLP) problem as:
max { P m t , P r , X } m = 1 M r = 1 R c f = 1 C c s = 1 C X m , r , c f , c s E E m , r , c f , c s s . t . C 1 : P m i n , m t P m t P m a x , m t , m t M T , P m i n , r P r P m a x , r , r R , C 2 : X m , r , c f , c s { 0 , 1 } , m M , r R , c f , c s C , C 3 : r R , c f , c s C X m , r , c f , c s 1 , m M , m M , c f , c s C X m , r , c f , c s 1 , r R , m M , r R X m , r , c f , c s 1 , c f , c s C , C 4 : m i n γ r , γ m r , γ m t , m r γ m i n D , m i n γ c f , γ c s γ m i n C , m t M T , m r M R , r R , c f , c s C .
C 1 gives the transmit power constraints of D2D TXs and RSs. P m i n , m t , and P m a x , m t represent the lower and upper bounds on the transmit power for D2D TX m t . P m i n , r and P m a x , r represent the lower and upper bounds on the transmit power for RS r. The specific values of P m i n , m t , P m a x , m t , P m i n , r and P m a x , r are analyzed and determined in Section 5. The inequalities in C 3 ensure that there is a one-to-one correspondence among the D2D TR pair, the selected RS, the RB reused by the first hop D2D link and the RB reused by the second hop link. C 4 guarantees the QoS requirements of links. γ m i n D and γ m i n C denote the SINR threshold of D2D links and cellular links, respectively.

5. Energy-Efficient Resource Matching Algorithm for D2D Relay Communications

As the mixed integer programming (8) is an NP-hard problem that cannot be solved in polynomial time, we investigate an approach based on matching theory to provide a sub-optimal solution. However, the MINLP problem with a four-dimensional variable X leads to a four-dimensional matching, which is highly complicated. The problem defined in (8) is a mixed integer nonlinear program because it involves integer optimization variables for relay selection and spectrum allocation and continuous optimization variables for power allocation. By observing (1) and (2), we notice that in the first hop, both signal power and interference power received by the RS are related to the result of relay selection, while in the second hop, only signal power received by the RX is related to the result of relay selection. Thereupon, we start from the optimization of RB selection in the second hop link, which is independent of relay selection. First of all, each two-hop D2D link is matched with an RB for its second hop D2D transmissions, which can minimize its received co-channel interference, and then, it is matched with an RS and another RB for its first hop D2D transmission, which can maximize the EE of two-hop D2D links. Thus, the original four-dimensional matching can be approximatively decomposed into a two-dimensional matching between D2D TRs and RBs for the second hop and a three-dimensional matching among D2D TRs, RSs and RBs for the first hop. Note that we ignore the effect of dimensionality reduction on CUEs for simplification. For each matching problem, firstly, we introduce how to establish the preference list and then propose a pricing-based iterative matching algorithm.

5.1. Two-Dimensional Matching

We start from the two-dimensional matching problem in the second hop D2D link, which involves a two-sided matching with M D2D TRs on one side and C RBs on the other side. We have the following definition:
Definition 1.
A matching X ^ is a one-to-one correspondence from the set M C onto itself, which is X ^ : M C M C , such that X ^ ( m ) = c s means that D2D TR m is matched with the RB c s and X ^ ( m ) = m means that m remains single. X ^ ( m ) is regarded as the matching partner of m.
We consider a matching X ^ , in which individuals m and c s prefer each other to their matching partners, but are not matched with each other under X ^ , namely c s m X ^ ( m ) and m c s X ^ ( c s ) . Thus, m and c s form a blocking pair for the matching X ^ , that is, ( m , c s ) blocks the matching. We say that matching X ^ is not stable because both m and c s have the desire to disrupt the matching X ^ in order to be matched with each other.
Definition 2.
The matching X ^ is stable when there exists no blocking pair.

5.1.1. Preference Establishment

In the two-dimensional matching problem, D2D TRs on one side propose to establish pairs with RBs on the other side based on their preference lists. As mentioned above, the RB selection for the second hop D2D link is independent of relay selection. Thus, regardless of which RS is selected, D2D TR m prefers to be matched with the RB that maximizes the reciprocal of interference power received by RX m r , i.e.,
V m , c s = 1 P c s d c s m r α h s , c s m r 2 ,
which is defined as the preference value of TR m on RB c s . By temporarily paring each D2D TR pair with each RB, the reciprocal of the interference corresponding to each TR-RB pair can be obtained. The preference value of TR m on each RB in descending order is denoted as E ^ m = { E ^ m , 1 , E ^ m , 2 , . . . , E ^ m , C } , which is firstly calculated as (9), and the corresponding RB is O ^ m = { O ^ m , 1 , O ^ m , 2 , . . . , O ^ m , C } , O ^ m , i C , i [ 1 , C ] , which is defined as the preference list of TR m. The set E ^ m and O ^ m will be updated in each matching iteration.

5.1.2. Matching Algorithm

In the matching process, D2D TR pairs can propose their most preferred RBs based on the established preference lists. However, it is possible that more than one D2D TR proposes the same RB. To solve this problem, we design a pricing strategy to resolve the conflict, which utilizes the concept of price to represent the matching cost for D2D TRs. The prices of RBs are virtual, without any physical significance, and are set to zero at the beginning. Let F ^ = { F ^ 1 , , F ^ c s , , F ^ C } , c s C denote the price set of RBs for the second hop D2D links. The matching algorithm proceeds iteratively. In each iteration, the actual preference of D2D TR m on c s is updated as V m , c s F ^ c s , and the preference list O ^ m should be updated accordingly. Each D2D TR proposes its most preferred RB in O ^ m . If any RB receives more than one request, there are conflicts in the matching process. Then, the conflicting RBs raise their prices with step s until they receive no more than one request. The algorithm ends if there exist no new requests from D2D TRs. We summarize the two-dimensional matching algorithm in Algorithm 1 and provide an illustration of the preference establishment process and the derived one-to-one matching result for the second hop D2D link in Figure 2; the flowchart is given in Figure 3.
Algorithm 1The Two-Dimensional Matching Algorithm
1:Input: M , C , s.
2:Output: X ^ .
3:Initialization:
4:Every D2D TR m M builds its preference list.
5:Set X ^ ( m ) = , m , Ω ^ = ∅, s = 0.1.
6:while X ^ ( m ) = do
7:    for m M do
8:        D2D TR m proposes to its most preferred RB in updated O ^ m .
9:    end for
10:    Find the RBs that have received more than one requests and put it into Ω ^ .
11:    if Ω ^ = then
12:        Match TRs with the requested RBs directly.
13:    else
14:        for c s Ω ^ do
15:            RB c s increases its price F ^ c s with the price step s, and D2D TRs would update
            the preference lists O ^ m . RB c s would be matched with the last remaining TR m
            that proposes to it, denoted as X ^ ( m ) = c s . Delete c s from Ω ^ .
16:        end for
17:    end if
18:Update: Update O ^ m and E ^ m for m , X ^ ( m ) = by deleting the matched RB.
19:end white

5.2. Three-Dimensional Matching Algorithm

Given the RB allocation of the second hop, the joint optimization problem (8) is equivalent to a three-dimensional matching problem involving the matching among D2D TRs, RSs and RBs for the first hop. To simplify the problem, we combine one RS and one RB (CUE) as a one-dimensional unit RC. Owing to the existence of R RSs and C RBs, there are R × C different combinations, denoted as RC = { R C r , c f } r = 1 , c f = 1 r = R , c f = C . Thus, the three-dimensional matching problem is reduced to a two-sided matching with M D2D TRs on one side and R × C RC units on the other side, which can also be solved by the proposed pricing-based matching approach. We give the definition below:
Definition 3.
A matching X ˜ is a one-to-one correspondence, denoted as X ˜ : M RC M RC , such that X ˜ ( m ) = R C r , c f means that D2D TR m is matched with unit R C r , c f and X ˜ ( m ) = means that m remains single. X ˜ ( m ) is the matching partner of m.
Since the matching in the first hop D2D link is a one-to-one three-dimensional matching among D2D TRs, RSs and RBs, if X ˜ ( i ) = R C r , c f , for i M { i } , we have X ˜ ( i ) = { RC R C r , c f } { } . Similar to the definition of the two-dimensional stable matching, the three-dimensional matching is stable if there exists no blocking pair. That is, there exists no TX and RC unit that have not been matched with each other under X ˜ and that prefer each other more than their current matching partners.

5.2.1. Preference Establishment

In the three-dimensional matching, D2D TRs on one side propose to establish pairs with RC units on the other side based on their preference lists. As the ultimate optimization goal is to maximize the EE of D2D links, the preference values of D2D TR on RC units can be obtained by solving a power control problem with the optimization of transmit power P m t and P r . When calculating the preference of the same D2D TR towards another RC unit, the formulated power control problem is different from the previous one, and the derived optimum power control strategy would not be the same. Hence, the determination of optimal power control strategy cannot be handled separately from matching problems because it depends on the specific matching assumption. To establish the preference lists, each D2D TR is temporarily paired with each RC unit to obtain the optimum EE and corresponding transmit power, which is also restricted to meet the QoS requirements of both D2D and cellular links. When TR m has been matched with RB c s for the second hop, the EE of TR m can be rewritten as E E ˜ m , r , c f , c s . The preference value of TR m on each RC unit in a descending order is denoted as the set E ˜ m = { E ˜ m , 1 , E ˜ m , 2 , . . . , E ˜ m , R × C } , which is firstly calculated as the achievable EE of the two-hop link, and the corresponding RC unit is O ˜ m = { O ˜ m , 1 , O ˜ m , 2 , . . . , O ˜ m , R × C } , O ˜ m , i RC , i [ 1 , R × C ] , which is the preference list of TR m. For a specific matching X ˜ ( m ) = R C r , c f , the maximum achievable EE of the two-hop link can be obtained by solving the following power control problem:
max { P m t , P r } E E ˜ m , r , c f , c s s . t . C 1 : P m i n , m t P m t P m a x , m t , P m i n , r P r P m a x , r , C 2 : m i n γ r , γ m r , γ m t , m r γ m i n D , m i n γ c f , γ c s γ m i n C .
Since (7) is concave with respect to P m t and P r , the transmit power ( P ˜ m t , P ˜ r ) that maximizes the EE can be obtained by:
E E ˜ m , r , c f , c s ( P m t , P r ) P m t = 0 , E E ˜ m , r , c f , c s ( P m t , P r ) P r = 0 .
The upper and lower bounds in the transmit power constraints are calculated as:
P m a x , m t = m i n P m a x , P c f h c f , B S 2 γ m i n C N 0 h m t , B S 2 ,
P m a x , r = m i n P m a x , P c s h c s , B S 2 γ m i n C N 0 h r , B S 2 ,
P m i n , m t = m a x γ m i n D 1 + γ r γ r γ m i n D A , γ m i n D P c f h c f , r 2 + N 0 h m t , r 2 ,
P m i n , r = m a x γ m i n D 1 + γ m r γ m r γ m i n D B , γ m i n D P c s h c s , m r 2 + N 0 h r , m r 2 .
The upper bounds are derived based on the QoS requirements of CUEs, and the lower bounds are derived based on the QoS requirements of D2D links. P m a x denotes the maximum transmit power allowed for any mobile equipment. A and B in (14) and (15) are given by, respectively,
A = h m t , r 2 / P c f h c f , r 2 + N 0 ,
B = h r , m r 2 / P c s h c s , m r 2 + N 0 .
Finally, the optimum power of D2D TX m t and RS r can be expressed as:
P m t * = m a x m i n P ˜ m t , P m a x , m t , P m i n , m t ,
P r * = m a x m i n P ˜ r , P m a x , r , P m i n , r ,
respectively. Note that if the calculated upper bound is lower than the lower bound, i.e., P m a x , m t < P m i n , m t or P m a x , r < P m i n , r , then TR m cannot be matched with unit R C r , c f .

5.2.2. Matching

Similar to the two-dimensional matching problem, a conflict occurs if more than one D2D TR proposes the same RS or RB. Thus, the units contain the conflicting RSs, and RBs are denoted as Ω ˜ ; and the proposed pricing strategy can also be used to resolve the conflict. Let F ˜ R = { F ˜ 1 R , , F ˜ r R , , F ˜ R R } and F ˜ C = { F ˜ 1 C , , F ˜ c f C , , F ˜ C C } denote the prices of RSs and RBs, respectively. The price of RC unit R C r , c f is the sum price of RS r and RB c f , and the price set of all of the RC units is denoted as F ˜ R C = { F ˜ r , c f R C } r = 1 , c f = 1 r = R , c f = C . In each iteration, the actual preference of D2D TR m towards unit R C r , c f is updated as E E ˜ m , r , c f , c s F ˜ r , c f R C , and the preference list O ˜ m should be updated accordingly. Any D2D TR m that has not been matched with any RC unit would propose its most preferred RC unit in O ˜ m . If both RS and RB in one RC unit receive only one request, the unit would be directly matched with the D2D TR that sends the request. If Ω ˜ , the conflicting elements in Ω ˜ would raise their prices with the price step s, until there is only one request left. The iterative process ends when there are no new requests from D2D TRs.
We summarize the three-dimensional matching algorithm in Algorithm 2, and an illustration of the preference lists’ establishment and a stable matching that we expected is shown in Figure 4; the flowchart is given in Figure 5.
Algorithm 2The Three-Dimensional Matching Algorithm
1:Input: M , R , C , s.
2:Output: X ˜ .
3:Initialization:
4:Every D2D TR m M builds its preference list on RC .
5:Set X ˜ ( m ) = , m , Ω ˜ = ∅, s = 0.1.
6:while X ˜ ( m ) = do
7:    for m M do
8:        D2D TR m proposes to its most preferred RC unit in updated O ˜ m .
9:    end for
10:    Count the amount of RSs and RBs that have received requests and put the conflicting ele-
    ments that have received more than one request into Ω ˜ .
11:    if Ω ˜ = ϕ then
12:        Match the RC unit with its requesting D2D pair directly.
13:    end if
14:    if Ω ˜ ϕ then
15:        for R C r , c f RC do
16:            if RS r and RB c f receive requests from more than one D2D pair then
17:            RS r and RB c f in Ω ˜ increase their prices F ˜ r R and F ˜ c f C with the price step s, and D2D TR
            update the preference list O ˜ m . After this process, RC unit which includes RS r and RB
             c f would be matched with the last remaining D2D TR m that proposed to them,
            denoted by X ˜ ( m ) = R C r , c f . Delete RC unit R C r , c f from Ω ˜ .
18:            end if
19:        end for
20:    end if
21:Update: Update: Update O ˜ m and E ˜ m for m , X ˜ ( m ) = by deleting the matched RC unit.
Set F ˜ R = { 0 } , F ˜ C = { 0 } , F ˜ RC = { 0 } .
22:end white

5.3. Implementation

In the real system, the BS is a major work component of the implementation of the proposed algorithm. At the beginning, the transmitters send some packets with detection signals, and then, the BS would obtain the feedback of the CSIfrom each terminal, i.e., D2D and cellular users. After that, the four-dimensional matching process would be conducted at the BS. Finally, after the matching process ends, the control signals would be forwarded to each node, and there is no signaling interaction among the network nodes during the process of matching.

6. Properties of the Matching Algorithm

In this section, the properties of the three-dimensional and the two-dimensional matching algorithm, including convergence, stability, optimality, and complexity, are analyzed in detail.
The analyses of the convergence, stability and optimality of the three-dimensional matching algorithm are given as follows:
Theorem 1.
The matching process would end with finite iterations.
Theorem 2.
The proposed Algorithm 2 can converge to a two-sided stable matching X ˜ in finite iterations.
Theorem 3.
The solution of three-dimensional matching X ˜ is weak Pareto optimal for D2D TRs on combinations of RSs and RBs.
The proof of Theorems 1–3 can be found in Appendix B, Appendix C and Appendix D, respectively.
The analysis of the optimality of two-dimensional matching algorithm is given as follows:
Theorem 4.
For any set of prices when the matching process ends, the result of the matching process has the maximum total utility of all of the possible assignments of RBs to D2D TRs.
The proof of Theorem 4 can be found in Appendix E. The proofs of the convergence and stability of two-dimensional matching are similar to Theorems 1 and 2, respectively, which can also be referred to Appendices Appendix B and Appendix C. Next, we discuss the complexity of the proposed algorithm.
For the three-dimensional matching algorithm, because each D2D TR would firstly calculate the achieved EE for each RC unit, in the process of preference establishment, the computational complexity for any TR i M to obtain the preferences is O ( R C ) . The preference list can be derived by sorting the preference values for each TR, and the computational complexity is O ( R C log ( R C ) ) . In Algorithm 2, the complexity of each price raising process, in which TRs that remain single propose their most preferred RC units, is M ˜ l o o p . M ˜ l o o p is the required number of iterations by the price rising process to resolve the conflict, i.e., during M ˜ l o o p iterations, the conflicting elements assignment are finished when Ω . We have M ˜ l o o p = 1 when Ω = . Then, the computational complexity of the matching process is O ( M M ˜ l o o p ) ( M C ) or O ( C M ˜ l o o p ) ( C M ) .
For the two-dimensional matching algorithm, since each D2D TR would calculate the interference for each RB and for any TR i M , the computational complexity to obtain the preference is O ( C ) . The complexity of each price rising process, in which TRs that remain single propose their most preferred RBs, is M ^ l o o p . M ^ l o o p is the required number of iterations by the price rising process to resolve the conflict. Then, the computational complexity of the matching process is O ( M M ^ l o o p ) ( M C ) or O ( C M ^ l o o p ) ( C M ) .

7. Simulation Results and Discussion

In this section, the performance of the proposed pricing-based iterative matching algorithm is validated through simulations. Table 2 presents the simulation parameters. We employ mathematical simulation software MATLAB as our simulation environment. We consider a single cellular network with a radius of R = 300 m, which contains M D2D TRs, R RSs and C CUEs. The CUEs are randomly distributed around the BS, and the D2D TRs and the RSs are deployed in a circular hot spot area with a radius of r = 30 m. The Monte Carlo simulation is used, and in general, we set the number of simulations as 2000. We compare the proposed algorithm with three heuristic algorithms, i.e., the exhaustive search with power control, matching with fixed power and random allocation with fixed power. The exhaustive search algorithm that examines all of the possible combinations to find the optimum solution is used to serve as an upper performance benchmark. In the proposed matching with the fixed power algorithm, the power allocation problem is not considered, and the transmit power for any D2D TX or RS is always fixed as the maximum transmit power P m a x .
Remark 2.
The random matching with the fixed power algorithm also employs the maximum transmit power. In addition, the D2D TRs, RSs and RBs are matched in a random way, which is used to indicate the lower performance benchmark. This is a conventional algorithm for the lower bound that is also utilized in many works [12,27,30].
For the optimum exhaustive matching algorithm, the computational complexity increases rapidly with the number of users increasing, which makes it impossible to simulate the optimum exhaustive matching algorithm with a large scale of users. For this reason, we use 2 ∼ 6 pairs of D2D TRs as an example to indicate the performance gap between the proposed algorithm and the optimum one.
Figure 6 shows a snapshot of user locations with M = R = 4 and C = 8 . Figure 7 shows the overall EE performance of all of the D2D pairs versus the number of D2D TR pairs. The performance gap between the proposed algorithm and the optimum exhaustive matching algorithm is smaller compared with the gaps of the other heuristic algorithms, such as matching with fixed power and random allocation with fixed power. For instance, when M = 6 , the proposed algorithm achieves more than 80 % of the optimum performance, and outperforms matching with fixed power and random allocation with fixed power by 68.75 % and 145.45 % , respectively. Besides, the computational complexity of the proposed matching algorithm is an order of magnitude lower than that of the exhaustive algorithm. The EE of D2D pairs of matching with fixed power is worse than that with power control. This proves that the matching with fixed power performance gain achieved by maximizing transmit power in an interference-limited environment is not able to compensate the corresponding EE loss. The random matching with fixed power performs the worst due to the following two reasons. Firstly, the relay selection and spectrum allocation are not jointly optimized, and the power control has not been taken into consideration. Secondly, some potential D2D TR pairs are blocked when the QoS requirements of either D2D TR pairs or CUEs are not satisfied under the random matching.
Figure 8 shows the number of iterations required to reach a stable matching. Simulation results show that it takes only three iterations and four iterations for the proposed algorithm to converge when M = 4 and M = 5 , respectively. As the number of D2D pairs increases, it takes more iterations for the proposed algorithm to converge. The reason is that both the number of matching candidates and the number of matching conflicts increase along with the number of D2D TR pairs. Thus, it takes more iterations for the proposed algorithm to find a suitable RC unit and reach a stable matching.
Figure 9 shows the percentage of served users, which is defined as the number of D2D pairs served successfully by relay stations divided by the total number of D2D pairs. We compare the percentage of served users under two scenarios, i.e., with RSs and without RSs. Matching without RSs means that the D2D TX and the D2D RX communicate directly, and it reuses only one RB. The D2D TX in matching without RSs adopts the optimal transmit power by solving the power allocation problem to maximize the EE. If the QoS requirements of both D2D TR pairs and CUEs cannot be satisfied simultaneously, the D2D TR pairs are blocked and not served. Simulation results show that the proposed algorithm outperforms matching with power control, but without relay and random allocation with fixed power by 18.57 % and 72.92 % , respectively, when the SINR threshold of D2D links is 28 dB. Figure 9 also shows that the average percentage of served users degrades as the SINR threshold increases. The reason is that it is more difficult to find suitable RBs and RSs for the D2D pairs to satisfy the QoS requirements of both D2D pairs and CUEs simultaneously.
Figure 10 and Figure 11 show the influence of changing the number of D2D RSs or CUEs to the average energy efficiency of D2D pairs. We compare the proposed algorithm with two heuristic algorithms, i.e., matching with the fixed power algorithm and random allocation with the fixed power algorithm. In Figure 10, when R = 10 , the proposed algorithm outperforms the matching with the fixed power algorithm and the random allocation with the fixed power algorithm by 129.82 % and 567.84 % , respectively. It is noted that increasing the number of RSs could significantly improve the performance of the proposed algorithm. For instance, when the number of D2D RSs is increased from 6–10, the performance of proposed algorithm is improved by 29.58 % , while the performances of the other two heuristic algorithms are only improved by 24.00 % and 2.58 % . In Figure 11, when C = 16 , the proposed algorithm outperforms the matching with the fixed power algorithm and the random allocation with the fixed power algorithm by 134.62 % and 767.21 % , respectively. Similar to Figure 10, it is clear that increasing the number of CUEs could also dramatically improve the performance of the proposed algorithm. For instance, when the number of CUEs is increased from 12–16, the performance of the proposed algorithm is improved by 47.08 % , while the performances of the other two heuristic algorithms are only improved by 39.49 % and 2.37 % . Thus, the proposed algorithm could receive more benefits from the increasing number of RS and CUE compared to the other two heuristic algorithms.
We also consider another single cellular network with a radius of R = 100 m, for which the D2D TRs and the RSs are deployed in a circular hot spot area with a radius of r = 10 m. In fact, the simulation parameters do not impact the performance results of the proposed algorithm. In other words, the proposed algorithm is adapted to other scenarios, including an indoor scenario.

8. Conclusions

In this paper, we considered the energy-efficient resource management issue in D2D cooperative relay communications and proposed a pricing-based matching approach to jointly optimize relay selection, spectrum allocation and power control. First of all, we formulated the joint optimization problem as an NP-hard four-dimensional matching problem. Secondly, to provide a tractable solution, we proposed a low-complexity two-stage optimization approach, which decomposes the original joint problem into a two-dimensional matching between D2D RXs and RBs for the second hop and a three-dimensional matching among D2D TRs, RSs and RBs for the first hop. Thirdly, for the matching problem in each stage, we proposed a pricing-based iterative matching algorithm to maximize the EE while guaranteeing the QoS requirements of both D2D TRs and CUEs. Finally, the proposed algorithm was compared with some heuristic algorithms through simulations. It was demonstrated that the proposed algorithm cannot only dramatically improve the energy efficiency performance, but also significantly enhance the network coverage with low computational complexity.

Acknowledgments

This work was partially supported by the National Science Foundation of China (NSFC) under Grant Numbers 61601180, 61601181; the Fundamental Research Funds for the Central Universities under Grant Numbers 2016MS17, 2017MS13; supported by Beijing Natural Science Foundation (4174104); Beijing Outstanding Young Talent under Grant Number 2016000020124G081; SGRIXTMMXS[2016]586 2016 State Grid Corporation Science and Technology Program: “Research on communication access technology for the integration, protection, and acquisition of multiple new energy resources”.

Author Contributions

Zhenyu Zhou designed the architecture and system model. Chen Xu provided the idea and wrote the paper. Junhao Feng wrote some sections and performed simulations. Biyao Huang analyzed the experimental results. Shahid Mumtaz and Jonathan Rodriquez developed the simulation system and wrote some sections.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof of Lemma 1.
First of all, we use x m t and x c f to denote the symbols transmitted by D2D TX m t and CUE c f , respectively, and we assume that E [ | x m t | 2 ] = E [ | x c f | 2 ] = 1 . Thus, the received signal of D2D RS r is:
y r = P m t h m t r x m t + P c f h c f r x c f + n 0 ,
where n 0 is the noise terms with complex Gaussian distribution as CN ( 0 , σ 2 ) . The received signal y m r of D2D RX m r can be: expressed as
y m r = P r h r m r β r y r + P c s h c s m r x c s + n 0 ,
where x c s are the symbols transmitted by CUE c s , and the power normalization factor β r of D2D RS r can be expressed as:
β r = 1 / P m t h m t r 2 x m t + P c f h c f r 2 x c f + N 0 .
Thus, by substituting β r into (A2), the received SINR of D2D TR m can be expressed as γ m t , m r = γ r γ m r / ( γ r + γ m r + 1 ) , where γ r = P m t h m t r 2 / ( P c f h c f r 2 + N 0 ) and γ m r = P r h r m r 2 / ( P c s h c s m r 2 + N 0 ) . ☐

Appendix B

Proof of Theorem 1.
First, we define the achieved maximum E E ˜ m , r , c f , c s as utility w m , r , c f , c s of R C r , c f for TR m and the price F ˜ r , c f R C of R C r , c f as the matching cost for TRs. Then, we formulate the payoff of TR m for matching with R C r , c f as w m , r , c f , c s F ˜ r , c f R C . In the matching process, there exist conflicts among TRs, which prefer to be matched with the same RS or RB, and the pricing mechanism is used to solve such conflicts. At the beginning, the prices of RSs and RBs are initialized as zero. During the conflict period, the prices of conflicting RSs and RBs are increased by the step size s gradually. As a result, D2D TRs, which proposed the same RS, would find that this RS is no longer their most preferred RS and, hence, change to propose an alternate RS with the maximum payoff:
( r , c f ) = a r g max r R , c f C ( w m , r , c f , c s F ˜ r , c f R C ) .
According to the rules of matching shown in Algorithm 2, the last TR that remains in the request queue at the end of the price increasing process will be matched with the conflicting RC unit. By assuming that TR m is matched with the RC unit R C r , c f , given the step size s and the total matching benefit w m , r , c f , c s , we can conclude that the conflict process must terminate within w m , r , c f , c s / s steps. The reason is that during the conflict period, the payoff decreases with the prices of a conflicting RC unit increasing. Since the favorite RC unit of TR m keeps raising its price, the suboptimal choice of TR m would have better performance after several iterations of price raising, and then, TR m turns to its suboptimal choice, leaving its favorite RC unit to other competitors. ☐

Appendix C

Proof of Theorem 2.
According to Definition 2, the matching X ˜ would be stable if there exists no blocking pair. To prove that the matching X ˜ is stable, we firstly assume that TR i and RC unit R C j , k form a blocking pair. That is, TR i and RC unit R C j , k have not been matched with each other, but prefer to be matched, and we have X ˜ ( i ) R C j , k , R C j , k i X ˜ ( i ) , i R C j , k X ˜ ( R C j , k ) .
During the matching process, since every TR aims to maximize its individual payoff, which is the different between matching benefit and cost, the payoff optimization problem for each TR can be formulated as m a x j R , k C ( w i , j , k , c s F ˜ j , k R C ) , i M . With the assumption that R C j , k X ˜ ( i ) , TR i must have already proposed to RC unit R C j , k based on the matching rules. However, according to the pricing strategy given in Algorithm 2, there is no X ˜ ( i ) = R C i , j in the matching result, which represents that TR ihas given up the request to R C i , j during the process of rising prices because the final payoff for TR i matched with R C i , j is zero. Moreover, the winner in the contention for R C j , k is X ˜ ( R C j , k ) , i.e., X ˜ ( R C j , k ) R C j , k i . Therefore, if R C j , k i X ˜ ( i ) , the condition i R C j , k X ˜ ( R C j , k ) cannot hold, and this means that TR i and R C j , k cannot form a blocking pair. The analysis result above contradicts the assumption. Thus, the matching X ˜ obtained from Algorithm 2 is stable. ☐

Appendix D

Proof of Theorem 3.
First, we assume that a Pareto improvement exists for the matching X ˜ . The improvement for TR i is defined as R C j , k ; thus, we can obtain R C j , k i X ˜ ( i ) . If there exists an RC unit R C j , k that has not been matched with any TR under the matching X ˜ , i.e., X ˜ ( R C j , k ) = . It is obvious that i R C j , k X ˜ ( R C j , k ) . That is, TR i and R C j , k prefer to be matched with each other and form a blocking pair. This contradicts Theorem 1 that X ˜ is stable. On the contrary, if there exists a R C j , k that has already been matched with TR i , then RS icannot be matched with R C j , k due to the one-to-one matching rule. Since TRs i and i propose the same RC unit, the algorithm would turn to the process of rising prices to solve the conflict. During the price raising process, the matching cost of the RC unit is increased by s at each step, and the payoff of i to be matched with R C j , k would reduce correspondingly. As a result, R C j , k i X ˜ ( i ) can no longer held.
Based on the above analysis, we prove that there exists no Pareto improvement, and the matching X ˜ is weak Pareto optimal for D2D transmitters. ☐

Appendix E

Proof of Theorem 4.
First, we assume that a matching process ends with the prices of F ^ and the matching result X ^ , then the total payoff of this matching is denoted by:
m M , c s C x m , c s ( V m , c s F ^ c s ) .
Here, x m , c s = 1 when X ^ ( m ) = c s . Since each D2D TR wants to maximize its individual payoff, the matching result X ^ would generate the maximum total payoff due to Theorem 4. The total payoff of the matching X ^ can be calculated by subtracting the sum of prices from the total value of X ^ . It is noted that the sum of prices F ^ is independent of the matching X ^ . Therefore, the matching X ^ that maximizes the total payoff is also the one that maximizes the total utility. ☐

Abbreviations

The following abbreviations are used in this manuscript:
D2DDevice to device
UEsUser equipment
BSBase station
CUEsCellular user equipment
EEEnergy efficiency
QoSQuality of serve
TXsD2D transmitters
RSsD2D relay stations
RXsD2D receivers
TRsD2D transmitter-receiver pairs

References

  1. Doppler, K.; Rinne, M.; Ribeiro, C.B.; Hugl, K. Device-to-device communications as an underlay to LTE-advanced networks. IEEE Commun. Mag. 2009, 47, 42–49. [Google Scholar] [CrossRef]
  2. Tehrani, M.N.; Uysal, M.; Yanikomeroglu, H. Device-to-device communication in 5G cellular networks: Challenges, solutions, and future directions. IEEE Commun. Mag. 2014, 52, 86–92. [Google Scholar] [CrossRef]
  3. Xu, C.; Song, L.; Han, Z. Resource Management for Device-to-Device Underlay Communication, Series: Springer Briefs in Computer Science; Springer: New York, NY, USA, 2014. [Google Scholar]
  4. Nunes, I.O.; de Melo, P.O.S.V.; Loureiro, A.A.F. Leveraging D2D multihop communication through social group meeting awareness. IEEE Wirel. Commun. 2016, 23, 12–19. [Google Scholar] [CrossRef]
  5. Kartsakli, E.; Verikoukis, C.V.; Alonso, L. Performance analysis of the distributed queuing collision avoidance (DQCA) protocol with link adaptation. IEEE Trans. Wirel. Commun. 2009, 8, 644–647. [Google Scholar] [CrossRef]
  6. Yuan, D.; Angelakis, V.; Chenand, L.; Karipidis, E.; Larsson, E.G. On optimal link activation with interference cancelation in wireless networking. IEEE Trans. Veh. Technol. 2013, 62, 939–945. [Google Scholar] [CrossRef]
  7. Nishiyama, H.; Ito, M.; Kato, N. Relay-by-smartphone: Realizing multihop device-to-device communications. IEEE Commun. Mag. 2014, 52, 56–65. [Google Scholar] [CrossRef]
  8. Wei, L.; Hu, R.Q.; Qian, Y.; Wu, G. Enable device-to-device communications underlying cellular networks: Challenges and research aspects. IEEE Commun. Mag. 2014, 52, 90–96. [Google Scholar] [CrossRef]
  9. Kartsakli, E.; Antonopoulos, A.; Lalos, A.S.; Tennina, S.; Di Renzo, M.; Alonso, L.; Verikoukis, C. Reliable MAC design for ambient assisted living: Moving the coordination to the cloud. IEEE Commun. Mag. 2015, 53, 78–86. [Google Scholar] [CrossRef]
  10. Zhou, Z.; Dong, M.; Ota, K.; Shi, R.; Liu, Z.; Sato, T. Game-theoretic approach to energy-efficient resource allocation in device-to-device underlay communications. IEEE IET Commun. 2015, 9, 375–385. [Google Scholar] [CrossRef]
  11. Zhou, Z.; Ota, K.; Dong, M.; Xu, C. Energy-efficient matching for resource allocation in D2D enabled cellular networks. IEEE Trans. Veh. Technol. 2016, PP, 1–13. [Google Scholar] [CrossRef]
  12. Xu, C.; Gao, C.; Zhou, Z.; Chang, Z.; Jia, Y. Social network-based content delivery in device-to-device underlay cellular networks using matching theory. IEEE Access. 2016, 5, 924–937. [Google Scholar] [CrossRef]
  13. Ren, Y.; Shen, J.; Wang, J.; Han, J.; Lee, S. Mutual verifiable provable data auditing in public cloud storage. J. Internet Technol. 2015, 16, 317–323. [Google Scholar]
  14. Fu, Z.; Sun, X.; Liu, Q.; Zhou, L.; Shu, J. Achieving efficient cloud search services: Multi-keyword ranked search over encrypted cloud data supporting parallel computing. IEICE Trans. Commun. 2015, 98, 190–200. [Google Scholar] [CrossRef]
  15. Granelli, F.; Gebremariam, A.A.; Usman, M.; Cugini, F.; Stamati, V.; Alitska, M.; Chatzimisios, P. Software defined and virtualized wireless access in future wireless networks: Scenarios and standards. IEEE Commun. Mag. 2015, 53, 26–34. [Google Scholar] [CrossRef]
  16. Ma, B.; Shah-Mansouri, H.; Wong, V.W.S. A matching approach for power efficient relay selection in full duplex D2D networks. In Proceedings of the 2016 IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, 23–27 May 2016; pp. 1–6. [Google Scholar]
  17. Zhang, M.; Chen, X.; Zhang, J. Social-aware relay selection for cooperative networking: An optimal stopping approach. In Proceedings of the 2014 IEEE International Conference on Communications (ICC), Sydney, Australia, 10–14 June 2014; pp. 2257–2262. [Google Scholar]
  18. Pei, Y.; Liang, Y.C. Resource allocation for device-to-device communications overlaying two-way cellular networks. IEEE Trans. Wirel. Commun. 2013, 12, 3611–3621. [Google Scholar] [CrossRef]
  19. Cao, Y.; Jiang, T.; Wang, C. Cooperative device-to-device communications in cellular networks. IEEE Trans. Wirel. Commun. 2015, 22, 124–129. [Google Scholar] [CrossRef]
  20. Zhao, Y.; Li, Y.; Chen, X.; Ge, N. Joint optimization of resource allocation and relay selection for network coding aided device-to-device communications. IEEE Commun. Lett. 2015, 19, 807–810. [Google Scholar] [CrossRef]
  21. Zhao, M.; Gu, X.; Wu, D.; Ren, L. A two-stages relay selection and resource allocation joint method for d2d communication system. In Proceedings of the 2016 IEEE Wireless Communications and Networking Conference, Doha, Qatar, 3–6 April 2016; pp. 1–6. [Google Scholar]
  22. Granelli, F.; Pawelczak, P.; Prasad, R.V.; Subbalakshmi, K.P.; Chandramouli, R.; Hoffmeyer, J.A.; Berger, H.S. Standardization and research in cognitive and dynamic spectrum access networks: IEEE SCC41 efforts and other activities. IEEE Commun. Mag. 2010, 48, 71–79. [Google Scholar] [CrossRef]
  23. Kim, T.; Dong, M. An iterative hungarian method to joint relay selection and resource allocation for d2d communications. IEEE Wirel. Commun. Lett. 2014, 3, 625–628. [Google Scholar]
  24. Zhou, Z.; Wang, Y.; Wu, Q.M.J.; Yang, C.; Sun, X. Effective and efficient global context verification for image copy detection. IEEE Trans. Inf. Forensics Secur. 2017, 12, 48–63. [Google Scholar] [CrossRef]
  25. Yuan, D.; Joung, J.; Hoand, C.K.; Sun, S. On tractability aspects of optimal resource allocation in OFDMA systems. IEEE Trans. Veh. Technol. 2013, 62, 863–873. [Google Scholar] [CrossRef]
  26. Zhou, Z.; Dong, M.; Ota, K.; Chang, Z. Energy-efficient context-aware matching for resource allocation in ultra-dense small cells. IEEE Access. 2015, 3, 1849–1860. [Google Scholar] [CrossRef]
  27. Zhou, Z.; Ma, G.; Dong, M.; Ota, K.; Xu, C.; Jia, Y. Iterative energy-efficient stable matching approach for context-aware resource allocation in D2D communications. IEEE Access. 2016, 4, 6181–6196. [Google Scholar] [CrossRef]
  28. Roth, A.; Sotomayor, M. Two-Sided Matching: A Study in GameTheoretic Modeling and Analysis; Cambridge University Press: Cambridge, UK, 1992. [Google Scholar]
  29. Islam, M.T.; Taha, A.M.; Akl, S.; Choudhury, S. A two-phase auction-based fair resource allocation for underlying D2D communications. Proceedings of tne IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, 22–27 May 2016; pp. 1–6. [Google Scholar]
  30. Swain, S.; Mishra, N.; Rath, S.; Sahoo, B.P.S. Spectrum sharing for D2D communication in 5G cellular networks: An auction-based model. In Proceedings of the IEEE India Conference (INDICON), Bengaluru, India, 16–18 December 2016; pp. 1–6. [Google Scholar]
  31. Steri, G.; Baldini, G.; Fovino, I.N.; Neisse, R.; Goratti, L. A novel multi-hop secure LTE-D2D communication protocol for iot scenarios. In Proceedings of the 2016 International Conferenceon Information and Communication Technology and Systems (ICTS), Thessaloniki, Greece, 16–18 May 2016; pp. 1–6. [Google Scholar]
  32. Astaneh, S.A.; Gazor, S. Resource allocation and relay selection for collaborative communications. IEEE Trans. Wirel. Commun. 2009, 8, 6126–6133. [Google Scholar] [CrossRef]
  33. Cai, Y.; Ni, Y.; Zhu, H. Performance for device-to-device communication with three-time-slot two-way amplify-and-forward relay protocol. China Commun. 2015, 12, 1–11. [Google Scholar]
  34. Xiao, S.; Zhou, X.; Feng, D.; Wu, Y.Y.; Li, G.Y.; Guo, W. Energy-efficient mobile association in heterogeneous networks with device-to-device communications. IEEE Trans. Wirel. Commun. 2016, 15, 5260–5271. [Google Scholar] [CrossRef]
  35. Wang, S.; Guo, W.; Zhou, Z.; Wu, Y.; Chu, X. Outage probability for multi-hop D2D communications with shortest path routing. IEEE Commun. Lett. 2015, 19, 1997–2000. [Google Scholar] [CrossRef]
  36. Shen, J.; Tan, H.; Wang, J.; Wang, J.; Lee, S. A novel routing protocol providing good transmission reliability in underwater sensor networks. J. Internet Technol. 2015, 16, 171–178. [Google Scholar]
  37. Alizadeh, A.; Forouzan, N.; Ghorashi, S.A.; Sadough, S. M.-S. A graph-based approach for relay selection and resource allocation in cognitive two-way relay networks. In Proceedings of the 2011 Wireless Advanced (WiAd 2011), London, UK, 20–22 June 2011; pp. 101–105. [Google Scholar]
  38. Cao, J.; Zhang, T.; Zeng, Z.; Liu, D. Interference-aware relay selection scheme in cooperative relay networks. In Proceedings of the 2013 16th International Symposium on Wireless Personal Multimedia Communications (WPMC), Atlantic City, NJ, USA, 24–27 June 2013; pp. 1–6. [Google Scholar]
  39. Yuan, H.; Guo, W.; Wang, S. D2D multi-hop routing: Collision probability and routing strategy with limited location information. In Proceedings of the 2015 IEEE International Conference on Communication Workshop (ICCW), Krugersdorp, South Africa, 24–25 March 2015; pp. 670–674. [Google Scholar]
  40. Yuan, H.; Guo, W.; Wang, S. Emergency route selection for D2D cellular communications during an urban terrorist attack. In Proceedings of the 2014 IEEE International Conference on Communications, Sydney, Australia, 10–14 June 2014; pp. 237–242. [Google Scholar]
  41. Zhang, G.; Wang, R.; Wu, S.; Chen, L.; Dai, B.; Yang, K.; Zhao, L. Joint relay selection and resource allocation for D2D-enabled cellular communications. In Proceedings of the 15th IEEE International Conference on Computer and Information Technology (CIT 2015), Liverpool, UK, 26–28 October 2015; pp. 1186–1192. [Google Scholar]
  42. Zainaldin, A.; Halabian, H.; Lambadaris, I. Joint resource allocation and relay selection in LTE-advanced network using hybrid co-operative relaying and network coding. IEEE Trans. Wirel. Commun. 2016, 15, 4348–4361. [Google Scholar] [CrossRef]
  43. Chang, Z.; Ristaniemi, T.; Niu, Z. Radio resource allocation for collaborative OFDMA relay networks with imperfect channel state information. IEEE Trans. Wirel. Commun. 2014, 13, 2824–2835. [Google Scholar]
  44. Bhardwaj, V.; Murthy, C.R. On optimal routing and power allocation for D2D communications. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brisbane, Australia, 19–24 April 2015; pp. 3063–3067. [Google Scholar]
  45. Kartsakli, E.; Zorba, N.; Alonso, L.; Verikoukis, C.V. A threshold-selective multiuser downlink MAC scheme for 802.11n wireless networks. IEEE Trans. Wirel. Commun. 2011, 10, 857–867. [Google Scholar] [CrossRef]
  46. Li, J.; Xia, W.; Xing, S.; Shen, L. Transmission scheduling and congestion control for multi-hop D2D underlying cellular networks. In Proceedings of the 2014 IEEE 25th Annual International Symposium on Personal, Indoor, and Mobile Radio Communications (PIMRC), Washington, DC, USA, 2–5 September 2014; pp. 996–1000. [Google Scholar]
  47. Deng, J.; Dowhuszko, A.A.; Freij, R.; Tirkkonen, O. Relay Selection and Resource Allocation for D2D-Relaying under Uplink Cellular Power Control. In Proceedings of the 2015 IEEE Globecom Workshops (GC Wkshps), San Diego, CA, USA, 6–10 December 2015; pp. 1–6. [Google Scholar]
  48. Wang, L.; Peng, T.; Yang, Y.; Wang, W. Interference constrained D2D communication with relay underlying cellular networks. In Proceedings of the 2013 IEEE 78th Vehicular Technology Conference (VTC Fall), Las Vegas, NV, USA, 2–5 September 2013; pp. 1–5. [Google Scholar]
  49. Wang, L.; Peng, T.; Yang, Y.; Wang, W. Interference constrained relay selection of D2D communication for relay purpose underlying cellular networks. In Proceedings of the 2012 8th International Conference on Wireless Communications, Networking and Mobile Computing (WiCOM), Shanghai, China, 21–23 September 2012; pp. 1–5. [Google Scholar]
  50. Gong, J.; Zhou, S.; Niu, Z. Optimal power allocation for energy harvesting and power grid coexisting wireless communication systems. IEEE Trans. Wirel. Commun. 2013, 61, 3040–3049. [Google Scholar] [CrossRef]
  51. Munir, D.; Shah, S.T.; Lee, W.J.; Chung, M.Y.; Hasan, S.F. Selection of relay ue with energy harvesting capabilities in public safety environment. In Proceedings of the 2016 International Conference on Information Networking (ICOIN), Kota Kinabalu, Malaysia, 13–15 January 2016; pp. 40–44. [Google Scholar]
  52. Zhou, S.; Chen, T.; Chen, W.; Niu, Z. Outage minimization for a fading wireless link with energy harvesting transmitter and receiver. IEEE J. Sel. Areas Commun. 2015, 33, 496–511. [Google Scholar] [CrossRef]
  53. Haija, A.A.A.; Vu, M. Spectral Efficiency and Outage Performance for Hybrid D2D-Infrastructure Uplink Cooperation. IEEE Trans. Wirel. Commun. 2015, 14, 1183–1198. [Google Scholar] [CrossRef]
  54. Wei, L.; Hu, R.Q.; Qian, Y.; Wu, G. Energy Efficiency and Spectrum Efficiency of Multihop Device-to-Device Communications Underlaying Cellular Networks. IEEE Trans. Veh. Technol. 2016, 65, 367–380. [Google Scholar] [CrossRef]
Figure 1. System model of D2D cooperative relay communications underlying the cellular network.
Figure 1. System model of D2D cooperative relay communications underlying the cellular network.
Applsci 07 00491 g001
Figure 2. An illustration of the two-dimensional matching in the second hop D2D link.
Figure 2. An illustration of the two-dimensional matching in the second hop D2D link.
Applsci 07 00491 g002
Figure 3. Flowchart of the two-dimensional matching algorithm.
Figure 3. Flowchart of the two-dimensional matching algorithm.
Applsci 07 00491 g003
Figure 4. An illustration of the three-dimensional matching in the first hop D2D link.
Figure 4. An illustration of the three-dimensional matching in the first hop D2D link.
Applsci 07 00491 g004
Figure 5. Flowchart of the three-dimensional matching algorithm.
Figure 5. Flowchart of the three-dimensional matching algorithm.
Applsci 07 00491 g005
Figure 6. A snapshot of user locations for a single cellular network with C CUEs, M D2D pairs and R RSs ( C = 8 , M = 4 , R = 4 ; the cell radius is 300 m, and the size of the spot hot is 30 m, respectively).
Figure 6. A snapshot of user locations for a single cellular network with C CUEs, M D2D pairs and R RSs ( C = 8 , M = 4 , R = 4 ; the cell radius is 300 m, and the size of the spot hot is 30 m, respectively).
Applsci 07 00491 g006
Figure 7. Energy efficiency of D2D pairs vs. the number of D2D pairs.
Figure 7. Energy efficiency of D2D pairs vs. the number of D2D pairs.
Applsci 07 00491 g007
Figure 8. Energy efficiency of D2D pairs vs. the number of matching iterations.
Figure 8. Energy efficiency of D2D pairs vs. the number of matching iterations.
Applsci 07 00491 g008
Figure 9. Average percentage of served users vs. the threshold of SINR.
Figure 9. Average percentage of served users vs. the threshold of SINR.
Applsci 07 00491 g009
Figure 10. Energy efficiency of D2D pairs vs. the number of RSs.
Figure 10. Energy efficiency of D2D pairs vs. the number of RSs.
Applsci 07 00491 g010
Figure 11. Energy efficiency of D2D pairs vs. the number of D2D CUEs.
Figure 11. Energy efficiency of D2D pairs vs. the number of D2D CUEs.
Applsci 07 00491 g011
Table 1. Nomenclature.
Table 1. Nomenclature.
VariableMeaning
m t D2D transmitter
m r D2D receiver
rD2D relay station
cCellular user
c f The cellular user whose resource blocks are reused in first hop D2D link
c s The cellular user whose resource blocks are reused in second hop D2D link
P m t Transmit power of D2D transmitter
P r Transmit power of relay station
P c f Transmit power of cellular user c f
P c s Transmit power of cellular user c s
d m t r Transmission distance between D2D transmitter and relay station
d c f r Transmission distance between cellular user and relay station
h m t r Channel response of the first hop D2D link
h c f r Channel response of the interference link from cellular user c f
α Path-loss exponent
h s , m t r Small-scale fading (Rayleigh) channel coefficients between D2D transmitter and relay station
h s , c f r Small-scale fading (Rayleigh) channel coefficients between cellular user and relay station
N 0 One-sided power spectral density of the additive white Gaussian noise
γ r SINR received by relay station r
γ m r SINR received by D2D receiver m r
γ m r , m r SINR of the two-hop D2D link
γ c f SINR of the cellular user whose resource blocks are reused by the first hop D2D link
γ c s SINR of the cellular user whose resource blocks are reused by the second hop D2D link
E E m , r , c f , c s Energy efficiency of the two-hop D2D link
Table 2. Simulation parameters.
Table 2. Simulation parameters.
Simulation ParameterValue
Cell radius R300 m
Radius of the hot zone r30 m
Path loss exponent α 4
Max transmit power of D2D TXs and RSs P m a x 23 dBm
Transmit power of CUEs P c 20 dBm
Noise power N 0 −114 dBm
Number of D2D TR pairs M2 ∼ 6
Number of relay R2 ∼ 6
Number of CUEs C4 ∼ 12
SINR threshold γ m i n D 25 dB
SINR threshold γ m i n C 25 dB
Step size s0.1

Share and Cite

MDPI and ACS Style

Xu, C.; Feng, J.; Huang, B.; Zhou, Z.; Mumtaz, S.; Rodriguez, J. Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory. Appl. Sci. 2017, 7, 491. https://doi.org/10.3390/app7050491

AMA Style

Xu C, Feng J, Huang B, Zhou Z, Mumtaz S, Rodriguez J. Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory. Applied Sciences. 2017; 7(5):491. https://doi.org/10.3390/app7050491

Chicago/Turabian Style

Xu, Chen, Junhao Feng, Biyao Huang, Zhenyu Zhou, Shahid Mumtaz, and Jonathan Rodriguez. 2017. "Joint Relay Selection and Resource Allocation for Energy-Efficient D2D Cooperative Communications Using Matching Theory" Applied Sciences 7, no. 5: 491. https://doi.org/10.3390/app7050491

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop