Next Article in Journal
Machine Learning Techniques Applied to Dose Prediction in Computed Tomography Tests
Next Article in Special Issue
Cognitive Interference Cancellation with Digital Channelizer for Satellite Communication
Previous Article in Journal
Electronic Tongues for Inedible Media
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Resource Allocation in Cognitive Radio Wireless Sensor Networks with Energy Harvesting

Department of Communication Engineering, University of Science and Technology Beijing, Beijing 10083, China
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(23), 5115; https://doi.org/10.3390/s19235115
Submission received: 4 November 2019 / Revised: 14 November 2019 / Accepted: 20 November 2019 / Published: 22 November 2019

Abstract

:
The progress of science and technology and the expansion of the Internet of Things make the information transmission between communication infrastructure and wireless sensors become more and more convenient. For the power-limited wireless sensors, the life time can be extended through the energy-harvesting technique. Additionally, wireless sensors can use the unauthored spectrum resource to complete certain information transmission tasks based on cognitive radio. Harvesting enough energy from the environments, the wireless sensors, works as the second users (SUs) can lease spectrum resource from the primary user (PU) to finish their task and bring additional transmission cost to themselves. To minimize the overall cost of SUs and to maximize the spectrum profit of the PU during the information transmission period, we formulated a differential game model to solve the resource allocation problem in the cognitive radio wireless sensor networks with energy harvesting, considering the SUs as the game players. By solving the proposed resource allocation game model, we found the open loop Nash equilibrium solutions and feedback Nash equilibrium solutions for all SUs as the optimal control strategies. Ultimately, series numerical simulation experiments have been made to demonstrate the rationality and effectiveness of the game model.

1. Introduction

With the development of the Internet of Things (IoTs), all things will have the ability to perceive the environment. Through the deep integration of the physical world and the digital world [1], the IoTs revolutionizes connection and interaction among all objects. Based on the IoT, the physical equipment and physical world are no longer cold but become a vitality community. In addition, wireless sensor networks (WSN), as the nerve ending of IoTs, have attracted extensive attention and research interest by many scholars in recent years [2], and their theoretical and applied research has received more and more attention [3].
WSNs are composed of a series of sensor nodes. Generally, communication between the sensor nodes is completed using the common frequency band. However, with the development of wireless communication, this frequency band is increasingly crowded, and the communication between sensor nodes receives interference not only by their own nodes, but also increasingly serious and uncontrollable interference from other application type networks. As interconnected devices increase rapidly, sensor nodes with multiple applications overlapping coverage areas will suffer from more severe interference. Over the past few years, cognitive radio networks have been proposed as an effective way to alleviate the spectrum scarcity problem [4]. Many scholars have realized that cognitive radio technology can be an effective means to improve the utilization of spectrum resources and have done a great deal of research on it [5].
By configuring the cognitive radio module to the sensor node, we can detect the state of the licensed spectrum and opportunistically utilize the idle licensed spectrum for data transmission [6]. A wireless network of sensor nodes with cognitive radio modules is also known as a cognitive radio wireless sensor network (CRWSN), which is no longer subject to signal interference from the public frequency band. Although CRWSNs are no longer subject to transmission interference caused by a lack of spectrum, nodes in the sensor network need to consume additional energy to implement cognitive radio functions, such as spectrum detection, spectrum switching, etc., which are generally powered by difficult-to-replace batteries. For sensor nodes, the problem of insufficient energy becomes more severe. Therefore, compared with traditional wireless sensor networks, the network lifetime problem of CRWSNs caused by insufficient node energy becomes more urgent [7]. In order to overcome the lack of nodes and ensure that the network can continue to operate effectively, energy-harvesting technology has begun to be adopted by sensor nodes. Using energy-harvesting technology, sensor nodes can collect ambient conditions such as solar energy, wind energy, and vibration energy [8,9]. The nodes continuously obtain energy from the surrounding environment, and the lifetime of the sensor network can be effectively extended [10].
Cognitive radio wireless sensor networks with energy harvesting can achieve a continuous supply of spectrum resources and energy resources to make up for the shortcomings of traditional sensor networks [11]. Compared with traditional sensor networks, CRWSNs with energy harvesting have obvious advantages and great development potentials, which brings new opportunities to the development of sensor networks and lays a foundation for the development of the Internet of Things. However, both energy and spectrum resources in the CRWSNs are dynamic and difficult to predict. The sensor node performs spectrum detection by consuming energy to obtain transmission opportunities. Under the guarantee of sufficient residual energy, the sensor node can use this opportunity to transmit the collected data; otherwise, even if there is a transmission opportunity, the sensor node will not have enough energy and no data transmission or data collection.
Therefore, the resource allocation of CRWSNs with energy harvesting needs to consider the dynamic energy-harvesting process and the random available spectrum resources. Researchers have made some achievements in the research of the above problems. Firstly, there are three popular resource allocation schemes [12] in the field of cognitive radio sensor networks: centralized, cluster-based, and distributed scenarios. The centralized scheme means that the central node of the cognitive radio network is the network backbone, which can control the power of other sensing nodes in the network and make decisions on resource allocation. The centralized scheme can be found in the literature [13,14,15,16], which achieves multiple optimization goals through efficient decision making, such as improving energy efficiency and maximizing throughput. However, the control channel transmits information that requires higher power, and the network is susceptible to central node failure. The cluster-based scheme means that the network is divided into multiple clusters, and each cluster has one cluster head and multiple intracluster nodes. After the cluster head aggregates the information, it makes the decision of resource allocation and communicates to each node. A number of cluster-based approaches have been adopted in some of the literature [17,18,19,20]. There are fewer members in the cluster, which can reduce the signaling overhead for each cluster head, but if the cluster head node fails, the connections to neighboring nodes in the cluster may be invalid. The distributed scenarios means that each node in a distributed system makes decisions either autonomously or in cooperation with neighboring nodes. In a time-varying environment, it is possible to quickly adapt to node failures and ensure robustness. However, this solution is based on local information decisions, which are susceptible to interference and malicious attacks, resulting in inefficient solutions [21,22,23].
Additionally, lots of productive work has been done on the resource allocation problem under the cognitive radio networks with energy harvesting. Previous work focused mainly on the optimal spectrum sensing algorithms to improve spectrum utilization efficiency [24]. There are also some studies that focused on transmission strategies, which aim to improve energy use efficiency and choose different channel models to improve the efficiency of energy collections [25,26]. Recently, cognitive radio network energy-harvesting systems based on spectrum sharing have become a significant research direction. A novel energy cooperation transmission scheme was proposed for cognitive spectrum-sharing-based D2D communication system in [27]. Li et al. [28] considered a spectrum-sharing scheme based on simultaneous wireless information and power transfer (SWIPT). Zhang et al. [29] proposed a new cooperative spectrum-sharing protocol with dynamic time-slot allocation based on energy harvesting. However, none of the related works mentioned above takes into account resource allocation in CRWSNs with energy harvesting.
In CRWSNs with energy harvesting, the wireless sensors can be considered as the secondary users (SUs), and the base station or infrastructure can be seen as the primary user (PU). The SUs can capture the spectrum from the PU. Before information transmission, the SUs should harvest enough energy for information transmission. Then, when the channels are not occupied by the PU, the SUs will use the harvest energy to transmit information. The available information transmission spectrum mainly depends on the spectrum leased from the PU. The SUs should control their spectrum requirements to lower the spectrum lease cost, while completing the information transmission tasks. In this paper, we investigated the optimal resource allocation problem for the PUs and SU based on a differential game. The SUs control their spectrum requirements to minimize the cost.
In this paper, we propose a differential game-based resource allocation problem in CRWSN. The system state of the proposed CRWSN is the capacity of the spectrum resource that the PU wants to lease. All the wireless sensors that can be seen as SUs should control the resource level leased from the PU to minimize their cost during the information transmission. We obtained open loop and feedback Nash equilibria from the SUs. The numerical results are given to present the correctness of the differential game analysis. The whole paper is organized as follows: Section 2 is the system differential game model and problem formulation, which consists of two parts, that is, system model and game formulation. Section 3 is the game analysis, which consists of three parts, that is, open loop Nash equilibrium, feedback Nash equilibrium under the finite horizon, and feedback Nash equilibrium under the infinite horizon. Section 4 is a numerical simulation and analysis. Finally, there is the conclusion about the main work and the corresponding summary in Section 5.

2. System Model and Problem Formulation

2.1. System Model

As shown in Figure 1, here, we consider a CRWSN with one primary user (PU), one fusion center (FC), and N secondary users (SUs) with N 2 . All the SUs in the SU system share the same spectrum band with the PU. Meanwhile, the SUs communicates with the FC via the same channel of the PU. We assume that the SU is equipped with the energy harvest circuits. The SUs can harvest energy from the ambient environment (e.g., solar, wind, and radio frequency) to increase the lifetime. The harvest energy is stored in a rechargeable battery with finite energy storage capacity. With the harvest energy, the SUs can achieve information transmission with the FC. The SUs can harvest energy when the PU’s spectrum is busy. The status of the spectrum is updated by the PU. When the spectrum is not used by the PU, the PU will announce its status and lease its spectrum to the SUs. The spectrum status can be seen by the SUs through the available spectrum idle status map [30], which can be dynamically updated by the PU through collaboration with SUs. Using the leased spectrum, the SUs can directly transmit data to the FC with the harvest energy. The work mode of SUs can be considered as the harvest-then-transmit mode.
Let u i ( t ) denote the leased spectrum of SU i at time t , for i N , where each SU controls its spectrum band from the PU. Let U i be the set of admissible spectrum band. The harvest-then-transmit mode in our proposed model includes two phases as shown in Figure 2, the energy-harvesting phase and the information transmission phase, respectively. In the energy-harvesting phase, the SUs will harvest enough energy for information transmission. In the information transmission phase, the SUs will use the harvested energy and leased spectrum to complete the information transmission tasks. Let β i denote the normalized channel idle period, and ( 1 β i ) is the normalized channel busy period. When the channel is idle, β i is the time fraction of the SUs for information transmission. When the channel is busy, ( 1 β i ) is the time fraction of the SUs for energy harvesting. The harvest energy during the ( 1 β i ) period will be stored in the battery of the SUs for information transmission during the β i period. For example, the normalized idle period for the Disney TV channel is approximately 0.25 (25%), which means β i = 0.25 for the SUs in the Disney TV channel. Let π p denote the spectrum price of the PU when leasing the spectrum to SUs. The PU can adjust its price continuously as a seller during the idle period to maximize the spectrum income. The SUs should adjust the corresponding leased spectrum according to the needs and the spectrum price announced by the PU.

2.2. Game Formulation

We formulate the resource allocation problem in the CRWSN with energy harvesting as a differential game as follows:
  • Players: The set of N SUs in the proposed CRWSN are the players of the differential game;
  • State: The system state of the proposed CRWSN is the capacity of the spectrum resource the PU wants to lease;
  • Strategy: The strategy of each SU is the leased spectrum resource from the PU. Accordingly, the strategy set can be denoted as U i = { u 1 , u 2 , , u N } .
We aim at minimizing the overall cost of the SUs during the energy-harvesting and information transmission period. The overall cost function for each SU is given by the following equation:
U i ( t ) = ω i s p U i s p ( t ) + ω i e h U i e h ( t ) + ω i d i s U i d i s ( t ) ,
where U i s p ( t ) is the spectrum cost for the spectrum leased from the PU, which is a linear function of the spectrum leased from the PU and is given by β i π p u i ( t ) . If the spectrum price π p is announced by the PU, the SUs should pay for the leased spectrum to the PU with payment β i π p u i ( t ) . Because the battery capacity of the SUs is limited, each SU is equipped with the energy-harvesting circuit to harvest enough energy before the information transmission. U i e h ( t ) is the cost of energy harvesting of SU i to have enough energy for information transmission and can be given by ( 1 β i ) ε i 2 u i 2 ( t ) / η i , where η i is the conversion efficiency of the harvest energy and ε i is the QoS requirements of SU i , such as energy efficiency, or spectrum efficiency. Based on the harvest-then-transmit mode, there are two costs during the harvest-then-transmit process, the spectrum cost U i s p ( t ) and the energy-harvesting cost U i e h ( t ) . U i d i s ( t ) is the discrepancy cost between the spectrum requirements and available capacity of the spectrum resource. Without loss of generality, we use the quadratic function to express the discrepancy cost between the spectrum requirements and the available capacity, and the cost is denoted by ( u i ( t ) x ( t ) ) 2 . ω i s p , ω i e h , and ω i d i s are weighted parameters. In the above equation, we find that we do not consider the interference cost, because we have the spectrum cost due to the spectrum leased from the PU.
The SUs will acquire the availability of spectrum resource at the beginning of the idle time slot from the PU and will share the spectrum resource with the other SUs. To avoid interference among SUs, each of the spectrum resources can only be allocated to one SU. Meanwhile, at the beginning of the idle time slot, the PU will announce a unit price for the leased spectrum. Then, the SUs should decide the spectrum requirements based on the price and their acquirements. Let x ( t ) be the capacity of the spectrum resource at time t . In particular, we have U i R + , x ( t ) > 0 , and U i = 0 for x ( t ) = 0 . The evolution of spectrum capacity x ( t ) can be given by the following differential equation:
d x ( t ) = [ i = 1 N α i u i ( t ) + δ x ( t ) ] d t ,
where α i is a negative constant which means the spectrum leasing efficiency of SU i from the PU. δ is the spectrum loss rate during the spectrum leasing. From Equation (2), we can find that the dynamic variation of the spectrum capacity is mainly controlled by the control variables u i ( t ) . The evolution of spectrum capacity is an ordinary differential equation of variables u i ( t ) .
Based on the above models, we formulate the optimal resource allocation problem in the proposed CRWSN. Assuming the observing time for the proposed game is [ t 0 , T ] , the objective for SU i can be written as differential game as follows:
J i ( t ) = min u i ( t ) { t 0 T U i ( t ) e r t d t + Φ ( x T ) } = min u i ( t ) { t 0 T [ ω i s p U i s p ( t ) + ω i e h U i e h ( t ) + ω i d i s U i d i s ( t ) ] e r t d t + Φ i ( x ( T ) ) } ,
subject to:
d x ( t ) d t = i = 1 N α i u i ( t ) + δ x ( t ) ,
where Φ i ( x ( T ) ) is the terminal cost after the game period that the SUs can achieve, which is given by the spectrum capacity x ( t ) at the end of the time interval. r is the discount rate and e r t is the discount factor. SU i is to find the optimal spectrum strategy that can minimize its overall cost function over the time interval [ t 0 , T ] . In the following section, the equilibrium to the differential game is given based on the Hamilton-Jacob-Bellman (HJB) function.

3. Game Analysis

In this section, we try to find the equilibrium solution to the proposed differential game. Based on the spectrum price controlled by the PU, the SUs should control their spectrum requirements to minimize the overall cost given by Equations (3) and (4). Firstly, assuming the initial state of the spectrum capacity is known by all the SUs, we can find the open loop Nash equilibrium for each SU. Then, the feedback solutions to the proposed game are discussed based on Bellman dynamic programming, when the game players know the exact system state at time instant t .

3.1. Open Loop Nash Equilibrium

First, we discuss the open loop solutions. In order to get the open loop equilibrium solutions, some definitions are needed as follows.
Definition 1.
The allocated spectrum resource u i * ( t ) from PU to SU is optimal under the open loop condition if the following inequality holds for all control variables u i ( t )   u i * ( t ) in the set of admissible spectrum U i ,
J i ( t , u i * ( t ) ) J i ( t , u i ( t ) ) .
Definition 2.
The Hamiltonian function of each SU in the proposed differential game in the time period [ t 0 , T ] can be given as follows:
H i ( t , x ) = U i ( t ) e r t + Λ i ( t ) d x d t ,
where Λ i ( t ) is the costate function and is given by:
d Λ i ( t ) d t = H i ( t , x ) x .
Theorem 1.
The allocated spectrum resource u i * ( t ) provides an open loop Nash equilibrium to the proposed resource allocation game in Equations (3) and (4) if there are constate functions Λ i ( t ) , satisfying the following equations:
u i * ( t ) = a r g min u i ( t ) U i H i ( t , x ) =   a r g min u i ( t ) U i [ U i ( t ) e r t + Λ i ( t ) d x d t ]
Considering the optimal control problem given by Equations (3) and (4), based on the Pontryagin’s maximum principle, we can have the Nash equilibrium solutions of the optimal resource allocation problem for each SU.
Theorem 2.
The optimal resource allocation strategy of SU i is given by:
u i * ( t ) = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( Λ i e r t α i + ω i s p β i π p 2 ω i d i s x ( t ) ) ,
where [ x ( t ) , Λ i ( t ) ] are the solutions to the following Riccati function:
d Λ i ( t ) d t = 2 ω i d i s [ u i * ( t ) x ( t ) ] e r t Λ i ( t ) δ ,
d x ( t ) d t = i = 1 N α i u i * ( t ) + δ x ( t ) ,
In Equation (9), the allocated spectrum resource for SU i is a linear function of the system state x ( t ) , and affected by the unit spectrum price π p controlled by the PU. The SU should make decision for the spectrum from the PU based on the available system capacity x ( t ) at time t and should consider the influence of unit resource price π p .
Proof. 
For the open loop equilibrium, Pontryagin’s maximum principle can be used as the necessary condition to find the optimal strategies. The Hamiltonian function of the SU is given by Equation (6), and take the derivative of the Hamiltonian function yields:
u i * ( t ) = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i Λ i e r t + ω i s p β i π p 2 ω i d i s x ( t ) ) ,
which is the optimal strategy for the SUs under the open loop condition. Based on Equation (12), we can allocate the spectrum resource to SUs for the proposed game durations [ t 0 , T ] .  □

3.2. Feedback Nash Equilibrium under Finite Horizon

In the open loop Nash equilibrium, the system state is not known by all the game players. The game players only know the initial system state. The SUs make decisions on the optimal allocated resource based on the time instant s and the initial system state x ( 0 ) . Next, we try to find the feedback strategies when the system state is known by the SUs. The optimal solutions to the proposed game under feedback condition depend on the current time and current system state. In order to have the feedback solutions, some definitions are needed.
Definition 3.
The allocated spectrum resource u i * ( t , x ) from PU to SU is optimal under feedback condition if the following inequality holds for all control variables u i ( t , x )   u i * ( t , x ) in the set of admissible spectrum U i :
J i ( t , u i * ( t , x ) ) J i ( t , u i ( t , x ) ) .
Theorem 3.
The allocated spectrum resource u i * ( t , x ) provides a feedback Nash equilibrium to the proposed resource allocation game in Equations (3) and (4) if there are continuously differentiable functions V i ( t , x ) , satisfying the following set of partial differential equations:
d V i ( t , x ) d t = min u i ( t , x ) U i { U i ( t , x ) e r t + d V i ( t , x ) d x d x d t } ,
V i ( T , x ( T ) ) = Φ i ( x ( T ) ) ,
where V i ( t , x ) is the game equilibrium payoff of SU i at time t [ t 0 , T ] with the system state being x , which is called value function of SU i in the proposed resource allocation problem.
Definition 4.
The value function of each SU under feedback control can be given as follows:
V i ( t , x ) = min u i { t T [ ω i s p U i s p ( t ) + ω i e h U i e h ( t ) + ω i d i s U i d i s ( t ) ] e r s d s + Φ i ( x ( T ) ) } = t T [ ω i s p U i s p ( s , u i * ( s , x ) , x * ( s ) ) + ω i e h U i e h ( s , u i * ( s , x ) , x * ( s ) ) + ω i d i s U i d i s ( t , u i * ( s , x ) , x * ( s ) ) ] e r s d t + Φ i ( x ( T ) ) ,
and satisfying the boundary condition:
V i ( T , x ( T ) ) = Φ i ( x ( T ) ) .
Given another resource allocation strategy u i u i * , with the corresponding system state x , then we can have the following equations:
U i ( t , u i , x ) e r t + d V i ( t , u i , x ) d x d x d t + d V i ( t , u i , x ) d t 0 ,
and:
U i ( t , u i * , x * ) e r t + d V i ( t , u i * , x * ) d x d x d t + d V i ( t , u i * , x * ) d t = 0 .
Integrating the above expressions from t 0 to T , we obtain:
t 0 T U i ( s , u i , x ) e r s d s + V ( T , x ( T ) ) V ( t 0 , x 0 ) 0 ,   and t 0 T U i ( s , u i * , x * ) e r s d s + V ( T , x * ( T ) ) V ( t 0 , x 0 ) = 0 .
Performing the indicated minimization in Equations (14) and (15) yields:
u i * ( t ) = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i d V i ( t , x ) d x e r t + ω i s p β i π p 2 ω i d i s x ( t ) ) .
Substituting Equation (18) into Equations (16) and (17) and solving Equations (16) and (17), one obtains:
V i ( t , x ) = e r t ( 1 2 A i ( t ) x 2 + B i ( t ) x + C i ( t ) ) ,
where A i ( t ) , B i ( t ) and C i ( t ) satisfy the following differential equations:
1 2 d A i ( t ) d t + 1 2 r A i ( t ) = ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i A i ( t ) 2 ω i d i s ) ( 3 2 ω i d i s 1 4 α i A i ( t ) ) + ω i d i s + δ A i ( t ) + A i ( t ) j = 1 N α j ( 2 ω j d i s 2 ω j e h ( 1 β j ) ε j 2 η j 1 ) 1 ( α j A j ( t ) 2 ω j d i s )
d B i ( t ) d t + r B i ( t ) = δ B i ( t ) 1 2 ω i s p β i π p ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i A i ( t ) 2 ω i d i s ) + 1 2 α i A i ( t ) ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i B i ( t ) + ω i s p β i π p ) + A i ( t ) j = 1 N α j ( 2 ω j d i s 2 ω j e h ( 1 β j ) ε j 2 η j 1 ) 1 ( α j B j ( t ) + ω j s p β j π p ) + B i ( t ) j = 1 N α j ( 2 ω j d i s 2 ω j e h ( 1 β j ) ε j 2 η j 1 ) 1 ( α j A j ( t ) 2 ω j d i s ) .
d C i ( t ) d t + r C i ( t ) = 1 4 ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( ( α i B i ( t ) ) 2 ( ω i s p β i π p ) 2 ) + B i ( t ) j = 1 N α j ( 2 ω i d i s 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α j B j ( t ) + ω i s p β i π p ) .
Theorem 4.
The optimal resource allocation strategy of SU i under the feedback control situation is given by:
u i * ( t ) = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i ( A i ( t ) x + B i ( t ) ) + ω i s p β i π p 2 ω i d i s x ( t ) ) ,
where A i ( t ) and B i ( t ) are the solutions of the differential equations given by Equations (20)–(22).

3.3. Feedback Nash Equilibrium under Infinite Horizon

We now turn the proposed game to the infinite-horizon autonomous game with a constant discount factor. In this subsection, the observing time for the differential game is infinity. The objective function and system state function, which are given by Equations (3) and (4), are both non-autonomous. The game problem in Equations (3) and (4) is changed to a problem with infinite horizon as follows:
J i ( t ) = min u i ( t ) { t 0 U i ( t ) e r t d t } = min u i ( t ) { t 0 [ ω i s p U i s p ( t ) + ω i e h U i e h ( t ) + ω i d i s U i d i s ( t ) ] e r t d t } ,
subject to:
d x ( t ) = [ i = 1 N α i u i ( t ) + δ x ( t ) ] d t .
Under the infinite horizon, the solutions are independent of time-instant and dependent only on the system state at the starting time. A feedback solution for the infinite horizon game in Equations (24) and (25) can be characterized as follows.
Theorem 5.
The allocated spectrum resource u i * ( t , x ) provides a feedback Nash equilibrium to the proposed resource allocation game in Equations (24) and (25) if there are functions W i ( t , x ) , satisfying the following set of partial differential equations:
r W i ( x ) = min u i ( x ) U i { ω i s p U i s p + ω i e h U i e h + ω i d i s U i d i s + d W i ( x ) d x d x d t } .
Theorem 6.
The optimal resource allocation strategy of SU i is given by:
u i * = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( d W i ( x ) d x α i + ω i s p β i π p 2 ω i d i s x ) ,
Proof. 
Performing the indicated minimization in Equation (27), we can obtain:
u i * = ( 2 ω i d i s + 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( d W i ( x ) d x α i + ω i s p β i π p 2 ω i d i s x ) .
Incorporating the solution u i * into Equations (24) and (25), and solving the equations, we can obtain:
W i ( x ) = 1 2 r W A i ( x ) x 2 + r W B i ( x ) x + r W C i ( x ) ,
where W A i ( x ) , W B i ( x ) , and W C i ( x ) satisfy the following equations:
1 2 r W A i ( x ) = ω i d i s + δ W A i ( x ) + 1 4 ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( ( α i W A i ( x ) ) 2 ( 2 ω i d i s ) 2 ) + W A i ( x ) j = 1 N α j ( 2 ω j d i s 2 ω j e h ( 1 β j ) ε j 2 η j 1 ) 1 ( α j W A j ( x ) 2 ω j d i s )
r W B i ( x ) = δ W B i ( x ) + 1 2 α i W B i ( x ) ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i W A i ( x ) 2 ω i d i s ) + ω i d i s ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α i W B i ( x ) + ω i s p β i π p ) + W A i ( x ) j = 1 N α j ( 2 ω j d i s 2 ω j e h ( 1 β j ) ε j 2 η j 1 ) 1 ( α j W B j ( x ) + ω j s p β j π p ) + W B i ( x ) j = 1 N α j ( 2 ω i d i s 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α j W A j ( x ) 2 ω j d i s )
r W C i ( x ) = 1 4 ( ω i d i s + ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( ( α i W B i ( x ) ) 2 ( ω i s p β i π p ) 2 ) + W B i ( x ) j = 1 N α j ( 2 ω i d i s 2 ω i e h ( 1 β i ) ε i 2 η i 1 ) 1 ( α j W B j ( x ) + ω j s p β j π p )

4. Numerical Simulations

In this section, a series of numerical simulation experiments have been done using a mathematical software named MATLAB, version R2016a, to show the optimal strategy’s change over time about each of the SUs’ resource leased through the differential game model formulated and solved above. In the next portion, we comprehensively analyze the differential game model open loop Nash equilibrium solution and feedback Nash equilibrium solution to SUs about the spectrum band resource leased. In the simulations, for both the open loop and feedback situation, three SUs are chosen to make the simulation environment, and to show the dynamic changing process of the system strategy based on the proposed differential game model. All the differential game model simulation parameters that are used in the experiment process are shown in Table 1, where we see that there are some parameters (discount rate r, the unit price π p that the PU appointed to lease their spectrum resource, the spectrum loss rate δ during the process of spectrum leasing) that are the same for the three users; on the one hand, to simplify the simulations, we consider some special scenarios.
Firstly, we analyze the open loop solution and feedback solution u i * ( t ) of the differential game model that are formulated in this paper, and from Section 3, we have got the optimal resource allocation strategy expression of SU about the open loop Nash equilibrium and feedback Nash equilibrium. Through the optimal strategy expression and the parameters’ setting, we get the strategy simulation figure, and via the simulation in Figure 3a and Figure 4a, we see that the changing of the resource allocation strategy u i * ( t ) over time is decreasing gradually and stabilizing to a certain value in both the open loop and feedback solution simulation figure, which satisfies the actual fact that we know u i * ( t ) represents the spectrum resource that SU i leased from the PU at t instantaneous, and in the information transmission stage, the leased spectrum and the harvested energy are used by the SUs to complete certain information transmission tasks; as time goes by, some information transmission tasks may tend to be finished or stable, which results in that fewer spectrum resources may be needed with time than the previous moment, so the spectrum resource that needs to be leased from the PU may decrease gradually, and with transmission tasks finished or tending toward a stable state, some permanent spectrum resource may just be needed to maintain the message transmission status, so the curve trend is stable near to a certain value.
Secondly, we analyze the variation of x ( t ) over time under the optimal resource allocation strategy u i * ( t ) in both the open loop and feedback solution situation; from Formula (4) and same as with the analysis description of u i * ( t ) in the above, we know x(t) represents the spectrum resource capacity that the PU wants to rent externally at t instantaneous and the variations of x(t) are relevant to not only the optimal strategy u i * ( t ) but also x(t) itself. Here, through Formula (4), the optimal strategy u i * ( t ) , and the parameters’ setting, we get the changing simulation figure about x ( t ) , Figure 3b that represents the change of x ( t ) with time under an open loop solution, and Figure 4b that represents the change of x ( t ) with time under a feedback solution situation. In addition, from Figure 3b and Figure 4b, we see that the trend of x ( t ) over time gradually decreases and stabilizes to a certain value, which satisfies the actual fact that, on the one side, with the information transmission tasks near to being finished, the resource of the spectrum band that the SU wants to lease from the PU may be reduces, which result in the consequence that the resource the PU wants to rent outside may be depressed to achieve the maximum of resource utilization. On the other side of the shield, long-term rental resources may cause the reduction of the spectrum used by the PU, which may bring about the reduction of its work efficiency to the PU; therefore, the spectrum that the PU rents externally may decrease over time. The optimal solutions in the feedback situation under an infinite horizon are also analyzed, which are given in Figure 5. Based on the results given in Figure 5, we can find that the system state will be changed with the time varying, and the optimal solutions would be changed, and there will be more optimal solutions for the users to choose.

5. Conclusions

In this article, the secondary users that have the energy harvest function can achieve the information transmission task via its stored energy and the spectrum resource leased from the PU. The act of leasing a spectrum resource brings about a cost increase for SUs. To minimize the cost, firstly, we formulated a differential game model to represent it mathematically and figuratively. Secondly, by solving the game model, the open loop Nash equilibrium solutions and the feedback Nash equilibrium solutions were obtained, which illustrates the fact that optimal resource allocation strategies for SUs exist. Finally, a certain number of numerical simulations was done to verify the correctness of the differential game model.

Author Contributions

H.X. conceived the main idea and the differential game theory model; all authors contributed to data analysis and wrote the paper.

Funding

This work was supported by the National Science Foundation Project of China (Nos. 61971032) and Fundamental Research Funds for the Central Universities (No. FRF-TP-18-008A3).

Acknowledgments

All of the authors would like to thank the editor and all the anonymous reviewers for their valuable suggestions that improved the quality of the article.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Mishra, D.P.; Kumar, S.; Ashu, A. Internet of Things: A Survey on Enabling Technologies, Application and Standardization. In Proceedings of the 3rd International Conference on Internet of Things and Connected Technologies (ICIoTCT), Jaipur, India, 26–27 March 2018. [Google Scholar]
  2. Hui, H.; Zhou, C.; Xu, S.; Lin, F. A Novel Secure Data Transmission Scheme in Industrial Internet of Things. China Commun. 2020, 17. [Google Scholar]
  3. Dener, M. Developing a Low-Cost Security System with Wireless Sensor Networks. In Proceedings of the 2018 International Conference on Advances in Computing and Communication Engineering (ICACCE), Paris, France, 20–23 June 2018; pp. 69–73. [Google Scholar] [CrossRef]
  4. Das, A.; Das, N. Cooperative Cognitive Radio for Wireless Opportunistic Networks. In Proceedings of the 2019 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 7–11 January 2019; pp. 574–576. [Google Scholar] [CrossRef]
  5. Kakalou, I.; Psannis, K.E. Sustainable and Efficient Data Collection in Cognitive Radio Sensor Networks. IEEE Trans. Sustain. Comput. 2018, 4, 29–38. [Google Scholar] [CrossRef]
  6. Özger, M.; Alagoz, F.; Akan, O.B. Clustering in Multi-Channel Cognitive Radio Ad Hoc and Sensor Networks. IEEE Commun. Mag. 2018, 56, 156–162. [Google Scholar] [CrossRef]
  7. Guo, J.; Jafarkhani, H. Movement-Efficient Sensor Deployment in Wireless Sensor Networks With Limited Communication Range. IEEE Trans. Wirel. Commun. 2019, 18, 3469–3484. [Google Scholar] [CrossRef]
  8. Lakshmi, P.S.; Jibukumar, M.G.; Neenu, V.S. Network lifetime enhancement of multi-hop wireless sensor network by RF energy harvesting. In Proceedings of the 2018 International Conference on Information Networking (ICOIN), Chiang Mai, Thailand, 10–12 January 2018; pp. 738–743. [Google Scholar]
  9. Li, L.; Xu, Y.; Zhang, Z.; Yin, J.; Chen, W.; Han, Z. A prediction-based charging policy and interference mitigation approach in the wireless powered Internet of Things. IEEE J. Sel. Areas Commun. 2019, 37, 439–451. [Google Scholar] [CrossRef]
  10. Zhang, D.; Awad, M.K.; Zhou, H.; Shen, X.S.; Chen, Z.; Zhang, N. Utility-optimal Resource Management and Allocation Algorithm for Energy Harvesting Cognitive Radio Sensor Networks. IEEE J. Sel. Areas Commun. 2016, 34, 3552–3565. [Google Scholar] [CrossRef]
  11. Ren, J.; Zhang, Y.; Deng, R.; Zhang, N.; Zhang, D.; Shen, X. Joint channel access and sampling rate control in energy harvesting cognitive radio sensor networks. IEEE Trans. Emerg. Top. Comput. 2019, 7, 149–161. [Google Scholar] [CrossRef]
  12. Ahmad, A.; Ahmad, S.; Rehmani, M.H.; Hassan, N.U. A Survey on Radio Resource Allocation in Cognitive Radio Sensor Networks. IEEE Commun. Surv. Tutor. 2015, 17, 888–917. [Google Scholar] [CrossRef]
  13. Chriki, A.; Touati, H.; Snoussi, H.; Kamoun, F. Centralized Cognitive Radio Based Frequency Allocation for UAVs Communication. In Proceedings of the 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco, 24–28 June 2019; pp. 1674–1679. [Google Scholar] [CrossRef]
  14. Zhao, Y.; Peng, M.; Li, H. A channel bonding scheme with packet dropping mechanism in centralized cognitive radio networks. In Proceedings of the 2018 Chinese Control and Decision Conference (CCDC), Shenyang, China, 9–11 June 2018; pp. 1548–1551. [Google Scholar] [CrossRef]
  15. Aggarwal, M.; Velmurugan, T.; Karuppiah, M.; Hassan, M.M.; Almogren, A.; Ismail, W.N.; Nagy, W. Probability-Based Centralized Device for Spectrum Handoff in Cognitive Radio Networks. IEEE Access 2019, 7, 26731–26739. [Google Scholar] [CrossRef]
  16. Salama, A.M.; Samy, I.; el Shafie, A.; Mohamed, A.; Khattab, T. Centralized and Distributed Cognitive Relay-Selection Schemes for SWIPT Cognitive Networks. IEEE Trans. Commun. 2019, 67, 7431–7443. [Google Scholar] [CrossRef]
  17. Sachan, A.; Nigam, S.; Bajpai, A. An Energy Efficient Virtual-MIMO Communication for Cluster Based Cooperative Wireless Sensor Network. In Proceedings of the 2018 9th International Conference on Computing, Communication and Networking Technologies (ICCCNT), Bangalore, India, 10–12 July 2018; pp. 1–6. [Google Scholar] [CrossRef]
  18. Wang, C.; Li, J.; Yang, Y.; Ye, F. Combining Solar Energy Harvesting with Wireless Charging for Hybrid Wireless Sensor Networks. IEEE Trans. Mob. Comput. 2018, 17, 560–576. [Google Scholar] [CrossRef]
  19. Salout, N.; Awin, F.; Abde-Raheem, E.; Tepe, K. Combined Fusion Schemes for Cluster-Based Spectrum Sensing in Cognitive Radio Networks. In Proceedings of the 2018 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT), Louisville, KY, USA, 6–8 December 2018; pp. 218–222. [Google Scholar]
  20. Moghaddam, S.S.; Shirvanimoghaddam, M.; Habibzadeh, A. Clustering-based Handover and Resource Allocation Schemes for Cognitive Radio Heterogeneous Networks. In Proceedings of the 2018 28th International Telecommunication Networks and Applications Conference (ITNAC), Sydney, NSW, Australia, 21–23 November 2018; pp. 1–6. [Google Scholar]
  21. Li, J.; Zhao, H.; Wei, J.; Ma, D.; Zhou, L. Sender-Jump Receiver-Wait: A Simple Blind Rendezvous Algorithm for Distributed Cognitive Radio Networks. IEEE Trans. Mob. Comput. 2018, 17, 183–196. [Google Scholar] [CrossRef]
  22. Lu, Y.; Duel-Hallen, A. A Sensing Contribution-Based Two-Layer Game for Channel Selection and Spectrum Access in Cognitive Radio Ad-hoc Networks. IEEE Trans. Wirel. Commun. 2018, 17, 3631–3640. [Google Scholar] [CrossRef]
  23. Smith, P.J.; Senanayake, R.; Dmochowski, P.A.; Evans, J.S. Novel distributed spectrum sensing techniques for cognitive radio networks. In Proceedings of the 2018 IEEE Wireless Communications and Networking Conference (WCNC), Barcelona, Spain, 15–18 April 2018; pp. 1–6. [Google Scholar]
  24. Lee, H.S.; Ahmed, M.E.; Kim, D.I. Optimal spectrum sensing policy in RF-powered cognitive radio networks. IEEE Trans. Veh. Technol. 2018, 67, 9557–9570. [Google Scholar] [CrossRef]
  25. Mishra, N.; Kundu, S.; Mondal, S.; Roy, S.D. Cognitive Machine to Machine Communication with Energy Harvesting in IoT networks. In Proceedings of the 2019 11th International Conference on Communication Systems & Networks (COMSNETS), Bengaluru, India, 7–11 January 2019; pp. 672–677. [Google Scholar]
  26. El-Malek, A.H.A.; Aboulhassan, M.A.; Salhab, A.M.; Zummo, S.A.; El-Malek, A.H.A. Performance Analysis and Power Optimization for Spectrum-Sharing Mixed RF/FSO Relay Networks with Energy Harvesting. IEEE Photonics J. 2019, 11, 1–17. [Google Scholar] [CrossRef]
  27. Zhao, D.; Cui, Y.; Tian, H.; Zhang, P. A Novel Information and Energy Cooperation Transmission Scheme in Cognitive Spectrum Sharing-Based D2D Communication Systems. IEEE Access 2019, 7, 72316–72328. [Google Scholar] [CrossRef]
  28. Li, Q.; Zhang, X.; Pandharipande, A.; Zhang, J.; Zhanga, J. Cooperative Spectrum Sharing on SWIPT-Based DF Relay: An Energy-Aware Retransmission Approach. IEEE Access 2019, 7, 120802–120816. [Google Scholar] [CrossRef]
  29. Zhang, Z.; Lu, Y.; Huang, Y. Simultaneous Wireless Information and Power Transfer for Dynamic Cooperative Spectrum Sharing Networks. IEEE Access 2019, 7, 823–834. [Google Scholar] [CrossRef]
  30. Marko, H.; Aarne, M.; Marina, E.; Marja, M.; Juha, K.; Jaakko, O.; Jaakko, S.; Reijo, E.; Roger, B.; Dennis, R. Spectrum occupancy measurements: A survey and use of interference maps. IEEE Commun. Surv. Tutor. 2016, 18, 2386–2414. [Google Scholar]
Figure 1. Cognitive radio wireless sensor network (CRWSN) system model.
Figure 1. Cognitive radio wireless sensor network (CRWSN) system model.
Sensors 19 05115 g001
Figure 2. Harvest-then-transmit mode.
Figure 2. Harvest-then-transmit mode.
Sensors 19 05115 g002
Figure 3. (a) Variation of u i * ( t ) with time t for the open loop solution; (b) variation of x ( t ) with time t for the open loop solution.
Figure 3. (a) Variation of u i * ( t ) with time t for the open loop solution; (b) variation of x ( t ) with time t for the open loop solution.
Sensors 19 05115 g003
Figure 4. (a) Variation of u i * ( t ) with time t for the feedback solution; (b) variation of x ( t ) with time t for the feedback solution.
Figure 4. (a) Variation of u i * ( t ) with time t for the feedback solution; (b) variation of x ( t ) with time t for the feedback solution.
Sensors 19 05115 g004
Figure 5. (a) Variation of u i * ( t ) for the feedback solution under infinite horizon at t = 10; (b) variation of u i * ( t ) for the feedback solution under infinite horizon at the end of the game.
Figure 5. (a) Variation of u i * ( t ) for the feedback solution under infinite horizon at t = 10; (b) variation of u i * ( t ) for the feedback solution under infinite horizon at the end of the game.
Sensors 19 05115 g005
Table 1. Simulation parameters setting in the differential game model.
Table 1. Simulation parameters setting in the differential game model.
Parametersi ω i d i s ω i e h ω i s p β i ε i η i r π p δ α i Φ i T
Value10.40.80.10.10.30.0280.150.2−0.15−0.60.1100
Value20.50.40.30.40.50.030.150.2−0.15−0.50.6100
Value30.60.30.60.70.70.0340.150.2−0.15−0.40.8100

Share and Cite

MDPI and ACS Style

Xu, H.; Gao, H.; Zhou, C.; Duan, R.; Zhou, X. Resource Allocation in Cognitive Radio Wireless Sensor Networks with Energy Harvesting. Sensors 2019, 19, 5115. https://doi.org/10.3390/s19235115

AMA Style

Xu H, Gao H, Zhou C, Duan R, Zhou X. Resource Allocation in Cognitive Radio Wireless Sensor Networks with Energy Harvesting. Sensors. 2019; 19(23):5115. https://doi.org/10.3390/s19235115

Chicago/Turabian Style

Xu, Haitao, Hongjie Gao, Chengcheng Zhou, Ruifeng Duan, and Xianwei Zhou. 2019. "Resource Allocation in Cognitive Radio Wireless Sensor Networks with Energy Harvesting" Sensors 19, no. 23: 5115. https://doi.org/10.3390/s19235115

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop