Joint channel allocation and transmit power control for underlay EH‐CRNs: A clustering‐based multi‐agent DDPG approach

To address the concerns of energy supply and spectrum scarcity for wireless devices, energy harvesting cognitive radio networks have been proposed. To improve spectrum utilization, secondary users (SUs) access the licensed spectrum in underlay mode, which may cause severe interference to primary use...

Celý popis

Uloženo v:
Podrobná bibliografie
Vydáno v:IET communications Ročník 18; číslo 19; s. 1574 - 1587
Hlavní autoři: Liu, Xiaoying, Kuang, Xinyu, Li, Zefu, Zheng, Kechen
Médium: Journal Article
Jazyk:angličtina
Vydáno: Wiley 01.12.2024
Témata:
ISSN:1751-8628, 1751-8636
On-line přístup:Získat plný text
Tagy: Přidat tag
Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!
Popis
Shrnutí:To address the concerns of energy supply and spectrum scarcity for wireless devices, energy harvesting cognitive radio networks have been proposed. To improve spectrum utilization, secondary users (SUs) access the licensed spectrum in underlay mode, which may cause severe interference to primary users and SUs. The focus is on the underlay energy harvesting cognitive radio networks with multiple pairs of SUs, and formulate the long‐term secondary throughput maximization problem as a mixed‐integer non‐linear programming problem. As traditional approaches could hardly solve the mixed‐integer non‐linear programming problem well, a centralized deep deterministic policy gradient (C‐DDPG) approach is proposed that achieves satisfactory throughput performance. To reduce the computational complexity of C‐DDPG, we further propose a clustering‐based multi‐agent DDPG (CMA‐DDPG) approach that combines the advantages of the centralized deep reinforcement learning approach and the distributed deep reinforcement learning approach. In the CMA‐DDPG, a novel interference‐based clustering algorithm is proposed, which partitions the SUs that cause severe mutual interference into one cluster, and the sizes of state space and action space are smaller than those in C‐DDPG. Numerical results validate the superiority of the proposed approaches in terms of the throughput and outage probability, and validate the clustering performance of the interference‐based clustering algorithm in terms of the outage probability of the secondary network. This article focuses on the underlay energy harvesting cognitive radio networks with multiple pairs of secondary users, and formulate the long‐term secondary throughput maximization problem as a mixed‐integer non‐linear programming problem. Authors propose a centralized deep deterministic policy gradient (C‐DDPG) approach that achieves satisfactory throughput performance. To reduce the computational complexity of C‐DDPG, authors further propose a clustering‐based multi‐agent DDPG approach that combines the advantages of the centralized deep reinforcement learning approach and the distributed deep reinforcement learning approach.
ISSN:1751-8628
1751-8636
DOI:10.1049/cmu2.12852