Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning

The aim of this paper is to enhance the performance of an integrated sensing and communications (ISAC) system in the multiuser multiple-input multiple-output (MIMO) downlink in which a two-dimensional (2D) fluid antenna system (FAS) with multiple activated ports is employed at the base station (BS)...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on wireless communications Jg. 23; H. 9; S. 10879 - 10894
Hauptverfasser: Wang, Chao, Li, Guo, Zhang, Haibin, Wong, Kai-Kit, Li, Zan, Ng, Derrick Wing Kwan, Chae, Chan-Byoung
Format: Journal Article
Sprache:Englisch
Veröffentlicht: New York IEEE 01.09.2024
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Schlagworte:
ISSN:1536-1276, 1558-2248
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The aim of this paper is to enhance the performance of an integrated sensing and communications (ISAC) system in the multiuser multiple-input multiple-output (MIMO) downlink in which a two-dimensional (2D) fluid antenna system (FAS) with multiple activated ports is employed at the base station (BS) to maximize the sum-rate of the downlink users subject to a sensing constraint. The unique feature of this setup is that the locations of the antenna ports at the FAS can be optimized jointly with the precoding design to achieve a higher sum-rate. The required optimization problem is however NP-hard. To overcome this, we start by considering the perfect channel state information (CSI) scenario where all the port CSI is available. Deep reinforcement learning is utilized to build an end-to-end learning framework for the joint optimization problem. In particular, by fixing the activated ports, we adopt a primal-dual based learning algorithm to design a constraint-aware neural network for optimizing the ISAC precoder. Then, by using the neural precoding network to calculate the reward, we adopt the deep reinforcement learning algorithm to design the port selection and precoder jointly. An advantage actor and critic (A2C) algorithm is proposed to train the policy, in which the actor network uses the pointer network to learn the stochastic policy and the critic network adopts the Long Short-Term Memory (LSTM) encoder architecture to learn the expected reward from the observations. Afterwards, the partial CSI case is addressed, where we propose a masked autoencoder (MAE) induced channel extrapolation for predicting all the CSI to facilitate the joint design. Simulation results demonstrate the promising performance of using FAS for multiuser MIMO and also validate the proposed learning-based scheme.
Bibliographie:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1536-1276
1558-2248
DOI:10.1109/TWC.2024.3376800