TY - JOUR
T1 - Deep reinforcement learning for near-field wideband beamforming in STAR-RIS networks
AU - Wang, Ji
AU - Sun, Jiayi
AU - Fang, Wei
AU - Chen, Zhao
AU - Liu, Yue
AU - Liu, Yuanwei
N1 - Publisher Copyright:
© Zhejiang University Press 2024.
PY - 2024/12
Y1 - 2024/12
N2 - A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted multiuser near-field wideband communication system is investigated, in which a robust deep reinforcement learning (DRL) based algorithm is proposed to enhance the users’ achievable rate by jointly optimizing the active beamforming at the base station (BS) and passive beamforming at the STAR-RIS. To mitigate the beam split issue, the delay-phase hybrid precoding structure is introduced to facilitate wideband beamforming. Considering the coupled nature of the STAR-RIS phase-shift model, the passive beamforming design is formulated as a problem of hybrid continuous and discrete phase-shift control, and the proposed algorithm controls the high-dimensional continuous action through hybrid action mapping. Additionally, to address the issue of biased estimation encountered by existing DRL algorithms, a softmax operator is introduced into the algorithm to mitigate this bias. Simulation results illustrate that the proposed algorithm outperforms existing algorithms and overcomes the issues of overestimation and underestimation.
AB - A simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS) assisted multiuser near-field wideband communication system is investigated, in which a robust deep reinforcement learning (DRL) based algorithm is proposed to enhance the users’ achievable rate by jointly optimizing the active beamforming at the base station (BS) and passive beamforming at the STAR-RIS. To mitigate the beam split issue, the delay-phase hybrid precoding structure is introduced to facilitate wideband beamforming. Considering the coupled nature of the STAR-RIS phase-shift model, the passive beamforming design is formulated as a problem of hybrid continuous and discrete phase-shift control, and the proposed algorithm controls the high-dimensional continuous action through hybrid action mapping. Additionally, to address the issue of biased estimation encountered by existing DRL algorithms, a softmax operator is introduced into the algorithm to mitigate this bias. Simulation results illustrate that the proposed algorithm outperforms existing algorithms and overcomes the issues of overestimation and underestimation.
KW - Deep reinforcement learning
KW - Near-field beamforming
KW - Simultaneously transmitting and reflecting reconfigurable intelligent surface (STAR-RIS)
KW - TP391.4
KW - Wideband beam split
UR - http://www.scopus.com/inward/record.url?scp=86000529854&partnerID=8YFLogxK
U2 - 10.1631/FITEE.2400364
DO - 10.1631/FITEE.2400364
M3 - Article
AN - SCOPUS:86000529854
SN - 2095-9184
VL - 25
SP - 1651
EP - 1663
JO - Frontiers of Information Technology and Electronic Engineering
JF - Frontiers of Information Technology and Electronic Engineering
IS - 12
ER -