TY - GEN
T1 - Risk-Aware Reinforcement Learning Based Federated Learning Framework for Io V
AU - Chen, Yuhan
AU - Liu, Zhibo
AU - Lu, Xiaozhen
AU - Xiao, Liang
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - Federated learning helps protect data privacy for Internet of vehicles (Io V) by selecting a number of participated nodes but suffers from performance degradation such as low model training accuracy in the highly dynamic and large-scale Io V systems under selfish attacks. In this paper, we propose a risk-aware reinforcement learning based federated learning framework against selfish attacks for Io V,which jointly optimizes the training policy (i.e., the selection of participated vehicles and the corresponding local training data size) based on the state including the global model training accuracy, local model quality, training latency, data rate, and participation rate. By designing a punishment function to evaluate the immediate risk of each choosing training policy, this scheme avoids risky policies that result in extremely low training accuracy and high training latency to satisfy the requirements of local tasks such as the quality of service requirements. An evaluated neural network involved fully connected layers is designed to fast extract the global and local training features and thus accelerate the convergence speed. Experimental results based on both the MNIST and CIFAR-10 datasets verify that our scheme outperforms the benchmarks with higher training accuracy and less training latency.
AB - Federated learning helps protect data privacy for Internet of vehicles (Io V) by selecting a number of participated nodes but suffers from performance degradation such as low model training accuracy in the highly dynamic and large-scale Io V systems under selfish attacks. In this paper, we propose a risk-aware reinforcement learning based federated learning framework against selfish attacks for Io V,which jointly optimizes the training policy (i.e., the selection of participated vehicles and the corresponding local training data size) based on the state including the global model training accuracy, local model quality, training latency, data rate, and participation rate. By designing a punishment function to evaluate the immediate risk of each choosing training policy, this scheme avoids risky policies that result in extremely low training accuracy and high training latency to satisfy the requirements of local tasks such as the quality of service requirements. An evaluated neural network involved fully connected layers is designed to fast extract the global and local training features and thus accelerate the convergence speed. Experimental results based on both the MNIST and CIFAR-10 datasets verify that our scheme outperforms the benchmarks with higher training accuracy and less training latency.
KW - IoV
KW - federated learning
KW - reinforcement learning
KW - selfish attacks
UR - http://www.scopus.com/inward/record.url?scp=85193273219&partnerID=8YFLogxK
U2 - 10.1109/WCNC57260.2024.10571032
DO - 10.1109/WCNC57260.2024.10571032
M3 - Conference contribution
AN - SCOPUS:85193273219
T3 - IEEE Wireless Communications and Networking Conference, WCNC
BT - 2024 IEEE Wireless Communications and Networking Conference, WCNC 2024 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE Wireless Communications and Networking Conference, WCNC 2024
Y2 - 21 April 2024 through 24 April 2024
ER -