TY - JOUR
T1 - Joint Service Placement and Resource Allocation for Long-Term DNN Inference Accuracy in Dynamic MEC Networks
AU - Zhang, Zhihan
AU - Zhang, Tiankui
AU - Shi, Tianyi
AU - Wang, Yapeng
N1 - Publisher Copyright:
© 2025 IEEE. All rights reserved.
PY - 2025
Y1 - 2025
N2 - By leveraging deep neural networks (DNNs), mobile edge computing networks can integrate advanced intelligent computing capabilities. Due to temporally dynamic changes in wireless channel and service requesting distribution, the long-term inference accuracy of DNN services can be severely affected over time. To address the above challenge, this correspondence aims to maximize the long-term inference accuracy by jointly optimizing service placement, bandwidth allocation, and wireless device association. By modeling the relationship between data size and inference accuracy, we employ regression techniques to derive the fitting curve for the deployed services. A two-timescale long-term optimization problem is transformed into a series of subproblems using Lyapunov analysis. We propose an alternating optimization algorithm to tackle with the subproblems, in which the convex-concave procedure is utilized for bandwidth allocation, while the branch-and-bound method is employed for service placement. Moreover, a low-complexity penalty-based method is further developed for service placement. Simulation results show that the proposed methods outperform baselines in inference accuracy and system fairness.
AB - By leveraging deep neural networks (DNNs), mobile edge computing networks can integrate advanced intelligent computing capabilities. Due to temporally dynamic changes in wireless channel and service requesting distribution, the long-term inference accuracy of DNN services can be severely affected over time. To address the above challenge, this correspondence aims to maximize the long-term inference accuracy by jointly optimizing service placement, bandwidth allocation, and wireless device association. By modeling the relationship between data size and inference accuracy, we employ regression techniques to derive the fitting curve for the deployed services. A two-timescale long-term optimization problem is transformed into a series of subproblems using Lyapunov analysis. We propose an alternating optimization algorithm to tackle with the subproblems, in which the convex-concave procedure is utilized for bandwidth allocation, while the branch-and-bound method is employed for service placement. Moreover, a low-complexity penalty-based method is further developed for service placement. Simulation results show that the proposed methods outperform baselines in inference accuracy and system fairness.
KW - DNN inference
KW - MEC network
KW - dynamic resource allocation
KW - edge-end cooperation
KW - service placement
UR - https://www.scopus.com/pages/publications/105022657521
U2 - 10.1109/TVT.2025.3633791
DO - 10.1109/TVT.2025.3633791
M3 - Article
AN - SCOPUS:105022657521
SN - 0018-9545
JO - IEEE Transactions on Vehicular Technology
JF - IEEE Transactions on Vehicular Technology
ER -