跳至主導覽 跳至搜尋 跳過主要內容

Optimizing job scheduling by using broad learning to predict execution times on HPC clusters

  • Zhengxiong Hou
  • , Hong Shen
  • , Qiying Feng
  • , Zhiqi Lv
  • , Junwei Jin
  • , Xingshe Zhou
  • , Jianhua Gu

研究成果: Article同行評審

7 引文 斯高帕斯(Scopus)

摘要

Small and middle size high-performance computing clusters are very popular for various applications. How to utilize the accumulated log data generated in the past to optimize job scheduling using machine learning techniques is an interesting problem. Most of the current work use the common machine learning algorithms, such as the multivariate linear regression and polynomial model, to predict job runtime and optimize job scheduling. They either ignore the interference among job features or require a high time overhead for improving the prediction accuracy. In this paper, we propose to implement and improve broad learning algorithm for predicting the execution times of new coming jobs more accurately and efficiently. The experimental results showed that the proposed method can obtain high prediction accuracy with a negligible time overhead. And the predicted job execution time can help improve the efficiency of job scheduling and HPC systems.

原文English
頁(從 - 到)365-377
頁數13
期刊CCF Transactions on High Performance Computing
6
發行號4
DOIs
出版狀態Published - 8月 2024
對外發佈

指紋

深入研究「Optimizing job scheduling by using broad learning to predict execution times on HPC clusters」主題。共同形成了獨特的指紋。

引用此