Clustering based Probabilistic I/O Scheduling for Burst-Buffers Equipped HPC

Benbo Zha, Hong Shen, Hankz Hankui Zhuo, Zhijian Luo

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Modern High-Performance Computing (HPC) platforms usually consist of an intermediate high-throughput layer, Burst-Buffers (BBs), between computing nodes and underlying shared Parallel File System (PFS) to absorb the I/O bursts caused by concurrent I/O requests from different applications. As concurrent applications increase I/O demand, BBs may experience I/O contention due to its limited capacity. The existing probabilistic I/O scheduling method can schedule I/O under limited BBs' capacity, which can sense BBs' congestion via the Markov-Chain-based probability model. However, the probability model requires consistent I/O characteristics of applications, including similar I/O duration and longer application length, to obtain an accurate I/O load estimation. These consistency conditions do not often hold in realistic situations.In this paper, we proposed a probability I/O scheduling framework based on application clustering (PIOS) to eliminate the consistency requirement. The framework first clusters all applications by 1-D K-means according to their I/O phrase length. Next, the expected I/O workload of each cluster is calculated and then the BBs's capacity is partitioned according to the expected I/O workload. Finally, the probabilistic I/O scheduling is applied to each application cluster. The simulation results demonstrate our framework can adapt to inconsistency and show more efficiency.

Original languageEnglish
Title of host publicationProceedings - 2023 The 14th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2023
PublisherIEEE Computer Society
ISBN (Electronic)9798350371024
DOIs
Publication statusPublished - 2023
Event14th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2023 - Beijing, China
Duration: 24 Nov 202326 Nov 2023

Publication series

NameProceedings - International Symposium on Parallel Architectures, Algorithms and Programming, PAAP
ISSN (Print)2168-3034
ISSN (Electronic)2168-3042

Conference

Conference14th International Symposium on Parallel Architectures, Algorithms and Programming, PAAP 2023
Country/TerritoryChina
CityBeijing
Period24/11/2326/11/23

Keywords

  • Application clustering
  • Burst-buffering
  • High-performance computing
  • I/O scheduling

Fingerprint

Dive into the research topics of 'Clustering based Probabilistic I/O Scheduling for Burst-Buffers Equipped HPC'. Together they form a unique fingerprint.

Cite this