標題: Reinforcement Learning based Fragment-Aware Scheduling for High Utilization HPC Platforms
作者: Chen, Lung-Pin
Wu, I-Chen
Chang, Yen-Ling
資訊工程學系
Department of Computer Science
關鍵字: High-performance computing;malleable task;reinforcement learning;scheduling
公開日期: 1-Jan-2019
摘要: Due to high capacity and complex scheduling activities, a HPC platform often creates resource fragments with low usability. This paper develops a novel fragment-aware scheduling approach which improves system utilization by fitting elastic lightweight tasks to the fragments of resources dynamically. The new approach employs a threshold to determine the balancing factor between the length of tasks and the degree of granularity of the resource fragments. We employ the PPO reinforcement learning approach to train a neural network that can compute the threshold precisely. With the threshold that is adaptive to the changing system states, the PPO-based scheduler is able to utilize the idle resources and maximize the execution success rate of the tasks.
URI: http://hdl.handle.net/11536/154060
ISBN: 978-1-7281-4666-9
ISSN: 2376-6816
期刊: 2019 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI)
起始頁: 0
結束頁: 0
Appears in Collections:Conferences Paper