A survey on scheduling techniques in computing and network convergence
S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …
computing power. This trend results in the urgent need for higher-level computing resource …
Deep reinforcement agent for scheduling in HPC
Cluster scheduler is crucial in high-performance computing (HPC). It determines when and
which user jobs should be allocated to available system resources. Existing cluster …
which user jobs should be allocated to available system resources. Existing cluster …
GARLSched: Generative adversarial deep reinforcement learning task scheduling optimization for large-scale high performance computing systems
J Li, X Zhang, J Wei, Z Ji, Z Wei - Future Generation Computer Systems, 2022 - Elsevier
Efficient task scheduling has become increasingly complex as the number and type of tasks
proliferate and the size of computing resource grows in large-scale distributed high …
proliferate and the size of computing resource grows in large-scale distributed high …
Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning
Q Ding, P Zheng, S Kudari, S Venkataraman… - Proceedings of the …, 2023 - dl.acm.org
Accommodating long-running deep learning (DL) training and inference jobs is challenging
on GPU clusters that use traditional batch schedulers, such as Slurm. Given fixed wall clock …
on GPU clusters that use traditional batch schedulers, such as Slurm. Given fixed wall clock …
[HTML][HTML] A machine learning approach for an HPC use case: The jobs queuing time prediction
Abstract High-Performance Computing (HPC) domain provided the necessary tools to
support the scientific and industrial advancements we all have seen during the last decades …
support the scientific and industrial advancements we all have seen during the last decades …
{BlockFlex}: Enabling Storage Harvesting with {Software-Defined} Flash in Modern Cloud Platforms
Cloud platforms today make efficient use of storage resources by slicing them among multi-
tenant applications on demand. However, our study discloses that the cloud storage is still …
tenant applications on demand. However, our study discloses that the cloud storage is still …
A deep reinforcement learning approach to resource management in hybrid clouds harnessing renewable energy and task scheduling
The use of cloud computing for delivering application services over the Internet has gained
rapid traction. Since the beginning of the COVID-19 global pandemic, the work from home …
rapid traction. Since the beginning of the COVID-19 global pandemic, the work from home …
[PDF][PDF] On the Study of Curriculum Learning for Inferring Dispatching Policies on the Job Shop Scheduling.
This paper studies the use of Curriculum Learning on Reinforcement Learning (RL) to
improve the performance of the dispatching policies learned on the Job-shop Scheduling …
improve the performance of the dispatching policies learned on the Job-shop Scheduling …
Deep reinforcement learning for minimizing tardiness in parallel machine scheduling with sequence dependent family setups
B Paeng, IB Park, J Park - IEEE Access, 2021 - ieeexplore.ieee.org
Parallel machine scheduling with sequence-dependent family setups has attracted much
attention from academia and industry due to its practical applications. In a real-world …
attention from academia and industry due to its practical applications. In a real-world …
Dras: Deep reinforcement learning for cluster scheduling in high performance computing
Y Fan, B Li, D Favorite, N Singh… - … on Parallel and …, 2022 - ieeexplore.ieee.org
Cluster schedulers are crucial in high-performance computing (HPC). They determine when
and which user jobs should be allocated to available system resources. Existing cluster …
and which user jobs should be allocated to available system resources. Existing cluster …