A survey on scheduling techniques in computing and network convergence

S Tang, Y Yu, H Wang, G Wang, W Chen… - … Surveys & Tutorials, 2023 - ieeexplore.ieee.org
The computing demand for massive applications has led to the ubiquitous deployment of
computing power. This trend results in the urgent need for higher-level computing resource …

Deep reinforcement agent for scheduling in HPC

Y Fan, Z Lan, T Childers, P Rich… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
Cluster scheduler is crucial in high-performance computing (HPC). It determines when and
which user jobs should be allocated to available system resources. Existing cluster …

GARLSched: Generative adversarial deep reinforcement learning task scheduling optimization for large-scale high performance computing systems

J Li, X Zhang, J Wei, Z Ji, Z Wei - Future Generation Computer Systems, 2022 - Elsevier
Efficient task scheduling has become increasingly complex as the number and type of tasks
proliferate and the size of computing resource grows in large-scale distributed high …

Mirage: Towards Low-interruption Services on Batch GPU Clusters with Reinforcement Learning

Q Ding, P Zheng, S Kudari, S Venkataraman… - Proceedings of the …, 2023 - dl.acm.org
Accommodating long-running deep learning (DL) training and inference jobs is challenging
on GPU clusters that use traditional batch schedulers, such as Slurm. Given fixed wall clock …

[HTML][HTML] A machine learning approach for an HPC use case: The jobs queuing time prediction

C Vercellino, A Scionti, G Varavallo, P Viviani… - Future Generation …, 2023 - Elsevier
Abstract High-Performance Computing (HPC) domain provided the necessary tools to
support the scientific and industrial advancements we all have seen during the last decades …

{BlockFlex}: Enabling Storage Harvesting with {Software-Defined} Flash in Modern Cloud Platforms

B Reidys, J Sun, A Badam, S Noghabi… - 16th USENIX Symposium …, 2022 - usenix.org
Cloud platforms today make efficient use of storage resources by slicing them among multi-
tenant applications on demand. However, our study discloses that the cloud storage is still …

A deep reinforcement learning approach to resource management in hybrid clouds harnessing renewable energy and task scheduling

J Zhao, MA Rodríguez, R Buyya - 2021 IEEE 14th International …, 2021 - ieeexplore.ieee.org
The use of cloud computing for delivering application services over the Internet has gained
rapid traction. Since the beginning of the COVID-19 global pandemic, the work from home …

[PDF][PDF] On the Study of Curriculum Learning for Inferring Dispatching Policies on the Job Shop Scheduling.

Z Iklassov, D Medvedev, RSO De Retana, M Takac - IJCAI, 2023 - ijcai.org
This paper studies the use of Curriculum Learning on Reinforcement Learning (RL) to
improve the performance of the dispatching policies learned on the Job-shop Scheduling …

Deep reinforcement learning for minimizing tardiness in parallel machine scheduling with sequence dependent family setups

B Paeng, IB Park, J Park - IEEE Access, 2021 - ieeexplore.ieee.org
Parallel machine scheduling with sequence-dependent family setups has attracted much
attention from academia and industry due to its practical applications. In a real-world …

Dras: Deep reinforcement learning for cluster scheduling in high performance computing

Y Fan, B Li, D Favorite, N Singh… - … on Parallel and …, 2022 - ieeexplore.ieee.org
Cluster schedulers are crucial in high-performance computing (HPC). They determine when
and which user jobs should be allocated to available system resources. Existing cluster …