Optimus: an efficient dynamic resource scheduler for deep learning clusters

Y Peng, Y Bao, Y Chen, C Wu, C Guo - Proceedings of the Thirteenth …, 2018 - dl.acm.org
Deep learning workloads are common in today's production clusters due to the proliferation
of deep learning driven AI services (eg, speech recognition, machine translation). A deep …

Online task resource consumption prediction for scientific workflows

RF Da Silva, G Juve, M Rynge, E Deelman… - Parallel Processing …, 2015 - World Scientific
Estimates of task runtime, disk space usage, and memory consumption, are commonly used
by scheduling and resource provisioning algorithms to support efficient and reliable …

A new toolbox for scheduling theory

Z Scully - ACM SIGMETRICS Performance Evaluation Review, 2023 - dl.acm.org
Queueing delays are ubiquitous in many domains, including computer systems, service
systems, communication networks, supply chains, and transportation. Queueing and …

HFSP: bringing size-based scheduling to hadoop

M Pastorelli, D Carra, M Dell'Amico… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Size-based scheduling with aging has been recognized as an effective approach to
guarantee fairness and near-optimal system response times. We present HFSP, a scheduler …

PSBS: Practical size-based scheduling

M Dell'Amico, D Carra… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
Size-based schedulers have very desirable performance properties: optimal or near-optimal
response time can be coupled with strong fairness. Despite this, however, such systems are …

Trace-based analysis and prediction of cloud computing user behavior using the fractal modeling technique

S Chen, M Ghorbani, Y Wang… - … Congress on Big …, 2014 - ieeexplore.ieee.org
The problem of big data analytics is gaining increasing research interest because of the
rapid growth in the volume of data to be analyzed in various areas of science and …

Job scheduling without prior information in big data processing systems

Z Hu, B Li, Z Qin, RSM Goh - 2017 IEEE 37th international …, 2017 - ieeexplore.ieee.org
Job scheduling plays an important role in improving the overall system performance in big
data processing frameworks. Simple job scheduling policies, such as Fair and FIFO …

Scheduling with service-time information: The power of two priority classes

Y Chen, J Dong - arXiv preprint arXiv:2105.10499, 2021 - arxiv.org
Utilizing customers' service-time information, we study an easy-to-implement scheduling
policy with two priority classes. By carefully designing the classes, the two-class priority rule …

Flexible scheduling of distributed analytic applications

F Pace, D Venzano, D Carra… - 2017 17th IEEE/ACM …, 2017 - ieeexplore.ieee.org
This work addresses the problem of scheduling user-defined analytic applications, which we
define as high-level compositions of frameworks, their components, and the logic necessary …

Scheduling jobs with estimation errors for multi-server systems

R Mailach, DG Down - 2017 29th International Teletraffic …, 2017 - ieeexplore.ieee.org
When scheduling single server systems, Shortest Remaining Processing Time (SRPT)
minimizes the number of jobs in the system at every point in time. However, a major …