On the convergence of malleability and the HPC PowerStack: exploiting dynamism in over-provisioned and power-constrained HPC systems

E Arima, AI Comprés, M Schulz - International Conference on High …, 2022 - Springer
Abstract Recent High-Performance Computing (HPC) systems are facing important
challenges, such as massive power consumption, while at the same time significantly under …

Maximizing system utilization via parallelism management for co-located parallel applications

Y Cho, CAC Guzman, B Egger - … of the 27th International Conference on …, 2018 - dl.acm.org
With an increasing number of cores and memory controllers in multiprocessor platforms, co-
location of parallel applications is gaining on importance. Key to achieve good performance …

nos-v: Co-executing hpc applications using system-wide task scheduling

D Álvarez, K Sala, V Beltran - 2024 IEEE International Parallel …, 2024 - ieeexplore.ieee.org
Future Exascale systems will feature massive parallelism, many-core processors and
heterogeneous architectures. In this scenario, it is increasingly difficult for HPC applications …

Intelligent colocation of HPC workloads

FV Zacarias, V Petrucci, R Nishtala, P Carpenter… - Journal of Parallel and …, 2021 - Elsevier
Many HPC applications suffer from a bottleneck in the shared caches, instruction execution
units, I/O or memory bandwidth, even though the remaining resources may be underutilized …

Dynamic co-scheduling driven by main memory bandwidth utilization

J Breitbart, S Pickartz, S Lankes… - 2017 IEEE …, 2017 - ieeexplore.ieee.org
Most applications running on supercomputers achieve only a fraction of a system's peak
performance. It has been demonstrated that the co-scheduling of applications can improve …

Prospects and challenges of virtual machine migration in HPC

S Pickartz, C Clauss, J Breitbart… - Concurrency and …, 2018 - Wiley Online Library
The continuous growth of supercomputers is accompanied by increased complexity of the
intra‐node level and the interconnection topology. Consequently, the whole software stack …

Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach

U Saroliya, E Arima, D Liu… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to
their architectural simplicity specialized for data-level parallelism, GPUs can offer much …

M3at: Monitoring agents assignment model for data-intensive applications

V Kashansky, D Kimovski, R Prodan… - 2020 28th Euromicro …, 2020 - ieeexplore.ieee.org
Nowadays, massive amounts of data are acquired, transferred, and analyzed nearly in real-
time by utilizing a large number of computing and storage elements interconnected through …

Intelligent colocation of workloads for enhanced server efficiency

FV Zacarias, V Petrucci, R Nishtala… - 2019 31st …, 2019 - ieeexplore.ieee.org
Many server applications achieve only a fraction of their theoretical peak performance due to
bottlenecks in the shared caches, instruction execution units, I/O or memory bandwidth, even …

Application migration in HPC—a driver of the exascale era?

S Pickartz, S Lankes, A Monti, C Clauss… - … Conference on High …, 2016 - ieeexplore.ieee.org
Application migration is valuable for modern computing centers. Apart from a facilitation of
the maintenance process, it enables dynamic load balancing for an improvement of the …