On the convergence of malleability and the HPC PowerStack: exploiting dynamism in over-provisioned and power-constrained HPC systems
E Arima, AI Comprés, M Schulz - International Conference on High …, 2022 - Springer
Abstract Recent High-Performance Computing (HPC) systems are facing important
challenges, such as massive power consumption, while at the same time significantly under …
challenges, such as massive power consumption, while at the same time significantly under …
Maximizing system utilization via parallelism management for co-located parallel applications
With an increasing number of cores and memory controllers in multiprocessor platforms, co-
location of parallel applications is gaining on importance. Key to achieve good performance …
location of parallel applications is gaining on importance. Key to achieve good performance …
nos-v: Co-executing hpc applications using system-wide task scheduling
Future Exascale systems will feature massive parallelism, many-core processors and
heterogeneous architectures. In this scenario, it is increasingly difficult for HPC applications …
heterogeneous architectures. In this scenario, it is increasingly difficult for HPC applications …
Intelligent colocation of HPC workloads
Many HPC applications suffer from a bottleneck in the shared caches, instruction execution
units, I/O or memory bandwidth, even though the remaining resources may be underutilized …
units, I/O or memory bandwidth, even though the remaining resources may be underutilized …
Dynamic co-scheduling driven by main memory bandwidth utilization
Most applications running on supercomputers achieve only a fraction of a system's peak
performance. It has been demonstrated that the co-scheduling of applications can improve …
performance. It has been demonstrated that the co-scheduling of applications can improve …
Prospects and challenges of virtual machine migration in HPC
The continuous growth of supercomputers is accompanied by increased complexity of the
intra‐node level and the interconnection topology. Consequently, the whole software stack …
intra‐node level and the interconnection topology. Consequently, the whole software stack …
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach
U Saroliya, E Arima, D Liu… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to
their architectural simplicity specialized for data-level parallelism, GPUs can offer much …
their architectural simplicity specialized for data-level parallelism, GPUs can offer much …
M3at: Monitoring agents assignment model for data-intensive applications
Nowadays, massive amounts of data are acquired, transferred, and analyzed nearly in real-
time by utilizing a large number of computing and storage elements interconnected through …
time by utilizing a large number of computing and storage elements interconnected through …
Intelligent colocation of workloads for enhanced server efficiency
Many server applications achieve only a fraction of their theoretical peak performance due to
bottlenecks in the shared caches, instruction execution units, I/O or memory bandwidth, even …
bottlenecks in the shared caches, instruction execution units, I/O or memory bandwidth, even …
Application migration in HPC—a driver of the exascale era?
Application migration is valuable for modern computing centers. Apart from a facilitation of
the maintenance process, it enables dynamic load balancing for an improvement of the …
the maintenance process, it enables dynamic load balancing for an improvement of the …