Revisiting I/O behavior in large-scale storage systems: The expected and the unexpected
Large-scale applications typically spend a large fraction of their execution time performing
I/O to a parallel storage system. However, with rapid progress in compute and storage …
I/O to a parallel storage system. However, with rapid progress in compute and storage …
ElastiSim: a batch-system simulator for malleable workloads
As high-performance computing infrastructures move towards exascale, the role of resource
and job management systems is more critical now than ever. Simulating batch systems to …
and job management systems is more critical now than ever. Simulating batch systems to …
Prionn: Predicting runtime and io using neural networks
For job allocation decision, current batch schedulers have access to and use only
information on the number of nodes and runtime because it is readily available at …
information on the number of nodes and runtime because it is readily available at …
Zero-cycle loads: Microarchitecture support for reducing load latency
TM Austin, GS Sohi - Proceedings of the 28th annual …, 1995 - ieeexplore.ieee.org
Untolerated load instruction latencies often have a significant impact on overall program
performance. As one means of mitigating this effect we present an aggressive hardware …
performance. As one means of mitigating this effect we present an aggressive hardware …
Influence of noisy environments on behavior of HPC applications
Many contemporary HPC systems expose their jobs to substantial amounts of interference,
leading to significant run-to-run variation. For example, application runtimes on Theta, a …
leading to significant run-to-run variation. For example, application runtimes on Theta, a …
Quantifying i/o and communication traffic interference on dragonfly networks equipped with burst buffers
HPC systems have shifted to burst buffer storage and high radix interconnect topologies in
order to meet the challenges of large-scale, data-intensive scientific computing. Both of …
order to meet the challenges of large-scale, data-intensive scientific computing. Both of …
Evaluation of an interference-free node allocation policy on fat-tree clusters
Interference between jobs competing for network bandwidth on a fat-tree cluster can cause
significant variability and degradation in performance. These performance issues can be …
significant variability and degradation in performance. These performance issues can be …
Performance characterization of scientific workflows for the optimal use of burst buffers
Scientific discoveries are increasingly dependent upon the analysis of large volumes of data
from observations and simulations of complex phenomena. Scientists compose the complex …
from observations and simulations of complex phenomena. Scientists compose the complex …
{GIFT}: A coupon based {Throttle-and-Reward} mechanism for fair and efficient {I/O} bandwidth management on parallel storage systems
Large-scale parallel applications are highly data-intensive and perform terabytes of I/O
routinely. Unfortunately, on a large-scale system where multiple applications run …
routinely. Unfortunately, on a large-scale system where multiple applications run …
Evaluating burst buffer placement in hpc systems
Burst buffers (BBs) are increasingly exploited in contemporary supercomputers to bridge the
performance gap between compute and storage systems. The design of BBs, particularly the …
performance gap between compute and storage systems. The design of BBs, particularly the …