An ephemeral burst-buffer file system for scientific applications
Burst buffers are becoming an indispensable hardware resource on large-scale
supercomputers to buffer the bursty I/O from scientific applications. However, there is a lack …
supercomputers to buffer the bursty I/O from scientific applications. However, there is a lack …
Exploration of lossy compression for application-level checkpoint/restart
The scale of high performance computing (HPC) systems is exponentially growing,
potentially causing prohibitive shrinkage of mean time between failures (MTBF) while the …
potentially causing prohibitive shrinkage of mean time between failures (MTBF) while the …
Hermes: a heterogeneous-aware multi-tiered distributed I/O buffering system
Modern High-Performance Computing (HPC) systems are adding extra layers to the memory
and storage hierarchy named deep memory and storage hierarchy (DMSH), to increase I/O …
and storage hierarchy named deep memory and storage hierarchy (DMSH), to increase I/O …
Scalable I/O-aware job scheduling for burst buffer enabled HPC clusters
The economics of flash vs. disk storage is driving HPC centers to incorporate faster solid-
state burst buffers into the storage hierarchy in exchange for smaller parallel file system …
state burst buffers into the storage hierarchy in exchange for smaller parallel file system …
Efficient user-level storage disaggregation for deep learning
On large-scale high performance computing (HPC) systems, applications are provisioned
with aggregated resources to meet their peak demands for brief periods. This results in …
with aggregated resources to meet their peak demands for brief periods. This results in …
Managing I/O interference in a shared burst buffer system
In this work, we investigate the problem of inter-application interference in a shared Burst
Buffer (BB) system. A BB is a new storage technology for HPC architectures that acts as an …
Buffer (BB) system. A BB is a new storage technology for HPC architectures that acts as an …
Hpc storage service autotuning using variational-autoencoder-guided asynchronous bayesian optimization
Distributed data storage services tailored to specific applications have grown popular in the
high-performance computing (HPC) community as a way to address I/O and storage …
high-performance computing (HPC) community as a way to address I/O and storage …
Survey of storage systems for high-performance computing
In current supercomputers, storage is typically provided by parallel distributed file systems
for hot data and tape archives for cold data. These file systems are often compatible with …
for hot data and tape archives for cold data. These file systems are often compatible with …
Quantifying i/o and communication traffic interference on dragonfly networks equipped with burst buffers
HPC systems have shifted to burst buffer storage and high radix interconnect topologies in
order to meet the challenges of large-scale, data-intensive scientific computing. Both of …
order to meet the challenges of large-scale, data-intensive scientific computing. Both of …
Performance characterization of scientific workflows for the optimal use of burst buffers
Scientific discoveries are increasingly dependent upon the analysis of large volumes of data
from observations and simulations of complex phenomena. Scientists compose the complex …
from observations and simulations of complex phenomena. Scientists compose the complex …