[PDF][PDF] {CloudCmp}: Shopping for a Cloud Made Easy
Cloud computing has gained much popularity recently, and many companies now offer a
variety of public cloud computing services, such as Google AppEngine, Amazon AWS, and …
variety of public cloud computing services, such as Google AppEngine, Amazon AWS, and …
Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark
We present the performance analysis of a port of the LU benchmark from the NAS Parallel
Benchmark (NPB) suite to NVIDIA's Compute Unified Device Architecture (CUDA), and …
Benchmark (NPB) suite to NVIDIA's Compute Unified Device Architecture (CUDA), and …
Model-driven scheduling for distributed stream processing systems
Abstract Distributed Stream Processing Systems (DSPS) are “Fast Data” platforms that allow
streaming applications to be composed and executed with low latency on commodity …
streaming applications to be composed and executed with low latency on commodity …
Rethinking hardware-software codesign for exascale systems
J Shalf, D Quinlan, C Janssen - Computer, 2011 - ieeexplore.ieee.org
The rapid and disruptive changes anticipated in hardware design over this next decade
necessitate a more agile development process, such as the hardware-software co-design …
necessitate a more agile development process, such as the hardware-software co-design …
On the role of co-design in high performance computing
RF Barrett, S Borkar, SS Dosanjh… - Transition of HPC …, 2013 - ebooks.iospress.nl
Preparations for Exascale computing have led to the realization that future computing
environments will be significantly different from those that provide Petascale capabilities …
environments will be significantly different from those that provide Petascale capabilities …
Cap3: A cloud auto-provisioning framework for parallel processing using on-demand and spot instances
Cloud computing has drawn increasing attention from the scientific computing community
due to its ease of use, elasticity, and relatively low cost. Because a high-performance …
due to its ease of use, elasticity, and relatively low cost. Because a high-performance …
An unstructured CFD mini‐application for the performance prediction of a production CFD code
AMB Owenson, SA Wright, RA Bunt… - Concurrency and …, 2020 - Wiley Online Library
Maintaining the performance of large scientific codes is a difficult task. To aid in this task, a
number of mini‐applications have been developed that are more tractable to analyze than …
number of mini‐applications have been developed that are more tractable to analyze than …
Using simulation to design extremescale applications and architectures: programming model exploration
CL Janssen, H Adalsteinsson, JP Kenny - ACM SIGMETRICS …, 2011 - dl.acm.org
A key problem facing application developers is that they are expected to utilize extreme
levels of parallelism soon after delivery of future leadership class machines, but developing …
levels of parallelism soon after delivery of future leadership class machines, but developing …
SIMCAN: A flexible, scalable and expandable simulation platform for modelling and simulating distributed architectures and applications
In this paper we propose a new simulation platform called SIMCAN, for analyzing parallel
and distributed systems. This platform is aimed to test parallel and distributed architectures …
and distributed systems. This platform is aimed to test parallel and distributed architectures …
Durango: Scalable synthetic workload generation for extreme-scale application performance modeling and simulation
CD Carothers, JS Meredith, MP Blanco… - Proceedings of the …, 2017 - dl.acm.org
Performance modeling of extreme-scale applications on accurate representations of
potential architectures is critical for designing next generation supercomputing systems …
potential architectures is critical for designing next generation supercomputing systems …