[PDF][PDF] {CloudCmp}: Shopping for a Cloud Made Easy

A Li, X Yang, M Zhang - 2nd USENIX Workshop on Hot Topics in Cloud …, 2010 - usenix.org
Cloud computing has gained much popularity recently, and many companies now offer a
variety of public cloud computing services, such as Google AppEngine, Amazon AWS, and …

Performance analysis of a hybrid MPI/CUDA implementation of the NASLU benchmark

SJ Pennycook, SD Hammond, SA Jarvis… - ACM SIGMETRICS …, 2011 - dl.acm.org
We present the performance analysis of a port of the LU benchmark from the NAS Parallel
Benchmark (NPB) suite to NVIDIA's Compute Unified Device Architecture (CUDA), and …

Model-driven scheduling for distributed stream processing systems

A Shukla, Y Simmhan - Journal of Parallel and Distributed Computing, 2018 - Elsevier
Abstract Distributed Stream Processing Systems (DSPS) are “Fast Data” platforms that allow
streaming applications to be composed and executed with low latency on commodity …

Rethinking hardware-software codesign for exascale systems

J Shalf, D Quinlan, C Janssen - Computer, 2011 - ieeexplore.ieee.org
The rapid and disruptive changes anticipated in hardware design over this next decade
necessitate a more agile development process, such as the hardware-software co-design …

On the role of co-design in high performance computing

RF Barrett, S Borkar, SS Dosanjh… - Transition of HPC …, 2013 - ebooks.iospress.nl
Preparations for Exascale computing have led to the realization that future computing
environments will be significantly different from those that provide Petascale capabilities …

Cap3: A cloud auto-provisioning framework for parallel processing using on-demand and spot instances

H Huang, L Wang, BC Tak, L Wang… - 2013 IEEE Sixth …, 2013 - ieeexplore.ieee.org
Cloud computing has drawn increasing attention from the scientific computing community
due to its ease of use, elasticity, and relatively low cost. Because a high-performance …

An unstructured CFD mini‐application for the performance prediction of a production CFD code

AMB Owenson, SA Wright, RA Bunt… - Concurrency and …, 2020 - Wiley Online Library
Maintaining the performance of large scientific codes is a difficult task. To aid in this task, a
number of mini‐applications have been developed that are more tractable to analyze than …

Using simulation to design extremescale applications and architectures: programming model exploration

CL Janssen, H Adalsteinsson, JP Kenny - ACM SIGMETRICS …, 2011 - dl.acm.org
A key problem facing application developers is that they are expected to utilize extreme
levels of parallelism soon after delivery of future leadership class machines, but developing …

SIMCAN: A flexible, scalable and expandable simulation platform for modelling and simulating distributed architectures and applications

A Núñez, J Fernández, R Filgueira, F García… - … Modelling Practice and …, 2012 - Elsevier
In this paper we propose a new simulation platform called SIMCAN, for analyzing parallel
and distributed systems. This platform is aimed to test parallel and distributed architectures …

Durango: Scalable synthetic workload generation for extreme-scale application performance modeling and simulation

CD Carothers, JS Meredith, MP Blanco… - Proceedings of the …, 2017 - dl.acm.org
Performance modeling of extreme-scale applications on accurate representations of
potential architectures is critical for designing next generation supercomputing systems …