Performance analysis of high performance computing applications on the amazon web services cloud

KR Jackson, L Ramakrishnan, K Muriki… - 2010 IEEE second …, 2010 - ieeexplore.ieee.org
Cloud computing has seen tremendous growth, particularly for commercial web
applications. The on-demand, pay-as-you-go model creates a flexible and cost-effective …

The Scalasca performance toolset architecture

M Geimer, F Wolf, BJN Wylie… - Concurrency and …, 2010 - Wiley Online Library
Scalasca is a performance toolset that has been specifically designed to analyze parallel
application execution behavior on large‐scale systems with many thousands of processors …

Sustained petascale performance of seismic simulations with SeisSol on SuperMUC

A Breuer, A Heinecke, S Rettenberger, M Bader… - … Conference, ISC 2014 …, 2014 - Springer
Seismic simulations in realistic 3D Earth models require peta-or even exascale computing
power to capture small-scale features of high relevance for scientific and industrial …

Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications

F Wolf, BJN Wylie, E Abraham, D Becker… - Tools for High …, 2008 - Springer
Abstract scalasca is a performance toolset that has been specifically designed to analyze
parallel application behavior on large-scale systems, but is also well-suited for small-and …

Early evaluation of IBM BlueGene/P

S Alam, R Barrett, M Bast, MR Fahey… - SC'08: Proceedings …, 2008 - ieeexplore.ieee.org
BlueGene/P (BG/P) is the second generation BlueGene architecture from IBM, succeeding
BlueGene/L (BG/L). BG/P is a system-on-a-chip (SoC) design that uses four PowerPC 450 …

Communication requirements and interconnect optimization for high-end scientific applications

S Kamil, L Oliker, A Pinar, J Shalf - IEEE Transactions on …, 2009 - ieeexplore.ieee.org
The path toward realizing next-generation petascale and exascale computing is increasingly
dependent on building supercomputers with unprecedented numbers of processors. To …

Automated mapping of regular communication graphs on mesh interconnects

A Bhatelé, GR Gupta, LV Kalé… - … Conference on High …, 2010 - ieeexplore.ieee.org
Network contention has a significantly adverse effect on the performance of parallel
applications with increasing size of parallel machines. Machines of the petascale era are …

Characterizing parallel scientific applications on commodity clusters: An empirical study of a tapered fat-tree

EA León, I Karlin, A Bhatele, SH Langer… - SC'16: Proceedings …, 2016 - ieeexplore.ieee.org
Understanding the characteristics and requirements of applications that run on commodity
clusters is key to properly configuring current machines and, more importantly, procuring …

[图书][B] Automating topology aware mapping for supercomputers

A Bhatele - 2010 - search.proquest.com
Petascale machines with hundreds of thousands of cores are being built. These machines
have varying interconnect topologies and large network diameters. Computation is cheap …

[图书][B] Performance tuning of scientific applications

DH Bailey, RF Lucas, S Williams - 2010 - books.google.com
Describing useful current research in modern performance science and engineering, this
book helps real-world users of parallel computer systems to better understand both the …