Performance analysis of high performance computing applications on the amazon web services cloud
KR Jackson, L Ramakrishnan, K Muriki… - 2010 IEEE second …, 2010 - ieeexplore.ieee.org
Cloud computing has seen tremendous growth, particularly for commercial web
applications. The on-demand, pay-as-you-go model creates a flexible and cost-effective …
applications. The on-demand, pay-as-you-go model creates a flexible and cost-effective …
The Scalasca performance toolset architecture
M Geimer, F Wolf, BJN Wylie… - Concurrency and …, 2010 - Wiley Online Library
Scalasca is a performance toolset that has been specifically designed to analyze parallel
application execution behavior on large‐scale systems with many thousands of processors …
application execution behavior on large‐scale systems with many thousands of processors …
Sustained petascale performance of seismic simulations with SeisSol on SuperMUC
A Breuer, A Heinecke, S Rettenberger, M Bader… - … Conference, ISC 2014 …, 2014 - Springer
Seismic simulations in realistic 3D Earth models require peta-or even exascale computing
power to capture small-scale features of high relevance for scientific and industrial …
power to capture small-scale features of high relevance for scientific and industrial …
Usage of the SCALASCA toolset for scalable performance analysis of large-scale parallel applications
Abstract scalasca is a performance toolset that has been specifically designed to analyze
parallel application behavior on large-scale systems, but is also well-suited for small-and …
parallel application behavior on large-scale systems, but is also well-suited for small-and …
Early evaluation of IBM BlueGene/P
S Alam, R Barrett, M Bast, MR Fahey… - SC'08: Proceedings …, 2008 - ieeexplore.ieee.org
BlueGene/P (BG/P) is the second generation BlueGene architecture from IBM, succeeding
BlueGene/L (BG/L). BG/P is a system-on-a-chip (SoC) design that uses four PowerPC 450 …
BlueGene/L (BG/L). BG/P is a system-on-a-chip (SoC) design that uses four PowerPC 450 …
Communication requirements and interconnect optimization for high-end scientific applications
The path toward realizing next-generation petascale and exascale computing is increasingly
dependent on building supercomputers with unprecedented numbers of processors. To …
dependent on building supercomputers with unprecedented numbers of processors. To …
Automated mapping of regular communication graphs on mesh interconnects
Network contention has a significantly adverse effect on the performance of parallel
applications with increasing size of parallel machines. Machines of the petascale era are …
applications with increasing size of parallel machines. Machines of the petascale era are …
Characterizing parallel scientific applications on commodity clusters: An empirical study of a tapered fat-tree
Understanding the characteristics and requirements of applications that run on commodity
clusters is key to properly configuring current machines and, more importantly, procuring …
clusters is key to properly configuring current machines and, more importantly, procuring …
[图书][B] Automating topology aware mapping for supercomputers
A Bhatele - 2010 - search.proquest.com
Petascale machines with hundreds of thousands of cores are being built. These machines
have varying interconnect topologies and large network diameters. Computation is cheap …
have varying interconnect topologies and large network diameters. Computation is cheap …
[图书][B] Performance tuning of scientific applications
DH Bailey, RF Lucas, S Williams - 2010 - books.google.com
Describing useful current research in modern performance science and engineering, this
book helps real-world users of parallel computer systems to better understand both the …
book helps real-world users of parallel computer systems to better understand both the …