A survey of communication performance models for high-performance computing

JA Rico-Gallego, JC Díaz-Martín… - ACM Computing …, 2019 - dl.acm.org
This survey aims to present the state of the art in analytic communication performance
models, providing sufficiently detailed descriptions of particularly noteworthy efforts …

GPU register file virtualization

H Jeon, GS Ravi, NS Kim, M Annavaram - Proceedings of the 48th …, 2015 - dl.acm.org
To support massive number of parallel thread contexts, Graphics Processing Units (GPUs)
use a huge register file, which is responsible for a large fraction of GPU's total power and …

Extending τ-lop to model concurrent MPI communications in multicore clusters

JA Rico-Gallego, JC Díaz-Martín… - Future Generation …, 2016 - Elsevier
Achieving optimal performance of MPI applications on current multi-core architectures,
composed of multiple shared communication channels and deep memory hierarchies, is not …

Adaptive-compi: Enhancing mpi-based applications' performance and scalability by using adaptive compression

R Filgueira, DE Singh, J Carretero… - … Journal of High …, 2011 - journals.sagepub.com
This paper presents an optimization of MPI communication, called Adaptive-CoMPI, based
on runtime compression of MPI messages exchanged by applications. The technique …

C-Lop: Accurate contention-based modeling of MPI concurrent communication

Z Wang, H Chen, W Cai, X Dong, X Zhang - Parallel Computing, 2022 - Elsevier
MPI communication optimization is a crucial stage to optimize high-performance
applications. As a formal analysis of MPI communication, the communication performance …

Predictive models for bandwidth sharing in high performance clusters

V Jérôme, M Maxime, V Jean-Marc… - 2008 IEEE …, 2008 - ieeexplore.ieee.org
Using MPI as communication interface, one or several applications may introduce complex
communication behaviors over the network cluster. This effect is increased when nodes of …

[HTML][HTML] Analyse et modélisation des communications concurrentes dans les réseaux haute performance

M Martinasso - 2007 - inria.hal.science
La croissance des capacités de calcul des processeurs se poursuit, non plus par
l'augmentation des fréquences d'horloge, mais par la multiplication d'unités de traitement …

Model of concurrent MPI communications over SMP clusters

M Martinasso, JF Méhaut - 2006 - inria.hal.science
SMP clusters are one of the most common HPC platform used by scientific applications. The
nodes of SMP cluster contain several computing elements. Scientific applications may be …

Modeling contention and mapping effects in multi-core clusters

JA Rico-Gallego, JC Díaz-Martín… - European Conference on …, 2015 - Springer
Modeling and formal analysis of parallel algorithms contribute to optimize their performance.
Modern multi-core are complex machines composed of heterogeneous shared …

Scalable and Energy Efficient Execution Methods for Multicore Systems

D Li - 2011 - vtechworks.lib.vt.edu
Multicore architectures impose great pressure on resource management. The exploration
spaces available for resource management increase explosively, especially for large-scale …