A survey of communication performance models for high-performance computing

JA Rico-Gallego, JC Díaz-Martín… - ACM Computing …, 2019 - dl.acm.org
This survey aims to present the state of the art in analytic communication performance
models, providing sufficiently detailed descriptions of particularly noteworthy efforts …

Improving TCP congestion control over internets with heterogeneous transmission media

C Parsa, JJ Garcia-Luna-Aceves - … International Conference on …, 1999 - ieeexplore.ieee.org
We present a new implementation of TCP that is better suited to today's Internet than TCP
Reno or Tahoe. Our implementation of TCP, which we call TCP Santa Cruz, is designed to …

Models of parallel computation: a survey and classification

Y Zhang, G Chen, G Sun, Q Miao - Frontiers of Computer Science in China, 2007 - Springer
In this paper, the state-of-the-art parallel computational model research is reviewed. We will
introduce various models that were developed during the past decades. According to their …

Modeling communication in cache-coherent SMP systems: a case-study with Xeon Phi

S Ramos, T Hoefler - Proceedings of the 22nd international symposium …, 2013 - dl.acm.org
Most multi-core and some many-core processors implement cache coherency protocols that
heavily complicate the design of optimal parallel algorithms. Communication is performed …

BSF: A parallel computation model for scalability estimation of iterative numerical algorithms on cluster computing systems

LB Sokolinsky - Journal of Parallel and Distributed Computing, 2021 - Elsevier
This paper examines a novel parallel computation model called bulk synchronous farm
(BSF) that focuses on estimating the scalability of compute-intensive iterative algorithms …

Synchronous parallel kinetic Monte Carlo for continuum diffusion-reaction systems

E Martínez, J Marian, MH Kalos, JM Perlado - Journal of Computational …, 2008 - Elsevier
A novel parallel kinetic Monte Carlo (kMC) algorithm formulated on the basis of perfect time
synchronicity is presented. The algorithm is intended as a generalization of the standard n …

Performance analysis and optimization of MPI collective operations on multi-core clusters

B Tu, J Fan, J Zhan, X Zhao - The Journal of Supercomputing, 2012 - Springer
Memory hierarchy on multi-core clusters has twofold characteristics: vertical memory
hierarchy and horizontal memory hierarchy. This paper proposes new parallel computation …

Detailed modeling of heterogeneous and contention-constrained point-to-point mpi communication

A Thune, SA Reinemo, T Skeie… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The network topology of modern parallel computing systems is inherently heterogeneous,
with a variety of latency and bandwidth values. Moreover, contention for the bandwidth can …

Model-based estimation of the communication cost of hybrid data-parallel applications on heterogeneous clusters

JA Rico-Gallego, AL Lastovetsky… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Heterogeneous systems composed of CPUs and accelerators sharing communication
channels of different performance are getting mainstream in HPC but, at the same time, they …

Extending τ-lop to model concurrent MPI communications in multicore clusters

JA Rico-Gallego, JC Díaz-Martín… - Future Generation …, 2016 - Elsevier
Achieving optimal performance of MPI applications on current multi-core architectures,
composed of multiple shared communication channels and deep memory hierarchies, is not …