A survey of communication performance models for high-performance computing
JA Rico-Gallego, JC Díaz-Martín… - ACM Computing …, 2019 - dl.acm.org
This survey aims to present the state of the art in analytic communication performance
models, providing sufficiently detailed descriptions of particularly noteworthy efforts …
models, providing sufficiently detailed descriptions of particularly noteworthy efforts …
Improving TCP congestion control over internets with heterogeneous transmission media
C Parsa, JJ Garcia-Luna-Aceves - … International Conference on …, 1999 - ieeexplore.ieee.org
We present a new implementation of TCP that is better suited to today's Internet than TCP
Reno or Tahoe. Our implementation of TCP, which we call TCP Santa Cruz, is designed to …
Reno or Tahoe. Our implementation of TCP, which we call TCP Santa Cruz, is designed to …
Models of parallel computation: a survey and classification
In this paper, the state-of-the-art parallel computational model research is reviewed. We will
introduce various models that were developed during the past decades. According to their …
introduce various models that were developed during the past decades. According to their …
Modeling communication in cache-coherent SMP systems: a case-study with Xeon Phi
Most multi-core and some many-core processors implement cache coherency protocols that
heavily complicate the design of optimal parallel algorithms. Communication is performed …
heavily complicate the design of optimal parallel algorithms. Communication is performed …
BSF: A parallel computation model for scalability estimation of iterative numerical algorithms on cluster computing systems
LB Sokolinsky - Journal of Parallel and Distributed Computing, 2021 - Elsevier
This paper examines a novel parallel computation model called bulk synchronous farm
(BSF) that focuses on estimating the scalability of compute-intensive iterative algorithms …
(BSF) that focuses on estimating the scalability of compute-intensive iterative algorithms …
Synchronous parallel kinetic Monte Carlo for continuum diffusion-reaction systems
E Martínez, J Marian, MH Kalos, JM Perlado - Journal of Computational …, 2008 - Elsevier
A novel parallel kinetic Monte Carlo (kMC) algorithm formulated on the basis of perfect time
synchronicity is presented. The algorithm is intended as a generalization of the standard n …
synchronicity is presented. The algorithm is intended as a generalization of the standard n …
Performance analysis and optimization of MPI collective operations on multi-core clusters
Memory hierarchy on multi-core clusters has twofold characteristics: vertical memory
hierarchy and horizontal memory hierarchy. This paper proposes new parallel computation …
hierarchy and horizontal memory hierarchy. This paper proposes new parallel computation …
Detailed modeling of heterogeneous and contention-constrained point-to-point mpi communication
A Thune, SA Reinemo, T Skeie… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The network topology of modern parallel computing systems is inherently heterogeneous,
with a variety of latency and bandwidth values. Moreover, contention for the bandwidth can …
with a variety of latency and bandwidth values. Moreover, contention for the bandwidth can …
Model-based estimation of the communication cost of hybrid data-parallel applications on heterogeneous clusters
JA Rico-Gallego, AL Lastovetsky… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Heterogeneous systems composed of CPUs and accelerators sharing communication
channels of different performance are getting mainstream in HPC but, at the same time, they …
channels of different performance are getting mainstream in HPC but, at the same time, they …
Extending τ-lop to model concurrent MPI communications in multicore clusters
JA Rico-Gallego, JC Díaz-Martín… - Future Generation …, 2016 - Elsevier
Achieving optimal performance of MPI applications on current multi-core architectures,
composed of multiple shared communication channels and deep memory hierarchies, is not …
composed of multiple shared communication channels and deep memory hierarchies, is not …