KNEM: A generic and scalable kernel-assisted intra-node MPI communication framework

B Goglin, S Moreaud - Journal of Parallel and Distributed Computing, 2013 - Elsevier
The multiplication of cores in today's architectures raises the importance of intra-node
communication in modern clusters and their impact on the overall parallel application …

Framework for scalable intra-node collective operations using shared memory

S Jain, R Kaleem, MG Balmana… - … Conference for High …, 2018 - ieeexplore.ieee.org
Collective operations are used in MPI programs to express common communication
patterns, collective computations, or synchronization. In many collectives, such as …

Salar: Scalable and adaptive designs for large message reduction collectives

M Bayatpour, JM Hashmi, S Chakraborty… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
Message Passing Interface (MPI), thus far, has remained a dominant programming model to
program large-scale scientific applications. Collective communication operations in MPI are …

Contention-aware kernel-assisted MPI collectives for multi-/many-core systems

S Chakraborty, H Subramoni… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Multi-/many-core CPU based architectures are seeing widespread adoption due to their
unprecedented compute performance in a small power envelope. With the increasingly …

Designing efficient shared address space reduction collectives for multi-/many-cores

JM Hashmi, S Chakraborty, M Bayatpour… - 2018 IEEE …, 2018 - ieeexplore.ieee.org
State-of-the-art designs for the hierarchical reduction collective operation in MPI that work on
the concept of distributed address spaces incur the cost of intermediate copies inside the …

Gait analysis for human identification in frequency domain

S Yu, L Wang, W Hu, T Tan - … on Image and Graphics (ICIG'04), 2004 - ieeexplore.ieee.org
In this paper, we analyze the spatio-temporal human characteristic of moving silhouettes in
frequency domain, and find key Fourier descriptors that have better discriminatory capability …

Process distance-aware adaptive MPI collective communications

T Ma, T Herault, G Bosilca… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org
Message Passing Interface (MPI) implementations provide a great flexibility to allow users to
arbitrarily bind processes to computing cores to fully exploit clusters of multicore/many-core …

Benefits of cross memory attach for mpi libraries on hpc clusters

J Vienne - Proceedings of the 2014 Annual Conference on …, 2014 - dl.acm.org
With the number of cores per node increasing in modern clusters, an efficient
implementation of intra-node communications is critical for application performance. MPI …

HierKNEM: An adaptive framework for kernel-assisted and topology-aware collective communications on many-core clusters

T Ma, G Bosilca, A Bouteiller… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
Multicore Clusters, which have become the most prominent form of High Performance
Computing (HPC) systems, challenge the performance of MPI applications with non uniform …

A placement-aware soft error rate estimation of combinational circuits for multiple transient faults in CMOS technology

GI Paliaroutis, P Tsoumanis… - … on Defect and Fault …, 2018 - ieeexplore.ieee.org
A considerable disadvantage that comes with the downscaling of the CMOS technology is
the ever-increasing susceptibility of Integrated Circuits (ICs) to soft errors. Therefore, the …