An overview of the Trilinos project

MA Heroux, RA Bartlett, VE Howle… - ACM Transactions on …, 2005 - dl.acm.org
The Trilinos Project is an effort to facilitate the design, development, integration, and
ongoing support of mathematical software libraries within an object-oriented framework for …

The ELPA library: scalable parallel eigenvalue solutions for electronic structure theory and computational science

A Marek, V Blum, R Johanni, V Havu… - Journal of Physics …, 2014 - iopscience.iop.org
Obtaining the eigenvalues and eigenvectors of large matrices is a key problem in electronic
structure theory and many other areas of computational science. The computational effort …

Elemental: A new framework for distributed memory dense matrix computations

J Poulson, B Marker, RA Van de Geijn… - ACM Transactions on …, 2013 - dl.acm.org
Parallelizing dense matrix computations to distributed memory architectures is a well-
studied subject and generally considered to be among the best understood domains of …

[PDF][PDF] The parallel BGL: A generic library for distributed graph computations

D Gregor, A Lumsdaine - Parallel Object-Oriented Scientific …, 2005 - researchgate.net
This paper presents the Parallel BGL, a generic C++ library for distributed graph
computation. Like the sequential Boost Graph Library (BGL) upon which it is based, the …

[图书][B] Parallel scientific computation: a structured approach using BSP and MPI

RH Bisseling - 2004 - books.google.com
This is the first text explaining how to use the bulk synchronous parallel (BSP) model and the
freely available BSPlib communication library in parallel algorithm design and parallel …

Coded computing: Mitigating fundamental bottlenecks in large-scale distributed computing and machine learning

S Li, S Avestimehr - Foundations and Trends® in …, 2020 - nowpublishers.com
We introduce the concept of “coded computing”, a novel computing paradigm that utilizes
coding theory to effectively inject and leverage data/computation redundancy to mitigate …

The science of deriving dense linear algebra algorithms

P Bientinesi, JA Gunnels, ME Myers… - ACM Transactions on …, 2005 - dl.acm.org
In this article we present a systematic approach to the derivation of families of high-
performance algorithms for a large set of frequently encountered dense linear algebra …

Parallel out-of-core computation and updating of the QR factorization

BC Gunter, RA Van De Geijn - ACM Transactions on Mathematical …, 2005 - dl.acm.org
This article discusses the high-performance parallel implementation of the computation and
updating of QR factorizations of dense matrices, including problems large enough to require …

Heterogeneous distribution of computations solving linear algebra problems on networks of heterogeneous computers

A Kalinov, A Lastovetsky - Journal of Parallel and Distributed Computing, 2001 - Elsevier
This paper presents and analyzes two different strategies of heterogeneous distribution of
computations solving dense linear algebra problems on heterogeneous networks of …

[图书][B] Handbook of parallel computing and statistics

EJ Kontoghiorghes - 2005 - books.google.com
This unique reference weaves together the principles and theoretical models of parallel
computing with the design, analysis, and application of algorithms for solving statistical …