Fine-grained parallel incomplete LU factorization
This paper presents a new fine-grained parallel algorithm for computing an incomplete LU
factorization. All nonzeros in the incomplete factors can be computed in parallel and …
factorization. All nonzeros in the incomplete factors can be computed in parallel and …
ShyLU: A hybrid-hybrid solver for multicore platforms
S Rajamanickam, EG Boman… - 2012 IEEE 26th …, 2012 - ieeexplore.ieee.org
With the ubiquity of multicore processors, it is crucial that solvers adapt to the hierarchical
structure of modern architectures. We present ShyLU, a “hybrid-hybrid” solver for general …
structure of modern architectures. We present ShyLU, a “hybrid-hybrid” solver for general …
Structure-adaptive parallel solution of sparse triangular linear systems
Solving sparse triangular systems of linear equations is a performance bottleneck in many
methods for solving more general sparse systems. Both for direct methods and for many …
methods for solving more general sparse systems. Both for direct methods and for many …
一种分层并行计算的软件化雷达系统.
赵帮, 李武旭, 赵浩然 - Telecommunication Engineering, 2022 - search.ebscohost.com
为解决软件化雷达系统实时处理大规模数据的问题, 提出了一种分层级的分布式并行计算方法,
并设计了一种低延时大规模数据处理能力的软件化雷达系统. 该系统采用三层并行计算方法 …
并设计了一种低延时大规模数据处理能力的软件化雷达系统. 该系统采用三层并行计算方法 …
[PDF][PDF] Parallel incomplete-LU and Cholesky factorization in the preconditioned iterative methods on the GPU
M Naumov - Nvidia Technical Report NVR-2012-003, 2012 - research.nvidia.com
A novel algorithm for computing the incomplete-LU and Cholesky factorization with 0 fill-in
on a graphics processing unit (GPU) is proposed. It implements the incomplete factorization …
on a graphics processing unit (GPU) is proposed. It implements the incomplete factorization …
INMOST parallel platform for mathematical modeling and applications
K Terekhov, Y Vassilevski - … 2018, Moscow, Russia, September 24–25 …, 2019 - Springer
In the present work we present INMOST, the programming platform for mathematical
modelling and its application to a couple of practical problems. INMOST consists of a …
modelling and its application to a couple of practical problems. INMOST consists of a …
Exploiting task and data parallelism in ILUPACK's preconditioned CG solver on NUMA architectures and many-core accelerators
We present specialized implementations of the preconditioned iterative linear system solver
in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co …
in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co …
Tools and methods for measuring and tuning the energy efficiency of HPC systems
Energy costs nowadays represent a significant share of the total costs of ownership of High
Performance Computing (HPC) systems. In this paper we provide an overview on different …
Performance Computing (HPC) systems. In this paper we provide an overview on different …
Parallelization of multilevel ILU preconditioners on distributed-memory multiprocessors
In this paper we investigate the parallelization of the ILUPACK library for the solution of
sparse linear systems on distributed-memory multiprocessors. The parallelization approach …
sparse linear systems on distributed-memory multiprocessors. The parallelization approach …
Assessing the impact of the CPU power-saving modes on the task-parallel solution of sparse linear systems
We investigate the benefits that an energy-aware implementation of the runtime in charge of
the concurrent execution of ILUPACK—a sophisticated preconditioned iterative solver for …
the concurrent execution of ILUPACK—a sophisticated preconditioned iterative solver for …