[图书][B] Automatic performance tuning of sparse matrix kernels

RW Vuduc - 2003 - search.proquest.com
This dissertation presents an automated system to generate highly efficient, platform-
adapted implementations of sparse matrix kernels. We show that conventional …

Performance optimizations and bounds for sparse matrix-vector multiply

R Vuduc, JW Demmel, KA Yelick… - SC'02: Proceedings …, 2002 - ieeexplore.ieee.org
We consider performance tuning, by code and data structure reorganization, of sparse
matrix-vector multiply (SpM× V), one of the most important computational kernels in scientific …

When cache blocking of sparse matrix vector multiply works and why

R Nishtala, RW Vuduc, JW Demmel… - Applicable Algebra in …, 2007 - Springer
We present new performance models and more compact data structures for cache blocking
when applied to sparse matrix-vector multiply (SpM× V). We extend our prior models by …

[PDF][PDF] Automatic performance tuning and analysis of sparse triangular solve

R Vuduc, S Kamil, J Hsu, R Nishtala, JW Demmel… - 2002 - academia.edu
Automatic Performance Tuning and Analysis of Sparse Triangular Solve Page 1 Automatic
Performance Tuning and Analysis of Sparse Triangular Solve Richard Vuduc, Shoaib Kamil …

[PDF][PDF] Performance modeling and analysis of cache blocking in sparse matrix vector multiply

R Nishtala, RW Vuduc… - … of California, Tech …, 2004 - digitalassets.lib.berkeley.edu
We consider the problem of building high-performance implementations of sparse matrix-
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …

Memory hierarchy optimizations and performance bounds for sparse ATAx

R Vuduc, A Gyulassy, JW Demmel… - Proceedings of the 2003 …, 2003 - dl.acm.org
This paper presents uniprocessor performance optimizations, automatic tuning techniques,
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …

Memory Hierarchy Optimizations and Performance Bounds for Sparse A T Ax

R Vuduc, A Gyulassy, JW Demmel… - … Science—ICCS 2003 …, 2003 - Springer
This paper presents uniprocessor performance optimizations, automatic tuning techniques,
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …

Investigation and development of implicit numerical methods for building energy simulation

ME Crowley - 2005 - doras.dcu.ie
A variety of building energy analysis and simulation tools are increasingly used to determine
peak heating and cooling loads, size thermal plant, anticipate annual energy consumption …

[PDF][PDF] Performance Optimizations and Bounds for Sparse Matrix Kernels

R Vuduc, JW Demmel, KA Yelick, S Kamil, R Nishtala… - bebop.cs.berkeley.edu
Building high-performance implementations of sparse matrix-vector multiply (SpM× V), an
important and ubiquitous computational kernel, is fundamentally limited by a variety of …

[PDF][PDF] Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply

R Vuduc, K Yelick - Citeseer
We consider the problem of building high-performance implementations of sparse matrix-
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …