[图书][B] Automatic performance tuning of sparse matrix kernels
RW Vuduc - 2003 - search.proquest.com
This dissertation presents an automated system to generate highly efficient, platform-
adapted implementations of sparse matrix kernels. We show that conventional …
adapted implementations of sparse matrix kernels. We show that conventional …
Performance optimizations and bounds for sparse matrix-vector multiply
We consider performance tuning, by code and data structure reorganization, of sparse
matrix-vector multiply (SpM× V), one of the most important computational kernels in scientific …
matrix-vector multiply (SpM× V), one of the most important computational kernels in scientific …
When cache blocking of sparse matrix vector multiply works and why
We present new performance models and more compact data structures for cache blocking
when applied to sparse matrix-vector multiply (SpM× V). We extend our prior models by …
when applied to sparse matrix-vector multiply (SpM× V). We extend our prior models by …
[PDF][PDF] Automatic performance tuning and analysis of sparse triangular solve
Automatic Performance Tuning and Analysis of Sparse Triangular Solve Page 1 Automatic
Performance Tuning and Analysis of Sparse Triangular Solve Richard Vuduc, Shoaib Kamil …
Performance Tuning and Analysis of Sparse Triangular Solve Richard Vuduc, Shoaib Kamil …
[PDF][PDF] Performance modeling and analysis of cache blocking in sparse matrix vector multiply
R Nishtala, RW Vuduc… - … of California, Tech …, 2004 - digitalassets.lib.berkeley.edu
We consider the problem of building high-performance implementations of sparse matrix-
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …
Memory hierarchy optimizations and performance bounds for sparse ATAx
This paper presents uniprocessor performance optimizations, automatic tuning techniques,
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …
Memory Hierarchy Optimizations and Performance Bounds for Sparse A T Ax
This paper presents uniprocessor performance optimizations, automatic tuning techniques,
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …
and an experimental analysis of the sparse matrix operation, y= AT Ax, where A is a sparse …
Investigation and development of implicit numerical methods for building energy simulation
ME Crowley - 2005 - doras.dcu.ie
A variety of building energy analysis and simulation tools are increasingly used to determine
peak heating and cooling loads, size thermal plant, anticipate annual energy consumption …
peak heating and cooling loads, size thermal plant, anticipate annual energy consumption …
[PDF][PDF] Performance Optimizations and Bounds for Sparse Matrix Kernels
Building high-performance implementations of sparse matrix-vector multiply (SpM× V), an
important and ubiquitous computational kernel, is fundamentally limited by a variety of …
important and ubiquitous computational kernel, is fundamentally limited by a variety of …
[PDF][PDF] Performance Modeling and Analysis of Cache Blocking in Sparse Matrix Vector Multiply
We consider the problem of building high-performance implementations of sparse matrix-
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …
vector multiply (SpM× V), or y= y+ A· x, which is an important and ubiquitous computational …