Featgraph: A flexible and efficient backend for graph neural network systems
Graph neural networks (GNNs) are gaining popularity as a promising approach to machine
learning on graphs. Unlike traditional graph workloads where each vertex/edge is …
learning on graphs. Unlike traditional graph workloads where each vertex/edge is …
Performance-aware model for sparse matrix-matrix multiplication on the sunway taihulight supercomputer
General sparse matrix-sparse matrix multiplication (SpGEMM) is one of the fundamental
linear operations in a wide variety of scientific applications. To implement efficient SpGEMM …
linear operations in a wide variety of scientific applications. To implement efficient SpGEMM …
Sage: Parallel semi-asymmetric graph algorithms for NVRAMs
Non-volatile main memory (NVRAM) technologies provide an attractive set of features for
large-scale graph analytics, including byte-addressability, low idle power, and improved …
large-scale graph analytics, including byte-addressability, low idle power, and improved …
SAGE: A Storage-Based Approach for Scalable and Efficient Sparse Generalized Matrix-Matrix Multiplication
Sparse generalized matrix-matrix multiplication (SpGEMM) is a fundamental operation for
real-world network analysis. With the increasing size of real-world networks, the single …
real-world network analysis. With the increasing size of real-world networks, the single …
Spectral inference for large stochastic blockmodels with nodal covariates
In many applications of network analysis, it is important to distinguish between observed and
unobserved factors affecting network structure. To this end, we develop spectral estimators …
unobserved factors affecting network structure. To this end, we develop spectral estimators …
Practical wavelet tree construction
We present new sequential and parallel algorithms for wavelet tree construction based on a
new bottom-up technique. This technique makes use of the structure of the wavelet trees …
new bottom-up technique. This technique makes use of the structure of the wavelet trees …
ZAKI+: A machine learning based process mapping tool for SpMV computations on distributed memory architectures
Smart cities and other cyber-physical systems (CPSs) rely on various scientific, engineering,
business, and social applications that provide timely intelligence for their design, operations …
business, and social applications that provide timely intelligence for their design, operations …
Arrow Matrix Decomposition: A Novel Approach for Communication-Efficient Sparse Matrix Multiplication
L Gianinazzi, AN Ziogas, L Huang… - Proceedings of the 29th …, 2024 - dl.acm.org
We propose a novel approach to iterated sparse matrix dense matrix multiplication, a
fundamental computational kernel in scientific computing and graph neural network training …
fundamental computational kernel in scientific computing and graph neural network training …
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators
Graph convolutional networks (GCNs) are becoming increasingly popular as they can
process a wide variety of data formats that prior deep neural networks cannot easily support …
process a wide variety of data formats that prior deep neural networks cannot easily support …
Optimizing partitioned CSR-based SpGEMM on the Sunway TaihuLight
General sparse matrix-sparse matrix (SpGEMM) multiplication is one of the basic kernels in
a great many applications. Several works focus on various optimizations for SpGEMM. To …
a great many applications. Several works focus on various optimizations for SpGEMM. To …