Optimization techniques for GPU programming
In the past decade, Graphics Processing Units have played an important role in the field of
high-performance computing and they still advance new fields such as IoT, autonomous …
high-performance computing and they still advance new fields such as IoT, autonomous …
Graph processing on GPUs: A survey
In the big data era, much real-world data can be naturally represented as graphs.
Consequently, many application domains can be modeled as graph processing. Graph …
Consequently, many application domains can be modeled as graph processing. Graph …
A scalable processing-in-memory accelerator for parallel graph processing
The explosion of digital data and the ever-growing need for fast data analysis have made in-
memory big-data processing in computer systems increasingly important. In particular, large …
memory big-data processing in computer systems increasingly important. In particular, large …
GCNAX: A flexible and energy-efficient accelerator for graph convolutional neural networks
Graph convolutional neural networks (GCNs) have emerged as an effective approach to
extend deep learning for graph data analytics. Given that graphs are usually irregular, as …
extend deep learning for graph data analytics. Given that graphs are usually irregular, as …
Scalable GPU graph traversal
D Merrill, M Garland, A Grimshaw - ACM Sigplan Notices, 2012 - dl.acm.org
Breadth-first search (BFS) is a core primitive for graph traversal and a basis for many higher-
level graph analysis algorithms. It is also representative of a class of parallel computations …
level graph analysis algorithms. It is also representative of a class of parallel computations …
A quantitative study of irregular programs on GPUs
GPUs have been used to accelerate many regular applications and, more recently, irregular
applications in which the control flow and memory access patterns are data-dependent and …
applications in which the control flow and memory access patterns are data-dependent and …
CuSha: vertex-centric graph processing on GPUs
Vertex-centric graph processing is employed by many popular algorithms (eg, PageRank)
due to its simplicity and efficient use of asynchronous parallelism. The high compute power …
due to its simplicity and efficient use of asynchronous parallelism. The high compute power …
Green-Marl: a DSL for easy and efficient graph analysis
The increasing importance of graph-data based applications is fueling the need for highly
efficient and parallel implementations of graph analysis software. In this paper we describe …
efficient and parallel implementations of graph analysis software. In this paper we describe …
Energy efficient architecture for graph analytics accelerators
Specialized hardware accelerators can significantly improve the performance and power
efficiency of compute systems. In this paper, we focus on hardware accelerators for graph …
efficiency of compute systems. In this paper, we focus on hardware accelerators for graph …
Medusa: Simplified graph processing on GPUs
Graphs are common data structures for many applications, and efficient graph processing is
a must for application performance. Recently, the graphics processing unit (GPU) has been …
a must for application performance. Recently, the graphics processing unit (GPU) has been …