{NeuGraph}: Parallel deep neural network computation on large graphs

L Ma, Z Yang, Y Miao, J Xue, M Wu, L Zhou… - 2019 USENIX Annual …, 2019 - usenix.org
Recent deep learning models have moved beyond low dimensional regular grids such as
image, video, and speech, to high-dimensional graph-structured data, such as social …

Powerlyra: Differentiated graph computation and partitioning on skewed graphs

R Chen, J Shi, Y Chen, B Zang, H Guan… - ACM Transactions on …, 2019 - dl.acm.org
Natural graphs with skewed distributions raise unique challenges to distributed graph
computation and partitioning. Existing graph-parallel systems usually use a “one-size-fits-all” …

Gluon: A communication-optimizing substrate for distributed heterogeneous graph analytics

R Dathathri, G Gill, L Hoang, HV Dang… - Proceedings of the 39th …, 2018 - dl.acm.org
This paper introduces a new approach to building distributed-memory graph analytics
systems that exploits heterogeneity in processor types (CPU and GPU), partitioning policies …

Graphh: A processing-in-memory architecture for large-scale graph processing

G Dai, T Huang, Y Chi, J Zhao, G Sun… - … on Computer-Aided …, 2018 - ieeexplore.ieee.org
Large-scale graph processing requires the high bandwidth of data access. However, as
graph computing continues to scale, it becomes increasingly challenging to achieve a high …

Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach

M Yan, X Hu, S Li, A Basak, H Li, X Ma… - Proceedings of the …, 2019 - dl.acm.org
Graph analytics is an emerging application which extracts insights by processing large
volumes of highly connected data, namely graphs. The parallel processing of graphs has …

Subway: Minimizing data transfer during out-of-GPU-memory graph processing

AHN Sabet, Z Zhao, R Gupta - … of the Fifteenth European Conference on …, 2020 - dl.acm.org
In many graph-based applications, the graphs tend to grow, imposing a great challenge for
GPU-based graph processing. When the graph size exceeds the device memory capacity …

Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network

S Li, D Niu, Y Wang, W Han, Z Zhang, T Guan… - Proceedings of the 49th …, 2022 - dl.acm.org
Graph neural network (GNN) is a promising emerging application for link prediction,
recommendation, etc. Existing hardware innovation is limited to single-machine GNN (SM …

EMOGI: Efficient memory-access for out-of-memory graph-traversal in GPUs

SW Min, VS Mailthody, Z Qureshi, J Xiong… - arXiv preprint arXiv …, 2020 - arxiv.org
Modern analytics and recommendation systems are increasingly based on graph data that
capture the relations between entities being analyzed. Practical graphs come in huge sizes …

PCGCN: Partition-centric processing for accelerating graph convolutional network

C Tian, L Ma, Z Yang, Y Dai - 2020 IEEE International Parallel …, 2020 - ieeexplore.ieee.org
Inspired by the successes of convolutional neural networks (CNN) in computer vision, the
convolutional operation has been moved beyond low-dimension grids (eg, images) to high …

Grus: Toward unified-memory-efficient high-performance graph processing on gpu

P Wang, J Wang, C Li, J Wang, H Zhu… - ACM Transactions on …, 2021 - dl.acm.org
Today's GPU graph processing frameworks face scalability and efficiency issues as the
graph size exceeds GPU-dedicated memory limit. Although recent GPUs can over-subscribe …