CHARM: C omposing H eterogeneous A ccele R ators for M atrix Multiply on Versal ACAP Architecture

J Zhuang, J Lau, H Ye, Z Yang, Y Du, J Lo… - Proceedings of the …, 2023 - dl.acm.org
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning
applications. To cope with the high computation demands of these applications …

Exploring the Versal AI engines for accelerating stencil-based atmospheric advection simulation

N Brown - Proceedings of the 2023 ACM/SIGDA International …, 2023 - dl.acm.org
AMD Xilinx's new Versal Adaptive Compute Acceleration Platform (ACAP) is an FPGA
architecture combining reconfigurable fabric with other on-chip hardened compute …

Codg-reram: An algorithm-hardware co-design to accelerate semi-structured gnns on reram

Y Luo, P Behnam, K Thorat, Z Liu… - 2022 IEEE 40th …, 2022 - ieeexplore.ieee.org
Graph Neural Networks (GCNs) have attracted wide attention and are applied to the real
world. However, due to the ever-growing graph data with significant irregularities, off-chip …

AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP

Z Yang, J Zhuang, J Yin, C Yu… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Arbitrary-precision integer multiplication is the core kernel of many applications including
scientific computing, cryptographic algorithms, etc. Existing acceleration of arbitrary …

MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine

E Taka, A Arora, KC Wu… - … Conference on Field …, 2023 - ieeexplore.ieee.org
The increasing computational and memory requirements of Deep Learning (DL) workloads
has led to outstanding innovations in hardware architectures. An archetype of such …

HyScale-GNN: A scalable hybrid GNN training system on single-node heterogeneous architecture

YC Lin, V Prasanna - 2023 IEEE International Parallel and …, 2023 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have shown success in many real-world applications that
involve graph-structured data. Most of the existing single-node GNN training systems are …

Exploiting on-chip heterogeneity of versal architecture for GNN inference acceleration

P Chen, P Manjunath, S Wijeratne… - … Conference on Field …, 2023 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have revolutionized many Machine Learning (ML)
applications, such as social network analysis, bioinformatics, etc. GNN inference can be …

A Survey of Graph Pre-processing Methods: From Algorithmic to Hardware Perspectives

Z Lv, M Yan, X Liu, M Dong, X Ye, D Fan… - arXiv preprint arXiv …, 2023 - arxiv.org
Graph-related applications have experienced significant growth in academia and industry,
driven by the powerful representation capabilities of graph. However, efficiently executing …

Accelerating graph neural networks in Pytorch with HLS and deep dataflows

J Nunez-Yanez - International Symposium on Applied Reconfigurable …, 2023 - Springer
Graph neural networks (GNNs) combine sparse and dense data compute requirements that
are challenging to meet in resource-constrained embedded hardware. In this paper, we …

Graph-opu: A highly integrated fpga-based overlay processor for graph neural networks

R Chen, H Zhang, S Li, E Tang, J Yu… - 2023 33rd International …, 2023 - ieeexplore.ieee.org
Field-programmable gate array (FPGA) is an ideal candidate for accelerating graph neural
networks (GNNs). However, FPGA reconfiguration is a time-consuming process when …