CHARM: C omposing H eterogeneous A ccele R ators for M atrix Multiply on Versal ACAP Architecture
Dense matrix multiply (MM) serves as one of the most heavily used kernels in deep learning
applications. To cope with the high computation demands of these applications …
applications. To cope with the high computation demands of these applications …
Exploring the Versal AI engines for accelerating stencil-based atmospheric advection simulation
N Brown - Proceedings of the 2023 ACM/SIGDA International …, 2023 - dl.acm.org
AMD Xilinx's new Versal Adaptive Compute Acceleration Platform (ACAP) is an FPGA
architecture combining reconfigurable fabric with other on-chip hardened compute …
architecture combining reconfigurable fabric with other on-chip hardened compute …
Codg-reram: An algorithm-hardware co-design to accelerate semi-structured gnns on reram
Graph Neural Networks (GCNs) have attracted wide attention and are applied to the real
world. However, due to the ever-growing graph data with significant irregularities, off-chip …
world. However, due to the ever-growing graph data with significant irregularities, off-chip …
AIM: Accelerating Arbitrary-precision Integer Multiplication on Heterogeneous Reconfigurable Computing Platform Versal ACAP
Arbitrary-precision integer multiplication is the core kernel of many applications including
scientific computing, cryptographic algorithms, etc. Existing acceleration of arbitrary …
scientific computing, cryptographic algorithms, etc. Existing acceleration of arbitrary …
MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine
The increasing computational and memory requirements of Deep Learning (DL) workloads
has led to outstanding innovations in hardware architectures. An archetype of such …
has led to outstanding innovations in hardware architectures. An archetype of such …
HyScale-GNN: A scalable hybrid GNN training system on single-node heterogeneous architecture
YC Lin, V Prasanna - 2023 IEEE International Parallel and …, 2023 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have shown success in many real-world applications that
involve graph-structured data. Most of the existing single-node GNN training systems are …
involve graph-structured data. Most of the existing single-node GNN training systems are …
Exploiting on-chip heterogeneity of versal architecture for GNN inference acceleration
P Chen, P Manjunath, S Wijeratne… - … Conference on Field …, 2023 - ieeexplore.ieee.org
Graph Neural Networks (GNNs) have revolutionized many Machine Learning (ML)
applications, such as social network analysis, bioinformatics, etc. GNN inference can be …
applications, such as social network analysis, bioinformatics, etc. GNN inference can be …
A Survey of Graph Pre-processing Methods: From Algorithmic to Hardware Perspectives
Graph-related applications have experienced significant growth in academia and industry,
driven by the powerful representation capabilities of graph. However, efficiently executing …
driven by the powerful representation capabilities of graph. However, efficiently executing …
Accelerating graph neural networks in Pytorch with HLS and deep dataflows
J Nunez-Yanez - International Symposium on Applied Reconfigurable …, 2023 - Springer
Graph neural networks (GNNs) combine sparse and dense data compute requirements that
are challenging to meet in resource-constrained embedded hardware. In this paper, we …
are challenging to meet in resource-constrained embedded hardware. In this paper, we …
Graph-opu: A highly integrated fpga-based overlay processor for graph neural networks
Field-programmable gate array (FPGA) is an ideal candidate for accelerating graph neural
networks (GNNs). However, FPGA reconfiguration is a time-consuming process when …
networks (GNNs). However, FPGA reconfiguration is a time-consuming process when …