FlooNoC: A 645 Gbps/link 0.15 pJ/B/hop Open-Source NoC with Wide Physical Links and End-to-End AXI4 Parallel Multi-Stream Support

T Fischer, M Rogenmoser, T Benz… - arXiv preprint arXiv …, 2024 - arxiv.org
The new generation of domain-specific AI accelerators is characterized by rapidly increasing
demands for bulk data transfers, as opposed to small, latency-critical cache line transfers …

Spatz: Clustering Compact RISC-V-Based Vector Units to Maximize Computing Efficiency

M Cavalcante, M Perotti, S Riedel, L Benini - arXiv preprint arXiv …, 2023 - arxiv.org
The ever-increasing computational and storage requirements of modern applications and
the slowdown of technology scaling pose major challenges to designing and implementing …

OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling

X Yi, R Antonio, J Dumoulin, J Sun, J Van Delm… - arXiv preprint arXiv …, 2024 - arxiv.org
Deep neural networks (DNNs) face significant challenges when deployed on resource-
constrained extreme edge devices due to their computational and data-intensive nature …

[图书][B] Fighting Back the Von Neumann Bottleneck with Small-and Large-Scale Vector Microprocessors

M Cavalcante - 2023 - books.google.com
In his seminal Turing Award Lecture, Backus discussed the issues stemming from the word-
at-a-time style of programming inherited from the von Neumann computer. More than forty …