[HTML][HTML] A survey on hardware accelerators: Taxonomy, trends, challenges, and perspectives

B Peccerillo, M Mannino, A Mondelli… - Journal of Systems …, 2022 - Elsevier
In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …

A survey of coarse-grained reconfigurable architecture and design: Taxonomy, challenges, and applications

L Liu, J Zhu, Z Li, Y Lu, Y Deng, J Han, S Yin… - ACM Computing …, 2019 - dl.acm.org
As general-purpose processors have hit the power wall and chip fabrication cost escalates
alarmingly, coarse-grained reconfigurable architectures (CGRAs) are attracting increasing …

An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems

Y Gan, Y Zhang, D Cheng, A Shetty, P Rathi… - Proceedings of the …, 2019 - dl.acm.org
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …

On the tool manipulation capability of open-source large language models

Q Xu, F Hong, B Li, C Hu, Z Chen, J Zhang - arXiv preprint arXiv …, 2023 - arxiv.org
Recent studies on software tool manipulation with large language models (LLMs) mostly rely
on closed model APIs. The industrial adoption of these models is substantially constrained …

Extensor: An accelerator for sparse tensor algebra

K Hegde, H Asghari-Moghaddam, M Pellauer… - Proceedings of the …, 2019 - dl.acm.org
Generalized tensor algebra is a prime candidate for acceleration via customized ASICs.
Modern tensors feature a wide range of data sparsity, with the density of non-zero elements …

AutoSA: A polyhedral compiler for high-performance systolic arrays on FPGA

J Wang, L Guo, J Cong - The 2021 ACM/SIGDA International Symposium …, 2021 - dl.acm.org
While systolic array architectures have the potential to deliver tremendous performance, it is
notoriously challenging to customize an efficient systolic array processor for a target …

Intelligence beyond the edge: Inference on intermittent embedded systems

G Gobieski, B Lucia, N Beckmann - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Energy-harvesting technology provides a promising platform for future IoT applications.
However, since communication is very expensive in these devices, applications will require …

Cosa: Scheduling by constrained optimization for spatial accelerators

Q Huang, M Kang, G Dinh, T Norell… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Recent advances in Deep Neural Networks (DNNs) have led to active development of
specialized DNN accelerators, many of which feature a large number of processing …

Spatial: A language and compiler for application accelerators

D Koeplinger, M Feldman, R Prabhakar… - Proceedings of the 39th …, 2018 - dl.acm.org
Industry is increasingly turning to reconfigurable architectures like FPGAs and CGRAs for
improved performance and energy efficiency. Unfortunately, adoption of these architectures …

Sharing, Protection, and Compatibility for Reconfigurable Fabric with {AmorphOS}

A Khawaja, J Landgraf, R Prakash, M Wei… - … USENIX Symposium on …, 2018 - usenix.org
Cloud providers such as Amazon and Microsoft have begun to support on-demand FPGA
acceleration in the cloud, and hardware vendors will support FPGAs in future processors. At …