[HTML][HTML] A survey on hardware accelerators: Taxonomy, trends, challenges, and perspectives
In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …
A survey of coarse-grained reconfigurable architecture and design: Taxonomy, challenges, and applications
As general-purpose processors have hit the power wall and chip fabrication cost escalates
alarmingly, coarse-grained reconfigurable architectures (CGRAs) are attracting increasing …
alarmingly, coarse-grained reconfigurable architectures (CGRAs) are attracting increasing …
An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …
On the tool manipulation capability of open-source large language models
Recent studies on software tool manipulation with large language models (LLMs) mostly rely
on closed model APIs. The industrial adoption of these models is substantially constrained …
on closed model APIs. The industrial adoption of these models is substantially constrained …
Extensor: An accelerator for sparse tensor algebra
K Hegde, H Asghari-Moghaddam, M Pellauer… - Proceedings of the …, 2019 - dl.acm.org
Generalized tensor algebra is a prime candidate for acceleration via customized ASICs.
Modern tensors feature a wide range of data sparsity, with the density of non-zero elements …
Modern tensors feature a wide range of data sparsity, with the density of non-zero elements …
AutoSA: A polyhedral compiler for high-performance systolic arrays on FPGA
While systolic array architectures have the potential to deliver tremendous performance, it is
notoriously challenging to customize an efficient systolic array processor for a target …
notoriously challenging to customize an efficient systolic array processor for a target …
Intelligence beyond the edge: Inference on intermittent embedded systems
Energy-harvesting technology provides a promising platform for future IoT applications.
However, since communication is very expensive in these devices, applications will require …
However, since communication is very expensive in these devices, applications will require …
Cosa: Scheduling by constrained optimization for spatial accelerators
Recent advances in Deep Neural Networks (DNNs) have led to active development of
specialized DNN accelerators, many of which feature a large number of processing …
specialized DNN accelerators, many of which feature a large number of processing …
Spatial: A language and compiler for application accelerators
D Koeplinger, M Feldman, R Prabhakar… - Proceedings of the 39th …, 2018 - dl.acm.org
Industry is increasingly turning to reconfigurable architectures like FPGAs and CGRAs for
improved performance and energy efficiency. Unfortunately, adoption of these architectures …
improved performance and energy efficiency. Unfortunately, adoption of these architectures …
Sharing, Protection, and Compatibility for Reconfigurable Fabric with {AmorphOS}
Cloud providers such as Amazon and Microsoft have begun to support on-demand FPGA
acceleration in the cloud, and hardware vendors will support FPGAs in future processors. At …
acceleration in the cloud, and hardware vendors will support FPGAs in future processors. At …