High pe utilization CNN accelerator with channel fusion supporting pattern-compressed sparse...

S Yang, W Chen, X Zhang, S He, Y Yin… - Proceedings of the ACM …, 2021 - dl.acm.org

Emergent ReRAM-based accelerators support in-memory computation to accelerate deep
neural network (DNN) inference. Weight matrix pruning of DNNs is a widely used technique …

被引用次数：34 相关文章所有 4 个版本

[PDF] arxiv.org

Sense: Model-hardware codesign for accelerating sparse CNNs on systolic arrays

W Sun, D Liu, Z Zou, W Sun, S Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Sparsity is an intrinsic property of convolutional neural networks (CNNs), worth exploiting for
CNN accelerators. However, the extra processing involved comes with hardware overhead …

被引用次数：17 相关文章所有 5 个版本

[PDF] google.com

APQ: Automated DNN Pruning and Quantization for ReRAM-Based Accelerators

S Yang, S He, H Duan, W Chen… - … on Parallel and …, 2023 - ieeexplore.ieee.org

Emerging ReRAM-based accelerators support in-memory computation to accelerate deep
neural network (DNN) inference. Weight matrix pruning is a widely used technique to reduce …

被引用次数：6 相关文章所有 4 个版本

Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays

J Rhe, KE Jeon, JC Lee, S Jeong… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org

Processing-in-memory (PIM) architectures have been highlighted as one of the viable
solutions for faster and more power-efficient convolutional neural networks (CNNs) …

被引用次数：3 相关文章

[PDF] acm.org

MIME: adapting a single neural network for multi-task inference with memory-efficient dynamic pruning

A Bhattacharjee, Y Venkatesha, A Moitra… - Proceedings of the 59th …, 2022 - dl.acm.org

Recent years have seen a paradigm shift towards multi-task learning. This calls for memory
and energy-efficient solutions for inference in a multi-task scenario. We propose an …

被引用次数：7 相关文章所有 5 个版本

被引用次数：6 相关文章所有 3 个版本

[PDF] mdpi.com

DSCU: Accelerating CNN inference in FPGAs with dual sizes of compute unit

Z Bao, J Guo, W Zhang, H Dang - Journal of Low Power Electronics and …, 2022 - mdpi.com

FPGA-based accelerators have shown great potential in improving the performance of CNN
inference. However, the existing FPGA-based approaches suffer from a low compute unit …

被引用次数：4 相关文章所有 3 个版本