A comprehensive survey on model quantization for deep neural networks

RAELLA: Reforming the arithmetic for efficient, low-resolution, and low-loss analog PIM: No retraining required!

T Andrulis, JS Emer, V Sze - … of the 50th Annual International Symposium …, 2023 - dl.acm.org

Processing-In-Memory (PIM) accelerators have the potential to efficiently run Deep Neural
Network (DNN) inference by reducing costly data movement and by using resistive RAM …

被引用次数：19 相关文章所有 6 个版本

[HTML] springer.com Full View

[HTML][HTML] Modular design automation of the morphologies, controllers, and vision systems for intelligent robots: a survey

W Li, Z Wang, R Mai, P Ren, Q Zhang, Y Zhou, N Xu… - Visual Intelligence, 2023 - Springer

Abstract Design automation is a core technology in industrial design software and an
important branch of knowledge-worker automation. For example, electronic design …

被引用次数：9 相关文章所有 4 个版本

[PDF] thecvf.com

Pela: Learning parameter-efficient models with low-rank approximation

Y Guo, G Wang, M Kankanhalli - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Applying a pre-trained large model to downstream tasks is prohibitive under resource-
constrained conditions. Recent dominant approaches for addressing efficiency issues …

被引用次数：2 相关文章所有 3 个版本

FedACQ: adaptive clustering quantization of model parameters in federated learning

T Tian, H Shi, R Ma, Y Liu - International Journal of Web Information …, 2024 - emerald.com

Purpose For privacy protection, federated learning based on data separation allows
machine learning models to be trained on remote devices or in isolated data devices …

被引用次数：2 相关文章所有 2 个版本

[PDF] ieee.org

Differentiable Neural Architecture, Mixed Precision and Accelerator Co-Search

KT Chitty-Venkata, Y Bian, M Emani… - IEEE …, 2023 - ieeexplore.ieee.org

Quantization, effective Neural Network architecture, and efficient accelerator hardware are
three important design paradigms to maximize accuracy and efficiency. Mixed Precision …

被引用次数：3 相关文章所有 5 个版本

[PDF] arxiv.org

Empirical Guidelines for Deploying LLMs onto Resource-constrained Edge Devices

R Qin, D Liu, Z Yan, Z Tan, Z Pan, Z Jia, M Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org

The scaling laws have become the de facto guidelines for designing large language models
(LLMs), but they were studied under the assumption of unlimited computing resources for …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

被引用次数：1 相关文章所有 5 个版本

A Survey on Image Classification of Lightweight Convolutional Neural Network

Y Liu, P Xiao, J Fang, D Zhang - 2023 19th International …, 2023 - ieeexplore.ieee.org

In recent years, deep neural networks have achieved tremendous success in image
classification in both academic and industrial settings. However, the high hardware …

被引用次数：3 相关文章