VecQ: Minimal loss DNN model compression with vectorized weight quantization

C Gong, Y Chen, Y Lu, T Li, C Hao… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Quantization has been proven to be an effective method for reducing the computing and/or
storage cost of DNNs. However, the trade-off between the quantization bitwidth and final …

Algorithm/Accelerator co-design and co-search for edge AI

X Zhang, Y Li, J Pan, D Chen - IEEE Transactions on Circuits …, 2022 - ieeexplore.ieee.org
The world has seen the great success of deep neural networks (DNNs) in a massive number
of artificial intelligence (AI) applications. However, developing high-quality AI services to …

HiKonv: High throughput quantized convolution with novel bit-wise management and computation

X Liu, Y Chen, P Ganesh, J Pan… - 2022 27th Asia and …, 2022 - ieeexplore.ieee.org
Quantization for Convolutional Neural Network (CNN) has shown significant progress with
the intention of reducing the cost of computation and storage with low-bitwidth data inputs …

Elastic significant bit quantization and acceleration for deep neural networks

C Gong, Y Lu, K Xie, Z Jin, T Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Quantization has been proven to be a vital method for improving the inference efficiency of
deep neural networks (DNNs). However, it is still challenging to strike a good balance …

HiKonv: Maximizing the Throughput of Quantized Convolution With Novel Bit-wise Management and Computation

Y Chen, J Pan, X Liu, J Xiong, D Chen - arXiv preprint arXiv:2208.00763, 2022 - arxiv.org
Quantization for CNN has shown significant progress with the intention of reducing the cost
of computation and storage with low-bitwidth data representations. There are, however, no …

Efficient AI hardware acceleration

X Zhang - 2022 - ideals.illinois.edu
The great success of artificial intelligence (AI) has been driven in part by the continuous
improvement of deep neural networks (DNNs) with deeper and more sophisticated model …

Energy-Efficient Fixed-Point Hardware Accelerator for Embedded DNNs

M Mastalizade, A Ansarmohammadi, N Nazari… - Journal of Information …, 2024 - jour.aicti.ir
Deep Neural Networks (DNNs) have demonstrated remarkable performance in various
application domains, such as computer vision, pattern recognition, and natural language …

Energy-Efficient Fixed-Point Hardware Accelerator for Embedded DNNs

M Mastalizade, A Ansarmohammadi, N Nazari… - Journal of Information …, 2024 - jour.aicti.ir
Deep Neural Networks (DNNs) have demonstrated remarkable performance in various
application domains, such as computer vision, pattern recognition, and natural language …

Resource-efficient FPGA acceleration for machine learning applications through HLS

X Liu - 2022 - ideals.illinois.edu
The rapidly growing machine learning development has demonstrated its great capability
and effectiveness in handling complicated real-world problems such as computer vision and …

An Adaptive Logarithm Quantization Method for DNN Compression

Y Wang, Z He, C Tang, Z Wang, W Zhu - … 8–12, 2021, Proceedings, Part V …, 2021 - Springer
The size and complexity of Neural Network models grow rapidly in recent years, which
makes the inference of these models require more computational and memory resources. To …