VecQ: Minimal loss DNN model compression with vectorized weight quantization
Quantization has been proven to be an effective method for reducing the computing and/or
storage cost of DNNs. However, the trade-off between the quantization bitwidth and final …
storage cost of DNNs. However, the trade-off between the quantization bitwidth and final …
Algorithm/Accelerator co-design and co-search for edge AI
The world has seen the great success of deep neural networks (DNNs) in a massive number
of artificial intelligence (AI) applications. However, developing high-quality AI services to …
of artificial intelligence (AI) applications. However, developing high-quality AI services to …
HiKonv: High throughput quantized convolution with novel bit-wise management and computation
Quantization for Convolutional Neural Network (CNN) has shown significant progress with
the intention of reducing the cost of computation and storage with low-bitwidth data inputs …
the intention of reducing the cost of computation and storage with low-bitwidth data inputs …
Elastic significant bit quantization and acceleration for deep neural networks
C Gong, Y Lu, K Xie, Z Jin, T Li… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Quantization has been proven to be a vital method for improving the inference efficiency of
deep neural networks (DNNs). However, it is still challenging to strike a good balance …
deep neural networks (DNNs). However, it is still challenging to strike a good balance …
HiKonv: Maximizing the Throughput of Quantized Convolution With Novel Bit-wise Management and Computation
Quantization for CNN has shown significant progress with the intention of reducing the cost
of computation and storage with low-bitwidth data representations. There are, however, no …
of computation and storage with low-bitwidth data representations. There are, however, no …
Efficient AI hardware acceleration
X Zhang - 2022 - ideals.illinois.edu
The great success of artificial intelligence (AI) has been driven in part by the continuous
improvement of deep neural networks (DNNs) with deeper and more sophisticated model …
improvement of deep neural networks (DNNs) with deeper and more sophisticated model …
Energy-Efficient Fixed-Point Hardware Accelerator for Embedded DNNs
M Mastalizade, A Ansarmohammadi, N Nazari… - Journal of Information …, 2024 - jour.aicti.ir
Deep Neural Networks (DNNs) have demonstrated remarkable performance in various
application domains, such as computer vision, pattern recognition, and natural language …
application domains, such as computer vision, pattern recognition, and natural language …
Energy-Efficient Fixed-Point Hardware Accelerator for Embedded DNNs
M Mastalizade, A Ansarmohammadi, N Nazari… - Journal of Information …, 2024 - jour.aicti.ir
Deep Neural Networks (DNNs) have demonstrated remarkable performance in various
application domains, such as computer vision, pattern recognition, and natural language …
application domains, such as computer vision, pattern recognition, and natural language …
Resource-efficient FPGA acceleration for machine learning applications through HLS
X Liu - 2022 - ideals.illinois.edu
The rapidly growing machine learning development has demonstrated its great capability
and effectiveness in handling complicated real-world problems such as computer vision and …
and effectiveness in handling complicated real-world problems such as computer vision and …
An Adaptive Logarithm Quantization Method for DNN Compression
The size and complexity of Neural Network models grow rapidly in recent years, which
makes the inference of these models require more computational and memory resources. To …
makes the inference of these models require more computational and memory resources. To …