A comprehensive review of binary neural network
Deep learning (DL) has recently changed the development of intelligent systems and is
widely adopted in many real-life applications. Despite their various benefits and potentials …
widely adopted in many real-life applications. Despite their various benefits and potentials …
A survey of quantization methods for efficient neural network inference
This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …
Neural Network computations, covering the advantages/disadvantages of current methods …
A comprehensive survey on model quantization for deep neural networks in image classification
Recent advancements in machine learning achieved by Deep Neural Networks (DNNs)
have been significant. While demonstrating high accuracy, DNNs are associated with a …
have been significant. While demonstrating high accuracy, DNNs are associated with a …
Adabin: Improving binary neural networks with adaptive binary sets
This paper studies the Binary Neural Networks (BNNs) in which weights and activations are
both binarized into 1-bit values, thus greatly reducing the memory usage and computational …
both binarized into 1-bit values, thus greatly reducing the memory usage and computational …
Intraq: Learning synthetic images with intra-class heterogeneity for zero-shot network quantization
Learning to synthesize data has emerged as a promising direction in zero-shot quantization
(ZSQ), which represents neural networks by low-bit integer without accessing any of the real …
(ZSQ), which represents neural networks by low-bit integer without accessing any of the real …
Bibench: Benchmarking and analyzing network binarization
Network binarization emerges as one of the most promising compression approaches
offering extraordinary computation and memory savings by minimizing the bit-width …
offering extraordinary computation and memory savings by minimizing the bit-width …
Recu: Reviving the dead weights in binary neural networks
Binary neural networks (BNNs) have received increasing attention due to their superior
reductions of computation and memory. Most existing works focus on either lessening the …
reductions of computation and memory. Most existing works focus on either lessening the …
Pruning Networks With Cross-Layer Ranking & k-Reciprocal Nearest Filters
This article focuses on filter-level network pruning. A novel pruning method, termed CLR-
RNF, is proposed. We first reveal a “long-tail” pruning problem in magnitude-based weight …
RNF, is proposed. We first reveal a “long-tail” pruning problem in magnitude-based weight …
Data-free knowledge distillation for image super-resolution
Convolutional network compression methods require training data for achieving acceptable
results, but training data is routinely unavailable due to some privacy and transmission …
results, but training data is routinely unavailable due to some privacy and transmission …
Estimator meets equilibrium perspective: A rectified straight through estimator for binary neural networks training
Binarization of neural networks is a dominant paradigm in neural networks compression.
The pioneering work BinaryConnect uses Straight Through Estimator (STE) to mimic the …
The pioneering work BinaryConnect uses Straight Through Estimator (STE) to mimic the …