A survey on approximate edge AI for energy efficient autonomous driving services
Autonomous driving services depends on active sensing from modules such as camera,
LiDAR, radar, and communication units. Traditionally, these modules process the sensed …
LiDAR, radar, and communication units. Traditionally, these modules process the sensed …
Convolutional neural network pruning with structural redundancy reduction
Z Wang, C Li, X Wang - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Convolutional neural network (CNN) pruning has become one of the most successful
network compression approaches in recent years. Existing works on network pruning …
network compression approaches in recent years. Existing works on network pruning …
Resrep: Lossless cnn pruning via decoupling remembering and forgetting
We propose ResRep, a novel method for lossless channel pruning (aka filter pruning), which
slims down a CNN by reducing the width (number of output channels) of convolutional …
slims down a CNN by reducing the width (number of output channels) of convolutional …
Towards efficient tensor decomposition-based dnn model compression with optimization framework
Advanced tensor decomposition, such as Tensor train (TT) and Tensor ring (TR), has been
widely studied for deep neural network (DNN) model compression, especially for recurrent …
widely studied for deep neural network (DNN) model compression, especially for recurrent …
Compressing neural networks: Towards determining the optimal layer-wise decomposition
L Liebenwein, A Maalouf… - Advances in Neural …, 2021 - proceedings.neurips.cc
We present a novel global compression framework for deep neural networks that
automatically analyzes each layer to identify the optimal per-layer compression ratio, while …
automatically analyzes each layer to identify the optimal per-layer compression ratio, while …
Rgp: Neural network pruning through regular graph with edges swapping
Deep learning technology has found a promising application in lightweight model design, for
which pruning is an effective means of achieving a large reduction in both model parameters …
which pruning is an effective means of achieving a large reduction in both model parameters …
Model compression of deep neural network architectures for visual pattern recognition: Current status and future directions
S Bhalgaonkar, M Munot - Computers and Electrical Engineering, 2024 - Elsevier
Abstract Visual Pattern Recognition Networks (VPRNs) are widely used in various visual
data based applications such as computer vision and edge AI. VPRNs help to enhance a …
data based applications such as computer vision and edge AI. VPRNs help to enhance a …
Validating the lottery ticket hypothesis with inertial manifold theory
Despite achieving remarkable efficiency, traditional network pruning techniques often follow
manually-crafted heuristics to generate pruned sparse networks. Such heuristic pruning …
manually-crafted heuristics to generate pruned sparse networks. Such heuristic pruning …
Deep neural network compression by Tucker decomposition with nonlinear response
Y Liu, MK Ng - Knowledge-Based Systems, 2022 - Elsevier
Deep neural networks have shown impressive performance in many areas, including
computer vision and natural language processing. Millions of parameters in deep neural …
computer vision and natural language processing. Millions of parameters in deep neural …
Pela: Learning parameter-efficient models with low-rank approximation
Applying a pre-trained large model to downstream tasks is prohibitive under resource-
constrained conditions. Recent dominant approaches for addressing efficiency issues …
constrained conditions. Recent dominant approaches for addressing efficiency issues …