A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Filter pruning via geometric median for deep convolutional neural networks acceleration

Y He, P Liu, Z Wang, Z Hu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Previous works utilized" smaller-norm-less-important" criterion to prune filters with smaller
norm values in a convolutional neural network. In this paper, we analyze this norm-based …

Survey: Exploiting data redundancy for optimization of deep learning

JA Chen, W Niu, B Ren, Y Wang, X Shen - ACM Computing Surveys, 2023 - dl.acm.org
Data redundancy is ubiquitous in the inputs and intermediate results of Deep Neural
Networks (DNN). It offers many significant opportunities for improving DNN performance and …

Learning filter pruning criteria for deep convolutional neural networks acceleration

Y He, Y Ding, P Liu, L Zhu… - Proceedings of the …, 2020 - openaccess.thecvf.com
Filter pruning has been widely applied to neural network compression and acceleration.
Existing methods usually utilize pre-defined pruning criteria, such as Lp-norm, to prune …

A good student is cooperative and reliable: CNN-transformer collaborative learning for semantic segmentation

J Zhu, Y Luo, X Zheng, H Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we strive to answer the question'how to collaboratively learn convolutional
neural network (CNN)-based and vision transformer (ViT)-based models by selecting and …

Protopshare: Prototypical parts sharing for similarity discovery in interpretable image classification

D Rymarczyk, Ł Struski, J Tabor… - Proceedings of the 27th …, 2021 - dl.acm.org
In this work, we introduce an extension to ProtoPNet called ProtoPShare which shares
prototypical parts between classes. To obtain prototype sharing we prune prototypical parts …

Deep ensemble learning for human activity recognition using wearable sensors via filter activation

W Huang, L Zhang, S Wang, H Wu… - ACM Transactions on …, 2022 - dl.acm.org
During the past decade, human activity recognition (HAR) using wearable sensors has
become a new research hot spot due to its extensive use in various application domains …

Leveraging filter correlations for deep model compression

P Singh, VK Verma, P Rai… - Proceedings of the …, 2020 - openaccess.thecvf.com
We present a filter correlation based model compression approach for deep convolutional
neural networks. Our approach iteratively identifies pairs of filters with the largest pairwise …

Manipulating identical filter redundancy for efficient pruning on deep and complicated cnn

T Hao, X Ding, J Han, Y Guo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
The existence of redundancy in convolutional neural networks (CNNs) enables us to remove
some filters/channels with acceptable performance drops. However, the training objective of …

Deep neural network pruning method based on sensitive layers and reinforcement learning

W Yang, H Yu, B Cui, R Sui, T Gu - Artificial Intelligence Review, 2023 - Springer
It is of great significance to compress neural network models so that they can be deployed
on resource-constrained embedded mobile devices. However, due to the lack of theoretical …