The combinatorial brain surgeon: pruning weights that cancel one another in neural networks

M Zhang, H Chen, C Shen, Z Yang, L Ou, X Yu… - arXiv preprint arXiv …, 2023 - arxiv.org

Large pre-trained models (LPMs), such as LLaMA and GLM, have shown exceptional
performance across various tasks through fine-tuning. Although low-rank adaption (LoRA) …

被引用次数：47 相关文章所有 3 个版本

[PDF] neurips.cc

Pruning's effect on generalization through the lens of training and regularization

T Jin, M Carbin, D Roy, J Frankle… - Advances in Neural …, 2022 - proceedings.neurips.cc

Practitioners frequently observe that pruning improves model generalization. A long-
standing hypothesis based on bias-variance trade-off attributes this generalization …

被引用次数：27 相关文章所有 5 个版本

[PDF] mlr.press

Fast as chita: Neural network pruning with combinatorial optimization

R Benbaki, W Chen, X Meng… - International …, 2023 - proceedings.mlr.press

The sheer size of modern neural networks makes model serving a serious computational
challenge. A popular class of compression techniques overcomes this challenge by pruning …

被引用次数：17 相关文章所有 8 个版本

[PDF] neurips.cc

Singe: Sparsity via integrated gradients estimation of neuron relevance

E Yvinec, A Dapogny, M Cord… - Advances in Neural …, 2022 - proceedings.neurips.cc

The leap in performance in state-of-the-art computer vision methods is attributed to the
development of deep neural networks. However it often comes at a computational price …

被引用次数：8 相关文章所有 7 个版本

[PDF] mlr.press

FALCON: FLOP-Aware Combinatorial Optimization for Neural Network Pruning

X Meng, W Chen, R Benbaki… - International …, 2024 - proceedings.mlr.press

The increasing computational demands of modern neural networks present deployment
challenges on resource-constrained devices. Network pruning offers a solution to reduce …

被引用次数：1 相关文章所有 3 个版本

[PDF] acm.org

Register Tiling for Unstructured Sparsity in Neural Network Inference

L Wilkinson, K Cheshmi, MM Dehnavi - Proceedings of the ACM on …, 2023 - dl.acm.org

Unstructured sparse neural networks are an important class of machine learning (ML)
models, as they compact model size and reduce floating point operations. The execution …

被引用次数：12 相关文章所有 3 个版本

[PDF] openreview.net

UFKT: Unimportant filters knowledge transfer for CNN pruning

CH Sarvani, SR Dubey, M Ghorai - Neurocomputing, 2022 - Elsevier

As the deep learning models have been widely used in recent years, there is a high demand
for reducing the model size in terms of memory and computation without much compromise …

被引用次数：8 相关文章所有 3 个版本

[PDF] thecvf.com

MULTIFLOW: Shifting Towards Task-Agnostic Vision-Language Pruning

M Farina, M Mancini, E Cunegatti… - Proceedings of the …, 2024 - openaccess.thecvf.com

While excellent in transfer learning Vision-Language models (VLMs) come with high
computational costs due to their large number of parameters. To address this issue …

UPSCALE: unconstrained channel pruning

A Wan, H Hao, K Patnaik, Y Xu… - International …, 2023 - proceedings.mlr.press

As neural networks grow in size and complexity, inference speeds decline. To combat this,
one of the most effective compression techniques–channel pruning–removes channels from …

被引用次数：3 相关文章所有 6 个版本

[PDF] thecvf.com

Jointly Training and Pruning CNNs via Learnable Agent Guidance and Alignment

A Ganjdanesh, S Gao, H Huang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Structural model pruning is a prominent approach used for reducing the computational cost
of Convolutional Neural Networks (CNNs) before their deployment on resource-constrained …

被引用次数：3 相关文章所有 3 个版本