Pruning from scratch

D Ghimire, D Kil, S Kim - Electronics, 2022 - mdpi.com

Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …

被引用次数：157 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

被引用次数：40 相关文章所有 2 个版本

[PDF] thecvf.com

Tinyclip: Clip distillation via affinity mimicking and weight inheritance

K Wu, H Peng, Z Zhou, B Xiao, M Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

In this paper, we propose a novel cross-modal distillation method, called TinyCLIP, for large-
scale language-image pre-trained models. The method introduces two core techniques …

被引用次数：28 相关文章所有 5 个版本

[PDF] thecvf.com

Combined depth space based architecture search for person re-identification

H Li, G Wu, WS Zheng - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com

Most works on person re-identification (ReID) take advantage of large backbone networks
such as ResNet, which are designed for image classification instead of ReID, for feature …

被引用次数：144 相关文章所有 6 个版本

[PDF] mdpi.com

TinyML: Enabling of inference deep learning models on ultra-low-power IoT edge devices for AI applications

NN Alajlan, DM Ibrahim - Micromachines, 2022 - mdpi.com

Recently, the Internet of Things (IoT) has gained a lot of attention, since IoT devices are
placed in various fields. Many of these devices are based on machine learning (ML) models …

被引用次数：98 相关文章所有 10 个版本

[PDF] thecvf.com

Chex: Channel exploration for cnn model compression

Z Hou, M Qin, F Sun, X Ma, K Yuan… - Proceedings of the …, 2022 - openaccess.thecvf.com

Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …

被引用次数：74 相关文章所有 6 个版本

[PDF] arxiv.org

Bringing AI to edge: From deep learning's perspective

D Liu, H Kong, X Luo, W Liu, R Subramaniam - Neurocomputing, 2022 - Elsevier

Edge computing and artificial intelligence (AI), especially deep learning algorithms, are
gradually intersecting to build the novel system, namely edge intelligence. However, the …

被引用次数：126 相关文章所有 6 个版本

[PDF] arxiv.org

Dsa: More efficient budgeted pruning via differentiable sparsity allocation

X Ning, T Zhao, W Li, P Lei, Y Wang, H Yang - European Conference on …, 2020 - Springer

Budgeted pruning is the problem of pruning under resource constraints. In budgeted
pruning, how to distribute the resources across layers (ie, sparsity allocation) is the key …

被引用次数：125 相关文章所有 11 个版本

[PDF] arxiv.org

Weight-sharing neural architecture search: A battle to shrink the optimization gap

L Xie, X Chen, K Bi, L Wei, Y Xu, L Wang… - ACM Computing …, 2021 - dl.acm.org

Neural architecture search (NAS) has attracted increasing attention. In recent years,
individual search methods have been replaced by weight-sharing search methods for higher …

被引用次数：94 相关文章所有 4 个版本

[PDF] thecvf.com

Gdp: Stabilized neural network pruning via gates with differentiable polarization

Y Guo, H Yuan, J Tan, Z Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract Model compression techniques are recently gaining explosive attention for
obtaining efficient AI models for various real time applications. Channel pruning is one …

被引用次数：56 相关文章所有 5 个版本

A survey on efficient convolutional neural networks and hardware acceleration

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

Tinyclip: Clip distillation via affinity mimicking and weight inheritance

Combined depth space based architecture search for person re-identification

TinyML: Enabling of inference deep learning models on ultra-low-power IoT edge devices for AI applications

Chex: Channel exploration for cnn model compression

Bringing AI to edge: From deep learning's perspective

Dsa: More efficient budgeted pruning via differentiable sparsity allocation

Weight-sharing neural architecture search: A battle to shrink the optimization gap

Gdp: Stabilized neural network pruning via gates with differentiable polarization

高级搜索

引用