A survey on efficient convolutional neural networks and hardware acceleration

D Ghimire, D Kil, S Kim - Electronics, 2022 - mdpi.com
Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Tinyclip: Clip distillation via affinity mimicking and weight inheritance

K Wu, H Peng, Z Zhou, B Xiao, M Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
In this paper, we propose a novel cross-modal distillation method, called TinyCLIP, for large-
scale language-image pre-trained models. The method introduces two core techniques …

Combined depth space based architecture search for person re-identification

H Li, G Wu, WS Zheng - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Most works on person re-identification (ReID) take advantage of large backbone networks
such as ResNet, which are designed for image classification instead of ReID, for feature …

TinyML: Enabling of inference deep learning models on ultra-low-power IoT edge devices for AI applications

NN Alajlan, DM Ibrahim - Micromachines, 2022 - mdpi.com
Recently, the Internet of Things (IoT) has gained a lot of attention, since IoT devices are
placed in various fields. Many of these devices are based on machine learning (ML) models …

Chex: Channel exploration for cnn model compression

Z Hou, M Qin, F Sun, X Ma, K Yuan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …

Bringing AI to edge: From deep learning's perspective

D Liu, H Kong, X Luo, W Liu, R Subramaniam - Neurocomputing, 2022 - Elsevier
Edge computing and artificial intelligence (AI), especially deep learning algorithms, are
gradually intersecting to build the novel system, namely edge intelligence. However, the …

Dsa: More efficient budgeted pruning via differentiable sparsity allocation

X Ning, T Zhao, W Li, P Lei, Y Wang, H Yang - European Conference on …, 2020 - Springer
Budgeted pruning is the problem of pruning under resource constraints. In budgeted
pruning, how to distribute the resources across layers (ie, sparsity allocation) is the key …

Weight-sharing neural architecture search: A battle to shrink the optimization gap

L Xie, X Chen, K Bi, L Wei, Y Xu, L Wang… - ACM Computing …, 2021 - dl.acm.org
Neural architecture search (NAS) has attracted increasing attention. In recent years,
individual search methods have been replaced by weight-sharing search methods for higher …

Gdp: Stabilized neural network pruning via gates with differentiable polarization

Y Guo, H Yuan, J Tan, Z Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Model compression techniques are recently gaining explosive attention for
obtaining efficient AI models for various real time applications. Channel pruning is one …