A survey on efficient convolutional neural networks and hardware acceleration
Over the past decade, deep-learning-based representations have demonstrated remarkable
performance in academia and industry. The learning capability of convolutional neural …
performance in academia and industry. The learning capability of convolutional neural …
A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …
massive model sizes that require significant computational and storage resources. To …
Tinyclip: Clip distillation via affinity mimicking and weight inheritance
In this paper, we propose a novel cross-modal distillation method, called TinyCLIP, for large-
scale language-image pre-trained models. The method introduces two core techniques …
scale language-image pre-trained models. The method introduces two core techniques …
Combined depth space based architecture search for person re-identification
Most works on person re-identification (ReID) take advantage of large backbone networks
such as ResNet, which are designed for image classification instead of ReID, for feature …
such as ResNet, which are designed for image classification instead of ReID, for feature …
TinyML: Enabling of inference deep learning models on ultra-low-power IoT edge devices for AI applications
NN Alajlan, DM Ibrahim - Micromachines, 2022 - mdpi.com
Recently, the Internet of Things (IoT) has gained a lot of attention, since IoT devices are
placed in various fields. Many of these devices are based on machine learning (ML) models …
placed in various fields. Many of these devices are based on machine learning (ML) models …
Chex: Channel exploration for cnn model compression
Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …
computation and memory cost of deep convolutional neural networks. However …
Bringing AI to edge: From deep learning's perspective
Edge computing and artificial intelligence (AI), especially deep learning algorithms, are
gradually intersecting to build the novel system, namely edge intelligence. However, the …
gradually intersecting to build the novel system, namely edge intelligence. However, the …
Dsa: More efficient budgeted pruning via differentiable sparsity allocation
Budgeted pruning is the problem of pruning under resource constraints. In budgeted
pruning, how to distribute the resources across layers (ie, sparsity allocation) is the key …
pruning, how to distribute the resources across layers (ie, sparsity allocation) is the key …
Weight-sharing neural architecture search: A battle to shrink the optimization gap
Neural architecture search (NAS) has attracted increasing attention. In recent years,
individual search methods have been replaced by weight-sharing search methods for higher …
individual search methods have been replaced by weight-sharing search methods for higher …
Gdp: Stabilized neural network pruning via gates with differentiable polarization
Abstract Model compression techniques are recently gaining explosive attention for
obtaining efficient AI models for various real time applications. Channel pruning is one …
obtaining efficient AI models for various real time applications. Channel pruning is one …