Communication-efficient edge AI: Algorithms and systems
Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields,
ranging from speech processing, image classification to drug discovery. This is driven by the …
ranging from speech processing, image classification to drug discovery. This is driven by the …
Deep convolutional neural networks for image classification: A comprehensive review
Convolutional neural networks (CNNs) have been applied to visual tasks since the late
1980s. However, despite a few scattered applications, they were dormant until the mid …
1980s. However, despite a few scattered applications, they were dormant until the mid …
Fnet: Mixing tokens with fourier transforms
We show that Transformer encoder architectures can be sped up, with limited accuracy
costs, by replacing the self-attention sublayers with simple linear transformations that" mix" …
costs, by replacing the self-attention sublayers with simple linear transformations that" mix" …
Binary neural networks: A survey
The binary neural network, largely saving the storage and computation, serves as a
promising technique for deploying deep models on resource-limited devices. However, the …
promising technique for deploying deep models on resource-limited devices. However, the …
Patient knowledge distillation for bert model compression
Pre-trained language models such as BERT have proven to be highly effective for natural
language processing (NLP) tasks. However, the high demand for computing resources in …
language processing (NLP) tasks. However, the high demand for computing resources in …
A survey of model compression and acceleration for deep neural networks
Deep neural networks (DNNs) have recently achieved great success in many visual
recognition tasks. However, existing deep neural network models are computationally …
recognition tasks. However, existing deep neural network models are computationally …
Learning efficient object detection models with knowledge distillation
Despite significant accuracy improvement in convolutional neural networks (CNN) based
object detectors, they often require prohibitive runtimes to process an image for real-time …
object detectors, they often require prohibitive runtimes to process an image for real-time …
Nisp: Pruning networks using neuron importance score propagation
To reduce the significant redundancy in deep Convolutional Neural Networks (CNNs), most
existing methods prune neurons by only considering the statistics of an individual layer or …
existing methods prune neurons by only considering the statistics of an individual layer or …
Model compression and acceleration for deep neural networks: The principles, progress, and challenges
In recent years, deep neural networks (DNNs) have received increased attention, have been
applied to different applications, and achieved dramatic accuracy improvements in many …
applied to different applications, and achieved dramatic accuracy improvements in many …
Recent advances in convolutional neural network acceleration
In recent years, convolutional neural networks (CNNs) have shown great performance in
various fields such as image classification, pattern recognition, and multi-media …
various fields such as image classification, pattern recognition, and multi-media …