A survey of accelerator architectures for deep neural networks
Recently, due to the availability of big data and the rapid growth of computing power,
artificial intelligence (AI) has regained tremendous attention and investment. Machine …
artificial intelligence (AI) has regained tremendous attention and investment. Machine …
A survey on deep neural network compression: Challenges, overview, and solutions
Deep Neural Network (DNN) has gained unprecedented performance due to its automated
feature extraction capability. This high order performance leads to significant incorporation …
feature extraction capability. This high order performance leads to significant incorporation …
ELSA: Hardware-software co-design for efficient, lightweight self-attention mechanism in neural networks
The self-attention mechanism is rapidly emerging as one of the most important key primitives
in neural networks (NNs) for its ability to identify the relations within input entities. The self …
in neural networks (NNs) for its ability to identify the relations within input entities. The self …
Nonuniform-to-uniform quantization: Towards accurate quantization via generalized straight-through estimation
The nonuniform quantization strategy for compressing neural networks usually achieves
better performance than its counterpart, ie, uniform strategy, due to its superior …
better performance than its counterpart, ie, uniform strategy, due to its superior …
Dota: detect and omit weak attentions for scalable transformer acceleration
Transformer Neural Networks have demonstrated leading performance in many applications
spanning over language understanding, image processing, and generative modeling …
spanning over language understanding, image processing, and generative modeling …
Transforming large-size to lightweight deep neural networks for IoT applications
Deep Neural Networks (DNNs) have gained unprecedented popularity due to their high-
order performance and automated feature extraction capability. This has encouraged …
order performance and automated feature extraction capability. This has encouraged …
Machine learning in real-time Internet of Things (IoT) systems: A survey
Over the last decade, machine learning (ML) and deep learning (DL) algorithms have
significantly evolved and been employed in diverse applications, such as computer vision …
significantly evolved and been employed in diverse applications, such as computer vision …
[HTML][HTML] AI augmented Edge and Fog computing: Trends and challenges
In recent years, the landscape of computing paradigms has witnessed a gradual yet
remarkable shift from monolithic computing to distributed and decentralized paradigms such …
remarkable shift from monolithic computing to distributed and decentralized paradigms such …
Efficient AI system design with cross-layer approximate computing
Advances in deep neural networks (DNNs) and the availability of massive real-world data
have enabled superhuman levels of accuracy on many AI tasks and ushered the explosive …
have enabled superhuman levels of accuracy on many AI tasks and ushered the explosive …
{ALERT}: Accurate learning for energy and timeliness
An increasing number of software applications incorporate runtime Deep Neural Networks
(DNNs) to process sensor data and return inference results to humans. Effective deployment …
(DNNs) to process sensor data and return inference results to humans. Effective deployment …