A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

Colbertv2: Effective and efficient retrieval via lightweight late interaction

K Santhanam, O Khattab, J Saad-Falcon… - arXiv preprint arXiv …, 2021 - arxiv.org
Neural information retrieval (IR) has greatly advanced search and other knowledge-
intensive language tasks. While many neural IR methods encode queries and documents …

Autoregressive image generation using residual quantization

D Lee, C Kim, S Kim, M Cho… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ)
represents an image as a sequence of discrete codes. A short sequence length is important …

Momask: Generative masked modeling of 3d human motions

C Guo, Y Mu, MG Javed, S Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
We introduce MoMask a novel masked modeling framework for text-driven 3D human
motion generation. In MoMask a hierarchical quantization scheme is employed to represent …

[HTML][HTML] Design implementations of ternary logic systems: A critical review

F Zahoor, RA Jaber, UB Isyaku, T Sharma, F Bashir… - Results in …, 2024 - Elsevier
In the electronics industry, binary devices have played a critical role since the development
of solid-state transistors. While binary technology associates devices' inherent ability to be …

A survey of model compression strategies for object detection

Z Lyu, T Yu, F Pan, Y Zhang, J Luo, D Zhang… - Multimedia tools and …, 2024 - Springer
Deep neural networks (DNNs) have achieved great success in many object detection tasks.
However, such DNNS-based large object detection models are generally computationally …

Tas: ternarized neural architecture search for resource-constrained edge devices

M Loni, H Mousavi, M Riazati… - … , Automation & Test …, 2022 - ieeexplore.ieee.org
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-
bit representation resulting in remarkable network compression and energy efficiency …

BISDU: A Bit-Serial Dot-Product Unit for Microcontrollers

D Metz, V Kumar, M Själander - ACM Transactions on Embedded …, 2023 - dl.acm.org
Low-precision quantized neural networks (QNNs) reduce the required memory space,
bandwidth, and computational power, and hence are suitable for deployment in applications …

X-nvdla: Runtime accuracy configurable nvdla based on applying voltage overscaling to computing and memory units

H Afzali-Kusha, M Pedram - … on Circuits and Systems I: Regular …, 2023 - ieeexplore.ieee.org
This paper investigates a runtime accuracy reconfigurable implementation of an energy
efficient deep learning accelerator. It is based on voltage overscaling (VOS) technique which …

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

X Luo, D Liu, H Kong, S Huai, H Chen… - ACM Transactions on …, 2024 - dl.acm.org
Deep neural networks (DNNs) have recently achieved impressive success across a wide
range of real-world vision and language processing tasks, spanning from image …