A review of convolutional neural network architectures and their optimizations
The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …
Colbertv2: Effective and efficient retrieval via lightweight late interaction
Neural information retrieval (IR) has greatly advanced search and other knowledge-
intensive language tasks. While many neural IR methods encode queries and documents …
intensive language tasks. While many neural IR methods encode queries and documents …
Autoregressive image generation using residual quantization
For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ)
represents an image as a sequence of discrete codes. A short sequence length is important …
represents an image as a sequence of discrete codes. A short sequence length is important …
Momask: Generative masked modeling of 3d human motions
We introduce MoMask a novel masked modeling framework for text-driven 3D human
motion generation. In MoMask a hierarchical quantization scheme is employed to represent …
motion generation. In MoMask a hierarchical quantization scheme is employed to represent …
[HTML][HTML] Design implementations of ternary logic systems: A critical review
In the electronics industry, binary devices have played a critical role since the development
of solid-state transistors. While binary technology associates devices' inherent ability to be …
of solid-state transistors. While binary technology associates devices' inherent ability to be …
A survey of model compression strategies for object detection
Z Lyu, T Yu, F Pan, Y Zhang, J Luo, D Zhang… - Multimedia tools and …, 2024 - Springer
Deep neural networks (DNNs) have achieved great success in many object detection tasks.
However, such DNNS-based large object detection models are generally computationally …
However, such DNNS-based large object detection models are generally computationally …
Tas: ternarized neural architecture search for resource-constrained edge devices
Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-
bit representation resulting in remarkable network compression and energy efficiency …
bit representation resulting in remarkable network compression and energy efficiency …
BISDU: A Bit-Serial Dot-Product Unit for Microcontrollers
D Metz, V Kumar, M Själander - ACM Transactions on Embedded …, 2023 - dl.acm.org
Low-precision quantized neural networks (QNNs) reduce the required memory space,
bandwidth, and computational power, and hence are suitable for deployment in applications …
bandwidth, and computational power, and hence are suitable for deployment in applications …
X-nvdla: Runtime accuracy configurable nvdla based on applying voltage overscaling to computing and memory units
H Afzali-Kusha, M Pedram - … on Circuits and Systems I: Regular …, 2023 - ieeexplore.ieee.org
This paper investigates a runtime accuracy reconfigurable implementation of an energy
efficient deep learning accelerator. It is based on voltage overscaling (VOS) technique which …
efficient deep learning accelerator. It is based on voltage overscaling (VOS) technique which …
Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision
Deep neural networks (DNNs) have recently achieved impressive success across a wide
range of real-world vision and language processing tasks, spanning from image …
range of real-world vision and language processing tasks, spanning from image …