Trq: Ternary neural networks with residual quantization

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer

The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

被引用次数：165 相关文章所有 5 个版本

[PDF] arxiv.org

Colbertv2: Effective and efficient retrieval via lightweight late interaction

K Santhanam, O Khattab, J Saad-Falcon… - arXiv preprint arXiv …, 2021 - arxiv.org

Neural information retrieval (IR) has greatly advanced search and other knowledge-
intensive language tasks. While many neural IR methods encode queries and documents …

被引用次数：378 相关文章所有 5 个版本

[PDF] thecvf.com

Autoregressive image generation using residual quantization

D Lee, C Kim, S Kim, M Cho… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ)
represents an image as a sequence of discrete codes. A short sequence length is important …

被引用次数：275 相关文章所有 5 个版本

[PDF] thecvf.com

Momask: Generative masked modeling of 3d human motions

C Guo, Y Mu, MG Javed, S Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com

We introduce MoMask a novel masked modeling framework for text-driven 3D human
motion generation. In MoMask a hierarchical quantization scheme is employed to represent …

被引用次数：24 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Design implementations of ternary logic systems: A critical review

F Zahoor, RA Jaber, UB Isyaku, T Sharma, F Bashir… - Results in …, 2024 - Elsevier

In the electronics industry, binary devices have played a critical role since the development
of solid-state transistors. While binary technology associates devices' inherent ability to be …

被引用次数：2 相关文章

[PDF] springer.com

A survey of model compression strategies for object detection

Z Lyu, T Yu, F Pan, Y Zhang, J Luo, D Zhang… - Multimedia tools and …, 2024 - Springer

Deep neural networks (DNNs) have achieved great success in many object detection tasks.
However, such DNNS-based large object detection models are generally computationally …

被引用次数：20 相关文章所有 2 个版本

[PDF] mdu.se

Tas: ternarized neural architecture search for resource-constrained edge devices

M Loni, H Mousavi, M Riazati… - … , Automation & Test …, 2022 - ieeexplore.ieee.org

Ternary Neural Networks (TNNs) compress network weights and activation functions into 2-
bit representation resulting in remarkable network compression and energy efficiency …

被引用次数：28 相关文章所有 8 个版本

[PDF] acm.org

BISDU: A Bit-Serial Dot-Product Unit for Microcontrollers

D Metz, V Kumar, M Själander - ACM Transactions on Embedded …, 2023 - dl.acm.org

Low-precision quantized neural networks (QNNs) reduce the required memory space,
bandwidth, and computational power, and hence are suitable for deployment in applications …

被引用次数：4 相关文章所有 3 个版本

X-nvdla: Runtime accuracy configurable nvdla based on applying voltage overscaling to computing and memory units

H Afzali-Kusha, M Pedram - … on Circuits and Systems I: Regular …, 2023 - ieeexplore.ieee.org

This paper investigates a runtime accuracy reconfigurable implementation of an energy
efficient deep learning accelerator. It is based on voltage overscaling (VOS) technique which …

被引用次数：6 相关文章

[PDF] acm.org

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

X Luo, D Liu, H Kong, S Huai, H Chen… - ACM Transactions on …, 2024 - dl.acm.org

Deep neural networks (DNNs) have recently achieved impressive success across a wide
range of real-world vision and language processing tasks, spanning from image …

被引用次数：1 相关文章所有 3 个版本