A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms

R Gong, Y Ding, Z Wang, C Lv, X Zheng, J Du… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have achieved remarkable advancements in natural
language processing, showcasing exceptional performance across various tasks. However …

Binaryvit: Towards efficient and accurate binary vision transformers

J Xiao, Z Li, J Li, L Yang, Q Gu - IEEE Transactions on Circuits …, 2024 - ieeexplore.ieee.org
Vision Transformers (ViTs) have emerged as the new fundamental architecture for most
computer vision fields. However, the considerable memory and computation costs also …

An Analysis on Quantizing Diffusion Transformers

Y Yang, J Wang, X Dai, P Zhang, H Zhang - arXiv preprint arXiv …, 2024 - arxiv.org
Diffusion Models (DMs) utilize an iterative denoising process to transform random noise into
synthetic data. Initally proposed with a UNet structure, DMs excel at producing images that …

Mixed Non-linear Quantization for Vision Transformers

G Kim, J Lee, S Park, Y Kwon, H Kim - arXiv preprint arXiv:2407.18437, 2024 - arxiv.org
The majority of quantization methods have been proposed to reduce the model size of
Vision Transformers, yet most of them have overlooked the quantization of non-linear …

Performance Comparison of Vision Transformer-and CNN-Based Image Classification Using Cross Entropy: A Preliminary Application to Lung Cancer Discrimination …

E Matsuyama, H Watanabe, N Takahashi - Journal of Biomedical Science …, 2024 - scirp.org
This study evaluates the performance and reliability of a vision transformer (ViT) compared
to convolutional neural networks (CNNs) using the ResNet50 model in classifying lung …

SI-BiViT: Binarizing Vision Transformers with Spatial Interaction

P Yin, X Zhu, J Song, L Gao, HT Shen - ACM Multimedia 2024 - openreview.net
Binarized Vision Transformers (BiViTs) aim to facilitate the efficient and lightweight utilization
of Vision Transformers (ViTs) on devices with limited computational resources. Yet, the …