Bi-ViT: Pushing the Limit of Vision Transformer Quantization

R Gong, Y Ding, Z Wang, C Lv, X Zheng, J Du… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) have achieved remarkable advancements in natural
language processing, showcasing exceptional performance across various tasks. However …

Binaryvit: Towards efficient and accurate binary vision transformers

J Xiao, Z Li, J Li, L Yang, Q Gu - IEEE Transactions on Circuits …, 2024 - ieeexplore.ieee.org

Vision Transformers (ViTs) have emerged as the new fundamental architecture for most
computer vision fields. However, the considerable memory and computation costs also …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

An Analysis on Quantizing Diffusion Transformers

Y Yang, J Wang, X Dai, P Zhang, H Zhang - arXiv preprint arXiv …, 2024 - arxiv.org

Diffusion Models (DMs) utilize an iterative denoising process to transform random noise into
synthetic data. Initally proposed with a UNet structure, DMs excel at producing images that …

[PDF] arxiv.org

Mixed Non-linear Quantization for Vision Transformers

G Kim, J Lee, S Park, Y Kwon, H Kim - arXiv preprint arXiv:2407.18437, 2024 - arxiv.org

The majority of quantization methods have been proposed to reduce the model size of
Vision Transformers, yet most of them have overlooked the quantization of non-linear …

Performance Comparison of Vision Transformer-and CNN-Based Image Classification Using Cross Entropy: A Preliminary Application to Lung Cancer Discrimination …

E Matsuyama, H Watanabe, N Takahashi - Journal of Biomedical Science …, 2024 - scirp.org

This study evaluates the performance and reliability of a vision transformer (ViT) compared
to convolutional neural networks (CNNs) using the ResNet50 model in classifying lung …

SI-BiViT: Binarizing Vision Transformers with Spatial Interaction

P Yin, X Zhu, J Song, L Gao, HT Shen - ACM Multimedia 2024 - openreview.net

Binarized Vision Transformers (BiViTs) aim to facilitate the efficient and lightweight utilization
of Vision Transformers (ViTs) on devices with limited computational resources. Yet, the …