Efficientformer: Vision transformers at mobilenet speed
Abstract Vision Transformers (ViT) have shown rapid progress in computer vision tasks,
achieving promising results on various benchmarks. However, due to the massive number of …
achieving promising results on various benchmarks. However, due to the massive number of …
Rethinking vision transformers for mobilenet size and speed
With the success of Vision Transformers (ViTs) in computer vision tasks, recent arts try to
optimize the performance and complexity of ViTs to enable efficient deployment on mobile …
optimize the performance and complexity of ViTs to enable efficient deployment on mobile …
Edgevits: Competing light-weight cnns on mobile devices with vision transformers
Self-attention based models such as vision transformers (ViTs) have emerged as a very
competitive architecture alternative to convolutional neural networks (CNNs) in computer …
competitive architecture alternative to convolutional neural networks (CNNs) in computer …
Repvit: Revisiting mobile cnn from vit perspective
Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …
FastViT: A fast hybrid vision transformer using structural reparameterization
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …
Efficientvit: Memory efficient vision transformer with cascaded group attention
Vision transformers have shown great success due to their high model capabilities.
However, their remarkable performance is accompanied by heavy computation costs, which …
However, their remarkable performance is accompanied by heavy computation costs, which …
Edgenext: efficiently amalgamated cnn-transformer architecture for mobile vision applications
In the pursuit of achieving ever-increasing accuracy, large and complex neural networks are
usually developed. Such models demand high computational resources and therefore …
usually developed. Such models demand high computational resources and therefore …
Towards efficient vision transformer inference: A first study of transformers on mobile devices
Convolution neural networks (CNNs) have long been dominating the model choice in on-
device intelligent mobile applications. Recently, we are witnessing the fast development of …
device intelligent mobile applications. Recently, we are witnessing the fast development of …
Separable self-attention for mobile vision transformers
S Mehta, M Rastegari - arXiv preprint arXiv:2206.02680, 2022 - arxiv.org
Mobile vision transformers (MobileViT) can achieve state-of-the-art performance across
several mobile vision tasks, including classification and detection. Though these models …
several mobile vision tasks, including classification and detection. Though these models …
Patch slimming for efficient vision transformers
This paper studies the efficiency problem for visual transformers by excavating redundant
calculation in given networks. The recent transformer architecture has demonstrated its …
calculation in given networks. The recent transformer architecture has demonstrated its …