- 学术资源搜索

A survey of techniques for optimizing transformer inference

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier

Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

被引用次数：28 相关文章所有 6 个版本

Vision transformers for dense prediction: A survey

S Zuo, Y Xiao, X Chang, X Wang - Knowledge-Based Systems, 2022 - Elsevier

Transformers have demonstrated impressive expressiveness and transfer capability in
computer vision fields. Dense prediction is a fundamental problem in computer vision that is …

被引用次数：36 相关文章所有 3 个版本

[PDF] arxiv.org

Segment anything is not always perfect: An investigation of sam on different real-world applications

W Ji, J Li, Q Bi, T Liu, W Li, L Cheng - 2024 - Springer

Abstract Recently, Meta AI Research approaches a general, promptable segment anything
model (SAM) pre-trained on an unprecedentedly large segmentation dataset (SA-1B) …

被引用次数：125 相关文章所有 6 个版本

[PDF] neurips.cc

SegFormer: Simple and efficient design for semantic segmentation with transformers

E Xie, W Wang, Z Yu, A Anandkumar… - Advances in neural …, 2021 - proceedings.neurips.cc

We present SegFormer, a simple, efficient yet powerful semantic segmentation framework
which unifies Transformers with lightweight multilayer perceptron (MLP) decoders …

被引用次数：3888 相关文章所有 12 个版本

[PDF] neurips.cc

Transformer in transformer

K Han, A Xiao, E Wu, J Guo, C Xu… - Advances in neural …, 2021 - proceedings.neurips.cc

Transformer is a new kind of neural architecture which encodes the input data as powerful
features via the attention mechanism. Basically, the visual transformers first divide the input …

被引用次数：1550 相关文章所有 7 个版本

[PDF] thecvf.com

Pyramid vision transformer: A versatile backbone for dense prediction without convolutions

W Wang, E Xie, X Li, DP Fan, K Song… - Proceedings of the …, 2021 - openaccess.thecvf.com

Although convolutional neural networks (CNNs) have achieved great success in computer
vision, this work investigates a simpler, convolution-free backbone network useful for many …

被引用次数：3749 相关文章所有 9 个版本

[PDF] thecvf.com

Transreid: Transformer-based object re-identification

S He, H Luo, P Wang, F Wang, H Li… - Proceedings of the …, 2021 - openaccess.thecvf.com

Extracting robust feature representation is one of the key challenges in object re-
identification (ReID). Although convolution neural network (CNN)-based methods have …

被引用次数：860 相关文章所有 8 个版本

[PDF] aaai.org

Transfg: A transformer architecture for fine-grained recognition

J He, JN Chen, S Liu, A Kortylewski, C Yang… - Proceedings of the …, 2022 - ojs.aaai.org

Fine-grained visual classification (FGVC) which aims at recognizing objects from
subcategories is a very challenging task due to the inherently subtle inter-class differences …

被引用次数：379 相关文章所有 11 个版本

[PDF] arxiv.org

Multi-compound transformer for accurate biomedical image segmentation

Y Ji, R Zhang, H Wang, Z Li, L Wu, S Zhang… - … Image Computing and …, 2021 - Springer

The recent vision transformer (ie for image classification) learns non-local attentive
interaction of different patch tokens. However, prior arts miss learning the cross-scale …

被引用次数：149 相关文章所有 5 个版本

[PDF] arxiv.org

Dex-NeRF: Using a neural radiance field to grasp transparent objects

J Ichnowski, Y Avigal, J Kerr, K Goldberg - arXiv preprint arXiv:2110.14217, 2021 - arxiv.org

The ability to grasp and manipulate transparent objects is a major challenge for robots.
Existing depth cameras have difficulty detecting, localizing, and inferring the geometry of …

被引用次数：141 相关文章所有 4 个版本