Axial attention in multidimensional transformers

R Azad, EK Aghdam, A Rauland, Y Jia… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Automatic medical image segmentation is a crucial topic in the medical domain and
successively a critical counterpart in the computer-aided diagnosis paradigm. U-Net is the …

被引用次数：252 相关文章所有 2 个版本网页快照

[PDF] sci-hub [HTML] sciencedirect.com [ 下载加速 ]

[HTML][HTML] A survey of transformers

T Lin, Y Wang, X Liu, X Qiu - AI open, 2022 - Elsevier

Transformers have achieved great success in many artificial intelligence fields, such as
natural language processing, computer vision, and audio processing. Therefore, it is natural …

被引用次数：1398 相关文章所有 4 个版本网页快照

[PDF] sci-hub [PDF] thecvf.com [ 下载加速 ]

Biformer: Vision transformer with bi-level routing attention

L Zhu, X Wang, Z Ke, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

As the core building block of vision transformers, attention is a powerful tool to capture long-
range dependency. However, such power comes at a cost: it incurs a huge computation …

被引用次数：656 相关文章所有 10 个版本网页快照

[PDF] sci-hub [HTML] nature.com [ 下载加速 ]

[HTML][HTML] Discovering faster matrix multiplication algorithms with reinforcement learning

A Fawzi, M Balog, A Huang, T Hubert… - Nature, 2022 - nature.com

Improving the efficiency of algorithms for fundamental computations can have a widespread
impact, as it can affect the overall speed of a large amount of computations. Matrix …

被引用次数：645 相关文章所有 11 个版本网页快照

[PDF] sci-hub [PDF] science.org [ 下载加速 ]

Evolutionary-scale prediction of atomic-level protein structure with a language model

Z Lin, H Akin, R Rao, B Hie, Z Zhu, W Lu, N Smetanin… - Science, 2023 - science.org

Recent advances in machine learning have leveraged evolutionary information in multiple
sequence alignments to predict protein structure. We demonstrate direct inference of full …

被引用次数：2207 相关文章所有 9 个版本网页快照

[PDF] sci-hub [PDF] arxiv.org [ 下载加速 ]

Maxvit: Multi-axis vision transformer

Z Tu, H Talebi, H Zhang, F Yang, P Milanfar… - European conference on …, 2022 - Springer

Transformers have recently gained significant attention in the computer vision community.
However, the lack of scalability of self-attention mechanisms with respect to image size has …

被引用次数：725 相关文章所有 8 个版本网页快照

[PDF] sci-hub [PDF] neurips.cc [ 下载加速 ]

Video diffusion models

J Ho, T Salimans, A Gritsenko… - Advances in …, 2022 - proceedings.neurips.cc

Generating temporally coherent high fidelity video is an important milestone in generative
modeling research. We make progress towards this milestone by proposing a diffusion …

被引用次数：1389 相关文章所有 8 个版本网页快照

[PDF] sci-hub [PDF] thecvf.com [ 下载加速 ]

Simvp: Simpler yet better video prediction

Z Gao, C Tan, L Wu, SZ Li - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Abstract From CNN, RNN, to ViT, we have witnessed remarkable advancements in video
prediction, incorporating auxiliary inputs, elaborate neural architectures, and sophisticated …

被引用次数：261 相关文章所有 6 个版本网页快照

[PDF] sci-hub [PDF] mlr.press [ 下载加速 ]

Transformer quality in linear time

W Hua, Z Dai, H Liu, Q Le - International conference on …, 2022 - proceedings.mlr.press

We revisit the design choices in Transformers, and propose methods to address their
weaknesses in handling long sequences. First, we propose a simple layer named gated …

被引用次数：255 相关文章所有 5 个版本网页快照

[PDF] sci-hub [PDF] thecvf.com [ 下载加速 ]

Cswin transformer: A general vision transformer backbone with cross-shaped windows

X Dong, J Bao, D Chen, W Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com

Abstract We present CSWin Transformer, an efficient and effective Transformer-based
backbone for general-purpose vision tasks. A challenging issue in Transformer design is …

被引用次数：1194 相关文章所有 7 个版本网页快照