Improved transformer for high-resolution gans

Y Lu, D Chen, E Olaniyi, Y Huang - Computers and Electronics in …, 2022 - Elsevier

In agricultural image analysis, optimal model performance is keenly pursued for better
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …

被引用次数：205 相关文章所有 7 个版本

[PDF] arxiv.org

Maxvit: Multi-axis vision transformer

Z Tu, H Talebi, H Zhang, F Yang, P Milanfar… - European conference on …, 2022 - Springer

Transformers have recently gained significant attention in the computer vision community.
However, the lack of scalability of self-attention mechanisms with respect to image size has …

被引用次数：725 相关文章所有 8 个版本

[PDF] thecvf.com

Maxim: Multi-axis mlp for image processing

Z Tu, H Talebi, H Zhang, F Yang… - Proceedings of the …, 2022 - openaccess.thecvf.com

Recent progress on Transformers and multi-layer perceptron (MLP) models provide new
network architectural designs for computer vision tasks. Although these models proved to be …

被引用次数：553 相关文章所有 10 个版本

[PDF] thecvf.com

Uformer: A general u-shaped transformer for image restoration

Z Wang, X Cun, J Bao, W Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com

In this paper, we present Uformer, an effective and efficient Transformer-based architecture
for image restoration, in which we build a hierarchical encoder-decoder network using the …

被引用次数：1722 相关文章所有 7 个版本

[PDF] thecvf.com

Styleswin: Transformer-based gan for high-resolution image generation

B Zhang, S Gu, B Zhang, J Bao… - Proceedings of the …, 2022 - openaccess.thecvf.com

Despite the tantalizing success in a broad of vision tasks, transformers have not yet
demonstrated on-par ability as ConvNets in high-resolution image generative modeling. In …

被引用次数：276 相关文章所有 7 个版本

[PDF] thecvf.com

Mage: Masked generative encoder to unify representation learning and image synthesis

T Li, H Chang, S Mishra, H Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Generative modeling and representation learning are two key tasks in computer vision.
However, these models are typically trained independently, which ignores the potential for …

被引用次数：139 相关文章所有 6 个版本

[PDF] neurips.cc

Transgan: Two pure transformers can make one strong gan, and that can scale up

Y Jiang, S Chang, Z Wang - Advances in Neural …, 2021 - proceedings.neurips.cc

The recent explosive interest on transformers has suggested their potential to become
powerful``universal" models for computer vision tasks, such as classification, detection, and …

被引用次数：483 相关文章所有 9 个版本

[PDF] openreview.net

Diffit: Diffusion vision transformers for image generation

A Hatamizadeh, J Song, G Liu, J Kautz… - European Conference on …, 2025 - Springer

Diffusion models with their powerful expressivity and high sample quality have achieved
State-Of-The-Art (SOTA) performance in the generative domain. The pioneering Vision …

被引用次数：44 相关文章所有 3 个版本

[PDF] ieee.org

A comprehensive review of deep learning-based real-world image restoration

L Zhai, Y Wang, S Cui, Y Zhou - IEEE Access, 2023 - ieeexplore.ieee.org

Real-world imagery does not always exhibit good visibility and clean content, but often
suffers from various kinds of degradations (eg, noise, blur, rain drops, fog, color distortion …

被引用次数：29 相关文章所有 2 个版本

[PDF] neurips.cc

Class-aware adversarial transformers for medical image segmentation

C You, R Zhao, F Liu, S Dong… - Advances in …, 2022 - proceedings.neurips.cc

Transformers have made remarkable progress towards modeling long-range dependencies
within the medical image analysis domain. However, current transformer-based models …

被引用次数：132 相关文章所有 8 个版本