Generative adversarial networks (GANs) for image augmentation in agriculture: A systematic review
In agricultural image analysis, optimal model performance is keenly pursued for better
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …
fulfilling visual recognition tasks (eg, image classification, segmentation, object detection …
Maxvit: Multi-axis vision transformer
Transformers have recently gained significant attention in the computer vision community.
However, the lack of scalability of self-attention mechanisms with respect to image size has …
However, the lack of scalability of self-attention mechanisms with respect to image size has …
Maxim: Multi-axis mlp for image processing
Recent progress on Transformers and multi-layer perceptron (MLP) models provide new
network architectural designs for computer vision tasks. Although these models proved to be …
network architectural designs for computer vision tasks. Although these models proved to be …
Uformer: A general u-shaped transformer for image restoration
In this paper, we present Uformer, an effective and efficient Transformer-based architecture
for image restoration, in which we build a hierarchical encoder-decoder network using the …
for image restoration, in which we build a hierarchical encoder-decoder network using the …
Styleswin: Transformer-based gan for high-resolution image generation
Despite the tantalizing success in a broad of vision tasks, transformers have not yet
demonstrated on-par ability as ConvNets in high-resolution image generative modeling. In …
demonstrated on-par ability as ConvNets in high-resolution image generative modeling. In …
Mage: Masked generative encoder to unify representation learning and image synthesis
Generative modeling and representation learning are two key tasks in computer vision.
However, these models are typically trained independently, which ignores the potential for …
However, these models are typically trained independently, which ignores the potential for …
Transgan: Two pure transformers can make one strong gan, and that can scale up
The recent explosive interest on transformers has suggested their potential to become
powerful``universal" models for computer vision tasks, such as classification, detection, and …
powerful``universal" models for computer vision tasks, such as classification, detection, and …
Diffit: Diffusion vision transformers for image generation
Diffusion models with their powerful expressivity and high sample quality have achieved
State-Of-The-Art (SOTA) performance in the generative domain. The pioneering Vision …
State-Of-The-Art (SOTA) performance in the generative domain. The pioneering Vision …
A comprehensive review of deep learning-based real-world image restoration
Real-world imagery does not always exhibit good visibility and clean content, but often
suffers from various kinds of degradations (eg, noise, blur, rain drops, fog, color distortion …
suffers from various kinds of degradations (eg, noise, blur, rain drops, fog, color distortion …
Class-aware adversarial transformers for medical image segmentation
Transformers have made remarkable progress towards modeling long-range dependencies
within the medical image analysis domain. However, current transformer-based models …
within the medical image analysis domain. However, current transformer-based models …