[HTML][HTML] Transformers in medical image analysis

K He, C Gan, Z Li, I Rekik, Z Yin, W Ji, Y Gao, Q Wang… - Intelligent …, 2023 - Elsevier
Transformers have dominated the field of natural language processing and have recently
made an impact in the area of computer vision. In the field of medical image analysis …

A comprehensive review on deep learning based remote sensing image super-resolution methods

P Wang, B Bayram, E Sertel - Earth-Science Reviews, 2022 - Elsevier
Satellite imageries are an important geoinformation source for different applications in the
Earth Science field. However, due to the limitation of the optic and sensor technologies and …

Exploiting diffusion prior for real-world image super-resolution

J Wang, Z Yue, S Zhou, KCK Chan, CC Loy - International Journal of …, 2024 - Springer
We present a novel approach to leverage prior knowledge encapsulated in pre-trained text-
to-image diffusion models for blind super-resolution. Specifically, by employing our time …

Learning a sparse transformer network for effective image deraining

X Chen, H Li, M Li, J Pan - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Transformers-based methods have achieved significant performance in image deraining as
they can model the non-local information which is vital for high-quality image reconstruction …

SwinFusion: Cross-domain long-range learning for general image fusion via swin transformer

J Ma, L Tang, F Fan, J Huang, X Mei… - IEEE/CAA Journal of …, 2022 - ieeexplore.ieee.org
This study proposes a novel general image fusion framework based on cross-domain long-
range learning and Swin Transformer, termed as SwinFusion. On the one hand, an attention …

Restormer: Efficient transformer for high-resolution image restoration

SW Zamir, A Arora, S Khan, M Hayat… - Proceedings of the …, 2022 - openaccess.thecvf.com
Since convolutional neural networks (CNNs) perform well at learning generalizable image
priors from large-scale data, these models have been extensively applied to image …

Srformer: Permuted self-attention for single image super-resolution

Y Zhou, Z Li, CL Guo, S Bai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Previous works have shown that increasing the window size for Transformer-based image
super-resolution models (eg, SwinIR) can significantly improve the model performance but …

Mm-diffusion: Learning multi-modal diffusion models for joint audio and video generation

L Ruan, Y Ma, H Yang, H He, B Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose the first joint audio-video generation framework that brings engaging watching
and listening experiences simultaneously, towards high-quality realistic videos. To generate …

SwinSUNet: Pure transformer network for remote sensing image change detection

C Zhang, L Wang, S Cheng, Y Li - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Convolutional neural network (CNN) can extract effective semantic features, so it was widely
used for remote sensing image change detection (CD) in the latest years. CNN has acquired …

Uformer: A general u-shaped transformer for image restoration

Z Wang, X Cun, J Bao, W Zhou… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we present Uformer, an effective and efficient Transformer-based architecture
for image restoration, in which we build a hierarchical encoder-decoder network using the …