Transformer models for enhancing AttnGAN based text to image generation

[HTML][HTML] A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

L Alzubaidi, J Bai, A Al-Sabaawi, J Santamaría… - Journal of Big Data, 2023 - Springer

Data scarcity is a major challenge when training deep learning (DL) models. DL demands a
large amount of data to achieve exceptional performance. Unfortunately, many applications …

被引用次数：316 相关文章所有 9 个版本

[PDF] arxiv.org

A survey of the vision transformers and their CNN-transformer based variants

A Khan, Z Rauf, A Sohail, AR Khan, H Asif… - Artificial Intelligence …, 2023 - Springer

Vision transformers have become popular as a possible substitute to convolutional neural
networks (CNNs) for a variety of computer vision applications. These transformers, with their …

被引用次数：62 相关文章所有 6 个版本

[PDF] arxiv.org

Transformer-based generative adversarial networks in computer vision: A comprehensive survey

SR Dubey, SK Singh - IEEE Transactions on Artificial …, 2024 - ieeexplore.ieee.org

Generative Adversarial Networks (GANs) have been very successful for synthesizing the
images in a given dataset. The artificially generated images by GANs are very realistic. The …

被引用次数：26 相关文章所有 4 个版本

The research landscape on generative artificial intelligence: a bibliometric analysis of transformer-based models

G Marchena Sekli - Kybernetes, 2024 - emerald.com

Purpose The aim of this study is to offer valuable insights to businesses and facilitate better
understanding on transformer-based models (TBMs), which are among the widely employed …

被引用次数：1 相关文章

Multi-sentence complementarily generation for text-to-image synthesis

L Zhao, P Huang, T Chen, C Fu, Q Hu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Generating realistic images based on text descriptions remains challenging in computer
vision. Existing multi-stage generation methods are sufficient to generate high-resolution …

被引用次数：6 相关文章

[HTML] mdpi.com

[HTML][HTML] Research on automatic classification and detection of mutton multi-parts based on swin-transformer

S Zhao, Z Bai, S Wang, Y Gu - Foods, 2023 - mdpi.com

In order to realize the real-time classification and detection of mutton multi-part, this paper
proposes a mutton multi-part classification and detection method based on the Swin …

被引用次数：4 相关文章所有 8 个版本

[PDF] surrey.ac.uk

A Survey of Cross-Modal Visual Content Generation

F Nazarieh, Z Feng, M Awais, W Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Cross-modal content generation has become very popular in recent years. To generate high-
quality and realistic content, a variety of methods have been proposed. Among these …

被引用次数：1 相关文章所有 2 个版本

VTM-GAN: video-text matcher based generative adversarial network for generating videos from textual description

R Mehmood, R Bashir, KJ Giri - International Journal of Information …, 2024 - Springer

Text-to-video synthesis has garnered significant attention as a challenging task in the
domain of vision computing. With the advent of unsupervised learning techniques, text-to …

被引用次数：3 相关文章

A deep learning based cross model text to image generation using DC-GAN

G Kasi, S Abirami, RD Lakshmi - 2023 12th International …, 2023 - ieeexplore.ieee.org

In recent times, Generative Adversarial Networks have successfully synthesized images
through text descriptions. In the domain of image processing, deep convolutional generative …

被引用次数：3 相关文章

[HTML] springer.com Full View

[HTML][HTML] Controllable image synthesis methods, applications and challenges: a comprehensive survey

S Huang, Q Li, J Liao, S Wang, L Liu, L Li - Artificial Intelligence Review, 2024 - Springer

Abstract Controllable Image Synthesis (CIS) is a methodology that allows users to generate
desired images or manipulate specific attributes of images by providing precise input …