Vision-language matching for text-to-image synthesis via generative adversarial networks

Q Cheng, K Wen, X Gu - IEEE Transactions on Multimedia, 2022 - ieeexplore.ieee.org
Text-to-image synthesis is an attractive but challenging task that aims to generate a photo-
realistic and semantic consistent image from a specific text description. The images …

Cagan: Text-to-image generation with combined attention generative adversarial networks

H Schulze, D Yaman, A Waibel - DAGM German Conference on Pattern …, 2021 - Springer
Generating images according to natural language descriptions is a challenging task. Prior
research has mainly focused to enhance the quality of generation by investigating the use of …

Video description with GAN

M Wang - 2020 IEEE 3rd International Conference on …, 2020 - ieeexplore.ieee.org
Video description is to convert rich information of video data into text information, Which has
been attracting broad research attention in the Artificial Intelligence Community. Deep …