Self-supervised image-to-text and text-to-image synthesis
A comprehensive understanding of vision and language and their interrelation are crucial to
realize the underlying similarities and differences between these modalities and to learn …
realize the underlying similarities and differences between these modalities and to learn …
Self-supervised Image-to-Text and Text-to-Image Synthesis
AS Das, S Saha - International Conference on Neural Information …, 2021 - dl.acm.org
A comprehensive understanding of vision and language and their interrelation are crucial to
realize the underlying similarities and differences between these modalities and to learn …
realize the underlying similarities and differences between these modalities and to learn …
Self-Supervised Image-to-Text and Text-to-Image Synthesis
AS Das, S Saha - arXiv preprint arXiv:2112.04928, 2021 - arxiv.org
A comprehensive understanding of vision and language and their interrelation are crucial to
realize the underlying similarities and differences between these modalities and to learn …
realize the underlying similarities and differences between these modalities and to learn …