Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation

N Ruiz, Y Li, V Jampani, Y Pritch… - Proceedings of the …, 2023 - openaccess.thecvf.com
Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-
quality and diverse synthesis of images from a given text prompt. However, these models …

A-star: Test-time attention segregation and retention for text-to-image synthesis

A Agarwal, S Karanam, KJ Joseph… - Proceedings of the …, 2023 - openaccess.thecvf.com
While recent developments in text-to-image generative models have led to a suite of high-
performing methods capable of producing creative imagery from free-form text, there are …

Toward verifiable and reproducible human evaluation for text-to-image generation

M Otani, R Togashi, Y Sawai… - Proceedings of the …, 2023 - openaccess.thecvf.com
Human evaluation is critical for validating the performance of text-to-image generative
models, as this highly cognitive process requires deep comprehension of text and images …

Training-free consistent text-to-image generation

Y Tewel, O Kaduri, R Gal, Y Kasten, L Wolf… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-image models offer a new level of creative flexibility by allowing users to guide the
image generation process through natural language. However, using these models to …

Aligning text-to-image models using human feedback

K Lee, H Liu, M Ryu, O Watkins, Y Du… - arXiv preprint arXiv …, 2023 - arxiv.org
Deep generative models have shown impressive results in text-to-image synthesis.
However, current text-to-image models often generate images that are inadequately aligned …

Discriminative probing and tuning for text-to-image generation

L Qu, W Wang, Y Li, H Zhang, L Nie… - Proceedings of the …, 2024 - openaccess.thecvf.com
Despite advancements in text-to-image generation (T2I) prior methods often face text-image
misalignment problems such as relation confusion in generated images. Existing solutions …

Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis

X Wu, Y Hao, K Sun, Y Chen, F Zhu, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent text-to-image generative models can generate high-fidelity images from text inputs,
but the quality of these generated images cannot be accurately evaluated by existing …

Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models

J Cho, A Zala, M Bansal - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recently, DALL-E, a multimodal transformer language model, and its variants including
diffusion models have shown high-quality text-to-image generation capabilities. However …

ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation

S Hao, K Han, S Zhao, KYK Wong - arXiv preprint arXiv:2306.00971, 2023 - arxiv.org
Personalized text-to-image generation using diffusion models has recently emerged and
garnered significant interest. This task learns a novel concept (eg, a unique toy), illustrated …