相关文章- 学术资源搜索

Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

被引用次数：26 相关文章所有 5 个版本

[PDF] thecvf.com

Dreambooth: Fine tuning text-to-image diffusion models for subject-driven generation

N Ruiz, Y Li, V Jampani, Y Pritch… - Proceedings of the …, 2023 - openaccess.thecvf.com

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-
quality and diverse synthesis of images from a given text prompt. However, these models …

被引用次数：1540 相关文章所有 6 个版本

[PDF] thecvf.com

A-star: Test-time attention segregation and retention for text-to-image synthesis

A Agarwal, S Karanam, KJ Joseph… - Proceedings of the …, 2023 - openaccess.thecvf.com

While recent developments in text-to-image generative models have led to a suite of high-
performing methods capable of producing creative imagery from free-form text, there are …

被引用次数：15 相关文章所有 7 个版本

[PDF] thecvf.com

Toward verifiable and reproducible human evaluation for text-to-image generation

M Otani, R Togashi, Y Sawai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Human evaluation is critical for validating the performance of text-to-image generative
models, as this highly cognitive process requires deep comprehension of text and images …

被引用次数：46 相关文章所有 7 个版本

[PDF] arxiv.org

Training-free consistent text-to-image generation

Y Tewel, O Kaduri, R Gal, Y Kasten, L Wolf… - arXiv preprint arXiv …, 2024 - arxiv.org

Text-to-image models offer a new level of creative flexibility by allowing users to guide the
image generation process through natural language. However, using these models to …

被引用次数：11 相关文章所有 3 个版本

[PDF] arxiv.org

Aligning text-to-image models using human feedback

K Lee, H Liu, M Ryu, O Watkins, Y Du… - arXiv preprint arXiv …, 2023 - arxiv.org

Deep generative models have shown impressive results in text-to-image synthesis.
However, current text-to-image models often generate images that are inadequately aligned …

被引用次数：136 相关文章所有 2 个版本

[PDF] thecvf.com

Discriminative probing and tuning for text-to-image generation

L Qu, W Wang, Y Li, H Zhang, L Nie… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite advancements in text-to-image generation (T2I) prior methods often face text-image
misalignment problems such as relation confusion in generated images. Existing solutions …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis

X Wu, Y Hao, K Sun, Y Chen, F Zhu, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent text-to-image generative models can generate high-fidelity images from text inputs,
but the quality of these generated images cannot be accurately evaluated by existing …

被引用次数：72 相关文章所有 2 个版本

[PDF] thecvf.com

Dall-eval: Probing the reasoning skills and social biases of text-to-image generation models

J Cho, A Zala, M Bansal - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Recently, DALL-E, a multimodal transformer language model, and its variants including
diffusion models have shown high-quality text-to-image generation capabilities. However …

被引用次数：79 相关文章所有 5 个版本

[PDF] arxiv.org

ViCo: Plug-and-play Visual Condition for Personalized Text-to-image Generation

S Hao, K Han, S Zhao, KYK Wong - arXiv preprint arXiv:2306.00971, 2023 - arxiv.org

Personalized text-to-image generation using diffusion models has recently emerged and
garnered significant interest. This task learns a novel concept (eg, a unique toy), illustrated …

被引用次数：32 相关文章所有 3 个版本