相关文章- 学术资源搜索

Imagereward: Learning and evaluating human preferences for text-to-image generation

J Xu, X Liu, Y Wu, Y Tong, Q Li… - Advances in …, 2024 - proceedings.neurips.cc

We present a comprehensive solution to learn and improve text-to-image models from
human preference feedback. To begin with, we build ImageReward---the first general …

被引用次数：175 相关文章所有 6 个版本

[PDF] arxiv.org

Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis

X Wu, Y Hao, K Sun, Y Chen, F Zhu, R Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent text-to-image generative models can generate high-fidelity images from text inputs,
but the quality of these generated images cannot be accurately evaluated by existing …

被引用次数：72 相关文章所有 2 个版本

[PDF] arxiv.org

Aligning text-to-image models using human feedback

K Lee, H Liu, M Ryu, O Watkins, Y Du… - arXiv preprint arXiv …, 2023 - arxiv.org

Deep generative models have shown impressive results in text-to-image synthesis.
However, current text-to-image models often generate images that are inadequately aligned …

被引用次数：136 相关文章所有 2 个版本

[PDF] thecvf.com

Toward verifiable and reproducible human evaluation for text-to-image generation

M Otani, R Togashi, Y Sawai… - Proceedings of the …, 2023 - openaccess.thecvf.com

Human evaluation is critical for validating the performance of text-to-image generative
models, as this highly cognitive process requires deep comprehension of text and images …

被引用次数：46 相关文章所有 7 个版本

[PDF] neurips.cc

Pick-a-pic: An open dataset of user preferences for text-to-image generation

Y Kirstain, A Polyak, U Singer… - Advances in …, 2023 - proceedings.neurips.cc

The ability to collect a large dataset of human preferences from text-to-image users is
usually limited to companies, making such datasets inaccessible to the public. To address …

被引用次数：126 相关文章所有 5 个版本

[PDF] thecvf.com

Rich human feedback for text-to-image generation

Y Liang, J He, G Li, P Li, A Klimovskiy… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen
have made significant progress in generating high-resolution images based on text …

被引用次数：9 相关文章所有 3 个版本

[PDF] neurips.cc

Optimizing prompts for text-to-image generation

Y Hao, Z Chi, L Dong, F Wei - Advances in Neural …, 2024 - proceedings.neurips.cc

Well-designed prompts can guide text-to-image models to generate amazing images.
However, the performant prompts are often model-specific and misaligned with user input …

被引用次数：92 相关文章所有 5 个版本

[PDF] thecvf.com

Iti-gen: Inclusive text-to-image generation

C Zhang, X Chen, S Chai, CH Wu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …

被引用次数：26 相关文章所有 5 个版本

[PDF] thecvf.com

Grounded text-to-image synthesis with attention refocusing

Q Phung, S Ge, JB Huang - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …

被引用次数：42 相关文章所有 3 个版本

[PDF] aaai.org

Compositional text-to-image synthesis with attention map control of diffusion models

R Wang, Z Chen, C Chen, J Ma, H Lu… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

Recent text-to-image (T2I) diffusion models show outstanding performance in generating
high-quality images conditioned on textual prompts. However, they fail to semantically align …

被引用次数：27 相关文章所有 4 个版本

Imagereward: Learning and evaluating human preferences for text-to-image generation

Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis

Aligning text-to-image models using human feedback

Toward verifiable and reproducible human evaluation for text-to-image generation

Pick-a-pic: An open dataset of user preferences for text-to-image generation

Rich human feedback for text-to-image generation

Optimizing prompts for text-to-image generation

Iti-gen: Inclusive text-to-image generation

Grounded text-to-image synthesis with attention refocusing

Compositional text-to-image synthesis with attention map control of diffusion models

相关搜索

高级搜索

引用