Imagereward: Learning and evaluating human preferences for text-to-image generation
We present a comprehensive solution to learn and improve text-to-image models from
human preference feedback. To begin with, we build ImageReward---the first general …
human preference feedback. To begin with, we build ImageReward---the first general …
Human preference score v2: A solid benchmark for evaluating human preferences of text-to-image synthesis
Recent text-to-image generative models can generate high-fidelity images from text inputs,
but the quality of these generated images cannot be accurately evaluated by existing …
but the quality of these generated images cannot be accurately evaluated by existing …
Aligning text-to-image models using human feedback
Deep generative models have shown impressive results in text-to-image synthesis.
However, current text-to-image models often generate images that are inadequately aligned …
However, current text-to-image models often generate images that are inadequately aligned …
Toward verifiable and reproducible human evaluation for text-to-image generation
Human evaluation is critical for validating the performance of text-to-image generative
models, as this highly cognitive process requires deep comprehension of text and images …
models, as this highly cognitive process requires deep comprehension of text and images …
Pick-a-pic: An open dataset of user preferences for text-to-image generation
The ability to collect a large dataset of human preferences from text-to-image users is
usually limited to companies, making such datasets inaccessible to the public. To address …
usually limited to companies, making such datasets inaccessible to the public. To address …
Rich human feedback for text-to-image generation
Abstract Recent Text-to-Image (T2I) generation models such as Stable Diffusion and Imagen
have made significant progress in generating high-resolution images based on text …
have made significant progress in generating high-resolution images based on text …
Optimizing prompts for text-to-image generation
Well-designed prompts can guide text-to-image models to generate amazing images.
However, the performant prompts are often model-specific and misaligned with user input …
However, the performant prompts are often model-specific and misaligned with user input …
Iti-gen: Inclusive text-to-image generation
Text-to-image generative models often reflect the biases of the training data, leading to
unequal representations of underrepresented groups. This study investigates inclusive text …
unequal representations of underrepresented groups. This study investigates inclusive text …
Grounded text-to-image synthesis with attention refocusing
Driven by the scalable diffusion models trained on large-scale datasets text-to-image
synthesis methods have shown compelling results. However these models still fail to …
synthesis methods have shown compelling results. However these models still fail to …
Compositional text-to-image synthesis with attention map control of diffusion models
Recent text-to-image (T2I) diffusion models show outstanding performance in generating
high-quality images conditioned on textual prompts. However, they fail to semantically align …
high-quality images conditioned on textual prompts. However, they fail to semantically align …