Large-scale text-to-image generation models for visual artists' creative works
Large-scale Text-to-image Generation Models (LTGMs)(eg, DALL-E), self-supervised deep
learning models trained on a huge dataset, have demonstrated the capacity for generating …
learning models trained on a huge dataset, have demonstrated the capacity for generating …
Vision-language models in remote sensing: Current progress and future trends
The remarkable achievements of ChatGPT and GPT-4 have sparked a wave of interest and
research in the field of large language models for Artificial General Intelligence (AGI). These …
research in the field of large language models for Artificial General Intelligence (AGI). These …
Scaling up gans for text-to-image synthesis
The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …
general public's imagination. From a technical standpoint, it also marked a drastic change in …
Attend-and-excite: Attention-based semantic guidance for text-to-image diffusion models
Recent text-to-image generative models have demonstrated an unparalleled ability to
generate diverse and creative imagery guided by a target text prompt. While revolutionary …
generate diverse and creative imagery guided by a target text prompt. While revolutionary …
Masactrl: Tuning-free mutual self-attention control for consistent image synthesis and editing
Despite the success in large-scale text-to-image generation and text-conditioned image
editing, existing methods still struggle to produce consistent generation and editing results …
editing, existing methods still struggle to produce consistent generation and editing results …
Stylegan-t: Unlocking the power of gans for fast large-scale text-to-image synthesis
Text-to-image synthesis has recently seen significant progress thanks to large pretrained
language models, large-scale training data, and the introduction of scalable model families …
language models, large-scale training data, and the introduction of scalable model families …
Optimizing prompts for text-to-image generation
Well-designed prompts can guide text-to-image models to generate amazing images.
However, the performant prompts are often model-specific and misaligned with user input …
However, the performant prompts are often model-specific and misaligned with user input …
Raphael: Text-to-image generation via large mixture of diffusion paths
Text-to-image generation has recently witnessed remarkable achievements. We introduce a
text-conditional image diffusion model, termed RAPHAEL, to generate highly artistic images …
text-conditional image diffusion model, termed RAPHAEL, to generate highly artistic images …
Ernie-vilg 2.0: Improving text-to-image diffusion model with knowledge-enhanced mixture-of-denoising-experts
Recent progress in diffusion models has revolutionized the popular technology of text-to-
image generation. While existing approaches could produce photorealistic high-resolution …
image generation. While existing approaches could produce photorealistic high-resolution …
Glaze: Protecting Artists from Style Mimicry by {Text-to-Image} Models
Recent text-to-image diffusion models such as MidJourney and Stable Diffusion threaten to
displace many in the professional artist community. In particular, models can learn to mimic …
displace many in the professional artist community. In particular, models can learn to mimic …