Diffusion models: A comprehensive survey of methods and applications
Diffusion models have emerged as a powerful new family of deep generative models with
record-breaking performance in many applications, including image synthesis, video …
record-breaking performance in many applications, including image synthesis, video …
A complete survey on generative ai (aigc): Is chatgpt from gpt-4 to gpt-5 all you need?
As ChatGPT goes viral, generative AI (AIGC, aka AI-generated content) has made headlines
everywhere because of its ability to analyze and create text, images, and beyond. With such …
everywhere because of its ability to analyze and create text, images, and beyond. With such …
Instructpix2pix: Learning to follow image editing instructions
We propose a method for editing images from human instructions: given an input image and
a written instruction that tells the model what to do, our model follows these instructions to …
a written instruction that tells the model what to do, our model follows these instructions to …
Diffusiondet: Diffusion model for object detection
We propose DiffusionDet, a new framework that formulates object detection as a denoising
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
diffusion process from noisy boxes to object boxes. During the training stage, object boxes …
ediff-i: Text-to-image diffusion models with an ensemble of expert denoisers
Large-scale diffusion-based generative models have led to breakthroughs in text-
conditioned high-resolution image synthesis. Starting from random noise, such text-to-image …
conditioned high-resolution image synthesis. Starting from random noise, such text-to-image …
Augmented language models: a survey
This survey reviews works in which language models (LMs) are augmented with reasoning
skills and the ability to use tools. The former is defined as decomposing a potentially …
skills and the ability to use tools. The former is defined as decomposing a potentially …
Inference-time intervention: Eliciting truthful answers from a language model
Abstract We introduce Inference-Time Intervention (ITI), a technique designed to enhance
the" truthfulness" of large language models (LLMs). ITI operates by shifting model activations …
the" truthfulness" of large language models (LLMs). ITI operates by shifting model activations …
Rodin: A generative model for sculpting 3d digital avatars using diffusion
This paper presents a 3D diffusion model that automatically generates 3D digital avatars
represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is …
represented as neural radiance fields (NeRFs). A significant challenge for 3D diffusion is …
Flow straight and fast: Learning to generate and transfer data with rectified flow
We present rectified flow, a surprisingly simple approach to learning (neural) ordinary
differential equation (ODE) models to transport between two empirically observed …
differential equation (ODE) models to transport between two empirically observed …
A survey on generative diffusion models
Deep generative models have unlocked another profound realm of human creativity. By
capturing and generalizing patterns within data, we have entered the epoch of all …
capturing and generalizing patterns within data, we have entered the epoch of all …