Patch diffusion: Faster and more data-efficient training of diffusion models

Z Wang, Y Jiang, H Zheng, P Wang… - Advances in neural …, 2024 - proceedings.neurips.cc
Diffusion models are powerful, but they require a lot of time and data to train. We propose
Patch Diffusion, a generic patch-wise training framework, to significantly reduce the training …

Alias-free generative adversarial networks

T Karras, M Aittala, S Laine… - Advances in neural …, 2021 - proceedings.neurips.cc
We observe that despite their hierarchical convolutional nature, the synthesis process of
typical generative adversarial networks depends on absolute pixel coordinates in an …

Neural fields in visual computing and beyond

Y Xie, T Takikawa, S Saito, O Litany… - Computer Graphics …, 2022 - Wiley Online Library
Recent advances in machine learning have led to increased interest in solving visual
computing problems using methods that employ coordinate‐based neural networks. These …

Stylegan-v: A continuous video generator with the price, image quality and perks of stylegan2

I Skorokhodov, S Tulyakov… - Proceedings of the …, 2022 - openaccess.thecvf.com
Videos show continuous events, yet most--if not all--video synthesis frameworks treat them
discretely in time. In this work, we think of videos of what they should be--time-continuous …

Epigraf: Rethinking training of 3d gans

I Skorokhodov, S Tulyakov, Y Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
A recent trend in generative modeling is building 3D-aware generators from 2D image
collections. To induce the 3D bias, such models typically rely on volumetric rendering, which …

From data to functa: Your data point is a function and you can treat it like one

E Dupont, H Kim, SM Eslami, D Rezende… - arXiv preprint arXiv …, 2022 - arxiv.org
It is common practice in deep learning to represent a measurement of the world on a
discrete grid, eg a 2D grid of pixels. However, the underlying signal represented by these …

Generating videos with dynamics-aware implicit generative adversarial networks

S Yu, J Tack, S Mo, H Kim, J Kim, JW Ha… - arXiv preprint arXiv …, 2022 - arxiv.org
In the deep learning era, long video generation of high-quality still remains challenging due
to the spatio-temporal complexity and continuity of videos. Existing prior works have …

Implicit neural representations for image compression

Y Strümpler, J Postels, R Yang, LV Gool… - European Conference on …, 2022 - Springer
Abstract Implicit Neural Representations (INRs) gained attention as a novel and effective
representation for various data types. Recently, prior work applied INRs to image …

Vidm: Video implicit diffusion models

K Mei, V Patel - Proceedings of the AAAI conference on artificial …, 2023 - ojs.aaai.org
Diffusion models have emerged as a powerful generative method for synthesizing high-
quality and diverse set of images. In this paper, we propose a video generation method …

Diffcollage: Parallel generation of large content with diffusion models

Q Zhang, J Song, X Huang, Y Chen… - 2023 IEEE/CVF …, 2023 - ieeexplore.ieee.org
We present DiffCollage, a compositional diffusion model that can generate large content by
leveraging diffusion models trained on generating pieces of the large content. Our approach …