Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

Differentially private diffusion models

T Dockhorn, T Cao, A Vahdat, K Kreis - arXiv preprint arXiv:2210.09929, 2022 - arxiv.org
While modern machine learning models rely on increasingly large training datasets, data is
often limited in privacy-sensitive domains. Generative models trained with differential privacy …

Sok: Privacy-preserving data synthesis

Y Hu, F Wu, Q Li, Y Long, GM Garrido… - … IEEE Symposium on …, 2024 - ieeexplore.ieee.org
As the prevalence of data analysis grows, safeguarding data privacy has become a
paramount concern. Consequently, there has been an upsurge in the development of …

Differentially private latent diffusion models

MF Liu, S Lyu, M Vinaroz, M Park - arXiv preprint arXiv:2305.15759, 2023 - arxiv.org
Diffusion models (DMs) are one of the most widely used generative models for producing
high quality images. However, a flurry of recent papers points out that DMs are least private …

Differentially private synthetic data via foundation model apis 1: Images

Z Lin, S Gopi, J Kulkarni, H Nori, S Yekhanin - arXiv preprint arXiv …, 2023 - arxiv.org
Generating differentially private (DP) synthetic data that closely resembles the original
private data is a scalable way to mitigate privacy concerns in the current data-driven world …

Pre-trained perceptual features improve differentially private image generation

F Harder, MJ Asadabadi, DJ Sutherland… - arXiv preprint arXiv …, 2022 - arxiv.org
Training even moderately-sized generative models with differentially-private stochastic
gradient descent (DP-SGD) is difficult: the required level of noise for reasonable levels of …

Private gans, revisited

A Bie, G Kamath, G Zhang - arXiv preprint arXiv:2302.02936, 2023 - arxiv.org
We show that the canonical approach for training differentially private GANs--updating the
discriminator with differentially private stochastic gradient descent (DPSGD)--can yield …

A unified view of differentially private deep generative modeling

D Chen, R Kerkouche, M Fritz - arXiv preprint arXiv:2309.15696, 2023 - arxiv.org
The availability of rich and vast data sources has greatly advanced machine learning
applications in various domains. However, data with privacy concerns comes with stringent …

Functional Rényi differential privacy for generative modeling

D Jiang, S Sun, Y Yu - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Differential privacy (DP) has emerged as a rigorous notion to quantify data privacy.
Subsequently, Renyi differential privacy (RDP) becomes an alternative to the ordinary DP …

Dynamic differential privacy-based dataset condensation

Z Wu, X Gao, Y Qian, Y Hao, M Chen - Neurocomputing, 2024 - Elsevier
With the continuous expansion of data scale, data condensation technology has emerged as
a means to reduce costs related to storage, time, and energy consumption. Data …