Typology of risks of generative text-to-image models

C Bird, E Ungless, A Kasirzadeh - Proceedings of the 2023 AAAI/ACM …, 2023 - dl.acm.org
This paper investigates the direct risks and harms associated with modern text-to-image
generative models, such as DALL-E and Midjourney, through a comprehensive literature …

The ethical implications of generative audio models: A systematic literature review

J Barnett - Proceedings of the 2023 AAAI/ACM Conference on AI …, 2023 - dl.acm.org
Generative audio models typically focus their applications in music and speech generation,
with recent models having human-like quality in their audio output. This paper conducts a …

Trustworthy LLMs: A survey and guideline for evaluating large language models' alignment

Y Liu, Y Yao, JF Ton, X Zhang, RGH Cheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Ensuring alignment, which refers to making models behave in accordance with human
intentions [1, 2], has become a critical task before deploying large language models (LLMs) …

Understanding and mitigating copying in diffusion models

G Somepalli, V Singla, M Goldblum… - Advances in …, 2023 - proceedings.neurips.cc
Images generated by diffusion models like Stable Diffusion are increasingly widespread.
Recent works and even lawsuits have shown that these models are prone to replicating their …

Counterfactual memorization in neural language models

C Zhang, D Ippolito, K Lee… - Advances in …, 2023 - proceedings.neurips.cc
Modern neural language models that are widely used in various NLP tasks risk memorizing
sensitive information from their training data. Understanding this memorization is important …

Silo language models: Isolating legal risk in a nonparametric datastore

S Min, S Gururangan, E Wallace, W Shi… - arXiv preprint arXiv …, 2023 - arxiv.org
The legality of training language models (LMs) on copyrighted or otherwise restricted data is
under intense debate. However, as we show, model performance significantly degrades if …

The data provenance initiative: A large scale audit of dataset licensing & attribution in ai

S Longpre, R Mahari, A Chen, N Obeng-Marnu… - arXiv preprint arXiv …, 2023 - arxiv.org
The race to train language models on vast, diverse, and inconsistently documented datasets
has raised pressing concerns about the legal and ethical risks for practitioners. To remedy …

Copyright protection in generative ai: A technical perspective

J Ren, H Xu, P He, Y Cui, S Zeng, J Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Generative AI has witnessed rapid advancement in recent years, expanding their
capabilities to create synthesized content such as text, images, audio, and code. The high …

Attention scheme inspired softmax regression

Y Deng, Z Li, Z Song - arXiv preprint arXiv:2304.10411, 2023 - arxiv.org
Large language models (LLMs) have made transformed changes for human society. One of
the key computation in LLMs is the softmax unit. This operation is important in LLMs …

On the generalization properties of diffusion models

P Li, Z Li, H Zhang, J Bian - Advances in Neural Information …, 2023 - proceedings.neurips.cc
Diffusion models are a class of generative models that serve to establish a stochastic
transport map between an empirically observed, yet unknown, target distribution and a …