An Empirical Study of Memorization in NLP X Zheng, J Jiang ACL 2022, 2022 | 12 | 2022 |
Intriguing Properties of Data Attribution on Diffusion Models X Zheng, T Pang, C Du, J Jiang, M Lin ICLR 2024, 2023 | 10 | 2023 |
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast X Gu*, X Zheng*, T Pang*, C Du, Q Liu, Y Wang, J Jiang, ... ICML 2024, 2024 | 6 | 2024 |
Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses X Zheng, T Pang, C Du, Q Liu, J Jiang, M Lin NextGenAISafety @ ICML 2024, 2024 | 2 | 2024 |
RegMix: Data Mixture as Regression for Language Model Pre-training Q Liu*, X Zheng*, N Muennighoff, G Zeng, L Dou, T Pang, J Jiang, ... https://arxiv.org/abs/2407.01492, 2024 | | 2024 |