关注
Oam Patel
Oam Patel
在 college.harvard.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Inference-time intervention: Eliciting truthful answers from a language model
K Li, O Patel, F Viégas, H Pfister, M Wattenberg
Advances in Neural Information Processing Systems 36, 2024
1492024
The wmdp benchmark: Measuring and reducing malicious use with unlearning
N Li, A Pan, A Gopal, S Yue, D Berrios, A Gatti, JD Li, AK Dombrowski, ...
arXiv preprint arXiv:2403.03218, 2024
192024
Defending Against Unforeseen Failure Modes with Latent Adversarial Training
S Casper, L Schulze, O Patel, D Hadfield-Menell
arXiv preprint arXiv:2403.05030, 2024
62024
Designing a Dashboard for Transparency and Control of Conversational AI
Y Chen, A Wu, T DePodesta, C Yeh, K Li, NC Marin, O Patel, J Riecke, ...
arXiv preprint arXiv:2406.07882, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–4