Jailbroken: How does LLM safety training fail? A Wei, N Haghtalab, J Steinhardt Advances in Neural Information Processing Systems 36, 2024 | 342 | 2024 |
Human-level play in the game of Diplomacy by combining language models with strategic reasoning Meta Fundamental AI Research Diplomacy Team (FAIR), A Bakhtin, ... Science 378 (6624), 1067-1074, 2022 | 173 | 2022 |
Optimal Robustness-Consistency Trade-offs for Learning-Augmented Online Algorithms A Wei, F Zhang Advances in Neural Information Processing Systems, 8042-8053, 2020 | 80 | 2020 |
Better and Simpler Learning-Augmented Online Caching A Wei Approximation, Randomization, and Combinatorial Optimization 176, 60:1-60:17, 2020 | 69 | 2020 |
More than a toy: Random matrix models predict how real-world neural representations generalize A Wei, W Hu, J Steinhardt International Conference on Machine Learning, 23549-23588, 2022 | 53 | 2022 |
Learning equilibria in matching markets with bandit feedback M Jagadeesan, A Wei, Y Wang, MI Jordan, J Steinhardt Journal of the ACM 70 (3), 1-46, 2023 | 36 | 2023 |
Predicting out-of-distribution error with the projection norm Y Yu, Z Yang, A Wei, Y Ma, J Steinhardt International Conference on Machine Learning, 25721-25746, 2022 | 32 | 2022 |
Learning in Stackelberg games with non-myopic agents N Haghtalab, T Lykouris, S Nietert, A Wei Proceedings of the 23rd ACM Conference on Economics and Computation, 917-918, 2022 | 20 | 2022 |
Designing approximately optimal search on matching platforms N Immorlica, B Lucier, V Manshadi, A Wei Proceedings of the 22nd ACM Conference on Economics and Computation, 632-633, 2021 | 20 | 2021 |
TCT: Convexifying federated learning using bootstrapped neural tangent kernels Y Yu, A Wei, SP Karimireddy, Y Ma, M Jordan Advances in Neural Information Processing Systems 35, 30882-30897, 2022 | 18 | 2022 |
Allocation for Social Good: Auditing Mechanisms for Utility Maximization T Lundy, A Wei, H Fu, SD Kominers, K Leyton-Brown Proceedings of the 2019 ACM Conference on Economics and Computation, 785-803, 2019 | 15 | 2019 |
An Interscholastic Network To Generate LexA Enhancer Trap Lines in Drosophila L Kockel, C Griffin, Y Ahmed, L Fidelak, A Rajan, EP Gould, M Haigney, ... G3: Genes, Genomes, Genetics 9 (7), 2097-2106, 2019 | 11 | 2019 |
Optimal Las Vegas Approximate Near Neighbors in ℓp A Wei Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete …, 2019 | 9 | 2019 |
Varying the Number of Signals in Matching Markets M Jagadeesan, A Wei International Conference on Web and Internet Economics, 232-245, 2018 | 9 | 2018 |
Learning and Decision-Making in Complex Environments A Wei UC Berkeley, 2023 | | 2023 |
Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaptation D Halawi, A Wei, E Wallace, TT Wang, N Haghtalab, J Steinhardt Forty-first International Conference on Machine Learning, 0 | | |