LoRA: Low-Rank Adaptation of Large Language Models EJ Hu, Y Shen, P Wallis, Z Allen-Zhu, Y Li, S Wang, L Wang, W Chen International Conference on Learning Representations, 2022 | 5173 | 2022 |
Tensor programs iv: Feature learning in infinite-width neural networks G Yang, EJ Hu International Conference on Machine Learning, 11727-11737, 2021 | 256* | 2021 |
Randomized smoothing of all shapes and sizes G Yang, T Duan, EJ Hu, H Salman, I Razenshteyn, J Li International Conference on Machine Learning, 10693-10705, 2020 | 204 | 2020 |
Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation A Poliak, A Haldar, R Rudinger, EJ Hu, E Pavlick, AS White, B Van Durme Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018 | 163 | 2018 |
Tuning large neural networks via zero-shot hyperparameter transfer G Yang, EJ Hu, I Babuschkin, S Sidor, X Liu, D Farhi, N Ryder, J Pachocki, ... Advances in Neural Information Processing Systems 34, 17084-17097, 2021 | 159* | 2021 |
Gflownet foundations Y Bengio, S Lahlou, T Deleu, EJ Hu, M Tiwari, E Bengio Journal of Machine Learning Research 24 (210), 1-55, 2023 | 152 | 2023 |
Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting EJ Hu, H Khayrallah, R Culkin, P Xia, T Chen, M Post, B Van Durme Proceedings of the 2019 Conference of the North American Chapter of the …, 2019 | 144 | 2019 |
ParaBank: Monolingual bitext generation and sentential paraphrasing via lexically-constrained neural machine translation EJ Hu, R Rudinger, M Post, B Van Durme Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6521-6528, 2019 | 86 | 2019 |
Large-scale, Diverse, Paraphrastic Bitexts via Sampling and Clustering EJ Hu, A Singh, N Holzenberger, M Post, B Van Durme Proceedings of the 23rd Conference on Computational Natural Language …, 2019 | 67 | 2019 |
GFlowNets and variational inference N Malkin, S Lahlou, T Deleu, X Ji, EJ Hu, K Everett, D Zhang, Y Bengio arXiv preprint arXiv:2210.00580, 2022 | 54 | 2022 |
GFlowNet-EM for learning compositional latent variable models EJ Hu, N Malkin, M Jain, KE Everett, A Graikos, Y Bengio International Conference on Machine Learning, 13528-13549, 2023 | 28 | 2023 |
Amortizing intractable inference in large language models EJ Hu, M Jain, E Elmoznino, Y Kaddar, G Lajoie, Y Bengio, N Malkin arXiv preprint arXiv:2310.04363, 2023 | 18 | 2023 |
Improved Image Wasserstein Attacks and Defenses EJ Hu, A Swaminathan, H Salman, G Yang arXiv preprint arXiv:2004.12478, 2020 | 16 | 2020 |
Efficient computation of deep nonlinear infinite-width neural networks that learn features G Yang, M Santacroce, EJ Hu International Conference on Learning Representations, 2022 | 9 | 2022 |
Iterative paraphrastic augmentation with discriminative span alignment R Culkin, EJ Hu, E Stengel-Eskin, G Qin, BV Durme Transactions of the Association for Computational Linguistics 9, 494-509, 2021 | 6 | 2021 |
Differentiable Tree Operations Promote Compositional Generalization P Soulos, EJ Hu, K McCurdy, Y Chen, R Fernandez, P Smolensky, J Gao International Conference on Machine Learning, 32499-32520, 2023 | 4 | 2023 |
GFlowNets for Causal Discovery: an Overview DC Manta, EJ Hu, Y Bengio ICML 2023 Workshop on Structured Probabilistic Inference {\&} Generative …, 2023 | 2 | 2023 |
NIST TAC SM-KBP 2019 System Description: JHU/UR Framework. Y Chen, S Ebner, T Chen, P Xia, E Stengel-Eskin, TR Su, EJ Hu, ... TAC, 2019 | 1 | 2019 |