Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model S Smith, M Patwary, B Norick, P LeGresley, S Rajbhandari, J Casper, ... arXiv preprint arXiv:2201.11990, 2022 | 676* | 2022 |
Self-refine: Iterative refinement with self-feedback A Madaan, N Tandon, P Gupta, S Hallinan, L Gao, S Wiegreffe, U Alon, ... Advances in Neural Information Processing Systems 36, 2024 | 649 | 2024 |
Style transfer through back-translation S Prabhumoye, Y Tsvetkov, R Salakhutdinov, AW Black arXiv preprint arXiv:1804.09000, 2018 | 473 | 2018 |
The second conversational intelligence challenge (convai2) E Dinan, V Logacheva, V Malykh, A Miller, K Shuster, J Urbanek, D Kiela, ... The NeurIPS'18 Competition: From Machine Learning to Intelligent …, 2020 | 373 | 2020 |
A dataset for document grounded conversations K Zhou, S Prabhumoye, AW Black arXiv preprint arXiv:1809.07358, 2018 | 248 | 2018 |
Five sources of bias in natural language processing D Hovy, S Prabhumoye Language and Linguistics Compass 15 (8), e12432, 2021 | 230 | 2021 |
Politeness transfer: A tag and generate approach A Madaan, A Setlur, T Parekh, B Poczos, G Neubig, Y Yang, ... arXiv preprint arXiv:2004.14257, 2020 | 185 | 2020 |
Exploring controllable text generation techniques S Prabhumoye, AW Black, R Salakhutdinov arXiv preprint arXiv:2005.01822, 2020 | 92 | 2020 |
Equity beyond bias in language technologies for education E Mayfield, M Madaio, S Prabhumoye, D Gerritsen, B McLaughlin, ... Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building …, 2019 | 59 | 2019 |
Topological sort for sentence ordering S Prabhumoye, R Salakhutdinov, AW Black arXiv preprint arXiv:2005.00432, 2020 | 46 | 2020 |
Focused Attention Improves Document-Grounded Generation S Prabhumoye, K Hashimoto, Y Zhou, AW Black, R Salakhutdinov arXiv preprint arXiv:2104.12714, 2021 | 42 | 2021 |
SPRING: GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning Y Wu, SY Min, S Prabhumoye, Y Bisk, R Salakhutdinov, A Azaria, ... arXiv preprint arXiv:2305.15486, 2023 | 39* | 2023 |
“My Way of Telling a Story”: Persona based Grounded Story Generation K Chandu, S Prabhumoye, R Salakhutdinov, AW Black Proceedings of the Second Workshop on Storytelling, 11-21, 2019 | 35* | 2019 |
Multi-Stage Prompting for Knowledgeable Dialogue Generation Z Liu, M Patwary, R Prenger, S Prabhumoye, W Ping, M Shoeybi, ... arXiv preprint arXiv:2203.08745, 2022 | 34 | 2022 |
Towards content transfer through grounded text generation S Prabhumoye, C Quirk, M Galley arXiv preprint arXiv:1905.05293, 2019 | 32 | 2019 |
Generating interactive worlds with text A Fan, J Urbanek, P Ringshia, E Dinan, E Qian, S Karamcheti, ... Proceedings of the AAAI Conference on Artificial Intelligence 34 (02), 1693-1700, 2020 | 26 | 2020 |
Plan, Eliminate, and Track--Language Models are Good Teachers for Embodied Agents Y Wu, SY Min, Y Bisk, R Salakhutdinov, A Azaria, Y Li, T Mitchell, ... arXiv preprint arXiv:2305.02412, 2023 | 24 | 2023 |
Case study: Deontological ethics in NLP S Prabhumoye, B Boldt, R Salakhutdinov, AW Black arXiv preprint arXiv:2010.04658, 2020 | 24 | 2020 |
The first conversational intelligence challenge M Burtsev, V Logacheva, V Malykh, IV Serban, R Lowe, S Prabhumoye, ... The NIPS'17 Competition: Building Intelligent Systems, 25-46, 2018 | 24 | 2018 |
AutoBiasTest: Controllable Sentence Generation for Automated and Open-Ended Social Bias Testing in Language Models R Kocielnik, S Prabhumoye, V Zhang, RM Alvarez, A Anandkumar arXiv preprint arXiv:2302.07371, 2023 | 21* | 2023 |