Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024 | 2124 | 2024 |
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1335 | 2023 |
The flan collection: Designing data and methods for effective instruction tuning S Longpre, L Hou, T Vu, A Webson, HW Chung, Y Tay, D Zhou, QV Le, ... ICML 2023, 2023 | 435 | 2023 |
Question rewriting for conversational question answering S Vakulenko, S Longpre, Z Tu, R Anantha WSDM 2021, 355-363, 2021 | 152 | 2021 |
Open-domain question answering goes conversational via question rewriting R Anantha, S Vakulenko, Z Tu, S Longpre, S Pulman, S Chappidi NAACL 2021, 2020 | 143 | 2020 |
Entity-based knowledge conflicts in question answering S Longpre, K Perisetla, A Chen, N Ramesh, C DuBois, S Singh EMNLP 2021, 2021 | 141 | 2021 |
The bigscience roots corpus: A 1.6 tb composite multilingual dataset H Laurençon, L Saulnier, T Wang, C Akiki, A Villanova del Moral, ... Advances in Neural Information Processing Systems 35, 31809-31826, 2022 | 133 | 2022 |
MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering S Longpre, Y Lu, J Daiber TACL 2021, Vol 9, 2020 | 117 | 2020 |
Octopack: Instruction tuning code large language models N Muennighoff, Q Liu, A Zebaze, Q Zheng, B Hui, TY Zhuo, S Singh, ... arXiv preprint arXiv:2308.07124, 2023 | 94 | 2023 |
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers? S Longpre, Y Wang, C DuBois Findings of the Association for Computational Linguistics: EMNLP 2020, 2020 | 92 | 2020 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Proceedings of BigScience Episode# 5--Workshop on Challenges & Perspectives …, 2022 | 81 | 2022 |
Huai hsin Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, E Li, X Wang, ... Le, and Jason Wei, 2022 | 77 | 2022 |
A pretrainer's guide to training data: Measuring the effects of data age, domain coverage, quality, & toxicity S Longpre, G Yauney, E Reif, K Lee, A Roberts, B Zoph, D Zhou, J Wei, ... arXiv preprint arXiv:2305.13169, 2023 | 58 | 2023 |
Prometheus: Inducing fine-grained evaluation capability in language models S Kim, J Shin, Y Cho, J Jang, S Longpre, H Lee, S Yun, S Shin, S Kim, ... The Twelfth International Conference on Learning Representations, 2023 | 55 | 2023 |
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering S Longpre, Y Lu, Z Tu, C DuBois Proceedings of the 2nd Workshop on Machine Reading for Question Answering …, 2019 | 50 | 2019 |
The foundation model transparency index R Bommasani, K Klyman, S Longpre, S Kapoor, N Maslej, B Xiong, ... arXiv preprint arXiv:2310.12941, 2023 | 42 | 2023 |
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP A Chen, P Gudipati, S Longpre, X Ling, S Singh ACL 2021, 2021 | 39 | 2021 |
Mixture-of-experts meets instruction tuning: A winning combination for large language models S Shen, L Hou, Y Zhou, N Du, S Longpre, J Wei, HW Chung, B Zoph, ... arXiv preprint arXiv:2305.14705, 2023 | 37 | 2023 |
Aya model: An instruction finetuned open-access multilingual language model A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ... arXiv preprint arXiv:2402.07827, 2024 | 36 | 2024 |
A comparison of question rewriting methods for conversational passage retrieval S Vakulenko, N Voskarides, Z Tu, S Longpre Proceedings of the 43rd European Conference on IR Research, ECIR 2021, 2021 | 34 | 2021 |