Finetuned language models are zero-shot learners J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le arXiv preprint arXiv:2109.01652, 2021 | 2528 | 2021 |
Scaling instruction-finetuned language models HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, Y Li, X Wang, ... Journal of Machine Learning Research 25 (70), 1-53, 2024 | 2253 | 2024 |
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022 | 1311 | 2022 |
Adversarial attacks and defences competition A Kurakin, I Goodfellow, S Bengio, Y Dong, F Liao, M Liang, T Pang, ... The NIPS'17 Competition: Building Intelligent Systems, 195-231, 2018 | 356 | 2018 |
Large dual encoders are generalizable retrievers J Ni, C Qu, J Lu, Z Dai, GH Ábrego, J Ma, VY Zhao, Y Luan, KB Hall, ... arXiv preprint arXiv:2112.07899, 2021 | 277 | 2021 |
Mixture-of-experts with expert choice routing Y Zhou, T Lei, H Liu, N Du, Y Huang, V Zhao, AM Dai, QV Le, J Laudon Advances in Neural Information Processing Systems 35, 7103-7114, 2022 | 173 | 2022 |
Rarr: Researching and revising what language models say, using language models L Gao, Z Dai, P Pasupat, A Chen, AT Chaganty, Y Fan, VY Zhao, N Lao, ... arXiv preprint arXiv:2210.08726, 2022 | 143 | 2022 |
Promptagator: Few-shot dense retrieval from 8 examples Z Dai, VY Zhao, J Ma, Y Luan, J Ni, J Lu, A Bakalov, K Guu, KB Hall, ... arXiv preprint arXiv:2209.11755, 2022 | 142 | 2022 |
Huai hsin Chi, Jeff Dean, Jacob Devlin, Adam Roberts, Denny Zhou, Quoc V HW Chung, L Hou, S Longpre, B Zoph, Y Tay, W Fedus, E Li, X Wang, ... Le, and Jason Wei, 2022 | 79 | 2022 |
Dialog inpainting: Turning documents into dialogs Z Dai, AT Chaganty, VY Zhao, A Amini, QM Rashid, M Green, K Guu International conference on machine learning, 4558-4586, 2022 | 49 | 2022 |
Mixture-of-experts meets instruction tuning: A winning combination for large language models S Shen, L Hou, Y Zhou, N Du, S Longpre, J Wei, HW Chung, B Zoph, ... arXiv preprint arXiv:2305.14705, 2023 | 40 | 2023 |
Dr. icl: Demonstration-retrieved in-context learning M Luo, X Xu, Z Dai, P Pasupat, M Kazemi, C Baral, V Imbrasaite, VY Zhao arXiv preprint arXiv:2305.14128, 2023 | 28 | 2023 |
Identification of patients with carotid stenosis using natural language processing X Wu, Y Zhao, D Radev, A Malhotra European Radiology 30, 4125-4133, 2020 | 28 | 2020 |
Junjiajia Long, Yerkebulan Berdibekov, Takuya Akiba, Seiya Tokui, and Motoki Abe A Kurakin, IJ Goodfellow, S Bengio, Y Dong, F Liao, M Liang, T Pang, ... Adversarial attacks and defences competition. CoRR, abs/1804.00097 5, 7, 2018 | 25 | 2018 |
Flan-moe: Scaling instruction-finetuned language models with sparse mixture of experts S Shen, L Hou, Y Zhou, N Du, S Longpre, J Wei, HW Chung, B Zoph, ... arXiv preprint arXiv:2305.14705 2, 2023 | 22 | 2023 |
Finetuned language models are zero-shot learners. arXiv 2021 J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le arXiv preprint arXiv:2109.01652, 2023 | 22 | 2023 |
Finetuned language models are zero-shot learners, 2021 J Wei, M Bosma, VY Zhao, K Guu, AW Yu, B Lester, N Du, AM Dai, QV Le URL https://openreview. net/forum, 0 | 22 | |
Conditional adapters: Parameter-efficient transfer learning with fast inference T Lei, J Bai, S Brahma, J Ainslie, K Lee, Y Zhou, N Du, V Zhao, Y Wu, B Li, ... Advances in Neural Information Processing Systems 36, 8152-8172, 2023 | 21 | 2023 |
Attributed text generation via post-hoc research and revision L Gao, Z Dai, P Pasupat, A Chen, AT Chaganty, Y Fan, VY Zhao, N Lao, ... arXiv preprint arXiv:2210.08726, 2022 | 18 | 2022 |
Multi-vector retrieval as sparse alignment Y Qian, J Lee, SMK Duddu, Z Dai, S Brahma, I Naim, T Lei, VY Zhao arXiv preprint arXiv:2211.01267, 2022 | 12 | 2022 |