Opt: Open pre-trained transformer language models S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... arXiv preprint arXiv:2205.01068, 2022 | 1884 | 2022 |
Multilingual denoising pre-training for neural machine translation Y Liu, J Gu, N Goyal, X Li, S Edunov, M Ghazvininejad, M Lewis, ... Transactions of the Association for Computational Linguistics 8, 726-742, 2020 | 1628 | 2020 |
Xi Victoria Lin, Todor Mihaylov, Myle Ott, Sam Shleifer, Kurt Shuster, Daniel Simig, Punit Singh Koura, Anjali Sridhar, Tianlu Wang, and Luke Zettlemoyer. 2022 S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... Opt: Open pretrained transformer language models 1, 2022 | 683 | 2022 |
Multilingual translation with extensible multilingual pretraining and finetuning Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan arXiv preprint arXiv:2008.00401, 2020 | 335 | 2020 |
Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 2021 | 321* | 2021 |
TWC LOGD: A portal for linked open government data ecosystems L Ding, T Lebo, JS Erickson, D DiFranzo, GT Williams, X Li, J Michaelis, ... Journal of Web Semantics 9 (3), 325-333, 2011 | 260 | 2011 |
Flowseq: Non-autoregressive conditional sequence generation with generative flow X Ma, C Zhou, X Li, G Neubig, E Hovy arXiv preprint arXiv:1909.02480, 2019 | 203 | 2019 |
A corpus for multilingual document classification in eight languages H Schwenk, X Li arXiv preprint arXiv:1805.09821, 2018 | 158 | 2018 |
On evaluation of adversarial perturbations for sequence-to-sequence models P Michel, X Li, G Neubig, JM Pino arXiv preprint arXiv:1903.06620, 2019 | 142 | 2019 |
Multilingual speech translation with efficient finetuning of pretrained models X Li, C Wang, Y Tang, C Tran, Y Tang, J Pino, A Baevski, A Conneau, ... arXiv preprint arXiv:2010.12829, 2020 | 132 | 2020 |
Data-gov wiki: Towards linking government data L Ding, D DiFranzo, A Graves, JR Michaelis, X Li, DL McGuinness, ... 2010 AAAI spring symposium series, 2010 | 126 | 2010 |
Multilingual translation from denoising pre-training Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021 | 125 | 2021 |
Self-rewarding language models W Yuan, RY Pang, K Cho, S Sukhbaatar, J Xu, J Weston arXiv preprint arXiv:2401.10020, 2024 | 120 | 2024 |
Self-alignment with instruction backtranslation X Li, P Yu, C Zhou, T Schick, L Zettlemoyer, O Levy, J Weston, M Lewis arXiv preprint arXiv:2308.06259, 2023 | 107 | 2023 |
Lifting the curse of multilinguality by pre-training modular transformers J Pfeiffer, N Goyal, XV Lin, X Li, J Cross, S Riedel, M Artetxe arXiv preprint arXiv:2205.06266, 2022 | 90 | 2022 |
TWC data-gov corpus: incrementally generating linked government data from data. gov L Ding, D DiFranzo, A Graves, JR Michaelis, X Li, DL McGuinness, ... Proceedings of the 19th international conference on World Wide Web, 1383-1386, 2010 | 81 | 2010 |
Efficient large scale language modeling with mixtures of experts M Artetxe, S Bhosale, N Goyal, T Mihaylov, M Ott, S Shleifer, XV Lin, J Du, ... arXiv preprint arXiv:2112.10684, 2021 | 80 | 2021 |
Jingfei Du, et al. 2021. Few-shot learning with multilingual language models XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ... arXiv preprint arXiv:2112.10668, 35-40, 2021 | 80 | 2021 |
Cross-lingual retrieval for iterative self-supervised training C Tran, Y Tang, X Li, J Gu Advances in Neural Information Processing Systems 33, 2207-2219, 2020 | 68 | 2020 |
Findings of the first shared task on machine translation robustness X Li, P Michel, A Anastasopoulos, Y Belinkov, N Durrani, O Firat, P Koehn, ... arXiv preprint arXiv:1906.11943, 2019 | 67 | 2019 |