Unsupervised cross-lingual representation learning at scale A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ... arXiv preprint arXiv:1911.02116, 2019 | 5694 | 2019 |
Beyond english-centric multilingual machine translation A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ... Journal of Machine Learning Research 22 (107), 1-48, 2021 | 689 | 2021 |
CCNet: Extracting high quality monolingual datasets from web crawl data G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ... arXiv preprint arXiv:1911.00359, 2019 | 544 | 2019 |
No language left behind: Scaling human-centered machine translation MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ... arXiv preprint arXiv:2207.04672, 2022 | 519 | 2022 |
The flores-101 evaluation benchmark for low-resource and multilingual machine translation N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ... Transactions of the Association for Computational Linguistics 10, 522-538, 2022 | 346 | 2022 |
CCMatrix: Mining billions of high-quality parallel sentences on the web H Schwenk, G Wenzek, S Edunov, E Grave, A Joulin arXiv preprint arXiv:1911.04944, 2019 | 207 | 2019 |
Trans-gram, Fast Cross-lingual Word-embeddings J Coulmance, JM Marty, G Wenzek, A Benhalloum Proceedings of the 2015 Conference on Empirical Methods in Natural Language …, 2015 | 113 | 2015 |
Unsupervised cross-lingual representation learning at scale. CoRR abs/1911.02116 (2019) A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ... URL: http://arxiv. org/abs/1911.02116, 1911 | 74 | 1911 |
Unsupervised cross-lingual representation learning at scale. arXiv 2019 A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ... arXiv preprint arXiv:1911.02116, 1911 | 54 | 1911 |
SeamlessM4T-Massively Multilingual & Multimodal Machine Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, PA Duquenne, ... arXiv preprint arXiv:2308.11596, 2023 | 52 | 2023 |
Generating fact checking briefs A Fan, A Piktus, F Petroni, G Wenzek, M Saeidi, A Vlachos, A Bordes, ... arXiv preprint arXiv:2011.05448, 2020 | 52 | 2020 |
Seamless: Multilingual Expressive and Streaming Speech Translation L Barrault, YA Chung, MC Meglioli, D Dale, N Dong, M Duppenthaler, ... arXiv preprint arXiv:2312.05187, 2023 | 43 | 2023 |
Facebook AI's WAT19 Myanmar-English translation task submission PJ Chen, J Shen, M Le, V Chaudhary, A El-Kishky, G Wenzek, M Ott, ... arXiv preprint arXiv:1910.06848, 2019 | 28 | 2019 |
Findings of the WMT 2021 shared task on large-scale multilingual machine translation G Wenzek, V Chaudhary, A Fan, S Gomez, N Goyal, S Jain, D Kiela, ... Proceedings of the Sixth Conference on Machine Translation, 89-99, 2021 | 19 | 2021 |
Findings of the WMT’22 shared task on large-scale machine translation evaluation for African languages D Adelani, MMI Alam, A Anastasopoulos, A Bhagia, MR Costa-jussà, ... Proceedings of the Seventh Conference on Machine Translation (WMT), 773-800, 2022 | 15 | 2022 |
How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? S Zhang, V Chaudhary, N Goyal, J Cross, G Wenzek, M Bansal, ... arXiv preprint arXiv:2204.14268, 2022 | 11 | 2022 |
Analyse d’opinions de tweets par réseaux de neurones convolutionnels JM Marty, G Wenzek, E Schmitt, J Coulmance Actes de DEFT, Caen, France: TALN, 2015 | 6 | 2015 |
stopes-Modular Machine Translation Pipelines P Andrews, G Wenzek, K Heffernan, O Çelebi, A Sun, A Kamran, Y Guo, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 2 | 2022 |
Method for automatically constructing inter-language queries for a search engine G Wenzek, J Coulmance, JM Marty US Patent 11,055,370, 2021 | | 2021 |