Data Augmentation for Low-Resource Neural Machine Translation M Fadaee, A Bisazza, C Monz Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017 | 574 | 2017 |
Back-translation sampling by targeting difficult words in neural machine translation M Fadaee, C Monz arXiv preprint arXiv:1808.09006, 2018 | 83 | 2018 |
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ... arXiv preprint arXiv:2108.13897, 2021 | 77 | 2021 |
Inpars: Data augmentation for information retrieval using large language models L Bonifacio, H Abonizio, M Fadaee, R Nogueira arXiv preprint arXiv:2202.05144, 2022 | 74 | 2022 |
Inpars: Unsupervised dataset generation for information retrieval L Bonifacio, H Abonizio, M Fadaee, R Nogueira Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022 | 65 | 2022 |
Inpars-v2: Large language models as efficient dataset generators for information retrieval V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ... arXiv preprint arXiv:2301.01820, 2023 | 62 | 2023 |
When less is more: Investigating data pruning for pretraining llms at scale M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker arXiv preprint arXiv:2309.04564, 2023 | 37 | 2023 |
Examining the tip of the iceberg: A data set for idiom translation M Fadaee, A Bisazza, C Monz arXiv preprint arXiv:1802.04681, 2018 | 37 | 2018 |
Aya model: An instruction finetuned open-access multilingual language model A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ... arXiv preprint arXiv:2402.07827, 2024 | 36 | 2024 |
No parameter left behind: How distillation and model size affect zero-shot retrieval GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ... arXiv preprint arXiv:2206.02873, 2022 | 28 | 2022 |
Aya dataset: An open-access collection for multilingual instruction tuning S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ... arXiv preprint arXiv:2402.06619, 2024 | 23 | 2024 |
In defense of cross-encoders for zero-shot retrieval G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ... arXiv preprint arXiv:2212.06121, 2022 | 20 | 2022 |
Learning Topic-Sensitive Word Representations M Fadaee, A Bisazza, C Monz Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017 | 20 | 2017 |
Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, A Üstün, ... arXiv preprint arXiv:2402.14740, 2024 | 19 | 2024 |
The unreasonable volatility of neural machine translation models M Fadaee, C Monz arXiv preprint arXiv:2005.12398, 2020 | 15 | 2020 |
Aya 23: Open weight releases to further multilingual progress V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ... arXiv preprint arXiv:2405.15032, 2024 | 14 | 2024 |
Elo uncovered: Robustness and best practices in language model evaluation M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee arXiv preprint arXiv:2311.17295, 2023 | 11 | 2023 |
Data augmentation for low-resource neural machine translation. arXiv 2017 M Fadaee, A Bisazza, C Monz arXiv preprint arXiv:1705.00440, 0 | 9 | |
Automatic WordNet Construction Using Markov Chain Monte Carlo M Fadaee, H Ghader, H Faili, A Shakery Polibits, 13-22, 2013 | 7 | 2013 |
A New Neural Search and Insights Platform for Navigating and Organizing AI Research M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ... arXiv preprint arXiv:2011.00061, 2020 | 6 | 2020 |