Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1439 | 2023 |
Participatory research for low-resourced machine translation: A case study in african languages W Nekoto, V Marivate, T Matsila, T Fasubaa, T Kolawole, T Fagbohungbe, ... arXiv preprint arXiv:2010.02353, 2020 | 159 | 2020 |
Quality at a glance: An audit of web-crawled multilingual datasets J Kreutzer, I Caswell, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Transactions of the Association for Computational Linguistics 10, 50-72, 2022 | 117 | 2022 |
Nisansa de Silva J Kreutzer, I Caswell, L Wang, A Wahab, D Van Esch, N Ulzii-Orshikh, ... Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur …, 2022 | 91 | 2022 |
MasakhaNER: Named entity recognition for African languages DI Adelani, J Abbott, G Neubig, D D’souza, J Kreutzer, C Lignos, ... Transactions of the Association for Computational Linguistics 9, 1116-1131, 2021 | 82 | 2021 |
Naijasenti: A nigerian twitter sentiment corpus for multilingual sentiment analysis SH Muhammad, DI Adelani, IS Ahmad, I Abdulmumin, BS Bello, ... Proceedings of LREC 2022, 2022 | 74 | 2022 |
Afrisenti: A twitter sentiment analysis benchmark for african languages SH Muhammad, I Abdulmumin, AA Ayele, N Ousidhoum, DI Adelani, ... arXiv preprint arXiv:2302.08956, 2023 | 53 | 2023 |
SemEval-2023 task 12: sentiment analysis for african languages (AfriSenti-SemEval) SH Muhammad, I Abdulmumin, SM Yimam, DI Adelani, IS Ahmad, ... arXiv preprint arXiv:2304.06845, 2023 | 42 | 2023 |
A few thousand translations go a long way! leveraging pre-trained models for african news translation DI Adelani, JO Alabi, A Fan, J Kreutzer, X Shen, M Reid, D Ruiter, ... arXiv preprint arXiv:2205.02022, 2022 | 34 | 2022 |
Quality at a glance: An audit of web-crawled multilingual datasets I Caswell, J Kreutzer, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... arXiv e-prints, arXiv: 2103.12028, 2021 | 34 | 2021 |
A survey on machine learning techniques in movie revenue prediction IS Ahmad, AA Bakar, MR Yaakub, SH Muhammad SN Computer Science 1 (4), 235, 2020 | 28 | 2020 |
Masakhaner 2.0: Africa-centric transfer learning for named entity recognition DI Adelani, G Neubig, S Ruder, S Rijhwani, M Beukman, C Palen-Michel, ... arXiv preprint arXiv:2210.12391, 2022 | 21 | 2022 |
Nisansa de Silva I Caswell, J Kreutzer, L Wang, A Wahab, D van Esch, N Ulzii-Orshikh, ... Sakine Çabuk Ballı, Stella Biderman, Alessia Battisti, Ahmed Baruwa, Ankur …, 2021 | 20 | 2021 |
SemEval-2024 task 1: Semantic textual relatedness for african and asian languages N Ousidhoum, SH Muhammad, M Abdalla, I Abdulmumin, IS Ahmad, ... Proceedings of the 18th International Workshop on Semantic Evaluation …, 2024 | 16 | 2024 |
Masakhanews: News topic classification for african languages DI Adelani, M Masiak, IA Azime, J Alabi, AL Tonja, C Mwase, O Ogundepo, ... arXiv preprint arXiv:2304.09972, 2023 | 15 | 2023 |
Massive open online courses: awareness, adoption, benefits and challenges in Sub-Saharan Africa SH Muhammad, A Mustapha, K Haruna International Journal of ICT and Managemant 4 (2), 60-68, 2016 | 14 | 2016 |
Bibletts: a large, high-fidelity, multilingual, and uniquely african speech corpus J Meyer, DI Adelani, E Casanova, A Öktem, DWJ Weber, S Kabongo, ... arXiv preprint arXiv:2207.03546, 2022 | 13 | 2022 |
Hausa visual genome: A dataset for multi-modal English to Hausa machine translation I Abdulmumin, SR Dash, MA Dawud, S Parida, SH Muhammad, IS Ahmad, ... arXiv preprint arXiv:2205.01133, 2022 | 11 | 2022 |
SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 14 Languages N Ousidhoum, SH Muhammad, M Abdalla, I Abdulmumin, IS Ahmad, ... arXiv preprint arXiv:2402.08638, 2024 | 9 | 2024 |
Afriqa: Cross-lingual open-retrieval question answering for african languages O Ogundepo, TR Gwadabe, CE Rivera, JH Clark, S Ruder, DI Adelani, ... arXiv preprint arXiv:2305.06897, 2023 | 6 | 2023 |