Contemporary Amharic corpus: Automatically morpho-syntactically tagged Amharic corpus

AM Gezmu, BE Seyoum, M Gasser… - arXiv preprint arXiv …, 2021 - arxiv.org
We introduced the contemporary Amharic corpus, which is automatically tagged for morpho-
syntactic information. Texts are collected from 25,199 documents from different domains and …

Automated all in one misspelling detection and correction system for Ethiopian languages

WB Demilie, AO Salau - Journal of Cloud Computing, 2022 - Springer
In this paper, a misspelling detection and correction system was developed for Ethiopian
languages (Amharic, Afan Oromo, Tigrinya, Hadiyyisa, Kambatissa, and Awngi). For some of …

Extended parallel corpus for Amharic-English machine translation

AM Gezmu, A Nürnberger, TB Bati - arXiv preprint arXiv:2104.03543, 2021 - arxiv.org
This paper describes the acquisition, preprocessing, segmentation, and alignment of an
Amharic-English parallel corpus. It will be helpful for machine translation of a low-resource …

Beqi: Revitalize the senegalese wolof language with a robust spelling corrector

D Mbaye, M Diallo - arXiv preprint arXiv:2305.08518, 2023 - arxiv.org
The progress of Natural Language Processing (NLP), although fast in recent years, is not at
the same pace for all languages. African languages in particular are still behind and lack …

Enhancing Sentiment Analysis in Amharic: Leveraging Transformer-Based Language Model for Low-Resource African Languages

N Raychawdhary, A Das, S Bhattacharya… - SoutheastCon …, 2024 - ieeexplore.ieee.org
One of the most extensively researched applications in natural language processing (NLP)
is sentiment analysis. While the majority of the study focuses on high-resource languages …

Amharic sentence-level word sense disambiguation u sing transfer learning

N Mossa, M Meshesha - International Conference on Advances of Science …, 2022 - Springer
Word sense disambiguation (WSD) plays an important role, in increasing the performance of
NLP applications such as information extraction, information retrieval, and machine …

Manually annotated spelling error corpus for Amharic

AM Gezmu, TT Lema, BE Seyoum… - arXiv preprint arXiv …, 2021 - arxiv.org
This paper presents a manually annotated spelling error corpus for Amharic, lingua franca in
Ethiopia. The corpus is designed to be used for the evaluation of spelling error detection and …

Optimizing Multilingual Sentiment Analysis in Low-Resource Languages with Adaptive Pretraining and Strategic Language Selection

N Raychawdhary, A Das… - 2024 IEEE 3rd …, 2024 - ieeexplore.ieee.org
In the realm of natural language processing (NLP), sentiment analysis has traditionally
focused on resource-rich languages like English, often overlooking the linguistic diversity …

Subword-based Neural Machine Translation for low-resource fusion languages

AM Gezmu - 2023 - repo.bibliothek.uni-halle.de
Neural approaches, which are currently state-of-the-art in many areas, have contributed
significantly to the exciting advancements in machine translation. However, Neural Machine …

Towards Afrocentric natural language processing

I Adebara - 2024 - open.library.ubc.ca
This dissertation centers on Natural Language Processing (NLP) for African languages,
endeavoring to unravel the progress, challenges, and future prospects within this linguistic …