Moroccan dialect-darija-open dataset

A Outchakoucht, H Es-Samaali - arXiv preprint arXiv:2103.09687, 2021 - arxiv.org
Darija Open Dataset (DODa) is an open-source project for the Moroccan dialect. With more
than 10,000 entries DODa is arguably the largest open-source collaborative project for …

Goud. ma: a news article dataset for summarization in Moroccan Darija

A Issam, K Mrini - 3rd Workshop on African Natural Language …, 2021 - openreview.net
Moroccan Darija is a vernacular spoken by over 30 million people primarily in Morocco.
Despite a high number of speakers, it remains a low-resource language. In this paper, we …

Moroccan dialect “Darija” automatic speech recognition: a survey

M Labied, A Belangour - 2021 IEEE 2nd International …, 2021 - ieeexplore.ieee.org
Nowadays, human-machine interaction is growing swiftly, and Automatic Speech
Recognition is gaining immense interest to make the daily routines much easier. This could …

Towards a computational lexicon for Moroccan darija: Words, idioms, and constructions

J Laoudi, C Bonial, L Donatelli, S Tratz… - Proceedings of the …, 2018 - aclanthology.org
In this paper, we explore the challenges of building a computational lexicon for Moroccan
Darija (MD), an Arabic dialect spoken by over 32 million people worldwide but which only …

The Evolution of Darija Open Dataset: Introducing Version 2

A Outchakoucht, H Es-Samaali - arXiv preprint arXiv:2405.13016, 2024 - arxiv.org
Darija Open Dataset (DODa) represents an open-source project aimed at enhancing Natural
Language Processing capabilities for the Moroccan dialect, Darija. With approximately …

Putting figures on influences on moroccan darija from Arabic, French and Spanish using the wordnet

K Mrini, F Bond - Proceedings of the 9th Global Wordnet …, 2018 - aclanthology.org
Moroccan Darija is a variant of Arabic with many influences. Using the Open Multilingual
WordNet (OMW), we compare the lemmas in the Moroccan Darija Wordnet (MDW) with the …

Acquiring Domain-Specific Knowledge for WordNet from a Terminological Database

A Simões, X Gómez Guinovart - 8th Symposium on Languages …, 2019 - drops.dagstuhl.de
In this research we explore a terminological database (Termoteca) in order to expand the
Portuguese and Galician wordnets (PULO and Galnet) with the addition of new synset …