[PDF][PDF] The penn arabic treebank: Building a large-scale annotated arabic corpus

M Maamouri, A Bies, T Buckwalter… - NEMLAR conference on …, 2004 - marefa.org
From our three year experience of developing a large-scale corpus of annotated Arabic text,
our paper will address the following:(a) review pertinent Arabic language issues as they …

Overview of the SPMRL 2013 shared task: A cross-framework evaluation of parsing morphologically rich languages

D Seddah, R Tsarfaty, S Kübler, M Candito… - Proceedings of the …, 2013 - hal.science
This paper reports on the first shared task on statistical parsing of morphologically rich lan-
guages (MRLs). The task features data sets from nine languages, each available both in …

[PDF][PDF] Machine translation experiments on PADIC: A parallel Arabic dialect corpus

K Meftouh, S Harrat, S Jamoussi… - Proceedings of the …, 2015 - aclanthology.org
We present in this paper PADIC, a Parallel Arabic DIalect Corpus we built from scratch, then
we conducted experiments on crossdialect Arabic machine translation. PADIC is composed …

[PDF][PDF] Prague Arabic dependency treebank: Development in data and tools

J Hajic, O Smrz, P Zemánek… - Proc. of the NEMLAR …, 2004 - catalog.ldc.upenn.edu
Abstract Prague Arabic Dependency Treebank not only consists of multi-level linguistic
annotations over the language of Modern Standard Arabic, but even provides a variety of …

[图书][B] Automatic dialect and accent recognition and its application to speech recognition

F Biadsy - 2011 - search.proquest.com
A fundamental challenge for current research on speech science and technology is
understanding and modeling individual variation in spoken language. Individuals have their …

[PDF][PDF] Issues in Arabic orthography and morphology analysis

T Buckwalter - proceedings of the workshop on computational …, 2004 - aclanthology.org
This paper discusses several issues in Arabic orthography that were encountered in the
process of performing morphology analysis and POS tagging of 542,543 Arabic words in …

A hybrid approach to Arabic named entity recognition

K Shaalan, M Oudah - Journal of Information Science, 2014 - journals.sagepub.com
In this paper, we propose a hybrid named entity recognition (NER) approach that takes the
advantages of rule-based and machine learning-based approaches in order to improve the …

[PDF][PDF] Generating complex morphology for machine translation

E Minkov, K Toutanova, H Suzuki - … of the 45th annual meeting of …, 2007 - aclanthology.org
We present a novel method for predicting inflected word forms for generating
morphologically rich languages in machine translation. We utilize a rich set of syntactic and …

[PDF][PDF] A pipeline Arabic named entity recognition using a hybrid approach

M Oudah, K Shaalan - Proceedings of COLING 2012, 2012 - aclanthology.org
ABSTRACT Most Arabic Named Entity Recognition (NER) systems have been developed
using either of two approaches: a rule-based or Machine Learning (ML) based approach …

Creating language resources for under-resourced languages: methodologies, and experiments with Arabic

M El-Haj, U Kruschwitz, C Fox - Language Resources and Evaluation, 2015 - Springer
Abstract Language resources are important for those working on computational methods to
analyse and study languages. These resources are needed to help advancing the research …