Parts-of-speech tagging of Nepali texts with Bidirectional LSTM, Conditional Random Fields and HMM

A Pradhan, A Yajnik - Multimedia Tools and Applications, 2024 - Springer
Abstract Parts-of-Speech (POS) Tagging is one of the fundamental and pre-processing steps
for Natural Language Processing (NLP) tasks such as Text Summarization, Name Entity …

Robust French syntax analysis: reconciling statistical methods and linguistic knowledge in the Talismane toolkit

A Urieli - 2013 - theses.hal.science
In this thesis we explore robust statistical syntax analysis for French. Our main concern is to
explore methods whereby the linguist can inject linguistic knowledge and/or resources into …

A robust transformation-based learning approach using ripple down rules for part-of-speech tagging

DQ Nguyen, DQ Nguyen, DD Pham… - AI …, 2016 - content.iospress.com
In this paper, we propose a new approach to construct a system of transformation rules for
the Part-of-Speech (POS) tagging task. Our approach is based on an incremental …

Crowdsourcing complex language resources: Playing to annotate dependency syntax

B Guillaume, K Fort, N Lefebvre - International Conference on …, 2016 - inria.hal.science
This article presents the results we obtained on a complex annotation task (that of
dependency syntax) using a specifically designed Game with a Purpose, ZombiLingo. We …

Socioeconomic dependencies of linguistic patterns in twitter: A multivariate analysis

JL Abitbol, M Karsai, JP Magué, JP Chevrot… - Proceedings of the 2018 …, 2018 - dl.acm.org
Our usage of language is not solely reliant on cognition but is arguably determined by
myriad external factors leading to a global variability of linguistic patterns. This issue, which …

Corpus vs. lexicon supervision in morphosyntactic tagging: the case of Slovene

N Ljubešić, T Erjavec - … of the Tenth International Conference on …, 2016 - aclanthology.org
In this paper we present a tagger developed for inflectionally rich languages for which both a
training corpus and a lexicon are available. We do not constrain the tagger by the lexicon …

The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres

T Chanier, C Poudat, B Sagot, G Antoniadis… - Journal for language …, 2014 - shs.hal.science
The CoMeRe project aims to build a kernel corpus of different Computer-Mediated Com-
munication (CMC) genres with interactions in French as the main language, by assembling …

TCOF-POS: un corpus libre de français parlé annoté en morphosyntaxe

C Benzitoun, K Fort, B Sagot - JEP-TALN 2012-Journées d'Études sur …, 2012 - hal.science
Résumé This article details the creation of TCOF-POS, the first freely available corpus of
spontaneous spoken French. We present here the methodology that was followed in order to …

New ways of analyzing complementizer drop in Montréal French: Exploration of cognitive factors

Y Liang, P Amsili, H Burnett - Language Variation and Change, 2021 - cambridge.org
In this paper, we return to the well-studied yet still puzzling phenomenon of complementizer
omission in a large spoken corpus of Quebec French, with the help of modern computational …

Applying phraseological complexity measures to L2 French: A partial replication study

N Vandeweerd, A Housen… - International Journal of …, 2021 - jbe-platform.com
This study partially replicates Paquot's (,) study of phraseological complexity in L2 English
by investigating how phraseological complexity compares across proficiency levels as well …