Word sense disambiguation: a uinified evaluation framework and empirical comparison

A Raganato, J Camacho-Collados… - Proceedings of the 15th …, 2017 - iris.uniroma1.it
Abstract Word Sense Disambiguation is a longstanding task in Natural Language
Processing, lying at the core of human language understanding. However, the evaluation of …

Mulan: Multilingual label propagation for word sense disambiguation

E Barba, L Procopio, N Campolungo… - Proceedings of the …, 2020 - iris.uniroma1.it
The knowledge acquisition bottleneck strongly affects the creation of multilingual sense-
annotated data, hence limiting the power of supervised systems when applied to multilingual …

[PDF][PDF] The knowledge acquisition bottleneck problem in multilingual word sense disambiguation

T Pasini - Proceedings of the Twenty-Ninth International …, 2021 - ijcai.org
Abstract Word Sense Disambiguation (WSD) is the task of identifying the meaning of a word
in a given context. It lies at the base of Natural Language Processing as it provides semantic …

Towards a seamless integration of word senses into downstream NLP applications

MT Pilehvar, J Camacho-Collados, R Navigli… - arXiv preprint arXiv …, 2017 - arxiv.org
Lexical ambiguity can impede NLP systems from accurate understanding of semantics.
Despite its potential benefits, the integration of sense-level information into NLP systems has …

Eurosense: Automatic harvesting of multilingual sense annotations from parallel text

CD Bovi, J Camacho-Collados… - Proceedings of the …, 2017 - aclanthology.org
Parallel corpora are widely used in a variety of Natural Language Processing tasks, from
Machine Translation to cross-lingual Word Sense Disambiguation, where parallel sentences …

[HTML][HTML] Train-o-matic: Supervised word sense disambiguation with no (manual) effort

T Pasini, R Navigli - Artificial Intelligence, 2020 - Elsevier
Abstract Word Sense Disambiguation (WSD) is the task of associating the correct meaning
with a word in a given context. WSD provides explicit semantic information that is beneficial …

Goldfish: Monolingual Language Models for 350 Languages

TA Chang, C Arnett, Z Tu, BK Bergen - arXiv preprint arXiv:2408.10441, 2024 - arxiv.org
For many low-resource languages, the only available language models are large
multilingual models trained on many languages simultaneously. However, using FLORES …

EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models

S Ji, Z Li, I Paul, J Paavola, P Lin, P Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
In this work, we introduce EMMA-500, a large-scale multilingual language model continue-
trained on texts across 546 languages designed for enhanced multilingual performance …

A short survey on sense-annotated corpora

T Pasini, J Camacho-Collados - arXiv preprint arXiv:1802.04744, 2018 - arxiv.org
Large sense-annotated datasets are increasingly necessary for training deep supervised
systems in Word Sense Disambiguation. However, gathering high-quality sense-annotated …

A proposed scheme for sentiment analysis: Effective feature reduction based on statistical information of SentiWordNet

S Tofighy, SM Fakhrahmad - Kybernetes, 2018 - emerald.com
Purpose This paper aims to propose a statistical and context-aware feature reduction
algorithm that improves sentiment classification accuracy. Classification of reviews with …