Crosslingual generalization through multitask finetuning

N Muennighoff, T Wang, L Sutawika, A Roberts… - arXiv preprint arXiv …, 2022 - arxiv.org
Multitask prompted finetuning (MTF) has been shown to help large language models
generalize to new tasks in a zero-shot setting, but so far explorations of MTF have focused …

Language models are multilingual chain-of-thought reasoners

F Shi, M Suzgun, M Freitag, X Wang, S Srivats… - arXiv preprint arXiv …, 2022 - arxiv.org
We evaluate the reasoning abilities of large language models in multilingual settings. We
introduce the Multilingual Grade School Math (MGSM) benchmark, by manually translating …

[PDF][PDF] Recent trends in word sense disambiguation: A survey

M Bevilacqua, T Pasini… - … Joint Conference on …, 2021 - researchportal.helsinki.fi
Abstract Word Sense Disambiguation (WSD) aims at making explicit the semantics of a word
in context by identifying the most suitable meaning from a predefined sense inventory …

State-of-the-art generalisation research in NLP: a taxonomy and review

D Hupkes, M Giulianelli, V Dankers, M Artetxe… - arXiv preprint arXiv …, 2022 - arxiv.org
The ability to generalise well is one of the primary desiderata of natural language
processing (NLP). Yet, what'good generalisation'entails and how it should be evaluated is …

XL-LEXEME: WiC pretrained model for cross-lingual LEXical sEMantic changE

P Cassotti, L Siciliani, M DeGemmis… - Proceedings of the …, 2023 - aclanthology.org
The recent introduction of large-scale datasets for the WiC (Word in Context) task enables
the creation of more reliable and meaningful contextualized word embeddings. However …

Aya model: An instruction finetuned open-access multilingual language model

A Üstün, V Aryabumi, ZX Yong, WY Ko… - arXiv preprint arXiv …, 2024 - arxiv.org
Recent breakthroughs in large language models (LLMs) have centered around a handful of
data-rich languages. What does it take to broaden access to breakthroughs beyond first …

Ten years of BabelNet: A survey

R Navigli, M Bevilacqua, S Conia, D Montagnini… - IJCAI, 2021 - iris.uniroma1.it
The intelligent manipulation of symbolic knowledge has been a long-sought goal of AI.
However, when it comes to Natural Language Processing (NLP), symbols have to be …

[PDF][PDF] WiC-ITA at EVALITA2023: Overview of the EVALITA2023 Word-in-Context for ITAlian Task.

P Cassotti, L Siciliani, LC Passaro, M Gatto, P Basile - EVALITA, 2023 - ceur-ws.org
WiC-ita is a shared task proposed at the EVALITA 2023 campaign. The task focuses on the
meaning of words in specific contexts and has been modelled as both a binary classification …

XL-WSD: An extra-large and cross-lingual evaluation framework for word sense disambiguation

T Pasini, A Raganato, R Navigli - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Transformer-based architectures brought a breeze of change to Word Sense
Disambiguation (WSD), improving models' performances by a large margin. The fast …

Analysis and evaluation of language models for word sense disambiguation

D Loureiro, K Rezaee, MT Pilehvar… - Computational …, 2021 - direct.mit.edu
Transformer-based language models have taken many fields in NLP by storm. BERT and its
derivatives dominate most of the existing evaluation benchmarks, including those for Word …