Ocnli: Original chinese natural language inference

H Hu, K Richardson, L Xu, L Li, S Kübler… - arXiv preprint arXiv …, 2020 - arxiv.org
Despite the tremendous recent progress on natural language inference (NLI), driven largely
by large-scale investment in new datasets (eg, SNLI, MNLI) and advances in modeling, most …

KorNLI and KorSTS: New benchmark datasets for Korean natural language understanding

J Ham, YJ Choe, K Park, I Choi, H Soh - arXiv preprint arXiv:2004.03289, 2020 - arxiv.org
Natural language inference (NLI) and semantic textual similarity (STS) are key tasks in
natural language understanding (NLU). Although several benchmark datasets for those …

ZeroBERTo: Leveraging zero-shot text classification by topic modeling

A Alcoforado, TP Ferraz, R Gerber, E Bustos… - … Processing of the …, 2022 - Springer
Traditional text classification approaches often require a good amount of labeled data, which
is difficult to obtain, especially in restricted domains or less widespread languages. This lack …

Pirá: A bilingual portuguese-english dataset for question-answering about the ocean

AFA Paschoal, P Pirozelli, V Freire… - Proceedings of the 30th …, 2021 - dl.acm.org
Current research in natural language processing is highly dependent on carefully produced
corpora. Most existing resources focus on English; some resources focus on languages …

Bluex: A benchmark based on Brazilian leading universities entrance exams

TS Almeida, T Laitz, GK Bonás, R Nogueira - Brazilian Conference on …, 2023 - Springer
One common trend in recent studies of language models (LMs) is the use of standardized
tests for evaluation. However, despite being the fifth most spoken language worldwide, few …

A survey on textual entailment: Benchmarks, approaches and applications

Y Alharahseheh, R Obeidat, M Al-Ayoub… - … on Information and …, 2022 - ieeexplore.ieee.org
Textual Entailment Recognition (TER), also known as natural language inference, is a
crucial task in natural language processing that combines many fundamental aspects of …

Recognizing textual entailment: challenges in the Portuguese language

G Rocha, H Lopes Cardoso - Information, 2018 - mdpi.com
Recognizing textual entailment comprises the task of determining semantic entailment
relations between text fragments. A text fragment entails another text fragment if, from the …

PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

T Osório, B Leite, HL Cardoso, L Gomes… - arXiv preprint arXiv …, 2024 - arxiv.org
Leveraging research on the neural modelling of Portuguese, we contribute a collection of
datasets for an array of language processing tasks and a corresponding collection of fine …

Sentence similarity recognition in Portuguese from multiple embedding models

AC Rodrigues, RM Marcacini - 2022 21st IEEE International …, 2022 - ieeexplore.ieee.org
Distinct pre-trained embedding models perform differently in sentence similarity recognition
tasks. The current assumption is that they encode different features due to differences in …

Ptt5-paraphraser: Diversity and meaning fidelity in automatic portuguese paraphrasing

LFAO Pellicer, P Pirozelli, AHR Costa… - … Processing of the …, 2022 - Springer
Paraphrasing is a fundamental technique for many text applications. Typically, this task is
performed through models that perform lexical and translation operations, which tend to …