ASSIN: Avaliacao de similaridade semantica e inferencia textual

H Hu, K Richardson, L Xu, L Li, S Kübler… - arXiv preprint arXiv …, 2020 - arxiv.org

Despite the tremendous recent progress on natural language inference (NLI), driven largely
by large-scale investment in new datasets (eg, SNLI, MNLI) and advances in modeling, most …

被引用次数：99 相关文章所有 4 个版本

[PDF] arxiv.org

KorNLI and KorSTS: New benchmark datasets for Korean natural language understanding

J Ham, YJ Choe, K Park, I Choi, H Soh - arXiv preprint arXiv:2004.03289, 2020 - arxiv.org

Natural language inference (NLI) and semantic textual similarity (STS) are key tasks in
natural language understanding (NLU). Although several benchmark datasets for those …

被引用次数：113 相关文章所有 4 个版本

[PDF] arxiv.org

ZeroBERTo: Leveraging zero-shot text classification by topic modeling

A Alcoforado, TP Ferraz, R Gerber, E Bustos… - … Processing of the …, 2022 - Springer

Traditional text classification approaches often require a good amount of labeled data, which
is difficult to obtain, especially in restricted domains or less widespread languages. This lack …

被引用次数：36 相关文章所有 10 个版本

[PDF] acm.org

Pirá: A bilingual portuguese-english dataset for question-answering about the ocean

AFA Paschoal, P Pirozelli, V Freire… - Proceedings of the 30th …, 2021 - dl.acm.org

Current research in natural language processing is highly dependent on carefully produced
corpora. Most existing resources focus on English; some resources focus on languages …

被引用次数：18 相关文章所有 7 个版本

[PDF] arxiv.org

Bluex: A benchmark based on Brazilian leading universities entrance exams

TS Almeida, T Laitz, GK Bonás, R Nogueira - Brazilian Conference on …, 2023 - Springer

One common trend in recent studies of language models (LMs) is the use of standardized
tests for evaluation. However, despite being the fifth most spoken language worldwide, few …

被引用次数：5 相关文章所有 4 个版本

A survey on textual entailment: Benchmarks, approaches and applications

Y Alharahseheh, R Obeidat, M Al-Ayoub… - … on Information and …, 2022 - ieeexplore.ieee.org

Textual Entailment Recognition (TER), also known as natural language inference, is a
crucial task in natural language processing that combines many fundamental aspects of …

被引用次数：5 相关文章

[PDF] mdpi.com

Recognizing textual entailment: challenges in the Portuguese language

G Rocha, H Lopes Cardoso - Information, 2018 - mdpi.com

Recognizing textual entailment comprises the task of determining semantic entailment
relations between text fragments. A text fragment entails another text fragment if, from the …

被引用次数：26 相关文章所有 5 个版本

[PDF] arxiv.org

PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese

T Osório, B Leite, HL Cardoso, L Gomes… - arXiv preprint arXiv …, 2024 - arxiv.org

Leveraging research on the neural modelling of Portuguese, we contribute a collection of
datasets for an array of language processing tasks and a corresponding collection of fine …

被引用次数：2 相关文章所有 6 个版本

Sentence similarity recognition in Portuguese from multiple embedding models

AC Rodrigues, RM Marcacini - 2022 21st IEEE International …, 2022 - ieeexplore.ieee.org

Distinct pre-trained embedding models perform differently in sentence similarity recognition
tasks. The current assumption is that they encode different features due to differences in …

被引用次数：2 相关文章所有 2 个版本

[PDF] google.com

Ptt5-paraphraser: Diversity and meaning fidelity in automatic portuguese paraphrasing

LFAO Pellicer, P Pirozelli, AHR Costa… - … Processing of the …, 2022 - Springer

Paraphrasing is a fundamental technique for many text applications. Typically, this task is
performed through models that perform lexical and translation operations, which tend to …

被引用次数：5 相关文章所有 7 个版本