Ocnli: Original chinese natural language inference
Despite the tremendous recent progress on natural language inference (NLI), driven largely
by large-scale investment in new datasets (eg, SNLI, MNLI) and advances in modeling, most …
by large-scale investment in new datasets (eg, SNLI, MNLI) and advances in modeling, most …
KorNLI and KorSTS: New benchmark datasets for Korean natural language understanding
Natural language inference (NLI) and semantic textual similarity (STS) are key tasks in
natural language understanding (NLU). Although several benchmark datasets for those …
natural language understanding (NLU). Although several benchmark datasets for those …
ZeroBERTo: Leveraging zero-shot text classification by topic modeling
Traditional text classification approaches often require a good amount of labeled data, which
is difficult to obtain, especially in restricted domains or less widespread languages. This lack …
is difficult to obtain, especially in restricted domains or less widespread languages. This lack …
Pirá: A bilingual portuguese-english dataset for question-answering about the ocean
AFA Paschoal, P Pirozelli, V Freire… - Proceedings of the 30th …, 2021 - dl.acm.org
Current research in natural language processing is highly dependent on carefully produced
corpora. Most existing resources focus on English; some resources focus on languages …
corpora. Most existing resources focus on English; some resources focus on languages …
Bluex: A benchmark based on Brazilian leading universities entrance exams
One common trend in recent studies of language models (LMs) is the use of standardized
tests for evaluation. However, despite being the fifth most spoken language worldwide, few …
tests for evaluation. However, despite being the fifth most spoken language worldwide, few …
A survey on textual entailment: Benchmarks, approaches and applications
Textual Entailment Recognition (TER), also known as natural language inference, is a
crucial task in natural language processing that combines many fundamental aspects of …
crucial task in natural language processing that combines many fundamental aspects of …
Recognizing textual entailment: challenges in the Portuguese language
G Rocha, H Lopes Cardoso - Information, 2018 - mdpi.com
Recognizing textual entailment comprises the task of determining semantic entailment
relations between text fragments. A text fragment entails another text fragment if, from the …
relations between text fragments. A text fragment entails another text fragment if, from the …
PORTULAN ExtraGLUE Datasets and Models: Kick-starting a Benchmark for the Neural Processing of Portuguese
Leveraging research on the neural modelling of Portuguese, we contribute a collection of
datasets for an array of language processing tasks and a corresponding collection of fine …
datasets for an array of language processing tasks and a corresponding collection of fine …
Sentence similarity recognition in Portuguese from multiple embedding models
AC Rodrigues, RM Marcacini - 2022 21st IEEE International …, 2022 - ieeexplore.ieee.org
Distinct pre-trained embedding models perform differently in sentence similarity recognition
tasks. The current assumption is that they encode different features due to differences in …
tasks. The current assumption is that they encode different features due to differences in …
Ptt5-paraphraser: Diversity and meaning fidelity in automatic portuguese paraphrasing
Paraphrasing is a fundamental technique for many text applications. Typically, this task is
performed through models that perform lexical and translation operations, which tend to …
performed through models that perform lexical and translation operations, which tend to …