Parsinlu: a suite of language understanding challenges for persian

F Yu, H Zhang, P Tiwari, B Wang - ACM Computing Surveys, 2024 - dl.acm.org

This survey article proposes a clearer view of Natural Language Reasoning (NLR) in the
field of Natural Language Processing (NLP), both conceptually and practically …

被引用次数：47 相关文章所有 3 个版本

[PDF] arxiv.org

XTREME-R: Towards more challenging and nuanced multilingual evaluation

S Ruder, N Constant, J Botha, A Siddhant… - arXiv preprint arXiv …, 2021 - arxiv.org

Machine learning has brought striking advances in multilingual natural language processing
capabilities over the past year. For example, the latest techniques have improved the state …

被引用次数：143 相关文章所有 8 个版本

[PDF] arxiv.org

Massive: A 1m-example multilingual natural language understanding dataset with 51 typologically-diverse languages

J FitzGerald, C Hench, C Peris, S Mackie… - arXiv preprint arXiv …, 2022 - arxiv.org

We present the MASSIVE dataset--Multilingual Amazon Slu resource package (SLURP) for
Slot-filling, Intent classification, and Virtual assistant Evaluation. MASSIVE contains 1M …

被引用次数：90 相关文章所有 8 个版本

[PDF] arxiv.org

Scandeval: A benchmark for Scandinavian natural language processing

DS Nielsen - arXiv preprint arXiv:2304.00906, 2023 - arxiv.org

This paper introduces a Scandinavian benchmarking platform, ScandEval, which can
benchmark any pretrained model on four different tasks in the Scandinavian languages. The …

被引用次数：21 相关文章所有 3 个版本

[PDF] neurips.cc

This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish

L Augustyniak, K Tagowski… - Advances in …, 2022 - proceedings.neurips.cc

The availability of compute and data to train larger and larger language models increases
the demand for robust methods of benchmarking the true progress of LM training. Recent …

被引用次数：11 相关文章所有 5 个版本

[PDF] aclanthology.org

Superlim: A Swedish language understanding evaluation benchmark

A Berdičevskis, G Bouma, R Kurtz… - Proceedings of the …, 2023 - aclanthology.org

We present Superlim, a multi-task NLP benchmark and analysis platform for evaluating
Swedish language models, a counterpart to the English-language (Super) GLUE suite. We …

被引用次数：8 相关文章所有 3 个版本

[PDF] arxiv.org

Farstail: A persian natural language inference dataset

H Amirkhani, M AzariJafari, S Faridan-Jahromi… - Soft Computing, 2023 - Springer

With the considerable achievements of data-hungry deep learning methods in natural
language processing tasks, a great amount of effort has been devoted to develop more …

被引用次数：39 相关文章所有 5 个版本

[PDF] ieee.org

Persianquad: the native question answering dataset for the Persian language

A Kazemi, J Mozafari, MA Nematbakhsh - IEEE Access, 2022 - ieeexplore.ieee.org

Developing Question Answering systems (QA) is one of the main goals in Artificial
Intelligence. With the advent of Deep Learning (DL) techniques, QA systems have witnessed …

被引用次数：32 相关文章所有 5 个版本

[HTML] mdpi.com

[HTML][HTML] Investigating the Challenges and Opportunities in Persian Language Information Retrieval through Standardized Data Collections and Deep Learning

S Moniri, T Schlosser, D Kowerko - Computers, 2024 - mdpi.com

The Persian language, also known as Farsi, is distinguished by its intricate morphological
richness, yet it contends with a paucity of linguistic resources. With an estimated 110 million …

FaBERT: Pre-training BERT on Persian Blogs

M Masumi, SS Majd, M Shamsfard, H Beigy - arXiv preprint arXiv …, 2024 - arxiv.org

We introduce FaBERT, a Persian BERT-base model pre-trained on the HmBlogs corpus,
encompassing both informal and formal Persian texts. FaBERT is designed to excel in …

被引用次数：4 相关文章所有 2 个版本