Flaubert: Unsupervised language model pre-training for french

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - International Journal of …, 2024 - Springer

Abstract Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks across different data modalities. A PFM (eg, BERT, ChatGPT, GPT-4) is …

被引用次数：564 相关文章所有 2 个版本

[PDF] arxiv.org

A comprehensive survey on applications of transformers for deep learning tasks

S Islam, H Elmekki, A Elsebai, J Bentahar… - Expert Systems with …, 2024 - Elsevier

Abstract Transformers are Deep Neural Networks (DNN) that utilize a self-attention
mechanism to capture contextual relationships within sequential data. Unlike traditional …

被引用次数：148 相关文章所有 4 个版本

[PDF] arxiv.org

A bibliometric review of large language models research from 2017 to 2023

L Fan, L Li, Z Ma, S Lee, H Yu, L Hemphill - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs), such as OpenAI's Generative Pre-trained Transformer
(GPT), are a class of language models that have demonstrated outstanding performance …

被引用次数：123 相关文章所有 4 个版本

[PDF] pkwyx.com

[PDF][PDF] mt5: A massively multilingual pre-trained text-to-text transformer

L Xue - arXiv preprint arXiv:2010.11934, 2020 - fq.pkwyx.com

The recent" Text-to-Text Transfer Transformer"(T5) leveraged a unified text-to-text format and
scale to attain state-of-the-art results on a wide variety of English-language NLP tasks. In this …

被引用次数：2417 相关文章

[PDF] arxiv.org

Ammus: A survey of transformer-based pretrained models in natural language processing

KS Kalyan, A Rajasekharan, S Sangeetha - arXiv preprint arXiv …, 2021 - arxiv.org

Transformer-based pretrained language models (T-PTLMs) have achieved great success in
almost every NLP task. The evolution of these models started with GPT and BERT. These …

被引用次数：354 相关文章所有 2 个版本

[PDF] arxiv.org

ARBERT & MARBERT: Deep bidirectional transformers for Arabic

M Abdul-Mageed, AR Elmadany… - arXiv preprint arXiv …, 2020 - arxiv.org

Pre-trained language models (LMs) are currently integral to many natural language
processing systems. Although multilingual LMs were also introduced to serve many …

被引用次数：466 相关文章所有 4 个版本

[HTML] springer.com

[HTML][HTML] Pre-trained models for natural language processing: A survey

X Qiu, T Sun, Y Xu, Y Shao, N Dai, X Huang - Science China …, 2020 - Springer

Recently, the emergence of pre-trained models (PTMs) has brought natural language
processing (NLP) to a new era. In this survey, we provide a comprehensive review of PTMs …

被引用次数：1877 相关文章所有 9 个版本

[PDF] aclanthology.org

IndicNLPSuite: Monolingual corpora, evaluation benchmarks and pre-trained multilingual language models for Indian languages

D Kakwani, A Kunchukuttan, S Golla… - Findings of the …, 2020 - aclanthology.org

In this paper, we introduce NLP resources for 11 major Indian languages from two major
language families. These resources include:(a) large-scale sentence-level monolingual …

被引用次数：486 相关文章所有 4 个版本

[PDF] arxiv.org

Spanish pre-trained bert model and evaluation data

J Cañete, G Chaperon, R Fuentes, JH Ho… - arXiv preprint arXiv …, 2023 - arxiv.org

The Spanish language is one of the top 5 spoken languages in the world. Nevertheless,
finding resources to train or evaluate Spanish language models is not an easy task. In this …

被引用次数：983 相关文章所有 2 个版本

[引用][C] Transformers: State-of-the-Art Natural Language Processing

T Wolf - arXiv preprint arXiv:1910.03771, 2020

被引用次数：5780 相关文章