- 学术资源搜索

A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

被引用次数：229 相关文章所有 3 个版本

[HTML] mdpi.com

[HTML][HTML] A survey on text classification algorithms: From text to predictions

A Gasparetto, M Marcuzzo, A Zangari, A Albarelli - Information, 2022 - mdpi.com

In recent years, the exponential growth of digital documents has been met by rapid progress
in text classification techniques. Newly proposed machine learning algorithms leverage the …

被引用次数：129 相关文章所有 6 个版本

[PDF] hal.science

Bloom: A 176b-parameter open-access multilingual language model

T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow… - 2023 - inria.hal.science

Large language models (LLMs) have been shown to be able to perform new tasks based on
a few demonstrations or natural language instructions. While these capabilities have led to …

被引用次数：1334 相关文章所有 16 个版本

[PDF] neurips.cc

Language model tokenizers introduce unfairness between languages

A Petrov, E La Malfa, P Torr… - Advances in Neural …, 2024 - proceedings.neurips.cc

Recent language models have shown impressive multilingual performance, even when not
explicitly trained for it. Despite this, there are concerns about the quality of their outputs …

被引用次数：39 相关文章所有 8 个版本

[PDF] aclanthology.org

Character-aware models improve visual text rendering

R Liu, D Garrette, C Saharia, W Chan… - arXiv preprint arXiv …, 2022 - arxiv.org

Current image generation models struggle to reliably produce well-formed visual text. In this
paper, we investigate a key contributing factor: popular text-to-image models lack character …

被引用次数：38 相关文章所有 9 个版本

[PDF] thecvf.com

Clippo: Image-and-language understanding from pixels only

M Tschannen, B Mustafa… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Multimodal models are becoming increasingly effective, in part due to unified components,
such as the Transformer architecture. However, multimodal models still often consist of many …

被引用次数：22 相关文章所有 6 个版本

[PDF] arxiv.org

Linguistically inspired roadmap for building biologically reliable protein language models

MH Vu, R Akbar, PA Robert, B Swiatczak… - Nature Machine …, 2023 - nature.com

Deep neural-network-based language models (LMs) are increasingly applied to large-scale
protein sequence data to predict protein function. However, being largely black-box models …

被引用次数：25 相关文章所有 4 个版本

[HTML] mit.edu

[HTML][HTML] mGPT: Few-Shot Learners Go Multilingual

O Shliazhko, A Fenogenova, M Tikhonova… - Transactions of the …, 2024 - direct.mit.edu

This paper introduces mGPT, a multilingual variant of GPT-3, pretrained on 61 languages
from 25 linguistically diverse language families using Wikipedia and the C4 Corpus. We …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

The SIGMORPHON 2022 shared task on morpheme segmentation

K Batsuren, G Bella, A Arora, V Martinović… - arXiv preprint arXiv …, 2022 - arxiv.org

The SIGMORPHON 2022 shared task on morpheme segmentation challenged systems to
decompose a word into a sequence of morphemes and covered most types of morphology …

被引用次数：27 相关文章所有 9 个版本

[PDF] arxiv.org

Text generation with text-editing models

E Malmi, Y Dong, J Mallinson, A Chuklin… - arXiv preprint arXiv …, 2022 - arxiv.org

Text-editing models have recently become a prominent alternative to seq2seq models for
monolingual text-generation tasks such as grammatical error correction, simplification, and …

被引用次数：29 相关文章所有 7 个版本