Bloom: A 176b-parameter open-access multilingual language model

E Kasneci, K Seßler, S Küchemann, M Bannert… - Learning and individual …, 2023 - Elsevier

Large language models represent a significant advancement in the field of AI. The
underlying technology is key to further innovations and, despite critical views and even bans …

被引用次数：2176 相关文章所有 15 个版本

[PDF] arxiv.org

A comprehensive survey of ai-generated content (aigc): A history of generative ai from gan to chatgpt

Y Cao, S Li, Y Liu, Z Yan, Y Dai, PS Yu… - arXiv preprint arXiv …, 2023 - arxiv.org

Recently, ChatGPT, along with DALL-E-2 and Codex, has been gaining significant attention
from society. As a result, many individuals have become interested in related resources and …

被引用次数：499 相关文章所有 2 个版本

[PDF] researchgate.net

[PDF][PDF] Cross-lingual summarization via chatgpt

J Wang, Y Liang, F Meng, Z Li, J Qu… - arXiv preprint arXiv …, 2023 - researchgate.net

Given a document in a source language, crosslingual summarization (CLS) aims to
generate a summary in a different target language. Recently, the emergence of ChatGPT …

被引用次数：32 相关文章

[PDF] arxiv.org

Llama 2: Open foundation and fine-tuned chat models

H Touvron, L Martin, K Stone, P Albert… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we develop and release Llama 2, a collection of pretrained and fine-tuned large
language models (LLMs) ranging in scale from 7 billion to 70 billion parameters. Our fine …

被引用次数：6211 相关文章所有 2 个版本

[PDF] neurips.cc

Qlora: Efficient finetuning of quantized llms

T Dettmers, A Pagnoni, A Holtzman… - Advances in Neural …, 2024 - proceedings.neurips.cc

We present QLoRA, an efficient finetuning approach that reduces memory usage enough to
finetune a 65B parameter model on a single 48GB GPU while preserving full 16-bit …

被引用次数：1028 相关文章所有 6 个版本

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

被引用次数：1839 相关文章所有 4 个版本

[PDF] arxiv.org

Minigpt-4: Enhancing vision-language understanding with advanced large language models

D Zhu, J Chen, X Shen, X Li, M Elhoseiny - arXiv preprint arXiv …, 2023 - arxiv.org

The recent GPT-4 has demonstrated extraordinary multi-modal abilities, such as directly
generating websites from handwritten text and identifying humorous elements within …

被引用次数：1305 相关文章所有 7 个版本

[HTML] nature.com

[HTML][HTML] Large language models encode clinical knowledge

K Singhal, S Azizi, T Tu, SS Mahdavi, J Wei, HW Chung… - Nature, 2023 - nature.com

Large language models (LLMs) have demonstrated impressive capabilities, but the bar for
clinical applications is high. Attempts to assess the clinical knowledge of models typically …

被引用次数：948 相关文章所有 10 个版本

[PDF] mlr.press

The flan collection: Designing data and methods for effective instruction tuning

S Longpre, L Hou, T Vu, A Webson… - International …, 2023 - proceedings.mlr.press

We study the design decision of publicly available instruction tuning methods, by
reproducing and breaking down the development of Flan 2022 (Chung et al., 2022) …

被引用次数：425 相关文章所有 8 个版本

[PDF] neurips.cc

Mathematical capabilities of chatgpt

S Frieder, L Pinchetti, RR Griffiths… - Advances in neural …, 2024 - proceedings.neurips.cc

We investigate the mathematical capabilities of two iterations of ChatGPT (released 9-
January-2023 and 30-January-2023) and of GPT-4 by testing them on publicly available …

被引用次数：370 相关文章所有 10 个版本