相关文章- 学术资源搜索

Is ChatGPT a general-purpose natural language processing task solver?

C Qin, A Zhang, Z Zhang, J Chen, M Yasunaga… - arXiv preprint arXiv …, 2023 - arxiv.org

Spurred by advancements in scale, large language models (LLMs) have demonstrated the
ability to perform a variety of natural language processing (NLP) tasks zero-shot--ie, without …

被引用次数：532 相关文章所有 4 个版本

[PDF] arxiv.org

Towards making the most of chatgpt for machine translation

K Peng, L Ding, Q Zhong, L Shen, X Liu… - arXiv preprint arXiv …, 2023 - arxiv.org

ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies
have shown that it achieves comparable results to commercial systems for high-resource …

被引用次数：165 相关文章所有 8 个版本

[PDF] arxiv.org

Language models as science tutors

A Chevalier, J Geng, A Wettig, H Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

NLP has recently made exciting progress toward training language models (LMs) with
strong scientific problem-solving skills. However, model development has not focused on …

被引用次数：2 相关文章所有 5 个版本

[PDF] arxiv.org

Olmo: Accelerating the science of language models

D Groeneveld, I Beltagy, P Walsh, A Bhagia… - arXiv preprint arXiv …, 2024 - arxiv.org

Language models (LMs) have become ubiquitous in both NLP research and in commercial
product offerings. As their commercial importance has surged, the most powerful models …

被引用次数：29 相关文章所有 2 个版本

[PDF] ieee.org

A review on large Language Models: Architectures, applications, taxonomies, open issues and challenges

MAK Raiaan, MSH Mukta, K Fatema, NM Fahad… - IEEE …, 2024 - ieeexplore.ieee.org

Large Language Models (LLMs) recently demonstrated extraordinary capability in various
natural language processing (NLP) tasks including language translation, text generation …

被引用次数：47 相关文章所有 7 个版本

[PDF] arxiv.org

The impact of large language models on scientific discovery: a preliminary study using gpt-4

MR AI4Science, MA Quantum - arXiv preprint arXiv:2311.07361, 2023 - arxiv.org

In recent years, groundbreaking advancements in natural language processing have
culminated in the emergence of powerful large language models (LLMs), which have …

被引用次数：18 相关文章

[PDF] arxiv.org

Sharpness-aware minimization improves language model generalization

D Bahri, H Mobahi, Y Tay - arXiv preprint arXiv:2110.08529, 2021 - arxiv.org

The allure of superhuman-level capabilities has led to considerable interest in language
models like GPT-3 and T5, wherein the research has, by and large, revolved around new …

被引用次数：77 相关文章所有 7 个版本

[PDF] arxiv.org

An empirical study of instruction-tuning large language models in chinese

Q Si, T Wang, Z Lin, X Zhang, Y Cao… - arXiv preprint arXiv …, 2023 - arxiv.org

The success of ChatGPT validates the potential of large language models (LLMs) in artificial
general intelligence (AGI). Subsequently, the release of LLMs has sparked the open-source …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

Is a question decomposition unit all we need?

P Patel, S Mishra, M Parmar, C Baral - arXiv preprint arXiv:2205.12538, 2022 - arxiv.org

Large Language Models (LMs) have achieved state-of-the-art performance on many Natural
Language Processing (NLP) benchmarks. With the growing number of new benchmarks, we …

被引用次数：33 相关文章所有 5 个版本

[PDF] arxiv.org

Llama beyond english: An empirical study on language capability transfer

J Zhao, Z Zhang, Q Zhang, T Gui, X Huang - arXiv preprint arXiv …, 2024 - arxiv.org

In recent times, substantial advancements have been witnessed in large language models
(LLMs), exemplified by ChatGPT, showcasing remarkable proficiency across a range of …

被引用次数：23 相关文章所有 2 个版本