Is ChatGPT a general-purpose natural language processing task solver?

C Qin, A Zhang, Z Zhang, J Chen, M Yasunaga… - arXiv preprint arXiv …, 2023 - arxiv.org
Spurred by advancements in scale, large language models (LLMs) have demonstrated the
ability to perform a variety of natural language processing (NLP) tasks zero-shot--ie, without …

Towards making the most of chatgpt for machine translation

K Peng, L Ding, Q Zhong, L Shen, X Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies
have shown that it achieves comparable results to commercial systems for high-resource …

Language models as science tutors

A Chevalier, J Geng, A Wettig, H Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
NLP has recently made exciting progress toward training language models (LMs) with
strong scientific problem-solving skills. However, model development has not focused on …

Olmo: Accelerating the science of language models

D Groeneveld, I Beltagy, P Walsh, A Bhagia… - arXiv preprint arXiv …, 2024 - arxiv.org
Language models (LMs) have become ubiquitous in both NLP research and in commercial
product offerings. As their commercial importance has surged, the most powerful models …

A review on large Language Models: Architectures, applications, taxonomies, open issues and challenges

MAK Raiaan, MSH Mukta, K Fatema, NM Fahad… - IEEE …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) recently demonstrated extraordinary capability in various
natural language processing (NLP) tasks including language translation, text generation …

The impact of large language models on scientific discovery: a preliminary study using gpt-4

MR AI4Science, MA Quantum - arXiv preprint arXiv:2311.07361, 2023 - arxiv.org
In recent years, groundbreaking advancements in natural language processing have
culminated in the emergence of powerful large language models (LLMs), which have …

Sharpness-aware minimization improves language model generalization

D Bahri, H Mobahi, Y Tay - arXiv preprint arXiv:2110.08529, 2021 - arxiv.org
The allure of superhuman-level capabilities has led to considerable interest in language
models like GPT-3 and T5, wherein the research has, by and large, revolved around new …

An empirical study of instruction-tuning large language models in chinese

Q Si, T Wang, Z Lin, X Zhang, Y Cao… - arXiv preprint arXiv …, 2023 - arxiv.org
The success of ChatGPT validates the potential of large language models (LLMs) in artificial
general intelligence (AGI). Subsequently, the release of LLMs has sparked the open-source …

Is a question decomposition unit all we need?

P Patel, S Mishra, M Parmar, C Baral - arXiv preprint arXiv:2205.12538, 2022 - arxiv.org
Large Language Models (LMs) have achieved state-of-the-art performance on many Natural
Language Processing (NLP) benchmarks. With the growing number of new benchmarks, we …

Llama beyond english: An empirical study on language capability transfer

J Zhao, Z Zhang, Q Zhang, T Gui, X Huang - arXiv preprint arXiv …, 2024 - arxiv.org
In recent times, substantial advancements have been witnessed in large language models
(LLMs), exemplified by ChatGPT, showcasing remarkable proficiency across a range of …