A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

Challenges and applications of large language models

J Kaddour, J Harris, M Mozes, H Bradley… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …

Qwen technical report

J Bai, S Bai, Y Chu, Z Cui, K Dang, X Deng… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have revolutionized the field of artificial intelligence,
enabling natural language processing tasks that were previously thought to be exclusive to …

Leandojo: Theorem proving with retrieval-augmented language models

K Yang, A Swope, A Gu, R Chalamala… - Advances in …, 2024 - proceedings.neurips.cc
Large language models (LLMs) have shown promise in proving formal theorems using proof
assistants such as Lean. However, existing methods are difficult to reproduce or build on …

Active retrieval augmented generation

Z Jiang, FF Xu, L Gao, Z Sun, Q Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
Despite the remarkable ability of large language models (LMs) to comprehend and generate
language, they have a tendency to hallucinate and create factually inaccurate output …

Octopack: Instruction tuning code large language models

N Muennighoff, Q Liu, A Zebaze, Q Zheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Finetuning large language models (LLMs) on instructions leads to vast performance
improvements on natural language tasks. We apply instruction tuning using code …

Copiloting the copilots: Fusing large language models with completion engines for automated program repair

Y Wei, CS Xia, L Zhang - Proceedings of the 31st ACM Joint European …, 2023 - dl.acm.org
During Automated Program Repair (APR), it can be challenging to synthesize correct
patches for real-world systems in general-purpose programming languages. Recent Large …

Enhancing retrieval-augmented large language models with iterative retrieval-generation synergy

Z Shao, Y Gong, Y Shen, M Huang, N Duan… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models are powerful text processors and reasoners, but are still subject to
limitations including outdated knowledge and hallucinations, which necessitates connecting …

Qwen2. 5-coder technical report

B Hui, J Yang, Z Cui, J Yang, D Liu, L Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
In this report, we introduce the Qwen2. 5-Coder series, a significant upgrade from its
predecessor, CodeQwen1. 5. This series includes six models: Qwen2. 5-Coder-(0.5 B/1.5 …

Codeplan: Repository-level coding using llms and planning

R Bairi, A Sonwane, A Kanade, A Iyer… - Proceedings of the …, 2024 - dl.acm.org
Software engineering activities such as package migration, fixing error reports from static
analysis or testing, and adding type annotations or other specifications to a codebase …