A comprehensive overview of large language models
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …
natural language processing tasks and beyond. This success of LLMs has led to a large …
Challenges and applications of large language models
Large Language Models (LLMs) went from non-existent to ubiquitous in the machine
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
learning discourse within a few years. Due to the fast pace of the field, it is difficult to identify …
Qwen technical report
Large language models (LLMs) have revolutionized the field of artificial intelligence,
enabling natural language processing tasks that were previously thought to be exclusive to …
enabling natural language processing tasks that were previously thought to be exclusive to …
Leandojo: Theorem proving with retrieval-augmented language models
Large language models (LLMs) have shown promise in proving formal theorems using proof
assistants such as Lean. However, existing methods are difficult to reproduce or build on …
assistants such as Lean. However, existing methods are difficult to reproduce or build on …
Active retrieval augmented generation
Despite the remarkable ability of large language models (LMs) to comprehend and generate
language, they have a tendency to hallucinate and create factually inaccurate output …
language, they have a tendency to hallucinate and create factually inaccurate output …
Octopack: Instruction tuning code large language models
Finetuning large language models (LLMs) on instructions leads to vast performance
improvements on natural language tasks. We apply instruction tuning using code …
improvements on natural language tasks. We apply instruction tuning using code …
Copiloting the copilots: Fusing large language models with completion engines for automated program repair
During Automated Program Repair (APR), it can be challenging to synthesize correct
patches for real-world systems in general-purpose programming languages. Recent Large …
patches for real-world systems in general-purpose programming languages. Recent Large …
Enhancing retrieval-augmented large language models with iterative retrieval-generation synergy
Large language models are powerful text processors and reasoners, but are still subject to
limitations including outdated knowledge and hallucinations, which necessitates connecting …
limitations including outdated knowledge and hallucinations, which necessitates connecting …
Qwen2. 5-coder technical report
In this report, we introduce the Qwen2. 5-Coder series, a significant upgrade from its
predecessor, CodeQwen1. 5. This series includes six models: Qwen2. 5-Coder-(0.5 B/1.5 …
predecessor, CodeQwen1. 5. This series includes six models: Qwen2. 5-Coder-(0.5 B/1.5 …
Codeplan: Repository-level coding using llms and planning
Software engineering activities such as package migration, fixing error reports from static
analysis or testing, and adding type annotations or other specifications to a codebase …
analysis or testing, and adding type annotations or other specifications to a codebase …