Leandojo: Theorem proving with retrieval-augmented language models

K Yang, A Swope, A Gu, R Chalamala… - Advances in …, 2024 - proceedings.neurips.cc
Large language models (LLMs) have shown promise in proving formal theorems using proof
assistants such as Lean. However, existing methods are difficult to reproduce or build on …

Toolqa: A dataset for llm question answering with external tools

Y Zhuang, Y Yu, K Wang, H Sun… - Advances in Neural …, 2023 - proceedings.neurips.cc
Abstract Large Language Models (LLMs) have demonstrated impressive performance in
various NLP tasks, but they still suffer from challenges such as hallucination and weak …

Open-world story generation with structured knowledge enhancement: A comprehensive survey

Y Wang, J Lin, Z Yu, W Hu, BF Karlsson - Neurocomputing, 2023 - Elsevier
Storytelling and narrative are fundamental to human experience, intertwined with our social
and cultural engagement. As such, researchers have long attempted to create systems that …

Dense text retrieval based on pretrained language models: A survey

WX Zhao, J Liu, R Ren, JR Wen - ACM Transactions on Information …, 2024 - dl.acm.org
Text retrieval is a long-standing research topic on information seeking, where a system is
required to return relevant information resources to user's queries in natural language. From …

Inferfix: End-to-end program repair with llms

M Jin, S Shahriar, M Tufano, X Shi, S Lu… - Proceedings of the 31st …, 2023 - dl.acm.org
Software development life cycle is profoundly influenced by bugs; their introduction,
identification, and eventual resolution account for a significant portion of software …

Repocoder: Repository-level code completion through iterative retrieval and generation

F Zhang, B Chen, Y Zhang, J Keung, J Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
The task of repository-level code completion is to continue writing the unfinished code based
on a broader context of the repository. While for automated code completion tools, it is …

Crosscodeeval: A diverse and multilingual benchmark for cross-file code completion

Y Ding, Z Wang, W Ahmad, H Ding… - Advances in …, 2024 - proceedings.neurips.cc
Code completion models have made significant progress in recent years, yet current popular
evaluation datasets, such as HumanEval and MBPP, predominantly focus on code …

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org
Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

Pangu-coder2: Boosting large language models for code with ranking feedback

B Shen, J Zhang, T Chen, D Zan, B Geng, A Fu… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models for Code (Code LLM) are flourishing. New and powerful models
are released on a weekly basis, demonstrating remarkable performance on the code …

Skcoder: A sketch-based approach for automatic code generation

J Li, Y Li, G Li, Z Jin, Y Hao, X Hu - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Recently, deep learning techniques have shown great success in automatic code
generation. Inspired by the code reuse, some researchers propose copy-based approaches …