CodeBERT: a pre-trained model for programming and natural languages (2020)

Copiloting the copilots: Fusing large language models with completion engines for automated program repair

Y Wei, CS Xia, L Zhang - Proceedings of the 31st ACM Joint European …, 2023 - dl.acm.org

During Automated Program Repair (APR), it can be challenging to synthesize correct
patches for real-world systems in general-purpose programming languages. Recent Large …

被引用次数：65 相关文章所有 6 个版本

[PDF] acm.org

How effective are neural networks for fixing security vulnerabilities

Y Wu, N Jiang, HV Pham, T Lutellier, J Davis… - Proceedings of the …, 2023 - dl.acm.org

Security vulnerability repair is a difficult task that is in dire need of automation. Two groups of
techniques have shown promise:(1) large code language models (LLMs) that have been pre …

被引用次数：50 相关文章所有 6 个版本

[PDF] arxiv.org

Chatgpt: A study on its utility for ubiquitous software engineering tasks

G Sridhara, S Mazumdar - arXiv preprint arXiv:2305.16837, 2023 - arxiv.org

ChatGPT (Chat Generative Pre-trained Transformer) is a chatbot launched by OpenAI on
November 30, 2022. OpenAI's GPT-3 family of large language models serve as the …

被引用次数：29 相关文章所有 2 个版本

[PDF] software-lab.org

Finding the dwarf: recovering precise types from WebAssembly binaries

D Lehmann, M Pradel - Proceedings of the 43rd ACM SIGPLAN …, 2022 - dl.acm.org

The increasing popularity of WebAssembly creates a demand for understanding and reverse
engineering WebAssembly binaries. Recovering high-level function types is an important …

被引用次数：27 相关文章所有 5 个版本

[PDF] arxiv.org

Codex hacks hackerrank: Memorization issues and a framework for code synthesis evaluation

A Karmakar, JA Prenner, M D'Ambros… - arXiv preprint arXiv …, 2022 - arxiv.org

The Codex model has demonstrated extraordinary competence in synthesizing code from
natural language problem descriptions. However, in order to reveal unknown failure modes …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Is a single model enough? mucos: A multi-model ensemble learning approach for semantic code search

L Du, X Shi, Y Wang, E Shi, S Han… - Proceedings of the 30th …, 2021 - dl.acm.org

Recently, deep learning methods have become mainstream in code search since they do
better at capturing semantic correlations between code snippets and search queries and …

被引用次数：32 相关文章所有 5 个版本

[PDF] arxiv.org

DeepVulSeeker: A novel vulnerability identification framework via code graph structure and pre-training mechanism

J Wang, H Xiao, S Zhong, Y Xiao - Future Generation Computer Systems, 2023 - Elsevier

Software vulnerabilities can pose severe harms to a computing system. They can lead to
system crash, privacy leakage, or even physical damage. Correctly identifying vulnerabilities …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

Pre-training code representation with semantic flow graph for effective bug localization

Y Du, Z Yu - Proceedings of the 31st ACM Joint European Software …, 2023 - dl.acm.org

Enlightened by the big success of pre-training in natural language processing, pre-trained
models for programming languages have been widely used to promote code intelligence in …

被引用次数：9 相关文章所有 3 个版本

[PDF] acm.org

The growing cost of deep learning for source code

VJ Hellendoorn, AA Sawant - Communications of the ACM, 2021 - dl.acm.org

The growing cost of deep learning for source code Page 1 JANUARY 2022 | VOL. 65 | NO. 1 |
COMMUNICATIONS OF THE ACM 31 viewpoints IMA GER YB Y OZZ DE SIGN uniquely …

被引用次数：20 相关文章

A code centric evaluation of c/c++ vulnerability datasets for deep learning based vulnerability detection techniques

R Jain, N Gervasoni, M Ndhlovu, S Rawat - Proceedings of the 16th …, 2023 - dl.acm.org

Recent years have witnessed tremendous progress in NLP-based code comprehension via
deep neural networks (DNN) learning, especially Large Language Models (LLMs). While the …

被引用次数：12 相关文章所有 2 个版本