Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org
Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

Context-aware code change embedding for better patch correctness assessment

B Lin, S Wang, M Wen, X Mao - ACM Transactions on Software …, 2022 - dl.acm.org
Despite the capability in successfully fixing more and more real-world bugs, existing
Automated Program Repair (APR) techniques are still challenged by the long-standing …

An exploratory literature study on sharing and energy use of language models for source code

M Hort, A Grishina, L Moonen - 2023 ACM/IEEE International …, 2023 - ieeexplore.ieee.org
Context: Large language models trained on source code can support a variety of software
development tasks, such as code recommendation and program repair. Large amounts of …

On the significance of category prediction for code-comment synchronization

Z Yang, JW Keung, X Yu, Y Xiao, Z Jin… - ACM Transactions on …, 2023 - dl.acm.org
Software comments sometimes are not promptly updated in sync when the associated code
is changed. The inconsistency between code and comments may mislead the developers …

Cct5: A code-change-oriented pre-trained model

B Lin, S Wang, Z Liu, Y Liu, X Xia, X Mao - Proceedings of the 31st ACM …, 2023 - dl.acm.org
Software is constantly changing, requiring developers to perform several derived tasks in a
timely manner, such as writing a description for the intention of the code change, or …

Practitioners' expectations on automated code comment generation

X Hu, X Xia, D Lo, Z Wan, Q Chen… - Proceedings of the 44th …, 2022 - dl.acm.org
Good comments are invaluable assets to software projects, as they help developers
understand and maintain projects. However, due to some poor commenting practices …

Ccrep: Learning code change representations via pre-trained code model and query back

Z Liu, Z Tang, X Xia, X Yang - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Representing code changes as numeric feature vectors, ie, code change representations, is
usually an essential step to automate many software engineering tasks related to code …

A novel refactoring and semantic aware abstract syntax tree differencing tool and a benchmark for evaluating the accuracy of diff tools

P Alikhanifard, N Tsantalis - ACM Transactions on Software Engineering …, 2024 - dl.acm.org
Software undergoes constant changes to support new requirements, address bugs,
enhance performance, and ensure maintainability. Thus, developers spend a great portion …

The devil is in the tails: How long-tailed code distributions impact large language models

X Zhou, K Kim, B Xu, J Liu, DG Han, D Lo - arXiv preprint arXiv …, 2023 - arxiv.org
Learning-based techniques, especially advanced Large Language Models (LLMs) for code,
have gained considerable popularity in various software engineering (SE) tasks. However …

Automating the removal of obsolete TODO comments

Z Gao, X Xia, D Lo, J Grundy… - Proceedings of the 29th …, 2021 - dl.acm.org
TODO comments are very widely used by software developers to describe their pending
tasks during software development. However, after performing the task developers …