A survey on deep learning for software engineering

Y Yang, X Xia, D Lo, J Grundy - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
In 2006, Geoffrey Hinton proposed the concept of training “Deep Neural Networks (DNNs)”
and an improved model training method to break the bottleneck of neural network …

Deep learning for source code modeling and generation: Models, applications, and challenges

THM Le, H Chen, MA Babar - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Deep Learning (DL) techniques for Natural Language Processing have been evolving
remarkably fast. Recently, the DL advances in language modeling, machine translation, and …

Graph neural networks: foundation, frontiers and applications

L Wu, P Cui, J Pei, L Zhao, X Guo - … of the 28th ACM SIGKDD Conference …, 2022 - dl.acm.org
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …

Codexglue: A machine learning benchmark dataset for code understanding and generation

S Lu, D Guo, S Ren, J Huang, A Svyatkovskiy… - arXiv preprint arXiv …, 2021 - arxiv.org
Benchmark datasets have a significant impact on accelerating research in programming
language tasks. In this paper, we introduce CodeXGLUE, a benchmark dataset to foster …

A transformer-based approach for source code summarization

WU Ahmad, S Chakraborty, B Ray… - arXiv preprint arXiv …, 2020 - arxiv.org
Generating a readable summary that describes the functionality of a program is known as
source code summarization. In this task, learning code representation by modeling the …

Improved code summarization via a graph neural network

A LeClair, S Haque, L Wu, C McMillan - Proceedings of the 28th …, 2020 - dl.acm.org
Automatic source code summarization is the task of generating natural language
descriptions for source code. Automatic code summarization is a rapidly expanding research …

Spt-code: Sequence-to-sequence pre-training for learning source code representations

C Niu, C Li, V Ng, J Ge, L Huang, B Luo - Proceedings of the 44th …, 2022 - dl.acm.org
Recent years have seen the successful application of large pre-trained models to code
representation learning, resulting in substantial improvements on many code-related …

A neural model for generating natural language summaries of program subroutines

A LeClair, S Jiang, C McMillan - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Source code summarization--creating natural language descriptions of source code
behavior--is a rapidly-growing research topic with applications to automatic documentation …

Reacc: A retrieval-augmented code completion framework

S Lu, N Duan, H Han, D Guo, S Hwang… - arXiv preprint arXiv …, 2022 - arxiv.org
Code completion, which aims to predict the following code token (s) according to the code
context, can improve the productivity of software development. Recent work has proved that …

Retrieval augmented code generation and summarization

MR Parvez, WU Ahmad, S Chakraborty, B Ray… - arXiv preprint arXiv …, 2021 - arxiv.org
Software developers write a lot of source code and documentation during software
development. Intrinsically, developers often recall parts of source code or code summaries …