A novel neural source code representation based on abstract syntax tree

J Zhang, X Wang, H Zhang, H Sun… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Exploiting machine learning techniques for analyzing programs has attracted much
attention. One key problem is how to represent code fragments well for follow-up analysis …

Modular tree network for source code representation learning

W Wang, G Li, S Shen, X Xia, Z Jin - ACM Transactions on Software …, 2020 - dl.acm.org
Learning representation for source code is a foundation of many program analysis tasks. In
recent years, neural networks have already shown success in this area, but most existing …

Infercode: Self-supervised learning of code representations by predicting subtrees

NDQ Bui, Y Yu, L Jiang - 2021 IEEE/ACM 43rd International …, 2021 - ieeexplore.ieee.org
Learning code representations has found many uses in software engineering, such as code
classification, code search, comment generation, and bug prediction, etc. Although …

Learning program semantics with code representations: An empirical study

JK Siow, S Liu, X Xie, G Meng… - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Program semantics learning is the core and fundamental for various code intelligent tasks
eg, vulnerability detection, clone detection. A considerable amount of existing works …

Treecaps: Tree-based capsule networks for source code processing

NDQ Bui, Y Yu, L Jiang - Proceedings of the AAAI Conference on …, 2021 - ojs.aaai.org
Recently program learning techniques have been proposed to process source code based
on syntactical structures (eg, abstract syntax trees) and/or semantic information (eg …

A grammar-based structural cnn decoder for code generation

Z Sun, Q Zhu, L Mou, Y Xiong, G Li… - Proceedings of the AAAI …, 2019 - ojs.aaai.org
Code generation maps a program description to executable source code in a programming
language. Existing approaches mainly rely on a recurrent neural network (RNN) as the …

Bridging pre-trained models and downstream tasks for source code understanding

D Wang, Z Jia, S Li, Y Yu, Y Xiong, W Dong… - Proceedings of the 44th …, 2022 - dl.acm.org
With the great success of pre-trained models, the pretrain-then-finetune paradigm has been
widely adopted on downstream tasks for source code understanding. However, compared to …

A neural model for generating natural language summaries of program subroutines

A LeClair, S Jiang, C McMillan - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
Source code summarization--creating natural language descriptions of source code
behavior--is a rapidly-growing research topic with applications to automatic documentation …

Integrating tree path in transformer for code representation

H Peng, G Li, W Wang, Y Zhao… - Advances in Neural …, 2021 - proceedings.neurips.cc
Learning distributed representation of source code requires modelling its syntax and
semantics. Recent state-of-the-art models leverage highly structured source code …

Language-agnostic representation learning of source code from structure and context

D Zügner, T Kirschstein, M Catasta, J Leskovec… - arXiv preprint arXiv …, 2021 - arxiv.org
Source code (Context) and its parsed abstract syntax tree (AST; Structure) are two
complementary representations of the same computer program. Traditionally, designers of …