Unsupervised translation of programming languages

B Roziere, MA Lachaux… - Advances in neural …, 2020 - proceedings.neurips.cc
A transcompiler, also known as source-to-source translator, is a system that converts source
code from a high-level programming language (such as C++ or Python) to another …

A convolutional attention network for extreme summarization of source code

M Allamanis, H Peng, C Sutton - International conference on …, 2016 - proceedings.mlr.press
Attention mechanisms in neural networks have proved useful for problems in which the input
and output do not have fixed dimension. Often there exist features that are locally translation …

Suggesting accurate method and class names

M Allamanis, ET Barr, C Bird, C Sutton - … of the 2015 10th joint meeting …, 2015 - dl.acm.org
Descriptive names are a vital part of readable, and hence maintainable, code. Recent
progress on automatically suggesting names for local variables tantalizes with the prospect …

A review on source code documentation

S Rai, RC Belwal, A Gupta - ACM Transactions on Intelligent Systems …, 2022 - dl.acm.org
Context: Coding is an incremental activity where a developer may need to understand a
code before making suitable changes in the code. Code documentation is considered one of …

DOBF: A deobfuscation pre-training objective for programming languages

MA Lachaux, B Roziere… - Advances in Neural …, 2021 - proceedings.neurips.cc
Recent advances in self-supervised learning have dramatically improved the state of the art
on a wide variety of tasks. However, research in language model pre-training has mostly …

Learning natural coding conventions

M Allamanis, ET Barr, C Bird, C Sutton - Proceedings of the 22nd acm …, 2014 - dl.acm.org
Every programmer has a characteristic style, ranging from preferences about identifier
naming to preferences about object relationships and design patterns. Coding conventions …

Mining source code repositories at massive scale using language modeling

M Allamanis, C Sutton - 2013 10th working conference on …, 2013 - ieeexplore.ieee.org
The tens of thousands of high-quality open source software projects on the Internet raise the
exciting possibility of studying software development by finding patterns across truly large …

On the localness of software

Z Tu, Z Su, P Devanbu - Proceedings of the 22nd ACM SIGSOFT …, 2014 - dl.acm.org
The n-gram language model, which has its roots in statistical natural language processing,
has been shown to successfully capture the repetitive and predictable regularities …

Eye movements in code reading: Relaxing the linear order

T Busjahn, R Bednarik, A Begel… - 2015 ieee 23rd …, 2015 - ieeexplore.ieee.org
Code reading is an important skill in programming. Inspired by the linearity that people
exhibit while natural language text reading, we designed local and global gaze-based …

Learning to spot and refactor inconsistent method names

K Liu, D Kim, TF Bissyandé, T Kim… - 2019 IEEE/ACM 41st …, 2019 - ieeexplore.ieee.org
To ensure code readability and facilitate software maintenance, program methods must be
named properly. In particular, method names must be consistent with the corresponding …