Learning natural coding conventions

M Allamanis, ET Barr, P Devanbu… - ACM Computing Surveys …, 2018 - dl.acm.org

Research at the intersection of machine learning, programming languages, and software
engineering has recently taken important steps in proposing learnable probabilistic models …

被引用次数：1051 相关文章所有 10 个版本

[PDF] mdpi.com

Natural language generation and understanding of big code for AI-assisted programming: A review

MF Wong, S Guo, CN Hang, SW Ho, CW Tan - Entropy, 2023 - mdpi.com

This paper provides a comprehensive review of the literature concerning the utilization of
Natural Language Processing (NLP) techniques, with a particular focus on transformer …

被引用次数：79 相关文章所有 9 个版本

[PDF] arxiv.org

Program synthesis with large language models

J Austin, A Odena, M Nye, M Bosma… - arXiv preprint arXiv …, 2021 - arxiv.org

This paper explores the limits of the current generation of large language models for
program synthesis in general purpose programming languages. We evaluate a collection of …

被引用次数：1294 相关文章所有 3 个版本

[PDF] neurips.cc

Unsupervised translation of programming languages

B Roziere, MA Lachaux… - Advances in neural …, 2020 - proceedings.neurips.cc

A transcompiler, also known as source-to-source translator, is a system that converts source
code from a high-level programming language (such as C++ or Python) to another …

被引用次数：259 相关文章所有 6 个版本

[PDF] acm.org

code2vec: Learning distributed representations of code

U Alon, M Zilberstein, O Levy, E Yahav - Proceedings of the ACM on …, 2019 - dl.acm.org

We present a neural model for representing snippets of code as continuous distributed
vectors (``code embeddings''). The main idea is to represent a code snippet as a single fixed …

被引用次数：1491 相关文章所有 8 个版本

[PDF] arxiv.org

On the robustness of code generation techniques: An empirical study on github copilot

A Mastropaolo, L Pascarella… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org

Software engineering research has always being concerned with the improvement of code
completion approaches, which suggest the next tokens a developer will likely type while …

被引用次数：99 相关文章所有 8 个版本

[PDF] smu.edu.sg

Deep code comment generation

X Hu, G Li, X Xia, D Lo, Z Jin - Proceedings of the 26th conference on …, 2018 - dl.acm.org

During software maintenance, code comments help developers comprehend programs and
reduce additional time spent on reading and navigating source code. Unfortunately, these …

被引用次数：785 相关文章所有 12 个版本

[PDF] arxiv.org

Learning to represent programs with graphs

M Allamanis, M Brockschmidt, M Khademi - arXiv preprint arXiv …, 2017 - arxiv.org

Learning tasks on source code (ie, formal languages) have been considered recently, but
most work has tried to transfer natural language methods and does not capitalize on the …

被引用次数：1029 相关文章所有 6 个版本

[PDF] acm.org

Big code!= big vocabulary: Open-vocabulary models for source code

RM Karampatsis, H Babii, R Robbes, C Sutton… - Proceedings of the …, 2020 - dl.acm.org

Statistical language modeling techniques have successfully been applied to large source
code corpora, yielding a variety of new software development tools, such as tools for code …

被引用次数：259 相关文章所有 13 个版本

[PDF] miamioh.edu

Deep learning code fragments for code clone detection

M White, M Tufano, C Vendome… - Proceedings of the 31st …, 2016 - dl.acm.org

Code clone detection is an important problem for software maintenance and evolution. Many
approaches consider either structure or identifiers, but none of the existing detection …

被引用次数：763 相关文章所有 10 个版本