Large language models for software engineering: A systematic literature review
Large Language Models (LLMs) have significantly impacted numerous domains, including
Software Engineering (SE). Many recent publications have explored LLMs applied to …
Software Engineering (SE). Many recent publications have explored LLMs applied to …
A survey on deep learning for software engineering
In 2006, Geoffrey Hinton proposed the concept of training “Deep Neural Networks (DNNs)”
and an improved model training method to break the bottleneck of neural network …
and an improved model training method to break the bottleneck of neural network …
Program synthesis with large language models
This paper explores the limits of the current generation of large language models for
program synthesis in general purpose programming languages. We evaluate a collection of …
program synthesis in general purpose programming languages. We evaluate a collection of …
Graph neural networks: foundation, frontiers and applications
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …
recent years. Graph neural networks, also known as deep learning on graphs, graph …
SantaCoder: don't reach for the stars!
The BigCode project is an open-scientific collaboration working on the responsible
development of large language models for code. This tech report describes the progress of …
development of large language models for code. This tech report describes the progress of …
MultiPL-E: a scalable and polyglot approach to benchmarking neural code generation
Large language models have demonstrated the ability to generate both natural language
and programming language text. Although contemporary code generation models are …
and programming language text. Although contemporary code generation models are …
An extensive study on pre-trained models for program understanding and generation
Automatic program understanding and generation techniques could significantly advance
the productivity of programmers and have been widely studied by academia and industry …
the productivity of programmers and have been widely studied by academia and industry …
Multi-task learning based pre-trained language model for code completion
Code completion is one of the most useful features in the Integrated Development
Environments (IDEs), which can accelerate software development by suggesting the next …
Environments (IDEs), which can accelerate software development by suggesting the next …
The adverse effects of code duplication in machine learning models of code
M Allamanis - Proceedings of the 2019 ACM SIGPLAN International …, 2019 - dl.acm.org
The field of big code relies on mining large corpora of code to perform some learning task
towards creating better tools for software engineers. A significant threat to this approach was …
towards creating better tools for software engineers. A significant threat to this approach was …
Perfection not required? Human-AI partnerships in code translation
Generative models have become adept at producing artifacts such as images, videos, and
prose at human-like levels of proficiency. New generative techniques, such as unsupervised …
prose at human-like levels of proficiency. New generative techniques, such as unsupervised …