A Catalog of Data Smells for Coding Tasks

A Vitale, R Oliveto, S Scalabrino - ACM Transactions on Software …, 2024 - dl.acm.org
Large Language Models (LLMs) are increasingly becoming fundamental in supporting
software developers in coding tasks. The massive datasets used for training LLMs are often …

Test-case-driven programming understanding in large language models for better code generation

Z Tian, J Chen, X Zhang - arXiv preprint arXiv:2309.16120, 2023 - arxiv.org
Code generation is to automatically generate source code conforming to a given
programming specification, which has received extensive attention especially with the …

A systematic assessment of openai o1-preview for higher order thinking in education

E Latif, Y Zhou, S Guo, Y Gao, L Shi… - arXiv preprint arXiv …, 2024 - arxiv.org
As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable
to human intelligence, with significant potential to transform education and workforce …

Agents in Software Engineering: Survey, Landscape, and Vision

Y Wang, W Zhong, Y Huang, E Shi, M Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, Large Language Models (LLMs) have achieved remarkable success and
have been widely used in various downstream tasks, especially in the tasks of the software …

Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

D Paul, R West, A Bosselut, B Faltings - arXiv preprint arXiv:2402.13950, 2024 - arxiv.org
Large language models (LLMs) have been shown to perform better when asked to reason
step-by-step before answering a question. However, it is unclear to what degree the model's …

SABER: Model-agnostic Backdoor Attack on Chain-of-Thought in Neural Code Generation

N Jin, Z Li, Y Guo, C Su, T Zhang, Q Zeng - arXiv preprint arXiv …, 2024 - arxiv.org
Recent studies have proposed integrating Chain-of-Thought (CoT) reasoning to further
enhance the reliability of Code Language Models (CLMs) in generating code, a step-by-step …

Aligning language models to code: exploring efficient, temporal, and preference alignment for code generation

M Weyssow - 2024 - papyrus.bib.umontreal.ca
Pre-trained and large language models (PLMs, LLMs) have had a transformative impact on
the artificial intelligence (AI) for software engineering (SE) research field. Through large …