Evaluating defect prediction approaches: a benchmark and an extensive comparison

M D'Ambros, M Lanza, R Robbes - Empirical Software Engineering, 2012 - Springer
Reliably predicting software defects is one of the holy grails of software engineering.
Researchers have devised and implemented a plethora of defect/bug prediction approaches …

Improving code readability models with textual features

S Scalabrino, M Linares-Vasquez… - 2016 IEEE 24th …, 2016 - ieeexplore.ieee.org
Code reading is one of the most frequent activities in software maintenance; before
implementing changes, it is necessary to fully understand source code often written by other …

A comprehensive model for code readability

S Scalabrino, M Linares‐Vásquez… - Journal of Software …, 2018 - Wiley Online Library
Unreadable code could compromise program comprehension, and it could cause the
introduction of bugs. Code consists of mostly natural language text, both in identifiers and …

Automatically identifying code features for software defect prediction: Using AST N-grams

T Shippey, D Bowes, T Hall - Information and Software Technology, 2019 - Elsevier
Context: Identifying defects in code early is important. A wide range of static code metrics
have been evaluated as potential defect indicators. Most of these metrics offer only high …

Natural language is a programming language: Applying natural language processing to software development

MD Ernst - 2nd Summit on Advances in Programming Languages …, 2017 - drops.dagstuhl.de
A powerful, but limited, way to view software is as source code alone. Treating a program as
a sequence of instructions enables it to be formalized and makes it amenable to …

Integrating conceptual and logical couplings for change impact analysis in software

H Kagdi, M Gethers, D Poshyvanyk - Empirical Software Engineering, 2013 - Springer
The paper presents an approach that combines conceptual and evolutionary techniques to
support change impact analysis in source code. Conceptual couplings capture the extent to …

Sublinear time low-rank approximation of positive semidefinite matrices

C Musco, DP Woodruff - 2017 IEEE 58th Annual Symposium …, 2017 - ieeexplore.ieee.org
We show how to compute a relative-error low-rank approximation to any positive
semidefinite (PSD) matrix in sublinear time, ie, for any nxn PSD matrix A, in Õ (n· poly (k/ε)) …

Can lexicon bad smells improve fault prediction?

SL Abebe, V Arnaoudova, P Tonella… - 2012 19th Working …, 2012 - ieeexplore.ieee.org
In software development, early identification of fault-prone classes can save a considerable
amount of resources. In the literature, source code structural metrics have been widely …

The effect of lexicon bad smells on concept location in source code

SL Abebe, S Haiduc, P Tonella… - 2011 IEEE 11th …, 2011 - ieeexplore.ieee.org
Experienced programmers choose identifier names carefully, in the attempt to convey
information about the role and behavior of the labeled code entity in a concise and …

Investigating the use of code analysis and NLP to promote a consistent usage of identifiers

B Lin, S Scalabrino, A Mocci, R Oliveto… - 2017 IEEE 17th …, 2017 - ieeexplore.ieee.org
Meaningless identifiers as well as inconsistent use of identifiers in the source code might
hinder code readability and result in increased software maintenance efforts. Over the past …