A systematic literature review on bad smells–5 w's: which, when, what, who, where

EV de Paulo Sobrinho, A De Lucia… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Bad smells are sub-optimal code structures that may represent problems needing attention.
We conduct an extensive literature review on bad smells relying on a large body of …

Assessing the generalizability of code2vec token embeddings

HJ Kang, TF Bissyandé, D Lo - 2019 34th IEEE/ACM …, 2019 - ieeexplore.ieee.org
Many Natural Language Processing (NLP) tasks, such as sentiment analysis or syntactic
parsing, have benefited from the development of word embedding models. In particular …

Clones: what is that smell?

F Rahman, C Bird, P Devanbu - Empirical Software Engineering, 2012 - Springer
Clones are generally considered bad programming practice in software engineering
folklore. They are identified as a bad smell (Fowler et al. 1999) and a major contributor to …

[HTML][HTML] A systematic mapping study of clone visualization

M Hammad, HA Basit, S Jarzabek, R Koschke - Computer Science Review, 2020 - Elsevier
Knowing code clones (similar code fragments) is helpful in software maintenance and re-
engineering. As clone detectors return huge numbers of clones, visualization techniques …

Can pre-trained code embeddings improve model performance? Revisiting the use of code embeddings in software engineering tasks

Z Ding, H Li, W Shang, THP Chen - Empirical Software Engineering, 2022 - Springer
Word representation plays a key role in natural language processing (NLP). Various
representation methods have been developed, among which pre-trained word embeddings …

Assessing the refactorability of software clones

N Tsantalis, D Mazinanian… - IEEE Transactions on …, 2015 - ieeexplore.ieee.org
The presence of duplicated code in software systems is significant and several studies have
shown that clones can be potentially harmful with respect to the maintainability and …

Detector adaptation by maximising agreement between independent data sources

CO Conaire, NE O'Connor… - 2007 IEEE conference on …, 2007 - ieeexplore.ieee.org
Traditional methods for creating classifiers have two main disadvantages. Firstly, it is time
consuming to acquire, or manually annotate, the training collection. Secondly, the data on …

Towards learning generalizable code embeddings using task-agnostic graph convolutional networks

Z Ding, H Li, W Shang, TH Chen - ACM Transactions on Software …, 2023 - dl.acm.org
Code embeddings have seen increasing applications in software engineering (SE) research
and practice recently. Despite the advances in embedding techniques applied in SE …

Studying the relationship between logging characteristics and the code quality of platform software

W Shang, M Nagappan, AE Hassan - Empirical Software Engineering, 2015 - Springer
Platform software plays an important role in speeding up the development of large scale
applications. Such platforms provide functionalities and abstraction on which applications …

An empirical study of faults in late propagation clone genealogies

L Barbour, F Khomh, Y Zou - Journal of Software: Evolution …, 2013 - Wiley Online Library
Two similar code segments, or clones, form a clone pair within a software system. The
changes to the clones over time create a clone evolution history. In this work, we study late …