Code authorship attribution: Methods and challenges
Code authorship attribution is the process of identifying the author of a given code. With
increasing numbers of malware and advanced mutation techniques, the authors of malware …
increasing numbers of malware and advanced mutation techniques, the authors of malware …
Authorship attribution for neural text generation
In recent years, the task of generating realistic short and long texts have made tremendous
advancements. In particular, several recently proposed neural network-based language …
advancements. In particular, several recently proposed neural network-based language …
Human factors in security research: Lessons learned from 2008-2018
Instead of only considering technology, computer security research now strives to also take
into account the human factor by studying regular users and, to a lesser extent, experts like …
into account the human factor by studying regular users and, to a lesser extent, experts like …
Misleading authorship attribution of source code using adversarial learning
In this paper, we present a novel attack against authorship attribution of source code. We
exploit that recent attribution methods rest on machine learning and thus can be deceived by …
exploit that recent attribution methods rest on machine learning and thus can be deceived by …
Ropgen: Towards robust code authorship attribution via automatic coding style transformation
Source code authorship attribution is an important problem often encountered in
applications such as software forensics, bug fixing, and software quality analysis. Recent …
applications such as software forensics, bug fixing, and software quality analysis. Recent …
Robustness, security, privacy, explainability, efficiency, and usability of large language models for code
Large language models for code (LLM4Code), which demonstrate strong performance (eg,
high accuracy) in processing source code, have significantly transformed software …
high accuracy) in processing source code, have significantly transformed software …
Authorship attribution of source code: A language-agnostic approach and applicability in software engineering
E Bogomolov, V Kovalenko, Y Rebryk… - Proceedings of the 29th …, 2021 - dl.acm.org
Authorship attribution (ie, determining who is the author of a piece of source code) is an
established research topic. State-of-the-art results for the authorship attribution problem look …
established research topic. State-of-the-art results for the authorship attribution problem look …
Identifying Authorship in Malicious Binaries: Features, Challenges & Datasets
J Gray, D Sgandurra, L Cavallaro… - ACM Computing …, 2024 - dl.acm.org
Attributing a piece of malware to its creator typically requires threat intelligence. Binary
attribution increases the level of difficulty as it mostly relies upon the ability to disassemble …
attribution increases the level of difficulty as it mostly relies upon the ability to disassemble …
De‐anonymizing Ethereum blockchain smart contracts through code attribution
Blockchain users are identified by addresses (public keys), which cannot be easily linked
back to them without out‐of‐network information. This provides pseudo‐anonymity, which is …
back to them without out‐of‐network information. This provides pseudo‐anonymity, which is …
PART: Pre-trained Authorship Representation Transformer
Authors writing documents imprint identifying information within their texts: vocabulary,
registry, punctuation, misspellings, or even emoji usage. Finding these details is very …
registry, punctuation, misspellings, or even emoji usage. Finding these details is very …