Chain-of-thought prompting of large language models for discovering and fixing software vulnerabilities

Y Nong, M Aldeen, L Cheng, H Hu, F Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Security vulnerabilities are increasingly prevalent in modern software and they are widely
consequential to our society. Various approaches to defending against these vulnerabilities …

A Catalog of Data Smells for Coding Tasks

A Vitale, R Oliveto, S Scalabrino - ACM Transactions on Software …, 2024 - dl.acm.org
Large Language Models (LLMs) are increasingly becoming fundamental in supporting
software developers in coding tasks. The massive datasets used for training LLMs are often …

MoreFixes: A large-scale dataset of CVE fix commits mined through enhanced repository discovery

J Akhoundali, SR Nouri, K Rietveld… - Proceedings of the 20th …, 2024 - dl.acm.org
Vulnerability datasets have become an important instrument in software security research,
being used to develop automated, machine learning-based vulnerability detection and …

Automated software vulnerability patching using large language models

Y Nong, H Yang, L Cheng, H Hu, H Cai - arXiv preprint arXiv:2408.13597, 2024 - arxiv.org
Timely and effective vulnerability patching is essential for cybersecurity defense, for which
various approaches have been proposed yet still struggle to generate valid and correct …

Learning to Detect and Localize Multilingual Bugs

H Yang, Y Nong, T Zhang, X Luo, H Cai - Proceedings of the ACM on …, 2024 - dl.acm.org
Increasing studies have shown bugs in multi-language software as a critical loophole in
modern software quality assurance, especially those induced by language interactions (ie …

A Cross-Silo Vulnerability Federated Learning Approach Based on Content Chunking

W Zhang, J Zhang, S Yu, M Duan… - IEEE Internet of Things …, 2024 - ieeexplore.ieee.org
The proliferation of vulnerable code poses a significant threat to software system security
and user privacy. Given the inefficiency inherent in manual vulnerability analysis, there has …

Exploring RAG-based Vulnerability Augmentation with LLMs

SS Daneshvar, Y Nong, X Yang, S Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Detecting vulnerabilities is a crucial task for maintaining the integrity, availability, and
security of software systems. Utilizing DL-based models for vulnerability detection has …

Improving VulRepair's Perfect Prediction by Leveraging the LION Optimizer

B Kishiyama, Y Lee, J Yang - Applied Sciences, 2024 - mdpi.com
In current software applications, numerous vulnerabilities may be present. Attackers attempt
to exploit these vulnerabilities, leading to security breaches, unauthorized entry, data theft …

Enhancing Pre-Trained Language Models for Vulnerability Detection via Semantic-Preserving Data Augmentation

W Qi, J Cao, D Poddar, S Li, X Wang - arXiv preprint arXiv:2410.00249, 2024 - arxiv.org
With the rapid development and widespread use of advanced network systems, software
vulnerabilities pose a significant threat to secure communications and networking. Learning …

Improving Long-Tail Vulnerability Detection Through Data Augmentation Based on Large Language Models

X Deng, F Duan, R Xie, W Ye… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
The ability of automatic vulnerability detection models largely depends on the dataset used
for training. However, annotating these datasets is costly and time-consuming, leading to a …