Deepwukong: Statically detecting software vulnerabilities using deep graph neural network

X Cheng, H Wang, J Hua, G Xu, Y Sui - ACM Transactions on Software …, 2021 - dl.acm.org
Static bug detection has shown its effectiveness in detecting well-defined memory errors, eg,
memory leaks, buffer overflows, and null dereference. However, modern software systems …

What do they capture? a structural analysis of pre-trained language models for source code

Y Wan, W Zhao, H Zhang, Y Sui, G Xu… - Proceedings of the 44th …, 2022 - dl.acm.org
Recently, many pre-trained language models for source code have been proposed to model
the context of code and serve as a basis for downstream code intelligence tasks such as …

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org
Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

Path-sensitive code embedding via contrastive learning for software vulnerability detection

X Cheng, G Zhang, H Wang, Y Sui - Proceedings of the 31st ACM …, 2022 - dl.acm.org
Machine learning and its promising branch deep learning have shown success in a wide
range of application domains. Recently, much effort has been expended on applying deep …

Tfix: Learning to fix coding errors with a text-to-text transformer

B Berabi, J He, V Raychev… - … Conference on Machine …, 2021 - proceedings.mlr.press
The problem of fixing errors in programs has attracted substantial interest over the years.
The key challenge for building an effective code fixing tool is to capture a wide range of …

D2a: A dataset built for ai-based vulnerability detection methods using differential analysis

Y Zheng, S Pujar, B Lewis, L Buratti… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Static analysis tools are widely used for vulnerability detection as they understand programs
with complex behavior and millions of lines of code. Despite their popularity, static analysis …

A survey on machine learning techniques for source code analysis

T Sharma, M Kechagia, S Georgiou, R Tiwari… - arXiv preprint arXiv …, 2021 - arxiv.org
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

Self-supervised contrastive learning for code retrieval and summarization via semantic-preserving transformations

NDQ Bui, Y Yu, L Jiang - Proceedings of the 44th International ACM …, 2021 - dl.acm.org
We propose Corder, a self-supervised contrastive learning framework for source code
model. Corder is designed to alleviate the need of labeled data for code retrieval and code …

Ai-assisted programming tasks using code embeddings and transformers

S Kotsiantis, V Verykios, M Tzagarakis - Electronics, 2024 - mdpi.com
This review article provides an in-depth analysis of the growing field of AI-assisted
programming tasks, specifically focusing on the use of code embeddings and transformers …

Stealthy backdoor attack for code models

Z Yang, B Xu, JM Zhang, HJ Kang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Code models, such as CodeBERT and CodeT5, offer general-purpose representations of
code and play a vital role in supporting downstream automated software engineering tasks …