Flow2vec: Value-flow-based precise code embedding

X Cheng, H Wang, J Hua, G Xu, Y Sui - ACM Transactions on Software …, 2021 - dl.acm.org

Static bug detection has shown its effectiveness in detecting well-defined memory errors, eg,
memory leaks, buffer overflows, and null dereference. However, modern software systems …

被引用次数：254 相关文章所有 5 个版本

[PDF] arxiv.org

What do they capture? a structural analysis of pre-trained language models for source code

Y Wan, W Zhao, H Zhang, Y Sui, G Xu… - Proceedings of the 44th …, 2022 - dl.acm.org

Recently, many pre-trained language models for source code have been proposed to model
the context of code and serve as a basis for downstream code intelligence tasks such as …

被引用次数：125 相关文章所有 9 个版本

[PDF] acm.org

Deep Learning for Code Intelligence: Survey, Benchmark and Toolkit

Y Wan, Z Bi, Y He, J Zhang, H Zhang, Y Sui… - ACM Computing …, 2024 - dl.acm.org

Code intelligence leverages machine learning techniques to extract knowledge from
extensive code corpora, with the aim of developing intelligent tools to improve the quality …

被引用次数：10 相关文章所有 5 个版本

[PDF] uts.edu.au

Path-sensitive code embedding via contrastive learning for software vulnerability detection

X Cheng, G Zhang, H Wang, Y Sui - Proceedings of the 31st ACM …, 2022 - dl.acm.org

Machine learning and its promising branch deep learning have shown success in a wide
range of application domains. Recently, much effort has been expended on applying deep …

被引用次数：91 相关文章所有 4 个版本

[PDF] mlr.press

Tfix: Learning to fix coding errors with a text-to-text transformer

B Berabi, J He, V Raychev… - … Conference on Machine …, 2021 - proceedings.mlr.press

The problem of fixing errors in programs has attracted substantial interest over the years.
The key challenge for building an effective code fixing tool is to capture a wide range of …

被引用次数：134 相关文章所有 9 个版本

[PDF] arxiv.org

D2a: A dataset built for ai-based vulnerability detection methods using differential analysis

Y Zheng, S Pujar, B Lewis, L Buratti… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org

Static analysis tools are widely used for vulnerability detection as they understand programs
with complex behavior and millions of lines of code. Despite their popularity, static analysis …

被引用次数：153 相关文章所有 7 个版本

[PDF] arxiv.org

A survey on machine learning techniques for source code analysis

T Sharma, M Kechagia, S Georgiou, R Tiwari… - arXiv preprint arXiv …, 2021 - arxiv.org

The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

被引用次数：96 相关文章所有 5 个版本

[PDF] arxiv.org

Self-supervised contrastive learning for code retrieval and summarization via semantic-preserving transformations

NDQ Bui, Y Yu, L Jiang - Proceedings of the 44th International ACM …, 2021 - dl.acm.org

We propose Corder, a self-supervised contrastive learning framework for source code
model. Corder is designed to alleviate the need of labeled data for code retrieval and code …

被引用次数：120 相关文章所有 7 个版本

[PDF] mdpi.com

Ai-assisted programming tasks using code embeddings and transformers

S Kotsiantis, V Verykios, M Tzagarakis - Electronics, 2024 - mdpi.com

This review article provides an in-depth analysis of the growing field of AI-assisted
programming tasks, specifically focusing on the use of code embeddings and transformers …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Stealthy backdoor attack for code models

Z Yang, B Xu, JM Zhang, HJ Kang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Code models, such as CodeBERT and CodeT5, offer general-purpose representations of
code and play a vital role in supporting downstream automated software engineering tasks …

被引用次数：52 相关文章所有 6 个版本