Deep learning-based software engineering: Progress, challenges, and opportunities

X Chen, X Hu, Y Huang, H Jiang, W Ji, Y Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org
Researchers have recently achieved significant advances in deep learning techniques,
which in turn has substantially advanced other research disciplines, such as natural …

A review on source code documentation

S Rai, RC Belwal, A Gupta - ACM Transactions on Intelligent Systems …, 2022 - dl.acm.org
Context: Coding is an incremental activity where a developer may need to understand a
code before making suitable changes in the code. Code documentation is considered one of …

Slide4n: Creating presentation slides from computational notebooks with human-ai collaboration

F Wang, X Liu, O Liu, A Neshati, T Ma, M Zhu… - Proceedings of the 2023 …, 2023 - dl.acm.org
Data scientists often have to use other presentation tools (eg, Microsoft PowerPoint) to
create slides to communicate their analysis obtained using computational notebooks. Much …

Documentation matters: Human-centered ai system to assist data science code documentation in computational notebooks

AY Wang, D Wang, J Drozdal, M Muller, S Park… - ACM Transactions on …, 2022 - dl.acm.org
Computational notebooks allow data scientists to express their ideas through a combination
of code and documentation. However, data scientists often pay attention only to the code …

Telling stories from computational notebooks: Ai-assisted presentation slides creation for presenting data science work

C Zheng, D Wang, AY Wang, X Ma - … of the 2022 CHI Conference on …, 2022 - dl.acm.org
Creating presentation slides is a critical but time-consuming task for data scientists. While
researchers have proposed many AI techniques to lift data scientists' burden on data …

Execution-based evaluation for data science code generation models

J Huang, C Wang, J Zhang, C Yan, H Cui… - arXiv preprint arXiv …, 2022 - arxiv.org
Code generation models can benefit data scientists' productivity by automatically generating
code from context and text descriptions. An important measure of the modeling progress is …

Developer-intent driven code comment generation

F Mu, X Chen, L Shi, S Wang… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
Existing automatic code comment generators mainly focus on producing a general
description of functionality for a given code snippet without considering developer intentions …

DistilKaggle: A distilled dataset of kaggle jupyter notebooks

M Mostafavi Ghahfarokhi, A Asgari… - Proceedings of the 21st …, 2024 - dl.acm.org
Jupyter notebooks have become indispensable tools for data analysis and processing in
various domains. However, despite their widespread use, there is a notable research gap in …

Towards a taxonomy of Roxygen documentation in R packages

M Vidoni, Z Codabux - Empirical Software Engineering, 2023 - Springer
Software documentation is often neglected, impacting maintenance and reuse and leading
to technical issues. In particular, when working with scientific software, such issues in the …

A survey of deep learning models for structural code understanding

R Wu, Y Zhang, Q Peng, L Chen, Z Zheng - arXiv preprint arXiv …, 2022 - arxiv.org
In recent years, the rise of deep learning and automation requirements in the software
industry has elevated Intelligent Software Engineering to new heights. The number of …