A survey of flaky tests
O Parry, GM Kapfhammer, M Hilton… - ACM Transactions on …, 2021 - dl.acm.org
Tests that fail inconsistently, without changes to the code under test, are described as flaky.
Flaky tests do not give a clear indication of the presence of software bugs and thus limit the …
Flaky tests do not give a clear indication of the presence of software bugs and thus limit the …
Context-aware patch generation for better automated program repair
The effectiveness of search-based automated program repair is limited in the number of
correct patches that can be successfully generated. There are two causes of such limitation …
correct patches that can be successfully generated. There are two causes of such limitation …
Deepfl: Integrating multiple fault diagnosis dimensions for deep fault localization
Learning-based fault localization has been intensively studied recently. Prior studies have
shown that traditional Learning-to-Rank techniques can help precisely diagnose fault …
shown that traditional Learning-to-Rank techniques can help precisely diagnose fault …
Toga: A neural method for test oracle generation
Testing is widely recognized as an important stage of the software development lifecycle.
Effective software testing can provide benefits such as bug finding, preventing regressions …
Effective software testing can provide benefits such as bug finding, preventing regressions …
Automated patch correctness assessment: How far are we?
Test-based automated program repair (APR) has attracted huge attention from both industry
and academia. Despite the significant progress made in recent studies, the overfitting …
and academia. Despite the significant progress made in recent studies, the overfitting …
Automatic repair of real bugs in java: A large-scale experiment on the defects4j dataset
Defects4J is a large, peer-reviewed, structured dataset of real-world Java bugs. Each bug in
Defects4J comes with a test suite and at least one failing test case that triggers the bug. In …
Defects4J comes with a test suite and at least one failing test case that triggers the bug. In …
Can large language models write good property-based tests?
Property-based testing (PBT), while an established technique in the software testing
research community, is still relatively underused in real-world software. Pain points in writing …
research community, is still relatively underused in real-world software. Pain points in writing …
An industrial evaluation of unit test generation: Finding real faults in a financial application
Automated unit test generation has been extensively studied in the literature in recent years.
Previous studies on open source systems have shown that test generation tools are quite …
Previous studies on open source systems have shown that test generation tools are quite …
Boosting spectrum-based fault localization using pagerank
Manual debugging is notoriously tedious and time consuming. Therefore, various automated
fault localization techniques have been proposed to help with manual debugging. Among …
fault localization techniques have been proposed to help with manual debugging. Among …
How many of all bugs do we find? a study of static bug detectors
Static bug detectors are becoming increasingly popular and are widely used by professional
software developers. While most work on bug detectors focuses on whether they find bugs at …
software developers. While most work on bug detectors focuses on whether they find bugs at …