Coz finding code that counts with causal profiling
C Curtsinger, ED Berger - Proceedings of the 25th Symposium on …, 2015 - dl.acm.org
Improving performance is a central concern for software developers. To locate optimization
opportunities, developers rely on software profilers. However, these profilers only report …
opportunities, developers rely on software profilers. However, these profilers only report …
Learning to parallelize with openmp by augmented heterogeneous ast representation
Detecting parallelizable code regions is a challenging task, even for experienced
developers. Numerous recent studies have explored the use of machine learning for code …
developers. Numerous recent studies have explored the use of machine learning for code …
Kremlin: Rethinking and rebooting gprof for the multicore age
Many recent parallelization tools lower the barrier for parallelizing a program, but overlook
one of the first questions that a programmer needs to answer: which parts of the program …
one of the first questions that a programmer needs to answer: which parts of the program …
SD3: A scalable approach to dynamic data-dependence profiling
As multicore processors are deployed in mainstream computing, the need for software tools
to help parallelize programs is increasing dramatically. Data-dependence profiling is an …
to help parallelize programs is increasing dramatically. Data-dependence profiling is an …
Precise calling context encoding
WN Sumner, Y Zheng, D Weeratunge… - Proceedings of the 32nd …, 2010 - dl.acm.org
Calling contexts are very important for a wide range of applications such as profiling,
debugging, and event logging. Most applications perform expensive stack walking to …
debugging, and event logging. Most applications perform expensive stack walking to …
Kismet: parallel speedup estimates for serial programs
Software engineers now face the difficult task of refactoring serial programs for parallel
execution on multicore processors. Currently, they are offered little guidance as to how much …
execution on multicore processors. Currently, they are offered little guidance as to how much …
Profiling data-dependence to assist parallelization: Framework, scope, and optimization
A Ketterlin, P Clauss - 2012 45th Annual IEEE/ACM …, 2012 - ieeexplore.ieee.org
This paper describes a tool using one or more executions of a sequential program to detect
parallel portions of the program. The tool, called Par wiz, uses dynamic binary …
parallel portions of the program. The tool, called Par wiz, uses dynamic binary …
Unveiling parallelization opportunities in sequential programs
Z Li, R Atre, Z Huda, A Jannesari, F Wolf - Journal of Systems and Software, 2016 - Elsevier
The stagnation of single-core performance leaves application developers with software
parallelism as the only option to further benefit from Moore's Law. However, in view of the …
parallelism as the only option to further benefit from Moore's Law. However, in view of the …
Multi-view learning for parallelism discovery of sequential programs
Identifying suitable parallelizable regions in sequential programs is a crucial task for
performance optimizations. Traditional methods like static and dynamic analysis have flaws …
performance optimizations. Traditional methods like static and dynamic analysis have flaws …
Towards parallelism detection of sequential programs with graph neural network
Y Shen, M Peng, S Wang, Q Wu - Future Generation Computer Systems, 2021 - Elsevier
Abstract Development of the parallel processing technology is necessary to solve problems
created by programs with complex structures that are computation-and data-intensive. In the …
created by programs with complex structures that are computation-and data-intensive. In the …