How much parallelism is there in irregular applications?

M Kulkarni, M Burtscher, R Inkulu, K Pingali… - ACM sigplan …, 2009 - dl.acm.org
Irregular programs are programs organized around pointer-based data structures such as
trees and graphs. Recent investigations by the Galois project have shown that many …

Kremlin: Rethinking and rebooting gprof for the multicore age

S Garcia, D Jeon, CM Louie, MB Taylor - ACM SIGPLAN Notices, 2011 - dl.acm.org
Many recent parallelization tools lower the barrier for parallelizing a program, but overlook
one of the first questions that a programmer needs to answer: which parts of the program …

Practical parallelization of scientific applications with OpenMP, OpenACC and MPI

M Aldinucci, V Cesare, I Colonnelli, AR Martinelli… - Journal of parallel and …, 2021 - Elsevier
This work aims at distilling a systematic methodology to modernize existing sequential
scientific codes with a little re-designing effort, turning an old codebase into modern code, ie …

Speculative parallelization using software multi-threaded transactions

A Raman, H Kim, TR Mason, TB Jablin… - Proceedings of the …, 2010 - dl.acm.org
With the right techniques, multicore architectures may be able to continue the exponential
performance trend that elevated the performance of applications of all types for decades …

HELIX: Automatic parallelization of irregular programs for chip multiprocessing

S Campanoni, T Jones, G Holloway, VJ Reddi… - Proceedings of the …, 2012 - dl.acm.org
We describe and evaluate HELIX, a new technique for automatic loop parallelization that
assigns successive iterations of a loop to separate threads. We show that the inter-thread …

Parallelizing sequential applications on commodity hardware using a low-cost software transactional memory

M Mehrara, J Hao, PC Hsu, S Mahlke - ACM Sigplan Notices, 2009 - dl.acm.org
Multicore designs have emerged as the mainstream design paradigm for the microprocessor
industry. Unfortunately, providing multiple cores does not directly translate into performance …

HELIX-RC: An architecture-compiler co-design for automatic parallelization of irregular programs

S Campanoni, K Brownell, S Kanev, TM Jones… - ACM SIGARCH …, 2014 - dl.acm.org
Data dependences in sequential programs limit parallelization because extracted threads
cannot run independently. Although thread-level speculation can avoid the need for precise …

HELIX-UP: Relaxing program semantics to unleash parallelization

S Campanoni, G Holloway, GY Wei… - 2015 IEEE/ACM …, 2015 - ieeexplore.ieee.org
Automatic generation of parallel code for general-purpose commodity processors is a
challenging computational problem. Nevertheless, there is a lot of latent thread-level …

Dynamic trace-based analysis of vectorization potential of applications

J Holewinski, R Ramamurthi, M Ravishankar… - Proceedings of the 33rd …, 2012 - dl.acm.org
Recent hardware trends with GPUs and the increasing vector lengths of SSE-like ISA
extensions for multicore CPUs imply that effective exploitation of SIMD parallelism is critical …

Kismet: parallel speedup estimates for serial programs

D Jeon, S Garcia, C Louie, MB Taylor - Proceedings of the 2011 ACM …, 2011 - dl.acm.org
Software engineers now face the difficult task of refactoring serial programs for parallel
execution on multicore processors. Currently, they are offered little guidance as to how much …