A survey on thread-level speculation techniques
A Estebanez, DR Llanos… - ACM Computing Surveys …, 2016 - dl.acm.org
Thread-Level Speculation (TLS) is a promising technique that allows the parallel execution
of sequential code without relying on a prior, compile-time-dependence analysis. In this …
of sequential code without relying on a prior, compile-time-dependence analysis. In this …
Automatic construct selection and variable classification in OpenMP
A major task of parallelization with OpenMP is to decide where in a program to insert which
OpenMP construct such that speedup is maximized and correctness is preserved. Another …
OpenMP construct such that speedup is maximized and correctness is preserved. Another …
A machine learning method to variable classification in OpenMP
Y Shen, M Peng, Q Wu, R Li - Future Generation Computer Systems, 2023 - Elsevier
OpenMP is a parallel computing framework that provides programmers with a set of
directives and clauses to use when writing parallel applications. The most important task in …
directives and clauses to use when writing parallel applications. The most important task in …
A job sizing strategy for high-throughput scientific workflows
B Tovar, RF da Silva, G Juve… - … on Parallel and …, 2017 - ieeexplore.ieee.org
The user of a computing facility must make a critical decision when submitting jobs for
execution: how many resources (such as cores, memory, and disk) should be requested for …
execution: how many resources (such as cores, memory, and disk) should be requested for …
Using hardware-transactional-memory support to implement thread-level speculation
J Salamanca, JN Amaral… - IEEE transactions on …, 2017 - ieeexplore.ieee.org
This paper presents a detailed analysis of the application of Hardware Transactional
Memory (HTM) support for loop parallelization with Thread-Level Speculation (TLS) and …
Memory (HTM) support for loop parallelization with Thread-Level Speculation (TLS) and …
Tasking framework for adaptive speculative parallel mesh generation
Handling the ever-increasing complexity of mesh generation codes along with the intricacies
of newer hardware often results in codes that are both difficult to comprehend and maintain …
of newer hardware often results in codes that are both difficult to comprehend and maintain …
Anthropomorphic diagnosis of runtime hidden behaviors in OpenMP multi-threaded applications
W Wang, D Li, W Luo, Y Kang, L Wang - Journal of Parallel and Distributed …, 2023 - Elsevier
Extreme-scale computing involves hundreds of millions of threads with multi-level
parallelism running on large-scale hierarchical and heterogeneous hardware. Some …
parallelism running on large-scale hierarchical and heterogeneous hardware. Some …
Toward emotion-aware computing: A loop selection approach based on machine learning for speculative multithreading
B Liu, J He, Y Geng, L Huang, S Li - IEEE Access, 2017 - ieeexplore.ieee.org
Emotion-aware computing can recognize, interpret, process, and simulate human affects.
These programs in this area are compute-intensive applications, so they need to be …
These programs in this area are compute-intensive applications, so they need to be …
A unified framework for parallel anisotropic mesh adaptation
C Tsolakis - 2021 - search.proquest.com
Finite-element methods are a critical component of the design and analysis procedures of
many (bio-) engineering applications. Mesh adaptation is one of the most crucial …
many (bio-) engineering applications. Mesh adaptation is one of the most crucial …
A Proposal for Supporting Speculation in the OpenMP taskloop Construct
J Salamanca, A Baldassin - … Conquering the Full Hardware Spectrum: 15th …, 2019 - Springer
Parallelization constructs in OpenMP, such as parallel for or taskloop, are typically restricted
to loops that have no loop-carried dependencies (DOALL) or that contain well-known …
to loops that have no loop-carried dependencies (DOALL) or that contain well-known …