A survey on thread-level speculation techniques

L Gonnord, L Henrio, L Morel, G Radanne - ACM Computing Surveys, 2023 - dl.acm.org

Parallelism is often required for performance. In these situations an excess of non-
determinism is harmful as it means the program can have several different behaviours or …

被引用次数：8 相关文章所有 6 个版本

[PDF] gatech.edu

Batch-aware unified memory management in GPUs for irregular workloads

H Kim, J Sim, P Gera, R Hadidi, H Kim - Proceedings of the Twenty-Fifth …, 2020 - dl.acm.org

While unified virtual memory and demand paging in modern GPUs provide convenient
abstractions to programmers for working with large-scale applications, they come at a …

被引用次数：81 相关文章所有 3 个版本

[PDF] arxiv.org

Block-stm: Scaling blockchain execution by turning ordering curse to a performance blessing

R Gelashvili, A Spiegelman, Z Xiang… - Proceedings of the 28th …, 2023 - dl.acm.org

Block-STM is a parallel execution engine for smart contracts, built around the principles of
Software Transactional Memory. Transactions are grouped in blocks, and every execution of …

被引用次数：44 相关文章所有 8 个版本

SIMD programming using Intel vector extensions

H Amiri, A Shahbahrami - Journal of Parallel and Distributed Computing, 2020 - Elsevier

Single instruction multiple data (SIMD) extensions are one of the most significant capabilities
of recent General Purpose Processors (GPPs) which improves the performance of …

被引用次数：62 相关文章所有 4 个版本

[PDF] mit.edu

T4: Compiling sequential code for effective speculative parallelization in hardware

VA Ying, MC Jeffrey, D Sanchez - 2020 ACM/IEEE 47th Annual …, 2020 - ieeexplore.ieee.org

Multicores are now ubiquitous, but programmers still write sequential code. Speculative
parallelization is an enticing approach to parallelize code while retaining the ease of …

被引用次数：35 相关文章所有 13 个版本

[PDF] researchgate.net

Adaptive parallel Louvain community detection on a multicore platform

M Fazlali, E Moradi, HT Malazi - Microprocessors and microsystems, 2017 - Elsevier

Community detection is a demanded technique in analyzing complex and massive graph-
based networks. The quality of the detected communities in an acceptable time is an …

被引用次数：43 相关文章所有 5 个版本

Parallel computing: review and perspective

Y Li, Z Zhang - … Conference on Information Science and Control …, 2018 - ieeexplore.ieee.org

As parallelism on different levels becomes ubiquitous in today's computers, it seems
worthwhile to provide a review of the wealth of every aspect of parallel computing that has …

被引用次数：15 相关文章所有 2 个版本

[PDF] mit.edu

Harmonizing speculative and non-speculative execution in architectures for ordered parallelism

MC Jeffrey, VA Ying, S Subramanian… - 2018 51st Annual …, 2018 - ieeexplore.ieee.org

Multicore systems should support both speculative and non-speculative parallelism.
Speculative parallelism is easy to use and is crucial to scale many challenging applications …

被引用次数：21 相关文章所有 10 个版本

[PDF] arxiv.org

Specx: a C++ task-based runtime system for heterogeneous distributed architectures

P Cardosi, B Bramas - arXiv preprint arXiv:2308.15964, 2023 - arxiv.org

Parallelization is needed everywhere, from laptops and mobile phones to supercomputers.
Among parallel programming models, task-based programming has demonstrated a …

被引用次数：3 相关文章所有 4 个版本

[PDF] mit.edu

SAM: Optimizing multithreaded cores for speculative parallelism

M Abeydeera, S Subramanian… - 2017 26th …, 2017 - ieeexplore.ieee.org

This work studies the interplay between multithreaded cores and speculative parallelism
(eg, transactional memory or thread-level speculation). These techniques are often used …

被引用次数：21 相关文章所有 9 个版本