A survey on parallelism and determinism

L Gonnord, L Henrio, L Morel, G Radanne - ACM Computing Surveys, 2023 - dl.acm.org
Parallelism is often required for performance. In these situations an excess of non-
determinism is harmful as it means the program can have several different behaviours or …

Batch-aware unified memory management in GPUs for irregular workloads

H Kim, J Sim, P Gera, R Hadidi, H Kim - Proceedings of the Twenty-Fifth …, 2020 - dl.acm.org
While unified virtual memory and demand paging in modern GPUs provide convenient
abstractions to programmers for working with large-scale applications, they come at a …

Block-stm: Scaling blockchain execution by turning ordering curse to a performance blessing

R Gelashvili, A Spiegelman, Z Xiang… - Proceedings of the 28th …, 2023 - dl.acm.org
Block-STM is a parallel execution engine for smart contracts, built around the principles of
Software Transactional Memory. Transactions are grouped in blocks, and every execution of …

SIMD programming using Intel vector extensions

H Amiri, A Shahbahrami - Journal of Parallel and Distributed Computing, 2020 - Elsevier
Single instruction multiple data (SIMD) extensions are one of the most significant capabilities
of recent General Purpose Processors (GPPs) which improves the performance of …

T4: Compiling sequential code for effective speculative parallelization in hardware

VA Ying, MC Jeffrey, D Sanchez - 2020 ACM/IEEE 47th Annual …, 2020 - ieeexplore.ieee.org
Multicores are now ubiquitous, but programmers still write sequential code. Speculative
parallelization is an enticing approach to parallelize code while retaining the ease of …

Adaptive parallel Louvain community detection on a multicore platform

M Fazlali, E Moradi, HT Malazi - Microprocessors and microsystems, 2017 - Elsevier
Community detection is a demanded technique in analyzing complex and massive graph-
based networks. The quality of the detected communities in an acceptable time is an …

Parallel computing: review and perspective

Y Li, Z Zhang - … Conference on Information Science and Control …, 2018 - ieeexplore.ieee.org
As parallelism on different levels becomes ubiquitous in today's computers, it seems
worthwhile to provide a review of the wealth of every aspect of parallel computing that has …

Harmonizing speculative and non-speculative execution in architectures for ordered parallelism

MC Jeffrey, VA Ying, S Subramanian… - 2018 51st Annual …, 2018 - ieeexplore.ieee.org
Multicore systems should support both speculative and non-speculative parallelism.
Speculative parallelism is easy to use and is crucial to scale many challenging applications …

Specx: a C++ task-based runtime system for heterogeneous distributed architectures

P Cardosi, B Bramas - arXiv preprint arXiv:2308.15964, 2023 - arxiv.org
Parallelization is needed everywhere, from laptops and mobile phones to supercomputers.
Among parallel programming models, task-based programming has demonstrated a …

SAM: Optimizing multithreaded cores for speculative parallelism

M Abeydeera, S Subramanian… - 2017 26th …, 2017 - ieeexplore.ieee.org
This work studies the interplay between multithreaded cores and speculative parallelism
(eg, transactional memory or thread-level speculation). These techniques are often used …