Leveraging Cache Coherence to Detect and Repair False Sharing On-the-fly

V Patel, S Biswas, M Chaudhuri - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Performance bugs due to false sharing do not man-ifest as observable correctness errors,
and hence are challenging to detect and repair. Prior approaches aim to both detect and …

CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions

S Singh, J Feliu, ME Acacio… - 2023 32nd …, 2023 - ieeexplore.ieee.org
Efficient Total Store Order (TSO) implementations allow loads to execute speculatively out-of-
order. To detect order violations, the load queue (LQ) holds all the in-flight loads and is …

Neat: Low-Complexity, Efficient On-Chip Cache Coherence

R Zhang, S Biswas, V Balaji, MD Bond… - arXiv preprint arXiv …, 2021 - arxiv.org
Cache coherence protocols such as MESI that use writer-initiated invalidation have high
complexity and sometimes have poor performance and energy usage, especially under …

Peacenik: Architecture support for not failing under fail-stop memory consistency

R Zhang, S Biswas, V Balaji, MD Bond… - Proceedings of the Twenty …, 2020 - dl.acm.org
Modern shared-memory systems have erroneous, undefined behavior for programs that are
not well synchronized. A promising solution is to provide fail-stop memory consistency …

[图书][B] Architecture and Compiler Support for Parallel Consistency, Coherence, and Security

R Zhang - 2020 - search.proquest.com
The widespread use of multicore processors has made parallelism a necessity for
performance. However, parallelism allows programs to share physical computing resources …