Leveraging Cache Coherence to Detect and Repair False Sharing On-the-fly
V Patel, S Biswas, M Chaudhuri - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Performance bugs due to false sharing do not man-ifest as observable correctness errors,
and hence are challenging to detect and repair. Prior approaches aim to both detect and …
and hence are challenging to detect and repair. Prior approaches aim to both detect and …
CELLO: Compiler-Assisted Efficient Load-Load Ordering in Data-Race-Free Regions
Efficient Total Store Order (TSO) implementations allow loads to execute speculatively out-of-
order. To detect order violations, the load queue (LQ) holds all the in-flight loads and is …
order. To detect order violations, the load queue (LQ) holds all the in-flight loads and is …
Neat: Low-Complexity, Efficient On-Chip Cache Coherence
Cache coherence protocols such as MESI that use writer-initiated invalidation have high
complexity and sometimes have poor performance and energy usage, especially under …
complexity and sometimes have poor performance and energy usage, especially under …
Peacenik: Architecture support for not failing under fail-stop memory consistency
Modern shared-memory systems have erroneous, undefined behavior for programs that are
not well synchronized. A promising solution is to provide fail-stop memory consistency …
not well synchronized. A promising solution is to provide fail-stop memory consistency …
[图书][B] Architecture and Compiler Support for Parallel Consistency, Coherence, and Security
R Zhang - 2020 - search.proquest.com
The widespread use of multicore processors has made parallelism a necessity for
performance. However, parallelism allows programs to share physical computing resources …
performance. However, parallelism allows programs to share physical computing resources …