CoNDA: Efficient cache coherence support for near-data accelerators
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …
computing systems. Recent advances in memory technology have enabled near-data …
SPACE: Sharing pattern-based directory coherence for multicore scalability
H Zhao, A Shriraman, S Dwarkadas - Proceedings of the 19th …, 2010 - dl.acm.org
An important challenge in multicore processors is the maintenance of cache coherence in a
scalable manner. Directory-based protocols save bandwidth and achieve scalability by …
scalable manner. Directory-based protocols save bandwidth and achieve scalability by …
SPATL: Honey, I shrunk the coherence directory
H Zhao, A Shriraman, S Dwarkadas… - 2011 International …, 2011 - ieeexplore.ieee.org
One of the key scalability challenges of on-chip coherence in a multicore chip is the
coherence directory, which provides information on sharing of cache blocks. Shadow tags …
coherence directory, which provides information on sharing of cache blocks. Shadow tags …
Practical Mechanisms for Reducing Processor–Memory Data Movement in Modern Workloads
A Boroumand - 2020 - search.proquest.com
Data movement between the memory system and computation units is one of the most
critical challenges in designing high performance and energy-efficient computing systems …
critical challenges in designing high performance and energy-efficient computing systems …
SpongeDirectory: Flexible sparse directories utilizing multi-level memristors
Cache-coherent shared memory is critical for programmability in many-core systems.
Several directory-based schemes have been proposed, but dynamic, non-uniform sharing …
Several directory-based schemes have been proposed, but dynamic, non-uniform sharing …
CANDY: Enabling coherent DRAM caches for multi-node systems
C Chou, A Jaleel, MK Qureshi - 2016 49th Annual IEEE/ACM …, 2016 - ieeexplore.ieee.org
This paper investigates the use of DRAM caches for multi-node systems. Current systems
architect the DRAM cache as Memory-Side Cache (MSC), restricting the DRAM cache to …
architect the DRAM cache as Memory-Side Cache (MSC), restricting the DRAM cache to …
Hierarchical cache directory for CMP
As more processing cores are integrated into one chip and feature size continues to shrink,
the average access latency for remote nodes using directory-based coherence protocol …
the average access latency for remote nodes using directory-based coherence protocol …
Zero directory eviction victim: Unbounded coherence directory and core cache isolation
M Chaudhuri - 2021 IEEE International Symposium on High …, 2021 - ieeexplore.ieee.org
A directory structure is traditionally employed for tracking coherence information of the
privately cached blocks in a cache-coherent chip-multiprocessor (CMP). The eviction of a …
privately cached blocks in a cache-coherent chip-multiprocessor (CMP). The eviction of a …
Manager-client pairing: A framework for implementing coherence hierarchies
As technology continues to scale, the need for more sophisticated coherence management
is becoming a necessity. The likely solution to this problem is the use of coherence …
is becoming a necessity. The likely solution to this problem is the use of coherence …
Sharing pattern-based directory coherence for multicore scalability (“SPACE”)
Z Hongzhou, A Shriraman, S Dwarkadas - US Patent 9,411,733, 2016 - Google Patents
A method and directory system that recognizes and represents the subset of sharing
patterns present in an application is provided. As used herein, the term sharing pattern …
patterns present in an application is provided. As used herein, the term sharing pattern …