Managing cache coherency in a data processing apparatus
US PATENT DOCUMENTS 6,272.520 B1 8/2001 Sharangpani et al. 6,338,123 B2* 1/2002
Joseph et al.................. 711/144 6,704,845 B2* 3/2004 Anderson et al.............. 711.146 …
Joseph et al.................. 711/144 6,704,845 B2* 3/2004 Anderson et al.............. 711.146 …
An optimized 3D-stacked memory architecture by exploiting excessive, high-density TSV bandwidth
DH Woo, NH Seong, DL Lewis… - HPCA-16 2010 The …, 2010 - ieeexplore.ieee.org
Memory bandwidth has become a major performance bottleneck as more and more cores
are integrated onto a single die, demanding more and more data from the system memory …
are integrated onto a single die, demanding more and more data from the system memory …
Cooperative caching for chip multiprocessors
J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …
DeNovo: Rethinking the memory hierarchy for disciplined parallelism
B Choi, R Komuravelli, H Sung… - 2011 International …, 2011 - ieeexplore.ieee.org
For parallelism to become tractable for mass programmers, shared-memory languages and
environments must evolve to enforce disciplined practices that ban" wild shared-memory …
environments must evolve to enforce disciplined practices that ban" wild shared-memory …
Heterogeneous system coherence for integrated CPU-GPU systems
Many future heterogeneous systems will integrate CPUs and GPUs physically on a single
chip and logically connect them via shared memory to avoid explicit data copying. Making …
chip and logically connect them via shared memory to avoid explicit data copying. Making …
SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering
In the many-core era, scalable coherence and on-chip interconnects are crucial for shared
memory processors. While snoopy coherence is common in small multicore systems …
memory processors. While snoopy coherence is common in small multicore systems …
Increasing the effectiveness of directory caches by deactivating coherence for private memory blocks
To meet the demand for more powerful high-performance shared-memory servers,
multiprocessor systems must incorporate efficient and scalable cache coherence protocols …
multiprocessor systems must incorporate efficient and scalable cache coherence protocols …
CHOP: Adaptive filter-based DRAM caching for CMP server platforms
As manycore architectures enable a large number of cores on the die, a key challenge that
emerges is the availability of memory bandwidth with conventional DRAM solutions. To …
emerges is the availability of memory bandwidth with conventional DRAM solutions. To …
A tagless coherence directory
A key challenge in architecting a CMP with many cores is maintaining cache coherence in
an efficient manner. Directory-based protocols avoid the bandwidth overhead of snoop …
an efficient manner. Directory-based protocols avoid the bandwidth overhead of snoop …
Zero-content augmented caches
J Dusser, T Piquet, A Seznec - … of the 23rd international conference on …, 2009 - dl.acm.org
It has been observed that some applications manipulate large amounts of null data.
Moreover these zero data often exhibit high spatial locality. On some applications more than …
Moreover these zero data often exhibit high spatial locality. On some applications more than …