Managing cache coherency in a data processing apparatus

E Özer, SD Biles, SA Ford - US Patent 7,937,535, 2011 - Google Patents
US PATENT DOCUMENTS 6,272.520 B1 8/2001 Sharangpani et al. 6,338,123 B2* 1/2002
Joseph et al.................. 711/144 6,704,845 B2* 3/2004 Anderson et al.............. 711.146 …

An optimized 3D-stacked memory architecture by exploiting excessive, high-density TSV bandwidth

DH Woo, NH Seong, DL Lewis… - HPCA-16 2010 The …, 2010 - ieeexplore.ieee.org
Memory bandwidth has become a major performance bottleneck as more and more cores
are integrated onto a single die, demanding more and more data from the system memory …

Cooperative caching for chip multiprocessors

J Chang, GS Sohi - ACM SIGARCH Computer Architecture News, 2006 - dl.acm.org
This paper presents CMP Cooperative Caching, a unified framework to manage a CMP's
aggregate on-chip cache resources. Cooperative caching combines the strengths of private …

DeNovo: Rethinking the memory hierarchy for disciplined parallelism

B Choi, R Komuravelli, H Sung… - 2011 International …, 2011 - ieeexplore.ieee.org
For parallelism to become tractable for mass programmers, shared-memory languages and
environments must evolve to enforce disciplined practices that ban" wild shared-memory …

Heterogeneous system coherence for integrated CPU-GPU systems

J Power, A Basu, J Gu, S Puthoor… - Proceedings of the 46th …, 2013 - dl.acm.org
Many future heterogeneous systems will integrate CPUs and GPUs physically on a single
chip and logically connect them via shared memory to avoid explicit data copying. Making …

SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering

BK Daya, CHO Chen, S Subramanian… - ACM SIGARCH …, 2014 - dl.acm.org
In the many-core era, scalable coherence and on-chip interconnects are crucial for shared
memory processors. While snoopy coherence is common in small multicore systems …

Increasing the effectiveness of directory caches by deactivating coherence for private memory blocks

BA Cuesta, A Ros, ME Gómez, A Robles… - Proceedings of the 38th …, 2011 - dl.acm.org
To meet the demand for more powerful high-performance shared-memory servers,
multiprocessor systems must incorporate efficient and scalable cache coherence protocols …

CHOP: Adaptive filter-based DRAM caching for CMP server platforms

X Jiang, N Madan, L Zhao, M Upton… - HPCA-16 2010 The …, 2010 - ieeexplore.ieee.org
As manycore architectures enable a large number of cores on the die, a key challenge that
emerges is the availability of memory bandwidth with conventional DRAM solutions. To …

A tagless coherence directory

J Zebchuk, V Srinivasan, MK Qureshi… - Proceedings of the 42nd …, 2009 - dl.acm.org
A key challenge in architecting a CMP with many cores is maintaining cache coherence in
an efficient manner. Directory-based protocols avoid the bandwidth overhead of snoop …

Zero-content augmented caches

J Dusser, T Piquet, A Seznec - … of the 23rd international conference on …, 2009 - dl.acm.org
It has been observed that some applications manipulate large amounts of null data.
Moreover these zero data often exhibit high spatial locality. On some applications more than …