Adaptive history-based memory schedulers

S Zhuravlev, JC Saez, S Blagodurov… - ACM Computing …, 2012 - dl.acm.org

Chip multicore processors (CMPs) have emerged as the dominant architecture choice for
modern computing platforms and will most likely continue to be dominant well into the …

被引用次数：232 相关文章所有 9 个版本

System, method, and computer program product for improving memory systems

MS Smith - US Patent 9,432,298, 2016 - Google Patents

H01L25/18—Assemblies consisting of a plurality of individual semiconductor or other solid
state devices; Multistep manufacturing processes thereof the devices being of types …

被引用次数：848 相关文章所有 2 个版本

[PDF] ieee.org

DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org

Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

被引用次数：95 相关文章所有 10 个版本

[图书][B] Memory systems: cache, DRAM, disk

B Jacob, D Wang, S Ng - 2010 - books.google.com

Is your memory hierarchy stopping your microprocessor from performing at the high level it
should be? Memory Systems: Cache, DRAM, Disk shows you how to resolve this problem …

被引用次数：1162 相关文章所有 6 个版本

[PDF] wisc.edu

3D-stacked memory architectures for multi-core processors

GH Loh - ACM SIGARCH computer architecture news, 2008 - dl.acm.org

Three-dimensional integration enables stacking memory directly on top of a microprocessor,
thereby significantly reducing wire delay between the two. Previous studies have examined …

被引用次数：934 相关文章所有 19 个版本

[PDF] cmu.edu

Thread cluster memory scheduling: Exploiting differences in memory access behavior

Y Kim, M Papamichael, O Mutlu… - 2010 43rd Annual …, 2010 - ieeexplore.ieee.org

In a modern chip-multiprocessor system, memory is a shared resource among multiple
concurrently executing threads. The memory scheduling algorithm should resolve memory …

被引用次数：562 相关文章所有 28 个版本

[PDF] arxiv.org

Figaro: Improving system performance via fine-grained in-dram data relocation and caching

Y Wang, L Orosa, X Peng, Y Guo… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org

Main memory, composed of DRAM, is a performance bottleneck for many applications, due
to the high DRAM access latency. In-DRAM caches work to mitigate this latency by …

被引用次数：87 相关文章所有 22 个版本

[PDF] cam.ac.uk

Self-optimizing memory controllers: A reinforcement learning approach

E Ipek, O Mutlu, JF Martínez, R Caruana - ACM SIGARCH Computer …, 2008 - dl.acm.org

Efficiently utilizing off-chip DRAM bandwidth is a critical issuein designing cost-effective,
high-performance chip multiprocessors (CMPs). Conventional memory controllers deliver …

被引用次数：646 相关文章所有 20 个版本

[PDF] cmu.edu

ATLAS: A scalable and high-performance scheduling algorithm for multiple memory controllers

Y Kim, D Han, O Mutlu… - HPCA-16 2010 The …, 2010 - ieeexplore.ieee.org

Modern chip multiprocessor (CMP) systems employ multiple memory controllers to control
access to main memory. The scheduling algorithm employed by these memory controllers …

被引用次数：573 相关文章所有 24 个版本

[PDF] psu.edu

Parallelism-aware batch scheduling: Enhancing both performance and fairness of shared DRAM systems

O Mutlu, T Moscibroda - ACM SIGARCH Computer Architecture News, 2008 - dl.acm.org

In a chip-multiprocessor (CMP) system, the DRAM system isshared among cores. In a
shared DRAM system, requests from athread can not only delay requests from other threads …

被引用次数：751 相关文章所有 10 个版本