NDPBridge: Enabling Cross-Bank Coordination in Near-DRAM-Bank Processing Architectures
Various near-data processing (NDP) designs have been proposed to alleviate the memory
wall challenge for data-intensive applications. Among them, near-DRAM-bank NDP …
wall challenge for data-intensive applications. Among them, near-DRAM-bank NDP …
HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory
Memory disaggregation is a promising direction to mitigate memory contention in
datacenters. To make memory disaggregation practical, prior efforts expose remote memory …
datacenters. To make memory disaggregation practical, prior efforts expose remote memory …
Polynesia: Enabling High-Performance and Energy-Efficient Hybrid Transactional/Analytical Databases with Hardware/Software Co-Design
A growth in data volume, combined with increasing demand for real-time analysis (using the
most recent data), has resulted in the emergence of database systems that concurrently …
most recent data), has resulted in the emergence of database systems that concurrently …
Experiences with ml-driven design: A noc case study
J Yin, S Sethumurugan, Y Eckert… - … Symposium on High …, 2020 - ieeexplore.ieee.org
There has been a lot of recent interest in applying machine learning (ML) to the design of
systems, which purports to aid human experts in extracting new insights leading to better …
systems, which purports to aid human experts in extracting new insights leading to better …
Active-routing: Compute on the way for near-data processing
The explosion of data availability and the demand for faster data analysis have led to the
emergence of applications exhibiting large memory footprint and low data reuse rate. These …
emergence of applications exhibiting large memory footprint and low data reuse rate. These …
[PDF][PDF] Toward more efficient noc arbitration: A deep reinforcement learning approach
The network on-chip (NoC) is a critical resource shared by various on-chip components. An
efficient NoC arbitration policy is crucial in providing global fairness and improving system …
efficient NoC arbitration policy is crucial in providing global fairness and improving system …
String figure: A scalable and elastic memory network architecture
Demand for server memory capacity and performance is rapidly increasing due to
expanding working set sizes of modern applications, such as big data analytics, inmemory …
expanding working set sizes of modern applications, such as big data analytics, inmemory …
Stream-Based Data Placement for Near-Data Processing with Extended Memory
The data access bottleneck in memory-intensive applications has motivated various
architectural innovations in the main memory system, with Near-Data Processing (NDP) and …
architectural innovations in the main memory system, with Near-Data Processing (NDP) and …
Tafe: Thread address footprint estimation for capturing data/thread locality in gpu systems
K Punniyamurthy, A Gerstlauer - … of the ACM International Conference on …, 2020 - dl.acm.org
In multi-GPU and multi-chiplet GPU systems exhibiting NUMA behavior, information about
addresses accessed by threads is crucial for various optimizations such as data/thread co …
addresses accessed by threads is crucial for various optimizations such as data/thread co …
Enabling high-performance and energy-efficient hybrid transactional/analytical databases with hardware/software cooperation
A growth in data volume, combined with increasing demand for real-time analysis (using the
most recent data), has resulted in the emergence of database systems that concurrently …
most recent data), has resulted in the emergence of database systems that concurrently …