NDPBridge: Enabling Cross-Bank Coordination in Near-DRAM-Bank Processing Architectures

B Tian, Y Li, L Jiang, S Cai… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Various near-data processing (NDP) designs have been proposed to alleviate the memory
wall challenge for data-intensive applications. Among them, near-DRAM-bank NDP …

HoPP: Hardware-Software Co-Designed Page Prefetching for Disaggregated Memory

H Li, K Liu, T Liang, Z Li, T Lu, H Yuan… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Memory disaggregation is a promising direction to mitigate memory contention in
datacenters. To make memory disaggregation practical, prior efforts expose remote memory …

Polynesia: Enabling High-Performance and Energy-Efficient Hybrid Transactional/Analytical Databases with Hardware/Software Co-Design

A Boroumand, S Ghose, GF Oliveira… - 2022 IEEE 38th …, 2022 - ieeexplore.ieee.org
A growth in data volume, combined with increasing demand for real-time analysis (using the
most recent data), has resulted in the emergence of database systems that concurrently …

Experiences with ml-driven design: A noc case study

J Yin, S Sethumurugan, Y Eckert… - … Symposium on High …, 2020 - ieeexplore.ieee.org
There has been a lot of recent interest in applying machine learning (ML) to the design of
systems, which purports to aid human experts in extracting new insights leading to better …

Active-routing: Compute on the way for near-data processing

J Huang, RR Puli, P Majumder, S Kim… - … symposium on high …, 2019 - ieeexplore.ieee.org
The explosion of data availability and the demand for faster data analysis have led to the
emergence of applications exhibiting large memory footprint and low data reuse rate. These …

[PDF][PDF] Toward more efficient noc arbitration: A deep reinforcement learning approach

J Yin, Y Eckert, S Che, M Oskin… - Proc. IEEE 1st Int …, 2018 - academia.edu
The network on-chip (NoC) is a critical resource shared by various on-chip components. An
efficient NoC arbitration policy is crucial in providing global fairness and improving system …

String figure: A scalable and elastic memory network architecture

M Ogleari, Y Yu, C Qian, E Miller… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Demand for server memory capacity and performance is rapidly increasing due to
expanding working set sizes of modern applications, such as big data analytics, inmemory …

Stream-Based Data Placement for Near-Data Processing with Extended Memory

Y Li, B Tian, Y Ren, M Gao - 2024 57th IEEE/ACM International …, 2024 - ieeexplore.ieee.org
The data access bottleneck in memory-intensive applications has motivated various
architectural innovations in the main memory system, with Near-Data Processing (NDP) and …

Tafe: Thread address footprint estimation for capturing data/thread locality in gpu systems

K Punniyamurthy, A Gerstlauer - … of the ACM International Conference on …, 2020 - dl.acm.org
In multi-GPU and multi-chiplet GPU systems exhibiting NUMA behavior, information about
addresses accessed by threads is crucial for various optimizations such as data/thread co …

Enabling high-performance and energy-efficient hybrid transactional/analytical databases with hardware/software cooperation

A Boroumand, S Ghose, GF Oliveira… - arXiv preprint arXiv …, 2022 - arxiv.org
A growth in data volume, combined with increasing demand for real-time analysis (using the
most recent data), has resulted in the emergence of database systems that concurrently …