FireSim: FPGA-accelerated cycle-exact scale-out system simulation in the public cloud

S Karandikar, H Mao, D Kim, D Biancolin… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
We present FireSim, an open-source simulation platform that enables cycle-exact
microarchitectural simulation of large scale-out clusters by combining FPGA-accelerated …

Softsku: Optimizing server architectures for microservice diversity@ scale

A Sriraman, A Dhanotia, TF Wenisch - Proceedings of the 46th …, 2019 - dl.acm.org
The variety and complexity of microservices in warehouse-scale data centers has grown
precipitously over the last few years to support a growing user base and an evolving product …

The mondrian data engine

M Drumond, A Daglis, N Mirzadeh, D Ustiugov… - ACM SIGARCH …, 2017 - dl.acm.org
The increasing demand for extracting value out of ever-growing data poses an ongoing
challenge to system designers, a task only made trickier by the end of Dennard scaling. As …

Memory hierarchy for web search

G Ayers, JH Ahn, C Kozyrakis… - … Symposium on High …, 2018 - ieeexplore.ieee.org
Online data-intensive services, such as search, serve billions of users, utilize millions of
cores, and comprise a significant and growing portion of datacenter-scale workloads …

Mitigating wordline crosstalk using adaptive trees of counters

SM Seyedzadeh, AK Jones… - 2018 ACM/IEEE 45th …, 2018 - ieeexplore.ieee.org
DRAM technology scaling has the undesirable side effect of degrading cell reliability. One
such concern of deeply scaled DRAMs is the increased coupling between adjacent cells …

Rebooting virtual memory with midgard

S Gupta, A Bhattacharyya, Y Oh… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Computer systems designers are building cache hierarchies with higher capacity to capture
the ever-increasing working sets of modern workloads. Cache hierarchies with higher …

COAXIAL: A CXL-Centric Memory System for Scalable Servers

A Cho, A Saxena, M Qureshi… - … Conference for High …, 2024 - ieeexplore.ieee.org
The memory system is a major performance determinant for server processors. Ever-
growing core counts and datasets demand higher memory bandwidth and capacity. DDR …

A case for cxl-centric server processors

A Cho, A Saxena, M Qureshi, A Daglis - arXiv preprint arXiv:2305.05033, 2023 - arxiv.org
The memory system is a major performance determinant for server processors. Ever-
growing core counts and datasets demand higher bandwidth and capacity as well as lower …

Scale-out ccNUMA: Exploiting skew with strongly consistent caching

V Gavrielatos, A Katsarakis, A Joshi, N Oswald… - Proceedings of the …, 2018 - dl.acm.org
Today's cloud based online services are underpinned by distributed key-value stores (KVS).
Such KVS typically use a scale-out architecture, whereby the dataset is partitioned across a …

Near-memory address translation

J Picorel, D Jevdjic, B Falsafi - 2017 26th International …, 2017 - ieeexplore.ieee.org
Memory and logic integration on the same chip is becoming increasingly cost effective,
creating the opportunity to offload data-intensive functionality to processing units placed …