Lord of the ring (s): Side channel attacks on the {CPU}{On-Chip} ring interconnect are practical

R Paccagnella, L Luo, CW Fletcher - 30th USENIX Security Symposium …, 2021 - usenix.org
We introduce the first microarchitectural side channel attacks that leverage contention on the
CPU ring interconnect. There are two challenges that make it uniquely difficult to exploit this …

RAMBDA: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications

Y Yuan, J Huang, Y Sun, T Wang… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Responding to the" datacenter tax" and" killer microseconds" problems for memory-intensive
datacenter applications, diverse solutions including Smart NIC-based ones have been …

Reexamining Direct Cache Access to Optimize {I/O} Intensive Applications for Multi-hundred-gigabit Networks

A Farshin, A Roozbeh, GQ Maguire Jr… - 2020 USENIX Annual …, 2020 - usenix.org
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with
commodity hardware, hence it is essential to make good use of cache memory that is a …

MeshUp: Stateless cache side-channel attack on CPU mesh

J Wan, Y Bi, Z Zhou, Z Li - 2022 IEEE Symposium on Security …, 2022 - ieeexplore.ieee.org
Cache side-channel attacks lead to severe security threats to the settings where a CPU is
shared across users, eg, in the cloud. The majority of attacks rely on sensing the micro …

RSS++ load and state-aware receive side scaling

T Barbette, GP Katsikas, GQ Maguire Jr… - Proceedings of the 15th …, 2019 - dl.acm.org
While the current literature typically focuses on load-balancing among multiple servers, in
this paper, we demonstrate the importance of load-balancing within a single machine …

A closer look at intel resource director technology (rdt)

P Sohal, M Bechtel, R Mancuso, H Yun… - Proceedings of the 30th …, 2022 - dl.acm.org
Unarbitrated contention over shared resources at different levels of the memory hierarchy
represents a major source of temporal interference. Hardware manufacturers are …

nanoBench: A low-overhead tool for running microbenchmarks on x86 systems

A Abel, J Reineke - … on Performance Analysis of Systems and …, 2020 - ieeexplore.ieee.org
We present nanoBench, a tool for evaluating small microbenchmarks using hardware
performance counters on Intel and AMD x86 systems. Most existing tools and libraries are …

APT-GET: profile-guided timely software prefetching

S Jamilan, TA Khan, G Ayers, B Kasikci… - Proceedings of the …, 2022 - dl.acm.org
Prefetching which predicts future memory accesses and preloads them from main memory,
is a widely-adopted technique to overcome the processor-memory performance gap …

PacketMill: toward per-Core 100-Gbps networking

A Farshin, T Barbette, A Roozbeh… - Proceedings of the 26th …, 2021 - dl.acm.org
We present PacketMill, a system for optimizing software packet processing, which (i)
introduces a new model to efficiently manage packet metadata and (ii) employs code …

The NEBULA RPC-optimized architecture

M Sutherland, S Gupta, B Falsafi… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Large-scale online services are commonly structured as a network of software tiers, which
communicate over the datacenter network using RPCs. Ongoing trends towards software …