Lord of the ring (s): Side channel attacks on the {CPU}{On-Chip} ring interconnect are practical
We introduce the first microarchitectural side channel attacks that leverage contention on the
CPU ring interconnect. There are two challenges that make it uniquely difficult to exploit this …
CPU ring interconnect. There are two challenges that make it uniquely difficult to exploit this …
RAMBDA: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications
Responding to the" datacenter tax" and" killer microseconds" problems for memory-intensive
datacenter applications, diverse solutions including Smart NIC-based ones have been …
datacenter applications, diverse solutions including Smart NIC-based ones have been …
Reexamining Direct Cache Access to Optimize {I/O} Intensive Applications for Multi-hundred-gigabit Networks
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with
commodity hardware, hence it is essential to make good use of cache memory that is a …
commodity hardware, hence it is essential to make good use of cache memory that is a …
MeshUp: Stateless cache side-channel attack on CPU mesh
Cache side-channel attacks lead to severe security threats to the settings where a CPU is
shared across users, eg, in the cloud. The majority of attacks rely on sensing the micro …
shared across users, eg, in the cloud. The majority of attacks rely on sensing the micro …
RSS++ load and state-aware receive side scaling
While the current literature typically focuses on load-balancing among multiple servers, in
this paper, we demonstrate the importance of load-balancing within a single machine …
this paper, we demonstrate the importance of load-balancing within a single machine …
A closer look at intel resource director technology (rdt)
Unarbitrated contention over shared resources at different levels of the memory hierarchy
represents a major source of temporal interference. Hardware manufacturers are …
represents a major source of temporal interference. Hardware manufacturers are …
nanoBench: A low-overhead tool for running microbenchmarks on x86 systems
We present nanoBench, a tool for evaluating small microbenchmarks using hardware
performance counters on Intel and AMD x86 systems. Most existing tools and libraries are …
performance counters on Intel and AMD x86 systems. Most existing tools and libraries are …
APT-GET: profile-guided timely software prefetching
Prefetching which predicts future memory accesses and preloads them from main memory,
is a widely-adopted technique to overcome the processor-memory performance gap …
is a widely-adopted technique to overcome the processor-memory performance gap …
PacketMill: toward per-Core 100-Gbps networking
We present PacketMill, a system for optimizing software packet processing, which (i)
introduces a new model to efficiently manage packet metadata and (ii) employs code …
introduces a new model to efficiently manage packet metadata and (ii) employs code …
The NEBULA RPC-optimized architecture
Large-scale online services are commonly structured as a network of software tiers, which
communicate over the datacenter network using RPCs. Ongoing trends towards software …
communicate over the datacenter network using RPCs. Ongoing trends towards software …