Survey of scheduling techniques for addressing shared resources in multicore processors
S Zhuravlev, JC Saez, S Blagodurov… - ACM Computing …, 2012 - dl.acm.org
Chip multicore processors (CMPs) have emerged as the dominant architecture choice for
modern computing platforms and will most likely continue to be dominant well into the …
modern computing platforms and will most likely continue to be dominant well into the …
ZSim: Fast and accurate microarchitectural simulation of thousand-core systems
D Sanchez, C Kozyrakis - ACM SIGARCH Computer architecture news, 2013 - dl.acm.org
Architectural simulation is time-consuming, and the trend towards hundreds of cores is
making sequential simulation even slower. Existing parallel simulation techniques either …
making sequential simulation even slower. Existing parallel simulation techniques either …
{mTCP}: a highly scalable user-level {TCP} stack for multicore systems
Scaling the performance of short TCP connections on multicore systems is fundamentally
challenging. Although many proposals have attempted to address various shortcomings …
challenging. Although many proposals have attempted to address various shortcomings …
The multikernel: a new OS architecture for scalable multicore systems
Commodity computer systems contain more and more processor cores and exhibit
increasingly diverse architectural tradeoffs, including memory hierarchies, interconnects …
increasingly diverse architectural tradeoffs, including memory hierarchies, interconnects …
PacketShader: a GPU-accelerated software router
We present PacketShader, a high-performance software router framework for general packet
processing with Graphics Processing Unit (GPU) acceleration. PacketShader exploits the …
processing with Graphics Processing Unit (GPU) acceleration. PacketShader exploits the …
OpenPiton: An open source manycore research framework
Industry is building larger, more complex, manycore processors on the back of strong
institutional knowledge, but academic projects face difficulties in replicating that scale. To …
institutional knowledge, but academic projects face difficulties in replicating that scale. To …
Traffic management: a holistic approach to memory placement on NUMA systems
M Dashti, A Fedorova, J Funston, F Gaud… - ACM SIGPLAN …, 2013 - dl.acm.org
NUMA systems are characterized by Non-Uniform Memory Access times, where accessing
data in a remote node takes longer than a local access. NUMA hardware has been built …
data in a remote node takes longer than a local access. NUMA hardware has been built …
Energy efficient allocation of virtual machines in cloud data centers
A Beloglazov, R Buyya - … on Cluster, Cloud and Grid Computing, 2010 - ieeexplore.ieee.org
Rapid growth of the demand for computational power has led to the creation of large-scale
data centers. They consume enormous amounts of electrical power resulting in high …
data centers. They consume enormous amounts of electrical power resulting in high …
[PDF][PDF] An analysis of Linux scalability to many cores
S Boyd-Wickizer, AT Clements, Y Mao… - … USENIX Symposium on …, 2010 - usenix.org
This paper analyzes the scalability of seven system applications (Exim, memcached,
Apache, PostgreSQL, gmake, Psearchy, and MapReduce) running on Linux on a 48-core …
Apache, PostgreSQL, gmake, Psearchy, and MapReduce) running on Linux on a 48-core …
Everything you always wanted to know about synchronization but were afraid to ask
This paper presents the most exhaustive study of synchronization to date. We span multiple
layers, from hardware cache-coherence protocols up to high-level concurrent software. We …
layers, from hardware cache-coherence protocols up to high-level concurrent software. We …