Amdahl's law in the multicore era

MD Hill, MR Marty - Computer, 2008 - ieeexplore.ieee.org
Augmenting Amdahl's law with a corollary for multicore hardware makes it relevant to future
generations of chips with multiple processor cores. Obtaining optimal multicore performance …

LogTM: Log-based transactional memory

KE Moore, J Bobba, MJ Moravan… - … Symposium on High …, 2006 - ieeexplore.ieee.org
Transactional memory (TM) simplifies parallel programming by guaranteeing that
transactions appear to execute atomically and in isolation. Implementing these properties …

Multithreaded processors

T Ungerer, B Robič, J Šilc - The Computer Journal, 2002 - academic.oup.com
The instruction-level parallelism found in a conventional instruction stream is limited. Studies
have shown the limits of processor utilization even for today's superscalar microprocessors …

CoNDA: Efficient cache coherence support for near-data accelerators

A Boroumand, S Ghose, M Patel, H Hassan… - Proceedings of the 46th …, 2019 - dl.acm.org
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …

Smart memories: A modular reconfigurable architecture

K Mai, T Paaske, N Jayasena, R Ho, WJ Dally… - Proceedings of the 27th …, 2000 - dl.acm.org
Trends in VLSI technology scaling demand that future computing devices be narrowly
focused to achieve high performance and high efficiency, yet also target the high volumes …

Piranha: A scalable architecture based on single-chip multiprocessing

LA Barroso, K Gharachorloo, R McNamara… - ACM SIGARCH …, 2000 - dl.acm.org
The microprocessor industry is currently struggling with higher development costs and
longer design times that arise from exceedingly complex processors that are pushing the …

A scalable approach to thread-level speculation

JG Steffan, CB Colohan, A Zhai, TC Mowry - ACM SIGARCH Computer …, 2000 - dl.acm.org
While architects understand how to build cost-effective parallel machines across a wide
spectrum of machine sizes (ranging from within a single chip to large-scale servers), the real …

DMP: Deterministic shared memory multiprocessing

J Devietti, B Lucia, L Ceze, M Oskin - Proceedings of the 14th …, 2009 - dl.acm.org
Current shared memory multicore and multiprocessor systems are nondeterministic. Each
time these systems execute a multithreaded application, even if supplied with the same …

ACE: exploiting correlation for energy-efficient and continuous context sensing

S Nath - Proceedings of the 10th international conference on …, 2012 - dl.acm.org
We propose ACE (Acquisitional Context Engine), a middleware that supports continuous
context-aware applications while mitigating sensing costs for inferring contexts. ACE …

[图书][B] Transactional memory

JR Larus, R Rajwar - 2022 - books.google.com
The advent of multicore processors has renewed interest in the idea of incorporating
transactions into the programming model used to write parallel programs. This approach …