Amdahl's law in the multicore era
Augmenting Amdahl's law with a corollary for multicore hardware makes it relevant to future
generations of chips with multiple processor cores. Obtaining optimal multicore performance …
generations of chips with multiple processor cores. Obtaining optimal multicore performance …
LogTM: Log-based transactional memory
KE Moore, J Bobba, MJ Moravan… - … Symposium on High …, 2006 - ieeexplore.ieee.org
Transactional memory (TM) simplifies parallel programming by guaranteeing that
transactions appear to execute atomically and in isolation. Implementing these properties …
transactions appear to execute atomically and in isolation. Implementing these properties …
Multithreaded processors
T Ungerer, B Robič, J Šilc - The Computer Journal, 2002 - academic.oup.com
The instruction-level parallelism found in a conventional instruction stream is limited. Studies
have shown the limits of processor utilization even for today's superscalar microprocessors …
have shown the limits of processor utilization even for today's superscalar microprocessors …
CoNDA: Efficient cache coherence support for near-data accelerators
Specialized on-chip accelerators are widely used to improve the energy efficiency of
computing systems. Recent advances in memory technology have enabled near-data …
computing systems. Recent advances in memory technology have enabled near-data …
Smart memories: A modular reconfigurable architecture
Trends in VLSI technology scaling demand that future computing devices be narrowly
focused to achieve high performance and high efficiency, yet also target the high volumes …
focused to achieve high performance and high efficiency, yet also target the high volumes …
Piranha: A scalable architecture based on single-chip multiprocessing
LA Barroso, K Gharachorloo, R McNamara… - ACM SIGARCH …, 2000 - dl.acm.org
The microprocessor industry is currently struggling with higher development costs and
longer design times that arise from exceedingly complex processors that are pushing the …
longer design times that arise from exceedingly complex processors that are pushing the …
A scalable approach to thread-level speculation
JG Steffan, CB Colohan, A Zhai, TC Mowry - ACM SIGARCH Computer …, 2000 - dl.acm.org
While architects understand how to build cost-effective parallel machines across a wide
spectrum of machine sizes (ranging from within a single chip to large-scale servers), the real …
spectrum of machine sizes (ranging from within a single chip to large-scale servers), the real …
DMP: Deterministic shared memory multiprocessing
Current shared memory multicore and multiprocessor systems are nondeterministic. Each
time these systems execute a multithreaded application, even if supplied with the same …
time these systems execute a multithreaded application, even if supplied with the same …
ACE: exploiting correlation for energy-efficient and continuous context sensing
S Nath - Proceedings of the 10th international conference on …, 2012 - dl.acm.org
We propose ACE (Acquisitional Context Engine), a middleware that supports continuous
context-aware applications while mitigating sensing costs for inferring contexts. ACE …
context-aware applications while mitigating sensing costs for inferring contexts. ACE …