A survey of techniques for dynamic branch prediction

S Mittal - Concurrency and Computation: Practice and …, 2019 - Wiley Online Library
Branch predictor (BP) is an essential component in modern processors since high BP
accuracy can improve performance and reduce energy by decreasing the number of …

Design space exploration for 3D architectures

Y Xie, GH Loh, B Black, K Bernstein - ACM Journal on Emerging …, 2006 - dl.acm.org
As technology scales, interconnects have become a major performance bottleneck and a
major source of power consumption for microprocessors. Increasing interconnect costs …

Managing wire delay in large chip-multiprocessor caches

BM Beckmann, DA Wood - 37th International Symposium on …, 2004 - ieeexplore.ieee.org
In response to increasing (relative) wire delay, architects have proposed various
technologies to manage the impact of slow wires on large uniprocessor L2 caches. Block …

A case for (partially) tagged geometric history length branch prediction

A Seznec, P Michaud - The Journal of Instruction-Level Parallelism, 2006 - inria.hal.science
It is now widely admitted that in order to provide state-of-the-art accuracy, a conditional
branch predictor must combine several predictions. Recent research has shown that an …

A new case for the tage branch predictor

A Seznec - Proceedings of the 44th Annual IEEE/ACM …, 2011 - dl.acm.org
The TAGE predictor is often considered as state-of-the-art in conditional branch predictors
proposed by academy. In this paper, we first present directions to reduce the hardware …

Fast path-based neural branch prediction

DA Jiménez - … . 36th Annual IEEE/ACM International Symposium …, 2003 - ieeexplore.ieee.org
Microarchitectural prediction based on neural learning has received increasing attention in
recent years. However, neural prediction remains impractical because its superior accuracy …

Pointer cache assisted prefetching

J Collins, S Sair, B Calder… - 35th Annual IEEE/ACM …, 2002 - ieeexplore.ieee.org
Data prefetching effectively reduces the negative effects of long load latencies on the
performance of modern processors. Hardware prefetchers employ hardware structures to …

Thermal herding: Microarchitecture techniques for controlling hotspots in high-performance 3D-integrated processors

K Puttaswamy, GH Loh - 2007 IEEE 13th International …, 2007 - ieeexplore.ieee.org
3D integration technology greatly increases transistor density while providing faster on-chip
communication. 3D implementations of processors can simultaneously provide both latency …

Tage-sc-l branch predictors again

A Seznec - 5th JILP Workshop on Computer Architecture …, 2016 - inria.hal.science
Outline In this study, we explore the performance limits of these TAGE-SC-L predictors for
respectively 8Kbytes and 64Kbytes of storage budget. For a 8KB storage budget, our …

Analysis of the o-geometric history length branch predictor

A Seznec - … Symposium on Computer Architecture (ISCA'05), 2005 - ieeexplore.ieee.org
In this paper, we introduce and analyze the Optimized GEometric History Length (O-GEHL)
branch Predictor that efficiently exploits very long global histories in the 100-200 bits range …