A survey of techniques for dynamic branch prediction

S Mittal - Concurrency and Computation: Practice and …, 2019 - Wiley Online Library
Branch predictor (BP) is an essential component in modern processors since high BP
accuracy can improve performance and reduce energy by decreasing the number of …

A case for (partially) tagged geometric history length branch prediction

A Seznec, P Michaud - The Journal of Instruction-Level Parallelism, 2006 - inria.hal.science
It is now widely admitted that in order to provide state-of-the-art accuracy, a conditional
branch predictor must combine several predictions. Recent research has shown that an …

[PDF][PDF] BOOMv2: an open-source out-of-order RISC-V core

C Celio, PF Chiu, B Nikolic, DA Patterson… - First Workshop on …, 2017 - eecs.berkeley.edu
This paper presents BOOM version 2, an updated version of the Berkeley Out-of-Order
Machine first presented in [3]. The design exploration was performed through synthesis …

Increasing processor performance by implementing deeper pipelines

E Sprangle, D Carmean - ACM SIGARCH Computer Architecture News, 2002 - dl.acm.org
One architectural method for increasing processor performance involves increasing the
frequency by implementing deeper pipelines. This paper will explore the relationship …

Virtual machine showdown: Stack versus registers

Y Shi, K Casey, MA Ertl, D Gregg - ACM Transactions on Architecture …, 2008 - dl.acm.org
Virtual machines (VMs) enable the distribution of programs in an architecture-neutral format,
which can easily be interpreted or compiled. A long-running question in the design of VMs is …

Design tradeoffs for the Alpha EV8 conditional branch predictor

A Seznec, S Felix, V Krishnan, Y Sazeides - ACM SIGARCH Computer …, 2002 - dl.acm.org
This paper presents the Alpha EV8 conditional branch predictor The Alpha EV8
microprocessor project, canceled in June 2001 in a late phase of development, envisioned …

FabScalar: Composing synthesizable RTL designs of arbitrary cores within a canonical superscalar template

NK Choudhary, SV Wadhavkar, TA Shah… - ACM SIGARCH …, 2011 - dl.acm.org
A growing body of work has compiled a strong case for the single-ISA heterogeneous multi-
core paradigm. A single-ISA heterogeneous multi-core provides multiple, differently …

Fetch directed instruction prefetching

G Reinman, B Calder, T Austin - … of the 32nd Annual ACM/IEEE …, 1999 - ieeexplore.ieee.org
Instruction supply is a crucial component of processor performance. Instruction prefetching
has been proposed as a mechanism to help reduce instruction cache misses, which in turn …

[PDF][PDF] The impact of delay on the design of branch predictors

DA Jiménez, SW Keckler, C Lin - Proceedings of the 33rd annual ACM …, 2000 - dl.acm.org
Modern microprocessors employ increasingly complicated branch predictors to achieve
instruction fetch bandwidth that is sufficient for wide out-of-order execution cores. While …

A scalable front-end architecture for fast instruction delivery

G Reinman, T Austin, B Calder - ACM SIGARCH Computer Architecture …, 1999 - dl.acm.org
In the pursuit of instruction-level parallelism, significant demands are placed on a
processor's instruction delivery mechanism. Delivering the performance necessary to meet …