A survey of techniques for dynamic branch prediction
S Mittal - Concurrency and Computation: Practice and …, 2019 - Wiley Online Library
Branch predictor (BP) is an essential component in modern processors since high BP
accuracy can improve performance and reduce energy by decreasing the number of …
accuracy can improve performance and reduce energy by decreasing the number of …
A case for (partially) tagged geometric history length branch prediction
A Seznec, P Michaud - The Journal of Instruction-Level Parallelism, 2006 - inria.hal.science
It is now widely admitted that in order to provide state-of-the-art accuracy, a conditional
branch predictor must combine several predictions. Recent research has shown that an …
branch predictor must combine several predictions. Recent research has shown that an …
[PDF][PDF] BOOMv2: an open-source out-of-order RISC-V core
This paper presents BOOM version 2, an updated version of the Berkeley Out-of-Order
Machine first presented in [3]. The design exploration was performed through synthesis …
Machine first presented in [3]. The design exploration was performed through synthesis …
Increasing processor performance by implementing deeper pipelines
E Sprangle, D Carmean - ACM SIGARCH Computer Architecture News, 2002 - dl.acm.org
One architectural method for increasing processor performance involves increasing the
frequency by implementing deeper pipelines. This paper will explore the relationship …
frequency by implementing deeper pipelines. This paper will explore the relationship …
Virtual machine showdown: Stack versus registers
Virtual machines (VMs) enable the distribution of programs in an architecture-neutral format,
which can easily be interpreted or compiled. A long-running question in the design of VMs is …
which can easily be interpreted or compiled. A long-running question in the design of VMs is …
Design tradeoffs for the Alpha EV8 conditional branch predictor
This paper presents the Alpha EV8 conditional branch predictor The Alpha EV8
microprocessor project, canceled in June 2001 in a late phase of development, envisioned …
microprocessor project, canceled in June 2001 in a late phase of development, envisioned …
FabScalar: Composing synthesizable RTL designs of arbitrary cores within a canonical superscalar template
NK Choudhary, SV Wadhavkar, TA Shah… - ACM SIGARCH …, 2011 - dl.acm.org
A growing body of work has compiled a strong case for the single-ISA heterogeneous multi-
core paradigm. A single-ISA heterogeneous multi-core provides multiple, differently …
core paradigm. A single-ISA heterogeneous multi-core provides multiple, differently …
Fetch directed instruction prefetching
Instruction supply is a crucial component of processor performance. Instruction prefetching
has been proposed as a mechanism to help reduce instruction cache misses, which in turn …
has been proposed as a mechanism to help reduce instruction cache misses, which in turn …
[PDF][PDF] The impact of delay on the design of branch predictors
Modern microprocessors employ increasingly complicated branch predictors to achieve
instruction fetch bandwidth that is sufficient for wide out-of-order execution cores. While …
instruction fetch bandwidth that is sufficient for wide out-of-order execution cores. While …
A scalable front-end architecture for fast instruction delivery
In the pursuit of instruction-level parallelism, significant demands are placed on a
processor's instruction delivery mechanism. Delivering the performance necessary to meet …
processor's instruction delivery mechanism. Delivering the performance necessary to meet …