Tiny but mighty: designing and realizing scalable latency tolerance for manycore SoCs

M Orenes-Vera, A Manocha, J Balkind, F Gao… - Proceedings of the 49th …, 2022 - dl.acm.org
Modern computing systems employ significant heterogeneity and specialization to meet
performance targets at manageable power. However, memory latency bottlenecks remain …

The marenostrum experimental exascale platform (meep)

A Fell, DJ Mazure, TC Garcia, B Perez… - Supercomputing …, 2021 - superfri.susu.ru
Abstract Nascent Open Source Instruction Set Architectures such as OpenPOWER or RISC-
V, allow software/hardware co-designers to fully utilize the underlying hardware, modify it or …

Post-fabrication microarchitecture

C Kumar, A Seshadri, A Chaudhary… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Microarchitectural enhancements that improve performance generally, across many
workloads, are favored in superscalar processor design. Targeting general performance is …

Timely, Efficient, and Accurate Branch Precomputation

A Deshmukh, LC Cai, YN Patt - 2024 57th IEEE/ACM …, 2024 - ieeexplore.ieee.org
Out-of-order cores rely on high-accuracy branch predictors to supply useful instructions to
the processor backend. However, there remains a large fraction of mispredictions caused by …

Criticality driven fetch

A Deshmukh, YN Patt - MICRO-54: 54th Annual IEEE/ACM International …, 2021 - dl.acm.org
Modern OoO cores achieve high levels of performance using large instruction windows.
Scaling the window size improves performance by making visible more of the parallelism …

[PDF][PDF] Using Convolutional Neural Networks to Improve Branch Prediction

SZ Kamali - 2022 - hps.ece.utexas.edu
My advisor, Prof. Yale Patt has been instrumental in the completion of my Ph. D. program.
Thank you for inspiring my interest in computer architecture, for convincing me to pursue my …

By-software branch prediction in loops

M Goudarzi, R Azimi, J Humecki… - IEEE Computer …, 2023 - ieeexplore.ieee.org
Load-Dependent Branches (LDB) often do not exhibit regular patterns in their local or global
history and thus are inherently hard to predict correctly by conventional branch predictors …

Navigating Heterogeneity and Scalability in Modern Chip Design

M Orenes-Vera - 2024 - search.proquest.com
Computing systems have become ubiquitous in the modern world but their design is far from
one-size-fits-all. From battery-powered devices to supercomputers, deployment …

Using convolutional neural networks to improve branch prediction

S Zangeneh Kamali - 2022 - repositories.lib.utexas.edu
The state-of-the-art branch predictor, TAGE, remains inefficient at identifying correlated
branches deep in a noisy global branch history. This dissertation argues this inefficiency is a …

Maintaining high performance in the presence of impossible-to-predict branches

SM Pruett - 2022 - repositories.lib.utexas.edu
High performance microprocessors have relied on accurate branch predictors to maintain
high instruction supply for over 30 years. Unfortunately, as instruction windows and pipeline …