[PDF][PDF] Sonicboom: The 3rd generation berkeley out-of-order machine

J Zhao, B Korpan, A Gonzalez… - Fourth Workshop on …, 2020 - people.eecs.berkeley.edu
We present SonicBOOM, the third generation of the Berkeley Outof-Order Machine (BOOM).
SonicBOOM is an open-source RTL implementation of a RISC-V superscalar out-of-order …

Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems

M Besta, R Kanakagiri, G Kwasniewski… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …

Analysis and optimization of the memory hierarchy for graph processing workloads

A Basak, S Li, X Hu, SM Oh, X Xie… - … Symposium on High …, 2019 - ieeexplore.ieee.org
Graph processing is an important analysis technique for a wide range of big data
applications. The ability to explicitly represent relationships between entities gives graph …

Towards a sustainable artificial intelligence: A case study of energy efficiency in decision tree algorithms

M Ferro, GD Silva, FB de Paula… - Concurrency and …, 2023 - Wiley Online Library
Artificial intelligence has been showing accelerated growth due to its use in solving
problems in several application domains. This success results from the convergence of large …

Designing low-power, low-latency networks-on-chip by optimally combining electrical and optical links

S Werner, J Navaridas, M Luján - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Optical on-chip communication is considered a promising candidate to overcome latency
and energy bottlenecks of electrical interconnects. Although recently proposed hybrid …

A specialized architecture for object serialization with applications to big data analytics

J Jang, SJ Jung, S Jeong, J Heo, H Shin… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Object serialization and deserialization (S/D) is an essential feature for efficient
communication between distributed computing nodes with potentially non-uniform execution …

Edge-connected jaccard similarity for graph link prediction on fpga

P Sathre, A Gondhalekar… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
Graph analysis is a critical task in many fields, such as social networking, epidemiology,
bioinformatics, and fraud de-tection. In particular, understanding and inferring relationships …

Manycore simulation for peta-scale system design: Motivation, tools, challenges and prospects

J Zarrin, RL Aguiar, JP Barraca - Simulation Modelling Practice and Theory, 2017 - Elsevier
The architecture design of peta-scale computing systems is complex and presents lots of
difficulties to designs, as current tools lack support for relevant features of future scenarios …

FASTA: Revisiting Fully Associative Memories in Computer Microarchitecture

E Garzón, R Hanhan, M Lanuzza, A Teman… - IEEE Access, 2024 - ieeexplore.ieee.org
Associative access is widely used in fundamental microarchitectural components, such as
caches and TLBs. However, associative (or content addressable) memories (CAMs) have …

[图书][B] A highly productive implementation of an out-of-order processor generator

CP Celio - 2017 - search.proquest.com
General-purpose serial-thread performance gains have become more difficult for industry to
realize due to the slowing down of process improvements. In this new regime of poor …