[PDF][PDF] Sonicboom: The 3rd generation berkeley out-of-order machine
J Zhao, B Korpan, A Gonzalez… - Fourth Workshop on …, 2020 - people.eecs.berkeley.edu
We present SonicBOOM, the third generation of the Berkeley Outof-Order Machine (BOOM).
SonicBOOM is an open-source RTL implementation of a RISC-V superscalar out-of-order …
SonicBOOM is an open-source RTL implementation of a RISC-V superscalar out-of-order …
Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …
Analysis and optimization of the memory hierarchy for graph processing workloads
Graph processing is an important analysis technique for a wide range of big data
applications. The ability to explicitly represent relationships between entities gives graph …
applications. The ability to explicitly represent relationships between entities gives graph …
Towards a sustainable artificial intelligence: A case study of energy efficiency in decision tree algorithms
M Ferro, GD Silva, FB de Paula… - Concurrency and …, 2023 - Wiley Online Library
Artificial intelligence has been showing accelerated growth due to its use in solving
problems in several application domains. This success results from the convergence of large …
problems in several application domains. This success results from the convergence of large …
Designing low-power, low-latency networks-on-chip by optimally combining electrical and optical links
Optical on-chip communication is considered a promising candidate to overcome latency
and energy bottlenecks of electrical interconnects. Although recently proposed hybrid …
and energy bottlenecks of electrical interconnects. Although recently proposed hybrid …
A specialized architecture for object serialization with applications to big data analytics
Object serialization and deserialization (S/D) is an essential feature for efficient
communication between distributed computing nodes with potentially non-uniform execution …
communication between distributed computing nodes with potentially non-uniform execution …
Edge-connected jaccard similarity for graph link prediction on fpga
P Sathre, A Gondhalekar… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
Graph analysis is a critical task in many fields, such as social networking, epidemiology,
bioinformatics, and fraud de-tection. In particular, understanding and inferring relationships …
bioinformatics, and fraud de-tection. In particular, understanding and inferring relationships …
Manycore simulation for peta-scale system design: Motivation, tools, challenges and prospects
The architecture design of peta-scale computing systems is complex and presents lots of
difficulties to designs, as current tools lack support for relevant features of future scenarios …
difficulties to designs, as current tools lack support for relevant features of future scenarios …
FASTA: Revisiting Fully Associative Memories in Computer Microarchitecture
Associative access is widely used in fundamental microarchitectural components, such as
caches and TLBs. However, associative (or content addressable) memories (CAMs) have …
caches and TLBs. However, associative (or content addressable) memories (CAMs) have …
[图书][B] A highly productive implementation of an out-of-order processor generator
CP Celio - 2017 - search.proquest.com
General-purpose serial-thread performance gains have become more difficult for industry to
realize due to the slowing down of process improvements. In this new regime of poor …
realize due to the slowing down of process improvements. In this new regime of poor …