SpZip: Architectural support for effective data compression in irregular applications

Y Yang, JS Emer, D Sanchez - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Irregular applications, such as graph analytics and sparse linear algebra, exhibit frequent
indirect, data-dependent accesses to single or short sequences of elements that cause high …

Meshtaichi: A compiler for efficient mesh-based operations

C Yu, Y Xu, Y Kuang, Y Hu, T Liu - ACM Transactions on Graphics (TOG), 2022 - dl.acm.org
Meshes are an indispensable representation in many graphics applications because they
provide conformal spatial discretizations. However, mesh-based operations are often slow …

Graphitron: A domain specific language for fpga-based graph processing accelerator generation

X Zhang, Z Feng, S Liang, X Chen, C Liu, H Li… - arXiv preprint arXiv …, 2024 - arxiv.org
FPGA-based graph processing accelerators, enabling extensive customization, have
demonstrated significant energy efficiency over general computing engines like CPUs and …

Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V Manycore

DC Jung, M Ruttenberg, P Gao… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
Existing tiled manycore architectures propose to convert abundant silicon resources into
general-purpose parallel processors with unmatched computational density and …

Compiling for vector extensions with stream-based specialization

N Neves, JM Domingos, N Roma, P Tomas… - IEEE Micro, 2022 - ieeexplore.ieee.org
To overcome the current performance wall, data streaming and data-flow computing
paradigms have been gradually making their way into the general-purpose domain …

Beyond static parallel loops: Supporting dynamic task parallelism on manycore architectures with software-managed scratchpad memories

L Cheng, M Ruttenberg, DC Jung… - Proceedings of the 28th …, 2023 - dl.acm.org
Manycore architectures integrate hundreds of cores on a single chip by using simple cores
and simple memory systems usually based on software-managed scratchpad memories …

A tensor processing framework for CPU-manycore heterogeneous systems

L Cheng, P Pan, Z Zhao, K Ranjan… - … on Computer-Aided …, 2021 - ieeexplore.ieee.org
Future CPU-manycore heterogeneous systems can provide high peak throughput by
integrating thousands of simple, independent, energy-efficient cores in a single die …

[图书][B] A Complete Open Source Network Stack For BlackParrot

YM Chueh - 2022 - search.proquest.com
Dennard scaling has come to an end. General-purpose architecture now can hardly have
major improvements in power efficiency. Therefore, recently researchers have been actively …

Enabling Vector Load and Store Instructions on HammerBlade Architecture

R Ramstad - 2024 - search.proquest.com
Traditionally, computer architecture has been dominated by overly complex instruction sets
that created a” solution” to every problem by adding another instruction. If these complex …

The Performance Cost of Disintegrated Manycores: Which Applications Lose and Why?

IR Brkić - 2023 - search.proquest.com
Recent industry manycores have transitioned to disintegrated designs with multiple chips
within a package. Disintegration can make larger and higher performance systems …