SpZip: Architectural support for effective data compression in irregular applications
Irregular applications, such as graph analytics and sparse linear algebra, exhibit frequent
indirect, data-dependent accesses to single or short sequences of elements that cause high …
indirect, data-dependent accesses to single or short sequences of elements that cause high …
Meshtaichi: A compiler for efficient mesh-based operations
Meshes are an indispensable representation in many graphics applications because they
provide conformal spatial discretizations. However, mesh-based operations are often slow …
provide conformal spatial discretizations. However, mesh-based operations are often slow …
Graphitron: A domain specific language for fpga-based graph processing accelerator generation
FPGA-based graph processing accelerators, enabling extensive customization, have
demonstrated significant energy efficiency over general computing engines like CPUs and …
demonstrated significant energy efficiency over general computing engines like CPUs and …
Scalable, Programmable and Dense: The HammerBlade Open-Source RISC-V Manycore
Existing tiled manycore architectures propose to convert abundant silicon resources into
general-purpose parallel processors with unmatched computational density and …
general-purpose parallel processors with unmatched computational density and …
Compiling for vector extensions with stream-based specialization
To overcome the current performance wall, data streaming and data-flow computing
paradigms have been gradually making their way into the general-purpose domain …
paradigms have been gradually making their way into the general-purpose domain …
Beyond static parallel loops: Supporting dynamic task parallelism on manycore architectures with software-managed scratchpad memories
L Cheng, M Ruttenberg, DC Jung… - Proceedings of the 28th …, 2023 - dl.acm.org
Manycore architectures integrate hundreds of cores on a single chip by using simple cores
and simple memory systems usually based on software-managed scratchpad memories …
and simple memory systems usually based on software-managed scratchpad memories …
A tensor processing framework for CPU-manycore heterogeneous systems
Future CPU-manycore heterogeneous systems can provide high peak throughput by
integrating thousands of simple, independent, energy-efficient cores in a single die …
integrating thousands of simple, independent, energy-efficient cores in a single die …
[图书][B] A Complete Open Source Network Stack For BlackParrot
YM Chueh - 2022 - search.proquest.com
Dennard scaling has come to an end. General-purpose architecture now can hardly have
major improvements in power efficiency. Therefore, recently researchers have been actively …
major improvements in power efficiency. Therefore, recently researchers have been actively …
Enabling Vector Load and Store Instructions on HammerBlade Architecture
R Ramstad - 2024 - search.proquest.com
Traditionally, computer architecture has been dominated by overly complex instruction sets
that created a” solution” to every problem by adding another instruction. If these complex …
that created a” solution” to every problem by adding another instruction. If these complex …
The Performance Cost of Disintegrated Manycores: Which Applications Lose and Why?
IR Brkić - 2023 - search.proquest.com
Recent industry manycores have transitioned to disintegrated designs with multiple chips
within a package. Disintegration can make larger and higher performance systems …
within a package. Disintegration can make larger and higher performance systems …