Chopper: A compiler infrastructure for programmable bit-serial simd processing using memory in dram

X Peng, Y Wang, MC Yang - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Increasing interests in Bit-serial SIMD Processing-Using-DRAM (PUD) architectures amplify
the needs for a compiler to automate code generation, credited to their ultra-wide SIMD …

StreamPIM: Streaming Matrix Computation in Racetrack Memory

Y An, Y Tang, S Yi, L Peng, X Pan… - … Symposium on High …, 2024 - ieeexplore.ieee.org
Racetrack memory (RM) techniques have become promising solutions to resolve the
memory wall issue as they increase memory density, reduce energy consumption and are …

Refresh fpgas: Sustainable fpga chiplet architectures

P Zhou, J Zhuang, S Cahoon, Y Tang, Z Yang… - Proceedings of the 14th …, 2023 - dl.acm.org
There is a growing call for greater amounts of increasingly agile computational power for
edge and cloud infrastructure to serve the computationally complex needs of ubiquitous …

Sustainable AI processing at the edge

S Ollivier, S Li, Y Tang, S Cahoon, R Caginalp… - IEEE Micro, 2022 - ieeexplore.ieee.org
Edge computing is a popular paradigm for accelerating light-to medium-weight machine
learning algorithms initiated from mobile devices without requiring the long communication …

Count2Multiply: Reliable In-memory High-Radix Counting

JPC de Lima, BF Morris III, AA Khan, J Castrillon… - arXiv preprint arXiv …, 2024 - arxiv.org
Big data processing has exposed the limits of compute-centric hardware acceleration due to
the memory-to-processor bandwidth bottleneck. Consequently, there has been a shift …

Towards Error Correction for Computing in Racetrack Memory

P Brazzle, BF Morris III, E McKinney, P Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
Computing-in-memory (CIM) promises to alleviate the Von Neumann bottleneck and
accelerate data-intensive applications. Depending on the underlying technology and …

A multi-domain magneto tunnel junction for racetrack nanowire strips

P Dutta, A Lee, KL Wang, AK Jones… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Domain-wall memory (DWM) has SRAM class access performance, low energy, high
endurance, high density, and CMOS compatibility. Recently, shift reliability and processing …

Pod-racing: bulk-bitwise to floating-point compute in racetrack memory for machine learning at the edge

S Ollivier, X Zhang, Y Tang, C Choudhuri, J Hu… - IEEE Micro, 2022 - ieeexplore.ieee.org
Convolutional neural networks (CNNs) have become a ubiquitous algorithm with growing
applications in mobile and edge settings. We describe a compute-in-memory (CIM) …

SPIMulator: A Spintronic Processing-In-Memory Simulator for Racetracks

P Bera, S Cahoon, S Bhanja, A Jones - ACM Transactions on Embedded …, 2024 - dl.acm.org
In-memory processing is becoming a popular method to alleviate the memory bottleneck of
the von Neumann computing model. With the goal of improving both latency and energy cost …

Domain wall motion at low current density in a synthetic antiferromagnet nanowire

CEA Barker, S Finizio, E Haltz, S Mayr… - Journal of Physics D …, 2023 - iopscience.iop.org
The current-driven motion of magnetic domain walls (DWs) is the working principle of
magnetic racetrack memories. In this type of spintronic technology, high current densities are …