Simplepim: A software framework for productive and efficient processing-in-memory

J Chen, J Gómez-Luna, I El Hajj… - 2023 32nd …, 2023 - ieeexplore.ieee.org
Data movement between memory and processors is a major bottleneck in modern
computing systems. The processing-in-memory (PIM) paradigm aims to alleviate this …

The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview

AA Khan, JPC De Lima, H Farzaneh… - arXiv preprint arXiv …, 2024 - arxiv.org
In today's data-centric world, where data fuels numerous application domains, with machine
learning at the forefront, handling the enormous volume of data efficiently in terms of time …

C4CAM: A Compiler for CAM-based In-memory Accelerators

H Farzaneh, JPC De Lima, M Li, AA Khan… - Proceedings of the 29th …, 2024 - dl.acm.org
Machine learning and data analytics applications increasingly suffer from the high latency
and energy consumption of conventional von Neumann architectures. Recently, several in …

Special Session-Non-Volatile Memories: Challenges and Opportunities for Embedded System Architectures with Focus on Machine Learning Applications

J Henkel, L Siddhu, L Bauer, J Teich… - Proceedings of the …, 2023 - dl.acm.org
This paper explores the challenges and opportunities of integrating non-volatile memories
(NVMs) into embedded systems for machine learning. NVMs offer advantages such as …

SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems

K Gogineni, SS Dayapule, J Gómez-Luna… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward
signals from experience datasets. However, RL training often faces memory limitations …

Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System

S Rhyner, H Luo, J Gómez-Luna, M Sadrosadati… - arXiv preprint arXiv …, 2024 - arxiv.org
Machine Learning (ML) training on large-scale datasets is a very expensive and time-
consuming workload. Processor-centric architectures (eg, CPU, GPU) commonly used for …

SongC: A compiler for hybrid near-memory and in-memory many-core architecture

J Lin, H Qu, S Ma, X Ji, H Li, X Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Building hybrid systems that incorporate various processing-in-memory (PIM) devices and
processing-near-memory (PNM) technologies can offer complementary advantages in both …

Smoothing Disruption Across the Stack: Tales of Memory, Heterogeneity, & Compilers

M Niemier, Z Enciso, M Sharifi, XS Hu… - … , Automation & Test …, 2024 - ieeexplore.ieee.org
Multiple research vectors represent possible paths to improved energy and performance
metrics at the application-level. There are active efforts with respect to emerging logic …

Programming Model Extensions for General-Purpose Processing-In-Memory

H Hong, L Sommer, B Kim, M Kashkarov… - ISC High …, 2024 - ieeexplore.ieee.org
The performance of many applications is limited by the available memory bandwidth. One
approach to improve the performance of such memory-bound applications is to move the …

[PDF][PDF] Programming abstractions and optimizing compilers for energy-efficient computing

J Castrillon - 2023 - netzero.sysnet.ucsd.edu
Programming abstractions and optimizing compilers for energy-efficient computing Page 1
Programming abstractions and optimizing compilers for energy-efficient computing Jeronimo …