Simplepim: A software framework for productive and efficient processing-in-memory
Data movement between memory and processors is a major bottleneck in modern
computing systems. The processing-in-memory (PIM) paradigm aims to alleviate this …
computing systems. The processing-in-memory (PIM) paradigm aims to alleviate this …
The Landscape of Compute-near-memory and Compute-in-memory: A Research and Commercial Overview
In today's data-centric world, where data fuels numerous application domains, with machine
learning at the forefront, handling the enormous volume of data efficiently in terms of time …
learning at the forefront, handling the enormous volume of data efficiently in terms of time …
C4CAM: A Compiler for CAM-based In-memory Accelerators
Machine learning and data analytics applications increasingly suffer from the high latency
and energy consumption of conventional von Neumann architectures. Recently, several in …
and energy consumption of conventional von Neumann architectures. Recently, several in …
Special Session-Non-Volatile Memories: Challenges and Opportunities for Embedded System Architectures with Focus on Machine Learning Applications
This paper explores the challenges and opportunities of integrating non-volatile memories
(NVMs) into embedded systems for machine learning. NVMs offer advantages such as …
(NVMs) into embedded systems for machine learning. NVMs offer advantages such as …
SwiftRL: Towards Efficient Reinforcement Learning on Real Processing-In-Memory Systems
Reinforcement Learning (RL) trains agents to learn optimal behavior by maximizing reward
signals from experience datasets. However, RL training often faces memory limitations …
signals from experience datasets. However, RL training often faces memory limitations …
Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System
S Rhyner, H Luo, J Gómez-Luna, M Sadrosadati… - arXiv preprint arXiv …, 2024 - arxiv.org
Machine Learning (ML) training on large-scale datasets is a very expensive and time-
consuming workload. Processor-centric architectures (eg, CPU, GPU) commonly used for …
consuming workload. Processor-centric architectures (eg, CPU, GPU) commonly used for …
SongC: A compiler for hybrid near-memory and in-memory many-core architecture
Building hybrid systems that incorporate various processing-in-memory (PIM) devices and
processing-near-memory (PNM) technologies can offer complementary advantages in both …
processing-near-memory (PNM) technologies can offer complementary advantages in both …
Smoothing Disruption Across the Stack: Tales of Memory, Heterogeneity, & Compilers
Multiple research vectors represent possible paths to improved energy and performance
metrics at the application-level. There are active efforts with respect to emerging logic …
metrics at the application-level. There are active efforts with respect to emerging logic …
Programming Model Extensions for General-Purpose Processing-In-Memory
The performance of many applications is limited by the available memory bandwidth. One
approach to improve the performance of such memory-bound applications is to move the …
approach to improve the performance of such memory-bound applications is to move the …
[PDF][PDF] Programming abstractions and optimizing compilers for energy-efficient computing
J Castrillon - 2023 - netzero.sysnet.ucsd.edu
Programming abstractions and optimizing compilers for energy-efficient computing Page 1
Programming abstractions and optimizing compilers for energy-efficient computing Jeronimo …
Programming abstractions and optimizing compilers for energy-efficient computing Jeronimo …