Recnmp: Accelerating personalized recommendation with near-memory processing

L Ke, U Gupta, BY Cho, D Brooks… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Personalized recommendation systems leverage deep learning models and account for the
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …

Analyzing and increasing the reliability of convolutional neural networks on GPUs

FF dos Santos, PF Pimenta, C Lunardi… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Graphics processing units (GPUs) are playing a critical role in convolutional neural networks
(CNNs) for image detection. As GPU-enabled CNNs move into safety-critical environments …

A survey of techniques for improving error-resilience of DRAM

S Mittal, MS Inukonda - Journal of Systems Architecture, 2018 - Elsevier
Aggressive process scaling and increasing demands of performance/cost efficiency have
exacerbated the incidences and impact of errors in DRAM systems. Due to this …

Experimental and analytical study of xeon phi reliability

D Oliveira, L Pilla, N DeBardeleben… - Proceedings of the …, 2017 - dl.acm.org
We present an in-depth analysis of transient faults effects on HPC applications in Intel Xeon
Phi processors based on radiation experiments and high-level fault injection. Besides …

On the efficacy of ECC and the benefits of FinFET transistor layout for GPU reliability

C Lunardi, F Previlon, D Kaeli… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Using error-correcting codes (ECCs) is considered one of the most effective ways to mask
the effects of radiation-induced faults in memory and computing devices. Unfortunately, with …

Driving into the memory wall: the role of memory for advanced driver assistance systems and autonomous driving

M Jung, SA McKee, C Sudarshan… - Proceedings of the …, 2018 - dl.acm.org
Autonomous driving is disrupting conventional automotive development. Underlying
reasons include control unit consolidation, the use of components originally developed for …

SEFI protection for nanosat 16-bit chip onboard computer memories

A Sánchez-Macián, P Reviriego… - … on Device and …, 2017 - ieeexplore.ieee.org
Plans to launch miniaturized satellite missions have been increasing in the last few years.
Space missions like these are exposed to radiation, which is a cause of errors in electronic …

[HTML][HTML] Mitigation of 1-Row Hammer in BCAT Structures Through Buried Oxide Integration and Investigation of Inter-Cell Disturbances

YS Kim, MW Kwon - Electronics, 2024 - mdpi.com
Dynamic random-access memory (DRAM) is crucial for high-performance computing due to
its speed and storage capacity. As the demand for high-capacity memory increases, DRAM …

Memory Controller with Adaptive ECC for Reliable System Operation

M Stefani, C Marcon, F Silva… - 2023 36th SBC/SBMicro …, 2023 - ieeexplore.ieee.org
Memory errors can cause crashes and data loss, which are unacceptable for various
computing systems, mainly large servers. Memory controllers can mitigate these errors by …

Improving DRAM Reliability Using a High Order Error Correction Code

W Li, M Zhang, T Gui, Z Fang, C Xie… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Dynamic random access memory (DRAM) is being upgraded iteratively, and as a result, its
transmission rate and bandwidth are rising quickly. Simultaneously, as the DRAM process …