Recnmp: Accelerating personalized recommendation with near-memory processing
Personalized recommendation systems leverage deep learning models and account for the
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …
majority of data center AI cycles. Their performance is dominated by memory-bound sparse …
Analyzing and increasing the reliability of convolutional neural networks on GPUs
FF dos Santos, PF Pimenta, C Lunardi… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
Graphics processing units (GPUs) are playing a critical role in convolutional neural networks
(CNNs) for image detection. As GPU-enabled CNNs move into safety-critical environments …
(CNNs) for image detection. As GPU-enabled CNNs move into safety-critical environments …
A survey of techniques for improving error-resilience of DRAM
S Mittal, MS Inukonda - Journal of Systems Architecture, 2018 - Elsevier
Aggressive process scaling and increasing demands of performance/cost efficiency have
exacerbated the incidences and impact of errors in DRAM systems. Due to this …
exacerbated the incidences and impact of errors in DRAM systems. Due to this …
Experimental and analytical study of xeon phi reliability
We present an in-depth analysis of transient faults effects on HPC applications in Intel Xeon
Phi processors based on radiation experiments and high-level fault injection. Besides …
Phi processors based on radiation experiments and high-level fault injection. Besides …
On the efficacy of ECC and the benefits of FinFET transistor layout for GPU reliability
C Lunardi, F Previlon, D Kaeli… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
Using error-correcting codes (ECCs) is considered one of the most effective ways to mask
the effects of radiation-induced faults in memory and computing devices. Unfortunately, with …
the effects of radiation-induced faults in memory and computing devices. Unfortunately, with …
Driving into the memory wall: the role of memory for advanced driver assistance systems and autonomous driving
Autonomous driving is disrupting conventional automotive development. Underlying
reasons include control unit consolidation, the use of components originally developed for …
reasons include control unit consolidation, the use of components originally developed for …
SEFI protection for nanosat 16-bit chip onboard computer memories
A Sánchez-Macián, P Reviriego… - … on Device and …, 2017 - ieeexplore.ieee.org
Plans to launch miniaturized satellite missions have been increasing in the last few years.
Space missions like these are exposed to radiation, which is a cause of errors in electronic …
Space missions like these are exposed to radiation, which is a cause of errors in electronic …
[HTML][HTML] Mitigation of 1-Row Hammer in BCAT Structures Through Buried Oxide Integration and Investigation of Inter-Cell Disturbances
YS Kim, MW Kwon - Electronics, 2024 - mdpi.com
Dynamic random-access memory (DRAM) is crucial for high-performance computing due to
its speed and storage capacity. As the demand for high-capacity memory increases, DRAM …
its speed and storage capacity. As the demand for high-capacity memory increases, DRAM …
Memory Controller with Adaptive ECC for Reliable System Operation
Memory errors can cause crashes and data loss, which are unacceptable for various
computing systems, mainly large servers. Memory controllers can mitigate these errors by …
computing systems, mainly large servers. Memory controllers can mitigate these errors by …
Improving DRAM Reliability Using a High Order Error Correction Code
W Li, M Zhang, T Gui, Z Fang, C Xie… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Dynamic random access memory (DRAM) is being upgraded iteratively, and as a result, its
transmission rate and bandwidth are rising quickly. Simultaneously, as the DRAM process …
transmission rate and bandwidth are rising quickly. Simultaneously, as the DRAM process …