Tensordimm: A practical near-memory processing architecture for embeddings and tensor operations in deep learning Y Kwon, Y Lee, M Rhu Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019 | 211 | 2019 |
Tensor casting: Co-designing algorithm-architecture for personalized recommendation training Y Kwon, Y Lee, M Rhu 2021 IEEE International Symposium on High-Performance Computer Architecture …, 2021 | 40 | 2021 |
Smartsage: training large-scale graph neural networks using in-storage processing architectures Y Lee, J Chung, M Rhu Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 32 | 2022 |
Understanding the implication of non-volatile memory for large-scale graph neural network training Y Lee, Y Kwon, M Rhu IEEE Computer Architecture Letters 20 (2), 118-121, 2021 | 8 | 2021 |
Neural network acceleration system and operating method thereof M Rhu, Y Kwon, Y Lee US Patent App. 16/922,333, 2021 | 1 | 2021 |
PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models Y Lee, H Kim, M Rhu arXiv preprint arXiv:2406.14571, 2024 | | 2024 |
FPGA-Accelerated Data Preprocessing for Personalized Recommendation Systems H Kim, Y Lee, M Rhu IEEE Computer Architecture Letters, 2023 | | 2023 |