ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning SC Kao, G Jeong, T Krishna In Proc. of the 53rd Annual IEEE/ACM International Symposium on …, 2020 | 101 | 2020 |
TurboFlux: A fast continuous subgraph matching system for streaming graph data K Kim, I Seo, WS Han, JH Lee, S Hong, H Chafi, H Shin, G Jeong In Proc. of the 44th International Conference on Management of Data (SIGMOD …, 2018 | 62 | 2018 |
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication GE Moon, H Kwon, G Jeong, P Chatarasi, S Rajamanickam, T Krishna IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021 | 21 | 2021 |
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU G Jeong, E Qin, A Samajdar, CJ Hughes, S Subramoney, H Kim, ... In Proc. of the 58th Annual Design Automation Conference (DAC), 2021 | 16 | 2021 |
Extending Sparse Tensor Accelerators to Support Multiple Compression Formats E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ... In Proc. of the 35th IEEE International Parallel & Distributed Processing …, 2021 | 16 | 2021 |
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators G Jeong, G Kestor, P Chatarasi, A Parashar, PA Tsai, S Rajamanickam, ... In Proc. of the 30th International Conference on Parallel Architectures and …, 2021 | 15 | 2021 |
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM H Kang, Q Zhang, S Kundu, G Jeong, Z Liu, T Krishna, T Zhao arXiv preprint arXiv:2403.05527, 2024 | 13 | 2024 |
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ... In Proc. of the 29th IEEE International Symposium on High-Performance …, 2023 | 12 | 2023 |
Characterization of Data Compression in Datacenters G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ... In Proc. of the 24th IEEE International Symposium on Performance Analysis of …, 2023 | 3 | 2023 |
Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference A Ramachandran, Z Wan, G Jeong, J Gustafson, T Krishna In Proc. of the 61st Annual Design Automation Conference (DAC), 2024 | 2 | 2024 |
Demystifying Platform Requirements for Diverse LLM Inference Use Cases A Bambhaniya, R Raj, G Jeong, S Kundu, S Srinivasan, M Elavazhagan, ... arXiv preprint arXiv:2406.01698, 2024 | 1 | 2024 |
Understanding Data Compression in Warehouse-Scale Datacenter Services G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ... In Proc. of the 23rd IEEE International Symposium on Performance Analysis of …, 2022 | 1 | 2022 |
Bridging the Frequency Gap in Heterogeneous 3D SoCs through Technology-Specific NoC Router Architectures JM Joseph, L Bamberg, G Jeong, RT Chien, R Leupers, A Garía-Ortiz, ... In Proc. of the 26th Asia and South Pacific Design Automation Conference …, 2021 | 1 | 2021 |
SDQ: Sparse Decomposed Quantization for LLM Inference G Jeong, PA Tsai, SW Keckler, T Krishna arXiv preprint arXiv:2406.13868, 2024 | | 2024 |
Generating sparse neural networks G Jeong, PA Tsai, JM Pool US Patent US20240152407A1, 2024 | | 2024 |
Abstracting Sparse DNN Acceleration via Structured Sparse Tensor Decomposition G Jeong, PA Tsai, AR Bambhaniya, SW Keckler, T Krishna arXiv preprint arXiv:2403.07953, 2024 | | 2024 |