关注
Geonhwa Jeong
Geonhwa Jeong
Research Scientist, Meta
在 meta.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
ConfuciuX: Autonomous Hardware Resource Assignment for DNN Accelerators using Reinforcement Learning
SC Kao, G Jeong, T Krishna
In Proc. of the 53rd Annual IEEE/ACM International Symposium on …, 2020
1012020
TurboFlux: A fast continuous subgraph matching system for streaming graph data
K Kim, I Seo, WS Han, JH Lee, S Hong, H Chafi, H Shin, G Jeong
In Proc. of the 44th International Conference on Management of Data (SIGMOD …, 2018
622018
Evaluating Spatial Accelerator Architectures with Tiled Matrix-Matrix Multiplication
GE Moon, H Kwon, G Jeong, P Chatarasi, S Rajamanickam, T Krishna
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2021
212021
RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
G Jeong, E Qin, A Samajdar, CJ Hughes, S Subramoney, H Kim, ...
In Proc. of the 58th Annual Design Automation Conference (DAC), 2021
162021
Extending Sparse Tensor Accelerators to Support Multiple Compression Formats
E Qin, G Jeong, W Won, SC Kao, H Kwon, S Srinivasan, D Das, GE Moon, ...
In Proc. of the 35th IEEE International Parallel & Distributed Processing …, 2021
162021
Union: A Unified HW-SW Co-Design Ecosystem in MLIR for Evaluating Tensor Operations on Spatial Accelerators
G Jeong, G Kestor, P Chatarasi, A Parashar, PA Tsai, S Rajamanickam, ...
In Proc. of the 30th International Conference on Parallel Architectures and …, 2021
152021
GEAR: An Efficient KV Cache Compression Recipe for Near-Lossless Generative Inference of LLM
H Kang, Q Zhang, S Kundu, G Jeong, Z Liu, T Krishna, T Zhao
arXiv preprint arXiv:2403.05527, 2024
132024
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
G Jeong, S Damani, AR Bambhaniya, E Qin, CJ Hughes, S Subramoney, ...
In Proc. of the 29th IEEE International Symposium on High-Performance …, 2023
122023
Characterization of Data Compression in Datacenters
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
In Proc. of the 24th IEEE International Symposium on Performance Analysis of …, 2023
32023
Algorithm-Hardware Co-Design of Distribution-Aware Logarithmic-Posit Encodings for Efficient DNN Inference
A Ramachandran, Z Wan, G Jeong, J Gustafson, T Krishna
In Proc. of the 61st Annual Design Automation Conference (DAC), 2024
22024
Demystifying Platform Requirements for Diverse LLM Inference Use Cases
A Bambhaniya, R Raj, G Jeong, S Kundu, S Srinivasan, M Elavazhagan, ...
arXiv preprint arXiv:2406.01698, 2024
12024
Understanding Data Compression in Warehouse-Scale Datacenter Services
G Jeong, B Sharma, N Terrell, A Dhanotia, Z Zhao, N Agarwal, ...
In Proc. of the 23rd IEEE International Symposium on Performance Analysis of …, 2022
12022
Bridging the Frequency Gap in Heterogeneous 3D SoCs through Technology-Specific NoC Router Architectures
JM Joseph, L Bamberg, G Jeong, RT Chien, R Leupers, A Garía-Ortiz, ...
In Proc. of the 26th Asia and South Pacific Design Automation Conference …, 2021
12021
SDQ: Sparse Decomposed Quantization for LLM Inference
G Jeong, PA Tsai, SW Keckler, T Krishna
arXiv preprint arXiv:2406.13868, 2024
2024
Generating sparse neural networks
G Jeong, PA Tsai, JM Pool
US Patent US20240152407A1, 2024
2024
Abstracting Sparse DNN Acceleration via Structured Sparse Tensor Decomposition
G Jeong, PA Tsai, AR Bambhaniya, SW Keckler, T Krishna
arXiv preprint arXiv:2403.07953, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–16