关注
Coleman Hooper
Coleman Hooper
Graduate Student (UC Berkeley)
在 berkeley.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
Edgebert: Sentence-level energy optimizations for latency-aware multi-task nlp inference
T Tambe, C Hooper, L Pentecost, T Jia, EY Yang, M Donato, V Sanh, ...
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture …, 2021
912021
Squeezellm: Dense-and-sparse quantization
S Kim, C Hooper, A Gholami, Z Dong, X Li, S Shen, MW Mahoney, ...
arXiv preprint arXiv:2306.07629, 2023
842023
Full stack optimization of transformer inference: a survey
S Kim, C Hooper, T Wattanawong, M Kang, R Yan, H Genc, G Dinh, ...
arXiv preprint arXiv:2302.14017, 2023
512023
S-lora: Serving thousands of concurrent lora adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
arXiv preprint arXiv:2311.03285, 2023
382023
9.8 A 25mm2 SoC for IoT Devices with 18ms Noise-Robust Speech-to-Text Latency via Bayesian Speech Denoising and Attention-Based Sequence-to-Sequence …
T Tambe, EY Yang, GG Ko, Y Chai, C Hooper, M Donato, PN Whatmough, ...
2021 IEEE International Solid-State Circuits Conference (ISSCC) 64, 158-160, 2021
322021
Kvquant: Towards 10 million context length llm inference with kv cache quantization
C Hooper, S Kim, H Mohammadzadeh, MW Mahoney, YS Shao, ...
arXiv preprint arXiv:2401.18079, 2024
172024
AI and memory wall
A Gholami, Z Yao, S Kim, C Hooper, MW Mahoney, K Keutzer
IEEE Micro, 2024
152024
Speed: Speculative pipelined execution for efficient decoding
C Hooper, S Kim, H Mohammadzadeh, H Genc, K Keutzer, A Gholami, ...
arXiv preprint arXiv:2310.12072, 2023
112023
A 16-nm soc for noise-robust speech and nlp edge ai inference with bayesian sound source separation and attention-based dnns
T Tambe, EY Yang, GG Ko, Y Chai, C Hooper, M Donato, PN Whatmough, ...
IEEE Journal of Solid-State Circuits 58 (2), 569-581, 2022
102022
22.9 A 12nm 18.1 TFLOPs/W sparse transformer processor with entropy-based early exit, mixed-precision predication and fine-grained power management
T Tambe, J Zhang, C Hooper, T Jia, PN Whatmough, J Zuckerman, ...
2023 IEEE International Solid-State Circuits Conference (ISSCC), 342-344, 2023
72023
Yakun Sophia Shao, Kurt Keutzer, and Amir Gholami. 2024. KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
C Hooper, S Kim, H Mohammadzadeh, MW Mahoney
arXiv preprint arXiv:2401.18079, 2024
62024
Yakun Sophia Shao, and Amir Gholami. 2023. Full Stack Optimization of Transformer Inference: a Survey
S Kim, C Hooper, T Wattanawong, M Kang, R Yan, H Genc, G Dinh, ...
arXiv preprint arXiv:2302.14017, 2023
52023
9.8 A 25mm2 SoC for IoT Devices with 18ms Noise-Robust Speech-to-Text Latency via Bayesian Speech Denoising and Attention-Based Sequence-to-Sequence DNN Speech Recognition in …
T Tambe, EY Yang, GG Ko, Y Chai, C Hooper, M Donato, PN Whatmough, ...
IEEE, 2021
52021
Property-aware multi-speaker data simulation: A probabilistic modelling technique for synthetic data generation
TJ Park, H Huang, C Hooper, N Koluguri, K Dhawan, A Jukic, J Balam, ...
arXiv preprint arXiv:2310.12371, 2023
22023
SM6: A 16nm System-on-Chip for Accurate and Noise-Robust Attention-Based NLP Applications : The 33rd Hot Chips Symposium – August 22-24, 2021
T Tambe, EY Yang, GG Ko, Y Chai, C Hooper, M Donato, PN Whatmough, ...
2021 IEEE Hot Chips 33 Symposium (HCS), 1-13, 2021
22021
Quantifying and maximizing the benefits of back-end noise adaption on attention-based speech recognition models
C Hooper, T Tambe, GY Wei
arXiv preprint arXiv:2105.01134, 2021
12021
SLoRA: Scalable Serving of Thousands of LoRA Adapters
Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ...
Proceedings of Machine Learning and Systems 6, 296-311, 2024
2024
Learned Best-Effort LLM Serving
S Jha, C Hooper, X Liu, S Kim, K Keutzer
arXiv preprint arXiv:2401.07886, 2024
2024
Full Stack Optimization of Transformer Inference
S Kim, C Hooper, T Wattanawong, M Kang, R Yan, H Genc, G Dinh, ...
Architecture and System Support for Transformer Models (ASSYST@ ISCA 2023), 2023
2023
Combining observations and simulations into improved assessments of tidal resources
R Karsten, G Trowse, A Bharath, C Hooper, J Locke, M Guerra&6, ...
PAN AMERICAN MARINE ENERGY CONFERENCE PAMEC 2020, 6, 0
系统目前无法执行此操作,请稍后再试。
文章 1–20