关注
Kshitij Gupta
Kshitij Gupta
在 umontreal.ca 的电子邮件经过验证
标题
引用次数
引用次数
年份
Broken Neural Scaling Laws
E Caballero, K Gupta, I Rish, D Krueger
arXiv preprint arXiv:2210.14891, 2022
502022
Illinimet: Illinois system for metaphor detection with contextual and linguistic information
H Gong, K Gupta, A Jain, S Bhat
Proceedings of the Second Workshop on Figurative Language Processing, 146-153, 2020
442020
ARB: Advanced Reasoning Benchmark for Large Language Models
T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ...
arXiv preprint arXiv:2307.13692, 2023
312023
Continual Pre-Training of Large Language Models: How to (re) warm your model?
K Gupta*, B Thérien*, A Ibrahim*, ML Richter, Q Anthony, E Belilovsky, ...
arXiv preprint arXiv:2308.04014, 2023
282023
Temporal latent bottleneck: Synthesis of fast and slow processing mechanisms in sequence learning
A Didolkar, K Gupta, A Goyal, NB Gundavarapu, AM Lamb, NR Ke, ...
Advances in Neural Information Processing Systems 35, 10505-10520, 2022
92022
Simple and Scalable Strategies to Continually Pre-train Large Language Models
A Ibrahim*, B Thérien*, K Gupta*, ML Richter, Q Anthony, T Lesort, ...
arXiv preprint arXiv:2403.08763, 2024
72024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the US Executive Order
T Nakamura, M Mishra, S Tedeschi, Y Chai, JT Stillerman, F Friedrich, ...
arXiv preprint arXiv:2404.00399, 2024
22024
系统目前无法执行此操作,请稍后再试。
文章 1–7