Kshitij Gupta 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	171	171
h 指数	6	6
i10 指数	4	4

0

100

50

25

75

202020212022202320242 9 11 56 93

Kshitij Gupta

Kshitij Gupta

Mila, Université de Montréal

在 umontreal.ca 的电子邮件经过验证

Scaling Laws Multimodal Foundation Models Memory Reasoning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Broken Neural Scaling Laws E Caballero, K Gupta, I Rish, D Krueger arXiv preprint arXiv:2210.14891, 2022	50	2022
Illinimet: Illinois system for metaphor detection with contextual and linguistic information H Gong, K Gupta, A Jain, S Bhat Proceedings of the Second Workshop on Figurative Language Processing, 146-153, 2020	44	2020
ARB: Advanced Reasoning Benchmark for Large Language Models T Sawada, D Paleka, A Havrilla, P Tadepalli, P Vidas, A Kranias, JJ Nay, ... arXiv preprint arXiv:2307.13692, 2023	31	2023
Continual Pre-Training of Large Language Models: How to (re) warm your model? K Gupta, B Thérien, A Ibrahim*, ML Richter, Q Anthony, E Belilovsky, ... arXiv preprint arXiv:2308.04014, 2023	28	2023
Temporal latent bottleneck: Synthesis of fast and slow processing mechanisms in sequence learning A Didolkar, K Gupta, A Goyal, NB Gundavarapu, AM Lamb, NR Ke, ... Advances in Neural Information Processing Systems 35, 10505-10520, 2022	9	2022
Simple and Scalable Strategies to Continually Pre-train Large Language Models A Ibrahim, B Thérien, K Gupta*, ML Richter, Q Anthony, T Lesort, ... arXiv preprint arXiv:2403.08763, 2024	7	2024
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the US Executive Order T Nakamura, M Mishra, S Tedeschi, Y Chai, JT Stillerman, F Friedrich, ... arXiv preprint arXiv:2404.00399, 2024	2	2024

系统目前无法执行此操作，请稍后再试。

文章 1–7

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交