Samrat Phatale 个人学术档案

引用次数

	总计	2019 年至今
引用	260	259
h 指数	3	3
i10 指数	1	1

200

100

150

202020212022202320241 1 1 57 198

合著作者

Hassan MansoorSoftware Engineer Google在 google.com 的电子邮件经过验证
Abhinav RastogiGoogle DeepMind在 google.com 的电子邮件经过验证
Thomas MesnardResearch Scientist at Google DeepMind在 google.com 的电子邮件经过验证
Victor CărbuneGoogle在 google.com 的电子邮件经过验证
Colton BishopGoogle DeepMind在 google.com 的电子邮件经过验证
Harrison LeeGoogle Deepmind在 google.com 的电子邮件经过验证
Kellie LuColumbia University在 columbia.edu 的电子邮件经过验证
Lucas DixonPAIR, Google Research在 google.com 的电子邮件经过验证
Liangchen LuoGoogle DeepMind在 google.com 的电子邮件经过验证
Harsh LaraGoogle DeepMind在 google.com 的电子邮件经过验证
Sushant PrakashGoogle在 google.com 的电子邮件经过验证
Raghav GuptaGoogle Research在 google.com 的电子邮件经过验证
Simral ChaudharyAlumni在 alumni.cmu.edu 的电子邮件经过验证
Renat AksitovGoogle DeepMind在 google.com 的电子邮件经过验证
Johan FerretResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证

关注

Samrat Phatale

Google DeepMind

在 google.com 的电子邮件经过验证

Machine Intelligence Natural Language Processing


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Rlaif: Scaling reinforcement learning from human feedback with ai feedback H Lee, S Phatale, H Mansoor, K Lu, T Mesnard, C Bishop, V Carbune, ... arXiv preprint arXiv:2309.00267, 2023	247	2023
RLAIF: Scaling reinforcement learning from human feedback with ai feedback, 2024 H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, K Lu, C Bishop, E Hall, ... URL https://openreview. net/forum, 0	5
Prose for a painting P Kashyap, S Phatale, I Drori arXiv preprint arXiv:1910.03634, 2019	3	2019
PERL: Parameter Efficient Reinforcement Learning from Human Feedback H Sidahmed, S Phatale, A Hutcheson, Z Lin, Z Chen, Z Yu, J Jin, ... arXiv preprint arXiv:2403.10704, 2024	2	2024
Improve Mathematical Reasoning in Language Models by Automated Process Supervision L Luo, Y Liu, R Liu, S Phatale, H Lara, Y Li, L Shu, Y Zhu, L Meng, J Sun, ... arXiv preprint arXiv:2406.06592, 2024	1	2024
SAFE: Software-defined authentication framework AV Kamath, K Kataoka, N Vijayvergiya, GB Reddy, S Phatale Proceedings of the 12th Asian Internet Engineering Conference, 57-63, 2016	1	2016
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback H Lee, S Phatale, H Mansoor, T Mesnard, J Ferret, KR Lu, C Bishop, ... Forty-first International Conference on Machine Learning, 0	1
Conversational Recommendation as Retrieval: A Simple, Strong Baseline R Gupta, R Aksitov, S Phatale, S Chaudhary, H Lee, A Rastogi arXiv preprint arXiv:2305.13725, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–8

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用