Nathan Lambert 个人学术档案

引用次数

	总计	2019 年至今
引用	1752	1745
h 指数	18	18
i10 指数	28	28

820

410

205

615

20182019202020212022202320245 16 43 120 204 539 817

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Roberto CalandraProfessor, TU Dresden, Centre for Tactile Internet with Human-in-the-Loop (CeTI)在 tu-dresden.de 的电子邮件经过验证
Kristofer PISTERUC Berkeley在 berkeley.edu 的电子邮件经过验证
Daniel S. DrewUniversity of Utah在 utah.edu 的电子邮件经过验证
Tom ZickHarvard在 berkeley.edu 的电子邮件经过验证
Thomas Krendl GilbertNew York Academy of Sciences在 nyas.org 的电子邮件经过验证
Brandon AmosMeta在 fb.com 的电子邮件经过验证
Sarah DeanCornell在 cornell.edu 的电子邮件经过验证
Luis PinedaResearch Engineer, Facebook AI Research在 fb.com 的电子邮件经过验证
Craig B. SchindlerUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Lydia LeeSandia National Laboratories在 sandia.gov 的电子邮件经过验证

关注

Nathan Lambert

Research Scientist, Allen AI

在 allenai.org 的电子邮件经过验证 - 首页

Reinforcement Learning Machine Learning Robotics Responsible AI


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
[Github] Diffusers: State-of-the-art diffusion models P von Platen, S Patil, A Lozhkov, P Cuenca, N Lambert, K Rasul, ... https://github.com/huggingface/diffusers, 2022	256*	2022
Zephyr: Direct distillation of lm alignment L Tunstall, E Beeching, N Lambert, N Rajani, K Rasul, Y Belkada, ... arXiv preprint arXiv:2310.16944, 2023	200	2023
Low Level Control of a Quadrotor with Deep Model-Based Reinforcement Learning N Lambert, DS Drew, J Yaconelli, R Calandra, S Levine, KSJ Pister IEEE Robotics and Automation Letters 4 (4), 4224-4230, 2019	167	2019
Open LLM Leaderboard E Beeching, C Fourrier, N Habib, S Han, N Lambert, N Rajani, ... URL https://huggingface. co/spaces/HuggingFaceH4/open_llm_leaderboard, 2023	158	2023
On the importance of hyperparameter optimization for model-based reinforcement learning B Zhang, R Rajan, L Pineda, N Lambert, A Biedenkapp, K Chua, F Hutter, ... International Conference on Artificial Intelligence and Statistics, 4015-4023, 2021	106	2021
Objective Mismatch in Model-based Reinforcement Learning N Lambert, B Amos, O Yadan, R Calandra Learning for Dynamics and Control (L4DC), 2020	91	2020
[Github] Trl: Transformer reinforcement learning L von Werra, Y Belkada, L Tunstall, E Beeching, T Thrush, N Lambert https://github.com/lvwerra/trl, 2020	82*	2020
[Blog] Illustrating reinforcement learning from human feedback (RLHF) N Lambert, L Castricato, L von Werra, A Havrilla https://hf.co/blog/rlhf, 2022	77*	2022
Toward controlled flight of the ionocraft: a flying microrobot using electrohydrodynamic thrust with onboard sensing and no moving parts D Drew, N Lambert, C Schindler, K Pister IEEE Robotics and Automation Letters 3 (4), 2807-2813, 2018	76	2018
Camels in a changing climate: Enhancing lm adaptation with tulu 2 H Ivison, Y Wang, V Pyatkin, N Lambert, M Peters, P Dasigi, J Jang, ... arXiv preprint arXiv:2311.10702, 2023	64	2023
Mbrl-lib: A modular library for model-based reinforcement learning L Pineda, B Amos, A Zhang, NO Lambert, R Calandra arXiv preprint arXiv:2104.10159, 2021	49	2021
Learning generalizable locomotion skills with hierarchical reinforcement learning T Li, N Lambert, R Calandra, F Meier, A Rai IEEE International Conference on Robotics and Automation (ICRA), 413-419, 2020	47	2020
The challenges of exploration for offline reinforcement learning N Lambert, M Wulfmeier, W Whitney, A Byravan, M Bloesch, V Dasagi, ... arXiv preprint arXiv:2201.11861, 2022	40	2022
Reward reports for reinforcement learning TK Gilbert, N Lambert, S Dean, T Zick, A Snoswell, S Mehta Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, 84-130, 2023	31	2023
Olmo: Accelerating the science of language models D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ... arXiv preprint arXiv:2402.00838, 2024	30	2024
Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning N Lambert, A Wilcox, H Zhang, K Pister, R Calandra IEEE Conference on Decision and Control (CDC), 2880-2887, 2021	25	2021
Dolma: An Open Corpus of Three Trillion Tokens for Language Model Pretraining Research L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ... arXiv preprint arXiv:2402.00159, 2024	23	2024
The alignment handbook L Tunstall, E Beeching, N Lambert, N Rajani, AM Rush, T Wolf GitHub repository, 2023	22	2023
[HuggingFace] H4 Stack Exchange Preference Dataset N Lambert, NR Lewis Tunstall, T Thrush https://huggingface.co/datasets/HuggingFaceH4/stack-exchange-preferences, 2023	18*	2023
Investigating compounding prediction errors in learned dynamics models N Lambert, K Pister, R Calandra arXiv preprint arXiv:2203.09637, 2022	18	2022

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用