Xiuyuan Lu 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	505	489
h 指数	11	11
i10 指数	11	11

0

160

80

40

120

201820192020202120222023202414 14 37 55 109 154 120

Xiuyuan Lu

Xiuyuan Lu

Google DeepMind

在 google.com 的电子邮件经过验证

Reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Ensemble sampling X Lu, B Van Roy Advances in Neural Information Processing Systems 31, 2017	146	2017
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 37, 2023	86	2023
Reinforcement learning, bit by bit X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen Foundations and Trends® in Machine Learning 16 (6), 733-865, 2023	68	2023
Information-theoretic confidence bounds for reinforcement learning X Lu, B Van Roy Advances in Neural Information Processing Systems 33, 2019	57	2019
Hypermodels for exploration V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy International Conference on Learning Representations, 2020	44	2020
Efficient online recommendation via low-rank ensemble sampling X Lu, Z Wen, B Kveton Proceedings of the 12th ACM Conference on Recommender Systems, 460-464, 2018	21	2018
An analysis of ensemble sampling C Qin, Z Wen, X Lu, B Van Roy Advances in Neural Information Processing Systems 36, 2022	18	2022
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 36, 12554-12565, 2022	16	2022
Approximate Thompson Sampling via Epistemic Neural Networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, 2023	13	2023
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy Transactions on Machine Learning Research, 2023	12	2023
From predictions to decisions: The importance of joint predictive distributions Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ... arXiv preprint arXiv:2107.09224, 2021	11	2021
Evaluating High-Order Predictive Distributions in Deep Learning I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, 2022	7	2022
Information-directed sampling for reinforcement learning X Lu Stanford University, 2020	4	2020
Exploration using hyper-models B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband US Patent App. 17/639,504, 2022	1	2022
Robustness of epinets against distributional shifts X Lu, I Osband, SM Asghari, S Gowal, V Dwaracherla, Z Wen, B Van Roy arXiv preprint arXiv:2207.00137, 2022	1	2022
RLHF and IIA: Perverse Incentives W Xu, S Dong, X Lu, G Lam, Z Wen, B Van Roy arXiv e-prints, arXiv: 2312.01057, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–16

共建清朗的网络空间,如遇有害信息,请举报。
本站数据皆整合自互联网公开资源索引,方便科研学术方面查询,并不存储相关数据资源;如对此有异议,请联系我们解决.
© 2023 学术资源搜索 @联系我们 | 申请短期会员 | 数据源提交