关注
Xiuyuan Lu
Xiuyuan Lu
Google DeepMind
在 google.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Ensemble sampling
X Lu, B Van Roy
Advances in Neural Information Processing Systems 31, 2017
1462017
Epistemic neural networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Advances in Neural Information Processing Systems 37, 2023
862023
Reinforcement learning, bit by bit
X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen
Foundations and Trends® in Machine Learning 16 (6), 733-865, 2023
682023
Information-theoretic confidence bounds for reinforcement learning
X Lu, B Van Roy
Advances in Neural Information Processing Systems 33, 2019
572019
Hypermodels for exploration
V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy
International Conference on Learning Representations, 2020
442020
Efficient online recommendation via low-rank ensemble sampling
X Lu, Z Wen, B Kveton
Proceedings of the 12th ACM Conference on Recommender Systems, 460-464, 2018
212018
An analysis of ensemble sampling
C Qin, Z Wen, X Lu, B Van Roy
Advances in Neural Information Processing Systems 36, 2022
182022
The neural testbed: Evaluating joint predictions
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ...
Advances in Neural Information Processing Systems 36, 12554-12565, 2022
162022
Approximate Thompson Sampling via Epistemic Neural Networks
I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ...
Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, 2023
132023
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping
V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy
Transactions on Machine Learning Research, 2023
122023
From predictions to decisions: The importance of joint predictive distributions
Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ...
arXiv preprint arXiv:2107.09224, 2021
112021
Evaluating High-Order Predictive Distributions in Deep Learning
I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy
Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, 2022
72022
Information-directed sampling for reinforcement learning
X Lu
Stanford University, 2020
42020
Exploration using hyper-models
B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband
US Patent App. 17/639,504, 2022
12022
Robustness of epinets against distributional shifts
X Lu, I Osband, SM Asghari, S Gowal, V Dwaracherla, Z Wen, B Van Roy
arXiv preprint arXiv:2207.00137, 2022
12022
RLHF and IIA: Perverse Incentives
W Xu, S Dong, X Lu, G Lam, Z Wen, B Van Roy
arXiv e-prints, arXiv: 2312.01057, 2023
2023
系统目前无法执行此操作,请稍后再试。
文章 1–16