Ensemble sampling X Lu, B Van Roy Advances in Neural Information Processing Systems 31, 2017 | 151 | 2017 |
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 37, 2023 | 99 | 2023 |
Reinforcement learning, bit by bit X Lu, B Van Roy, V Dwaracherla, M Ibrahimi, I Osband, Z Wen Foundations and Trends® in Machine Learning 16 (6), 733-865, 2023 | 75 | 2023 |
Information-theoretic confidence bounds for reinforcement learning X Lu, B Van Roy Advances in Neural Information Processing Systems 33, 2019 | 59 | 2019 |
Hypermodels for exploration V Dwaracherla, X Lu, M Ibrahimi, I Osband, Z Wen, B Van Roy International Conference on Learning Representations, 2020 | 47 | 2020 |
An analysis of ensemble sampling C Qin, Z Wen, X Lu, B Van Roy Advances in Neural Information Processing Systems 36, 2022 | 22 | 2022 |
Efficient online recommendation via low-rank ensemble sampling X Lu, Z Wen, B Kveton Proceedings of the 12th ACM Conference on Recommender Systems, 460-464, 2018 | 22 | 2018 |
The neural testbed: Evaluating joint predictions I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, M Ibrahimi, ... Advances in Neural Information Processing Systems 36, 12554-12565, 2022 | 18 | 2022 |
Approximate Thompson Sampling via Epistemic Neural Networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence, 2023 | 15 | 2023 |
Ensembles for uncertainty estimation: Benefits of prior functions and bootstrapping V Dwaracherla, Z Wen, I Osband, X Lu, SM Asghari, B Van Roy Transactions on Machine Learning Research, 2023 | 13 | 2023 |
From predictions to decisions: The importance of joint predictive distributions Z Wen, I Osband, C Qin, X Lu, M Ibrahimi, V Dwaracherla, M Asghari, ... arXiv preprint arXiv:2107.09224, 2021 | 11 | 2021 |
Evaluating High-Order Predictive Distributions in Deep Learning I Osband, Z Wen, SM Asghari, V Dwaracherla, X Lu, B Van Roy Proceedings of the 38th Conference on Uncertainty in Artificial Intelligence, 2022 | 7 | 2022 |
Information-directed sampling for reinforcement learning X Lu Stanford University, 2020 | 4 | 2020 |
Exploration using hyper-models B Van Roy, X Lu, VR Dwaracherla, Z Wen, M Ibrahimi, IDM Osband US Patent App. 17/639,504, 2022 | 1 | 2022 |
Robustness of epinets against distributional shifts X Lu, I Osband, SM Asghari, S Gowal, V Dwaracherla, Z Wen, B Van Roy arXiv preprint arXiv:2207.00137, 2022 | 1 | 2022 |
RLHF and IIA: Perverse Incentives W Xu, S Dong, X Lu, G Lam, Z Wen, B Van Roy ICML 2024 Workshop on Models of Human Feedback for AI Alignment, 2023 | | 2023 |