Maxmin Q-learning: Controlling the estimation bias of Q-learning Q Lan, Y Pan, A Fyshe, M White International Conference on Learning Representations (ICLR), 2020 | 189 | 2020 |
Variational quantum soft actor-critic Q Lan arXiv preprint arXiv:2112.11921, 2021 | 18 | 2021 |
A deep top-k relevance matching model for ad-hoc retrieval Z Yang, Q Lan, J Guo, Y Fan, X Zhu, Y Lan, Y Wang, X Cheng China Conference on Information Retrieval (CCIR), 2018 | 16 | 2018 |
Loss of plasticity in deep continual learning S Dohare, JF Hernandez-Garcia, Q Lan, P Rahman, AR Mahmood, ... Nature 632 (8026), 768-774, 2024 | 14 | 2024 |
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo H Ishfaq*, Q Lan*, P Xu, AR Mahmood, D Precup, A Anandkumar, ... International Conference on Learning Representations (ICLR), 2024 | 12 | 2024 |
Overcoming policy collapse in deep reinforcement learning S Dohare, Q Lan, AR Mahmood European Workshop on Reinforcement Learning (EWRL), 2023 | 10 | 2023 |
Memory-efficient reinforcement learning with value-based knowledge consolidation Q Lan, Y Pan, J Luo, AR Mahmood Transactions on Machine Learning Research (TMLR), 2023 | 9* | 2023 |
Model-free Policy Learning with Reward Gradients Q Lan, S Tosatto, H Farrahi, AR Mahmood International Conference on Artificial Intelligence and Statistics (AISTATS), 2022 | 9 | 2022 |
Reducing selection bias in counterfactual reasoning for individual treatment effects estimation Z Zhang, Q Lan, L Ding, Y Wang, N Hassanpour, R Greiner NeurIPS 2019 CausalML Workshop, 2019 | 8 | 2019 |
A PyTorch Reinforcement Learning Framework for Exploring New Ideas Q Lan https://github.com/qlan3/Explorer, 2019 | 7 | 2019 |
Learning to Optimize for Reinforcement Learning Q Lan, AR Mahmood, S Yan, Z Xu Reinforcement Learning Conference (RLC), 2024 | 6 | 2024 |
Elephant Neural Networks: Born to Be a Continual Learner Q Lan, AR Mahmood ICML Workshop on High-dimensional Learning Dynamics, 2023 | 2 | 2023 |
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling I Haque, Y Tan, Y Yang, Q Lan, J Lu, AR Mahmood, D Precup, P Xu Reinforcement Learning Conference (RLC), 2024 | | 2024 |
Weight Clipping for Deep Continual and Reinforcement Learning M Elsayed, Q Lan, C Lyle, AR Mahmood Reinforcement Learning Conference (RLC), 2024 | | 2024 |
More Efficient Randomized Exploration for Reinforcement Learning via Approximate Sampling H Ishfaq, Y Tan, Y Yang, Q Lan, J Lu, AR Mahmood, D Precup, P Xu Reinforcement Learning Conference (RLC), 2024 | | 2024 |
Predictive Representation Learning for Language Modeling Q Lan, L Kumar, M White, A Fyshe arXiv preprint arXiv:2105.14214, 2021 | | 2021 |
Gym Compatible Games for Reinforcement Learning Q Lan https://github.com/qlan3/gym-games, 2019 | | 2019 |