A deep reinforcement learning framework for rebalancing dockless bike sharing systems L Pan, Q Cai, Z Fang, P Tang, L Huang Thirty-Third AAAI Conference on Artificial Intelligence Conference (AAAI), 2019 | 177 | 2019 |
Softmax deep double deterministic policy gradients L Pan, Q Cai, L Huang Thirty-Fourth Conference on Neural Information Processing Systems (NeurIPS), 2020 | 88 | 2020 |
Reinforcement learning with dynamic boltzmann softmax updates L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI), 2019 | 47 | 2019 |
Plan better amid conservatism: Offline multi-agent reinforcement learning with actor rectification L Pan, L Huang, T Ma, H Xu Thirty-Ninth International Conference on Machine Learning (ICML), 2022 | 42 | 2022 |
Better training of gflownets with local credit and incomplete trajectories L Pan, N Malkin, D Zhang, Y Bengio Fortieth International Conference on Machine Learning (ICML), 2023 | 35 | 2023 |
Let the flows tell: Solving graph combinatorial optimization problems with gflownets D Zhang, H Dai, N Malkin, A Courville, Y Bengio, L Pan Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023 | 32* | 2023 |
Regularized softmax deep multi-agent q-learning L Pan, T Rashid, B Peng, L Huang, S Whiteson Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021 | 32* | 2021 |
Generative augmented flow networks L Pan, D Zhang, A Courville, L Huang, Y Bengio Eleventh International Conference on Learning Representations (ICLR), 2023 | 29 | 2023 |
Stochastic generative flow networks L Pan, D Zhang, M Jain, L Huang, Y Bengio Thirty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI), 2023 | 19 | 2023 |
Distributional gflownets with quantile flows D Zhang, L Pan, RTQ Chen, A Courville, Y Bengio Transactions on Machine Learning Research (TMLR), 2024 | 16 | 2024 |
Network topology optimization via deep reinforcement learning Z Li, X Wang, L Pan, L Zhu, Z Wang, J Feng, C Deng, L Huang IEEE Transactions on Communications (TCOM), 2023 | 12 | 2023 |
Learning to scale logits for temperature-conditional GFlowNets M Kim, J Ko, D Zhang, L Pan, T Yun, W Kim, J Park, Y Bengio Forty-First International Conference on Machine Learning (ICML), 2023 | 10 | 2023 |
Rlx2: Training a sparse deep reinforcement learning model from scratch Y Tan, P Hu, L Pan, J Huang, L Huang Eleventh International Conference on Learning Representations (ICLR), 2023 | 10 | 2023 |
Pre-training and fine-tuning generative flow networks L Pan, M Jain, K Madan, Y Bengio Twelfth International Conference on Learning Representations (ICLR), 2024 | 8 | 2024 |
Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning P Hu, L Pan, Y Chen, Z Fang, L Huang Twenty-Third International Symposium on Theory, Algorithmic Foundations, and …, 2022 | 6 | 2022 |
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning H He, C Bai, L Pan, W Zhang, B Zhao, X Li arXiv preprint arXiv:2402.14407, 2024 | 5 | 2024 |
Beyond conservatism: Diffusion policies in offline multi-agent reinforcement learning Z Li, L Pan, L Huang arXiv preprint arXiv:2307.01472, 2023 | 5 | 2023 |
Qgfn: Controllable greediness with action values E Lau, SZ Lu, L Pan, D Precup, E Bengio arXiv preprint arXiv:2402.05234, 2024 | 4 | 2024 |
Multi-path policy optimization L Pan, Q Cai, L Huang Nineteenth International Conference on Autonomous Agents and Multi-Agent …, 2020 | 4 | 2020 |
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning P Hu, Y Chen, L Pan, Z Fang, F Xiao, L Huang IEEE/ACM Transactions on Networking (TON), 2024 | 3 | 2024 |