关注
Ling Pan
标题
引用次数
引用次数
年份
A deep reinforcement learning framework for rebalancing dockless bike sharing systems
L Pan, Q Cai, Z Fang, P Tang, L Huang
Thirty-Third AAAI Conference on Artificial Intelligence Conference (AAAI), 2019
1772019
Softmax deep double deterministic policy gradients
L Pan, Q Cai, L Huang
Thirty-Fourth Conference on Neural Information Processing Systems (NeurIPS), 2020
882020
Reinforcement learning with dynamic boltzmann softmax updates
L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu
Twenty-Ninth International Joint Conference on Artificial Intelligence (IJCAI), 2019
472019
Plan better amid conservatism: Offline multi-agent reinforcement learning with actor rectification
L Pan, L Huang, T Ma, H Xu
Thirty-Ninth International Conference on Machine Learning (ICML), 2022
422022
Better training of gflownets with local credit and incomplete trajectories
L Pan, N Malkin, D Zhang, Y Bengio
Fortieth International Conference on Machine Learning (ICML), 2023
352023
Let the flows tell: Solving graph combinatorial optimization problems with gflownets
D Zhang, H Dai, N Malkin, A Courville, Y Bengio, L Pan
Thirty-Seventh Conference on Neural Information Processing Systems (NeurIPS), 2023
32*2023
Regularized softmax deep multi-agent q-learning
L Pan, T Rashid, B Peng, L Huang, S Whiteson
Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021
32*2021
Generative augmented flow networks
L Pan, D Zhang, A Courville, L Huang, Y Bengio
Eleventh International Conference on Learning Representations (ICLR), 2023
292023
Stochastic generative flow networks
L Pan, D Zhang, M Jain, L Huang, Y Bengio
Thirty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI), 2023
192023
Distributional gflownets with quantile flows
D Zhang, L Pan, RTQ Chen, A Courville, Y Bengio
Transactions on Machine Learning Research (TMLR), 2024
162024
Network topology optimization via deep reinforcement learning
Z Li, X Wang, L Pan, L Zhu, Z Wang, J Feng, C Deng, L Huang
IEEE Transactions on Communications (TCOM), 2023
122023
Learning to scale logits for temperature-conditional GFlowNets
M Kim, J Ko, D Zhang, L Pan, T Yun, W Kim, J Park, Y Bengio
Forty-First International Conference on Machine Learning (ICML), 2023
102023
Rlx2: Training a sparse deep reinforcement learning model from scratch
Y Tan, P Hu, L Pan, J Huang, L Huang
Eleventh International Conference on Learning Representations (ICLR), 2023
102023
Pre-training and fine-tuning generative flow networks
L Pan, M Jain, K Madan, Y Bengio
Twelfth International Conference on Learning Representations (ICLR), 2024
82024
Effective multi-user delay-constrained scheduling with deep recurrent reinforcement learning
P Hu, L Pan, Y Chen, Z Fang, L Huang
Twenty-Third International Symposium on Theory, Algorithmic Foundations, and …, 2022
62022
Large-Scale Actionless Video Pre-Training via Discrete Diffusion for Efficient Policy Learning
H He, C Bai, L Pan, W Zhang, B Zhao, X Li
arXiv preprint arXiv:2402.14407, 2024
52024
Beyond conservatism: Diffusion policies in offline multi-agent reinforcement learning
Z Li, L Pan, L Huang
arXiv preprint arXiv:2307.01472, 2023
52023
Qgfn: Controllable greediness with action values
E Lau, SZ Lu, L Pan, D Precup, E Bengio
arXiv preprint arXiv:2402.05234, 2024
42024
Multi-path policy optimization
L Pan, Q Cai, L Huang
Nineteenth International Conference on Autonomous Agents and Multi-Agent …, 2020
42020
Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning
P Hu, Y Chen, L Pan, Z Fang, F Xiao, L Huang
IEEE/ACM Transactions on Networking (TON), 2024
32024
系统目前无法执行此操作,请稍后再试。
文章 1–20