Mean Field Multi-Agent Reinforcement Learning Y Yang, R Luo, M Li, M Zhou, W Zhang, J Wang ICML 2018, Oral presentation, 2018 | 742 | 2018 |
Magent: A many-agent reinforcement learning platform for artificial collective intelligence L Zheng, J Yang, H Cai, M Zhou, W Zhang, J Wang, Y Yu Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 230 | 2018 |
SMARTS: Scalable Multi-Agent Reinforcement Learning Training School for Autonomous Driving M Zhou, J Luo, J Villela, Y Yang, D Rusu, J Miao, W Zhang, M Alban, ... Conference on Robot Learning 2020 (Best System Paper Award), 2020 | 183* | 2020 |
CoRide: Joint Order Dispatching and Fleet Management for Multi-Scale Ride-Hailing Platforms J Jin, M Zhou, W Zhang, M Li, Z Guo, Z Qin, Y Jiao, X Tang, C Wang, ... Proceedings of the 28th ACM international conference on information and …, 2019 | 109 | 2019 |
Multi-agent reinforcement learning for order-dispatching via order-vehicle distribution matching M Zhou, J Jin, W Zhang, Z Qin, Y Jiao, C Wang, G Wu, Y Yu, J Ye Proceedings of the 28th ACM International Conference on Information and …, 2019 | 101 | 2019 |
Factorized Q-Learning for Large-Scale Multi-Agent Systems M Zhou, Y Chen, Y Wen, Y Yang, Y Su, W Zhang, D Zhang, J Wang International Conference on Distributed Artificial Intelligence 2019, Oral …, 2018 | 71 | 2018 |
Malib: A parallel framework for population-based multi-agent reinforcement learning M Zhou, Z Wan, H Wang, M Wen, R Wu, Y Wen, Y Yang, Y Yu, J Wang, ... Journal of Machine Learning Research 24 (150), 1-12, 2023 | 40 | 2023 |
Model-based Multi-agent Policy Optimization with Adaptive Opponent-wise Rollouts W Zhang, X Wang, J Shen, M Zhou IJCAI 2021, 2021 | 29 | 2021 |
Multi-Agent Interactions Modeling with Correlated Policies M Liu, M Zhou, W Zhang, Y Zhuang, J Wang, W Liu, Y Yu The International Conference on Learning Representations 2020, 2020 | 22 | 2020 |
Efficient Policy Space Response Oracles M Zhou, J Chen, Y Wen, W Zhang, Y Yang, Y Yu, J Wang arXiv preprint arXiv:2202.00633, 2022 | 10 | 2022 |
Generative adversarial exploration for reinforcement learning W Hong, M Zhu, M Liu, W Zhang, M Zhou, Y Yu, P Sun Proceedings of the First International Conference on Distributed Artificial …, 2019 | 7 | 2019 |
On realization of intelligent decision-making in the real world: A foundation decision model perspective Y Wen, Z Wan, M Zhou, S Hou, Z Cao, C Le, J Chen, Z Tian, W Zhang, ... arXiv preprint arXiv:2212.12669, 2022 | 4 | 2022 |
Promoting quality and diversity in population-based reinforcement learning via hierarchical trajectory space exploration J Miao, T Zhou, K Shao, M Zhou, W Zhang, J Hao, Y Yu, J Wang 2022 International Conference on Robotics and Automation (ICRA), 7544-7550, 2022 | 4 | 2022 |
Signal Instructed Coordination in Team Competition L Chen, H Guo, H Zhang, F Fang, Y Zhu, M Zhou, W Zhang, Q Wang, Y Yu International Conference on Distributed Artificial Intelligence, 185-205, 2019 | 3 | 2019 |
Building open-ended embodied agent via language-policy bidirectional adaptation S Zhai, J Wang, T Zhang, F Huang, Q Zhang, M Zhou, J Hou, Y Liu arXiv preprint arXiv:2401.00006, 2023 | 1 | 2023 |