Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Y Chen, Y Yang, T Wu, S Wang, X Feng, J Jiang, SM McAleer, H Dong, ... NeurIPS 2022, 2022 | 72 | 2022 |
Vehicle trajectory prediction using intention-based conditional variational autoencoder X Feng, Z Cen, J Hu, Y Zhang 2019 IEEE Intelligent Transportation Systems Conference (ITSC), 3514-3519, 2019 | 53 | 2019 |
Towards effective context for meta-reinforcement learning: an approach based on contrastive learning H Fu, H Tang, J Hao, C Chen, X Feng, D Li, W Liu Proceedings of the AAAI Conference on Artificial Intelligence 35 (8), 7457-7465, 2021 | 49 | 2021 |
Neural Auto-Curricula X Feng*, O Slumbers*, Y Yang, Z Wan, B Liu, S McAleer, Y Wen, J Wang NeurIPS 2021, 2021 | 46* | 2021 |
Mri reconstruction with interpretable pixel-wise operations using reinforcement learning W Li*, X Feng*, H An, XY Ng, YJ Zhang Proceedings of the AAAI conference on artificial intelligence 34 (01), 792-799, 2020 | 30 | 2020 |
Alphazero-like tree-search can guide large language model decoding and training X Feng, Z Wan, M Wen, SM McAleer, Y Wen, W Zhang, J Wang Forty-first International Conference on Machine Learning, 2024 | 28 | 2024 |
Heterogeneous-agent mirror learning: A continuum of solutions to cooperative marl JG Kuba, X Feng, S Ding, H Dong, J Wang, Y Yang JMLR, 2022 | 26* | 2022 |
CMML: Contextual modulation meta learning for cold-start recommendation X Feng, C Chen, D Li, M Zhao, J Hao, J Wang Proceedings of the 30th ACM International Conference on Information …, 2021 | 23 | 2021 |
ChessGPT: Bridging Policy Learning and Language Modeling X Feng, Y Luo, Z Wang, H Tang, M Yang, K Shao, D Mguni, Y Du, J Wang Advances in Neural Information Processing Systems 36, 2024 | 14 | 2024 |
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning X Feng*, B Liu*, J Ren, L Mai, R Zhu, J Wang, Y Yang NeurIPS 2022, 2021 | 12* | 2021 |
Contextual Transformer for Offline Meta Reinforcement Learning R Lin, Y Li, X Feng, Z Zhang, XHW Fung, H Zhang, J Wang, Y Du, Y Yang NeurIPS2022 FMDM workshop, 2022 | 10 | 2022 |
Autonomous lane change decision making using different deep reinforcement learning methods X Feng, J Hu, Y Huo, Y Zhang CICTP 2019, 5563-5575, 2019 | 10 | 2019 |
Pangu-agent: A fine-tunable generalist agent with structured reasoning F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ... arXiv preprint arXiv:2312.14878, 2023 | 7 | 2023 |
Torchopt: An efficient library for differentiable optimization J Ren*, X Feng*, B Liu*, X Pan*, Y Fu, L Mai, Y Yang JMLR Open Source Software, 2022 | 6 | 2022 |
Mansa: Learning fast and slow in multi-agent systems DH Mguni, H Chen, T Jafferjee, J Wang, L Yue, X Feng, SM Mcaleer, ... International Conference on Machine Learning, 24631-24658, 2023 | 3 | 2023 |
Natural Language Reinforcement Learning X Feng, Z Wan, M Yang, Z Wang, GA Koushiks, Y Du, Y Wen, J Wang arXiv preprint arXiv:2402.07157, 2024 | 1 | 2024 |
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models Z Hu, C Liu, X Feng, Y Zhao, SK Ng, AT Luu, J He, PW Koh, B Hooi arXiv preprint arXiv:2402.03271, 2024 | 1 | 2024 |