Baichuan 2: Open large-scale language models A Yang, B Xiao, B Wang, B Zhang, C Bian, C Yin, C Lv, D Pan, D Wang, ... arXiv preprint arXiv:2309.10305, 2023 | 254* | 2023 |
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset J Ji*, M Liu*, J Dai*, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang arXiv preprint arXiv:2307.04657, 2023 | 125 | 2023 |
Safe rlhf: Safe reinforcement learning from human feedback J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang arXiv preprint arXiv:2310.12773, 2023 | 80 | 2023 |
Omnisafe: An infrastructure for accelerating safe reinforcement learning research J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun, W Huang, Y Geng, M Liu, ... arXiv preprint arXiv:2305.09304, 2023 | 22 | 2023 |
Mate: Benchmarking multi-agent reinforcement learning in distributed target coverage control X Pan, M Liu, F Zhong, Y Yang, SC Zhu, Y Wang Advances in Neural Information Processing Systems 35, 27862-27879, 2022 | 20 | 2022 |
Proactive Multi-Camera Collaboration For 3D Human Pose Estimation H Ci*, M Liu*, X Pan*, F Zhong, Y Wang The 11th International Conference on Learning Representations (ICLR), 2023 | 8 | 2023 |