关注
Mickel Liu
Mickel Liu
其他姓名刘墨杨
在 stu.pku.edu.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Baichuan 2: Open large-scale language models
A Yang, B Xiao, B Wang, B Zhang, C Bian, C Yin, C Lv, D Pan, D Wang, ...
arXiv preprint arXiv:2309.10305, 2023
254*2023
BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset
J Ji*, M Liu*, J Dai*, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang
arXiv preprint arXiv:2307.04657, 2023
1252023
Safe rlhf: Safe reinforcement learning from human feedback
J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang
arXiv preprint arXiv:2310.12773, 2023
802023
Omnisafe: An infrastructure for accelerating safe reinforcement learning research
J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun, W Huang, Y Geng, M Liu, ...
arXiv preprint arXiv:2305.09304, 2023
222023
Mate: Benchmarking multi-agent reinforcement learning in distributed target coverage control
X Pan, M Liu, F Zhong, Y Yang, SC Zhu, Y Wang
Advances in Neural Information Processing Systems 35, 27862-27879, 2022
202022
Proactive Multi-Camera Collaboration For 3D Human Pose Estimation
H Ci*, M Liu*, X Pan*, F Zhong, Y Wang
The 11th International Conference on Learning Representations (ICLR), 2023
82023
系统目前无法执行此操作,请稍后再试。
文章 1–6