关注
Jiayi Zhou
Jiayi Zhou
Peking University Ph.D Student
在 stu.pku.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Ai alignment: A comprehensive survey
J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ...
arXiv preprint arXiv:2310.19852, 2023
1122023
Safety gymnasium: A unified safe reinforcement learning benchmark
J Ji, B Zhang, J Zhou, X Pan, W Huang, R Sun, Y Geng, Y Zhong, J Dai, ...
Advances in Neural Information Processing Systems 36, 2023
43*2023
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research
J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun, W Huang, Y Geng, M Liu, ...
arXiv preprint arXiv:2305.09304, 2023
242023
Rethinking information structures in rlhf: Reward generalization from a graph theory perspective
T Qiu, F Zeng, J Ji, D Yan, K Wang, J Zhou, H Yang, J Dai, X Pan, Y Yang
arXiv preprint arXiv:2402.10184, 2024
42024
Language Models Resist Alignment
J Ji, K Wang, T Qiu, B Chen, J Zhou, C Li, H Lou, Y Yang
arXiv preprint arXiv:2406.06144, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–5