关注
Dingwen Kong
Dingwen Kong
在 mit.edu 的电子邮件经过验证
标题
引用次数
引用次数
年份
Online sub-sampling for reinforcement learning with general function approximation
D Kong, R Salakhutdinov, R Wang, LF Yang
arXiv preprint arXiv:2106.07203, 2021
322021
RLHF from Heterogeneous Feedback via Personalization and Preference Aggregation
C Park, M Liu, D Kong, K Zhang, AE Ozdaglar
ICML 2024 Workshop: Aligning Reinforcement Learning Experimentalists and …, 0
7*
Provably feedback-efficient reinforcement learning via active reward learning
D Kong, L Yang
Advances in Neural Information Processing Systems 35, 11063-11078, 2022
62022
Learning Rationalizable Equilibria in Multiplayer Games
Y Wang, D Kong, Y Bai, C Jin
arXiv preprint arXiv:2210.11402, 2022
12022
系统目前无法执行此操作,请稍后再试。
文章 1–4