From demonstrations to task-space specifications. Using causal analysis to extract rule parameter...

文章

学术资源搜索

获得 2 条结果（用时0.03秒）

我的图书馆

From demonstrations to task-space specifications. Using causal analysis to extract rule parameter...

在引用文章中搜索

[PDF] wiley.com Full View

Avoiding collaborative paradox in multi‐agent reinforcement learning

H Kim, S Kim, D Lee, I Jang - ETRI Journal, 2021 - Wiley Online Library

The collaboration productively interacting between multi‐agents has become an emerging
issue in real‐world applications. In reinforcement learning, multi‐agent environments …

被引用次数：8 相关文章所有 5 个版本

[PDF] openreview.net

Learning Invariant Reward Functions through Trajectory Interventions

I Ovinnikov, E Bykovets, JM Buhmann - openreview.net

Inverse reinforcement learning methods aim to retrieve the reward function of a Markov
decision process based on a dataset of expert demonstrations. The commonplace scarcity of …