Learning human rewards by inferring their latent intelligence levels in multi-agent games: A theory-of-mind approach with application to driving data
Reward function, as an incentive representation that recognizes humans' agency and
rationalizes humans' actions, is particularly appealing for modeling human behavior in …
rationalizes humans' actions, is particularly appealing for modeling human behavior in …
Expressing diverse human driving behavior with probabilistic rewards and online inference
In human-robot interaction (HRI) systems, such as autonomous vehicles, understanding and
representing human behavior are important. Human behavior is naturally rich and diverse …
representing human behavior are important. Human behavior is naturally rich and diverse …
Joint goal and strategy inference across heterogeneous demonstrators via reward network distillation
Reinforcement learning (RL) has achieved tremendous success as a general framework for
learning how to make decisions. However, this success relies on the interactive hand-tuning …
learning how to make decisions. However, this success relies on the interactive hand-tuning …
Analyzing the suitability of cost functions for explaining and imitating human driving behavior based on inverse reinforcement learning
Autonomous vehicles are sharing the road with human drivers. In order to facilitate
interactive driving and cooperative behavior in dense traffic, a thorough understanding and …
interactive driving and cooperative behavior in dense traffic, a thorough understanding and …
Inferring non-stationary human preferences for human-agent teams
One main challenge to robot decision making in human-robot teams involves predicting the
intents of a human team member through observations of the human's behavior. Inverse …
intents of a human team member through observations of the human's behavior. Inverse …
Learning task-relevant representations via rewards and real actions for reinforcement learning
L Yuan, X Lu, Y Liu - Knowledge-Based Systems, 2024 - Elsevier
The input of visual reinforcement learning often contains redundant information, which will
reduce the decision efficiency and decrease the performance of the agent. To address this …
reduce the decision efficiency and decrease the performance of the agent. To address this …
Multiagent inverse reinforcement learning via theory of mind reasoning
We approach the problem of understanding how people interact with each other in
collaborative settings, especially when individuals know little about their teammates, via …
collaborative settings, especially when individuals know little about their teammates, via …
Human-level reinforcement learning through theory-based modeling, exploration, and planning
PA Tsividis, J Loula, J Burga, N Foss… - arXiv preprint arXiv …, 2021 - arxiv.org
Reinforcement learning (RL) studies how an agent comes to achieve reward in an
environment through interactions over time. Recent advances in machine RL have …
environment through interactions over time. Recent advances in machine RL have …
Preferences implicit in the state of the world
R Shah, D Krasheninnikov, J Alexander… - arXiv preprint arXiv …, 2019 - arxiv.org
Reinforcement learning (RL) agents optimize only the features specified in a reward function
and are indifferent to anything left out inadvertently. This means that we must not only …
and are indifferent to anything left out inadvertently. This means that we must not only …