When humans aren’t optimal: Robots that collaborate with risk-aware humans. In 2020 15th ACM

DJ Hejna III, D Sadigh - Conference on Robot Learning, 2023 - proceedings.mlr.press

While reinforcement learning (RL) has become a more popular approach for robotics,
designing sufficiently informative reward functions for complex tasks has proven to be …

被引用次数：76 相关文章所有 6 个版本

[PDF] neurips.cc

Inverse preference learning: Preference-based rl without a reward function

J Hejna, D Sadigh - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Reward functions are difficult to design and often hard to align with human intent. Preference-
based Reinforcement Learning (RL) algorithms address these problems by learning reward …

被引用次数：31 相关文章所有 9 个版本

[PDF] arxiv.org

Learning zero-shot cooperation with humans, assuming humans are biased

C Yu, J Gao, W Liu, B Xu, H Tang, J Yang… - arXiv preprint arXiv …, 2023 - arxiv.org

There is a recent trend of applying multi-agent reinforcement learning (MARL) to train an
agent that can cooperate with humans in a zero-shot fashion without using any human data …

被引用次数：27 相关文章所有 7 个版本

[PDF] mlr.press

Rap: Risk-aware prediction for robust planning

H Nishimura, J Mercat, B Wulfe… - … on Robot Learning, 2023 - proceedings.mlr.press

Robust planning in interactive scenarios requires predicting the uncertain future to make risk-
aware decisions. Unfortunately, due to long-tail safety-critical events, the risk is often under …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

A ranking game for imitation learning

H Sikchi, A Saran, W Goo, S Niekum - arXiv preprint arXiv:2202.03481, 2022 - arxiv.org

We propose a new framework for imitation learning--treating imitation as a two-player
ranking-based game between a policy and a reward. In this game, the reward agent learns …

被引用次数：16 相关文章所有 8 个版本

[PDF] uct.ac.za

Self-adapting simulated artificial societies

B Gower-Winter - 2023 - open.uct.ac.za

Abstract Agent-Based Models (ABM) are computational models that utilize autonomous
agents to interact and adapt to the environments in which they occupy. They are used in …

被引用次数：1 相关文章所有 3 个版本

[PDF] escholarship.org

[图书][B] Design of Intuitive and Risk-Perception-Aware Robotic Navigation Algorithms

A Suresh - 2022 - search.proquest.com

As robots become more integrated into society, their reasoning and actions will invariably be
evaluated by human decision makers. Thus, robots need to perceive, act, and reason like …

被引用次数：2 相关文章所有 2 个版本