查看文章

arxiv.org 中的 [PDF]

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

作者

Ran Tian, Masayoshi Tomizuka, Liting Sun

发表日期

2021/6/30

期刊

2021 IEEE International Conference on Intelligent Robots and Systems (IROS 2021)

简介

Reward function, as an incentive representation that recognizes humans’ agency and rationalizes humans’ actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an effective way to retrieve reward functions from demonstrations. However, it has always been challenging when applying it to multi-agent settings since the mutual influence between agents has to be appropriately modeled. To tackle this challenge, previous work either exploits equilibrium solution concepts by assuming humans as perfectly rational optimizers with unbounded intelligence or pre-assigns humans’ interaction strategies a priori. In this work, we advocate that humans are bounded rational and have different intelligence levels when reasoning about others’ decision-making process, and such an inherent and latent characteristic should be accounted for in reward learning …

引用总数

被引用次数：12

20212022202320242 2 7 1

学术搜索中的文章

Learning human rewards by inferring their latent intelligence levels in multi-agent games: A theory-of-mind approach with application to driving data

R Tian, M Tomizuka, L Sun - 2021 IEEE/RSJ International Conference on Intelligent …, 2021

被引用次数：12 相关文章所有 8 个版本