State2explanation: Concept-based explanations to benefit agent learning and user understanding

D Das, S Chernova, B Kim - Advances in Neural …, 2023 - proceedings.neurips.cc
As more non-AI experts use complex AI systems for daily tasks, there has been an
increasing effort to develop methods that produce explanations of AI decision making that …

Logic-based explainability in machine learning

J Marques-Silva - … Knowledge: 18th International Summer School 2022 …, 2023 - Springer
The last decade witnessed an ever-increasing stream of successes in Machine Learning
(ML). These successes offer clear evidence that ML is bound to become pervasive in a wide …

Autonomous capability assessment of sequential decision-making systems in stochastic settings

P Verma, R Karia, S Srivastava - Advances in Neural …, 2023 - proceedings.neurips.cc
It is essential for users to understand what their AI systems can and can't do in order to use
them safely. However, the problem of enabling users to assess AI systems with sequential …

Symbols as a lingua franca for bridging human-ai chasm for explainable and advisable ai systems

S Kambhampati, S Sreedharan, M Verma… - Proceedings of the …, 2022 - ojs.aaai.org
Despite the surprising power of many modern AI systems that often learn their own
representations, there is significant discontent about their inscrutability and the attendant …

Validating metrics for reward alignment in human-autonomy teaming

L Sanneman, JA Shah - Computers in Human Behavior, 2023 - Elsevier
Alignment of human and autonomous agent values and objectives is vital in human-
autonomy teaming settings which require collaborative action toward a common goal. In …

Theory of Mind abilities of Large Language Models in Human-Robot Interaction: An Illusion?

M Verma, S Bhambri, S Kambhampati - Companion of the 2024 ACM …, 2024 - dl.acm.org
Large Language Models (LLMs) have shown exceptional generative abilities in various
natural language and generation tasks. However, possible anthropomorphization and …

Explainable reinforcement learning (XRL): a systematic literature review and taxonomy

Y Bekkemoen - Machine Learning, 2024 - Springer
In recent years, reinforcement learning (RL) systems have shown impressive performance
and remarkable achievements. Many achievements can be attributed to combining RL with …

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI

M Mechergui, S Sreedharan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
While the question of misspecified objectives has gotten much attention in recent years,
most works in this area primarily focus on the challenges related to the complexity of the …

Explainable multi-agent reinforcement learning for temporal queries

K Boggess, S Kraus, L Feng - arXiv preprint arXiv:2305.10378, 2023 - arxiv.org
As multi-agent reinforcement learning (MARL) systems are increasingly deployed
throughout society, it is imperative yet challenging for users to understand the emergent …

Relative behavioral attributes: Filling the gap between symbolic goal specification and reward learning from human preferences

L Guan, K Valmeekam, S Kambhampati - arXiv preprint arXiv:2210.15906, 2022 - arxiv.org
Generating complex behaviors that satisfy the preferences of non-expert users is a crucial
requirement for AI agents. Interactive reward learning from trajectory comparisons (aka …