Bridging the gap: Providing post-hoc symbolic explanations for sequential decision-making...

D Das, S Chernova, B Kim - Advances in Neural …, 2023 - proceedings.neurips.cc

As more non-AI experts use complex AI systems for daily tasks, there has been an
increasing effort to develop methods that produce explanations of AI decision making that …

被引用次数：19 相关文章所有 6 个版本

[PDF] arxiv.org

Logic-based explainability in machine learning

J Marques-Silva - … Knowledge: 18th International Summer School 2022 …, 2023 - Springer

The last decade witnessed an ever-increasing stream of successes in Machine Learning
(ML). These successes offer clear evidence that ML is bound to become pervasive in a wide …

被引用次数：45 相关文章所有 5 个版本

[PDF] neurips.cc

Autonomous capability assessment of sequential decision-making systems in stochastic settings

P Verma, R Karia, S Srivastava - Advances in Neural …, 2023 - proceedings.neurips.cc

It is essential for users to understand what their AI systems can and can't do in order to use
them safely. However, the problem of enabling users to assess AI systems with sequential …

被引用次数：9 相关文章所有 7 个版本

[PDF] aaai.org

Symbols as a lingua franca for bridging human-ai chasm for explainable and advisable ai systems

S Kambhampati, S Sreedharan, M Verma… - Proceedings of the …, 2022 - ojs.aaai.org

Despite the surprising power of many modern AI systems that often learn their own
representations, there is significant discontent about their inscrutability and the attendant …

被引用次数：47 相关文章所有 8 个版本

[PDF] sciencedirect.com

Validating metrics for reward alignment in human-autonomy teaming

L Sanneman, JA Shah - Computers in Human Behavior, 2023 - Elsevier

Alignment of human and autonomous agent values and objectives is vital in human-
autonomy teaming settings which require collaborative action toward a common goal. In …

被引用次数：10 相关文章所有 2 个版本

[PDF] arxiv.org

Theory of Mind abilities of Large Language Models in Human-Robot Interaction: An Illusion?

M Verma, S Bhambri, S Kambhampati - Companion of the 2024 ACM …, 2024 - dl.acm.org

Large Language Models (LLMs) have shown exceptional generative abilities in various
natural language and generation tasks. However, possible anthropomorphization and …

被引用次数：15 相关文章所有 4 个版本

[PDF] springer.com

Explainable reinforcement learning (XRL): a systematic literature review and taxonomy

Y Bekkemoen - Machine Learning, 2024 - Springer

In recent years, reinforcement learning (RL) systems have shown impressive performance
and remarkable achievements. Many achievements can be attributed to combining RL with …

被引用次数：4 相关文章所有 4 个版本

[PDF] aaai.org

Goal Alignment: Re-analyzing Value Alignment Problems Using Human-Aware AI

M Mechergui, S Sreedharan - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org

While the question of misspecified objectives has gotten much attention in recent years,
most works in this area primarily focus on the challenges related to the complexity of the …

被引用次数：6 相关文章所有 5 个版本

[PDF] arxiv.org

Explainable multi-agent reinforcement learning for temporal queries

K Boggess, S Kraus, L Feng - arXiv preprint arXiv:2305.10378, 2023 - arxiv.org

As multi-agent reinforcement learning (MARL) systems are increasingly deployed
throughout society, it is imperative yet challenging for users to understand the emergent …

被引用次数：6 相关文章所有 7 个版本

[PDF] arxiv.org

Relative behavioral attributes: Filling the gap between symbolic goal specification and reward learning from human preferences

L Guan, K Valmeekam, S Kambhampati - arXiv preprint arXiv:2210.15906, 2022 - arxiv.org

Generating complex behaviors that satisfy the preferences of non-expert users is a crucial
requirement for AI agents. Interactive reward learning from trajectory comparisons (aka …

被引用次数：6 相关文章所有 6 个版本