Explainable artificial intelligence for autonomous driving: A comprehensive overview and field guide for future research directions

S Atakishiyev, M Salameh, H Yao, R Goebel - IEEE Access, 2024 - ieeexplore.ieee.org
Autonomous driving has achieved significant milestones in research and development over
the last two decades. There is increasing interest in the field as the deployment of …

A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arXiv preprint arXiv:2211.06665, 2022 - arxiv.org
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

State2explanation: Concept-based explanations to benefit agent learning and user understanding

D Das, S Chernova, B Kim - Advances in Neural …, 2023 - proceedings.neurips.cc
As more non-AI experts use complex AI systems for daily tasks, there has been an
increasing effort to develop methods that produce explanations of AI decision making that …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Validating metrics for reward alignment in human-autonomy teaming

L Sanneman, JA Shah - Computers in Human Behavior, 2023 - Elsevier
Alignment of human and autonomous agent values and objectives is vital in human-
autonomy teaming settings which require collaborative action toward a common goal. In …

The Utility of “Even if” semifactual explanation to optimise positive outcomes

E Kenny, W Huang - Advances in Neural Information …, 2024 - proceedings.neurips.cc
When users receive either a positive or negative outcome from an automated system,
Explainable AI (XAI) has almost exclusively focused on how to mutate negative outcomes …

Evaluation and improvement of interpretability for self-explainable part-prototype networks

Q Huang, M Xue, W Huang, H Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Part-prototype networks (eg, ProtoPNet, ProtoTree, and ProtoPool) have attracted broad
research interest for their intrinsic interpretability and comparable accuracy to non …

Refining diffusion planner for reliable behavior synthesis by automatic detection of infeasible plans

K Lee, S Kim, J Choi - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Diffusion-based planning has shown promising results in long-horizon, sparse-reward tasks
by training trajectory diffusion models and conditioning the sampled trajectories using …

Interpretable deep reinforcement learning for optimizing heterogeneous energy storage systems

L Xiong, Y Tang, C Liu, S Mao, K Meng… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Energy storage systems (ESS) are pivotal component in the energy market, serving as both
energy suppliers and consumers. ESS operators can reap benefits from energy arbitrage by …

Mixture of Gaussian-distributed Prototypes with Generative Modelling for Interpretable Image Classification

C Wang, Y Chen, F Liu, DJ McCarthy, H Frazer… - arXiv preprint arXiv …, 2023 - arxiv.org
Prototypical-part interpretable methods, eg, ProtoPNet, enhance interpretability by
connecting classification predictions to class-specific training prototypes, thereby offering an …