Explainable reinforcement learning: A survey and comparative review

S Milani, N Topin, M Veloso, F Fang - ACM Computing Surveys, 2024 - dl.acm.org
Explainable reinforcement learning (XRL) is an emerging subfield of explainable machine
learning that has attracted considerable attention in recent years. The goal of XRL is to …

Explainable reinforcement learning (XRL): a systematic literature review and taxonomy

Y Bekkemoen - Machine Learning, 2024 - Springer
In recent years, reinforcement learning (RL) systems have shown impressive performance
and remarkable achievements. Many achievements can be attributed to combining RL with …

强化学习的可解释方法分类研究.

唐蕾, 牛园园, 王瑞杰, 行本贝… - Application Research of …, 2024 - search.ebscohost.com
强化学习能够在动态复杂环境中实现自主学习, 这使其在法律, 医学, 金融等领域有着广泛应用.
但强化学习仍面临着全局状态空间不可观测, 对奖励函数强依赖和因果关系不确定等诸多问题 …

Experiential explanations for reinforcement learning

A Alabdulkarim, M Singh, G Mansi, K Hall… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement Learning (RL) systems can be complex and non-interpretable, making it
challenging for non-AI experts to understand or intervene in their decisions. This is due in …

Reinforcement learning and game theory based cyber-physical security framework for the humans interacting over societal control systems

Y Cao, C Tao - Frontiers in Energy Research, 2024 - frontiersin.org
A lot of infrastructure upgrade and algorithms have been developed for the information
technology driven smart grids over the past twenty years, especially with increasing interest …

Personalized Path Recourse

D Hong, T Wang - arXiv preprint arXiv:2312.08724, 2023 - arxiv.org
This paper introduces Personalized Path Recourse, a novel method that generates recourse
paths for an agent. The objective is to achieve desired goals (eg, better outcomes compared …

Explaining black box reinforcement learning agents through counterfactual policies

M Movin, GD Junior, J Hollmén… - … Symposium on Intelligent …, 2023 - Springer
Despite the increased attention to explainable AI, explainability methods for understanding
reinforcement learning (RL) agents have not been extensively studied. Failing to understand …

Building Interpretable Machine Learning Models for Sequential Data

D Hong - 2023 - search.proquest.com
Abstract Machine learning is progressing at an astounding rate. The past decade has seen
an explosion in the amount of machine learning research, including deep learning …

[PDF][PDF] Counterfactual Explanations of Learned Reward Functions

J Wehner - 2023 - repository.tudelft.nl
As AI systems become widely employed this technology will profoundly impact society. To
ensure this impact is positive it is essential to align these systems with the values and …