A survey on explainable reinforcement learning: Concepts, algorithms, challenges

Y Qing, S Liu, J Song, H Wang, M Song - arXiv preprint arXiv:2211.06665, 2022 - arxiv.org
Reinforcement Learning (RL) is a popular machine learning paradigm where intelligent
agents interact with the environment to fulfill a long-term goal. Driven by the resurgence of …

Cooperative exploration for multi-agent deep reinforcement learning

IJ Liu, U Jain, RA Yeh… - … conference on machine …, 2021 - proceedings.mlr.press
Exploration is critical for good results in deep reinforcement learning and has attracted much
attention. However, existing multi-agent deep reinforcement learning algorithms still use …

Learning transformer programs

D Friedman, A Wettig, D Chen - Advances in Neural …, 2024 - proceedings.neurips.cc
Recent research in mechanistic interpretability has attempted to reverse-engineer
Transformer models by carefully inspecting network weights and activations. However, these …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Generative AI for Self-Adaptive Systems: State of the Art and Research Roadmap

J Li, M Zhang, N Li, D Weyns, Z Jin, K Tei - ACM Transactions on …, 2024 - dl.acm.org
Self-adaptive systems (SASs) are designed to handle changes and uncertainties through a
feedback loop with four core functionalities: monitoring, analyzing, planning, and execution …

Integrating the traffic science with representation learning for city-wide network congestion prediction

W Zheng, HF Yang, J Cai, P Wang, X Jiang, SS Du… - Information …, 2023 - Elsevier
Recent studies on traffic congestion prediction have paved a promising path towards the
reduction of potential economic and environmental loss. However, at the city-wide scale …

Program synthesis guided reinforcement learning for partially observed environments

Y Yang, JP Inala, O Bastani, Y Pu… - Advances in neural …, 2021 - proceedings.neurips.cc
A key challenge for reinforcement learning is solving long-horizon planning problems.
Recent work has leveraged programs to guide reinforcement learning in these settings …

Web question answering with neurosymbolic program synthesis

Q Chen, A Lamoreaux, X Wang, G Durrett… - Proceedings of the …, 2021 - dl.acm.org
In this paper, we propose a new technique based on program synthesis for extracting
information from webpages. Given a natural language query and a few labeled webpages …

2021 年无人机热点回眸

段海滨, 何杭轩, 赵彦杰, 王寅, 牛轶峰, 袁莞迈… - 科技导报, 2022 - kjdb.org
随着人工智能技术的发展以及机载任务平台性能的提升, 2021 年无人机技术与应用呈现新的
发展态势. 从无人机政策法规, 自主控制, 反无人机, 应用领域等多方面回眸了2021 …

Programming-by-demonstration for long-horizon robot tasks

N Patton, K Rahmani, M Missula, J Biswas… - Proceedings of the ACM …, 2024 - dl.acm.org
The goal of programmatic Learning from Demonstration (LfD) is to learn a policy in a
programming language that can be used to control a robot's behavior from a set of user …