Stackelberg actor-critic: Game-theoretic reinforcement learning algorithms
The hierarchical interaction between the actor and critic in actor-critic based reinforcement
learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this …
learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this …
Near-optimal multi-agent learning for safe coverage control
M Prajapat, M Turchetta… - Advances in Neural …, 2022 - proceedings.neurips.cc
In multi-agent coverage control problems, agents navigate their environment to reach
locations that maximize the coverage of some density. In practice, the density is rarely …
locations that maximize the coverage of some density. In practice, the density is rarely …
Submodular reinforcement learning
In reinforcement learning (RL), rewards of states are typically considered additive, and
following the Markov assumption, they are $\textit {independent} $ of states visited …
following the Markov assumption, they are $\textit {independent} $ of states visited …
TSGS: Two-stage security game solution based on deep reinforcement learning for Internet of Things
X Feng, H Xia, S Xu, L Xu, R Zhang - Expert Systems with Applications, 2023 - Elsevier
The lack of effective defense resource allocation strategies and reliable multi-agent
collaboration mechanisms lead to the low stability of Deep Reinforcement Learning (DRL) …
collaboration mechanisms lead to the low stability of Deep Reinforcement Learning (DRL) …
[HTML][HTML] Security defense strategy algorithm for Internet of Things based on deep reinforcement learning
X Feng, J Han, R Zhang, S Xu, H Xia - High-Confidence Computing, 2024 - Elsevier
Currently, important privacy data of the Internet of Things (IoT) face extremely high risks of
leakage. Attackers persistently engage in continuous attacks on terminal devices to obtain …
leakage. Attackers persistently engage in continuous attacks on terminal devices to obtain …
Robust reinforcement learning: A constrained game-theoretic approach
Deep reinforcement learning (RL) methods provide state-of-art performance in complex
control tasks. However, it has been widely recognized that RL methods often fail to …
control tasks. However, it has been widely recognized that RL methods often fail to …
Risk-minimizing two-player zero-sum stochastic differential game via path integral control
This paper addresses a continuous-time risk-minimizing two-player zero-sum stochastic
differential game (SDG), in which each player aims to minimize its probability of failure …
differential game (SDG), in which each player aims to minimize its probability of failure …
Neural tree expansion for multi-robot planning in non-cooperative environments
We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online
planning in non-cooperative environments, where each robot attempts to maximize its …
planning in non-cooperative environments, where each robot attempts to maximize its …
Stackelberg games for learning emergent behaviors during competitive autocurricula
Autocurricular training is an important sub-area of multi-agent reinforcement learning
(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving …
(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving …
Reinforcement learning for unified allocation and patrolling in signaling games with uncertainty
Green Security Games (GSGs) have been successfully used in the protection of valuable
resources such as fisheries, forests and wildlife. While real-world deployment involves both …
resources such as fisheries, forests and wildlife. While real-world deployment involves both …