Stackelberg actor-critic: Game-theoretic reinforcement learning algorithms

L Zheng, T Fiez, Z Alumbaugh, B Chasnov… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
The hierarchical interaction between the actor and critic in actor-critic based reinforcement
learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this …

Near-optimal multi-agent learning for safe coverage control

M Prajapat, M Turchetta… - Advances in Neural …, 2022 - proceedings.neurips.cc
In multi-agent coverage control problems, agents navigate their environment to reach
locations that maximize the coverage of some density. In practice, the density is rarely …

Submodular reinforcement learning

M Prajapat, M Mutný, MN Zeilinger… - arXiv preprint arXiv …, 2023 - arxiv.org
In reinforcement learning (RL), rewards of states are typically considered additive, and
following the Markov assumption, they are $\textit {independent} $ of states visited …

TSGS: Two-stage security game solution based on deep reinforcement learning for Internet of Things

X Feng, H Xia, S Xu, L Xu, R Zhang - Expert Systems with Applications, 2023 - Elsevier
The lack of effective defense resource allocation strategies and reliable multi-agent
collaboration mechanisms lead to the low stability of Deep Reinforcement Learning (DRL) …

[HTML][HTML] Security defense strategy algorithm for Internet of Things based on deep reinforcement learning

X Feng, J Han, R Zhang, S Xu, H Xia - High-Confidence Computing, 2024 - Elsevier
Currently, important privacy data of the Internet of Things (IoT) face extremely high risks of
leakage. Attackers persistently engage in continuous attacks on terminal devices to obtain …

Robust reinforcement learning: A constrained game-theoretic approach

J Yu, C Gehring, F Schäfer… - Learning for Dynamics …, 2021 - proceedings.mlr.press
Deep reinforcement learning (RL) methods provide state-of-art performance in complex
control tasks. However, it has been widely recognized that RL methods often fail to …

Risk-minimizing two-player zero-sum stochastic differential game via path integral control

A Patil, Y Zhou, D Fridovich-Keil… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
This paper addresses a continuous-time risk-minimizing two-player zero-sum stochastic
differential game (SDG), in which each player aims to minimize its probability of failure …

Neural tree expansion for multi-robot planning in non-cooperative environments

B Riviere, W Hönig, M Anderson… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online
planning in non-cooperative environments, where each robot attempts to maximize its …

Stackelberg games for learning emergent behaviors during competitive autocurricula

B Yang, L Zheng, LJ Ratliff, B Boots… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Autocurricular training is an important sub-area of multi-agent reinforcement learning
(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving …

Reinforcement learning for unified allocation and patrolling in signaling games with uncertainty

A Venugopal, E Bondi, H Kamarthi, K Dholakia… - arXiv preprint arXiv …, 2020 - arxiv.org
Green Security Games (GSGs) have been successfully used in the protection of valuable
resources such as fisheries, forests and wildlife. While real-world deployment involves both …