Competitive policy optimization

L Zheng, T Fiez, Z Alumbaugh, B Chasnov… - Proceedings of the AAAI …, 2022 - ojs.aaai.org

The hierarchical interaction between the actor and critic in actor-critic based reinforcement
learning algorithms naturally lends itself to a game-theoretic interpretation. We adopt this …

被引用次数：41 相关文章所有 9 个版本

[PDF] neurips.cc

Near-optimal multi-agent learning for safe coverage control

M Prajapat, M Turchetta… - Advances in Neural …, 2022 - proceedings.neurips.cc

In multi-agent coverage control problems, agents navigate their environment to reach
locations that maximize the coverage of some density. In practice, the density is rarely …

被引用次数：15 相关文章所有 10 个版本

[PDF] arxiv.org

Submodular reinforcement learning

M Prajapat, M Mutný, MN Zeilinger… - arXiv preprint arXiv …, 2023 - arxiv.org

In reinforcement learning (RL), rewards of states are typically considered additive, and
following the Markov assumption, they are $\textit {independent} $ of states visited …

被引用次数：9 相关文章所有 5 个版本

TSGS: Two-stage security game solution based on deep reinforcement learning for Internet of Things

X Feng, H Xia, S Xu, L Xu, R Zhang - Expert Systems with Applications, 2023 - Elsevier

The lack of effective defense resource allocation strategies and reliable multi-agent
collaboration mechanisms lead to the low stability of Deep Reinforcement Learning (DRL) …

被引用次数：4 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] Security defense strategy algorithm for Internet of Things based on deep reinforcement learning

X Feng, J Han, R Zhang, S Xu, H Xia - High-Confidence Computing, 2024 - Elsevier

Currently, important privacy data of the Internet of Things (IoT) face extremely high risks of
leakage. Attackers persistently engage in continuous attacks on terminal devices to obtain …

被引用次数：5 相关文章

[PDF] mlr.press

Robust reinforcement learning: A constrained game-theoretic approach

J Yu, C Gehring, F Schäfer… - Learning for Dynamics …, 2021 - proceedings.mlr.press

Deep reinforcement learning (RL) methods provide state-of-art performance in complex
control tasks. However, it has been widely recognized that RL methods often fail to …

被引用次数：16 相关文章所有 8 个版本

[PDF] arxiv.org

Risk-minimizing two-player zero-sum stochastic differential game via path integral control

A Patil, Y Zhou, D Fridovich-Keil… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org

This paper addresses a continuous-time risk-minimizing two-player zero-sum stochastic
differential game (SDG), in which each player aims to minimize its probability of failure …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Neural tree expansion for multi-robot planning in non-cooperative environments

B Riviere, W Hönig, M Anderson… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

We present a self-improving, Neural Tree Expansion (NTE) method for multi-robot online
planning in non-cooperative environments, where each robot attempts to maximize its …

被引用次数：14 相关文章所有 6 个版本

[PDF] arxiv.org

Stackelberg games for learning emergent behaviors during competitive autocurricula

B Yang, L Zheng, LJ Ratliff, B Boots… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Autocurricular training is an important sub-area of multi-agent reinforcement learning
(MARL) that allows multiple agents to learn emergent skills in an unsupervised co-evolving …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Reinforcement learning for unified allocation and patrolling in signaling games with uncertainty

A Venugopal, E Bondi, H Kamarthi, K Dholakia… - arXiv preprint arXiv …, 2020 - arxiv.org

Green Security Games (GSGs) have been successfully used in the protection of valuable
resources such as fisheries, forests and wildlife. While real-world deployment involves both …

被引用次数：12 相关文章所有 11 个版本

Stackelberg actor-critic: Game-theoretic reinforcement learning algorithms

Near-optimal multi-agent learning for safe coverage control

Submodular reinforcement learning

TSGS: Two-stage security game solution based on deep reinforcement learning for Internet of Things

[HTML][HTML] Security defense strategy algorithm for Internet of Things based on deep reinforcement learning

Robust reinforcement learning: A constrained game-theoretic approach

Risk-minimizing two-player zero-sum stochastic differential game via path integral control

Neural tree expansion for multi-robot planning in non-cooperative environments

Stackelberg games for learning emergent behaviors during competitive autocurricula

Reinforcement learning for unified allocation and patrolling in signaling games with uncertainty

高级搜索

引用