Multi-agent reinforcement learning: Methods, applications, visionary prospects, and challenges

Z Zhou, G Liu, Y Tang - arXiv preprint arXiv:2305.10091, 2023 - arxiv.org
Multi-agent reinforcement learning (MARL) is a widely used Artificial Intelligence (AI)
technique. However, current studies and applications need to address its scalability, non …

Efficient adversarial attacks on online multi-agent reinforcement learning

G Liu, L Lai - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc
Due to the broad range of applications of multi-agent reinforcement learning (MARL),
understanding the effects of adversarial attacks against MARL model is essential for the safe …

Multiagent Reinforcement Learning: Methods, Trustworthiness, Applications in Intelligent Vehicles, and Challenges

Z Zhou, G Liu, Y Tang - IEEE Transactions on Intelligent …, 2024 - ieeexplore.ieee.org
Multiagent Reinforcement Learning (MARL) plays a pivotal role in intelligent vehicle
systems, offering solutions for complex decision-making, coordination, and adaptive …

Implicit poisoning attacks in two-agent reinforcement learning: Adversarial policies for training-time attacks

M Mohammadi, J Nöther, D Mandal, A Singla… - arXiv preprint arXiv …, 2023 - arxiv.org
In targeted poisoning attacks, an attacker manipulates an agent-environment interaction to
force the agent into adopting a policy of interest, called target policy. Prior work has primarily …

The Bandit Whisperer: Communication Learning for Restless Bandits

Y Zhao, T Wang, D Nagaraj, A Taneja… - arXiv preprint arXiv …, 2024 - arxiv.org
Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a
promising avenue for addressing allocation problems with resource constraints and …

Rampart: Reinforcing autonomous multi-agent protection through adversarial resistance in transportation

MT Hossain, H La, S Badsha - Journal on Autonomous Transportation …, 2024 - dl.acm.org
In the field of multi-agent autonomous transportation, such as automated payload delivery or
highway on-ramp merging, agents routinely exchange knowledge to optimize their shared …

Corruption-Robust Offline Two-Player Zero-Sum Markov Games

A Nika, D Mandal, A Singla… - … Conference on Artificial …, 2024 - proceedings.mlr.press
We study data corruption robustness in offline two-player zero-sum Markov games. Given a
dataset of realized trajectories of two players, an adversary is allowed to modify an $\epsilon …

Towards Offline Opponent Modeling with In-context Learning

Y Jing, K Li, B Liu, Y Zang, H Fu, Q FU… - The Twelfth …, 2023 - openreview.net
Opponent modeling aims at learning the opponent's behaviors, goals, or beliefs to reduce
the uncertainty of the competitive environment and assist decision-making. Existing work …

Data Poisoning to Fake a Nash Equilibria for Markov Games

Y Wu, J McMahan, X Zhu, Q Xie - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
We characterize offline data poisoning attacks on Multi-Agent Reinforcement Learning
(MARL), where an attacker may change a data set in an attempt to install a (potentially …

On faking a Nash equilibrium

Y Wu, J McMahan, X Zhu, Q Xie - arXiv preprint arXiv:2306.08041, 2023 - arxiv.org
We characterize offline data poisoning attacks on Multi-Agent Reinforcement Learning
(MARL), where an attacker may change a data set in an attempt to install a (potentially …