A study of FMQ heuristic in cooperative multi-agent games.

Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems

L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org

In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …

被引用次数：542 相关文章所有 13 个版本

[PDF] jmlr.org

Lenient learning in independent-learner stochastic cooperative games

E Wei, S Luke - Journal of Machine Learning Research, 2016 - jmlr.org

We introduce the Lenient Multiagent Reinforcement Learning 2 (LMRL2) algorithm for
independent-learner stochastic cooperative games. LMRL2 is designed to overcome a …

被引用次数：107 相关文章所有 6 个版本

[PDF] ieee.org

MARL-based distributed cache placement for wireless networks

X Lin, Y Tang, X Lei, J Xia, Q Zhou, H Wu, L Fan - IEEE Access, 2019 - ieeexplore.ieee.org

We investigate a distributed caching strategy based on multi-agent reinforcement learning
(MARL) in a cache-aided network, where all wireless nodes have limited storage capacity …

被引用次数：48 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-agent coordination in adversarial environments through signal mediated strategies

F Cacciamani, A Celli, M Ciccone, N Gatti - arXiv preprint arXiv …, 2021 - arxiv.org

Many real-world scenarios involve teams of agents that have to coordinate their actions to
reach a shared goal. We focus on the setting in which a team of agents faces an opponent in …

被引用次数：16 相关文章所有 10 个版本

[PDF] plos.org

Moderate confirmation bias enhances decision-making in groups of reinforcement-learning agents

C Bergerot, W Barfuss… - PLOS Computational …, 2024 - journals.plos.org

Humans tend to give more weight to information confirming their beliefs than to information
that disconfirms them. Nevertheless, this apparent irrationality has been shown to improve …

[PDF][PDF] The dynamics of reinforcement social learning in cooperative multiagent systems

J Hao, H Leung - Twenty-Third International Joint Conference on …, 2013 - Citeseer

Coordination in cooperative multiagent systems is an important problem in multiagent
learning literature. In practical complex environments, the interactions between agents can …

被引用次数：35 相关文章所有 4 个版本

[PDF] mdpi.com

Improved Q-learning algorithm based on approximate state matching in agricultural plant protection environment

F Sun, X Wang, R Zhang - Entropy, 2021 - mdpi.com

An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant
protection such as watering, sowing, and pesticide spraying. It is essential to develop a …

被引用次数：11 相关文章所有 8 个版本

[PDF] hal.science

Designing decentralized controllers for distributed-air-jet mems-based micromanipulators by reinforcement learning

L Matignon, GJ Laurent, N Le Fort-Piat… - Journal of intelligent & …, 2010 - Springer

Distributed-air-jet MEMS-based systems have been proposed to manipulate small parts with
high velocities and without any friction problems. The control of such distributed systems is …

被引用次数：27 相关文章所有 14 个版本

[PDF] psu.edu

A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games

T Kemmerich, H Kleine Büning - International Workshop on Adaptive and …, 2011 - Springer

We present a distributed Q-Learning approach for independently learning agents in a
subclass of cooperative stochastic games called cooperative sequential stage games. In this …

被引用次数：12 相关文章所有 9 个版本

[PDF] psu.edu

Reflections on remote reflection

M Richmond, J Noble - Proceedings 24th Australian Computer …, 2001 - ieeexplore.ieee.org

The Java programming language provides both reflection and remote method invocation:
reflection allows a program to inspect itself and its runtime environment, remote method …

被引用次数：22 相关文章所有 10 个版本