Independent reinforcement learners in cooperative markov games: a survey regarding coordination problems
L Matignon, GJ Laurent, N Le Fort-Piat - The Knowledge …, 2012 - cambridge.org
In the framework of fully cooperative multi-agent systems, independent (non-communicative)
agents that learn by reinforcement must overcome several difficulties to manage to …
agents that learn by reinforcement must overcome several difficulties to manage to …
Lenient learning in independent-learner stochastic cooperative games
We introduce the Lenient Multiagent Reinforcement Learning 2 (LMRL2) algorithm for
independent-learner stochastic cooperative games. LMRL2 is designed to overcome a …
independent-learner stochastic cooperative games. LMRL2 is designed to overcome a …
MARL-based distributed cache placement for wireless networks
X Lin, Y Tang, X Lei, J Xia, Q Zhou, H Wu, L Fan - IEEE Access, 2019 - ieeexplore.ieee.org
We investigate a distributed caching strategy based on multi-agent reinforcement learning
(MARL) in a cache-aided network, where all wireless nodes have limited storage capacity …
(MARL) in a cache-aided network, where all wireless nodes have limited storage capacity …
Multi-agent coordination in adversarial environments through signal mediated strategies
Many real-world scenarios involve teams of agents that have to coordinate their actions to
reach a shared goal. We focus on the setting in which a team of agents faces an opponent in …
reach a shared goal. We focus on the setting in which a team of agents faces an opponent in …
Moderate confirmation bias enhances decision-making in groups of reinforcement-learning agents
C Bergerot, W Barfuss… - PLOS Computational …, 2024 - journals.plos.org
Humans tend to give more weight to information confirming their beliefs than to information
that disconfirms them. Nevertheless, this apparent irrationality has been shown to improve …
that disconfirms them. Nevertheless, this apparent irrationality has been shown to improve …
[PDF][PDF] The dynamics of reinforcement social learning in cooperative multiagent systems
Coordination in cooperative multiagent systems is an important problem in multiagent
learning literature. In practical complex environments, the interactions between agents can …
learning literature. In practical complex environments, the interactions between agents can …
Improved Q-learning algorithm based on approximate state matching in agricultural plant protection environment
F Sun, X Wang, R Zhang - Entropy, 2021 - mdpi.com
An Unmanned Aerial Vehicle (UAV) can greatly reduce manpower in the agricultural plant
protection such as watering, sowing, and pesticide spraying. It is essential to develop a …
protection such as watering, sowing, and pesticide spraying. It is essential to develop a …
Designing decentralized controllers for distributed-air-jet mems-based micromanipulators by reinforcement learning
L Matignon, GJ Laurent, N Le Fort-Piat… - Journal of intelligent & …, 2010 - Springer
Distributed-air-jet MEMS-based systems have been proposed to manipulate small parts with
high velocities and without any friction problems. The control of such distributed systems is …
high velocities and without any friction problems. The control of such distributed systems is …
A convergent multiagent reinforcement learning approach for a subclass of cooperative stochastic games
T Kemmerich, H Kleine Büning - International Workshop on Adaptive and …, 2011 - Springer
We present a distributed Q-Learning approach for independently learning agents in a
subclass of cooperative stochastic games called cooperative sequential stage games. In this …
subclass of cooperative stochastic games called cooperative sequential stage games. In this …
Reflections on remote reflection
M Richmond, J Noble - Proceedings 24th Australian Computer …, 2001 - ieeexplore.ieee.org
The Java programming language provides both reflection and remote method invocation:
reflection allows a program to inspect itself and its runtime environment, remote method …
reflection allows a program to inspect itself and its runtime environment, remote method …