Local advantage actor-critic for robust multi-agent deep reinforcement learning

R Gorsane, O Mahjoub, RJ de Kock… - Advances in …, 2022 - proceedings.neurips.cc

Multi-agent reinforcement learning (MARL) has emerged as a useful approach to solving
decentralised decision-making problems at scale. Research in the field has been growing …

被引用次数：37 相关文章所有 5 个版本

[PDF] mdpi.com

Stock market prediction using deep reinforcement learning

AL Awad, SM Elkaffas, MW Fakhr - Applied System Innovation, 2023 - mdpi.com

Stock value prediction and trading, a captivating and complex research domain, continues to
draw heightened attention. Ensuring profitable returns in stock market investments demands …

被引用次数：15 相关文章所有 3 个版本

[PDF] mlr.press

Asymmetric DQN for partially observable reinforcement learning

A Baisero, B Daley, C Amato - Uncertainty in Artificial …, 2022 - proceedings.mlr.press

Offline training in simulated partially observable environments allows reinforcement learning
methods to exploit privileged state information through a mechanism known as asymmetry …

被引用次数：11 相关文章所有 4 个版本

[PDF] arxiv.org

Unbiased asymmetric reinforcement learning under partial observability

A Baisero, C Amato - arXiv preprint arXiv:2105.11674, 2021 - arxiv.org

In partially observable reinforcement learning, offline training gives access to latent
information which is not available during online training and/or execution, such as the …

被引用次数：21 相关文章所有 10 个版本

Learning controlled and targeted communication with the centralized critic for the multi-agent system

Q Sun, Y Yao, P Yi, YJ Hu, Z Yang, G Yang, X Zhou - Applied Intelligence, 2023 - Springer

Multi-agent deep reinforcement learning (MDRL) has attracted attention for solving complex
tasks. Two main challenges of MDRL are non-stationarity and partial observability from the …

被引用次数：6 相关文章所有 3 个版本

[PDF] springer.com

Formal Modelling for Multi-Robot Systems Under Uncertainty

C Street, M Mansouri, B Lacerda - Current Robotics Reports, 2023 - Springer

Abstract Purpose of Review To effectively synthesise and analyse multi-robot behaviour, we
require formal task-level models which accurately capture multi-robot execution. In this …

Factored Online Planning in Many-Agent POMDPs

MFL Galesloot, TD Simão, S Junges… - Proceedings of the AAAI …, 2024 - ojs.aaai.org

In centralized multi-agent systems, often modeled as multi-agent partially observable
Markov decision processes (MPOMDPs), the action and observation spaces grow …

被引用次数：1 相关文章所有 5 个版本

[PDF] openreview.net

Leveraging Mutual Information for Asymmetric Learning under Partial Observability

HH Nguyen, C Amato, R Platt - 8th Annual Conference on Robot … - openreview.net

Even though partial observability is prevalent in robotics, most reinforcement learning
studies avoid it due to the difficulty of learning a policy that can efficiently memorize past …