Recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: Part 1—fundamentals …

X Xiang, S Foo - Machine Learning and Knowledge Extraction, 2021 - mdpi.com
The first part of a two-part series of papers provides a survey on recent advances in Deep
Reinforcement Learning (DRL) applications for solving partially observable Markov decision …

Pettingzoo: Gym for multi-agent reinforcement learning

J Terry, B Black, N Grammel… - Advances in …, 2021 - proceedings.neurips.cc
This paper introduces the PettingZoo library and the accompanying Agent Environment
Cycle (" AEC") games model. PettingZoo is a library of diverse sets of multi-agent …

Meta-AAD: Active anomaly detection with deep reinforcement learning

D Zha, KH Lai, M Wan, X Hu - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
High false-positive rate is a long-standing challenge for anomaly detection algorithms,
especially in high-stake applications. To identify the true anomalies, in practice, analysts or …

Policy-gnn: Aggregation optimization for graph neural networks

KH Lai, D Zha, K Zhou, X Hu - Proceedings of the 26th ACM SIGKDD …, 2020 - dl.acm.org
Graph data are pervasive in many real-world applications. Recently, increasing attention
has been paid on graph neural networks (GNNs), which aim to model the local graph …

Suspicion-agent: Playing imperfect information games with theory of mind aware gpt-4

J Guo, B Yang, P Yoo, BY Lin, Y Iwasawa… - arXiv preprint arXiv …, 2023 - arxiv.org
Unlike perfect information games, where all elements are known to every player, imperfect
information games emulate the real-world complexities of decision-making under uncertain …

Techniques and paradigms in modern game AI systems

Y Lu, W Li - Algorithms, 2022 - mdpi.com
Games have long been benchmarks and test-beds for AI algorithms. With the development
of AI techniques and the boost of computational power, modern game AI systems have …

Perfectdou: Dominating doudizhu with perfect information distillation

G Yang, M Liu, W Hong, W Zhang… - Advances in …, 2022 - proceedings.neurips.cc
As a challenging multi-player card game, DouDizhu has recently drawn much attention for
analyzing competition and collaboration in imperfect-information games. In this paper, we …

Dual policy distillation

KH Lai, D Zha, Y Li, X Hu - arXiv preprint arXiv:2006.04061, 2020 - arxiv.org
Policy distillation, which transfers a teacher policy to a student policy has achieved great
success in challenging tasks of deep reinforcement learning. This teacher-student …

Agent-pro: Learning to evolve via policy-level reflection and optimization

W Zhang, K Tang, H Wu, M Wang, Y Shen… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models exhibit robust problem-solving capabilities for diverse tasks.
However, most LLM-based agents are designed as specific task solvers with sophisticated …

Sample efficient reinforcement learning via model-ensemble exploration and exploitation

Y Yao, L Xiao, Z An, W Zhang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Model-based deep reinforcement learning has achieved success in various domains that
require high sample efficiencies, such as Go and robotics. However, there are some …