Rlcard: A toolkit for reinforcement learning in card games

Recent advances in deep reinforcement learning applications for solving partially observable markov decision processes (pomdp) problems: Part 1—fundamentals …

X Xiang, S Foo - Machine Learning and Knowledge Extraction, 2021 - mdpi.com

The first part of a two-part series of papers provides a survey on recent advances in Deep
Reinforcement Learning (DRL) applications for solving partially observable Markov decision …

被引用次数：59 相关文章所有 6 个版本

[PDF] neurips.cc

Pettingzoo: Gym for multi-agent reinforcement learning

J Terry, B Black, N Grammel… - Advances in …, 2021 - proceedings.neurips.cc

This paper introduces the PettingZoo library and the accompanying Agent Environment
Cycle (" AEC") games model. PettingZoo is a library of diverse sets of multi-agent …

被引用次数：367 相关文章所有 12 个版本

[PDF] arxiv.org

Meta-AAD: Active anomaly detection with deep reinforcement learning

D Zha, KH Lai, M Wan, X Hu - 2020 IEEE International …, 2020 - ieeexplore.ieee.org

High false-positive rate is a long-standing challenge for anomaly detection algorithms,
especially in high-stake applications. To identify the true anomalies, in practice, analysts or …

被引用次数：85 相关文章所有 7 个版本

[PDF] acm.org

Policy-gnn: Aggregation optimization for graph neural networks

KH Lai, D Zha, K Zhou, X Hu - Proceedings of the 26th ACM SIGKDD …, 2020 - dl.acm.org

Graph data are pervasive in many real-world applications. Recently, increasing attention
has been paid on graph neural networks (GNNs), which aim to model the local graph …

被引用次数：110 相关文章所有 5 个版本

[PDF] arxiv.org

Suspicion-agent: Playing imperfect information games with theory of mind aware gpt-4

J Guo, B Yang, P Yoo, BY Lin, Y Iwasawa… - arXiv preprint arXiv …, 2023 - arxiv.org

Unlike perfect information games, where all elements are known to every player, imperfect
information games emulate the real-world complexities of decision-making under uncertain …

被引用次数：28 相关文章所有 3 个版本

[PDF] mdpi.com

Techniques and paradigms in modern game AI systems

Y Lu, W Li - Algorithms, 2022 - mdpi.com

Games have long been benchmarks and test-beds for AI algorithms. With the development
of AI techniques and the boost of computational power, modern game AI systems have …

被引用次数：13 相关文章所有 3 个版本

[PDF] neurips.cc

Perfectdou: Dominating doudizhu with perfect information distillation

G Yang, M Liu, W Hong, W Zhang… - Advances in …, 2022 - proceedings.neurips.cc

As a challenging multi-player card game, DouDizhu has recently drawn much attention for
analyzing competition and collaboration in imperfect-information games. In this paper, we …

被引用次数：29 相关文章所有 5 个版本

[PDF] arxiv.org

Dual policy distillation

KH Lai, D Zha, Y Li, X Hu - arXiv preprint arXiv:2006.04061, 2020 - arxiv.org

Policy distillation, which transfers a teacher policy to a student policy has achieved great
success in challenging tasks of deep reinforcement learning. This teacher-student …

被引用次数：59 相关文章所有 8 个版本

[PDF] arxiv.org

Agent-pro: Learning to evolve via policy-level reflection and optimization

W Zhang, K Tang, H Wu, M Wang, Y Shen… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models exhibit robust problem-solving capabilities for diverse tasks.
However, most LLM-based agents are designed as specific task solvers with sophisticated …

被引用次数：26 相关文章所有 3 个版本

[PDF] arxiv.org

Sample efficient reinforcement learning via model-ensemble exploration and exploitation

Y Yao, L Xiao, Z An, W Zhang… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org

Model-based deep reinforcement learning has achieved success in various domains that
require high sample efficiencies, such as Go and robotics. However, there are some …

被引用次数：27 相关文章所有 6 个版本