Variational oracle guiding for reinforcement learning

X Zhao, SB Holden - 2022 IEEE Conference on Games (CoG), 2022 - ieeexplore.ieee.org

Mahjong is a multi-player imperfect-information game with challenging features for AI
research. Sanma, being a 3-player variant of Japanese Riichi Mahjong, possesses unique …

被引用次数：7 相关文章所有 6 个版本

[PDF] nature.com

Synergizing habits and goals with variational Bayes

D Han, K Doya, D Li, J Tani - Nature Communications, 2024 - nature.com

Behaving efficiently and flexibly is crucial for biological and artificial embodied agents.
Behavior is generally classified into two types: habitual (fast but inflexible), and goal-directed …

LsAc ^∗‐MJ: A Low‐Resource Consumption Reinforcement Learning Model for Mahjong Game

X Li, Z Wang, B Liu, J Dai - International Journal of Intelligent …, 2024 - Wiley Online Library

This article proposes a novel Mahjong game model, LsAc∗‐MJ, designed to address
challenges posed by data scarcity, difficulty in leveraging contextual information, and the …

MJ-DLVAT: A Deep Learning Value Assessment Technique for Mahjong

T Ogami, K Amano, Y Tsuruoka - 2024 IEEE Conference on …, 2024 - ieeexplore.ieee.org

In games with stochastic outcomes, evaluating agent performance from limited data is
challenging. Results of Monte Carlo sampling do not provide a reliable indicator due to the …

[PDF] arxiv.org