UniZero: Generalized and Efficient Planning with Scalable Latent World Models
Y Pu, Y Niu, J Ren, Z Yang, H Li, Y Liu - arXiv preprint arXiv:2406.10667, 2024 - arxiv.org
Learning predictive world models is essential for enhancing the planning capabilities of
reinforcement learning agents. Notably, the MuZero-style algorithms, based on the value …
reinforcement learning agents. Notably, the MuZero-style algorithms, based on the value …