Statistical Efficiency of Distributional Temporal Difference

Y Peng, L Zhang, Z Zhang - arXiv preprint arXiv:2403.05811, 2024 - arxiv.org
Distributional reinforcement learning (DRL), which cares about the full distribution of returns
instead of just the mean, has achieved empirical success in various domains. One of the …