作者
Adam S Lowet, Qiao Zheng, Sara Matias, Jan Drugowitsch, Naoshige Uchida
发表日期
2020/12/1
来源
Trends in Neurosciences
卷号
43
期号
12
页码范围
980-997
出版商
Elsevier Current Trends
简介
Learning about rewards and punishments is critical for survival. Classical studies have demonstrated an impressive correspondence between the firing of dopamine neurons in the mammalian midbrain and the reward prediction errors of reinforcement learning algorithms, which express the difference between actual reward and predicted mean reward. However, it may be advantageous to learn not only the mean but also the complete distribution of potential rewards. Recent advances in machine learning have revealed a biologically plausible set of algorithms for reconstructing this reward distribution from experience. Here, we review the mathematical foundations of these algorithms as well as initial evidence for their neurobiological implementation. We conclude by highlighting outstanding questions regarding the circuit computation and behavioral readout of these distributional codes.
引用总数
202020212022202320241518268
学术搜索中的文章
AS Lowet, Q Zheng, S Matias, J Drugowitsch, N Uchida - Trends in neurosciences, 2020