Reward prediction error neurons implement an efficient code for reward

HH Schütt, D Kim, WJ Ma - Nature Neuroscience, 2024 - nature.com
We use efficient coding principles borrowed from sensory neuroscience to derive the optimal
neural population to encode a reward distribution. We show that the responses of …

Reinforcement learning in populations of spiking neurons

R Urbanczik, W Senn - Nature neuroscience, 2009 - nature.com
Population coding is widely regarded as an important mechanism for achieving reliable
behavioral responses despite neuronal variability. However, standard reinforcement …

Distributional reinforcement learning in the brain

AS Lowet, Q Zheng, S Matias, J Drugowitsch… - Trends in …, 2020 - cell.com
Learning about rewards and punishments is critical for survival. Classical studies have
demonstrated an impressive correspondence between the firing of dopamine neurons in the …

Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning

BB Averbeck - 2017 IEEE Symposium Series on Computational …, 2017 - ieeexplore.ieee.org
Standard models of reinforcement learning in the brain assume that dopamine codes reward
prediction errors, and these reward prediction errors are integrated by the striatum to …

A distributional code for value in dopamine-based reinforcement learning

W Dabney, Z Kurth-Nelson, N Uchida, CK Starkweather… - Nature, 2020 - nature.com
Since its introduction, the reward prediction error theory of dopamine has explained a wealth
of empirical phenomena, providing a unifying framework for understanding the …

Neural circuitry of reward prediction error

M Watabe-Uchida, N Eshel… - Annual review of …, 2017 - annualreviews.org
Dopamine neurons facilitate learning by calculating reward prediction error, or the difference
between expected and actual reward. Despite two decades of research, it remains unclear …

Anterior cingulate learns reward distribution

T Hong, WR Stauffer - Nature Neuroscience, 2024 - nature.com
Muller et al. demonstrate that reward signals recorded from the frontal cortex of nonhuman
primates exhibit a population-based scheme for learning probability distributions over …

Beyond simple reinforcement learning: the computational neurobiology of reward‐learning and valuation

JP O'Doherty - European Journal of Neuroscience, 2012 - Wiley Online Library
Neural computational accounts of reward‐learning have been dominated by the hypothesis
that dopamine neurons behave like a reward‐prediction error and thus facilitate …

Adaptive coding of reward prediction errors is gated by striatal coupling

SQ Park, T Kahnt, D Talmi… - Proceedings of the …, 2012 - National Acad Sciences
To efficiently represent all of the possible rewards in the world, dopaminergic midbrain
neurons dynamically adapt their coding range to the momentarily available rewards …

Focus on decision making.

H Bayer - Nature neuroscience, 2008 - nature.com
The ability to make appropriate choices is critical for survival. Successful decision making
requires the integration of sensory information, motivational states and potential outcomes to …