Reward prediction error neurons implement an efficient code for reward
We use efficient coding principles borrowed from sensory neuroscience to derive the optimal
neural population to encode a reward distribution. We show that the responses of …
neural population to encode a reward distribution. We show that the responses of …
Reinforcement learning in populations of spiking neurons
R Urbanczik, W Senn - Nature neuroscience, 2009 - nature.com
Population coding is widely regarded as an important mechanism for achieving reliable
behavioral responses despite neuronal variability. However, standard reinforcement …
behavioral responses despite neuronal variability. However, standard reinforcement …
Distributional reinforcement learning in the brain
Learning about rewards and punishments is critical for survival. Classical studies have
demonstrated an impressive correspondence between the firing of dopamine neurons in the …
demonstrated an impressive correspondence between the firing of dopamine neurons in the …
Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning
BB Averbeck - 2017 IEEE Symposium Series on Computational …, 2017 - ieeexplore.ieee.org
Standard models of reinforcement learning in the brain assume that dopamine codes reward
prediction errors, and these reward prediction errors are integrated by the striatum to …
prediction errors, and these reward prediction errors are integrated by the striatum to …
A distributional code for value in dopamine-based reinforcement learning
Since its introduction, the reward prediction error theory of dopamine has explained a wealth
of empirical phenomena, providing a unifying framework for understanding the …
of empirical phenomena, providing a unifying framework for understanding the …
Neural circuitry of reward prediction error
M Watabe-Uchida, N Eshel… - Annual review of …, 2017 - annualreviews.org
Dopamine neurons facilitate learning by calculating reward prediction error, or the difference
between expected and actual reward. Despite two decades of research, it remains unclear …
between expected and actual reward. Despite two decades of research, it remains unclear …
Anterior cingulate learns reward distribution
T Hong, WR Stauffer - Nature Neuroscience, 2024 - nature.com
Muller et al. demonstrate that reward signals recorded from the frontal cortex of nonhuman
primates exhibit a population-based scheme for learning probability distributions over …
primates exhibit a population-based scheme for learning probability distributions over …
Beyond simple reinforcement learning: the computational neurobiology of reward‐learning and valuation
JP O'Doherty - European Journal of Neuroscience, 2012 - Wiley Online Library
Neural computational accounts of reward‐learning have been dominated by the hypothesis
that dopamine neurons behave like a reward‐prediction error and thus facilitate …
that dopamine neurons behave like a reward‐prediction error and thus facilitate …
Adaptive coding of reward prediction errors is gated by striatal coupling
To efficiently represent all of the possible rewards in the world, dopaminergic midbrain
neurons dynamically adapt their coding range to the momentarily available rewards …
neurons dynamically adapt their coding range to the momentarily available rewards …
Focus on decision making.
H Bayer - Nature neuroscience, 2008 - nature.com
The ability to make appropriate choices is critical for survival. Successful decision making
requires the integration of sensory information, motivational states and potential outcomes to …
requires the integration of sensory information, motivational states and potential outcomes to …