相关文章- 学术资源搜索

Reward prediction error neurons implement an efficient code for reward

HH Schütt, D Kim, WJ Ma - Nature Neuroscience, 2024 - nature.com

We use efficient coding principles borrowed from sensory neuroscience to derive the optimal
neural population to encode a reward distribution. We show that the responses of …

被引用次数：4 相关文章所有 7 个版本

[PDF] cyberleninka.org

Reinforcement learning in populations of spiking neurons

R Urbanczik, W Senn - Nature neuroscience, 2009 - nature.com

Population coding is widely regarded as an important mechanism for achieving reliable
behavioral responses despite neuronal variability. However, standard reinforcement …

被引用次数：125 相关文章所有 13 个版本

[PDF] cell.com

Distributional reinforcement learning in the brain

AS Lowet, Q Zheng, S Matias, J Drugowitsch… - Trends in …, 2020 - cell.com

Learning about rewards and punishments is critical for survival. Classical studies have
demonstrated an impressive correspondence between the firing of dopamine neurons in the …

被引用次数：58 相关文章所有 12 个版本

Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning

BB Averbeck - 2017 IEEE Symposium Series on Computational …, 2017 - ieeexplore.ieee.org

Standard models of reinforcement learning in the brain assume that dopamine codes reward
prediction errors, and these reward prediction errors are integrated by the striatum to …

被引用次数：19 相关文章

[HTML] nih.gov

A distributional code for value in dopamine-based reinforcement learning

W Dabney, Z Kurth-Nelson, N Uchida, CK Starkweather… - Nature, 2020 - nature.com

Since its introduction, the reward prediction error theory of dopamine has explained a wealth
of empirical phenomena, providing a unifying framework for understanding the …

被引用次数：417 相关文章所有 13 个版本

[HTML] nih.gov

Neural circuitry of reward prediction error

M Watabe-Uchida, N Eshel… - Annual review of …, 2017 - annualreviews.org

Dopamine neurons facilitate learning by calculating reward prediction error, or the difference
between expected and actual reward. Despite two decades of research, it remains unclear …

被引用次数：363 相关文章所有 8 个版本

Anterior cingulate learns reward distribution

T Hong, WR Stauffer - Nature Neuroscience, 2024 - nature.com

Muller et al. demonstrate that reward signals recorded from the frontal cortex of nonhuman
primates exhibit a population-based scheme for learning probability distributions over …

被引用次数：1 相关文章所有 3 个版本

[PDF] wiley.com

Beyond simple reinforcement learning: the computational neurobiology of reward‐learning and valuation

JP O'Doherty - European Journal of Neuroscience, 2012 - Wiley Online Library

Neural computational accounts of reward‐learning have been dominated by the hypothesis
that dopamine neurons behave like a reward‐prediction error and thus facilitate …

被引用次数：54 相关文章所有 6 个版本

[PDF] pnas.org Full View

Adaptive coding of reward prediction errors is gated by striatal coupling

SQ Park, T Kahnt, D Talmi… - Proceedings of the …, 2012 - National Acad Sciences

To efficiently represent all of the possible rewards in the world, dopaminergic midbrain
neurons dynamically adapt their coding range to the momentarily available rewards …

被引用次数：61 相关文章所有 15 个版本

Focus on decision making.

H Bayer - Nature neuroscience, 2008 - nature.com

The ability to make appropriate choices is critical for survival. Successful decision making
requires the integration of sensory information, motivational states and potential outcomes to …

被引用次数：7 相关文章所有 7 个版本