相关文章- 学术资源搜索

Efficient coding of cognitive variables underlies dopamine response and choice behavior

A Motiwala, S Soares, BV Atallah, JJ Paton… - Nature …, 2022 - nature.com

Reward expectations based on internal knowledge of the external environment are a core
component of adaptive behavior. However, internal knowledge may be inaccurate or …

被引用次数：7 相关文章所有 5 个版本

[HTML] nature.com

[HTML][HTML] Dopamine-independent effect of rewards on choices through hidden-state inference

M Blanco-Pozo, T Akam, ME Walton - Nature Neuroscience, 2024 - nature.com

Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals
that update value estimates. There is also accumulating evidence that animals in structured …

被引用次数：7 相关文章所有 7 个版本

[HTML] cell.com Full View

[HTML][HTML] Stable representations of decision variables for flexible behavior

BA Bari, CD Grossman, EE Lubin, AE Rajagopalan… - Neuron, 2019 - cell.com

Decisions occur in dynamic environments. In the framework of reinforcement learning, the
probability of performing an action is influenced by decision variables. Discrepancies …

被引用次数：145 相关文章所有 8 个版本

[HTML] plos.org

[HTML][HTML] Reward-predictive representations generalize across tasks in reinforcement learning

L Lehnert, ML Littman, MJ Frank - PLoS computational biology, 2020 - journals.plos.org

In computer science, reinforcement learning is a powerful framework with which artificial
agents can learn to maximize their performance for any given Markov decision process …

被引用次数：36 相关文章所有 19 个版本

[HTML] plos.org

[HTML][HTML] Fidelity of the representation of value in decision-making

PM Bays, BA Dowding - PLoS computational biology, 2017 - journals.plos.org

The ability to make optimal decisions depends on evaluating the expected rewards
associated with different potential actions. This process is critically dependent on the fidelity …

被引用次数：33 相关文章所有 20 个版本

Multiplexing signals in reinforcement learning with internal models and dopamine

H Nakahara - Current opinion in neurobiology, 2014 - Elsevier

Highlights•Decision-making involves various uses of internal models, such as reflecting
reward structures.•Generalized prediction errors are used to improve value-based decision …

被引用次数：36 相关文章所有 6 个版本

[HTML] nih.gov

Learning to represent reward structure: A key to adapting to complex environments

H Nakahara, O Hikosaka - Neuroscience research, 2012 - Elsevier

Predicting outcomes is a critical ability of humans and animals. The dopamine reward
prediction error hypothesis, the driving force behind the recent progress in neural “value …

被引用次数：33 相关文章所有 10 个版本

[PDF] biorxiv.org

Computational noise in reward-guided learning drives behavioral variability in volatile environments

C Findling, V Skvortsova, R Dromnelle… - Nature …, 2019 - nature.com

When learning the value of actions in volatile environments, humans often make seemingly
irrational decisions that fail to maximize expected value. We reasoned that these 'non …

被引用次数：153 相关文章所有 11 个版本

[PDF] biorxiv.org

Reward-bases: dopaminergic mechanisms for adaptive acquisition of multiple reward types

B Millidge, Y Song, A Lak, ME Walton, R Bogacz - BioRxiv, 2023 - biorxiv.org

Animals can adapt their preferences for different types for reward according to physiological
state, such as hunger or thirst. To describe this ability, we propose a simple extension of …

被引用次数：2 相关文章所有 2 个版本

[HTML] nature.com Full View

[HTML][HTML] Belief state representation in the dopamine system

BM Babayan, N Uchida, SJ Gershman - Nature communications, 2018 - nature.com

Learning to predict future outcomes is critical for driving appropriate behaviors.
Reinforcement learning (RL) models have successfully accounted for such learning, relying …

被引用次数：97 相关文章所有 14 个版本