Dopamine responses reveal efficient coding of cognitive variables

A Motiwala, S Soares, BV Atallah, JJ Paton… - bioRxiv, 2020 - biorxiv.org
Reward expectations based on internal knowledge of the external environment are a core
component of adaptive behavior. However, internal knowledge may be inaccurate or …

Efficient coding of cognitive variables underlies dopamine response and choice behavior

A Motiwala, S Soares, BV Atallah, JJ Paton… - Nature …, 2022 - nature.com
Reward expectations based on internal knowledge of the external environment are a core
component of adaptive behavior. However, internal knowledge may be inaccurate or …

Dopamine-independent effect of rewards on choices through hidden-state inference

M Blanco-Pozo, T Akam, ME Walton - Nature Neuroscience, 2024 - nature.com
Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals
that update value estimates. There is also accumulating evidence that animals in structured …

Reward-bases: dopaminergic mechanisms for adaptive acquisition of multiple reward types

B Millidge, Y Song, A Lak, ME Walton, R Bogacz - BioRxiv, 2023 - biorxiv.org
Animals can adapt their preferences for different types for reward according to physiological
state, such as hunger or thirst. To describe this ability, we propose a simple extension of …

Stable representations of decision variables for flexible behavior

BA Bari, CD Grossman, EE Lubin, AE Rajagopalan… - Neuron, 2019 - cell.com
Decisions occur in dynamic environments. In the framework of reinforcement learning, the
probability of performing an action is influenced by decision variables. Discrepancies …

Dopamine-independent state inference mediates expert reward guided decision making

M Blanco-Pozo, T Akam, ME Walton - bioRxiv, 2021 - biorxiv.org
Rewards are thought to influence future choices through dopaminergic reward prediction
errors (RPEs) updating stored value estimates. However, accumulating evidence suggests …

Learning to represent reward structure: A key to adapting to complex environments

H Nakahara, O Hikosaka - Neuroscience research, 2012 - Elsevier
Predicting outcomes is a critical ability of humans and animals. The dopamine reward
prediction error hypothesis, the driving force behind the recent progress in neural “value …

Multiplexing signals in reinforcement learning with internal models and dopamine

H Nakahara - Current opinion in neurobiology, 2014 - Elsevier
Highlights•Decision-making involves various uses of internal models, such as reflecting
reward structures.•Generalized prediction errors are used to improve value-based decision …

Representation learning with reward prediction errors

WH Alexander, SJ Gershman - arXiv preprint arXiv:2108.12402, 2021 - arxiv.org
The Reward Prediction Error hypothesis proposes that phasic activity in the midbrain
dopaminergic system reflects prediction errors needed for learning in reinforcement …

[HTML][HTML] Mesolimbic dopamine encodes reward prediction errors independent of learning rates

A Mah, C Golden, C Constantinople - bioRxiv, 2024 - ncbi.nlm.nih.gov
Biological accounts of reinforcement learning posit that dopamine encodes reward
prediction errors (RPEs), which are multiplied by a learning rate to update state or action …