Efficient coding of cognitive variables underlies dopamine response and choice behavior

A Motiwala, S Soares, BV Atallah, JJ Paton… - Nature …, 2022 - nature.com
Reward expectations based on internal knowledge of the external environment are a core
component of adaptive behavior. However, internal knowledge may be inaccurate or …

[HTML][HTML] Dopamine-independent effect of rewards on choices through hidden-state inference

M Blanco-Pozo, T Akam, ME Walton - Nature Neuroscience, 2024 - nature.com
Dopamine is implicated in adaptive behavior through reward prediction error (RPE) signals
that update value estimates. There is also accumulating evidence that animals in structured …

[HTML][HTML] Stable representations of decision variables for flexible behavior

BA Bari, CD Grossman, EE Lubin, AE Rajagopalan… - Neuron, 2019 - cell.com
Decisions occur in dynamic environments. In the framework of reinforcement learning, the
probability of performing an action is influenced by decision variables. Discrepancies …

[HTML][HTML] Reward-predictive representations generalize across tasks in reinforcement learning

L Lehnert, ML Littman, MJ Frank - PLoS computational biology, 2020 - journals.plos.org
In computer science, reinforcement learning is a powerful framework with which artificial
agents can learn to maximize their performance for any given Markov decision process …

[HTML][HTML] Fidelity of the representation of value in decision-making

PM Bays, BA Dowding - PLoS computational biology, 2017 - journals.plos.org
The ability to make optimal decisions depends on evaluating the expected rewards
associated with different potential actions. This process is critically dependent on the fidelity …

Multiplexing signals in reinforcement learning with internal models and dopamine

H Nakahara - Current opinion in neurobiology, 2014 - Elsevier
Highlights•Decision-making involves various uses of internal models, such as reflecting
reward structures.•Generalized prediction errors are used to improve value-based decision …

Learning to represent reward structure: A key to adapting to complex environments

H Nakahara, O Hikosaka - Neuroscience research, 2012 - Elsevier
Predicting outcomes is a critical ability of humans and animals. The dopamine reward
prediction error hypothesis, the driving force behind the recent progress in neural “value …

Computational noise in reward-guided learning drives behavioral variability in volatile environments

C Findling, V Skvortsova, R Dromnelle… - Nature …, 2019 - nature.com
When learning the value of actions in volatile environments, humans often make seemingly
irrational decisions that fail to maximize expected value. We reasoned that these 'non …

Reward-bases: dopaminergic mechanisms for adaptive acquisition of multiple reward types

B Millidge, Y Song, A Lak, ME Walton, R Bogacz - BioRxiv, 2023 - biorxiv.org
Animals can adapt their preferences for different types for reward according to physiological
state, such as hunger or thirst. To describe this ability, we propose a simple extension of …

[HTML][HTML] Belief state representation in the dopamine system

BM Babayan, N Uchida, SJ Gershman - Nature communications, 2018 - nature.com
Learning to predict future outcomes is critical for driving appropriate behaviors.
Reinforcement learning (RL) models have successfully accounted for such learning, relying …