Model-based reinforcement learning under concurrent schedules of reinforcement in rodents
N Huh, S Jo, H Kim, JH Sul, MW Jung - Learning & Memory, 2009 - learnmem.cshlp.org
Reinforcement learning theories postulate that actions are chosen to maximize a long-term
sum of positive outcomes based on value functions, which are subjective estimates of future …
sum of positive outcomes based on value functions, which are subjective estimates of future …
Choice type impacts human reinforcement learning
M Rmus, A Zou, AGE Collins - Journal of Cognitive Neuroscience, 2023 - direct.mit.edu
In reinforcement learning (RL) experiments, participants learn to make rewarding choices in
response to different stimuli; RL models use outcomes to estimate stimulus–response values …
response to different stimuli; RL models use outcomes to estimate stimulus–response values …
Variable and sub-optimal responses to a choice problem are a persistent default mode
AR Hunt, W James, J Reuther, M Spilioti, E Mackay… - 2019 - osf.io
Here we report persistent choice variability in the presence of a simple decision rule. Two
analogous choice problems are presented, both of which involve making decisions about …
analogous choice problems are presented, both of which involve making decisions about …
[HTML][HTML] Recurrent networks endowed with structural priors explain suboptimal animal behavior
The strategies found by animals facing a new task are determined both by individual
experience and by structural priors evolved to leverage the statistics of natural …
experience and by structural priors evolved to leverage the statistics of natural …
Time elapsed between choices in a probabilistic task correlates with repeating the same decision
J Jabłońska, Ł Szumiec, P Zieliński… - European Journal of …, 2021 - Wiley Online Library
Reinforcement learning causes an action that yields a positive outcome more likely to be
taken in the future. Here, we investigate how the time elapsed from an action affects …
taken in the future. Here, we investigate how the time elapsed from an action affects …
Novel behavioral paradigm reveals lower temporal limits on mouse olfactory decisions
Temporal limits on perceptual decisions set strict boundaries on the possible underlying
neural computations. How odor information is encoded in the olfactory system is still poorly …
neural computations. How odor information is encoded in the olfactory system is still poorly …
[HTML][HTML] Sequential mechanisms underlying concentration invariance in biological olfaction
TA Cleland, SYT Chen, KW Hozer, HN Ukatu… - Frontiers in …, 2012 - frontiersin.org
Concentration invariance—the capacity to recognize a given odorant (analyte) across a
range of concentrations—is an unusually difficult problem in the olfactory modality …
range of concentrations—is an unusually difficult problem in the olfactory modality …
[HTML][HTML] Experience-dependent evolution of odor mixture representations in piriform cortex
A Berners-Lee, E Shtrahman, J Grimaud… - PLoS Biology, 2023 - journals.plos.org
Rodents can learn from exposure to rewarding odors to make better and quicker decisions.
The piriform cortex is thought to be important for learning complex odor associations; …
The piriform cortex is thought to be important for learning complex odor associations; …
[PDF][PDF] What is a choice in reinforcement learning?
M Rmus, A Collins - CogSci, 2020 - cognitivesciencesociety.org
In reinforcement learning (RL) experiments, participants learn to associate stimuli with
rewarding responses. RL models capture such learning by estimating stimulus-response …
rewarding responses. RL models capture such learning by estimating stimulus-response …
[HTML][HTML] Choice history effects in mice and humans improve reward harvesting efficiency
JS López-Yépez, J Martin, O Hulme… - PLOS Computational …, 2021 - journals.plos.org
Choice history effects describe how future choices depend on the history of past choices. In
experimental tasks this is typically framed as a bias because it often diminishes the …
experimental tasks this is typically framed as a bias because it often diminishes the …