related:wc-buNtTkJEJ:scholar.lanfanshu.cn/

The successor representation in choice evaluation

EM Russek - 2018 - search.proquest.com

How does the brain make decisions in sequential, multi-step tasks? In principle, evaluating
choice options in such tasks requires simulating each sequence of states that might follow a …

The basal ganglia, reinforcement learning, and the encoding of value

K Doya, M Kimura - Neuroeconomics, 2014 - Elsevier

In this chapter we ask a number of questions regarding the brain's realization of reward-
based decision making: 1) What is the right mathematical framework to capture the actual …

被引用次数：4 相关文章

[HTML] plos.org

[HTML][HTML] Parallel representation of value-based and finite state-based strategies in the ventral and dorsal striatum

M Ito, K Doya - PLoS computational biology, 2015 - journals.plos.org

Previous theoretical studies of animal and human behavioral learning have focused on the
dichotomy of the value-based strategy using action value functions to predict rewards and …

被引用次数：27 相关文章所有 16 个版本

[图书][B] Neuroeconomics: Chapter 17. The Basal Ganglia, Reinforcement Learning, and the Encoding of Value

K Doya, M Kimura - 2013 - books.google.com

In this chapter we ask a number of questions regarding the brain's realization of reward-
based decision making: 1) What is the right mathematical framework to capture the actual …

[PDF] psu.edu

[PDF][PDF] Supplemental material for “Model-based influences on humans' choices and striatal prediction errors”

ND Daw, SJ Gershman, B Seymour, P Dayan… - Citeseer

The task consists of three states (first stage: sA; second stage: sB and sC), each with two
actions (aA and aB). The goal of both the model-based and model-free subcomponents of …

[PDF][PDF] Hidden knobs: Representations for flexible goal-directed decision-making

S Gluth, A Shenhav - shenhavlab.org

Sequential sampling models have been tremendously successful in describing mechanisms
of decision-making at the behavioral level, and at providing testable predictions at the neural …

[PDF] wiley.com

Time elapsed between choices in a probabilistic task correlates with repeating the same decision

J Jabłońska, Ł Szumiec, P Zieliński… - European Journal of …, 2021 - Wiley Online Library

Reinforcement learning causes an action that yields a positive outcome more likely to be
taken in the future. Here, we investigate how the time elapsed from an action affects …

被引用次数：2 相关文章所有 5 个版本

[HTML] frontiersin.org

[HTML][HTML] Behavioral correlates of the decision process in a dynamic environment: post-choice latencies reflect relative value and choice evaluation

J Fam, F Westbrook, E Arabzadeh - Frontiers in Behavioral …, 2015 - frontiersin.org

One characteristic of natural environments is that outcomes vary across time. Animals need
to adapt to these environmental changes and adjust their choices accordingly. In this …

被引用次数：2 相关文章所有 15 个版本

[HTML] cell.com Full View

[HTML][HTML] Stable representations of decision variables for flexible behavior

BA Bari, CD Grossman, EE Lubin, AE Rajagopalan… - Neuron, 2019 - cell.com

Decisions occur in dynamic environments. In the framework of reinforcement learning, the
probability of performing an action is influenced by decision variables. Discrepancies …

被引用次数：146 相关文章所有 8 个版本

[PDF] psu.edu

[PDF][PDF] Processing Reward: How Humans and Learning Models Compare Different Expected Reward Values in Choice Tasks

DA Worthy, WT Maddox, AB Markman - Citeseer

Several models of choice compute the probability of selecting a given option by comparing
the Expected Value (EV) of each option. However, there is a subtle difference between two …