相关文章- 学术资源搜索

Optimality-based reward learning with applications to toxicology

SJ Weisenthal, M Eckard, A Ertefaie… - arXiv preprint arXiv …, 2024 - arxiv.org

In toxicology research, experiments are often conducted to determine the effect of toxicant
exposure on the behavior of mice, where mice are randomized to receive the toxicant or not …

[图书][B] Relative Sparsity and Optimality-Based Reward Learning With Applications to Medical Decisions and Toxicology

SJ Weisenthal - 2023 - search.proquest.com

Existing statistical methods can be used to estimate a policy, or a mapping from covariates to
decisions, which can then instruct decision makers (eg, whether to administer hypotension …

[PDF] neurips.cc

Inferring learning rules from animal decision-making

Z Ashwood, NA Roy, JH Bak… - Advances in Neural …, 2020 - proceedings.neurips.cc

How do animals learn? This remains an elusive question in neuroscience. Whereas
reinforcement learning often focuses on the design of algorithms that enable artificial agents …

被引用次数：27 相关文章所有 8 个版本

[PDF] mlr.press

Scaling laws for reward model overoptimization

L Gao, J Schulman, J Hilton - International Conference on …, 2023 - proceedings.mlr.press

In reinforcement learning from human feedback, it is common to optimize against a reward
model trained to predict human preferences. Because the reward model is an imperfect …

被引用次数：225 相关文章所有 7 个版本

[PDF] cshlp.org Free from Publisher

Model-based reinforcement learning under concurrent schedules of reinforcement in rodents

N Huh, S Jo, H Kim, JH Sul, MW Jung - Learning & Memory, 2009 - learnmem.cshlp.org

Reinforcement learning theories postulate that actions are chosen to maximize a long-term
sum of positive outcomes based on value functions, which are subjective estimates of future …

被引用次数：40 相关文章所有 13 个版本

[HTML] nih.gov Free from Publisher

[HTML][HTML] Computational mechanisms underlying motivation to earn symbolic reinforcers

DC Burk, C Taswell, H Tang, BB Averbeck - bioRxiv, 2023 - ncbi.nlm.nih.gov

Reinforcement learning (RL) is a theoretical framework that describes how agents learn to
select options that maximize rewards and minimize punishments over time. We often make …

被引用次数：1 相关文章所有 5 个版本

[PDF] jneurosci.org

Computational mechanisms underlying motivation to earn symbolic reinforcers

DC Burk, C Taswell, H Tang… - Journal of …, 2024 - Soc Neuroscience

Reinforcement learning is a theoretical framework that describes how agents learn to select
options that maximize rewards and minimize punishments over time. We often make …

被引用次数：2 相关文章所有 5 个版本

[PDF] neurips.cc

How fast to work: Response vigor, motivation and tonic dopamine

Y Niv, N Daw, P Dayan - Advances in neural information …, 2005 - proceedings.neurips.cc

Reinforcement learning models have long promised to unify computational, psychological
and neural accounts of appetitively conditioned behavior. However, the bulk of data on …

被引用次数：141 相关文章所有 15 个版本

[PDF] escholarship.org

[PDF][PDF] Reinforcement Learning Leads to Risk Averse Behavior

JC Denrell - Proceedings of the Annual Meeting of the Cognitive …, 2008 - escholarship.org

Animals and humans often have to choose between options with reward distributions that
are initially unknown and can only be learned through experience. Recent experimental and …

被引用次数：1 相关文章所有 3 个版本

[PDF] github.io

[PDF][PDF] Expressing non-Markov reward to a Markov agent

D Abel, A Barreto, M Bowling, W Dabney… - … and Decision Making, 2022 - david-abel.github.io

Abstract Markov Decision Processes are the standard model of sequential decision-making
problems in reinforcement learning. However, as noted by Abel et al.[1], for some …

被引用次数：6 相关文章