Distributional reinforcement learning in the brain

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：256 相关文章所有 9 个版本

[HTML] nih.gov

Mesoaccumbal dopamine heterogeneity: what do dopamine firing and release have to do with it?

JW de Jong, KM Fraser, S Lammel - Annual review of …, 2022 - annualreviews.org

Ventral tegmental area (VTA) dopamine (DA) neurons are often thought to uniformly encode
reward prediction errors. Conversely, DA release in the nucleus accumbens (NAc), the …

被引用次数：40 相关文章所有 7 个版本

[HTML] cell.com Full View

[HTML][HTML] Habitual daily intake of a sweet and fatty snack modulates reward processing in humans

SE Thanarajah, AG DiFeliceantonio, K Albus… - Cell metabolism, 2023 - cell.com

Western diets rich in fat and sugar promote excess calorie intake and weight gain; however,
the underlying mechanisms are unclear. Despite a well-documented association between …

被引用次数：36 相关文章所有 15 个版本

[HTML] elifesciences.org

[HTML][HTML] Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task

I Tsutsui-Kimura, H Matsumoto, K Akiti, MM Yamada… - Elife, 2020 - elifesciences.org

Different regions of the striatum regulate different types of behavior. However, how
dopamine signals differ across striatal regions and how dopamine regulates different …

被引用次数：78 相关文章所有 12 个版本

[PDF] enseeiht.fr

[图书][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com

The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

被引用次数：118 相关文章所有 9 个版本

Informing deep neural networks by multiscale principles of neuromodulatory systems

J Mei, E Muller, S Ramaswamy - Trends in Neurosciences, 2022 - cell.com

Our brains have evolved the ability to configure and adapt their processing states to match
the unique challenges of acting and learning in diverse environments and behavioral …

被引用次数：33 相关文章所有 7 个版本

[HTML] cell.com Full View

[HTML][HTML] Striatal dopamine explains novelty-induced behavioral dynamics and individual variability in threat prediction

K Akiti, I Tsutsui-Kimura, Y Xie, A Mathis, JE Markowitz… - Neuron, 2022 - cell.com

Animals both explore and avoid novel objects in the environment, but the neural
mechanisms that underlie these behaviors and their dynamics remain uncharacterized …

被引用次数：49 相关文章所有 10 个版本

[HTML] cell.com

[HTML][HTML] Formalising social representation to explain psychiatric symptoms

JM Barnby, P Dayan, V Bell - Trends in cognitive sciences, 2023 - cell.com

Recent work in social cognition has moved beyond a focus on how people process social
rewards to examine how healthy people represent other agents and how this is altered in …

被引用次数：18 相关文章所有 14 个版本

[PDF] science.org Full View

Exponential history integration with diverse temporal scales in retrosplenial cortex supports hyperbolic behavior

BP Danskin, R Hattori, YE Zhang, Z Babic, M Aoi… - Science …, 2023 - science.org

Animals use past experience to guide future choices. The integration of experiences
typically follows a hyperbolic, rather than exponential, decay pattern with a heavy tail for …

被引用次数：2 相关文章所有 7 个版本

[PDF] sciencedirect.com

Interoception as modeling, allostasis as control

E Sennesh, J Theriault, D Brooks, JW van de Meent… - Biological …, 2022 - Elsevier

The brain regulates the body by anticipating its needs and attempting to meet them before
they arise–a process called allostasis. Allostasis requires a model of the changing sensory …

被引用次数：65 相关文章所有 14 个版本