Towards continual reinforcement learning: A review and perspectives
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …
Mesoaccumbal dopamine heterogeneity: what do dopamine firing and release have to do with it?
Ventral tegmental area (VTA) dopamine (DA) neurons are often thought to uniformly encode
reward prediction errors. Conversely, DA release in the nucleus accumbens (NAc), the …
reward prediction errors. Conversely, DA release in the nucleus accumbens (NAc), the …
[HTML][HTML] Habitual daily intake of a sweet and fatty snack modulates reward processing in humans
SE Thanarajah, AG DiFeliceantonio, K Albus… - Cell metabolism, 2023 - cell.com
Western diets rich in fat and sugar promote excess calorie intake and weight gain; however,
the underlying mechanisms are unclear. Despite a well-documented association between …
the underlying mechanisms are unclear. Despite a well-documented association between …
[HTML][HTML] Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task
I Tsutsui-Kimura, H Matsumoto, K Akiti, MM Yamada… - Elife, 2020 - elifesciences.org
Different regions of the striatum regulate different types of behavior. However, how
dopamine signals differ across striatal regions and how dopamine regulates different …
dopamine signals differ across striatal regions and how dopamine regulates different …
[图书][B] Distributional reinforcement learning
The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …
mathematical formalism for thinking about decisions from a probabilistic perspective …
Informing deep neural networks by multiscale principles of neuromodulatory systems
Our brains have evolved the ability to configure and adapt their processing states to match
the unique challenges of acting and learning in diverse environments and behavioral …
the unique challenges of acting and learning in diverse environments and behavioral …
[HTML][HTML] Striatal dopamine explains novelty-induced behavioral dynamics and individual variability in threat prediction
Animals both explore and avoid novel objects in the environment, but the neural
mechanisms that underlie these behaviors and their dynamics remain uncharacterized …
mechanisms that underlie these behaviors and their dynamics remain uncharacterized …
[HTML][HTML] Formalising social representation to explain psychiatric symptoms
Recent work in social cognition has moved beyond a focus on how people process social
rewards to examine how healthy people represent other agents and how this is altered in …
rewards to examine how healthy people represent other agents and how this is altered in …
Exponential history integration with diverse temporal scales in retrosplenial cortex supports hyperbolic behavior
Animals use past experience to guide future choices. The integration of experiences
typically follows a hyperbolic, rather than exponential, decay pattern with a heavy tail for …
typically follows a hyperbolic, rather than exponential, decay pattern with a heavy tail for …
Interoception as modeling, allostasis as control
The brain regulates the body by anticipating its needs and attempting to meet them before
they arise–a process called allostasis. Allostasis requires a model of the changing sensory …
they arise–a process called allostasis. Allostasis requires a model of the changing sensory …