Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

Mesoaccumbal dopamine heterogeneity: what do dopamine firing and release have to do with it?

JW de Jong, KM Fraser, S Lammel - Annual review of …, 2022 - annualreviews.org
Ventral tegmental area (VTA) dopamine (DA) neurons are often thought to uniformly encode
reward prediction errors. Conversely, DA release in the nucleus accumbens (NAc), the …

[HTML][HTML] Habitual daily intake of a sweet and fatty snack modulates reward processing in humans

SE Thanarajah, AG DiFeliceantonio, K Albus… - Cell metabolism, 2023 - cell.com
Western diets rich in fat and sugar promote excess calorie intake and weight gain; however,
the underlying mechanisms are unclear. Despite a well-documented association between …

[HTML][HTML] Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task

I Tsutsui-Kimura, H Matsumoto, K Akiti, MM Yamada… - Elife, 2020 - elifesciences.org
Different regions of the striatum regulate different types of behavior. However, how
dopamine signals differ across striatal regions and how dopamine regulates different …

[图书][B] Distributional reinforcement learning

MG Bellemare, W Dabney, M Rowland - 2023 - books.google.com
The first comprehensive guide to distributional reinforcement learning, providing a new
mathematical formalism for thinking about decisions from a probabilistic perspective …

Informing deep neural networks by multiscale principles of neuromodulatory systems

J Mei, E Muller, S Ramaswamy - Trends in Neurosciences, 2022 - cell.com
Our brains have evolved the ability to configure and adapt their processing states to match
the unique challenges of acting and learning in diverse environments and behavioral …

[HTML][HTML] Striatal dopamine explains novelty-induced behavioral dynamics and individual variability in threat prediction

K Akiti, I Tsutsui-Kimura, Y Xie, A Mathis, JE Markowitz… - Neuron, 2022 - cell.com
Animals both explore and avoid novel objects in the environment, but the neural
mechanisms that underlie these behaviors and their dynamics remain uncharacterized …

[HTML][HTML] Formalising social representation to explain psychiatric symptoms

JM Barnby, P Dayan, V Bell - Trends in cognitive sciences, 2023 - cell.com
Recent work in social cognition has moved beyond a focus on how people process social
rewards to examine how healthy people represent other agents and how this is altered in …

Exponential history integration with diverse temporal scales in retrosplenial cortex supports hyperbolic behavior

BP Danskin, R Hattori, YE Zhang, Z Babic, M Aoi… - Science …, 2023 - science.org
Animals use past experience to guide future choices. The integration of experiences
typically follows a hyperbolic, rather than exponential, decay pattern with a heavy tail for …

Interoception as modeling, allostasis as control

E Sennesh, J Theriault, D Brooks, JW van de Meent… - Biological …, 2022 - Elsevier
The brain regulates the body by anticipating its needs and attempting to meet them before
they arise–a process called allostasis. Allostasis requires a model of the changing sensory …