An intrinsic value system for developing multiple invariant representations with incremental...

J Schmidhuber - Neural networks, 2015 - Elsevier

In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …

被引用次数：23796 相关文章所有 42 个版本

[PDF] frontiersin.org

Exploration, novelty, surprise, and free energy minimization

P Schwartenbeck, T FitzGerald, RJ Dolan… - Frontiers in …, 2013 - frontiersin.org

This paper reviews recent developments under the free energy principle that introduce a
normative perspective on classical economic (utilitarian) decision-making based on (active) …

被引用次数：338 相关文章所有 22 个版本

[PDF] elifesciences.org

Computational mechanisms of curiosity and goal-directed exploration

P Schwartenbeck, J Passecker, TU Hauser… - elife, 2019 - elifesciences.org

Successful behaviour depends on the right balance between maximising reward and
soliciting information about the world. Here, we show how different types of information-gain …

被引用次数：224 相关文章所有 22 个版本

[PDF] oup.com

The dopaminergic midbrain encodes the expected certainty about desired outcomes

P Schwartenbeck, THB FitzGerald, C Mathys… - Cerebral …, 2015 - academic.oup.com

Dopamine plays a key role in learning; however, its exact function in decision making and
choice remains unclear. Recently, we proposed a generic model based on active (Bayesian) …

被引用次数：213 相关文章所有 28 个版本

[PDF] sciencedirect.com

Curiosity as a metacognitive feeling

L Goupil, J Proust - Cognition, 2023 - Elsevier

Curious information-seeking is known to be a key driver for learning, but characterizing this
important psychological phenomenon remains a challenge. In this article, we argue that …

被引用次数：47 相关文章所有 11 个版本

[PDF] arxiv.org

On learning to think: Algorithmic information theory for novel combinations of reinforcement learning controllers and recurrent neural world models

J Schmidhuber - arXiv preprint arXiv:1511.09249, 2015 - arxiv.org

This paper addresses the general problem of reinforcement learning (RL) in partially
observable environments. In 2013, our large RL recurrent neural networks (RNNs) learned …

被引用次数：133 相关文章所有 2 个版本

[PDF] princeton.edu

[PDF][PDF] Boredom, information-seeking and exploration.

A Geana, R Wilson, ND Daw, J Cohen - CogSci, 2016 - princeton.edu

Any adaptive organism faces the choice between taking actions with known benefits
(exploitation), and sampling new actions to check for other, more valuable opportunities …

被引用次数：110 相关文章所有 5 个版本

[HTML] cell.com

[HTML][HTML] Curiosity and the dynamics of optimal exploration

F Poli, JX O'Reilly, RB Mars, S Hunnius - Trends in Cognitive Sciences, 2024 - cell.com

What drives our curiosity remains an elusive and hotly debated issue, with multiple
hypotheses proposed but a cohesive account yet to be established. This review discusses …

被引用次数：17 相关文章所有 7 个版本

[PDF] tandfonline.com

The minority report: some common assumptions to reconsider in the modelling of the brain and behaviour

S Edelman - Journal of Experimental & Theoretical Artificial …, 2016 - Taylor & Francis

Reverse-engineering the brain involves adopting and testing a hierarchy of working
hypotheses regarding the computational problems that it solves, the representations and …

被引用次数：82 相关文章所有 3 个版本

Value systems for developmental cognitive robotics: A survey

K Merrick - Cognitive Systems Research, 2017 - Elsevier

This paper surveys value systems for developmental cognitive robotics. A value system
permits a biological brain to increase the likelihood of neural responses to selected external …

被引用次数：48 相关文章所有 3 个版本