A tutorial survey of reinforcement learning

A Gosavi - INFORMS Journal on Computing, 2009 - pubsonline.informs.org

In the last few years, reinforcement learning (RL), also called adaptive (or approximate)
dynamic programming, has emerged as a powerful tool for solving complex sequential …

被引用次数：446 相关文章所有 15 个版本

Applications of Reinforcement Learning for maintenance of engineering systems: A review

AP Marugán - Advances in Engineering Software, 2023 - Elsevier

Nowadays, modern engineering systems require sophisticated maintenance strategies to
ensure their correct performance. Maintenance has become one of the most important tasks …

被引用次数：28 相关文章所有 2 个版本

[PDF] siam.org

Actor-critic--type learning algorithms for Markov decision processes

VR Konda, VS Borkar - SIAM Journal on control and Optimization, 1999 - SIAM

Algorithms for learning the optimal policy of a Markov decision process (MDP) based on
simulated transitions are formulated and analyzed. These are variants of the well-known" …

被引用次数：308 相关文章所有 7 个版本

[图书][B] Adaptive learning by genetic algorithms: Analytical results and applications to economic models

H Dawid - 2011 - books.google.com

The fact that I have the opportunity to present a second edition of this monograph is an
indicator for the growing size of the community concerned with agent-based computational …

被引用次数：480 相关文章所有 10 个版本

[PDF] frontiersin.org

Machine learning advances in microbiology: A review of methods and applications

Y Jiang, J Luo, D Huang, Y Liu, D Li - Frontiers in Microbiology, 2022 - frontiersin.org

Microorganisms play an important role in natural material and elemental cycles. Many
common and general biology research techniques rely on microorganisms. Machine …

被引用次数：44 相关文章所有 7 个版本

[PDF] researchgate.net

From reinforcement learning to deep reinforcement learning: An overview

F Agostinelli, G Hocquet, S Singh, P Baldi - … , Boston, MA, USA, April 28-30 …, 2018 - Springer

From Reinforcement Learning to Deep Reinforcement Learning: An Overview | SpringerLink
Skip to main content Advertisement SpringerLink Account Menu Find a journal Publish with us …

被引用次数：77 相关文章所有 6 个版本

[PDF] researchgate.net

[PDF][PDF] A survey of exploration strategies in reinforcement learning

R McFarlane - McGill University, 2018 - researchgate.net

A fundamental issue in reinforcement learning algorithms is the balance between
exploration of the environment and exploitation of information already obtained by the agent …

被引用次数：78 相关文章

[PDF] spacesailing.net

Optimization of very-low-thrust trajectories using evolutionary neurocontrol

B Dachwald - Acta Astronautica, 2005 - Elsevier

Searching optimal interplanetary trajectories for low-thrust spacecraft is usually a difficult
and time-consuming task that involves much experience and expert knowledge in …

被引用次数：86 相关文章所有 13 个版本

[PDF] spacesailing.net

[PDF][PDF] Low-thrust trajectory optimization and interplanetary mission analysis using evolutionary neurocontrol

B Dachwald - Doktorarbeit, Institut für Raumfahrttechnik, Universität …, 2004 - spacesailing.net

The design and optimization of interplanetary transfer trajectories is one of the most
important tasks during the analysis and design of a deep space mission. Due to their larger∆ …

被引用次数：92 相关文章所有 9 个版本

Application of Q-learning with temperature variation for bidding strategies in market based power systems

MB Naghibi-Sistani, MR Akbarzadeh-Tootoonchi… - Energy Conversion and …, 2006 - Elsevier

The electric power industry is confronted with restructuring in which the operation
scheduling is going to be decided based on a competitive market. In this new arrangement …

被引用次数：74 相关文章所有 10 个版本