Reinforcement learning: A tutorial survey and recent advances

A Gosavi - INFORMS Journal on Computing, 2009 - pubsonline.informs.org
In the last few years, reinforcement learning (RL), also called adaptive (or approximate)
dynamic programming, has emerged as a powerful tool for solving complex sequential …

Applications of Reinforcement Learning for maintenance of engineering systems: A review

AP Marugán - Advances in Engineering Software, 2023 - Elsevier
Nowadays, modern engineering systems require sophisticated maintenance strategies to
ensure their correct performance. Maintenance has become one of the most important tasks …

Actor-critic--type learning algorithms for Markov decision processes

VR Konda, VS Borkar - SIAM Journal on control and Optimization, 1999 - SIAM
Algorithms for learning the optimal policy of a Markov decision process (MDP) based on
simulated transitions are formulated and analyzed. These are variants of the well-known" …

[图书][B] Adaptive learning by genetic algorithms: Analytical results and applications to economic models

H Dawid - 2011 - books.google.com
The fact that I have the opportunity to present a second edition of this monograph is an
indicator for the growing size of the community concerned with agent-based computational …

Machine learning advances in microbiology: A review of methods and applications

Y Jiang, J Luo, D Huang, Y Liu, D Li - Frontiers in Microbiology, 2022 - frontiersin.org
Microorganisms play an important role in natural material and elemental cycles. Many
common and general biology research techniques rely on microorganisms. Machine …

From reinforcement learning to deep reinforcement learning: An overview

F Agostinelli, G Hocquet, S Singh, P Baldi - … , Boston, MA, USA, April 28-30 …, 2018 - Springer
From Reinforcement Learning to Deep Reinforcement Learning: An Overview | SpringerLink
Skip to main content Advertisement SpringerLink Account Menu Find a journal Publish with us …

[PDF][PDF] A survey of exploration strategies in reinforcement learning

R McFarlane - McGill University, 2018 - researchgate.net
A fundamental issue in reinforcement learning algorithms is the balance between
exploration of the environment and exploitation of information already obtained by the agent …

Optimization of very-low-thrust trajectories using evolutionary neurocontrol

B Dachwald - Acta Astronautica, 2005 - Elsevier
Searching optimal interplanetary trajectories for low-thrust spacecraft is usually a difficult
and time-consuming task that involves much experience and expert knowledge in …

[PDF][PDF] Low-thrust trajectory optimization and interplanetary mission analysis using evolutionary neurocontrol

B Dachwald - Doktorarbeit, Institut für Raumfahrttechnik, Universität …, 2004 - spacesailing.net
The design and optimization of interplanetary transfer trajectories is one of the most
important tasks during the analysis and design of a deep space mission. Due to their larger∆ …

Application of Q-learning with temperature variation for bidding strategies in market based power systems

MB Naghibi-Sistani, MR Akbarzadeh-Tootoonchi… - Energy Conversion and …, 2006 - Elsevier
The electric power industry is confronted with restructuring in which the operation
scheduling is going to be decided based on a competitive market. In this new arrangement …