A survey of actor-critic reinforcement learning: Standard and natural policy gradients I Grondman, L Busoniu, GAD Lopes, R Babuska IEEE Transactions on Systems, Man, and Cybernetics, part C (applications and …, 2012 | 1200 | 2012 |
Efficient model learning methods for actor–critic control I Grondman, M Vaandrager, L Busoniu, R Babuska, E Schuitema IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics) 42 …, 2011 | 130 | 2011 |
Model learning actor-critic algorithms: Performance evaluation in a motion control task I Grondman, L Buşoniu, R Babuška 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), 5272-5277, 2012 | 29 | 2012 |
Comparison of model-free and model-based methods for time optimal hit control of a badminton robot B Depraetere, M Liu, G Pinte, I Grondman, R Babuška Mechatronics 24 (8), 1021-1030, 2014 | 26 | 2014 |
Online model learning algorithms for actor-critic control I Grondman | 25 | 2015 |
Actor-critic control with reference model learning I Grondman, MVL Busoniu, R Babuska, E Schuitema IFAC Proceedings Volumes 44 (1), 14723-14728, 2011 | 14 | 2011 |
Learning rate free reinforcement learning for real-time motion control using a value-gradient based policy JC Van Rooijen, I Grondman, R Babuška Mechatronics 24 (8), 966-974, 2014 | 9 | 2014 |
Model-free and model-based time-optimal control of a badminton robot M Liu, B Depraetere, G Pinte, I Grondman, R Babuška 2013 9th Asian Control Conference (ASCC), 1-6, 2013 | 9 | 2013 |
Solutions to finite horizon cost problems using actor-critic reinforcement learning I Grondman, H Xu, S Jagannathan, R Babuška The 2013 International Joint Conference on Neural Networks (IJCNN), 1-7, 2013 | 6 | 2013 |